BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 015869
         (399 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255544149|ref|XP_002513137.1| conserved hypothetical protein [Ricinus communis]
 gi|223548148|gb|EEF49640.1| conserved hypothetical protein [Ricinus communis]
          Length = 424

 Score =  627 bits (1616), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 307/399 (76%), Positives = 350/399 (87%), Gaps = 6/399 (1%)

Query: 1   MPSQQGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEH 60
           MPS      ++YL+TL+AS RPFLRGELE++DKNLP+LI+VLRSVGAGECWHKHGSFL+H
Sbjct: 1   MPSSLHQETSNYLQTLLASTRPFLRGELESVDKNLPALISVLRSVGAGECWHKHGSFLDH 60

Query: 61  LVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHL 120
           LVDIYRILKIW A D VCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG AAE+LIHL
Sbjct: 61  LVDIYRILKIWNASDSVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGPAAERLIHL 120

Query: 121 FCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPAD 180
           FCIVPRQPLIHDDLLF YSD ELV+HL  S++SL NAKEKG+F+AE++WRKKIN LLPA 
Sbjct: 121 FCIVPRQPLIHDDLLFNYSDSELVQHLLLSEISLKNAKEKGVFDAEDSWRKKINSLLPAA 180

Query: 181 GLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSAL 240
           G+TVK IKTGEDV V+RR+VA FVM+T+ADFSDQLF FQD LF+NSDG+LEF+GN+ ++L
Sbjct: 181 GITVKRIKTGEDVLVTRRIVAVFVMMTMADFSDQLFSFQDLLFDNSDGRLEFSGNNLASL 240

Query: 241 WPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPP 300
           WPGDGKPGLW+NS+SRMGAIY+LI REEEIF++ER     A G+ VD+ERDEDIEL VPP
Sbjct: 241 WPGDGKPGLWINSISRMGAIYTLIRREEEIFVEER---IRAGGIEVDEERDEDIELVVPP 297

Query: 301 VFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVL 360
           VF+ CTRILDA++QIEARDLYWEAVC   D SK  L+K EELLL CIEKNP+VGEPHVVL
Sbjct: 298 VFDKCTRILDARQQIEARDLYWEAVC---DLSKRGLDKVEELLLSCIEKNPYVGEPHVVL 354

Query: 361 GQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
            Q+YL K  FEEAEREA KG+ L+LEWGSPWDKR SWEG
Sbjct: 355 SQVYLTKDRFEEAEREAEKGVTLMLEWGSPWDKRTSWEG 393


>gi|224095202|ref|XP_002310360.1| predicted protein [Populus trichocarpa]
 gi|222853263|gb|EEE90810.1| predicted protein [Populus trichocarpa]
          Length = 430

 Score =  619 bits (1597), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 308/390 (78%), Positives = 347/390 (88%), Gaps = 6/390 (1%)

Query: 10  NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
            +YL+TL+ASARPFLRGELE+IDKNLPSLI+VLRSVGAGECWHKHGSFL+HLV+IYRILK
Sbjct: 16  TNYLQTLLASARPFLRGELESIDKNLPSLISVLRSVGAGECWHKHGSFLDHLVEIYRILK 75

Query: 70  IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           IWKAPD VCLCGLFHSAYSNSYVNLAIFDPNTGRDVVR HVG+AAE+LIHLFCIVPRQ L
Sbjct: 76  IWKAPDSVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRNHVGEAAERLIHLFCIVPRQSL 135

Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
           IHDDLLF+YSD ELVEHLKAS++SL NA EKGLFN EE+WRKK+  LLPA G+TVKHIK+
Sbjct: 136 IHDDLLFKYSDIELVEHLKASELSLRNAGEKGLFNGEESWRKKLASLLPASGITVKHIKS 195

Query: 190 GEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGL 249
           GEDV V+RR+V  F+++T+ADFSDQLFGFQD LF N DG+LEF GN+F ALWPGDGKPGL
Sbjct: 196 GEDVLVTRRMVGVFLLMTMADFSDQLFGFQDLLFENFDGRLEFLGNNFGALWPGDGKPGL 255

Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
           W+NS+SRMGAIYSLI REEEIFI+ERK +    G  VD+ERDEDIEL + PVFENCT++L
Sbjct: 256 WINSISRMGAIYSLIVREEEIFIEERKRAG---GFEVDRERDEDIELVLAPVFENCTQVL 312

Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
           DA+EQ+ ARDLYWEAVC   D SK  LE+AEELL+  IEKNPFVGEPHVVLGQ YL KG 
Sbjct: 313 DAREQVVARDLYWEAVC---DTSKGGLERAEELLVSSIEKNPFVGEPHVVLGQFYLTKGR 369

Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           FEEAE+EA +G+ LLLEWGSPWDKRMSWEG
Sbjct: 370 FEEAEKEAERGVTLLLEWGSPWDKRMSWEG 399


>gi|15231007|ref|NP_188633.1| uncharacterized protein [Arabidopsis thaliana]
 gi|63003792|gb|AAY25425.1| At3g19990 [Arabidopsis thaliana]
 gi|111074436|gb|ABH04591.1| At3g19990 [Arabidopsis thaliana]
 gi|332642795|gb|AEE76316.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 425

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 288/387 (74%), Positives = 334/387 (86%), Gaps = 4/387 (1%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           + ++ SARPFLRGELE ID  LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 11  DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 70

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P+ VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+
Sbjct: 71  PESVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDE 130

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
           LLF+YSD ELVEHL  S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++
Sbjct: 131 LLFKYSDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEI 190

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
            VSRRVV  F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS
Sbjct: 191 VVSRRVVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNS 250

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
            SRMGAIYSLI REEEI ++ER+  A   G  V KERDEDI+L VPPVF  CT++LDAKE
Sbjct: 251 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKE 309

Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           QIEAR++YWE V  +SD SKE  LE+AEE LL CIEKNPFVGEPHV+L Q+YL K  F+E
Sbjct: 310 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKE 367

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 368 AEREAEKGLLLLLQWGSPWDKRMSWEG 394


>gi|11994774|dbj|BAB03164.1| unnamed protein product [Arabidopsis thaliana]
          Length = 450

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 288/387 (74%), Positives = 334/387 (86%), Gaps = 4/387 (1%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           + ++ SARPFLRGELE ID  LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 36  DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 95

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P+ VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+
Sbjct: 96  PESVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDE 155

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
           LLF+YSD ELVEHL  S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++
Sbjct: 156 LLFKYSDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEI 215

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
            VSRRVV  F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS
Sbjct: 216 VVSRRVVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNS 275

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
            SRMGAIYSLI REEEI ++ER+  A   G  V KERDEDI+L VPPVF  CT++LDAKE
Sbjct: 276 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKE 334

Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           QIEAR++YWE V  +SD SKE  LE+AEE LL CIEKNPFVGEPHV+L Q+YL K  F+E
Sbjct: 335 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKE 392

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 393 AEREAEKGLLLLLQWGSPWDKRMSWEG 419


>gi|110743422|dbj|BAE99597.1| hypothetical protein [Arabidopsis thaliana]
          Length = 412

 Score =  591 bits (1523), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 288/382 (75%), Positives = 331/382 (86%), Gaps = 4/382 (1%)

Query: 19  SARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVC 78
           SARPFLRGELE ID  LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKAP+ VC
Sbjct: 3   SARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKAPESVC 62

Query: 79  LCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEY 138
           LCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+LLF+Y
Sbjct: 63  LCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDELLFKY 122

Query: 139 SDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRR 198
           SD ELVEHL  S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++ VSRR
Sbjct: 123 SDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEIVVSRR 182

Query: 199 VVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMG 258
           VV  F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS SRMG
Sbjct: 183 VVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNSNSRMG 242

Query: 259 AIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEAR 318
           AIYSLI REEEI ++ER+  A   G  V KERDEDI+L VPPVF  CT++LDAKEQIEAR
Sbjct: 243 AIYSLIVREEEILMEERR-RAFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKEQIEAR 301

Query: 319 DLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
           ++YWE V  +SD SKE  LE+AEE LL CIEKNPFVGEPHV+L Q+YL K  F+EAEREA
Sbjct: 302 EMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKEAEREA 359

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
            KGL+LLL+WGSPWDKRMSWEG
Sbjct: 360 EKGLLLLLQWGSPWDKRMSWEG 381


>gi|297830676|ref|XP_002883220.1| binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329060|gb|EFH59479.1| binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 425

 Score =  588 bits (1517), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 288/387 (74%), Positives = 331/387 (85%), Gaps = 4/387 (1%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           + ++ SARPFLRGELE ID  LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 11  DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 70

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P+ VCLCGLFHSAYSNSYVNLAIFDP+TGR+VVR HVG+AAE LIHLFC+VPRQ LIHDD
Sbjct: 71  PESVCLCGLFHSAYSNSYVNLAIFDPSTGREVVRDHVGEAAESLIHLFCVVPRQTLIHDD 130

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
           LLF+YSD ELVEHL  S+VSL NAKEKG+F+ +E WRKKIN L+P +G  VKHIKTGE++
Sbjct: 131 LLFKYSDCELVEHLDCSEVSLRNAKEKGVFDGDEEWRKKINGLVPENGAVVKHIKTGEEI 190

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
            VSRRVV  F+ +T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLW+NS
Sbjct: 191 VVSRRVVGVFLFMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWLNS 250

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
            SRMGAIYSLI REEEI ++ER+  A   G  V KERDEDIEL VPPVF  CT++LDAKE
Sbjct: 251 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIELVVPPVFSFCTKVLDAKE 309

Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           QIEAR++YWE V  +SD SKE  LE+AEE LL CIEKNPFVGEPHV+L Q+YL K  F E
Sbjct: 310 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFNE 367

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 368 AEREAEKGLLLLLQWGSPWDKRMSWEG 394


>gi|225463770|ref|XP_002268202.1| PREDICTED: uncharacterized protein LOC100257837 [Vitis vinifera]
          Length = 426

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 296/388 (76%), Positives = 340/388 (87%), Gaps = 6/388 (1%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L+TL+ASARPFLRGELEA+   LPSLI+VLRSVGAGECWHKHGSFLEHL D+Y+ILK+W
Sbjct: 14  HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLFDMYKILKLW 73

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 74  KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 133

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
           DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P  G+TVKHIKTGE
Sbjct: 134 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 193

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
           DV VSRRVVA FVM+T+ADFSDQL+GFQD LF+NS+G+LEF+GNSF ALWPGDGKPGLW+
Sbjct: 194 DVLVSRRVVAIFVMMTMADFSDQLYGFQDALFDNSNGQLEFSGNSFGALWPGDGKPGLWL 253

Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
           NS+SRMGA+YSL+ REEEIFI+ERK      G  V K RDE++EL +PPV ENCTR+L A
Sbjct: 254 NSISRMGAVYSLLVREEEIFIEERKRVG---GDQVVKGRDEELELVIPPVLENCTRVLGA 310

Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
           +EQIEA DLYWEAVC   +GSK  L+ AE+LL RC+EKNPFVGEPHVVL QIYL  G FE
Sbjct: 311 EEQIEATDLYWEAVC---EGSKRGLDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 367

Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EAEREA KGL L+LEWG+PWDKRM+WEG
Sbjct: 368 EAEREAEKGLKLMLEWGNPWDKRMTWEG 395


>gi|225463772|ref|XP_002268460.1| PREDICTED: uncharacterized protein LOC100252690 [Vitis vinifera]
          Length = 426

 Score =  575 bits (1481), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 294/388 (75%), Positives = 340/388 (87%), Gaps = 6/388 (1%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L+TL+ASARPFLRGELEA+   LPSLI+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14  HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           KA + VCLCGLFHSAYSNSYVNLAIF+P+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 74  KAQESVCLCGLFHSAYSNSYVNLAIFEPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 133

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
           DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P  G+TVKHIKTGE
Sbjct: 134 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 193

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
           DV VSRRVVA FVM+T+ADFSDQ +GFQD LF+NS+G+LEF+GNS+ ALWPGDGKPGLW+
Sbjct: 194 DVLVSRRVVAIFVMMTMADFSDQHYGFQDALFDNSNGRLEFSGNSYGALWPGDGKPGLWL 253

Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
           NS+SRMGA+YSL+ REEEIFI+ERK      G  V K RDE++EL +PPV ENCTR+L A
Sbjct: 254 NSISRMGAVYSLLVREEEIFIEERKRVG---GDQVVKGRDEELELVIPPVLENCTRVLGA 310

Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
           +EQIEA DLYWEAVC   +GSK  L+ AE+LL RC+EKNPFVGEPHVVL QIYL  G FE
Sbjct: 311 EEQIEATDLYWEAVC---EGSKRGLDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 367

Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EAEREA KGL L+LEWG+PWDKRM+WEG
Sbjct: 368 EAEREAEKGLKLMLEWGNPWDKRMTWEG 395


>gi|449464960|ref|XP_004150197.1| PREDICTED: uncharacterized protein LOC101221411 [Cucumis sativus]
          Length = 423

 Score =  573 bits (1478), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 279/387 (72%), Positives = 326/387 (84%), Gaps = 10/387 (2%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           LE L+ S RPFLRGELE ++  LPSLI+VLRSVGAGECWHKHG+FL+HLVDIYRILKIWK
Sbjct: 16  LEPLLGSIRPFLRGELEKVNPILPSLISVLRSVGAGECWHKHGTFLDHLVDIYRILKIWK 75

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           A + VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVRGHVG+AAE+LIHLFCIVPRQ LIHD
Sbjct: 76  AQEPVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRGHVGEAAERLIHLFCIVPRQSLIHD 135

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
           DLLF YSD ELV+HLK S+ SL NA E G FN +E WRKK+  +LPA+G+TVKHIKTGED
Sbjct: 136 DLLFHYSDSELVDHLKFSNTSLKNALENGHFNEDETWRKKLQSILPANGVTVKHIKTGED 195

Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
           V VSRRVVA F+M+TIADFSDQLFGFQD LF N++G+LEF GN++ +LWPG+GKPGLWMN
Sbjct: 196 VTVSRRVVAIFLMMTIADFSDQLFGFQDSLFENTNGRLEFPGNNYKSLWPGNGKPGLWMN 255

Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
           S SRMGAIY+LIAREE I I +RK +     +GV  ERDED+EL +PP+FENCTR+LDA 
Sbjct: 256 SASRMGAIYTLIAREESILIAQRKRNNE---LGVKTERDEDLELVIPPIFENCTRVLDAD 312

Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           +QI ARDLYWE VC       EK +++EELLL+C+ KNPF+GE HV LGQ+YL +G FEE
Sbjct: 313 DQITARDLYWEGVC-------EKSDRSEELLLKCVMKNPFIGEAHVALGQLYLGEGRFEE 365

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           AE+EA +G+ L+L WGS WDKRM WEG
Sbjct: 366 AEKEAEEGVRLMLTWGSAWDKRMPWEG 392


>gi|388504458|gb|AFK40295.1| unknown [Medicago truncatula]
          Length = 428

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 272/390 (69%), Positives = 325/390 (83%), Gaps = 6/390 (1%)

Query: 11  DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
           +YLETL+ SA+PFLR EL +ID  LPSLI +LRSVGA ECWHKHG+FLEHL+DI+RIL +
Sbjct: 13  EYLETLLTSAKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHL 72

Query: 71  WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
           WK+P  V LCGLFHSAYSNSYVNLAIFDP+T R+VVRGHVG  AE+LIHLFC+VPRQ LI
Sbjct: 73  WKSPYSVSLCGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLI 132

Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
           HDDLLF YSD+EL   L+ S++SL NAKEKG+FN +E+WRKK+  L+PADG+ VKHI+TG
Sbjct: 133 HDDLLFHYSDKELCHDLEKSELSLRNAKEKGIFNKDESWRKKLQGLVPADGIKVKHIRTG 192

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           EDV +SRRVVA FVM+T+ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW
Sbjct: 193 EDVKLSRRVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLW 252

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRIL 309
           +NS+SRMGA+Y+LI REEEIF++E+K      GV GVD ERDE IEL +PPVF  CT++L
Sbjct: 253 LNSISRMGAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVL 312

Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
           DA++QI ARDLYWEA+       +E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG 
Sbjct: 313 DARDQIVARDLYWEAMI-----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGR 367

Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           FEE E+EA +GL LLLEWG  WDKR+SWEG
Sbjct: 368 FEEGEKEAERGLTLLLEWGCHWDKRISWEG 397


>gi|449526223|ref|XP_004170113.1| PREDICTED: uncharacterized protein LOC101229416 [Cucumis sativus]
          Length = 423

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 278/387 (71%), Positives = 326/387 (84%), Gaps = 10/387 (2%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           LE L+ S RPFLRGELE ++  LPSLI+VLRSVGAGECWHKHG+FL+HLVDIYRILKIWK
Sbjct: 16  LEPLLGSIRPFLRGELEKVNPILPSLISVLRSVGAGECWHKHGTFLDHLVDIYRILKIWK 75

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           A + VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVRGHVG+AAE+LIHLFCIVPRQ LIHD
Sbjct: 76  AQEPVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRGHVGEAAERLIHLFCIVPRQSLIHD 135

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
           DLLF YSD ELV+HLK S+ SL NA E G FN +E WRKK+  +LPA+G+TVKHIKTGED
Sbjct: 136 DLLFHYSDSELVDHLKFSNTSLKNALENGHFNEDETWRKKLQSILPANGVTVKHIKTGED 195

Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
           V VSRRVVA F+M+TIADFSDQLFGFQD LF N++G+LEF GN++ +LWPG+GKPGLWMN
Sbjct: 196 VTVSRRVVAIFLMMTIADFSDQLFGFQDSLFENTNGRLEFPGNNYKSLWPGNGKPGLWMN 255

Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
           S SRMGAIY+LIAREE I I +RK +     +GV  ERDED+EL +PP+FENCT++LDA 
Sbjct: 256 SASRMGAIYTLIAREESILIVQRKRNNE---LGVKTERDEDLELVIPPIFENCTKVLDAD 312

Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           +QI ARDLYWE VC       EK +++EELLL+C+ KNPF+GE HV LGQ+YL +G FEE
Sbjct: 313 DQITARDLYWEGVC-------EKSDRSEELLLKCVMKNPFIGEAHVALGQLYLGEGRFEE 365

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           AE+EA +G+ L+L WGS WDKRM WEG
Sbjct: 366 AEKEAEEGVRLMLTWGSAWDKRMPWEG 392


>gi|357439007|ref|XP_003589780.1| hypothetical protein MTR_1g039190 [Medicago truncatula]
 gi|355478828|gb|AES60031.1| hypothetical protein MTR_1g039190 [Medicago truncatula]
          Length = 462

 Score =  571 bits (1471), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 272/390 (69%), Positives = 325/390 (83%), Gaps = 6/390 (1%)

Query: 11  DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
           +YLETL+ SA+PFLR EL +ID  LPSLI +LRSVGA ECWHKHG+FLEHL+DI+RIL +
Sbjct: 13  EYLETLLTSAKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHL 72

Query: 71  WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
           WK+P  V LCGLFHSAYSNSYVNLAIFDP+T R+VVRGHVG  AE+LIHLFC+VPRQ LI
Sbjct: 73  WKSPYSVSLCGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLI 132

Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
           HDDLLF YSD+EL   L+ S++SL NAKEKG+FN +E+WRKK+  L+PADG+ VKHI+TG
Sbjct: 133 HDDLLFHYSDKELCHDLEKSELSLRNAKEKGIFNKDESWRKKLQGLVPADGIKVKHIRTG 192

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           EDV +SRRVVA FVM+T+ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW
Sbjct: 193 EDVKLSRRVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLW 252

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRIL 309
           +NS+SRMGA+Y+LI REEEIF++E+K      GV GVD ERDE IEL +PPVF  CT++L
Sbjct: 253 LNSISRMGAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVL 312

Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
           DA++QI ARDLYWEA+       +E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG 
Sbjct: 313 DARDQIVARDLYWEAMI-----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGR 367

Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           FEE E+EA +GL LLLEWG  WDKR+SWEG
Sbjct: 368 FEEGEKEAERGLTLLLEWGCHWDKRISWEG 397


>gi|358248656|ref|NP_001239662.1| uncharacterized protein LOC100806152 [Glycine max]
 gi|255646517|gb|ACU23736.1| unknown [Glycine max]
          Length = 429

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 278/392 (70%), Positives = 326/392 (83%), Gaps = 13/392 (3%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           +E L+ S RPFLRG+LE+IDKNLP L+ VL+SVGAGECWHKHGSFL HLVDI+RILK+WK
Sbjct: 15  IEKLLESGRPFLRGDLESIDKNLPRLVGVLQSVGAGECWHKHGSFLHHLVDIFRILKLWK 74

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           AP  VCLCGLFHSAYSNSYVNLAIFDP+TGR+VVR  VG+ AE LIHLFCIVPRQPLIHD
Sbjct: 75  APHSVCLCGLFHSAYSNSYVNLAIFDPSTGREVVRALVGEEAESLIHLFCIVPRQPLIHD 134

Query: 133 DLLFEYSDQELVEHLKASDVSLINAK--EKGLFNAE---EAWRKKINELLPADGLTVKHI 187
           DLLF YSD+ELV+HL  S++SL NAK  EKGLF+ +   E WRKK+  L+PA+G+ VKHI
Sbjct: 135 DLLFHYSDEELVQHLAQSEISLRNAKKMEKGLFDDDGELEGWRKKLQGLVPAEGVQVKHI 194

Query: 188 KTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKP 247
           +TGE V VSRR+VA F+M+T+ADF DQLFGFQD LF+N++G+LEF+GN+F ALWPGDGKP
Sbjct: 195 RTGEGVHVSRRIVAVFIMMTMADFCDQLFGFQDLLFDNANGRLEFSGNNFGALWPGDGKP 254

Query: 248 GLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTR 307
           GLW+NS+SRMGA+Y+LIAREEEIFIQERK     V V  D ER+EDIEL +PPVF+ C +
Sbjct: 255 GLWLNSISRMGAVYTLIAREEEIFIQERKRK-VGVAVVPDLERNEDIELVLPPVFDYCRK 313

Query: 308 ILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAK 367
           +L+A +QI ARDLYWEAVC       E   KAEELLL  IEKNPFVGEP+VVL Q+YL +
Sbjct: 314 VLEAGDQIVARDLYWEAVC-------EGGSKAEELLLESIEKNPFVGEPYVVLSQVYLTE 366

Query: 368 GEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           G FEEAE+ A +GL LLLEWG PWDKR SWEG
Sbjct: 367 GRFEEAEKHAERGLKLLLEWGCPWDKRTSWEG 398


>gi|242063594|ref|XP_002453086.1| hypothetical protein SORBIDRAFT_04g038110 [Sorghum bicolor]
 gi|241932917|gb|EES06062.1| hypothetical protein SORBIDRAFT_04g038110 [Sorghum bicolor]
          Length = 421

 Score =  499 bits (1286), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 233/387 (60%), Positives = 301/387 (77%), Gaps = 3/387 (0%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           L   +A+ARPFLRGE   +D  LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W 
Sbjct: 7   LADALAAARPFLRGEEAEVDPALPGLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLWG 66

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           APD V  CGL+HSAYSNSYVNLAIF P+TGR  V   VG  AE+L+HLFC+VPRQ L+HD
Sbjct: 67  APDAVARCGLYHSAYSNSYVNLAIFQPDTGRAHVAAVVGADAERLVHLFCVVPRQKLVHD 126

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
           DLLF YSDQ+LV+ L +S+ SL +A+ +G+F+  + WR+KI  L+P DG+TVKHI+TGED
Sbjct: 127 DLLFHYSDQDLVDDLASSEASLRDAR-RGVFDDAQPWRRKIQRLIPPDGITVKHIRTGED 185

Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
           V +SRR+ ATF+++TIADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW+ 
Sbjct: 186 VALSRRIAATFLLMTIADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLWVT 245

Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
           S SRMGA+Y+LI REEEI+I +R    +  G      RDEDI+L +PPVF+ CT++L+A 
Sbjct: 246 SSSRMGALYTLIIREEEIYITQRAHGISRQGSNGSAGRDEDIDLVIPPVFKGCTKVLNAD 305

Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           +Q  ARDLYWEAVC   + S     K EELL + I KNPFVGEPH+VL Q+YL    + +
Sbjct: 306 DQKVARDLYWEAVCSGDEAS--DWLKVEELLQQSIAKNPFVGEPHLVLAQVYLNMERYGD 363

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
           A+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 364 AQMQAEEGLKLLLEWGSSWDKRMTWEG 390


>gi|222623966|gb|EEE58098.1| hypothetical protein OsJ_08971 [Oryza sativa Japonica Group]
          Length = 474

 Score =  496 bits (1277), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L   + +ARPFLRGE E +D  LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 61  HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 120

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
            APD V  CGL+HSAYSNSYVNLAIF+P+ GRD VR  VG AAE+L+HLFC+VPR  LIH
Sbjct: 121 GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 180

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
           DDLLF Y+DQ+LV  L AS+ SL +A+ +GLF+ + EAWR KI  LLP  G+TVKHI+TG
Sbjct: 181 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 239

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 240 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 299

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
             S+SRMGA+YSLI REEEI I +RK S      G + +RDEDIEL +PPVF  CT++L 
Sbjct: 300 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 355

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           A +Q  ARDLYW+AVC   +   +   K EE+L RCI +NPFVGEPH+VL Q+ L    +
Sbjct: 356 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 414

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EEAE +   G+ LLLEWGS WDKRM WE 
Sbjct: 415 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 443


>gi|48716457|dbj|BAD23064.1| unknown protein [Oryza sativa Japonica Group]
 gi|215700944|dbj|BAG92368.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 419

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L   + +ARPFLRGE E +D  LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 6   HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 65

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
            APD V  CGL+HSAYSNSYVNLAIF+P+ GRD VR  VG AAE+L+HLFC+VPR  LIH
Sbjct: 66  GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 125

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
           DDLLF Y+DQ+LV  L AS+ SL +A+ +GLF+ + EAWR KI  LLP  G+TVKHI+TG
Sbjct: 126 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 184

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 185 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 244

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
             S+SRMGA+YSLI REEEI I +RK S      G + +RDEDIEL +PPVF  CT++L 
Sbjct: 245 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 300

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           A +Q  ARDLYW+AVC   +   +   K EE+L RCI +NPFVGEPH+VL Q+ L    +
Sbjct: 301 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 359

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EEAE +   G+ LLLEWGS WDKRM WE 
Sbjct: 360 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 388


>gi|125541709|gb|EAY88104.1| hypothetical protein OsI_09539 [Oryza sativa Indica Group]
          Length = 419

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L   + +ARPFLRGE E +D  LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 6   HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 65

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
            APD V  CGL+HSAYSNSYVNLAIF+P+ GRD VR  VG AAE+L+HLFC+VPR  LIH
Sbjct: 66  GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 125

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
           DDLLF Y+DQ+LV  L AS+ SL +A+ +GLF+ + EAWR KI  LLP  G+TVKHI+TG
Sbjct: 126 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 184

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 185 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 244

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
             S+SRMGA+YSLI REEEI I +RK S      G + +RDEDIEL +PPVF  CT++L 
Sbjct: 245 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 300

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           A +Q  ARDLYW+AVC   +   +   K EE+L RCI +NPFVGEPH+VL Q+ L    +
Sbjct: 301 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 359

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EEAE +   G+ LLLEWGS WDKRM WE 
Sbjct: 360 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 388


>gi|147832395|emb|CAN77784.1| hypothetical protein VITISV_014471 [Vitis vinifera]
          Length = 402

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 259/388 (66%), Positives = 297/388 (76%), Gaps = 30/388 (7%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L+TL+ASARPFLRGELEA+   LPS+I+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14  HLQTLLASARPFLRGELEAVHPKLPSIISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAEKLIHLFC+VPRQPL+H
Sbjct: 74  KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEKLIHLFCVVPRQPLVH 133

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
           DDL+F  SD ELVEHLKAS+VS+ NAKE G+ N +EAWRKK+N L+P DG+TVKHIKTGE
Sbjct: 134 DDLIFRCSDDELVEHLKASEVSVRNAKEGGVVNGDEAWRKKVNSLVPPDGVTVKHIKTGE 193

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
           DV VSRRVVA FVM+T+AD SD+ +GFQD         L F   +           GL M
Sbjct: 194 DVLVSRRVVAIFVMMTMADSSDKHYGFQD--------ALSFLATAMELC-------GLGM 238

Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
            SL      +S             K     VG+   K RDE++EL +PPV ENCTR+L A
Sbjct: 239 ASLGCGRRRFS------------SKRGKELVGIKWWKGRDEELELVIPPVLENCTRVLGA 286

Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
           +EQIEA DLYWEAVC   +GSK  L+ AEELL RC+EKNPFVGEPHVVL QIYL  G FE
Sbjct: 287 EEQIEATDLYWEAVC---EGSKRGLDWAEELLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 343

Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EAEREA KGL L+LEWGSPWDKRM+WEG
Sbjct: 344 EAEREAEKGLKLMLEWGSPWDKRMTWEG 371


>gi|224031197|gb|ACN34674.1| unknown [Zea mays]
 gi|413939578|gb|AFW74129.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
          Length = 426

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 228/378 (60%), Positives = 288/378 (76%), Gaps = 12/378 (3%)

Query: 22  PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
           PFLRGE   +D  LP L AVLR  GAGECWHKHG+FL HL+D++RIL++W APD V  CG
Sbjct: 30  PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89

Query: 82  LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
           L+HSAYSNSYVNLAIF P+TGR  V   VG  AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90  LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149

Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVA 201
           +L + L +S+ S+ +A+ +G+F+  + WR+KI  LLPADG+TVKHI+TGEDV +SRR+ A
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHIRTGEDVSLSRRIAA 208

Query: 202 TFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIY 261
           TF+++TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SRM A+Y
Sbjct: 209 TFLLMTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSRMAALY 268

Query: 262 SLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLY 321
           +LI REEEI I +R              RDED++L +PPVF  CTR+L+A++Q  ARDLY
Sbjct: 269 TLIVREEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKVARDLY 320

Query: 322 WEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGL 381
           WEAVCG          K E+LL + I KNPFVGEPH+VL Q+YL    + EA+ +A +GL
Sbjct: 321 WEAVCGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQAEEGL 377

Query: 382 MLLLEWGSPWDKRMSWEG 399
            LLLEWGS WDKRM WEG
Sbjct: 378 RLLLEWGSSWDKRMPWEG 395


>gi|413939577|gb|AFW74128.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
          Length = 431

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 228/383 (59%), Positives = 288/383 (75%), Gaps = 17/383 (4%)

Query: 22  PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
           PFLRGE   +D  LP L AVLR  GAGECWHKHG+FL HL+D++RIL++W APD V  CG
Sbjct: 30  PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89

Query: 82  LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
           L+HSAYSNSYVNLAIF P+TGR  V   VG  AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90  LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149

Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHI-----KTGEDVGVS 196
           +L + L +S+ S+ +A+ +G+F+  + WR+KI  LLPADG+TVKHI     +TGEDV +S
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHISELHCRTGEDVSLS 208

Query: 197 RRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSR 256
           RR+ ATF+++TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SR
Sbjct: 209 RRIAATFLLMTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSR 268

Query: 257 MGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIE 316
           M A+Y+LI REEEI I +R              RDED++L +PPVF  CTR+L+A++Q  
Sbjct: 269 MAALYTLIVREEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKV 320

Query: 317 ARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAERE 376
           ARDLYWEAVCG          K E+LL + I KNPFVGEPH+VL Q+YL    + EA+ +
Sbjct: 321 ARDLYWEAVCGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQ 377

Query: 377 AGKGLMLLLEWGSPWDKRMSWEG 399
           A +GL LLLEWGS WDKRM WEG
Sbjct: 378 AEEGLRLLLEWGSSWDKRMPWEG 400


>gi|357138263|ref|XP_003570716.1| PREDICTED: uncharacterized protein LOC100827583 isoform 2
           [Brachypodium distachyon]
          Length = 423

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 233/395 (58%), Positives = 295/395 (74%), Gaps = 5/395 (1%)

Query: 6   GNINNDYLETLVASARPFLRGELE-AIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDI 64
           G+  +      +A+ARPFLRGE E   D +LP+L AVL + GAGECWHKHG+FL HL+D+
Sbjct: 2   GSSTSSTYADALAAARPFLRGEEELQPDPSLPALAAVLSAAGAGECWHKHGTFLAHLLDV 61

Query: 65  YRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIV 124
           +RIL++W APD V  CGL+HSAYSNSYVNLAIF+P+ GR  V G VG  AE+L+HLFC+V
Sbjct: 62  HRILRLWSAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRAHVAGVVGAEAERLVHLFCVV 121

Query: 125 PRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTV 184
           PRQ LIHDDLLF Y D++LV  L  S+ SL +A+  G+F  +E WR+KI  LLPA G+TV
Sbjct: 122 PRQQLIHDDLLFHYEDEDLVVDLARSEESLADAR-GGVFEEDEPWRRKIQRLLPASGITV 180

Query: 185 KHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGD 244
           KHI+TGEDV +SRRV A F+++T+ADFSDQLF +QD LF+N++G+LEF GN++++LWPG 
Sbjct: 181 KHIRTGEDVALSRRVAAAFLLMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTSLWPGT 240

Query: 245 GKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFEN 304
           GKPGLW  S+SRMGA+YSLI REEEI+I  R  +    G      RDEDI L +PPVF  
Sbjct: 241 GKPGLWTTSISRMGALYSLIVREEEIYIAHRTHTTGQEG-DETATRDEDIALVIPPVFNG 299

Query: 305 CTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIY 364
           CT++L+A +Q  ARDLYWEAVC  SD       + EELL + + KNPFVGEP +VL Q+ 
Sbjct: 300 CTKVLNADDQKAARDLYWEAVC--SDEEAMDRHRVEELLRQTVAKNPFVGEPRLVLAQVC 357

Query: 365 LAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           L    +EEA+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 358 LNAEMYEEAQEQAEEGLKLLLEWGSSWDKRMTWEG 392


>gi|357138261|ref|XP_003570715.1| PREDICTED: uncharacterized protein LOC100827583 isoform 1
           [Brachypodium distachyon]
          Length = 421

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 237/400 (59%), Positives = 299/400 (74%), Gaps = 11/400 (2%)

Query: 1   MPSQQGNINNDYLETLVASARPFLRGELE-AIDKNLPSLIAVLRSVGAGECWHKHGSFLE 59
           MPS     ++ Y + L A+ARPFLRGE E   D +LP+L AVL + GAGECWHKHG+FL 
Sbjct: 1   MPS-----SSTYADAL-AAARPFLRGEEELQPDPSLPALAAVLSAAGAGECWHKHGTFLA 54

Query: 60  HLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIH 119
           HL+D++RIL++W APD V  CGL+HSAYSNSYVNLAIF+P+ GR  V G VG  AE+L+H
Sbjct: 55  HLLDVHRILRLWSAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRAHVAGVVGAEAERLVH 114

Query: 120 LFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPA 179
           LFC+VPRQ LIHDDLLF Y D++LV  L  S+ SL +A+  G+F  +E WR+KI  LLPA
Sbjct: 115 LFCVVPRQQLIHDDLLFHYEDEDLVVDLARSEESLADAR-GGVFEEDEPWRRKIQRLLPA 173

Query: 180 DGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSA 239
            G+TVKHI+TGEDV +SRRV A F+++T+ADFSDQLF +QD LF+N++G+LEF GN++++
Sbjct: 174 SGITVKHIRTGEDVALSRRVAAAFLLMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTS 233

Query: 240 LWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVP 299
           LWPG GKPGLW  S+SRMGA+YSLI REEEI+I  R  +    G      RDEDI L +P
Sbjct: 234 LWPGTGKPGLWTTSISRMGALYSLIVREEEIYIAHRTHTTGQEG-DETATRDEDIALVIP 292

Query: 300 PVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVV 359
           PVF  CT++L+A +Q  ARDLYWEAVC  SD       + EELL + + KNPFVGEP +V
Sbjct: 293 PVFNGCTKVLNADDQKAARDLYWEAVC--SDEEAMDRHRVEELLRQTVAKNPFVGEPRLV 350

Query: 360 LGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           L Q+ L    +EEA+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 351 LAQVCLNAEMYEEAQEQAEEGLKLLLEWGSSWDKRMTWEG 390


>gi|326497943|dbj|BAJ94834.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 421

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 228/384 (59%), Positives = 295/384 (76%), Gaps = 5/384 (1%)

Query: 17  VASARPFLRGELEAI-DKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           +A+ARP+LRGE +   D  LP+L AVLR+ GAGECWHKHG+FL HL+++YRIL++W +PD
Sbjct: 11  LAAARPYLRGEEDQCGDPALPALTAVLRAAGAGECWHKHGTFLAHLLEVYRILRLWASPD 70

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V  CGL+HSAYSNSYVNLAIF+P+ GR  V   VGD AE+L+HLFC+VPRQ L+HDDLL
Sbjct: 71  AVARCGLYHSAYSNSYVNLAIFEPDVGRARVAAVVGDEAERLVHLFCVVPRQQLVHDDLL 130

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
           F Y D +L   L  S+ S+++A+ +G+F+ +E WR+KI  LLPADG+TVKHI+TGEDV +
Sbjct: 131 FHYDDADLAADLARSEESVLDAR-RGVFDDDEPWRRKIQRLLPADGITVKHIRTGEDVAL 189

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLS 255
           SRR+ ATF+M+T+ADFSDQLF +QD LF+N++G+LEF GN++++LWPG GKPGLW  S+S
Sbjct: 190 SRRIAATFLMMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTSLWPGTGKPGLWTTSIS 249

Query: 256 RMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQI 315
           RMG +Y+LI REEEI++ +R  +    G      RDEDI L +PPVF+ CT++LDA +Q 
Sbjct: 250 RMGVLYNLIVREEEIYLAQRAHTTGKEG-NDSSTRDEDIALVIPPVFDGCTKVLDADDQK 308

Query: 316 EARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
            ARDLYWEAVC  SD       K EELL + + KNPFVGEP +VL Q+ L    +EEA+ 
Sbjct: 309 AARDLYWEAVC--SDDEATDRSKVEELLRQSVTKNPFVGEPRLVLAQMCLNAEMYEEAQE 366

Query: 376 EAGKGLMLLLEWGSPWDKRMSWEG 399
           +A +GL LLLEWGS WDKRM WEG
Sbjct: 367 QAEEGLKLLLEWGSSWDKRMPWEG 390


>gi|294460946|gb|ADE76045.1| unknown [Picea sitchensis]
          Length = 453

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 235/426 (55%), Positives = 296/426 (69%), Gaps = 38/426 (8%)

Query: 8   INNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRI 67
           + N  L T +  ARPFLRG  + +D +LP  + VLR VGA ECWHKHGSF +HLVD+YRI
Sbjct: 1   MANSELPTRLERARPFLRGAYDEVDPSLPEFMRVLRQVGASECWHKHGSFFDHLVDVYRI 60

Query: 68  LKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
           LK+W APD V  CGLFHSAYSNSYVNLAIF P+ GR  VR  +GD+AE+L+H+FCIVPRQ
Sbjct: 61  LKLWNAPDSVARCGLFHSAYSNSYVNLAIFKPDVGRQRVRDLIGDSAEELVHMFCIVPRQ 120

Query: 128 PLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGL----FNAEEA--------------- 168
           PLIHDDLLF ++D+ELV  L+ S+ ++  A+ + L       EE                
Sbjct: 121 PLIHDDLLFHFTDEELVSALEESEKTMNEAEAEDLELQKLKLEEKNGDWLDNIHDKVEIH 180

Query: 169 ---------------WRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSD 213
                          WR+KI ++LPA G+ VKHIK GEDV V RR+VA F+ +T+ADFSD
Sbjct: 181 KTHFKETLEGFILPEWRRKIQKILPAHGVRVKHIKNGEDVEVPRRLVAIFLFMTMADFSD 240

Query: 214 QLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQ 273
           QLF FQD LF+NS+G +EFAGN+++ALWPGDGKPGLW+NS+SRMGA+Y+LI REE++F  
Sbjct: 241 QLFSFQDVLFDNSNGIMEFAGNTWTALWPGDGKPGLWLNSISRMGALYTLIVREEKLF-- 298

Query: 274 ERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSK 333
           +R+           K   E+IEL +PPVF++C  +L   +Q+ ARDLYWEAVC    G K
Sbjct: 299 QREEGRKKNEEEKKKRGFEEIELVIPPVFDHCRSVLFPSDQVTARDLYWEAVC-MPPGEK 357

Query: 334 EKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDK 393
              EKAE LL +  E NPFVGEP VVL QIY ++G ++EAE+EA  GL L+LEWG+ WDK
Sbjct: 358 TH-EKAETLLAKACELNPFVGEPRVVLAQIYNSQGRYDEAEKEAETGLRLMLEWGTTWDK 416

Query: 394 RMSWEG 399
           RMSWEG
Sbjct: 417 RMSWEG 422


>gi|218191863|gb|EEC74290.1| hypothetical protein OsI_09541 [Oryza sativa Indica Group]
          Length = 413

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 219/389 (56%), Positives = 283/389 (72%), Gaps = 13/389 (3%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           ++ +VA+ARPFLRG+L  +D  LPSL++VL   GAGEC+HKHG+FL HL+D+YRIL++W 
Sbjct: 5   VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 64

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           APD V  CGL+HSAYSNSYVNLAIF P+  RD VRG +G AAE+L+HLFC+VPR  L+HD
Sbjct: 65  APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 124

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
           DL   Y+D EL  HL AS  SL  A++ G    +  EAWR K+  ++P +G+  +HI+TG
Sbjct: 125 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 184

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           E V +SRRV+  F+++TIADFSDQ   +QD LF+N DG+LE+ G+++ ALWPG GKPGLW
Sbjct: 185 EAVALSRRVLGVFIVMTIADFSDQYTDYQDKLFDNEDGRLEYRGDNWRALWPGSGKPGLW 244

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
           M+++SR+ A+Y LIA +EEI    R+ +A    V   K  D ++EL +PPVFE C+++LD
Sbjct: 245 MSAMSRLAAVYRLIATDEEI----RELTAGERSVV--KREDAELELVIPPVFERCSKVLD 298

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           A EQ E RDLYWEAVCG  +       K E LL R IEKNPFVGEP +VL Q  L  G  
Sbjct: 299 AAEQKEGRDLYWEAVCGEWEEG-----KTERLLRRSIEKNPFVGEPWLVLAQALLNSGHD 353

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EEAE  A +G+ L++EWGS WDKRMSWEG
Sbjct: 354 EEAEAAAEEGVRLVVEWGSSWDKRMSWEG 382


>gi|326487554|dbj|BAK05449.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 221/396 (55%), Positives = 292/396 (73%), Gaps = 11/396 (2%)

Query: 7   NINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYR 66
           + N+D L++L+ +ARPFLRG+L A+D  LPSL++VL S GAGEC+HKHG+FL HL+D+YR
Sbjct: 29  STNDDALQSLLVAARPFLRGDLAAVDPELPSLVSVLVSAGAGECYHKHGTFLAHLLDVYR 88

Query: 67  ILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
           IL++W APD V  CGLFHS+YSNSYVNLAIF+P+  R  VRG VG+AAE+L+HLFC+VPR
Sbjct: 89  ILRLWGAPDAVARCGLFHSSYSNSYVNLAIFEPDVSRARVRGIVGEAAERLVHLFCVVPR 148

Query: 127 QPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLF--NAEEAWRKKINELLPADGLTV 184
             L+HDDL   Y+D EL +HL A+  SL  A+  G    +  E WR K+  ++P DG+ V
Sbjct: 149 HALMHDDLHLRYTDAELRDHLAAAGASLQAARSHGGRPEDKAEPWRLKLRSVVPEDGVVV 208

Query: 185 KHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGD 244
            HI+TGE V +SRRV+A FV++T+ADFSDQ   +QD LF N DG+LEFAG++++ALWPG 
Sbjct: 209 PHIRTGEPVALSRRVLAVFVLMTVADFSDQYTDYQDKLFRNDDGRLEFAGDNWAALWPGT 268

Query: 245 GKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFEN 304
           GKPGLWM++ SR+ A+Y LIA +E++   E +            E+D  +EL +PPVF+ 
Sbjct: 269 GKPGLWMSATSRLAALYRLIATDEQLRHMEEQVGPTKT---TADEQDAGLELCIPPVFDR 325

Query: 305 CTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIY 364
           C+++LD  EQI ARD+YWEA+C  SDG +E +E    LL RCI KNP+VGEP +VL Q+ 
Sbjct: 326 CSKVLDPGEQIAARDMYWEAIC--SDGKEEGVES---LLRRCIAKNPYVGEPWLVLAQVL 380

Query: 365 L-AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           L   G +EEAE  A +GL L++EWGS WDKRMSWEG
Sbjct: 381 LNGGGRWEEAEAAAEEGLRLVVEWGSSWDKRMSWEG 416


>gi|115449991|ref|NP_001048601.1| Os02g0828500 [Oryza sativa Japonica Group]
 gi|48716459|dbj|BAD23066.1| unknown protein [Oryza sativa Japonica Group]
 gi|113538132|dbj|BAF10515.1| Os02g0828500 [Oryza sativa Japonica Group]
          Length = 414

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 218/389 (56%), Positives = 283/389 (72%), Gaps = 13/389 (3%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           ++ +VA+ARPFLRG+L  +D  LPSL++VL   GAGEC+HKHG+FL HL+D+YRIL++W 
Sbjct: 6   VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 65

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           APD V  CGL+HSAYSNSYVNLAIF P+  RD VRG +G AAE+L+HLFC+VPR  L+HD
Sbjct: 66  APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 125

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
           DL   Y+D EL  HL AS  SL  A++ G    +  EAWR K+  ++P +G+  +HI+TG
Sbjct: 126 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 185

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           E V +SRRV+  F+++TIADFSDQ   +QD LF+N DG+LE+ G+++ A+WPG GKPGLW
Sbjct: 186 EAVALSRRVLGVFIVMTIADFSDQYTDYQDKLFDNEDGRLEYRGDNWRAVWPGSGKPGLW 245

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
           M+++SR+ A+Y LIA +EEI    R+ +A    V   K  D ++EL +PPVFE C+++LD
Sbjct: 246 MSAMSRLAAVYRLIATDEEI----RELTAGERSVV--KREDAELELVIPPVFERCSKVLD 299

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           A EQ E RDLYWEAVCG  +       K E LL R IEKNPFVGEP +VL Q  L  G  
Sbjct: 300 AAEQKEGRDLYWEAVCGEWEEG-----KTERLLRRSIEKNPFVGEPWLVLAQALLNSGHD 354

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EEAE  A +G+ L++EWGS WDKRMSWEG
Sbjct: 355 EEAEAAAEEGVRLVVEWGSSWDKRMSWEG 383


>gi|242063596|ref|XP_002453087.1| hypothetical protein SORBIDRAFT_04g038120 [Sorghum bicolor]
 gi|241932918|gb|EES06063.1| hypothetical protein SORBIDRAFT_04g038120 [Sorghum bicolor]
          Length = 431

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 215/406 (52%), Positives = 294/406 (72%), Gaps = 13/406 (3%)

Query: 1   MPSQQGNINNDY------LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKH 54
           M +QQG  + D       L +L+A A+PFLRGELE ID  +PS +++LRS GAGE  HK+
Sbjct: 1   MATQQGFSSTDLKTHRKNLPSLLAVAQPFLRGELEKIDPEIPSFLSILRSAGAGERHHKN 60

Query: 55  GSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAA 114
           G+FL HL++ +RI+++W AP  +  CGLFHS+Y+NSYVN++IF+ +T R+ V+  +G   
Sbjct: 61  GAFLAHLLNFHRIVRLWGAPSDIARCGLFHSSYANSYVNVSIFESSTTREDVQRLIGAPV 120

Query: 115 EKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKIN 174
           E+L  LFC VPR  LIH++L F+Y+D EL +HL ASD+S+  A+E G F+A EAWR+KI 
Sbjct: 121 ERLAFLFCAVPRHKLIHEELHFQYTDTELRDHLAASDLSIKAARETGTFDASEAWREKIC 180

Query: 175 ELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAG 234
           +LLP  G+  +H KTGE + +SRR++A FV++TIAD  DQ   +QD L++N +G+LEF G
Sbjct: 181 KLLPPKGMEARHFKTGEPISLSRRIIALFVLMTIADICDQYIDYQDKLYDNENGRLEFRG 240

Query: 235 NSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDI 294
           +++ ALWPG  KPGLWMN+ SR+G +Y+LI REEE++ QER      +G  +  +RDE+I
Sbjct: 241 DNWGALWPGTCKPGLWMNAASRLGVLYNLIIREEELYTQERNK----MGETIRLDRDEEI 296

Query: 295 ELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK-LEKAEELLLRCIEKNPFV 353
            L +PPVF  CT++LD KEQI ARDLYWEA+C  SD  KE+  EK E++LL  I+KNPFV
Sbjct: 297 MLVIPPVFNYCTKVLDPKEQIAARDLYWEAIC--SDDRKERDWEKVEKVLLESIQKNPFV 354

Query: 354 GEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           GEP +V  Q+YL    + EA+ EA +GL LLLEWG  WDKRM+WE 
Sbjct: 355 GEPRLVAAQVYLNMERYAEAKEEAEEGLKLLLEWGISWDKRMTWES 400


>gi|242063598|ref|XP_002453088.1| hypothetical protein SORBIDRAFT_04g038130 [Sorghum bicolor]
 gi|241932919|gb|EES06064.1| hypothetical protein SORBIDRAFT_04g038130 [Sorghum bicolor]
          Length = 455

 Score =  423 bits (1087), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 209/393 (53%), Positives = 279/393 (70%), Gaps = 18/393 (4%)

Query: 13  LETLVASARPFLRGEL-EAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           + +L+ +ARPFLRGEL  A D  LPSL+AVLR+ GAGEC+HKHG+FL HL D+YRIL++W
Sbjct: 36  VSSLLDAARPFLRGELGSAEDPELPSLVAVLRAAGAGECYHKHGTFLAHLEDVYRILRLW 95

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
            A D V  CGLFHS+YSNSYV+LAIF P+TGR  VRG +G  AE+L+HLFC+VPR  L+ 
Sbjct: 96  GASDAVARCGLFHSSYSNSYVDLAIFQPDTGRAQVRGIIGADAERLVHLFCVVPRHHLVF 155

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEK-GLFNAEEAWRKKINELLPADGLTVKHIKTG 190
             L   Y+DQEL +HL A++     A++  GL      WR+K+  ++PA+G+ V HI+TG
Sbjct: 156 QQLQPRYTDQELRDHLAAAEAEAEIAQQPGGLLTTMSPWRQKLRSVVPAEGVVVSHIRTG 215

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           E V +SRRVVA F+++T+ADFSDQ   +QD LF+N DG+LEF G++++ALWPG GKPGLW
Sbjct: 216 EPVRLSRRVVAAFLLMTVADFSDQYTDYQDDLFDNDDGRLEFRGDNWAALWPGTGKPGLW 275

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDI----ELPVPPVFENCT 306
           ++++SR+ A+Y LIAR+E++ +           +G  +  D+D+    EL +PPVFE C+
Sbjct: 276 VSAMSRLAALYGLIARDEQLIM----------AIGEQQHDDDDVAAVGELVIPPVFERCS 325

Query: 307 RILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLA 366
           R+LD  EQ  ARDLYWEA+  ++   K K    E LL   I KNP VGEPH+VL Q+ L 
Sbjct: 326 RVLDPGEQKAARDLYWEAITSSTKDDKRK--DPEALLRESIAKNPLVGEPHLVLAQVLLN 383

Query: 367 KGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
            G + EA   A +G+ LLL+WGS WDKRMSWEG
Sbjct: 384 GGRYAEAAAAAARGVRLLLQWGSSWDKRMSWEG 416


>gi|357118082|ref|XP_003560788.1| PREDICTED: uncharacterized protein LOC100845551 [Brachypodium
           distachyon]
          Length = 423

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 211/405 (52%), Positives = 289/405 (71%), Gaps = 19/405 (4%)

Query: 1   MPSQ----QGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGS 56
           MP+      G + +   + L+A ARPFLRG+L A+D  LPSL++VL + GAGEC+HKHG+
Sbjct: 1   MPTNGDGISGGVISSGADALLAVARPFLRGDLGAVDPELPSLVSVLVAAGAGECYHKHGT 60

Query: 57  FLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEK 116
           FL HL+D++RIL++W APD V  CGL+HS+YSNSYVNLAIF P+  R  VR  VG  AE+
Sbjct: 61  FLAHLIDVFRILRLWAAPDAVARCGLYHSSYSNSYVNLAIFPPDISRGRVRAIVGAPAER 120

Query: 117 LIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINEL 176
           L+HLFC+VPR  L+H +L   Y+D EL  HL  +    +   ++G F+    WR K+  +
Sbjct: 121 LVHLFCVVPRHALLHQNLHLRYADAELRAHLARA----VAGDDEGYFS---EWRVKLRSV 173

Query: 177 LPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLF-NNSDGKLEFAGN 235
           +PA+G+  +HI+TGE V +SR+V+A FV++TIADFSDQ   +QD LF  N DG+ EFAG+
Sbjct: 174 VPAEGVVAEHIRTGEPVALSRKVLAAFVLMTIADFSDQYTDYQDKLFGGNEDGRFEFAGD 233

Query: 236 SFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIE 295
            ++ALWPG GKPGLWM+++SR+  +Y LIA +E + ++E       +G  V+ E D  +E
Sbjct: 234 DWAALWPGTGKPGLWMSAMSRLATLYRLIATDEALRLREED-----MGRPVNDE-DAGLE 287

Query: 296 LPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK-LEKAEELLLRCIEKNPFVG 354
           L +PPVF++C+++LD  EQ+ ARDLYWEA+C + D +K+K     EELL R IEKNP+VG
Sbjct: 288 LVIPPVFDHCSKVLDPGEQVTARDLYWEAICSDEDMAKKKNTVVVEELLRRSIEKNPYVG 347

Query: 355 EPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           EP +VL Q+ L  G +EEAE EA +GL L+LEWGS WDKRM+WEG
Sbjct: 348 EPWLVLAQVLLNAGRYEEAEEEAEQGLKLVLEWGSSWDKRMTWEG 392


>gi|168023750|ref|XP_001764400.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684264|gb|EDQ70667.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 417

 Score =  368 bits (944), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 183/389 (47%), Positives = 255/389 (65%), Gaps = 10/389 (2%)

Query: 11  DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
           D L   +A  RPFLR + E+ID  LP L+ +L + GAGECWHKHG+F EHL+  YRILK+
Sbjct: 8   DPLAAKIAQCRPFLREDYESIDSRLPGLLKILHARGAGECWHKHGTFYEHLLHTYRILKL 67

Query: 71  WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
           W + D V L GLFHS+YSNSYVNLAIF P+  R V+RG +GD AE+L+H+FC++PRQ LI
Sbjct: 68  WGSSDTVALFGLFHSSYSNSYVNLAIFAPDVDRSVLRGLIGDEAEELVHIFCVIPRQELI 127

Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
            D LL   +D +L+  LK    + +   E    NA+    K+    +P +G+ VKHI+TG
Sbjct: 128 FDRLLHTITDDDLIASLKQYQGTKVGESEA---NAKGGHDKR-RLFVPKEGMVVKHIRTG 183

Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
           ED+ + RR+VA FV+LT+ADF DQ +G+QD LF N DG+  ++GN + ALWPG+ KPGLW
Sbjct: 184 EDIFLPRRLVALFVLLTMADFVDQFYGWQDQLFRNDDGEFRYSGNDWHALWPGNCKPGLW 243

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
           ++  SRMG + +++  E+       +    +V  G+ +     +E+ +PP+F  CT IL 
Sbjct: 244 ISIASRMGVLLNILLLED----AAEQAELHSVPNGLPERPFGTLEISLPPIFNMCTEILP 299

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
              Q  + + YWEAV  N +G    +  A  LL  C + NPFV EPH++L QI L +G++
Sbjct: 300 PVNQNRSLEQYWEAV--NIEGRDRLVNTAVPLLQSCCKLNPFVAEPHLLLAQILLGEGQY 357

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           E AE EA +G+ LLL+W + WDKRM+WEG
Sbjct: 358 ELAETEADEGVRLLLDWATNWDKRMTWEG 386


>gi|116791245|gb|ABK25908.1| unknown [Picea sitchensis]
          Length = 303

 Score =  352 bits (902), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 169/301 (56%), Positives = 215/301 (71%), Gaps = 34/301 (11%)

Query: 8   INNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRI 67
           + N  L T +  ARPFLRG  + +D +LP  + VLR VGA ECWHKHGSF +HLVD+YRI
Sbjct: 1   MANSELPTRLERARPFLRGAYDEVDPSLPEFMRVLRQVGASECWHKHGSFFDHLVDVYRI 60

Query: 68  LKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
           LK+W APD V  CGLFHSAYSNSYVNLAIF+P+ GR  VR  +GD+AE+L+H+FCIVPRQ
Sbjct: 61  LKLWNAPDSVARCGLFHSAYSNSYVNLAIFEPDVGRQRVRDLIGDSAEELVHMFCIVPRQ 120

Query: 128 PLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGL----FNAEEA--------------- 168
           PLIHDDLLF ++D+ELV  L+ S+ ++  A+ + L       EE                
Sbjct: 121 PLIHDDLLFHFTDEELVSALEESEKTMKEAEAEDLELQKLKLEEKNGDWLDNIHDKVEIH 180

Query: 169 ---------------WRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSD 213
                          WR+KI ++LPA G+ VKHIK GEDV V RR+VA F+ +T+ADFSD
Sbjct: 181 KTHFKETLEGFILPEWRRKIQKILPAHGVRVKHIKNGEDVEVPRRLVAIFLFMTMADFSD 240

Query: 214 QLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQ 273
           QLF FQD LF+NS+G +EFAGN+++ALWPGDGKPGLW+NS+SRMGA+Y+LI REE++F +
Sbjct: 241 QLFSFQDVLFDNSNGIMEFAGNTWTALWPGDGKPGLWLNSISRMGALYTLIVREEKLFQR 300

Query: 274 E 274
           E
Sbjct: 301 E 301


>gi|302810161|ref|XP_002986772.1| hypothetical protein SELMODRAFT_124836 [Selaginella moellendorffii]
 gi|300145426|gb|EFJ12102.1| hypothetical protein SELMODRAFT_124836 [Selaginella moellendorffii]
          Length = 382

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 193/380 (50%), Positives = 249/380 (65%), Gaps = 32/380 (8%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR FLR E  AID  LP  ++VLRS GA EC+HKHG+F +HLV++YRILK+W APD VC 
Sbjct: 4   ARLFLREEYAAIDPALPHWLSVLRSTGASECFHKHGTFFQHLVEVYRILKLWGAPDDVCR 63

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
           CGLFHS+YSNSYVNLAIF+P   R  V+  +GD AE+L+ LFCIVPRQ LIHD LLF +S
Sbjct: 64  CGLFHSSYSNSYVNLAIFEPGVDRSRVKDLIGDPAEQLVELFCIVPRQELIHDMLLFRFS 123

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
           D +L E   ASD S                 KK   LLP   +TVK+I+ GEDV + RR+
Sbjct: 124 DDQLQEA-AASDRSC----------------KKSKLLLPDGFITVKNIRNGEDVRLPRRI 166

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
           VA FV+LT+ADF DQL+ +QD LF+N DG+L F+GN + ALWPGD +PGLWM+S+SRMG 
Sbjct: 167 VAVFVLLTMADFCDQLYNWQDNLFDNRDGELRFSGNHYWALWPGDVQPGLWMSSISRMGL 226

Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
           +   +  EEE   Q+     A             I L +PP+F+ C+++L   +QI ARD
Sbjct: 227 LLKGLLNEEE---QDNGICDAPAA----------IALKLPPLFDGCSKLLRPVDQIAARD 273

Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGK 379
           LY EAV   +D  ++  +     L    E+NP++GEPH+VL Q+   +G FEEA  EA  
Sbjct: 274 LYLEAV--GADLKRQGFDWIAGKLRGACERNPWIGEPHLVLAQLCCLEGAFEEAAVEAAL 331

Query: 380 GLMLLLEWGSPWDKRMSWEG 399
           GL  LL+WG+ WDKR++WE 
Sbjct: 332 GLERLLQWGTAWDKRVAWEA 351


>gi|302772192|ref|XP_002969514.1| hypothetical protein SELMODRAFT_91578 [Selaginella moellendorffii]
 gi|300162990|gb|EFJ29602.1| hypothetical protein SELMODRAFT_91578 [Selaginella moellendorffii]
          Length = 382

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 193/380 (50%), Positives = 249/380 (65%), Gaps = 32/380 (8%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR FLR E  AID  LP  ++VLRS GA EC+HKHG+F +HLV++YRILK+W APD VC 
Sbjct: 4   ARLFLREEYAAIDPALPHWLSVLRSTGASECFHKHGTFFQHLVEVYRILKLWGAPDDVCR 63

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
           CGLFHS+YSNSYVNLAIF+P   R  V+  +GD AE+L+ LFCIVPRQ LIHD LLF +S
Sbjct: 64  CGLFHSSYSNSYVNLAIFEPGVDRSRVKDLIGDPAEQLVELFCIVPRQELIHDMLLFRFS 123

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
           D +L E   ASD S                 KK   LLP   +TVK+I+ GEDV + RR+
Sbjct: 124 DDQLQEA-AASDRS----------------GKKSKLLLPDGFITVKNIRNGEDVRLPRRI 166

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
           VA FV+LT+ADF DQL+ +QD LF+N DG+L F+GN + ALWPGD +PGLWM+S+SRMG 
Sbjct: 167 VAVFVLLTMADFCDQLYNWQDNLFDNRDGELRFSGNHYWALWPGDVQPGLWMSSISRMGL 226

Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
           +   +  EEE   Q+     A             I L +PP+F+ C+++L   +QI ARD
Sbjct: 227 LLKGLLNEEE---QDNGICDAPAA----------IALKLPPLFDGCSKLLRPVDQIAARD 273

Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGK 379
           LY EAV   +D  ++  +     L    E+NP++GEPH+VL Q+   +G FEEA  EA  
Sbjct: 274 LYLEAV--GADLKRQGFDWIAGKLRGACERNPWIGEPHLVLAQLCCLEGAFEEAAVEAAL 331

Query: 380 GLMLLLEWGSPWDKRMSWEG 399
           GL  LL+WG+ WDKR++WE 
Sbjct: 332 GLERLLQWGTAWDKRVAWEA 351


>gi|297742731|emb|CBI35365.3| unnamed protein product [Vitis vinifera]
          Length = 361

 Score =  340 bits (872), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 168/254 (66%), Positives = 192/254 (75%), Gaps = 43/254 (16%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L+TL+ASARPFLRGELEA+   LPSLI+VLRSVGAGECWHKHGSFLEHL D+Y+ILK+W
Sbjct: 78  HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLFDMYKILKLW 137

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 138 KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 197

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
           DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P  G+TVKHIKTGE
Sbjct: 198 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 257

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
           DV VSRRVVA F                                           PGLW+
Sbjct: 258 DVLVSRRVVAIF-------------------------------------------PGLWL 274

Query: 252 NSLSRMGAIYSLIA 265
           NS+SRMGA+YSL+ 
Sbjct: 275 NSISRMGAVYSLLV 288



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 28/34 (82%)

Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
           L+ AE+LL RC+EKNPFVGEPHVVL QIYL  G 
Sbjct: 303 LDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGR 336


>gi|297742729|emb|CBI35363.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score =  337 bits (864), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 164/254 (64%), Positives = 193/254 (75%), Gaps = 42/254 (16%)

Query: 12  YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           +L+TL+ASARPFLRGELEA+   LPSLI+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14  HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           KA + VCLCGLFHSAYSNSYVNLAIF+P+TGRD VR HV                     
Sbjct: 74  KAQESVCLCGLFHSAYSNSYVNLAIFEPSTGRDTVRAHV--------------------- 112

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
                                S+ NAKE G+FN +EAWRKK+N L+P  G+TVKHIKTGE
Sbjct: 113 ---------------------SVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 151

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
           DV VSRRVVA FVM+T+ADFSDQ +GFQD LF+NS+G+LEF+GNS+ ALWPGDGKPGLW+
Sbjct: 152 DVLVSRRVVAIFVMMTMADFSDQHYGFQDALFDNSNGRLEFSGNSYGALWPGDGKPGLWL 211

Query: 252 NSLSRMGAIYSLIA 265
           NS+SRMGA+YSL+ 
Sbjct: 212 NSISRMGAVYSLLV 225



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 28/34 (82%)

Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
           L+ AE+LL RC+EKNPFVGEPHVVL QIYL  G 
Sbjct: 240 LDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGR 273


>gi|325930777|gb|ADZ45830.1| TPR1 [Glycine max]
          Length = 268

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 163/244 (66%), Positives = 196/244 (80%), Gaps = 11/244 (4%)

Query: 159 EKGLFNAE---EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQL 215
           EKGLF+ +   E WRKK+  L+PA+G+ VKHI+TGE V VSRR+VA F+M+T+ADF DQL
Sbjct: 2   EKGLFDDDGELEGWRKKLQGLVPAEGVQVKHIRTGEGVHVSRRIVAVFIMMTMADFCDQL 61

Query: 216 FGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQER 275
           FGFQD LF+N++G+LEF+GN+F ALWPGDGKPGLW+NS+SRMGA+Y+LIAREEEIFIQER
Sbjct: 62  FGFQDLLFDNANGRLEFSGNNFGALWPGDGKPGLWLNSISRMGAVYTLIAREEEIFIQER 121

Query: 276 KTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK 335
           K     V V  D ER+EDIEL +PPVF+ C ++L+A +QI ARDLYWEAVC       E 
Sbjct: 122 KRK-VGVAVVPDLERNEDIELVLPPVFDYCRKVLEAGDQIVARDLYWEAVC-------EG 173

Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRM 395
             KAEELLL  IEKNPFVGEP+VVL Q+YL +G FEEAE+ A +GL LLLEWG PWDKR 
Sbjct: 174 GSKAEELLLESIEKNPFVGEPYVVLSQVYLTEGRFEEAEKHAERGLKLLLEWGCPWDKRT 233

Query: 396 SWEG 399
           SWEG
Sbjct: 234 SWEG 237


>gi|302851869|ref|XP_002957457.1| hypothetical protein VOLCADRAFT_98509 [Volvox carteri f.
           nagariensis]
 gi|300257261|gb|EFJ41512.1| hypothetical protein VOLCADRAFT_98509 [Volvox carteri f.
           nagariensis]
          Length = 427

 Score =  307 bits (786), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 166/379 (43%), Positives = 222/379 (58%), Gaps = 32/379 (8%)

Query: 21  RPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLC 80
           R FLRG+L  ID +LP  + VL SVGAGECWHK  +F  HL ++Y+I KIW   D VC C
Sbjct: 32  RAFLRGDLSRIDPHLPEWLDVLSSVGAGECWHK--NFKSHLYEVYKICKIWGQDDDVCKC 89

Query: 81  GLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSD 140
            LFHSAYSNSYVNLAIF  +T R  +R  VG +AE LIH FC+VPR  L++DDLL     
Sbjct: 90  ALFHSAYSNSYVNLAIFKESTDRTRLRELVGHSAEDLIHTFCVVPRHNLVYDDLLARAPS 149

Query: 141 QELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV 200
            E +  L        +                +  L+P +G+TV+HIKTGE + V R  V
Sbjct: 150 AEDLAALGVGGAGGDDGAAAAAAP------PSLRRLVPPEGITVRHIKTGEPLTVDRLTV 203

Query: 201 ATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAI 260
           A F+++T+ADF +QLF +QD +F N +G+L +AG +  +LWPG  KPGLW ++LSRMGA+
Sbjct: 204 ARFLVMTMADFCEQLFSWQDSMFENHNGRLLYAGTNHGSLWPGACKPGLWHSALSRMGAL 263

Query: 261 YSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDL 320
                   E             G  +         +P+PPVFE CT+++   +Q+ ARD 
Sbjct: 264 LRFACTGPE-------------GRPL---------VPLPPVFEGCTQVITEADQLAARDA 301

Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
           YW AV   ++  K   ++A  LL      NP V EPHV+L QIY  +G++ +  R A + 
Sbjct: 302 YWVAVSECTETHKH--DEALRLLRTACWHNPHVAEPHVLLAQIYCQRGQWVDGRRHADRA 359

Query: 381 LMLLLEWGSPWDKRMSWEG 399
           + LL  WG+ WDKRMSWE 
Sbjct: 360 VRLLGCWGTAWDKRMSWEA 378


>gi|440792871|gb|ELR14079.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 377

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 157/385 (40%), Positives = 213/385 (55%), Gaps = 62/385 (16%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           E +   AR  LR EL   D  LP+L+  L   GA ECWHKHG+F +HL  ++RIL +WK 
Sbjct: 15  ELVPEHARALLREELHHCDPKLPALLEQLGKHGASECWHKHGTFKQHLFGVWRILYLWKQ 74

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
              VC CGL+HSAYSNSYVNLAIF P+TGR+ VR  VG+ AE+LIH FC +PRQ LI D 
Sbjct: 75  SQAVCRCGLYHSAYSNSYVNLAIFKPDTGRETVRSAVGNEAEELIHFFCTIPRQELIFDH 134

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG-ED 192
           +L  + D+ +                                  P +G+ V+HI+   E 
Sbjct: 135 ILPLHPDKPI----------------------------------PKEGVEVRHIRNKDEK 160

Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
           + VS +++   ++LTIADF DQ +G+QD LF N +G L  +G++ + LWP  GKPGLWM+
Sbjct: 161 IRVSPKLMKISLLLTIADFCDQWYGWQDLLFAN-EGGLTMSGDNPATLWPSTGKPGLWMS 219

Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
            LS+M  +   I  +E                  D E       P+PP+F NC+ +L+  
Sbjct: 220 RLSQMAVLSRRIHEQEP-----------------DDE-------PLPPIFNNCSTVLEPD 255

Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
            + EARDLYW  V   ++  + +    +  L + I  NPFV EP V+L QIY   G +EE
Sbjct: 256 SEREARDLYWRVVMHKTEQEQHEEAAED--LRKAIALNPFVAEPRVLLAQIYNVSGRYEE 313

Query: 373 AEREAGKGLMLLLEWGSPWDKRMSW 397
           AE +A + L LL EWG+ WDK   W
Sbjct: 314 AESQAQEALRLLFEWGTAWDKVPPW 338


>gi|357438997|ref|XP_003589775.1| hypothetical protein MTR_1g039140 [Medicago truncatula]
 gi|355478823|gb|AES60026.1| hypothetical protein MTR_1g039140 [Medicago truncatula]
          Length = 253

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 129/193 (66%), Positives = 155/193 (80%), Gaps = 6/193 (3%)

Query: 208 IADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIARE 267
           +ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW+NS+SRMGA+Y+LI RE
Sbjct: 1   MADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLWLNSISRMGAVYNLILRE 60

Query: 268 EEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVC 326
           EEIF++E+K      GV GVD ERDE IEL +PPVF  CT++LDA++QI ARDLYWEA+ 
Sbjct: 61  EEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVLDARDQIVARDLYWEAMI 120

Query: 327 GNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLE 386
                 +E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG FEE E+EA +GL LLLE
Sbjct: 121 -----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGRFEEGEKEAERGLTLLLE 175

Query: 387 WGSPWDKRMSWEG 399
           WG  WDKR+SWEG
Sbjct: 176 WGCHWDKRISWEG 188


>gi|159467587|ref|XP_001691973.1| hypothetical protein CHLREDRAFT_170872 [Chlamydomonas reinhardtii]
 gi|158278700|gb|EDP04463.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 347

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 61/335 (18%)

Query: 21  RPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLC 80
           R FLRG+L  +DK LP  + VL  VGAGECWHK  SF  HL ++Y++ KIW   D VC C
Sbjct: 27  RAFLRGDLSRVDKRLPEWMEVLSGVGAGECWHKKSSFKAHLFEVYKLCKIWGQDDDVCKC 86

Query: 81  GLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSD 140
            LFHSAYSNSYVNLAIF     R  ++  +G +AE+LIH FC+VPR  L++DD+L     
Sbjct: 87  ALFHSAYSNSYVNLAIFQEGADRGRLKELLGPSAEELIHTFCVVPRHNLVYDDVLAAAP- 145

Query: 141 QELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV 200
                   ASD++ +                            VKHIKTGE + V R+ +
Sbjct: 146 -------SASDMAAM----------------------------VKHIKTGEPLLVPRKTI 170

Query: 201 ATFVMLTIADFSDQLFGFQDFLF-NNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
           A F+ +TIADF++QLF +QD +F N  DG+L +AG +  +LWPG  KPGLW ++LSRM  
Sbjct: 171 AQFLAMTIADFAEQLFSWQDAMFENEGDGRLLYAGGNARSLWPGPCKPGLWHSALSRM-- 228

Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
             +L+ R                  G+     + + +P+PPVFE CT++L   +Q+ ARD
Sbjct: 229 --ALMLRH-----------------GLTDGSGQPL-VPLPPVFERCTQVLTEADQLAARD 268

Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVG 354
            YW AVC +++  +   ++A  LL   ++ NP + 
Sbjct: 269 AYWRAVCDHTEPHQH--DEAARLLRAAVDHNPHIA 301


>gi|222623967|gb|EEE58099.1| hypothetical protein OsJ_08972 [Oryza sativa Japonica Group]
          Length = 656

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 107/187 (57%), Positives = 140/187 (74%), Gaps = 2/187 (1%)

Query: 13  LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
           ++ +VA+ARPFLRG+L  +D  LPSL++VL   GAGEC+HKHG+FL HL+D+YRIL++W 
Sbjct: 6   VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 65

Query: 73  APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           APD V  CGL+HSAYSNSYVNLAIF P+  RD VRG +G AAE+L+HLFC+VPR  L+HD
Sbjct: 66  APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 125

Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
           DL   Y+D EL  HL AS  SL  A++ G    +  EAWR K+  ++P +G+  +HI+TG
Sbjct: 126 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 185

Query: 191 EDVGVSR 197
           E V +SR
Sbjct: 186 EAVALSR 192


>gi|212722138|ref|NP_001131675.1| uncharacterized protein LOC100193035 [Zea mays]
 gi|194692216|gb|ACF80192.1| unknown [Zea mays]
 gi|195637840|gb|ACG38388.1| hypothetical protein [Zea mays]
 gi|223948543|gb|ACN28355.1| unknown [Zea mays]
 gi|413939580|gb|AFW74131.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
          Length = 214

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 112/194 (57%), Positives = 141/194 (72%), Gaps = 11/194 (5%)

Query: 206 LTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIA 265
           +TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SRM A+Y+LI 
Sbjct: 1   MTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSRMAALYTLIV 60

Query: 266 REEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAV 325
           REEEI I +R              RDED++L +PPVF  CTR+L+A++Q  ARDLYWEAV
Sbjct: 61  REEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKVARDLYWEAV 112

Query: 326 CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLL 385
           CG          K E+LL + I KNPFVGEPH+VL Q+YL    + EA+ +A +GL LLL
Sbjct: 113 CGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQAEEGLRLLL 169

Query: 386 EWGSPWDKRMSWEG 399
           EWGS WDKRM WEG
Sbjct: 170 EWGSSWDKRMPWEG 183


>gi|413939579|gb|AFW74130.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
          Length = 195

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 105/167 (62%), Positives = 132/167 (79%), Gaps = 1/167 (0%)

Query: 22  PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
           PFLRGE   +D  LP L AVLR  GAGECWHKHG+FL HL+D++RIL++W APD V  CG
Sbjct: 30  PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89

Query: 82  LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
           L+HSAYSNSYVNLAIF P+TGR  V   VG  AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90  LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149

Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIK 188
           +L + L +S+ S+ +A+ +G+F+  + WR+KI  LLPADG+TVKHI+
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHIR 195


>gi|219118345|ref|XP_002179949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409002|gb|EEC48935.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 393

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 129/367 (35%), Positives = 189/367 (51%), Gaps = 65/367 (17%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           I+ LRS GA  CWHKH +FLEHL+ ++ IL++W     +   GLFHSAYSNSYVNLA+FD
Sbjct: 48  ISALRSRGAHRCWHKHSTFLEHLLSVHNILRLWGQGRTIGRVGLFHSAYSNSYVNLALFD 107

Query: 99  PNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAK 158
           P   R +++  +G+ AE L+HLFCI+ RQ ++ + LL +                     
Sbjct: 108 PAIERGMMKTLIGEDAEVLVHLFCIIDRQQVVVNTLLAQ--------------------- 146

Query: 159 EKGLFNAEEAWRKKINELLPADGLTVKHIK-TGEDVGVSRRVVATFVMLTIADFSDQLFG 217
                             +P  GLTV H++ + + V ++   +   V+ T+AD +DQ FG
Sbjct: 147 ----------------GFIPKQGLTVPHLRDSSKSVFLNSETLRMLVVFTMADIADQYFG 190

Query: 218 FQDFLFNNSD--------GKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEE 269
           +QD LF            GK + A +  +ALWPG  KPGLWM+ LS++  +         
Sbjct: 191 WQDTLFGGGGQEGSMIIPGKDDVARHDTTALWPGVSKPGLWMSYLSQLAEV-------AR 243

Query: 270 IFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNS 329
            F+   +         V++  DED     PPVF N T+ L   +++ ARD YW  V G  
Sbjct: 244 TFVPSTQD--------VNRSADEDRTADFPPVFANGTQSLSVSDEVAARDFYWSVVSGEV 295

Query: 330 DGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGS 389
             + E +E    +L  C++ NP+  EP V+L Q +L  G F+ A     + L L  +WG+
Sbjct: 296 TDNAEAIE----VLQLCVKMNPWAFEPLVILAQKHLHTGNFDSALNTTYRALDLQHQWGT 351

Query: 390 PWDKRMS 396
            WDKR+S
Sbjct: 352 AWDKRLS 358


>gi|407939355|ref|YP_006854996.1| hypothetical protein C380_13285 [Acidovorax sp. KKS102]
 gi|407897149|gb|AFU46358.1| hypothetical protein C380_13285 [Acidovorax sp. KKS102]
          Length = 362

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 116/386 (30%), Positives = 178/386 (46%), Gaps = 69/386 (17%)

Query: 16  LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           L A A P L  E    D  L  ++  + +   G+ WHK G+F  HLV + R L +W+ P 
Sbjct: 12  LFARALPLLDDEWLTRDPELAPVLPTVLARNVGQDWHKAGTFRHHLVGVTRTLTVWQQPR 71

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V L GL HS Y N++V+L  FDP   R  VR   G++AE L++LFC   R   +     
Sbjct: 72  DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGESAEHLVYLFCTQSRTQFV----- 126

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
                Q+++ H   +D SL+  K+                              G+D  +
Sbjct: 127 -----QKVLAHALEADGSLVLQKD------------------------------GQDHVL 151

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNN-SDGKLEFAGNSFSA-LWPGDGKPGLWMNS 253
           +   VA F+++++AD  +Q F +QD +F+   D +       ++A LWPG  +P      
Sbjct: 152 TPYEVAAFIIVSMADTIEQWFSWQDDIFSRFPDVQHRHQKAHWAASLWPGPMRPS----- 206

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
             RM    + +A+                   +     +D+ LP+PPVF +CT+ L A  
Sbjct: 207 -GRMVHQINGLAK------------------ALQHPGLKDV-LPMPPVFAHCTQHLSAAN 246

Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
           +  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G  E+A
Sbjct: 247 EAAATSLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGRNEDA 304

Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
           +  A   L L   WG+ WDKR+ W+ 
Sbjct: 305 KAAAESALHLFSAWGNAWDKRVQWDA 330


>gi|421486214|ref|ZP_15933762.1| hypothetical protein QWC_26353 [Achromobacter piechaudii HLE]
 gi|400195559|gb|EJO28547.1| hypothetical protein QWC_26353 [Achromobacter piechaudii HLE]
          Length = 357

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 162/355 (45%), Gaps = 68/355 (19%)

Query: 47  AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
           AGE WHK G+F  HL+ +YR L +W  P  V L GLFHS Y N YV+L +FD    R  +
Sbjct: 38  AGEDWHKAGTFKHHLLGVYRTLTLWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERSTL 97

Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
           R  +G  AE+ + LFC +PR   +   L                       + +G  N  
Sbjct: 98  REVLGTEAEEWVSLFCAMPRTKFVQAVL-----------------------QGQGAGN-- 132

Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
                        DGLT++    G+    + R VA F++++ AD  +Q   +QD +F   
Sbjct: 133 -------------DGLTLEGAD-GQMFTFTPRQVAAFIVVSAADIGEQWHSWQDEIFAGY 178

Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
               + +   N  S+LWPG  KP    N LS +  + + +             S      
Sbjct: 179 PQQQRRDLKTNWASSLWPGPLKPP--SNILSMLSHLLAPL-------------SGMPANT 223

Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
           G+          P+PP F+ C   L  +++  A  LYW+ +      +  +++ A  LL 
Sbjct: 224 GI----------PIPPAFDQCRATLSPRDEAAASALYWQVITRMHPLT--EMDSARHLLE 271

Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             +  NP+VGEP ++L Q+ L  G+F+ AE+ A  GL  L  WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVGEPRLLLAQLALTSGDFDAAEQHATAGLAALQAWGTAWDKRIEWAG 326


>gi|365091191|ref|ZP_09328698.1| hypothetical protein KYG_08065 [Acidovorax sp. NO-1]
 gi|363416309|gb|EHL23429.1| hypothetical protein KYG_08065 [Acidovorax sp. NO-1]
          Length = 362

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 116/390 (29%), Positives = 180/390 (46%), Gaps = 77/390 (19%)

Query: 16  LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           L A A P L  +  A D  L  ++ ++ +   G+ WHK G+F  HLV + R L +W+ P 
Sbjct: 12  LFARALPLLDDDWLARDAELAPVLPMVLARNVGQDWHKAGTFRHHLVGVTRTLTVWQQPR 71

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V L GL HS Y N++V+L  FDP   R  VR   G++AE L++LFC   R   +     
Sbjct: 72  DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGESAEHLVYLFCTQSRTQFV----- 126

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
                Q+++ H   +D SL+  K+            + + L P +               
Sbjct: 127 -----QKVLAHALEADGSLVLQKD-----------GQHHVLTPCE--------------- 155

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFN------NSDGKLEFAGNSFSALWPGDGKPGL 249
               VA F+++++AD  +Q F +QD +F+      +   K  +A    ++LWPG  +P  
Sbjct: 156 ----VAAFIIVSMADTIEQWFSWQDDIFSRFPEVQHRHQKAHWA----ASLWPGPMRPS- 206

Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
                 RM    + +A+     +Q                      LP+PPVF+NCT+ L
Sbjct: 207 -----GRMVHQINGLAKA----LQHPGLKGL---------------LPMPPVFDNCTQHL 242

Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
            A  +  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G 
Sbjct: 243 SAANEAAATSLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGR 300

Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
            ++A+  A   L L   WG+ WDKR+ W+ 
Sbjct: 301 KDDAKAAAESALHLFSAWGNAWDKRVQWDA 330


>gi|427817671|ref|ZP_18984734.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
 gi|410568671|emb|CCN16724.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
          Length = 357

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAIDADLPTLLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRRYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|33599383|ref|NP_886943.1| hypothetical protein BB0394 [Bordetella bronchiseptica RB50]
 gi|33566979|emb|CAE30892.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
          Length = 357

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|427812647|ref|ZP_18979711.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
 gi|410563647|emb|CCN21182.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
          Length = 357

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLVLQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|410418190|ref|YP_006898639.1| hypothetical protein BN115_0379 [Bordetella bronchiseptica MO149]
 gi|408445485|emb|CCJ57135.1| conserved hypothetical protein [Bordetella bronchiseptica MO149]
          Length = 357

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRRYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|427823827|ref|ZP_18990889.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
 gi|410589092|emb|CCN04157.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
          Length = 357

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 181/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F  CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGRCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|33594317|ref|NP_881961.1| hypothetical protein BP3435 [Bordetella pertussis Tohama I]
 gi|384205614|ref|YP_005591353.1| hypothetical protein BPTD_3387 [Bordetella pertussis CS]
 gi|408414206|ref|YP_006624913.1| hypothetical protein BN118_0137 [Bordetella pertussis 18323]
 gi|33564392|emb|CAE43698.1| conserved hypothetical protein [Bordetella pertussis Tohama I]
 gi|332383728|gb|AEE68575.1| hypothetical protein BPTD_3387 [Bordetella pertussis CS]
 gi|401776376|emb|CCJ61562.1| conserved hypothetical protein [Bordetella pertussis 18323]
          Length = 364

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 18  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 77

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 78  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 129

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 130 -------------RILAGEGRG-----------------ATGLVLQGAD-GQPLTLTPRQ 158

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 159 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 209

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 210 ARILDMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 253

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 254 AALYWQVITRMHPMT--EMDSAHHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 311

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 312 AAGLAALQAWGTSWDKRIEWSG 333


>gi|33595102|ref|NP_882745.1| hypothetical protein BPP0392 [Bordetella parapertussis 12822]
 gi|33565179|emb|CAE35976.1| conserved hypothetical protein [Bordetella parapertussis]
          Length = 364

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 18  ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 77

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 78  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 129

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 130 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 158

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 159 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 209

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 210 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 253

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 254 AALYWQVITRMHPMT--EMDSAHHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 311

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 312 AAGLAALQAWGTSWDKRIEWSG 333


>gi|299529997|ref|ZP_07043424.1| hypothetical protein CTS44_04439 [Comamonas testosteroni S44]
 gi|298721977|gb|EFI62907.1| hypothetical protein CTS44_04439 [Comamonas testosteroni S44]
          Length = 365

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/398 (28%), Positives = 183/398 (45%), Gaps = 76/398 (19%)

Query: 10  NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
           N   E L   A+  L  E  + D +L  L+ V+   G G+ WHK G+F  HL  + R L 
Sbjct: 6   NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65

Query: 70  IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           +W+ P  V L GL HS Y N++V+L  FD    R+ ++  +G+ AE L++LFC + R   
Sbjct: 66  LWQQPRDVRLLGLLHSVYGNAFVDLVKFDAGNEREHLKRLIGEQAEHLVYLFCTMSRT-- 123

Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
                       + V+ L A D++                         ADG +++  + 
Sbjct: 124 ------------QFVQKLLAGDIA-------------------------ADG-SMQLERN 145

Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
           G    E + ++   VA F ++++AD  +Q F +Q+ +++     D   + A +  ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205

Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
           G  +P       SRM +  S                    G+G   +      +LP+PPV
Sbjct: 206 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 239

Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
           F NC+++L A  +  A  LYW  +    D     L+ A   L + +  NP+VGEP +VL 
Sbjct: 240 FANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVALNPWVGEPQMVLA 297

Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           Q+YL+ G   +A R A   L     WG+ WDKR+ W+ 
Sbjct: 298 QLYLSAGRNTDAARAADSALQCFCAWGNAWDKRVQWDA 335


>gi|121607853|ref|YP_995660.1| hypothetical protein Veis_0863 [Verminephrobacter eiseniae EF01-2]
 gi|121552493|gb|ABM56642.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
          Length = 363

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 189/391 (48%), Gaps = 71/391 (18%)

Query: 13  LET-LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
           LET L A A+  L  E  A D +L  ++  + +   G+ WHK G+F  HLV + R L +W
Sbjct: 8   LETDLFARAQWLLDPEWLARDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLSLW 67

Query: 72  KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           + P  V L GL HS Y N++V+L  FDP   R  +R  VG++AE L++LFC   R     
Sbjct: 68  QQPRDVRLLGLLHSVYGNAFVDLVKFDPARERAQMRELVGESAEHLVYLFCTRSRT---- 123

Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
                ++  + L + L+A D SL+   E+G                  DG   +H+ T  
Sbjct: 124 -----QFVQRVLAQALQA-DGSLV--LEQG------------------DG---RHLLTPY 154

Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPG 248
           +V V       F+++T+AD  +Q F +QD +   F   + + + A +  ++LWPG  +P 
Sbjct: 155 EVAV-------FIIVTMADAIEQWFSWQDDVYSGFPRLEPQRQQAVHWAASLWPGPMRPT 207

Query: 249 LWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRI 308
                  RM              +Q+    A A+     K+      LP+PPVF +C++ 
Sbjct: 208 ------GRM--------------LQQISALAQALQHPGLKQL-----LPLPPVFAHCSQR 242

Query: 309 LDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKG 368
           L A ++  A  LYW  +    +      + A  +L   +  NP+VGEP +VL Q+YLA G
Sbjct: 243 LSASDEAAATALYWSVIA--QEHPLLDPDVATNMLESAVRHNPWVGEPQMVLAQLYLAAG 300

Query: 369 EFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             +EA+  A   L L   WG+ WDKR+ W+ 
Sbjct: 301 RKDEAKLAAASALQLFSAWGNSWDKRVQWDA 331


>gi|412340312|ref|YP_006969067.1| hypothetical protein BN112_3019 [Bordetella bronchiseptica 253]
 gi|408770146|emb|CCJ54936.1| conserved hypothetical protein [Bordetella bronchiseptica 253]
          Length = 357

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 182/382 (47%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     A+D +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V L
Sbjct: 11  ARALLADHYAAVDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R ++G+ AE+ +HLFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                         ++  + +G                 A GL ++    G+ + ++ R 
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + + + +  ++LWPG  KP          
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             I  +++R     +Q   T  A  G+            P PP F +CT +LDA ++  A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   I  NP+V EP ++L Q+ L   +++ A  +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326


>gi|293602157|ref|ZP_06684608.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
           43553]
 gi|292819427|gb|EFF78457.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
           43553]
          Length = 357

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 68/382 (17%)

Query: 20  ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
           AR  L     AID +LP+L+ +L +  AGE WHK G+F  HL+ +YR L +W  P  V +
Sbjct: 11  ARALLADHYAAIDADLPALLDMLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRM 70

Query: 80  CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
            GLFHS Y N YV+L +FD    R  +R  +G+ AE  + LFC +PR   +         
Sbjct: 71  LGLFHSVYGNEYVDLTLFDRERERSTLREVLGEEAEAWVSLFCAMPRTKFVQ-------- 122

Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
                        +++  +  G                  DG+ ++    G    ++ R 
Sbjct: 123 -------------AVLAGQGTG-----------------HDGMRLEGAD-GTVYALTPRQ 151

Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
           VA F++++ AD  +Q   +QD +F       + +   +  ++LWPG  KP    N LS +
Sbjct: 152 VAAFIVVSAADIGEQWHSWQDEIFAGYPQQQRRDLNTHWAASLWPGPLKPP--SNILSML 209

Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
             + + ++                       +   D  +P+PP F+ C   L A+++  A
Sbjct: 210 SHLLAPLS-----------------------QMPGDTGIPIPPAFDACRATLSARDEAAA 246

Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
             LYW+ +      +  +++ A  LL   +  NP+VGEP ++L Q+ L  G+F+EAE+ A
Sbjct: 247 SALYWQVITRMHPLT--EMDSARHLLEAAVAHNPWVGEPRLLLAQLALTTGDFDEAEQHA 304

Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
             GL  L  WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTAWDKRIEWAG 326


>gi|351730024|ref|ZP_08947715.1| hypothetical protein AradN_09609 [Acidovorax radicis N35]
          Length = 362

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 182/390 (46%), Gaps = 77/390 (19%)

Query: 16  LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           L A A+  L  E  A D +L  ++  + +   G+ WHK G+F  HLV + R L +W+ P 
Sbjct: 12  LFARAQTLLDEEWLARDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLTMWQQPR 71

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V L GL HS Y N++V+L  FDP   R  VR  VG++AE L+++FC   R   +     
Sbjct: 72  DVRLLGLLHSVYGNAFVDLVKFDPAQERARVRDLVGESAEHLVYVFCTQSRTQFV----- 126

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
                Q+++ H   +D SL+                     L  DG+   H+ T  +   
Sbjct: 127 -----QKVLAHALEADGSLV---------------------LQKDGM--DHVLTPYE--- 155

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFN------NSDGKLEFAGNSFSALWPGDGKPGL 249
               VATF+++++AD  +Q F +QD +F+      +   K  +A    ++LWPG  +P  
Sbjct: 156 ----VATFIIVSMADTIEQWFSWQDDIFSRFPEVQHRHQKAHWA----ASLWPGPMRPS- 206

Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
                 RM    + +A+     +Q                      LPVPPVF +CT+ L
Sbjct: 207 -----GRMVHQINGLAKA----LQHPGLKGL---------------LPVPPVFGHCTQHL 242

Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
            A  +  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G 
Sbjct: 243 SAANEAAAASLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGR 300

Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
            ++A+  A   L L   WG+ WDKR+ W+ 
Sbjct: 301 QDDAKAAAESALHLFSAWGNSWDKRVQWDA 330


>gi|359800019|ref|ZP_09302571.1| hypothetical protein KYC_23713 [Achromobacter arsenitoxydans SY8]
 gi|359362131|gb|EHK63876.1| hypothetical protein KYC_23713 [Achromobacter arsenitoxydans SY8]
          Length = 382

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 167/355 (47%), Gaps = 68/355 (19%)

Query: 47  AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
           AGE WHK G+F  HL+ +YR L +W  P  V L GLFHS Y N YV+L +FD    R+ +
Sbjct: 63  AGEDWHKAGTFKHHLLGVYRTLALWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 122

Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
           R  +G+ AE+ + LFC +PR   +                      +++  +  G     
Sbjct: 123 REALGEEAEEWVSLFCAMPRTKFVQ---------------------AILQGQGAG----- 156

Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
                       ADGLT++    G+    + R VA F +++ AD  +Q   +QD +F   
Sbjct: 157 ------------ADGLTLEGAD-GQTYTFTPRQVAAFTVVSAADIGEQWHSWQDEIFAGY 203

Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
               + +   N  S+LWPG  KP    N LS +  + + ++R                  
Sbjct: 204 PQQQRRDLKTNWASSLWPGPLKPP--SNILSMLSHLLAPLSR------------------ 243

Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
                  +D  +P+PP F+ C   L  +++  A  LYW+ +      ++  ++ A  +L 
Sbjct: 244 -----MPQDTGIPIPPAFDACRATLSPRDEAAASALYWQVITRMHPLTE--MDSARHMLE 296

Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             +E NP+VGEP ++L Q+ L  G+F+ AER A  GL  L  WG+ WDKR+ W G
Sbjct: 297 AAVEHNPWVGEPRLLLAQLALTAGDFDAAERHAAAGLAALQAWGTAWDKRIEWAG 351


>gi|121594684|ref|YP_986580.1| hypothetical protein Ajs_2343 [Acidovorax sp. JS42]
 gi|120606764|gb|ABM42504.1| conserved hypothetical protein [Acidovorax sp. JS42]
          Length = 368

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 184/394 (46%), Gaps = 76/394 (19%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           + L   A+  L  E  A D +L  ++ ++ + G G+ WHK G+F  HLV + R L +W  
Sbjct: 12  DALFQRAQSLLDDEWLAHDADLAPVLPMVLARGVGQDWHKAGTFRHHLVGVARSLALWGQ 71

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P  V L GL HS Y N++V+L  FD  + R  ++  VG++AE L++LFC + R   +   
Sbjct: 72  PRDVRLLGLLHSVYGNAFVDLVKFDAASERGRLQALVGESAEHLVYLFCTMSRAQFVQKV 131

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
           L  E            +D SL+  K                   PA   TV H+   E  
Sbjct: 132 LAGELQ----------ADGSLVVEKNG-----------------PAPRQTV-HLAPYE-- 161

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWPGDGKPGLW 250
                 VA F ++++AD  +Q F +QD +++     D     A +  ++LWPG  +P   
Sbjct: 162 ------VAAFTIVSMADTIEQWFSWQDDIYSRFPAVDPSRPQAVHWAASLWPGPMRP--- 212

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
               +RM                   +  AA+G+ +        +LP+PPVF++C++ L 
Sbjct: 213 ---TARM------------------VSQIAALGLALQHPGLRG-QLPLPPVFDHCSQGLT 250

Query: 311 AKEQIEARDLYWEAV-----CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYL 365
           A+++  A  LYW  +       + DG+   LE+A  L       NP+VGEP +VL Q+YL
Sbjct: 251 AQDEAAATALYWSVIQLAQPLVDLDGATATLEQAVRL-------NPWVGEPQMVLAQLYL 303

Query: 366 AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           + G  ++A   A   L     WG+ WDKR+ W+ 
Sbjct: 304 SAGRAQDAACAARGALQAFSAWGNAWDKRVQWDA 337


>gi|241764610|ref|ZP_04762625.1| conserved hypothetical protein [Acidovorax delafieldii 2AN]
 gi|241365931|gb|EER60561.1| conserved hypothetical protein [Acidovorax delafieldii 2AN]
          Length = 361

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 177/388 (45%), Gaps = 69/388 (17%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           +TL   A+  L  E    D +L  ++  + +   G+ WHK G+F  HLV + R L +W+ 
Sbjct: 10  QTLFIRAQALLDDEWITHDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLTLWQQ 69

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P  V L GL HS Y NS+V+L  FDP + R  +R  VG  AE L++LFC   R   +   
Sbjct: 70  PRDVRLLGLLHSVYGNSFVDLVKFDPASERARLRELVGAPAEHLVYLFCTQSRTQFV--- 126

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
                  Q ++    A+D SL                      L  DG   +H+ T ++V
Sbjct: 127 -------QRVLGGGLAADGSLA---------------------LDKDG--QRHVLTPDEV 156

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN-SDGKLEFAGNSFSA-LWPGDGKPGLWM 251
                  A F+++++AD  +Q F +QD +++     +    G  ++A LWPG  +P    
Sbjct: 157 -------AAFIVVSMADTIEQWFSWQDDIYSRFPHVQHRPQGVHWAASLWPGPMRP---- 205

Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
                 G +   IA                +G  +     + + LPVPPVF NCT+ L  
Sbjct: 206 -----TGRMLHQIA---------------GLGQALQHSGLKGL-LPVPPVFANCTQHLSV 244

Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
             +  A  LYW  +    D     L+ A  +L + +  NP+VGEP +VL Q+YL+ G  +
Sbjct: 245 AAEAAATSLYWSVI--QQDQPLVDLDVATAVLEQAVRHNPWVGEPQMVLAQLYLSAGRGD 302

Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           +A   A   L     WG+ WDKR+ W+ 
Sbjct: 303 DARHAAQSALQCFSAWGNSWDKRVQWDA 330


>gi|222110711|ref|YP_002552975.1| hypothetical protein Dtpsy_1514 [Acidovorax ebreus TPSY]
 gi|221730155|gb|ACM32975.1| conserved hypothetical protein [Acidovorax ebreus TPSY]
          Length = 368

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 183/394 (46%), Gaps = 76/394 (19%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           + L   A+  L  E  A D +L  ++ ++ + G G+ WHK G+F  HLV + R L +W  
Sbjct: 12  DALFQRAQSLLDDEWLAHDADLAPVLPMVLARGVGQDWHKAGTFRHHLVGVARSLALWGQ 71

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P  V L GL HS Y N++V+L  FD  + R  ++  VG++AE L++LFC + R   +   
Sbjct: 72  PRDVRLLGLLHSVYGNAFVDLVKFDAASERGRLQALVGESAEHLVYLFCTMSRAQFVQKV 131

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
           L  E            +D SL+  K                   PA   TV H+   E  
Sbjct: 132 LAGELQ----------ADGSLVVEKNG-----------------PAPRQTV-HLAPYE-- 161

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWPGDGKPGLW 250
                 VA F ++++AD  +Q F +QD +++     D     A +  ++LWPG  +P   
Sbjct: 162 ------VAAFTIVSMADTIEQWFSWQDDIYSRFPAVDPSRPQAVHWAASLWPGPMRP--- 212

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
               +RM                   +  AA+G+ +        +LP+PPVF++C++ L 
Sbjct: 213 ---TARM------------------VSQIAALGLALQHPGLRG-QLPLPPVFDHCSQGLT 250

Query: 311 AKEQIEARDLYWEAV-----CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYL 365
           A+++  A  LYW  +       + DG+   LE+A  L       NP+VGEP +VL Q+YL
Sbjct: 251 AQDEAAATALYWSVIQLAQPLVDLDGATATLEQAVRL-------NPWVGEPQMVLAQLYL 303

Query: 366 AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           + G   +A   A   L     WG+ WDKR+ W+ 
Sbjct: 304 SAGRAHDAACAARGALQAFSAWGNAWDKRVQWDA 337


>gi|395008001|ref|ZP_10391692.1| hypothetical protein PMI14_04387 [Acidovorax sp. CF316]
 gi|394313946|gb|EJE50903.1| hypothetical protein PMI14_04387 [Acidovorax sp. CF316]
          Length = 364

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 115/386 (29%), Positives = 177/386 (45%), Gaps = 69/386 (17%)

Query: 16  LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           L+A A P L  E  A D +L  ++  + +   G+ WHK G+F  HLV + R L +W+ P 
Sbjct: 15  LLARALPLLDEEWLARDADLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLAMWQQPR 74

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V L GL HS Y N++V+L  FDP   R  VR   GD+AE L++LFC   R   +     
Sbjct: 75  DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGDSAEHLVYLFCTQSRTQFV----- 129

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
                Q ++     +D SL+  K+            K + L P +               
Sbjct: 130 -----QRVLAGAFEADGSLVLEKD-----------GKPHTLTPYE--------------- 158

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNS 253
               VA F+++++AD  +Q F +QD +++   +  +   A +  ++LWPG  +P      
Sbjct: 159 ----VAAFIIVSMADTIEQWFSWQDDIYSRFPTVQQRPQAVHWAASLWPGPMRPS----- 209

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
               G + S I                A+G+ +     + + LP+PPVF  CT  L A +
Sbjct: 210 ----GRMVSQI---------------NALGLALQHPGLKGL-LPMPPVFNGCTAELSATD 249

Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
           +  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G   +A
Sbjct: 250 EAAATSLYWSVI--QQDQPLADLDVATAVLESAVRHNPWVGEPQMVLAQLYLSAGRNADA 307

Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
              A   L     WG+ WDKR+ W+ 
Sbjct: 308 AAAASSALQQFSAWGNSWDKRVQWDA 333


>gi|347821540|ref|ZP_08874974.1| hypothetical protein VeAt4_20777 [Verminephrobacter aporrectodeae
           subsp. tuberculatae At4]
          Length = 362

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 178/386 (46%), Gaps = 69/386 (17%)

Query: 16  LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
           L + A+  L  E  A D +L  ++  + +   G+ WHK G+F  HLV + R L +W+ P 
Sbjct: 12  LFSRAQSLLDEEWLARDPDLAPMLPTVLARNVGQDWHKAGTFRHHLVGVARSLSVWQQPR 71

Query: 76  CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
            V L GL HS Y N++V+L  FDP   R  +RG +G++AE L++LFC   R   +     
Sbjct: 72  DVRLLGLLHSVYGNAFVDLVKFDPAKERARMRGLMGESAEHLVYLFCTQSRTQFV----- 126

Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
                Q ++ H   +D SL+                     L  DG   +H+ T  +V V
Sbjct: 127 -----QRVLAHALEADGSLV---------------------LEKDG--QRHVLTPYEVAV 158

Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSF--SALWPGDGKPGLWMNS 253
                  F+++++AD  +Q   +QD +++      +    +   ++LWPG  +P      
Sbjct: 159 -------FIIVSMADTIEQWSSWQDDVYSRFPDVQQRPQTAHWAASLWPGPMRP------ 205

Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
               G +   I               +A+G  +     +D+ LP+PPVF +C++ L A +
Sbjct: 206 ---TGRMLHQI---------------SALGQALQHPALKDL-LPIPPVFAHCSQNLSASD 246

Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
           +  A  LYW  V    D     L+ A  +L   +  NP+VGE  + L Q+YL  G  +EA
Sbjct: 247 EAAASALYWAVV--QQDQPLVDLDVATGVLENAVRHNPWVGESQMTLAQLYLTAGRRDEA 304

Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
              A   L L   WG+ WDKR+ WE 
Sbjct: 305 RAAASSALQLFCVWGNSWDKRVQWEA 330


>gi|418530048|ref|ZP_13095974.1| hypothetical protein CTATCC11996_10158 [Comamonas testosteroni ATCC
           11996]
 gi|371452601|gb|EHN65627.1| hypothetical protein CTATCC11996_10158 [Comamonas testosteroni ATCC
           11996]
          Length = 367

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 115/398 (28%), Positives = 181/398 (45%), Gaps = 76/398 (19%)

Query: 10  NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
           N   E L   A+  L  E  + D +L  L+ V+   G G+ WHK G+F  HL  + R L 
Sbjct: 6   NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65

Query: 70  IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           +W+ P  V L GL HS Y N++V+L  FD    R+ ++  +G+ AE L++LFC + R   
Sbjct: 66  LWQQPRDVRLLGLLHSVYGNAFVDLVKFDAGNEREQLKRLIGEQAEHLVYLFCTMSR--- 122

Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
                       + V+ L A D++                         ADG +++  + 
Sbjct: 123 -----------TQFVQKLLAGDIA-------------------------ADG-SMQLERN 145

Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
           G    E + ++   VA F ++++AD  +Q F +Q+ +++     D   + A +  ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205

Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
           G  +P       SRM +  S                    G+G   +      +LP+PPV
Sbjct: 206 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 239

Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
           F NC+++L A  +  A  LYW  +    D     L+ A   L + +  NP+VGEP +VL 
Sbjct: 240 FANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVSLNPWVGEPQMVLA 297

Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           Q+YL  G   EA   A   L     WG+ WDKR+ W+ 
Sbjct: 298 QLYLTAGRNAEAAAAACGALQCFSSWGNAWDKRVQWDA 335


>gi|333915155|ref|YP_004488887.1| hypothetical protein DelCs14_3540 [Delftia sp. Cs1-4]
 gi|333745355|gb|AEF90532.1| hypothetical protein DelCs14_3540 [Delftia sp. Cs1-4]
          Length = 366

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 178/389 (45%), Gaps = 70/389 (17%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           E L   A+  L  E  A D +L  ++ V+ + G G+ WHK G+F  HL  + R L +W+ 
Sbjct: 14  EALFTRAQDLLDEEWLAKDADLAPVLPVVLARGVGQDWHKAGTFRHHLAGVARSLALWRQ 73

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P  V L GL HS Y N++V+L  FD  + R  +R  VG+ AE L++LFC + R       
Sbjct: 74  PRDVRLLGLLHSVYGNAFVDLVKFDAGSERGRLRELVGEGAEHLVYLFCTMSRT------ 127

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
                   + ++ + A D++     E G    E A                     G   
Sbjct: 128 --------QFMQKILAGDLA-----EDGSMALESA---------------------GRSQ 153

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPGLW 250
            +S   VA F ++++AD  +Q F +Q+ +   F     + +   +  ++LWPG  +P   
Sbjct: 154 RLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPGPMRPA-- 211

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
               SRM +  S + R     +Q     A               +LP+PPVF++C+R L 
Sbjct: 212 ----SRMLSQISDLGRA----LQHPALRA---------------QLPLPPVFDHCSRQLP 248

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
             ++  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G  
Sbjct: 249 PGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQLYLSAGRA 306

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           ++A R A   L     WG+ WDKR+ W+ 
Sbjct: 307 DDAARAATSALQCFSGWGNAWDKRVQWDA 335


>gi|160898617|ref|YP_001564199.1| hypothetical protein Daci_3176 [Delftia acidovorans SPH-1]
 gi|160364201|gb|ABX35814.1| conserved hypothetical protein [Delftia acidovorans SPH-1]
          Length = 366

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 177/389 (45%), Gaps = 70/389 (17%)

Query: 14  ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
           E L   A+  L  E  A D +L  ++ V+ + G G+ WHK G+F  HL  + R L +W+ 
Sbjct: 14  EALFTRAQDLLDEEWLAKDADLAPVLPVVLARGVGQDWHKAGTFRHHLAGVARSLALWRQ 73

Query: 74  PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
           P  V L GL HS Y N++V+L  FD  + R  +R  VG+ AE L++LFC + R       
Sbjct: 74  PRDVRLLGLLHSVYGNAFVDLVKFDAGSERGRLRELVGEGAEHLVYLFCTMSRT------ 127

Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
                   + ++ + A D++     E G    E A                     G   
Sbjct: 128 --------QFMQKILAGDLA-----EDGSMALESA---------------------GRSQ 153

Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPGLW 250
            +S   VA F ++++AD  +Q F +Q+ +   F     + +   +  ++LWPG  +P   
Sbjct: 154 RLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPGPMRPA-- 211

Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
               SRM +  S + R     +Q     A               +LP+PPVF++C+R L 
Sbjct: 212 ----SRMLSQISDLGRS----LQHPALRA---------------QLPLPPVFDHCSRQLS 248

Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
             ++  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+YL+ G  
Sbjct: 249 PGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQLYLSAGRA 306

Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           + A R A   L     WG+ WDKR+ W+ 
Sbjct: 307 DGAARAAASALQCFSGWGNAWDKRVQWDA 335


>gi|221066382|ref|ZP_03542487.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
 gi|220711405|gb|EED66773.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
          Length = 367

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 182/397 (45%), Gaps = 74/397 (18%)

Query: 10  NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
           N   E L   A+  L  E  + D +L  L+ V+   G G+ WHK G+F  HL  + R L 
Sbjct: 6   NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65

Query: 70  IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           +W+ P  V   GL HS Y N++V+L  FD    R+ ++  VG+ AE L++LFC + R   
Sbjct: 66  LWQQPRQVRQLGLLHSVYGNAFVDLVKFDAGNEREQLKSLVGEQAEHLVYLFCTMSRT-- 123

Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
                       + V+ L A +                         L ADG +++  + 
Sbjct: 124 ------------QFVQKLLAGE-------------------------LGADG-SLQIERN 145

Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
           G    E + ++   VA F ++++AD  +Q F +Q+ +++     D   + A +  ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205

Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVF 302
           G  +P       SRM +  S + R              A+     K R     LP+PPVF
Sbjct: 206 GPMRPS------SRMLSQISGLGR--------------ALQHPALKTR-----LPLPPVF 240

Query: 303 ENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQ 362
            NC+++L A  +  A  LYW  +    D     L+ A   L + +  NP+VGEP +VL Q
Sbjct: 241 ANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVALNPWVGEPQMVLAQ 298

Query: 363 IYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           +YL+ G   +A R A   L     WG+ WDKR+ W+ 
Sbjct: 299 LYLSAGRSADAARAAESALQCFCSWGNAWDKRVQWDA 335


>gi|264679041|ref|YP_003278948.1| hypothetical protein CtCNB1_2906 [Comamonas testosteroni CNB-2]
 gi|262209554|gb|ACY33652.1| conserved hypothetical protein [Comamonas testosteroni CNB-2]
          Length = 376

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 180/398 (45%), Gaps = 76/398 (19%)

Query: 10  NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
           N     L   A+  L  E  + D +L  L+ V+   G G+ WHK G+F  HL  + R L 
Sbjct: 15  NQLDAALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 74

Query: 70  IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           +W+ P  V   GL HS Y N++V+L  FD    RD ++  VG+ AE L++LFC + R   
Sbjct: 75  LWQQPREVRQLGLLHSVYGNAFVDLVKFDAGNERDQLKRLVGEQAEHLVYLFCTMSRT-- 132

Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
                       + V+ L A +                         L ADG +++  + 
Sbjct: 133 ------------QFVQKLLAGE-------------------------LGADG-SLQIERN 154

Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWP 242
           G    E + ++   VA F ++++AD  +Q F +Q+ +   F + D   + A +  ++LWP
Sbjct: 155 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPSVDHSRQQAVHWAASLWP 214

Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
           G  +P       SRM +  S                    G+G   +      +LP+PPV
Sbjct: 215 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 248

Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
           F NC+++L A  +  A  LYW  +  N       L+ A   L + +  NP+VGEP +VL 
Sbjct: 249 FANCSQLLSAGNEAAAVALYWSVIQLNQ--PLVDLDAATATLEQAVTLNPWVGEPQMVLA 306

Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           Q+YL+ G   +A R A   L     WG+ WDKR+ W+ 
Sbjct: 307 QLYLSAGRSADAARAAESALQCFCTWGNAWDKRVQWDA 344


>gi|422323799|ref|ZP_16404838.1| hypothetical protein HMPREF0005_01203 [Achromobacter xylosoxidans
           C54]
 gi|317401197|gb|EFV81843.1| hypothetical protein HMPREF0005_01203 [Achromobacter xylosoxidans
           C54]
          Length = 357

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 164/355 (46%), Gaps = 68/355 (19%)

Query: 47  AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
           AGE WHK G+F  HL+ +YR L +W  P  V L GLFHS Y N YV+L +FD    R+ +
Sbjct: 38  AGEDWHKAGTFKHHLLGVYRTLALWDQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 97

Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
           R  +G  AE+ + LFC +PR   +                      +++  + +G     
Sbjct: 98  REALGIEAEEWVSLFCAMPRTRFVQ---------------------AILQGQGQG----- 131

Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
                        +GLT++    G+    + R VA F++++ AD  +Q   +QD +F   
Sbjct: 132 ------------PEGLTLESAD-GQVFTFTPRQVAAFIVVSAADIGEQWHSWQDEIFAGY 178

Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
               + +   +  ++LWPG  KP    N LS +  + + +             SA     
Sbjct: 179 PQQQRRDLKSHWAASLWPGPLKPP--SNILSMLSHLLAPL-------------SALPADT 223

Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
           G+          PVPP F++C   L  +++  A  LYW+ V       + +++ A  +L 
Sbjct: 224 GI----------PVPPAFDHCRATLSPRDEAAASALYWQVVTRMH--PQTEMDTARHMLE 271

Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             +  NP+V EP ++L Q+ L  G+FE AE  A  GL  L  WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVAEPRLLLAQLALTAGDFEAAEGHAAAGLAALQAWGTAWDKRIEWAG 326


>gi|423014406|ref|ZP_17005127.1| hypothetical protein AXXA_08139 [Achromobacter xylosoxidans AXX-A]
 gi|338782587|gb|EGP46959.1| hypothetical protein AXXA_08139 [Achromobacter xylosoxidans AXX-A]
          Length = 357

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 164/355 (46%), Gaps = 68/355 (19%)

Query: 47  AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
           AGE WHK G+F  HL+ +YR L +W  P  V L GLFHS Y N YV+L +FD    R+ +
Sbjct: 38  AGEDWHKAGTFKHHLLGVYRTLTLWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 97

Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
           R  +G  AE+ + LFC +PR   +                      +++  + +G     
Sbjct: 98  REVLGAEAEEWVSLFCAMPRTRFVQ---------------------AILQGQGRG----- 131

Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
                        +GLT++    G+    + R VA FV+++ AD  +Q   +QD +F   
Sbjct: 132 ------------PEGLTLEGAD-GQAFTFTPRQVAAFVVVSAADIGEQWHSWQDEIFAGY 178

Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
               + +   +  ++LWPG  KP    N LS +  + + +             SA     
Sbjct: 179 PQQQRRDLKSHWAASLWPGPLKPP--SNILSMLSHLLAPL-------------SALPADT 223

Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
           G+          PVPP F++C   L  +++  A  LYW+ V       + +++ A  +L 
Sbjct: 224 GI----------PVPPAFDHCRATLSPRDEAAASALYWQVVTRMH--PQTEMDTARHMLE 271

Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             +  NP+VGEP ++L Q+ L  G+FE A   A  GL  L  WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVGEPRLLLAQLALTAGDFEAAAGHAAAGLAALQAWGTAWDKRIEWAG 326


>gi|297822077|ref|XP_002878921.1| hypothetical protein ARALYDRAFT_344261 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324760|gb|EFH55180.1| hypothetical protein ARALYDRAFT_344261 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 378

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 60/94 (63%), Positives = 75/94 (79%), Gaps = 1/94 (1%)

Query: 173 INELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEF 232
           IN L+P +G+ VKHIKTGE++ VSRRVV  F+++T+A+FSDQ FG QD  FN    +LEF
Sbjct: 122 INGLVPDNGVVVKHIKTGEEIVVSRRVVGVFLLMTMAEFSDQFFGLQDEFFNYG-WRLEF 180

Query: 233 AGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAR 266
            GN+ +ALWP +GKPGLW+NS SRMGAIYSLI R
Sbjct: 181 RGNNVTALWPRNGKPGLWLNSNSRMGAIYSLIVR 214


>gi|297837231|ref|XP_002886497.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297332338|gb|EFH62756.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1173

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 59/94 (62%), Positives = 74/94 (78%), Gaps = 1/94 (1%)

Query: 173 INELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEF 232
           IN L+P +G+ VKHIKTGE++ VSRRVV  F+++T+A+FS Q  G QD  FN    +LEF
Sbjct: 118 INGLVPDNGVVVKHIKTGEEIVVSRRVVGVFLLMTMAEFSGQFCGLQDEFFNYG-WRLEF 176

Query: 233 AGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAR 266
            GN+ +ALWPG+GKPGLW+NS SRMGAIYSLI R
Sbjct: 177 RGNNVTALWPGNGKPGLWLNSNSRMGAIYSLIVR 210


>gi|444916194|ref|ZP_21236314.1| hypothetical protein D187_08782 [Cystobacter fuscus DSM 2262]
 gi|444712516|gb|ELW53438.1| hypothetical protein D187_08782 [Cystobacter fuscus DSM 2262]
          Length = 568

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 154/364 (42%), Gaps = 103/364 (28%)

Query: 32  DKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSY 91
           D  +P L + L   G+    H  G+F +HL+   RIL+ W+    +C  G+FHSAYS  +
Sbjct: 294 DARVPMLASALAERGSDRFQHSVGTFRDHLLGTGRILRGWRQRSEICRAGMFHSAYSTQF 353

Query: 92  VNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASD 151
               +F P   R+ VR  +G+ AE L+ LFC V R  L                      
Sbjct: 354 FREHLFAPWE-RETVRELIGETAEALVFLFCTVDRFRL---------------------- 390

Query: 152 VSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADF 211
                        AE A R  ++      GL + ++ T +    S RVVA  +++ +A+ 
Sbjct: 391 ------------TAEVAKRHTLD-----GGLILHNLHTEQQQRFSPRVVAALLVVEMANL 433

Query: 212 SDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIF 271
           ++Q+  ++D                          PGLWM+ +S  G +         ++
Sbjct: 434 AEQV-AYRDR------------------------APGLWMSKVSHFGRVI-----RSHLY 463

Query: 272 IQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDG 331
                                     VPPVF++C+ +L  +E+  A   Y +A  G + G
Sbjct: 464 A-------------------------VPPVFDHCSAVLTPEEEKAALATYAQAHTG-APG 497

Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPW 391
            ++ L +   L       NP+VGEPH+ L ++  A+GE  +AE  A +G  LL  WG+PW
Sbjct: 498 EEKHLRETVAL-------NPWVGEPHIRLVRLCAARGEHLKAEGHARRGAALLTAWGTPW 550

Query: 392 DKRM 395
           DK +
Sbjct: 551 DKSL 554


>gi|389875796|ref|YP_006373531.1| hypothetical protein TMO_a0544 [Tistrella mobilis KA081020-065]
 gi|388530751|gb|AFK55947.1| hypothetical protein TMO_a0544 [Tistrella mobilis KA081020-065]
          Length = 173

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 66/103 (64%)

Query: 29  EAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYS 88
           E +D  LP+L+ +L S  AGE WHK G+F  HL+D+YR L +W  P  V + GLFHS YS
Sbjct: 31  EKVDAELPALLDLLFSRSAGEAWHKAGTFKNHLLDVYRTLALWDQPRDVRVLGLFHSVYS 90

Query: 89  NSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
           N +V L +FD +  R  +   VG+ AE+LIH FC++PR   + 
Sbjct: 91  NEFVALKLFDASAKRAELAAAVGEEAERLIHTFCVMPRTRFVQ 133


>gi|323453539|gb|EGB09410.1| hypothetical protein AURANDRAFT_71288 [Aureococcus anophagefferens]
          Length = 2421

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 155/384 (40%), Gaps = 74/384 (19%)

Query: 19   SARPFLRGELEAIDKNLPSLIAVLRSVGAGECW----HKHGSFLEHLVDIYRILKIWKAP 74
            S R F+  E  A+D+ +     VLR +GA   +    H   +F  HL   Y +L  W  P
Sbjct: 2055 SVRAFVYREWSALDEGIDD---VLRRMGAMSAFVEFPHARNTFFNHLKGTYGMLMAWNQP 2111

Query: 75   DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDL 134
              V   GL H+AYS       IFDP   R  +R  VG  AE LI+ F  V R   +    
Sbjct: 2112 VAVARAGLAHTAYSGDLFEFFIFDPTVDRAEMRALVGGPAEALIYEFGTVARHEFVGLGH 2171

Query: 135  LFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVG 194
            +       L   L             G  +   A R +           V+  KT     
Sbjct: 2172 MMNNRSHRLARPLV------------GGLDDRVATRSR-----------VEGAKT----- 2203

Query: 195  VSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSL 254
            +S R VA  ++++IAD+ DQ      +  ++    L+      + ++PG  +P      +
Sbjct: 2204 LSNREVAELIVVSIADYLDQGVTTNGWRDHHQIDPLD------TRIFPGTVEP-----DV 2252

Query: 255  SRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQ 314
            S   A  +  A    + +                         +PP+F+ CT  +  +++
Sbjct: 2253 SLYWAAAACKAVRGHLAV-------------------------IPPIFDRCTADIAYEDE 2287

Query: 315  IEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAE 374
              ARD+YW  V    D ++   + A  ++   IE  P+VGE HV+L Q++   G   EA 
Sbjct: 2288 AAARDMYWRVV---QDSNRISGDAAITMIAAAIELLPYVGEMHVLLAQLHFRAGRHGEAR 2344

Query: 375  REAGKGLMLLLEWGSPWDKRMSWE 398
              A   L  +  +G  WDKR+++E
Sbjct: 2345 DHAAVALDRMYAFGGTWDKRLTYE 2368


>gi|389875795|ref|YP_006373530.1| hypothetical protein TMO_a0543 [Tistrella mobilis KA081020-065]
 gi|388530750|gb|AFK55946.1| hypothetical protein TMO_a0543 [Tistrella mobilis KA081020-065]
          Length = 187

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 65/112 (58%), Gaps = 2/112 (1%)

Query: 288 KERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCI 347
           ++   D+ LPVPPVF+ CTRIL   +++ A  LYW+ V      S      A  LL   I
Sbjct: 47  RDLPADLGLPVPPVFDRCTRILTQADEMAAVALYWQVVTRAHAISTPV--AAVSLLKTAI 104

Query: 348 EKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
             NP + EPH+VL QI L +G+++ A   A  GL +L  WG+ WDKR++W G
Sbjct: 105 AHNPDIAEPHLVLAQIALTQGDYDTAAAHARIGLDILSAWGTAWDKRIAWSG 156


>gi|255611683|ref|XP_002539332.1| conserved hypothetical protein [Ricinus communis]
 gi|223507070|gb|EEF23050.1| conserved hypothetical protein [Ricinus communis]
          Length = 237

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 104/216 (48%), Gaps = 34/216 (15%)

Query: 191 EDVGVSRRV----VATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPG 243
           E  G S+R+    VA F ++++AD  +Q F +Q+ +   F     + +   +  ++LWPG
Sbjct: 18  ESAGRSQRLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPG 77

Query: 244 DGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFE 303
             +P       SRM +  S + R     +Q     A               +LP+PPVF+
Sbjct: 78  PMRPA------SRMLSQISDLGRA----LQHPALRA---------------QLPLPPVFD 112

Query: 304 NCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQI 363
           +C+R L   ++  A  LYW  +    D     L+ A  +L   +  NP+VGEP +VL Q+
Sbjct: 113 HCSRQLSPGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQL 170

Query: 364 YLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
           YL+ G  ++A R A   L     WG+ WDKR+ W+ 
Sbjct: 171 YLSAGRADDAARAATSALQCFSGWGNAWDKRVQWDA 206


>gi|147833692|emb|CAN64331.1| hypothetical protein VITISV_014667 [Vitis vinifera]
          Length = 846

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/78 (53%), Positives = 50/78 (64%), Gaps = 18/78 (23%)

Query: 182 LTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALW 241
           + VKHIK GEDV V                    FGFQD LF+NSD  LEF+GN++ ALW
Sbjct: 342 VIVKHIKIGEDVAV------------------LAFGFQDALFDNSDRCLEFSGNNYGALW 383

Query: 242 PGDGKPGLWMNSLSRMGA 259
           PGDGKPGLW+N++SRMG 
Sbjct: 384 PGDGKPGLWLNAISRMGT 401


>gi|361068359|gb|AEW08491.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156112|gb|AFG60290.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156114|gb|AFG60291.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156116|gb|AFG60292.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156118|gb|AFG60293.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156120|gb|AFG60294.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156122|gb|AFG60295.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156124|gb|AFG60296.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156126|gb|AFG60297.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156128|gb|AFG60298.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156132|gb|AFG60300.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156134|gb|AFG60301.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156136|gb|AFG60302.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156138|gb|AFG60303.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156140|gb|AFG60304.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156142|gb|AFG60305.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
          Length = 76

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 2/70 (2%)

Query: 292 EDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNP 351
           E+IEL  PPVF++C  +++  +Q+ ARDLY EA+C  +     K  KAE L ++  E NP
Sbjct: 9   EEIELVTPPVFDHCRAVVNPSDQVTARDLYMEAICMPAGEKNNK--KAEALFVKVCELNP 66

Query: 352 FVGEPHVVLG 361
           F+G+PHVVL 
Sbjct: 67  FIGDPHVVLA 76


>gi|383156108|gb|AFG60288.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156110|gb|AFG60289.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
 gi|383156130|gb|AFG60299.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
          Length = 76

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 2/70 (2%)

Query: 292 EDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNP 351
           E+IEL  PPVF++C  +++  +Q+ ARDLYWEA+   +     K  KAE L ++  E NP
Sbjct: 9   EEIELVTPPVFDHCRAVVNPSDQVTARDLYWEAISMPAGEKNNK--KAEALFVKVCELNP 66

Query: 352 FVGEPHVVLG 361
           F+G+PHVVL 
Sbjct: 67  FIGDPHVVLA 76


>gi|255088788|ref|XP_002506316.1| predicted protein [Micromonas sp. RCC299]
 gi|226521588|gb|ACO67574.1| predicted protein [Micromonas sp. RCC299]
          Length = 629

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 67/134 (50%), Gaps = 10/134 (7%)

Query: 1   MPSQQGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHK-HGSFLE 59
           +P    +++ + +E +VAS         E +  NL + +A L   G     H  +G F  
Sbjct: 365 LPFTDPSLHRNPIEAVVASV------AREDLTSNLDAKLAFLNGEGVAGVAHTGNGKFDA 418

Query: 60  HLVDIYRILKIWKAPDCVCLCGLFHSAY-SNSYVNLAIFDPNTGRDVVRGHVGDAAEKLI 118
           HLV +  +L+ W   D V   GLFHS Y +  +   ++  P   RD VR  +GD AE+L+
Sbjct: 419 HLVGVASVLRAWGCDDSVVDAGLFHSIYGTEGFQGFSL--PYERRDDVRALIGDRAERLV 476

Query: 119 HLFCIVPRQPLIHD 132
           H FC+V R  + +D
Sbjct: 477 HTFCVVDRLSVDND 490


>gi|297742727|emb|CBI35361.3| unnamed protein product [Vitis vinifera]
          Length = 85

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/37 (75%), Positives = 33/37 (89%)

Query: 290 RDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVC 326
           RDE++EL +PPV ENCTR+L A+EQIEA DLYWEAVC
Sbjct: 38  RDEELELVIPPVLENCTRVLGAEEQIEATDLYWEAVC 74


>gi|87118690|ref|ZP_01074589.1| hypothetical protein MED121_16724 [Marinomonas sp. MED121]
 gi|86166324|gb|EAQ67590.1| hypothetical protein MED121_16724 [Marinomonas sp. MED121]
          Length = 185

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 9/90 (10%)

Query: 41  VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
            L ++GAGE  H +GS + HL   +++LK W A D VC  GLFH+AY       A FD  
Sbjct: 10  ALEALGAGEFDHLNGSLISHLTGTFKMLKTWDAADYVCDAGLFHAAYGT-----AGFDDK 64

Query: 101 ----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
               + R  +   +G  AE L++L+C   R
Sbjct: 65  MVSLSQRAKIAEVIGKEAEVLVYLYCSCDR 94


>gi|116625450|ref|YP_827606.1| prolyl 4-hydroxylase subunit alpha [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228612|gb|ABJ87321.1| Prolyl 4-hydroxylase, alpha subunit [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 348

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 45/93 (48%), Gaps = 1/93 (1%)

Query: 34  NLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVN 93
           N   L   LR +GA  C HK G+  +HLV  + IL+     D VC  G  HS Y  +   
Sbjct: 191 NFEKLSLFLRKMGATNCGHKEGTLHDHLVRTFAILEKRGYTDEVCFGGGLHSIYGTNIFQ 250

Query: 94  LAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
            +I  P + R ++    G+ AE+L  LF  + R
Sbjct: 251 QSILTPES-RSLISNEFGNGAEELAFLFSTLER 282


>gi|343497111|ref|ZP_08735191.1| hypothetical protein VINI7043_03738 [Vibrio nigripulchritudo ATCC
           27043]
 gi|342819941|gb|EGU54773.1| hypothetical protein VINI7043_03738 [Vibrio nigripulchritudo ATCC
           27043]
          Length = 177

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 44/89 (49%), Gaps = 9/89 (10%)

Query: 42  LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
           L  +GAG+  H +GS  EHL     ILK W A + +   GLFH+AY       A FD N 
Sbjct: 8   LAELGAGDFQHLNGSLTEHLKGTQNILKQWGASEVLQTAGLFHAAYGT-----AGFDENM 62

Query: 102 ----GRDVVRGHVGDAAEKLIHLFCIVPR 126
                R  +   +GD  E L++L+C   R
Sbjct: 63  VSLDQRKYIAEIIGDEEEALVYLYCSCDR 91


>gi|323500047|ref|ZP_08105001.1| hypothetical protein VISI1226_08534 [Vibrio sinaloensis DSM 21326]
 gi|323314903|gb|EGA67960.1| hypothetical protein VISI1226_08534 [Vibrio sinaloensis DSM 21326]
          Length = 177

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 69/153 (45%), Gaps = 20/153 (13%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
            A L+++GAG+  H +GS   HL     +L  W A + + + GLFH+AY       A FD
Sbjct: 4   FAALKALGAGDFQHLNGSLEAHLQGTQALLNRWGADEVLQVAGLFHAAYGT-----AGFD 58

Query: 99  PN----TGRDVVRGHVGDAAEKLIHLFCIVPR----------QPLIHDDLLFEYSDQELV 144
            N      RD +   +G  AE L++L+C   R          +P+   D   E  + EL 
Sbjct: 59  DNMVSLAQRDKIASIIGPQAEALVYLYCSCDRDFVFPQFGLGKPIQFKDRFVE-REFELS 117

Query: 145 EHLKASDVSLINAKEKGLFNAEEAWRKKINELL 177
             L  +   L  A E  L  A + ++++  E L
Sbjct: 118 AELTVAFCELTVANELELVLASDVFKQQYGEEL 150


>gi|440701263|ref|ZP_20883463.1| hypothetical protein STRTUCAR8_04651 [Streptomyces turgidiscabies
           Car8]
 gi|440276098|gb|ELP64416.1| hypothetical protein STRTUCAR8_04651 [Streptomyces turgidiscabies
           Car8]
          Length = 171

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           +  LR +GA +  H  G+ L HL  +YR+L  W+A   + L GL H+AY       A+  
Sbjct: 1   MGFLRQLGAEQIGHPGGTLLAHLCRVYRLLAAWEARPALRLAGLCHAAYGTDGFPTALLS 60

Query: 99  PNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
            +   ++ R  +G  AE+++H++  + R+
Sbjct: 61  LDRRAELAR-VIGPEAEEIVHVYAAMNRK 88


>gi|418712828|ref|ZP_13273557.1| hypothetical protein LEP1GSC099_4795 [Leptospira interrogans str.
           UI 08452]
 gi|410790597|gb|EKR84289.1| hypothetical protein LEP1GSC099_4795 [Leptospira interrogans str.
           UI 08452]
          Length = 179

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R+ L
Sbjct: 80  EKAETLVYLYASCDREYL 97


>gi|418666230|ref|ZP_13227661.1| hypothetical protein LEP1GSC019_4608 [Leptospira interrogans
           serovar Pyrogenes str. 2006006960]
 gi|410758177|gb|EKR19776.1| hypothetical protein LEP1GSC019_4608 [Leptospira interrogans
           serovar Pyrogenes str. 2006006960]
          Length = 176

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 41  VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
           +L+ +      H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N
Sbjct: 7   ILKQLSVSNFEHLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTN 66

Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
             +++V G +G+ AE L++L+    R  L
Sbjct: 67  ARKEIV-GIIGEKAETLVYLYGSCDRTHL 94


>gi|323491275|ref|ZP_08096460.1| hypothetical protein VIBR0546_12617 [Vibrio brasiliensis LMG 20546]
 gi|323314401|gb|EGA67480.1| hypothetical protein VIBR0546_12617 [Vibrio brasiliensis LMG 20546]
          Length = 178

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 1/88 (1%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
             +L+ +GAG+  H +GS   HL     +L  W + + +   GLFH+AY  +  +  +  
Sbjct: 4   FTMLKQLGAGDFQHLNGSLEAHLKGTQAVLNEWNSSELLQTAGLFHAAYGTAGFDENMVS 63

Query: 99  PNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
            N  +++ +  +GDA E L++L+C   R
Sbjct: 64  LNQRQEIAQV-IGDAEEALVYLYCSCDR 90


>gi|260778426|ref|ZP_05887318.1| hypothetical protein VIC_003827 [Vibrio coralliilyticus ATCC
           BAA-450]
 gi|260604590|gb|EEX30885.1| hypothetical protein VIC_003827 [Vibrio coralliilyticus ATCC
           BAA-450]
          Length = 187

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 63/145 (43%), Gaps = 10/145 (6%)

Query: 42  LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
           L S+GAG+  H +GS   HLV    +L  W A D V   GLFH+AY  +  + ++     
Sbjct: 8   LESLGAGDFQHLNGSLESHLVGTESLLAKWHASDTVRSAGLFHAAYGTAGFDESMVSLEM 67

Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPR---------QPLIHDDLLFEYSDQELVEHLKASDV 152
            R  +   +G  AE L++L+C   R         +P I     F  S  E+ E       
Sbjct: 68  -RSEIAAVIGREAEDLVYLYCSCDRDYVFPQFDSKPFIQFRDRFTGSVFEMTEEQLHQFC 126

Query: 153 SLINAKEKGLFNAEEAWRKKINELL 177
            L  A E  L  A E ++ K  + L
Sbjct: 127 ELTVANELELVLASEEFKLKYGQEL 151


>gi|418704779|ref|ZP_13265646.1| hypothetical protein LEP1GSC096_2354 [Leptospira interrogans
           serovar Hebdomadis str. R499]
 gi|410765392|gb|EKR36092.1| hypothetical protein LEP1GSC096_2354 [Leptospira interrogans
           serovar Hebdomadis str. R499]
          Length = 179

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 1/93 (1%)

Query: 37  SLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAI 96
           +++  L+ +      H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +
Sbjct: 6   NILKQLKELSVSNFEHLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPV 65

Query: 97  FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
              N  +++V G +G+ AE L++L+    R  L
Sbjct: 66  VGTNARKEIV-GIIGEKAETLVYLYGSCDRTHL 97


>gi|417759633|ref|ZP_12407667.1| hypothetical protein LEP1GSC027_1397 [Leptospira interrogans str.
           2002000624]
 gi|417776498|ref|ZP_12424335.1| hypothetical protein LEP1GSC025_4235 [Leptospira interrogans str.
           2002000621]
 gi|418671397|ref|ZP_13232749.1| hypothetical protein LEP1GSC026_2270 [Leptospira interrogans str.
           2002000623]
 gi|409944381|gb|EKN89964.1| hypothetical protein LEP1GSC027_1397 [Leptospira interrogans str.
           2002000624]
 gi|410573707|gb|EKQ36752.1| hypothetical protein LEP1GSC025_4235 [Leptospira interrogans str.
           2002000621]
 gi|410581658|gb|EKQ49467.1| hypothetical protein LEP1GSC026_2270 [Leptospira interrogans str.
           2002000623]
          Length = 152

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R  L
Sbjct: 80  EKAETLVYLYGSCDRTHL 97


>gi|333985947|ref|YP_004515157.1| hypothetical protein [Methylomonas methanica MC09]
 gi|333809988|gb|AEG02658.1| hypothetical protein Metme_4309 [Methylomonas methanica MC09]
          Length = 210

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 1/85 (1%)

Query: 42  LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
           LR +GAGE  H +G  L HL   Y +L  W A + +C  GL+H+AY  +  +  +   N 
Sbjct: 10  LRELGAGEFEHLNGRLLSHLRGTYDLLHHWGANEVLCKAGLYHAAYGTAGYDEKLISLNE 69

Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPR 126
            R  ++  +G  +E++++++C   R
Sbjct: 70  -RLQIQELIGLESEEVVYVYCACDR 93


>gi|24217151|ref|NP_714634.1| hypothetical protein LB_090 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45655655|ref|YP_003464.1| hypothetical protein LIC20072 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386076110|ref|YP_005990299.1| hypothetical protein LIF_B073 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417766358|ref|ZP_12414310.1| hypothetical protein LEP1GSC007_1983 [Leptospira interrogans
           serovar Bulgarica str. Mallika]
 gi|417786989|ref|ZP_12434674.1| hypothetical protein LEP1GSC077_0204 [Leptospira interrogans str.
           C10069]
 gi|418699087|ref|ZP_13260054.1| hypothetical protein LEP1GSC087_2323 [Leptospira interrogans
           serovar Bataviae str. L1111]
 gi|418710612|ref|ZP_13271382.1| hypothetical protein LEP1GSC097_1767 [Leptospira interrogans
           serovar Grippotyphosa str. UI 08368]
 gi|418725668|ref|ZP_13284286.1| hypothetical protein LEP1GSC104_0409 [Leptospira interrogans str.
           UI 12621]
 gi|418733770|ref|ZP_13290881.1| hypothetical protein LEP1GSC105_0275 [Leptospira interrogans str.
           UI 12758]
 gi|421086775|ref|ZP_15547623.1| hypothetical protein LEP1GSC173_0283 [Leptospira santarosai str.
           HAI1594]
 gi|421103931|ref|ZP_15564527.1| hypothetical protein LEP1GSC117_0124 [Leptospira interrogans
           serovar Icterohaemorrhagiae str. Verdun LP]
 gi|421128388|ref|ZP_15588603.1| hypothetical protein LEP1GSC020_0149 [Leptospira interrogans
           serovar Grippotyphosa str. 2006006986]
 gi|421135717|ref|ZP_15595837.1| hypothetical protein LEP1GSC009_0389 [Leptospira interrogans
           serovar Grippotyphosa str. Andaman]
 gi|24202189|gb|AAN51649.1| hypothetical protein LB_090 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45602626|gb|AAS72101.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353459772|gb|AER04316.1| hypothetical protein LIF_B073 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400351185|gb|EJP03425.1| hypothetical protein LEP1GSC007_1983 [Leptospira interrogans
           serovar Bulgarica str. Mallika]
 gi|409949841|gb|EKO04374.1| hypothetical protein LEP1GSC077_0204 [Leptospira interrogans str.
           C10069]
 gi|409961305|gb|EKO25052.1| hypothetical protein LEP1GSC104_0409 [Leptospira interrogans str.
           UI 12621]
 gi|410020022|gb|EKO86827.1| hypothetical protein LEP1GSC009_0389 [Leptospira interrogans
           serovar Grippotyphosa str. Andaman]
 gi|410366412|gb|EKP21804.1| hypothetical protein LEP1GSC117_0124 [Leptospira interrogans
           serovar Icterohaemorrhagiae str. Verdun LP]
 gi|410430804|gb|EKP75167.1| hypothetical protein LEP1GSC173_0283 [Leptospira santarosai str.
           HAI1594]
 gi|410434113|gb|EKP83254.1| hypothetical protein LEP1GSC020_0149 [Leptospira interrogans
           serovar Grippotyphosa str. 2006006986]
 gi|410761947|gb|EKR28118.1| hypothetical protein LEP1GSC087_2323 [Leptospira interrogans
           serovar Bataviae str. L1111]
 gi|410769201|gb|EKR44444.1| hypothetical protein LEP1GSC097_1767 [Leptospira interrogans
           serovar Grippotyphosa str. UI 08368]
 gi|410772951|gb|EKR52983.1| hypothetical protein LEP1GSC105_0275 [Leptospira interrogans str.
           UI 12758]
 gi|456966682|gb|EMG08216.1| hypothetical protein LEP1GSC151_5661 [Leptospira interrogans
           serovar Grippotyphosa str. LT2186]
 gi|456986895|gb|EMG22351.1| hypothetical protein LEP1GSC150_0599 [Leptospira interrogans
           serovar Copenhageni str. LT2050]
          Length = 179

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R  L
Sbjct: 80  EKAETLVYLYGSCDRTHL 97


>gi|421119635|ref|ZP_15579955.1| hypothetical protein LEP1GSC057_2418 [Leptospira interrogans str.
           Brem 329]
 gi|410347786|gb|EKO98659.1| hypothetical protein LEP1GSC057_2418 [Leptospira interrogans str.
           Brem 329]
          Length = 179

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R  L
Sbjct: 80  EKAETLVYLYGSCDRTHL 97


>gi|456825880|gb|EMF74258.1| hypothetical protein LEP1GSC148_0133 [Leptospira interrogans
           serovar Canicola str. LT1962]
          Length = 179

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R  L
Sbjct: 80  EKAETLVYLYGSCDRTHL 97


>gi|455793441|gb|EMF45139.1| hypothetical protein LEP1GSC067_2065 [Leptospira interrogans
           serovar Lora str. TE 1992]
          Length = 179

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 52  HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
           H +G  L HL+  ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G
Sbjct: 21  HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79

Query: 112 DAAEKLIHLFCIVPRQPL 129
           + AE L++L+    R  L
Sbjct: 80  EKAETLVYLYGSCDRTHL 97


>gi|440736559|ref|ZP_20916154.1| prolyl 4-hydroxylase subunit alpha [Pseudomonas fluorescens
           BRIP34879]
 gi|440382920|gb|ELQ19402.1| prolyl 4-hydroxylase subunit alpha [Pseudomonas fluorescens
           BRIP34879]
          Length = 410

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 6/111 (5%)

Query: 39  IAVLRSVGAGECWHK-HGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSN-SYVNLAI 96
           ++ L   GA +  H    S  +HL+    IL    A D V L G++HS YS  S+ ++ I
Sbjct: 245 LSFLAKTGAADQVHSGRSSLQDHLLGTATILSKTGAGDDVQLAGMYHSIYSTQSFHHVTI 304

Query: 97  FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHL 147
             P   +D V G +G AAEKL+  F ++PR P +    L E   ++ +EHL
Sbjct: 305 --PIEDKDQVVGMIGPAAEKLVRAFSVLPR-PAVLSQALIE-KGEDWIEHL 351


>gi|398334432|ref|ZP_10519137.1| hypothetical protein LkmesMB_01530 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 182

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 1/94 (1%)

Query: 36  PSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLA 95
           P L+  L+S+ A    H HG  L HL+  + IL+ W +   +C  GL+H+ Y       +
Sbjct: 7   PILLDQLKSLHAQGKEHLHGDLLTHLIGTFYILEEWGSSIDLCNAGLYHAVYGTFAFQDS 66

Query: 96  IFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           + +  T R  V   +G  AE L++L+    R  L
Sbjct: 67  LLE-TTARKEVENLIGSKAETLVYLYGTCDRNHL 99


>gi|418690869|ref|ZP_13251976.1| hypothetical protein LEP1GSC080_1255 [Leptospira interrogans str.
           FPW2026]
 gi|400359905|gb|EJP15886.1| hypothetical protein LEP1GSC080_1255 [Leptospira interrogans str.
           FPW2026]
          Length = 154

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 58  LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
           L HL++ ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G+ AE L
Sbjct: 2   LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60

Query: 118 IHLFCIVPRQPL 129
           ++L+    R+ L
Sbjct: 61  VYLYASCDREYL 72


>gi|417951950|ref|ZP_12595024.1| hypothetical protein VISP3789_10999 [Vibrio splendidus ATCC 33789]
 gi|342802999|gb|EGU38379.1| hypothetical protein VISP3789_10999 [Vibrio splendidus ATCC 33789]
          Length = 177

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 41  VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
           +L+ +GAG+  H +GS   HL    ++L  W + + + + GLFH+AY  +  +  +   +
Sbjct: 6   MLQELGAGDFQHLNGSLEAHLKGTEQVLINWGSSELLQIAGLFHAAYGTAGFDENMVSLS 65

Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPR 126
             +D+ R  +G+  E L++L+C   R
Sbjct: 66  QRQDIAR-IIGNDEEALVYLYCSCDR 90


>gi|421116003|ref|ZP_15576396.1| hypothetical protein LEP1GSC069_1854 [Leptospira interrogans
           serovar Canicola str. Fiocruz LV133]
 gi|410012499|gb|EKO70597.1| hypothetical protein LEP1GSC069_1854 [Leptospira interrogans
           serovar Canicola str. Fiocruz LV133]
          Length = 124

 Score = 49.7 bits (117), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 1/64 (1%)

Query: 58  LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
           L HL++ ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G+ AE L
Sbjct: 2   LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60

Query: 118 IHLF 121
           ++L+
Sbjct: 61  VYLY 64


>gi|417772581|ref|ZP_12420469.1| hypothetical protein LEP1GSC014_0232 [Leptospira interrogans
           serovar Pomona str. Pomona]
 gi|418681886|ref|ZP_13243108.1| hypothetical protein LEP1GSC045_3464 [Leptospira interrogans
           serovar Pomona str. Kennewicki LC82-25]
 gi|400326403|gb|EJO78670.1| hypothetical protein LEP1GSC045_3464 [Leptospira interrogans
           serovar Pomona str. Kennewicki LC82-25]
 gi|409945258|gb|EKN95274.1| hypothetical protein LEP1GSC014_0232 [Leptospira interrogans
           serovar Pomona str. Pomona]
 gi|455669757|gb|EMF34815.1| hypothetical protein LEP1GSC201_0260 [Leptospira interrogans
           serovar Pomona str. Fox 32256]
          Length = 154

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 58  LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
           L HL++ ++IL+ W +P+ +C  GLFH+ Y        +   N  +++V G +G+ AE L
Sbjct: 2   LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60

Query: 118 IHLFCIVPRQPL 129
           ++L+    R  L
Sbjct: 61  VYLYGSCDRTHL 72


>gi|37680059|ref|NP_934668.1| hypothetical protein VV1875 [Vibrio vulnificus YJ016]
 gi|37198805|dbj|BAC94639.1| hypothetical protein [Vibrio vulnificus YJ016]
          Length = 180

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
             +L+ +GAG+  H +GS   HL    ++L  W + + + + GLFH+AY       A FD
Sbjct: 7   FGMLQELGAGDFQHLNGSLEAHLKGTEKVLINWGSSELLQIAGLFHAAYGT-----AGFD 61

Query: 99  PN----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
            N    + R  +   +G+  E L++L+C   R
Sbjct: 62  ENMVSLSQRQEIARIIGNDEEALVYLYCSCDR 93


>gi|388600182|ref|ZP_10158578.1| hypothetical protein VcamD_09820 [Vibrio campbellii DS40M4]
          Length = 177

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
             +L+ +GAG+  H +GS   HL    ++L  W + + + + GLFH+AY       A FD
Sbjct: 4   FVMLQELGAGDFQHLNGSLEAHLKGTEQVLINWGSSELLQIAGLFHAAYGT-----AGFD 58

Query: 99  PN----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
            N    + R  +   +G+  E L++L+C   R
Sbjct: 59  ENMVSLSQRQEIARIIGNDEEALVYLYCSCDR 90


>gi|428172851|gb|EKX41757.1| hypothetical protein GUITHDRAFT_112173 [Guillardia theta CCMP2712]
          Length = 287

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/164 (21%), Positives = 69/164 (42%), Gaps = 37/164 (22%)

Query: 52  HKHGS-FLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHV 110
           H HG  F +H+  +  IL+ W  P  +C+ GLFHS Y+       +F   T R  ++  +
Sbjct: 56  HTHGRLFPDHIRGVADILQQWNQPHEICMAGLFHSCYATEMFPWKLFS-LTERSRLQETL 114

Query: 111 GDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWR 170
           G   E+++ L+C                                    + G++   ++  
Sbjct: 115 GQEVEEIVFLYC---------------------------------TCSQNGIYRQVKSLM 141

Query: 171 KKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQ 214
           ++  + +PA+GL V +  TG+   +  ++ A  +++  AD  +Q
Sbjct: 142 RRGQD-IPAEGL-VTNYYTGQQCYLPHKLAAKVLVVLGADLMEQ 183


>gi|145341088|ref|XP_001415647.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575870|gb|ABO93939.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 40/87 (45%)

Query: 43  RSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTG 102
           R   A +  H  GSF  HLVD+   L  + AP  V   GL H+AY +     A+      
Sbjct: 73  REARATKLTHVRGSFDAHLVDVRDWLARFGAPRSVARAGLGHAAYGSELFPCAVASFGAH 132

Query: 103 RDVVRGHVGDAAEKLIHLFCIVPRQPL 129
           R + R   G A+E+L+ L+    ++  
Sbjct: 133 RSMFRAVCGRASERLMFLYATCSQRKF 159


>gi|300784829|ref|YP_003765120.1| hydrolase/acyltransferase [Amycolatopsis mediterranei U32]
 gi|399536712|ref|YP_006549374.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
 gi|299794343|gb|ADJ44718.1| predicted hydrolase/acyltransferase [Amycolatopsis mediterranei
           U32]
 gi|398317482|gb|AFO76429.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
          Length = 396

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 3/113 (2%)

Query: 16  LVASARPFLRGE-LEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAP 74
           L+A  RP    E + A+    P+L A+LR +GA +  H  G+ L+HL  +   +      
Sbjct: 204 LIARDRPVPLAETIAAVTAEWPAL-ALLRDLGAEQAGHPGGTLLDHLRRVRDQVADGGGS 262

Query: 75  DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
               L  L H+AY        +  P+  R  +R  +GDAAE L+H + +  R 
Sbjct: 263 RRARLAALCHAAYGTDGFPHPLLPPDQ-RARLRAAIGDAAESLVHRYGVCDRS 314


>gi|443291914|ref|ZP_21031008.1| Conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
 gi|385885102|emb|CCH19115.1| Conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
          Length = 178

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 1/88 (1%)

Query: 40  AVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDP 99
           A LR  GA E  H  G+   HL  +   L +      V   GL H+AY     +LA+ DP
Sbjct: 9   AWLRRQGAEEIAHPGGNLYAHLCRVSERLALLGCDSDVQAAGLTHAAYGTDGFDLALLDP 68

Query: 100 NTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
              R V+R  +G  AE+L++L+    R+
Sbjct: 69  -ADRAVLRDLIGADAEELVYLYGACDRR 95


>gi|270264694|ref|ZP_06192959.1| hypothetical protein SOD_i01110 [Serratia odorifera 4Rx13]
 gi|270041377|gb|EFA14476.1| hypothetical protein SOD_i01110 [Serratia odorifera 4Rx13]
          Length = 190

 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 41/170 (24%), Positives = 64/170 (37%), Gaps = 43/170 (25%)

Query: 44  SVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNL-AIFD-PNT 101
           ++G  +  H     + HL+  Y IL      + VCL GLFH  Y    ++   I D P T
Sbjct: 17  TLGVHKVAHLDEPLMNHLIRTYEILCRMSCEEQVCLAGLFHGCYGTQGLHTDEIGDIPET 76

Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKG 161
            R +VR   G   E L++ F ++                                     
Sbjct: 77  QRQIVRNCAGTQVESLVYHFSVM------------------------------------- 99

Query: 162 LFNAEEAWRKKINELLPADGL-TVKHIKTGEDVGVSRRVVATFVMLTIAD 210
              + E+  K    LL ADG+  +K  ++GE + V+R      + L +AD
Sbjct: 100 ---SYESLSKSFRNLLKADGIPALKDRRSGEAISVTREYFEKLLTLKLAD 146


>gi|384148102|ref|YP_005530918.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
 gi|340526256|gb|AEK41461.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
          Length = 373

 Score = 46.6 bits (109), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 3/113 (2%)

Query: 16  LVASARPFLRGE-LEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAP 74
           L+A  RP    E + A+    P+L A+LR +GA +  H  G+ L+HL  +   +      
Sbjct: 181 LIARDRPVPLAETIAAVTAEWPAL-ALLRDLGAEQAGHPGGTLLDHLRRVRDQVADGGGS 239

Query: 75  DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
               L  L H+AY        +  P+  R  +R  +GDAAE L+H + +  R 
Sbjct: 240 RRARLAALCHAAYGTDGFPHPLLPPDQ-RARLRAAIGDAAESLVHRYGVCDRS 291


>gi|29827165|ref|NP_821799.1| hypothetical protein SAV_624 [Streptomyces avermitilis MA-4680]
 gi|29604263|dbj|BAC68334.1| hypothetical protein SAV_624 [Streptomyces avermitilis MA-4680]
          Length = 188

 Score = 45.1 bits (105), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 1/87 (1%)

Query: 41  VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
           +LR +GA E  H  G+ L HL  +   L +W+A   + L GL H+ Y       A+   +
Sbjct: 14  LLRELGATEIAHPGGTLLAHLQRVQEQLAVWEARTALQLAGLCHAFYGTDGFPAALLSLD 73

Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
             RD +   +G  AE +++L+    R+
Sbjct: 74  R-RDELAAAIGVEAEGIVYLYASCDRK 99


>gi|410630511|ref|ZP_11341200.1| hypothetical protein GARC_1086 [Glaciecola arctica BSs20135]
 gi|410149953|dbj|GAC18067.1| hypothetical protein GARC_1086 [Glaciecola arctica BSs20135]
          Length = 178

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 5/100 (5%)

Query: 31  IDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNS 90
           +DK    LI     +GAGE  H  GS + HL     +LK W A   +   GL+H+AY  S
Sbjct: 1   MDKKFRLLI----ELGAGEFEHVDGSLIAHLEGTRCLLKEWNASQVLQDAGLYHAAYGTS 56

Query: 91  YVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
                +F+    R+ V   +G   E +++ +C   R    
Sbjct: 57  AFAQNLFEL-AQREEVATVIGSEIENIVYHYCACDRDAFF 95


>gi|303285608|ref|XP_003062094.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456505|gb|EEH53806.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 367

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 2/93 (2%)

Query: 38  LIAVLRSVGAGECWHKHGSFLE-HLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAI 96
           + A+LR+ G+    H  G  L+ HL  +  ILK W+AP  + + GL H+A+ + ++ +  
Sbjct: 93  VAAMLRARGSSSTPHLDGVTLDDHLSHVASILKRWRAPRSIVVAGLAHTAFGSEFLPVRA 152

Query: 97  FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
             P   R V     G  AE+++ L+    +  L
Sbjct: 153 M-PFASRRVAEETCGAWAERMMFLYGTASQSKL 184


>gi|108707196|gb|ABF94991.1| Histone-like transcription factor and archaeal histone family
           protein, expressed [Oryza sativa Japonica Group]
          Length = 358

 Score = 44.3 bits (103), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 11/63 (17%)

Query: 254 LSRMGAIYSLIAREEEI--FIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
           +SR+ A+Y L   +EEI     ER          V K +D ++EL +PPVF  C ++LDA
Sbjct: 1   MSRLAAVYRLFTTDEEIHELTGER---------SVVKTKDAELELVIPPVFVRCNKVLDA 51

Query: 312 KEQ 314
            EQ
Sbjct: 52  GEQ 54


>gi|224369129|ref|YP_002603293.1| hypothetical protein HRM2_20280 [Desulfobacterium autotrophicum
           HRM2]
 gi|223691846|gb|ACN15129.1| tetratricopeptide (TPR) domain protein [Desulfobacterium
           autotrophicum HRM2]
          Length = 761

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 12/92 (13%)

Query: 309 LDAKEQIEARDLYWEAVCGNSDG------------SKEKLEKAEELLLRCIEKNPFVGEP 356
           L  KE  +A+D Y+ A+  N D             +++  EKA    L  +EKNP +  P
Sbjct: 553 LAQKETNKAKDAYYRAIEANPDYLEPYGAIARIALAEQDREKAVIQYLTLLEKNPNLPAP 612

Query: 357 HVVLGQIYLAKGEFEEAEREAGKGLMLLLEWG 388
           H++LG IY     F++AE    K L +  E+ 
Sbjct: 613 HMMLGSIYETDKSFDKAEAHYRKALEINPEYA 644


>gi|326783900|ref|YP_004324294.1| hypothetical protein SSSM7_242 [Synechococcus phage S-SSM7]
 gi|310003912|gb|ADO98307.1| hypothetical protein SSSM7_242 [Synechococcus phage S-SSM7]
          Length = 152

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 5/92 (5%)

Query: 35  LPSLIAVLRSVGAGECWHKH--GSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYV 92
           +   I  L SVGA +  H H   S L H + +  +L  +  P      GLFHS Y N + 
Sbjct: 2   MDDYINYLVSVGADKIPHYHEGDSLLSHCIRVSGMLYSYGRPMYEVRAGLFHSVYGNEF- 60

Query: 93  NLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIV 124
              ++  N  R+ +   +G+ AE +++ FC +
Sbjct: 61  --QMYQINATRNEICDIIGEDAEHIVNTFCTL 90


>gi|428181274|gb|EKX50138.1| hypothetical protein GUITHDRAFT_135325 [Guillardia theta CCMP2712]
          Length = 165

 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 7/102 (6%)

Query: 74  PDCVCLCGLFHSAYSNS-YVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
           P  +C  GLFHS Y  S + + +I     GR VVR  VG+ AE L+ LF  + R   +H 
Sbjct: 10  PQRICAAGLFHSIYGTSIFKHTSIVPDARGRAVVREVVGEEAEHLVWLFHSIDRPGTLHA 69

Query: 133 DLL------FEYSDQELVEHLKASDVSLINAKEKGLFNAEEA 168
            +L      F   ++ L++    + + +   + + L   E A
Sbjct: 70  KILPCANGAFPAGEEVLLDKTDGTTIHVTRRQFEDLLAVEMA 111


>gi|303288031|ref|XP_003063304.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455136|gb|EEH52440.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 430

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 106/288 (36%), Gaps = 55/288 (19%)

Query: 38  LIAVLRSVGAGECWHKHG----SFLEHLVDIYRILKIWKAPDCVCLCGLFHSAY-SNSYV 92
           L+ V R +GA  C         S   H          W     +   GLFHS Y +  + 
Sbjct: 189 LLGVKRVLGASTCHPATSPNTTSPTNHTASFPSTSASWGCSPALQDAGLFHSVYGTEGFQ 248

Query: 93  NLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDV 152
              +  P   RD VR  VG  AE+L+  FC+V R  +          D +L +  K    
Sbjct: 249 GFCL--PYERRDDVRALVGAYAERLVWTFCVVDRLSV----------DNDLDDRTK---- 292

Query: 153 SLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV-ATFVMLTIADF 211
           S +   E G F+ +            +DG  +  I    DV   RR     F+ LT+AD+
Sbjct: 293 SWLARPELGRFDVDA-----------SDG-ELSDISDDADVKTRRRDFWHDFITLTLADW 340

Query: 212 SDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIF 271
            +Q+           +G        F    PGD   G    + +RM  I +    +  + 
Sbjct: 341 LEQV-----------EGASRTTSEIFEWFEPGDAW-GYRRGAYARMVDIVAARYPKARVL 388

Query: 272 IQERKTSAAAVGVGVDKERDEDIELPV-PPVFENCTRILDAKEQIEAR 318
                    AV    + E+ + +  P+ PP+ E   R  +  E++E R
Sbjct: 389 YD-------AV-YAREGEKSKGVRQPITPPMSEAARRAREGMEKVEPR 428


>gi|284029839|ref|YP_003379770.1| hypothetical protein Kfla_1877 [Kribbella flavida DSM 17836]
 gi|283809132|gb|ADB30971.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
          Length = 178

 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 38/82 (46%)

Query: 46  GAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDV 105
           GA E  H  G+   HL  + + L    A D + L GL H+AY        +FD    R V
Sbjct: 15  GADELDHAGGTLYVHLHRVAKRLSSLGASDTLVLAGLAHAAYGTDGFPTHLFDWQHERPV 74

Query: 106 VRGHVGDAAEKLIHLFCIVPRQ 127
           +   VG  AE+L++ +    R+
Sbjct: 75  LEAVVGPEAEQLVYRYGACERE 96


>gi|424037490|ref|ZP_17776269.1| hypothetical protein VCHENC02_2542A, partial [Vibrio cholerae
           HENC-02]
 gi|408895483|gb|EKM31860.1| hypothetical protein VCHENC02_2542A, partial [Vibrio cholerae
           HENC-02]
          Length = 82

 Score = 41.6 bits (96), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 41  VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
           +L+ +GAG+  H +GS   HL    ++L  W + + + + GLFH+AY  +  +  +   +
Sbjct: 6   MLQELGAGDFQHLNGSLEAHLKGTEQVLIDWGSSELLQIAGLFHAAYGTAGFDEKMVSLS 65

Query: 101 TGRDVVRGHVGDAAEKLI 118
             +++ R  +G+  E L+
Sbjct: 66  QRQEIAR-IIGNDEEALV 82


>gi|359150105|ref|ZP_09182966.1| hypothetical protein StrS4_25971 [Streptomyces sp. S4]
          Length = 205

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           +A+L  +GAG   H  G    HL  ++  L  W A   + L GL H+ Y       A+  
Sbjct: 16  LALLGELGAGRLPHPGGDLATHLRRVHSQLATWGARPALQLAGLCHAFYGTDGFPTALL- 74

Query: 99  PNTGRDVVRGHVGDAAEKLIHLF 121
           P   R  +   +G+ AE+L++L+
Sbjct: 75  PLPRRAELAAVIGEEAEELVYLY 97


>gi|302552380|ref|ZP_07304722.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM
           40736]
 gi|302469998|gb|EFL33091.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM
           40736]
          Length = 182

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           +A+L  +GA +  H  G+ + HL  +   L  W A   + L GL H+ Y       A+  
Sbjct: 12  VALLHELGAADLAHPGGTLIAHLQRVQGQLAAWDARPALQLAGLCHAFYGTDGFPEALL- 70

Query: 99  PNTGRDVVRGHVGDAAEKLIHLF 121
           P   R  +   VG  AE L++L+
Sbjct: 71  PLDRRSELTAVVGAEAEALVYLY 93


>gi|291222353|ref|XP_002731180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1253

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/85 (24%), Positives = 40/85 (47%)

Query: 297 PVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEP 356
           P  PV + C  +LD   +     +    + G        ++ A+  L  C++ NP   + 
Sbjct: 435 PAGPVLKRCAAVLDPLTKAVPGLMEGLFLMGKVKFLSGDIDAAQSTLQHCLDINPTFSDA 494

Query: 357 HVVLGQIYLAKGEFEEAEREAGKGL 381
           H+++ QIY  +G F++A +   +GL
Sbjct: 495 HLLMAQIYAHQGSFKQASQSLEQGL 519


>gi|357439003|ref|XP_003589778.1| hypothetical protein MTR_1g039170 [Medicago truncatula]
 gi|355478826|gb|AES60029.1| hypothetical protein MTR_1g039170 [Medicago truncatula]
          Length = 71

 Score = 40.4 bits (93), Expect = 1.6,   Method: Composition-based stats.
 Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGE---CWH 52
          N  +   V   + FLR +L +I  NLPS I VLRSVGA E   C+H
Sbjct: 22 NSRISATVLINKLFLRNDLTSIITNLPSRITVLRSVGASESASCYH 67


>gi|302532161|ref|ZP_07284503.1| conserved hypothetical protein [Streptomyces sp. C]
 gi|302441056|gb|EFL12872.1| conserved hypothetical protein [Streptomyces sp. C]
          Length = 202

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 39  IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           +A+ + VGA    H  G+ L HL+ +  +L  W A   +    L H+ Y    + L + +
Sbjct: 1   MALPKEVGAETTKHSGGTLLTHLLRVEALLASWGARPVLRTAALCHAFYGTDGLPLVLLE 60

Query: 99  PNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
            +  RD +   +G  AE+L++ +    R+
Sbjct: 61  LSE-RDRLAEAIGREAEELVYFYASCDRK 88


>gi|351713927|gb|EHB16846.1| Tetratricopeptide repeat protein 21A [Heterocephalus glaber]
          Length = 1426

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  + RC+E +P   + H++L QIYLA+G F
Sbjct: 609 ELENAQSTIQRCLELDPTSADAHLLLAQIYLAQGNF 644


>gi|380806327|gb|AFE75039.1| tetratricopeptide repeat protein 21A isoform 1, partial [Macaca
           mulatta]
          Length = 114

 Score = 39.7 bits (91), Expect = 2.6,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 17  ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 52


>gi|220928054|ref|YP_002504963.1| hypothetical protein Ccel_0602 [Clostridium cellulolyticum H10]
 gi|219998382|gb|ACL74983.1| Tetratricopeptide TPR_2 repeat protein [Clostridium cellulolyticum
           H10]
          Length = 586

 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 31/49 (63%), Gaps = 7/49 (14%)

Query: 327 GNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
           GN D +KE+ E       RCI+ NP   + +++LG+++++ G++ EA +
Sbjct: 305 GNIDEAKERFE-------RCIKNNPRQADAYILLGKLFMSVGQYSEASK 346


>gi|333372283|ref|ZP_08464215.1| hypothetical protein HMPREF9374_1960 [Desmospora sp. 8437]
 gi|332974490|gb|EGK11412.1| hypothetical protein HMPREF9374_1960 [Desmospora sp. 8437]
          Length = 305

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 32/50 (64%)

Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGL 381
            ++K E+ E +LL CI+K P  G  H  L +I+ A+G+ E+A ++  +GL
Sbjct: 88  KEDKPEEGERVLLECIDKYPDRGVAHTYLARIFAARGDREQAVQQLKEGL 137


>gi|410971684|ref|XP_003992295.1| PREDICTED: tetratricopeptide repeat protein 21A [Felis catus]
          Length = 1212

 Score = 39.3 bits (90), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   +PH+++  +YLA+G F
Sbjct: 468 ELENAQSTLQRCLELDPTSVDPHLLMSHVYLAQGNF 503


>gi|444525844|gb|ELV14172.1| Tetratricopeptide repeat protein 21A [Tupaia chinensis]
          Length = 1394

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 493 ELENAQSTLQRCLELDPTFVDAHLLMAQIYLAQGNF 528


>gi|74223484|dbj|BAE21602.1| unnamed protein product [Mus musculus]
          Length = 1233

 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|408794648|ref|ZP_11206253.1| 4-alpha-glucanotransferase [Leptospira meyeri serovar Hardjo str.
           Went 5]
 gi|408461883|gb|EKJ85613.1| 4-alpha-glucanotransferase [Leptospira meyeri serovar Hardjo str.
           Went 5]
          Length = 574

 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 71/178 (39%), Gaps = 33/178 (18%)

Query: 51  WHKHGSFLEHLVDIYRI------LKIWKAP--DCVCLCGLFHSAY---SNSYVNLAIFDP 99
           W    S+LEH   +YRI       +IW  P  D   L G FH  +   +  ++   + DP
Sbjct: 268 WKDRLSYLEHFFHLYRIDHVIGMYRIWAIPEADTTALHGWFHPQFGIATEEFIKEGL-DP 326

Query: 100 NTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKE 159
               D+           LIH F  +P   + + D   E   QEL E +KA    L    E
Sbjct: 327 KYFEDL----------GLIHEF--IPGHYIFYWDFWKEPGYQELEEEVKA---KLFPLSE 371

Query: 160 KGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFG 217
             +   E+ WR+   ++L         I   ED+G     V +F+  ++  F  Q+ G
Sbjct: 372 LHIAEEEKHWRQSGEKILEIFESFSAMIPCAEDLG----SVPSFIRDSL--FERQMIG 423


>gi|26325882|dbj|BAC26695.1| unnamed protein product [Mus musculus]
          Length = 1314

 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|149018252|gb|EDL76893.1| rCG25664 [Rattus norvegicus]
          Length = 797

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|148677249|gb|EDL09196.1| tetratricopeptide repeat domain 21A, isoform CRA_b [Mus musculus]
          Length = 1282

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|225543311|ref|NP_083011.2| tetratricopeptide repeat protein 21A [Mus musculus]
 gi|150416165|sp|Q8C0S4.2|TT21A_MOUSE RecName: Full=Tetratricopeptide repeat protein 21A; Short=TPR
           repeat protein 21A; AltName: Full=Tetratricopeptide
           repeat-containing hedgehog modulator 2
          Length = 1314

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|67626016|gb|AAY78492.1| tetratricopeptide repeat-containing hedgehog modulator 2 [Mus
           musculus]
 gi|126632017|gb|AAI33700.1| Tetratricopeptide repeat domain 21A [Mus musculus]
          Length = 1314

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|12853191|dbj|BAB29674.1| unnamed protein product [Mus musculus]
          Length = 1314

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|392342139|ref|XP_003754513.1| PREDICTED: tetratricopeptide repeat protein 21A-like [Rattus
           norvegicus]
 gi|392350417|ref|XP_003750652.1| PREDICTED: tetratricopeptide repeat protein 21A-like [Rattus
           norvegicus]
          Length = 1314

 Score = 38.9 bits (89), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544


>gi|297463510|ref|XP_612284.5| PREDICTED: tetratricopeptide repeat protein 21A [Bos taurus]
          Length = 1463

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 659 ELEHAQSTLQRCLELDPTSVDTHLLMAQIYLAQGNF 694


>gi|237755427|ref|ZP_04584053.1| putative TPR repeat-containing protein [Sulfurihydrogenibium
           yellowstonense SS-5]
 gi|237692427|gb|EEP61409.1| putative TPR repeat-containing protein [Sulfurihydrogenibium
           yellowstonense SS-5]
          Length = 329

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
           YW       D ++E LE A   L   I   P   + +   G++Y+ KGEFEEA+++  K 
Sbjct: 45  YWRGKVATHDLNQESLEVAIVELSEAIRLKPDYADAYFERGKVYIQKGEFEEAKKDLEKA 104

Query: 381 LML 383
           + L
Sbjct: 105 IQL 107


>gi|297488441|ref|XP_002696967.1| PREDICTED: tetratricopeptide repeat protein 21A [Bos taurus]
 gi|296475124|tpg|DAA17239.1| TPA: tetratricopeptide repeat domain 21A-like [Bos taurus]
          Length = 1362

 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 588 ELEHAQSTLQRCLELDPTSVDTHLLMAQIYLAQGNF 623


>gi|188997368|ref|YP_001931619.1| TPR repeat-containing protein [Sulfurihydrogenibium sp. YO3AOP1]
 gi|188932435|gb|ACD67065.1| TPR repeat-containing protein [Sulfurihydrogenibium sp. YO3AOP1]
          Length = 329

 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
           YW       D ++E LE A   L   I   P   + +   G++Y+ KGEFEEA+++  K 
Sbjct: 45  YWRGKVATHDLNQESLEVAIVELSEAIRLKPDYADAYFERGRVYIQKGEFEEAKKDLEKA 104

Query: 381 LML 383
           + L
Sbjct: 105 IQL 107


>gi|383777412|ref|YP_005461978.1| hypothetical protein AMIS_22420 [Actinoplanes missouriensis 431]
 gi|381370644|dbj|BAL87462.1| hypothetical protein AMIS_22420 [Actinoplanes missouriensis 431]
          Length = 178

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 3/83 (3%)

Query: 40  AVLRSVGAGECWHKHGSFLEHLVDIYRIL-KIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
           A L+  G     H  G+  +HL  +   L ++   P+ V   GL H+AY     +LA+  
Sbjct: 9   AWLQERGTDTVRHPGGTLYQHLCRVSEHLSEVGHRPE-VQAAGLTHAAYGTDGFDLALLF 67

Query: 99  PNTGRDVVRGHVGDAAEKLIHLF 121
               RD +RG VGD AE+L++L+
Sbjct: 68  WQQ-RDELRGLVGDEAEELVYLY 89


>gi|395517048|ref|XP_003762694.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
           21A [Sarcophilus harrisii]
          Length = 1218

 Score = 38.5 bits (88), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYL++G F
Sbjct: 519 ELENAQSTLQRCLELDPTSADVHLLMAQIYLSQGNF 554


>gi|431910586|gb|ELK13654.1| Tetratricopeptide repeat protein 21A, partial [Pteropus alecto]
          Length = 1317

 Score = 38.5 bits (88), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 498 ELENAQSTLQRCLELDPTSVDAHLLMSQIYLAQGNF 533


>gi|297286110|ref|XP_002802914.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 4
           [Macaca mulatta]
          Length = 1302

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 498 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 533


>gi|378726907|gb|EHY53366.1| hypothetical protein HMPREF1120_01560 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 1358

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 26/40 (65%)

Query: 334 EKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
           +  EKA EL+L+ I  NP +   H +L +IY AKGE E+A
Sbjct: 375 QDTEKALELILKAIAVNPEIYAAHALLSEIYFAKGEDEKA 414


>gi|194379504|dbj|BAG63718.1| unnamed protein product [Homo sapiens]
          Length = 1302

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 498 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 533


>gi|21748475|emb|CAD31647.1| TRP domain containing protein [Homo sapiens]
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|402860571|ref|XP_003894699.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Papio
           anubis]
          Length = 1419

 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 615 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 650


>gi|355559768|gb|EHH16496.1| hypothetical protein EGK_11783 [Macaca mulatta]
          Length = 1321

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 517 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 552


>gi|297286105|ref|XP_002802912.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 2
           [Macaca mulatta]
          Length = 1344

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 540 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 575


>gi|114586265|ref|XP_516376.2| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Pan
           troglodytes]
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|109041688|ref|XP_001083557.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 1
           [Macaca mulatta]
          Length = 1313

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544


>gi|402860577|ref|XP_003894702.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 4 [Papio
           anubis]
          Length = 1443

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 639 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 674


>gi|402860573|ref|XP_003894700.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Papio
           anubis]
          Length = 1242

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 608 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 643


>gi|355746800|gb|EHH51414.1| hypothetical protein EGM_10779 [Macaca fascicularis]
          Length = 1321

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 517 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 552


>gi|297286112|ref|XP_002802915.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 5
           [Macaca mulatta]
          Length = 1272

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|297286107|ref|XP_002802913.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 3
           [Macaca mulatta]
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|397511536|ref|XP_003826127.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Pan
           paniscus]
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|290543551|ref|NP_001098983.2| tetratricopeptide repeat protein 21A isoform 1 [Homo sapiens]
          Length = 1272

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|157502177|ref|NP_665698.2| tetratricopeptide repeat protein 21A isoform 2 [Homo sapiens]
 gi|296453007|sp|Q8NDW8.3|TT21A_HUMAN RecName: Full=Tetratricopeptide repeat protein 21A; Short=TPR
           repeat protein 21A; AltName: Full=Stress-inducible
           protein 2
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|410036712|ref|XP_003950106.1| PREDICTED: tetratricopeptide repeat protein 21A [Pan troglodytes]
          Length = 1143

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544


>gi|402860575|ref|XP_003894701.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 3 [Papio
           anubis]
          Length = 1371

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 567 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 602


>gi|397511540|ref|XP_003826129.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 3 [Pan
           paniscus]
          Length = 1272

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|397511538|ref|XP_003826128.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Pan
           paniscus]
          Length = 1143

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544


>gi|332816435|ref|XP_003309749.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Pan
           troglodytes]
          Length = 1272

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|119584968|gb|EAW64564.1| tetratricopeptide repeat domain 21A, isoform CRA_c [Homo sapiens]
          Length = 1320

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|67969406|dbj|BAE01054.1| unnamed protein product [Macaca fascicularis]
          Length = 601

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544


>gi|426340012|ref|XP_004033929.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
           21A [Gorilla gorilla gorilla]
          Length = 1320

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|354487661|ref|XP_003505990.1| PREDICTED: tetratricopeptide repeat protein 21A [Cricetulus
           griseus]
          Length = 1370

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+  L RC+E +P   + H+++ QIYLA+G F
Sbjct: 564 ELENAQSTLQRCLELDPNFVDAHLLMSQIYLAQGNF 599


>gi|332215575|ref|XP_003256920.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Nomascus
           leucogenys]
          Length = 1271

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|332215573|ref|XP_003256919.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Nomascus
           leucogenys]
          Length = 1319

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551


>gi|297671686|ref|XP_002813963.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
           21A [Pongo abelii]
          Length = 1344

 Score = 38.1 bits (87), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 540 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 575


>gi|194379578|dbj|BAG63755.1| unnamed protein product [Homo sapiens]
          Length = 1272

 Score = 38.1 bits (87), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503


>gi|120538593|gb|AAI29949.1| TTC21A protein [Homo sapiens]
          Length = 1143

 Score = 38.1 bits (87), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544


>gi|118365236|ref|XP_001015839.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89297606|gb|EAR95594.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 430

 Score = 38.1 bits (87), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 28/44 (63%)

Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
           S+ KLE+A++ LLR +E  P    P+  L QIY  +G  EEA++
Sbjct: 83  SQGKLEEAKKQLLRSLEIEPKYSHPYFQLSQIYYQQGMLEEAKQ 126


>gi|21752149|dbj|BAC04129.1| unnamed protein product [Homo sapiens]
          Length = 882

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 26/36 (72%)

Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
           +LE A+ +L RC+E +P   + H+++ QIYLA+G F
Sbjct: 78  ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 113


>gi|118381790|ref|XP_001024055.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89305822|gb|EAS03810.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 376

 Score = 38.1 bits (87), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 315 IEARDLYWEAVCGNSDGSKEK--LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
           I+  D Y++A    S    ++  +++A++ L + I+ NP   + H  LG IY  +GE EE
Sbjct: 285 IQINDQYYQAYYNLSSIYYDQGNIQEAKQCLEKAIKINPLYDQAHYNLGLIYYNQGELEE 344

Query: 373 AER 375
           A+R
Sbjct: 345 AKR 347


>gi|357418845|ref|YP_004933713.1| Tetratricopeptide TPR_1 repeat-containing protein [Thermovirga
           lienii DSM 17291]
 gi|355398188|gb|AER67616.1| Tetratricopeptide TPR_1 repeat-containing protein [Thermovirga
           lienii DSM 17291]
          Length = 377

 Score = 38.1 bits (87), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 31/46 (67%), Gaps = 1/46 (2%)

Query: 331 GSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAERE 376
           G+K+  +KAE+   + +E NP   EPH+ LG ++LA+G+ +  +RE
Sbjct: 312 GAKD-FKKAEDAYRKALELNPKFNEPHLGLGLVFLARGDLKGVKRE 356


>gi|167516170|ref|XP_001742426.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779050|gb|EDQ92664.1| predicted protein [Monosiga brevicollis MX1]
          Length = 242

 Score = 38.1 bits (87), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 315 IEAR-DLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
           IE R DL + A   N    K  L+ A+ LL+  IE+NP   E H  LG +Y   G+ ++A
Sbjct: 111 IERRSDLVYNAAVVN--LMKRNLQDAKALLIESIERNPSSAEAHNNLGCLYAELGQADKA 168

Query: 374 EREAGKGLMLL 384
                K L L+
Sbjct: 169 RTHLKKALQLM 179


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.138    0.421 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,628,863,298
Number of Sequences: 23463169
Number of extensions: 287407848
Number of successful extensions: 654610
Number of sequences better than 100.0: 203
Number of HSP's better than 100.0 without gapping: 171
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 654155
Number of HSP's gapped (non-prelim): 324
length of query: 399
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 254
effective length of database: 8,957,035,862
effective search space: 2275087108948
effective search space used: 2275087108948
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)