BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015869
(399 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255544149|ref|XP_002513137.1| conserved hypothetical protein [Ricinus communis]
gi|223548148|gb|EEF49640.1| conserved hypothetical protein [Ricinus communis]
Length = 424
Score = 627 bits (1616), Expect = e-177, Method: Compositional matrix adjust.
Identities = 307/399 (76%), Positives = 350/399 (87%), Gaps = 6/399 (1%)
Query: 1 MPSQQGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEH 60
MPS ++YL+TL+AS RPFLRGELE++DKNLP+LI+VLRSVGAGECWHKHGSFL+H
Sbjct: 1 MPSSLHQETSNYLQTLLASTRPFLRGELESVDKNLPALISVLRSVGAGECWHKHGSFLDH 60
Query: 61 LVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHL 120
LVDIYRILKIW A D VCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG AAE+LIHL
Sbjct: 61 LVDIYRILKIWNASDSVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGPAAERLIHL 120
Query: 121 FCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPAD 180
FCIVPRQPLIHDDLLF YSD ELV+HL S++SL NAKEKG+F+AE++WRKKIN LLPA
Sbjct: 121 FCIVPRQPLIHDDLLFNYSDSELVQHLLLSEISLKNAKEKGVFDAEDSWRKKINSLLPAA 180
Query: 181 GLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSAL 240
G+TVK IKTGEDV V+RR+VA FVM+T+ADFSDQLF FQD LF+NSDG+LEF+GN+ ++L
Sbjct: 181 GITVKRIKTGEDVLVTRRIVAVFVMMTMADFSDQLFSFQDLLFDNSDGRLEFSGNNLASL 240
Query: 241 WPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPP 300
WPGDGKPGLW+NS+SRMGAIY+LI REEEIF++ER A G+ VD+ERDEDIEL VPP
Sbjct: 241 WPGDGKPGLWINSISRMGAIYTLIRREEEIFVEER---IRAGGIEVDEERDEDIELVVPP 297
Query: 301 VFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVL 360
VF+ CTRILDA++QIEARDLYWEAVC D SK L+K EELLL CIEKNP+VGEPHVVL
Sbjct: 298 VFDKCTRILDARQQIEARDLYWEAVC---DLSKRGLDKVEELLLSCIEKNPYVGEPHVVL 354
Query: 361 GQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
Q+YL K FEEAEREA KG+ L+LEWGSPWDKR SWEG
Sbjct: 355 SQVYLTKDRFEEAEREAEKGVTLMLEWGSPWDKRTSWEG 393
>gi|224095202|ref|XP_002310360.1| predicted protein [Populus trichocarpa]
gi|222853263|gb|EEE90810.1| predicted protein [Populus trichocarpa]
Length = 430
Score = 619 bits (1597), Expect = e-175, Method: Compositional matrix adjust.
Identities = 308/390 (78%), Positives = 347/390 (88%), Gaps = 6/390 (1%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
+YL+TL+ASARPFLRGELE+IDKNLPSLI+VLRSVGAGECWHKHGSFL+HLV+IYRILK
Sbjct: 16 TNYLQTLLASARPFLRGELESIDKNLPSLISVLRSVGAGECWHKHGSFLDHLVEIYRILK 75
Query: 70 IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
IWKAPD VCLCGLFHSAYSNSYVNLAIFDPNTGRDVVR HVG+AAE+LIHLFCIVPRQ L
Sbjct: 76 IWKAPDSVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRNHVGEAAERLIHLFCIVPRQSL 135
Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
IHDDLLF+YSD ELVEHLKAS++SL NA EKGLFN EE+WRKK+ LLPA G+TVKHIK+
Sbjct: 136 IHDDLLFKYSDIELVEHLKASELSLRNAGEKGLFNGEESWRKKLASLLPASGITVKHIKS 195
Query: 190 GEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGL 249
GEDV V+RR+V F+++T+ADFSDQLFGFQD LF N DG+LEF GN+F ALWPGDGKPGL
Sbjct: 196 GEDVLVTRRMVGVFLLMTMADFSDQLFGFQDLLFENFDGRLEFLGNNFGALWPGDGKPGL 255
Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
W+NS+SRMGAIYSLI REEEIFI+ERK + G VD+ERDEDIEL + PVFENCT++L
Sbjct: 256 WINSISRMGAIYSLIVREEEIFIEERKRAG---GFEVDRERDEDIELVLAPVFENCTQVL 312
Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
DA+EQ+ ARDLYWEAVC D SK LE+AEELL+ IEKNPFVGEPHVVLGQ YL KG
Sbjct: 313 DAREQVVARDLYWEAVC---DTSKGGLERAEELLVSSIEKNPFVGEPHVVLGQFYLTKGR 369
Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
FEEAE+EA +G+ LLLEWGSPWDKRMSWEG
Sbjct: 370 FEEAEKEAERGVTLLLEWGSPWDKRMSWEG 399
>gi|15231007|ref|NP_188633.1| uncharacterized protein [Arabidopsis thaliana]
gi|63003792|gb|AAY25425.1| At3g19990 [Arabidopsis thaliana]
gi|111074436|gb|ABH04591.1| At3g19990 [Arabidopsis thaliana]
gi|332642795|gb|AEE76316.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 288/387 (74%), Positives = 334/387 (86%), Gaps = 4/387 (1%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+ ++ SARPFLRGELE ID LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 11 DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 70
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P+ VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+
Sbjct: 71 PESVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDE 130
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
LLF+YSD ELVEHL S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++
Sbjct: 131 LLFKYSDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEI 190
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
VSRRVV F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS
Sbjct: 191 VVSRRVVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNS 250
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
SRMGAIYSLI REEEI ++ER+ A G V KERDEDI+L VPPVF CT++LDAKE
Sbjct: 251 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKE 309
Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
QIEAR++YWE V +SD SKE LE+AEE LL CIEKNPFVGEPHV+L Q+YL K F+E
Sbjct: 310 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKE 367
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 368 AEREAEKGLLLLLQWGSPWDKRMSWEG 394
>gi|11994774|dbj|BAB03164.1| unnamed protein product [Arabidopsis thaliana]
Length = 450
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 288/387 (74%), Positives = 334/387 (86%), Gaps = 4/387 (1%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+ ++ SARPFLRGELE ID LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 36 DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 95
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P+ VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+
Sbjct: 96 PESVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDE 155
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
LLF+YSD ELVEHL S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++
Sbjct: 156 LLFKYSDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEI 215
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
VSRRVV F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS
Sbjct: 216 VVSRRVVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNS 275
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
SRMGAIYSLI REEEI ++ER+ A G V KERDEDI+L VPPVF CT++LDAKE
Sbjct: 276 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKE 334
Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
QIEAR++YWE V +SD SKE LE+AEE LL CIEKNPFVGEPHV+L Q+YL K F+E
Sbjct: 335 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKE 392
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 393 AEREAEKGLLLLLQWGSPWDKRMSWEG 419
>gi|110743422|dbj|BAE99597.1| hypothetical protein [Arabidopsis thaliana]
Length = 412
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 288/382 (75%), Positives = 331/382 (86%), Gaps = 4/382 (1%)
Query: 19 SARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVC 78
SARPFLRGELE ID LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKAP+ VC
Sbjct: 3 SARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKAPESVC 62
Query: 79 LCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEY 138
LCGLFHSAYSNSYVNLAIFDP+TGRDVVR HVG+AAE LIHLFC+VPRQ LIHD+LLF+Y
Sbjct: 63 LCGLFHSAYSNSYVNLAIFDPSTGRDVVRDHVGEAAESLIHLFCVVPRQTLIHDELLFKY 122
Query: 139 SDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRR 198
SD ELVEHL S+VSL NAKEKG+F+ +E WRKKIN L+P +G+ VKHIKTGE++ VSRR
Sbjct: 123 SDCELVEHLDCSEVSLRNAKEKGIFDGDEEWRKKINALVPENGVVVKHIKTGEEIVVSRR 182
Query: 199 VVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMG 258
VV F+++T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLWMNS SRMG
Sbjct: 183 VVGVFLLMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWMNSNSRMG 242
Query: 259 AIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEAR 318
AIYSLI REEEI ++ER+ A G V KERDEDI+L VPPVF CT++LDAKEQIEAR
Sbjct: 243 AIYSLIVREEEILMEERR-RAFGSGFEVRKERDEDIDLVVPPVFSFCTKVLDAKEQIEAR 301
Query: 319 DLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
++YWE V +SD SKE LE+AEE LL CIEKNPFVGEPHV+L Q+YL K F+EAEREA
Sbjct: 302 EMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFKEAEREA 359
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
KGL+LLL+WGSPWDKRMSWEG
Sbjct: 360 EKGLLLLLQWGSPWDKRMSWEG 381
>gi|297830676|ref|XP_002883220.1| binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297329060|gb|EFH59479.1| binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 425
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 288/387 (74%), Positives = 331/387 (85%), Gaps = 4/387 (1%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+ ++ SARPFLRGELE ID LPSLIAVL+SVGAGECWHKHGSFL+HL+DIY+ILK+WKA
Sbjct: 11 DAVLESARPFLRGELEKIDPKLPSLIAVLKSVGAGECWHKHGSFLDHLIDIYKILKLWKA 70
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P+ VCLCGLFHSAYSNSYVNLAIFDP+TGR+VVR HVG+AAE LIHLFC+VPRQ LIHDD
Sbjct: 71 PESVCLCGLFHSAYSNSYVNLAIFDPSTGREVVRDHVGEAAESLIHLFCVVPRQTLIHDD 130
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
LLF+YSD ELVEHL S+VSL NAKEKG+F+ +E WRKKIN L+P +G VKHIKTGE++
Sbjct: 131 LLFKYSDCELVEHLDCSEVSLRNAKEKGVFDGDEEWRKKINGLVPENGAVVKHIKTGEEI 190
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNS 253
VSRRVV F+ +T+ADFSDQLFGFQD LF N DG+LEF GN+ +ALWPG+GKPGLW+NS
Sbjct: 191 VVSRRVVGVFLFMTMADFSDQLFGFQDELFCNHDGRLEFRGNNVTALWPGNGKPGLWLNS 250
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
SRMGAIYSLI REEEI ++ER+ A G V KERDEDIEL VPPVF CT++LDAKE
Sbjct: 251 NSRMGAIYSLIVREEEILMEERRR-AFGSGFEVRKERDEDIELVVPPVFSFCTKVLDAKE 309
Query: 314 QIEARDLYWEAVCGNSDGSKE-KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
QIEAR++YWE V +SD SKE LE+AEE LL CIEKNPFVGEPHV+L Q+YL K F E
Sbjct: 310 QIEAREMYWEVV--SSDTSKEGYLERAEERLLGCIEKNPFVGEPHVLLSQVYLGKKRFNE 367
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
AEREA KGL+LLL+WGSPWDKRMSWEG
Sbjct: 368 AEREAEKGLLLLLQWGSPWDKRMSWEG 394
>gi|225463770|ref|XP_002268202.1| PREDICTED: uncharacterized protein LOC100257837 [Vitis vinifera]
Length = 426
Score = 578 bits (1491), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/388 (76%), Positives = 340/388 (87%), Gaps = 6/388 (1%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L+TL+ASARPFLRGELEA+ LPSLI+VLRSVGAGECWHKHGSFLEHL D+Y+ILK+W
Sbjct: 14 HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLFDMYKILKLW 73
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 74 KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 133
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P G+TVKHIKTGE
Sbjct: 134 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 193
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
DV VSRRVVA FVM+T+ADFSDQL+GFQD LF+NS+G+LEF+GNSF ALWPGDGKPGLW+
Sbjct: 194 DVLVSRRVVAIFVMMTMADFSDQLYGFQDALFDNSNGQLEFSGNSFGALWPGDGKPGLWL 253
Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
NS+SRMGA+YSL+ REEEIFI+ERK G V K RDE++EL +PPV ENCTR+L A
Sbjct: 254 NSISRMGAVYSLLVREEEIFIEERKRVG---GDQVVKGRDEELELVIPPVLENCTRVLGA 310
Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
+EQIEA DLYWEAVC +GSK L+ AE+LL RC+EKNPFVGEPHVVL QIYL G FE
Sbjct: 311 EEQIEATDLYWEAVC---EGSKRGLDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 367
Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EAEREA KGL L+LEWG+PWDKRM+WEG
Sbjct: 368 EAEREAEKGLKLMLEWGNPWDKRMTWEG 395
>gi|225463772|ref|XP_002268460.1| PREDICTED: uncharacterized protein LOC100252690 [Vitis vinifera]
Length = 426
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 294/388 (75%), Positives = 340/388 (87%), Gaps = 6/388 (1%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L+TL+ASARPFLRGELEA+ LPSLI+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14 HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
KA + VCLCGLFHSAYSNSYVNLAIF+P+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 74 KAQESVCLCGLFHSAYSNSYVNLAIFEPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 133
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P G+TVKHIKTGE
Sbjct: 134 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 193
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
DV VSRRVVA FVM+T+ADFSDQ +GFQD LF+NS+G+LEF+GNS+ ALWPGDGKPGLW+
Sbjct: 194 DVLVSRRVVAIFVMMTMADFSDQHYGFQDALFDNSNGRLEFSGNSYGALWPGDGKPGLWL 253
Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
NS+SRMGA+YSL+ REEEIFI+ERK G V K RDE++EL +PPV ENCTR+L A
Sbjct: 254 NSISRMGAVYSLLVREEEIFIEERKRVG---GDQVVKGRDEELELVIPPVLENCTRVLGA 310
Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
+EQIEA DLYWEAVC +GSK L+ AE+LL RC+EKNPFVGEPHVVL QIYL G FE
Sbjct: 311 EEQIEATDLYWEAVC---EGSKRGLDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 367
Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EAEREA KGL L+LEWG+PWDKRM+WEG
Sbjct: 368 EAEREAEKGLKLMLEWGNPWDKRMTWEG 395
>gi|449464960|ref|XP_004150197.1| PREDICTED: uncharacterized protein LOC101221411 [Cucumis sativus]
Length = 423
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 279/387 (72%), Positives = 326/387 (84%), Gaps = 10/387 (2%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
LE L+ S RPFLRGELE ++ LPSLI+VLRSVGAGECWHKHG+FL+HLVDIYRILKIWK
Sbjct: 16 LEPLLGSIRPFLRGELEKVNPILPSLISVLRSVGAGECWHKHGTFLDHLVDIYRILKIWK 75
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
A + VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVRGHVG+AAE+LIHLFCIVPRQ LIHD
Sbjct: 76 AQEPVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRGHVGEAAERLIHLFCIVPRQSLIHD 135
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
DLLF YSD ELV+HLK S+ SL NA E G FN +E WRKK+ +LPA+G+TVKHIKTGED
Sbjct: 136 DLLFHYSDSELVDHLKFSNTSLKNALENGHFNEDETWRKKLQSILPANGVTVKHIKTGED 195
Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
V VSRRVVA F+M+TIADFSDQLFGFQD LF N++G+LEF GN++ +LWPG+GKPGLWMN
Sbjct: 196 VTVSRRVVAIFLMMTIADFSDQLFGFQDSLFENTNGRLEFPGNNYKSLWPGNGKPGLWMN 255
Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
S SRMGAIY+LIAREE I I +RK + +GV ERDED+EL +PP+FENCTR+LDA
Sbjct: 256 SASRMGAIYTLIAREESILIAQRKRNNE---LGVKTERDEDLELVIPPIFENCTRVLDAD 312
Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
+QI ARDLYWE VC EK +++EELLL+C+ KNPF+GE HV LGQ+YL +G FEE
Sbjct: 313 DQITARDLYWEGVC-------EKSDRSEELLLKCVMKNPFIGEAHVALGQLYLGEGRFEE 365
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
AE+EA +G+ L+L WGS WDKRM WEG
Sbjct: 366 AEKEAEEGVRLMLTWGSAWDKRMPWEG 392
>gi|388504458|gb|AFK40295.1| unknown [Medicago truncatula]
Length = 428
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 272/390 (69%), Positives = 325/390 (83%), Gaps = 6/390 (1%)
Query: 11 DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
+YLETL+ SA+PFLR EL +ID LPSLI +LRSVGA ECWHKHG+FLEHL+DI+RIL +
Sbjct: 13 EYLETLLTSAKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHL 72
Query: 71 WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
WK+P V LCGLFHSAYSNSYVNLAIFDP+T R+VVRGHVG AE+LIHLFC+VPRQ LI
Sbjct: 73 WKSPYSVSLCGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLI 132
Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
HDDLLF YSD+EL L+ S++SL NAKEKG+FN +E+WRKK+ L+PADG+ VKHI+TG
Sbjct: 133 HDDLLFHYSDKELCHDLEKSELSLRNAKEKGIFNKDESWRKKLQGLVPADGIKVKHIRTG 192
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
EDV +SRRVVA FVM+T+ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW
Sbjct: 193 EDVKLSRRVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLW 252
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRIL 309
+NS+SRMGA+Y+LI REEEIF++E+K GV GVD ERDE IEL +PPVF CT++L
Sbjct: 253 LNSISRMGAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVL 312
Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
DA++QI ARDLYWEA+ +E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG
Sbjct: 313 DARDQIVARDLYWEAMI-----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGR 367
Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
FEE E+EA +GL LLLEWG WDKR+SWEG
Sbjct: 368 FEEGEKEAERGLTLLLEWGCHWDKRISWEG 397
>gi|449526223|ref|XP_004170113.1| PREDICTED: uncharacterized protein LOC101229416 [Cucumis sativus]
Length = 423
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 278/387 (71%), Positives = 326/387 (84%), Gaps = 10/387 (2%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
LE L+ S RPFLRGELE ++ LPSLI+VLRSVGAGECWHKHG+FL+HLVDIYRILKIWK
Sbjct: 16 LEPLLGSIRPFLRGELEKVNPILPSLISVLRSVGAGECWHKHGTFLDHLVDIYRILKIWK 75
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
A + VCLCGLFHSAYSNSYVNLAIFDP+TGRDVVRGHVG+AAE+LIHLFCIVPRQ LIHD
Sbjct: 76 AQEPVCLCGLFHSAYSNSYVNLAIFDPSTGRDVVRGHVGEAAERLIHLFCIVPRQSLIHD 135
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
DLLF YSD ELV+HLK S+ SL NA E G FN +E WRKK+ +LPA+G+TVKHIKTGED
Sbjct: 136 DLLFHYSDSELVDHLKFSNTSLKNALENGHFNEDETWRKKLQSILPANGVTVKHIKTGED 195
Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
V VSRRVVA F+M+TIADFSDQLFGFQD LF N++G+LEF GN++ +LWPG+GKPGLWMN
Sbjct: 196 VTVSRRVVAIFLMMTIADFSDQLFGFQDSLFENTNGRLEFPGNNYKSLWPGNGKPGLWMN 255
Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
S SRMGAIY+LIAREE I I +RK + +GV ERDED+EL +PP+FENCT++LDA
Sbjct: 256 SASRMGAIYTLIAREESILIVQRKRNNE---LGVKTERDEDLELVIPPIFENCTKVLDAD 312
Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
+QI ARDLYWE VC EK +++EELLL+C+ KNPF+GE HV LGQ+YL +G FEE
Sbjct: 313 DQITARDLYWEGVC-------EKSDRSEELLLKCVMKNPFIGEAHVALGQLYLGEGRFEE 365
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
AE+EA +G+ L+L WGS WDKRM WEG
Sbjct: 366 AEKEAEEGVRLMLTWGSAWDKRMPWEG 392
>gi|357439007|ref|XP_003589780.1| hypothetical protein MTR_1g039190 [Medicago truncatula]
gi|355478828|gb|AES60031.1| hypothetical protein MTR_1g039190 [Medicago truncatula]
Length = 462
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 272/390 (69%), Positives = 325/390 (83%), Gaps = 6/390 (1%)
Query: 11 DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
+YLETL+ SA+PFLR EL +ID LPSLI +LRSVGA ECWHKHG+FLEHL+DI+RIL +
Sbjct: 13 EYLETLLTSAKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHL 72
Query: 71 WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
WK+P V LCGLFHSAYSNSYVNLAIFDP+T R+VVRGHVG AE+LIHLFC+VPRQ LI
Sbjct: 73 WKSPYSVSLCGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLI 132
Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
HDDLLF YSD+EL L+ S++SL NAKEKG+FN +E+WRKK+ L+PADG+ VKHI+TG
Sbjct: 133 HDDLLFHYSDKELCHDLEKSELSLRNAKEKGIFNKDESWRKKLQGLVPADGIKVKHIRTG 192
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
EDV +SRRVVA FVM+T+ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW
Sbjct: 193 EDVKLSRRVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLW 252
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRIL 309
+NS+SRMGA+Y+LI REEEIF++E+K GV GVD ERDE IEL +PPVF CT++L
Sbjct: 253 LNSISRMGAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVL 312
Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
DA++QI ARDLYWEA+ +E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG
Sbjct: 313 DARDQIVARDLYWEAMI-----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGR 367
Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
FEE E+EA +GL LLLEWG WDKR+SWEG
Sbjct: 368 FEEGEKEAERGLTLLLEWGCHWDKRISWEG 397
>gi|358248656|ref|NP_001239662.1| uncharacterized protein LOC100806152 [Glycine max]
gi|255646517|gb|ACU23736.1| unknown [Glycine max]
Length = 429
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 278/392 (70%), Positives = 326/392 (83%), Gaps = 13/392 (3%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
+E L+ S RPFLRG+LE+IDKNLP L+ VL+SVGAGECWHKHGSFL HLVDI+RILK+WK
Sbjct: 15 IEKLLESGRPFLRGDLESIDKNLPRLVGVLQSVGAGECWHKHGSFLHHLVDIFRILKLWK 74
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
AP VCLCGLFHSAYSNSYVNLAIFDP+TGR+VVR VG+ AE LIHLFCIVPRQPLIHD
Sbjct: 75 APHSVCLCGLFHSAYSNSYVNLAIFDPSTGREVVRALVGEEAESLIHLFCIVPRQPLIHD 134
Query: 133 DLLFEYSDQELVEHLKASDVSLINAK--EKGLFNAE---EAWRKKINELLPADGLTVKHI 187
DLLF YSD+ELV+HL S++SL NAK EKGLF+ + E WRKK+ L+PA+G+ VKHI
Sbjct: 135 DLLFHYSDEELVQHLAQSEISLRNAKKMEKGLFDDDGELEGWRKKLQGLVPAEGVQVKHI 194
Query: 188 KTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKP 247
+TGE V VSRR+VA F+M+T+ADF DQLFGFQD LF+N++G+LEF+GN+F ALWPGDGKP
Sbjct: 195 RTGEGVHVSRRIVAVFIMMTMADFCDQLFGFQDLLFDNANGRLEFSGNNFGALWPGDGKP 254
Query: 248 GLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTR 307
GLW+NS+SRMGA+Y+LIAREEEIFIQERK V V D ER+EDIEL +PPVF+ C +
Sbjct: 255 GLWLNSISRMGAVYTLIAREEEIFIQERKRK-VGVAVVPDLERNEDIELVLPPVFDYCRK 313
Query: 308 ILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAK 367
+L+A +QI ARDLYWEAVC E KAEELLL IEKNPFVGEP+VVL Q+YL +
Sbjct: 314 VLEAGDQIVARDLYWEAVC-------EGGSKAEELLLESIEKNPFVGEPYVVLSQVYLTE 366
Query: 368 GEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
G FEEAE+ A +GL LLLEWG PWDKR SWEG
Sbjct: 367 GRFEEAEKHAERGLKLLLEWGCPWDKRTSWEG 398
>gi|242063594|ref|XP_002453086.1| hypothetical protein SORBIDRAFT_04g038110 [Sorghum bicolor]
gi|241932917|gb|EES06062.1| hypothetical protein SORBIDRAFT_04g038110 [Sorghum bicolor]
Length = 421
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 233/387 (60%), Positives = 301/387 (77%), Gaps = 3/387 (0%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
L +A+ARPFLRGE +D LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 7 LADALAAARPFLRGEEAEVDPALPGLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLWG 66
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
APD V CGL+HSAYSNSYVNLAIF P+TGR V VG AE+L+HLFC+VPRQ L+HD
Sbjct: 67 APDAVARCGLYHSAYSNSYVNLAIFQPDTGRAHVAAVVGADAERLVHLFCVVPRQKLVHD 126
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGED 192
DLLF YSDQ+LV+ L +S+ SL +A+ +G+F+ + WR+KI L+P DG+TVKHI+TGED
Sbjct: 127 DLLFHYSDQDLVDDLASSEASLRDAR-RGVFDDAQPWRRKIQRLIPPDGITVKHIRTGED 185
Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
V +SRR+ ATF+++TIADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW+
Sbjct: 186 VALSRRIAATFLLMTIADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLWVT 245
Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
S SRMGA+Y+LI REEEI+I +R + G RDEDI+L +PPVF+ CT++L+A
Sbjct: 246 SSSRMGALYTLIIREEEIYITQRAHGISRQGSNGSAGRDEDIDLVIPPVFKGCTKVLNAD 305
Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
+Q ARDLYWEAVC + S K EELL + I KNPFVGEPH+VL Q+YL + +
Sbjct: 306 DQKVARDLYWEAVCSGDEAS--DWLKVEELLQQSIAKNPFVGEPHLVLAQVYLNMERYGD 363
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSWEG 399
A+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 364 AQMQAEEGLKLLLEWGSSWDKRMTWEG 390
>gi|222623966|gb|EEE58098.1| hypothetical protein OsJ_08971 [Oryza sativa Japonica Group]
Length = 474
Score = 496 bits (1277), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L + +ARPFLRGE E +D LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 61 HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 120
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
APD V CGL+HSAYSNSYVNLAIF+P+ GRD VR VG AAE+L+HLFC+VPR LIH
Sbjct: 121 GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 180
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
DDLLF Y+DQ+LV L AS+ SL +A+ +GLF+ + EAWR KI LLP G+TVKHI+TG
Sbjct: 181 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 239
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 240 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 299
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
S+SRMGA+YSLI REEEI I +RK S G + +RDEDIEL +PPVF CT++L
Sbjct: 300 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 355
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
A +Q ARDLYW+AVC + + K EE+L RCI +NPFVGEPH+VL Q+ L +
Sbjct: 356 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 414
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EEAE + G+ LLLEWGS WDKRM WE
Sbjct: 415 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 443
>gi|48716457|dbj|BAD23064.1| unknown protein [Oryza sativa Japonica Group]
gi|215700944|dbj|BAG92368.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 419
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L + +ARPFLRGE E +D LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 6 HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 65
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
APD V CGL+HSAYSNSYVNLAIF+P+ GRD VR VG AAE+L+HLFC+VPR LIH
Sbjct: 66 GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 125
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
DDLLF Y+DQ+LV L AS+ SL +A+ +GLF+ + EAWR KI LLP G+TVKHI+TG
Sbjct: 126 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 184
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 185 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 244
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
S+SRMGA+YSLI REEEI I +RK S G + +RDEDIEL +PPVF CT++L
Sbjct: 245 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 300
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
A +Q ARDLYW+AVC + + K EE+L RCI +NPFVGEPH+VL Q+ L +
Sbjct: 301 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 359
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EEAE + G+ LLLEWGS WDKRM WE
Sbjct: 360 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 388
>gi|125541709|gb|EAY88104.1| hypothetical protein OsI_09539 [Oryza sativa Indica Group]
Length = 419
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/389 (61%), Positives = 300/389 (77%), Gaps = 7/389 (1%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L + +ARPFLRGE E +D LP L AVLR+ GAGECWHKHG+FL HL+D++RIL++W
Sbjct: 6 HLAGALDAARPFLRGEEEQVDPALPKLAAVLRAAGAGECWHKHGTFLAHLLDVHRILRLW 65
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
APD V CGL+HSAYSNSYVNLAIF+P+ GRD VR VG AAE+L+HLFC+VPR LIH
Sbjct: 66 GAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRDHVRPIVGAAAERLVHLFCVVPRHQLIH 125
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE-EAWRKKINELLPADGLTVKHIKTG 190
DDLLF Y+DQ+LV L AS+ SL +A+ +GLF+ + EAWR KI LLP G+TVKHI+TG
Sbjct: 126 DDLLFHYADQDLVADLAASEASLQDAR-RGLFHHDGEAWRLKIQRLLPPHGITVKHIRTG 184
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
EDV +SRR+ ATF+++T+ADFSDQLF +QD LF+N++G+LEF+GN++++LWPG GKPGLW
Sbjct: 185 EDVALSRRIAATFLLMTMADFSDQLFDWQDRLFDNTNGRLEFSGNTWTSLWPGTGKPGLW 244
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
S+SRMGA+YSLI REEEI I +RK S G + +RDEDIEL +PPVF CT++L
Sbjct: 245 TASISRMGALYSLIVREEEIHIAQRKHS----NNGQEDDRDEDIELVIPPVFNGCTQVLT 300
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
A +Q ARDLYW+AVC + + K EE+L RCI +NPFVGEPH+VL Q+ L +
Sbjct: 301 ADDQKAARDLYWDAVCSGGEDETD-WRKVEEILRRCIGRNPFVGEPHLVLAQVLLNMEMY 359
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EEAE + G+ LLLEWGS WDKRM WE
Sbjct: 360 EEAEEQIEAGVKLLLEWGSSWDKRMPWEA 388
>gi|147832395|emb|CAN77784.1| hypothetical protein VITISV_014471 [Vitis vinifera]
Length = 402
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 259/388 (66%), Positives = 297/388 (76%), Gaps = 30/388 (7%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L+TL+ASARPFLRGELEA+ LPS+I+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14 HLQTLLASARPFLRGELEAVHPKLPSIISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAEKLIHLFC+VPRQPL+H
Sbjct: 74 KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEKLIHLFCVVPRQPLVH 133
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
DDL+F SD ELVEHLKAS+VS+ NAKE G+ N +EAWRKK+N L+P DG+TVKHIKTGE
Sbjct: 134 DDLIFRCSDDELVEHLKASEVSVRNAKEGGVVNGDEAWRKKVNSLVPPDGVTVKHIKTGE 193
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
DV VSRRVVA FVM+T+AD SD+ +GFQD L F + GL M
Sbjct: 194 DVLVSRRVVAIFVMMTMADSSDKHYGFQD--------ALSFLATAMELC-------GLGM 238
Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
SL +S K VG+ K RDE++EL +PPV ENCTR+L A
Sbjct: 239 ASLGCGRRRFS------------SKRGKELVGIKWWKGRDEELELVIPPVLENCTRVLGA 286
Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
+EQIEA DLYWEAVC +GSK L+ AEELL RC+EKNPFVGEPHVVL QIYL G FE
Sbjct: 287 EEQIEATDLYWEAVC---EGSKRGLDWAEELLGRCVEKNPFVGEPHVVLAQIYLTNGRFE 343
Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EAEREA KGL L+LEWGSPWDKRM+WEG
Sbjct: 344 EAEREAEKGLKLMLEWGSPWDKRMTWEG 371
>gi|224031197|gb|ACN34674.1| unknown [Zea mays]
gi|413939578|gb|AFW74129.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
Length = 426
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/378 (60%), Positives = 288/378 (76%), Gaps = 12/378 (3%)
Query: 22 PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
PFLRGE +D LP L AVLR GAGECWHKHG+FL HL+D++RIL++W APD V CG
Sbjct: 30 PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89
Query: 82 LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
L+HSAYSNSYVNLAIF P+TGR V VG AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90 LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149
Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVA 201
+L + L +S+ S+ +A+ +G+F+ + WR+KI LLPADG+TVKHI+TGEDV +SRR+ A
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHIRTGEDVSLSRRIAA 208
Query: 202 TFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIY 261
TF+++TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SRM A+Y
Sbjct: 209 TFLLMTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSRMAALY 268
Query: 262 SLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLY 321
+LI REEEI I +R RDED++L +PPVF CTR+L+A++Q ARDLY
Sbjct: 269 TLIVREEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKVARDLY 320
Query: 322 WEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGL 381
WEAVCG K E+LL + I KNPFVGEPH+VL Q+YL + EA+ +A +GL
Sbjct: 321 WEAVCGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQAEEGL 377
Query: 382 MLLLEWGSPWDKRMSWEG 399
LLLEWGS WDKRM WEG
Sbjct: 378 RLLLEWGSSWDKRMPWEG 395
>gi|413939577|gb|AFW74128.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
Length = 431
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 228/383 (59%), Positives = 288/383 (75%), Gaps = 17/383 (4%)
Query: 22 PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
PFLRGE +D LP L AVLR GAGECWHKHG+FL HL+D++RIL++W APD V CG
Sbjct: 30 PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89
Query: 82 LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
L+HSAYSNSYVNLAIF P+TGR V VG AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90 LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149
Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHI-----KTGEDVGVS 196
+L + L +S+ S+ +A+ +G+F+ + WR+KI LLPADG+TVKHI +TGEDV +S
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHISELHCRTGEDVSLS 208
Query: 197 RRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSR 256
RR+ ATF+++TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SR
Sbjct: 209 RRIAATFLLMTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSR 268
Query: 257 MGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIE 316
M A+Y+LI REEEI I +R RDED++L +PPVF CTR+L+A++Q
Sbjct: 269 MAALYTLIVREEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKV 320
Query: 317 ARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAERE 376
ARDLYWEAVCG K E+LL + I KNPFVGEPH+VL Q+YL + EA+ +
Sbjct: 321 ARDLYWEAVCGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQ 377
Query: 377 AGKGLMLLLEWGSPWDKRMSWEG 399
A +GL LLLEWGS WDKRM WEG
Sbjct: 378 AEEGLRLLLEWGSSWDKRMPWEG 400
>gi|357138263|ref|XP_003570716.1| PREDICTED: uncharacterized protein LOC100827583 isoform 2
[Brachypodium distachyon]
Length = 423
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 233/395 (58%), Positives = 295/395 (74%), Gaps = 5/395 (1%)
Query: 6 GNINNDYLETLVASARPFLRGELE-AIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDI 64
G+ + +A+ARPFLRGE E D +LP+L AVL + GAGECWHKHG+FL HL+D+
Sbjct: 2 GSSTSSTYADALAAARPFLRGEEELQPDPSLPALAAVLSAAGAGECWHKHGTFLAHLLDV 61
Query: 65 YRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIV 124
+RIL++W APD V CGL+HSAYSNSYVNLAIF+P+ GR V G VG AE+L+HLFC+V
Sbjct: 62 HRILRLWSAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRAHVAGVVGAEAERLVHLFCVV 121
Query: 125 PRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTV 184
PRQ LIHDDLLF Y D++LV L S+ SL +A+ G+F +E WR+KI LLPA G+TV
Sbjct: 122 PRQQLIHDDLLFHYEDEDLVVDLARSEESLADAR-GGVFEEDEPWRRKIQRLLPASGITV 180
Query: 185 KHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGD 244
KHI+TGEDV +SRRV A F+++T+ADFSDQLF +QD LF+N++G+LEF GN++++LWPG
Sbjct: 181 KHIRTGEDVALSRRVAAAFLLMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTSLWPGT 240
Query: 245 GKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFEN 304
GKPGLW S+SRMGA+YSLI REEEI+I R + G RDEDI L +PPVF
Sbjct: 241 GKPGLWTTSISRMGALYSLIVREEEIYIAHRTHTTGQEG-DETATRDEDIALVIPPVFNG 299
Query: 305 CTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIY 364
CT++L+A +Q ARDLYWEAVC SD + EELL + + KNPFVGEP +VL Q+
Sbjct: 300 CTKVLNADDQKAARDLYWEAVC--SDEEAMDRHRVEELLRQTVAKNPFVGEPRLVLAQVC 357
Query: 365 LAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
L +EEA+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 358 LNAEMYEEAQEQAEEGLKLLLEWGSSWDKRMTWEG 392
>gi|357138261|ref|XP_003570715.1| PREDICTED: uncharacterized protein LOC100827583 isoform 1
[Brachypodium distachyon]
Length = 421
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 237/400 (59%), Positives = 299/400 (74%), Gaps = 11/400 (2%)
Query: 1 MPSQQGNINNDYLETLVASARPFLRGELE-AIDKNLPSLIAVLRSVGAGECWHKHGSFLE 59
MPS ++ Y + L A+ARPFLRGE E D +LP+L AVL + GAGECWHKHG+FL
Sbjct: 1 MPS-----SSTYADAL-AAARPFLRGEEELQPDPSLPALAAVLSAAGAGECWHKHGTFLA 54
Query: 60 HLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIH 119
HL+D++RIL++W APD V CGL+HSAYSNSYVNLAIF+P+ GR V G VG AE+L+H
Sbjct: 55 HLLDVHRILRLWSAPDAVARCGLYHSAYSNSYVNLAIFEPDVGRAHVAGVVGAEAERLVH 114
Query: 120 LFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPA 179
LFC+VPRQ LIHDDLLF Y D++LV L S+ SL +A+ G+F +E WR+KI LLPA
Sbjct: 115 LFCVVPRQQLIHDDLLFHYEDEDLVVDLARSEESLADAR-GGVFEEDEPWRRKIQRLLPA 173
Query: 180 DGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSA 239
G+TVKHI+TGEDV +SRRV A F+++T+ADFSDQLF +QD LF+N++G+LEF GN++++
Sbjct: 174 SGITVKHIRTGEDVALSRRVAAAFLLMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTS 233
Query: 240 LWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVP 299
LWPG GKPGLW S+SRMGA+YSLI REEEI+I R + G RDEDI L +P
Sbjct: 234 LWPGTGKPGLWTTSISRMGALYSLIVREEEIYIAHRTHTTGQEG-DETATRDEDIALVIP 292
Query: 300 PVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVV 359
PVF CT++L+A +Q ARDLYWEAVC SD + EELL + + KNPFVGEP +V
Sbjct: 293 PVFNGCTKVLNADDQKAARDLYWEAVC--SDEEAMDRHRVEELLRQTVAKNPFVGEPRLV 350
Query: 360 LGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
L Q+ L +EEA+ +A +GL LLLEWGS WDKRM+WEG
Sbjct: 351 LAQVCLNAEMYEEAQEQAEEGLKLLLEWGSSWDKRMTWEG 390
>gi|326497943|dbj|BAJ94834.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 421
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 228/384 (59%), Positives = 295/384 (76%), Gaps = 5/384 (1%)
Query: 17 VASARPFLRGELEAI-DKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
+A+ARP+LRGE + D LP+L AVLR+ GAGECWHKHG+FL HL+++YRIL++W +PD
Sbjct: 11 LAAARPYLRGEEDQCGDPALPALTAVLRAAGAGECWHKHGTFLAHLLEVYRILRLWASPD 70
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V CGL+HSAYSNSYVNLAIF+P+ GR V VGD AE+L+HLFC+VPRQ L+HDDLL
Sbjct: 71 AVARCGLYHSAYSNSYVNLAIFEPDVGRARVAAVVGDEAERLVHLFCVVPRQQLVHDDLL 130
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
F Y D +L L S+ S+++A+ +G+F+ +E WR+KI LLPADG+TVKHI+TGEDV +
Sbjct: 131 FHYDDADLAADLARSEESVLDAR-RGVFDDDEPWRRKIQRLLPADGITVKHIRTGEDVAL 189
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLS 255
SRR+ ATF+M+T+ADFSDQLF +QD LF+N++G+LEF GN++++LWPG GKPGLW S+S
Sbjct: 190 SRRIAATFLMMTMADFSDQLFDWQDRLFDNTNGRLEFRGNTWTSLWPGTGKPGLWTTSIS 249
Query: 256 RMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQI 315
RMG +Y+LI REEEI++ +R + G RDEDI L +PPVF+ CT++LDA +Q
Sbjct: 250 RMGVLYNLIVREEEIYLAQRAHTTGKEG-NDSSTRDEDIALVIPPVFDGCTKVLDADDQK 308
Query: 316 EARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
ARDLYWEAVC SD K EELL + + KNPFVGEP +VL Q+ L +EEA+
Sbjct: 309 AARDLYWEAVC--SDDEATDRSKVEELLRQSVTKNPFVGEPRLVLAQMCLNAEMYEEAQE 366
Query: 376 EAGKGLMLLLEWGSPWDKRMSWEG 399
+A +GL LLLEWGS WDKRM WEG
Sbjct: 367 QAEEGLKLLLEWGSSWDKRMPWEG 390
>gi|294460946|gb|ADE76045.1| unknown [Picea sitchensis]
Length = 453
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 235/426 (55%), Positives = 296/426 (69%), Gaps = 38/426 (8%)
Query: 8 INNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRI 67
+ N L T + ARPFLRG + +D +LP + VLR VGA ECWHKHGSF +HLVD+YRI
Sbjct: 1 MANSELPTRLERARPFLRGAYDEVDPSLPEFMRVLRQVGASECWHKHGSFFDHLVDVYRI 60
Query: 68 LKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
LK+W APD V CGLFHSAYSNSYVNLAIF P+ GR VR +GD+AE+L+H+FCIVPRQ
Sbjct: 61 LKLWNAPDSVARCGLFHSAYSNSYVNLAIFKPDVGRQRVRDLIGDSAEELVHMFCIVPRQ 120
Query: 128 PLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGL----FNAEEA--------------- 168
PLIHDDLLF ++D+ELV L+ S+ ++ A+ + L EE
Sbjct: 121 PLIHDDLLFHFTDEELVSALEESEKTMNEAEAEDLELQKLKLEEKNGDWLDNIHDKVEIH 180
Query: 169 ---------------WRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSD 213
WR+KI ++LPA G+ VKHIK GEDV V RR+VA F+ +T+ADFSD
Sbjct: 181 KTHFKETLEGFILPEWRRKIQKILPAHGVRVKHIKNGEDVEVPRRLVAIFLFMTMADFSD 240
Query: 214 QLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQ 273
QLF FQD LF+NS+G +EFAGN+++ALWPGDGKPGLW+NS+SRMGA+Y+LI REE++F
Sbjct: 241 QLFSFQDVLFDNSNGIMEFAGNTWTALWPGDGKPGLWLNSISRMGALYTLIVREEKLF-- 298
Query: 274 ERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSK 333
+R+ K E+IEL +PPVF++C +L +Q+ ARDLYWEAVC G K
Sbjct: 299 QREEGRKKNEEEKKKRGFEEIELVIPPVFDHCRSVLFPSDQVTARDLYWEAVC-MPPGEK 357
Query: 334 EKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDK 393
EKAE LL + E NPFVGEP VVL QIY ++G ++EAE+EA GL L+LEWG+ WDK
Sbjct: 358 TH-EKAETLLAKACELNPFVGEPRVVLAQIYNSQGRYDEAEKEAETGLRLMLEWGTTWDK 416
Query: 394 RMSWEG 399
RMSWEG
Sbjct: 417 RMSWEG 422
>gi|218191863|gb|EEC74290.1| hypothetical protein OsI_09541 [Oryza sativa Indica Group]
Length = 413
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 219/389 (56%), Positives = 283/389 (72%), Gaps = 13/389 (3%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
++ +VA+ARPFLRG+L +D LPSL++VL GAGEC+HKHG+FL HL+D+YRIL++W
Sbjct: 5 VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 64
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
APD V CGL+HSAYSNSYVNLAIF P+ RD VRG +G AAE+L+HLFC+VPR L+HD
Sbjct: 65 APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 124
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
DL Y+D EL HL AS SL A++ G + EAWR K+ ++P +G+ +HI+TG
Sbjct: 125 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 184
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
E V +SRRV+ F+++TIADFSDQ +QD LF+N DG+LE+ G+++ ALWPG GKPGLW
Sbjct: 185 EAVALSRRVLGVFIVMTIADFSDQYTDYQDKLFDNEDGRLEYRGDNWRALWPGSGKPGLW 244
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
M+++SR+ A+Y LIA +EEI R+ +A V K D ++EL +PPVFE C+++LD
Sbjct: 245 MSAMSRLAAVYRLIATDEEI----RELTAGERSVV--KREDAELELVIPPVFERCSKVLD 298
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
A EQ E RDLYWEAVCG + K E LL R IEKNPFVGEP +VL Q L G
Sbjct: 299 AAEQKEGRDLYWEAVCGEWEEG-----KTERLLRRSIEKNPFVGEPWLVLAQALLNSGHD 353
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EEAE A +G+ L++EWGS WDKRMSWEG
Sbjct: 354 EEAEAAAEEGVRLVVEWGSSWDKRMSWEG 382
>gi|326487554|dbj|BAK05449.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 221/396 (55%), Positives = 292/396 (73%), Gaps = 11/396 (2%)
Query: 7 NINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYR 66
+ N+D L++L+ +ARPFLRG+L A+D LPSL++VL S GAGEC+HKHG+FL HL+D+YR
Sbjct: 29 STNDDALQSLLVAARPFLRGDLAAVDPELPSLVSVLVSAGAGECYHKHGTFLAHLLDVYR 88
Query: 67 ILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
IL++W APD V CGLFHS+YSNSYVNLAIF+P+ R VRG VG+AAE+L+HLFC+VPR
Sbjct: 89 ILRLWGAPDAVARCGLFHSSYSNSYVNLAIFEPDVSRARVRGIVGEAAERLVHLFCVVPR 148
Query: 127 QPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLF--NAEEAWRKKINELLPADGLTV 184
L+HDDL Y+D EL +HL A+ SL A+ G + E WR K+ ++P DG+ V
Sbjct: 149 HALMHDDLHLRYTDAELRDHLAAAGASLQAARSHGGRPEDKAEPWRLKLRSVVPEDGVVV 208
Query: 185 KHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGD 244
HI+TGE V +SRRV+A FV++T+ADFSDQ +QD LF N DG+LEFAG++++ALWPG
Sbjct: 209 PHIRTGEPVALSRRVLAVFVLMTVADFSDQYTDYQDKLFRNDDGRLEFAGDNWAALWPGT 268
Query: 245 GKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFEN 304
GKPGLWM++ SR+ A+Y LIA +E++ E + E+D +EL +PPVF+
Sbjct: 269 GKPGLWMSATSRLAALYRLIATDEQLRHMEEQVGPTKT---TADEQDAGLELCIPPVFDR 325
Query: 305 CTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIY 364
C+++LD EQI ARD+YWEA+C SDG +E +E LL RCI KNP+VGEP +VL Q+
Sbjct: 326 CSKVLDPGEQIAARDMYWEAIC--SDGKEEGVES---LLRRCIAKNPYVGEPWLVLAQVL 380
Query: 365 L-AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
L G +EEAE A +GL L++EWGS WDKRMSWEG
Sbjct: 381 LNGGGRWEEAEAAAEEGLRLVVEWGSSWDKRMSWEG 416
>gi|115449991|ref|NP_001048601.1| Os02g0828500 [Oryza sativa Japonica Group]
gi|48716459|dbj|BAD23066.1| unknown protein [Oryza sativa Japonica Group]
gi|113538132|dbj|BAF10515.1| Os02g0828500 [Oryza sativa Japonica Group]
Length = 414
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 218/389 (56%), Positives = 283/389 (72%), Gaps = 13/389 (3%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
++ +VA+ARPFLRG+L +D LPSL++VL GAGEC+HKHG+FL HL+D+YRIL++W
Sbjct: 6 VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 65
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
APD V CGL+HSAYSNSYVNLAIF P+ RD VRG +G AAE+L+HLFC+VPR L+HD
Sbjct: 66 APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 125
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
DL Y+D EL HL AS SL A++ G + EAWR K+ ++P +G+ +HI+TG
Sbjct: 126 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 185
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
E V +SRRV+ F+++TIADFSDQ +QD LF+N DG+LE+ G+++ A+WPG GKPGLW
Sbjct: 186 EAVALSRRVLGVFIVMTIADFSDQYTDYQDKLFDNEDGRLEYRGDNWRAVWPGSGKPGLW 245
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
M+++SR+ A+Y LIA +EEI R+ +A V K D ++EL +PPVFE C+++LD
Sbjct: 246 MSAMSRLAAVYRLIATDEEI----RELTAGERSVV--KREDAELELVIPPVFERCSKVLD 299
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
A EQ E RDLYWEAVCG + K E LL R IEKNPFVGEP +VL Q L G
Sbjct: 300 AAEQKEGRDLYWEAVCGEWEEG-----KTERLLRRSIEKNPFVGEPWLVLAQALLNSGHD 354
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EEAE A +G+ L++EWGS WDKRMSWEG
Sbjct: 355 EEAEAAAEEGVRLVVEWGSSWDKRMSWEG 383
>gi|242063596|ref|XP_002453087.1| hypothetical protein SORBIDRAFT_04g038120 [Sorghum bicolor]
gi|241932918|gb|EES06063.1| hypothetical protein SORBIDRAFT_04g038120 [Sorghum bicolor]
Length = 431
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 215/406 (52%), Positives = 294/406 (72%), Gaps = 13/406 (3%)
Query: 1 MPSQQGNINNDY------LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKH 54
M +QQG + D L +L+A A+PFLRGELE ID +PS +++LRS GAGE HK+
Sbjct: 1 MATQQGFSSTDLKTHRKNLPSLLAVAQPFLRGELEKIDPEIPSFLSILRSAGAGERHHKN 60
Query: 55 GSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAA 114
G+FL HL++ +RI+++W AP + CGLFHS+Y+NSYVN++IF+ +T R+ V+ +G
Sbjct: 61 GAFLAHLLNFHRIVRLWGAPSDIARCGLFHSSYANSYVNVSIFESSTTREDVQRLIGAPV 120
Query: 115 EKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKIN 174
E+L LFC VPR LIH++L F+Y+D EL +HL ASD+S+ A+E G F+A EAWR+KI
Sbjct: 121 ERLAFLFCAVPRHKLIHEELHFQYTDTELRDHLAASDLSIKAARETGTFDASEAWREKIC 180
Query: 175 ELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAG 234
+LLP G+ +H KTGE + +SRR++A FV++TIAD DQ +QD L++N +G+LEF G
Sbjct: 181 KLLPPKGMEARHFKTGEPISLSRRIIALFVLMTIADICDQYIDYQDKLYDNENGRLEFRG 240
Query: 235 NSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDI 294
+++ ALWPG KPGLWMN+ SR+G +Y+LI REEE++ QER +G + +RDE+I
Sbjct: 241 DNWGALWPGTCKPGLWMNAASRLGVLYNLIIREEELYTQERNK----MGETIRLDRDEEI 296
Query: 295 ELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK-LEKAEELLLRCIEKNPFV 353
L +PPVF CT++LD KEQI ARDLYWEA+C SD KE+ EK E++LL I+KNPFV
Sbjct: 297 MLVIPPVFNYCTKVLDPKEQIAARDLYWEAIC--SDDRKERDWEKVEKVLLESIQKNPFV 354
Query: 354 GEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
GEP +V Q+YL + EA+ EA +GL LLLEWG WDKRM+WE
Sbjct: 355 GEPRLVAAQVYLNMERYAEAKEEAEEGLKLLLEWGISWDKRMTWES 400
>gi|242063598|ref|XP_002453088.1| hypothetical protein SORBIDRAFT_04g038130 [Sorghum bicolor]
gi|241932919|gb|EES06064.1| hypothetical protein SORBIDRAFT_04g038130 [Sorghum bicolor]
Length = 455
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 209/393 (53%), Positives = 279/393 (70%), Gaps = 18/393 (4%)
Query: 13 LETLVASARPFLRGEL-EAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+ +L+ +ARPFLRGEL A D LPSL+AVLR+ GAGEC+HKHG+FL HL D+YRIL++W
Sbjct: 36 VSSLLDAARPFLRGELGSAEDPELPSLVAVLRAAGAGECYHKHGTFLAHLEDVYRILRLW 95
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
A D V CGLFHS+YSNSYV+LAIF P+TGR VRG +G AE+L+HLFC+VPR L+
Sbjct: 96 GASDAVARCGLFHSSYSNSYVDLAIFQPDTGRAQVRGIIGADAERLVHLFCVVPRHHLVF 155
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEK-GLFNAEEAWRKKINELLPADGLTVKHIKTG 190
L Y+DQEL +HL A++ A++ GL WR+K+ ++PA+G+ V HI+TG
Sbjct: 156 QQLQPRYTDQELRDHLAAAEAEAEIAQQPGGLLTTMSPWRQKLRSVVPAEGVVVSHIRTG 215
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
E V +SRRVVA F+++T+ADFSDQ +QD LF+N DG+LEF G++++ALWPG GKPGLW
Sbjct: 216 EPVRLSRRVVAAFLLMTVADFSDQYTDYQDDLFDNDDGRLEFRGDNWAALWPGTGKPGLW 275
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDI----ELPVPPVFENCT 306
++++SR+ A+Y LIAR+E++ + +G + D+D+ EL +PPVFE C+
Sbjct: 276 VSAMSRLAALYGLIARDEQLIM----------AIGEQQHDDDDVAAVGELVIPPVFERCS 325
Query: 307 RILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLA 366
R+LD EQ ARDLYWEA+ ++ K K E LL I KNP VGEPH+VL Q+ L
Sbjct: 326 RVLDPGEQKAARDLYWEAITSSTKDDKRK--DPEALLRESIAKNPLVGEPHLVLAQVLLN 383
Query: 367 KGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
G + EA A +G+ LLL+WGS WDKRMSWEG
Sbjct: 384 GGRYAEAAAAAARGVRLLLQWGSSWDKRMSWEG 416
>gi|357118082|ref|XP_003560788.1| PREDICTED: uncharacterized protein LOC100845551 [Brachypodium
distachyon]
Length = 423
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 211/405 (52%), Positives = 289/405 (71%), Gaps = 19/405 (4%)
Query: 1 MPSQ----QGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGS 56
MP+ G + + + L+A ARPFLRG+L A+D LPSL++VL + GAGEC+HKHG+
Sbjct: 1 MPTNGDGISGGVISSGADALLAVARPFLRGDLGAVDPELPSLVSVLVAAGAGECYHKHGT 60
Query: 57 FLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEK 116
FL HL+D++RIL++W APD V CGL+HS+YSNSYVNLAIF P+ R VR VG AE+
Sbjct: 61 FLAHLIDVFRILRLWAAPDAVARCGLYHSSYSNSYVNLAIFPPDISRGRVRAIVGAPAER 120
Query: 117 LIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINEL 176
L+HLFC+VPR L+H +L Y+D EL HL + + ++G F+ WR K+ +
Sbjct: 121 LVHLFCVVPRHALLHQNLHLRYADAELRAHLARA----VAGDDEGYFS---EWRVKLRSV 173
Query: 177 LPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLF-NNSDGKLEFAGN 235
+PA+G+ +HI+TGE V +SR+V+A FV++TIADFSDQ +QD LF N DG+ EFAG+
Sbjct: 174 VPAEGVVAEHIRTGEPVALSRKVLAAFVLMTIADFSDQYTDYQDKLFGGNEDGRFEFAGD 233
Query: 236 SFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIE 295
++ALWPG GKPGLWM+++SR+ +Y LIA +E + ++E +G V+ E D +E
Sbjct: 234 DWAALWPGTGKPGLWMSAMSRLATLYRLIATDEALRLREED-----MGRPVNDE-DAGLE 287
Query: 296 LPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK-LEKAEELLLRCIEKNPFVG 354
L +PPVF++C+++LD EQ+ ARDLYWEA+C + D +K+K EELL R IEKNP+VG
Sbjct: 288 LVIPPVFDHCSKVLDPGEQVTARDLYWEAICSDEDMAKKKNTVVVEELLRRSIEKNPYVG 347
Query: 355 EPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
EP +VL Q+ L G +EEAE EA +GL L+LEWGS WDKRM+WEG
Sbjct: 348 EPWLVLAQVLLNAGRYEEAEEEAEQGLKLVLEWGSSWDKRMTWEG 392
>gi|168023750|ref|XP_001764400.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684264|gb|EDQ70667.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 417
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 183/389 (47%), Positives = 255/389 (65%), Gaps = 10/389 (2%)
Query: 11 DYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKI 70
D L +A RPFLR + E+ID LP L+ +L + GAGECWHKHG+F EHL+ YRILK+
Sbjct: 8 DPLAAKIAQCRPFLREDYESIDSRLPGLLKILHARGAGECWHKHGTFYEHLLHTYRILKL 67
Query: 71 WKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
W + D V L GLFHS+YSNSYVNLAIF P+ R V+RG +GD AE+L+H+FC++PRQ LI
Sbjct: 68 WGSSDTVALFGLFHSSYSNSYVNLAIFAPDVDRSVLRGLIGDEAEELVHIFCVIPRQELI 127
Query: 131 HDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG 190
D LL +D +L+ LK + + E NA+ K+ +P +G+ VKHI+TG
Sbjct: 128 FDRLLHTITDDDLIASLKQYQGTKVGESEA---NAKGGHDKR-RLFVPKEGMVVKHIRTG 183
Query: 191 EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLW 250
ED+ + RR+VA FV+LT+ADF DQ +G+QD LF N DG+ ++GN + ALWPG+ KPGLW
Sbjct: 184 EDIFLPRRLVALFVLLTMADFVDQFYGWQDQLFRNDDGEFRYSGNDWHALWPGNCKPGLW 243
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
++ SRMG + +++ E+ + +V G+ + +E+ +PP+F CT IL
Sbjct: 244 ISIASRMGVLLNILLLED----AAEQAELHSVPNGLPERPFGTLEISLPPIFNMCTEILP 299
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
Q + + YWEAV N +G + A LL C + NPFV EPH++L QI L +G++
Sbjct: 300 PVNQNRSLEQYWEAV--NIEGRDRLVNTAVPLLQSCCKLNPFVAEPHLLLAQILLGEGQY 357
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
E AE EA +G+ LLL+W + WDKRM+WEG
Sbjct: 358 ELAETEADEGVRLLLDWATNWDKRMTWEG 386
>gi|116791245|gb|ABK25908.1| unknown [Picea sitchensis]
Length = 303
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 169/301 (56%), Positives = 215/301 (71%), Gaps = 34/301 (11%)
Query: 8 INNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRI 67
+ N L T + ARPFLRG + +D +LP + VLR VGA ECWHKHGSF +HLVD+YRI
Sbjct: 1 MANSELPTRLERARPFLRGAYDEVDPSLPEFMRVLRQVGASECWHKHGSFFDHLVDVYRI 60
Query: 68 LKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
LK+W APD V CGLFHSAYSNSYVNLAIF+P+ GR VR +GD+AE+L+H+FCIVPRQ
Sbjct: 61 LKLWNAPDSVARCGLFHSAYSNSYVNLAIFEPDVGRQRVRDLIGDSAEELVHMFCIVPRQ 120
Query: 128 PLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGL----FNAEEA--------------- 168
PLIHDDLLF ++D+ELV L+ S+ ++ A+ + L EE
Sbjct: 121 PLIHDDLLFHFTDEELVSALEESEKTMKEAEAEDLELQKLKLEEKNGDWLDNIHDKVEIH 180
Query: 169 ---------------WRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSD 213
WR+KI ++LPA G+ VKHIK GEDV V RR+VA F+ +T+ADFSD
Sbjct: 181 KTHFKETLEGFILPEWRRKIQKILPAHGVRVKHIKNGEDVEVPRRLVAIFLFMTMADFSD 240
Query: 214 QLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQ 273
QLF FQD LF+NS+G +EFAGN+++ALWPGDGKPGLW+NS+SRMGA+Y+LI REE++F +
Sbjct: 241 QLFSFQDVLFDNSNGIMEFAGNTWTALWPGDGKPGLWLNSISRMGALYTLIVREEKLFQR 300
Query: 274 E 274
E
Sbjct: 301 E 301
>gi|302810161|ref|XP_002986772.1| hypothetical protein SELMODRAFT_124836 [Selaginella moellendorffii]
gi|300145426|gb|EFJ12102.1| hypothetical protein SELMODRAFT_124836 [Selaginella moellendorffii]
Length = 382
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 193/380 (50%), Positives = 249/380 (65%), Gaps = 32/380 (8%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR FLR E AID LP ++VLRS GA EC+HKHG+F +HLV++YRILK+W APD VC
Sbjct: 4 ARLFLREEYAAIDPALPHWLSVLRSTGASECFHKHGTFFQHLVEVYRILKLWGAPDDVCR 63
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
CGLFHS+YSNSYVNLAIF+P R V+ +GD AE+L+ LFCIVPRQ LIHD LLF +S
Sbjct: 64 CGLFHSSYSNSYVNLAIFEPGVDRSRVKDLIGDPAEQLVELFCIVPRQELIHDMLLFRFS 123
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
D +L E ASD S KK LLP +TVK+I+ GEDV + RR+
Sbjct: 124 DDQLQEA-AASDRSC----------------KKSKLLLPDGFITVKNIRNGEDVRLPRRI 166
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
VA FV+LT+ADF DQL+ +QD LF+N DG+L F+GN + ALWPGD +PGLWM+S+SRMG
Sbjct: 167 VAVFVLLTMADFCDQLYNWQDNLFDNRDGELRFSGNHYWALWPGDVQPGLWMSSISRMGL 226
Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
+ + EEE Q+ A I L +PP+F+ C+++L +QI ARD
Sbjct: 227 LLKGLLNEEE---QDNGICDAPAA----------IALKLPPLFDGCSKLLRPVDQIAARD 273
Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGK 379
LY EAV +D ++ + L E+NP++GEPH+VL Q+ +G FEEA EA
Sbjct: 274 LYLEAV--GADLKRQGFDWIAGKLRGACERNPWIGEPHLVLAQLCCLEGAFEEAAVEAAL 331
Query: 380 GLMLLLEWGSPWDKRMSWEG 399
GL LL+WG+ WDKR++WE
Sbjct: 332 GLERLLQWGTAWDKRVAWEA 351
>gi|302772192|ref|XP_002969514.1| hypothetical protein SELMODRAFT_91578 [Selaginella moellendorffii]
gi|300162990|gb|EFJ29602.1| hypothetical protein SELMODRAFT_91578 [Selaginella moellendorffii]
Length = 382
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 193/380 (50%), Positives = 249/380 (65%), Gaps = 32/380 (8%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR FLR E AID LP ++VLRS GA EC+HKHG+F +HLV++YRILK+W APD VC
Sbjct: 4 ARLFLREEYAAIDPALPHWLSVLRSTGASECFHKHGTFFQHLVEVYRILKLWGAPDDVCR 63
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
CGLFHS+YSNSYVNLAIF+P R V+ +GD AE+L+ LFCIVPRQ LIHD LLF +S
Sbjct: 64 CGLFHSSYSNSYVNLAIFEPGVDRSRVKDLIGDPAEQLVELFCIVPRQELIHDMLLFRFS 123
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
D +L E ASD S KK LLP +TVK+I+ GEDV + RR+
Sbjct: 124 DDQLQEA-AASDRS----------------GKKSKLLLPDGFITVKNIRNGEDVRLPRRI 166
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
VA FV+LT+ADF DQL+ +QD LF+N DG+L F+GN + ALWPGD +PGLWM+S+SRMG
Sbjct: 167 VAVFVLLTMADFCDQLYNWQDNLFDNRDGELRFSGNHYWALWPGDVQPGLWMSSISRMGL 226
Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
+ + EEE Q+ A I L +PP+F+ C+++L +QI ARD
Sbjct: 227 LLKGLLNEEE---QDNGICDAPAA----------IALKLPPLFDGCSKLLRPVDQIAARD 273
Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGK 379
LY EAV +D ++ + L E+NP++GEPH+VL Q+ +G FEEA EA
Sbjct: 274 LYLEAV--GADLKRQGFDWIAGKLRGACERNPWIGEPHLVLAQLCCLEGAFEEAAVEAAL 331
Query: 380 GLMLLLEWGSPWDKRMSWEG 399
GL LL+WG+ WDKR++WE
Sbjct: 332 GLERLLQWGTAWDKRVAWEA 351
>gi|297742731|emb|CBI35365.3| unnamed protein product [Vitis vinifera]
Length = 361
Score = 340 bits (872), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 168/254 (66%), Positives = 192/254 (75%), Gaps = 43/254 (16%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L+TL+ASARPFLRGELEA+ LPSLI+VLRSVGAGECWHKHGSFLEHL D+Y+ILK+W
Sbjct: 78 HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLFDMYKILKLW 137
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
KA + VCLCGLFHSAYSNSYVNLAIFDP+TGRD VR HVGDAAE+LI+LFC+VPRQPLIH
Sbjct: 138 KAQESVCLCGLFHSAYSNSYVNLAIFDPSTGRDTVRAHVGDAAEELIYLFCVVPRQPLIH 197
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
DDLLF YSD ELVEHLKAS+VS+ NAKE G+FN +EAWRKK+N L+P G+TVKHIKTGE
Sbjct: 198 DDLLFRYSDDELVEHLKASEVSVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 257
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
DV VSRRVVA F PGLW+
Sbjct: 258 DVLVSRRVVAIF-------------------------------------------PGLWL 274
Query: 252 NSLSRMGAIYSLIA 265
NS+SRMGA+YSL+
Sbjct: 275 NSISRMGAVYSLLV 288
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 28/34 (82%)
Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
L+ AE+LL RC+EKNPFVGEPHVVL QIYL G
Sbjct: 303 LDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGR 336
>gi|297742729|emb|CBI35363.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 164/254 (64%), Positives = 193/254 (75%), Gaps = 42/254 (16%)
Query: 12 YLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
+L+TL+ASARPFLRGELEA+ LPSLI+VLRSVGAGECWHKHGSFLEHLVD+Y+ILK+W
Sbjct: 14 HLQTLLASARPFLRGELEAVHPKLPSLISVLRSVGAGECWHKHGSFLEHLVDMYKILKLW 73
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
KA + VCLCGLFHSAYSNSYVNLAIF+P+TGRD VR HV
Sbjct: 74 KAQESVCLCGLFHSAYSNSYVNLAIFEPSTGRDTVRAHV--------------------- 112
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
S+ NAKE G+FN +EAWRKK+N L+P G+TVKHIKTGE
Sbjct: 113 ---------------------SVRNAKEGGVFNGDEAWRKKVNSLVPPGGVTVKHIKTGE 151
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWM 251
DV VSRRVVA FVM+T+ADFSDQ +GFQD LF+NS+G+LEF+GNS+ ALWPGDGKPGLW+
Sbjct: 152 DVLVSRRVVAIFVMMTMADFSDQHYGFQDALFDNSNGRLEFSGNSYGALWPGDGKPGLWL 211
Query: 252 NSLSRMGAIYSLIA 265
NS+SRMGA+YSL+
Sbjct: 212 NSISRMGAVYSLLV 225
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 28/34 (82%)
Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
L+ AE+LL RC+EKNPFVGEPHVVL QIYL G
Sbjct: 240 LDWAEKLLGRCVEKNPFVGEPHVVLAQIYLTNGR 273
>gi|325930777|gb|ADZ45830.1| TPR1 [Glycine max]
Length = 268
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/244 (66%), Positives = 196/244 (80%), Gaps = 11/244 (4%)
Query: 159 EKGLFNAE---EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQL 215
EKGLF+ + E WRKK+ L+PA+G+ VKHI+TGE V VSRR+VA F+M+T+ADF DQL
Sbjct: 2 EKGLFDDDGELEGWRKKLQGLVPAEGVQVKHIRTGEGVHVSRRIVAVFIMMTMADFCDQL 61
Query: 216 FGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQER 275
FGFQD LF+N++G+LEF+GN+F ALWPGDGKPGLW+NS+SRMGA+Y+LIAREEEIFIQER
Sbjct: 62 FGFQDLLFDNANGRLEFSGNNFGALWPGDGKPGLWLNSISRMGAVYTLIAREEEIFIQER 121
Query: 276 KTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEK 335
K V V D ER+EDIEL +PPVF+ C ++L+A +QI ARDLYWEAVC E
Sbjct: 122 KRK-VGVAVVPDLERNEDIELVLPPVFDYCRKVLEAGDQIVARDLYWEAVC-------EG 173
Query: 336 LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRM 395
KAEELLL IEKNPFVGEP+VVL Q+YL +G FEEAE+ A +GL LLLEWG PWDKR
Sbjct: 174 GSKAEELLLESIEKNPFVGEPYVVLSQVYLTEGRFEEAEKHAERGLKLLLEWGCPWDKRT 233
Query: 396 SWEG 399
SWEG
Sbjct: 234 SWEG 237
>gi|302851869|ref|XP_002957457.1| hypothetical protein VOLCADRAFT_98509 [Volvox carteri f.
nagariensis]
gi|300257261|gb|EFJ41512.1| hypothetical protein VOLCADRAFT_98509 [Volvox carteri f.
nagariensis]
Length = 427
Score = 307 bits (786), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 166/379 (43%), Positives = 222/379 (58%), Gaps = 32/379 (8%)
Query: 21 RPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLC 80
R FLRG+L ID +LP + VL SVGAGECWHK +F HL ++Y+I KIW D VC C
Sbjct: 32 RAFLRGDLSRIDPHLPEWLDVLSSVGAGECWHK--NFKSHLYEVYKICKIWGQDDDVCKC 89
Query: 81 GLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSD 140
LFHSAYSNSYVNLAIF +T R +R VG +AE LIH FC+VPR L++DDLL
Sbjct: 90 ALFHSAYSNSYVNLAIFKESTDRTRLRELVGHSAEDLIHTFCVVPRHNLVYDDLLARAPS 149
Query: 141 QELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV 200
E + L + + L+P +G+TV+HIKTGE + V R V
Sbjct: 150 AEDLAALGVGGAGGDDGAAAAAAP------PSLRRLVPPEGITVRHIKTGEPLTVDRLTV 203
Query: 201 ATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAI 260
A F+++T+ADF +QLF +QD +F N +G+L +AG + +LWPG KPGLW ++LSRMGA+
Sbjct: 204 ARFLVMTMADFCEQLFSWQDSMFENHNGRLLYAGTNHGSLWPGACKPGLWHSALSRMGAL 263
Query: 261 YSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDL 320
E G + +P+PPVFE CT+++ +Q+ ARD
Sbjct: 264 LRFACTGPE-------------GRPL---------VPLPPVFEGCTQVITEADQLAARDA 301
Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
YW AV ++ K ++A LL NP V EPHV+L QIY +G++ + R A +
Sbjct: 302 YWVAVSECTETHKH--DEALRLLRTACWHNPHVAEPHVLLAQIYCQRGQWVDGRRHADRA 359
Query: 381 LMLLLEWGSPWDKRMSWEG 399
+ LL WG+ WDKRMSWE
Sbjct: 360 VRLLGCWGTAWDKRMSWEA 378
>gi|440792871|gb|ELR14079.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 377
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 157/385 (40%), Positives = 213/385 (55%), Gaps = 62/385 (16%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
E + AR LR EL D LP+L+ L GA ECWHKHG+F +HL ++RIL +WK
Sbjct: 15 ELVPEHARALLREELHHCDPKLPALLEQLGKHGASECWHKHGTFKQHLFGVWRILYLWKQ 74
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
VC CGL+HSAYSNSYVNLAIF P+TGR+ VR VG+ AE+LIH FC +PRQ LI D
Sbjct: 75 SQAVCRCGLYHSAYSNSYVNLAIFKPDTGRETVRSAVGNEAEELIHFFCTIPRQELIFDH 134
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTG-ED 192
+L + D+ + P +G+ V+HI+ E
Sbjct: 135 ILPLHPDKPI----------------------------------PKEGVEVRHIRNKDEK 160
Query: 193 VGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMN 252
+ VS +++ ++LTIADF DQ +G+QD LF N +G L +G++ + LWP GKPGLWM+
Sbjct: 161 IRVSPKLMKISLLLTIADFCDQWYGWQDLLFAN-EGGLTMSGDNPATLWPSTGKPGLWMS 219
Query: 253 SLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAK 312
LS+M + I +E D E P+PP+F NC+ +L+
Sbjct: 220 RLSQMAVLSRRIHEQEP-----------------DDE-------PLPPIFNNCSTVLEPD 255
Query: 313 EQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
+ EARDLYW V ++ + + + L + I NPFV EP V+L QIY G +EE
Sbjct: 256 SEREARDLYWRVVMHKTEQEQHEEAAED--LRKAIALNPFVAEPRVLLAQIYNVSGRYEE 313
Query: 373 AEREAGKGLMLLLEWGSPWDKRMSW 397
AE +A + L LL EWG+ WDK W
Sbjct: 314 AESQAQEALRLLFEWGTAWDKVPPW 338
>gi|357438997|ref|XP_003589775.1| hypothetical protein MTR_1g039140 [Medicago truncatula]
gi|355478823|gb|AES60026.1| hypothetical protein MTR_1g039140 [Medicago truncatula]
Length = 253
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 129/193 (66%), Positives = 155/193 (80%), Gaps = 6/193 (3%)
Query: 208 IADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIARE 267
+ADF DQLFGFQD LF N DG+LEF GN+F A+WPG+GKPGLW+NS+SRMGA+Y+LI RE
Sbjct: 1 MADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLWLNSISRMGAVYNLILRE 60
Query: 268 EEIFIQERKTSAAAVGV-GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVC 326
EEIF++E+K GV GVD ERDE IEL +PPVF CT++LDA++QI ARDLYWEA+
Sbjct: 61 EEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVLDARDQIVARDLYWEAMI 120
Query: 327 GNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLE 386
+E LEK EELL++ IEKNPFVGEP+VVL Q+YL KG FEE E+EA +GL LLLE
Sbjct: 121 -----CEEGLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGRFEEGEKEAERGLTLLLE 175
Query: 387 WGSPWDKRMSWEG 399
WG WDKR+SWEG
Sbjct: 176 WGCHWDKRISWEG 188
>gi|159467587|ref|XP_001691973.1| hypothetical protein CHLREDRAFT_170872 [Chlamydomonas reinhardtii]
gi|158278700|gb|EDP04463.1| predicted protein [Chlamydomonas reinhardtii]
Length = 347
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 61/335 (18%)
Query: 21 RPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLC 80
R FLRG+L +DK LP + VL VGAGECWHK SF HL ++Y++ KIW D VC C
Sbjct: 27 RAFLRGDLSRVDKRLPEWMEVLSGVGAGECWHKKSSFKAHLFEVYKLCKIWGQDDDVCKC 86
Query: 81 GLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSD 140
LFHSAYSNSYVNLAIF R ++ +G +AE+LIH FC+VPR L++DD+L
Sbjct: 87 ALFHSAYSNSYVNLAIFQEGADRGRLKELLGPSAEELIHTFCVVPRHNLVYDDVLAAAP- 145
Query: 141 QELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV 200
ASD++ + VKHIKTGE + V R+ +
Sbjct: 146 -------SASDMAAM----------------------------VKHIKTGEPLLVPRKTI 170
Query: 201 ATFVMLTIADFSDQLFGFQDFLF-NNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGA 259
A F+ +TIADF++QLF +QD +F N DG+L +AG + +LWPG KPGLW ++LSRM
Sbjct: 171 AQFLAMTIADFAEQLFSWQDAMFENEGDGRLLYAGGNARSLWPGPCKPGLWHSALSRM-- 228
Query: 260 IYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARD 319
+L+ R G+ + + +P+PPVFE CT++L +Q+ ARD
Sbjct: 229 --ALMLRH-----------------GLTDGSGQPL-VPLPPVFERCTQVLTEADQLAARD 268
Query: 320 LYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVG 354
YW AVC +++ + ++A LL ++ NP +
Sbjct: 269 AYWRAVCDHTEPHQH--DEAARLLRAAVDHNPHIA 301
>gi|222623967|gb|EEE58099.1| hypothetical protein OsJ_08972 [Oryza sativa Japonica Group]
Length = 656
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 107/187 (57%), Positives = 140/187 (74%), Gaps = 2/187 (1%)
Query: 13 LETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWK 72
++ +VA+ARPFLRG+L +D LPSL++VL GAGEC+HKHG+FL HL+D+YRIL++W
Sbjct: 6 VDAVVAAARPFLRGDLSQVDPELPSLVSVLCDAGAGECYHKHGTFLAHLIDVYRILRLWG 65
Query: 73 APDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
APD V CGL+HSAYSNSYVNLAIF P+ RD VRG +G AAE+L+HLFC+VPR L+HD
Sbjct: 66 APDAVARCGLYHSAYSNSYVNLAIFQPDVARDHVRGIIGAAAERLVHLFCVVPRHQLMHD 125
Query: 133 DLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE--EAWRKKINELLPADGLTVKHIKTG 190
DL Y+D EL HL AS SL A++ G + EAWR K+ ++P +G+ +HI+TG
Sbjct: 126 DLHLRYTDAELTAHLAASQASLDAARKSGGGGGDPGEAWRAKLASVVPPEGVVARHIRTG 185
Query: 191 EDVGVSR 197
E V +SR
Sbjct: 186 EAVALSR 192
>gi|212722138|ref|NP_001131675.1| uncharacterized protein LOC100193035 [Zea mays]
gi|194692216|gb|ACF80192.1| unknown [Zea mays]
gi|195637840|gb|ACG38388.1| hypothetical protein [Zea mays]
gi|223948543|gb|ACN28355.1| unknown [Zea mays]
gi|413939580|gb|AFW74131.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
Length = 214
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 112/194 (57%), Positives = 141/194 (72%), Gaps = 11/194 (5%)
Query: 206 LTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIA 265
+TIADFSDQLF +QD LF+NS+G+LEF+GN++++LWPG GKPGLW+ S SRM A+Y+LI
Sbjct: 1 MTIADFSDQLFDWQDRLFDNSNGRLEFSGNTWTSLWPGTGKPGLWVTSSSRMAALYTLIV 60
Query: 266 REEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAV 325
REEEI I +R RDED++L +PPVF CTR+L+A++Q ARDLYWEAV
Sbjct: 61 REEEIHITQRARDTHG--------RDEDMDLVIPPVFNGCTRLLNAEDQKVARDLYWEAV 112
Query: 326 CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLL 385
CG K E+LL + I KNPFVGEPH+VL Q+YL + EA+ +A +GL LLL
Sbjct: 113 CGE---EASDWLKVEQLLQQSIAKNPFVGEPHLVLAQVYLNMERYGEAQMQAEEGLRLLL 169
Query: 386 EWGSPWDKRMSWEG 399
EWGS WDKRM WEG
Sbjct: 170 EWGSSWDKRMPWEG 183
>gi|413939579|gb|AFW74130.1| hypothetical protein ZEAMMB73_472163 [Zea mays]
Length = 195
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 105/167 (62%), Positives = 132/167 (79%), Gaps = 1/167 (0%)
Query: 22 PFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCG 81
PFLRGE +D LP L AVLR GAGECWHKHG+FL HL+D++RIL++W APD V CG
Sbjct: 30 PFLRGEEAEVDPALPGLAAVLRGAGAGECWHKHGTFLAHLLDVHRILRLWGAPDAVARCG 89
Query: 82 LFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQ 141
L+HSAYSNSYVNLAIF P+TGR V VG AE+L+HLFC+VPRQ L+HDDLLF YSDQ
Sbjct: 90 LYHSAYSNSYVNLAIFQPDTGRAHVAAVVGGPAERLVHLFCVVPRQQLVHDDLLFHYSDQ 149
Query: 142 ELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIK 188
+L + L +S+ S+ +A+ +G+F+ + WR+KI LLPADG+TVKHI+
Sbjct: 150 DLADDLASSEESVRDAR-RGVFDDAQPWRRKIQRLLPADGITVKHIR 195
>gi|219118345|ref|XP_002179949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409002|gb|EEC48935.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 393
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/367 (35%), Positives = 189/367 (51%), Gaps = 65/367 (17%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
I+ LRS GA CWHKH +FLEHL+ ++ IL++W + GLFHSAYSNSYVNLA+FD
Sbjct: 48 ISALRSRGAHRCWHKHSTFLEHLLSVHNILRLWGQGRTIGRVGLFHSAYSNSYVNLALFD 107
Query: 99 PNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAK 158
P R +++ +G+ AE L+HLFCI+ RQ ++ + LL +
Sbjct: 108 PAIERGMMKTLIGEDAEVLVHLFCIIDRQQVVVNTLLAQ--------------------- 146
Query: 159 EKGLFNAEEAWRKKINELLPADGLTVKHIK-TGEDVGVSRRVVATFVMLTIADFSDQLFG 217
+P GLTV H++ + + V ++ + V+ T+AD +DQ FG
Sbjct: 147 ----------------GFIPKQGLTVPHLRDSSKSVFLNSETLRMLVVFTMADIADQYFG 190
Query: 218 FQDFLFNNSD--------GKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEE 269
+QD LF GK + A + +ALWPG KPGLWM+ LS++ +
Sbjct: 191 WQDTLFGGGGQEGSMIIPGKDDVARHDTTALWPGVSKPGLWMSYLSQLAEV-------AR 243
Query: 270 IFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNS 329
F+ + V++ DED PPVF N T+ L +++ ARD YW V G
Sbjct: 244 TFVPSTQD--------VNRSADEDRTADFPPVFANGTQSLSVSDEVAARDFYWSVVSGEV 295
Query: 330 DGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGS 389
+ E +E +L C++ NP+ EP V+L Q +L G F+ A + L L +WG+
Sbjct: 296 TDNAEAIE----VLQLCVKMNPWAFEPLVILAQKHLHTGNFDSALNTTYRALDLQHQWGT 351
Query: 390 PWDKRMS 396
WDKR+S
Sbjct: 352 AWDKRLS 358
>gi|407939355|ref|YP_006854996.1| hypothetical protein C380_13285 [Acidovorax sp. KKS102]
gi|407897149|gb|AFU46358.1| hypothetical protein C380_13285 [Acidovorax sp. KKS102]
Length = 362
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 178/386 (46%), Gaps = 69/386 (17%)
Query: 16 LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
L A A P L E D L ++ + + G+ WHK G+F HLV + R L +W+ P
Sbjct: 12 LFARALPLLDDEWLTRDPELAPVLPTVLARNVGQDWHKAGTFRHHLVGVTRTLTVWQQPR 71
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V L GL HS Y N++V+L FDP R VR G++AE L++LFC R +
Sbjct: 72 DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGESAEHLVYLFCTQSRTQFV----- 126
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
Q+++ H +D SL+ K+ G+D +
Sbjct: 127 -----QKVLAHALEADGSLVLQKD------------------------------GQDHVL 151
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNN-SDGKLEFAGNSFSA-LWPGDGKPGLWMNS 253
+ VA F+++++AD +Q F +QD +F+ D + ++A LWPG +P
Sbjct: 152 TPYEVAAFIIVSMADTIEQWFSWQDDIFSRFPDVQHRHQKAHWAASLWPGPMRPS----- 206
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
RM + +A+ + +D+ LP+PPVF +CT+ L A
Sbjct: 207 -GRMVHQINGLAK------------------ALQHPGLKDV-LPMPPVFAHCTQHLSAAN 246
Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
+ A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G E+A
Sbjct: 247 EAAATSLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGRNEDA 304
Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
+ A L L WG+ WDKR+ W+
Sbjct: 305 KAAAESALHLFSAWGNAWDKRVQWDA 330
>gi|421486214|ref|ZP_15933762.1| hypothetical protein QWC_26353 [Achromobacter piechaudii HLE]
gi|400195559|gb|EJO28547.1| hypothetical protein QWC_26353 [Achromobacter piechaudii HLE]
Length = 357
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 162/355 (45%), Gaps = 68/355 (19%)
Query: 47 AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
AGE WHK G+F HL+ +YR L +W P V L GLFHS Y N YV+L +FD R +
Sbjct: 38 AGEDWHKAGTFKHHLLGVYRTLTLWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERSTL 97
Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
R +G AE+ + LFC +PR + L + +G N
Sbjct: 98 REVLGTEAEEWVSLFCAMPRTKFVQAVL-----------------------QGQGAGN-- 132
Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
DGLT++ G+ + R VA F++++ AD +Q +QD +F
Sbjct: 133 -------------DGLTLEGAD-GQMFTFTPRQVAAFIVVSAADIGEQWHSWQDEIFAGY 178
Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
+ + N S+LWPG KP N LS + + + + S
Sbjct: 179 PQQQRRDLKTNWASSLWPGPLKPP--SNILSMLSHLLAPL-------------SGMPANT 223
Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
G+ P+PP F+ C L +++ A LYW+ + + +++ A LL
Sbjct: 224 GI----------PIPPAFDQCRATLSPRDEAAASALYWQVITRMHPLT--EMDSARHLLE 271
Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ NP+VGEP ++L Q+ L G+F+ AE+ A GL L WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVGEPRLLLAQLALTSGDFDAAEQHATAGLAALQAWGTAWDKRIEWAG 326
>gi|365091191|ref|ZP_09328698.1| hypothetical protein KYG_08065 [Acidovorax sp. NO-1]
gi|363416309|gb|EHL23429.1| hypothetical protein KYG_08065 [Acidovorax sp. NO-1]
Length = 362
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 180/390 (46%), Gaps = 77/390 (19%)
Query: 16 LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
L A A P L + A D L ++ ++ + G+ WHK G+F HLV + R L +W+ P
Sbjct: 12 LFARALPLLDDDWLARDAELAPVLPMVLARNVGQDWHKAGTFRHHLVGVTRTLTVWQQPR 71
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V L GL HS Y N++V+L FDP R VR G++AE L++LFC R +
Sbjct: 72 DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGESAEHLVYLFCTQSRTQFV----- 126
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
Q+++ H +D SL+ K+ + + L P +
Sbjct: 127 -----QKVLAHALEADGSLVLQKD-----------GQHHVLTPCE--------------- 155
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFN------NSDGKLEFAGNSFSALWPGDGKPGL 249
VA F+++++AD +Q F +QD +F+ + K +A ++LWPG +P
Sbjct: 156 ----VAAFIIVSMADTIEQWFSWQDDIFSRFPEVQHRHQKAHWA----ASLWPGPMRPS- 206
Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
RM + +A+ +Q LP+PPVF+NCT+ L
Sbjct: 207 -----GRMVHQINGLAKA----LQHPGLKGL---------------LPMPPVFDNCTQHL 242
Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
A + A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G
Sbjct: 243 SAANEAAATSLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGR 300
Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
++A+ A L L WG+ WDKR+ W+
Sbjct: 301 KDDAKAAAESALHLFSAWGNAWDKRVQWDA 330
>gi|427817671|ref|ZP_18984734.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
gi|410568671|emb|CCN16724.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
Length = 357
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAIDADLPTLLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRRYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|33599383|ref|NP_886943.1| hypothetical protein BB0394 [Bordetella bronchiseptica RB50]
gi|33566979|emb|CAE30892.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
Length = 357
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|427812647|ref|ZP_18979711.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
gi|410563647|emb|CCN21182.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
Length = 357
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLVLQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|410418190|ref|YP_006898639.1| hypothetical protein BN115_0379 [Bordetella bronchiseptica MO149]
gi|408445485|emb|CCJ57135.1| conserved hypothetical protein [Bordetella bronchiseptica MO149]
Length = 357
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRRYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|427823827|ref|ZP_18990889.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
gi|410589092|emb|CCN04157.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
Length = 357
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 181/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGRCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|33594317|ref|NP_881961.1| hypothetical protein BP3435 [Bordetella pertussis Tohama I]
gi|384205614|ref|YP_005591353.1| hypothetical protein BPTD_3387 [Bordetella pertussis CS]
gi|408414206|ref|YP_006624913.1| hypothetical protein BN118_0137 [Bordetella pertussis 18323]
gi|33564392|emb|CAE43698.1| conserved hypothetical protein [Bordetella pertussis Tohama I]
gi|332383728|gb|AEE68575.1| hypothetical protein BPTD_3387 [Bordetella pertussis CS]
gi|401776376|emb|CCJ61562.1| conserved hypothetical protein [Bordetella pertussis 18323]
Length = 364
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 18 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 77
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 78 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 129
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 130 -------------RILAGEGRG-----------------ATGLVLQGAD-GQPLTLTPRQ 158
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 159 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 209
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 210 ARILDMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 253
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 254 AALYWQVITRMHPMT--EMDSAHHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 311
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 312 AAGLAALQAWGTSWDKRIEWSG 333
>gi|33595102|ref|NP_882745.1| hypothetical protein BPP0392 [Bordetella parapertussis 12822]
gi|33565179|emb|CAE35976.1| conserved hypothetical protein [Bordetella parapertussis]
Length = 364
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 18 ARALLADHYAAIDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 77
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 78 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 129
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 130 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 158
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 159 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 209
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 210 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 253
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 254 AALYWQVITRMHPMT--EMDSAHHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 311
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 312 AAGLAALQAWGTSWDKRIEWSG 333
>gi|299529997|ref|ZP_07043424.1| hypothetical protein CTS44_04439 [Comamonas testosteroni S44]
gi|298721977|gb|EFI62907.1| hypothetical protein CTS44_04439 [Comamonas testosteroni S44]
Length = 365
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 183/398 (45%), Gaps = 76/398 (19%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
N E L A+ L E + D +L L+ V+ G G+ WHK G+F HL + R L
Sbjct: 6 NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65
Query: 70 IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+W+ P V L GL HS Y N++V+L FD R+ ++ +G+ AE L++LFC + R
Sbjct: 66 LWQQPRDVRLLGLLHSVYGNAFVDLVKFDAGNEREHLKRLIGEQAEHLVYLFCTMSRT-- 123
Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
+ V+ L A D++ ADG +++ +
Sbjct: 124 ------------QFVQKLLAGDIA-------------------------ADG-SMQLERN 145
Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
G E + ++ VA F ++++AD +Q F +Q+ +++ D + A + ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205
Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
G +P SRM + S G+G + +LP+PPV
Sbjct: 206 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 239
Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
F NC+++L A + A LYW + D L+ A L + + NP+VGEP +VL
Sbjct: 240 FANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVALNPWVGEPQMVLA 297
Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
Q+YL+ G +A R A L WG+ WDKR+ W+
Sbjct: 298 QLYLSAGRNTDAARAADSALQCFCAWGNAWDKRVQWDA 335
>gi|121607853|ref|YP_995660.1| hypothetical protein Veis_0863 [Verminephrobacter eiseniae EF01-2]
gi|121552493|gb|ABM56642.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
Length = 363
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 189/391 (48%), Gaps = 71/391 (18%)
Query: 13 LET-LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIW 71
LET L A A+ L E A D +L ++ + + G+ WHK G+F HLV + R L +W
Sbjct: 8 LETDLFARAQWLLDPEWLARDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLSLW 67
Query: 72 KAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
+ P V L GL HS Y N++V+L FDP R +R VG++AE L++LFC R
Sbjct: 68 QQPRDVRLLGLLHSVYGNAFVDLVKFDPARERAQMRELVGESAEHLVYLFCTRSRT---- 123
Query: 132 DDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGE 191
++ + L + L+A D SL+ E+G DG +H+ T
Sbjct: 124 -----QFVQRVLAQALQA-DGSLV--LEQG------------------DG---RHLLTPY 154
Query: 192 DVGVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPG 248
+V V F+++T+AD +Q F +QD + F + + + A + ++LWPG +P
Sbjct: 155 EVAV-------FIIVTMADAIEQWFSWQDDVYSGFPRLEPQRQQAVHWAASLWPGPMRPT 207
Query: 249 LWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRI 308
RM +Q+ A A+ K+ LP+PPVF +C++
Sbjct: 208 ------GRM--------------LQQISALAQALQHPGLKQL-----LPLPPVFAHCSQR 242
Query: 309 LDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKG 368
L A ++ A LYW + + + A +L + NP+VGEP +VL Q+YLA G
Sbjct: 243 LSASDEAAATALYWSVIA--QEHPLLDPDVATNMLESAVRHNPWVGEPQMVLAQLYLAAG 300
Query: 369 EFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+EA+ A L L WG+ WDKR+ W+
Sbjct: 301 RKDEAKLAAASALQLFSAWGNSWDKRVQWDA 331
>gi|412340312|ref|YP_006969067.1| hypothetical protein BN112_3019 [Bordetella bronchiseptica 253]
gi|408770146|emb|CCJ54936.1| conserved hypothetical protein [Bordetella bronchiseptica 253]
Length = 357
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 182/382 (47%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L A+D +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V L
Sbjct: 11 ARALLADHYAAVDADLPALLELLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRL 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R ++G+ AE+ +HLFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERATLRQYLGEEAEQWVHLFCAMPRTQFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
++ + +G A GL ++ G+ + ++ R
Sbjct: 123 -------------RILAGEGRG-----------------ATGLALQGAD-GQPLTLTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + + ++LWPG KP
Sbjct: 152 VAAFIVVSAADVGEQWHSWQDEIFAGYPHQERRDTSTHWAASLWPGPLKPP--------- 202
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
I +++R +Q T A G+ P PP F +CT +LDA ++ A
Sbjct: 203 ARILGMLSR----LLQPLSTLPAGTGI------------PTPPAFGHCTAVLDAGDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL I NP+V EP ++L Q+ L +++ A +A
Sbjct: 247 AALYWQVITRMHPMT--EMDSARHLLQAAIAHNPWVAEPRLLLAQLALTAQDYDTALEQA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTSWDKRIEWSG 326
>gi|293602157|ref|ZP_06684608.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
43553]
gi|292819427|gb|EFF78457.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
43553]
Length = 357
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 68/382 (17%)
Query: 20 ARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCL 79
AR L AID +LP+L+ +L + AGE WHK G+F HL+ +YR L +W P V +
Sbjct: 11 ARALLADHYAAIDADLPALLDMLFARSAGEDWHKAGTFKHHLLGVYRTLALWNQPREVRM 70
Query: 80 CGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYS 139
GLFHS Y N YV+L +FD R +R +G+ AE + LFC +PR +
Sbjct: 71 LGLFHSVYGNEYVDLTLFDRERERSTLREVLGEEAEAWVSLFCAMPRTKFVQ-------- 122
Query: 140 DQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRV 199
+++ + G DG+ ++ G ++ R
Sbjct: 123 -------------AVLAGQGTG-----------------HDGMRLEGAD-GTVYALTPRQ 151
Query: 200 VATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRM 257
VA F++++ AD +Q +QD +F + + + ++LWPG KP N LS +
Sbjct: 152 VAAFIVVSAADIGEQWHSWQDEIFAGYPQQQRRDLNTHWAASLWPGPLKPP--SNILSML 209
Query: 258 GAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEA 317
+ + ++ + D +P+PP F+ C L A+++ A
Sbjct: 210 SHLLAPLS-----------------------QMPGDTGIPIPPAFDACRATLSARDEAAA 246
Query: 318 RDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREA 377
LYW+ + + +++ A LL + NP+VGEP ++L Q+ L G+F+EAE+ A
Sbjct: 247 SALYWQVITRMHPLT--EMDSARHLLEAAVAHNPWVGEPRLLLAQLALTTGDFDEAEQHA 304
Query: 378 GKGLMLLLEWGSPWDKRMSWEG 399
GL L WG+ WDKR+ W G
Sbjct: 305 AAGLAALQAWGTAWDKRIEWAG 326
>gi|351730024|ref|ZP_08947715.1| hypothetical protein AradN_09609 [Acidovorax radicis N35]
Length = 362
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 182/390 (46%), Gaps = 77/390 (19%)
Query: 16 LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
L A A+ L E A D +L ++ + + G+ WHK G+F HLV + R L +W+ P
Sbjct: 12 LFARAQTLLDEEWLARDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLTMWQQPR 71
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V L GL HS Y N++V+L FDP R VR VG++AE L+++FC R +
Sbjct: 72 DVRLLGLLHSVYGNAFVDLVKFDPAQERARVRDLVGESAEHLVYVFCTQSRTQFV----- 126
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
Q+++ H +D SL+ L DG+ H+ T +
Sbjct: 127 -----QKVLAHALEADGSLV---------------------LQKDGM--DHVLTPYE--- 155
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFN------NSDGKLEFAGNSFSALWPGDGKPGL 249
VATF+++++AD +Q F +QD +F+ + K +A ++LWPG +P
Sbjct: 156 ----VATFIIVSMADTIEQWFSWQDDIFSRFPEVQHRHQKAHWA----ASLWPGPMRPS- 206
Query: 250 WMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRIL 309
RM + +A+ +Q LPVPPVF +CT+ L
Sbjct: 207 -----GRMVHQINGLAKA----LQHPGLKGL---------------LPVPPVFGHCTQHL 242
Query: 310 DAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGE 369
A + A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G
Sbjct: 243 SAANEAAAASLYWSVI--QQDQPLVDLDVATGVLESAVRHNPWVGEPQMVLAQLYLSAGR 300
Query: 370 FEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
++A+ A L L WG+ WDKR+ W+
Sbjct: 301 QDDAKAAAESALHLFSAWGNSWDKRVQWDA 330
>gi|359800019|ref|ZP_09302571.1| hypothetical protein KYC_23713 [Achromobacter arsenitoxydans SY8]
gi|359362131|gb|EHK63876.1| hypothetical protein KYC_23713 [Achromobacter arsenitoxydans SY8]
Length = 382
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 167/355 (47%), Gaps = 68/355 (19%)
Query: 47 AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
AGE WHK G+F HL+ +YR L +W P V L GLFHS Y N YV+L +FD R+ +
Sbjct: 63 AGEDWHKAGTFKHHLLGVYRTLALWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 122
Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
R +G+ AE+ + LFC +PR + +++ + G
Sbjct: 123 REALGEEAEEWVSLFCAMPRTKFVQ---------------------AILQGQGAG----- 156
Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
ADGLT++ G+ + R VA F +++ AD +Q +QD +F
Sbjct: 157 ------------ADGLTLEGAD-GQTYTFTPRQVAAFTVVSAADIGEQWHSWQDEIFAGY 203
Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
+ + N S+LWPG KP N LS + + + ++R
Sbjct: 204 PQQQRRDLKTNWASSLWPGPLKPP--SNILSMLSHLLAPLSR------------------ 243
Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
+D +P+PP F+ C L +++ A LYW+ + ++ ++ A +L
Sbjct: 244 -----MPQDTGIPIPPAFDACRATLSPRDEAAASALYWQVITRMHPLTE--MDSARHMLE 296
Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+E NP+VGEP ++L Q+ L G+F+ AER A GL L WG+ WDKR+ W G
Sbjct: 297 AAVEHNPWVGEPRLLLAQLALTAGDFDAAERHAAAGLAALQAWGTAWDKRIEWAG 351
>gi|121594684|ref|YP_986580.1| hypothetical protein Ajs_2343 [Acidovorax sp. JS42]
gi|120606764|gb|ABM42504.1| conserved hypothetical protein [Acidovorax sp. JS42]
Length = 368
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 184/394 (46%), Gaps = 76/394 (19%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+ L A+ L E A D +L ++ ++ + G G+ WHK G+F HLV + R L +W
Sbjct: 12 DALFQRAQSLLDDEWLAHDADLAPVLPMVLARGVGQDWHKAGTFRHHLVGVARSLALWGQ 71
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P V L GL HS Y N++V+L FD + R ++ VG++AE L++LFC + R +
Sbjct: 72 PRDVRLLGLLHSVYGNAFVDLVKFDAASERGRLQALVGESAEHLVYLFCTMSRAQFVQKV 131
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
L E +D SL+ K PA TV H+ E
Sbjct: 132 LAGELQ----------ADGSLVVEKNG-----------------PAPRQTV-HLAPYE-- 161
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWPGDGKPGLW 250
VA F ++++AD +Q F +QD +++ D A + ++LWPG +P
Sbjct: 162 ------VAAFTIVSMADTIEQWFSWQDDIYSRFPAVDPSRPQAVHWAASLWPGPMRP--- 212
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
+RM + AA+G+ + +LP+PPVF++C++ L
Sbjct: 213 ---TARM------------------VSQIAALGLALQHPGLRG-QLPLPPVFDHCSQGLT 250
Query: 311 AKEQIEARDLYWEAV-----CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYL 365
A+++ A LYW + + DG+ LE+A L NP+VGEP +VL Q+YL
Sbjct: 251 AQDEAAATALYWSVIQLAQPLVDLDGATATLEQAVRL-------NPWVGEPQMVLAQLYL 303
Query: 366 AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ G ++A A L WG+ WDKR+ W+
Sbjct: 304 SAGRAQDAACAARGALQAFSAWGNAWDKRVQWDA 337
>gi|241764610|ref|ZP_04762625.1| conserved hypothetical protein [Acidovorax delafieldii 2AN]
gi|241365931|gb|EER60561.1| conserved hypothetical protein [Acidovorax delafieldii 2AN]
Length = 361
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 177/388 (45%), Gaps = 69/388 (17%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+TL A+ L E D +L ++ + + G+ WHK G+F HLV + R L +W+
Sbjct: 10 QTLFIRAQALLDDEWITHDPDLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLTLWQQ 69
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P V L GL HS Y NS+V+L FDP + R +R VG AE L++LFC R +
Sbjct: 70 PRDVRLLGLLHSVYGNSFVDLVKFDPASERARLRELVGAPAEHLVYLFCTQSRTQFV--- 126
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
Q ++ A+D SL L DG +H+ T ++V
Sbjct: 127 -------QRVLGGGLAADGSLA---------------------LDKDG--QRHVLTPDEV 156
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN-SDGKLEFAGNSFSA-LWPGDGKPGLWM 251
A F+++++AD +Q F +QD +++ + G ++A LWPG +P
Sbjct: 157 -------AAFIVVSMADTIEQWFSWQDDIYSRFPHVQHRPQGVHWAASLWPGPMRP---- 205
Query: 252 NSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
G + IA +G + + + LPVPPVF NCT+ L
Sbjct: 206 -----TGRMLHQIA---------------GLGQALQHSGLKGL-LPVPPVFANCTQHLSV 244
Query: 312 KEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFE 371
+ A LYW + D L+ A +L + + NP+VGEP +VL Q+YL+ G +
Sbjct: 245 AAEAAATSLYWSVI--QQDQPLVDLDVATAVLEQAVRHNPWVGEPQMVLAQLYLSAGRGD 302
Query: 372 EAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+A A L WG+ WDKR+ W+
Sbjct: 303 DARHAAQSALQCFSAWGNSWDKRVQWDA 330
>gi|222110711|ref|YP_002552975.1| hypothetical protein Dtpsy_1514 [Acidovorax ebreus TPSY]
gi|221730155|gb|ACM32975.1| conserved hypothetical protein [Acidovorax ebreus TPSY]
Length = 368
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 183/394 (46%), Gaps = 76/394 (19%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
+ L A+ L E A D +L ++ ++ + G G+ WHK G+F HLV + R L +W
Sbjct: 12 DALFQRAQSLLDDEWLAHDADLAPVLPMVLARGVGQDWHKAGTFRHHLVGVARSLALWGQ 71
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P V L GL HS Y N++V+L FD + R ++ VG++AE L++LFC + R +
Sbjct: 72 PRDVRLLGLLHSVYGNAFVDLVKFDAASERGRLQALVGESAEHLVYLFCTMSRAQFVQKV 131
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
L E +D SL+ K PA TV H+ E
Sbjct: 132 LAGELQ----------ADGSLVVEKNG-----------------PAPRQTV-HLAPYE-- 161
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWPGDGKPGLW 250
VA F ++++AD +Q F +QD +++ D A + ++LWPG +P
Sbjct: 162 ------VAAFTIVSMADTIEQWFSWQDDIYSRFPAVDPSRPQAVHWAASLWPGPMRP--- 212
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
+RM + AA+G+ + +LP+PPVF++C++ L
Sbjct: 213 ---TARM------------------VSQIAALGLALQHPGLRG-QLPLPPVFDHCSQGLT 250
Query: 311 AKEQIEARDLYWEAV-----CGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYL 365
A+++ A LYW + + DG+ LE+A L NP+VGEP +VL Q+YL
Sbjct: 251 AQDEAAATALYWSVIQLAQPLVDLDGATATLEQAVRL-------NPWVGEPQMVLAQLYL 303
Query: 366 AKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ G +A A L WG+ WDKR+ W+
Sbjct: 304 SAGRAHDAACAARGALQAFSAWGNAWDKRVQWDA 337
>gi|395008001|ref|ZP_10391692.1| hypothetical protein PMI14_04387 [Acidovorax sp. CF316]
gi|394313946|gb|EJE50903.1| hypothetical protein PMI14_04387 [Acidovorax sp. CF316]
Length = 364
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 115/386 (29%), Positives = 177/386 (45%), Gaps = 69/386 (17%)
Query: 16 LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
L+A A P L E A D +L ++ + + G+ WHK G+F HLV + R L +W+ P
Sbjct: 15 LLARALPLLDEEWLARDADLAPVLPTVLARNVGQDWHKAGTFRHHLVGVARSLAMWQQPR 74
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V L GL HS Y N++V+L FDP R VR GD+AE L++LFC R +
Sbjct: 75 DVRLLGLLHSVYGNAFVDLVKFDPAKERARVREIAGDSAEHLVYLFCTQSRTQFV----- 129
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
Q ++ +D SL+ K+ K + L P +
Sbjct: 130 -----QRVLAGAFEADGSLVLEKD-----------GKPHTLTPYE--------------- 158
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNN--SDGKLEFAGNSFSALWPGDGKPGLWMNS 253
VA F+++++AD +Q F +QD +++ + + A + ++LWPG +P
Sbjct: 159 ----VAAFIIVSMADTIEQWFSWQDDIYSRFPTVQQRPQAVHWAASLWPGPMRPS----- 209
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
G + S I A+G+ + + + LP+PPVF CT L A +
Sbjct: 210 ----GRMVSQI---------------NALGLALQHPGLKGL-LPMPPVFNGCTAELSATD 249
Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
+ A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G +A
Sbjct: 250 EAAATSLYWSVI--QQDQPLADLDVATAVLESAVRHNPWVGEPQMVLAQLYLSAGRNADA 307
Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
A L WG+ WDKR+ W+
Sbjct: 308 AAAASSALQQFSAWGNSWDKRVQWDA 333
>gi|347821540|ref|ZP_08874974.1| hypothetical protein VeAt4_20777 [Verminephrobacter aporrectodeae
subsp. tuberculatae At4]
Length = 362
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 178/386 (46%), Gaps = 69/386 (17%)
Query: 16 LVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPD 75
L + A+ L E A D +L ++ + + G+ WHK G+F HLV + R L +W+ P
Sbjct: 12 LFSRAQSLLDEEWLARDPDLAPMLPTVLARNVGQDWHKAGTFRHHLVGVARSLSVWQQPR 71
Query: 76 CVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLL 135
V L GL HS Y N++V+L FDP R +RG +G++AE L++LFC R +
Sbjct: 72 DVRLLGLLHSVYGNAFVDLVKFDPAKERARMRGLMGESAEHLVYLFCTQSRTQFV----- 126
Query: 136 FEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGV 195
Q ++ H +D SL+ L DG +H+ T +V V
Sbjct: 127 -----QRVLAHALEADGSLV---------------------LEKDG--QRHVLTPYEVAV 158
Query: 196 SRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSF--SALWPGDGKPGLWMNS 253
F+++++AD +Q +QD +++ + + ++LWPG +P
Sbjct: 159 -------FIIVSMADTIEQWSSWQDDVYSRFPDVQQRPQTAHWAASLWPGPMRP------ 205
Query: 254 LSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKE 313
G + I +A+G + +D+ LP+PPVF +C++ L A +
Sbjct: 206 ---TGRMLHQI---------------SALGQALQHPALKDL-LPIPPVFAHCSQNLSASD 246
Query: 314 QIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
+ A LYW V D L+ A +L + NP+VGE + L Q+YL G +EA
Sbjct: 247 EAAASALYWAVV--QQDQPLVDLDVATGVLENAVRHNPWVGESQMTLAQLYLTAGRRDEA 304
Query: 374 EREAGKGLMLLLEWGSPWDKRMSWEG 399
A L L WG+ WDKR+ WE
Sbjct: 305 RAAASSALQLFCVWGNSWDKRVQWEA 330
>gi|418530048|ref|ZP_13095974.1| hypothetical protein CTATCC11996_10158 [Comamonas testosteroni ATCC
11996]
gi|371452601|gb|EHN65627.1| hypothetical protein CTATCC11996_10158 [Comamonas testosteroni ATCC
11996]
Length = 367
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 181/398 (45%), Gaps = 76/398 (19%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
N E L A+ L E + D +L L+ V+ G G+ WHK G+F HL + R L
Sbjct: 6 NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65
Query: 70 IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+W+ P V L GL HS Y N++V+L FD R+ ++ +G+ AE L++LFC + R
Sbjct: 66 LWQQPRDVRLLGLLHSVYGNAFVDLVKFDAGNEREQLKRLIGEQAEHLVYLFCTMSR--- 122
Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
+ V+ L A D++ ADG +++ +
Sbjct: 123 -----------TQFVQKLLAGDIA-------------------------ADG-SMQLERN 145
Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
G E + ++ VA F ++++AD +Q F +Q+ +++ D + A + ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205
Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
G +P SRM + S G+G + +LP+PPV
Sbjct: 206 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 239
Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
F NC+++L A + A LYW + D L+ A L + + NP+VGEP +VL
Sbjct: 240 FANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVSLNPWVGEPQMVLA 297
Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
Q+YL G EA A L WG+ WDKR+ W+
Sbjct: 298 QLYLTAGRNAEAAAAACGALQCFSSWGNAWDKRVQWDA 335
>gi|333915155|ref|YP_004488887.1| hypothetical protein DelCs14_3540 [Delftia sp. Cs1-4]
gi|333745355|gb|AEF90532.1| hypothetical protein DelCs14_3540 [Delftia sp. Cs1-4]
Length = 366
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 178/389 (45%), Gaps = 70/389 (17%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
E L A+ L E A D +L ++ V+ + G G+ WHK G+F HL + R L +W+
Sbjct: 14 EALFTRAQDLLDEEWLAKDADLAPVLPVVLARGVGQDWHKAGTFRHHLAGVARSLALWRQ 73
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P V L GL HS Y N++V+L FD + R +R VG+ AE L++LFC + R
Sbjct: 74 PRDVRLLGLLHSVYGNAFVDLVKFDAGSERGRLRELVGEGAEHLVYLFCTMSRT------ 127
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
+ ++ + A D++ E G E A G
Sbjct: 128 --------QFMQKILAGDLA-----EDGSMALESA---------------------GRSQ 153
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPGLW 250
+S VA F ++++AD +Q F +Q+ + F + + + ++LWPG +P
Sbjct: 154 RLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPGPMRPA-- 211
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
SRM + S + R +Q A +LP+PPVF++C+R L
Sbjct: 212 ----SRMLSQISDLGRA----LQHPALRA---------------QLPLPPVFDHCSRQLP 248
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
++ A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G
Sbjct: 249 PGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQLYLSAGRA 306
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
++A R A L WG+ WDKR+ W+
Sbjct: 307 DDAARAATSALQCFSGWGNAWDKRVQWDA 335
>gi|160898617|ref|YP_001564199.1| hypothetical protein Daci_3176 [Delftia acidovorans SPH-1]
gi|160364201|gb|ABX35814.1| conserved hypothetical protein [Delftia acidovorans SPH-1]
Length = 366
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 177/389 (45%), Gaps = 70/389 (17%)
Query: 14 ETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKA 73
E L A+ L E A D +L ++ V+ + G G+ WHK G+F HL + R L +W+
Sbjct: 14 EALFTRAQDLLDEEWLAKDADLAPVLPVVLARGVGQDWHKAGTFRHHLAGVARSLALWRQ 73
Query: 74 PDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDD 133
P V L GL HS Y N++V+L FD + R +R VG+ AE L++LFC + R
Sbjct: 74 PRDVRLLGLLHSVYGNAFVDLVKFDAGSERGRLRELVGEGAEHLVYLFCTMSRT------ 127
Query: 134 LLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDV 193
+ ++ + A D++ E G E A G
Sbjct: 128 --------QFMQKILAGDLA-----EDGSMALESA---------------------GRSQ 153
Query: 194 GVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPGDGKPGLW 250
+S VA F ++++AD +Q F +Q+ + F + + + ++LWPG +P
Sbjct: 154 RLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPGPMRPA-- 211
Query: 251 MNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILD 310
SRM + S + R +Q A +LP+PPVF++C+R L
Sbjct: 212 ----SRMLSQISDLGRS----LQHPALRA---------------QLPLPPVFDHCSRQLS 248
Query: 311 AKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
++ A LYW + D L+ A +L + NP+VGEP +VL Q+YL+ G
Sbjct: 249 PGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQLYLSAGRA 306
Query: 371 EEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ A R A L WG+ WDKR+ W+
Sbjct: 307 DGAARAAASALQCFSGWGNAWDKRVQWDA 335
>gi|221066382|ref|ZP_03542487.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
gi|220711405|gb|EED66773.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
Length = 367
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 182/397 (45%), Gaps = 74/397 (18%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
N E L A+ L E + D +L L+ V+ G G+ WHK G+F HL + R L
Sbjct: 6 NQLDEALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 65
Query: 70 IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+W+ P V GL HS Y N++V+L FD R+ ++ VG+ AE L++LFC + R
Sbjct: 66 LWQQPRQVRQLGLLHSVYGNAFVDLVKFDAGNEREQLKSLVGEQAEHLVYLFCTMSRT-- 123
Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
+ V+ L A + L ADG +++ +
Sbjct: 124 ------------QFVQKLLAGE-------------------------LGADG-SLQIERN 145
Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN---SDGKLEFAGNSFSALWP 242
G E + ++ VA F ++++AD +Q F +Q+ +++ D + A + ++LWP
Sbjct: 146 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPAVDHSRQQAVHWAASLWP 205
Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVF 302
G +P SRM + S + R A+ K R LP+PPVF
Sbjct: 206 GPMRPS------SRMLSQISGLGR--------------ALQHPALKTR-----LPLPPVF 240
Query: 303 ENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQ 362
NC+++L A + A LYW + D L+ A L + + NP+VGEP +VL Q
Sbjct: 241 ANCSQLLSAGNEAAAVALYWSVI--QLDQPLVDLDAATATLEQAVALNPWVGEPQMVLAQ 298
Query: 363 IYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+YL+ G +A R A L WG+ WDKR+ W+
Sbjct: 299 LYLSAGRSADAARAAESALQCFCSWGNAWDKRVQWDA 335
>gi|264679041|ref|YP_003278948.1| hypothetical protein CtCNB1_2906 [Comamonas testosteroni CNB-2]
gi|262209554|gb|ACY33652.1| conserved hypothetical protein [Comamonas testosteroni CNB-2]
Length = 376
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 180/398 (45%), Gaps = 76/398 (19%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILK 69
N L A+ L E + D +L L+ V+ G G+ WHK G+F HL + R L
Sbjct: 15 NQLDAALFERAKTLLDEEWLSRDADLAPLLPVVLERGVGQDWHKAGTFRHHLAGVARSLA 74
Query: 70 IWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+W+ P V GL HS Y N++V+L FD RD ++ VG+ AE L++LFC + R
Sbjct: 75 LWQQPREVRQLGLLHSVYGNAFVDLVKFDAGNERDQLKRLVGEQAEHLVYLFCTMSRT-- 132
Query: 130 IHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKT 189
+ V+ L A + L ADG +++ +
Sbjct: 133 ------------QFVQKLLAGE-------------------------LGADG-SLQIERN 154
Query: 190 G----EDVGVSRRVVATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWP 242
G E + ++ VA F ++++AD +Q F +Q+ + F + D + A + ++LWP
Sbjct: 155 GPEPRETIRLTAYEVAVFAIVSMADSMEQWFSWQEDIYSRFPSVDHSRQQAVHWAASLWP 214
Query: 243 GDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDE-DIELPVPPV 301
G +P SRM + S G+G + +LP+PPV
Sbjct: 215 GPMRPS------SRMLSQIS--------------------GLGQALQHPALKTQLPLPPV 248
Query: 302 FENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLG 361
F NC+++L A + A LYW + N L+ A L + + NP+VGEP +VL
Sbjct: 249 FANCSQLLSAGNEAAAVALYWSVIQLNQ--PLVDLDAATATLEQAVTLNPWVGEPQMVLA 306
Query: 362 QIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
Q+YL+ G +A R A L WG+ WDKR+ W+
Sbjct: 307 QLYLSAGRSADAARAAESALQCFCTWGNAWDKRVQWDA 344
>gi|422323799|ref|ZP_16404838.1| hypothetical protein HMPREF0005_01203 [Achromobacter xylosoxidans
C54]
gi|317401197|gb|EFV81843.1| hypothetical protein HMPREF0005_01203 [Achromobacter xylosoxidans
C54]
Length = 357
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 164/355 (46%), Gaps = 68/355 (19%)
Query: 47 AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
AGE WHK G+F HL+ +YR L +W P V L GLFHS Y N YV+L +FD R+ +
Sbjct: 38 AGEDWHKAGTFKHHLLGVYRTLALWDQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 97
Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
R +G AE+ + LFC +PR + +++ + +G
Sbjct: 98 REALGIEAEEWVSLFCAMPRTRFVQ---------------------AILQGQGQG----- 131
Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
+GLT++ G+ + R VA F++++ AD +Q +QD +F
Sbjct: 132 ------------PEGLTLESAD-GQVFTFTPRQVAAFIVVSAADIGEQWHSWQDEIFAGY 178
Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
+ + + ++LWPG KP N LS + + + + SA
Sbjct: 179 PQQQRRDLKSHWAASLWPGPLKPP--SNILSMLSHLLAPL-------------SALPADT 223
Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
G+ PVPP F++C L +++ A LYW+ V + +++ A +L
Sbjct: 224 GI----------PVPPAFDHCRATLSPRDEAAASALYWQVVTRMH--PQTEMDTARHMLE 271
Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ NP+V EP ++L Q+ L G+FE AE A GL L WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVAEPRLLLAQLALTAGDFEAAEGHAAAGLAALQAWGTAWDKRIEWAG 326
>gi|423014406|ref|ZP_17005127.1| hypothetical protein AXXA_08139 [Achromobacter xylosoxidans AXX-A]
gi|338782587|gb|EGP46959.1| hypothetical protein AXXA_08139 [Achromobacter xylosoxidans AXX-A]
Length = 357
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 164/355 (46%), Gaps = 68/355 (19%)
Query: 47 AGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVV 106
AGE WHK G+F HL+ +YR L +W P V L GLFHS Y N YV+L +FD R+ +
Sbjct: 38 AGEDWHKAGTFKHHLLGVYRTLTLWNQPREVRLLGLFHSVYGNEYVDLTLFDRERERNTL 97
Query: 107 RGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAE 166
R +G AE+ + LFC +PR + +++ + +G
Sbjct: 98 REVLGAEAEEWVSLFCAMPRTRFVQ---------------------AILQGQGRG----- 131
Query: 167 EAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNN- 225
+GLT++ G+ + R VA FV+++ AD +Q +QD +F
Sbjct: 132 ------------PEGLTLEGAD-GQAFTFTPRQVAAFVVVSAADIGEQWHSWQDEIFAGY 178
Query: 226 -SDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGV 284
+ + + ++LWPG KP N LS + + + + SA
Sbjct: 179 PQQQRRDLKSHWAASLWPGPLKPP--SNILSMLSHLLAPL-------------SALPADT 223
Query: 285 GVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLL 344
G+ PVPP F++C L +++ A LYW+ V + +++ A +L
Sbjct: 224 GI----------PVPPAFDHCRATLSPRDEAAASALYWQVVTRMH--PQTEMDTARHMLE 271
Query: 345 RCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
+ NP+VGEP ++L Q+ L G+FE A A GL L WG+ WDKR+ W G
Sbjct: 272 AAVAHNPWVGEPRLLLAQLALTAGDFEAAAGHAAAGLAALQAWGTAWDKRIEWAG 326
>gi|297822077|ref|XP_002878921.1| hypothetical protein ARALYDRAFT_344261 [Arabidopsis lyrata subsp.
lyrata]
gi|297324760|gb|EFH55180.1| hypothetical protein ARALYDRAFT_344261 [Arabidopsis lyrata subsp.
lyrata]
Length = 378
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 60/94 (63%), Positives = 75/94 (79%), Gaps = 1/94 (1%)
Query: 173 INELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEF 232
IN L+P +G+ VKHIKTGE++ VSRRVV F+++T+A+FSDQ FG QD FN +LEF
Sbjct: 122 INGLVPDNGVVVKHIKTGEEIVVSRRVVGVFLLMTMAEFSDQFFGLQDEFFNYG-WRLEF 180
Query: 233 AGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAR 266
GN+ +ALWP +GKPGLW+NS SRMGAIYSLI R
Sbjct: 181 RGNNVTALWPRNGKPGLWLNSNSRMGAIYSLIVR 214
>gi|297837231|ref|XP_002886497.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332338|gb|EFH62756.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1173
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 59/94 (62%), Positives = 74/94 (78%), Gaps = 1/94 (1%)
Query: 173 INELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEF 232
IN L+P +G+ VKHIKTGE++ VSRRVV F+++T+A+FS Q G QD FN +LEF
Sbjct: 118 INGLVPDNGVVVKHIKTGEEIVVSRRVVGVFLLMTMAEFSGQFCGLQDEFFNYG-WRLEF 176
Query: 233 AGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAR 266
GN+ +ALWPG+GKPGLW+NS SRMGAIYSLI R
Sbjct: 177 RGNNVTALWPGNGKPGLWLNSNSRMGAIYSLIVR 210
>gi|444916194|ref|ZP_21236314.1| hypothetical protein D187_08782 [Cystobacter fuscus DSM 2262]
gi|444712516|gb|ELW53438.1| hypothetical protein D187_08782 [Cystobacter fuscus DSM 2262]
Length = 568
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 154/364 (42%), Gaps = 103/364 (28%)
Query: 32 DKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSY 91
D +P L + L G+ H G+F +HL+ RIL+ W+ +C G+FHSAYS +
Sbjct: 294 DARVPMLASALAERGSDRFQHSVGTFRDHLLGTGRILRGWRQRSEICRAGMFHSAYSTQF 353
Query: 92 VNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASD 151
+F P R+ VR +G+ AE L+ LFC V R L
Sbjct: 354 FREHLFAPWE-RETVRELIGETAEALVFLFCTVDRFRL---------------------- 390
Query: 152 VSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADF 211
AE A R ++ GL + ++ T + S RVVA +++ +A+
Sbjct: 391 ------------TAEVAKRHTLD-----GGLILHNLHTEQQQRFSPRVVAALLVVEMANL 433
Query: 212 SDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIF 271
++Q+ ++D PGLWM+ +S G + ++
Sbjct: 434 AEQV-AYRDR------------------------APGLWMSKVSHFGRVI-----RSHLY 463
Query: 272 IQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDG 331
VPPVF++C+ +L +E+ A Y +A G + G
Sbjct: 464 A-------------------------VPPVFDHCSAVLTPEEEKAALATYAQAHTG-APG 497
Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPW 391
++ L + L NP+VGEPH+ L ++ A+GE +AE A +G LL WG+PW
Sbjct: 498 EEKHLRETVAL-------NPWVGEPHIRLVRLCAARGEHLKAEGHARRGAALLTAWGTPW 550
Query: 392 DKRM 395
DK +
Sbjct: 551 DKSL 554
>gi|389875796|ref|YP_006373531.1| hypothetical protein TMO_a0544 [Tistrella mobilis KA081020-065]
gi|388530751|gb|AFK55947.1| hypothetical protein TMO_a0544 [Tistrella mobilis KA081020-065]
Length = 173
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 66/103 (64%)
Query: 29 EAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYS 88
E +D LP+L+ +L S AGE WHK G+F HL+D+YR L +W P V + GLFHS YS
Sbjct: 31 EKVDAELPALLDLLFSRSAGEAWHKAGTFKNHLLDVYRTLALWDQPRDVRVLGLFHSVYS 90
Query: 89 NSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIH 131
N +V L +FD + R + VG+ AE+LIH FC++PR +
Sbjct: 91 NEFVALKLFDASAKRAELAAAVGEEAERLIHTFCVMPRTRFVQ 133
>gi|323453539|gb|EGB09410.1| hypothetical protein AURANDRAFT_71288 [Aureococcus anophagefferens]
Length = 2421
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 155/384 (40%), Gaps = 74/384 (19%)
Query: 19 SARPFLRGELEAIDKNLPSLIAVLRSVGAGECW----HKHGSFLEHLVDIYRILKIWKAP 74
S R F+ E A+D+ + VLR +GA + H +F HL Y +L W P
Sbjct: 2055 SVRAFVYREWSALDEGIDD---VLRRMGAMSAFVEFPHARNTFFNHLKGTYGMLMAWNQP 2111
Query: 75 DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDL 134
V GL H+AYS IFDP R +R VG AE LI+ F V R +
Sbjct: 2112 VAVARAGLAHTAYSGDLFEFFIFDPTVDRAEMRALVGGPAEALIYEFGTVARHEFVGLGH 2171
Query: 135 LFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVG 194
+ L L G + A R + V+ KT
Sbjct: 2172 MMNNRSHRLARPLV------------GGLDDRVATRSR-----------VEGAKT----- 2203
Query: 195 VSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSL 254
+S R VA ++++IAD+ DQ + ++ L+ + ++PG +P +
Sbjct: 2204 LSNREVAELIVVSIADYLDQGVTTNGWRDHHQIDPLD------TRIFPGTVEP-----DV 2252
Query: 255 SRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDAKEQ 314
S A + A + + +PP+F+ CT + +++
Sbjct: 2253 SLYWAAAACKAVRGHLAV-------------------------IPPIFDRCTADIAYEDE 2287
Query: 315 IEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAE 374
ARD+YW V D ++ + A ++ IE P+VGE HV+L Q++ G EA
Sbjct: 2288 AAARDMYWRVV---QDSNRISGDAAITMIAAAIELLPYVGEMHVLLAQLHFRAGRHGEAR 2344
Query: 375 REAGKGLMLLLEWGSPWDKRMSWE 398
A L + +G WDKR+++E
Sbjct: 2345 DHAAVALDRMYAFGGTWDKRLTYE 2368
>gi|389875795|ref|YP_006373530.1| hypothetical protein TMO_a0543 [Tistrella mobilis KA081020-065]
gi|388530750|gb|AFK55946.1| hypothetical protein TMO_a0543 [Tistrella mobilis KA081020-065]
Length = 187
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 65/112 (58%), Gaps = 2/112 (1%)
Query: 288 KERDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCI 347
++ D+ LPVPPVF+ CTRIL +++ A LYW+ V S A LL I
Sbjct: 47 RDLPADLGLPVPPVFDRCTRILTQADEMAAVALYWQVVTRAHAISTPV--AAVSLLKTAI 104
Query: 348 EKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
NP + EPH+VL QI L +G+++ A A GL +L WG+ WDKR++W G
Sbjct: 105 AHNPDIAEPHLVLAQIALTQGDYDTAAAHARIGLDILSAWGTAWDKRIAWSG 156
>gi|255611683|ref|XP_002539332.1| conserved hypothetical protein [Ricinus communis]
gi|223507070|gb|EEF23050.1| conserved hypothetical protein [Ricinus communis]
Length = 237
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 104/216 (48%), Gaps = 34/216 (15%)
Query: 191 EDVGVSRRV----VATFVMLTIADFSDQLFGFQDFL---FNNSDGKLEFAGNSFSALWPG 243
E G S+R+ VA F ++++AD +Q F +Q+ + F + + + ++LWPG
Sbjct: 18 ESAGRSQRLSAYEVAAFTIVSMADAMEQWFSWQEDIYSQFPQVQNQRQQGVHWAASLWPG 77
Query: 244 DGKPGLWMNSLSRMGAIYSLIAREEEIFIQERKTSAAAVGVGVDKERDEDIELPVPPVFE 303
+P SRM + S + R +Q A +LP+PPVF+
Sbjct: 78 PMRPA------SRMLSQISDLGRA----LQHPALRA---------------QLPLPPVFD 112
Query: 304 NCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQI 363
+C+R L ++ A LYW + D L+ A +L + NP+VGEP +VL Q+
Sbjct: 113 HCSRQLSPGDEAAAVSLYWSVI--QLDQPLVDLDAATAVLESAVRLNPWVGEPQMVLAQL 170
Query: 364 YLAKGEFEEAEREAGKGLMLLLEWGSPWDKRMSWEG 399
YL+ G ++A R A L WG+ WDKR+ W+
Sbjct: 171 YLSAGRADDAARAATSALQCFSGWGNAWDKRVQWDA 206
>gi|147833692|emb|CAN64331.1| hypothetical protein VITISV_014667 [Vitis vinifera]
Length = 846
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/78 (53%), Positives = 50/78 (64%), Gaps = 18/78 (23%)
Query: 182 LTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFGFQDFLFNNSDGKLEFAGNSFSALW 241
+ VKHIK GEDV V FGFQD LF+NSD LEF+GN++ ALW
Sbjct: 342 VIVKHIKIGEDVAV------------------LAFGFQDALFDNSDRCLEFSGNNYGALW 383
Query: 242 PGDGKPGLWMNSLSRMGA 259
PGDGKPGLW+N++SRMG
Sbjct: 384 PGDGKPGLWLNAISRMGT 401
>gi|361068359|gb|AEW08491.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156112|gb|AFG60290.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156114|gb|AFG60291.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156116|gb|AFG60292.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156118|gb|AFG60293.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156120|gb|AFG60294.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156122|gb|AFG60295.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156124|gb|AFG60296.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156126|gb|AFG60297.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156128|gb|AFG60298.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156132|gb|AFG60300.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156134|gb|AFG60301.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156136|gb|AFG60302.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156138|gb|AFG60303.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156140|gb|AFG60304.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156142|gb|AFG60305.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
Length = 76
Score = 73.2 bits (178), Expect = 2e-10, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 2/70 (2%)
Query: 292 EDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNP 351
E+IEL PPVF++C +++ +Q+ ARDLY EA+C + K KAE L ++ E NP
Sbjct: 9 EEIELVTPPVFDHCRAVVNPSDQVTARDLYMEAICMPAGEKNNK--KAEALFVKVCELNP 66
Query: 352 FVGEPHVVLG 361
F+G+PHVVL
Sbjct: 67 FIGDPHVVLA 76
>gi|383156108|gb|AFG60288.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156110|gb|AFG60289.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
gi|383156130|gb|AFG60299.1| Pinus taeda anonymous locus CL113Contig2_01 genomic sequence
Length = 76
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 2/70 (2%)
Query: 292 EDIELPVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNP 351
E+IEL PPVF++C +++ +Q+ ARDLYWEA+ + K KAE L ++ E NP
Sbjct: 9 EEIELVTPPVFDHCRAVVNPSDQVTARDLYWEAISMPAGEKNNK--KAEALFVKVCELNP 66
Query: 352 FVGEPHVVLG 361
F+G+PHVVL
Sbjct: 67 FIGDPHVVLA 76
>gi|255088788|ref|XP_002506316.1| predicted protein [Micromonas sp. RCC299]
gi|226521588|gb|ACO67574.1| predicted protein [Micromonas sp. RCC299]
Length = 629
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 1 MPSQQGNINNDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGECWHK-HGSFLE 59
+P +++ + +E +VAS E + NL + +A L G H +G F
Sbjct: 365 LPFTDPSLHRNPIEAVVASV------AREDLTSNLDAKLAFLNGEGVAGVAHTGNGKFDA 418
Query: 60 HLVDIYRILKIWKAPDCVCLCGLFHSAY-SNSYVNLAIFDPNTGRDVVRGHVGDAAEKLI 118
HLV + +L+ W D V GLFHS Y + + ++ P RD VR +GD AE+L+
Sbjct: 419 HLVGVASVLRAWGCDDSVVDAGLFHSIYGTEGFQGFSL--PYERRDDVRALIGDRAERLV 476
Query: 119 HLFCIVPRQPLIHD 132
H FC+V R + +D
Sbjct: 477 HTFCVVDRLSVDND 490
>gi|297742727|emb|CBI35361.3| unnamed protein product [Vitis vinifera]
Length = 85
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/37 (75%), Positives = 33/37 (89%)
Query: 290 RDEDIELPVPPVFENCTRILDAKEQIEARDLYWEAVC 326
RDE++EL +PPV ENCTR+L A+EQIEA DLYWEAVC
Sbjct: 38 RDEELELVIPPVLENCTRVLGAEEQIEATDLYWEAVC 74
>gi|87118690|ref|ZP_01074589.1| hypothetical protein MED121_16724 [Marinomonas sp. MED121]
gi|86166324|gb|EAQ67590.1| hypothetical protein MED121_16724 [Marinomonas sp. MED121]
Length = 185
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 9/90 (10%)
Query: 41 VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
L ++GAGE H +GS + HL +++LK W A D VC GLFH+AY A FD
Sbjct: 10 ALEALGAGEFDHLNGSLISHLTGTFKMLKTWDAADYVCDAGLFHAAYGT-----AGFDDK 64
Query: 101 ----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
+ R + +G AE L++L+C R
Sbjct: 65 MVSLSQRAKIAEVIGKEAEVLVYLYCSCDR 94
>gi|116625450|ref|YP_827606.1| prolyl 4-hydroxylase subunit alpha [Candidatus Solibacter usitatus
Ellin6076]
gi|116228612|gb|ABJ87321.1| Prolyl 4-hydroxylase, alpha subunit [Candidatus Solibacter usitatus
Ellin6076]
Length = 348
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
Query: 34 NLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVN 93
N L LR +GA C HK G+ +HLV + IL+ D VC G HS Y +
Sbjct: 191 NFEKLSLFLRKMGATNCGHKEGTLHDHLVRTFAILEKRGYTDEVCFGGGLHSIYGTNIFQ 250
Query: 94 LAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
+I P + R ++ G+ AE+L LF + R
Sbjct: 251 QSILTPES-RSLISNEFGNGAEELAFLFSTLER 282
>gi|343497111|ref|ZP_08735191.1| hypothetical protein VINI7043_03738 [Vibrio nigripulchritudo ATCC
27043]
gi|342819941|gb|EGU54773.1| hypothetical protein VINI7043_03738 [Vibrio nigripulchritudo ATCC
27043]
Length = 177
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 44/89 (49%), Gaps = 9/89 (10%)
Query: 42 LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
L +GAG+ H +GS EHL ILK W A + + GLFH+AY A FD N
Sbjct: 8 LAELGAGDFQHLNGSLTEHLKGTQNILKQWGASEVLQTAGLFHAAYGT-----AGFDENM 62
Query: 102 ----GRDVVRGHVGDAAEKLIHLFCIVPR 126
R + +GD E L++L+C R
Sbjct: 63 VSLDQRKYIAEIIGDEEEALVYLYCSCDR 91
>gi|323500047|ref|ZP_08105001.1| hypothetical protein VISI1226_08534 [Vibrio sinaloensis DSM 21326]
gi|323314903|gb|EGA67960.1| hypothetical protein VISI1226_08534 [Vibrio sinaloensis DSM 21326]
Length = 177
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 69/153 (45%), Gaps = 20/153 (13%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
A L+++GAG+ H +GS HL +L W A + + + GLFH+AY A FD
Sbjct: 4 FAALKALGAGDFQHLNGSLEAHLQGTQALLNRWGADEVLQVAGLFHAAYGT-----AGFD 58
Query: 99 PN----TGRDVVRGHVGDAAEKLIHLFCIVPR----------QPLIHDDLLFEYSDQELV 144
N RD + +G AE L++L+C R +P+ D E + EL
Sbjct: 59 DNMVSLAQRDKIASIIGPQAEALVYLYCSCDRDFVFPQFGLGKPIQFKDRFVE-REFELS 117
Query: 145 EHLKASDVSLINAKEKGLFNAEEAWRKKINELL 177
L + L A E L A + ++++ E L
Sbjct: 118 AELTVAFCELTVANELELVLASDVFKQQYGEEL 150
>gi|440701263|ref|ZP_20883463.1| hypothetical protein STRTUCAR8_04651 [Streptomyces turgidiscabies
Car8]
gi|440276098|gb|ELP64416.1| hypothetical protein STRTUCAR8_04651 [Streptomyces turgidiscabies
Car8]
Length = 171
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+ LR +GA + H G+ L HL +YR+L W+A + L GL H+AY A+
Sbjct: 1 MGFLRQLGAEQIGHPGGTLLAHLCRVYRLLAAWEARPALRLAGLCHAAYGTDGFPTALLS 60
Query: 99 PNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
+ ++ R +G AE+++H++ + R+
Sbjct: 61 LDRRAELAR-VIGPEAEEIVHVYAAMNRK 88
>gi|418712828|ref|ZP_13273557.1| hypothetical protein LEP1GSC099_4795 [Leptospira interrogans str.
UI 08452]
gi|410790597|gb|EKR84289.1| hypothetical protein LEP1GSC099_4795 [Leptospira interrogans str.
UI 08452]
Length = 179
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R+ L
Sbjct: 80 EKAETLVYLYASCDREYL 97
>gi|418666230|ref|ZP_13227661.1| hypothetical protein LEP1GSC019_4608 [Leptospira interrogans
serovar Pyrogenes str. 2006006960]
gi|410758177|gb|EKR19776.1| hypothetical protein LEP1GSC019_4608 [Leptospira interrogans
serovar Pyrogenes str. 2006006960]
Length = 176
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 41 VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
+L+ + H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N
Sbjct: 7 ILKQLSVSNFEHLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTN 66
Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+++V G +G+ AE L++L+ R L
Sbjct: 67 ARKEIV-GIIGEKAETLVYLYGSCDRTHL 94
>gi|323491275|ref|ZP_08096460.1| hypothetical protein VIBR0546_12617 [Vibrio brasiliensis LMG 20546]
gi|323314401|gb|EGA67480.1| hypothetical protein VIBR0546_12617 [Vibrio brasiliensis LMG 20546]
Length = 178
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+L+ +GAG+ H +GS HL +L W + + + GLFH+AY + + +
Sbjct: 4 FTMLKQLGAGDFQHLNGSLEAHLKGTQAVLNEWNSSELLQTAGLFHAAYGTAGFDENMVS 63
Query: 99 PNTGRDVVRGHVGDAAEKLIHLFCIVPR 126
N +++ + +GDA E L++L+C R
Sbjct: 64 LNQRQEIAQV-IGDAEEALVYLYCSCDR 90
>gi|260778426|ref|ZP_05887318.1| hypothetical protein VIC_003827 [Vibrio coralliilyticus ATCC
BAA-450]
gi|260604590|gb|EEX30885.1| hypothetical protein VIC_003827 [Vibrio coralliilyticus ATCC
BAA-450]
Length = 187
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 63/145 (43%), Gaps = 10/145 (6%)
Query: 42 LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
L S+GAG+ H +GS HLV +L W A D V GLFH+AY + + ++
Sbjct: 8 LESLGAGDFQHLNGSLESHLVGTESLLAKWHASDTVRSAGLFHAAYGTAGFDESMVSLEM 67
Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPR---------QPLIHDDLLFEYSDQELVEHLKASDV 152
R + +G AE L++L+C R +P I F S E+ E
Sbjct: 68 -RSEIAAVIGREAEDLVYLYCSCDRDYVFPQFDSKPFIQFRDRFTGSVFEMTEEQLHQFC 126
Query: 153 SLINAKEKGLFNAEEAWRKKINELL 177
L A E L A E ++ K + L
Sbjct: 127 ELTVANELELVLASEEFKLKYGQEL 151
>gi|418704779|ref|ZP_13265646.1| hypothetical protein LEP1GSC096_2354 [Leptospira interrogans
serovar Hebdomadis str. R499]
gi|410765392|gb|EKR36092.1| hypothetical protein LEP1GSC096_2354 [Leptospira interrogans
serovar Hebdomadis str. R499]
Length = 179
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 37 SLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAI 96
+++ L+ + H +G L HL+ ++IL+ W +P+ +C GLFH+ Y +
Sbjct: 6 NILKQLKELSVSNFEHLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPV 65
Query: 97 FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
N +++V G +G+ AE L++L+ R L
Sbjct: 66 VGTNARKEIV-GIIGEKAETLVYLYGSCDRTHL 97
>gi|417759633|ref|ZP_12407667.1| hypothetical protein LEP1GSC027_1397 [Leptospira interrogans str.
2002000624]
gi|417776498|ref|ZP_12424335.1| hypothetical protein LEP1GSC025_4235 [Leptospira interrogans str.
2002000621]
gi|418671397|ref|ZP_13232749.1| hypothetical protein LEP1GSC026_2270 [Leptospira interrogans str.
2002000623]
gi|409944381|gb|EKN89964.1| hypothetical protein LEP1GSC027_1397 [Leptospira interrogans str.
2002000624]
gi|410573707|gb|EKQ36752.1| hypothetical protein LEP1GSC025_4235 [Leptospira interrogans str.
2002000621]
gi|410581658|gb|EKQ49467.1| hypothetical protein LEP1GSC026_2270 [Leptospira interrogans str.
2002000623]
Length = 152
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R L
Sbjct: 80 EKAETLVYLYGSCDRTHL 97
>gi|333985947|ref|YP_004515157.1| hypothetical protein [Methylomonas methanica MC09]
gi|333809988|gb|AEG02658.1| hypothetical protein Metme_4309 [Methylomonas methanica MC09]
Length = 210
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 1/85 (1%)
Query: 42 LRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNT 101
LR +GAGE H +G L HL Y +L W A + +C GL+H+AY + + + N
Sbjct: 10 LRELGAGEFEHLNGRLLSHLRGTYDLLHHWGANEVLCKAGLYHAAYGTAGYDEKLISLNE 69
Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPR 126
R ++ +G +E++++++C R
Sbjct: 70 -RLQIQELIGLESEEVVYVYCACDR 93
>gi|24217151|ref|NP_714634.1| hypothetical protein LB_090 [Leptospira interrogans serovar Lai
str. 56601]
gi|45655655|ref|YP_003464.1| hypothetical protein LIC20072 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386076110|ref|YP_005990299.1| hypothetical protein LIF_B073 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417766358|ref|ZP_12414310.1| hypothetical protein LEP1GSC007_1983 [Leptospira interrogans
serovar Bulgarica str. Mallika]
gi|417786989|ref|ZP_12434674.1| hypothetical protein LEP1GSC077_0204 [Leptospira interrogans str.
C10069]
gi|418699087|ref|ZP_13260054.1| hypothetical protein LEP1GSC087_2323 [Leptospira interrogans
serovar Bataviae str. L1111]
gi|418710612|ref|ZP_13271382.1| hypothetical protein LEP1GSC097_1767 [Leptospira interrogans
serovar Grippotyphosa str. UI 08368]
gi|418725668|ref|ZP_13284286.1| hypothetical protein LEP1GSC104_0409 [Leptospira interrogans str.
UI 12621]
gi|418733770|ref|ZP_13290881.1| hypothetical protein LEP1GSC105_0275 [Leptospira interrogans str.
UI 12758]
gi|421086775|ref|ZP_15547623.1| hypothetical protein LEP1GSC173_0283 [Leptospira santarosai str.
HAI1594]
gi|421103931|ref|ZP_15564527.1| hypothetical protein LEP1GSC117_0124 [Leptospira interrogans
serovar Icterohaemorrhagiae str. Verdun LP]
gi|421128388|ref|ZP_15588603.1| hypothetical protein LEP1GSC020_0149 [Leptospira interrogans
serovar Grippotyphosa str. 2006006986]
gi|421135717|ref|ZP_15595837.1| hypothetical protein LEP1GSC009_0389 [Leptospira interrogans
serovar Grippotyphosa str. Andaman]
gi|24202189|gb|AAN51649.1| hypothetical protein LB_090 [Leptospira interrogans serovar Lai
str. 56601]
gi|45602626|gb|AAS72101.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353459772|gb|AER04316.1| hypothetical protein LIF_B073 [Leptospira interrogans serovar Lai
str. IPAV]
gi|400351185|gb|EJP03425.1| hypothetical protein LEP1GSC007_1983 [Leptospira interrogans
serovar Bulgarica str. Mallika]
gi|409949841|gb|EKO04374.1| hypothetical protein LEP1GSC077_0204 [Leptospira interrogans str.
C10069]
gi|409961305|gb|EKO25052.1| hypothetical protein LEP1GSC104_0409 [Leptospira interrogans str.
UI 12621]
gi|410020022|gb|EKO86827.1| hypothetical protein LEP1GSC009_0389 [Leptospira interrogans
serovar Grippotyphosa str. Andaman]
gi|410366412|gb|EKP21804.1| hypothetical protein LEP1GSC117_0124 [Leptospira interrogans
serovar Icterohaemorrhagiae str. Verdun LP]
gi|410430804|gb|EKP75167.1| hypothetical protein LEP1GSC173_0283 [Leptospira santarosai str.
HAI1594]
gi|410434113|gb|EKP83254.1| hypothetical protein LEP1GSC020_0149 [Leptospira interrogans
serovar Grippotyphosa str. 2006006986]
gi|410761947|gb|EKR28118.1| hypothetical protein LEP1GSC087_2323 [Leptospira interrogans
serovar Bataviae str. L1111]
gi|410769201|gb|EKR44444.1| hypothetical protein LEP1GSC097_1767 [Leptospira interrogans
serovar Grippotyphosa str. UI 08368]
gi|410772951|gb|EKR52983.1| hypothetical protein LEP1GSC105_0275 [Leptospira interrogans str.
UI 12758]
gi|456966682|gb|EMG08216.1| hypothetical protein LEP1GSC151_5661 [Leptospira interrogans
serovar Grippotyphosa str. LT2186]
gi|456986895|gb|EMG22351.1| hypothetical protein LEP1GSC150_0599 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 179
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R L
Sbjct: 80 EKAETLVYLYGSCDRTHL 97
>gi|421119635|ref|ZP_15579955.1| hypothetical protein LEP1GSC057_2418 [Leptospira interrogans str.
Brem 329]
gi|410347786|gb|EKO98659.1| hypothetical protein LEP1GSC057_2418 [Leptospira interrogans str.
Brem 329]
Length = 179
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R L
Sbjct: 80 EKAETLVYLYGSCDRTHL 97
>gi|456825880|gb|EMF74258.1| hypothetical protein LEP1GSC148_0133 [Leptospira interrogans
serovar Canicola str. LT1962]
Length = 179
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R L
Sbjct: 80 EKAETLVYLYGSCDRTHL 97
>gi|455793441|gb|EMF45139.1| hypothetical protein LEP1GSC067_2065 [Leptospira interrogans
serovar Lora str. TE 1992]
Length = 179
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 52 HKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVG 111
H +G L HL+ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G
Sbjct: 21 HLNGDLLSHLLGTFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVVGTNARKEIV-GIIG 79
Query: 112 DAAEKLIHLFCIVPRQPL 129
+ AE L++L+ R L
Sbjct: 80 EKAETLVYLYGSCDRTHL 97
>gi|440736559|ref|ZP_20916154.1| prolyl 4-hydroxylase subunit alpha [Pseudomonas fluorescens
BRIP34879]
gi|440382920|gb|ELQ19402.1| prolyl 4-hydroxylase subunit alpha [Pseudomonas fluorescens
BRIP34879]
Length = 410
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 6/111 (5%)
Query: 39 IAVLRSVGAGECWHK-HGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSN-SYVNLAI 96
++ L GA + H S +HL+ IL A D V L G++HS YS S+ ++ I
Sbjct: 245 LSFLAKTGAADQVHSGRSSLQDHLLGTATILSKTGAGDDVQLAGMYHSIYSTQSFHHVTI 304
Query: 97 FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHL 147
P +D V G +G AAEKL+ F ++PR P + L E ++ +EHL
Sbjct: 305 --PIEDKDQVVGMIGPAAEKLVRAFSVLPR-PAVLSQALIE-KGEDWIEHL 351
>gi|398334432|ref|ZP_10519137.1| hypothetical protein LkmesMB_01530 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 182
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 1/94 (1%)
Query: 36 PSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLA 95
P L+ L+S+ A H HG L HL+ + IL+ W + +C GL+H+ Y +
Sbjct: 7 PILLDQLKSLHAQGKEHLHGDLLTHLIGTFYILEEWGSSIDLCNAGLYHAVYGTFAFQDS 66
Query: 96 IFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
+ + T R V +G AE L++L+ R L
Sbjct: 67 LLE-TTARKEVENLIGSKAETLVYLYGTCDRNHL 99
>gi|418690869|ref|ZP_13251976.1| hypothetical protein LEP1GSC080_1255 [Leptospira interrogans str.
FPW2026]
gi|400359905|gb|EJP15886.1| hypothetical protein LEP1GSC080_1255 [Leptospira interrogans str.
FPW2026]
Length = 154
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 58 LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
L HL++ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G+ AE L
Sbjct: 2 LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60
Query: 118 IHLFCIVPRQPL 129
++L+ R+ L
Sbjct: 61 VYLYASCDREYL 72
>gi|417951950|ref|ZP_12595024.1| hypothetical protein VISP3789_10999 [Vibrio splendidus ATCC 33789]
gi|342802999|gb|EGU38379.1| hypothetical protein VISP3789_10999 [Vibrio splendidus ATCC 33789]
Length = 177
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 41 VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
+L+ +GAG+ H +GS HL ++L W + + + + GLFH+AY + + + +
Sbjct: 6 MLQELGAGDFQHLNGSLEAHLKGTEQVLINWGSSELLQIAGLFHAAYGTAGFDENMVSLS 65
Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPR 126
+D+ R +G+ E L++L+C R
Sbjct: 66 QRQDIAR-IIGNDEEALVYLYCSCDR 90
>gi|421116003|ref|ZP_15576396.1| hypothetical protein LEP1GSC069_1854 [Leptospira interrogans
serovar Canicola str. Fiocruz LV133]
gi|410012499|gb|EKO70597.1| hypothetical protein LEP1GSC069_1854 [Leptospira interrogans
serovar Canicola str. Fiocruz LV133]
Length = 124
Score = 49.7 bits (117), Expect = 0.002, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
Query: 58 LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
L HL++ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G+ AE L
Sbjct: 2 LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60
Query: 118 IHLF 121
++L+
Sbjct: 61 VYLY 64
>gi|417772581|ref|ZP_12420469.1| hypothetical protein LEP1GSC014_0232 [Leptospira interrogans
serovar Pomona str. Pomona]
gi|418681886|ref|ZP_13243108.1| hypothetical protein LEP1GSC045_3464 [Leptospira interrogans
serovar Pomona str. Kennewicki LC82-25]
gi|400326403|gb|EJO78670.1| hypothetical protein LEP1GSC045_3464 [Leptospira interrogans
serovar Pomona str. Kennewicki LC82-25]
gi|409945258|gb|EKN95274.1| hypothetical protein LEP1GSC014_0232 [Leptospira interrogans
serovar Pomona str. Pomona]
gi|455669757|gb|EMF34815.1| hypothetical protein LEP1GSC201_0260 [Leptospira interrogans
serovar Pomona str. Fox 32256]
Length = 154
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 58 LEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKL 117
L HL++ ++IL+ W +P+ +C GLFH+ Y + N +++V G +G+ AE L
Sbjct: 2 LSHLLETFQILEEWGSPEFLCYAGLFHAVYGTFAYQNPVIGTNARKEIV-GIIGEKAETL 60
Query: 118 IHLFCIVPRQPL 129
++L+ R L
Sbjct: 61 VYLYGSCDRTHL 72
>gi|37680059|ref|NP_934668.1| hypothetical protein VV1875 [Vibrio vulnificus YJ016]
gi|37198805|dbj|BAC94639.1| hypothetical protein [Vibrio vulnificus YJ016]
Length = 180
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 9/92 (9%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+L+ +GAG+ H +GS HL ++L W + + + + GLFH+AY A FD
Sbjct: 7 FGMLQELGAGDFQHLNGSLEAHLKGTEKVLINWGSSELLQIAGLFHAAYGT-----AGFD 61
Query: 99 PN----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
N + R + +G+ E L++L+C R
Sbjct: 62 ENMVSLSQRQEIARIIGNDEEALVYLYCSCDR 93
>gi|388600182|ref|ZP_10158578.1| hypothetical protein VcamD_09820 [Vibrio campbellii DS40M4]
Length = 177
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 9/92 (9%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+L+ +GAG+ H +GS HL ++L W + + + + GLFH+AY A FD
Sbjct: 4 FVMLQELGAGDFQHLNGSLEAHLKGTEQVLINWGSSELLQIAGLFHAAYGT-----AGFD 58
Query: 99 PN----TGRDVVRGHVGDAAEKLIHLFCIVPR 126
N + R + +G+ E L++L+C R
Sbjct: 59 ENMVSLSQRQEIARIIGNDEEALVYLYCSCDR 90
>gi|428172851|gb|EKX41757.1| hypothetical protein GUITHDRAFT_112173 [Guillardia theta CCMP2712]
Length = 287
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/164 (21%), Positives = 69/164 (42%), Gaps = 37/164 (22%)
Query: 52 HKHGS-FLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHV 110
H HG F +H+ + IL+ W P +C+ GLFHS Y+ +F T R ++ +
Sbjct: 56 HTHGRLFPDHIRGVADILQQWNQPHEICMAGLFHSCYATEMFPWKLFS-LTERSRLQETL 114
Query: 111 GDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKGLFNAEEAWR 170
G E+++ L+C + G++ ++
Sbjct: 115 GQEVEEIVFLYC---------------------------------TCSQNGIYRQVKSLM 141
Query: 171 KKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQ 214
++ + +PA+GL V + TG+ + ++ A +++ AD +Q
Sbjct: 142 RRGQD-IPAEGL-VTNYYTGQQCYLPHKLAAKVLVVLGADLMEQ 183
>gi|145341088|ref|XP_001415647.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575870|gb|ABO93939.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 40/87 (45%)
Query: 43 RSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTG 102
R A + H GSF HLVD+ L + AP V GL H+AY + A+
Sbjct: 73 REARATKLTHVRGSFDAHLVDVRDWLARFGAPRSVARAGLGHAAYGSELFPCAVASFGAH 132
Query: 103 RDVVRGHVGDAAEKLIHLFCIVPRQPL 129
R + R G A+E+L+ L+ ++
Sbjct: 133 RSMFRAVCGRASERLMFLYATCSQRKF 159
>gi|300784829|ref|YP_003765120.1| hydrolase/acyltransferase [Amycolatopsis mediterranei U32]
gi|399536712|ref|YP_006549374.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
gi|299794343|gb|ADJ44718.1| predicted hydrolase/acyltransferase [Amycolatopsis mediterranei
U32]
gi|398317482|gb|AFO76429.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
Length = 396
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 3/113 (2%)
Query: 16 LVASARPFLRGE-LEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAP 74
L+A RP E + A+ P+L A+LR +GA + H G+ L+HL + +
Sbjct: 204 LIARDRPVPLAETIAAVTAEWPAL-ALLRDLGAEQAGHPGGTLLDHLRRVRDQVADGGGS 262
Query: 75 DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
L L H+AY + P+ R +R +GDAAE L+H + + R
Sbjct: 263 RRARLAALCHAAYGTDGFPHPLLPPDQ-RARLRAAIGDAAESLVHRYGVCDRS 314
>gi|443291914|ref|ZP_21031008.1| Conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
gi|385885102|emb|CCH19115.1| Conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
Length = 178
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 1/88 (1%)
Query: 40 AVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDP 99
A LR GA E H G+ HL + L + V GL H+AY +LA+ DP
Sbjct: 9 AWLRRQGAEEIAHPGGNLYAHLCRVSERLALLGCDSDVQAAGLTHAAYGTDGFDLALLDP 68
Query: 100 NTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
R V+R +G AE+L++L+ R+
Sbjct: 69 -ADRAVLRDLIGADAEELVYLYGACDRR 95
>gi|270264694|ref|ZP_06192959.1| hypothetical protein SOD_i01110 [Serratia odorifera 4Rx13]
gi|270041377|gb|EFA14476.1| hypothetical protein SOD_i01110 [Serratia odorifera 4Rx13]
Length = 190
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 64/170 (37%), Gaps = 43/170 (25%)
Query: 44 SVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNL-AIFD-PNT 101
++G + H + HL+ Y IL + VCL GLFH Y ++ I D P T
Sbjct: 17 TLGVHKVAHLDEPLMNHLIRTYEILCRMSCEEQVCLAGLFHGCYGTQGLHTDEIGDIPET 76
Query: 102 GRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKEKG 161
R +VR G E L++ F ++
Sbjct: 77 QRQIVRNCAGTQVESLVYHFSVM------------------------------------- 99
Query: 162 LFNAEEAWRKKINELLPADGL-TVKHIKTGEDVGVSRRVVATFVMLTIAD 210
+ E+ K LL ADG+ +K ++GE + V+R + L +AD
Sbjct: 100 ---SYESLSKSFRNLLKADGIPALKDRRSGEAISVTREYFEKLLTLKLAD 146
>gi|384148102|ref|YP_005530918.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
gi|340526256|gb|AEK41461.1| hydrolase/acyltransferase [Amycolatopsis mediterranei S699]
Length = 373
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 3/113 (2%)
Query: 16 LVASARPFLRGE-LEAIDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAP 74
L+A RP E + A+ P+L A+LR +GA + H G+ L+HL + +
Sbjct: 181 LIARDRPVPLAETIAAVTAEWPAL-ALLRDLGAEQAGHPGGTLLDHLRRVRDQVADGGGS 239
Query: 75 DCVCLCGLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
L L H+AY + P+ R +R +GDAAE L+H + + R
Sbjct: 240 RRARLAALCHAAYGTDGFPHPLLPPDQ-RARLRAAIGDAAESLVHRYGVCDRS 291
>gi|29827165|ref|NP_821799.1| hypothetical protein SAV_624 [Streptomyces avermitilis MA-4680]
gi|29604263|dbj|BAC68334.1| hypothetical protein SAV_624 [Streptomyces avermitilis MA-4680]
Length = 188
Score = 45.1 bits (105), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 41 VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
+LR +GA E H G+ L HL + L +W+A + L GL H+ Y A+ +
Sbjct: 14 LLRELGATEIAHPGGTLLAHLQRVQEQLAVWEARTALQLAGLCHAFYGTDGFPAALLSLD 73
Query: 101 TGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
RD + +G AE +++L+ R+
Sbjct: 74 R-RDELAAAIGVEAEGIVYLYASCDRK 99
>gi|410630511|ref|ZP_11341200.1| hypothetical protein GARC_1086 [Glaciecola arctica BSs20135]
gi|410149953|dbj|GAC18067.1| hypothetical protein GARC_1086 [Glaciecola arctica BSs20135]
Length = 178
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 5/100 (5%)
Query: 31 IDKNLPSLIAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNS 90
+DK LI +GAGE H GS + HL +LK W A + GL+H+AY S
Sbjct: 1 MDKKFRLLI----ELGAGEFEHVDGSLIAHLEGTRCLLKEWNASQVLQDAGLYHAAYGTS 56
Query: 91 YVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLI 130
+F+ R+ V +G E +++ +C R
Sbjct: 57 AFAQNLFEL-AQREEVATVIGSEIENIVYHYCACDRDAFF 95
>gi|303285608|ref|XP_003062094.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456505|gb|EEH53806.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 367
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 38 LIAVLRSVGAGECWHKHGSFLE-HLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAI 96
+ A+LR+ G+ H G L+ HL + ILK W+AP + + GL H+A+ + ++ +
Sbjct: 93 VAAMLRARGSSSTPHLDGVTLDDHLSHVASILKRWRAPRSIVVAGLAHTAFGSEFLPVRA 152
Query: 97 FDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPL 129
P R V G AE+++ L+ + L
Sbjct: 153 M-PFASRRVAEETCGAWAERMMFLYGTASQSKL 184
>gi|108707196|gb|ABF94991.1| Histone-like transcription factor and archaeal histone family
protein, expressed [Oryza sativa Japonica Group]
Length = 358
Score = 44.3 bits (103), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 11/63 (17%)
Query: 254 LSRMGAIYSLIAREEEI--FIQERKTSAAAVGVGVDKERDEDIELPVPPVFENCTRILDA 311
+SR+ A+Y L +EEI ER V K +D ++EL +PPVF C ++LDA
Sbjct: 1 MSRLAAVYRLFTTDEEIHELTGER---------SVVKTKDAELELVIPPVFVRCNKVLDA 51
Query: 312 KEQ 314
EQ
Sbjct: 52 GEQ 54
>gi|224369129|ref|YP_002603293.1| hypothetical protein HRM2_20280 [Desulfobacterium autotrophicum
HRM2]
gi|223691846|gb|ACN15129.1| tetratricopeptide (TPR) domain protein [Desulfobacterium
autotrophicum HRM2]
Length = 761
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 12/92 (13%)
Query: 309 LDAKEQIEARDLYWEAVCGNSDG------------SKEKLEKAEELLLRCIEKNPFVGEP 356
L KE +A+D Y+ A+ N D +++ EKA L +EKNP + P
Sbjct: 553 LAQKETNKAKDAYYRAIEANPDYLEPYGAIARIALAEQDREKAVIQYLTLLEKNPNLPAP 612
Query: 357 HVVLGQIYLAKGEFEEAEREAGKGLMLLLEWG 388
H++LG IY F++AE K L + E+
Sbjct: 613 HMMLGSIYETDKSFDKAEAHYRKALEINPEYA 644
>gi|326783900|ref|YP_004324294.1| hypothetical protein SSSM7_242 [Synechococcus phage S-SSM7]
gi|310003912|gb|ADO98307.1| hypothetical protein SSSM7_242 [Synechococcus phage S-SSM7]
Length = 152
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 5/92 (5%)
Query: 35 LPSLIAVLRSVGAGECWHKH--GSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYV 92
+ I L SVGA + H H S L H + + +L + P GLFHS Y N +
Sbjct: 2 MDDYINYLVSVGADKIPHYHEGDSLLSHCIRVSGMLYSYGRPMYEVRAGLFHSVYGNEF- 60
Query: 93 NLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIV 124
++ N R+ + +G+ AE +++ FC +
Sbjct: 61 --QMYQINATRNEICDIIGEDAEHIVNTFCTL 90
>gi|428181274|gb|EKX50138.1| hypothetical protein GUITHDRAFT_135325 [Guillardia theta CCMP2712]
Length = 165
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 7/102 (6%)
Query: 74 PDCVCLCGLFHSAYSNS-YVNLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHD 132
P +C GLFHS Y S + + +I GR VVR VG+ AE L+ LF + R +H
Sbjct: 10 PQRICAAGLFHSIYGTSIFKHTSIVPDARGRAVVREVVGEEAEHLVWLFHSIDRPGTLHA 69
Query: 133 DLL------FEYSDQELVEHLKASDVSLINAKEKGLFNAEEA 168
+L F ++ L++ + + + + + L E A
Sbjct: 70 KILPCANGAFPAGEEVLLDKTDGTTIHVTRRQFEDLLAVEMA 111
>gi|303288031|ref|XP_003063304.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455136|gb|EEH52440.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 430
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 106/288 (36%), Gaps = 55/288 (19%)
Query: 38 LIAVLRSVGAGECWHKHG----SFLEHLVDIYRILKIWKAPDCVCLCGLFHSAY-SNSYV 92
L+ V R +GA C S H W + GLFHS Y + +
Sbjct: 189 LLGVKRVLGASTCHPATSPNTTSPTNHTASFPSTSASWGCSPALQDAGLFHSVYGTEGFQ 248
Query: 93 NLAIFDPNTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDV 152
+ P RD VR VG AE+L+ FC+V R + D +L + K
Sbjct: 249 GFCL--PYERRDDVRALVGAYAERLVWTFCVVDRLSV----------DNDLDDRTK---- 292
Query: 153 SLINAKEKGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVV-ATFVMLTIADF 211
S + E G F+ + +DG + I DV RR F+ LT+AD+
Sbjct: 293 SWLARPELGRFDVDA-----------SDG-ELSDISDDADVKTRRRDFWHDFITLTLADW 340
Query: 212 SDQLFGFQDFLFNNSDGKLEFAGNSFSALWPGDGKPGLWMNSLSRMGAIYSLIAREEEIF 271
+Q+ +G F PGD G + +RM I + + +
Sbjct: 341 LEQV-----------EGASRTTSEIFEWFEPGDAW-GYRRGAYARMVDIVAARYPKARVL 388
Query: 272 IQERKTSAAAVGVGVDKERDEDIELPV-PPVFENCTRILDAKEQIEAR 318
AV + E+ + + P+ PP+ E R + E++E R
Sbjct: 389 YD-------AV-YAREGEKSKGVRQPITPPMSEAARRAREGMEKVEPR 428
>gi|284029839|ref|YP_003379770.1| hypothetical protein Kfla_1877 [Kribbella flavida DSM 17836]
gi|283809132|gb|ADB30971.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
Length = 178
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 38/82 (46%)
Query: 46 GAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPNTGRDV 105
GA E H G+ HL + + L A D + L GL H+AY +FD R V
Sbjct: 15 GADELDHAGGTLYVHLHRVAKRLSSLGASDTLVLAGLAHAAYGTDGFPTHLFDWQHERPV 74
Query: 106 VRGHVGDAAEKLIHLFCIVPRQ 127
+ VG AE+L++ + R+
Sbjct: 75 LEAVVGPEAEQLVYRYGACERE 96
>gi|424037490|ref|ZP_17776269.1| hypothetical protein VCHENC02_2542A, partial [Vibrio cholerae
HENC-02]
gi|408895483|gb|EKM31860.1| hypothetical protein VCHENC02_2542A, partial [Vibrio cholerae
HENC-02]
Length = 82
Score = 41.6 bits (96), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 41 VLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFDPN 100
+L+ +GAG+ H +GS HL ++L W + + + + GLFH+AY + + + +
Sbjct: 6 MLQELGAGDFQHLNGSLEAHLKGTEQVLIDWGSSELLQIAGLFHAAYGTAGFDEKMVSLS 65
Query: 101 TGRDVVRGHVGDAAEKLI 118
+++ R +G+ E L+
Sbjct: 66 QRQEIAR-IIGNDEEALV 82
>gi|359150105|ref|ZP_09182966.1| hypothetical protein StrS4_25971 [Streptomyces sp. S4]
Length = 205
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+A+L +GAG H G HL ++ L W A + L GL H+ Y A+
Sbjct: 16 LALLGELGAGRLPHPGGDLATHLRRVHSQLATWGARPALQLAGLCHAFYGTDGFPTALL- 74
Query: 99 PNTGRDVVRGHVGDAAEKLIHLF 121
P R + +G+ AE+L++L+
Sbjct: 75 PLPRRAELAAVIGEEAEELVYLY 97
>gi|302552380|ref|ZP_07304722.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM
40736]
gi|302469998|gb|EFL33091.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM
40736]
Length = 182
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+A+L +GA + H G+ + HL + L W A + L GL H+ Y A+
Sbjct: 12 VALLHELGAADLAHPGGTLIAHLQRVQGQLAAWDARPALQLAGLCHAFYGTDGFPEALL- 70
Query: 99 PNTGRDVVRGHVGDAAEKLIHLF 121
P R + VG AE L++L+
Sbjct: 71 PLDRRSELTAVVGAEAEALVYLY 93
>gi|291222353|ref|XP_002731180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1253
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/85 (24%), Positives = 40/85 (47%)
Query: 297 PVPPVFENCTRILDAKEQIEARDLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEP 356
P PV + C +LD + + + G ++ A+ L C++ NP +
Sbjct: 435 PAGPVLKRCAAVLDPLTKAVPGLMEGLFLMGKVKFLSGDIDAAQSTLQHCLDINPTFSDA 494
Query: 357 HVVLGQIYLAKGEFEEAEREAGKGL 381
H+++ QIY +G F++A + +GL
Sbjct: 495 HLLMAQIYAHQGSFKQASQSLEQGL 519
>gi|357439003|ref|XP_003589778.1| hypothetical protein MTR_1g039170 [Medicago truncatula]
gi|355478826|gb|AES60029.1| hypothetical protein MTR_1g039170 [Medicago truncatula]
Length = 71
Score = 40.4 bits (93), Expect = 1.6, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 10 NDYLETLVASARPFLRGELEAIDKNLPSLIAVLRSVGAGE---CWH 52
N + V + FLR +L +I NLPS I VLRSVGA E C+H
Sbjct: 22 NSRISATVLINKLFLRNDLTSIITNLPSRITVLRSVGASESASCYH 67
>gi|302532161|ref|ZP_07284503.1| conserved hypothetical protein [Streptomyces sp. C]
gi|302441056|gb|EFL12872.1| conserved hypothetical protein [Streptomyces sp. C]
Length = 202
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Query: 39 IAVLRSVGAGECWHKHGSFLEHLVDIYRILKIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
+A+ + VGA H G+ L HL+ + +L W A + L H+ Y + L + +
Sbjct: 1 MALPKEVGAETTKHSGGTLLTHLLRVEALLASWGARPVLRTAALCHAFYGTDGLPLVLLE 60
Query: 99 PNTGRDVVRGHVGDAAEKLIHLFCIVPRQ 127
+ RD + +G AE+L++ + R+
Sbjct: 61 LSE-RDRLAEAIGREAEELVYFYASCDRK 88
>gi|351713927|gb|EHB16846.1| Tetratricopeptide repeat protein 21A [Heterocephalus glaber]
Length = 1426
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ + RC+E +P + H++L QIYLA+G F
Sbjct: 609 ELENAQSTIQRCLELDPTSADAHLLLAQIYLAQGNF 644
>gi|380806327|gb|AFE75039.1| tetratricopeptide repeat protein 21A isoform 1, partial [Macaca
mulatta]
Length = 114
Score = 39.7 bits (91), Expect = 2.6, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 17 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 52
>gi|220928054|ref|YP_002504963.1| hypothetical protein Ccel_0602 [Clostridium cellulolyticum H10]
gi|219998382|gb|ACL74983.1| Tetratricopeptide TPR_2 repeat protein [Clostridium cellulolyticum
H10]
Length = 586
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 31/49 (63%), Gaps = 7/49 (14%)
Query: 327 GNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
GN D +KE+ E RCI+ NP + +++LG+++++ G++ EA +
Sbjct: 305 GNIDEAKERFE-------RCIKNNPRQADAYILLGKLFMSVGQYSEASK 346
>gi|333372283|ref|ZP_08464215.1| hypothetical protein HMPREF9374_1960 [Desmospora sp. 8437]
gi|332974490|gb|EGK11412.1| hypothetical protein HMPREF9374_1960 [Desmospora sp. 8437]
Length = 305
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 32/50 (64%)
Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKGL 381
++K E+ E +LL CI+K P G H L +I+ A+G+ E+A ++ +GL
Sbjct: 88 KEDKPEEGERVLLECIDKYPDRGVAHTYLARIFAARGDREQAVQQLKEGL 137
>gi|410971684|ref|XP_003992295.1| PREDICTED: tetratricopeptide repeat protein 21A [Felis catus]
Length = 1212
Score = 39.3 bits (90), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P +PH+++ +YLA+G F
Sbjct: 468 ELENAQSTLQRCLELDPTSVDPHLLMSHVYLAQGNF 503
>gi|444525844|gb|ELV14172.1| Tetratricopeptide repeat protein 21A [Tupaia chinensis]
Length = 1394
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 493 ELENAQSTLQRCLELDPTFVDAHLLMAQIYLAQGNF 528
>gi|74223484|dbj|BAE21602.1| unnamed protein product [Mus musculus]
Length = 1233
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|408794648|ref|ZP_11206253.1| 4-alpha-glucanotransferase [Leptospira meyeri serovar Hardjo str.
Went 5]
gi|408461883|gb|EKJ85613.1| 4-alpha-glucanotransferase [Leptospira meyeri serovar Hardjo str.
Went 5]
Length = 574
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 71/178 (39%), Gaps = 33/178 (18%)
Query: 51 WHKHGSFLEHLVDIYRI------LKIWKAP--DCVCLCGLFHSAY---SNSYVNLAIFDP 99
W S+LEH +YRI +IW P D L G FH + + ++ + DP
Sbjct: 268 WKDRLSYLEHFFHLYRIDHVIGMYRIWAIPEADTTALHGWFHPQFGIATEEFIKEGL-DP 326
Query: 100 NTGRDVVRGHVGDAAEKLIHLFCIVPRQPLIHDDLLFEYSDQELVEHLKASDVSLINAKE 159
D+ LIH F +P + + D E QEL E +KA L E
Sbjct: 327 KYFEDL----------GLIHEF--IPGHYIFYWDFWKEPGYQELEEEVKA---KLFPLSE 371
Query: 160 KGLFNAEEAWRKKINELLPADGLTVKHIKTGEDVGVSRRVVATFVMLTIADFSDQLFG 217
+ E+ WR+ ++L I ED+G V +F+ ++ F Q+ G
Sbjct: 372 LHIAEEEKHWRQSGEKILEIFESFSAMIPCAEDLG----SVPSFIRDSL--FERQMIG 423
>gi|26325882|dbj|BAC26695.1| unnamed protein product [Mus musculus]
Length = 1314
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|149018252|gb|EDL76893.1| rCG25664 [Rattus norvegicus]
Length = 797
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|148677249|gb|EDL09196.1| tetratricopeptide repeat domain 21A, isoform CRA_b [Mus musculus]
Length = 1282
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|225543311|ref|NP_083011.2| tetratricopeptide repeat protein 21A [Mus musculus]
gi|150416165|sp|Q8C0S4.2|TT21A_MOUSE RecName: Full=Tetratricopeptide repeat protein 21A; Short=TPR
repeat protein 21A; AltName: Full=Tetratricopeptide
repeat-containing hedgehog modulator 2
Length = 1314
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|67626016|gb|AAY78492.1| tetratricopeptide repeat-containing hedgehog modulator 2 [Mus
musculus]
gi|126632017|gb|AAI33700.1| Tetratricopeptide repeat domain 21A [Mus musculus]
Length = 1314
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|12853191|dbj|BAB29674.1| unnamed protein product [Mus musculus]
Length = 1314
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|392342139|ref|XP_003754513.1| PREDICTED: tetratricopeptide repeat protein 21A-like [Rattus
norvegicus]
gi|392350417|ref|XP_003750652.1| PREDICTED: tetratricopeptide repeat protein 21A-like [Rattus
norvegicus]
Length = 1314
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSTLQRCLELDPTFVDAHLLMSQIYLAQGNF 544
>gi|297463510|ref|XP_612284.5| PREDICTED: tetratricopeptide repeat protein 21A [Bos taurus]
Length = 1463
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 659 ELEHAQSTLQRCLELDPTSVDTHLLMAQIYLAQGNF 694
>gi|237755427|ref|ZP_04584053.1| putative TPR repeat-containing protein [Sulfurihydrogenibium
yellowstonense SS-5]
gi|237692427|gb|EEP61409.1| putative TPR repeat-containing protein [Sulfurihydrogenibium
yellowstonense SS-5]
Length = 329
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
YW D ++E LE A L I P + + G++Y+ KGEFEEA+++ K
Sbjct: 45 YWRGKVATHDLNQESLEVAIVELSEAIRLKPDYADAYFERGKVYIQKGEFEEAKKDLEKA 104
Query: 381 LML 383
+ L
Sbjct: 105 IQL 107
>gi|297488441|ref|XP_002696967.1| PREDICTED: tetratricopeptide repeat protein 21A [Bos taurus]
gi|296475124|tpg|DAA17239.1| TPA: tetratricopeptide repeat domain 21A-like [Bos taurus]
Length = 1362
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 588 ELEHAQSTLQRCLELDPTSVDTHLLMAQIYLAQGNF 623
>gi|188997368|ref|YP_001931619.1| TPR repeat-containing protein [Sulfurihydrogenibium sp. YO3AOP1]
gi|188932435|gb|ACD67065.1| TPR repeat-containing protein [Sulfurihydrogenibium sp. YO3AOP1]
Length = 329
Score = 38.5 bits (88), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 321 YWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAEREAGKG 380
YW D ++E LE A L I P + + G++Y+ KGEFEEA+++ K
Sbjct: 45 YWRGKVATHDLNQESLEVAIVELSEAIRLKPDYADAYFERGRVYIQKGEFEEAKKDLEKA 104
Query: 381 LML 383
+ L
Sbjct: 105 IQL 107
>gi|383777412|ref|YP_005461978.1| hypothetical protein AMIS_22420 [Actinoplanes missouriensis 431]
gi|381370644|dbj|BAL87462.1| hypothetical protein AMIS_22420 [Actinoplanes missouriensis 431]
Length = 178
Score = 38.5 bits (88), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
Query: 40 AVLRSVGAGECWHKHGSFLEHLVDIYRIL-KIWKAPDCVCLCGLFHSAYSNSYVNLAIFD 98
A L+ G H G+ +HL + L ++ P+ V GL H+AY +LA+
Sbjct: 9 AWLQERGTDTVRHPGGTLYQHLCRVSEHLSEVGHRPE-VQAAGLTHAAYGTDGFDLALLF 67
Query: 99 PNTGRDVVRGHVGDAAEKLIHLF 121
RD +RG VGD AE+L++L+
Sbjct: 68 WQQ-RDELRGLVGDEAEELVYLY 89
>gi|395517048|ref|XP_003762694.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
21A [Sarcophilus harrisii]
Length = 1218
Score = 38.5 bits (88), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYL++G F
Sbjct: 519 ELENAQSTLQRCLELDPTSADVHLLMAQIYLSQGNF 554
>gi|431910586|gb|ELK13654.1| Tetratricopeptide repeat protein 21A, partial [Pteropus alecto]
Length = 1317
Score = 38.5 bits (88), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 498 ELENAQSTLQRCLELDPTSVDAHLLMSQIYLAQGNF 533
>gi|297286110|ref|XP_002802914.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 4
[Macaca mulatta]
Length = 1302
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 498 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 533
>gi|378726907|gb|EHY53366.1| hypothetical protein HMPREF1120_01560 [Exophiala dermatitidis
NIH/UT8656]
Length = 1358
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 26/40 (65%)
Query: 334 EKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
+ EKA EL+L+ I NP + H +L +IY AKGE E+A
Sbjct: 375 QDTEKALELILKAIAVNPEIYAAHALLSEIYFAKGEDEKA 414
>gi|194379504|dbj|BAG63718.1| unnamed protein product [Homo sapiens]
Length = 1302
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 498 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 533
>gi|21748475|emb|CAD31647.1| TRP domain containing protein [Homo sapiens]
Length = 1320
Score = 38.1 bits (87), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|402860571|ref|XP_003894699.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Papio
anubis]
Length = 1419
Score = 38.1 bits (87), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 615 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 650
>gi|355559768|gb|EHH16496.1| hypothetical protein EGK_11783 [Macaca mulatta]
Length = 1321
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 517 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 552
>gi|297286105|ref|XP_002802912.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 2
[Macaca mulatta]
Length = 1344
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 540 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 575
>gi|114586265|ref|XP_516376.2| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Pan
troglodytes]
Length = 1320
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|109041688|ref|XP_001083557.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 1
[Macaca mulatta]
Length = 1313
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544
>gi|402860577|ref|XP_003894702.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 4 [Papio
anubis]
Length = 1443
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 639 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 674
>gi|402860573|ref|XP_003894700.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Papio
anubis]
Length = 1242
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 608 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 643
>gi|355746800|gb|EHH51414.1| hypothetical protein EGM_10779 [Macaca fascicularis]
Length = 1321
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 517 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 552
>gi|297286112|ref|XP_002802915.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 5
[Macaca mulatta]
Length = 1272
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|297286107|ref|XP_002802913.1| PREDICTED: tetratricopeptide repeat protein 21A-like isoform 3
[Macaca mulatta]
Length = 1320
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|397511536|ref|XP_003826127.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Pan
paniscus]
Length = 1320
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|290543551|ref|NP_001098983.2| tetratricopeptide repeat protein 21A isoform 1 [Homo sapiens]
Length = 1272
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|157502177|ref|NP_665698.2| tetratricopeptide repeat protein 21A isoform 2 [Homo sapiens]
gi|296453007|sp|Q8NDW8.3|TT21A_HUMAN RecName: Full=Tetratricopeptide repeat protein 21A; Short=TPR
repeat protein 21A; AltName: Full=Stress-inducible
protein 2
Length = 1320
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|410036712|ref|XP_003950106.1| PREDICTED: tetratricopeptide repeat protein 21A [Pan troglodytes]
Length = 1143
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544
>gi|402860575|ref|XP_003894701.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 3 [Papio
anubis]
Length = 1371
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 567 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 602
>gi|397511540|ref|XP_003826129.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 3 [Pan
paniscus]
Length = 1272
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|397511538|ref|XP_003826128.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Pan
paniscus]
Length = 1143
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544
>gi|332816435|ref|XP_003309749.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Pan
troglodytes]
Length = 1272
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|119584968|gb|EAW64564.1| tetratricopeptide repeat domain 21A, isoform CRA_c [Homo sapiens]
Length = 1320
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|67969406|dbj|BAE01054.1| unnamed protein product [Macaca fascicularis]
Length = 601
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544
>gi|426340012|ref|XP_004033929.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
21A [Gorilla gorilla gorilla]
Length = 1320
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|354487661|ref|XP_003505990.1| PREDICTED: tetratricopeptide repeat protein 21A [Cricetulus
griseus]
Length = 1370
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ L RC+E +P + H+++ QIYLA+G F
Sbjct: 564 ELENAQSTLQRCLELDPNFVDAHLLMSQIYLAQGNF 599
>gi|332215575|ref|XP_003256920.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 2 [Nomascus
leucogenys]
Length = 1271
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|332215573|ref|XP_003256919.1| PREDICTED: tetratricopeptide repeat protein 21A isoform 1 [Nomascus
leucogenys]
Length = 1319
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 516 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 551
>gi|297671686|ref|XP_002813963.1| PREDICTED: LOW QUALITY PROTEIN: tetratricopeptide repeat protein
21A [Pongo abelii]
Length = 1344
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 540 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 575
>gi|194379578|dbj|BAG63755.1| unnamed protein product [Homo sapiens]
Length = 1272
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 468 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 503
>gi|120538593|gb|AAI29949.1| TTC21A protein [Homo sapiens]
Length = 1143
Score = 38.1 bits (87), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 509 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 544
>gi|118365236|ref|XP_001015839.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89297606|gb|EAR95594.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 430
Score = 38.1 bits (87), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 332 SKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAER 375
S+ KLE+A++ LLR +E P P+ L QIY +G EEA++
Sbjct: 83 SQGKLEEAKKQLLRSLEIEPKYSHPYFQLSQIYYQQGMLEEAKQ 126
>gi|21752149|dbj|BAC04129.1| unnamed protein product [Homo sapiens]
Length = 882
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 335 KLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEF 370
+LE A+ +L RC+E +P + H+++ QIYLA+G F
Sbjct: 78 ELENAQSILQRCLELDPASVDAHLLMCQIYLAQGNF 113
>gi|118381790|ref|XP_001024055.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89305822|gb|EAS03810.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 376
Score = 38.1 bits (87), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 315 IEARDLYWEAVCGNSDGSKEK--LEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEE 372
I+ D Y++A S ++ +++A++ L + I+ NP + H LG IY +GE EE
Sbjct: 285 IQINDQYYQAYYNLSSIYYDQGNIQEAKQCLEKAIKINPLYDQAHYNLGLIYYNQGELEE 344
Query: 373 AER 375
A+R
Sbjct: 345 AKR 347
>gi|357418845|ref|YP_004933713.1| Tetratricopeptide TPR_1 repeat-containing protein [Thermovirga
lienii DSM 17291]
gi|355398188|gb|AER67616.1| Tetratricopeptide TPR_1 repeat-containing protein [Thermovirga
lienii DSM 17291]
Length = 377
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 31/46 (67%), Gaps = 1/46 (2%)
Query: 331 GSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEAERE 376
G+K+ +KAE+ + +E NP EPH+ LG ++LA+G+ + +RE
Sbjct: 312 GAKD-FKKAEDAYRKALELNPKFNEPHLGLGLVFLARGDLKGVKRE 356
>gi|167516170|ref|XP_001742426.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779050|gb|EDQ92664.1| predicted protein [Monosiga brevicollis MX1]
Length = 242
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 315 IEAR-DLYWEAVCGNSDGSKEKLEKAEELLLRCIEKNPFVGEPHVVLGQIYLAKGEFEEA 373
IE R DL + A N K L+ A+ LL+ IE+NP E H LG +Y G+ ++A
Sbjct: 111 IERRSDLVYNAAVVN--LMKRNLQDAKALLIESIERNPSSAEAHNNLGCLYAELGQADKA 168
Query: 374 EREAGKGLMLL 384
K L L+
Sbjct: 169 RTHLKKALQLM 179
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.421
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,628,863,298
Number of Sequences: 23463169
Number of extensions: 287407848
Number of successful extensions: 654610
Number of sequences better than 100.0: 203
Number of HSP's better than 100.0 without gapping: 171
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 654155
Number of HSP's gapped (non-prelim): 324
length of query: 399
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 254
effective length of database: 8,957,035,862
effective search space: 2275087108948
effective search space used: 2275087108948
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)