BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1550
(264 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242009326|ref|XP_002425440.1| Ser/Thr-rich protein T10 in DGCR region, putative [Pediculus
humanus corporis]
gi|212509266|gb|EEB12702.1| Ser/Thr-rich protein T10 in DGCR region, putative [Pediculus
humanus corporis]
Length = 280
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 130/233 (55%), Gaps = 21/233 (9%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GGTWLA S GK+G+LLNV + S P+ RG LV ++ + +EY+ LK E E +
Sbjct: 57 GGTWLAVSKKGKIGVLLNV-SQKSNPDKLGRGFLVANLLKTPLTTKEYMEILKIEKE--L 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+N +H+V E + Q N+ H+SN ++ T++ + V+ NS S PF
Sbjct: 114 YNTYHLVSAEFSPQG-NLLHYSNGKSNENVITEIDS---------EVFALSNSSMDS-PF 162
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
QKVI GK+ F +I +K+ + LI +L +K + +YPD ID + + + E K
Sbjct: 163 QKVIKGKQIFNDICSKYKTIKDKDKLISELLTFLKWDEKHYPDTVID--SFSHIPEKIKP 220
Query: 209 RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYET-----KKWIHTH 256
YS + V I + +YGTRTH+IILVD N ++FHEWTL+ KWI TH
Sbjct: 221 NYSAIFVKIPTSYYGTRTHTIILVDWENKVEFHEWTLEEPILSQNDSKWIETH 273
>gi|328723712|ref|XP_001943801.2| PREDICTED: uncharacterized protein C22orf25 homolog [Acyrthosiphon
pisum]
Length = 273
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 76/217 (35%), Positives = 122/217 (56%), Gaps = 15/217 (6%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWLA + GK LLNV + PN + RG LV +V+G SA EYL +L + +N
Sbjct: 55 EGGTWLALNIRGKFAALLNVFELDEIPNAKSRGQLVKNFVKGNSSALEYLQEL--QGSQN 112
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
V+NGF ++ + ++ S + F+N + P K + +V GFGN P
Sbjct: 113 VYNGFKLITITMSKLSLETHFFTNFVQDEGPPLK--------HLQKYVQGFGNG-----P 159
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
F KV G+++F +IV K+ LI+ +L ++K + +YPD E+ ++ D +
Sbjct: 160 FLKVAEGEKRFNDIVKKYGHSHYRDTLIKELLGVLKWDKLHYPDEELIKRTPKFPGTDNQ 219
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT 244
++S V V++ + YGTRTH+IIL+D + ++F+EWT
Sbjct: 220 KKFSSVFVEMPTRQYGTRTHTIILMDPDGKLEFNEWT 256
>gi|158290321|ref|XP_311935.4| AGAP002962-PA [Anopheles gambiae str. PEST]
gi|157017837|gb|EAA07544.4| AGAP002962-PA [Anopheles gambiae str. PEST]
Length = 293
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 126/247 (51%), Gaps = 23/247 (9%)
Query: 28 KGGTWLATSTNG----KLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+GGTWLA + KLG LLNV GEN RGP+V +++G+ ++ Y L
Sbjct: 56 EGGTWLAIGNHPAGQIKLGALLNVTGENKANVTNGRGPIVANFLKGELTSRAYSEQL--- 112
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKD-----HVYGF 138
++ F F+ V ++L ++ + V + +N + D GF
Sbjct: 113 LAQDNFGAFNFVSIQLDANGSDRSE-DRSEGRNQSGGAVLHTSNAPHSIDTCKTGQALGF 171
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRK-SESQNLIQNILDLMKNKQSNYPDPEIDRK 197
GNS +P QKV GKE+F EIV + N ++ L+ ++ L+K+ ++PDPE+ R+
Sbjct: 172 GNSTL-EKPLQKVCRGKEQFDEIVARCNVPFADKDALVHELMGLLKSDVRHFPDPELTRR 230
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL---DYETKKWIH 254
A + S V V + + YG+RTH++IL+DHNN M+F E TL D T +WI
Sbjct: 231 AG-----QHAEFLSSVNVRMPNGSYGSRTHTVILIDHNNHMEFIEQTLSGTDPHTGEWIT 285
Query: 255 THIRKTL 261
T I + L
Sbjct: 286 TRIEREL 292
>gi|195448871|ref|XP_002071850.1| GK10209 [Drosophila willistoni]
gi|194167935|gb|EDW82836.1| GK10209 [Drosophila willistoni]
Length = 277
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 86/243 (35%), Positives = 125/243 (51%), Gaps = 30/243 (12%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT---DLK 81
+GGTWLA + K+G LLN+ GE N RG +V YV ++S++E+ + + +
Sbjct: 56 EGGTWLAIGHHQDVFKVGALLNLTGEPKPRNAIGRGMIVADYV--KESSKEFNSMNYNRR 113
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ N ++ F+ V +E+T S I SN+ P T Q YGFGNS
Sbjct: 114 LLKDCNQYSAFNFVSIEITPTSAQIQLLSNVPP-----------TLAQFEPGKSYGFGNS 162
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
S PFQKV G+++F V + S L ++L+L+KNK +PDPE+ R+A
Sbjct: 163 -LPSSPFQKVSHGQQQFEATVREHGNAS-VDVLSSHLLELLKNKHKFWPDPELQRRAP-- 218
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT---LDYETKKWIHTHIR 258
++ S + V I YG+RTH++ILVD N M F E T LD E +W+H HI
Sbjct: 219 ---NWGEGLSSLNVFIDHQGYGSRTHTVILVDSQNKMHFIEETMAGLDPEG-EWLHKHIE 274
Query: 259 KTL 261
K
Sbjct: 275 KQF 277
>gi|255918143|ref|NP_613049.2| ser/Thr-rich protein T10 in DGCR region [Mus musculus]
gi|26354360|dbj|BAC40808.1| unnamed protein product [Mus musculus]
gi|74219739|dbj|BAE40462.1| unnamed protein product [Mus musculus]
gi|74227275|dbj|BAE21736.1| unnamed protein product [Mus musculus]
gi|148665102|gb|EDK97518.1| DNA segment, Chr 16, human D22S680E, expressed, isoform CRA_c [Mus
musculus]
Length = 276
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/239 (31%), Positives = 121/239 (50%), Gaps = 36/239 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+I+ +L+ ++ V N+ E + V YG
Sbjct: 111 TEGHLYNGFNIIAADLSTSKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + + L+ +LD++ N+++ PDP I+
Sbjct: 156 SNALLET-PWKKLCFGKQLFMEAVEQ-SEALPKDVLVTQLLDVLNNEEAQLPDPAIE--- 210
Query: 199 EADMDEDYKM----RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
D ++Y +Y+ VCV ++ YGTRT++IILVD N + F E + LD +T +W
Sbjct: 211 --DQGQEYVQPILNKYAAVCVRCAT--YGTRTNTIILVDANGHVTFTERSMLDKDTSRW 265
>gi|91084657|ref|XP_967518.1| PREDICTED: similar to AGAP002962-PA [Tribolium castaneum]
gi|270008630|gb|EFA05078.1| hypothetical protein TcasGA2_TC015175 [Tribolium castaneum]
Length = 271
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 114/221 (51%), Gaps = 27/221 (12%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
+GG WL S K G LLNV G R RGPLV Y+ G + A EY+T L E
Sbjct: 55 EGGMWLGVSLKPQMIKFGALLNVTGAPKREAQAGRGPLVYNYLAGGEPAPEYITKLVPEQ 114
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
++ F++ ++E++ + T YH SN SP +T T +Q V FGNS
Sbjct: 115 ----YSAFNLFMVEVSNEIT-CYHHSN-----SPEDTLT-YTGRQ-----VLAFGNSTPQ 158
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
S PF KV G +KF EI+ N L+Q +++L+K ++ + PDPE++ +A +
Sbjct: 159 S-PFTKVKKGGQKFEEIIT--NYGGSRARLVQELINLLKCEELHLPDPELEARAPFGVG- 214
Query: 205 DYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
S + V + YGTRTHS+ILVD ++F E T+
Sbjct: 215 ----FLSSIYVRMEEGGYGTRTHSVILVDDGGNVEFVEHTM 251
>gi|350416464|ref|XP_003490957.1| PREDICTED: uncharacterized protein C22orf25-like [Bombus impatiens]
Length = 274
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 119/226 (52%), Gaps = 34/226 (15%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPN--GRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTWLA S GK G++LN+ E S + + RG LV +V SA YL L K
Sbjct: 56 EGGTWLAMSLTGKAGVVLNLSDEASLTDIPKQGRGFLVPNFVTSNDSAVSYLDKLYKTNN 115
Query: 86 EN-VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
EN ++N F +VL++ LQ+ ++ + S+ N T ++D++ GFGNS
Sbjct: 116 ENQIYNPFILVLID--LQNADVKYLSS----------SHNSTGPNSSQDNILGFGNSGL- 162
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
P++KV GKE F IV + S LI+ +L +K+K+ PDPE+ ++
Sbjct: 163 DIPYKKVEAGKEIFKNIVKDI-KVSRQMTLIEELLKFLKSKERYLPDPELQKRCSK---- 217
Query: 205 DYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTL 245
RY ++SSIF Y TRTHSI+LV+ NN + F E TL
Sbjct: 218 ----RYK----ELSSIFVSTDGYCTRTHSILLVNGNNELTFVEETL 255
>gi|354480611|ref|XP_003502498.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Cricetulus
griseus]
gi|344241494|gb|EGV97597.1| Ser/Thr-rich protein T10 in DGCR region [Cricetulus griseus]
Length = 276
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 121/235 (51%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPYARGRGELVTHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E++ V YG
Sbjct: 111 TEGHLYNGFNLIAADLSTAKEDV---------------VCYYGNRGEHEPIVLTPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E+V + + LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWKKLCFGKQLFMEVVEQ-SEALPKDTLITHLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV ++ YGTRT++IILVD + + F E + L+ +T W
Sbjct: 214 R-EYVQPILSKYAAVCVRCAT--YGTRTNTIILVDADGHVTFTERSLLNKDTSHW 265
>gi|383852607|ref|XP_003701818.1| PREDICTED: uncharacterized protein C22orf25-like [Megachile
rotundata]
Length = 271
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 119/226 (52%), Gaps = 34/226 (15%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNG--RDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTWLA S GK G++LN+ E S N + RG L+ +V SA YL L E +
Sbjct: 56 EGGTWLALSMTGKAGVILNLSNEESLTNNPKQGRGALIPNFVTSNDSASSYLDKLYNENK 115
Query: 86 E-NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
+++N F ++L + L + N++ S+ PS +D + GF NS
Sbjct: 116 NGSLYNPFLLIL--INLYNANVHCLSSSINSIGPSLA----------EDSILGFSNSGL- 162
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
P++KV GKEKF IVN N S+ +LI+ +L +K+K+ + PDPE+ ++
Sbjct: 163 GVPYKKVEVGKEKFKSIVNNANV-SKQTHLIEGLLKFLKSKEKHLPDPELQKR------- 214
Query: 205 DYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTL 245
+ RY ++SSIF Y TRTHSI+LV+ N + F E TL
Sbjct: 215 -HLSRYK----ELSSIFVSAGEYSTRTHSILLVNGKNEVTFVEETL 255
>gi|324523220|gb|ADY48210.1| Ser/Thr-rich protein T10 in DGCR region [Ascaris suum]
Length = 288
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 128/238 (53%), Gaps = 26/238 (10%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
KGGTWL ++GK+GILL++L ++S P+ RG +V +Y+ +AEEY + K
Sbjct: 56 KGGTWLGIKSDGKVGILLSMLEKDSLMIPDAPTRGKIVNEYLHSDCTAEEYGQRIAKCA- 114
Query: 86 ENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK------DHVYGFG 139
++FNGF+++LL+ T Q+ + +V N ++Q++ VYGFG
Sbjct: 115 -HIFNGFNVLLLDRTKQTNG-----------QKAYRVVNFISRQDDATPETLGSGVYGFG 162
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
NS PF+KV +G F E + + N +S+ Q + + + ++++ ++PD ++ ++
Sbjct: 163 NS-LRRTPFKKVSYGARIFEEKIQQLNGQSK-QEIFEQFIGILRDDTCHHPDEQL--MSQ 218
Query: 200 ADMDEDYKMRYSRVCVDISSIF-YGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTH 256
D E+ S++ F YGTR+H++ILVD +D+ E + +K + + H
Sbjct: 219 TDQPEECSKAMSQLFFRFPPPFRYGTRSHTVILVDGLGHVDYLERSQIPPSKDFTNIH 276
>gi|195130583|ref|XP_002009731.1| GI15520 [Drosophila mojavensis]
gi|193908181|gb|EDW07048.1| GI15520 [Drosophila mojavensis]
Length = 282
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 124/245 (50%), Gaps = 32/245 (13%)
Query: 28 KGGTWLA---TSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
+GGTWLA K+G LLN+ GE N RG +V YV Q E + + +
Sbjct: 56 EGGTWLAIGGGKGVFKVGALLNLTGEPKPRNAVGRGMIVADYVR-QTDIEHNIVNYNQSL 114
Query: 85 EENV--FNGFHIVLLELTL--QSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ ++ F+ V +E+ Q I SN+ P P + YGFGN
Sbjct: 115 LADCTKYSAFNFVSIEIGSDGQPAQIKLLSNVPPTLLPF-----------EQGQCYGFGN 163
Query: 141 S--QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
S +C PF+KV +G+++F +IV +F +S+ + L +L L+KNK +PD E++R+A
Sbjct: 164 SLPEC---PFEKVQYGRKQFEQIVKEFG-QSDVETLSAELLQLLKNKHKFWPDAELERRA 219
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY--ETKKWIHTH 256
+ + S + V I+ YG+RTHS+ILVD NN M F E T+ + +W HTH
Sbjct: 220 PS-----WGEGLSALNVHIAEHGYGSRTHSLILVDANNKMHFIEETMRGLDPSAEWEHTH 274
Query: 257 IRKTL 261
I K
Sbjct: 275 IEKQF 279
>gi|149019787|gb|EDL77935.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 225
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 121/238 (50%), Gaps = 34/238 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 2 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 59
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 60 TEGHLYNGFNLIAADLSTAKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 104
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN---LIQNILDLMKNKQSNYPDPEID 195
N+ + P++K+ FGK+ F E V +SES L+ +L+++ N+++ PDP I+
Sbjct: 105 SNALLET-PWKKLCFGKQLFTEAV----ERSESLPKDILVTQLLEVLNNEEAQLPDPAIE 159
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + + +Y+ VCV S YGTRT++IILVD + + F E + LD +T W
Sbjct: 160 DQGQ-EYIQPILSKYAAVCV--RSATYGTRTNTIILVDADGHVTFTERSMLDKDTSCW 214
>gi|405970107|gb|EKC35040.1| Ser/Thr-rich protein T10 in DGCR region [Crassostrea gigas]
Length = 838
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 123/236 (52%), Gaps = 23/236 (9%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L+ + V+G TWL S GK+G+LLN+ E + RG LV ++ + + Y+
Sbjct: 84 LQSGVEHVEGCTWLGMSKTGKVGVLLNI-KEIPEEDKESRGCLVKDFLSCDLNCQSYIEK 142
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQ-ENKDHVYGF 138
K +++ +NGFH++LL+ T + +F+N + +++ N + D
Sbjct: 143 CVKPKQDS-YNGFHLLLLDCRSPHTEMAYFNN---------RDSDMKNGEIPLDDKCVCL 192
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK- 197
GNS P+QKV GK++F +I+ +FN+ + L +LD +K+K + YPD + RK
Sbjct: 193 GNSLSARTPWQKVEEGKKRFMDIIEQFNQVDKKSELTSKLLDFLKDK-TKYPDDPVLRKQ 251
Query: 198 --------AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+EAD+ D + S + V + YGTRT+SII +D++ +F E TL
Sbjct: 252 SEITNPNGSEADIARDVD-QLSSLFVYMPEYRYGTRTNSIITIDYDGQCEFMEKTL 306
>gi|157820081|ref|NP_001101793.1| transport and golgi organization 2 homolog [Rattus norvegicus]
gi|149019785|gb|EDL77933.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_a [Rattus norvegicus]
gi|149019786|gb|EDL77934.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 276
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 121/238 (50%), Gaps = 34/238 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 111 TEGHLYNGFNLIAADLSTAKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN---LIQNILDLMKNKQSNYPDPEID 195
N+ + P++K+ FGK+ F E V +SES L+ +L+++ N+++ PDP I+
Sbjct: 156 SNALLET-PWKKLCFGKQLFTEAV----ERSESLPKDILVTQLLEVLNNEEAQLPDPAIE 210
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + + +Y+ VCV S YGTRT++IILVD + + F E + LD +T W
Sbjct: 211 DQGQ-EYIQPILSKYAAVCV--RSATYGTRTNTIILVDADGHVTFTERSMLDKDTSCW 265
>gi|1711658|sp|P54797.1|T10_MOUSE RecName: Full=Ser/Thr-rich protein T10 in DGCR region
gi|406263|emb|CAA52612.1| T10 [Mus musculus]
Length = 276
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/239 (31%), Positives = 119/239 (49%), Gaps = 36/239 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+I+ +L+ ++ V N+ E + V YG
Sbjct: 111 TEGHLYNGFNIIAADLSTSKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + + L+ +LD++ N+++ PDP I+
Sbjct: 156 SNALLET-PWKKLCFGKQLFMEAVEQ-SEALPKDVLVTQLLDVLNNEEAQLPDPAIE--- 210
Query: 199 EADMDEDYKM----RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
D ++Y +Y+ V +S YGTRT++IILVD N + F E + LD +T +W
Sbjct: 211 --DQGQEYVQPILNKYAAVWCRCAS--YGTRTNTIILVDANGHVTFTERSMLDKDTSRW 265
>gi|194043487|ref|XP_001929679.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 1 [Sus
scrofa]
Length = 276
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/222 (29%), Positives = 115/222 (51%), Gaps = 17/222 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L P+ R RG LV ++ + YL + KE
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLDPDARGRGELVSHFLTTDMDSLSYLKKVSKE 112
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+++NGF+++ +L+ + +I + D P + YG N+
Sbjct: 113 G--HLYNGFNLIAADLSTEKGDIICYYGNRGNDKPVVL----------EPGTYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + ++ L+ +LD++ N+++ PDP I+ + +
Sbjct: 161 DT-PWKKLCFGKQLFLEAVGR-SQALPKDALVAQLLDMLNNEEAQLPDPAIEDQGR-EYV 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ + +Y+ VCV S YGTRT+++ILVD + + F E ++
Sbjct: 218 QPFLSKYAAVCVRCSD--YGTRTNTVILVDADGHVTFTERSM 257
>gi|432094834|gb|ELK26242.1| hypothetical protein MDA_GLEAN10020097 [Myotis davidii]
Length = 316
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 116/230 (50%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L P+ R RG LV ++ + YL K
Sbjct: 93 EEGKEGGTWLGISTRGKLAALTNYLQPRQDPDARGRGELVTHFLTADVDSFSYLK--KVS 150
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
TE +++NGF+++ +L+ + +++ + P YG N+
Sbjct: 151 TEGHLYNGFNLLAADLSTEKGDVFCYYGNQGEPEPIVLAPG----------TYGLSNALL 200
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK F E V + ++ + L+ +LD++ N+++ PDP I+ + +
Sbjct: 201 DT-PWRKLCFGKRLFLEAVER-SQALPKEVLVAQLLDVLNNEEAQLPDPAIEDQGR-EYV 257
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT+++ILVD + + F E + LD + +W
Sbjct: 258 QPILSKYAAVCVRCPG--YGTRTNTVILVDVDGHVTFTERSMLDKDPSRW 305
>gi|260798558|ref|XP_002594267.1| hypothetical protein BRAFLDRAFT_201357 [Branchiostoma floridae]
gi|229279500|gb|EEN50278.1| hypothetical protein BRAFLDRAFT_201357 [Branchiostoma floridae]
Length = 289
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 119/219 (54%), Gaps = 17/219 (7%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWL + G+L L N+L + PN + RG L+ ++ G ++ YL DL KE +
Sbjct: 58 EGGTWLGMTETGRLSALTNILQPSPDPNAKGRGSLITDFLRGDQTPLAYLEDLAKEG--H 115
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF++V ++L+ ++ Y++SN++ D P KV + Y NS + P
Sbjct: 116 LYNGFNLVTMDLSKDASLAYYYSNVS-QDGP--KVLSPGQ--------YVISNSLLHT-P 163
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
FQK GK+ F EI+ + + + I+++L L+ N DP++ + + E +
Sbjct: 164 FQKASNGKKLFREILLE-KEEGDKDAFIESLLQLLDNNVQFPDDPQVKLQG-THLPETVR 221
Query: 208 MRYSRVCVDIS-SIFYGTRTHSIILVDHNNVMDFHEWTL 245
Y+ + V + YGTRT+++ILVDHNN + F E T+
Sbjct: 222 HGYTAIRVRTPRAANYGTRTNTVILVDHNNHVTFLEKTM 260
>gi|195043900|ref|XP_001991712.1| GH11912 [Drosophila grimshawi]
gi|193901470|gb|EDW00337.1| GH11912 [Drosophila grimshawi]
Length = 282
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/243 (34%), Positives = 119/243 (48%), Gaps = 32/243 (13%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRDRGPLVVKYVE---GQKSAEEYLTDLK 81
+GGTWLA + G K+G LLN+ GE + RG +V YV GQ Y L
Sbjct: 56 EGGTWLAIGSEGGIFKVGALLNLTGEPKPRDAVGRGMIVADYVRQSAGQYDINNYNQHLL 115
Query: 82 KETEENVFNGFHIVLLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ N ++ F+ V +E+ + + I SN+ P T + YGFG
Sbjct: 116 ADC--NKYSAFNFVSIEIGDSTKPAQIKLLSNVPP-----------TLASFQQGQCYGFG 162
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
NS S PF+KV G+ +F +IV++ S+ L +L L+KNK +PD E+ R+A
Sbjct: 163 NSLPDS-PFEKVRHGQTQFEQIVSQHGH-SDVATLSAELLQLLKNKHKFWPDGELKRRAP 220
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT---LDYETKKWIHTH 256
+ + S + V I YG+RTHS+ILVD NN + F E T LD +W TH
Sbjct: 221 S-----WGQGLSALNVHIEEHGYGSRTHSLILVDANNKLHFIEETMLGLD-PNGEWERTH 274
Query: 257 IRK 259
I K
Sbjct: 275 IEK 277
>gi|195393258|ref|XP_002055271.1| GJ19277 [Drosophila virilis]
gi|194149781|gb|EDW65472.1| GJ19277 [Drosophila virilis]
Length = 282
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/243 (33%), Positives = 120/243 (49%), Gaps = 28/243 (11%)
Query: 28 KGGTWLAT-STNG--KLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
+GGTWLA S +G K+G LLN+ GE N RG +V YV Q E + + +
Sbjct: 56 EGGTWLAIGSAHGIFKVGALLNLTGEPKPRNAVGRGMIVADYVR-QSDVEHNIINYNQSL 114
Query: 85 EENV--FNGFHIVLLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ ++ F+ V +E+ Q I SN+ P P + YGFGN
Sbjct: 115 LADCTKYSAFNFVSIEIGDVSQPAQIKLLSNVPPTLMPF-----------EQGQCYGFGN 163
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
S S PF+KV G+++F +IV++ S+ L +L L+KNK +PD E+ R+A +
Sbjct: 164 SLPDS-PFEKVRHGQQQFEQIVSEHGH-SDVATLSAELLQLLKNKHKFWPDAELKRRAPS 221
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY--ETKKWIHTHIR 258
+ S + V I YG+RTHS+ILVD NN + F E T+ + +W THI
Sbjct: 222 -----WGEGLSALNVHIPEHGYGSRTHSLILVDANNKLHFIEETMAGLDPSGEWERTHIE 276
Query: 259 KTL 261
K
Sbjct: 277 KQF 279
>gi|417409552|gb|JAA51275.1| Hypothetical protein, partial [Desmodus rotundus]
Length = 306
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 115/230 (50%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + PN R RG LV ++ + YL K
Sbjct: 83 EEGKEGGTWLGISTRGKLAALTNYLQPHKDPNARGRGELVAHFLTTDMDSFSYLK--KVS 140
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
TE +++NGF+++ +L+ + +++ + P YG N+
Sbjct: 141 TEGHLYNGFNLIAADLSTEKGDVFCYYGNQGEPEPIVLAPG----------TYGLSNALL 190
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + ++ LI +LD++ N+++ PDP ++ + +
Sbjct: 191 ET-PWKKLCFGKQLFLEAVEQ-SQALPKDVLIAQLLDVLNNEEAQLPDPALEDQGREYL- 247
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+Y+ VCV YGTRT+++ILVD + + F E + LD + W
Sbjct: 248 RPILSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSMLDKDPSHW 295
>gi|156545790|ref|XP_001605757.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Nasonia
vitripennis]
Length = 272
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 127/228 (55%), Gaps = 37/228 (16%)
Query: 28 KGGTWLATST-NGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEE 86
+GGTWLA + N ++G+LLN+ G G+ RG LV Y+ KS E+ +L K +E
Sbjct: 56 EGGTWLALNVKNARVGVLLNLNGVPKSIEGKGRGFLVRDYLTSTKSTIEFAKNLHKVNQE 115
Query: 87 -NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTS 145
+N +++V++E L+++++Y+ S + L+ + + T + G GNS
Sbjct: 116 TQSYNPYNLVMVE--LKNSDVYYLS--SELNHQEPTILSQT--------ILGCGNS-GME 162
Query: 146 QPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
+P++KV+ GK KF +IVN N + QN LI+++L L+K+++ PD E+ +++ +E
Sbjct: 163 KPYKKVVSGKNKFEKIVN--NASTAQQNDLIEDLLQLLKHEEKYLPDDELQKRSPNAFNE 220
Query: 205 DYKMRYSRVCVDISSIF-------YGTRTHSIILVDHNNVMDFHEWTL 245
+SSIF YGTRTHSI+LVD N + F E TL
Sbjct: 221 ------------LSSIFVQHEKENYGTRTHSIVLVDDLNNLTFVEETL 256
>gi|312371635|gb|EFR19769.1| hypothetical protein AND_21842 [Anopheles darlingi]
Length = 298
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 116/260 (44%), Gaps = 42/260 (16%)
Query: 28 KGGTWLATSTNG----KLGILLNVLGENSRPNGRDRGPLVVKYV--EGQKSAEEYLTDLK 81
+GGTWLA + +LG LLNV GE RGP+V Y+ G +A Y L
Sbjct: 56 EGGTWLAIGNHPPGRIRLGALLNVTGETKANVTNGRGPIVANYLTSNGDNTARYYSEQLL 115
Query: 82 KETEENVFNGFHIVLLELTLQSTN---------------IYHFSNIAPLDSPSTKVTNVT 126
FN I L L + + + H SN + +P
Sbjct: 116 ANDNYGAFNFVSIDLATLVEPANSGGDHAAPESEDGIGTVLHTSNTPHIIAPCAL----- 170
Query: 127 NKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRK-SESQNLIQNILDLMKNK 185
GFGNS +P QKV G+E+F EIV N +++ + +++DL+K+
Sbjct: 171 ------GKALGFGNSTL-DKPLQKVCRGRERFEEIVTNRNVPFADTDAFVDDLMDLLKSD 223
Query: 186 QSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+YPDPE+ R+A + S V V + YG+RT +IIL+DH N M+F E T+
Sbjct: 224 VQHYPDPELTRRAG-----QHAEYLSSVNVRVPGGRYGSRTRTIILIDHQNHMEFIEETM 278
Query: 246 ---DYETKKWIHTHIRKTLN 262
D E W + I + L+
Sbjct: 279 VNSDPENPDWKTSRIVRDLS 298
>gi|307173421|gb|EFN64374.1| Uncharacterized protein C22orf25 [Camponotus floridanus]
Length = 276
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 118/226 (52%), Gaps = 34/226 (15%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENS--RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE-T 84
+GGTWLA S G+ ++LN++ EN+ + RGPL+ +V S E YL L KE
Sbjct: 56 EGGTWLAVSMRGRAAVILNLVNENNVISSAKKSRGPLISNFVTSNDSIEAYLNQLHKENI 115
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
+ +N + L+ L L++ N Y+ S+ A P K + D + G GNS
Sbjct: 116 DGQPYNPY--CLVSLDLKNANTYYLSSDAKSTGP---------KMCDSD-IIGIGNSG-M 162
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
++KV GK++F +IV N S+ LI+ +++ +K++ PDP++ +
Sbjct: 163 DDSYKKVEVGKKEFKQIVQNVN-ISKQNILIEELINFLKSQTKCLPDPKLQK-------- 213
Query: 205 DYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTL 245
+Y Y ++SSIF YGTRTHSI+L+D +N + F E TL
Sbjct: 214 NYPTTYE----ELSSIFVSGDEYGTRTHSILLIDGSNQVTFVEETL 255
>gi|149758675|ref|XP_001488122.1| PREDICTED: uncharacterized protein C22orf25 homolog [Equus
caballus]
Length = 276
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 114/230 (49%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLDPDARGRGELVAHFLTTDMDSLTYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E +++NGF+++ +L+ + ++ + P YG N+
Sbjct: 111 AEGHLYNGFNLIAADLSTEKGDVVCYYGNRGDPEPVVLAPG----------TYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F +V + ++ +LI +LD++ N ++ PDP I+ + +
Sbjct: 161 ET-PWRKLCFGKQLFLAVVEQ-SQALPKDDLIAQLLDVLNNDEAQLPDPAIEDQGR-EYV 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT+++ILVD + + F E + LD + +W
Sbjct: 218 QPILSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSMLDKDPSRW 265
>gi|440899624|gb|ELR50899.1| hypothetical protein M91_13657 [Bos grunniens mutus]
Length = 276
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L N R RG LV +++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLNHNARGRGELVAQFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
E +++NGF+++ +L+ + ++ + N+ E + V YG
Sbjct: 111 AEGHLYNGFNLIAADLSAEKGDV---------------ICYYGNRGEREPVVLAPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + R+ L+ +LD++ N ++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVEQ-GRELPRDALVAQLLDVLSNDEAQLPDPAIEAQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT+++ILVD + + F E + L + +W
Sbjct: 214 R-EYVRPILSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSMLGSDPTRW 265
>gi|348585353|ref|XP_003478436.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Cavia
porcellus]
Length = 276
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/236 (30%), Positives = 121/236 (51%), Gaps = 21/236 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKLG L N L + R RG LV ++ + YL + +E
Sbjct: 53 EEGREGGTWLGISTRGKLGALTNYLQPQLDRDTRGRGELVTNFLTTDMDSLSYLKKVSQE 112
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+++NGF+++ +L T + I ++ N P V YG N+
Sbjct: 113 G--HLYNGFNLIAADLSTAKGDVICYYGNRG---EPEPIVLT--------PGTYGLSNAL 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ FGK+ F + V K + L+ +LD++ N+++ PDP I+ + + +
Sbjct: 160 LET-PWKKLCFGKQLFLKAV-KQSEALPKDALVAQLLDVLNNEEAQLPDPAIEDQGQ-EY 216
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW-IHTH 256
+ +Y+ VCV + YGTRT++IILVD N + F E + LD + +W +TH
Sbjct: 217 IQSILSKYAAVCV--RGLDYGTRTNTIILVDANGQVTFTERSMLDKDPSRWETNTH 270
>gi|380786085|gb|AFE64918.1| uncharacterized protein C22orf25 [Macaca mulatta]
gi|383409805|gb|AFH28116.1| hypothetical protein LOC128989 [Macaca mulatta]
Length = 276
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIANLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + +W
Sbjct: 214 -GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSRW 265
>gi|355784796|gb|EHH65647.1| hypothetical protein EGM_02446 [Macaca fascicularis]
Length = 317
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIANLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + +W
Sbjct: 255 -GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSRW 306
>gi|355563467|gb|EHH20029.1| hypothetical protein EGK_02795 [Macaca mulatta]
Length = 317
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIANLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + +W
Sbjct: 255 -GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSRW 306
>gi|351715467|gb|EHB18386.1| hypothetical protein GW7_11143 [Heterocephalus glaber]
Length = 276
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 115/230 (50%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKLG L N L + R RG LV ++ + YL + +E
Sbjct: 53 EEGKEGGTWLGISTGGKLGALTNYLQPRLDQDTRGRGELVTNFLTTDMDSLSYLKKVSQE 112
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+++NGF+++ +L+ ++ + P T + YG N+
Sbjct: 113 G--HLYNGFNLIAADLSTAKGDVICYYGNRGEPEPITLMPG----------TYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + ++ + +LD++ N+++ PDP I+ + + +
Sbjct: 161 ET-PWRKLCFGKQLFLEAVEQ-SQVLPKDAFVAQLLDVLNNEEAQLPDPAIEDQGQ-EYI 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT++IILVD N + F E + LD + W
Sbjct: 218 QPLLNKYAAVCVRCPG--YGTRTNTIILVDANGQVTFTERSMLDKDPSCW 265
>gi|296191366|ref|XP_002743596.1| PREDICTED: uncharacterized protein C22orf25 isoform 2 [Callithrix
jacchus]
Length = 309
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 86 EEGKEGGTWLGISTRGKLAALTNYLQPQLDRQARGRGELVTHFLTTDVDSLSYLK--KVS 143
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 144 VEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 188
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 189 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIADLLDVLNNEEAQLPDPAIEDQG 246
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +YS VCV YGTRT++IILVD + + F E + LD + W
Sbjct: 247 R-EYVQPILSKYSAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMLDKDPSHW 298
>gi|395858824|ref|XP_003801758.1| PREDICTED: uncharacterized protein C22orf25 homolog [Otolemur
garnettii]
Length = 276
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 112/231 (48%), Gaps = 20/231 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLDREARGRGELVTHFLTTDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L T + I ++ N P V YG N+
Sbjct: 111 MEGHLYNGFNLIAADLSTTRGDIICYYGNRG---DPEPVVL--------APGTYGLSNAL 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ FGK F E V + + L+ +LD++ N+++ PDP I+ + + +
Sbjct: 160 LET-PWRKLCFGKRLFQEAVERC-QALPKDTLVAQLLDVLSNEEAQLPDPAIEDQGQ-EY 216
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEW-TLDYETKKW 252
+ +YS VCV YGTRT++IILVD + + F E LD + +W
Sbjct: 217 VRPFLSKYSAVCVRCPG--YGTRTNTIILVDTDGHVTFTERNMLDKDPSRW 265
>gi|126012559|ref|NP_001030405.2| uncharacterized protein C22orf25 homolog [Bos taurus]
gi|116247766|sp|Q29RZ5.1|CV025_BOVIN RecName: Full=Uncharacterized protein C22orf25 homolog
gi|88758681|gb|AAI13298.1| Chromosome 22 open reading frame 25 ortholog [Bos taurus]
Length = 276
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 113/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L N R RG LV +++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLNRNARGRGELVAQFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
E +++NGF+++ +L+ + ++ + N+ E + V YG
Sbjct: 111 AEGHLYNGFNLIAADLSAEKGDV---------------ICYYGNRGEREPVVLAPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + R+ L+ +L ++ N ++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-GRELPRDALVAQLLAVLSNDEAQLPDPAIEAQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT ++ILVD + + F E + L + +W
Sbjct: 214 R-EYVRPILSKYAAVCVRCPD--YGTRTSTVILVDADGHVTFTERSMLGSDPTRW 265
>gi|380016632|ref|XP_003692282.1| PREDICTED: uncharacterized protein C22orf25-like [Apis florea]
Length = 271
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 82/248 (33%), Positives = 123/248 (49%), Gaps = 41/248 (16%)
Query: 7 KPNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPN--GRDRGPLV 63
K +P G I+M + +GGTWL S GK G++LN+ E S N + RG L+
Sbjct: 40 KNHPVCLGGIDM-----ESGKEGGTWLGVSLTGKAGVVLNLSSLEKSSTNIPKQGRGFLI 94
Query: 64 VKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVT 123
++ + SA YL L K+ +E +N F +VL L LQ+ ++ + SN P++
Sbjct: 95 SNFIISKDSATSYLDQLHKKNKEVQYNPFLLVL--LNLQNADVKYLSNSQTSKGPTSA-- 150
Query: 124 NVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMK 183
+D++ GF N ++KV GKE F I+ + S+ LI+ +L MK
Sbjct: 151 --------EDNILGFSNG--FGVLYKKVEAGKEIFKNIIKNV-KVSKQTILIEELLTFMK 199
Query: 184 NKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIF------YGTRTHSIILVDHNNV 237
+K+ PD A++ + Y RY + SSIF Y TRTHSI+LV+ NN
Sbjct: 200 SKKRYLPD--------AELQKSYPTRYK----EYSSIFVSTDWGYCTRTHSILLVNGNNE 247
Query: 238 MDFHEWTL 245
+ F E TL
Sbjct: 248 ITFIEETL 255
>gi|402883557|ref|XP_003905280.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 3
[Papio anubis]
Length = 317
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLISDLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IIL+D + + F E + +D + +W
Sbjct: 255 -GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILIDADGHVTFTERSMMDKDLSRW 306
>gi|403304282|ref|XP_003942734.1| PREDICTED: uncharacterized protein C22orf25 homolog [Saimiri
boliviensis boliviensis]
gi|403304284|ref|XP_003942735.1| PREDICTED: uncharacterized protein C22orf25 homolog [Saimiri
boliviensis boliviensis]
Length = 276
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 113/227 (49%), Gaps = 27/227 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGREGGTWLGISTRGKLAALTNYLQPQLDRQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 VEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIADLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ + +YS VCV YGTRT++IILVD + + F E ++
Sbjct: 214 R-EYVQPILSKYSAVCVRCPG--YGTRTNTIILVDADGHVTFTERSM 257
>gi|444724091|gb|ELW64710.1| hypothetical protein TREES_T100000630 [Tupaia chinensis]
Length = 225
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL +T GKL L N L R RG LV ++ + YL K
Sbjct: 2 EEGREGGTWLGITTRGKLAALTNYLQPRLDLEARGRGELVTHFLTTDMDSLSYLK--KVS 59
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 60 TEGHLYNGFNLIAADLSTAKGDV---------------VCYYGNRGEPEPVVLTPGTYGL 104
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK F E V + ++ L+ +LD++ N+++ PDP I+ +
Sbjct: 105 SNALLET-PWRKLCFGKRLFLEAVER-SQALPKDALVAQLLDVLNNEEAQLPDPAIEDQG 162
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT+++ILVD + + F E + LD + +W
Sbjct: 163 R-EYVQPMLSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSMLDKDPSRW 214
>gi|402883553|ref|XP_003905278.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 1
[Papio anubis]
gi|402883555|ref|XP_003905279.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 2
[Papio anubis]
Length = 276
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLISDLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IIL+D + + F E + +D + +W
Sbjct: 214 -GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILIDADGHVTFTERSMMDKDLSRW 265
>gi|291413069|ref|XP_002722796.1| PREDICTED: transport and golgi organization 2-like [Oryctolagus
cuniculus]
Length = 276
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 117/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L P R RG LV ++ + YL K
Sbjct: 53 EEGREGGTWLGISTRGKLAALTNYLQPRQDPEARGRGELVTHFLTTDLDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
E +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 111 VEGHLYNGFNLLAADLSTTKGDV---------------VCYYGNRGEPEPIVLAPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F + + + ++ L+ +LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLQALER-SQALPKDALVGQLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV + YGTRT++IILVD + + F E + LD + +W
Sbjct: 214 R-EYVQPILSKYAAVCVRGPN--YGTRTNTIILVDTDGHVTFTERSMLDRDPTRW 265
>gi|27371221|gb|AAH41339.1| Chromosome 22 open reading frame 25 [Homo sapiens]
Length = 276
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAANLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ NK++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNKEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 265
>gi|19584311|emb|CAD28454.1| hypothetical protein [Homo sapiens]
Length = 224
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 1 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 58
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 59 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 103
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 104 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 161
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 162 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 213
>gi|343961337|dbj|BAK62258.1| em:AC006547.3 protein [Pan troglodytes]
gi|410211604|gb|JAA03021.1| chromosome 22 open reading frame 25 [Pan troglodytes]
gi|410261324|gb|JAA18628.1| chromosome 22 open reading frame 25 [Pan troglodytes]
gi|410293096|gb|JAA25148.1| chromosome 22 open reading frame 25 [Pan troglodytes]
gi|410293098|gb|JAA25149.1| chromosome 22 open reading frame 25 [Pan troglodytes]
gi|410351909|gb|JAA42558.1| chromosome 22 open reading frame 25 [Pan troglodytes]
Length = 276
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 VEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 265
>gi|126324743|ref|XP_001376894.1| PREDICTED: uncharacterized protein C22orf25-like [Monodelphis
domestica]
Length = 276
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 120/239 (50%), Gaps = 36/239 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GG+WL S GKL L N + + + RG LV ++ + + YL K
Sbjct: 53 EEGKEGGSWLGISKKGKLAALTNYMQPKLDKDAKGRGELVTHFLTTEVDSFSYLK--KIS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIY-HFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
TE +++NGF+++ +L +++ ++ N D SP VYG
Sbjct: 111 TEGHLYNGFNLIAADLNTNKGDVFCYYGNRGEHDPIVLSPG---------------VYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
NS + P++K+ +GK+ F E++N+ ++ ++LI +L +M N++ PDP I+
Sbjct: 156 SNSLLET-PWKKLQYGKQLFVEVINQ-SQDLAKEDLISELLKVMNNEECQLPDPAIE--- 210
Query: 199 EADMDEDYKM----RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKW 252
D +DY + +Y+ VCV YGTRT++I+L+D + F E ++ D + W
Sbjct: 211 --DQGKDYVLPILNKYAAVCVRCPG--YGTRTNTIVLIDAEGNVTFTERSMIDEDVTHW 265
>gi|73995915|ref|XP_850154.1| PREDICTED: uncharacterized protein C22orf25 [Canis lupus
familiaris]
Length = 276
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 112/230 (48%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRQDRDARGRGELVAHFLTTDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E +++NGF+++ +L+ + ++ + P YG N+
Sbjct: 111 AEGHLYNGFNLIAADLSTEKGDVICYYGNRGEPEPVVLAPG----------TYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + ++ LI +L+++ N ++ PDP I+ + +
Sbjct: 161 ET-PWRKLCFGKQLFLEAVER-SQALPKDILIAQLLNVLNNDEAQLPDPAIEDQGR-EYV 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT+++ILVD + + F E + LD + W
Sbjct: 218 QPFLSKYAAVCVRCPG--YGTRTNTVILVDADGHVTFTERSMLDKDPSCW 265
>gi|119623408|gb|EAX03003.1| chromosome 22 open reading frame 25, isoform CRA_c [Homo sapiens]
gi|221040672|dbj|BAH12013.1| unnamed protein product [Homo sapiens]
gi|221045500|dbj|BAH14427.1| unnamed protein product [Homo sapiens]
Length = 317
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 255 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 306
>gi|397485973|ref|XP_003814110.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 3 [Pan
paniscus]
Length = 317
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 255 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 306
>gi|190570176|ref|NP_690870.3| uncharacterized protein C22orf25 [Homo sapiens]
gi|397485969|ref|XP_003814108.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 1 [Pan
paniscus]
gi|397485971|ref|XP_003814109.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 2 [Pan
paniscus]
gi|74709518|sp|Q6ICL3.1|CV025_HUMAN RecName: Full=Uncharacterized protein C22orf25
gi|47678241|emb|CAG30241.1| Em:AC006547.3 [Homo sapiens]
gi|109451270|emb|CAK54496.1| C22orf25 [synthetic construct]
gi|109451848|emb|CAK54795.1| C22orf25 [synthetic construct]
gi|119623406|gb|EAX03001.1| chromosome 22 open reading frame 25, isoform CRA_a [Homo sapiens]
gi|119623410|gb|EAX03005.1| chromosome 22 open reading frame 25, isoform CRA_a [Homo sapiens]
gi|261859460|dbj|BAI46252.1| chromosome 22 open reading frame 25 [synthetic construct]
Length = 276
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 265
>gi|297708290|ref|XP_002830902.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 2
[Pongo abelii]
gi|297708292|ref|XP_002830903.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 3
[Pongo abelii]
Length = 276
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 111/224 (49%), Gaps = 27/224 (12%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDMLIASLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + +Y+ VCV YGTRT++IILVD + + F E
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTE 254
>gi|355733759|gb|AES11133.1| Ser/Thr-rich protein T10 in DGCR region [Mustela putorius furo]
Length = 275
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 111/230 (48%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRQDRDARGRGELVTHFLTTDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E +++NGF+++ +L+ + ++ + P YG N+
Sbjct: 111 AEGHLYNGFNLIAADLSTEKGDVICYYGNRGEPEPVVLAPG----------TYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + ++ LI +L ++ N ++ PDP I+ + +
Sbjct: 161 ET-PWKKLCFGKQLFLEAVER-SQALPKDVLIAQLLHVLNNDEAQLPDPAIEDQGR-EYV 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + LD + W
Sbjct: 218 QPFLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMLDKDPSCW 265
>gi|426393564|ref|XP_004063088.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 2
[Gorilla gorilla gorilla]
Length = 317
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 196
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 197 SNALLET-PWRKLCFGKQLFLETVER-SQALPKDVLITSLLDVLNNEEAQLPDPAIEDQG 254
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 255 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 306
>gi|426393562|ref|XP_004063087.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 1
[Gorilla gorilla gorilla]
gi|426393566|ref|XP_004063089.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 3
[Gorilla gorilla gorilla]
gi|426393568|ref|XP_004063090.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 4
[Gorilla gorilla gorilla]
Length = 276
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLETVER-SQALPKDVLITSLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 265
>gi|410977158|ref|XP_003994977.1| PREDICTED: uncharacterized protein C22orf25 homolog [Felis catus]
Length = 276
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRQDRDARGRGELVTHFLTTDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
E +++NGF+++ +L+ + ++ V N+ E + V YG
Sbjct: 111 AEGHLYNGFNLIAADLSTEKGDV---------------VCYYGNRGEPEPVVLAPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F + V + +++ LI +L ++ N ++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLDAVER-SQELPKDALIAQLLHVLNNDEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + +Y+ VCV YGTRT+++ILVD + + F E + LD + W
Sbjct: 214 R-EYVQPFLSKYAAVCVRCPG--YGTRTNTVILVDADGHVTFTERSMLDKDPSCW 265
>gi|157130999|ref|XP_001655770.1| hypothetical protein AaeL_AAEL011962 [Aedes aegypti]
gi|157131001|ref|XP_001655771.1| hypothetical protein AaeL_AAEL011962 [Aedes aegypti]
gi|157131003|ref|XP_001655772.1| hypothetical protein AaeL_AAEL011962 [Aedes aegypti]
gi|108871679|gb|EAT35904.1| AAEL011962-PB [Aedes aegypti]
gi|108871680|gb|EAT35905.1| AAEL011962-PA [Aedes aegypti]
gi|108871681|gb|EAT35906.1| AAEL011962-PC [Aedes aegypti]
Length = 282
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 115/237 (48%), Gaps = 19/237 (8%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
+GGTWLA K+G LLNV GE+ RGP+V Y+ G S EEY L
Sbjct: 58 EGGTWLAIGGRDGIVKIGALLNVTGESKANITSGRGPIVANYLVGPLSNEEYSKQL---L 114
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
+ + F++V +++ +I ++ + K+ + D GFGNS
Sbjct: 115 AIDNYGPFNLVSVQMDTNGGSIGGGVSVLHTSNSPCKIDHC-----ELDGPLGFGNSPL- 168
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
P +KV G+++F +IV+K + L+ ++ L+K+ + ++PD E+ R+
Sbjct: 169 EVPLEKVKHGRKQFGDIVDKRLEVQSKEALVDELMGLLKSDEIHFPDKELSRRV-----P 223
Query: 205 DYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK--KWIHTHIRK 259
+ + + + V YG+RT ++IL+DH N MDF E T+ +W THI +
Sbjct: 224 QFTEKLASINVRFPESGYGSRTRTVILIDHENHMDFIEDTMVTADPDGEWKRTHIER 280
>gi|432875348|ref|XP_004072797.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Oryzias
latipes]
Length = 276
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 125/249 (50%), Gaps = 33/249 (13%)
Query: 12 TNGLIEMLLEVSQDAVK-GGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQ 70
TNG E+L + Q+ K GG+WL + GKL L N + N + RG LV Y+ +
Sbjct: 42 TNG--EILSGLDQECGKEGGSWLGINKRGKLAALTNYMEGNQNSDAAGRGFLVSNYLM-E 98
Query: 71 KSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQE 130
K + Y K TE +++NGF+++ + + + ++ N SP +
Sbjct: 99 KDLDSYSYLKKVSTEGHLYNGFNLITADFKAKQDTLCYYGNRG---SPEP-------IRL 148
Query: 131 NKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYP 190
N +YG NS + P++K++ GK+ F IVN ++ L+Q +L ++ N++ N P
Sbjct: 149 NPAGIYGLSNSLLDT-PWKKLLQGKQHFTNIVN--DQSLSCDGLVQELLTVLNNEELNIP 205
Query: 191 DPEIDRKAEADMDEDYKMRYSRVCVD-ISSIF-----YGTRTHSIILVDHNNVMDFHEWT 244
DP + E + YS+ ++ +SS+F YGTRT+++IL+D + F E T
Sbjct: 206 DPIL---------ESHGSFYSKPLIEALSSVFVRTPNYGTRTNTVILIDAEGNVTFTERT 256
Query: 245 -LDYETKKW 252
LD + KW
Sbjct: 257 MLDSDANKW 265
>gi|9634740|ref|NP_039033.1| T10 gene product [Fowlpox virus]
gi|7271568|gb|AAF44414.1|AF198100_61 ORF FPV070 T10 gene product [Fowlpox virus]
Length = 273
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 114/230 (49%), Gaps = 20/230 (8%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
D GGTWL +T+GK ++ N L PN RG LV Y+ S+ EYL L K
Sbjct: 51 DVKSGGTWLGINTDGKFSVVTNYLQPYEDPNFISRGNLVSDYLTSNISSREYLCYLSKRG 110
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC- 143
+++NGF+++ + +S ++Y++SN + T + +YG NS
Sbjct: 111 --HLYNGFNLITASFSKESDDLYYYSNRSG-----------TAPERLGTGIYGLSNSLLD 157
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
S P KV GK+ F +I++ ++LI +L+++ + DP I + + D
Sbjct: 158 ISWP--KVCVGKKVFTDIIHAHKNDLNQESLITELLEMLNDTSPLPIDPRIQEQGQ-DFI 214
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKW 252
+S +CV YGTRT++I+ +D + ++F E T+ D +TK++
Sbjct: 215 RPMIKEFSSICVRADG--YGTRTNTIVTIDSHYSVNFIEKTITDMDTKEF 262
>gi|301770467|ref|XP_002920626.1| PREDICTED: uncharacterized protein C22orf25-like [Ailuropoda
melanoleuca]
Length = 276
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRQDRDARGRGELVTHFLTTDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
E +++NGF+++ +L+ + ++ + N+ E + V YG
Sbjct: 111 AEGHLYNGFNLIAADLSTEKGDV---------------ICYYGNRGEPEPIVLAPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI +L ++ N ++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIAELLHVLNNDEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + +Y+ VCV YGTRT+++ILVD + + F E + LD + W
Sbjct: 214 R-EYVQPFLSKYAAVCVRCPG--YGTRTNTVILVDTDGHVTFTERSMLDKDPSCW 265
>gi|56119064|ref|NP_001007837.1| transport and golgi organization 2 homolog [Gallus gallus]
gi|53136540|emb|CAG32599.1| hypothetical protein RCJMB04_30h1 [Gallus gallus]
Length = 276
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 120/242 (49%), Gaps = 20/242 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL S G++ L N + + N + RG LV ++ YL K
Sbjct: 53 EEGKEGGTWLGISKKGRMAALTNYMQPTTDKNAKGRGALVTNFLTSDLDCYSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L T + I ++ N P N +YG N
Sbjct: 111 VEGHLYNGFNLIAADLNTTKGDVICYYGNKG---DPEPVFLN--------PGIYGLSNCL 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ +GK+ F E++N+ ++ ++L+Q +L +M N++ PDP I+ + + +
Sbjct: 160 LDT-PWKKLQYGKQLFTEVINR-SQDLAKEDLVQELLTVMNNQEPQLPDPAIEDQGKEYI 217
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKWIHTHIRKTL 261
+Y+ VCV YGTRT++++L+D + F E T+ + + +W + L
Sbjct: 218 RPILN-KYAAVCVRCPG--YGTRTNTVLLIDSEGNVTFTERTMINEDVSQWKTSSYEFKL 274
Query: 262 NM 263
+M
Sbjct: 275 HM 276
>gi|431904429|gb|ELK09814.1| hypothetical protein PAL_GLEAN10007686 [Pteropus alecto]
Length = 327
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 115/231 (49%), Gaps = 20/231 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL + N L RG LV ++ + YL K
Sbjct: 104 EEGKEGGTWLGISTRGKLAAITNYLQPKLDLEAPGRGKLVAHFLTTDMDSLSYLK--KVS 161
Query: 84 TEENVFNGFHIVLLELTLQSTN-IYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L+ + + I ++ N P V + YG N+
Sbjct: 162 AEGHLYNGFNLIAADLSTEKGDVICYYGNRG---EPEPVVL--------EPGTYGLSNAL 210
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ FGK+ F E V + + + LI +LD++ N+++ PDP I+ + +
Sbjct: 211 LET-PWKKLCFGKQLFLEAVERC-QALPKEVLITQLLDVLNNEEAQLPDPAIEDQGRESV 268
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT+++ILVD + + F E + LD + +W
Sbjct: 269 -QPILSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSMLDKDPSRW 316
>gi|158260747|dbj|BAF82551.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 115/235 (48%), Gaps = 28/235 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + E + +D + W
Sbjct: 214 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTLTERSMMDKDLSHW 265
>gi|344294902|ref|XP_003419154.1| PREDICTED: uncharacterized protein C22orf25-like [Loxodonta
africana]
Length = 276
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 112/230 (48%), Gaps = 18/230 (7%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTQGKLAALTNYLQPRLDRDARGRGELVTHFLTTNIDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E +++NGF+++ +L+ ++ + P +YG N+
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVVCYYGNRGQPEPVVLTPG----------IYGLSNALL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ FGK+ F E V + S+ LI +LD++ N+++ PDP I + +
Sbjct: 161 DT-PWRKLSFGKQLFQEAVEQSQALSKD-TLIAQLLDVLNNEEAQLPDPAIMDQGR-EYV 217
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV + YGTRT++IILVD + F E + LD + W
Sbjct: 218 QPFLSKYAAVCVRGPN--YGTRTNTIILVDVEGHVTFTERSMLDKDPTCW 265
>gi|348522225|ref|XP_003448626.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like
[Oreochromis niloticus]
Length = 276
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 112/226 (49%), Gaps = 18/226 (7%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GG+WL S GKL + N + P+ + RG LV Y+ K + Y K +E +
Sbjct: 57 EGGSWLGISKRGKLAAITNYMEGRPNPDAQGRGFLVSHYLM-DKDQDSYSYLKKVSSEGH 115
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF+++ E + + ++ N SP N +YG NS + P
Sbjct: 116 LYNGFNLITAEFKAKQDIVCYYGNRG---SPEPIRLNPAG-------IYGLSNSLLDT-P 164
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
++K++ GK+ F +V+ ++ L+Q +L ++ N++ N PDP + + + +
Sbjct: 165 WKKLLKGKQHFTSVVS--DQTLSCDGLVQELLSVLNNEELNTPDPIQETQGDC-YSKSMI 221
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
S VCV S YGTRT++IIL+D + F E T LD +T KW
Sbjct: 222 QALSAVCV--RSPHYGTRTNTIILIDAEGNVIFTERTMLDCDTTKW 265
>gi|291237095|ref|XP_002738475.1| PREDICTED: transport and golgi organization 2-like [Saccoglossus
kowalevskii]
Length = 322
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 128/275 (46%), Gaps = 51/275 (18%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GGTWL S G+L LLN+L N + RG LV +++G YL + KE ++
Sbjct: 56 GGTWLGISKEGRLASLLNILQPQLDLNAKGRGSLVTDFIKGDVDCLPYLQGIAKEG--HL 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF++++++LT +T S+I + S K+ + + +Y N+ +P+
Sbjct: 114 YNGFNLLVMDLTRSNT-----SDIPQIGYYSNKIGK--DPELLCPGIYSLCNAT-IDKPW 165
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
+K I GK+KF EIVN +E + L+ +++ + N + +P+P+I + ++ +
Sbjct: 166 KKAIVGKKKFEEIVNSATTGTE-EKLVDQLIEFL-NDDTPHPEPQIHIQCSGELTPKQQR 223
Query: 209 RYSRVCVD------------------------------ISSIF-----YGTRTHSIILVD 233
S VCV + ++ YGTRT++IILV+
Sbjct: 224 ERSAVCVQSPDMGYGSRYYDTQPSEQNGRSYPDIILTRLGALLCRLPNYGTRTNTIILVN 283
Query: 234 HNNVMDFHEWTL----DYETKKWIHTHIRKTLNMA 264
N + + E TL D E KW+ + L+ A
Sbjct: 284 TANKVKYVERTLEEPIDIENLKWLRNNYDFNLSSA 318
>gi|18859691|ref|NP_572905.1| transport and golgi organization 2, isoform A [Drosophila
melanogaster]
gi|442616246|ref|NP_001259525.1| transport and golgi organization 2, isoform C [Drosophila
melanogaster]
gi|17861960|gb|AAL39457.1| LD02616p [Drosophila melanogaster]
gi|22832210|gb|AAF48295.2| transport and golgi organization 2, isoform A [Drosophila
melanogaster]
gi|220942866|gb|ACL83976.1| Tango2-PA [synthetic construct]
gi|220953052|gb|ACL89069.1| Tango2-PA [synthetic construct]
gi|440216744|gb|AGB95367.1| transport and golgi organization 2, isoform C [Drosophila
melanogaster]
Length = 283
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 117/242 (48%), Gaps = 30/242 (12%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRDRGPLVVKYV---EGQKSAEEYLTDLK 81
+GGTWLA + K+G LLN+ GE + RG +V YV + + S Y L
Sbjct: 56 EGGTWLAIGHSAGFFKVGALLNLTGEPKPRDAVGRGMIVADYVTRADEEHSILNYNERLL 115
Query: 82 KETEENVFNGFHIVLLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
K+ + ++ F+ V +E+ Q + SN+ P T + YGFG
Sbjct: 116 KDCTK--YSAFNFVSIEIGSASQPARVKLLSNVPP-----------TLEDFQNGECYGFG 162
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
NS S PF+KV GK++F IV K + ++ + L ++ L++NK +PD E+ +A
Sbjct: 163 NSLPHS-PFEKVRHGKQEFEAIV-KAHGEASVETLSAQLMQLLRNKHKFWPDDELKTRAP 220
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY--ETKKWIHTHI 257
++ S + V I YG+RTHS++LVD N M F E T+ +W THI
Sbjct: 221 -----NWGEGLSSLNVHIEEHAYGSRTHSVVLVDSENKMHFIEETMTGLDPHGEWNKTHI 275
Query: 258 RK 259
K
Sbjct: 276 EK 277
>gi|405973536|gb|EKC38244.1| Uncharacterized protein C22orf25-like protein [Crassostrea gigas]
Length = 294
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/228 (30%), Positives = 110/228 (48%), Gaps = 23/228 (10%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSRP-NGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
G TWL S GK+ +LLN++ S + RG LV ++ YL K +N
Sbjct: 58 GQTWLGMSKEGKIAVLLNIIEAKSGTFRYQRRGFLVSDFLYSNMDGHSYLEQCIKPKSDN 117
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF---GNSQCT 144
+ GFH++LL+ + T++ +F+N + N+ E+ GF GNS+
Sbjct: 118 -YKGFHLLLLDCSSTKTDVKYFNNRYTI-----------NRSEDIRDDKGFISLGNSRSE 165
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA------ 198
P+ KV GKEKF +IV FN K + +L+L+K+ DP ++ +
Sbjct: 166 KAPWIKVAKGKEKFKDIVKNFNNKDSKSKIESALLELLKDNTVYLNDPVLEMQTKNGKPF 225
Query: 199 EADMDEDYKM-RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
E D+ ++ + S + V + + GTRTHSII VD +F E+TL
Sbjct: 226 EGTSDKAKQIEQLSSLFVYMPELRRGTRTHSIITVDFEGNCEFLEFTL 273
>gi|170062264|ref|XP_001866592.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880234|gb|EDS43617.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 284
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 121/246 (49%), Gaps = 39/246 (15%)
Query: 28 KGGTWLATSTNGKLGI-----LLNVLGENSRPN-GRDRGPLVVKYVEGQKSAEEYLTDLK 81
+GGTWLA ++ GI LLNV GEN +PN RGP+V YV G S E+Y +
Sbjct: 62 EGGTWLAIGSDIPAGIIRVGALLNVTGEN-KPNVTSGRGPIVADYVSGTTSNEDYS---R 117
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTN--VTNKQENKDHVYGFG 139
+ + + F+ V +EL ++ H SN AP T +T +T + ++ + F
Sbjct: 118 RLLAADSYGAFNFVSVELDRTKASVLHASN-AP-----TGITRCELTARWDSATARWTF- 170
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNK---FNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
+ F V +IV+K + L++ + L+K+K+ +PD E+ R
Sbjct: 171 ----RWRRFDTV----AAVGDIVDKQLKVTTDCDRDALVEELFGLLKSKEKYFPDKELSR 222
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL---DYETKKWI 253
+A A + + + + V YG+RT ++ILVDH N M+F E T+ D E +W
Sbjct: 223 RAPAQAE-----KLASINVRFPEAGYGSRTRTVILVDHQNRMEFVEETMVSTDPE-GEWK 276
Query: 254 HTHIRK 259
THI++
Sbjct: 277 RTHIQR 282
>gi|395517374|ref|XP_003762852.1| PREDICTED: uncharacterized protein C22orf25 homolog [Sarcophilus
harrisii]
Length = 277
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 116/234 (49%), Gaps = 26/234 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GG+WL S GKL L N + + RG LV ++ + + YL K
Sbjct: 53 EEGKEGGSWLGISKKGKLAALTNYMQPKLDKGAKGRGELVTHFLTTEVDSFSYLK--KIS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E +++NGF+++ +L +I+ + P VYG NS
Sbjct: 111 SEAHLYNGFNLIAADLNTTKGDIFCYYGNRGEQEPIVLTPG----------VYGLSNSLL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ +GK+ F +++++ ++ ++LI +L +M N++ PDP I+ D
Sbjct: 161 ET-PWKKLQYGKQLFVDVIDQ-SQDLSREDLIGELLKVMNNEECQLPDPAIE-----DQG 213
Query: 204 EDYKM----RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKW 252
+DY + +Y+ VCV YGTRT++I+L+D + F E ++ D + +W
Sbjct: 214 KDYILPILNKYAAVCVRCPG--YGTRTNTIVLIDAEGNVTFTERSMIDEDINQW 265
>gi|40556032|ref|NP_955117.1| CNPV094 T10-like protein [Canarypox virus]
gi|40233857|gb|AAR83440.1| CNPV094 T10-like protein [Canarypox virus]
Length = 275
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 113/224 (50%), Gaps = 20/224 (8%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENVF 89
GTWL + GK+ + N L PN RG LV Y+ + + EYL D+ + +++
Sbjct: 57 GTWLGMNRLGKISAITNYLQPTENPNATSRGYLVSNYLTSEVDSYEYLLDVSR--NGHLY 114
Query: 90 NGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQ 149
NGF+I+ L + ++SN + +SP ++T +YG NS P+
Sbjct: 115 NGFNIITASLGNGPDKLCYYSNRS--NSPPKELTQ---------GIYGISNS-LLDVPWT 162
Query: 150 KVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMR 209
K+ +GK+KF +IV ++ L ++L+L+ + DP I+ + + + K
Sbjct: 163 KLTYGKKKFTDIVT--SKYCSPDKLTASLLELLNDTTPVPIDPAIEIQGKEFIRPILK-E 219
Query: 210 YSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+S VCV YG+RT+++I+VD + + F E T LD E K+W
Sbjct: 220 FSAVCVKAKG--YGSRTNTVIIVDSDYNVSFTERTMLDTEAKEW 261
>gi|312067492|ref|XP_003136768.1| hypothetical protein LOAG_01180 [Loa loa]
gi|307768063|gb|EFO27297.1| hypothetical protein LOAG_01180 [Loa loa]
Length = 289
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 114/223 (51%), Gaps = 41/223 (18%)
Query: 29 GGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTD-LKKETE 85
GGTWL + NG+LG +L VL + P RG +V +Y++ + E Y+ + L KE +
Sbjct: 55 GGTWLGMTRNGRLGNILAVLENPTDEVPCAITRGKIVYEYLKSEMPPENYVVEQLSKEAQ 114
Query: 86 ENVFNGFHIVLL------ELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ +NGF+++LL E+ ++ FSN DSP+ VYGFG
Sbjct: 115 Q--YNGFNVILLHRLFNEEIERKTYFGVQFSNRH--DSPTAAFG---------PGVYGFG 161
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKS-ESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ +PF+K+ +G F E + + K+ Q L++ LD++ ++ S++PD ++ +
Sbjct: 162 NN-ALGKPFKKITYGLRLFEEKLKVLDDKNVNEQELMKQFLDILVDQTSHHPDEQLISQK 220
Query: 199 EADMDEDYKMRYSRVCVDISSIF--------YGTRTHSIILVD 233
E D D C +S +F YGTR+H+I+LV+
Sbjct: 221 EQDKDS---------CKLMSQLFYELPKPLRYGTRSHTIVLVN 254
>gi|326929556|ref|XP_003210928.1| PREDICTED: uncharacterized protein C22orf25-like [Meleagris
gallopavo]
Length = 276
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 111/224 (49%), Gaps = 27/224 (12%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL S G++ L N + + N + RG LV ++ YL K
Sbjct: 53 EEGKEGGTWLGISKKGRMAALTNYMQPTTDKNAKGRGALVTNFLTSDLDCYSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L + + I ++ N P N +YG N
Sbjct: 111 VEGHLYNGFNLIAADLNSTKGDVICYYGNKG---EPEPVFLN--------PGIYGLSNCL 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ +GK+ F E++N+ ++ ++L+Q +L +M N++ PDP I+ D
Sbjct: 160 LDT-PWKKLQYGKQLFTEVINR-SQDLAKEDLVQELLTVMNNQEPQLPDPAIE-----DQ 212
Query: 203 DEDYKM----RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
++Y +Y+ VCV YGTRT++++L+D + F E
Sbjct: 213 GKEYIRPILNKYAAVCVRCPG--YGTRTNTVLLIDSEGNVTFTE 254
>gi|196011768|ref|XP_002115747.1| hypothetical protein TRIADDRAFT_59757 [Trichoplax adhaerens]
gi|190581523|gb|EDV21599.1| hypothetical protein TRIADDRAFT_59757 [Trichoplax adhaerens]
Length = 265
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/233 (29%), Positives = 111/233 (47%), Gaps = 41/233 (17%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GGTW+ S NG++ + N P + RG LV+K++ + S + YL ++ + +
Sbjct: 57 GGTWIGVSKNGRIAAITNTSRATHDPKLQSRGELVMKFLTSEVSPKAYLEEIAQVGHK-- 114
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF ++ + L + Y F N D ++ + N + F ++Q
Sbjct: 115 YNGFFLITIALRQDKIDGYIFCNKD--DEGIRHISEGLHGLSN----FAFNSNQT----- 163
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
K+++GKE A+ V K + K + LI ++L +K+ + N PDP
Sbjct: 164 -KIVYGKELIADAVKKSSSKGD---LISSLLTALKDNKRNVPDP---------------- 203
Query: 209 RYSRVCVDISSI-----FYGTRTHSIILVDHNNVMDFHEWTLDY---ETKKWI 253
+S + ++SSI YGTRTH+IILVDH + F E T+ Y E KWI
Sbjct: 204 LHSYMYPELSSINVKLPTYGTRTHTIILVDHQQNVTFTEHTMQYPITENPKWI 256
>gi|57524602|ref|NP_001003781.1| transport and golgi organization 2 homolog [Danio rerio]
gi|50926009|gb|AAH79504.1| Zgc:100952 [Danio rerio]
Length = 273
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 110/222 (49%), Gaps = 20/222 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GG+WL S GKL L N L P+ + RG LV Y+ + YL K
Sbjct: 53 EEGKEGGSWLGISKRGKLAALTNYLEARQNPDAQGRGFLVSNYLTDNLDSLAYLR--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E +++NGF+++ E + ++ N S++ ++ K +YG NS
Sbjct: 111 SEAHLYNGFNLLTAEFRANEDTLCYYGNKG-----SSEPIHL------KAGIYGLSNSLL 159
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ GK+ F+ +V+K L+Q +L ++ N++ N PDP + + +
Sbjct: 160 ET-PWRKLQHGKQLFSSVVSK---TLPPDGLVQELLHILNNEELNTPDPAQESQGVGYSN 215
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ S VCV S YGTRT+++IL+D + F E T+
Sbjct: 216 AMLRA-LSAVCV--RSPGYGTRTNTVILIDREGNVSFTERTM 254
>gi|427798681|gb|JAA64792.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 254
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 109/210 (51%), Gaps = 20/210 (9%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENS--RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTW+A S +G++G LLN+L R RG LVV +++ YLT L KE +
Sbjct: 56 EGGTWMAVSKDGRIGALLNILQPTHCMDTTKRGRGFLVVDFIKSTMDGNAYLTKLMKEKD 115
Query: 86 ENVFNGFHIVLLELTL--QSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E FNGF V +E + ++ ++SN+ + + + FGNS
Sbjct: 116 E--FNGFLFVAIEAKPLKKKISMSYYSNL-----------QEGGPVQTEPGFHAFGNS-V 161
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
Q + KV +GK+KF EIV + NR S+ L++ I D + ++ ++YP + RK + +
Sbjct: 162 PPQFWAKVKYGKQKFEEIVRQNNRFSQKDQLLEKIYDFL-DESTSYPVDDAMRKQSQEPE 220
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
++ +++ + + YGTRT +++ V+
Sbjct: 221 STLRL-MNQMKYILPNYNYGTRTQTVLFVN 249
>gi|16553179|dbj|BAB71498.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 114/235 (48%), Gaps = 36/235 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 58 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 115
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 116 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 160
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + SQ L +++L+ N+++ PDP I+ +
Sbjct: 161 SNALLET-PWRKLCFGKQLFLEAVER------SQALPKDVLN---NEEAQLPDPAIEDQG 210
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 211 -GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 262
>gi|410922259|ref|XP_003974600.1| PREDICTED: uncharacterized protein C22orf25-like isoform 1
[Takifugu rubripes]
Length = 275
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 110/226 (48%), Gaps = 19/226 (8%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GG+WL + GKL + N L +S P+ + RG LV Y+ K + Y K E +
Sbjct: 57 EGGSWLGINKRGKLAGITNYLESHSNPDAQGRGFLVSNYLT-DKDQDSYSYLKKVSLEGH 115
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF+++ E + + ++ N + K +YG NS + P
Sbjct: 116 LYNGFNLITAEFRAKQDVVCYYGNRGSPEPIHLKAG-----------IYGLSNSLLDT-P 163
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
++K++ GK F+ +V+ ++ L+Q +L ++ N++ N PDP + + +
Sbjct: 164 WKKLLRGKRHFSSVVD--DQTLSCDGLVQELLGVLNNEELNSPDPAQESHGDG-YSKPML 220
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKW 252
S VCV S YGTRT+++IL+D + F E T+ D +T W
Sbjct: 221 QALSAVCV--RSPDYGTRTNTVILIDAEGNVTFTERTMRDRDTSNW 264
>gi|426247584|ref|XP_004017561.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C22orf25
homolog [Ovis aries]
Length = 284
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 110/228 (48%), Gaps = 21/228 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV +++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLNHDARGRGELVAQFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTN-IYHFSNIAP-----LDSPSTKVTNVTNKQENKDHVYG 137
E +++NGF+++ +L+ + + I ++ N L P N YG
Sbjct: 111 AEGHLYNGFNLIAGDLSAEKGDVICYYGNRGEREPVGLSPPPPCWQNARPPA----GTYG 166
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N+ + P++K+ FGK+ F E V + R+ + L LD+ N ++ PDP I+ +
Sbjct: 167 LSNALLET-PWRKLCFGKQLFLEAVER-GRELPREXL----LDVRSNDEAQLPDPAIEAQ 220
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ +Y+ VCV YGTRT+++ILVD + + F E ++
Sbjct: 221 GR-EYVRPILSKYAAVCVRCPD--YGTRTNTVILVDADGHVTFTERSM 265
>gi|149019789|gb|EDL77937.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 229
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 101/207 (48%), Gaps = 27/207 (13%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 2 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 59
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 60 TEGHLYNGFNLIAADLSTAKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 104
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + + L+ +L+++ N+++ PDP I+ +
Sbjct: 105 SNALLET-PWKKLCFGKQLFTEAVER-SESLPKDILVTQLLEVLNNEEAQLPDPAIEDQG 162
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTR 225
+ + + +Y+ VCV S YGTR
Sbjct: 163 Q-EYIQPILSKYAAVCV--RSATYGTR 186
>gi|149019788|gb|EDL77936.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 280
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 101/207 (48%), Gaps = 27/207 (13%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGF 138
TE +++NGF+++ +L+ ++ V N+ E + V YG
Sbjct: 111 TEGHLYNGFNLIAADLSTAKGDV---------------VCYYGNRGEPEPIVLTPGTYGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P++K+ FGK+ F E V + + L+ +L+++ N+++ PDP I+ +
Sbjct: 156 SNALLET-PWKKLCFGKQLFTEAVER-SESLPKDILVTQLLEVLNNEEAQLPDPAIEDQG 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTR 225
+ + + +Y+ VCV S YGTR
Sbjct: 214 Q-EYIQPILSKYAAVCV--RSATYGTR 237
>gi|402593990|gb|EJW87917.1| hypothetical protein WUBG_01175 [Wuchereria bancrofti]
Length = 285
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 115/224 (51%), Gaps = 43/224 (19%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSR---PNGRDRGPLVVKYVEGQKSAEEYLTD-LKKET 84
GGTWL S +G+LG +L VL EN+ P RG +V +Y++ + E+Y+ + L KE
Sbjct: 51 GGTWLGMSRDGRLGNVLAVL-ENTADEIPCAITRGKIVYEYLKSKMPPEDYVAEQLSKEA 109
Query: 85 EENVFNGFHIVLL------ELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ +NGF+++LL E+ ++ FSN SP+ +YGF
Sbjct: 110 QQ--YNGFNVILLHRLFNEEIERKTYFGVQFSNRQ--GSPTAAFG---------PGIYGF 156
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKS-ESQNLIQNILDLMKNKQSNYPDPEIDRK 197
GN+ +PF+K+ +G F E + + ++ Q L++ LD++ ++ S+YPD ++ +
Sbjct: 157 GNN-VLGKPFKKITYGLRLFEEKLKILDDENVNEQELVKQFLDILVDQTSHYPDEQLISQ 215
Query: 198 AEADMDEDYKMRYSRVCVDISSIF--------YGTRTHSIILVD 233
+ D D C +S +F YGTR H+I+LV+
Sbjct: 216 KKQDKDR---------CKLMSQLFYELPEPLRYGTRCHTIVLVN 250
>gi|89273920|emb|CAJ81843.1| novel protein [Xenopus (Silurana) tropicalis]
Length = 275
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 111/226 (49%), Gaps = 19/226 (8%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWL + GK L N L + + RG LV ++ + + YL K E +
Sbjct: 57 EGGTWLGINKRGKFAALTNYLQPKIDLHAKGRGSLVTSFLTSEIDSFSYLK--KISAEGH 114
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF+++ + + + ++ S + + N VYG S + P
Sbjct: 115 LYNGFNLLAADFNTKEDVMCYYG------SKGEQEPLILNP-----GVYGLSCSLLDT-P 162
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
++K+ GK+ FA+I+ K + ++L+Q ++ +M N++ PDP I+ + + D
Sbjct: 163 WRKLQHGKKLFADIIRKIQDIAR-EDLVQELIKVMNNEEQQLPDPAIEEQGK-DFVRPIL 220
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
YS VCV S YGTRT++I+L+D + F E T L+ + +W
Sbjct: 221 SGYSAVCVRCSG--YGTRTNTIVLIDGEGQVTFTERTMLNSDVTQW 264
>gi|346471531|gb|AEO35610.1| hypothetical protein [Amblyomma maculatum]
Length = 283
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 113/225 (50%), Gaps = 26/225 (11%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENS--RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTW+A S +G++ LLN+L + P R RG LVV +++ YLT L E +
Sbjct: 56 EGGTWMAVSKSGRIAALLNILQPANIMDPTKRGRGFLVVDFIKSAVDGNTYLTKLMNEKD 115
Query: 86 ENVFNGFHIVLLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E FNGF + +E + ++ ++SN+ D K + Y FGNS
Sbjct: 116 E--FNGFLFIAIEAKPCQKEISMSYYSNLQ--DGGPVKA---------EPGFYAFGNS-V 161
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA---EA 200
+ + KV GK+KF E++ + + S+ L+ I D + + S+ D ++ +++ E
Sbjct: 162 PPKFWAKVKCGKKKFEELIKQNGQFSQRDQLLTKIFDFLDDSTSHPVDDDMRKQSLEPEC 221
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
++ +M+Y + S YGTRTH+++ V+ +F E T+
Sbjct: 222 NLKLMNQMKYV-----LPSWNYGTRTHTVLFVNGAGKAEFIEKTM 261
>gi|427798033|gb|JAA64468.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 292
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 117/237 (49%), Gaps = 36/237 (15%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENS--RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTW+A S +G++G LLN+L R RG LVV +++ YLT L KE +
Sbjct: 56 EGGTWMAVSKDGRIGALLNILQPTHCMDTTKRGRGFLVVDFIKSTMDGNAYLTKLMKEKD 115
Query: 86 ENVFNGFHIVLLEL-------------------TLQSTNIYH-FSN-IAPLDSPSTK--- 121
E FNGF V +E +Q+ +H F N + P P K
Sbjct: 116 E--FNGFLFVAIEAKPLKKKISMSYYSNLQEGGPVQTEPGFHAFGNSVPPQFXPLKKKIS 173
Query: 122 VTNVTNKQEN-----KDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQ 176
++ +N QE + + FGNS Q + KV +GK+KF EIV + NR S+ L++
Sbjct: 174 MSYYSNLQEGGPVQTEPGFHAFGNS-VPPQFWAKVKYGKQKFEEIVRQNNRFSQKDQLLE 232
Query: 177 NILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
I D + ++ ++YP + RK + + ++ +++ + + YGTRT +++ V+
Sbjct: 233 KIYDFL-DESTSYPVDDAMRKQSQEPESTLRLM-NQMKYILPNYNYGTRTQTVLFVN 287
>gi|291190306|ref|NP_001167100.1| Ser/Thr-rich protein T10 in DGCR region [Salmo salar]
gi|223648126|gb|ACN10821.1| Ser/Thr-rich protein T10 in DGCR region [Salmo salar]
Length = 274
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 112/230 (48%), Gaps = 20/230 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GG+WL + GKL L N L P+ + RG LV ++ Q S + Y +
Sbjct: 53 EEGKEGGSWLGINKRGKLAALTNYLEGRPNPDAQGRGFLVSNFLTDQ-SQDSYSYLKRVS 111
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E +++NGF+++ E + + ++ N + +YG NS
Sbjct: 112 SEGHLYNGFNLLTAEFKAKEDTVCYYGNRGSAEPIRL-----------NPGIYGLSNSLL 160
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ GK F +VN+ +L+ ++L+++ N++ N PDP + + E
Sbjct: 161 ET-PWRKLQHGKRLFTSVVNQ---TLPCDSLVHDLLNVLNNEELNTPDPAQESQGEGYSS 216
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ S VCV S YGTRT++IIL+D + + F E T L+ + +W
Sbjct: 217 HMLR-SLSSVCV--RSPHYGTRTNTIILIDASGNVTFTERTMLNCDVSQW 263
>gi|170594361|ref|XP_001901932.1| Ser/Thr-rich protein T10 in DGCR region [Brugia malayi]
gi|158590876|gb|EDP29491.1| Ser/Thr-rich protein T10 in DGCR region, putative [Brugia malayi]
Length = 285
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 110/222 (49%), Gaps = 39/222 (17%)
Query: 29 GGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEE 86
GGTWL S +G+LG +L VL + P RG +V +Y++ + E+Y+ + + E
Sbjct: 51 GGTWLGMSRDGRLGNVLAVLENATDEIPCAITRGKIVYEYLKSEMPPEDYVAE-QLSNEA 109
Query: 87 NVFNGFHIVLL------ELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+NGF+++LL E+ ++ FSN SP+ +YGFGN
Sbjct: 110 QQYNGFNVILLHRLFNEEIERKTYFGVQFSNRQ--GSPTAAF---------GPGIYGFGN 158
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKS-ESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
S +PF+K+ +G F E + + ++ Q L++ LD++ ++ S+YPD ++ + +
Sbjct: 159 S-VLGKPFKKITYGLRLFEEKLKILDDENVNEQELMKQFLDILIDQTSHYPDEQLISQKK 217
Query: 200 ADMDEDYKMRYSRVCVDISSIF--------YGTRTHSIILVD 233
D D C +S +F YGTR H+I+LV+
Sbjct: 218 QDKDR---------CKLMSQLFYEFPEPVXYGTRCHTIVLVN 250
>gi|410922261|ref|XP_003974601.1| PREDICTED: uncharacterized protein C22orf25-like isoform 2
[Takifugu rubripes]
Length = 276
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 112/228 (49%), Gaps = 22/228 (9%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GG+WL + GKL + N L +S P+ + RG LV Y+ K + Y K E +
Sbjct: 57 EGGSWLGINKRGKLAGITNYLESHSNPDAQGRGFLVSNYLT-DKDQDSYSYLKKVSLEGH 115
Query: 88 VFNGFHIVLLELTLQSTNI--YHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTS 145
++NGF+++ E + ++ Y+ + +P K +YG NS +
Sbjct: 116 LYNGFNLITAEFSTAKQDVVCYYGNRGSP------------EPIHLKAGIYGLSNSLLDT 163
Query: 146 QPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDED 205
P++K++ GK F+ +V+ ++ L+Q +L ++ N++ N PDP + + +
Sbjct: 164 -PWKKLLRGKRHFSSVVD--DQTLSCDGLVQELLGVLNNEELNSPDPAQESHGDG-YSKP 219
Query: 206 YKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKW 252
S VCV S YGTRT+++IL+D + F E T+ D +T W
Sbjct: 220 MLQALSAVCV--RSPDYGTRTNTVILIDAEGNVTFTERTMRDRDTSNW 265
>gi|52345676|ref|NP_001004885.1| transport and golgi organization 2 homolog [Xenopus (Silurana)
tropicalis]
gi|49522958|gb|AAH75282.1| MGC88919 protein [Xenopus (Silurana) tropicalis]
Length = 276
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 110/226 (48%), Gaps = 18/226 (7%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWL + GK L N L + + RG LV ++ + + YL K E +
Sbjct: 57 EGGTWLGINKRGKFAALTNYLQPKIDLHAKGRGSLVTSFLTSEIDSFSYLK--KISAEGH 114
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF+++ + ++ + P + N VYG S + P
Sbjct: 115 LYNGFNLLAADFNSTKEDVMCYYGSKGEQEPL-----ILNP-----GVYGLSCSLLDT-P 163
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
++K+ GK+ FA+I+ K ++ ++L+Q ++ +M N++ PDP I+ + + D
Sbjct: 164 WRKLQHGKKLFADIIRK-SQDIAREDLVQELIKVMNNEEQQLPDPAIEEQGK-DFVRPIL 221
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
YS VCV S YGTRT++I+L+D + F E T L+ + +W
Sbjct: 222 SGYSAVCVRCSG--YGTRTNTIVLIDGEGQVTFTERTMLNSDVTQW 265
>gi|320164730|gb|EFW41629.1| hypothetical protein CAOG_06761 [Capsaspora owczarzaki ATCC 30864]
Length = 276
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 113/236 (47%), Gaps = 22/236 (9%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKY-VEGQKSAEEYLTDLKKETEE 86
+GGTWL T+T+G++ L N + P R RG L + V+ Q S +++ L E E
Sbjct: 57 EGGTWLGTTTDGRVAFLTNCREASPPPELRGRGGLCANFLVDQQASPDKFAHSLASERHE 116
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
F+GF++V+ ++ QS N + SN D S + + +G N +
Sbjct: 117 --FSGFNLVVGDI--QSGNFQYVSNRVNQDYQSVQPCVL----------HGVSNG-VLDE 161
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
P+ KV GK VN+ N +++ + ++ M+++Q D+ + + ++
Sbjct: 162 PWPKVTRGKANIDAAVNRAN--ADADQVAAHLASAMRDQQKC----SDDQLPKTGVPIEW 215
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLN 262
+ + S V V+ YGTR+ ++ +VDHN F+E T D ET +W +LN
Sbjct: 216 ERKLSPVFVEFPEAAYGTRSIAVQVVDHNGHSVFYEHTRDSETGEWKQQRFSFSLN 271
>gi|290562403|gb|ADD38598.1| protein C22orf25 [Lepeophtheirus salmonis]
Length = 287
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 117/236 (49%), Gaps = 44/236 (18%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYV-EGQ-KSAEEYLTDLKKETE 85
+GGTW A + GK+G L N+ S N RG L++ Y+ EG+ K++ +YL +L K
Sbjct: 54 RGGTWCAINEEGKIGFLTNIFTGQSY-NRLSRGSLIIDYLKEGKIKTSMDYLNELSKHG- 111
Query: 86 ENVFNGFHIVLLELTLQST--NIY-------HFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
NV+N F++ L+ Q + + Y H N PL + D
Sbjct: 112 -NVYNPFNLFLMSPNAQGSFDSFYYCPGLEDHIQNEGPL--------------QIHDSFI 156
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N +S P++K KF+ IV+++N S+ L +++ + ++ YPDP+I++
Sbjct: 157 GLSNHPLSS-PYRKTSTYLHKFSNIVHEYNDTSQQSTLTESLFNALQCTDKCYPDPQIEK 215
Query: 197 KAEADMDEDYKMRYSRVCVDISSIF--------YGTRTHSIILVDHNNVMDFHEWT 244
+ ++ ++ RY ++S+F YGTR+ ++IL+D+ + + F E T
Sbjct: 216 QC---LNPSFRSRYEF----LTSLFILSGPGSTYGTRSQTLILIDYFDNVYFTEKT 264
>gi|357123083|ref|XP_003563242.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like
[Brachypodium distachyon]
Length = 266
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 112/222 (50%), Gaps = 21/222 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ ++ +G+L L NVL ++ PN R RG L +++++G K E T++ K
Sbjct: 46 GRDVLGGGTWMGSTKDGRLAFLTNVLEPDAMPNARTRGDLPLRFLQGNKGPLEVATEVAK 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT N+ + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---RNVMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNK-FNRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEA 200
S P+QK I + F E + K + + E++++ + ++ D K + P+ D E
Sbjct: 154 LDS-PWQKAIRLGKNFREFIRKNGDDEVEAKDIAERLMTDTTKADKDRLPNTGCDTNWEH 212
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + + V YGTR+ +++ V+++ +E
Sbjct: 213 GLSSIF------IEVQTDQGLYGTRSTAVLSVNYDGKASLYE 248
>gi|290462189|gb|ADD24142.1| protein C22orf25 [Lepeophtheirus salmonis]
Length = 287
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 116/236 (49%), Gaps = 44/236 (18%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYV-EGQ-KSAEEYLTDLKKETE 85
+GGTW A + GK+G L N+ S N RG L++ Y+ EG+ K++ +YL +L K
Sbjct: 54 RGGTWCAINEEGKIGFLTNIFTGQSY-NRLSRGSLIIDYLKEGKIKTSMDYLNELSKHG- 111
Query: 86 ENVFNGFHIVLLELTLQST--NIY-------HFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
NV+N ++ L+ Q + + Y H N PL + D
Sbjct: 112 -NVYNPLNLFLMSPNAQGSFDSFYYCPGLEDHIQNEGPL--------------QIHDSFI 156
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N +S P++K KF+ IV+++N S+ L +++ + ++ YPDP+I++
Sbjct: 157 GLSNHPLSS-PYRKTSTNLHKFSNIVHEYNDTSQQSTLTESLFNALQCTDKCYPDPQIEK 215
Query: 197 KAEADMDEDYKMRYSRVCVDISSIF--------YGTRTHSIILVDHNNVMDFHEWT 244
+ ++ ++ RY ++S+F YGTR+ ++IL+D+ + + F E T
Sbjct: 216 QC---LNPSFRSRYEF----LTSLFILSGPGSTYGTRSQTLILIDYFDNVYFTEKT 264
>gi|148665100|gb|EDK97516.1| DNA segment, Chr 16, human D22S680E, expressed, isoform CRA_a [Mus
musculus]
Length = 207
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 106/214 (49%), Gaps = 36/214 (16%)
Query: 49 GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYH 108
G S + R RG LV ++ + YL K TE +++NGF+I+ +L+ ++
Sbjct: 9 GTRSGSDARGRGELVSHFLTSDMDSLSYLK--KVSTEGHLYNGFNIIAADLSTSKGDV-- 64
Query: 109 FSNIAPLDSPSTKVTNVTNKQENKDHV-----YGFGNSQCTSQPFQKVIFGKEKFAEIVN 163
V N+ E + V YG N+ + P++K+ FGK+ F E V
Sbjct: 65 -------------VCYYGNRGEPEPIVLTPGTYGLSNALLET-PWKKLCFGKQLFMEAVE 110
Query: 164 KFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM----RYSRVCVDISS 219
+ + L+ +LD++ N+++ PDP I+ D ++Y +Y+ VCV ++
Sbjct: 111 Q-SEALPKDVLVTQLLDVLNNEEAQLPDPAIE-----DQGQEYVQPILNKYAAVCVRCAT 164
Query: 220 IFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
YGTRT++IILVD N + F E + LD +T +W
Sbjct: 165 --YGTRTNTIILVDANGHVTFTERSMLDKDTSRW 196
>gi|443699188|gb|ELT98798.1| hypothetical protein CAPTEDRAFT_220945 [Capitella teleta]
Length = 271
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 105/221 (47%), Gaps = 28/221 (12%)
Query: 28 KGGTWLATSTNGKLGILLNVLGE--NSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTWL S ++ LLN+L P+ + RG LV ++ ++S E YL+ + TE
Sbjct: 54 EGGTWLGMSKRRRIASLLNILQYPFGPDPSKKGRGFLVKDFLTNEESVESYLSRIA--TE 111
Query: 86 ENVFNGFHIVLLELTLQSTNIYH-FSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
+ + GFH+V ++ +Y+ F+ I+ F N +C
Sbjct: 112 GHSYGGFHLVCVDYRCGDHFLYYSFTCISI-----------------------FTNCKCY 148
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
++ + K + + F +IV + N + L ++L L+ + PDP ++ +A +
Sbjct: 149 NRQWAKAAYARPLFTDIVARLNSTQKQNELKLSLLRLLSDDTQTLPDPALEEQAGEALPH 208
Query: 205 DYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
S V V +SI YGTRT+SIILVD + +F E T+
Sbjct: 209 HIIAERSAVHVTSASINYGTRTNSIILVDKSGNCEFFERTM 249
>gi|332262771|ref|XP_003280432.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C22orf25
homolog [Nomascus leucogenys]
Length = 268
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 110/231 (47%), Gaps = 28/231 (12%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNXXXXXXX--------LVTHFLTTDVDSLSYLK--KVS 102
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L T + I ++ N P V YG N+
Sbjct: 103 MEGHLYNGFNLIAADLSTAKGDVICYYGNRG---EPDPIVLT--------PGAYGLSNAL 151
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+ + +
Sbjct: 152 LET-PWRKLCFGKQLFLEAVER-SQALPKDVLIANLLDVLNNEEAQLPDPAIEDQG-GEY 208
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 209 VQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDFSHW 257
>gi|115478555|ref|NP_001062871.1| Os09g0323500 [Oryza sativa Japonica Group]
gi|48716989|dbj|BAD23681.1| unknown proteingi|148228476|ref|NP_001083694.1| uncharacterized protein LOC399065 [Xenopus laevis]
gi|39645613|gb|AAH63724.1| MGC68627 protein [Xenopus laevis]
Length = 275
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 111/226 (49%), Gaps = 19/226 (8%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWL + GK L N L + + RG LV ++ + + YL K E +
Sbjct: 57 EGGTWLGINKRGKFAALTNYLQPKIDLHAKGRGSLVTNFLTSEIDSFSYLK--KISAEGH 114
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++NGF+++ + + + ++ S + + N VYG S + P
Sbjct: 115 LYNGFNLLAADFNTKEDVMCYYG------SKGEQEPLILNP-----GVYGLSCSLLDT-P 162
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
++K+ GK+ FA+I+ K ++ +L+Q ++ +M N++ PDP I+ + + +
Sbjct: 163 WKKLQHGKKLFADIIRK-SQDIARDDLVQELIKVMNNEEQQLPDPAIEEQGK-EFVRPIL 220
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
YS VCV + YGTRT++I+L+D + F E T L+ + +W
Sbjct: 221 SSYSAVCVRCNG--YGTRTNTIVLIDGKGQVTFTERTMLNSDITQW 264
>gi|241638382|ref|XP_002409106.1| conserved hypothetical protein [Ixodes scapularis]
gi|215501296|gb|EEC10790.1| conserved hypothetical protein [Ixodes scapularis]
Length = 291
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 110/222 (49%), Gaps = 20/222 (9%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNG--RDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTWLA S G++ LLN+L ++ + + RG LVV +V+ + E YL L + E
Sbjct: 64 EGGTWLAVSKTGRIAALLNILQRSNIIDATKKGRGFLVVDFVQSSQDGETYLNKLMQ--E 121
Query: 86 ENVFNGFHIVLLELTLQSTNIY--HFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
N +NGF ++ +E + ++SN+ + T + + FGNS
Sbjct: 122 RNDYNGFLLITVETKPLKKKCFLNYYSNLQ--EGAPTHL---------DPGFHAFGNS-V 169
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ KV GK+ F IV + S+ + L+ + D +++ + YP + R ++D
Sbjct: 170 PPYFWSKVTGGKQMFEAIVKENCSFSQRETLVSKLFDFLQD-TTPYPVDDSMRSQSEELD 228
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
++R +R+ + YGTRTH+I+LV+ +F E T+
Sbjct: 229 ATLEIR-NRIKFALPKWNYGTRTHTIVLVNGQGKAEFIEKTM 269
>gi|326511277|dbj|BAJ87652.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 111/222 (50%), Gaps = 21/222 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ ++ +G+L L NVL ++ P R RG L +++++G KS E T++ K
Sbjct: 46 GRDVLGGGTWMGSTKDGRLAFLTNVLEPDAMPGARTRGDLPLRFLQGNKSPLEVATEVAK 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT N+ + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---RNVMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKF-NRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEA 200
S P+QK I + F E + K + + E++++ ++ D + + P+ D E
Sbjct: 154 LDS-PWQKAIRLGKNFREFIRKHGDDEVEAKDIADRLMTDTTRADKDRLPNTGCDPTWEH 212
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + + V YGTR+ +++ V+++ +E
Sbjct: 213 GLSSIF------IEVQTDEGLYGTRSTAVLSVNYDGEASLYE 248
>gi|413935037|gb|AFW69588.1| hypothetical protein ZEAMMB73_631728 [Zea mays]
Length = 261
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 112/226 (49%), Gaps = 34/226 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +G+L L NVL ++ P R RG L +K+++ KS E T++ +
Sbjct: 46 GRDVLGGGTWMGCTKDGRLAFLTNVLEPDAMPGARTRGDLPLKFLQSNKSPLEVATEVAE 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT TNI + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---TNIMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNR-KSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEA 200
S P+QK I + F E++ + + E +++++ ++ D K + P+ D E
Sbjct: 154 LDS-PWQKAILLGKNFRELLREHGADEVEVKDIVERLMTDTTKADKDRLPNTGCDPNWEH 212
Query: 201 DMDEDYKMRYSRVCVDISSIF----YGTRTHSIILVDHNNVMDFHE 242
+ SSIF YGTR+ +++ V+++ +E
Sbjct: 213 GL---------------SSIFIEGPYGTRSTAVLSVNYDGEASLYE 243
>gi|268083460|gb|ACY95278.1| unknown [Zea mays]
Length = 266
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 111/222 (50%), Gaps = 21/222 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +G+L L NVL ++ P R RG L +K+++ KS E T++ +
Sbjct: 46 GRDVLGGGTWMGCTKDGRLAFLTNVLQPDAMPGARTRGDLPLKFLQSNKSPLEVATEVAE 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT TNI + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---TNIMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
S P+QK I + F E++ + + D+++ ++ + DR
Sbjct: 154 LDS-PWQKAILLGKNFRELLREHGADE------VEVKDIVERLMTDTTKADKDRLPNTGC 206
Query: 203 DEDYKMRYSRVCVDISS--IFYGTRTHSIILVDHNNVMDFHE 242
D +++ S + +++ + YGTR+ +++ V+++ +E
Sbjct: 207 DPNWEHGLSSIFIEVQTDQGPYGTRSTAVLSVNYDGEASLYE 248
>gi|391330110|ref|XP_003739507.1| PREDICTED: uncharacterized protein C22orf25-like [Metaseiulus
occidentalis]
Length = 274
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/240 (30%), Positives = 116/240 (48%), Gaps = 26/240 (10%)
Query: 28 KGGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQK--SAEEYLTDLKKE 83
+GG WL + NG+ LLN+L E R RG LVV + + + +EY L K+
Sbjct: 57 EGGAWLIFNKNGRFASLLNILQRDEELLAGKRGRGFLVVDFAKSSSTITPDEYAQSLYKD 116
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E FNGF +V ++L Q+ Y F ++ ++ V N+ + FGNS
Sbjct: 117 REN--FNGFTLVCVDLK-QAYGSY-FCSMGDME-----VQNLGAG------YHAFGNSIL 161
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
Q + KV+ GKEKF E+V K K + + L + + DLM N ++N+ E RK + D
Sbjct: 162 PRQ-WPKVVAGKEKFEEVVEK--NKHDEKLLEEALFDLM-NDETNHGVDEFMRK-QTDEP 216
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLNM 263
+D + I YG+RTH+++L+ + + E T+ E KW+ LN+
Sbjct: 217 QDLLTLRGALKFYIKEFNYGSRTHTVVLIRGDGSASYSEKTV--EDGKWVERRTDFQLNL 274
>gi|195626530|gb|ACG35095.1| Ser/Thr-rich protein T10 in DGCR region [Zea mays]
gi|413935038|gb|AFW69589.1| ser/Thr-rich protein T10 in DGCR region [Zea mays]
Length = 266
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 111/222 (50%), Gaps = 21/222 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +G+L L NVL ++ P R RG L +K+++ KS E T++ +
Sbjct: 46 GRDVLGGGTWMGCTKDGRLAFLTNVLEPDAMPGARTRGDLPLKFLQSNKSPLEVATEVAE 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT TNI + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---TNIMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
S P+QK I + F E++ + + D+++ ++ + DR
Sbjct: 154 LDS-PWQKAILLGKNFRELLREHGADE------VEVKDIVERLMTDTTKADKDRLPNTGC 206
Query: 203 DEDYKMRYSRVCVDISS--IFYGTRTHSIILVDHNNVMDFHE 242
D +++ S + +++ + YGTR+ +++ V+++ +E
Sbjct: 207 DPNWEHGLSSIFIEVQTDQGPYGTRSTAVLSVNYDGEASLYE 248
>gi|297260577|ref|XP_001113606.2| PREDICTED: hypothetical protein LOC719416 [Macaca mulatta]
Length = 542
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 101/208 (48%), Gaps = 19/208 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF+++ +L T + I ++ N D +T T YG N+
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPD--PIVLTPGT---------YGLSNAL 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+ + +
Sbjct: 160 LET-PWRKLCFGKQLFLEAVER-SQALPKDVLIANLLDVLNNEEAQLPDPAIEDQG-GEY 216
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSII 230
+ +Y+ VCV YGTR S +
Sbjct: 217 VQPVLSKYAAVCVRCPG--YGTRYCSPV 242
>gi|324520281|gb|ADY47596.1| Ser/Thr-rich protein T10 in DGCR region [Ascaris suum]
Length = 287
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 106/225 (47%), Gaps = 33/225 (14%)
Query: 30 GTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTWL T+G++G LL++ + + RG L ++++ +KS + L + +
Sbjct: 54 GTWLCMDTSGRIGNLLSITIPFHEVKQHAPSRGALPIEFLRSKKSPSRFCESLT--SIAD 111
Query: 88 VFNGFHIVLLE---------LTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+N F I+ L+ TL S + I P+ P VYGF
Sbjct: 112 AYNAFQILCLQRNNEDQYDLCTLASCFV---DRIQPIKYPPG--------------VYGF 154
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFN-RKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
GNS +PF+KV G + ++V++ + + +I +L++ ++ +PD ++ ++
Sbjct: 155 GNSP-PDKPFKKVQRGLDLMRKMVDELKIEEPCEEKIIARLLNIATDRVQCFPDEQLQKQ 213
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ E + R S I YGTR+HSIILVD +N + F+E
Sbjct: 214 CQRST-EICRFRTSLYVQYPDGIRYGTRSHSIILVDRSNRVTFYE 257
>gi|357165290|ref|XP_003580333.1| PREDICTED: uncharacterized protein C22orf25 homolog [Brachypodium
distachyon]
Length = 272
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 115/249 (46%), Gaps = 35/249 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWL + GKL L NV + R R RG L V++++G+ EY T++ KE
Sbjct: 50 KDELGGGTWLGCTKGGKLAFLTNVREPSPRAGARSRGELPVRFLQGRLGPLEYATEIAKE 109
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
++ +NGF+++L + + S N+ + SN P +P V HV ++
Sbjct: 110 ADQ--YNGFNLILAD--VHSGNMVYISN-KPSGAP------VVQTVSPGSHVL---SNAA 155
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSES------QNLIQNILDLMKNKQSNYPDPEIDRK 197
P+ K + + + N+F +S Q + + ++D +K +S PD
Sbjct: 156 IDSPWPKAL----RLGQSFNRFLATHDSVEVPLEQMVEELMMDTVKADRSEVPD------ 205
Query: 198 AEADMDEDYKMRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHT 255
+D D++ + S + +D YGTR+ I V + + F+E +L + W
Sbjct: 206 --TGVDPDWEYQLSSIFIDTEKGQARYGTRSMVAIAVKLDGEVTFYERSLA-SSSLWEEN 262
Query: 256 HIRKTLNMA 264
++ + MA
Sbjct: 263 LVQFRMEMA 271
>gi|194769790|ref|XP_001966984.1| GF21807 [Drosophila ananassae]
gi|190622779|gb|EDV38303.1| GF21807 [Drosophila ananassae]
Length = 349
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 102/209 (48%), Gaps = 29/209 (13%)
Query: 59 RGPLVVKYVEGQKSAEEYLT---DLKKETEENVFNGFHIVLLELTLQS--TNIYHFSNIA 113
RG +V +V Q L+ DL K+ + ++ F+ V +E+ S + SN+
Sbjct: 154 RGMIVADFVAKQDDRHSILSYNQDLLKDCTK--YSAFNFVSIEIGPSSLPARVKLLSNVP 211
Query: 114 PLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN 173
P T ++ YGFGNS PF+KV G+E F IV + + S +
Sbjct: 212 P-----------TLEEFENGGCYGFGNS-LPHTPFEKVRHGQEAFQAIVKEHGQAS-VET 258
Query: 174 LIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
L +++ L+K K +PD E+ R+A ++ S + V IS YG+RTH++ILVD
Sbjct: 259 LSGHLMQLLKKKHKFWPDVELKRRAP-----NWGEGLSSLNVHISEHGYGSRTHTVILVD 313
Query: 234 HNNVMDFHEWT---LDYETKKWIHTHIRK 259
+N M F E T LD E +W THI K
Sbjct: 314 GDNKMHFIEETMAGLDPEG-EWRRTHIEK 341
>gi|312067494|ref|XP_003136769.1| hypothetical protein LOAG_01181 [Loa loa]
gi|307768064|gb|EFO27298.1| hypothetical protein LOAG_01181 [Loa loa]
Length = 259
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 103/220 (46%), Gaps = 37/220 (16%)
Query: 30 GTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTWL + G + LL + +P+ RG L + +VE K + EEN
Sbjct: 53 GTWLCMNATGHISNLLTITVPIHQMKPDSLTRGALPINFVESNK-----------KPEEN 101
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
++ + + L QS++ + P V+GF NS + +P
Sbjct: 102 EYDQYEVA--GLMYQSSDKIEITRYPP-------------------GVHGFSNSP-SYEP 139
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
F+KV G EK + I+ + N K+ S+ +I+ +L L +K +PD ++ R+ +E
Sbjct: 140 FKKVQRGVEKMSGIIEEINSKNLSEGEIIERLLQLATDKYQCFPDDQLKRRC-GRSNELC 198
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
K R + I YGTR+H+II+VDHNN ++E +++
Sbjct: 199 KYRAAVFVRYPDGIPYGTRSHTIIVVDHNNRATYYEKSME 238
>gi|318037563|ref|NP_001187887.1| uncharacterized protein c22orf25 [Ictalurus punctatus]
gi|308324242|gb|ADO29256.1| uncharacterized protein c22orf25 [Ictalurus punctatus]
Length = 301
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 105/230 (45%), Gaps = 21/230 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GG+WL S GKL L N L S P+ RG LV Y+ + YL K
Sbjct: 81 EEGKEGGSWLGISKRGKLTTLTNYLEHKSNPDALGRGFLVSNYMTENVDSFSYLR--KVS 138
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E ++NGF+++ E + ++ N + K +YG NS
Sbjct: 139 SEGQLYNGFNLLTAEFRASEDTMCYYGNRGSPEPVRLKAAG----------IYGLSNSLL 188
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+ P++K+ GK+ F+ +VNK ++ D ++ N PD + + E
Sbjct: 189 DT-PWRKMQRGKQHFSSVVNKTLPLKAGPGTAASLND----EELNTPDHMQENQGEG-YT 242
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+D+ S VCV YGTRT++IIL+D + + F E T L+ + +W
Sbjct: 243 KDWLKLLSAVCVRAPG--YGTRTNTIILIDSEDNVSFTERTMLNCDVTQW 290
>gi|194895429|ref|XP_001978251.1| GG17797 [Drosophila erecta]
gi|190649900|gb|EDV47178.1| GG17797 [Drosophila erecta]
Length = 334
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 103/208 (49%), Gaps = 27/208 (12%)
Query: 59 RGPLVVKYV---EGQKSAEEYLTDLKKETEENVFNGFHIVLLEL--TLQSTNIYHFSNIA 113
RG +V +V + + S Y L K+ + ++ F+ V +E+ + Q ++ SN+
Sbjct: 141 RGMIVADFVTQADLEHSILNYNESLLKDCTK--YSAFNFVSIEIGSSSQPASVKLLSNVP 198
Query: 114 PLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN 173
P T + YGFGNS S PF+KV GK++F EIV K + KS +
Sbjct: 199 P-----------TLEDFQNGECYGFGNSLPHS-PFEKVRHGKQEFEEIV-KAHGKSSVEA 245
Query: 174 LIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
L ++ L++NK +PD E+ R+A ++ S + V I YG+RTH+++LVD
Sbjct: 246 LTAQLMQLLRNKHKFWPDDELKRRAP-----NWGEGLSALNVHIEDHAYGSRTHTVVLVD 300
Query: 234 HNNVMDFHEWTLDY--ETKKWIHTHIRK 259
N M F E T+ +W THI +
Sbjct: 301 SENKMHFIEETMTGLDPHGEWSRTHIER 328
>gi|226505812|ref|NP_001146588.1| uncharacterized protein LOC100280184 [Zea mays]
gi|219887921|gb|ACL54335.1| unknown [Zea mays]
Length = 266
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 110/222 (49%), Gaps = 21/222 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +G+ L NVL ++ P R RG L +K+++ KS E T++ +
Sbjct: 46 GRDVLGGGTWMGCTKDGRPAFLTNVLEPDAMPGARTRGDLPLKFLQSNKSPLEVATEVAE 105
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF+++L +LT TNI + + P P+T Q ++ N++
Sbjct: 106 EADE--YNGFNLILADLT---TNIMVYVSNRPKGQPAT-------IQLVSPGLHVLSNAR 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
S P+QK I + F E++ + + D+++ ++ + DR
Sbjct: 154 LDS-PWQKAILLGKNFRELLREHGADE------VEVKDIVERLMTDTTKADKDRLPNTGC 206
Query: 203 DEDYKMRYSRVCVDISS--IFYGTRTHSIILVDHNNVMDFHE 242
D +++ S + +++ + YGTR+ +++ V+++ +E
Sbjct: 207 DPNWEHGLSSIFIEVQTDQGPYGTRSTAVLSVNYDGEASLYE 248
>gi|116309703|emb|CAH66750.1| H0409D10.8 [Oryza sativa Indica Group]
gi|116309706|emb|CAH66752.1| OSIGBa0158F05.1 [Oryza sativa Indica Group]
Length = 286
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 118/254 (46%), Gaps = 36/254 (14%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L+ +D + GGTWL + +GKL L NV + + RG L V++++G + EY
Sbjct: 45 LILGGRDELGGGTWLGCTRDGKLAFLTNVREPGTLVGAKSRGELPVRFLQGNQCPLEYAE 104
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ KE ++ +NGF++VL + +QS N+ + SN P P V K HV
Sbjct: 105 EIAKEADQ--YNGFNLVLAD--VQSGNMAYISN-RPEGDP------VVQKVLPGFHVLSN 153
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-----QNLIQNI-LDLMKNKQSNYPDP 192
C P+ K++ + + N+F + Q +++ + +D +K +S PD
Sbjct: 154 AAIDC---PWPKML----RLGQSFNRFLATQDGAEVSLQQMVEELMMDPVKADKSAVPD- 205
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+D D++ + S + +D YGTR+ + + V N + F+E L E+
Sbjct: 206 -------TGVDPDWEYQLSSIFIDTEKGQARYGTRSMTALAVKFNGEVTFYERYL--ESN 256
Query: 251 KWIHTHIRKTLNMA 264
W ++ L M+
Sbjct: 257 LWKENLMQFELEMS 270
>gi|255539238|ref|XP_002510684.1| Ser/Thr-rich protein T10 in DGCR region, putative [Ricinus
communis]
gi|223551385|gb|EEF52871.1| Ser/Thr-rich protein T10 in DGCR region, putative [Ricinus
communis]
Length = 248
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 112/224 (50%), Gaps = 27/224 (12%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DAV GGTWL S +G++ L NVL ++ P R RG L V ++E KS +E+ L KE
Sbjct: 24 RDAVAGGTWLGCSRSGRVAFLTNVLELHALPEARSRGELPVLFLESPKSPKEFAEMLVKE 83
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ +NGF+++L +++ S ++ + SN P P QE ++ N++
Sbjct: 84 AHQ--YNGFNLILADIS--SKSMVYISN-RPKGEPVV-------VQEVSPGIHVLSNAKL 131
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSE--SQNLIQNIL-DLMKNKQSNYPDPEIDRKAEA 200
S P+ KV K F E ++ + + E + +++ ++ D ++ ++S P
Sbjct: 132 DS-PWPKVQRLKLNFKEQLDTYGGEDEIPVEGMLEKLMRDTVRAEKSGLP---------G 181
Query: 201 DMDEDYKMRYSRVCVDISSIF--YGTRTHSIILVDHNNVMDFHE 242
D++ S + V++ + YGTR+ + + V N + F+E
Sbjct: 182 ICSLDWEHNLSSIFVEVHTPLGCYGTRSTTALTVRANGEVSFYE 225
>gi|125549358|gb|EAY95180.1| hypothetical protein OsI_16997 [Oryza sativa Indica Group]
gi|125591302|gb|EAZ31652.1| hypothetical protein OsJ_15797 [Oryza sativa Japonica Group]
Length = 314
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 118/254 (46%), Gaps = 36/254 (14%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L+ +D + GGTWL + +GKL L NV + + RG L V++++G + EY
Sbjct: 73 LILGGRDELGGGTWLGCTRDGKLAFLTNVREPGTLVGAKSRGELPVRFLQGNQCPLEYAE 132
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ KE ++ +NGF++VL + +QS N+ + SN P P V K HV
Sbjct: 133 EIAKEADQ--YNGFNLVLAD--VQSGNMAYISN-RPEGDP------VVQKVLPGFHVLSN 181
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-----QNLIQNI-LDLMKNKQSNYPDP 192
C P+ K++ + + N+F + Q +++ + +D +K +S PD
Sbjct: 182 AAIDC---PWPKML----RLGQSFNRFLATQDGAEVSLQQMVEELMMDPVKADKSAVPD- 233
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+D D++ + S + +D YGTR+ + + V N + F+E L E+
Sbjct: 234 -------TGVDPDWEYQLSSIFIDTEKGQARYGTRSMTALAVKFNGEVTFYERYL--ESN 284
Query: 251 KWIHTHIRKTLNMA 264
W ++ L M+
Sbjct: 285 LWKENLMQFELEMS 298
>gi|195478351|ref|XP_002100492.1| GE17093 [Drosophila yakuba]
gi|194188016|gb|EDX01600.1| GE17093 [Drosophila yakuba]
Length = 330
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 87/175 (49%), Gaps = 22/175 (12%)
Query: 89 FNGFHIVLLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
++ F+ V +E+ + Q + SN+ P T + YGFGNS
Sbjct: 168 YSAFNFVSIEIGSSSQPARVKLLSNVPP-----------TLEDFQNGECYGFGNS-LPHT 215
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
PF+KV G+++FA IV K + S + L ++ L++NK +PD E+ R+A ++
Sbjct: 216 PFEKVRHGQQEFAAIV-KAHGASSVETLSAQLMQLLRNKHKFWPDDELKRRAP-----NW 269
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY--ETKKWIHTHIRK 259
S + V I YG+RTH+++LVD +N M F E T+ +W THI K
Sbjct: 270 GEGLSALNVHIEDHAYGSRTHTVVLVDSDNKMHFIEETMSGLDPLGEWSRTHIEK 324
>gi|357631596|gb|EHJ79065.1| hypothetical protein KGM_15502 [Danaus plexippus]
Length = 274
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 104/223 (46%), Gaps = 22/223 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
Q GTW+A + K LL L + + N + RG +V +V+ + +EY+ ++
Sbjct: 52 QPGSGSGTWMAINPVLKKLGLLLNLPGSKKENVQSRGRIVADFVKSHYNTKEYVEIVRDY 111
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY-GFGNSQ 142
E N F + LE + + TNVT+K E Y GFGNS
Sbjct: 112 ATE--CNDFIFISLEYGGSEPTVNSY-------------TNVTDKIEQWTETYLGFGNS- 155
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+P +KV GKE+ I +K+ + ++ LI+ + L+KN +SN PD +++ +
Sbjct: 156 LPDKPLKKVEAGKEQLKNICHKYCKITDKAELIEELTRLLKNTESNLPDSQLENRQPHLY 215
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
E S + V I YGTR H+I+LV + +D E ++
Sbjct: 216 KE-----LSSIFVSIPKAKYGTRAHTILLVTKSGYVDLIEISM 253
>gi|387929441|ref|ZP_10132118.1| hypothetical protein PB1_13534 [Bacillus methanolicus PB1]
gi|387586259|gb|EIJ78583.1| hypothetical protein PB1_13534 [Bacillus methanolicus PB1]
Length = 255
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 112/233 (48%), Gaps = 34/233 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D KGGTW+ + G+ + N + RP+ + RG LV Y+ G +EYL
Sbjct: 44 AGRDLEKGGTWMGVTRTGRFAAITNYRAPGHDRPDAKSRGFLVSDYLTGTDKPKEYLE-- 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K + + ++NGF++ L + ++Y++S I LD S KV ++G N
Sbjct: 102 KVQQDRGLYNGFNL----LVGDTESLYYYSPI--LDEIS-KVAPA---------IHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAE 199
+ P+ K+ GKEK + ++ K + L+ +IL D + + PD + ++ E
Sbjct: 146 A-VLDTPWPKIKKGKEKLTKAIS---YKIIDEALLLSILSDSEEAPEEELPDTGVGKEWE 201
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKW 252
+ + I S YGTR +I+++DH+N + F+E +L E ++W
Sbjct: 202 KMLSPIF----------IKSSQYGTRASTILMIDHDNNIVFNEKSLLPELRQW 244
>gi|225457562|ref|XP_002272189.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region [Vitis vinifera]
gi|297745560|emb|CBI40725.3| unnamed protein product [Vitis vinifera]
Length = 266
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 112/241 (46%), Gaps = 48/241 (19%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWLA S G++ L NVL R RG L VK++E +KS EY ++ K
Sbjct: 44 GRDEVAGGTWLACSRQGRVAFLTNVLELRPRLRPFSRGELPVKFLESRKSPHEYAEEITK 103
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF++++ ++ S + + SN + P+ QE ++ N++
Sbjct: 104 EGDE--YNGFNLIVADIA--SKTMVYVSNRPEGEPPTI--------QEISPGIHVLSNAK 151
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILD-LMKNKQSNYPDPEIDRKAEAD 201
P+ KV + KF E++ K+ ES I+ +++ LM++K KA+
Sbjct: 152 -LGIPWHKVQRLRGKFKELLGKYG---ESDIPIKEMIEKLMRDKV----------KADES 197
Query: 202 MDEDYKMRYSRVC-----VDISSIF---------YGTRTHSIILVDHNNVMDFHEWTLDY 247
M +C D SSIF +GTR+ + + V + F+E L+
Sbjct: 198 M-------LPHICALDWECDQSSIFTDTETKMGHFGTRSTAALTVTATGEVTFYETCLEK 250
Query: 248 E 248
E
Sbjct: 251 E 251
>gi|147814820|emb|CAN74747.1| hypothetical protein VITISV_012025 [Vitis vinifera]
Length = 266
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 111/233 (47%), Gaps = 32/233 (13%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWLA S G++ L NVL R RG L VK++E +KS EY ++ K
Sbjct: 44 GRDEVAGGTWLACSRQGRVAFLTNVLELRPRLRPFSRGELPVKFLESRKSPHEYAEEITK 103
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +E +NGF++++ ++ S + + SN + P+ QE ++ N++
Sbjct: 104 EGDE--YNGFNLIVADIA--SKTMVYVSNRPEGEPPTI--------QEISPGIHVLSNAK 151
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKS-ESQNLIQNIL-DLMKNKQSNYP-----DPEID 195
P+ KV + KF E++ K+ + +I+ ++ D +K +S P D E D
Sbjct: 152 -LGIPWHKVQRLRGKFKELLGKYGESDIPIKEMIEKLMRDTVKADESMLPHICALDWECD 210
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
++ D + KM + +GTR+ + + V + F+E L+ E
Sbjct: 211 -QSSIFTDTETKMGH-----------FGTRSTAALTVTATGEVTFYETCLEKE 251
>gi|17566294|ref|NP_507692.1| Protein Y80D3A.9 [Caenorhabditis elegans]
gi|6425533|emb|CAB60443.1| Protein Y80D3A.9 [Caenorhabditis elegans]
Length = 279
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 108/221 (48%), Gaps = 16/221 (7%)
Query: 18 MLLEVSQDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEE 75
ML V + V GTWL +GK+G+LL++ ++ N RG +V +Y++ +
Sbjct: 41 MLSGVDEKDVARGTWLGMDAHGKVGMLLSITQPVDSKHKNAPSRGGIVNEYLKSGTDTTK 100
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
+ +L++ E+ FNGF V +E +T ++H + S + ++ + + D
Sbjct: 101 FYENLRENAEK--FNGFQFVGVEKN-PTTGLFH------VQSLTNQLVDQIEISKWNDKF 151
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
+ F NS PF+K FG + F E + + S Q + + + ++ + S +PD +I
Sbjct: 152 HVFSNSP-PHVPFKKTEFGLKMFEEKLKNTDEMSVEQ-IFEKLFEIATCRTSCFPDDQI- 208
Query: 196 RKAEADMDEDYKMRYSRVCVDISSI-FYGTRTHSIILVDHN 235
+A+ E + + V I YGTR+H++I+VD N
Sbjct: 209 -RAQTGFPEHIYKPLTSIFVRFPEIRRYGTRSHTLIVVDQN 248
>gi|38345399|emb|CAE03090.2| OSJNBa0017B10.5 [Oryza sativa Japonica Group]
Length = 403
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 117/254 (46%), Gaps = 36/254 (14%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L+ +D + GGTWL + +GKL L NV + RG L V++++G + EY
Sbjct: 162 LILGGRDELGGGTWLGCTRDGKLAFLTNVREPGTLVGPNVRGELPVRFLQGNQCPLEYAE 221
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ KE ++ +NGF++VL + +QS N+ + SN P P V K HV
Sbjct: 222 EIAKEADQ--YNGFNLVLAD--VQSGNMAYISN-RPEGDP------VVQKVLPGFHVLSN 270
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-----QNLIQNI-LDLMKNKQSNYPDP 192
C P+ K++ + + N+F + Q +++ + +D +K +S PD
Sbjct: 271 AAIDC---PWPKML----RLGQSFNRFLATQDGAEVSLQQMVEELMMDPVKADKSAVPD- 322
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+D D++ + S + +D YGTR+ + + V N + F+E L E+
Sbjct: 323 -------TGVDPDWEYQLSSIFIDTEKGQARYGTRSMTALAVKFNGEVTFYERYL--ESN 373
Query: 251 KWIHTHIRKTLNMA 264
W ++ L M+
Sbjct: 374 LWKENLMQFELEMS 387
>gi|334130081|ref|ZP_08503884.1| hypothetical protein METUNv1_00899 [Methyloversatilis universalis
FAM5]
gi|333445117|gb|EGK73060.1| hypothetical protein METUNv1_00899 [Methyloversatilis universalis
FAM5]
Length = 260
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 97/220 (44%), Gaps = 30/220 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D V GGTWL ST+G+ L N RP+ R RG LV ++ G+ A E+
Sbjct: 44 AGRDQVSGGTWLGVSTDGRFAALTNYRDPARQRPDARSRGALVADFLTGRMPAAEHAAAC 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+E + +NGF+++L + + H P SP + + +H+ G
Sbjct: 104 MREAAQ--YNGFNLLLCD-GRDLIWVGHGGGHPPRSSPLPPGLHALS-----NHLPG--- 152
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN---LIQNILDLMKNKQSNYPDPEIDRK 197
P+ K++ +E FA + + +R + L + +LD ++ PD
Sbjct: 153 -----TPWPKLVRAREAFARVTGQGSRAATEARVDALFELLLDPTPAADADLPD------ 201
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNV 237
+ +++ R S V I+ YGTR+ +++++D +
Sbjct: 202 --TGVSLEWERRLSPVF--IAGDDYGTRSRTVLVIDPAAI 237
>gi|386764365|ref|NP_001245654.1| transport and golgi organization 2, isoform B [Drosophila
melanogaster]
gi|254939761|gb|ACT88143.1| IP21164p [Drosophila melanogaster]
gi|383293372|gb|AFH07368.1| transport and golgi organization 2, isoform B [Drosophila
melanogaster]
Length = 340
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 99/208 (47%), Gaps = 27/208 (12%)
Query: 59 RGPLVVKYV---EGQKSAEEYLTDLKKETEENVFNGFHIVLLEL--TLQSTNIYHFSNIA 113
RG +V YV + + S Y L K+ + ++ F+ V +E+ Q + SN+
Sbjct: 147 RGMIVADYVTRADEEHSILNYNERLLKDCTK--YSAFNFVSIEIGSASQPARVKLLSNVP 204
Query: 114 PLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN 173
P T + YGFGNS S PF+KV GK++F IV K + ++ +
Sbjct: 205 P-----------TLEDFQNGECYGFGNSLPHS-PFEKVRHGKQEFEAIV-KAHGEASVET 251
Query: 174 LIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
L ++ L++NK +PD E+ +A ++ S + V I YG+RTHS++LVD
Sbjct: 252 LSAQLMQLLRNKHKFWPDDELKTRAP-----NWGEGLSSLNVHIEEHAYGSRTHSVVLVD 306
Query: 234 HNNVMDFHEWTLDY--ETKKWIHTHIRK 259
N M F E T+ +W THI K
Sbjct: 307 SENKMHFIEETMTGLDPHGEWNKTHIEK 334
>gi|224083298|ref|XP_002306979.1| predicted protein [Populus trichocarpa]
gi|222856428|gb|EEE93975.1| predicted protein [Populus trichocarpa]
Length = 262
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 106/234 (45%), Gaps = 48/234 (20%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DAV GGTWLA S G++ + NVL + P + RG L V ++E KS +E+ L K+
Sbjct: 41 RDAVAGGTWLACSRTGRVAFITNVLELHPFPEAKSRGELPVLFLESTKSPKEFAEGLVKD 100
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ +NGF+++L +++ S ++ + SN P P QE ++ N++
Sbjct: 101 AHQ--YNGFNLILADIS--SKSMVYLSN-RPEGEPVV-------IQEVSPGLHVLSNAKL 148
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLM-KNKQSNYPDPEIDRKAEADM 202
S P+ KV Q L N+ DL+ K +S P E+ K D
Sbjct: 149 DS-PWHKV--------------------QRLGLNLKDLLGKYGESEIPVKEVLEKLMRDK 187
Query: 203 DEDYKMRYSRVC-----VDISSIF---------YGTRTHSIILVDHNNVMDFHE 242
+ K R +C ++SSIF YGTR+ + + + + F+E
Sbjct: 188 VKADKSRLPGICSIDWEFNLSSIFVEIDTPLGCYGTRSTAALTIGAGGEVSFYE 241
>gi|195352408|ref|XP_002042704.1| GM17622 [Drosophila sechellia]
gi|194126735|gb|EDW48778.1| GM17622 [Drosophila sechellia]
Length = 326
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 118/288 (40%), Gaps = 79/288 (27%)
Query: 28 KGGTWLATSTNG---KLGILLNVLGENSRPNGRD-------------------------- 58
+GGTWLA + K+G LLN+ GE P RD
Sbjct: 56 EGGTWLAIGHSAGFFKVGALLNLTGE---PKPRDAVAHKNILQPHNHNSTGINLCPKPTT 112
Query: 59 --------------------RGPLVVKYV---EGQKSAEEYLTDLKKETEENVFNGFHIV 95
RG +V YV + + S Y L K+ + ++ F+ V
Sbjct: 113 SNSNSYRHPNPNQNPNTNPRRGMIVADYVTRADEEHSILNYNERLLKDCTK--YSAFNFV 170
Query: 96 LLEL--TLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIF 153
+E+ Q + SN+ P T + YGFGNS S PF+KV
Sbjct: 171 SIEIGSASQPARVKLLSNVPP-----------TLEDFQNGECYGFGNSLPHS-PFEKVRH 218
Query: 154 GKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRV 213
GK++F IV K + + + L ++ L++NK +PD E+ +A ++ S +
Sbjct: 219 GKQEFEAIV-KAHGGASVETLSAQLMQLLRNKHKFWPDDELKTRAP-----NWGEGLSAL 272
Query: 214 CVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY--ETKKWIHTHIRK 259
V I YG+RTH+++LVD N M F E T+ +W THI K
Sbjct: 273 NVHIEEHAYGSRTHTVVLVDSENKMHFIEETMTGLDPHGEWNKTHIEK 320
>gi|339235387|ref|XP_003379248.1| conserved hypothetical protein [Trichinella spiralis]
gi|316978120|gb|EFV61140.1| conserved hypothetical protein [Trichinella spiralis]
Length = 279
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 112/226 (49%), Gaps = 33/226 (14%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTWL S G++G+LLN++ RG LV +Y++ YL ++ +
Sbjct: 54 GTWLGISRRGRIGVLLNIMEPICSTKAEFSRGNLVKEYLQCTCDPLCYLQEMIVPHAQK- 112
Query: 89 FNGFHIVLLELTLQST----NIYHFSNIAPLDSPSTKVTNVTNKQENKD------HVYGF 138
++GF ++LLE L ++ N++ +SN Q N+ VYGF
Sbjct: 113 YSGFVLLLLERRLLNSRFQWNLFTYSN-----------------QSNRGIESFFTGVYGF 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-QNLIQNILDLMKNKQSNYPDPEIDRK 197
GNS PF KV G ++F IV K + ++ + ++ ++L+L+K+ + N+PD ++ +
Sbjct: 156 GNS-VPDLPFLKVKRGLKRFENIVLKSHSSTQDPEPVLADLLNLLKDTKENFPDDQLQSQ 214
Query: 198 AEADMDEDYKMRYSRVCVD-ISSIFYGTRTHSIILVDHNNVMDFHE 242
A + + R SR D + YGTRTHSIIL+D + F E
Sbjct: 215 G-AHYETAMQKRLSRCFYDLLPHQEYGTRTHSIILIDGSGRALFVE 259
>gi|311271114|ref|XP_003133059.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 2 [Sus
scrofa]
Length = 236
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 95/186 (51%), Gaps = 17/186 (9%)
Query: 60 GPLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPS 119
G LV ++ + YL + KE +++NGF+++ +L+ + +I + D P
Sbjct: 49 GELVSHFLTTDMDSLSYLKKVSKEG--HLYNGFNLIAADLSTEKGDIICYYGNRGNDKPV 106
Query: 120 TKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL 179
+ YG N+ + P++K+ FGK+ F E V + ++ L+ +L
Sbjct: 107 VL----------EPGTYGLSNALLDT-PWKKLCFGKQLFLEAVGR-SQALPKDALVAQLL 154
Query: 180 DLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMD 239
D++ N+++ PDP I+ + + + + +Y+ VCV S YGTRT+++ILVD + +
Sbjct: 155 DMLNNEEAQLPDPAIEDQGR-EYVQPFLSKYAAVCVRCSD--YGTRTNTVILVDADGHVT 211
Query: 240 FHEWTL 245
F E ++
Sbjct: 212 FTERSM 217
>gi|268558024|ref|XP_002637002.1| Hypothetical protein CBG09494 [Caenorhabditis briggsae]
Length = 276
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 111/224 (49%), Gaps = 34/224 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
QD +G TW + NG++G++L++ E+ PN RG +V ++ +++ + L
Sbjct: 48 QDHARG-TWTGINRNGRIGMMLSITQTQESKIPNAPSRGGIVNSFLNAHNTSDMVESLL- 105
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
++NGF +V LE N + + L + +V N+ Q N D + NS
Sbjct: 106 --IMAGIYNGFQLVCLE-----QNQFGLYEVRTLTN--QQVENIEVCQWN-DEYHVISNS 155
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
S+P+QK ++GK+ F+E + +++ + + + +L++ N++ YPD ++ +
Sbjct: 156 P-PSKPYQKAVYGKQLFSERL-QYSDEMTVDEVFETLLEIAVNRKQCYPDAQLQFQTRNT 213
Query: 202 MDEDYKMRYSRVCVDISSIF---------YGTRTHSIILVDHNN 236
+ Y+R +S+IF YGTR H++I +D NN
Sbjct: 214 HE------YNR---PLSAIFIKYPEGTREYGTRCHTLITIDQNN 248
>gi|133712623|gb|ABO37118.1| cuticular water permeability [Solanum habrochaites]
Length = 269
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 117/230 (50%), Gaps = 29/230 (12%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWLA+STNGK L NVL ++ P+ + RG L V++++ KS E+ +L
Sbjct: 44 GKDEVGGGTWLASSTNGKW-FLTNVLEIHTLPHAKTRGDLPVRFLQSNKSPMEFAKELVN 102
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E E +NGF+++L + +++ + + +N P P T QE + ++ N++
Sbjct: 103 EGNE--YNGFNLILAD--IETKKMVYVTN-RPKGEPIT-------IQEVQPGIHVLSNAK 150
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSES---QNLIQNIL-DLMKNKQSNYPDPEIDRKA 198
S P+ K K F ++++ + E +++I+ ++ D K +S P
Sbjct: 151 LDS-PWPKAQRLKLNFKKMMDVYEVNDEKICVKDMIEKLMRDTTKADKSKLP-------- 201
Query: 199 EADMDEDYKMRYSRVCVDISSIF--YGTRTHSIILVDHNNVMDFHEWTLD 246
D+++ S + V++ + + YGTR+ + + ++ + F+E L+
Sbjct: 202 -CICSTDWELELSSIFVEVDTHWGCYGTRSTTALTIEVGGEVSFYELYLE 250
>gi|402702581|ref|ZP_10850560.1| hypothetical protein PfraA_22219 [Pseudomonas fragi A22]
Length = 248
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 96/217 (44%), Gaps = 37/217 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P+ R RG LV ++ G +S EY D+
Sbjct: 44 AGRDLEAGGTWLGIGANGRFAALTNIREPHKPPSRRSRGELVSDFLSGNQSIGEYFADVG 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFS--NIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF+++L + ++H++ ++AP Q VYG
Sbjct: 104 RRSVE--YAGFNLLLGT----ADQLWHYNSQDVAP--------------QLLDAGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + AE++N + Q L+ + D S+ PD + E
Sbjct: 144 NAGLDT-PWPKLLKARAALAEVLN----DPQPQALLALLSDRQTAPFSDLPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + V ISS YGTR + ++V+ +
Sbjct: 199 SLLS----------SVFISSPSYGTRASTALIVNADG 225
>gi|399002068|ref|ZP_10704764.1| hypothetical protein PMI21_03352 [Pseudomonas sp. GM18]
gi|398125812|gb|EJM15275.1| hypothetical protein PMI21_03352 [Pseudomonas sp. GM18]
Length = 248
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 33/216 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ P R RG LV +++ G E+YL D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPLQPPAARSRGELVAQFLIGDMPIEDYLNDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + E + GF++ L + ++HF+ T K VYG N+
Sbjct: 104 RRSPE--YAGFNL----LIGNANELWHFN------------ARETEAVRLKPGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ K +E+++ + Q L+ + D + PD + + E
Sbjct: 146 GLDT-PWPKVLKAKAALSEVLD----DPQPQALLALLSDPQTAPLAELPDTGVGQATETL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNV 237
+ V I S YGTR + ++V + V
Sbjct: 201 LS----------SVFIKSQAYGTRASTALIVQADGV 226
>gi|313238942|emb|CBY13931.1| unnamed protein product [Oikopleura dioica]
Length = 269
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 17/213 (7%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENVF 89
G+WL S G++GIL NVL P G RG LV ++ G S EEY +L F
Sbjct: 55 GSWLTISEKGRIGILTNVLVAKPNPTGLARGKLVTEFAAGTASPEEYFRELPLAQ----F 110
Query: 90 NGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQ 149
N F++ L++L +++ ++ S+K N + Y NS + +
Sbjct: 111 NEFNLSLIDLC---SDVGAHGTWMTSNASSSKTVLAENVK------YVMSNSCELNSKWP 161
Query: 150 KVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMR 209
K + F E+ + +R + L++ IL+++++ +S I ++ +++ M+
Sbjct: 162 KAAKLRRNFDEL--ELDRPKD--ELVEKILEMLEDGESCGESELIAEQSCGFVEKAIGMK 217
Query: 210 YSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
YGTRTHS+I+VD N + F E
Sbjct: 218 KYNSVKITGDAKYGTRTHSVIVVDELNRVTFVE 250
>gi|359474959|ref|XP_003631559.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Vitis
vinifera]
gi|297744476|emb|CBI37738.3| unnamed protein product [Vitis vinifera]
Length = 272
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 113/229 (49%), Gaps = 20/229 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWLA S +G+L L NV + P + RG L+V+++E +K+ E+ ++ KE
Sbjct: 45 RDGLAGGTWLACSRDGRLAFLTNVREVHPIPEAKSRGDLIVRFLESKKNPMEFAEEVMKE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
++ +NGF++++ +L S + + +N P +V E ++ N+
Sbjct: 105 ADK--YNGFNLIIADLC--SKTMIYITN-----RPREANVSVV---EVSPGIHVLSNASL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNR-KSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
S P+ K F E+++K+ + ++ +++ LMKN D EI
Sbjct: 153 DS-PWPKARRLGHNFKELLDKYGEGEIPTEEMVEK---LMKN---TIKDDEIVLPRIYPP 205
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ ++++ V D YGTR+ S + V + ++F+E L+ ET +
Sbjct: 206 EREHQLSSIFVDTDTPLGRYGTRSTSSVCVRASGEVNFYEKHLENETWR 254
>gi|358349515|ref|XP_003638781.1| Ser/Thr-rich protein T10 in DGCR region [Medicago truncatula]
gi|355504716|gb|AES85919.1| Ser/Thr-rich protein T10 in DGCR region [Medicago truncatula]
Length = 385
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 123/247 (49%), Gaps = 29/247 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWLA S+ GK+ L NVL ++ P + RG L + +++ K+ +E+ LK+E
Sbjct: 45 RDEIGGGTWLACSSQGKVAFLTNVLELHTCPEAKTRGDLPLMFLKSSKNPKEFAESLKRE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ +NGF++V+ ++ S ++ + SN P P T QE ++ N++
Sbjct: 105 AQ--YYNGFNLVIADIN--SKSMVYISN-RPKGQPIT-------VQEVPPGLHVLSNAKL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSE--SQNLIQNIL-DLMKNKQSNYPDPEIDRKAEA 200
S P+ K + +F E + K N + E + +I+ ++ D +K +S P+
Sbjct: 153 NS-PWHKAQRLQFRFKEHLAK-NGEGEIHVKEVIKKLMKDKIKADKSMLPNI-------C 203
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKW----IHTH 256
+D ++ + V V+ YGTR+ + + V + + F+E LD + W I H
Sbjct: 204 SLDWEFNLSSIFVEVETPLGVYGTRSSAALTVRSSGKVSFYEDYLD-DDNVWKDHVIDFH 262
Query: 257 IRKTLNM 263
I+K L +
Sbjct: 263 IQKKLKL 269
>gi|336323570|ref|YP_004603537.1| hypothetical protein Flexsi_1318 [Flexistipes sinusarabici DSM
4947]
gi|336107151|gb|AEI14969.1| protein of unknown function DUF833 [Flexistipes sinusarabici DSM
4947]
Length = 261
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 106/231 (45%), Gaps = 32/231 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +G+ L N G P RG +V +++G Y L
Sbjct: 51 AGRDLQSGGTWLGVTKSGRFAALTNFREGTAKDPKKTSRGLIVKNFLQGNDDPLSYAKQL 110
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ ++N + GF++ L S +Y+FSN A P + + +YG N
Sbjct: 111 --DLKKNRYEGFNL----LFGSSDEMYYFSNRANSLLPIS------------EGIYGLSN 152
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K+ GK F EI+ S I ++ +++++ + PE
Sbjct: 153 ATLDT-PWPKINRGKRLFKEII------SCEDVQIDSLFEMLRDDKK----PEKRELPFT 201
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ E+++ S + V I+ YGTR+ S+IL+D+N+ ++F+E D K+
Sbjct: 202 GVSEEFEKELSPIFVRING--YGTRSSSVILIDYNDNVEFYEKNYDSSGKE 250
>gi|356515804|ref|XP_003526588.1| PREDICTED: LOW QUALITY PROTEIN: ser/Thr-rich protein T10 in DGCR
region-like [Glycine max]
Length = 275
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 109/227 (48%), Gaps = 26/227 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GTWLA ST G++ L NVL ++ P + RG L V +++ K +E+ LK
Sbjct: 45 RDEIAMGTWLACSTQGRVAFLTNVLELHTLPEAKSRGDLPVLFLKSSKKPKEFAESLK-- 102
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
E + +NGF+IV+ ++ S ++ + SN P P T +E ++ N +
Sbjct: 103 LEAHYYNGFNIVVADIV--SKSMVYISN-RPKGQPIT-------IKEVPPGLHVLSNDKL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNR-KSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEAD 201
S P+ K + + F E V K+ + + +IQ ++ D +K +S+ P
Sbjct: 153 DS-PWHKALRLEFSFKEHVAKYGEGEIPVKEVIQKLMKDKVKADKSSLP---------RI 202
Query: 202 MDEDYKMRYSRVCVDISSI--FYGTRTHSIILVDHNNVMDFHEWTLD 246
D++ S + V++ + YGTR+ + + V +F+E LD
Sbjct: 203 CSPDWEFNLSSIFVEVETPLGLYGTRSSAALTVRSRGEANFYEVYLD 249
>gi|17563216|ref|NP_506243.1| Protein R186.1 [Caenorhabditis elegans]
gi|3879196|emb|CAB01439.1| Protein R186.1 [Caenorhabditis elegans]
Length = 276
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 109/237 (45%), Gaps = 19/237 (8%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPN--GRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTWL + NG++G++L++ N RG +V +++ +++ + LKK +
Sbjct: 53 GTWLGLNENGRIGMMLSITQTQESKNLHAPSRGGIVNEFLNANDTSK-MIESLKKCASK- 110
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
+NGF +V +E ST +Y +A +V + KD + NS T +P
Sbjct: 111 -YNGFQLVAVEKN--STGLYEVRTLANQQVDEIEVCQL------KDEYHVVSNSPPT-KP 160
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
+QK + GK+ E + ++ S Q + + +L + KN YPD ++ + + E+Y
Sbjct: 161 YQKAVQGKKLLREHLENSDQFSVDQ-IFEKLLSIAKNTTQWYPDAQLQYQTQNV--EEYN 217
Query: 208 MRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLN 262
S + + + YGTR H++I VD + ++ E L E W LN
Sbjct: 218 RPLSAIFIKYPEGTRMYGTRCHTLITVDQKDKINILERRLLPEQSTWHDARFEFVLN 274
>gi|156379202|ref|XP_001631347.1| predicted protein [Nematostella vectensis]
gi|156218386|gb|EDO39284.1| predicted protein [Nematostella vectensis]
Length = 290
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 112/230 (48%), Gaps = 31/230 (13%)
Query: 28 KGGTWLATSTNGKLGILLNVLG--EN--SRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+GGTWL + +G+ G L N EN S + RG LV +++ S +YL ++++
Sbjct: 56 EGGTWLGFTKSGRFGCLTNYRQSLENYASHKDAIGRGFLVSDFLKDDISTSDYLRNIQQN 115
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ +NGF++++ EL S +K+ N ++ + GN
Sbjct: 116 GSK--YNGFNLLVGELCSDS---------------ESKIEWYCNCEDKQIKTLSSGNYAL 158
Query: 144 TSQ----PFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYP---DPEIDR 196
++ P+ KV+ G+++F +I+ K + Q L+ +L L++ Q +P D
Sbjct: 159 CNRTLNFPWPKVVHGQKRFEDILQK---SMDKQALLDGLLGLLQENQRYFPGEDDSNYCT 215
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ ++D+DYK + V + I YGTRTH++ILVD + + E T+D
Sbjct: 216 LLDKELDQDYKDACRSIFVQHNEIGYGTRTHTVILVDEAGHVTYMEKTMD 265
>gi|90084387|dbj|BAE91035.1| unnamed protein product [Macaca fascicularis]
Length = 159
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 68/118 (57%), Gaps = 6/118 (5%)
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
YG N+ + P++K+ FGK+ F E V + ++ LI N+LD++ N+++ PDP I+
Sbjct: 36 YGLSNALLET-PWRKLCFGKQLFLEAVGR-SQALPKDVLIANLLDVLNNEEAQLPDPAIE 93
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + +Y+ VCV YGTRT++IILVD + + F E + +D + +W
Sbjct: 94 DQG-GEYVQPVLSKYAAVCVRCPG--YGTRTNTIILVDADGRVTFTERSMMDKDLSRW 148
>gi|198467241|ref|XP_001354319.2| GA10818 [Drosophila pseudoobscura pseudoobscura]
gi|198149406|gb|EAL31372.2| GA10818 [Drosophila pseudoobscura pseudoobscura]
Length = 326
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 124/284 (43%), Gaps = 68/284 (23%)
Query: 28 KGGTWLAT-STNG--KLGILLNVLGE---------------------------------- 50
+GGTWLA S +G K+G LLN+ GE
Sbjct: 56 EGGTWLAIGSASGVFKVGALLNLTGEPKPRDAVAHKNILQPNNQNNHNNSHSSSNMSNIS 115
Query: 51 -NSRPNGRD-------RGPLVVKYVEGQKSAEEYL----TDLKKETEENVFNGFHIVLLE 98
+ RP D RG +V +V L + L+ T+ + FN I +
Sbjct: 116 NSKRPENNDNEHFLLGRGMIVADFVSQFDEPNSILNFNQSLLQDCTKYSAFNFVSIQIGP 175
Query: 99 LTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKF 158
+L + I SN+ P + + +E K YGFGNS PF+KV FG+++F
Sbjct: 176 KSLPA-QIRLLSNVPP---------TLNDFEEGK--CYGFGNS-LPGTPFEKVRFGQQQF 222
Query: 159 AEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDIS 218
++IV + + + L +L L+KNK+ +PD E+ R+A ++ S + V I
Sbjct: 223 SDIVEEHGTGT-VEALSAQLLMLLKNKKKFWPDAELKRRAP-----NWGEELSALNVHIG 276
Query: 219 SIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLN 262
YG+RTHSIILVD NN M F E D E ++R ++
Sbjct: 277 DRGYGSRTHSIILVDTNNKMHFFEENFDPEINSLEPCNLRTVVH 320
>gi|195173839|ref|XP_002027692.1| GL22571 [Drosophila persimilis]
gi|194114638|gb|EDW36681.1| GL22571 [Drosophila persimilis]
Length = 329
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 126/287 (43%), Gaps = 71/287 (24%)
Query: 28 KGGTWLAT-STNG--KLGILLNVLGE---------------------------------- 50
+GGTWLA S +G K+G LLN+ GE
Sbjct: 56 EGGTWLAIGSASGVFKVGALLNLTGEPKPRDAVAHKNILQPNNQNNHNNSHSHSHSHSSS 115
Query: 51 ----NSRPNGRD-------RGPLVVKYVEGQKSAEEYLTDLKKETEENV--FNGFHIVLL 97
+ RP D RG +V +V Q + + + ++ ++ F+ V +
Sbjct: 116 NMSNSKRPENNDNEHFLLGRGMIVADFVS-QFDEPNSILNFNQSLLQDCTKYSAFNFVSI 174
Query: 98 ELTLQS--TNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGK 155
E+ +S I SN+ P + + +E K YGFGNS PF+KV FG+
Sbjct: 175 EIGPKSLPAQIRLLSNVPP---------TLNDFEEGK--CYGFGNS-LPGTPFEKVRFGQ 222
Query: 156 EKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCV 215
++F++IV + + + L +L L+KNK+ +PD E+ R+A ++ S + V
Sbjct: 223 QQFSDIVEEHGTGT-VEALSAQLLMLLKNKKKFWPDAELKRRAP-----NWGEELSALNV 276
Query: 216 DISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLN 262
I YG+RTHSIILVD NN M F E D E ++R ++
Sbjct: 277 HIGDRGYGSRTHSIILVDTNNKMHFFEENFDPEINSLEPCNLRTVVH 323
>gi|359806332|ref|NP_001240971.1| uncharacterized protein LOC100820362 [Glycine max]
gi|255635670|gb|ACU18184.1| unknown [Glycine max]
Length = 273
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 108/225 (48%), Gaps = 22/225 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GTWLA ST G++ L NVL ++ P + RG L V +++ K +E+ LK
Sbjct: 45 RDEIAMGTWLACSTQGRVAFLTNVLELHTLPEAKSRGDLPVLFLKSSKKPKEFAESLK-- 102
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E + +NGF++++ ++ S ++ + SN P P T QE ++ N +
Sbjct: 103 SEAHYYNGFNLIVADIV--SNSMVYISN-RPKGQPIT-------IQEVPPGLHVLSNDKL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNR-KSESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEAD 201
S P+ K + + F E V K + + +IQ ++ D +K +++ P
Sbjct: 153 DS-PWHKALRLEFSFKEHVAKHGEGEIPVKEVIQKLMKDTVKADKNSLP-------RICS 204
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+D ++ + V V+ YGTR+ + + V + F+E LD
Sbjct: 205 LDWEFNLSSIFVEVETPLGLYGTRSSAALTVRSSGEASFYEVYLD 249
>gi|308500906|ref|XP_003112638.1| hypothetical protein CRE_30994 [Caenorhabditis remanei]
gi|308267206|gb|EFP11159.1| hypothetical protein CRE_30994 [Caenorhabditis remanei]
Length = 276
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 115/247 (46%), Gaps = 33/247 (13%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAE 74
E+L V + GTW + NG++G++L++ E+ N RG +V +++ ++
Sbjct: 40 EILSGVDEQDEARGTWTGLTKNGRIGMMLSITQTQESKDFNAPSRGGIVSSFLKANDTS- 98
Query: 75 EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-- 132
E + LKK+ + +NGF +V LE DS V +TN+Q +K
Sbjct: 99 EMMESLKKDAGK--YNGFQLVGLEQN---------------DSGLYDVKTLTNQQVDKIE 141
Query: 133 -----DHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS 187
D + NS T +P+QK I+G+ +E + N S Q + +N++ + +K
Sbjct: 142 VCEWNDEYHVISNSPLT-KPYQKAIYGRRLISERLRDSNEMSVEQ-VFENLMSIATDKTQ 199
Query: 188 NYPDPEIDRKAEADMDEDYKMRYSRVCVDI--SSIFYGTRTHSIILVDHNNVMDFHEWTL 245
YPD ++ + + + DE Y S + + + YGTR H+++ ++ ++ + E
Sbjct: 200 CYPDAQLQFQTQ-NTDE-YNQPLSAIFIKYPEGTREYGTRCHTLLTINQDDHVSILERRF 257
Query: 246 DYETKKW 252
E W
Sbjct: 258 LPEESTW 264
>gi|423098136|ref|ZP_17085932.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
Q2-87]
gi|397885644|gb|EJL02127.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
Q2-87]
Length = 248
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/213 (22%), Positives = 90/213 (42%), Gaps = 33/213 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL +G+ L N+ P + RG LV +++ G EYL++
Sbjct: 42 VHAGRDLEAGGTWLGVGADGRFAALTNIRDPGQLPAFKSRGELVARFLSGNLPIAEYLSE 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E F GF+++L + T ++HF+ T Q+ + VYG
Sbjct: 102 VVPRAGE--FGGFNLLLGD----GTQLWHFN------------ARDTRPQQLAEGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + E+++ + + + L+ + D + PD + E
Sbjct: 144 NAGLNT-PWPKLLKARAALVEVLD----EPQPEALLALLNDPQPAPVAELPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 199 TLLS----------SVFIASPAYGTRASTALIV 221
>gi|357475105|ref|XP_003607838.1| Ser/Thr-rich protein T10 in DGCR region [Medicago truncatula]
gi|355508893|gb|AES90035.1| Ser/Thr-rich protein T10 in DGCR region [Medicago truncatula]
Length = 275
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 114/235 (48%), Gaps = 33/235 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWLA+S NG+L L N + P RG L +++++ KS EE+ ++ KE
Sbjct: 55 RDVLCGGTWLASSKNGRLAFLTNFRELQNIPQSNTRGDLPLRFLQSDKSPEEFAEEVLKE 114
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+++NGF++VL ++ ST +Y F + P+ +VT ++ N+
Sbjct: 115 A--HLYNGFNLVLADIC-TSTMVYVF------NRPNHGYLSVTPG------IHVLTNASL 159
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILD-LMKNKQSNYPDPEIDRKA--EA 200
+ P+ K + F E+V+++ + + I+ +++ LM N + D K
Sbjct: 160 DA-PWSKAERLRHSFKELVDQYG---DGEFPIKEMVEKLMTNTVKD------DDKCLLPG 209
Query: 201 DMDEDYKMRYSRVCVDI---SSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKW 252
+++ S + VD S YGTR+ S + V N + F+E LD K+W
Sbjct: 210 IRPPEFEFPLSSIFVDTQFPSVGPYGTRSTSALFVTSNKEVTFYEKHLD--QKQW 262
>gi|329910245|ref|ZP_08275263.1| hypothetical protein IMCC9480_115 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546231|gb|EGF31269.1| hypothetical protein IMCC9480_115 [Oxalobacteraceae bacterium
IMCC9480]
Length = 241
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/225 (23%), Positives = 96/225 (42%), Gaps = 29/225 (12%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYL 77
L+ +D GGTW+ + G+ L NV R + R RG L ++ G +A+ Y+
Sbjct: 41 LVYAGRDLQGGGTWMGVTQTGRFAALTNVRAPGEQRADARSRGELAADFLTGTMTAKAYV 100
Query: 78 TDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYG 137
L ++ ++ +NGF++++ + + +SN A D V N Q +YG
Sbjct: 101 AQLAQDDDD--YNGFNLLVGD----RHELIWYSNRAADD--------VRNGQAITPGIYG 146
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N++ + P+ KV+ K +FA ++ + + + + D + P + +
Sbjct: 147 LSNARLDT-PWPKVVRTKAEFASLLCQ---GATDDAFFEMLTDTRRATDCRLPKTGLSLE 202
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + S VC+D YGTR +I +D + E
Sbjct: 203 LERAL--------SAVCIDTPD--YGTRASTIARLDTTDRASLSE 237
>gi|388490624|gb|AFK33378.1| unknown [Lotus japonicus]
gi|388512725|gb|AFK44424.1| unknown [Lotus japonicus]
Length = 271
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 110/225 (48%), Gaps = 22/225 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GTWLA S G++ L NVL ++ P + RG LVV +++ ++ +E+ LK
Sbjct: 45 RDEIAKGTWLACSREGRIACLTNVLEVHTLPEAKTRGDLVVSFLKSKEHPKEFSESLK-- 102
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
T+ + +NGF++++ +++ S ++ + +N P + + +E ++ N+
Sbjct: 103 TKAHYYNGFNLIVADIS--SKSMVYITN-----RPKRQSMTI---EEVSPGLHVLTNASL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKS-ESQNLIQNIL-DLMKNKQSNYPDPEIDRKAEAD 201
S P+ K + + F + + K+ + LI ++ D K ++S P
Sbjct: 153 DS-PWHKALRLEVGFKKQLAKYGVGDIPVKELIHKLMNDRTKAEESRLPHI-------CS 204
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+D +Y + V V+ YGTR+ + ++V N ++F E LD
Sbjct: 205 LDWEYDLSPIFVEVETPLGLYGTRSSAAVIVRWNGEVNFFEAYLD 249
>gi|384246671|gb|EIE20160.1| DUF833-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 280
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 100/232 (43%), Gaps = 37/232 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + G+ +L N G RG L V ++ G++S EYL L
Sbjct: 44 AGRDLKGGGTWLGITKTGRFALLTNFREPGFGSVKGTSRGALTVDFLRGEQSPLEYLKGL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ FNG ++++ +L +S + + +N K+ + + QE +YG N
Sbjct: 104 NAQ----AFNGVNLIVGDLKAKS--VAYLTN-------RGKIEELKHPQELPAGLYGISN 150
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + KV+ GKEK + + +I D+M +++ D ++
Sbjct: 151 GVLGDR-WVKVVRGKEKLSSLEGDLAEGHVPWEIIMG--DIMGDRERVTDDAQLP----- 202
Query: 201 DMDEDYKMRYSRVCVDISSIF----------YGTRTHSIILVDHNNVMDFHE 242
D Y R+ +SSIF YGTR+ ++++V + ++F E
Sbjct: 203 --DTGIPAHYERI---LSSIFVEPAEMPDGPYGTRSQTVVVVWRDGRVEFRE 249
>gi|356552868|ref|XP_003544784.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Glycine
max]
Length = 270
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 102/223 (45%), Gaps = 18/223 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWLA S G++ L NVL S P + RG L V +++ K +E+ LK E
Sbjct: 45 RDEIAGGTWLACSREGRVAFLTNVLELRSLPEAKSRGDLPVSFLKSGKHPKEFAESLKME 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ +NGF++++ ++ S + + SN +P P T +E ++ N++
Sbjct: 105 A--HYYNGFNLIVADIP--SKCMVYISN-SPKGQPIT-------IKEVSPGLHVLSNAKL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
S+ + K + F E + K+ + + L K K N P I +D
Sbjct: 153 DSK-WHKAQRLEVGFKEQLAKYGEGEIPVKEVVHKLMKDKTKADNSHLPHI-----CSLD 206
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
++ + V V+ YGTR+ + ++V + + F E LD
Sbjct: 207 WEFNLSSIFVEVETPLGLYGTRSSAALIVTSSEEVSFFEAYLD 249
>gi|408479572|ref|ZP_11185791.1| hypothetical protein PsR81_03394 [Pseudomonas sp. R81]
Length = 248
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 92/215 (42%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G+ L N+ + P + RG LV +++ G E+YL D+
Sbjct: 44 AGRDQEAGGTWLGVNAEGRFAALTNIRDPHQPPARKSRGELVARFLNGSLPVEQYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ N T + K VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTPGELWHYN------------ANHTEPTQLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K +E++N + + L++ + D + PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALSELLN----DPQPEALLEILSDPQIAPFAELPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|398911940|ref|ZP_10655736.1| hypothetical protein PMI29_01561 [Pseudomonas sp. GM49]
gi|398183095|gb|EJM70591.1| hypothetical protein PMI29_01561 [Pseudomonas sp. GM49]
Length = 248
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 94/220 (42%), Gaps = 43/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P R RG LV +++ G+ S ++YL+D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPHQPPGRRSRGELVARFLTGEVSIDDYLSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DHVY 136
+ E + GF++ L + ++HF N +E++ V+
Sbjct: 104 ARSSE--YGGFNL----LVGNANELWHF-----------------NARESEAVMLPAGVH 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ K +E+++ + Q L+ + D + + PD +
Sbjct: 141 GLSNAGLDT-PWPKLLKAKAALSEVLD----DPQPQALLALLSDSQQAPFAELPDTGVGL 195
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 196 ATETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|398871024|ref|ZP_10626342.1| hypothetical protein PMI34_01532 [Pseudomonas sp. GM74]
gi|398206859|gb|EJM93617.1| hypothetical protein PMI34_01532 [Pseudomonas sp. GM74]
Length = 248
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 93/220 (42%), Gaps = 43/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P R RG LV +++ G+ S ++YL+D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPHQPPGRRSRGELVAQFLTGEVSIDDYLSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DHVY 136
+ E + GF++ L + ++HF N +E++ V+
Sbjct: 104 ARSPE--YGGFNL----LVGNANELWHF-----------------NARESEAVMLPAGVH 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ K +E+++ + Q L+ + D + PD +
Sbjct: 141 GLSNAGLDT-PWPKLLKAKAALSEVLD----DPQPQTLLALLSDSQPAPVAELPDTGVGL 195
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 196 ATETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|356577141|ref|XP_003556686.1| PREDICTED: uncharacterized protein C22orf25-like [Glycine max]
Length = 268
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 111/231 (48%), Gaps = 21/231 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA+ GGTWL ++ +G++ L N N + RG L +++++G KS EE+ + KE
Sbjct: 45 KDALGGGTWLGSTRDGRIAFLTNFREVEMLSNPKTRGDLPLRFLQGNKSPEEFAEQVVKE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+E +NGF++VL ++ S+ +Y F N D S VT ++ N+
Sbjct: 105 ADE--YNGFNLVLADIC-TSSMVYVF-NRPNQDHLSLAQVVVT------PGIHVLTNAAL 154
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILD-LMKNKQSNYPDPEIDRKAEADM 202
+ P+ K + F E ++++ ES I+ +++ LM N D E
Sbjct: 155 DA-PWPKAERLRHNFKEFIDQYG---ESDFPIKEMVEKLMTN---TVKDEECMLPGIHPP 207
Query: 203 DEDYKMRYSRVCVDISSI-FYGTRTHSIILVDHNNVMDFHEWTLDYETKKW 252
+ + + V ++SS YGTR+ S + V N + F+E L E K+W
Sbjct: 208 EREQPLSSIFVEAELSSSGHYGTRSSSALFVKSNKEVTFYEKYL--EKKQW 256
>gi|268564823|ref|XP_002647227.1| Hypothetical protein CBG23817 [Caenorhabditis briggsae]
Length = 276
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 107/222 (48%), Gaps = 18/222 (8%)
Query: 18 MLLEVSQDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEE 75
+L V + V GTWL + GK+G+LL++ + N RG +V +++ + + E
Sbjct: 41 ILSGVDEKDVARGTWLGINPRGKIGMLLSITQPIDTKHTNAPSRGGIVNDFLKTE--SLE 98
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
+L +LK + E+ FNGF V +E +T ++ ++ +V K E+K HV
Sbjct: 99 FLENLKIKAEQ--FNGFQFVGIEKN-ANTGLFEVRSLTNQLVRDVEVV----KWEDKFHV 151
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
Y ++ PF+K +G E F + + + + E + + + ++ + +PD +I
Sbjct: 152 Y---SNSPPHIPFKKAEYGLELFQKSLEN-SDQLEREEIFGRLFEIATCQTQCFPDEQI- 206
Query: 196 RKAEADMDEDYKMRYSRVCVDISSI-FYGTRTHSIILVDHNN 236
+ + D + + V YGTR+H+IILVD N+
Sbjct: 207 -RVQTGFPGDVYRPLTSIFVRFPETRRYGTRSHTIILVDRND 247
>gi|398857113|ref|ZP_10612816.1| hypothetical protein PMI36_00708 [Pseudomonas sp. GM79]
gi|398241427|gb|EJN27079.1| hypothetical protein PMI36_00708 [Pseudomonas sp. GM79]
Length = 248
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 88/215 (40%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ P + RG LV +++ G S ++YL D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPLQPPARKSRGELVAQFLSGDMSIDDYLKDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L + ++HF + + T Q +YG N+
Sbjct: 104 GRSPE--YAGFNL----LIGNANELWHF---------NARETEAVMLQPG---IYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ K +E++ + Q L+ + D + PD + + E
Sbjct: 146 GLDT-PWPKVLKAKAALSEVLG----DPQPQALLALLSDPQTAPLAELPDTGVGQATETL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I S YGTR + ++V +
Sbjct: 201 LS----------SVFIKSPTYGTRASTALIVQADG 225
>gi|390458605|ref|XP_002743595.2| PREDICTED: uncharacterized protein C22orf25 isoform 1 [Callithrix
jacchus]
Length = 252
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 24/169 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDRQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 VEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS 187
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIADLLDVLNNEEA 202
>gi|221042284|dbj|BAH12819.1| unnamed protein product [Homo sapiens]
Length = 178
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 67/118 (56%), Gaps = 6/118 (5%)
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
YG N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++ PDP I+
Sbjct: 55 YGLSNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQLPDPAIE 112
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + +Y+ VCV YGTRT++IILVD + + F E + +D + W
Sbjct: 113 DQG-GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTERSMMDKDLSHW 167
>gi|398868555|ref|ZP_10623952.1| hypothetical protein PMI35_05909 [Pseudomonas sp. GM78]
gi|398233009|gb|EJN18957.1| hypothetical protein PMI35_05909 [Pseudomonas sp. GM78]
Length = 248
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/223 (21%), Positives = 95/223 (42%), Gaps = 43/223 (19%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G+ ++YL D
Sbjct: 42 VHAGRDLEAGGTWLGVGANGRFAALTNIRDPHRPPGRKSRGELVARFLTGEVPIDDYLDD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DH 134
+ + E + GF++ L + ++HF N +E++
Sbjct: 102 VVTRSPE--YGGFNL----LIGNTNELWHF-----------------NARESEAVMLPPG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VYG N+ + P+ K++ K +E+++ + Q+L+ + D + ++ PD +
Sbjct: 139 VYGLSNAGLDT-PWPKLLKAKAALSEVLD----DPQPQSLLALLSDSQQAPFADLPDTGV 193
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNV 237
E + V I+S YGTR + ++V +
Sbjct: 194 GLATETLLS----------SVFIASQSYGTRASTALIVQADGA 226
>gi|398839036|ref|ZP_10596286.1| hypothetical protein PMI18_01620 [Pseudomonas sp. GM102]
gi|398113666|gb|EJM03510.1| hypothetical protein PMI18_01620 [Pseudomonas sp. GM102]
Length = 248
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 86/215 (40%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ P + RG LV +++ G S ++YLTD+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPLQPPARKSRGELVAQFLSGDMSIDDYLTDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L + ++HF+ T +YG N+
Sbjct: 104 GRSPE--YAGFNL----LIGNANELWHFN------------ARETEAVMLPPGIYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ K E++ + Q L+ + D + PD + + E
Sbjct: 146 GLDT-PWPKVLKAKAALNEVLG----DPQPQALLALLSDPQTAPLAELPDTGVGQATETL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I S YGTR + ++V +
Sbjct: 201 LS----------SVFIKSPTYGTRASTALIVQADG 225
>gi|398893452|ref|ZP_10646166.1| hypothetical protein PMI31_04014 [Pseudomonas sp. GM55]
gi|398184352|gb|EJM71808.1| hypothetical protein PMI31_04014 [Pseudomonas sp. GM55]
Length = 248
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 88/218 (40%), Gaps = 33/218 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G+ + YL D
Sbjct: 42 VHAGRDLEAGGTWLGVGANGRFAALTNIRDPHRPPGRKSRGELVARFLTGEIPIDHYLAD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF++ L + ++HF+ +N VYG
Sbjct: 102 VVDHSPE--YGGFNL----LIGNTHELWHFN------------ARESNALMLAPGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ KV+ K E+++ + Q L+ + D + PD + E
Sbjct: 144 NAGLDT-PWPKVLKAKAALREVLD----DPQPQALLALLSDAQTAPFAELPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNV 237
+ V I+S YGTR + ++V +
Sbjct: 199 TLLS----------SVFIASQSYGTRASTALIVQADGA 226
>gi|381209151|ref|ZP_09916222.1| hypothetical protein LGrbi_04378 [Lentibacillus sp. Grbi]
Length = 256
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 97/225 (43%), Gaps = 37/225 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D V GTWL + +G++ L N N + N + RG +V ++E + T L
Sbjct: 44 AGRDLVGMGTWLGITKSGRIAALTNYRDPNEDQQNKKSRGHIVKDFLESSMPVAGFFTGL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIA---PLDSPSTKVTNVTNKQENKDHVYG 137
K E +NV+ GF+ + +Y++SNI + SP T +G
Sbjct: 104 KHE--KNVYGGFNFI----AGNPEQLYYYSNIEDSISMVSPGT---------------HG 142
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N + P+ KV+ GKE+ A + + N E+ L I D + PD +
Sbjct: 143 LSN-EFLDTPWPKVMKGKERLASYLRQ-NATIETDPLFDIISDAEQAPDELLPDTGVGL- 199
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D++ + S + + YGTR +++LVD NN + F E
Sbjct: 200 -------DFERKLSPLFIKTPE--YGTRCSTVLLVDRNNQVTFIE 235
>gi|332859156|ref|XP_003317148.1| PREDICTED: uncharacterized protein C22orf25 homolog [Pan
troglodytes]
gi|397485975|ref|XP_003814111.1| PREDICTED: uncharacterized protein C22orf25 homolog isoform 4 [Pan
paniscus]
gi|21751093|dbj|BAC03902.1| unnamed protein product [Homo sapiens]
Length = 214
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 63/114 (55%), Gaps = 5/114 (4%)
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
Q YG N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++
Sbjct: 84 QARGRGTYGLSNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQ 141
Query: 189 YPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
PDP I+ + + + +Y+ VCV YGTRT++IILVD + + F E
Sbjct: 142 LPDPAIEDQG-GEYVQPMLSKYAAVCVRCPG--YGTRTNTIILVDADGHVTFTE 192
>gi|425902171|ref|ZP_18878762.1| PF05742 family protein [Pseudomonas chlororaphis subsp.
aureofaciens 30-84]
gi|397881648|gb|EJK98137.1| PF05742 family protein [Pseudomonas chlororaphis subsp.
aureofaciens 30-84]
Length = 248
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 89/211 (42%), Gaps = 33/211 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +G+ L N+ + P R RG LV +++ + S ++Y +D+
Sbjct: 44 AGRDLEAGGTWLGLGADGRFAALTNIRNPHQPPARRSRGELVARFLSSKMSIDDYFSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ V Q+ VYG N+
Sbjct: 104 GRSLE--YAGFNL----LVGTPDQLWHYN------------AQVREPQQLGSGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K E+++ + Q L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALQEVLD----DPQPQALLALLSDAQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 201 LS----------SVFIASPSYGTRASTALIV 221
>gi|340727962|ref|XP_003402302.1| PREDICTED: uncharacterized protein C22orf25 homolog [Bombus
terrestris]
Length = 169
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 64/117 (54%), Gaps = 15/117 (12%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPN--GRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
+GGTWLA S GK G++LN+ E S N + RG LV +V SA YL L K+
Sbjct: 56 EGGTWLAMSLTGKAGVVLNLSNEASSTNIPKQGRGFLVPNFVTSNDSALSYLDKLYKKNN 115
Query: 86 EN-VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
EN ++N F +VL++ LQ+ ++ + S+ N T ++D++ GFGNS
Sbjct: 116 ENQIYNPFILVLID--LQNADVKYLSS----------SHNSTGPNSSQDNILGFGNS 160
>gi|426412067|ref|YP_007032166.1| hypothetical protein PputUW4_05171 [Pseudomonas sp. UW4]
gi|426270284|gb|AFY22361.1| hypothetical protein PputUW4_05171 [Pseudomonas sp. UW4]
Length = 248
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 92/222 (41%), Gaps = 43/222 (19%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G+ +EYL D
Sbjct: 42 VHAGRDLEAGGTWLGVGANGRFAALTNIRDPHRPPGRKSRGELVARFLAGEMPIDEYLAD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DH 134
+ + E + GF++ L ++HF N++E++
Sbjct: 102 VVDRSLE--YGGFNL----LVGNPHELWHF-----------------NQRESEAVMLAPG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VYG N+ + P+ K++ + +E+++ + Q L+ + D + PD +
Sbjct: 139 VYGLSNAGLDT-PWPKLLKARAALSEVLD----DPQPQALLALLSDAQTAPFAELPDTGV 193
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 194 GLATETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|224131884|ref|XP_002328132.1| predicted protein [Populus trichocarpa]
gi|222837647|gb|EEE76012.1| predicted protein [Populus trichocarpa]
Length = 262
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 106/225 (47%), Gaps = 22/225 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D + GGTWL + +GK+ + NV S P + RG L ++++E K+ +EY +L KE
Sbjct: 45 KDELAGGTWLGCTRDGKIAFITNVREVKSIPQAKSRGDLTLRFLESNKNPKEYAEELSKE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
++ +NGF+++L +++ +S +Y + P + +VT HV N+
Sbjct: 105 ADQ--YNGFNLILADISSKSM-VYLTNRPKPENFIVMEVTPGM-------HV--LSNASL 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL--DLMKNKQSNYPDPEIDRKAEAD 201
S P+ K F +++ K++ + IL + +K+ +S P
Sbjct: 153 DS-PWPKAQRLGHGFKDLLEKYDEAELPTKEMAEILMTNTIKDDESMLP-------GIYP 204
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ ++++ + D YGTR+ + V + ++F+E LD
Sbjct: 205 SEREHQLSSIFIEADTPLGRYGTRSTCALSVKSSGEVNFYERYLD 249
>gi|402593989|gb|EJW87916.1| hypothetical protein WUBG_01174 [Wuchereria bancrofti]
Length = 269
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/230 (25%), Positives = 97/230 (42%), Gaps = 47/230 (20%)
Query: 30 GTWLATSTNGK----LGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETE 85
GTWL + G L I + + S RG L V +V+ K+ + + E
Sbjct: 53 GTWLCMNATGSISNLLAITVPIHQMKSSSLTLTRGALPVDFVKSNKNRKNFANYWSIPNE 112
Query: 86 ENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTS 145
+ + + LT Q + + P ++GF NS +
Sbjct: 113 YDQYE-----IAGLTYQPDDKIEITRYPP-------------------GIHGFSNSP-SC 147
Query: 146 QPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKNKQSNYPDPEIDRKAEADMDE 204
+PF+KV G EK I+ + N + S+ +I +L L +K YPD ++ R+
Sbjct: 148 EPFKKVQRGTEKVNVIIEEINSEDLSETEIIGRLLQLATDKYQCYPDDQLKRRCG----- 202
Query: 205 DYKMRYSRVCVDISSIF--------YGTRTHSIILVDHNNVMDFHEWTLD 246
R S +C ++IF YGTR+H+II+VD NN ++E +++
Sbjct: 203 ----RSSELCKYRAAIFVRYPDGIPYGTRSHTIIIVDRNNRATYYEKSME 248
>gi|221045428|dbj|BAH14391.1| unnamed protein product [Homo sapiens]
Length = 252
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 24/169 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS 187
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEA 202
>gi|399522759|ref|ZP_10763422.1| Ser/Thr-rich protein T10 in DGCR region [Pseudomonas
pseudoalcaligenes CECT 5344]
gi|399109623|emb|CCH39983.1| Ser/Thr-rich protein T10 in DGCR region [Pseudomonas
pseudoalcaligenes CECT 5344]
Length = 247
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/226 (21%), Positives = 93/226 (41%), Gaps = 44/226 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G+ L N+ + R RG L Y+ GQ+SA YL +
Sbjct: 44 AGRDLQAGGTWLGVTAQGRFAALTNIRDPRQKAGPRSRGALTADYLLGQESAPAYLDRIM 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH-----VY 136
++ E + GF++++ + + ++HF N QE + +Y
Sbjct: 104 RDATE--YAGFNLLVGD----AQQLWHF-----------------NSQEGQPRQLESGIY 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ ++ +E ++ + ++L++ + D + PD +
Sbjct: 141 GLSNASLDT-PWPKLLRARDALSERIDGVD-----EDLLEMLSDSSQPADHLLPDTGVGL 194
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + V I+S YGTR +++ + N E
Sbjct: 195 ATERLLS----------SVFIASATYGTRASTVLNLKQNGGWSITE 230
>gi|308471686|ref|XP_003098073.1| hypothetical protein CRE_11355 [Caenorhabditis remanei]
gi|308269414|gb|EFP13367.1| hypothetical protein CRE_11355 [Caenorhabditis remanei]
Length = 278
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 106/218 (48%), Gaps = 14/218 (6%)
Query: 18 MLLEVSQDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEE 75
+L V + V GTWL +GK+G+LL++ ++ N RG +V ++++ + +
Sbjct: 41 ILSGVDEKDVARGTWLGIDEHGKVGMLLSITQPVDSKHTNAPSRGGIVNEFLKSGSDSSQ 100
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
+ +LK + E FNGF V +E +S ++ ++ ++T K +K HV
Sbjct: 101 FFENLKGKAES--FNGFQFVGIERNPKS-GLFEVRSLTNQLVDQIEIT----KWNDKFHV 153
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
Y ++ PF+K FG + F+E + K + + + + + + ++ + +PD +I
Sbjct: 154 Y---SNSPPHIPFKKAEFGLKIFSESLEKTD-ELDIDEIQRRLFEIATCRTHCFPDDQI- 208
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
R + YK S S YGTR+H++I+VD
Sbjct: 209 RVQTGFPGDVYKPLTSIFVRFPESRRYGTRSHTLIIVD 246
>gi|168059636|ref|XP_001781807.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666714|gb|EDQ53361.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 275
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 108/221 (48%), Gaps = 22/221 (9%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWLA S +G+L + N G+++ P RG L +++ KS YL ++
Sbjct: 46 RDGEAGGTWLAISESGRLAFVTNFREPGKDT-PGAVSRGELPTLFLKSSKSPTAYLEEVA 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
++ +NGF++++ + + + + + SN P P V KQ + ++ N+
Sbjct: 105 ARADK--YNGFNLIVAD--MNTKEMAYLSN-RPRGEP------VEVKQVSPG-LHSLSNA 152
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ GKEK +++++ + +N + I +L+ + EI R +
Sbjct: 153 NLDT-PWPKVLRGKEKIEVLLSQYPDQVIPENCL--IDELLTDSTRA----EISRLPKTG 205
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ E ++ +S + V + YGTR+ ++I V F+E
Sbjct: 206 LSEAHEHAFSPIFVYWDNPPYGTRSMTVIAVHKTGQTTFYE 246
>gi|221042028|dbj|BAH12691.1| unnamed protein product [Homo sapiens]
Length = 252
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 24/169 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSNIAPLD----SPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L T + I ++ N D +P T YG
Sbjct: 111 MEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGT---------------YGL 155
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS 187
N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++
Sbjct: 156 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEA 202
>gi|229593188|ref|YP_002875307.1| hypothetical protein PFLU5818 [Pseudomonas fluorescens SBW25]
gi|229365054|emb|CAY53237.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
Length = 248
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 91/215 (42%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + +G+ L N+ + P + RG LV +++ G E+YL D+
Sbjct: 44 AGRDQEAGGTWLGVNADGRFAALTNIRDPHQPPARKSRGELVARFLNGSLPIEDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ N T K VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTRDELWHYN------------ANHTEPTRLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K +E++ + + L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALSELLG----DPQPEALLGILSDPQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|70733183|ref|YP_262956.1| hypothetical protein PFL_5898 [Pseudomonas protegens Pf-5]
gi|68347482|gb|AAY95088.1| conserved hypothetical protein [Pseudomonas protegens Pf-5]
Length = 248
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/213 (22%), Positives = 90/213 (42%), Gaps = 33/213 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ + P R RG LV +++ G ++YL D
Sbjct: 42 IHAGRDLEAGGTWLGVGAEGRFAALTNIRNPHQPPARRSRGELVARFLAGDVPLDDYLRD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF+++L + ++H++ P Q + VYG
Sbjct: 102 VVGRSVE--YAGFNLLL----GNAEQLWHYNARDP------------QPQRLEPGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + E+++ + Q L+ + D ++ PD + E
Sbjct: 144 NAGLDT-PWPKLLKARAALQEVLH----DPQPQALLALLSDPQTAPFADLPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + V I+S YGTR + ++V
Sbjct: 199 SLLS----------SVFIASPSYGTRASTALIV 221
>gi|423694136|ref|ZP_17668656.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
SS101]
gi|387999012|gb|EIK60341.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
SS101]
Length = 248
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 91/215 (42%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + +G+ L N+ + P + RG LV +++ G ++YL D+
Sbjct: 44 AGRDQEAGGTWLGVNADGRFAALTNIRDPHQPPARKSRGELVARFLSGSLPIDDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ N T K VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTRDELWHYN------------ANDTEPTLLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K F+ ++ E + L++ + D + PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAAFSAVLE----NPEPEALLEILSDPQTAPFAELPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|398975924|ref|ZP_10685950.1| hypothetical protein PMI24_02066 [Pseudomonas sp. GM25]
gi|398139916|gb|EJM28904.1| hypothetical protein PMI24_02066 [Pseudomonas sp. GM25]
Length = 248
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/217 (21%), Positives = 92/217 (42%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P R RG LV ++ G S ++YL+D
Sbjct: 42 VHAGRDLEAGGTWLGLGANGRFAALTNIRDPHQPPARRSRGELVAGFLTGNLSIDDYLSD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF+++L + ++HF+ + + VYG
Sbjct: 102 VVARSPE--YAGFNLLL----GNANELWHFN------------ARSSEPVMLQPGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ K + +++ + + L+ + D ++ PD + E
Sbjct: 144 NAGLDT-PWPKLLKAKTALSAVLD----DPQPERLLALLSDAQTAPFADLPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + V I+S YGTR + ++V +
Sbjct: 199 SLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|242074008|ref|XP_002446940.1| hypothetical protein SORBIDRAFT_06g025430 [Sorghum bicolor]
gi|241938123|gb|EES11268.1| hypothetical protein SORBIDRAFT_06g025430 [Sorghum bicolor]
Length = 270
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 105/224 (46%), Gaps = 26/224 (11%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSHGPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E ++ +NGF+++L ++ S + + SN P P V HV ++
Sbjct: 108 EADQ--YNGFNLILADVN--SGTMVYISN-RPGGDP------VIQTVAPGLHVL---SNA 153
Query: 143 CTSQPFQKVI-FGKEKFAEIVNKFNRKSESQNLIQNI-LDLMKNKQSNYPDPEIDRKAEA 200
P+ K + G+ + + ++ +++++ + +D + +S PD +D + E
Sbjct: 154 AIDSPWPKAMRLGQSFEGYLATHDDAEASLKHMVEELMMDTARPDRSMVPDTGVDPEWE- 212
Query: 201 DMDEDYKMRYSRVCVDIS--SIFYGTRTHSIILVDHNNVMDFHE 242
YK+ S + +D + YGTR+ + + + F+E
Sbjct: 213 -----YKL--SSIFIDTTKEQAQYGTRSMAALAAKLKGEVAFYE 249
>gi|387896201|ref|YP_006326498.1| hypothetical protein PflA506_5110 [Pseudomonas fluorescens A506]
gi|387160118|gb|AFJ55317.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
A506]
Length = 248
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 90/215 (41%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + +G+ L N+ + P + RG LV +++ G ++YL D+
Sbjct: 44 AGRDQEAGGTWLGVNADGRFAALTNIRDPHQPPARKSRGELVARFLSGSLPIDDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ N T K VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTRDELWHYN------------ANDTEPTLLKAGVYGLANA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K F+ ++ E + L+ + D + PD + E+
Sbjct: 146 GLDT-PWPKLVKAKAAFSAVLE----NPEPEALLGILSDPQTAPFAELPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|91788385|ref|YP_549337.1| hypothetical protein Bpro_2522 [Polaromonas sp. JS666]
gi|91697610|gb|ABE44439.1| protein of unknown function DUF833 [Polaromonas sp. JS666]
Length = 295
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 103/243 (42%), Gaps = 36/243 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTWL + +G++ L NV G R RG LV++++E A +++ L+K
Sbjct: 48 RDLRAGGTWLGMTPDGRVAFLTNVREGYPQAAAPRSRGELVMRWLEANCEAGDFIRVLEK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++V+ + Q +N A L + Q +YG N+
Sbjct: 108 DGA--AYGGFNLVMGD--FQRDAWTWVTNKAALG---------WHAQPLAAGIYGLSNAA 154
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEIDRKA 198
+ P+ K + K+ A ++ + + L + + N+Q ++ PD +
Sbjct: 155 LDT-PWPKTVELKQTLATALSTTPAAHQPEVLQAPLWTALGNRQRAPLASLPDTGVPLVI 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIIL---VDHNNVMDFHEWTLDYETKKWIHT 255
EA + + VD YGTR+ +++L DH + W L E + T
Sbjct: 214 EAALSSAF--------VDFPENAYGTRSSTVLLASAADHGD--GVRRWDLQLEER----T 259
Query: 256 HIR 258
H+R
Sbjct: 260 HVR 262
>gi|291288498|ref|YP_003505314.1| hypothetical protein Dacet_2600 [Denitrovibrio acetiphilus DSM
12809]
gi|290885658|gb|ADD69358.1| protein of unknown function DUF833 [Denitrovibrio acetiphilus DSM
12809]
Length = 250
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 98/225 (43%), Gaps = 33/225 (14%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
D GGTW+ S NG+L L N + +P+ RG LV +++ +A Y L +
Sbjct: 47 DKYAGGTWMGISENGRLAALTNYRNPADMQPHRASRGKLVYDFLKSSLNATSYSGVLTET 106
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
++ +NG++++ ++ N+ ++SN K+ +V E +YG N
Sbjct: 107 AKQ--YNGYNLIFGDVD----NLCYYSN---------KIGDVL---ELNTGIYGLSN-HL 147
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
P+ KV GK K ++N R+ +++L+ + D P I + E +
Sbjct: 148 LDTPWSKVTNGKTKLRSVIN---REFFTEDLLSMMHDETIAPDDQLPSTGISSEKEKMLS 204
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
+ I S YGTR S+ILVD + F E + + E
Sbjct: 205 PMF----------IKSAEYGTRCTSVILVDRAGAVTFTERSFNSE 239
>gi|398939731|ref|ZP_10668824.1| hypothetical protein PMI27_02601 [Pseudomonas sp. GM41(2012)]
gi|398164053|gb|EJM52202.1| hypothetical protein PMI27_02601 [Pseudomonas sp. GM41(2012)]
Length = 248
Score = 60.5 bits (145), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 48/217 (22%), Positives = 89/217 (41%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G S ++YL D
Sbjct: 42 VHAGRDLEAGGTWLGVGANGRFAALTNIRDPHQPPARKSRGELVARFLTGDMSIDDYLAD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF++ L + ++HF+ T VYG
Sbjct: 102 VVGRSLE--YAGFNL----LVGNANELWHFN------------ARETEAVMLPPGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + E+++ + Q L+ + D ++ PD + E
Sbjct: 144 NAGLDT-PWPKLLKARAALEEVLD----DPQPQALLALLNDPQTAPFADLPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V +
Sbjct: 199 TLLS----------SVFIASPTYGTRASTALIVQADG 225
>gi|359074996|ref|XP_003587243.1| PREDICTED: uncharacterized protein LOC100848859 [Bos taurus]
Length = 287
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 6/119 (5%)
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
YG N+ + P++K+ FGK+ F E V + R+ L+ +L ++ N ++ PDP I
Sbjct: 163 TYGLSNALLET-PWRKLCFGKQLFLEAVER-GRELPRDALVAQLLAVLSNDEAQLPDPAI 220
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT-LDYETKKW 252
+ + + +Y+ VCV YGTRT ++ILVD + + F E + L + +W
Sbjct: 221 EAQGR-EYVRPILSKYAAVCVRCPD--YGTRTSTVILVDADGHVTFTERSMLGSDPTRW 276
>gi|398955014|ref|ZP_10676246.1| hypothetical protein PMI26_04024 [Pseudomonas sp. GM33]
gi|398151758|gb|EJM40297.1| hypothetical protein PMI26_04024 [Pseudomonas sp. GM33]
Length = 248
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 92/222 (41%), Gaps = 43/222 (19%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G+ + ++YL D
Sbjct: 42 VHAGRDLEAGGTWLGVGANGRFAALTNIRDPHRPPGRKSRGELVARFLTGELTIDDYLAD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DH 134
+ + E + GF++ L S ++HF N++E +
Sbjct: 102 VVDRSLE--YGGFNL----LVGNSHELWHF-----------------NQREAEAVMLAPG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VYG N+ + P+ K++ + +E++ + Q L+ + D + PD +
Sbjct: 139 VYGLSNAGLDT-PWPKLLKARAALSEVLA----DPQPQALLALLSDAQTAPFAELPDTGV 193
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 194 GLATETLLS----------SVFIASQSYGTRASTALVVQADG 225
>gi|399008052|ref|ZP_10710542.1| hypothetical protein PMI20_03450 [Pseudomonas sp. GM17]
gi|398118862|gb|EJM08587.1| hypothetical protein PMI20_03450 [Pseudomonas sp. GM17]
Length = 248
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 88/211 (41%), Gaps = 33/211 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +G+ L N+ + P R RG LV +++ + S ++Y +D+
Sbjct: 44 AGRDLEAGGTWLGLGADGRFAALTNIRNPHQPPARRSRGELVARFLSSKMSIDDYFSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ Q+ VYG N+
Sbjct: 104 GRSLE--YAGFNL----LVGTPDQLWHYN------------AQAREPQQLGSGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K E+++ + Q L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALQEVLH----DPQPQALLALLSDTQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 201 LS----------SVFIASPSYGTRASTALIV 221
>gi|312963679|ref|ZP_07778160.1| protein of unknown function (DUF833) superfamily [Pseudomonas
fluorescens WH6]
gi|311282188|gb|EFQ60788.1| protein of unknown function (DUF833) superfamily [Pseudomonas
fluorescens WH6]
Length = 248
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 92/215 (42%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G+ L N+ + P + RG LV +++ G S E+YL D+
Sbjct: 44 AGRDQEAGGTWLGVNAEGRFAALTNIRDPHQPPARKSRGELVARFLSGSLSIEDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ + T + K VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTQDELWHYN------------AHHTEPTQLKPGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K +E++ + + L++ + D + PD + E+
Sbjct: 146 GLDT-PWPKLLKTKAALSELLA----DPQPEALLEILSDPQIAPFAELPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPTYGTRASTALIVNADG 225
>gi|77461597|ref|YP_351104.1| hypothetical protein Pfl01_5376 [Pseudomonas fluorescens Pf0-1]
gi|77385600|gb|ABA77113.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
Length = 248
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/217 (21%), Positives = 93/217 (42%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P R RG LV ++ G S ++YL+D
Sbjct: 42 VHAGRDLEAGGTWLGLGANGRFAALTNIRDPHQPPARRSRGELVAGFLTGNLSIDDYLSD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + + + GF+++L + ++HF + + + Q VYG
Sbjct: 102 VVARSPD--YAGFNLLL----GNANELWHF---------NARSSEPVMLQPG---VYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ K + +++ + + L+ + D ++ PD + E
Sbjct: 144 NAGLDT-PWPKLLKAKAALSAVLD----DPQPERLLALLSDAQTAPFADLPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + V I+S YGTR + ++V +
Sbjct: 199 SLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|341899638|gb|EGT55573.1| hypothetical protein CAEBREN_06787 [Caenorhabditis brenneri]
Length = 276
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 111/231 (48%), Gaps = 33/231 (14%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAE 74
E+L V + GTW + NG++G++L++ ++ P+ RG +V Y++ ++
Sbjct: 40 EILSGVDEQNEARGTWTGLNRNGRIGMMLSITQTQDSKNPDAPSRGGIVNSYLKAADTS- 98
Query: 75 EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH 134
+ L K+ +++NGF +V LE + +Y L + + + N + E D
Sbjct: 99 SMMNKLAKDA--SMYNGFQLVALEQN--NHGLYQ------LQTLTNQQVNDISICEWDDE 148
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
+ NS S+P+QK + GK E + + N+ + Q + + + ++ YPD ++
Sbjct: 149 YHVISNSP-PSKPYQKAVHGKRLLRERIQESNKMTVDQ-IFDELFVIATDETQCYPDTQL 206
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIF---------YGTRTHSIILVDHNN 236
+ + + DE Y+R +S+IF YGTR H+++ +D+ N
Sbjct: 207 QFQTQ-NTDE-----YNR---PLSAIFIRYPEGTRQYGTRCHTLLTIDNEN 248
>gi|421749849|ref|ZP_16187207.1| hypothetical protein B551_24014 [Cupriavidus necator HPC(L)]
gi|409771210|gb|EKN53584.1| hypothetical protein B551_24014 [Cupriavidus necator HPC(L)]
Length = 280
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 102/237 (43%), Gaps = 31/237 (13%)
Query: 17 EMLLEVSQDAVKG--GTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSA 73
E+L Q V G GTW+ + G+ L N + R + R RG LV ++ G+ S
Sbjct: 41 EVLAGRDQADVIGEPGTWMGVTAGGRFAALTNYRAPSEKRTDARSRGELVAGFLRGEASP 100
Query: 74 EEYLTDLKKET---EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQE 130
E+Y+ DL + +NGF+++ +L ++ +SN +P +P Q
Sbjct: 101 EQYVGDLARGAGIRSPGAYNGFNLLAGDL----DALWWYSNRSPDGAP----------QR 146
Query: 131 NKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNY 189
K +YG N+ + P+ KV + + + + + ++ L L+ + +
Sbjct: 147 LKPGLYGLSNALLDT-PWPKV---RSRVGALAETLAADTGAPGASVEPYLALLADTRQAC 202
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
D E+ + A E K+ S I S YGTR +++ V H+ D E + D
Sbjct: 203 -DFELPQTGVAPEWE--KLLSSAF---IRSPSYGTRASTVLRVRHDGRFDLSERSFD 253
>gi|121533903|ref|ZP_01665729.1| protein of unknown function DUF833 [Thermosinus carboxydivorans
Nor1]
gi|121307414|gb|EAX48330.1| protein of unknown function DUF833 [Thermosinus carboxydivorans
Nor1]
Length = 265
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 104/229 (45%), Gaps = 32/229 (13%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEE 75
+ +L +D + GTWL + +G+L L N S RP+ RG LV Y+ ++ +
Sbjct: 39 QPMLLAGRDLEQMGTWLGITRSGRLAALTNYRDPASYRPDALSRGKLVSGYLGSEQQPRD 98
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
YL + ++ E +NGF++ L +T ++++SN KVT V +
Sbjct: 99 YLAAVDRQANE--YNGFNL----LVGDATGLWYYSNR------HNKVTAVAPG------I 140
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
+G N + + KV G K A + + K + N+ D++ ++Q P P+ +
Sbjct: 141 HGLSNHLLDTA-WPKVAKGCHKLAACLAADDVKED------NLWDILADRQ---PAPDHE 190
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT 244
A E + + + I + YGTR +++L+D+ + F E T
Sbjct: 191 LPATGVSRE---LERALSPIFIVTPDYGTRAGTLLLIDYAGNVRFIERT 236
>gi|398924478|ref|ZP_10661240.1| hypothetical protein PMI28_00838 [Pseudomonas sp. GM48]
gi|398173354|gb|EJM61191.1| hypothetical protein PMI28_00838 [Pseudomonas sp. GM48]
Length = 248
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 91/220 (41%), Gaps = 43/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P + RG LV +++ G+ ++YL+D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPHQPPGRKSRGELVARFLTGELPIDDYLSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKD-----HVY 136
+ E + GF++ L + ++HF N +E++ V+
Sbjct: 104 ARSSE--YGGFNL----LVGNANELWHF-----------------NARESEPVMLPAGVH 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ K E+++ + Q L+ + D + PD +
Sbjct: 141 GLSNAGLDT-PWPKLLKAKAALREVLD----DPQPQALLALLSDSQPAPVAELPDTGVGL 195
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 196 ATETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|395799490|ref|ZP_10478771.1| hypothetical protein A462_29525 [Pseudomonas sp. Ag1]
gi|421141005|ref|ZP_15600999.1| hypothetical protein MHB_16746 [Pseudomonas fluorescens BBc6R8]
gi|395336594|gb|EJF68454.1| hypothetical protein A462_29525 [Pseudomonas sp. Ag1]
gi|404507776|gb|EKA21752.1| hypothetical protein MHB_16746 [Pseudomonas fluorescens BBc6R8]
Length = 248
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 93/215 (43%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +G+ L N+ + P + RG LV +++ G S ++YL D+
Sbjct: 44 AGRDQEAGGTWLGVGADGRFAALTNIRDPHQPPARKSRGELVARFLSGSLSIDDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ A +P+ V VYG N+
Sbjct: 104 GRSIE--YAGFNL----LVGTRDELWHYN--AKESAPTPLVAG----------VYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K E+++ + + L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALTEVLD----DPQPEALLGILSDPQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|398855398|ref|ZP_10611893.1| hypothetical protein PMI37_06099 [Pseudomonas sp. GM80]
gi|398231794|gb|EJN17776.1| hypothetical protein PMI37_06099 [Pseudomonas sp. GM80]
Length = 248
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/217 (21%), Positives = 88/217 (40%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ P + RG LV +++ G S ++YL+D
Sbjct: 42 VHAGRDLEAGGTWLGIGANGRFAALTNIRDPQQPPARKSRGELVARFLTGDLSIDDYLSD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E + GF+++L S ++HF+ ++ + VYG
Sbjct: 102 VVGRASE--YAGFNLLL----GNSHELWHFN------------ARLSEPIMLEPGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + +++ + L+ + D + PD + E
Sbjct: 144 NAGLDT-PWPKLLKARAALGAVLD----EPTPARLLGLLSDAQTAPSAELPDTGVGMATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V +
Sbjct: 199 TLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|281353793|gb|EFB29377.1| hypothetical protein PANDA_009400 [Ailuropoda melanoleuca]
Length = 186
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 81/169 (47%), Gaps = 24/169 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L + R RG LV ++ + YL K
Sbjct: 37 EEGKEGGTWLGISTRGKLAALTNYLQPRQDRDARGRGELVTHFLTTDMDSLSYLK--KVS 94
Query: 84 TEENVFNGFHIVLLELTLQSTNI--YHFSNIAP---LDSPSTKVTNVTNKQENKDHVYGF 138
E +++NGF+++ +L+ + ++ Y+ + P + +P T YG
Sbjct: 95 AEGHLYNGFNLIAADLSTEKGDVICYYGNRGEPEPIVLAPGT---------------YGL 139
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS 187
N+ + P++K+ FGK+ F E V + ++ LI +L ++ N ++
Sbjct: 140 SNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIAELLHVLNNDEA 186
>gi|330505462|ref|YP_004382331.1| hypothetical protein [Pseudomonas mendocina NK-01]
gi|328919748|gb|AEB60579.1| hypothetical protein MDS_4548 [Pseudomonas mendocina NK-01]
Length = 247
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 88/220 (40%), Gaps = 44/220 (20%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +T G+ L N+ + R RG L Y++GQ SA YL +
Sbjct: 44 AGRDLQAGGTWLGITTQGRFAALTNIRDLRQKAGPRSRGALTADYLQGQDSAPAYLDRIM 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH-----VY 136
++ E + GF++++ + ++HF N QE + +Y
Sbjct: 104 RDAAE--YPGFNLLVGD----RHQLWHF-----------------NSQERRPRQLERGIY 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ K+ E + ++E + L+ + D PD +
Sbjct: 141 GMSNASLDT-PWPKLLSAKDALRERI-----EAEDEALLALLADRSPPADHLLPDTGVGL 194
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + + I YGTR +++ +D +
Sbjct: 195 ATERLLSTTF----------IIGEAYGTRASTVLNLDRDG 224
>gi|297801914|ref|XP_002868841.1| hypothetical protein ARALYDRAFT_912278 [Arabidopsis lyrata subsp.
lyrata]
gi|297314677|gb|EFH45100.1| hypothetical protein ARALYDRAFT_912278 [Arabidopsis lyrata subsp.
lyrata]
Length = 273
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 9/110 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D V GGTWL + +G+L L N +S P+ + RG L ++Y++ +KS E+ +++ E
Sbjct: 45 RDLVGGGTWLGCTRHGRLAFLTNFREASSFPDAKSRGDLPIRYLQSRKSPAEFAEEIQDE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIY------HFSNIAPLDSPSTKVTNVTN 127
+++NGF++V+ + L + IY H +A SP V + N
Sbjct: 105 I--SLYNGFNLVVAHV-LSKSMIYITNRPPHGHKLATQVSPGIHVLSNAN 151
>gi|409395697|ref|ZP_11246758.1| hypothetical protein C211_09893 [Pseudomonas sp. Chol1]
gi|409119634|gb|EKM96010.1| hypothetical protein C211_09893 [Pseudomonas sp. Chol1]
Length = 251
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 96/223 (43%), Gaps = 39/223 (17%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL ++G+ L N+ +P G R RG LV +Y+ G+ + +YL L
Sbjct: 45 GRDLQAGGTWLGVRSDGRFAALTNIR-IGGQPTGRRSRGELVERYLRGELAPADYLAGLS 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
++ + GF++ L + ++HF+ + + + VYG N+
Sbjct: 104 AGIDD--YAGFNL----LVGTARELWHFN------------SQSAAPRRLEAGVYGLCNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K+ + A + +++++L+Q + + PD E+
Sbjct: 146 DLDT-PWPKLRRARTALASQLE----LADTESLLQ-----LLDDHERAPDAELPSTG--- 192
Query: 202 MDEDYKMRYSRV--CVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + R+ V I+S YGTR + +L N +D HE
Sbjct: 193 ----VPLEWERLLSSVFIASTEYGTRASTALLRWQNGALDIHE 231
>gi|328771782|gb|EGF81821.1| hypothetical protein BATDEDRAFT_34595 [Batrachochytrium
dendrobatidis JAM81]
Length = 914
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 104/233 (44%), Gaps = 42/233 (18%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEE------------ 75
GTW+ + G+ + N S + RG LV Y+ E
Sbjct: 91 GTWIGMTQQGRFAFITNFREPPSSRCQTAKSRGLLVRDYLLESHPVNEKETSTTSSDPSL 150
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
Y+ L + EE +NGF++VL ++ +++ N + D P + ++ + Q +
Sbjct: 151 YVQKLSNQLEE--YNGFNLVLGQV---GGKMWYVGNRS--DKP---IQHLASTQ-----L 195
Query: 136 YGFGNSQC---TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDP 192
YG N T + + KVI GK+ F + +++ K Q+L+Q +L ++ N ++
Sbjct: 196 YGISNGILMSDTKETWPKVIHGKQLFTKALDEATDK---QDLVQRLLVVLSNSEAC---- 248
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDISSIF---YGTRTHSIILVDHNNVMDFHE 242
D+ D + + S +CV S + Y TRTH+IILVDH+N F E
Sbjct: 249 PYDQLPPNMFDYNLESCLSPICVHCSQLARGQYATRTHTIILVDHDNHATFVE 301
>gi|410091419|ref|ZP_11287983.1| hypothetical protein AAI_12094 [Pseudomonas viridiflava UASWS0038]
gi|410093381|ref|ZP_11289866.1| hypothetical protein AAI_21657 [Pseudomonas viridiflava UASWS0038]
gi|409759246|gb|EKN44486.1| hypothetical protein AAI_21657 [Pseudomonas viridiflava UASWS0038]
gi|409761274|gb|EKN46360.1| hypothetical protein AAI_12094 [Pseudomonas viridiflava UASWS0038]
Length = 248
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 103/240 (42%), Gaps = 39/240 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTW+ G+ + N+ R RG LV K++ G++S E+Y+ ++
Sbjct: 44 AGRDLEAGGTWMGIGALGRFAAVTNIRDPGQTLGLRSRGELVAKFLAGEQSPEDYVAEMA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E + GF++ L + +Y+ S+ TN T + + VYG N+
Sbjct: 104 ARASE--YTGFNL----LAGDAVQLYYLSS-----------TNPTPRLLG-EGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K + A + + E +LDL+K+ Q A+AD
Sbjct: 146 GLNT-PWPKLL--KARAALTAQLDDPRPE------RLLDLLKDPQP---------AADAD 187
Query: 202 MDE-DYKMRYSRV--CVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIR 258
+ E + ++ V I+S YGTR ++++V+ + E + + + +R
Sbjct: 188 LPETGVGLATEKLLSSVFIASPNYGTRASTVLIVNADGTGQMVEHSFGPQGGRLGEVELR 247
>gi|424925479|ref|ZP_18348840.1| hypothetical protein I1A_004966 [Pseudomonas fluorescens R124]
gi|404306639|gb|EJZ60601.1| hypothetical protein I1A_004966 [Pseudomonas fluorescens R124]
Length = 248
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/217 (20%), Positives = 90/217 (41%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G S ++YL D
Sbjct: 42 VHAGRDLEAGGTWLGIGANGRFAALTNIRDPHQPPARKSRGELVARFLTGDASIDDYLRD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E + GF+++L + ++HF+ + + +YG
Sbjct: 102 VVARAPE--YAGFNLLL----GNAHELWHFN------------ARTSEPVMLEPGIYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ K + +++ + L++ + D + PD + E
Sbjct: 144 NAGLDT-PWPKLLKAKAALSAVLD----DPQPARLLELLGDAQTAPFAELPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + V I+S YGTR + ++V +
Sbjct: 199 SLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|395762332|ref|ZP_10443001.1| hypothetical protein JPAM2_11385 [Janthinobacterium lividum PAMC
25724]
Length = 245
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/226 (22%), Positives = 97/226 (42%), Gaps = 33/226 (14%)
Query: 22 VSQDAVKGGTWL----ATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEY 76
+D GG+W+ A S + + + N+ R P+ RG LV Y+ G S ++Y
Sbjct: 44 AGRDLQAGGSWMGITQAGSGSSRFAAITNIRSPQERNPDAPSRGALVADYLAGTLSPQDY 103
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
+ ++ + +NGF++VL + + + SN D+ N Q + +Y
Sbjct: 104 IAQIRPGCK--AYNGFNLVLGD----AHTLIWLSNRGDDDA--------RNGQPLQPGIY 149
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ KV+ K +FA ++ + + + D + PD +
Sbjct: 150 GLSNALLDA-PWPKVLKTKAQFASLLCQ---GAPDDAYFDMLADTTRAPDFRLPDTGVPI 205
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E ++ S VC++ YGTRT +++ + N+ + HE
Sbjct: 206 DLEREL--------SAVCIETPG--YGTRTSTVVKLFPNSPGELHE 241
>gi|389683011|ref|ZP_10174345.1| protein of unknown function, DUF833 family [Pseudomonas
chlororaphis O6]
gi|388553218|gb|EIM16477.1| protein of unknown function, DUF833 family [Pseudomonas
chlororaphis O6]
Length = 248
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 87/211 (41%), Gaps = 33/211 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +G+ L N+ + P R RG LV +++ + S ++Y +D+
Sbjct: 44 AGRDLEAGGTWLGLGADGRFAALTNIRNPHQPPARRSRGELVARFLSNKMSIDDYFSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L ++H++ Q+ VYG N+
Sbjct: 104 GRSLE--YAGFNL----LIGTPDQLWHYN------------AQAREPQQLDSGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K E+++ Q L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALQEVLH----DPHPQALLALLSDAQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 201 LS----------SVFIASPSYGTRASTALIV 221
>gi|325914206|ref|ZP_08176558.1| hypothetical protein XVE_0424 [Xanthomonas vesicatoria ATCC 35937]
gi|325539590|gb|EGD11234.1| hypothetical protein XVE_0424 [Xanthomonas vesicatoria ATCC 35937]
Length = 255
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 93/225 (41%), Gaps = 39/225 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ ++G++ ++ NV + +GR RG L+ Y+ G A Y DL
Sbjct: 45 AGRDLRSGGSWVGLGSDGRVAVVTNVRDPLATASGRSRGHLIADYLSGSLGAAGYAHDLA 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F F+++L + + H SN PL ++ + ++G N
Sbjct: 105 GAGHE--FPPFNLLLCD----AERCEHLSNHPPL------------ARQLEPGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS----NYPDPEIDRK 197
+ P+ K A +++ + ++ N +Q + D + N + + P +D
Sbjct: 147 PLDT-PWPKT----RALAGVLHDWC--AQDSNDLQPLWDALGNPATAPDQSLPSTGVDLS 199
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I+ YGTR +I+ +DH+ HE
Sbjct: 200 TERLLSTAF----------IAGPSYGTRASTIVALDHHGGGFIHE 234
>gi|109900315|ref|YP_663570.1| hypothetical protein Patl_4017 [Pseudoalteromonas atlantica T6c]
gi|109702596|gb|ABG42516.1| protein of unknown function DUF833 [Pseudoalteromonas atlantica
T6c]
Length = 255
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 106/237 (44%), Gaps = 45/237 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ +T GKL L N+ + + N RG LV +Y++ ++ YL+ +
Sbjct: 44 AGKDLRAGGTWMGINTQGKLAALTNIRDPKRNLANAVSRGELVSQYLQHSPRSDAYLSKM 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E +NGF+++ + N++ F+N + + VYG N
Sbjct: 104 LADAEN--YNGFNLLYGDW----NNLHVFNN------------HTLQSERLTTGVYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++ + P+ K + G + E + ++++ L+KN+ KA+
Sbjct: 146 AKLNT-PWPKTLHGMAQLTEYCQNPHTLE-----VESLFALLKNQT----------KAQD 189
Query: 201 DM--DEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTLDYETK 250
D+ + + R +SSIF YGTR+ +++LVD + + E T ++E +
Sbjct: 190 DILPQTGVPIEWER---KLSSIFIQSPEYGTRSSTLLLVDKQQRVSWQERTFNHEAQ 243
>gi|134298460|ref|YP_001111956.1| hypothetical protein Dred_0589 [Desulfotomaculum reducens MI-1]
gi|134051160|gb|ABO49131.1| protein of unknown function DUF833 [Desulfotomaculum reducens MI-1]
Length = 263
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 102/228 (44%), Gaps = 32/228 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D +GGTW+ +T G L N ++ P RG LV KY+ S E YL +L
Sbjct: 51 AGRDLEQGGTWMGLTTTGCFAALTNYRDPVHNNPQAPSRGYLVHKYLNSDVSPEYYLKNL 110
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E +NGF+ LL T Q+ IY++SN + K+ N +YG N
Sbjct: 111 PNGGAE--YNGFN--LLVGTTQA--IYYYSN---REKVIRKIAN---------GIYGLSN 152
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++P+ KV GK+ A+ + K + + +M +++ PE +
Sbjct: 153 G-FLNEPWPKVSKGKKALADCLQGQEIKKDQ------LFKIMADQE----QPEDCELPQT 201
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
+ +++ SR+ I S YGTR+ ++++VD + F E + E
Sbjct: 202 GVSLEWERLLSRIF--IVSPCYGTRSSTVLMVDRKGHVQFWERSFTME 247
>gi|413919268|gb|AFW59200.1| ser/Thr-rich protein T10 in DGCR region [Zea mays]
Length = 270
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 100/224 (44%), Gaps = 26/224 (11%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSLCPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E ++ +NGF+++L ++ S + + SN P P V HV N+
Sbjct: 108 EADQ--YNGFNLILADVN--SGTMVYISN-RPGGDP------VIQTVAPGLHV--LSNAA 154
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
S P+ K + + F + + S Q + + ++D + +S PD
Sbjct: 155 INS-PWPKAMRLGQSFKRYLTIHDDAEASLKQMVEELMMDTARPDRSMVPD--------T 205
Query: 201 DMDEDYKMRYSRVCVDIS--SIFYGTRTHSIILVDHNNVMDFHE 242
D +++ + S + +D + YGTR+ + + F+E
Sbjct: 206 GDDPEWEYKLSSIFIDTAKEQARYGTRSMVALAAKLEGEVTFYE 249
>gi|378948184|ref|YP_005205672.1| hypothetical protein PSF113_0246 [Pseudomonas fluorescens F113]
gi|359758198|gb|AEV60277.1| Hypothetical protein PSF113_0246 [Pseudomonas fluorescens F113]
Length = 260
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/213 (21%), Positives = 87/213 (40%), Gaps = 33/213 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ P + RG LV +++ G S +YL +
Sbjct: 54 VHAGRDLEAGGTWLGIGAEGRFAALTNIRDPGQLPAFKSRGELVARFLTGNLSIADYLGE 113
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E + GF+++L + T ++H + T Q + +YG
Sbjct: 114 VVPRASE--YGGFNLLLGD----GTELWHHN------------ARDTQPQRLGEGLYGLS 155
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + +E+++ + Q L+ + D + PD + E
Sbjct: 156 NAGLNT-PWPKLLKARAALSEVLD----DPQPQALLALLNDPQPAPVAELPDTGVGVATE 210
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 211 TLLS----------SVFIASPAYGTRASTALIV 233
>gi|395495449|ref|ZP_10427028.1| hypothetical protein PPAM2_05251 [Pseudomonas sp. PAMC 25886]
Length = 248
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 92/215 (42%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL +G+ L N+ + P + RG LV +++ G ++YL D+
Sbjct: 44 AGRDQEAGGTWLGVGADGRFAALTNIRDPHQPPARKSRGELVARFLSGSLPIDDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF+ LL TL + ++H++ N + + VYG N+
Sbjct: 104 GRSIE--YAGFN--LLVGTLDA--LWHYN------------ANESAPTRLEAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K E+++ + + L+ + D ++ PD + E+
Sbjct: 146 GLDT-PWPKLLKAKAALTEVLD----DPQPEALLGILSDPQTAPFADLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVSADG 225
>gi|406660240|ref|ZP_11068374.1| hypothetical protein B879_00378 [Cecembia lonarensis LW9]
gi|405556118|gb|EKB51087.1| hypothetical protein B879_00378 [Cecembia lonarensis LW9]
Length = 261
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 103/237 (43%), Gaps = 31/237 (13%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ + N +N R + RG LV ++EG+ + YL +++
Sbjct: 43 GKDLRAGGTWLGMHPNGRFAAITNFRDLKNKREFSKSRGDLVKNFLEGEMDPKSYLEEIE 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
KE E + GF++ L +Y+ SN K+ V ++ + +YG N+
Sbjct: 103 KEKTE--YEGFNL----LVADQEGLYYLSN---------KMEGV---RQLRPGLYGLSNA 144
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
P+ K+ KE A ++ + + L LMK + S DP+
Sbjct: 145 -VLETPWPKLTKAKENLAHHISTASFEP---------LSLMKGQHSKETDPQEILPDTGA 194
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIR 258
E K+ S +++ + +YGT +++L H+ +D E D + + T ++
Sbjct: 195 TPEQEKL-LSAQFINVGN-YYGTVNSTVLLWKHSGEVDMMERKFDQVAETFQDTQVK 249
>gi|407684060|ref|YP_006799234.1| hypothetical protein AMEC673_10825 [Alteromonas macleodii str.
'English Channel 673']
gi|407245671|gb|AFT74857.1| hypothetical protein AMEC673_10825 [Alteromonas macleodii str.
'English Channel 673']
Length = 260
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 103/238 (43%), Gaps = 53/238 (22%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKY-----VEGQKSA 73
L QD GTW+ + NG + L NV N N RG LV + VE + A
Sbjct: 42 LLAGQDLEANGTWMGVTRNGYVAALTNVRDPHNINKNAVSRGELVANWLKHDSVEMKSVA 101
Query: 74 E-EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
+ +YL L+K +NG++++ ++ S +Y+ NV N
Sbjct: 102 QSQYLATLEKTRHR--YNGYNLLFGDI--NSLKVYN---------------NVNNSTHII 142
Query: 133 D-HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNY 189
D VYG N+ + P+ KV G V N QN+I +++ D+++
Sbjct: 143 DTGVYGLSNADIAT-PWPKVTQG-------VTALNNYVTKQNIINDEDLFDILR------ 188
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHE 242
D KA+ + D + Y R +SSIF YGTRT +++LVD NN + + E
Sbjct: 189 ----YDNKADDALLPDTGVGYEREKA-LSSIFIQSEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|388468911|ref|ZP_10143121.1| protein of unknown function, DUF833 family [Pseudomonas synxantha
BG33R]
gi|388012491|gb|EIK73678.1| protein of unknown function, DUF833 family [Pseudomonas synxantha
BG33R]
Length = 248
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 89/215 (41%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + +G+ L N+ + P + RG LV +++ G ++YL D+
Sbjct: 44 AGRDQEAGGTWLGVNADGRFAALTNIRDPHQPPARKSRGELVARFLSGSLPIDDYLADVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + E + GF++ L ++H++ N K VYG N+
Sbjct: 104 RRSIE--YAGFNL----LVGTRDELWHYN------------ANDAEPTLLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K F+ ++ E + L+ + D + PD + E
Sbjct: 146 GLDT-PWPKLVKAKAAFSAVLE----NPEPKALLGILSDPQTAPFAELPDTGVGLATENL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASPSYGTRASTALIVNADG 225
>gi|398995207|ref|ZP_10698096.1| hypothetical protein PMI22_02706 [Pseudomonas sp. GM21]
gi|398130774|gb|EJM20108.1| hypothetical protein PMI22_02706 [Pseudomonas sp. GM21]
Length = 248
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/220 (22%), Positives = 89/220 (40%), Gaps = 43/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P+ + RG LV Y+ G + ++YL D+
Sbjct: 44 AGRDQEAGGTWLGIGANGRFAALTNIRDPHQPPSRKSRGELVAGYLLGDQPIDDYLNDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DHVY 136
+ E + GF++ L + ++HF N +E++ VY
Sbjct: 104 ARSLE--YAGFNL----LIGNANELWHF-----------------NARESEAVMLAPGVY 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ + E++ + Q L+ + D + PD +
Sbjct: 141 GLSNAGLNT-PWPKLLKARAALEEVLG----DPQPQALLALLADSQAAPFAELPDTGVGL 195
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 196 ATETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|410640982|ref|ZP_11351508.1| hypothetical protein GCHA_1744 [Glaciecola chathamensis S18K6]
gi|410645309|ref|ZP_11355775.1| hypothetical protein GAGA_1317 [Glaciecola agarilytica NO2]
gi|410135081|dbj|GAC04174.1| hypothetical protein GAGA_1317 [Glaciecola agarilytica NO2]
gi|410139547|dbj|GAC09695.1| hypothetical protein GCHA_1744 [Glaciecola chathamensis S18K6]
Length = 255
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 100/230 (43%), Gaps = 31/230 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + GKL L N+ + + RG LV Y++ ++EEYL +
Sbjct: 44 AGKDLRAGGTWMGINQQGKLSALTNIRDPQRIASDAISRGELVSDYLQRSPNSEEYLAQI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E+ +NGF+++ + ++Y F+N S K+T VYG N
Sbjct: 104 SANAEK--YNGFNLLFGDW----NDLYVFNNHT---LQSDKLTT---------GVYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K + G + E ++ ES+ L + + K K P + + E
Sbjct: 146 ASLNT-PWPKTLQGMTQLTEYCQD-SQSLESEQLFALLKNQTKAKDDTLPQTGVPIEWER 203
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+ + I S YGTR+ +++LVD+ + + E T + E +
Sbjct: 204 KLSSIF----------IQSPEYGTRSSTLLLVDNQQQVSWQERTFNNEAQ 243
>gi|398900503|ref|ZP_10649531.1| hypothetical protein PMI30_01396 [Pseudomonas sp. GM50]
gi|398180903|gb|EJM68477.1| hypothetical protein PMI30_01396 [Pseudomonas sp. GM50]
Length = 248
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 88/215 (40%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ P + RG LV +++ G S ++YL D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPLQPPARKSRGELVAQFLSGDMSIDDYLKDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L + +++F + + T Q +YG N+
Sbjct: 104 GRSPE--YAGFNL----LIGNANELWYF---------NARETEAVMLQPG---IYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ K +E++ + Q L+ + D + P+ + + E
Sbjct: 146 GLDT-PWPKVLKAKAALSEVLG----DPQPQALLALLSDPQTAPLAELPNTGVGQATETL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I S YGTR + ++V +
Sbjct: 201 LS----------SVFIKSPTYGTRASTALIVQADG 225
>gi|113867576|ref|YP_726065.1| hypothetical protein H16_A1565 [Ralstonia eutropha H16]
gi|113526352|emb|CAJ92697.1| uncharacterized conserved protein [Ralstonia eutropha H16]
Length = 275
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 95/224 (42%), Gaps = 30/224 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+ L N + R + R RG LV ++ G+++ +YL L E+
Sbjct: 56 GTWMGVNADGRFAALTNYRAPSEKRTDARSRGELVAGFLRGREAPFDYLDGLAG--EDGR 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF+++ +L ++ +SN A P Q + +YG N+ + P+
Sbjct: 114 YNGFNLLASDL----RELWWYSNRAASRQP----------QRLRPGLYGLSNALLDT-PW 158
Query: 149 QKVIFGKEKFAEIV--NKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
KV AE++ + + ++ +Q + D + P + + E + +
Sbjct: 159 PKVRSRVGALAEVLAADSGQANASAEPYLQMLADERQAADFELPATGVAPEWEKLLSSAF 218
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
I S YGTR +++ V H+ D E + D + +
Sbjct: 219 ----------IRSPLYGTRASTVLRVRHDGRFDLSERSFDADGR 252
>gi|408420955|ref|YP_006762369.1| hypothetical protein TOL2_C35080 [Desulfobacula toluolica Tol2]
gi|405108168|emb|CCK81665.1| conserved uncharacterized protein, DUF833 [Desulfobacula toluolica
Tol2]
Length = 259
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 50/226 (22%), Positives = 102/226 (45%), Gaps = 33/226 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D +GGTW NG L N +S RP+ RG ++V +++ +K E + L
Sbjct: 44 AGKDLEQGGTWFGVHKNGTFAALTNYRDPSSIRPDAPSRGEIIVDFLKSKKPPETHFNHL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K++ +N +NGF+++ +I+ FSN+ ++ ++G N
Sbjct: 104 KEK--QNPYNGFNLLFG----SKDDIFWFSNLK------------NTIEKIAPGIHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV GK+ +I+ +++ ++ ++ P+ D+ +
Sbjct: 146 -RFLDTPWPKVESGKKALQDIIC-------GTITFESLFSILTDQSI----PDDDQLPQT 193
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ +++ S + I S YGTR+ +++L+D N ++ E T D
Sbjct: 194 GVGLEWERMLSSLF--IHSDTYGTRSSTVMLMDQNGTIEITERTYD 237
>gi|226499108|ref|NP_001148760.1| Ser/Thr-rich protein T10 in DGCR region [Zea mays]
gi|195621960|gb|ACG32810.1| Ser/Thr-rich protein T10 in DGCR region [Zea mays]
Length = 270
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 100/224 (44%), Gaps = 26/224 (11%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSLCPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E ++ +NGF+++L ++ S + + SN P P V HV ++
Sbjct: 108 EADQ--YNGFNLILADVN--SGTMVYISN-RPGGDP------VIQTVAPGLHVL---SNA 153
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ K + + F + + S Q + + ++D + +S PD
Sbjct: 154 AINSPWPKAMRLGQGFKRYLAIHDDAEASLKQMVEELMMDTARPDRSMVPD--------T 205
Query: 201 DMDEDYKMRYSRVCVDIS--SIFYGTRTHSIILVDHNNVMDFHE 242
D +++ + S + +D + YGTR+ + + F+E
Sbjct: 206 GDDPEWEYKLSSIFIDTAKEQARYGTRSMVALAAKLEGEVTFYE 249
>gi|148665101|gb|EDK97517.1| DNA segment, Chr 16, human D22S680E, expressed, isoform CRA_b [Mus
musculus]
Length = 158
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 2/83 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 60 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 117
Query: 84 TEENVFNGFHIVLLELTLQSTNI 106
TE +++NGF+I+ +L+ ++
Sbjct: 118 TEGHLYNGFNIIAADLSTSKGDV 140
>gi|339325717|ref|YP_004685410.1| hypothetical protein CNE_1c15860 [Cupriavidus necator N-1]
gi|338165874|gb|AEI76929.1| hypothetical protein CNE_1c15860 [Cupriavidus necator N-1]
Length = 275
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/224 (21%), Positives = 93/224 (41%), Gaps = 30/224 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+ L N + R + R RG LV ++ G + +YL L E+
Sbjct: 56 GTWMGVNADGRFAALTNYRAPSEKRTDARSRGELVAGFLRGHAAPFDYLDGLAG--EDGC 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF+++ +L ++ +SN + P Q + +YG N+ + P+
Sbjct: 114 YNGFNLLASDL----RELWWYSNRSASRQP----------QRLRPGLYGLSNALLDT-PW 158
Query: 149 QKVIFGKEKFAEIV--NKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
KV AE++ + + ++ +Q + D + P + + E + +
Sbjct: 159 PKVRSRVGALAEVLAADSGQANASAEPYLQMLADARQAADFELPATGVAPEWEKLLSSAF 218
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
I S YGTR +++ V H+ D E + D + +
Sbjct: 219 ----------IRSPLYGTRASTVLRVRHDGRFDLSERSFDADGR 252
>gi|13529419|gb|AAH05445.1| D16H22S680E protein [Mus musculus]
Length = 151
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 2/83 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 53 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNI 106
TE +++NGF+I+ +L+ ++
Sbjct: 111 TEGHLYNGFNIIAADLSTSKGDV 133
>gi|198418074|ref|XP_002128217.1| PREDICTED: similar to MGC88919 protein [Ciona intestinalis]
Length = 282
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 97/214 (45%), Gaps = 23/214 (10%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
Q VKGGTW + GK L N L E++ + RG +V ++E + + +Y+
Sbjct: 51 QPGVKGGTWFGVNKQGKFAFLTNYLTPKEDAFTDKVSRGFIVRNFLENEMNPHDYIASTL 110
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E H V + T Q+ N+ +F S+ N+ D VY
Sbjct: 111 MSREH---RPHHFVGGQFTNNCQNMNVSYFGT-----HDSSSPLNL------DDGVYALA 156
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRKA 198
S ++ ++KV G KF +++ N S ++ L+ N+ D++ +K YPD + +++
Sbjct: 157 CSTLGTK-WRKVNHGSNKFQKVIQ--NPNSSTKELMDNLFTDVLSDKTPLYPDALVKKQS 213
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + E++ Y V + YGTR ++++V
Sbjct: 214 GSYLPEEFLKLYCSVMIS-GYPDYGTRMQTVLIV 246
>gi|398991808|ref|ZP_10694903.1| hypothetical protein PMI23_05412 [Pseudomonas sp. GM24]
gi|399013338|ref|ZP_10715647.1| hypothetical protein PMI19_02441 [Pseudomonas sp. GM16]
gi|398114006|gb|EJM03842.1| hypothetical protein PMI19_02441 [Pseudomonas sp. GM16]
gi|398137080|gb|EJM26147.1| hypothetical protein PMI23_05412 [Pseudomonas sp. GM24]
Length = 248
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 95/218 (43%), Gaps = 35/218 (16%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLT 78
+ +D GGTWL NG+ L N+ + +P R RG LV +++ G S ++YL
Sbjct: 42 VHAGRDLEAGGTWLGIGANGRFAALTNIR-DPQQPTARKSRGELVARFLTGDASIDDYLN 100
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ + E + GF+++L S ++HF+ A L P + VYG
Sbjct: 101 EVVGHSAE--YAGFNLLL----GNSHELWHFN--ARLSEPVML----------ESGVYGV 142
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ K++ K + +++ + Q+L+ + D ++ PD +
Sbjct: 143 SNAGLDT-PWPKLLKAKAGLSAVLD----DPQPQHLLDLLSDAQTAPEAELPDTGVGLAT 197
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 198 ETLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|241780291|ref|XP_002400136.1| conserved hypothetical protein [Ixodes scapularis]
gi|215508553|gb|EEC18007.1| conserved hypothetical protein [Ixodes scapularis]
Length = 114
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRD---RGPLVVKYVEGQKSAEEYLTDL 80
++ GGTWL + GKL LLN++ G + RG LVVKY+EGQ YL DL
Sbjct: 4 EEGKAGGTWLGMNVTGKLASLLNIIQPLDEITGDEKLPRGHLVVKYLEGQHDGASYLRDL 63
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKV 122
+ E+ F+ F +V L++ +I LD+P +
Sbjct: 64 SRRAED--FDRFLLVTLDIRPSRQDIEATCYTNALDAPPVPL 103
>gi|301061449|ref|ZP_07202219.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300444488|gb|EFK08483.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 256
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 100/226 (44%), Gaps = 33/226 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + G+L + N +S + RG LV ++ + E YL L
Sbjct: 44 AGRDLRDGGTWLGITRKGRLAAITNYRDPSSVNEDAPSRGALVSDFLLSGEDPETYLARL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K+ +NGF++VL+ + FSN + P Q+ + +YG N
Sbjct: 104 SKKAR--YYNGFNLVLM----AEGKYFWFSN---RNGP---------PQQLRPGIYGVCN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV GK + E++ SES N +++ ++ ++ S PD +
Sbjct: 146 -HLLNTPWPKVTEGKARLQEML------SESLN-PESVFQMLSDR-SIPPDVHL-----P 191
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
D + S + ++ YGTR+ +++ +D NN + F E T D
Sbjct: 192 DTGVGLEWERSLGAIFVARPGYGTRSSTLLTIDQNNHVTFVEKTHD 237
>gi|164659246|ref|XP_001730747.1| hypothetical protein MGL_1746 [Malassezia globosa CBS 7966]
gi|159104645|gb|EDP43533.1| hypothetical protein MGL_1746 [Malassezia globosa CBS 7966]
Length = 320
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 103/231 (44%), Gaps = 31/231 (13%)
Query: 25 DAVKGGTWLATSTNGKLGILLNV-------LGENSRPNGRDRGPLVVKYVEGQKSAEEYL 77
D + GGTW+ + +G L NV L ++ RP R RG LV+K+++ Q S
Sbjct: 57 DLLAGGTWMGITRSGSFAALTNVYEMPAPQLTQDGRPL-RSRGELVMKWLDAQHSEPGLS 115
Query: 78 TDLKKET--EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
E + F++++ +L+ + ++ + SN + + N EN+ V
Sbjct: 116 ASAIDEMYLSRGDYGSFNLLVGDLSEEGAHVRYISN----RTSDHEYNNAIRSNENQ-RV 170
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ--SNYPDPE 193
G NS +P+ KV G++ F ++++ +S LI+ + ++++ + P PE
Sbjct: 171 CGLSNSPL-QKPWPKVELGEKLFQSVLSE--ERSSQACLIERLFNVLQTSSYTRHVPVPE 227
Query: 194 IDR--------KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
R K + D+ M S + +YGTRT ++ILV +
Sbjct: 228 AMRHTIHVSPMKMPSKADQ---MHLSSTPNPDTFAWYGTRTSTVILVSRTS 275
>gi|413919267|gb|AFW59199.1| hypothetical protein ZEAMMB73_815680 [Zea mays]
Length = 336
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSLCPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSN 111
E ++ +NGF+++L + + S + + SN
Sbjct: 108 EADQ--YNGFNLILAD--VNSGTMVYISN 132
>gi|348029331|ref|YP_004872017.1| hypothetical protein GNIT_1915 [Glaciecola nitratireducens FR1064]
gi|347946674|gb|AEP30024.1| hypothetical protein GNIT_1915 [Glaciecola nitratireducens FR1064]
Length = 258
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 111/237 (46%), Gaps = 42/237 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEG----QKSAEEY 76
+D GGTW+ + NG + L N+ + PN RG LV+ +++G QK +
Sbjct: 44 AGRDEEAGGTWMGVNKNGSIAALTNIRAPDKDIPNPVTRGELVINFLKGSSKQQKKYAKT 103
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIY--HFSNIAPLDSPSTKVTNVTNKQENKDH 134
L D K + +NG++ LL TL+ +Y H L+ D
Sbjct: 104 LADSKAQ-----YNGYN--LLFGTLEQLYVYNNHEDTCVQLE----------------DG 140
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
V+G N+ S P+ K+ G++ A K+ + ++ + +++ +L++N N P E
Sbjct: 141 VFGLSNASLNS-PWPKISTGRDALA----KYCQHADVLD-TEHLFELLRN---NKP-AED 190
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ + +++ R S + I S YGTR+ +++LVDH+ + + E T + + ++
Sbjct: 191 SMLPKTGVPIEWERRLSSIF--IQSPEYGTRSSTVLLVDHHQQVVWEERTFNAQAEQ 245
>gi|407687981|ref|YP_006803154.1| hypothetical protein AMBAS45_11020 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407291361|gb|AFT95673.1| hypothetical protein AMBAS45_11020 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 260
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 102/233 (43%), Gaps = 43/233 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKY-----VEGQKSA 73
L QD GTW+ + NG + L NV N N RG LV + VE + A
Sbjct: 42 LLAGQDLEANGTWMGVTRNGYVAALTNVRDPHNINKNAVSRGELVANWLKHDPVEMKPVA 101
Query: 74 E-EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
+ +YL L+K + +NG++++ ++ S +Y+ NV N
Sbjct: 102 QSQYLATLEKTRHQ--YNGYNLLFGDI--NSLKVYN---------------NVNNSTHII 142
Query: 133 D-HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNY 189
D VYG N+ + P+ KV G + V+K QN+I +++ D+++
Sbjct: 143 DTGVYGLSNADIAT-PWPKVTQGVTALNDYVSK-------QNIINEEDLFDILRY----- 189
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D + D D Y+ + + I S YGTRT +++LVD NN + + E
Sbjct: 190 -DNKADDALLPDTGVGYEWEKALSSIFIQSEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|407694305|ref|YP_006819093.1| hypothetical protein B5T_00465 [Alcanivorax dieselolei B5]
gi|407251643|gb|AFT68750.1| hypothetical protein B5T_00465 [Alcanivorax dieselolei B5]
Length = 251
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 97/236 (41%), Gaps = 46/236 (19%)
Query: 25 DAVKGGTWLATSTNGKLGILLN--------VLGENSRPNGRDRGPLVVKYVEGQKSAEEY 76
D GGTWL +G+ ++ N +GE S RG L ++E +S E +
Sbjct: 44 DLRAGGTWLGIHRDGRFAVVTNYREPVAERTVGELS------RGLLPQAFLESHQSPELF 97
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
L E EE++++GF++ L ++++ SN P P K ++
Sbjct: 98 CLSL--EAEEHLYSGFNL----LVGDRESLWYLSNRGPAAQPV------------KPGLH 139
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-QNLIQNILDLMKNKQSNYPDPEID 195
G N P+ KV GK++ A + + S ++L+ ++D + + + PD ++
Sbjct: 140 GLSNG-VLDDPWPKVERGKQRLARALEGAPPNTPSLRDLLGVVVDPYQPPEEDLPDTGVE 198
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE--WTLDYET 249
R+ E + + I S YGTR S +L+ E W D E
Sbjct: 199 RELERLVAPIF----------IQSQQYGTRASSAVLLPREGAPMMREQCWRADGEA 244
>gi|409042627|gb|EKM52111.1| hypothetical protein PHACADRAFT_128288 [Phanerochaete carnosa
HHB-10118-sp]
Length = 308
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/243 (21%), Positives = 100/243 (41%), Gaps = 33/243 (13%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTWL + G++ L N+ E+ RG L ++ ++ E +
Sbjct: 52 GRDLSAGGTWLGVNRAGRVAFLTNITEEHLSYTS-TRGALASAFLLPDRAGESLQDHVGH 110
Query: 83 ETEEN-VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK------------- 128
EN + GF+++L ++ AP S VTN+
Sbjct: 111 VVRENRAYAGFNLLLFGPRASAS--------APGGVLSYDAACVTNRGGGGTITCRPLAD 162
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
E + G ++ + KV G F++++ ++ +L+Q + L+ + +S
Sbjct: 163 NERRCGALSNGVEGQGAEAWPKVCLGSSMFSDVLQFVTPETTEADLVQRLFHLLTSWRSE 222
Query: 189 YPDPEIDRKAEADMDEDYKMRYSRVCVDISS--IFYGTRTHSIILVDHNNVMDFHE---W 243
+P +A A++ ++ +V +S + YGTR +IILV + + F+E W
Sbjct: 223 HPP-----RARAELRNTIQIEPMQVRASETSDGLLYGTRLSTIILVRRDGSVSFYERDIW 277
Query: 244 TLD 246
LD
Sbjct: 278 VLD 280
>gi|381394283|ref|ZP_09920000.1| hypothetical protein GPUN_1002 [Glaciecola punicea DSM 14233 = ACAM
611]
gi|379329885|dbj|GAB55133.1| hypothetical protein GPUN_1002 [Glaciecola punicea DSM 14233 = ACAM
611]
Length = 255
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 109/235 (46%), Gaps = 50/235 (21%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV-LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D + GGTW+ +T G + L N+ + + N RG LV K++ K E YL D
Sbjct: 44 AGKDLLAGGTWMGINTRGDVAALTNIRVPVPEKNNAVSRGELVTKFLSSNKQ-ESYL-DT 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K+T N +NG++++ L TN++ +++ LD+ + + K+ VYG N
Sbjct: 102 LKQTHAN-YNGYNLLFGNL----TNLHVYNS---LDNSAYSL---------KEGVYGLSN 144
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ S P+ K+ GK A+ S +++ +L+ N+
Sbjct: 145 ASLDS-PWPKLGMGKSALAQYCQHAKDLS-----FEHLFELLSNRA-------------- 184
Query: 201 DMDEDYKMRYSRVCVDI----SSIF-----YGTRTHSIILVDHNNVMDFHEWTLD 246
+ +D ++ + V V+I SSIF YGTR+ +++L+D+ + + E T
Sbjct: 185 -VAKDEELPNTGVSVEIEKMLSSIFICTPQYGTRSSTVLLIDNQQQVYWEERTFS 238
>gi|406686234|gb|AFS51429.1| hypothetical protein pECd1.7 [uncultured bacterium]
Length = 248
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 87/215 (40%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + P + RG LV +++ G ++YL+D+
Sbjct: 44 AGRDIEAGGTWLGVGANGRFAALTNIRDPHQPPARKSRGELVARFLCGDMPIDDYLSDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF++ L + ++HF+ T VYG N+
Sbjct: 104 GRSLE--YAGFNL----LVGNANELWHFN------------ARETEAVLLAPGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + E+++ + Q L+ + D + PD + E
Sbjct: 146 GIDT-PWPKLLKARAALEEVLD----DPQPQALLALLNDSQTAPFAELPDTGVGLATETL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V +
Sbjct: 201 LSS----------VFIASPTYGTRASTALIVQADG 225
>gi|409911344|ref|YP_006889809.1| hypothetical protein KN400_0772 [Geobacter sulfurreducens KN400]
gi|298504912|gb|ADI83635.1| protein of unknown function DUF833 [Geobacter sulfurreducens KN400]
Length = 256
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/224 (20%), Positives = 96/224 (42%), Gaps = 35/224 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW + +G++ + N + R R RG LV ++ G ++ +L L
Sbjct: 44 AGRDLTAGGTWCGVTRDGRIAAVTNYRDPGAHRVGARSRGELVAGFLSGDEAPSRWLEHL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++ + +NGF+++ + +++ SN SP + ++G N
Sbjct: 104 QRNGHD--YNGFNLIFGD----GNGLHYHSNRGAAASPLSP------------GIHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV G++ A ++ + + + ++ ++ N+ P P+
Sbjct: 146 -HLLDTPWPKVARGRDALARLL-----ATADEPAVDDLFAILANRT---PAPD-----HL 191
Query: 201 DMDEDYKMRYSRVC--VDISSIFYGTRTHSIILVDHNNVMDFHE 242
D + + R+ + I+S YGTR+ ++ILVD + F E
Sbjct: 192 LPDTGVSLDWERLLSPLFITSPTYGTRSSTVILVDRSGQCTFVE 235
>gi|395649077|ref|ZP_10436927.1| hypothetical protein Pext1s1_10887 [Pseudomonas extremaustralis
14-3 substr. 14-3b]
Length = 248
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 96/225 (42%), Gaps = 41/225 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + +G+ L N+ + P + RG LV +++ G E+YL ++
Sbjct: 44 AGRDLEAGGTWLGVNADGRFAALTNIRDPHQPPARKSRGELVARFLGGSLPLEQYLAEIN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E + GF++ L ++H++ DS T++ VYG N+
Sbjct: 104 GRAIE--YAGFNL----LVGTREELWHYNA---RDSVPTQL---------PAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEIDRK 197
+ P+ K++ K E++ N + E+ +LD++ + Q + PD +
Sbjct: 146 GLDT-PWPKLLKAKAALGEVLE--NPQPET------LLDILSDPQTAPFAELPDTGVGLA 196
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E+ + V I+S YGTR + ++V+ + E
Sbjct: 197 TESLLS----------SVFIASPSYGTRASTALIVNADGTRQIVE 231
>gi|413919266|gb|AFW59198.1| hypothetical protein ZEAMMB73_815680 [Zea mays]
Length = 271
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSLCPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSN 111
E ++ +NGF+++L + + S + + SN
Sbjct: 108 EADQ--YNGFNLILAD--VNSGTMVYISN 132
>gi|449443390|ref|XP_004139460.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Cucumis
sativus]
Length = 263
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 52/84 (61%), Gaps = 3/84 (3%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D V GGTWL S GK+ + NVL + P+ + RG L + +++ KS +E+ ++K E
Sbjct: 45 RDEVGGGTWLGCSRKGKVAFVTNVLEPHEIPDAKSRGHLPLLFLQSTKSPKEFAEEIKAE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIY 107
+ +NGF++++ +++ + T IY
Sbjct: 105 AYQ--YNGFNLIVADISTK-TMIY 125
>gi|449525826|ref|XP_004169917.1| PREDICTED: ser/Thr-rich protein T10 in DGCR region-like [Cucumis
sativus]
Length = 263
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 52/84 (61%), Gaps = 3/84 (3%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+D V GGTWL S GK+ + NVL + P+ + RG L + +++ KS +E+ ++K E
Sbjct: 45 RDEVGGGTWLGCSRKGKVAFVTNVLEPHEIPDAKSRGHLPLLFLQSTKSPKEFAEEIKAE 104
Query: 84 TEENVFNGFHIVLLELTLQSTNIY 107
+ +NGF++++ +++ + T IY
Sbjct: 105 AYQ--YNGFNLIVADISTK-TMIY 125
>gi|440739368|ref|ZP_20918884.1| hypothetical protein A986_13817 [Pseudomonas fluorescens BRIP34879]
gi|440379815|gb|ELQ16399.1| hypothetical protein A986_13817 [Pseudomonas fluorescens BRIP34879]
Length = 248
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 89/215 (41%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL S +G+ L N+ + P RG LV +++ G ++YL ++
Sbjct: 44 AGRDQEAGGTWLGVSADGRFAALTNIREPHQPPARTSRGELVARFLSGSLPIDQYLAEVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF+++L ++H++ + + K +YG N+
Sbjct: 104 GRSIE--YAGFNLLL----GTRDEMWHYN------------AHESQPTRLKPGIYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + E++ + Q L+ + D + PD + E+
Sbjct: 146 TLNT-PWPKLLKARAALGEVLE----DPQPQALLAIMSDPQTAPFAQLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASASYGTRVSTALIVNADG 225
>gi|115933480|ref|XP_001200927.1| PREDICTED: uncharacterized protein C22orf25 homolog
[Strongylocentrotus purpuratus]
Length = 228
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 107/247 (43%), Gaps = 35/247 (14%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEE 86
+GGTWL S G+L ++LN+ R + + RG LV ++ G+++ ++YL + + E+
Sbjct: 6 EGGTWLGMSKKGRLAVILNIFNPGGIRDDAKGRGALVSDFLTGEQTTDDYLKKIAEYGED 65
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
+NGF++ L+++ Y + P Q+ + ++G NS +
Sbjct: 66 --YNGFNLFTLDISNGDAAYYSNKSSKP-------------PQKLQPGIFGVSNSTL-EK 109
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQNL--IQNIL---DLMKNKQSNYPDPEIDRKAEAD 201
P+ K K EI+N + S L + ++L +L+ +Q + + A
Sbjct: 110 PWPKANHLKTNLEEIINSSSDLSSEDLLKKLHSVLESCELLNGEQFKFKPGMTIEEMRAV 169
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL----DYETKKWIHTHI 257
+ + V + S YGTR+ ++I VD + + E +L D KW
Sbjct: 170 LPQ---------MVHVWSPVYGTRSSTVITVDAAGEVVYSEKSLEEPIDPSNLKWTTKEH 220
Query: 258 RKTLNMA 264
R L A
Sbjct: 221 RFHLTCA 227
>gi|255623260|ref|XP_002540348.1| Ser/Thr-rich protein T10 in DGCR region, putative [Ricinus
communis]
gi|223496514|gb|EEF22035.1| Ser/Thr-rich protein T10 in DGCR region, putative [Ricinus
communis]
Length = 172
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 76/178 (42%), Gaps = 28/178 (15%)
Query: 53 RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNI 112
RP+ RG LV ++ G SA EYL + E FNGF++VL + +Y +SN
Sbjct: 2 RPDAPTRGVLVADFLTGTLSAPEYLGRIAPGAGE--FNGFNLVLGD----RDALYWYSNR 55
Query: 113 APLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQ 172
A D N Q + VYG N + P+ KV+ K +FA ++ +
Sbjct: 56 AGGDP--------RNGQPLEPGVYGISNG-LLNDPWPKVVRTKAQFASLLCS---GAPED 103
Query: 173 NLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSII 230
+ + D + P E + D + S VC ISS YGTRT +++
Sbjct: 104 AYFEMLADTTRAPDMRLP--------ETGVPLDMERMLSAVC--ISSPGYGTRTSTVV 151
>gi|404448195|ref|ZP_11013188.1| hypothetical protein A33Q_02625 [Indibacter alkaliphilus LW1]
gi|403765816|gb|EJZ26691.1| hypothetical protein A33Q_02625 [Indibacter alkaliphilus LW1]
Length = 247
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 100/227 (44%), Gaps = 32/227 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL +G+ L N +N + + RG LV ++EG K+ EYL ++
Sbjct: 27 AGKDLRAGGTWLGMHPSGRFAALTNYRDLKNPKKYEKSRGDLVKNFLEGDKNPYEYLKEI 86
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E +E + GF++ L N+Y+ SN + S + ++ +YG N
Sbjct: 87 AAEMQE--YEGFNL----LVGDQNNLYYLSNKS-----SGGIKQLS------PGIYGLSN 129
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPD-PEIDRKAE 199
+ P++K++ KE E + + N + ES LMK + S + PE+
Sbjct: 130 A-VLETPWRKLVKAKENLEEHLAEGNFQMES---------LMKGQHSKETESPELLPDTG 179
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
A +++ + + V +YGT +++L D ++ E D
Sbjct: 180 ATPEQEILLSAQFINV---GNYYGTVNSTVLLWDEKGQVEMMERVFD 223
>gi|415885225|ref|ZP_11547153.1| hypothetical protein MGA3_08325 [Bacillus methanolicus MGA3]
gi|387590894|gb|EIJ83213.1| hypothetical protein MGA3_08325 [Bacillus methanolicus MGA3]
Length = 255
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 108/232 (46%), Gaps = 32/232 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D KGGTW+ + G+ + N R + + RG LV Y+ G +EYL
Sbjct: 44 AGRDLEKGGTWMGVTRTGRFAAITNYRAPGYDRLDAKSRGFLVSNYLTGSSKPKEYLE-- 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K + + ++NGF++ L + ++Y++S I K++ V V+G N
Sbjct: 102 KVQQDHKLYNGFNL----LVGDTESLYYYSPIL------NKISIV------PPGVHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV G EK + ++ N+ + L+ + D + + PD I +
Sbjct: 146 A-VLDTPWPKVKKGIEKLTQAIS--NKIIDESLLLSILSDSEEAPEEELPDTGIGK---- 198
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKW 252
D++ S + + S+ YGTR +I+ +D+++ + F+E +L + ++W
Sbjct: 199 ----DWEKLLSPIFIQSST--YGTRASTILTIDNDHHIVFNEKSLLPDLRQW 244
>gi|18420207|ref|NP_568038.1| uncharacterized protein [Arabidopsis thaliana]
gi|332661506|gb|AEE86906.1| uncharacterized protein [Arabidopsis thaliana]
Length = 275
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 105/218 (48%), Gaps = 34/218 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWL + +G+L L N +S P + RG L ++Y++ +KS E+ +++
Sbjct: 44 GRDLVGGGTWLGCTRHGRLAFLTNFKEASSFPAAKSRGDLPLRYLQSEKSPAEFAEEIQD 103
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E +++NGF++V+ + S ++ + +N P VT V+ ++ N+
Sbjct: 104 EI--SLYNGFNLVVAHVL--SKSMIYITNRPP--HGDKLVTQVSPG------IHVLSNAN 151
Query: 143 CTSQPFQKVIFGKEKFAEIV-----NKFNRKSESQNLIQNILDLMKNKQSNYPD---PEI 194
S P+ K + +E F +++ +F K+ + ++ N + K++++ P PE
Sbjct: 152 LDS-PWPKCLRLREGFQQLLAENGSGEFPVKTMVEEVMTNTV---KDEETELPHVFTPET 207
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+Y + V + + YGTR+ S I+V
Sbjct: 208 ----------EYHLSSIFVDMQRPTGRYGTRSISAIIV 235
>gi|330806921|ref|YP_004351383.1| hypothetical protein PSEBR_a245 [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
gi|327375029|gb|AEA66379.1| Conserved hypothetical protein; putative exported protein
[Pseudomonas brassicacearum subsp. brassicacearum
NFM421]
Length = 248
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 85/213 (39%), Gaps = 33/213 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ P + RG LV +++ G S EYL++
Sbjct: 42 VHAGRDLEAGGTWLGIGAEGRFAALTNIRDPGQLPAFKSRGELVARFLTGNLSIAEYLSE 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E + GF+++L + ++H + T Q + +YG
Sbjct: 102 IVPRASE--YGGFNLLLGD----GVELWHHN------------ARDTQPQRLGEGLYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + E+++ + L+ + D + PD + E
Sbjct: 144 NAGLNT-PWPKLLKARAALGEVLD----DPRPEALLTLLNDPQPASVAELPDTGVGVATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 199 MLLS----------SVFIASPAYGTRASTALIV 221
>gi|410619912|ref|ZP_11330802.1| hypothetical protein GPLA_4061 [Glaciecola polaris LMG 21857]
gi|410160553|dbj|GAC34940.1| hypothetical protein GPLA_4061 [Glaciecola polaris LMG 21857]
Length = 255
Score = 56.6 bits (135), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 106/232 (45%), Gaps = 31/232 (13%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L +D GGTW+ + GKL L N+ + P+ RG LV Y++ +++YLT
Sbjct: 42 LLAGKDLRAGGTWMGINQWGKLAALTNIRDPQRIDPDALSRGELVSHYLQQSPDSKDYLT 101
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
L T N +NGF+++ + ++N+ ++ + + +T+ VYG
Sbjct: 102 RLNA-TGTN-YNGFNLLFGD----------WNNLRVFNNHTLQSETLTSG------VYGL 143
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ K + G + + + Q + DL+KN+ E D
Sbjct: 144 SNASLNT-PWPKTVQGMAQLTQYCQDPHTLESEQ-----LFDLLKNQTK----AEDDILP 193
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+ + D++ + S + I S YGTR+ +++L+D+ + + E T + E +
Sbjct: 194 QTGVPIDWERKLSSIF--IQSDEYGTRSSTLLLIDNQQHVTWQERTFNNEAE 243
>gi|260802716|ref|XP_002596238.1| hypothetical protein BRAFLDRAFT_117984 [Branchiostoma floridae]
gi|229281492|gb|EEN52250.1| hypothetical protein BRAFLDRAFT_117984 [Branchiostoma floridae]
Length = 193
Score = 56.6 bits (135), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 69/135 (51%), Gaps = 15/135 (11%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
+GGTWL + NG+L + N+L + N + RG L+ ++ G ++ YL L +E +
Sbjct: 56 EGGTWLGITENGRLSAITNILQSSPVHNAKGRGYLITDFLRGDQTPLAYLESLAEEG--H 113
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
FN F+++ ++L+ + + +++N++ DSP Y NS P
Sbjct: 114 TFNAFNLLTMDLS-EPACLAYYNNVS-HDSPKILAPG----------QYVVSNS-LPQTP 160
Query: 148 FQKVIFGKEKFAEIV 162
QK I+G++ F E++
Sbjct: 161 LQKTIYGEKIFNELL 175
>gi|447918886|ref|YP_007399454.1| hypothetical protein H045_19480 [Pseudomonas poae RE*1-1-14]
gi|445202749|gb|AGE27958.1| hypothetical protein H045_19480 [Pseudomonas poae RE*1-1-14]
Length = 248
Score = 56.6 bits (135), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 89/215 (41%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL S +G+ L N+ + P RG LV +++ G ++YL ++
Sbjct: 44 AGRDQEAGGTWLGVSADGRFAALTNIREPHQPPARTSRGELVARFLSGSLPIDQYLAEVN 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF+++L ++H++ + + K +YG N+
Sbjct: 104 GRSIE--YAGFNLLL----GTRDEMWHYN------------AHESQPTRLKPGIYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + E++ + Q L+ + D + PD + E+
Sbjct: 146 TLNT-PWPKLLKARAALGEVLE----DPQPQALLAIMSDPQTAPFAQLPDTGVGLATESL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR + ++V+ +
Sbjct: 201 LS----------SVFIASASYGTRVSTALIVNADG 225
>gi|344304085|gb|EGW34334.1| hypothetical protein SPAPADRAFT_59746 [Spathaspora passalidarum
NRRL Y-27907]
Length = 312
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 116/230 (50%), Gaps = 21/230 (9%)
Query: 30 GTWLATSTNGKLGILLNVL-GENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTW+ +T+GK+ +L+N ++SR G RG L + Y+ +S +E+ +L+K + +
Sbjct: 55 GTWIGVTTSGKVAVLVNYHEADSSRAIGEISRGILPLDYLCSTESDDEWRENLEK-VQLD 113
Query: 88 VFNGFHIVLLELTLQ-STNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV-----YGFGNS 141
GF ++ +L+L +T+ + NI KV ++ + +D + +G NS
Sbjct: 114 KIGGFTLLYGQLSLDPNTSKINNLNIVSNRGEHGKVFQ-SDGAKTEDEIACKSTFGLSNS 172
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKNK--QSNYPDP-EIDRK 197
++P++KV G+ +++++ K SQ L+ DL+ N N D + ++K
Sbjct: 173 -LYNEPWKKVKIGEGLLDKLIDESVEKGYSQEVLVDKCFDLLSNDTYDRNIIDSKDFEKK 231
Query: 198 AEADMDEDY-----KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
A D + + R C+ + +YGTRT +IIL+D ++++E
Sbjct: 232 IIALRDSVFIPPLIRAGMKRDCIAVGK-YYGTRTQTIILLDKTGHLNYYE 280
>gi|423694754|ref|ZP_17669244.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
Q8r1-96]
gi|388009097|gb|EIK70348.1| protein of unknown function, DUF833 family [Pseudomonas fluorescens
Q8r1-96]
Length = 248
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 85/213 (39%), Gaps = 33/213 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ P + RG LV +++ G S EYL++
Sbjct: 42 VHAGRDLEAGGTWLGIGAEGRFAALTNIRDPGQLPAFKSRGELVARFLTGNLSIAEYLSE 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E + GF+++L + ++H + T Q + +YG
Sbjct: 102 IVPRASE--YGGFNLLLGD----GVELWHHN------------ARDTQPQRLGEGLYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ + E+++ + L+ + D + PD + E
Sbjct: 144 NAGLNT-PWPKLLKARTALGEVLD----DPRPEALLTLLNDPQPASVAELPDTGVGVATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ V I+S YGTR + ++V
Sbjct: 199 MLLS----------SVFIASPAYGTRASTALIV 221
>gi|332304659|ref|YP_004432510.1| hypothetical protein Glaag_0273 [Glaciecola sp. 4H-3-7+YE-5]
gi|332171988|gb|AEE21242.1| protein of unknown function DUF833 [Glaciecola sp. 4H-3-7+YE-5]
Length = 255
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 99/230 (43%), Gaps = 31/230 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + GKL L N+ + + RG LV Y++ S+E YL +
Sbjct: 44 AGKDLRAGGTWMGINQQGKLSALTNIRDPQRIASDAISRGELVSDYLQHSPSSEGYLAQV 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E+ +NGF+++ + ++Y F+N S K+T VYG N
Sbjct: 104 SANAEK--YNGFNLLFGDW----NDLYVFNNHT---LQSDKLTT---------GVYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K + G + E ++ ES+ L + + K K P + + E
Sbjct: 146 ASLNT-PWPKTLQGMTQLTEYCQD-SQSLESEQLFALLKNQTKAKDDTLPQTGVPIEWER 203
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+ + I S YGTR+ +++LVD+ + + E T + E +
Sbjct: 204 KLSSIF----------IQSPEYGTRSSTLLLVDNQQQVSWQERTFNNEAQ 243
>gi|410626018|ref|ZP_11336788.1| hypothetical protein GMES_1258 [Glaciecola mesophila KMM 241]
gi|410154638|dbj|GAC23557.1| hypothetical protein GMES_1258 [Glaciecola mesophila KMM 241]
Length = 255
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 108/237 (45%), Gaps = 45/237 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ +T GKL L N+ + + N RG LV +Y++ + YL+ +
Sbjct: 44 AGKDLRAGGTWMGINTQGKLAALTNIRDPKRNLANAVSRGELVSQYLQHSPKSSAYLSKM 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E +NGF+ LL + N+++ + S K+T VYG N
Sbjct: 104 LADAEN--YNGFN--LLYGDWNNLNVFNNHTLQ-----SEKLTT---------GVYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++ + P+ K + G + E +R E ++++ L+KN+ KA+
Sbjct: 146 AKLNT-PWPKTLQGMAQLTEYCQN-SRTLE----VESLFVLLKNQ----------TKAQD 189
Query: 201 DM--DEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTLDYETK 250
D+ + + R +SSIF YGTR+ +++LVD+ + + E T + E +
Sbjct: 190 DILPQTGVPIEWER---KLSSIFIQSPEYGTRSSTLLLVDNQQHVSWQERTFNSEAQ 243
>gi|39995898|ref|NP_951849.1| hypothetical protein GSU0792 [Geobacter sulfurreducens PCA]
gi|39982662|gb|AAR34122.1| protein of unknown function DUF833 [Geobacter sulfurreducens PCA]
Length = 256
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/224 (20%), Positives = 96/224 (42%), Gaps = 35/224 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW + +G++ + N + R R RG LV ++ G ++ +L L
Sbjct: 44 AGRDLTAGGTWCGVTRDGRIAAVTNYRDPGAHRVGARSRGELVAGFLGGDEAPSRWLEHL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++ + +NGF+++ + +++ SN SP + ++G N
Sbjct: 104 QRNGHD--YNGFNLIFGD----GNGLHYHSNRGAAASPLSP------------GIHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV G++ A ++ + + + ++ ++ N+ P P+
Sbjct: 146 -HLLDTPWPKVARGRDALARLL-----ATADEPAVDDLFAILANRT---PAPD-----HL 191
Query: 201 DMDEDYKMRYSRVC--VDISSIFYGTRTHSIILVDHNNVMDFHE 242
D + + R+ + I+S YGTR+ ++ILVD + F E
Sbjct: 192 LPDTGVSLDWERLLSPLFITSPTYGTRSSTVILVDRSGQCTFVE 235
>gi|406597050|ref|YP_006748180.1| hypothetical protein MASE_10485 [Alteromonas macleodii ATCC 27126]
gi|406374371|gb|AFS37626.1| hypothetical protein MASE_10485 [Alteromonas macleodii ATCC 27126]
Length = 260
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 99/233 (42%), Gaps = 43/233 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKY-----VEGQKSA 73
L QD GTW+ + NG + L NV N N RG LV + VE + A
Sbjct: 42 LLAGQDLEANGTWMGVTRNGYVAALTNVRDPHNINKNAVSRGELVANWLKHDSVEMKSVA 101
Query: 74 E-EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
+ +YL L+K +NG++++ ++ S +Y+ NV N
Sbjct: 102 QSQYLATLEKTRHR--YNGYNLLFGDI--NSLKVYN---------------NVNNSTHII 142
Query: 133 D-HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNY 189
D VYG N+ + P+ KV G V N QN+I +++ D+++
Sbjct: 143 DTGVYGLSNADIAT-PWPKVTQG-------VTALNNYVTKQNIINDEDLFDILRY----- 189
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D + D D Y+ + + I S YGTRT +++LVD NN + + E
Sbjct: 190 -DNKADDALLPDTGVGYEWEKALSSIFIQSEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|158520248|ref|YP_001528118.1| hypothetical protein Dole_0231 [Desulfococcus oleovorans Hxd3]
gi|158509074|gb|ABW66041.1| protein of unknown function DUF833 [Desulfococcus oleovorans Hxd3]
Length = 261
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 34/228 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL +T G+ + N + P RG LV ++ G + ++YL DL
Sbjct: 44 AGRDKKAGGTWLGVTTTGRFAAVTNFRDPRLANPAAPSRGALVSDFLSGTTAPDQYLLDL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++E ++GF++V L ++++SN +V ++ ++G N
Sbjct: 104 QQENRP--YSGFNLVAGTL---DDGLFYYSNR------EQQVRRISPG------LHGLSN 146
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV G+ + +++K + +L++ + D + PD +
Sbjct: 147 -HLLDTPWPKVETGRNRLKGLLSK-GKAFSPDDLLEILHDTTMPPDNRLPDTGV------ 198
Query: 201 DMDEDYKMRYSRVC--VDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + RV + I S YGTR+ S+IL+D F E T +
Sbjct: 199 ------GIAWERVLSPMFIRSPDYGTRSSSVILIDGEGRGLFVERTFE 240
>gi|413919265|gb|AFW59197.1| hypothetical protein ZEAMMB73_815680 [Zea mays]
Length = 164
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D + GGTW+ + +GKL L NV +S + RG L V++++G EY T++ K
Sbjct: 48 GRDELGGGTWMGCTRDGKLAFLTNVREPSSLIGAKTRGQLPVRFLQGSLCPLEYATEIAK 107
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSN 111
E ++ +NGF+++L + + S + + SN
Sbjct: 108 EADQ--YNGFNLILAD--VNSGTMVYISN 132
>gi|4539334|emb|CAB37482.1| putative protein [Arabidopsis thaliana]
gi|7270810|emb|CAB80491.1| putative protein [Arabidopsis thaliana]
Length = 262
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 60/119 (50%), Gaps = 22/119 (18%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWL + +G+L L N +S P + RG L ++Y++ +KS E+ +++
Sbjct: 19 GRDLVGGGTWLGCTRHGRLAFLTNFKEASSFPAAKSRGDLPLRYLQSEKSPAEFAEEIQD 78
Query: 83 ETEENVFNGFHIVLLE-------------------LTLQSTNIYHFSNIAPLDSPSTKV 122
E +++NGF++V+ +T S I+ SN A LDSP KV
Sbjct: 79 EI--SLYNGFNLVVAHVLSKSMIYITNRPPHGDKLVTQVSPGIHVLSN-ANLDSPWPKV 134
>gi|386014413|ref|YP_005932690.1| hypothetical protein PPUBIRD1_4935 [Pseudomonas putida BIRD-1]
gi|313501119|gb|ADR62485.1| Hypothetical protein, conserved [Pseudomonas putida BIRD-1]
Length = 248
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 85/215 (39%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ P R RG LV +++G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPGQAPGPRSRGELVAAFLQGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + Q H AP + K VYG N+
Sbjct: 104 SRSGQ--YSGFNLLVGD--GQRLGYLHAREAAP--------------RLLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ ++ +++ E L+ + D + P+ + E
Sbjct: 146 GLDT-PWPKLVKARDGLESLLD----TPEPHRLLDLLADNEPAADGDLPETGVGLATEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQNYGTRASTVLIVDEQG 225
>gi|398806787|ref|ZP_10565686.1| hypothetical protein PMI15_04555 [Polaromonas sp. CF318]
gi|398087152|gb|EJL77749.1| hypothetical protein PMI15_04555 [Polaromonas sp. CF318]
Length = 273
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 100/246 (40%), Gaps = 35/246 (14%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ + G++ L NV R RG LV +++E A ++ L+
Sbjct: 47 GRDLRAGGTWMGITPAGRIAFLTNVRQAQPEAAPRSRGELVTRWLESSDDAAGFVAALQD 106
Query: 83 ETEENVFNGFHIVLLELTLQS----TNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ F GF++VL +L ++ TN Y +P+ + Q +YG
Sbjct: 107 QGA--AFGGFNLVLGDLRQKAWTWVTNKYE-------GAPA------LHAQALAPGLYGL 151
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ K + K A + + Q L+ L ++Q P
Sbjct: 152 SNAGLDT-PWPKTLELKRVLAAALQSGDGPEGLQKLLWKALG---SRQRALPADLPHTGV 207
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIR 258
+++E S V++ YGTR +++L ++ +W ++ E + TH R
Sbjct: 208 PPELEEA----LSSAFVEVPGRGYGTRCSTVLLATASD----SDWNIELEER----THGR 255
Query: 259 KTLNMA 264
+ L A
Sbjct: 256 EGLAAA 261
>gi|332142134|ref|YP_004427872.1| hypothetical protein MADE_1013700 [Alteromonas macleodii str. 'Deep
ecotype']
gi|327552156|gb|AEA98874.1| hypothetical protein MADE_1013700 [Alteromonas macleodii str. 'Deep
ecotype']
Length = 260
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 105/238 (44%), Gaps = 53/238 (22%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVE---GQKSAEE 75
L QD GTW+ + NGK+ L NV N N RG LV +++ QK E
Sbjct: 42 LLAGQDLEADGTWMGVTRNGKIAALTNVRDPHNINKNAVSRGELVANWLKQNPAQKDYAE 101
Query: 76 ---YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
YL L++ + +NG++++ ++T + +Y+ NV N +
Sbjct: 102 QSAYLATLEETRHQ--YNGYNLLFGDVT--ALRVYN---------------NVNNSTHSI 142
Query: 133 D-HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQ--NILDLMKNKQSNY 189
D VYG N+ + P+ KV G V N + QN I ++ +++++
Sbjct: 143 DTGVYGLSNADIAT-PWPKVTQG-------VTALNVYVQQQNTINADDLFNILRH----- 189
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHE 242
+ KAE + D + Y +SSIF YGTRT +++LVD NN + + E
Sbjct: 190 -----ENKAEDALLPDTGVGYEWEKA-LSSIFIQTEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|149019790|gb|EDL77938.1| similar to Ser/Thr-rich protein T10 in DGCR region (predicted),
isoform CRA_e [Rattus norvegicus]
Length = 110
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ GGTWL ST GKLG L N L P+ R RG LV ++ + YL K
Sbjct: 2 EEGKAGGTWLGISTRGKLGALTNYLQPRQEPDARGRGELVSHFLTSDMDSLSYLK--KVS 59
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHF 109
TE +++NGF+++ +L+ ++ +
Sbjct: 60 TEGHLYNGFNLIAADLSTAKGDVVCY 85
>gi|15451202|gb|AAK96872.1| putative protein [Arabidopsis thaliana]
gi|20148245|gb|AAM10013.1| putative protein [Arabidopsis thaliana]
Length = 253
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 51/85 (60%), Gaps = 3/85 (3%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D V GGTWL + +G+L L N +S P + RG L ++Y++ +KS E+ +++
Sbjct: 22 GRDLVGGGTWLGCTRHGRLAFLTNFKEASSFPAAKSRGDLPLRYLQSEKSPAEFAEEIQD 81
Query: 83 ETEENVFNGFHIVLLELTLQSTNIY 107
E +++NGF++V+ + L + IY
Sbjct: 82 EI--SLYNGFNLVVAHV-LSKSMIY 103
>gi|300311917|ref|YP_003776009.1| signal peptide protein [Herbaspirillum seropedicae SmR1]
gi|300074702|gb|ADJ64101.1| signal peptide protein [Herbaspirillum seropedicae SmR1]
Length = 276
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/212 (23%), Positives = 87/212 (41%), Gaps = 29/212 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +G+ L NV + R + RG LV Y+ G + YL +
Sbjct: 44 AGRDLQGGGTWLGVTRDGRFAALTNVRAPSERRTDAPTRGQLVADYLAGNLDPQAYLRQI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + ++ +NGF+++L + +SN D N Q VYG N
Sbjct: 104 EAQAQQ--YNGFNLLL----GNRDTMLWYSNRGQQDP--------RNGQPLDYGVYGVSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K+ K +FA ++ + + + + + D + PD I + E
Sbjct: 150 ALLDT-PWPKLTRAKAQFASLLCQ---GAPEETFFEMLTDATRANDCRLPDTGIGLEKER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + I S YGTR S++ V
Sbjct: 206 MLSPIF----------IRSPDYGTRCSSVVRV 227
>gi|340371321|ref|XP_003384194.1| PREDICTED: uncharacterized protein C22orf25-like [Amphimedon
queenslandica]
Length = 269
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 108/246 (43%), Gaps = 48/246 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRP--NGRDRGPLVVKYVEGQKSAEEYLTD 79
+D + GG+WL S G+ +L N + + P G RG + + Y +
Sbjct: 47 AGKDGLNGGSWLGFSRRGRFAVLTNYRKDFNAPEFQGISRG---------ETTPTNYSST 97
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DH 134
L + + ++GF+++ L +Y +N+ +
Sbjct: 98 LMSNSSQ--YDGFNLITGSLGNNKEMVY-----------------CSNRSDEGACPLLGG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VYG NS S P+ KV GK+KF EI++ K +L+ +L L+ + +PDPE+
Sbjct: 139 VYGLSNSLLDS-PWMKVKEGKKKFKEIISSGLSK---DDLVSELLSLLGDDTCYHPDPEM 194
Query: 195 DRKAEADMDEDYKMR-YSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK--- 250
+ + + M+ +S + V + YGTRT+++ILVDH + + E T+ K
Sbjct: 195 HDTSHPE----FLMKAFSSIFVKAPGVRYGTRTNTVILVDHEGTVTYVERTMAEPIKEEQ 250
Query: 251 -KWIHT 255
+WI T
Sbjct: 251 VEWITT 256
>gi|398966510|ref|ZP_10681549.1| hypothetical protein PMI25_03271 [Pseudomonas sp. GM30]
gi|398146045|gb|EJM34813.1| hypothetical protein PMI25_03271 [Pseudomonas sp. GM30]
Length = 248
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/217 (20%), Positives = 89/217 (41%), Gaps = 33/217 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL NG+ L N+ + P + RG LV +++ G S ++Y D
Sbjct: 42 VHAGRDLEAGGTWLGIGANGRFAALTNIRDPHQPPARKSRGELVARFLTGDASIDDYWRD 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + E + GF+++L + ++HF+ + + +YG
Sbjct: 102 VVGRSPE--YAGFNLLL----GNAHELWHFN------------ARTSEPVMLEPGLYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ K + +++ + L+ + D + PD + E
Sbjct: 144 NAGLDT-PWPKLLKAKAALSAVLD----DPQPARLLDLLGDAQTAPFAELPDTGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + V I+S YGTR + ++V +
Sbjct: 199 SLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|340787006|ref|YP_004752471.1| hypothetical protein CFU_1816 [Collimonas fungivorans Ter331]
gi|340552273|gb|AEK61648.1| hypothetical protein CFU_1816 [Collimonas fungivorans Ter331]
Length = 263
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/237 (21%), Positives = 96/237 (40%), Gaps = 30/237 (12%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ + NV R + RG LV ++ GQK+A EY+ ++
Sbjct: 44 AGRDLQDGGTWIGITRSGRFAAITNVRAPAERRTDAPTRGTLVSDFLGGQKTAAEYVAEI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + +NGF++++ + ++ +SN + N Q +YG N
Sbjct: 104 SDDAAK--YNGFNLLVGD----GKDLIWYSN--------KHQEDARNGQPLPAGIYGLSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + KV+ K +FA ++ + + + D ++ P + + E
Sbjct: 150 ASLDGC-WPKVVRTKAQFASLLCQ---GAPDACFFDMLSDTTRSGDCRLPSTGVGIELER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL-DYETKKWIHTH 256
+ V I S YGTR +++ + N HE + IHTH
Sbjct: 206 VLSA----------VFIQSPDYGTRASTLVRIKANGSAMLHERVAPPFCPSSPIHTH 252
>gi|224370721|ref|YP_002604885.1| hypothetical protein HRM2_36620 [Desulfobacterium autotrophicum
HRM2]
gi|223693438|gb|ACN16721.1| conserved hypothetical protein [Desulfobacterium autotrophicum
HRM2]
Length = 258
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 91/232 (39%), Gaps = 35/232 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + G+ + N + R + RG +V ++E +L
Sbjct: 44 AGKDIKAGGTWLGVTKKGRFAAITNFRKPSLQRQDAPSRGGIVTDFLESNLDPATFLNRF 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K T+ FNGF++ L ++Y FSNI SP+ + YG N
Sbjct: 104 K--TQAQRFNGFNL----LAGDQNSLYWFSNIK--GSPTRLLPGF----------YGISN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV GK+ + +NK + + L + D + PD +
Sbjct: 146 -HLMNTPWPKVARGKKALEKCLNKTGDIT-TDALFPLLADRTRPHDDELPDTGVG----- 198
Query: 201 DMDEDYKMRYSRVC--VDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
M + R+ + I S YGTR +I+++ +D E T D K
Sbjct: 199 -------MAWERLLAPIFIESPTYGTRCSTILIITQTGEIDICERTFDQNQK 243
>gi|399018203|ref|ZP_10720387.1| hypothetical protein PMI16_01299 [Herbaspirillum sp. CF444]
gi|398101839|gb|EJL92039.1| hypothetical protein PMI16_01299 [Herbaspirillum sp. CF444]
Length = 273
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/217 (23%), Positives = 92/217 (42%), Gaps = 39/217 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +G+ L NV + RP+ RG LV Y+ G + E+Y+ +
Sbjct: 44 AGRDLQAGGTWLGVTRDGRFAALTNVRAPSEKRPDAPTRGKLVSDYLSGTFTPEQYVEHI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E +NGF++++ + + +SN D N + VYG N
Sbjct: 104 RPGAAE--YNGFNLIVGD----RETLLWYSNKGDDDP--------RNGKPLGYGVYGLSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ KV K +FA ++ + + + + + D + P +
Sbjct: 150 ALLDT-PWPKVTRAKAQFASLLCQ---GAPEETFFELLSDSTRANDCRLPSTGV------ 199
Query: 201 DMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILV 232
+ + R+ +SSIF YGTR+ +++ V
Sbjct: 200 ------SLEWERI---LSSIFICSPDYGTRSSTVVRV 227
>gi|407368028|ref|ZP_11114560.1| hypothetical protein PmanJ_29686 [Pseudomonas mandelii JR-1]
Length = 248
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/221 (22%), Positives = 91/221 (41%), Gaps = 51/221 (23%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
D GGTWL NG+ L N+ N P+ + RG LV ++ G ++YL+D+ +
Sbjct: 47 DLEAGGTWLGIGANGRFAALTNIRDPNQSPSRKSRGELVAGFLLGNMPIDDYLSDVVGRS 106
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DHVYGFG 139
E + GF++ L + ++HF N +E++ +YG
Sbjct: 107 LE--YAGFNL----LIGNANELWHF-----------------NARESEAVMLAPGIYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEID 195
N+ + P+ K++ + E++ ++ +L L+K+ Q + PD +
Sbjct: 144 NAGLDT-PWPKLLKARAALEEVLEDPQPEA--------LLALLKDPQTAPFAELPDTGVG 194
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + V I+S YGTR + ++V +
Sbjct: 195 LATETLLS----------SVFIASPTYGTRASTALIVQADG 225
>gi|194289600|ref|YP_002005507.1| hypothetical protein RALTA_A1491 [Cupriavidus taiwanensis LMG
19424]
gi|193223435|emb|CAQ69440.1| conserved hypothetical protein, DUF833 [Cupriavidus taiwanensis LMG
19424]
Length = 275
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/224 (20%), Positives = 92/224 (41%), Gaps = 30/224 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + G+ L N + R + R RG LV ++ G + +YL L E+
Sbjct: 56 GTWMGVNAEGRFAALTNYRAPSEKRTDARSRGELVAGFLRGHAAPFDYLDRLAG--EDGA 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF+++ +L ++ +SN + P Q + +YG N+ + P+
Sbjct: 114 YNGFNLLASDL----RELWWYSNRSASRQP----------QRLRPGLYGLSNALLDT-PW 158
Query: 149 QKVIFGKEKFAEIV--NKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
KV AE++ + + ++ +Q + D + P + E + +
Sbjct: 159 PKVRSRVGALAEVLAADSGQANASAEPYLQMLADERQAADFELPSTGVAPDWEKLLSSAF 218
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
I S YGTR +++ + H+ D +E + D + +
Sbjct: 219 ----------IRSPMYGTRASTVLRIRHDGRFDLNERSFDADGR 252
>gi|152981285|ref|YP_001353680.1| hypothetical protein mma_1990 [Janthinobacterium sp. Marseille]
gi|151281362|gb|ABR89772.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length = 244
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 90/212 (42%), Gaps = 33/212 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ L N+ + R + RG LV ++ G S +EY+ +L
Sbjct: 44 AGRDLQGGGTWIGVTRDGRFAALTNIRAPSEMRDDAPSRGALVTNFLAGTMSPDEYIAEL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K + E+ +NGF++++ + S + FSN D N + +YG N
Sbjct: 104 KTQQED--YNGFNLLIGD----SKQLVWFSNRGDGDE--------RNGKSLGPGIYGLSN 149
Query: 141 S--QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
S C + KV+ K +FA ++ + + + + D P + +
Sbjct: 150 SLLDCA---WPKVVRTKAQFASLLCQ---GAPEDAYFEMLTDTTCASDCRLPKTGVSIER 203
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSII 230
E + S VC I S YGTR +++
Sbjct: 204 ERLL--------SAVC--IESPDYGTRVSTLV 225
>gi|407700320|ref|YP_006825107.1| hypothetical protein AMBLS11_10375 [Alteromonas macleodii str.
'Black Sea 11']
gi|407249467|gb|AFT78652.1| hypothetical protein AMBLS11_10375 [Alteromonas macleodii str.
'Black Sea 11']
Length = 260
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 98/233 (42%), Gaps = 43/233 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKY------VEGQKS 72
L QD GTW+ + NG L L NV N N RG LV + + +
Sbjct: 42 LLAGQDLEANGTWMGVTRNGNLAALTNVRDPYNINKNAVSRGELVANWLKQDPVISENAA 101
Query: 73 AEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
+YL L K + +NG++++ ++ + +Y+ NV N
Sbjct: 102 QSQYLKTLAKTRHQ--YNGYNLLFGQV--NALKVYN---------------NVNNSTHLI 142
Query: 133 DH-VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNY 189
D VYG N+ + P+ KV G E V+K ++LI ++ D++++
Sbjct: 143 DSGVYGLSNADIAT-PWPKVTQGVTALNEYVSK-------ESLINDDDLFDILRS----- 189
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D + D D Y+ + + I + YGTRT +++LVD NN + + E
Sbjct: 190 -DNKADDHLLPDTGVGYEWEKALSSIFIQTEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|345303460|ref|YP_004825362.1| hypothetical protein Rhom172_1608 [Rhodothermus marinus
SG0.5JP17-172]
gi|345112693|gb|AEN73525.1| protein of unknown function DUF833 [Rhodothermus marinus
SG0.5JP17-172]
Length = 256
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 99/224 (44%), Gaps = 32/224 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL S G+L ++ N + R GR RG L ++ G + YL
Sbjct: 44 AGRDLEAGGTWLGVSERGRLALVTNYREPDRRATGRRSRGWLTRDFLLGSEPPAAYLE-- 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E N +NGF++ L + + + SN S +V +YG N
Sbjct: 102 RVLAEGNAYNGFNL----LVGDTETLAYGSN----RSDGIRVL--------APGLYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++ + KV G F I+ + E+ +L L+ ++ + PD + R
Sbjct: 146 HLLGTR-WPKVTRGLTAFEAILQHDSIDPET------LLALLADR-TPAPDEALPRTG-- 195
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT 244
+D +++ R S + V S+ YGTR+ +++L++ + + F E T
Sbjct: 196 -LDLEWERRLSAIFVATSA--YGTRSSTVLLLEKDGSLTFVERT 236
>gi|326433255|gb|EGD78825.1| hypothetical protein PTSG_01800 [Salpingoeca sp. ATCC 50818]
Length = 303
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 94/215 (43%), Gaps = 29/215 (13%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKY-VEGQKSAEEYLTDLKKETEE 86
K GTW+ + G++G+L NV G + + R RG LV + V + +Y + K +
Sbjct: 54 KHGTWMGITRQGRVGVLTNVRGLHDDESPRGRGELVRDFLVHPHVTPWDYCERVAKHDRD 113
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
+ GF+++L +L ++ + +N + S V V ++G N+ +
Sbjct: 114 --YGGFNLLLFDLAKRTEPAAYLTNRG---TSSGTVLGVPAG------IHGLSNA-ALNI 161
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
P+ KV GK + A++ N E L + DL+ + Q P E+ + D E +
Sbjct: 162 PWFKVERGKSRMAQM--DLNLPEE--ELTPLLFDLLSDTQKA-PASELPQTGLGDAIEHH 216
Query: 207 KMRYSRVCVDISSIF--------YGTRTHSIILVD 233
S VDI YGTR ++ILVD
Sbjct: 217 ---LSSTMVDIDPSLGLPHLGSGYGTRAQTVILVD 248
>gi|221134574|ref|ZP_03560879.1| hypothetical protein GHTCC_06572 [Glaciecola sp. HTCC2999]
Length = 256
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 96/232 (41%), Gaps = 43/232 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + NG + L N+ + R + RG LVV ++ + S E + L
Sbjct: 44 AGKDLQAGGTWLGVAKNGHIAALTNIRAPSKERFDAMTRGELVVNALQYKGSWEVHTQHL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K + +NGF++V + + N+ +S + K VYG N
Sbjct: 104 KNTATD--YNGFNLVYGD----------WKNLQVFNS------HTQEHHALKQGVYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSES------QNLIQNILDLMKNKQSNYPDPEI 194
+Q + P+ K G VN N +S + L + D + PD I
Sbjct: 146 AQLNT-PWPKTQQG-------VNALNALCQSKQPLVVEQLFAILSDPTQASDETLPDTGI 197
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ E KM S + I S YGTR ++I VDH++ +++ E + D
Sbjct: 198 AKPWE-------KMLSS---IFIKSPDYGTRCSTVITVDHHHALNWEERSYD 239
>gi|398835802|ref|ZP_10593158.1| hypothetical protein PMI40_03291 [Herbaspirillum sp. YR522]
gi|398214905|gb|EJN01473.1| hypothetical protein PMI40_03291 [Herbaspirillum sp. YR522]
Length = 270
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/212 (22%), Positives = 90/212 (42%), Gaps = 29/212 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL T+ +G+ L NV + R + RG LV Y+ G S +Y+ L
Sbjct: 44 AGRDLQGGGTWLGTTRDGRFAALTNVRSPSERRTDAPTRGALVSDYLTGSMSPTDYVQQL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ ++ +NGF++++ + + +SN D+ N + VYG N
Sbjct: 104 QARAQD--YNGFNLLVGD----RETLLWYSNRGQADA--------RNGRPLDYGVYGVSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K+ K +FA ++ + + + + + D + PD + + E
Sbjct: 150 ALLDT-PWPKLTRAKAQFASLLCQ---GAPEEAFFEMLTDATRANDCRLPDTGVGIERER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + I S YGTR +++ V
Sbjct: 206 MLSPIF----------IRSPDYGTRCSTVLRV 227
>gi|341895457|gb|EGT51392.1| hypothetical protein CAEBREN_15823 [Caenorhabditis brenneri]
Length = 256
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 89/179 (49%), Gaps = 13/179 (7%)
Query: 18 MLLEVSQDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEE 75
+L V + V GTWL +GK+G+LL++ ++ N RG +V +Y++ + +
Sbjct: 41 ILSGVDEKDVARGTWLGMDEHGKVGMLLSITQPVDSKHKNAPSRGGIVNEYLKSGIDSTQ 100
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
+ +LK + E +NGF V +E Q + ++ ++ ++T K +K HV
Sbjct: 101 FFENLKNKAEN--YNGFQFVGIERN-QESKLFEVRSLTNQLVDDIEIT----KWNDKFHV 153
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
Y ++ PF+K FG + F+E + K + + + ++ + ++ + S +PD +I
Sbjct: 154 Y---SNSPPHIPFKKTEFGLKIFSEKLEKTD-EMDVDDIFSRLFEIATCRTSCFPDDQI 208
>gi|344230756|gb|EGV62641.1| DUF833-domain-containing protein [Candida tenuis ATCC 10573]
gi|344230757|gb|EGV62642.1| hypothetical protein CANTEDRAFT_115200 [Candida tenuis ATCC 10573]
Length = 316
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 103/254 (40%), Gaps = 52/254 (20%)
Query: 30 GTWLATSTNGKLGIL--------LNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
GTW+ T GKL +L LNV+ E SR G + V Y+ S EE+ +L+
Sbjct: 54 GTWIGVDTTGKLAVLVNYRESDDLNVISEVSR------GIIPVDYLSSDLSDEEWYDNLE 107
Query: 82 KETEENVFN------------GFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQ 129
K + GF ++ +L L + + NI V + +
Sbjct: 108 KSLALRIVGKKVDSIPLRRVGGFSLLYGQLKLNDSGKINKLNIISNRGDKGTVFDFDSPP 167
Query: 130 E---NKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ 186
+ + D G NS +P+ KV G+ E++NK + +Q Q + +L K
Sbjct: 168 DHFHSNDTTIGLSNS-LFYEPWNKVHLGRRALHEVINKSTEQHYTQQ--QLVEELFKVLS 224
Query: 187 SNYPDPEIDRKAEADMDEDYKMRYSR--VCV-------------DISSIFYGTRTHSIIL 231
+N D K+ D K+ R +CV D +YGTRT +IIL
Sbjct: 225 TNSYDQSFANKSMKD-----KLLGLRDTICVPPLETQFTDKELNDTIGKYYGTRTQTIIL 279
Query: 232 VDHNNVMDFHEWTL 245
+D + V+ ++E L
Sbjct: 280 LDKHGVLHYYEKVL 293
>gi|114320411|ref|YP_742094.1| hypothetical protein Mlg_1255 [Alkalilimnicola ehrlichii MLHE-1]
gi|114226805|gb|ABI56604.1| protein of unknown function DUF833 [Alkalilimnicola ehrlichii
MLHE-1]
Length = 258
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 99/244 (40%), Gaps = 32/244 (13%)
Query: 7 KPNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG--RDRGPLVV 64
+P + E + +D GGTW+ + G L N E SRP R RG LV+
Sbjct: 28 RPTRAAHWWHEPPILAGRDERAGGTWMGVAAQGHWAALTNYR-EPSRPRRGLRSRGQLVL 86
Query: 65 KYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTN 124
+ Q DL+ E+ ++GF+++ + +Y+ SN D P +V
Sbjct: 87 DAL--QTPPASLRVDLEPAALEHDYDGFNLLFGD----RERLYYVSN---RDQPLRQVPP 137
Query: 125 VTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKN 184
+G N + P+ KV G+E+ A ++ + ++ + DL+ +
Sbjct: 138 ---------GYHGLSNG-LLNDPWPKVRRGRERLAACLSGTWGERADTPPLEPLFDLLHD 187
Query: 185 KQSNYPDPEIDRKAEADMDEDYKMRYSRVCVD--ISSIFYGTRTHSIILVDHNNVMDFHE 242
+ PD E+ +D + + R I S YGTR +++L+D + F E
Sbjct: 188 DRCP-PDHEL-------VDTGVSLAWERRLAPMFIRSPEYGTRCSTLLLLDEGGEIHFAE 239
Query: 243 WTLD 246
D
Sbjct: 240 RRFD 243
>gi|431797365|ref|YP_007224269.1| hypothetical protein Echvi_2006 [Echinicola vietnamensis DSM 17526]
gi|430788130|gb|AGA78259.1| hypothetical protein Echvi_2006 [Echinicola vietnamensis DSM 17526]
Length = 253
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 98/222 (44%), Gaps = 31/222 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ +G+ L N ++ + N RG LV ++EG+ S EYLT++
Sbjct: 42 AGKDLKSGGTWMGVHPSGRFAALTNFRDFSTVKENPVSRGQLVKDFLEGRLSPREYLTNV 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+K ++ F GF++++ E +Y+ SN A QE ++G N
Sbjct: 102 QKCQDQ--FEGFNLLVGE----GDQLYYCSNYA------------EQVQEVSPGIHGLSN 143
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+P++KV+ K + ++ K + + L+ L +++ N P+ ++ + E
Sbjct: 144 G-LLDEPWKKVVASKAQMNALLQK--EEPSLEKLLAMHLSTAEDELENLPNTGVEPEVEQ 200
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + DI YGT +++ H+ +D E
Sbjct: 201 SLSAAF-------IRDIQG--YGTVNITVLCWGHDGKVDLLE 233
>gi|114567753|ref|YP_754907.1| hypothetical protein Swol_2245 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114338688|gb|ABI69536.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 269
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 107/246 (43%), Gaps = 47/246 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D +GGTW+ +TNG L N +S RP R RG LV Y+ + + Y DL
Sbjct: 59 AGRDLKEGGTWMGITTNGCFAALTNYRDPSSFRPEARSRGHLVKDYLNSNLAPKTYTADL 118
Query: 81 KKETEENVFNGFHIVLLELTLQSTN--IYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ +NGF++ L TN +Y++SN K+ ++ ++G
Sbjct: 119 PNGG--SAYNGFNL------LMGTNDSMYYYSN-------REKII-----RQIPAGIHGL 158
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV G + A+++ + ++E + +M N++ PD
Sbjct: 159 SNALLNT-PWPKVSKGIKAMAQLIQHDDIEAE------QLFAMMSNQE--LPD------- 202
Query: 199 EADMDE---DYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDY---ETKKW 252
E D+ + +M + ++S YGT ++ILV+ N+ +F W Y + W
Sbjct: 203 EQDLPQTGVSLEMERMLAPIFVTSPDYGTCLTTVILVERNH--NFQFWERSYINGQPDSW 260
Query: 253 IHTHIR 258
+ R
Sbjct: 261 DEVYYR 266
>gi|429331515|ref|ZP_19212268.1| hypothetical protein CSV86_07016 [Pseudomonas putida CSV86]
gi|428763676|gb|EKX85838.1| hypothetical protein CSV86_07016 [Pseudomonas putida CSV86]
Length = 247
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 84/215 (39%), Gaps = 34/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV Y+ G+ S E YL K
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPGQALGARSRGELVAGYLRGELSVEAYLD--K 101
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E +++GF++++ N + D+P + VYG N+
Sbjct: 102 VAGEARLYSGFNLLV-------GNAGQLGYLHAGDTPRML----------EAGVYGLSNA 144
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ +E E ++ + + L + + D + P + E
Sbjct: 145 ALDT-PWPKLVKAREALRERLD----DPQPEELFELLGDARPAADAELPSTGVGLATERL 199
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++V+ +
Sbjct: 200 LS----------SVFIASQNYGTRASTVLVVEADG 224
>gi|89093748|ref|ZP_01166694.1| hypothetical protein MED92_04599 [Neptuniibacter caesariensis]
gi|89081878|gb|EAR61104.1| hypothetical protein MED92_04599 [Oceanospirillum sp. MED92]
Length = 258
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 98/220 (44%), Gaps = 42/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D + GTWL +T GK + N G N N R RG L +++ Q+SA+EYL L
Sbjct: 44 AGRDLEQLGTWLGFNTAGKFTAVTNYRDGRNPAKNRRSRGELTANFLKCQQSADEYLLQL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++ +E F F++++ + T +Y+ SN E K +YG N
Sbjct: 104 EQNKQE--FGDFNLLVAD----QTGLYYCSNRG------------AETHELKPGIYGMSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K+ K+ +N +E +++ +M N+++ PD ++
Sbjct: 146 ALLDT-PWPKLQRVKQDLTNAINSSALTAE------HLMGIMGNRETA-PDKDLP----- 192
Query: 201 DMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHN 235
D + + R+ +SS F YGTR ++++ N
Sbjct: 193 --DTGISLNWERL---LSSSFIQADNYGTRATTLVMQKPN 227
>gi|421619702|ref|ZP_16060653.1| hypothetical protein B597_23682 [Pseudomonas stutzeri KOS6]
gi|409778314|gb|EKN58016.1| hypothetical protein B597_23682 [Pseudomonas stutzeri KOS6]
Length = 251
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/221 (20%), Positives = 96/221 (43%), Gaps = 35/221 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+A + +G+ + N+ R RG L +++ GQ+S +Y+ L
Sbjct: 45 GRDLQAGGTWMAATADGRFAAVTNIRAPGQPAGRRSRGELPERFLRGQQSPGDYVAALAD 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+E + GF+++L + +++ + + Q + VYG N+
Sbjct: 105 HRDE--YAGFNLLLGD----RNELWYLN------------AHTGRPQALESGVYGLSNAA 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK-AEAD 201
+ P+ K+ +E A+ ++ Q + +L L+ + P P D + +
Sbjct: 147 LNT-PWPKLKKAREALADELD--------QPAPERLLMLLAD-----PHPAADHELPQTG 192
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ +++ R S + I+S YGTR + ++ + ++ E
Sbjct: 193 VPLEWERRLSSIF--IASADYGTRASTALIRHADGALEIVE 231
>gi|388542842|ref|ZP_10146134.1| hypothetical protein PMM47T1_00620 [Pseudomonas sp. M47T1]
gi|388278928|gb|EIK98498.1| hypothetical protein PMM47T1_00620 [Pseudomonas sp. M47T1]
Length = 250
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 94/221 (42%), Gaps = 45/221 (20%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ R RG LV +++ + ++YL D+
Sbjct: 44 AGRDLEAGGTWLGLGPNGRFAALTNIRDPGQPLGRRSRGDLVARFLTSSMTIDDYLDDIA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKD---HVYGF 138
E ++GF+++L S + +F+ ++ KD VYG
Sbjct: 104 SRASE--YSGFNLLL----GASGRLCYFNQ---------------RERHGKDLSTGVYGL 142
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ K++ + AE ++K + +L+L+KN DR A
Sbjct: 143 SNAGLDT-PWPKLLKARGALAEHLHKPQ--------PETLLNLLKNA---------DRAA 184
Query: 199 EADM-DEDYKMRYSRV--CVDISSIFYGTRTHSIILVDHNN 236
+A++ D + + V I+S YGTR + ++V +
Sbjct: 185 DAELPDTGVGLATESLLSSVFIASPNYGTRASTALIVGADG 225
>gi|404402189|ref|ZP_10993773.1| hypothetical protein PfusU_20574 [Pseudomonas fuscovaginae UPB0736]
Length = 248
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/225 (20%), Positives = 96/225 (42%), Gaps = 41/225 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + + L N+ + P R RG LV +++ S +YL D+
Sbjct: 44 AGRDLEAGGTWLGVGAHRRFAALTNIRNPHQPPARRSRGELVARFLGEDCSIADYLQDIV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ E + GF+++L + + ++HF+ ++ ++ ++ +YG N+
Sbjct: 104 ARSPE--YAGFNLLLGD----AQELWHFN------------SHEGQPRQLQEGIYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS----NYPDPEIDRK 197
+ P+ K++ + + + + +++L L+ ++QS PD +
Sbjct: 146 GLDT-PWPKLLKARTALGQALER--------PTPEHLLALLGDRQSAAFAELPDTGVGLA 196
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E+ + V I+S YGTR + ++V+ + E
Sbjct: 197 TESLLS----------SVFIASPSYGTRASTALIVNADGTRHLLE 231
>gi|397697224|ref|YP_006535107.1| hypothetical protein T1E_4484 [Pseudomonas putida DOT-T1E]
gi|397333954|gb|AFO50313.1| hypothetical protein T1E_4484 [Pseudomonas putida DOT-T1E]
Length = 248
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/214 (21%), Positives = 85/214 (39%), Gaps = 33/214 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ P R RG LV +++G+ E YL
Sbjct: 42 VHAGRDLEAGGTWLGVGPQGRFAALTNIRDPGQAPGPRSRGELVAAFLQGELGVEAYLDQ 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + + ++GF++++ + Q H P + K VYG
Sbjct: 102 VASRSGQ--YSGFNLLVGD--GQRLGYLHAREATP--------------RLLKAGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ ++ +++ E L+ + D + P+ + E
Sbjct: 144 NAGLDT-PWPKLVKARDGLENLLD----TPEPHRLLALLADNEPAADGDLPETGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 199 KLLS----------SVFIASQNYGTRASTVLIVD 222
>gi|410861919|ref|YP_006977153.1| hypothetical protein amad1_11465 [Alteromonas macleodii AltDE1]
gi|410819181|gb|AFV85798.1| hypothetical protein amad1_11465 [Alteromonas macleodii AltDE1]
Length = 260
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 64/238 (26%), Positives = 104/238 (43%), Gaps = 53/238 (22%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVE---GQKSAEE 75
L QD GTW+ + NGK+ L NV N N RG LV +++ QK E
Sbjct: 42 LLAGQDLEADGTWMGVTRNGKIAALTNVRDPHNINKNAVSRGELVANWLKQNPAQKDYAE 101
Query: 76 ---YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
YL L++ + +NG++++ ++T + +Y+ NV N +
Sbjct: 102 QSAYLATLEETRHQ--YNGYNLLFGDVT--ALRVYN---------------NVNNSTHSI 142
Query: 133 D-HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNY 189
D VYG N+ + P+ KV G V N + Q I ++ +++++
Sbjct: 143 DTGVYGLSNADIAT-PWPKVTQG-------VTALNVYVQQQKTINADDLFNILRH----- 189
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHE 242
+ KAE + D + Y +SSIF YGTRT +++LVD NN + + E
Sbjct: 190 -----ENKAEDALLPDTGVGYEWEKA-LSSIFIQTEKYGTRTSTLLLVDKNNTLTWKE 241
>gi|456013930|gb|EMF47567.1| hypothetical protein B481_1164 [Planococcus halocryophilus Or1]
Length = 252
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 98/237 (41%), Gaps = 55/237 (23%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D + GTWL + G++ L N + G RG +V ++ S EEYL
Sbjct: 44 AGRDLTQLGTWLGVTKQGRIAALTNFRDPTNLEAGPLSRGAVVKNFLAATSSPEEYL--- 100
Query: 81 KKETEENVFNGFHIVL--LELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
K + + GF++++ E L NI H S P P T +++N N
Sbjct: 101 -KSIDPKQYAGFNLIVGDAEKLLYYNNIQHESYEIP---PGTH--SLSNHFLNT------ 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
P+ KV GKEK A + + K++ + L + A
Sbjct: 149 --------PWPKVTKGKEKLASYMAQ-TEKADLEELFTIL-------------------A 180
Query: 199 EADMDEDYKMRYSRVCVDI----SSIF-----YGTRTHSIILVDHNNVMDFHEWTLD 246
+AD D + S V +D+ S IF YGTR+ +++LV H+N + F E + +
Sbjct: 181 DADHAPDTHLPNSGVSLDLERMLSPIFIKTPDYGTRSATVVLVSHDNTLTFAERSFE 237
>gi|198411761|ref|XP_002123681.1| PREDICTED: similar to MGC88919 protein, partial [Ciona
intestinalis]
Length = 229
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 88/195 (45%), Gaps = 22/195 (11%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
Q VKGGTW + GK L N L E++ + RG +V ++E + ++ +Y+
Sbjct: 51 QPGVKGGTWFGVNKQGKFAFLTNYLTPKEDAFTDKVSRGFIVRNFLENETNSHDYIASTL 110
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E H V + T Q+ N+ +F DS S + + D VY
Sbjct: 111 MSREH---RPHHFVGGQFTNNCQNMNVSYFGT---HDSSSPLILD--------DGVYALA 156
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRKA 198
S ++ ++KV G KF +++ N S + L+ N+ D++ +K YPD + +++
Sbjct: 157 CSTLGTK-WRKVDHGSVKFRKVIQ--NPNSSXKELMDNLFTDVLSDKTPLYPDALVKKQS 213
Query: 199 EADMDEDYKMRYSRV 213
+ + E++ Y V
Sbjct: 214 GSYLPEEFLKLYCSV 228
>gi|71021155|ref|XP_760808.1| hypothetical protein UM04661.1 [Ustilago maydis 521]
gi|46100285|gb|EAK85518.1| hypothetical protein UM04661.1 [Ustilago maydis 521]
Length = 328
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 106/243 (43%), Gaps = 44/243 (18%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRD----RGPLVVKYVE-------GQ 70
++D+ GGTWL S NG L N R P G D RG LV ++ G+
Sbjct: 49 ARDSTGGGTWLGVSENGAFAALTNFTESAPRIPVGMDGFESRGGLVRDWLTLQAAMCGGK 108
Query: 71 KSAEEYLTDLKK-----ETEENVFNGFHIVLLELTLQSTNIYHFSN-------------- 111
+S +E T +++ + + + GF++++ L+LQ + + +N
Sbjct: 109 RSLQEVSTQVQEYLNSVGAKGDRYPGFNLLVGALSLQGMVVGYVTNRTLQGQVVRDATVD 168
Query: 112 -IAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSE 170
PL +T+ ++ T+ G NS +QP++KV G F +IV+ + +S
Sbjct: 169 MFLPLPHGATEASSATSPP------VGMSNS-ILAQPWRKVTSGSHSFCQIVSSHHTQST 221
Query: 171 S-QNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSI 229
+ +++ + + DL+ SN P + + + ++ S +Y TRT ++
Sbjct: 222 TLEDMTEQLFDLLWTS-SNPPPSQRSELQNSVLISPLELPAS---ASTERDWYATRTSTV 277
Query: 230 ILV 232
I +
Sbjct: 278 ITI 280
>gi|322421113|ref|YP_004200336.1| hypothetical protein GM18_3632 [Geobacter sp. M18]
gi|320127500|gb|ADW15060.1| protein of unknown function DUF833 [Geobacter sp. M18]
Length = 252
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 94/227 (41%), Gaps = 33/227 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D V GGTWL ++ G++ L N + R RG LV ++E + A+EYL
Sbjct: 44 AGRDLVHGGTWLGVTSTGRIAALTNYRDPSDPHRHGPSSRGRLVSAFLEEEAGAKEYLDR 103
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
L++ + G+++ L S ++ +SN S ++ + + ++G
Sbjct: 104 LRQSGIP--YGGYNL----LVGTSERLFCYSN------KSDRLITI------EPGIHGLS 145
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N P+ KV GK+ A I+++ + ++ L + D PD + + E
Sbjct: 146 N-HLLDTPWPKVRRGKKGLARILSQ--ERFSTEELFALLADDTHAPDQELPDTGVGLELE 202
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + I YGTR S++LVD F E + +
Sbjct: 203 RLLSPIF----------IKGAQYGTRCSSVLLVDREGAATFIERSFE 239
>gi|146309230|ref|YP_001189695.1| hypothetical protein Pmen_4216 [Pseudomonas mendocina ymp]
gi|145577431|gb|ABP86963.1| protein of unknown function DUF833 [Pseudomonas mendocina ymp]
Length = 248
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/228 (20%), Positives = 91/228 (39%), Gaps = 43/228 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L +D GGTWL + +G+ L N+ + P R RG L + ++ + YL
Sbjct: 42 LFAGRDLQAGGTWLGVTADGRFAALTNIRDPSQPPGRRSRGELPIDFLRSALEPQAYLQR 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH----- 134
+ + + + GF+++L + ++++ N QE++ H
Sbjct: 102 VADKAAD--YAGFNLLLGD----RRQLWYY-----------------NSQEDQPHALAEG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
+YG N++ S P+ K+ K ++ +S+ L+ + D + + PD +
Sbjct: 139 LYGLSNARLDS-PWPKLQRAKAALENCLD----APDSERLLALLADTTQPAEHLLPDTGV 193
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I+S YGTR S +L+ + + E
Sbjct: 194 GPAVERLLATSF----------IASANYGTRASSALLLGADGSWELVE 231
>gi|421505844|ref|ZP_15952779.1| hypothetical protein A471_21279 [Pseudomonas mendocina DLHK]
gi|400343541|gb|EJO91916.1| hypothetical protein A471_21279 [Pseudomonas mendocina DLHK]
Length = 248
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/228 (20%), Positives = 91/228 (39%), Gaps = 43/228 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L +D GGTWL + +G+ L N+ + P R RG L + ++ + YL
Sbjct: 42 LFAGRDLQAGGTWLGVTADGRFAALTNIRDPSQPPGRRSRGELPIDFLRSALEPQAYLQR 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH----- 134
+ + + + GF+++L + ++++ N QE++ H
Sbjct: 102 VADKAAD--YAGFNLLLGD----RRQLWYY-----------------NSQEDQPHPLAEG 138
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
+YG N++ S P+ K+ K ++ +S+ L+ + D + + PD +
Sbjct: 139 LYGLSNARLDS-PWPKLQRAKAALENCLD----APDSERLLALLADTTQPAEHLLPDTGV 193
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I+S YGTR S +L+ + + E
Sbjct: 194 GPAVERLLATSF----------IASANYGTRASSALLLGADGSWELVE 231
>gi|401884102|gb|EJT48275.1| hypothetical protein A1Q1_02694 [Trichosporon asahii var. asahii
CBS 2479]
Length = 517
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 101/226 (44%), Gaps = 32/226 (14%)
Query: 30 GTWLATSTNGKLGILLNVLGE--NSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTWL + + ++G + N+ + P+ RG L+ ++ A+ ++D K+
Sbjct: 283 GTWLGITKDLRVGTVTNIRYPIVATPPDPPSRGMLLKSFLSAAPDAKVSVSDFLKDIPAK 342
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
+ GF+++L +L + + SN + T +T N D G NS QP
Sbjct: 343 AYVGFNLLLFDLQSSPAEVGYLSN-------RPEPTQLT---PNNDSCQGISNSPW-DQP 391
Query: 148 FQKVIFGKEKFAEIVNKFNRKSES-QNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
+ KV G+E+ A+ + + + + ++L+ +LDL+ P P + + D+
Sbjct: 392 YPKVTEGEERMAKTLEAWAMEGRNEEHLVTRMLDLLS------PAPPV--TSAKDLFRAT 443
Query: 207 KMRYSRVCVDISSI----------FYGTRTHSIILVDHNNVMDFHE 242
+++ + D ++ +YGTR ++I+V + + F E
Sbjct: 444 RVQPVIIGPDPNAPPADRPTEGGRWYGTRVSTVIIVRDDGHVLFVE 489
>gi|104784147|ref|YP_610645.1| hypothetical protein PSEEN5237 [Pseudomonas entomophila L48]
gi|95113134|emb|CAK17862.1| conserved hypothetical protein [Pseudomonas entomophila L48]
Length = 248
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/213 (22%), Positives = 83/213 (38%), Gaps = 35/213 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV Y+ G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPRGRFAALTNIRDPRQPLGPRSRGELVAAYLRGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAP-LDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + ++GF++++ + Q H P L SP VYG N
Sbjct: 104 SRSGQ--YSGFNLLVGD--GQQLGYLHAHEAGPRLLSPG---------------VYGLSN 144
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K++ + +++ + Q L+ + D S P+ + E
Sbjct: 145 AGLDT-PWPKLVKARSGLEGLLD----SDDPQRLLALLADAQPAPDSELPETGVGLATEK 199
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 200 LLS----------SVFIASQNYGTRASTVLIVD 222
>gi|343084094|ref|YP_004773389.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342352628|gb|AEL25158.1| protein of unknown function DUF833 [Cyclobacterium marinum DSM 745]
Length = 250
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 98/222 (44%), Gaps = 31/222 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL TNGK L N +N +P + RG LV ++ G +S ++YL +
Sbjct: 42 AGKDLKGGGTWLGFHTNGKFAALTNYRDPQNDKPLSKTRGELVTNFLNGSQSPKDYLKQI 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+++ ++ +NGF++++ E + FSN Q+ ++G N
Sbjct: 102 EEKKKD--YNGFNLLVAE----KKELLVFSNYG------------GGVQQVPPGIHGLSN 143
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ KV K +++ + K + ++L L+++K+ + D
Sbjct: 144 AFLNT-PWPKVEAAKADLKKLLEHKSPK------LDDLLTLLQSKEKAPIELLPDTGIPM 196
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D+++ ++ RV +YGT + + H+ + E
Sbjct: 197 DLEQTISSQFIRV-----EDYYGTVNTTALRWGHDGTVSIKE 233
>gi|325275836|ref|ZP_08141698.1| hypothetical protein G1E_20651 [Pseudomonas sp. TJI-51]
gi|324099032|gb|EGB97016.1| hypothetical protein G1E_20651 [Pseudomonas sp. TJI-51]
Length = 248
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV Y+ G+ E YL ++
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPRQAQGPRSRGELVAGYLRGELGVEAYLDEVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + + H ++AP + + VYG N+
Sbjct: 104 SRSGQ--YSGFNLLVGD--GRQLGYLHGPDVAP--------------RLLEAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + ++ E L+ + D PD + E
Sbjct: 146 GLDT-PWPKLVKARSGLQRLLE----TPEPPQLLALLADGEPAADGELPDTGVGMATEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQSYGTRASTVLIVDDQG 225
>gi|441497454|ref|ZP_20979668.1| hypothetical protein C900_01861 [Fulvivirga imtechensis AK7]
gi|441438789|gb|ELR72119.1| hypothetical protein C900_01861 [Fulvivirga imtechensis AK7]
Length = 227
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 100/227 (44%), Gaps = 34/227 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ +T G++ +L N N + N RG LV +++ ++A +YL +
Sbjct: 14 AGRDMEAGGTWMGINTKGEISMLTNYRDPFNIKSNSPSRGHLVSGFLKNNENAHDYLQKI 73
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+E +++NGF+++ + +Y++ N + E +G N
Sbjct: 74 AREG--HLYNGFNMICGNV----DQLYYYGNYS------------KGVHEIALGFHGLSN 115
Query: 141 SQC-TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
+ TS P KV G EK + + + ++L + + D +K PD I + E
Sbjct: 116 ALLNTSWP--KVDKGLEKLKSAIK--GEEVQVESLFKTLYDDVKAPPHLLPDTGIGAEKE 171
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + I S YG+R +++LVD+ N + + E T D
Sbjct: 172 QVLS----------SIFIKSPGYGSRCSTVLLVDNENQIQYVERTYD 208
>gi|431928932|ref|YP_007241966.1| LOW QUALITY PROTEIN: hypothetical protein Psest_3861 [Pseudomonas
stutzeri RCH2]
gi|431827219|gb|AGA88336.1| LOW QUALITY PROTEIN: hypothetical protein Psest_3861 [Pseudomonas
stutzeri RCH2]
Length = 251
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 95/221 (42%), Gaps = 33/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTW+ + +G+ + N+ R RG L +Y+ + S +YLT+L
Sbjct: 44 AGRDLQAGGTWMGITAHGRFAAVTNIRAPGQPVGLRSRGELPAQYLRAELSPADYLTELT 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + GF++ L +++H L+S S + T + VYG N+
Sbjct: 104 ARRHD--YAGFNL----LVGDHQSLWH------LNSRSGEATRLGAG------VYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K+ + A ++ +++++ L+ + D + + P +
Sbjct: 146 GLDT-PWPKLQKARAALAAGLD----RADTERLLALLADPQQAAEHELP--------QTG 192
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ +++ R S + ISS YGTR + ++ + +D E
Sbjct: 193 VPLEWERRLSSIF--ISSPDYGTRASTALIRHADGALDILE 231
>gi|377820688|ref|YP_004977059.1| hypothetical protein BYI23_A012440 [Burkholderia sp. YI23]
gi|357935523|gb|AET89082.1| hypothetical protein BYI23_A012440 [Burkholderia sp. YI23]
Length = 273
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 91/213 (42%), Gaps = 29/213 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAE-EYLTD 79
+D GGTWL S +G+ L N + RPN RG LV ++ G++ A +YL
Sbjct: 49 AGRDLTGGGTWLGVSGDGRFAALTNYRAPSEVRPNAPTRGTLVSAFLAGERVAPMDYLRG 108
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ +E +NGF+++ + T + Y N A D+P + + V+G
Sbjct: 109 VAREGHR--YNGFNLLCGDFTRRELGWY--GNRA--DAPPVLL---------EAGVHGLS 153
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
NS + P+ K++ +E ++++ + + LI + D + P I + E
Sbjct: 154 NSLLNT-PWPKLVAQREALCDLIHA-DERPPLDVLIDTLRDPRIANDEHLPTTGISIERE 211
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + I S YGTR+ + + V
Sbjct: 212 RVLSAAF----------IESPDYGTRSTTALRV 234
>gi|170719508|ref|YP_001747196.1| hypothetical protein PputW619_0321 [Pseudomonas putida W619]
gi|169757511|gb|ACA70827.1| protein of unknown function DUF833 [Pseudomonas putida W619]
Length = 248
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/214 (20%), Positives = 86/214 (40%), Gaps = 33/214 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ R RG LV Y++G+ E YL
Sbjct: 42 VHAGRDLEAGGTWLGVGPRGRFAALTNIRDPGQPLGSRSRGELVAAYLQGELGVEAYLDQ 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ +++ ++GF++++ + H AP + VYG
Sbjct: 102 VASRSKQ--YSGFNLLVGDGV--RLGYLHAREAAP--------------RLLAAGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ ++ ++ ++ Q L+ + D ++ P+ + E
Sbjct: 144 NAGLDT-PWPKLVKARDGLERLLG----SADPQRLMALLADGETAPEAELPETGVGVATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 199 KLLS----------SVFIASQNYGTRASTVLIVD 222
>gi|268316764|ref|YP_003290483.1| hypothetical protein Rmar_1205 [Rhodothermus marinus DSM 4252]
gi|262334298|gb|ACY48095.1| protein of unknown function DUF833 [Rhodothermus marinus DSM 4252]
Length = 256
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 98/222 (44%), Gaps = 32/222 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL S G+L ++ N + R G R RG L ++ G ++ YL
Sbjct: 44 AGRDLEAGGTWLGVSERGRLALVTNYREPDRRAVGRRSRGWLTRDFLLGSEAPAAYLE-- 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E N +NGF++ L + + + SN S +V +YG N
Sbjct: 102 RVLAEGNAYNGFNL----LVGDTETLAYGSN----RSDGIRVL--------APGLYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++ + KV G F ++ + + E+ +L L+ ++ + PD + R
Sbjct: 146 HLLGTR-WPKVTRGLAAFTSLLQEDSIDPEA------LLALLADR-TPAPDETLPRTG-- 195
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+D +++ R S + V + YGTR+ +++L + + + F E
Sbjct: 196 -LDLEWERRLSAIFVATPA--YGTRSSTVLLWEQDGKLTFVE 234
>gi|443472338|ref|ZP_21062367.1| Hypothetical protein ppKF707_3355 [Pseudomonas pseudoalcaligenes
KF707]
gi|442902680|gb|ELS28196.1| Hypothetical protein ppKF707_3355 [Pseudomonas pseudoalcaligenes
KF707]
Length = 250
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 91/225 (40%), Gaps = 41/225 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ + P GR RG L V+++ G+ E+L D
Sbjct: 44 AGRDLEAGGTWLGAGPRGRFAALTNIRDPRTPPVGRTRGELCVQFLRGEMGPGEFLEDAL 103
Query: 82 KETEENVFNGFHIVL---LELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ + ++GF++++ EL + N+ P VYG
Sbjct: 104 RRAGD--YSGFNLLVGDDRELWFLNPRSGGPINLGP-------------------GVYGV 142
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL-DLMKNKQSNYPDPEIDRK 197
N+ + P+ KV GK AE + + S + + N+L D + P+ +
Sbjct: 143 SNADLDT-PWPKVERGKAAIAECL-----EPPSTDALLNLLHDPEQAPDHILPETGVGLN 196
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E +M S V I++ YGTR S ++V + + E
Sbjct: 197 TE-------RMLSS---VFIATRTYGTRASSALIVRADGSRELVE 231
>gi|339489871|ref|YP_004704399.1| hypothetical protein PPS_4992 [Pseudomonas putida S16]
gi|338840714|gb|AEJ15519.1| conserved hypothetical protein [Pseudomonas putida S16]
Length = 248
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 84/215 (39%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV Y++G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPAQALGPRSRGELVAAYLQGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + + H + AP + + VYG N+
Sbjct: 104 SRSRQ--YSGFNLLVGDG--RQLGYLHARDAAP--------------RLLEAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + ++ E Q L+ + D + P+ + E
Sbjct: 146 GLDT-PWPKLVKARSGLEGLLET----PEPQRLLALLADAEPAPEGELPETGVGLATEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQNYGTRASTVLIVDDQG 225
>gi|431804970|ref|YP_007231873.1| hypothetical protein B479_25205 [Pseudomonas putida HB3267]
gi|430795735|gb|AGA75930.1| hypothetical protein B479_25205 [Pseudomonas putida HB3267]
Length = 248
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 84/215 (39%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV Y++G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPAQALGPRSRGELVAAYLQGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + + H + AP + + VYG N+
Sbjct: 104 SRSRQ--YSGFNLLVGD--GRQLGYLHAQDAAP--------------RLLEAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + ++ E Q L+ + D + P+ + E
Sbjct: 146 GLDT-PWPKLVKARSGLEGLLE----TPEPQRLLALLADAEPAPEGELPETGVGLATEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQNYGTRASTVLIVDDQG 225
>gi|403236700|ref|ZP_10915286.1| hypothetical protein B1040_13094 [Bacillus sp. 10403023]
Length = 251
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 103/227 (45%), Gaps = 43/227 (18%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEE 75
E L +D K GTW+ + G+ L N N G+ RG LV + + E
Sbjct: 39 EPTLLAGRDLEKMGTWMGVTKAGRFAALTNYRDPNESSVGKQSRGKLVKEALTYNGEIAE 98
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
Y+ L+K + F G+++ L +Y++SN+ ++ NV K +
Sbjct: 99 YMKQLEKTNSQ--FPGYNL----LVGDPNQLYYYSNVG------KRLENV------KPGI 140
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
+G N + + KV GKE A+I+ + +NL++N+++L++N ++ P+
Sbjct: 141 HGLSNHLLNTN-WPKVEKGKEGLAQII-------KGENLVENLVELLQN--TDRPNDHFL 190
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNV 237
K ++ + R+ +SS+F YGTR+ +++L+ +
Sbjct: 191 PKTGVSLE------WERI---LSSLFIQTNGYGTRSSTVLLMSDTEI 228
>gi|393236554|gb|EJD44102.1| DUF833-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 333
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 75/165 (45%), Gaps = 16/165 (9%)
Query: 30 GTWLATSTNGKLGILLNV----LGENSRP--NGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
GTW+ + G+L +L N + + + P R RG +V ++ + + E+YL L +
Sbjct: 58 GTWMGITRQGRLAVLTNYREDDVADAAHPVAGTRSRGSMVTNWLTSRDAVEDYLRHLIHD 117
Query: 84 TEENVFN--GFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK---QENKDHVYGF 138
E V N GF ++ +L H + PL S + ++V++ + D VYG
Sbjct: 118 -EGGVSNVGGFSLIFGKLHHTEAEGRH--RVPPLALISNRASDVSHTIWLSGDADEVYGL 174
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES-QNLIQNILDLM 182
NS +P+ KV GKE A V + ES + Q + L+
Sbjct: 175 SNSAFL-EPWPKVEHGKELLAAAVKDSVAQGESREQFTQRLFGLL 218
>gi|254282378|ref|ZP_04957346.1| conserved hypothetical protein [gamma proteobacterium NOR51-B]
gi|219678581|gb|EED34930.1| conserved hypothetical protein [gamma proteobacterium NOR51-B]
Length = 261
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 94/226 (41%), Gaps = 32/226 (14%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLT 78
+ +D GGTWL + G+ L NV R +G R RG +V ++ +S +L
Sbjct: 42 IAAGRDLDAGGTWLGVTRGGRFAALTNVREPPPRVSGPRSRGDIVTDFLTSAQSPAVWLA 101
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSN-IAPLDSPSTKVTNVTNKQENKDHVYG 137
+ + ++ F GF+++ L +Y SN + L++ S V ++N N D
Sbjct: 102 SYRDQFDQ--FQGFNLLFGSL---DEGLYFTSNRLGTLETVSPGVHGLSNGSLNDD---- 152
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
+ KV E I+ E+ L ++D + + N PD +
Sbjct: 153 ----------WPKVRGLCEGLTRILAT-ETTLEAPRLQPLLVDRTQPEDGNLPDTGVG-- 199
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEW 243
D++ R+ I S YGTR+ SI++V+ + F E+
Sbjct: 200 --LDLERLLAPRF------IVSEGYGTRSSSIVIVNRGREIRFSEY 237
>gi|294507538|ref|YP_003571596.1| hypothetical protein SRM_01723 [Salinibacter ruber M8]
gi|294343866|emb|CBH24644.1| Conserved hypothetical protein containing DUF833 [Salinibacter
ruber M8]
Length = 349
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 95/229 (41%), Gaps = 33/229 (14%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G + NV E R + RG LV Y+ + + E YL L
Sbjct: 135 GRDLKAGGTWLGITRKGHWATVTNVRDERPRRDDAPSRGRLVADYLREEPAPEAYLDGL- 193
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
ETE + +NGF++ L +++SN + + ++G N+
Sbjct: 194 -ETEADQYNGFNV----LVGTPEKTFYYSN------------RDGTPRPVRSGIHGMSNA 236
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDRKAEA 200
Q + KV G + E ++L I+ + D++ ++Q PD ++ +
Sbjct: 237 QLDDS-WPKVERGTSGLDALC-------EDKDLSIEALFDILDDRQPA-PDGQL---PQT 284
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYET 249
+ + + S +D YGTR ++ LV + + F E + D T
Sbjct: 285 GVGRETERMLSPPFIDGDEA-YGTRASTVFLVHRSGRVTFAERSFDGGT 332
>gi|409406458|ref|ZP_11254920.1| signal peptide protein [Herbaspirillum sp. GW103]
gi|386435007|gb|EIJ47832.1| signal peptide protein [Herbaspirillum sp. GW103]
Length = 282
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 90/212 (42%), Gaps = 29/212 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +G+ L NV + R + RG LV Y+ + YL +
Sbjct: 44 AGRDLQGGGTWLGVTRDGRFAALTNVRAPSERRTDAPTRGQLVADYLASDLDVQAYLRRI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ E + +NGF+++L + + T ++ +SN D N Q + VYG N
Sbjct: 104 EPEAQR--YNGFNLLLGD---RETMVW-YSNRGQEDP--------RNGQPLEYGVYGVSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ K+ K +FA ++ + + + + + D + PD + + E
Sbjct: 150 ALLDT-PWPKLTRAKAQFASLLCQ---GAPEETFFEMLTDGTRANDCRLPDTGVGLEKER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ + I S YGTR +++ V
Sbjct: 206 MLSPIF----------IRSPDYGTRCSTVLRV 227
>gi|325924994|ref|ZP_08186418.1| hypothetical protein XPE_0338 [Xanthomonas perforans 91-118]
gi|325925250|ref|ZP_08186657.1| hypothetical protein XPE_0588 [Xanthomonas perforans 91-118]
gi|325544338|gb|EGD15714.1| hypothetical protein XPE_0588 [Xanthomonas perforans 91-118]
gi|325544595|gb|EGD15954.1| hypothetical protein XPE_0338 [Xanthomonas perforans 91-118]
Length = 260
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/223 (21%), Positives = 83/223 (37%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G SA Y L
Sbjct: 50 AGRDLRSGGSWVGLGRDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSLSATAYGNALA 109
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F F+++L + + H SN PL + ++G N
Sbjct: 110 GAAHE--FPPFNLLLCD----ADRCEHLSNHPPL------------ARRLAAGIHGMSNG 151
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRKAE 199
+ P+ K +++++ E Q L + + + P +D E
Sbjct: 152 PLDA-PWPKT----AALTRVLHRWCASGEEDLQPLWAALGNPAIAPDAALPQTGVDLPTE 206
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + IS YGTR +I+ DH+ HE
Sbjct: 207 RLLSAAF----------ISGPSYGTRASTIVAADHHGHGFIHE 239
>gi|333902450|ref|YP_004476323.1| hypothetical protein Psefu_4277 [Pseudomonas fulva 12-X]
gi|333117715|gb|AEF24229.1| protein of unknown function DUF833 [Pseudomonas fulva 12-X]
Length = 245
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 94/222 (42%), Gaps = 41/222 (18%)
Query: 10 PCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEG 69
P GLI +D GGTWL G+ L N+ + R RG L +++++
Sbjct: 37 PQMPGLI-----AGRDLQAGGTWLGIGPGGRFAALTNIRDPQAEQGSRSRGELPLRFLQT 91
Query: 70 QKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQ 129
E +L +L+ E +E ++GF++++ + ++HF +S S + T +T
Sbjct: 92 PLEPEAFLEELRGEADE--YSGFNLLIGD----CQALWHF------NSQSGEATALTPG- 138
Query: 130 ENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNY 189
++G N+ + P+ K+ K A+ ++ + L + D K + +
Sbjct: 139 -----IHGVSNAGLNT-PWPKLERAKAALADAPDE-------RALFALLADTTKPEDAAL 185
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIIL 231
PD + + E + V I+S YGTR +++
Sbjct: 186 PDTGVGMELERLLG----------SVFIASPTYGTRASTLLF 217
>gi|410637089|ref|ZP_11347677.1| Ser/Thr-rich protein T10 in DGCR region [Glaciecola lipolytica E3]
gi|410143468|dbj|GAC14882.1| Ser/Thr-rich protein T10 in DGCR region [Glaciecola lipolytica E3]
Length = 254
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 100/226 (44%), Gaps = 31/226 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + NG+L L NV + N RG LV +++ Q S YL L
Sbjct: 44 AGRDLEAGGTWMGMNKNGRLCALTNVRDPQKILTNATSRGYLVSEFLTNQDSQLSYLAKL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++ + +NG++++ + ++N+ ++ + K+ +T VYG N
Sbjct: 104 QESKHQ--YNGYNLMFGQ----------WNNLWVYNNHTDKLAKLTAG------VYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ S P+ K+ G K E + + + L +LD + + P K
Sbjct: 146 ADLDS-PWPKINQGVNKLKEHCQQ-AQALNTDKLFAILLDQTQARDELLP------KTGV 197
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+D + K+ + I S YGTR+ +++LV+ N + + E T D
Sbjct: 198 PIDWERKLS----SIFIQSPDYGTRSSTLLLVNKNKHVTWLEHTFD 239
>gi|403069756|ref|ZP_10911088.1| hypothetical protein ONdio_09189 [Oceanobacillus sp. Ndiop]
Length = 256
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/226 (22%), Positives = 98/226 (43%), Gaps = 31/226 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D ++ GTWL + G++ L N + P RG ++ Y+ +S E YL L
Sbjct: 44 AGRDLLQMGTWLGITKTGRIAALTNYRNPDLEEPGKYSRGEIITNYLISNQSPEAYLNAL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + + ++GF++++ S +++++N ++ + N T+ N
Sbjct: 104 RAKRGD--YSGFNLIVG----NSEALFYYNN---MEDKFDLIPNGTHALSN--------- 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV+ G+ E N + E + L + + D + ++ P I E
Sbjct: 146 -HFLNTPWPKVVKGRTMLREYSNSHD-TVEIEKLFEIVADAEEAGDNDLPQTGITLDLER 203
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + I + YGTRT +I+L+D N+ + F E T D
Sbjct: 204 KLSPLF----------IQTPDYGTRTSTILLIDKNDQVTFVERTWD 239
>gi|61554042|gb|AAX46498.1| hypothetical protein DKFZp761P1121 [Bos taurus]
Length = 162
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L N R RG LV +++ + YL K
Sbjct: 2 EEGKEGGTWLGISTRGKLAALTNYLQPRLNRNARGRGELVAQFLTSDMDSLSYLK--KVS 59
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHF 109
E +++NGF+++ +L+ + ++ +
Sbjct: 60 AEGHLYNGFNLIAADLSAEKGDVICY 85
>gi|296478318|tpg|DAA20433.1| TPA: transport and golgi organization 2-like [Bos taurus]
Length = 276
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L N R RG LV +++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLNRNARGRGELVAQFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHF 109
E +++NGF+++ +L+ + ++ +
Sbjct: 111 AEGHLYNGFNLIAADLSAEKGDVICY 136
>gi|354559463|ref|ZP_08978712.1| protein of unknown function DUF833 [Desulfitobacterium
metallireducens DSM 15288]
gi|353542359|gb|EHC11822.1| protein of unknown function DUF833 [Desulfitobacterium
metallireducens DSM 15288]
Length = 259
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 96/224 (42%), Gaps = 36/224 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D +GGTW+ + NG+ L N + + RG LV Y+E S + YL L
Sbjct: 44 AGRDLEQGGTWMGIAMNGRFAALTNYRDPSQDKAKAPSRGNLVRGYLESDLSPDSYLQVL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+E E +NGF+ LL T QS +Y++SN KV K +G N
Sbjct: 104 NEEQIE--YNGFN--LLAGTTQS--LYYYSN-------REKVIRQVEK-----GFHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIV--NKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
S + + KV G F + NK N ++L + + D ++ P + +
Sbjct: 146 S-LLDEAWPKVKKGVNAFKGGLKENKIN----VEHLFEIMADQVRPDDQELPQTGVSLEL 200
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + Y I S YGTR+ +I+LVD+ N + F E
Sbjct: 201 ERMLSPLY----------IVSPDYGTRSTTILLVDNYNHVQFWE 234
>gi|254483601|ref|ZP_05096824.1| conserved hypothetical protein [marine gamma proteobacterium
HTCC2148]
gi|214036110|gb|EEB76794.1| conserved hypothetical protein [marine gamma proteobacterium
HTCC2148]
Length = 254
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 66/143 (46%), Gaps = 16/143 (11%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GG+W+ + +G + N E ++ R RG LV+ Y++G SA++YL L
Sbjct: 44 AGRDRQAGGSWMGVNRSGHFAAVTNSRDPEQTKAAERSRGELVLGYLQGDLSAQQYLDQL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ GF+++L + ++++ SN P S + +YG N
Sbjct: 104 S--LRAGAYAGFNLLLGD----REDLWYVSNSGPSSEQSPRRLTPG--------IYGLSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVN 163
+ + P+ KV GK+ AE++
Sbjct: 150 ASLDT-PWPKVQRGKQVMAELLQ 171
>gi|83648500|ref|YP_436935.1| hypothetical protein HCH_05861 [Hahella chejuensis KCTC 2396]
gi|83636543|gb|ABC32510.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 262
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/225 (22%), Positives = 91/225 (40%), Gaps = 32/225 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D +GGTW+ +TNG+ + N RG LV ++E + S ++ L+
Sbjct: 44 AGRDLQEGGTWMGATTNGRFAAVTNHRSGPREAAQLSRGALVRSFLESELSVGDFAAALE 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
++ + GF++++L + + +FSN Q VYG N
Sbjct: 104 PTSQS--YGGFNLMMLA----GSTLAYFSN-----------RKEPQYQRLTPGVYGLSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K+ GK +F + + N+ L + D PD + +AE
Sbjct: 147 LLNT-PWPKLTAGKRRFISAMTEPNQ----DELWALLADDAAAPDYLLPDTGVGVEAERL 201
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + I+S YGTR+ +++L D + V E T +
Sbjct: 202 LSSAF----------ITSKDYGTRSSTLLLHDASGVTQMWEKTFE 236
>gi|21230190|ref|NP_636107.1| hypothetical protein XCC0716 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66769820|ref|YP_244582.1| hypothetical protein XC_3519 [Xanthomonas campestris pv. campestris
str. 8004]
gi|188993035|ref|YP_001905045.1| hypothetical protein xccb100_3640 [Xanthomonas campestris pv.
campestris str. B100]
gi|21111727|gb|AAM40031.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66575152|gb|AAY50562.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
gi|167734795|emb|CAP53005.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 255
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 82/221 (37%), Gaps = 31/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G A Y +L
Sbjct: 45 AGRDLRSGGSWVGLGLDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSADAATYARELA 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+E F F+++L + + H SN L + ++G N
Sbjct: 105 AAGQE--FPPFNLLLCD----AERCEHLSNHPAL------------ARALAPGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ + K A + + + Q L + + ++ P +DR E
Sbjct: 147 PLDAH-WPKTTALTHALAAWCDSGS--DDLQPLWTALANPATAADADLPHTGVDRATERL 203
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I YGTR +I+ VDH+ HE
Sbjct: 204 LSAAF----------IRGASYGTRASTIVAVDHHGHGFIHE 234
>gi|90415499|ref|ZP_01223433.1| hypothetical protein GB2207_09286 [gamma proteobacterium HTCC2207]
gi|90332822|gb|EAS47992.1| hypothetical protein GB2207_09286 [marine gamma proteobacterium
HTCC2207]
Length = 254
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 108/232 (46%), Gaps = 34/232 (14%)
Query: 6 EKPNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPN-----GRDRG 60
++P + +M + +D +GGTW+ S NG+ + N + S+P + RG
Sbjct: 27 QRPTLPMHWWQDMPVLAGRDEQQGGTWMGLSRNGRFAAVTN-FRDFSKPGFHEARPKSRG 85
Query: 61 PLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPST 120
LV ++ Q SA+++ + ++ + V+ GF+++ IY + L++ +
Sbjct: 86 NLVTDFLCSQASAKDWADSVLEDFD--VYGGFNLL----------IYDGEQLLYLNNFNH 133
Query: 121 KVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILD 180
+V ++ + +Y N S P+ KV + + + E ++K + +Q L +++L
Sbjct: 134 QVRSL------EPGIYALSNHLLDS-PWPKVDYARRQLKETLSK---RGSNQQLAEDLLG 183
Query: 181 LMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
L++ Q+ YPD + + D++ R S + YGTR + I++
Sbjct: 184 LLEQNQT-YPDHLL---PSTGVPADWERRLSSAFIVAED--YGTRAATSIVL 229
>gi|424775329|ref|ZP_18202322.1| Ser/Thr-rich protein T10 [Alcaligenes sp. HPC1271]
gi|422889039|gb|EKU31419.1| Ser/Thr-rich protein T10 [Alcaligenes sp. HPC1271]
Length = 297
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 98/224 (43%), Gaps = 43/224 (19%)
Query: 22 VSQDAVKGGTWLA-TSTNGKLGILLNVLGENSRPNG--RDRGPLVVKYVEGQKSAEEYLT 78
+D GGTWL T + G+L +L N S N R RG L+ +++ ++S +YL
Sbjct: 44 AGRDLQAGGTWLGLTQSRGRLALLTNYREAPSNKNSKQRSRGELIPQFLRDERSPADYLE 103
Query: 79 DLKKETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
L +E + + GF++ +LE Q + ++SN P SP H+
Sbjct: 104 QLAQEGAD--YAGFNLFVLEWPDKQQPLRMGYYSNRHPSGSP---------------HML 146
Query: 137 GFG----NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDP 192
G G ++ + P+ K +F KE F+ S ++ + ++N++
Sbjct: 147 GPGVHVLSNAWLNTPWPKSLFLKETLQ--AQHFD---GSPPAVEGLFQGLRNQEV----- 196
Query: 193 EIDRKAEADMDE-DYKMRYSRVCVD--ISSIFYGTRTHSIILVD 233
A+AD+ + + + R+ I S YGTR ++I ++
Sbjct: 197 ----AADADLPQTGLSLEHERLLSSPFIVSPNYGTRCTTVITIN 236
>gi|393758012|ref|ZP_10346836.1| Ser/Thr-rich protein T10 [Alcaligenes faecalis subsp. faecalis NCIB
8687]
gi|393165704|gb|EJC65753.1| Ser/Thr-rich protein T10 [Alcaligenes faecalis subsp. faecalis NCIB
8687]
Length = 297
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 98/223 (43%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLA-TSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
+D GGTWL T G+L +L N + N R RG L+ +++ ++S +YL
Sbjct: 44 AGRDLQAGGTWLGLTQCRGRLALLTNYREIPGNKSGAQRSRGELIPQFLRDERSPLDYLA 103
Query: 79 DLKKETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
L ++ E + GF++ +L+ Q+ + ++SN P P Q V+
Sbjct: 104 QLARQGAE--YAGFNLFVLQWPTDTQALQLAYYSNRHPSGQP----------QRLDPGVH 151
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
N+ + P+ K +F KE F+ + + ++++ ++N+ D
Sbjct: 152 ILSNAWLNT-PWPKSLFLKETLQ--AQHFDGSAPA---VESLFQGLRNQ---------DL 196
Query: 197 KAEADMDE-DYKMRYSRVCVD--ISSIFYGTRTHSIILVDHNN 236
A+AD+ + + R+ I S YGTR ++I +D +
Sbjct: 197 AADADLPQTGLSLERERLLSSPFIVSPDYGTRCTTVITLDQHG 239
>gi|307202174|gb|EFN81661.1| Uncharacterized protein C22orf25-like protein [Harpegnathos
saltator]
Length = 135
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 3/75 (4%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPN--GRDRGPLVVKYVEGQKSAEEYLTDLKKE-T 84
+GGTWLA ST GK I+LN + + PN + RG L+ ++ S E YL L KE
Sbjct: 56 EGGTWLALSTKGKAAIILNFVNKEGVPNMSRKSRGSLIKNFITSNDSIELYLNKLHKENI 115
Query: 85 EENVFNGFHIVLLEL 99
+N + +VLL+L
Sbjct: 116 NIQPYNPYCLVLLDL 130
>gi|218780956|ref|YP_002432274.1| hypothetical protein Dalk_3116 [Desulfatibacillum alkenivorans
AK-01]
gi|218762340|gb|ACL04806.1| protein of unknown function DUF833 [Desulfatibacillum alkenivorans
AK-01]
Length = 262
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 97/230 (42%), Gaps = 40/230 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + G+ L NV + + RG LV +++ Q + ++YLT +
Sbjct: 44 AGRDLQGGGTWMGVNDRGRFAALTNVRDISRIKQEAKSRGLLVSGFLQNQDAPKDYLTTV 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKD-HVYGFG 139
+K+ ++ +N F++V +L +Y +T V NK D V+G
Sbjct: 104 QKKADQ--YNPFNLVAGDL----GGLYC-------------LTGVDNKVRKLDVGVFGLS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N + P+ KV GK A IV E L + L M PD ++
Sbjct: 145 NWGLDT-PWPKVKKGKNALASIVEN---ALEKGVLDREALFSMLADSETAPDDKLP---- 196
Query: 200 ADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWT 244
D + + RV +SSIF YGTR +++ V + ++ E T
Sbjct: 197 ---DTGVGLDWERV---LSSIFVKSPGYGTRCSTVLTVSFSGRVEVEERT 240
>gi|372489284|ref|YP_005028849.1| hypothetical protein Dsui_2658 [Dechlorosoma suillum PS]
gi|359355837|gb|AEV27008.1| hypothetical protein Dsui_2658 [Dechlorosoma suillum PS]
Length = 249
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 81/215 (37%), Gaps = 34/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + + N + R RG L Y+ G S ++L+ L
Sbjct: 44 AGRDLQAGGTWLGIDRRRRFAAVTNYRDPKAPEGQRSRGDLTRDYLAGDLSPRDFLSGL- 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + GF++ L + ++ ++SN QE +YG N
Sbjct: 103 ---DGGAYGGFNLFLAD----RDSLCYYSN------------RQGRIQELPPGIYGLSN- 142
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
P+ K++ K +F+E + E++ L + D + + PD + + E
Sbjct: 143 HLLETPWPKLVAAKARFSEALQAL---PETEPLFHLLADPTRWPDEHLPDTGVSLEWERM 199
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + + S YGTR +++ VD
Sbjct: 200 LS----------AIFVQSPAYGTRASTVLTVDRQG 224
>gi|254428734|ref|ZP_05042441.1| conserved hypothetical protein [Alcanivorax sp. DG881]
gi|196194903|gb|EDX89862.1| conserved hypothetical protein [Alcanivorax sp. DG881]
Length = 257
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 90/213 (42%), Gaps = 32/213 (15%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGE-NSRPNG-RDRGPLVVKYVEGQKSAEEYLTDLKK 82
D GGTWL NG+ ++ N R G R RG L ++EGQ+S E++ L +
Sbjct: 44 DLTAGGTWLGIHRNGRFAVVTNYREPITGRQRGERSRGLLPFAFLEGQQSPEQFSHTLAE 103
Query: 83 ETEENVFNGFHIVLLELTLQST--NIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E++ + F++++ ++ +T ++++ N Q + ++G N
Sbjct: 104 --EQHHYGAFNLLVGTVSTDTTRDSLWYLGNRG------------AAPQAVQPGIHGLSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV K+ A+ + Q L+Q + D + + S PD + + E
Sbjct: 150 G-LLDDPWPKVERAKQNLADAIAA---GGNLQQLLQVVNDHYQPRDSELPDTGVGVELER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I S YGTR S +++D
Sbjct: 206 LV----------APVFIQSETYGTRASSAVILD 228
>gi|410030927|ref|ZP_11280757.1| hypothetical protein MaAK2_17046 [Marinilabilia sp. AK2]
Length = 261
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 103/237 (43%), Gaps = 31/237 (13%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ + N +N R + RG LV ++EG+ + YL +++
Sbjct: 43 GKDLRAGGTWLGMHPNGRFAAITNFRDLKNKREFSKSRGDLVKNFLEGEMDPKSYLEEIE 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
KE +E GF++ L +Y+ SN KV V ++ +YG N+
Sbjct: 103 KEKKEYE--GFNL----LVADQEGLYYLSN---------KVEGV---RQLSPGLYGLSNA 144
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
P+ K+ KE A ++ + + L LMK +QS DP+
Sbjct: 145 -VLETPWPKLTKAKENLAHHISTASFEP---------LSLMKGQQSKETDPQEILPDTGA 194
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIR 258
E K+ S +++ + +YGT +++L H+ +D E D + + T ++
Sbjct: 195 TPEQEKL-LSAQFINVGN-YYGTVNSTVLLWKHSGEVDMMERKFDQVAETFKDTQVK 249
>gi|331006107|ref|ZP_08329441.1| hypothetical protein IMCC1989_2820 [gamma proteobacterium IMCC1989]
gi|330420086|gb|EGG94418.1| hypothetical protein IMCC1989_2820 [gamma proteobacterium IMCC1989]
Length = 279
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 101/232 (43%), Gaps = 32/232 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D V GTWL + G+ + NV + N RG L ++ G +AE YL +
Sbjct: 44 AGRDLVANGTWLGITKTGRFAAITNVREPSVVVDNPLSRGDLTRDFLMGNMNAETYLEQI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVYGF 138
++++ + GF++++ E T ++++SN N++NK+ ++ +YG
Sbjct: 104 --SSKQHRYCGFNLLVGEFTRSQQTLWYYSN--------RDTQNLSNKKYHRLDKGIYGL 153
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N ++ + KV GK E K + +++ + L + + PD ++ +
Sbjct: 154 SNHLLNTE-WPKVKAGKNFVEETTQKESFAQQTKPEKHDALRVYLENKQLAPDDQLPKTG 212
Query: 199 EADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDH---NNVMDFHE 242
+ Y R +S+ F YGTRT ++I +D N + F E
Sbjct: 213 ---------VSYPREKA-LSAAFILLPDYGTRTSTVITIDKTHTTNTISFSE 254
>gi|398878328|ref|ZP_10633453.1| hypothetical protein PMI33_03153 [Pseudomonas sp. GM67]
gi|398200585|gb|EJM87496.1| hypothetical protein PMI33_03153 [Pseudomonas sp. GM67]
Length = 248
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 53/122 (43%), Gaps = 16/122 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + + RG LV +++ G ++YL D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPHQPAARKSRGELVARFLSGDMPIDDYLDDVV 103
Query: 82 KETEENVFNGFHIVL-------------LELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK 128
+ E + GF++++ LE L +Y SN A LD+P K+
Sbjct: 104 GRSLE--YAGFNLLIGNANELWHFNARELEAVLLPAGVYGLSN-AGLDTPWPKLLKARAA 160
Query: 129 QE 130
E
Sbjct: 161 LE 162
>gi|296478280|tpg|DAA20395.1| TPA: hypothetical protein LOC518209 [Bos taurus]
Length = 126
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L N R RG LV +++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPRLNRNARGRGELVAQFLTSDMDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL 99
E +++NGF+++ +L
Sbjct: 111 AEGHLYNGFNLIAADL 126
>gi|78046377|ref|YP_362552.1| hypothetical protein XCV0821 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78034807|emb|CAJ22452.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 260
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/223 (21%), Positives = 82/223 (36%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G SA Y L
Sbjct: 50 AGRDLRSGGSWVGLGRDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSLSATAYGNALA 109
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F F+++L + + H SN PL + ++G N
Sbjct: 110 GAAHE--FPPFNLLLCD----ADRCEHLSNHPPL------------ARRLAAGIHGMSNG 151
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRKAE 199
+ P+ K +++++ E Q L + + P +D E
Sbjct: 152 PLDA-PWPKT----AALTRVLHRWCASGEEDLQPLWAAPGNPAIAPDAALPQTGVDLPTE 206
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + IS YGTR +I+ DH+ HE
Sbjct: 207 RLLSAAF----------ISGPSYGTRASTIVAADHHGHGFIHE 239
>gi|262202100|ref|YP_003273308.1| hypothetical protein Gbro_2166 [Gordonia bronchialis DSM 43247]
gi|262085447|gb|ACY21415.1| protein of unknown function DUF833 [Gordonia bronchialis DSM 43247]
Length = 256
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 95/218 (43%), Gaps = 30/218 (13%)
Query: 22 VSQDAVKGGTWLATSTNG--KLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
+D + GGTW+ ++ ++ NV G R R RG L V ++ G +S EE+
Sbjct: 43 AGRDVLAGGTWMGVHAGALNRVAMVTNVREGIAQRSGVRSRGALPVDFLLGGQSPEEFAR 102
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ + F+ ++ L S +++ +N P VT D V+G
Sbjct: 103 --RAAADAGDFDPVNL----LVADSGSMWWMTN-----RPQPTAQRVT------DGVHGL 145
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N S + KV G E+ A++V+ + + + ++ LDL+ ++ P DR
Sbjct: 146 SNGALDSD-WPKVTDGSERMAQLVSA---DAGAGSSVEPYLDLLADQHR----PAADRLP 197
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ + ++ S + V++ YGTR +++ V ++
Sbjct: 198 DTGVSAAFEADLSPIFVNMPG--YGTRASTVLRVGYDG 233
>gi|83816154|ref|YP_445646.1| hypothetical protein SRU_1524 [Salinibacter ruber DSM 13855]
gi|83757548|gb|ABC45661.1| Protein of unknown function (DUF833) superfamily [Salinibacter
ruber DSM 13855]
Length = 259
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 95/229 (41%), Gaps = 33/229 (14%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G + NV E R + RG LV Y+ + + E YL L
Sbjct: 45 GRDLKAGGTWLGITRKGHWATVTNVRDERPRRDDAPSRGRLVADYLREELAPEAYLDGL- 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
ETE + +NGF++ L +++SN P + ++G N+
Sbjct: 104 -ETEADQYNGFNV----LVGTPEKTFYYSNRDGTPRPV------------RSGIHGMSNA 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDRKAEA 200
Q + KV G + E ++L I+ + D++ ++Q PD ++ +
Sbjct: 147 QLDDS-WPKVERGTSGLDALC-------EDKDLSIEALFDILDDRQPA-PDGQL---PQT 194
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYET 249
+ + + S +D YGTR ++ LV + + F E + D T
Sbjct: 195 GVGRETERMLSPPFIDGDEA-YGTRASTVFLVHRSGRVTFAERSFDGGT 242
>gi|430809361|ref|ZP_19436476.1| hypothetical protein D769_23883 [Cupriavidus sp. HMR-1]
gi|429498170|gb|EKZ96684.1| hypothetical protein D769_23883 [Cupriavidus sp. HMR-1]
Length = 284
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 106/256 (41%), Gaps = 39/256 (15%)
Query: 19 LLEVSQDAVKGGTWLA---------TSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVE 68
L EV DA GTW+ T + L N + R + R RG LV ++
Sbjct: 48 LAEVIGDA---GTWMGIAGAQHGQDTPVGARFAALTNYRAPSEKRTDARSRGELVAHFLR 104
Query: 69 GQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK 128
G ++ +YL DL +NGF+++ +L +++ +SN + P
Sbjct: 105 GDQTPADYLHDLAG--AHGAYNGFNLLASDL----HDLWWYSNRSKSRVP---------- 148
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
Q K +YG N+ + P+ KV E++ +R N+ + L +++Q+
Sbjct: 149 QRLKPGLYGLSNALLDT-PWPKVRSRVGAMCEVLAA-DRGQIGSNVESYLQLLAEDRQA- 205
Query: 189 YPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
P+ + + E K+ S I S YGTR +++ V H+ DF E + D +
Sbjct: 206 ---PDWELPSTGVTPEWEKLLSSAF---IRSPNYGTRASTVLRVMHDGRFDFVERSFDAD 259
Query: 249 TKKWIHTHIRKTLNMA 264
+ R LN+A
Sbjct: 260 GQT-GEVSYRGRLNLA 274
>gi|346723698|ref|YP_004850367.1| hypothetical protein XACM_0765 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648445|gb|AEO41069.1| hypothetical protein XACM_0765 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 260
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 84/223 (37%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G SA Y L
Sbjct: 50 AGRDLRSGGSWVGLGRDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSLSATAYANALA 109
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F F+++L + + H SN PL P ++G N
Sbjct: 110 GAAHE--FPPFNLLLCD----ADRCEHLSNHPPLARPLAA------------GIHGMSNG 151
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRKAE 199
+ P+ K +++++ E Q L + + + P +D E
Sbjct: 152 PLDA-PWPKT----AALTRVLHRWCASGEEDLQPLWAALGNPAIAPDAALPQTGVDLPTE 206
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + IS YGTR +I+ VDH+ HE
Sbjct: 207 RLLSAAF----------ISGPSYGTRASTIVAVDHHGHGFIHE 239
>gi|413962677|ref|ZP_11401904.1| hypothetical protein BURK_022235 [Burkholderia sp. SJ98]
gi|413928509|gb|EKS67797.1| hypothetical protein BURK_022235 [Burkholderia sp. SJ98]
Length = 272
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 90/214 (42%), Gaps = 29/214 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAE-EYLTD 79
+D GGTWL S +G+ L N N +PN RG LV ++ G++ A +YL
Sbjct: 49 AGRDLTGGGTWLGVSRDGRFAALTNYRAPGNMQPNAPTRGTLVSAFLAGERMAPLDYLQG 108
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ E +NGF+++ + + Y N A D+P + V+G
Sbjct: 109 VAHEGHR--YNGFNLLCGDFLRRQLGWY--GNRA--DAPPALL---------DAGVHGLS 153
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
NS + P+ K++ +E ++++ R S LI+ + D + P I + E
Sbjct: 154 NSLLNT-PWPKLVAQREALCDLIHADERPSLDV-LIETLRDPRIANDEHLPSTGISIERE 211
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ + I + YGTR+ + + V+
Sbjct: 212 RVLSAAF----------IETADYGTRSTTALRVN 235
>gi|384426560|ref|YP_005635917.1| hypothetical protein XCR_0882 [Xanthomonas campestris pv. raphani
756C]
gi|341935660|gb|AEL05799.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 255
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/221 (20%), Positives = 83/221 (37%), Gaps = 31/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G A Y +L
Sbjct: 45 AGRDLRSGGSWMGLGLDGRAAVVTNVRDPLATVSGRSRGHLIADYLSGSADAATYARELA 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+E F F+++L + + H SN L + ++G N
Sbjct: 105 AAGQE--FPPFNLLLCD----AERCEHLSNHPAL------------ARALAPGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ + K + A + + + Q L + + ++ P +DR E
Sbjct: 147 PLDAH-WPKSTALTQALAAWCDSGS--DDLQPLWTALANPATAADADLPHTGVDRATERL 203
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I+ YGTR +I+ VD+ HE
Sbjct: 204 LSAAF----------ITGTSYGTRASTIVAVDYQGSGFIHE 234
>gi|328793706|ref|XP_001121829.2| PREDICTED: uncharacterized protein C22orf25 homolog [Apis
mellifera]
Length = 143
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 51/99 (51%), Gaps = 11/99 (11%)
Query: 7 KPNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPN--GRDRGPLV 63
K +P G I+M + +GGTWL S GK G+LLN+ E S N + RG LV
Sbjct: 40 KNHPVCLGGIDM-----ESGKEGGTWLGVSLTGKAGVLLNLSSLEKSSTNIPKQGRGFLV 94
Query: 64 VKYVEGQKSAEEYLTDLKKETEEN---VFNGFHIVLLEL 99
++ + S YL L KE +EN +N F +VLL L
Sbjct: 95 SNFIVSKHSTTSYLDQLHKENKENKEIQYNPFLLVLLNL 133
>gi|398845627|ref|ZP_10602654.1| hypothetical protein PMI38_02012 [Pseudomonas sp. GM84]
gi|398253393|gb|EJN38523.1| hypothetical protein PMI38_02012 [Pseudomonas sp. GM84]
Length = 248
Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 44/212 (20%), Positives = 85/212 (40%), Gaps = 33/212 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV +++G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPGQPLGPRSRGELVAGFLQGEFGVERYLDHVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ +++GF++++ + + H P + + VYG N+
Sbjct: 104 SRSA--LYSGFNLLVGD--GRQLGYLHGREATP--------------RLLQAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ +E ++ ++ Q L+ + D +S P+ + E
Sbjct: 146 GLDT-PWPKLVKAREGLEGLLG----SADPQRLLALLADAEPAAESELPETGVGLVTEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQNYGTRASTVLIVD 222
>gi|328958746|ref|YP_004376132.1| hypothetical protein CAR_c24610 [Carnobacterium sp. 17-4]
gi|328675070|gb|AEB31116.1| protein of unknown function DUF833 [Carnobacterium sp. 17-4]
Length = 241
Score = 50.1 bits (118), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 107/224 (47%), Gaps = 37/224 (16%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGEN-SRP-NGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D ++ GTWL + G+ L N+ S P N RG +V Y+ +SA +L+DL+
Sbjct: 31 RDLLQMGTWLGITKEGRFAALTNIYDSAISVPENPVSRGQIVRDYLSTTQSASGFLSDLQ 90
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + + GF+I+L ++ +++HF+N + +V+++ K ++G N+
Sbjct: 91 DKRLD--YAGFNILLGDID----HLWHFNN------HTNQVSSL------KTGIHGLSNA 132
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKS---ESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
+ P+ KV+ K + + N KS + +L+ +D ++ P + A
Sbjct: 133 SL-NDPWPKVL----KVKSHLQQLNSKSTLFDPNDLLTAFMD------TSLPS---RKDA 178
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
K+ + I + YGT + +++LVD++N++ F E
Sbjct: 179 SLTTTTSLKLEKETPPIFIKTPEYGTVSTTVLLVDYDNIVTFIE 222
>gi|345856390|ref|ZP_08808876.1| hypothetical protein DOT_0224 [Desulfosporosinus sp. OT]
gi|344330514|gb|EGW41806.1| hypothetical protein DOT_0224 [Desulfosporosinus sp. OT]
Length = 255
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 104/219 (47%), Gaps = 41/219 (18%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ ++ G+L L N +S+ + + RG LV ++ +S +EY+ L+
Sbjct: 52 GTWMGITSTGRLAALTNYRDPSSQILSAKSRGFLVSNFLCNTESPKEYM--LEVVNNRTF 109
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF++++ +L ++ +F+ + S S +V K +YG N + P+
Sbjct: 110 YNGFNLLVGDLK----SLVYFNKL----SASAEVL--------KPGIYGLSN-HFLNTPW 152
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
KV KE A + K + +L + + D+ + + + P+ + ++ E
Sbjct: 153 PKVQKSKEALANYLAK-QPFLKPHSLFEILADIEQAQDRDLPETGLSQERE--------- 202
Query: 209 RYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHE 242
R+ +SSIF YGTR+ ++IL+D +N + F E
Sbjct: 203 ---RL---LSSIFIPGTDYGTRSSTVILIDRHNHVIFTE 235
>gi|33602043|ref|NP_889603.1| hypothetical protein BB3067 [Bordetella bronchiseptica RB50]
gi|33576481|emb|CAE33559.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
Length = 293
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL S G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 70 AGRDLQAGGTWLGVSDQGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLRDV 129
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 130 HARGAR--YNGFNLIV 143
>gi|406910657|gb|EKD50622.1| hypothetical protein ACD_62C00479G0003 [uncultured bacterium]
Length = 256
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/208 (22%), Positives = 93/208 (44%), Gaps = 22/208 (10%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
QD GG WL + G+L L N L ++ + RG LV Y+ G ++A YL D
Sbjct: 44 AGQDERHGGAWLGVNKYGRLAALTNYRKLPQDEDRRYKSRGQLVKNYLTGHQNAFAYLFD 103
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ ++ +E + F+++L E ++Y+F S +++ +T+ +YG
Sbjct: 104 VSQQYKE--YKPFNLLLGE----EDSLYYFC------SEQREISEITSPG-----IYGLS 146
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N P+ KV GK + + V F + + ++L + + + ++ P+ +D + E
Sbjct: 147 N-HLLDTPWPKVEQGKRELEKAV--FRDRFDLEDLFAILANQDEVEEEQLPETGLDWRWE 203
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTH 227
+ + + + V + I +G H
Sbjct: 204 RALAKVFVSHENYATVSSTVILWGWDNH 231
>gi|389816915|ref|ZP_10207809.1| hypothetical protein A1A1_07112 [Planococcus antarcticus DSM 14505]
gi|388464822|gb|EIM07147.1| hypothetical protein A1A1_07112 [Planococcus antarcticus DSM 14505]
Length = 210
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 94/224 (41%), Gaps = 37/224 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D GTWL + G++ L N + +G+ RG +V ++ G S EYL +
Sbjct: 2 AGRDLTGMGTWLGVTKQGRIAALTNFRDPTNLESGQLSRGAVVKNFLTGTNSPVEYLQTI 61
Query: 81 KKETEENVFNGFHIVL--LELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ + GF++++ E + NI S P P T +G
Sbjct: 62 DPKQ----YAGFNLIVGNAERLIYYNNIQEESYAIP---PGT---------------HGL 99
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N + P+ KV GKEK A +++ + K++ + + + D ++ PD +
Sbjct: 100 SN-HFLNTPWPKVTKGKEKLASYMSQID-KADLEKIFAILADDGHAPDTHLPDTGVGLDL 157
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I ++ YGTR+ +++LV H+ + F E
Sbjct: 158 ERMLSPMF----------IKTLDYGTRSATVVLVTHDGTLTFAE 191
>gi|421151405|ref|ZP_15611019.1| hypothetical protein PABE171_0361 [Pseudomonas aeruginosa ATCC
14886]
gi|404527309|gb|EKA37473.1| hypothetical protein PABE171_0361 [Pseudomonas aeruginosa ATCC
14886]
Length = 251
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 86/221 (38%), Gaps = 33/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + ++H++ V + VYG N+
Sbjct: 104 ERAAD--YSGFNLLIGD----RHQLWHYN------------PRVGPPRLLPAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K + + + Q L++ + D S PD + E
Sbjct: 146 ALDT-PWPKLL----KARAALAERLAEPHPQALLELLTDAAPAADSQLPDTGVGLATERL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ V I+S YGTR S++ V + + E
Sbjct: 201 LS----------SVFIASPSYGTRASSVVRVHADGTREMIE 231
>gi|386019102|ref|YP_005937126.1| hypothetical protein PSTAA_0464 [Pseudomonas stutzeri DSM 4166]
gi|327479074|gb|AEA82384.1| conserved hypothetical protein [Pseudomonas stutzeri DSM 4166]
Length = 251
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/220 (19%), Positives = 89/220 (40%), Gaps = 33/220 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ + G+ L N+ + R RG L +Y+ G+ +YL +L +
Sbjct: 45 GRDLQAGGTWMGVTARGRFAALTNIRAPDQPIGARSRGELPERYLRGELPPADYLAELAE 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + G+++ L ++H + ++ +E + VYG N+
Sbjct: 105 RRGD--YAGYNL----LVGDRQTLWHLN------------SHDGEPKELQPGVYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ + I ++ E + L+ + D + P + +
Sbjct: 147 LDT-PWPKLRRARAALTAI----RQQPEPERLLALLADAEPAAEHELP--------QTGV 193
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R S + I+S YGTR + ++ + +D E
Sbjct: 194 PLEWERRLSSIF--IASPEYGTRASTALIRHADGSLDIIE 231
>gi|145589152|ref|YP_001155749.1| hypothetical protein Pnuc_0969 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145047558|gb|ABP34185.1| protein of unknown function DUF833 [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
Length = 285
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/206 (21%), Positives = 91/206 (44%), Gaps = 17/206 (8%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYL-TDLKKETEEN 87
GTWL + GK L NV + + P+ R RG L + Y+ G +S ++ T+ K+ ++
Sbjct: 56 GTWLGFTKTGKFSALTNVRAPSEKNPDARTRGELSLLYLTGNQSPSNFIETNTKRFSQ-- 113
Query: 88 VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQP 147
+NGF++++ +L+ H+ + + + + + Q VYG N+ + P
Sbjct: 114 -YNGFNLLMADLSNPQNAEMHWVSNRMMMGQNVRPRKIFPPQPLSPGVYGLSNAMLDT-P 171
Query: 148 FQKVIFGKEKFAE-IVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
+ KV FA+ + + + ++ + D S P+ + ++ E + +
Sbjct: 172 WPKVNHRISAFAQALAMDSGQLKNTDQYLKLLADTHHASDSELPNTGVSKEWEKALSAAF 231
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILV 232
I + YGTR+ +++ V
Sbjct: 232 ----------IKTPSYGTRSSTVLRV 247
>gi|449296847|gb|EMC92866.1| hypothetical protein BAUCODRAFT_37782 [Baudoinia compniacensis UAMH
10762]
Length = 322
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 98/252 (38%), Gaps = 33/252 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEG-QKSAEEYLTDLKKETEEN 87
GTWLA +T G++ L N E G+ RG L + YV + AE +K+ +
Sbjct: 54 GTWLAITTQGRIANLTNFREEGIEVSKGKSRGLLPLGYVTAVPEVAESDEAFIKQSLDLG 113
Query: 88 V--FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTS 145
V GF +V +L + ++ L + + V VT +G NS
Sbjct: 114 VDDMGGFSLVFGQLRAPREDGSR-PGLSILSNRTASVDGVTRICTKAGETHGLSNSHFGD 172
Query: 146 QPFQKVIFGKEKFAEIVNK-FNRKSESQNLIQNI-----LDLMKNKQSNYPDPEIDRKAE 199
+ + KV+ G++ E + +R + Q I ++ +D Q P R+
Sbjct: 173 RSWPKVVQGEQFLKEAIQANIDRGEDQQKFIDDLFHILSVDTFPQVQPGEPFALYARQMR 232
Query: 200 ------------ADMDEDYKMRYSRV--------CVDISSIFYGTRTHSIILVDHNNVMD 239
AD ++ +R VD S YGTR S++LVD V+
Sbjct: 233 NTIFVPPVRGKFADNQPADRLAAARFDALDPPNNAVDHGS--YGTRQQSVVLVDGEGVVT 290
Query: 240 FHEWTLDYETKK 251
F E TL E K
Sbjct: 291 FDERTLYREDGK 302
>gi|395446023|ref|YP_006386276.1| hypothetical protein YSA_04417 [Pseudomonas putida ND6]
gi|421523474|ref|ZP_15970103.1| hypothetical protein PPUTLS46_16593 [Pseudomonas putida LS46]
gi|388560020|gb|AFK69161.1| hypothetical protein YSA_04417 [Pseudomonas putida ND6]
gi|402752460|gb|EJX12965.1| hypothetical protein PPUTLS46_16593 [Pseudomonas putida LS46]
Length = 248
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/212 (20%), Positives = 83/212 (39%), Gaps = 33/212 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L N+ R RG LV +++G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPQGRFAALTNIRDPGQAQGPRSRGELVAAFLQGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + Q H P + K VYG N+
Sbjct: 104 SRSGQ--YSGFNLLVGD--GQQLGYLHAREATP--------------RLLKAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ ++ +++ E L+ + D + P+ + E
Sbjct: 146 GLDT-PWPKLVKARDGLENLLD----TPEPHRLLALLADNEPAADGDLPETGVGLATEKL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 201 LS----------SVFIASQNYGTRASTVLIVD 222
>gi|452125384|ref|ZP_21937968.1| Ser/Thr-rich protein T10 [Bordetella holmesii F627]
gi|452128792|ref|ZP_21941369.1| Ser/Thr-rich protein T10 [Bordetella holmesii H558]
gi|451924614|gb|EMD74755.1| Ser/Thr-rich protein T10 [Bordetella holmesii F627]
gi|451925839|gb|EMD75977.1| Ser/Thr-rich protein T10 [Bordetella holmesii H558]
Length = 297
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/228 (21%), Positives = 91/228 (39%), Gaps = 33/228 (14%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAE 74
E L +D GGTW+ + NG+ ++ N +G + RG LV +Y+ G +
Sbjct: 39 EPRLYAGRDLRAGGTWMGAADNGRYAVITNYREVGHQVA-DAPSRGALVERYLAGSIAPH 97
Query: 75 EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH 134
+YL D+ ++ + +NGF++++ + +++SN T +
Sbjct: 98 DYLMDVARQA--SAYNGFNLIVGD----HQGAWYYSN------------RDTAPRYLSTG 139
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VY N P+ K++ K A+++ + + L + D + P +
Sbjct: 140 VYALSN-HLLDTPWPKLVRLKTAVAQVL-AHSPQPNLPALFAALDDREPADDAALPRTGL 197
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I S YGTR +++L N V + HE
Sbjct: 198 PLARERLLSSPF----------IVSPDYGTRCSTVLLWRDNGVGELHE 235
>gi|389872282|ref|YP_006379701.1| Ser/Thr-rich protein T10 [Advenella kashmirensis WT001]
gi|388537531|gb|AFK62719.1| Ser/Thr-rich protein T10 [Advenella kashmirensis WT001]
Length = 261
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 95/226 (42%), Gaps = 40/226 (17%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
D GGTWLA + NG+ +L N + P RG LV +V+G +A +Y T + K
Sbjct: 47 DLSGGGTWLAINRNGRFAMLTNFRDPSGFIPQAPTRGLLVSNFVDGIMTAGDYATQVWKT 106
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
++ +NGF++++ ++ +++ N D+P K+T+ + N
Sbjct: 107 GDQ--YNGFNLIVGDV----NEVFYTGNRQ--DAPPQKLTHGSYILSN----------HL 148
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
P+ K E+ ++ + +Q + L+K+ P P+ + D
Sbjct: 149 LDTPWPKA----ERLRRGLDALTPDC-CPDALQQVFALLKDTT---PAPD-----DTLPD 195
Query: 204 EDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWT 244
+ R+ +SS F YGTR SII VD F E T
Sbjct: 196 TGIPLERERL---LSSPFIISENYGTRCSSIIAVDRAGEATFSELT 238
>gi|392424393|ref|YP_006465387.1| hypothetical protein Desaci_1014 [Desulfosporosinus acidiphilus
SJ4]
gi|391354356|gb|AFM40055.1| hypothetical protein Desaci_1014 [Desulfosporosinus acidiphilus
SJ4]
Length = 256
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/237 (22%), Positives = 99/237 (41%), Gaps = 40/237 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GTW+ + +G+ L N +S+ + RG LV Y+ ++S EY+ +
Sbjct: 44 AGRDLEMMGTWMGITRSGRFAALTNFRDPSSQIREPKSRGFLVRDYLSSKESPAEYIEKV 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K ++++N F++++ L+ + +FSN A QE K +YG N
Sbjct: 104 KHN--QSLYNPFNLLVGNLSC----LMYFSNQA------------REFQELKPGLYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV K+ + + Q L + + D + + P+ I + E
Sbjct: 146 -HFLDTPWPKVRKSKQALVNYLELTQEEIVPQKLFEILADTERAQDHELPNTGISLERE- 203
Query: 201 DMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTLDYETKKW 252
+M +S IF YGTR+ +++ +D N + F E + + K W
Sbjct: 204 ------RM--------LSPIFIEGNDYGTRSSTVLCLDRNYNVLFQERSFRGDNKSW 246
>gi|374579387|ref|ZP_09652481.1| hypothetical protein DesyoDRAFT_0728 [Desulfosporosinus youngiae
DSM 17734]
gi|374415469|gb|EHQ87904.1| hypothetical protein DesyoDRAFT_0728 [Desulfosporosinus youngiae
DSM 17734]
Length = 255
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 50/223 (22%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+L L NV ++ N + RG LV ++ Q S EYL D+ +
Sbjct: 52 GTWMGITRSGRLAALTNVRDPAAQITNAQSRGMLVKNFLCSQDSPVEYLQDVA--LNDAF 109
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+N F++++ + +S +Y+ ++ E +YG N + P+
Sbjct: 110 YNPFNLLVGD---KSRLMYY--------------RQSSSIMELSPGIYGLSNHSLDT-PW 151
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
KV+ K+ A + N + ++ NIL +DR+ +DY++
Sbjct: 152 PKVLKSKQALANYIE--NELAIDPQVLFNIL--------------VDREPA----QDYEL 191
Query: 209 RYSRVCVD----ISSIF-----YGTRTHSIILVDHNNVMDFHE 242
+ + + +SSIF YGTR+ +++L+D +N + F E
Sbjct: 192 PETGISYEFEKLLSSIFIQGSDYGTRSSTVLLIDRHNHVTFKE 234
>gi|392422907|ref|YP_006459511.1| hypothetical protein A458_19325 [Pseudomonas stutzeri CCUG 29243]
gi|390985095|gb|AFM35088.1| hypothetical protein A458_19325 [Pseudomonas stutzeri CCUG 29243]
Length = 251
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/221 (21%), Positives = 94/221 (42%), Gaps = 35/221 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ +T G+ L N+ R RG L +Y+ G +YL +L
Sbjct: 45 GRDLQAGGTWMGVTTGGRFAALTNIRAPGQVIGSRSRGELPEQYLRGNLPPADYLAELTG 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++ L ++H L+S S + T + + +YG N+
Sbjct: 105 RCAD--YAGFNL----LVGDRQALWH------LNSHSGEATAL------QPGIYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK-AEAD 201
+ P+ K+ ++ A + + + +N+L L+ + P P + + +
Sbjct: 147 LDT-PWPKL---RKARAALATNLDPATP-----ENLLALLAD-----PSPAAEHELPQTG 192
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ +++ R S + I+S YGTR + ++ + +D E
Sbjct: 193 VPLEWERRLSSIF--IASPEYGTRASTALIRHADGALDILE 231
>gi|409421834|ref|ZP_11258957.1| hypothetical protein PsHYS_07302 [Pseudomonas sp. HYS]
Length = 248
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/215 (20%), Positives = 87/215 (40%), Gaps = 33/215 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + R RG LV ++ G+ E YL +
Sbjct: 44 AGRDLEAGGTWLGVGPNGRFAALTNIRDPHQPQGARSRGELVAAFLRGELGVEAYLDQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+++ ++GF++ L + + + + P Q + VYG N+
Sbjct: 104 SRSQQ--YSGFNL----LVADNAQLGYLNARQP------------TPQVLGEGVYGVSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ + + + + + L++ + D + S P+ + E
Sbjct: 146 GLDT-PWPKLLKARAGLQQHLA----DPQPERLLELLGDNLPAADSELPETGVGLGTERL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ V I+S YGTR ++++V+ +
Sbjct: 201 LSS----------VFIASQNYGTRASTVLIVEADG 225
>gi|148550251|ref|YP_001270353.1| hypothetical protein Pput_5051 [Pseudomonas putida F1]
gi|148514309|gb|ABQ81169.1| protein of unknown function DUF833 [Pseudomonas putida F1]
Length = 248
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/214 (20%), Positives = 84/214 (39%), Gaps = 33/214 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ R RG LV +++G+ E YL
Sbjct: 42 VHAGRDLEAGGTWLGVGPQGRFAALTNIRDPGQAQGPRSRGELVAAFLQGELGVEAYLDQ 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + + ++GF++++ + Q H P + K VYG
Sbjct: 102 VTSRSGQ--YSGFNLLVGD--GQQLGYLHAREATP--------------RLLKAGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ ++ +++ E L+ + D + P+ + E
Sbjct: 144 NAGLDT-PWPKLVKARDGLENLLD----TPEPHRLLALLADNEPAADGDLPETGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 199 KLLS----------SVFIASQNYGTRASTVLIVD 222
>gi|410461328|ref|ZP_11314979.1| hypothetical protein BAZO_18698 [Bacillus azotoformans LMG 9581]
gi|409925834|gb|EKN63034.1| hypothetical protein BAZO_18698 [Bacillus azotoformans LMG 9581]
Length = 252
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/227 (20%), Positives = 104/227 (45%), Gaps = 49/227 (21%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D K GTW+ + NG+ + N +G+ RG L++ Y++ K E Y+ +
Sbjct: 44 AGRDLEKMGTWMGVTKNGRFAAVTNYRDPKEVTDGKLSRGELILNYLKSSKMPENYMKVM 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
++ +E ++ G+++ L +Y+FSNI S K+ + ++G N
Sbjct: 104 EENSE--LYPGYNL----LVGSPDELYYFSNIT---KKSMKI---------EPGIHGVSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
++ + KV+ GKE I+ + + +I+++ +++N
Sbjct: 146 HLLNTE-WPKVVHGKESLFNILTQ-----KDDAMIESLFKILQNN--------------- 184
Query: 201 DMDEDYKMRYSRVCVD----ISSIF-----YGTRTHSIILVDHNNVM 238
D+ D ++ + V ++ +SSIF YGTR+ +++ ++ + ++
Sbjct: 185 DVAPDDRLPKTGVTIELERLLSSIFIKSEGYGTRSSTVLFMNDDEII 231
>gi|253699377|ref|YP_003020566.1| hypothetical protein GM21_0734 [Geobacter sp. M21]
gi|251774227|gb|ACT16808.1| protein of unknown function DUF833 [Geobacter sp. M21]
Length = 258
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 92/221 (41%), Gaps = 32/221 (14%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTWL +T+G++ L N + +G RG LV +++G EEYL L+
Sbjct: 45 GRDLLHGGTWLGITTSGRIAALTNYRDPADLVRHGLSRGRLVSGFLKGAGDIEEYLECLR 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ F G++ LL T + + Y S T T + ++G N
Sbjct: 105 EAA--GPFGGYN--LLVGTPKRLSCY---------SSKTDRTVILEP-----GIHGLSN- 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ GK+ + + F + + L + D PD + + E
Sbjct: 146 RLLDTPWPKVLRGKKALEQAL--FAADPDLEELFAILSDRSCPPDDQLPDTGVGLELERL 203
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I S YGTR+ +++LVD + F E
Sbjct: 204 LSPIF----------IESERYGTRSCTVLLVDRDGEARFVE 234
>gi|451997044|gb|EMD89510.1| hypothetical protein COCHEDRAFT_1022805 [Cochliobolus
heterostrophus C5]
Length = 318
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/245 (21%), Positives = 102/245 (41%), Gaps = 33/245 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEEYLTDL-KKETEE 86
GTW+ + G++ +L N E+ + R RG + +++ + EE + K+ E+
Sbjct: 54 GTWMGVTRQGRIAVLTNYREEDEKIVEGARSRGLIPNAWLKSDPAKEETTDEFAKRMIEQ 113
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
+ G L + +A + + + V V + D + N+ + +
Sbjct: 114 DGVEGVGGFSLCYGFMQDVVKKGKGLAIVSNRTPDVHGVVRLLQKPDETHALSNAAYSDR 173
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMK------NKQSNYPD-------- 191
+ KVI G+E + + + +ES++ LIQ +L L++ K++ D
Sbjct: 174 SWPKVIHGEEWTKKAITESVAANESRDQLIQRLLALLRTDTMPRQKENEEWDMYLNQLRH 233
Query: 192 ----PEIDRKAEADMDEDYKMRYSRV-------CVDISSIFYGTRTHSIILVDHNNVMDF 240
P I R D E++KM + D +S YGT+ + ++L DH + +
Sbjct: 234 SIFIPAIGR----DHLEEHKMPAHEIGDVVKNKAADATSGCYGTQKNIVVLCDHQGKVTY 289
Query: 241 HEWTL 245
E TL
Sbjct: 290 LERTL 294
>gi|427814331|ref|ZP_18981395.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
gi|410565331|emb|CCN22886.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
Length = 293
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ S G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 70 AGRDLQAGGTWMGVSDQGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLRDV 129
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 130 HARGAR--YNGFNLIV 143
>gi|33597640|ref|NP_885283.1| hypothetical protein BPP3104 [Bordetella parapertussis 12822]
gi|410473367|ref|YP_006896648.1| hypothetical protein BN117_2802 [Bordetella parapertussis Bpp5]
gi|412338195|ref|YP_006966950.1| hypothetical protein BN112_0869 [Bordetella bronchiseptica 253]
gi|33574068|emb|CAE38391.1| conserved hypothetical protein [Bordetella parapertussis]
gi|408443477|emb|CCJ50135.1| conserved hypothetical protein [Bordetella parapertussis Bpp5]
gi|408768029|emb|CCJ52787.1| conserved hypothetical protein [Bordetella bronchiseptica 253]
Length = 293
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ S G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 70 AGRDLQAGGTWMGVSDQGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLRDV 129
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 130 HARGAR--YNGFNLIV 143
>gi|33593621|ref|NP_881265.1| hypothetical protein BP2651 [Bordetella pertussis Tohama I]
gi|384204912|ref|YP_005590651.1| hypothetical protein BPTD_2609 [Bordetella pertussis CS]
gi|408416291|ref|YP_006626998.1| hypothetical protein BN118_2463 [Bordetella pertussis 18323]
gi|33563694|emb|CAE42927.1| conserved hypothetical protein [Bordetella pertussis Tohama I]
gi|332383026|gb|AEE67873.1| hypothetical protein BPTD_2609 [Bordetella pertussis CS]
gi|401778461|emb|CCJ63888.1| conserved hypothetical protein [Bordetella pertussis 18323]
Length = 267
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ S +G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 44 AGRDLQAGGTWMGVSDHGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLHDV 103
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 104 HARGAR--YNGFNLIV 117
>gi|388858093|emb|CCF48330.1| uncharacterized protein [Ustilago hordei]
Length = 326
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 100/227 (44%), Gaps = 23/227 (10%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRD----RGPLVVKYVEGQKS--AEE 75
++DA GGTWL + NG L N ++ P G D RG LV ++ Q S A+
Sbjct: 58 ARDATGGGTWLGITRNGAFATLTNFTELSAPLPEGMDLFESRGKLVRDWLSSQSSLKAKR 117
Query: 76 YLTDLKKETEENV---------FNGFHIVLLELTLQSTNIYHFSNI-APLDSPSTKVTNV 125
++K E E + F GF++++ ++ + T + + +N A + ++
Sbjct: 118 SAEEVKGEVERYLKGLDGKLDRFPGFNLLVGSISTEGTLVGYITNRNADASLTRNREPDI 177
Query: 126 TNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNK 185
+ V G ++ SQP+ KV G F +N+ + SE Q +IQ + +L+
Sbjct: 178 FLPTNPEGKVPGGMSNSVLSQPWSKVSQGSSTFLSTLNR-SHSSEDQ-IIQELYNLLWT- 234
Query: 186 QSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
S+ P P +++E + +Y TRT ++IL+
Sbjct: 235 -SSTPAPA--QRSELRNSVLISPLQLPTTAEGGRDWYATRTSTVILI 278
>gi|398882520|ref|ZP_10637487.1| hypothetical protein PMI32_01174 [Pseudomonas sp. GM60]
gi|398198672|gb|EJM85626.1| hypothetical protein PMI32_01174 [Pseudomonas sp. GM60]
Length = 248
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/220 (20%), Positives = 87/220 (39%), Gaps = 43/220 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL NG+ L N+ + + RG LV +++ G ++YL D+
Sbjct: 44 AGRDLEAGGTWLGVGANGRFAALTNIRDPHQPAARKSRGELVARFLSGDMPIDDYLDDVV 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK-----DHVY 136
+ E + GF++ L + ++HF N +E++ VY
Sbjct: 104 GRSLE--YAGFNL----LIGNANELWHF-----------------NARESEAVLLPAGVY 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K++ + ++ + + L+ + D + PD +
Sbjct: 141 GLSNAGLDT-PWPKLLKARAALEAVLE----DPQPEALLALLSDSQTAPFAELPDTGVGL 195
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E+ + V I+S YGTR + ++V +
Sbjct: 196 ATESLLS----------SVFIASQSYGTRASTALIVQADG 225
>gi|410620880|ref|ZP_11331737.1| hypothetical protein GPAL_0230 [Glaciecola pallidula DSM 14239 =
ACAM 615]
gi|410159578|dbj|GAC27111.1| hypothetical protein GPAL_0230 [Glaciecola pallidula DSM 14239 =
ACAM 615]
Length = 258
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 108/241 (44%), Gaps = 50/241 (20%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEG----QKSAEEY 76
+D GGTW+ + NG + L N+ N N RG LV+ ++ G QK +
Sbjct: 44 AGKDNEAGGTWMGINKNGSVSALTNIRALNKDVENAITRGELVLNFLLGSSKQQKKYAQT 103
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNI----APLDSPSTKVTNVTNKQENK 132
LTD + +NG++++ L +Y ++N LD
Sbjct: 104 LTDSRAR-----YNGYNLLFGTL----AQLYVYNNFEDTCVALD---------------- 138
Query: 133 DHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDP 192
D VYG N+ S P+ K+ G++ A K+ + ++ + +++ +L+ N D
Sbjct: 139 DGVYGLSNASLNS-PWPKISTGRDALA----KYCQHADVLD-TEHLFELLHN------DK 186
Query: 193 EIDRKA--EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
D A + + +++ R S + I S YGTR+ +++LVDH+ + + E + + + +
Sbjct: 187 LADDSALPKTGVPIEWERRLSSIF--IQSPEYGTRSSTVLLVDHDQHVVWEERSFNAQAE 244
Query: 251 K 251
+
Sbjct: 245 Q 245
>gi|452988901|gb|EME88656.1| hypothetical protein MYCFIDRAFT_35667 [Pseudocercospora fijiensis
CIRAD86]
Length = 317
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/240 (21%), Positives = 102/240 (42%), Gaps = 27/240 (11%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPN-GRDRGPLVVKYV----EGQKSAEEYLTDLKKET 84
GTWL + G++ +L N E N + RG + Y+ E Q++ EEY L ET
Sbjct: 54 GTWLGITKQGRIAVLTNFREEGVEVNKNKSRGAITNSYLCIPPESQETEEEYARRLLNET 113
Query: 85 EENVFNGFHIVLLELTLQSTNIYH-------FSNIAPLDSPSTKVTNVTNKQENKDHVYG 137
+ GF ++ +L +S I+ ++ L + ++ T++ + +G
Sbjct: 114 GIHDVGGFTLIFGKL--RSPKIHQNTPTTSTSGGLSILSNKTSSPTSLPRIATTRHETHG 171
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQ-NLIQNILDLMK----NKQSNY--- 189
NS + + K++ G+ ++ + + +SQ ++++ ++ K S++
Sbjct: 172 LSNSHYGDKTWPKIVLGETLLSQTIQNSTKTQQSQPEFLESLFQILSLDTLPKTSDWNAY 231
Query: 190 ----PDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
D + A+ ++ E V ++ YGT ++ILVD + F E TL
Sbjct: 232 VRHMRDSILISPAKGELAERMAGYVPATSVSETAA-YGTSKQTVILVDAKGKVTFVERTL 290
>gi|427821723|ref|ZP_18988785.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
gi|410586988|emb|CCN02018.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
Length = 293
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ S G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 70 AGRDLQAGGTWMGVSDQGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLHDV 129
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 130 HARGAR--YNGFNLIV 143
>gi|410419875|ref|YP_006900324.1| hypothetical protein BN115_2086 [Bordetella bronchiseptica MO149]
gi|427821204|ref|ZP_18988267.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
gi|408447170|emb|CCJ58842.1| conserved hypothetical protein [Bordetella bronchiseptica MO149]
gi|410572204|emb|CCN20471.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
Length = 293
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ S G+ ++ N N +P+ RG LV ++ G SA+EYL D+
Sbjct: 70 AGRDLQAGGTWMGVSDQGRYAVVTNFRDPRNIQPDAPSRGALVEAFLRGAASAQEYLHDV 129
Query: 81 KKETEENVFNGFHIVL 96
+NGF++++
Sbjct: 130 HARGAR--YNGFNLIV 143
>gi|146280806|ref|YP_001170959.1| hypothetical protein PST_0411 [Pseudomonas stutzeri A1501]
gi|145569011|gb|ABP78117.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 251
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/220 (19%), Positives = 87/220 (39%), Gaps = 33/220 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ + G+ L N+ R RG L +Y+ G+ +YL L +
Sbjct: 45 GRDLQAGGTWMGVTARGRFAALTNIRAPGQPIGARSRGELPERYLRGELPPADYLAALAE 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + G+++ L ++H + ++ +E + VYG N+
Sbjct: 105 RRGD--YAGYNL----LVGDRQTLWHLN------------SHDGEPKELQPGVYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ + I+ + E + L+ + D + P + +
Sbjct: 147 LDT-PWPKLRRARAALTSILQ----QPEPERLLALLADAEPAAEHELP--------QTGV 193
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R S + I+S YGTR + ++ + +D E
Sbjct: 194 PLEWERRLSSIF--IASPEYGTRASTALIRHADGSLDIIE 231
>gi|443720872|gb|ELU10424.1| hypothetical protein CAPTEDRAFT_161211 [Capitella teleta]
Length = 252
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 88/199 (44%), Gaps = 21/199 (10%)
Query: 47 VLGENSRPN---GRDRGPLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQS 103
+ G +++P G G LV ++ +A EY + + E +N F ++L+E +
Sbjct: 43 ISGTDAQPGSEGGTWLGYLVTDFLTSDHTAAEYYSKINLED----YNPFRLILVEKHFER 98
Query: 104 TNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVN 163
++ D S ++N N F NS ++P++K +FG+ FA+I+
Sbjct: 99 WE----ASCMSTDGHSLDLSNGINL---------FTNSSDVTRPWKKHVFGQHLFAKILA 145
Query: 164 KFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYG 223
+ +S+ L + L+ + + PD + + +D + R S D + YG
Sbjct: 146 EHKGIEQSEELANRLQGLLSDDTTYLPDRNLQDQGYFLSPDDIQTRSSIFMRDPAGK-YG 204
Query: 224 TRTHSIILVDHNNVMDFHE 242
TR ++IL++ + F E
Sbjct: 205 TRASTVILINQLDEAIFWE 223
>gi|385330324|ref|YP_005884275.1| hypothetical protein HP15_583 [Marinobacter adhaerens HP15]
gi|311693474|gb|ADP96347.1| protein containing DUF833 [Marinobacter adhaerens HP15]
Length = 257
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 101/244 (41%), Gaps = 38/244 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYV-EGQKSAEEYLTD 79
+D + GGTWLA +T G++ + NV E + GR RG L ++ + + ++ E YL D
Sbjct: 46 AGRDLLSGGTWLAINTAGEVSAVTNVR-EGAPETGRISRGELPLRALTDSREHLERYLLD 104
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ F+GF++V L + + ++FSN + ++ VYG
Sbjct: 105 TADQ-----FSGFNLV----QLTTADGWYFSN-----------RDAHPGRQIHRGVYGLS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRK 197
N P+ K++ ++ + + R + + LI + D PD + +
Sbjct: 145 N-HLLQTPWPKLLRLRQAAGDTIAAAGRDAATLHNELIPLLQDSTPAPDHMLPDTGVGLE 203
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE--WTLDYETKKWIHT 255
E + + I YGTR +++ V + ++ E W D T + H
Sbjct: 204 TERFLSSPF----------IVGSDYGTRATTVVTVSDSGEIEVTEQNWGPDARTGERRHF 253
Query: 256 HIRK 259
H R+
Sbjct: 254 HWRR 257
>gi|416877993|ref|ZP_11920123.1| hypothetical protein PA15_18624 [Pseudomonas aeruginosa 152504]
gi|334838914|gb|EGM17616.1| hypothetical protein PA15_18624 [Pseudomonas aeruginosa 152504]
Length = 251
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 85/221 (38%), Gaps = 33/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGSFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + ++H++ V + VYG N+
Sbjct: 104 ERAAD--YSGFNLLIGD----RHQLWHYN------------PRVGPPRLLPAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K + + + Q L++ + D S PD + E
Sbjct: 146 ALDT-PWPKLL----KARAALAERLAEPHPQALLELLTDAAPAADSQLPDTGVGLATERL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ V I+S YGTR S++ V + + E
Sbjct: 201 LS----------SVFIASPSYGTRASSVVRVHADGTREMIE 231
>gi|429210683|ref|ZP_19201849.1| hypothetical protein PM1_00578 [Pseudomonas sp. M1]
gi|428158097|gb|EKX04644.1| hypothetical protein PM1_00578 [Pseudomonas sp. M1]
Length = 250
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/230 (21%), Positives = 99/230 (43%), Gaps = 39/230 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ G+ L NV + S+P G R RG LV ++ G+++ +YL ++
Sbjct: 44 AGRDLEAGGTWMGLGPGGRFAALTNVR-DPSQPLGPRSRGELVAGFLRGRQAPLDYLAEV 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFS--NIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++ E ++GF++ L + + +F+ N P + P+ ++G
Sbjct: 103 QRRAAE--YSGFNL----LVGDARQLCYFNPRNGEPRELPAG--------------LHGL 142
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ K++ + + ++ ++E L+ + D + P+ +
Sbjct: 143 SNASLDT-PWPKLLRARAALEKHLD----EAEPGALLGLLGDRERPDDERLPETGVGLAT 197
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E + V I+S YGTR S++ V + +F E + E
Sbjct: 198 ERLLS----------SVFIASANYGTRASSVLRVHADGRREFIERSFGPE 237
>gi|340727964|ref|XP_003402303.1| PREDICTED: hypothetical protein LOC100647164 [Bombus terrestris]
Length = 86
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 17/77 (22%)
Query: 174 LIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHS 228
LI+ +L +K+K+ PDPE+ ++ + YK ++SSIF Y TRTHS
Sbjct: 3 LIEELLKFLKSKERYLPDPELQKRC----SKGYK--------ELSSIFVSTDGYCTRTHS 50
Query: 229 IILVDHNNVMDFHEWTL 245
I+LV+ NN + F E TL
Sbjct: 51 ILLVNGNNELTFVEETL 67
>gi|73541124|ref|YP_295644.1| hypothetical protein Reut_A1430 [Ralstonia eutropha JMP134]
gi|72118537|gb|AAZ60800.1| Protein of unknown function DUF833 [Ralstonia eutropha JMP134]
Length = 275
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/238 (21%), Positives = 95/238 (39%), Gaps = 31/238 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ +G+ L N + R + R RG LV ++ G + +YL L +
Sbjct: 56 GTWMGVGGDGRFAALTNYRAPSEKRTDARSRGELVSGFLRGHDAPADYLGTLAG--ADGC 113
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF+++ +L ++ +SN + P Q + +YG N+ + P+
Sbjct: 114 YNGFNLLASDL----NELWWYSNRSASRQP----------QRLRPGLYGLSNALLDT-PW 158
Query: 149 QKVIFGKEKFAEIV--NKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDY 206
KV AE + + + + ++ ++ + D + P + + E + +
Sbjct: 159 PKVRSRVGALAEALAADTGHANASAEPYLKLLADNRQAADFELPSTGVAPEWEKLLSSAF 218
Query: 207 KMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIHTHIRKTLNMA 264
I S YGTR +++ V H+ D E + D + R LN+A
Sbjct: 219 ----------IRSGNYGTRASTVLRVRHDGRFDLSERSFDASGQT-GEVSYRGKLNLA 265
>gi|339492519|ref|YP_004712812.1| hypothetical protein PSTAB_0442 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
gi|338799891|gb|AEJ03723.1| hypothetical protein PSTAB_0442 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
Length = 251
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/220 (19%), Positives = 87/220 (39%), Gaps = 33/220 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ + G+ L N+ R RG L +Y+ G +YL +L +
Sbjct: 45 GRDLQAGGTWMGVTARGRFAALTNIRAPGQAIGPRSRGELPEQYLRGNLPPADYLAELAE 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++ L ++H + ++ +E + VYG N+
Sbjct: 105 RRGD--YAGFNL----LVGDRQTLWHLN------------SHDGEPKELQHGVYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ + I+ + E + L+ + D + P + +
Sbjct: 147 LDT-PWPKLRRARAALTAILQ----QPEPERLLALLADAEPAAEHELP--------QTGV 193
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R S + I+S YGTR + ++ + +D E
Sbjct: 194 PLEWERRLSSIF--IASPEYGTRASTALIRHADGSLDIIE 231
>gi|237748468|ref|ZP_04578948.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13]
gi|229379830|gb|EEO29921.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13]
Length = 249
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/230 (22%), Positives = 92/230 (40%), Gaps = 37/230 (16%)
Query: 22 VSQDAVKGGTWLATS----TNGKLGILLNVLGENSRP-----NGRDRGPLVVKYVEGQKS 72
+D GGTWL + G+ G L P + RG LV Y+ S
Sbjct: 44 AGRDLEGGGTWLGVADRPGQTGRKGSRFAALTNVRMPHLMKDDAPTRGLLVSDYLASSLS 103
Query: 73 AEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
EEY+ D+K E +NGF++ L + + +SN + N N Q +
Sbjct: 104 PEEYIRDIKHEAHH--YNGFNL----LVGDADTLIWYSNYG--------LMNPKNGQALQ 149
Query: 133 DHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDP 192
+YG N+ + P+ KV+ + +F ++ + ++ + + PD
Sbjct: 150 PGIYGLSNA-LLNDPWPKVVRTRAQFGCLLGIGAPEDAYFEMLSD--------TATVPDK 200
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ R + +++ S VC I S YGTR +++ + ++ HE
Sbjct: 201 LLPRTG---VPYEWEKLLSAVC--IQSPDYGTRASTLVELYNDAPATLHE 245
>gi|419952511|ref|ZP_14468658.1| hypothetical protein YO5_18752 [Pseudomonas stutzeri TS44]
gi|387970556|gb|EIK54834.1| hypothetical protein YO5_18752 [Pseudomonas stutzeri TS44]
Length = 251
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 92/220 (41%), Gaps = 33/220 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTWL +G+ L N+ R RG LV +Y+ + + +YL L
Sbjct: 45 GRDLQAGGTWLGIRPDGRFAALTNIRNGGQPAGRRSRGELVERYLRDELAPADYLAGLAA 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++ L + ++HF++ + + E VYG N+
Sbjct: 105 GIGD--YAGFNL----LAGTARELWHFNS----------QSGAPRRLEAG--VYGLCNAD 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ + A + + I+ +L L+ +++ PD E+ +
Sbjct: 147 LDT-PWPKLRRARSALAARLELAD--------IEALLQLLDDREPA-PDAEL---PSTGV 193
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R S + I+ YGTR + +L + +D HE
Sbjct: 194 SLEWERRLSSIF--ITGAEYGTRASTALLRWQDGAVDIHE 231
>gi|345020199|ref|ZP_08783812.1| hypothetical protein OTW25_02560 [Ornithinibacillus scapharcae
TW25]
Length = 255
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 98/231 (42%), Gaps = 41/231 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D ++ GTWL ST+G+ L N + +G+ RG +V ++ G +++ EYL
Sbjct: 44 AGRDLLQMGTWLGISTSGRFAGLTNYRHPDHFKSGKLSRGEIVTNFLIGNETSYEYLHKQ 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K ++ V G++I L + ++SN+ K+ V + ++G N
Sbjct: 104 VKRKDDYV--GYNI----LVGSPNELKYYSNV------EDKIIKV------QPGIHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV GK+K + + + + L + D + PD I + E
Sbjct: 146 -HFLDTPWPKVEIGKKKLQSYLEQV-ETVDPEELFSILADSEEASVDELPDTGIGLELER 203
Query: 201 DMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWTLD 246
+ SS+F YGTR +++ +DH+N + F E T +
Sbjct: 204 KL---------------SSMFIKMPDYGTRCSTVLTIDHDNNVTFIERTFE 239
>gi|358451413|ref|ZP_09161847.1| hypothetical protein KYE_18928 [Marinobacter manganoxydans MnI7-9]
gi|357224646|gb|EHJ03177.1| hypothetical protein KYE_18928 [Marinobacter manganoxydans MnI7-9]
Length = 257
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 100/244 (40%), Gaps = 38/244 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYV-EGQKSAEEYLTD 79
+D + GGTWLA + +G++ + NV E + GR RG L ++ + + + E YL D
Sbjct: 46 AGRDLLSGGTWLAVNADGEVSAVTNVR-EGTPETGRISRGELPLRALTDSRDRLEGYLLD 104
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F+GF++V L + + ++FSN + + VYG
Sbjct: 105 -----EADQFSGFNLV----HLTTGDGWYFSN-----------RDAHPGRRIHRGVYGLS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRK 197
N P+ K++ ++ + V R +E LI + D PD + +
Sbjct: 145 N-HLLQTPWPKLLRLRQAAGDTVAAAGRHAEKLHNELIPLLQDSTPAPDHLLPDTGVGLE 203
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE--WTLDYETKKWIHT 255
E + + I YGTR +++ V + ++ E W D T + H
Sbjct: 204 TERFLSSPF----------IVGSDYGTRATTVVTVSASGEIEVTEQNWGPDARTGERRHF 253
Query: 256 HIRK 259
H R+
Sbjct: 254 HWRR 257
>gi|94310770|ref|YP_583980.1| hypothetical protein Rmet_1832 [Cupriavidus metallidurans CH34]
gi|93354622|gb|ABF08711.1| conserved hypothetical protein [Cupriavidus metallidurans CH34]
Length = 284
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 105/256 (41%), Gaps = 39/256 (15%)
Query: 19 LLEVSQDAVKGGTWLA---------TSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVE 68
L EV DA GTW+ + + L N + R + R RG LV ++
Sbjct: 48 LAEVIGDA---GTWMGIAGAQHGLDSPVGARFAALTNYRAPSEKRTDARSRGELVAHFLR 104
Query: 69 GQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK 128
G ++ +YL DL + +NGF++ LT +++ +SN + P
Sbjct: 105 GDQTPADYLHDLAG--DHGAYNGFNL----LTSDLHDLWWYSNRSKSRVP---------- 148
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
Q +YG N+ + P+ KV E++ +R N+ + L ++Q+
Sbjct: 149 QRLTPGLYGLSNALLDT-PWPKVRSRVGAMCEVLAA-DRGQIGSNVESYLQLLADDRQA- 205
Query: 189 YPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
PD E+ A E K+ S I S YGTR +++ V H+ DF E + D +
Sbjct: 206 -PDWELPSTGVAPEWE--KLLSSAF---IRSPNYGTRASTVLRVMHDGRFDFVERSFDAD 259
Query: 249 TKKWIHTHIRKTLNMA 264
+ R LN+A
Sbjct: 260 GQT-GEVSYRGRLNLA 274
>gi|451847959|gb|EMD61266.1| hypothetical protein COCSADRAFT_124896 [Cochliobolus sativus
ND90Pr]
Length = 318
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 100/245 (40%), Gaps = 33/245 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEEYLTDL-KKETEE 86
GTW+ + G++ +L N E+ R RG + +++ + EE D K+ E+
Sbjct: 54 GTWMGVTRQGRIAVLTNYREEDETIVEGARSRGLIPNAWLKSDPAKEETTDDFAKRMIEQ 113
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
+ G L + +A + + + V V D + N+ + +
Sbjct: 114 DGVEGVGGFSLCYGFMQDVVKEAKGLAIVSNRTPDVHGVVRLLHKSDETHALSNAAYSDR 173
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMK------NKQSNYPD-------- 191
+ KVI G+E + + + +ES++ LIQ +L L++ K++ D
Sbjct: 174 SWPKVIHGEEWTKKAIAESVAANESRDQLIQRLLTLLRTDTMPRQKENEEWDMYLNQLRH 233
Query: 192 ----PEIDRKAEADMDEDYKMRYSRV-------CVDISSIFYGTRTHSIILVDHNNVMDF 240
P I R D E++KM + VD +S YGT+ + I+L D + +
Sbjct: 234 SIFIPAIGR----DHLEEHKMPAHEIGDVVKNKAVDATSGCYGTQKNIIVLCDRQGKVTY 289
Query: 241 HEWTL 245
E TL
Sbjct: 290 LERTL 294
>gi|374993505|ref|YP_004969004.1| hypothetical protein Desor_0806 [Desulfosporosinus orientis DSM
765]
gi|357211871|gb|AET66489.1| hypothetical protein Desor_0806 [Desulfosporosinus orientis DSM
765]
Length = 255
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/217 (22%), Positives = 100/217 (46%), Gaps = 31/217 (14%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+ L N +++ N + RG LV Y+ G ++ YL ++ +++
Sbjct: 52 GTWMGITRSGRFAALTNFRDPSAQLTNPQSRGKLVSDYLCGNQNPVNYLQEVV--FGQDL 109
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+N F++++ + S++ L S + ++T + +YG N Q P+
Sbjct: 110 YNPFNLLVGDR----------SDMFFLCSKTPEITKIMPG------IYGLSNYQL-DYPW 152
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYKM 208
KV K+ F ++ +++ E Q L + + D K P+ I + E +
Sbjct: 153 PKVQKSKQAFTNYLD-VSKEIEPQALFEILADREKAPDHQLPNTGISYELEKLLSS---- 207
Query: 209 RYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
V I + YGTR+ ++IL++ ++ ++F E T
Sbjct: 208 ------VFIHGMDYGTRSSTVILIEQHHRVNFVEKTF 238
>gi|296386835|ref|ZP_06876334.1| hypothetical protein PaerPAb_01827 [Pseudomonas aeruginosa PAb1]
Length = 251
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 85/221 (38%), Gaps = 33/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPVGSFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ + ++GF++++ + ++H++ V + VYG N+
Sbjct: 104 ERAAD--YSGFNLLIGD----RHQLWHYN------------PRVGPPRLLPAGVYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K + + + Q L++ + D S PD + E
Sbjct: 146 ALDT-PWPKLL----KARAALAERLAEPHPQALLELLTDAAPAADSQLPDTGVGLATERL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ V I+S YGTR S++ V + + E
Sbjct: 201 LS----------SVFIASPSYGTRASSVVRVHADGTREMIE 231
>gi|134094616|ref|YP_001099691.1| hypothetical protein HEAR1392 [Herminiimonas arsenicoxydans]
gi|133738519|emb|CAL61564.1| conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 260
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/212 (22%), Positives = 87/212 (41%), Gaps = 33/212 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ + N+ + RP+ RG LV+ Y+ G + Y+ ++
Sbjct: 44 AGRDLQGGGTWMGITRDGRFAAVTNIRAPSEIRPDAPTRGNLVLNYLAGTTTPTGYVEEI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ +NGF++ L + +SN D N + K +YG N
Sbjct: 104 SPHAAD--YNGFNL----LVGNKDELIWYSNRGEKDE--------RNGKPMKPGMYGLSN 149
Query: 141 S--QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
S CT + KV+ K +FA ++ + + + + D P + +
Sbjct: 150 SLLDCT---WPKVVSVKAEFASLLCQ---GAPEDAFFEMLSDTTCAADCRLPKTGVSLER 203
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSII 230
E + S VC I+S YGTR +++
Sbjct: 204 ERLL--------SAVC--ITSHDYGTRVSTLV 225
>gi|15614290|ref|NP_242593.1| hypothetical protein BH1727 [Bacillus halodurans C-125]
gi|10174344|dbj|BAB05446.1| BH1727 [Bacillus halodurans C-125]
Length = 269
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 102/228 (44%), Gaps = 40/228 (17%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLG--ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D KGGTWL + GK+ L NV ++S + RG +V Y+ SA+ ++ L
Sbjct: 45 GRDLKKGGTWLGFTKEGKIAALTNVRDSFKHSGIPFQSRGFIVQDYLTSNTSAQLFMERL 104
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ +E F+G+++V ++Y+++N + ++ K Y N
Sbjct: 105 IQTADE--FDGYNLVFGSY----HDLYYYTN------------RMEKGEKLKPGYYMLSN 146
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
Q S + K + + ++ R+ + + +I+ +L +M++++ +PD ++
Sbjct: 147 GQMNSH-WPKAVKVRTHLQSVL---EREQDIETIIKKLLAIMQDEE-RFPDEQLP----- 196
Query: 201 DMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEW 243
D + + R+ +S IF YGTR ++I+ F EW
Sbjct: 197 --DTGVGIEWERI---LSPIFINGKTYGTRATTVIICTDEGESFFLEW 239
>gi|295398258|ref|ZP_06808303.1| conserved hypothetical protein [Aerococcus viridans ATCC 11563]
gi|294973517|gb|EFG49299.1| conserved hypothetical protein [Aerococcus viridans ATCC 11563]
Length = 256
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNV-LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D +GGTWLA S GK L N + + RG L++ Y++ + SA +Y+++L+
Sbjct: 46 RDLKQGGTWLAFSKAGKFAALTNYPFADRQVADPISRGFLIMDYLDSEISASDYVSNLRY 105
Query: 83 ETEENVFNGFHIVL 96
E+ F G+H+++
Sbjct: 106 HREQ--FEGYHLLV 117
>gi|427404628|ref|ZP_18895368.1| hypothetical protein HMPREF9710_04964 [Massilia timonae CCUG 45783]
gi|425716799|gb|EKU79768.1| hypothetical protein HMPREF9710_04964 [Massilia timonae CCUG 45783]
Length = 253
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 89/226 (39%), Gaps = 32/226 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-----DRGPLVVKYVEGQKSAEEY 76
+D GG+W+ S G G L P R RG LV Y+ G+ +
Sbjct: 44 AGRDLQGGGSWMGVSLTGPNGPRFAALTNIRSPQERRLDAPSRGALVADYLRGE--LDAA 101
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
+ +++NG+++VL + +Y +SN D P + + +Y
Sbjct: 102 AYVAELAARPDLYNGYNLVLGD----GETLYWYSNRG-KDDPRN------GQPLERGRIY 150
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N + P+ KV+ K +FA ++ Q ++ M + PD R
Sbjct: 151 GISNGLLDT-PWPKVLRTKAQFASLL--------CQGAPEDAYFEMLADTTRAPD---VR 198
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + + S VC I S YGTRT +++ + + + HE
Sbjct: 199 LPETGVPIEMERMLSPVC--IESPEYGTRTSTVVKLYRDAAPELHE 242
>gi|124009216|ref|ZP_01693897.1| dkfzp761p1121 protein [Microscilla marina ATCC 23134]
gi|123985201|gb|EAY25135.1| dkfzp761p1121 protein [Microscilla marina ATCC 23134]
Length = 255
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 95/222 (42%), Gaps = 32/222 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ NG+ L N +N + RG L + Y++ S + YL +
Sbjct: 44 AGKDLEAGGTWMGLHKNGRFTALTNYRDIDNIKTTAPSRGMLTLDYLQSNDSPKAYLEGI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + +NGF++ LT + +Y+ SN K+ + + +YG N
Sbjct: 104 FENID--AYNGFNL----LTGNTEELYYLSNY------QHKIIQL------QAGIYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + F KV K+KF E+++ K E L+ + D K ++ + + E
Sbjct: 146 ALLDTDWF-KVQRLKKKFTEMIDA--PKVEENALLDLMYDPTKANKTEVQRTGLPIEREI 202
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I S YGT + + IL+D++N + F E
Sbjct: 203 MLSPMF----------IKSPQYGTCSSATILIDYDNKVRFTE 234
>gi|167036184|ref|YP_001671415.1| hypothetical protein PputGB1_5197 [Pseudomonas putida GB-1]
gi|166862672|gb|ABZ01080.1| protein of unknown function DUF833 [Pseudomonas putida GB-1]
Length = 248
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/216 (20%), Positives = 80/216 (37%), Gaps = 37/216 (17%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ R RG LV Y++G+ E YL
Sbjct: 42 VHAGRDLEAGGTWLGVGPQGRFAALTNIRDPGHALGPRSRGELVAAYLQGEMGVEAYLGQ 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH--VYG 137
+ + + ++GF++++ D N H VYG
Sbjct: 102 VASRSGQ--YSGFNLLV------------------GDGQQLGYLNAREATPRLLHAGVYG 141
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N+ + P+ K++ + ++ E L+ + D + + P+ +
Sbjct: 142 LSNAGLDT-PWPKLVKARSGLEGLLE----APEPHRLLALLADNEPAAEGDLPETGVGVA 196
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
E + V I+S YGTR ++++VD
Sbjct: 197 TEKLLS----------SVFIASQNYGTRASTVLIVD 222
>gi|150391443|ref|YP_001321492.1| hypothetical protein Amet_3713 [Alkaliphilus metalliredigens QYMF]
gi|149951305|gb|ABR49833.1| protein of unknown function DUF833 [Alkaliphilus metalliredigens
QYMF]
Length = 256
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 102/224 (45%), Gaps = 35/224 (15%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGR--DRGPLVVKYVEGQKSAEEYLTDLKK 82
D KGGTW + G++ L N + S P+ RG L ++ S YL ++
Sbjct: 47 DLEKGGTWAGVTKEGRVAFLTNY-RDPSLPSTAPLSRGFLTRDFLIQGGSPLSYLENI-- 103
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+T ++ +NGF++++ L +++ +SNI P K +YG N+
Sbjct: 104 QTNQSKYNGFNLIVGTLN----DLWFYSNIENEIRPI------------KPGLYGLSNAL 147
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ KV GK++ A ++ ++ ++ + D++ + + PD ++ +
Sbjct: 148 LNT-PWFKVDRGKKRLAALL-------DTDFTVEQLFDILDDTEVP-PDGKLPKTGVP-- 196
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+M + I S YGTR+ ++IL+ + + F+E L+
Sbjct: 197 ---LEMERLLSTIHIDSPAYGTRSKTVILMTNKGELQFYEKALE 237
>gi|449666512|ref|XP_002161018.2| PREDICTED: uncharacterized protein C22orf25-like [Hydra
magnipapillata]
Length = 268
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/219 (22%), Positives = 98/219 (44%), Gaps = 27/219 (12%)
Query: 25 DAVKGGTWLATSTNGKLGILLNV-LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+ +GGTWL + +G++ L N+ E + RG LV ++ + + +YL L+K
Sbjct: 52 EGYEGGTWLGITRSGRVCFLTNISTNEVISSAQKGRGQLVADFLRSEINPSDYLLGLEKI 111
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC 143
+ F F++V + I +N+ L+ +TN +
Sbjct: 112 --KKTFRPFNLVAGLIESDFIYISTENNLCALEKGYLCLTNSS----------------- 152
Query: 144 TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
+P K + KF ++ FN + E +N+ + + L+ +K N ++ +++ + +
Sbjct: 153 LDKPCLKALNSLNKFKDL---FNSEEEIENIHEKLFTLLADKTKN---TDLCQESSDEWN 206
Query: 204 EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + D +++F GTRT ++I VD NN + F E
Sbjct: 207 SSIYIDPTTAPFDNANVF-GTRTSTVITVDINNHVVFQE 244
>gi|323490234|ref|ZP_08095450.1| hypothetical protein GPDM_12811 [Planococcus donghaensis MPA1U2]
gi|323396074|gb|EGA88904.1| hypothetical protein GPDM_12811 [Planococcus donghaensis MPA1U2]
Length = 252
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 103/235 (43%), Gaps = 51/235 (21%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D + GTWL + G++ L N + G RG +V ++ + S E+YL +
Sbjct: 44 AGRDLTQLGTWLGVTKQGRIAALTNFRDPTNLEAGPLSRGAVVKDFLANKSSPEDYLQSI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+ + GF++++ + + + +++NI + S ++ T+ N F N
Sbjct: 104 DPKQ----YAGFNLIVGD----AEKLVYYNNI---QNESYEIPPGTHALSNH-----FLN 147
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV GKE A + ++E +L + + ++ A+A
Sbjct: 148 T-----PWPKVTKGKENLA----SYMAQTEEADL-EELFTIL---------------ADA 182
Query: 201 DMDEDYKMRYSRVCVDI----SSIF-----YGTRTHSIILVDHNNVMDFHEWTLD 246
D D + S V +D+ S IF YGTR+ +++LV H+N + F E D
Sbjct: 183 DHAPDTHLPDSGVGLDLERMLSPIFIKTPDYGTRSATVVLVSHDNTLTFVERNFD 237
>gi|26991820|ref|NP_747245.1| hypothetical protein PP_5144 [Pseudomonas putida KT2440]
gi|24986934|gb|AAN70709.1|AE016714_7 conserved hypothetical protein [Pseudomonas putida KT2440]
Length = 248
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/214 (20%), Positives = 83/214 (38%), Gaps = 33/214 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+ +D GGTWL G+ L N+ R RG LV +++G+ YL
Sbjct: 42 VHAGRDLEAGGTWLGVGPQGRFAALTNIRDPGQAQGPRSRGELVAAFLQGELGVGAYLDQ 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + + ++GF++++ + Q H P + K VYG
Sbjct: 102 VASRSGQ--YSGFNLLVGD--GQRLGYLHAREATP--------------RLLKAGVYGLS 143
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ + P+ K++ ++ +++ E L+ + D + P+ + E
Sbjct: 144 NAGLDT-PWPKLVKARDGLENLLD----TPEPHRLLALLADNEPAADGDLPETGVGLATE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ V I+S YGTR ++++VD
Sbjct: 199 KLLS----------SVFIASQNYGTRASTVLIVD 222
>gi|340354540|ref|ZP_08677244.1| protein of hypothetical function DUF833 [Sporosarcina newyorkensis
2681]
gi|339623356|gb|EGQ27859.1| protein of hypothetical function DUF833 [Sporosarcina newyorkensis
2681]
Length = 259
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 103/225 (45%), Gaps = 31/225 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D + GTWL + GK+ L N + G+ RG +V Y++ Q E++ L
Sbjct: 44 AGRDLEQMGTWLGINKQGKIAALTNYRDFSLPIKGKQSRGHIVSSYLQSQTPPREFMEQL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+E + GF++ L S ++++SN+ + +T YG N
Sbjct: 104 HANHDE--YTGFNV----LAGSSEELFYYSNM------EQSIRYLTTG------TYGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ KV E+ +++ + + ++ ++ +IL M + +P +++ +
Sbjct: 146 ALLDT-PWPKV----EETKRLLSDYTKTADKIDV--DILFEMMQRAEGFP---LEQLPDT 195
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ E+ + S + I+S YGTR +++L+D ++ + F E T
Sbjct: 196 GVGEELESLLSSIF--ITSEGYGTRCTTVVLIDSDDRVYFEERTF 238
>gi|390943225|ref|YP_006406986.1| hypothetical protein Belba_1625 [Belliella baltica DSM 15883]
gi|390416653|gb|AFL84231.1| hypothetical protein Belba_1625 [Belliella baltica DSM 15883]
Length = 256
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 18/137 (13%)
Query: 6 EKPNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVV 64
E+P+ + G E +D GGTW+ NG+ L N +N + N + RG LV
Sbjct: 27 ERPS-ASLGKWEQGFFAGKDLKAGGTWMGMHPNGRFAALTNYRDLKNPKINPKTRGDLVK 85
Query: 65 KYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLE-------------LTLQSTNIYHFSN 111
++E Q EYL ++ E E++ + GF++++ + + +Y SN
Sbjct: 86 DFLESQIPPLEYLKEI--EAEKHNYEGFNLLVADQEHLCYLSNYKDGIEELQPGLYGLSN 143
Query: 112 IAPLDSPSTKVTNVTNK 128
A LD+P TK+ N+
Sbjct: 144 -ALLDTPWTKLNMAKNR 159
>gi|407802596|ref|ZP_11149436.1| hypothetical protein S7S_01669 [Alcanivorax sp. W11-5]
gi|407023232|gb|EKE34979.1| hypothetical protein S7S_01669 [Alcanivorax sp. W11-5]
Length = 251
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 87/213 (40%), Gaps = 43/213 (20%)
Query: 25 DAVKGGTWLATSTNGKLGILLNV---LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
D GGTWL + +G+ + N + E R RG L +++++G++S +Y +
Sbjct: 44 DLSAGGTWLGVTRDGRFAAVTNFREPVAERVSDAARSRGDLPMQFLQGRESPSDYTARIA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+++ + F++++ + +++ SN Q V+G N
Sbjct: 104 D--QQDTYRPFNLLVGD----GDQLWYLSNRG------------AAPQAVTPGVHGLSNG 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPD----PEIDRK 197
+ P+ KV GKEK A + ++ ++L+ + D + + PD PE++R
Sbjct: 146 LLDT-PWPKVTRGKEKLAHVGQG---EARPEHLLALLHDDWRPDDDHLPDTGVGPELERL 201
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSII 230
+ I S YGTR S +
Sbjct: 202 V--------------APIFIRSPQYGTRASSAV 220
>gi|187478505|ref|YP_786529.1| hypothetical protein BAV2013 [Bordetella avium 197N]
gi|115423091|emb|CAJ49622.1| conserved hypothetical protein [Bordetella avium 197N]
Length = 255
Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 52/247 (21%), Positives = 96/247 (38%), Gaps = 39/247 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D + GGTW+ + NG IL N E R P+ RG L ++ G A +Y+
Sbjct: 42 AGRDLLAGGTWMGAARNGHYAILTNYR-EVGRHLPDAPSRGKLAEHFLAGDLPARDYIAS 100
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ +E +NGF++++ + + +++ N + KD +Y
Sbjct: 101 VAREGGR--YNGFNLIVGD----RQDAWYYGN------------RGGEPRLLKDGLYALS 142
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N S P+ K + K + F + L + D + ++ PD + R+ E
Sbjct: 143 NHLLDS-PWPKSLRLKHA---VRQTFEHGLDLPALFAALGDRTVAEDASLPDTGLPRERE 198
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK----KWIHT 255
+ + I S YGTR +++L N + E D + +H
Sbjct: 199 RLLSSPF----------IVSADYGTRCSTVLLWRANGQGELVERRFDSAARLTGETALHF 248
Query: 256 HIRKTLN 262
H+ T++
Sbjct: 249 HLPGTVS 255
>gi|388568286|ref|ZP_10154706.1| hypothetical protein Q5W_3049 [Hydrogenophaga sp. PBC]
gi|388264486|gb|EIK90056.1| hypothetical protein Q5W_3049 [Hydrogenophaga sp. PBC]
Length = 273
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/226 (19%), Positives = 92/226 (40%), Gaps = 22/226 (9%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTWL G++ L NV + GR RG LV ++++ + ++ L
Sbjct: 47 GRDLRDGGTWLGLGPTGRIAWLTNVRQPGAEHRGRSRGELVSRWLQSDTDSTGFVDTL-- 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++V+ + L+ N P D+P + + + V+ N+
Sbjct: 105 --DPAAYAGFNLVVGD--LRRGEWAWIGNRRP-DAPHAEQAPERHWRAIAPGVHTLSNAT 159
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQ--NLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ K + A+ + + S+ Q +L ++D ++ P + + E
Sbjct: 160 LNT-PWPKA----RRLAQALGDALKSSDEQPASLTGALIDTTLAADADLPRTGVPDEVER 214
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + V YGTR+ +++ + + + EWT D
Sbjct: 215 VLSSPF--------VRWPDHAYGTRSSTLLRLGADGELRIDEWTHD 252
>gi|445497041|ref|ZP_21463896.1| hypothetical protein Jab_2c06390 [Janthinobacterium sp. HH01]
gi|444787036|gb|ELX08584.1| hypothetical protein Jab_2c06390 [Janthinobacterium sp. HH01]
Length = 247
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 49/216 (22%), Positives = 89/216 (41%), Gaps = 35/216 (16%)
Query: 22 VSQDAVKGGTWLATSTNG------KLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAE 74
+D GG+W+ + + + N+ + R + RG LV +++ G + +
Sbjct: 44 AGRDLQAGGSWMGITQPAEGRNASRFAAITNIRAPSEHRDDAPSRGHLVAEFLAGSMTPQ 103
Query: 75 EYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH 134
EY+ ++ + +NGF++VL + + FSN D N Q +
Sbjct: 104 EYVDAIRADA--GAYNGFNLVLSD----GDTLIWFSNRGDADP--------RNGQPLEPG 149
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
VYG N+ + P+ KV+ K +FA ++ + + + D P P+
Sbjct: 150 VYGLSNALLDA-PWPKVVKTKAQFASLLCL---GAPDDAFFEMLAD-------TTPAPD- 197
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSII 230
R E + D + S V I S YGTRT +++
Sbjct: 198 QRLPETGVPLDLERMLS--AVRIESPSYGTRTSTVV 231
>gi|347538023|ref|YP_004845447.1| hypothetical protein NH8B_0184 [Pseudogulbenkiania sp. NH8B]
gi|345641200|dbj|BAK75033.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
Length = 243
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 12/115 (10%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GG+WLA G+ + N+ +R R RG LV ++V G+ SA + LK
Sbjct: 30 GRDLQAGGSWLALDKRGRFAAITNIREGLARRGERTRGELVARFVTGEDSALAFAAWLKA 89
Query: 83 E----TEENVFNG-------FHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVT 126
E N+ G FH LT + I+ SN A LD+P K +
Sbjct: 90 EHGRYAPFNLLFGQLDDLYHFHSRGARLTRLTPGIHTLSN-ATLDTPWPKAQRLA 143
>gi|293605765|ref|ZP_06688139.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
43553]
gi|292815814|gb|EFF74921.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
43553]
Length = 256
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 91/213 (42%), Gaps = 33/213 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D + GGTW+ +T G+ ++ N G++ P RG LV Y+ G + +YL
Sbjct: 44 AGRDGLAGGTWMGVTTRGRFALVTNFREPGKHRDP-APSRGALVEDYLRGDYAPADYLA- 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + +NGF++++ + + ++ SN + +Q + +Y
Sbjct: 102 -RTHESDQAYNGFNLIVGD----TREAWYLSN-----------RDGAPRQLDPG-IYALS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N P+ K+ K F E++ + Q + + + + ++Q+ D D A
Sbjct: 145 N-HLLDTPWPKLARTKTAFTEVLGR-----SPQPDLPALFEALADRQTATDD---DMPAT 195
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
+ K+ S V S YGTR+ S++++
Sbjct: 196 GLPLDREKLLSSPFIV---SPDYGTRSSSVLVL 225
>gi|224826090|ref|ZP_03699193.1| protein of unknown function DUF833 [Pseudogulbenkiania ferrooxidans
2002]
gi|224601727|gb|EEG07907.1| protein of unknown function DUF833 [Pseudogulbenkiania ferrooxidans
2002]
Length = 259
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 12/115 (10%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GG+WLA G+ + N+ +R R RG LV ++V G+ SA + LK
Sbjct: 46 GRDLQAGGSWLALDKRGRFAAITNIREGLARRGERTRGELVARFVTGEDSALAFAAWLKA 105
Query: 83 E----TEENVFNG-------FHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVT 126
E N+ G FH LT + I+ SN A LD+P K +
Sbjct: 106 EHGRYAPFNLLFGQLDDLYHFHSRGARLTRLTPGIHTLSN-ATLDTPWPKAQRLA 159
>gi|357014701|ref|ZP_09079700.1| hypothetical protein PelgB_34955 [Paenibacillus elgii B69]
Length = 247
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/219 (21%), Positives = 97/219 (44%), Gaps = 35/219 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D ++ GTW+ + +G+ L N + G R RG LV Y+ G +S + Y
Sbjct: 40 AGRDLLRNGTWMGVTNSGRFAALTNYRDPAEQTEGKRSRGQLVSSYLIGNESPQAYTERA 99
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+E E + G+++++ + +Y++SNI ++ + VYG N
Sbjct: 100 ARERTE--YPGYNLLIGD----PNELYYYSNIG------------HEIRKLQPGVYGLSN 141
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + KV GK++ + ++ + + +L+++ + P P+ +A
Sbjct: 142 HLLDTD-WPKVTKGKKELEAQLEDVR-----EDRVDALFELLEDAE---PAPD-----DA 187
Query: 201 DMDEDYKMRYSRVC--VDISSIFYGTRTHSIILVDHNNV 237
+++ R+ + I S YGTR+ +I+L+ N +
Sbjct: 188 LPATGVPLQWERLLSPIYIRSDHYGTRSSTILLMTDNKL 226
>gi|21241540|ref|NP_641122.1| hypothetical protein XAC0770 [Xanthomonas axonopodis pv. citri str.
306]
gi|381171114|ref|ZP_09880264.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|390990607|ref|ZP_10260890.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|418515278|ref|ZP_13081459.1| hypothetical protein MOU_00495 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418521029|ref|ZP_13087075.1| hypothetical protein WS7_08418 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|21106890|gb|AAM35658.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
gi|372554699|emb|CCF67865.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|380688509|emb|CCG36751.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|410703005|gb|EKQ61502.1| hypothetical protein WS7_08418 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410707997|gb|EKQ66446.1| hypothetical protein MOU_00495 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 255
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/225 (20%), Positives = 85/225 (37%), Gaps = 35/225 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L +D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G A Y
Sbjct: 43 LLAGRDLRSGGSWVGLDRDGRAAVVTNVRDPLATSSGRSRGHLIADYLSGSLGATVYAD- 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ F F+++L + + H SN PL ++ ++G
Sbjct: 102 -ALAGAAHAFPPFNLLLCD----ADRCEHLSNHPPL------------ARQLAAGIHGMS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRK 197
N + P+ K ++++++ + Q L + + + P +D
Sbjct: 145 NGPLDA-PWPKT----AALTKVLHRWCADGDEDLQPLWAALGNPAIAPDAALPQTGVDLA 199
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + IS YGTR +I+ VDH+ HE
Sbjct: 200 TERLLSAAF----------ISGPSYGTRASTIVAVDHHGHGFIHE 234
>gi|374369126|ref|ZP_09627163.1| hypothetical protein OR16_25973 [Cupriavidus basilensis OR16]
gi|373099276|gb|EHP40360.1| hypothetical protein OR16_25973 [Cupriavidus basilensis OR16]
Length = 331
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 70/168 (41%), Gaps = 21/168 (12%)
Query: 30 GTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+ L N + R + R RG LV ++ G + YL+ L T
Sbjct: 51 GTWMGLAGDGRFAALTNYRAPSEKRTDARSRGELVAGFLRGDSDPQSYLSGLVPRT--GA 108
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQPF 148
+NGF+++ +L ++ +SN P Q K +YG N+ + P+
Sbjct: 109 YNGFNLLACDL----RELWWYSNRGTSPYP----------QRLKPGIYGLSNALLDT-PW 153
Query: 149 QKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
KV AE + + ++ LDL+ N + D E+ R
Sbjct: 154 PKVRSRVGALAETLAADTGLPGAS--VEPYLDLLANDRQAL-DAELPR 198
>gi|260949121|ref|XP_002618857.1| hypothetical protein CLUG_00016 [Clavispora lusitaniae ATCC 42720]
gi|238846429|gb|EEQ35893.1| hypothetical protein CLUG_00016 [Clavispora lusitaniae ATCC 42720]
Length = 316
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 56/273 (20%), Positives = 114/273 (41%), Gaps = 33/273 (12%)
Query: 10 PCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEG 69
P NG ++L + + GTW+ + GK+ +L+N E++ + RG L ++Y+
Sbjct: 36 PLKNG-TQILSPLDLGRPERGTWIGITDTGKVAVLVNYREESNFVSEVSRGILPLQYLTS 94
Query: 70 QKSAEEYLTDLKKETEENVFNGFHIVL-----LELTLQSTNIYHFSNIAPLDSPSTK--- 121
+++ L+K N G + L L + +I PL+ S +
Sbjct: 95 DLEDDDWYESLEKNLSMNSVTGGSVSLEQIGGFTLMYGKLELDSKGSIKPLNIMSNRGDK 154
Query: 122 --------VTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNK-FNRKSESQ 172
+ + + + +G NS S P+ KV G K E+V K K +
Sbjct: 155 GRIHSYEALDEIPHFEIACQPTFGLSNSLFYS-PWPKVTNGASKMGELVEKSVEHKYTQE 213
Query: 173 NLIQNILDLMKNKQSNYPDPEIDR-----KAEADMDEDYKMRYSRVCVDISSI------F 221
+L++ +L+ ++ DPEI + K ++ + D++++ +
Sbjct: 214 DLVEACFELL---STDTFDPEIRKDTSFSKKLQELPNSIFIPPLETNYDLATVSPMVGKY 270
Query: 222 YGTRTHSIILVDHNNVMDFHEWTLDYETKKWIH 254
YGTRT ++I++ + + ++E L + +H
Sbjct: 271 YGTRTQTVIMLHKSGTLHYYERDLHSDDTSDVH 303
>gi|333920264|ref|YP_004493845.1| hypothetical protein AS9A_2598 [Amycolicicoccus subflavus DQS3-9A1]
gi|333482485|gb|AEF41045.1| hypothetical protein AS9A_2598 [Amycolicicoccus subflavus DQS3-9A1]
Length = 243
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 44/221 (19%), Positives = 83/221 (37%), Gaps = 28/221 (12%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + N RG L ++ SA EY +
Sbjct: 31 AGRDELSGGTWMGVTAGQRFAAVTNFRQGTPAAGTLSRGDLPKDFLLSDLSAREYCDAVA 90
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F GF + + ++ SN + T + ++G N+
Sbjct: 91 ARGSE--FGGFSL----FASDGSELWWISN-----------RSDTGPSIVQPGIHGLSNA 133
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ GK +FAE+ + ++ + + + D K + P + R E
Sbjct: 134 LLDT-PWPKVVDGKAEFAEVATADDGSADPEEYLAVLADTTKAPSRSLPSTGVPRLFE-- 190
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
K+ SR I YGTR +++ + + ++ E
Sbjct: 191 -----KLLSSRF---IRMGSYGTRASTVLRIRADGSIELTE 223
>gi|389757471|ref|ZP_10191673.1| hypothetical protein UU5_17947 [Rhodanobacter sp. 115]
gi|388430783|gb|EIL87911.1| hypothetical protein UU5_17947 [Rhodanobacter sp. 115]
Length = 252
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 43/222 (19%), Positives = 88/222 (39%), Gaps = 33/222 (14%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEY 76
++ + +D GGTWL + +G+ G++ NV G+ RG L+ Y+ G A +
Sbjct: 38 DLPIIAGRDLQAGGTWLGVTGDGRCGVVTNVRDPRDPQRGQSRGLLLTDYLGGSADAATH 97
Query: 77 LTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVY 136
L+ + + F++ LT + + ++ N P + VT V+
Sbjct: 98 AEALRACAAD--YRPFNL----LTFDAHDAFYLGN-----RPEARAQAVTPG------VH 140
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ K + + ++ +++ + L Q+ PD E+
Sbjct: 141 GLSNADFNT-PWPKT----RALMQRLQRWTTSADADDFAPLFAALADEHQA--PDDELP- 192
Query: 197 KAEADMDEDYKMRYSRVCVD--ISSIFYGTRTHSIILVDHNN 236
D + R+ I YGTR +++ + H++
Sbjct: 193 ------DTGVGLERERMLSPAFIRGEHYGTRASTVVAIGHDD 228
>gi|374336267|ref|YP_005092954.1| hypothetical protein GU3_12245 [Oceanimonas sp. GK1]
gi|372985954|gb|AEY02204.1| hypothetical protein GU3_12245 [Oceanimonas sp. GK1]
Length = 258
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + NG+ L NV + R RG LV ++ G++S +YL +
Sbjct: 43 AGRDLQAGGTWLGITQNGRFAALTNVRQGAPQTGRRSRGELVTGFLAGRQSPGDYLQQVL 102
Query: 82 KETEENVFNGFHIVLLEL 99
E + + GF++++ +L
Sbjct: 103 AEGQ--AYAGFNLLVGDL 118
>gi|319650574|ref|ZP_08004714.1| hypothetical protein HMPREF1013_01319 [Bacillus sp. 2_A_57_CT2]
gi|317397755|gb|EFV78453.1| hypothetical protein HMPREF1013_01319 [Bacillus sp. 2_A_57_CT2]
Length = 254
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 48/219 (21%), Positives = 98/219 (44%), Gaps = 33/219 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D K GTW+ + NG+ L N N +G R RG LV +++G S + +
Sbjct: 47 AGRDLSKMGTWMGVTKNGRFAALTNYRDPNEVTDGKRSRGDLVADFLKGSASPSRF---M 103
Query: 81 KKETE-ENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
K +E N + G++++ L ++++SN+ ++ VYG
Sbjct: 104 KIASEHRNSYPGYNLLAGNL----EELFYYSNVE------------DRIEQLGPGVYGVS 147
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N ++ + KV GKE ++I++ + + + + L++N P P+ D +
Sbjct: 148 NHVLNTE-WPKVKKGKEGLSKIIDH-----AAGDFTEALFTLLQNAD---PAPD-DMLPK 197
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVM 238
+ +++ S + I S YGTR+ +++L+ + +
Sbjct: 198 TGVSLEWERMLSPLF--IRSEGYGTRSSTVMLMSEDEIF 234
>gi|407716352|ref|YP_006837632.1| hypothetical protein Q91_1091 [Cycloclasticus sp. P1]
gi|407256688|gb|AFT67129.1| hypothetical protein Q91_1091 [Cycloclasticus sp. P1]
Length = 255
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 52/91 (57%), Gaps = 13/91 (14%)
Query: 25 DAVKGGTWLATSTNGKLGILLNV----LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
D V GG+WL+T T+G+L L NV EN++ RG LV ++ Q +A +++ L
Sbjct: 47 DHVAGGSWLSTDTSGRLAALTNVRKPPFIENTK---LSRGHLVRDFLSQQSTAPDFIEKL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSN 111
KK + + F+++L++ T ++++SN
Sbjct: 104 KKRRAD--YGLFNLLLMD----HTGLWYYSN 128
>gi|221043578|dbj|BAH13466.1| unnamed protein product [Homo sapiens]
Length = 198
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 3/89 (3%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 94 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 151
Query: 84 TEENVFNGFHIVLLEL-TLQSTNIYHFSN 111
E +++NGF+++ +L T + I ++ N
Sbjct: 152 MEGHLYNGFNLIAADLSTAKGDVICYYGN 180
>gi|149926473|ref|ZP_01914734.1| hypothetical protein LMED105_13778 [Limnobacter sp. MED105]
gi|149824836|gb|EDM84050.1| hypothetical protein LMED105_13778 [Limnobacter sp. MED105]
Length = 269
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 94/236 (39%), Gaps = 37/236 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPN----GRDRGPLVVKYVEGQKSAEE 75
L +D GGTWL +N + +L NV NS N RG LV+ ++ + + EE
Sbjct: 43 LWAGKDLQSGGTWLGIGSNAEFALLTNV--RNSTLNMPGTAPSRGQLVLDAIQTRATPEE 100
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTN---VTNKQENK 132
K + FN L+ LQS N+ SN S K+ N + +N
Sbjct: 101 -----KASLQYAGFN-----LIHGNLQSLNLLCTSN------QSLKLGNGLDYSVALQNG 144
Query: 133 DHVY--GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYP 190
H G+ N+ K +E A N + + Q L+ + + + + P
Sbjct: 145 LHSLSNGYLNAPWPKSRLLKTGLQQEIEANCSNHTSLAAFEQTLLGLLTNTRLAEDAELP 204
Query: 191 DPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + E + V I S YGTR+ ++IL+D +N + F E T +
Sbjct: 205 STGVPYEWEKMLS----------AVKIVSPLYGTRSSAVILLDRSNTVHFTEITFN 250
>gi|197117116|ref|YP_002137543.1| hypothetical protein Gbem_0721 [Geobacter bemidjiensis Bem]
gi|197086476|gb|ACH37747.1| protein of unknown function DUF833 [Geobacter bemidjiensis Bem]
Length = 258
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 91/221 (41%), Gaps = 32/221 (14%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTWL +T ++ L N + +G RG LV +++G +E YL L+
Sbjct: 45 GRDLLHGGTWLGITTTARIAALTNYRDPADLVRHGLSRGRLVSGFLKGTGDSEAYLECLR 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ F G++ LL T + + Y S T T++ ++G N
Sbjct: 105 EAA--GPFGGYN--LLVGTPERLSCY---------SSKTDRTSILEP-----GIHGLSN- 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV+ GK+ + + + + L + D + PD + + E
Sbjct: 146 RLLDTPWPKVVRGKKALEQAL--LAADPDLEELFAILSDRTRPPDDQLPDTGVGLELERL 203
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I S YGTR+ +++L+D + F E
Sbjct: 204 LSPIF----------IESERYGTRSCTVLLLDRDGEARFVE 234
>gi|372269490|ref|ZP_09505538.1| hypothetical protein MstaS_00364 [Marinobacterium stanieri S30]
Length = 260
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 92/221 (41%), Gaps = 35/221 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+A N + ++ N L N R RG LV ++ +A ++ L
Sbjct: 45 RDLEAGGSWMALGDNNRFVLVTNHRNLRLPKPENARSRGNLVADFINSPLAAPDFCNQL- 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E + GF+++L + +T YH+SNI+ Q V+G N+
Sbjct: 104 ---EAKDYAGFNLLLRD----TTGWYHYSNIS------------QRCQRLGAGVFGLSNA 144
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K + + + + + + + E+ L+ + + + K + PD + + E
Sbjct: 145 LLDT-PWPKTLRARARLQQQLAQGLPQPET--LLHLLHETSRPKDAYLPDTGLSLERE-- 199
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
R C I S YGTR S++L + + F E
Sbjct: 200 -------RLLSSCF-IQSADYGTRASSLLLQHRSGRLLFVE 232
>gi|403413042|emb|CCL99742.1| predicted protein [Fibroporia radiculosa]
Length = 218
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/168 (22%), Positives = 72/168 (42%), Gaps = 14/168 (8%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR---DRGPLVVKYVEGQKSAEEYLTDL 80
+D + GGTW S G++ +L N+ + P G+ RG L ++ + + ++
Sbjct: 55 RDLLAGGTWAGISRAGRVALLTNI----TEPAGKYTSSRGGLASSFLMPETPERAFRAEV 110
Query: 81 KKETEENV-FNGFHIVLLELTLQSTNIYHFSNIAPL--DSPSTKVTNVTNKQENKDHVYG 137
++ +N + GF+++LL +Q + A L +S V + + G
Sbjct: 111 ERIVAQNAKYAGFNLLLLSPVMQGQSNRALRMDAALVTNSGGGGVITTRDLSSEERRCGG 170
Query: 138 FGNS---QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLM 182
N Q S+ + KV G EI+N S+ L++ + D++
Sbjct: 171 MSNGIDRQGASE-WPKVKHGTHALQEIINTIPGNSDEPELVERLFDML 217
>gi|440747583|ref|ZP_20926840.1| hypothetical protein C943_3841 [Mariniradius saccharolyticus AK6]
gi|436484053|gb|ELP40073.1| hypothetical protein C943_3841 [Mariniradius saccharolyticus AK6]
Length = 241
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 64/146 (43%), Gaps = 28/146 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL G+ L N N + RG LV ++EG+ ++YL +
Sbjct: 25 AGKDIRAGGTWLGIHPGGRFATLTNFRDLRNKNKYEKSRGDLVKNFLEGKDHPKDYLEKI 84
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH----VY 136
+ E E ++GF++ L +++FS NKQE +Y
Sbjct: 85 QSEKGE--YDGFNL----LVADGDELFYFS----------------NKQEGIHRLDFGLY 122
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIV 162
G N+ + P++K++ KEK A+ +
Sbjct: 123 GLSNALLET-PWRKLLEAKEKLAQKI 147
>gi|186475805|ref|YP_001857275.1| hypothetical protein Bphy_1040 [Burkholderia phymatum STM815]
gi|184192264|gb|ACC70229.1| protein of unknown function DUF833 [Burkholderia phymatum STM815]
Length = 273
Score = 45.4 bits (106), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 83/188 (44%), Gaps = 38/188 (20%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAE---- 74
L +D V GGTWL + +G+ L N + R + RG LV ++ G++ E
Sbjct: 44 LLAGRDLVGGGTWLGMTRDGRFAALTNYRAPSEMRADAPTRGTLVSNWLSGERGIESGAP 103
Query: 75 -EYLTDLKKETEENVFNGFHIVLLELTLQSTNIY-HFSNIAP-LDSPSTKVTNVTNKQEN 131
+YL L+ + +++NGF++++ + T + Y + S AP L +P T
Sbjct: 104 LDYL--LRVAQDGDMYNGFNLLVGDWTRRELAWYCNRSPAAPALLAPGT----------- 150
Query: 132 KDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPD 191
+G N+ P+ K++ + + A + +S+ + ++ LM+ D
Sbjct: 151 ----HGISNA-VLDTPWPKLVRKRAELARTL-----ADDSRPPLTTLIGLMR-------D 193
Query: 192 PEIDRKAE 199
P + R E
Sbjct: 194 PRVARDDE 201
>gi|421483433|ref|ZP_15931010.1| hypothetical protein QWC_12501 [Achromobacter piechaudii HLE]
gi|400198677|gb|EJO31636.1| hypothetical protein QWC_12501 [Achromobacter piechaudii HLE]
Length = 267
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 48/229 (20%), Positives = 90/229 (39%), Gaps = 45/229 (19%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR------DRGPLVVKYVEGQKSAEE 75
+D + GGTW+ ++ G+ ++ N R GR RG LV ++ G S E
Sbjct: 44 AGRDGLAGGTWMGATSRGRYALVTNF-----REPGRHLDAAPSRGALVEDFLRGGTSPAE 98
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNI--APLDSPSTKVTNVTNKQENKD 133
YL + + ++ +NGF++++ + LQ+ ++ SN AP P
Sbjct: 99 YLARVHE--KDQAYNGFNLIVGD-ALQA---WYLSNRDGAPRALPPG------------- 139
Query: 134 HVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPE 193
+Y N P+ K+ K F ++++ + + L + D + P
Sbjct: 140 -IYALSN-HLLDTPWPKLARTKAAFTAVLDR-APQPDMPALFAALADRQGAEDDELPATG 196
Query: 194 IDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ R E + + I S YGTR+ S++ + + E
Sbjct: 197 LPRDREKLLSSPF----------IVSPNYGTRSSSVLALHADGAGQLEE 235
>gi|406695926|gb|EKC99223.1| hypothetical protein A1Q2_06423 [Trichosporon asahii var. asahii
CBS 8904]
Length = 504
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 48/227 (21%), Positives = 98/227 (43%), Gaps = 32/227 (14%)
Query: 27 VKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEE 86
V G L T+ G I ++ + P+ RG L+ ++ A+ ++D K+
Sbjct: 271 VLSGRDLGTAEKGTTNIRYPIVA--TPPDPPSRGMLLKSFLSAAPDAKVSVSDFLKDIPA 328
Query: 87 NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCTSQ 146
+ GF+++L +L + + SN + T +T N D G NS Q
Sbjct: 329 KDYVGFNLLLFDLQSSPAEVGYLSN-------RPEPTQLT---PNNDSCQGISNSPW-DQ 377
Query: 147 PFQKVIFGKEKFAEIVNKFNRKSES-QNLIQNILDLMKNKQSNYPDPEIDRKAEADMDED 205
P+ KV G+E+ A+ + + + + ++L+ ++DL+ P P + + D+
Sbjct: 378 PYPKVTEGEERMAKTLEAWAMEGRNEEDLVTRMMDLLS------PAPPV--TSAKDLFRA 429
Query: 206 YKMRYSRVCVDISSI----------FYGTRTHSIILVDHNNVMDFHE 242
+++ + D ++ +YGTR ++I+V + + F E
Sbjct: 430 TRVQPVIIGPDPNAPPADRPTEGGRWYGTRVSTVIIVRDDGHVLFVE 476
>gi|329895894|ref|ZP_08271222.1| hypothetical protein IMCC3088_1766 [gamma proteobacterium IMCC3088]
gi|328922112|gb|EGG29471.1| hypothetical protein IMCC3088_1766 [gamma proteobacterium IMCC3088]
Length = 256
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 47/221 (21%), Positives = 84/221 (38%), Gaps = 27/221 (12%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL + G+ L NV RG LV ++ Q S+ E + L
Sbjct: 44 AGRDLQAGGTWLGVNAQGRFAALTNVRETLPPATETSRGSLVGLWLSNQSSSMELVGQLA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
++G++++ F + + L S + N +YG N+
Sbjct: 104 DTAA--TYSGYNLI-------------FGDTSQLLWASNRGQNGFEYSYLASGIYGLSNA 148
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K+ K++ A + K + S I +L ++ + D
Sbjct: 149 SLNT-PWPKLRLAKQRLA-LQLKLSHPS--------IAELAWVTADTCTQDDLSHLGDTD 198
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
M+E + S I++ YGTR ++I ++ M HE
Sbjct: 199 MEESWLRALSAQW--IATPDYGTRAQTVITRENTGQMTCHE 237
>gi|410665207|ref|YP_006917578.1| hypothetical protein M5M_13540 [Simiduia agarivorans SA1 = DSM
21679]
gi|409027564|gb|AFU99848.1| hypothetical protein M5M_13540 [Simiduia agarivorans SA1 = DSM
21679]
Length = 257
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 83/215 (38%), Gaps = 38/215 (17%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GG+WL S + L N+ + RG L++ + G+ A L
Sbjct: 45 GKDLQAGGSWLMASEQARWAALTNIRNPAAPVAQGSRGTLILDALRGRAPA------LNA 98
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ G++++ EL+ I + +N + P + Q K YG N+
Sbjct: 99 DV-----GGYNLLKGELSPGGWQIDYHTNGGGIHPPGS--------QRLKPGCYGLSNAG 145
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEIDRKA 198
S P+ KV GK F V E +L Q + L+ ++ + PD + +
Sbjct: 146 LDS-PWPKVTQGKSAFVNAVG----TEEDADLRQALWALLAHRATADTAELPDTGVGLEW 200
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
E + + I S YGTR S++L+D
Sbjct: 201 ETRLSARF----------IVSPDYGTRASSLLLLD 225
>gi|238879786|gb|EEQ43424.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 338
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 114/253 (45%), Gaps = 41/253 (16%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGRD--RGPLVVKYVEGQKSAEEY-------LTDL 80
GTW+ +TNGK+ +L+N + R + RG L + Y+ G++S E+ D
Sbjct: 55 GTWIGVTTNGKIAVLVNYRENDQREAISEVSRGILPLDYLCGKESDNEWKDNLAATFRDG 114
Query: 81 KKETEENVFNGFHIVLLELTL--QSTNIYHFSNIA----------------PLDSPSTKV 122
+ + GF ++ +L++ Q+ I+H + ++ + + +
Sbjct: 115 RNTIDLKKIGGFSLLYGKLSINPQTGKIHHLNILSNRGDQGKVFKSKETSKDIANGEINL 174
Query: 123 TNVTNKQENKD--HVYGFGNSQCT-SQPFQKVIFGKEKFAEIVNK--FNRKSESQNLIQN 177
+N T+ E+ D H FG S ++P+ KV G+ ++++++ N S+ Q LI+
Sbjct: 175 SNDTSDTEDDDIPHKTTFGLSNSLYNKPWDKVKLGESLLSKLIDESIINNYSQDQ-LIEK 233
Query: 178 ILDLMKNKQSNYPDPEIDRKAEADMDEDYKM------RYSRVCVDISSI--FYGTRTHSI 229
L+ + N D + + Y + R + D ++ +YGTRT +I
Sbjct: 234 CFQLLSHDTYNSKILHSDDFEKKFAELKYSIFIPPIVRNNYYETDSVAVGKYYGTRTQTI 293
Query: 230 ILVDHNNVMDFHE 242
IL+D ++++E
Sbjct: 294 ILLDKYGNLNYYE 306
>gi|262194930|ref|YP_003266139.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262078277|gb|ACY14246.1| protein of unknown function DUF833 [Haliangium ochraceum DSM 14365]
Length = 267
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 50/224 (22%), Positives = 98/224 (43%), Gaps = 23/224 (10%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + G+ L N+ + + RG LV ++ YL L
Sbjct: 44 AGRDLEAGGTWMGVTRAGRWAALTNIHEPGVAHEDALSRGELVSGFLRNDDEPGAYLAAL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E E+ + GF++V+ +++ SN + P+ V V+G N
Sbjct: 104 --EAREHRYRGFNLVI----GTREQVWYVSNRS---GPARAV---------DAGVHGVSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + P+ KV+ G+ AE R ++ L++++ +L+ ++ ++ R+A
Sbjct: 146 ALLDT-PWPKVVRGRRLLAEAAEAAARDRDTPALLRSLHELLADR--DFDVAAALRRAGI 202
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT 244
D D D + + R + + + YGT + + ++VD + F E T
Sbjct: 203 DGDADPEA-WKRASLFLDTPGYGTCSSTAVVVDAQGRVTFSERT 245
>gi|260221722|emb|CBA30570.1| hypothetical protein Csp_C24210 [Curvibacter putative symbiont of
Hydra magnipapillata]
Length = 251
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 30 GTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTWL S GK+ + N G+ R RG LV +++ AEEY+ +L T+ +
Sbjct: 50 GTWLGCSRKGKMAAVTNFRSGKAPNSQARSRGELVTQFLSAALDAEEYINELS--TQADA 107
Query: 89 FNGFHIVLLE 98
+N F++++ +
Sbjct: 108 YNAFNLLVFD 117
>gi|418293944|ref|ZP_12905845.1| hypothetical protein PstZobell_11604 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
gi|379065328|gb|EHY78071.1| hypothetical protein PstZobell_11604 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
Length = 251
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 48/229 (20%), Positives = 96/229 (41%), Gaps = 51/229 (22%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ +T+G+ L N+ R RG L ++ +YL +L +
Sbjct: 45 GRDLQAGGTWMGITTSGRFAALTNIRAPGQPTGLRSRGELPEHFLRADLQPADYLAELAE 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++ L ++HF +S S T + + VYG N+
Sbjct: 105 RRGD--YAGFNL----LVGDHQALWHF------NSQSGLATAL------EAGVYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ K+ A +V + ++ + Q +L L+ + Q +
Sbjct: 147 LDT-PWPKL---KKARAALVGELDKPNPEQ-----LLALLADPQ---------------L 182
Query: 203 DEDYKMRYSRVCVD----ISSIF-----YGTRTHSIILVDHNNVMDFHE 242
++++ ++ V ++ +SSIF YGTR + ++ + +D E
Sbjct: 183 AAEHELPHTGVPLEWERRLSSIFIVSPEYGTRASTALIRHADGALDILE 231
>gi|50549657|ref|XP_502299.1| YALI0D01760p [Yarrowia lipolytica]
gi|49648167|emb|CAG80485.1| YALI0D01760p [Yarrowia lipolytica CLIB122]
Length = 315
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 99/238 (41%), Gaps = 34/238 (14%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGRD--RGPLVVKYVEGQKSAEEYLTDLKKETEEN 87
GTW+ + GK+ ++LN + + D RG +++ +S EE+ + K+ E+
Sbjct: 51 GTWIGCNKQGKIAVILNYQEKRDKDAIGDISRGKFPKEFLLSNESPEEFTKNFKRRYPED 110
Query: 88 ---VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
GF + ++ L FS I+ V ++ +D G NS
Sbjct: 111 KLQAAGGFSFLFGQVDLNGKT--QFSIISNRGGDHEWVA-----RDPEDQTIGISNS-LF 162
Query: 145 SQPFQKVIFGKEKFAEIVNK-FNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMD 203
P+ K G E+ E+V + S + L+ ++ D++ + + DP +K ++ +
Sbjct: 163 CNPWPKCALGIERLDEVVKESVESGSSRKELVDSLFDVLSH---DTFDPGA-KKGASEQE 218
Query: 204 EDYKMRYSRVCVDISSI----------------FYGTRTHSIILVDHNNVMDFHEWTL 245
+ +R S I++ YGTRT +II+V + + + E TL
Sbjct: 219 IHHALRQSVFIPVIAAAGYQEGQDWNVTHPFGKHYGTRTQTIIMVGKDGQLRYFEKTL 276
>gi|333893267|ref|YP_004467142.1| hypothetical protein ambt_09070 [Alteromonas sp. SN2]
gi|332993285|gb|AEF03340.1| hypothetical protein ambt_09070 [Alteromonas sp. SN2]
Length = 274
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 100/227 (44%), Gaps = 32/227 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKY-VEGQKSAEEYLTD 79
D GTW+ + NGK+ L N+ E + + RG LV + ++ YL
Sbjct: 64 AGHDLQAHGTWMGVTRNGKVAALTNIRAPETIKTDAISRGGLVADWLIDEAMKQPTYLNV 123
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
L+ + +NG+++V + ++S +Y+ + + T+ T Q VYG
Sbjct: 124 LRANRHK--YNGYNLVYGD--VKSLAVYN----------NFEDTHATLTQG----VYGLS 165
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N+ T+ P+ KV G + V++ N + +++ L + D K P+ I + E
Sbjct: 166 NANLTT-PWPKVTKGIASLTDYVSQ-NNQLDTEALFAILKDEDKASDHTLPNTGIGYEWE 223
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+ + I S YGTRT +++LV+ + + ++E T +
Sbjct: 224 KRLSSIF----------IQSPEYGTRTSTLLLVNSHQQIHWYERTFN 260
>gi|421078366|ref|ZP_15539320.1| protein of unknown function DUF833 [Pelosinus fermentans JBW45]
gi|392523556|gb|EIW46728.1| protein of unknown function DUF833 [Pelosinus fermentans JBW45]
Length = 258
Score = 44.7 bits (104), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 101/224 (45%), Gaps = 36/224 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRD---RGPLVVKYVEGQKSAEEYLT 78
+D + GTW+ + +G+ L N + R + D RG LV Y+ Q+S +EYL
Sbjct: 44 AGRDLKELGTWMGITVHGRFAALTNY--RDPRQHTTDALSRGHLVANYLSNQQSPQEYLG 101
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+ K+ ++ +NGF++++ +L +++++ N +V V V+G
Sbjct: 102 KIAKQADQ--YNGFNLLVGDL----QSLWYYGN------KQGQVQPVVPG------VHGL 143
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N S P+ K+ G+++ A+ + + + + L Q + + + P+ + +
Sbjct: 144 CNHLLNS-PWPKLEKGRQQLAQCLTQEDVFEDE--LWQILTNGEQAADDLLPNTGVGLEL 200
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I S YGTR+ +I+L+ + + F E
Sbjct: 201 ERTLSS----------IFIESPEYGTRSSTILLIRQDGWVTFVE 234
>gi|50949984|emb|CAH10533.1| hypothetical protein [Homo sapiens]
Length = 195
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
++ +GGTWL ST GKL L N L R RG LV ++ + YL K
Sbjct: 53 EEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLK--KVS 110
Query: 84 TEENVFNGFHIVLLEL 99
E +++NGF+++ +L
Sbjct: 111 MEGHLYNGFNLIAADL 126
>gi|183220409|ref|YP_001838405.1| hypothetical protein LEPBI_I1005 [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Paris)']
gi|189910523|ref|YP_001962078.1| hypothetical protein LBF_0973 [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Ames)']
gi|167775199|gb|ABZ93500.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Ames)']
gi|167778831|gb|ABZ97129.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Paris)']
Length = 264
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 6/81 (7%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D GGTWL ++ GKL L NV L + PN + RG LV +++E + L D
Sbjct: 44 AGKDLKAGGTWLGVNSFGKLAFLTNVRNLKKPPHPNPKSRGELVFRFLESDGNFS--LND 101
Query: 80 LKKE--TEENVFNGFHIVLLE 98
KE + ++ F GF++ L +
Sbjct: 102 YAKEVHSAKDNFEGFNLFLFD 122
>gi|392566115|gb|EIW59291.1| DUF833-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 313
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 50/251 (19%), Positives = 98/251 (39%), Gaps = 50/251 (19%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYV---EGQKSAEEYLTDL 80
+D + GGTW + +G+L +L N+ E R RG L ++ + + + E + +
Sbjct: 54 RDLLAGGTWAGLNRSGRLALLTNIT-EAPRKYASTRGDLTSSFLLPKDPKATLEAEIDEF 112
Query: 81 KKETEENVFNGFHIVLLE------------LTLQSTNIYHFSNIAPLDSPSTKVTNVTNK 128
E + + GF+++LL L L + P+ + +K
Sbjct: 113 LLENRDRAYAGFNLLLLSPAPARGDSGPSALALDGALATNSGGGGPI------TARMLSK 166
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
+E + G + + KV G + A ++N + + +++ + L+ K +
Sbjct: 167 EERECGGMSNGVDHHGASEWPKVKHGIQTLAAVLNSLPQDATEAEIVERLFALLTWKSEH 226
Query: 189 YPDPEIDRKAEADMDEDYKMRYSRVCVDISSI------------FYGTRTHSIILVDHNN 236
P +DR ++R + + VD I +YGTR ++IL+ +
Sbjct: 227 AP---VDRS---------ELRNT-IMVDPLPIGARESPDTPPPNYYGTRLSTVILIRRDG 273
Query: 237 VMDFHE---WT 244
F E WT
Sbjct: 274 TALFIERDVWT 284
>gi|452749675|ref|ZP_21949433.1| hypothetical protein B381_17924 [Pseudomonas stutzeri NF13]
gi|452006314|gb|EMD98588.1| hypothetical protein B381_17924 [Pseudomonas stutzeri NF13]
Length = 251
Score = 44.3 bits (103), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 88/220 (40%), Gaps = 33/220 (15%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ +T G+ L N+ R RG L Y+ G +YL +L
Sbjct: 45 GRDLQAGGTWMGVTTGGRFAALTNIRAPGQANGPRSRGELPEHYLRGNLPPADYLAELAG 104
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ + GF++ L ++H LDS S + T + VYG N+
Sbjct: 105 RRSD--YAGFNL----LVGDRQALWH------LDSHSGEATALPPG------VYGLSNAG 146
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K+ + A ++ + + L+ + D + P + +
Sbjct: 147 LDT-PWPKLRKARAALAANLD----PAAPEQLLALLADPSPAAEHELP--------QTGV 193
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R S + I+S YGTR + ++ + V+D E
Sbjct: 194 PLEWERRLSSIF--IASPEYGTRASTALISHADGVLDIVE 231
>gi|294627716|ref|ZP_06706298.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|294664071|ref|ZP_06729472.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292598068|gb|EFF42223.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292606156|gb|EFF49406.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 255
Score = 44.3 bits (103), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 45/225 (20%), Positives = 84/225 (37%), Gaps = 35/225 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L +D GG+W+ +G+ ++ NV + +G RG L+ Y+ G A Y
Sbjct: 43 LLAGRDLRSGGSWVGLGRDGRAAVVTNVRDPLATSSGHSRGHLIADYLSGSLGATVYAD- 101
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ F F+++L + + H SN PL ++ ++G
Sbjct: 102 -ALAGAAHAFPPFNLLLCD----ADRCEHLSNHPPL------------ARQLAAGIHGMS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSES--QNLIQNILDLMKNKQSNYPDPEIDRK 197
N + P+ K ++++++ + Q L + + + P +D
Sbjct: 145 NGPLDA-PWPKT----AALTQVLHRWCADGDEDLQPLWAALGNPAIAPDAALPQTGVDLM 199
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + IS YGTR +I+ VDH+ HE
Sbjct: 200 TERLLSAAF----------ISGPSYGTRASTIVAVDHHGHGFIHE 234
>gi|50410910|ref|XP_457001.1| DEHA2B00770p [Debaryomyces hansenii CBS767]
gi|49652666|emb|CAG84986.1| DEHA2B00770p [Debaryomyces hansenii CBS767]
Length = 325
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 107/246 (43%), Gaps = 34/246 (13%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGRD--RGPLVVKYVEGQKSAEEYLTDLKK----- 82
GTW+A +T+GKL +L+N ++++ + RG L ++Y+ + EE+ T L K
Sbjct: 54 GTWIAVNTDGKLAVLVNYREKSNKDQISEISRGVLPLQYLISDELNEEWYTKLNKLISEK 113
Query: 83 -------ETEENVFN---GFHIVLLELTL-QSTNIYHFSNIAPLDSPSTKVTNVTNKQEN 131
++NV + GF I+ +L + +ST NI K+
Sbjct: 114 LPLGILDTVDDNVLSRIGGFSILYGQLNINKSTGTIEPLNILSNRGDVGKIFESNGDTSI 173
Query: 132 KDHV-----YGFGNSQCTSQPFQKVIFGKEKFAEIV-NKFNRKSESQNLIQNILDLMKNK 185
D + +G NS + P++KV G++ E + N LI++ +++
Sbjct: 174 SDDIEIQKTFGLSNS-LYNDPWRKVELGQKMLKETIQNSVKDNYTHDQLIESCFEILSYD 232
Query: 186 QSNYPDPEIDRKAEADMDEDYKMRYSRVCV-------DISSI--FYGTRTHSIILVDHNN 236
N + EA ++ + + + D S++ FYGTRT +II++D
Sbjct: 233 TYNRNVAKNGTPHEAILELRNSIYIPPIELGTVNQLPDSSTVGKFYGTRTQTIIILDKKG 292
Query: 237 VMDFHE 242
M ++E
Sbjct: 293 NMHYYE 298
>gi|334344925|ref|YP_004553477.1| hypothetical protein Sphch_1282 [Sphingobium chlorophenolicum L-1]
gi|334101547|gb|AEG48971.1| protein of unknown function DUF833 [Sphingobium chlorophenolicum
L-1]
Length = 252
Score = 44.3 bits (103), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 91/230 (39%), Gaps = 40/230 (17%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLG-ENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L +D + GGTWL S G+ I+ N+ G + RPN RG LV + G++ A
Sbjct: 42 LIAGRDLLSGGTWLGVSDQGRAAIVTNLRGYGDPRPNRASRGALVTDLLAGREVA----- 96
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
E N FN F+++ +++ +N P+ T++T +YG
Sbjct: 97 ------ELNDFNPFNLI----SIEGGQARFLTN-----RPAPLRTDLTPG------LYGL 135
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N + P+ K + K + + + + D ++++ + P I A
Sbjct: 136 SNGALDA-PWPKTLALKSALLDWLVAGAAAPLA------LFDALRSE--SLPHAGIAPDA 186
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
+D+ + + I + YGTR +II VD E D E
Sbjct: 187 PSDVPTEALTS----PIFIRNPVYGTRCSTIIAVDAQGHGTISERRFDAE 232
>gi|443900393|dbj|GAC77719.1| pleiotropic drug resistance proteins [Pseudozyma antarctica T-34]
Length = 478
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 79/179 (44%), Gaps = 23/179 (12%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRD----RGPLVVKYV---------- 67
++DA GGTWL + +G L N P G + RG LV ++
Sbjct: 222 ARDASGGGTWLGVTRSGAFATLTNFTEATPELPPGMERFESRGQLVRDWLLSSHPFPQTR 281
Query: 68 -EGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNI---APLDSPSTKVT 123
E +K ++YL ++ + + F GF++++ +++ Q T + + +N L P K+
Sbjct: 282 QEVEKDIQQYLNNVGDKLQS--FPGFNLLVGKVSKQGTVLGYITNRDSQGELAKPQPKIF 339
Query: 124 NVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLM 182
+ K E G NS QP+ KV G + F + + + + + LI ++ ++
Sbjct: 340 D-PPKIERVRMCRGMSNS-ILDQPWPKVHTGSQAFDAALARASTAPDQEQLIADLFSVL 396
>gi|167586962|ref|ZP_02379350.1| hypothetical protein BuboB_16592 [Burkholderia ubonensis Bu]
Length = 281
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 71/169 (42%), Gaps = 17/169 (10%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAE-EYL 77
L +D GGTWL S +G+ L N + R RG LV Y+ GQ A +YL
Sbjct: 44 LLAGRDLEAGGTWLGVSRDGRFAALTNYRAPFDIRAGAPTRGKLVSDYLGGQPVAPLDYL 103
Query: 78 TDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYG 137
L K V+NGF++++ + + + F N A D+ + V+
Sbjct: 104 ARLAKHAA--VYNGFNLLVGD--WKRRELAWFCNRAAEDAAGVDAPVLVAAG-----VHA 154
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ 186
N + + P+ KV+ + + ++ + + +++ LM++
Sbjct: 155 LSNGRLDT-PWPKVVRKRAELGTLLT-----DDPTPPLDDLIALMRDPH 197
>gi|387813235|ref|YP_005428717.1| hypothetical protein MARHY0811 [Marinobacter hydrocarbonoclasticus
ATCC 49840]
gi|381338247|emb|CCG94294.1| conserved hypothetical protein [Marinobacter hydrocarbonoclasticus
ATCC 49840]
Length = 257
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 93/220 (42%), Gaps = 40/220 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLT 78
L +D GGTWLA S NG++ + NV E S GR RG L V+ + S +
Sbjct: 44 LLAGRDLASGGTWLAISANGQVAAVTNVR-EGSPETGRSSRGELPVRALAEPAS----VL 98
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+E + + GF++V +L +++SN + ++ +G
Sbjct: 99 GAALNSEPDRYAGFNLV----SLTERGGWYYSN-----------RDAHPGRQLFRGSFGL 143
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N P+ K++ ++ FA +V + ++E +L +L L+++ S PD +
Sbjct: 144 SN-HLLQTPWPKLLRLRKAFATVVEAASHRTE--DLHNQLLPLLQDT-SPAPDAMLPDTG 199
Query: 199 EADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVD 233
E + +SS F YGTR +I+ +D
Sbjct: 200 VGIETERF----------LSSPFIVGSDYGTRATTIVTLD 229
>gi|389740204|gb|EIM81395.1| hypothetical protein STEHIDRAFT_66775 [Stereum hirsutum FP-91666
SS1]
Length = 321
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 94/249 (37%), Gaps = 30/249 (12%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQ-----KSAEEYLT 78
+D + GGTWL + G+L +L N+ E RG LV ++ +S + +
Sbjct: 52 RDLLAGGTWLGITRTGRLALLTNIT-EPMSSYKSSRGHLVTSFLAPTSSQHVRSLADEVD 110
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNK---------- 128
DL + + + GF+++LL + +P +S S +TN
Sbjct: 111 DLLSQRLDTPYAGFNMLLLAPRGTGLPDNCSTAASPHNSLSFSAELITNHGGGGPLAHRS 170
Query: 129 -QENKDHVYGFGNS--QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNK 185
E++ H G N + + KV+ G + EI+ +S + I
Sbjct: 171 LTESERHCGGMSNGVDGRGASEWPKVVRGTQSLQEILESLPTRSMENDPFLIIFFTSSRW 230
Query: 186 QSNYPDPEIDRKAE------ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMD 239
+S+ P +DR A + D K FYGTR +++LV +
Sbjct: 231 RSD--PPPVDRGTLRNTIHIAPLPIDPKAPIKEGAKVSGRRFYGTRLSTVVLVRRDGQAR 288
Query: 240 FHE---WTL 245
F E W L
Sbjct: 289 FLERDVWML 297
>gi|120555346|ref|YP_959697.1| hypothetical protein Maqu_2435 [Marinobacter aquaeolei VT8]
gi|120325195|gb|ABM19510.1| protein of unknown function DUF833 [Marinobacter aquaeolei VT8]
Length = 272
Score = 43.9 bits (102), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 94/220 (42%), Gaps = 40/220 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLT 78
L +D GGTWLA S NG++ + NV E S GR RG L V+ + S +
Sbjct: 59 LLAGRDLASGGTWLAISANGQVAAVTNVR-EGSPETGRSSRGELPVRALAEPAS----VL 113
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
+E + + GF++V +L +++SN + ++ +G
Sbjct: 114 GAALNSEPDRYAGFNLV----SLTERGGWYYSN-----------RDAHPGRQLFRGSFGL 158
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N P+ K++ ++ FA +V + ++E +L +L L+++ S PD +
Sbjct: 159 SN-HLLQTPWPKLLRLRKAFATVVEAASHRTE--DLHNQLLPLLQDT-SPAPDAMLP--- 211
Query: 199 EADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVD 233
D + R +SS F YGTR +I+ +D
Sbjct: 212 ----DTGVGIETERF---LSSPFIVGSDYGTRATTIVTLD 244
>gi|222479228|ref|YP_002565465.1| hypothetical protein Hlac_0796 [Halorubrum lacusprofundi ATCC
49239]
gi|222452130|gb|ACM56395.1| protein of unknown function DUF833 [Halorubrum lacusprofundi ATCC
49239]
Length = 277
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+DA GGTW+ S +G + + N + R + R RG LV +E SAE + ++
Sbjct: 51 APRDAEAGGTWIGLSASGVVAAVTNRWLDADRESDRSRGLLVRDCLEAD-SAEAAVRTVE 109
Query: 82 KETEENVFNGFHIVLLE 98
++ E ++GF++VL +
Sbjct: 110 RDLETRSYDGFNLVLAD 126
>gi|374703195|ref|ZP_09710065.1| hypothetical protein PseS9_07341 [Pseudomonas sp. S9]
Length = 266
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 45/219 (20%), Positives = 88/219 (40%), Gaps = 41/219 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL S +G+ L N+ R RG L ++ SA +Y+ LK
Sbjct: 62 AGRDLEAGGTWLGISGDGRFAALTNIRNMQLPAGRRSRGELPANFLRTSLSARDYIETLK 121
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ ++GF++++ + + +F+ ++ Q VYG N+
Sbjct: 122 VSAAD--YSGFNLLVGD----GQQLCYFN------------SHSGQAQTLSAGVYGLSNA 163
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEIDRK 197
+ S + KV + K+ A+ + ++ +L+L+ + Q + PD +
Sbjct: 164 ELDSA-WPKVDYAKQALAQQLESPRPEA--------MLELLSDNQVPNDNALPDTGVGIA 214
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
E + + I+S YGTR + ++V +
Sbjct: 215 TERLLGSAF----------IASQTYGTRASTALIVQADG 243
>gi|397688615|ref|YP_006525934.1| hypothetical protein PSJM300_17605 [Pseudomonas stutzeri DSM 10701]
gi|395810171|gb|AFN79576.1| hypothetical protein PSJM300_17605 [Pseudomonas stutzeri DSM 10701]
Length = 253
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 36/77 (46%), Gaps = 2/77 (2%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
L +D GGTW+ + G+ L N+ R RG L +Y+ EEYL
Sbjct: 42 LFAGRDLQAGGTWMGITPAGRFAALTNIRAPGEPLGSRSRGELPERYLRADMPPEEYLDR 101
Query: 80 LKKETEENVFNGFHIVL 96
L + E ++GF++++
Sbjct: 102 LAAQRRE--YSGFNLLV 116
>gi|390571729|ref|ZP_10251966.1| hypothetical protein WQE_25257 [Burkholderia terrae BS001]
gi|389936343|gb|EIM98234.1| hypothetical protein WQE_25257 [Burkholderia terrae BS001]
Length = 273
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 82/188 (43%), Gaps = 38/188 (20%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAE---- 74
L +D V GGTWL + +G+ L N + R + RG LV ++ G E
Sbjct: 44 LLAGRDLVGGGTWLGMTRDGRFAALTNYRAPHEMRADAPTRGTLVSNWLSGDHDIESGAP 103
Query: 75 -EYLTDLKKETEENVFNGFHIVLLELTLQSTNIY-HFSNIAP-LDSPSTKVTNVTNKQEN 131
+YL L+ + +++NGF++++ + T + Y + S AP L +P T
Sbjct: 104 LDYL--LRVAQDGDMYNGFNLLVGDWTRRELAWYCNRSPSAPTLLAPGT----------- 150
Query: 132 KDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPD 191
+G N+ P+ K++ + + A+ + E++ + ++ LM+ D
Sbjct: 151 ----HGISNA-VLDTPWPKLVRKRAELAQAITD-----EARPPLATLIGLMR-------D 193
Query: 192 PEIDRKAE 199
P + R E
Sbjct: 194 PHVARDDE 201
>gi|448449788|ref|ZP_21591885.1| hypothetical protein C470_04048 [Halorubrum litoreum JCM 13561]
gi|445812760|gb|EMA62748.1| hypothetical protein C470_04048 [Halorubrum litoreum JCM 13561]
Length = 259
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S G + + N + R R RG L+V+ SAE+ + +++E
Sbjct: 49 RDAEAGGTWIGVSEGGLVVAVTNRWLDADREGDRSRG-LLVRDCLTAASAEDAVRAVERE 107
Query: 84 TEENVFNGFHIVLLE 98
+E ++GF++VL +
Sbjct: 108 LDERAYDGFNLVLAD 122
>gi|448480560|ref|ZP_21604633.1| hypothetical protein C462_04550 [Halorubrum arcis JCM 13916]
gi|445822101|gb|EMA71875.1| hypothetical protein C462_04550 [Halorubrum arcis JCM 13916]
Length = 259
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S G + + N + R R RG L+V+ SAE+ + +++E
Sbjct: 49 RDAEAGGTWIGVSEGGLVVAVTNRWLDADREGDRSRG-LLVRDCLTAASAEDAVRAVERE 107
Query: 84 TEENVFNGFHIVLLE 98
+E ++GF++VL +
Sbjct: 108 LDERAYDGFNLVLAD 122
>gi|110835208|ref|YP_694067.1| hypothetical protein ABO_2347 [Alcanivorax borkumensis SK2]
gi|110648319|emb|CAL17795.1| conserved hypothetical protein [Alcanivorax borkumensis SK2]
Length = 257
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 85/213 (39%), Gaps = 32/213 (15%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGE--NSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
D GGTWL NG+ + N + +P R RG L +++G+++ E++ L K
Sbjct: 44 DLTAGGTWLGIHRNGRFAGVTNYREPITDRQPGERSRGLLPFAFLKGEQTPEQFSHHLAK 103
Query: 83 ETEENVFNGFHIVL--LELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E++ + F++++ + + ++++ N + + ++G N
Sbjct: 104 --EQHHYGAFNLLVGTVNADPEQDSLWYLGNRG------------ATPRAVRPGIHGLSN 149
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
P+ KV K+ A V Q L+Q + D + S PD + E
Sbjct: 150 G-LLDDPWPKVERAKQNLAAAVTA---GGNLQQLLQVVDDRYQPNDSELPDTGVGIDLER 205
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
+ + I S YGTR S +++D
Sbjct: 206 LI----------APIFIQSETYGTRASSAVILD 228
>gi|448429012|ref|ZP_21584557.1| hypothetical protein C473_16222 [Halorubrum terrestre JCM 10247]
gi|445675387|gb|ELZ27918.1| hypothetical protein C473_16222 [Halorubrum terrestre JCM 10247]
Length = 259
Score = 43.5 bits (101), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S G + + N + R R RG L+V+ SAE+ + +++E
Sbjct: 49 RDAEAGGTWIGVSEGGLVVAVTNRWLDADREGDRSRG-LLVRDCLTAASAEDAVRAVERE 107
Query: 84 TEENVFNGFHIVLLE 98
+E ++GF++VL +
Sbjct: 108 LDERAYDGFNLVLAD 122
>gi|107099323|ref|ZP_01363241.1| hypothetical protein PaerPA_01000335 [Pseudomonas aeruginosa PACS2]
gi|424943246|ref|ZP_18359009.1| hypothetical protein NCGM1179_4435 [Pseudomonas aeruginosa
NCMG1179]
gi|346059692|dbj|GAA19575.1| hypothetical protein NCGM1179_4435 [Pseudomonas aeruginosa
NCMG1179]
Length = 251
Score = 43.5 bits (101), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 44/221 (19%), Positives = 84/221 (38%), Gaps = 33/221 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ ++GF++++ + ++H++ V + +YG N+
Sbjct: 104 GRAAD--YSGFNLLIGD----RHQLWHYN------------PRVGPPRLLPAGIYGLSNA 145
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ K++ K + + + Q L++ + D PD + E
Sbjct: 146 ALDT-PWPKLL----KARAALAERLAEPHPQALMELLADAAPAADGQLPDTGVGLATERL 200
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ V I+S YGTR S++ V + + E
Sbjct: 201 LSS----------VFIASPSYGTRASSVVRVHADGTREMIE 231
>gi|448510785|ref|ZP_21615998.1| hypothetical protein C465_10636 [Halorubrum distributum JCM 9100]
gi|448523730|ref|ZP_21618917.1| hypothetical protein C466_09552 [Halorubrum distributum JCM 10118]
gi|445695539|gb|ELZ47641.1| hypothetical protein C465_10636 [Halorubrum distributum JCM 9100]
gi|445700803|gb|ELZ52794.1| hypothetical protein C466_09552 [Halorubrum distributum JCM 10118]
Length = 259
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S G + + N + R R RG L+V+ SAE+ + +++E
Sbjct: 49 RDAEAGGTWIGVSEGGLVVAVTNRWLDADREGDRSRG-LLVRDCLTAASAEDAVRAVERE 107
Query: 84 TEENVFNGFHIVLLE 98
+E ++GF++VL +
Sbjct: 108 LDERAYDGFNLVLAD 122
>gi|170091142|ref|XP_001876793.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648286|gb|EDR12529.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 295
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 105/250 (42%), Gaps = 24/250 (9%)
Query: 8 PNPCTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYV 67
PNP + G I +D GGTWL + G++ +L N+ E ++ RG L ++
Sbjct: 36 PNPESPGNILS----GRDEKAGGTWLGINRAGRVALLTNIT-EPAKKFDSSRGELTSSFL 90
Query: 68 EGQKSAEEYLTDLKKETEEN-VFNGFHIVLLELTLQSTNIYHFSNIAPLD--SPSTKVTN 124
SA ++ K N VF GF+++LL L+ + ++ + S T +
Sbjct: 91 L-SDSAHPLQDEVGKIVSPNAVFAGFNLLLLSPALRPDGKLSYDSLLVTNHGSGGTITSR 149
Query: 125 VTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQ-NLIQNILDLMK 183
+ QE G + K+ I+ + K ++ N+ ++ +++
Sbjct: 150 FLSTQERSCGGVSNGVDGAGGNDWPKINHASHDLGAILRPSSTKEWTETNITDHLFEVLA 209
Query: 184 NKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISS-----IFYGTRTHSIILVDHNNVM 238
+ P+P DR + + + + ++ S+ FYGTR+ +++LV + V+
Sbjct: 210 WRS---PEPITDR---SQLRNTVHVAAIPITIEGSTEYVKPAFYGTRSSTVLLVRRDGVV 263
Query: 239 DFHE---WTL 245
F E W L
Sbjct: 264 LFIERDVWKL 273
>gi|336365633|gb|EGN93983.1| hypothetical protein SERLA73DRAFT_126376 [Serpula lacrymans var.
lacrymans S7.3]
Length = 306
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 91/248 (36%), Gaps = 38/248 (15%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGR---DRGPLVVKYVEGQKSAEEYLTDLK 81
D + GGTWL S GK+ +L N+ + P + RG LV ++ + ++L D+
Sbjct: 56 DVLAGGTWLGLSRTGKIAVLTNI----TEPLAKFTSSRGHLVSSFL--TSDSPKHLQDVV 109
Query: 82 KE--TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F GF+++LL ++ SN PL T VTN H
Sbjct: 110 GEIIPTDAKFAGFNLLLLSPAPSTS-----SNQRPLSFDGTYVTNRGAGGFITSHALSSS 164
Query: 140 NSQC----------TSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNY 189
C + + KV G I+ + + DL+ +
Sbjct: 165 QRHCGGMSNGIEDKGANEWPKVQHGIRSLKAILETSMPDRTEDQFTEELFDLL-----TW 219
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSI----FYGTRTHSIILVDHNNVMDFHE--- 242
PE R+ + V +SI YGTR ++ILV + + F E
Sbjct: 220 TSPEAPRQRSELRNTIQVNPLPIVSNPTASIGHRDLYGTRLSTVILVKRDGQVLFVERDR 279
Query: 243 WTLDYETK 250
W D E K
Sbjct: 280 WKQDAEKK 287
>gi|397734906|ref|ZP_10501609.1| hypothetical protein JVH1_6106 [Rhodococcus sp. JVH1]
gi|396929131|gb|EJI96337.1| hypothetical protein JVH1_6106 [Rhodococcus sp. JVH1]
Length = 254
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 47/223 (21%), Positives = 89/223 (39%), Gaps = 30/223 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV G R RG L V Y+ G+ S +Y +
Sbjct: 42 AGRDLLAGGTWMGIADSLRFAAVTNVRGGAPSTGARSRGALPVDYLRGRLSPADYAEQVA 101
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + T +TN H +Q + V+G
Sbjct: 102 ATGVE--YGSFNLLVGDPTELRWATNRPH-----------------GRQQRVEPGVHGLS 142
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N++ + P+ KV GK+ FA ++ + ++ D++ + D D E
Sbjct: 143 NAELDT-PWPKVTGGKQAFAAALDADDGSPDADP--GTYFDVLADSDPAPWDSLPDTGIE 199
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+++ R+ I YGTR +++ V + D E
Sbjct: 200 PELERALSSRF------IHYGDYGTRASTLLRVRPDGTFDITE 236
>gi|288556169|ref|YP_003428104.1| hypothetical protein BpOF4_15815 [Bacillus pseudofirmus OF4]
gi|288547329|gb|ADC51212.1| hypothetical protein BpOF4_15815 [Bacillus pseudofirmus OF4]
Length = 260
Score = 43.5 bits (101), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 9/93 (9%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV---LGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
+D +GGTWL + +G+L + N+ L E RG LV ++ A+EY+
Sbjct: 44 AGRDVERGGTWLGVTKSGRLACVTNIREPLNETEDHEFISRGELVRGFLSSTTPAKEYVE 103
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSN 111
++++ ++ F GF+++ + ++Y++SN
Sbjct: 104 SIRQQKDD--FQGFNLIAGTI----DDVYYYSN 130
>gi|421464595|ref|ZP_15913285.1| NRDE protein [Acinetobacter radioresistens WC-A-157]
gi|400205348|gb|EJO36329.1| NRDE protein [Acinetobacter radioresistens WC-A-157]
Length = 261
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 46/218 (21%), Positives = 97/218 (44%), Gaps = 31/218 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ ++ N G++ + RG LV ++E ++ + L
Sbjct: 43 AGKDLQSGGTWMGITASGRWAVITNFRDGKDRQHYPTSRGHLVQAFLESEQKPIRFAQML 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+K+ +E + GF++V+ + T + SN ++P + VY N
Sbjct: 103 EKQQQE--YAGFNLVIGD----QTQAVYMSNRG--EAPQVLASG----------VYVISN 144
Query: 141 SQCTSQPFQKVIFGKEKFA-EIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA- 198
S +QK + +++F E + R L + D++++++ ++D+ A
Sbjct: 145 G-LMSDSWQKTLHLRKRFTQEFLPMLQRPVAEHILESAVWDILEDER------KVDQNAL 197
Query: 199 -EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHN 235
E + +D ++ S I S YGTR + + + H+
Sbjct: 198 PETGISQDMELLLSSTF--IQSPMYGTRCSNFLRLYHH 233
>gi|311748601|ref|ZP_07722386.1| hypothetical protein ALPR1_20093 [Algoriphagus sp. PR1]
gi|126577125|gb|EAZ81373.1| hypothetical protein ALPR1_20093 [Algoriphagus sp. PR1]
Length = 262
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 46/212 (21%), Positives = 93/212 (43%), Gaps = 34/212 (16%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTW+ NG+ +L N NS R RG LV ++E S ++YL +
Sbjct: 44 KDLKSGGTWMGFHPNGRWALLTNYRDFNSPRKAQISRGKLVQDFLESTVSPKDYLASV-- 101
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
E ++ ++GF++ L +++FSN E ++G N
Sbjct: 102 EVNQDQYDGFNL----LVSDGEKLFYFSNFG------------KGILELAPGIHGLSNG- 144
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDRKAEAD 201
+ P+ KV K + +++ S++ + ++L+ +K+K++ D D +
Sbjct: 145 LLNDPWPKVELAKSEL--------KQATSEHFTVDSLLETLKSKKTYSIDQLPDTGVPEE 196
Query: 202 MDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
M+ ++ R+ + YGT + + +++D
Sbjct: 197 MEIGLSAQFIRLGDN-----YGTVSSTALILD 223
>gi|420136912|ref|ZP_14644920.1| hypothetical protein PACIG1_0421, partial [Pseudomonas aeruginosa
CIG1]
gi|403250305|gb|EJY63753.1| hypothetical protein PACIG1_0421, partial [Pseudomonas aeruginosa
CIG1]
Length = 155
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ ++GF++++ + L IY SN A LD+P K+
Sbjct: 104 GRAAD--YSGFNLLIGDRHQLWHYNPRVGPPRLLPAGIYGLSNAA-LDTPWPKL 154
>gi|420247733|ref|ZP_14751126.1| hypothetical protein PMI06_01437 [Burkholderia sp. BT03]
gi|398070448|gb|EJL61748.1| hypothetical protein PMI06_01437 [Burkholderia sp. BT03]
Length = 273
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 38/188 (20%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAE---- 74
L +D V GGTWL + +G+ L N R + RG LV ++ G E
Sbjct: 44 LLAGRDLVGGGTWLGMTRDGRFAALTNYRAPREMRADAPTRGTLVSNWLSGDHGIESGAP 103
Query: 75 -EYLTDLKKETEENVFNGFHIVLLELTLQSTNIY-HFSNIAP-LDSPSTKVTNVTNKQEN 131
+YL L+ + +++NGF++++ + T + Y + S AP L +P T
Sbjct: 104 LDYL--LRVAQDGDMYNGFNLLVGDWTRRELAWYCNRSPSAPTLLAPGT----------- 150
Query: 132 KDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPD 191
+G N+ P+ K++ + + A+ + E++ + ++ LM+ D
Sbjct: 151 ----HGISNA-VLDTPWPKLVRKRAELAKAIT-----DEARPPLATLIGLMR-------D 193
Query: 192 PEIDRKAE 199
P + R E
Sbjct: 194 PHVARDDE 201
>gi|448446370|ref|ZP_21590689.1| hypothetical protein C471_14365 [Halorubrum saccharovorum DSM 1137]
gi|445684125|gb|ELZ36510.1| hypothetical protein C471_14365 [Halorubrum saccharovorum DSM 1137]
Length = 291
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S +G + + N + R R RG LV +E SAE + +++E
Sbjct: 58 RDAEAGGTWIGLSASGVVAAVTNRWLDADREGDRSRGLLVRDCLE-ADSAEGAVRTVERE 116
Query: 84 TEENVFNGFHIVLLE 98
E ++GF++VL +
Sbjct: 117 LESRSYDGFNLVLAD 131
>gi|392962876|ref|ZP_10328304.1| protein of unknown function DUF833 [Pelosinus fermentans DSM 17108]
gi|421056440|ref|ZP_15519357.1| protein of unknown function DUF833 [Pelosinus fermentans B4]
gi|421062418|ref|ZP_15524574.1| protein of unknown function DUF833 [Pelosinus fermentans B3]
gi|421064740|ref|ZP_15526585.1| protein of unknown function DUF833 [Pelosinus fermentans A12]
gi|421069764|ref|ZP_15530925.1| protein of unknown function DUF833 [Pelosinus fermentans A11]
gi|392437620|gb|EIW15482.1| protein of unknown function DUF833 [Pelosinus fermentans B4]
gi|392443127|gb|EIW20679.1| protein of unknown function DUF833 [Pelosinus fermentans B3]
gi|392449729|gb|EIW26827.1| protein of unknown function DUF833 [Pelosinus fermentans A11]
gi|392451551|gb|EIW28537.1| protein of unknown function DUF833 [Pelosinus fermentans DSM 17108]
gi|392460488|gb|EIW36785.1| protein of unknown function DUF833 [Pelosinus fermentans A12]
Length = 258
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRD---RGPLVVKYVEGQKSAEEYLT 78
+D + GTW+ + +G+ L N + R + D RG LV Y++ Q+S +EYL
Sbjct: 44 AGRDLKELGTWMGITVHGRFAALTNY--RDPRQHTSDALSRGHLVANYLKNQQSPQEYLE 101
Query: 79 DLKKETEENVFNGFHIVLLEL 99
+ K+ ++ +NGF++++ +L
Sbjct: 102 KISKQADQ--YNGFNLLVGDL 120
>gi|406607056|emb|CCH41571.1| hypothetical protein BN7_1112 [Wickerhamomyces ciferrii]
Length = 290
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 107/231 (46%), Gaps = 28/231 (12%)
Query: 30 GTWLATST-NGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAE-EYLTDLKKETEE 86
GTW+A +T N K IL+N G N RG L YV + ++L +L + ++
Sbjct: 54 GTWIALNTENDKFAILVNYREGIKEAINPVSRGVLPQDYVVSNSTTRSQFLNELSAKYDD 113
Query: 87 ----NVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
+ GF+++ +L+ S +I N + +TK D +G NS
Sbjct: 114 GELLSKIGGFNLLFGDLSSNSFDIISNKNNSDFKIFTTK-----------DEYHGLSNSS 162
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSES---QNLIQNILDLMK-NKQSNYPDP---EID 195
+P++KV G++ E +N + + ++L++++ +L+ N N + +
Sbjct: 163 F-DEPWEKVRIGEQLLREYTENYNNSTSTINKEDLVESLFNLLSYNTMENITNDFQGNFE 221
Query: 196 RKAEADMDEDYKMR-YSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTL 245
+ K+R Y+R V ++ +YGTRT ++ILVD++ + + E L
Sbjct: 222 NIKNSIFIPPLKVRDYNRNNV-LAGQWYGTRTQTVILVDNHGHVSYIERNL 271
>gi|392981750|ref|YP_006480337.1| hypothetical protein PADK2_01700 [Pseudomonas aeruginosa DK2]
gi|392317255|gb|AFM62635.1| hypothetical protein PADK2_01700 [Pseudomonas aeruginosa DK2]
Length = 251
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ ++GF++++ + L IY SN A LD+P K+
Sbjct: 104 GRAAD--YSGFNLLIGDRHQLWHYNPRVGPPRLLPAGIYGLSNAA-LDTPWPKL 154
>gi|49082222|gb|AAT50511.1| PA0339, partial [synthetic construct]
Length = 252
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ ++GF++++ + L IY SN A LD+P K+
Sbjct: 104 GRAAD--YSGFNLLIGDRHQLWHYNPRVGPPRLLPAGIYGLSNAA-LDTPWPKL 154
>gi|15595536|ref|NP_249030.1| hypothetical protein PA0339 [Pseudomonas aeruginosa PAO1]
gi|218889080|ref|YP_002437944.1| hypothetical protein PLES_03361 [Pseudomonas aeruginosa LESB58]
gi|254237425|ref|ZP_04930748.1| hypothetical protein PACG_03500 [Pseudomonas aeruginosa C3719]
gi|254243438|ref|ZP_04936760.1| hypothetical protein PA2G_04253 [Pseudomonas aeruginosa 2192]
gi|386056406|ref|YP_005972928.1| hypothetical protein PAM18_0337 [Pseudomonas aeruginosa M18]
gi|418584387|ref|ZP_13148449.1| hypothetical protein O1O_06976 [Pseudomonas aeruginosa MPAO1/P1]
gi|418591119|ref|ZP_13155020.1| hypothetical protein O1Q_10926 [Pseudomonas aeruginosa MPAO1/P2]
gi|421157338|ref|ZP_15616719.1| hypothetical protein PABE173_0350 [Pseudomonas aeruginosa ATCC
25324]
gi|421178249|ref|ZP_15635864.1| hypothetical protein PAE2_0310 [Pseudomonas aeruginosa E2]
gi|421514953|ref|ZP_15961639.1| hypothetical protein A161_01735 [Pseudomonas aeruginosa PAO579]
gi|451984022|ref|ZP_21932281.1| COG3332 [Pseudomonas aeruginosa 18A]
gi|9946187|gb|AAG03728.1|AE004471_11 hypothetical protein PA0339 [Pseudomonas aeruginosa PAO1]
gi|126169356|gb|EAZ54867.1| hypothetical protein PACG_03500 [Pseudomonas aeruginosa C3719]
gi|126196816|gb|EAZ60879.1| hypothetical protein PA2G_04253 [Pseudomonas aeruginosa 2192]
gi|218769303|emb|CAW25063.1| hypothetical protein PLES_03361 [Pseudomonas aeruginosa LESB58]
gi|347302712|gb|AEO72826.1| hypothetical protein PAM18_0337 [Pseudomonas aeruginosa M18]
gi|375045885|gb|EHS38458.1| hypothetical protein O1O_06976 [Pseudomonas aeruginosa MPAO1/P1]
gi|375050012|gb|EHS42498.1| hypothetical protein O1Q_10926 [Pseudomonas aeruginosa MPAO1/P2]
gi|404348681|gb|EJZ75018.1| hypothetical protein A161_01735 [Pseudomonas aeruginosa PAO579]
gi|404548579|gb|EKA57526.1| hypothetical protein PAE2_0310 [Pseudomonas aeruginosa E2]
gi|404550707|gb|EKA59432.1| hypothetical protein PABE173_0350 [Pseudomonas aeruginosa ATCC
25324]
gi|451758258|emb|CCQ84804.1| COG3332 [Pseudomonas aeruginosa 18A]
gi|453045716|gb|EME93434.1| hypothetical protein H123_12950 [Pseudomonas aeruginosa PA21_ST175]
Length = 251
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ ++GF++++ + L IY SN A LD+P K+
Sbjct: 104 GRAAD--YSGFNLLIGDRHQLWHYNPRVGPPRLLPAGIYGLSNAA-LDTPWPKL 154
>gi|262380140|ref|ZP_06073295.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
gi|262298334|gb|EEY86248.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
Length = 261
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ ++ N G + + RG LV ++E ++ + L
Sbjct: 43 AGKDLQSGGTWMGITASGRWAVITNFRDGTDRQHYPTSRGHLVQAFLESEQKPIRFAQML 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+K+ +E + GF++V+ + T + SN ++P + VY N
Sbjct: 103 EKQQQE--YAGFNLVIGD----QTQAVYMSNRG--EAPQVLASG----------VYVISN 144
Query: 141 SQCTSQPFQKVIFGKEKFA-EIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA- 198
S +QK + +++F E + R L + D++++++ ++D+ A
Sbjct: 145 G-LMSDSWQKTLHLRKRFTQEFLPMLQRPVAEHILESAVWDILEDER------KVDQNAL 197
Query: 199 -EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHN 235
E + +D ++ S I S YGTR + + + H+
Sbjct: 198 PETGISQDMELLLSSTF--IQSPMYGTRCSNFLRLYHH 233
>gi|255321172|ref|ZP_05362338.1| ser/thr-rich protein t10 in dgcr region [Acinetobacter
radioresistens SK82]
gi|255301726|gb|EET80977.1| ser/thr-rich protein t10 in dgcr region [Acinetobacter
radioresistens SK82]
Length = 261
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ ++ N G + + RG LV ++E ++ + L
Sbjct: 43 AGKDLQSGGTWMGITASGRWAVITNFRDGTDRQHYPTSRGHLVQAFLESEQKPIRFAQML 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+K+ +E + GF++V+ + T + SN ++P + VY N
Sbjct: 103 EKQQQE--YAGFNLVIGD----QTQAVYMSNRG--EAPQVLASG----------VYVISN 144
Query: 141 SQCTSQPFQKVIFGKEKFA-EIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA- 198
S +QK + +++F E + R L + D++++++ ++D+ A
Sbjct: 145 G-LMSDSWQKTLHLRKRFTQEFLPMLQRPVAEHILESAVWDILEDER------KVDQNAL 197
Query: 199 -EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHN 235
E + +D ++ S I S YGTR + + + H+
Sbjct: 198 PETGISQDMELLLSSTF--IQSPMYGTRCSNFLRLYHH 233
>gi|226946826|ref|YP_002801899.1| hypothetical protein Avin_48200 [Azotobacter vinelandii DJ]
gi|226721753|gb|ACO80924.1| Conserved hypothetical protein [Azotobacter vinelandii DJ]
Length = 250
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 6/94 (6%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GTWL + +G+ + N+ + S+P G R RG L ++ G+ S E YL +
Sbjct: 44 AGRDLESAGTWLGVAPDGRFAAVTNIR-DPSQPRGRRSRGELPAGFLLGELSPEAYLAQV 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAP 114
+ + + GF+ LL T YH AP
Sbjct: 103 AERARD--YTGFN--LLAGTAGELWYYHARETAP 132
>gi|448509095|ref|XP_003866058.1| aminotransferase [Candida orthopsilosis Co 90-125]
gi|380350396|emb|CCG20618.1| aminotransferase [Candida orthopsilosis Co 90-125]
Length = 318
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 101/261 (38%), Gaps = 66/261 (25%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGR--DRGPLVVKYVEGQKSAEEYLTDLKKETE-- 85
GTW+ +T+GK+ +L+N ++ + RG L + Y+E KS +E+ E +
Sbjct: 55 GTWIGVTTSGKIAVLVNYRDMDTESTMKQVSRGVLPLSYLEATKSDDEWRDSFSTEVQNG 114
Query: 86 ------ENVFNGFHIVLLELTLQSTNIYHFSNIAP--------------------LDSPS 119
+N+ GF ++ L ++ NI LD S
Sbjct: 115 NATLDLKNI-GGFTLLYGSLRIRPGGGMDHLNILSNKGHHGRVFEGRGGSQGVEDLDPIS 173
Query: 120 TKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNI 178
TK T +G NS P++KV G++ IV + +Q+ L++
Sbjct: 174 TKTT------------FGMSNS-LYDDPWKKVELGEKMLEGIVADSVKNLPTQDQLVEEC 220
Query: 179 LDLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSI--------------FYGT 224
L+ + N R D D K+ R + I + +YGT
Sbjct: 221 FQLLSHDTYN-------RAIMKQHDFDVKVMELRNSIFIPPLKREGVESHLVSIGDYYGT 273
Query: 225 RTHSIILVDHNNVMDFHEWTL 245
RT +IIL+D ++++E L
Sbjct: 274 RTQTIILLDKLGNLNYYERNL 294
>gi|421855060|ref|ZP_16287442.1| hypothetical protein ACRAD_05_00790 [Acinetobacter radioresistens
DSM 6976 = NBRC 102413]
gi|403189560|dbj|GAB73643.1| hypothetical protein ACRAD_05_00790 [Acinetobacter radioresistens
DSM 6976 = NBRC 102413]
Length = 261
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ ++ N G + + RG LV ++E ++ + L
Sbjct: 43 AGKDLQSGGTWMGITASGRWAVITNFRDGTDRQHYPTSRGHLVQAFLESEQKPIRFAQML 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
+K+ +E + GF++V+ + T + SN ++P + VY N
Sbjct: 103 EKQQQE--YAGFNLVIGD----QTQAVYMSNRG--EAPQVLASG----------VYVISN 144
Query: 141 SQCTSQPFQKVIFGKEKFA-EIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA- 198
S +QK + +++F E + R L + D++++++ ++D+ A
Sbjct: 145 G-LMSDSWQKTLHLRKRFTQEFLPMLQRPVAEHILESAVWDILEDER------KVDQNAL 197
Query: 199 -EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHN 235
E + +D ++ S I S YGTR + + + H+
Sbjct: 198 PETGISQDMELLLSSTF--IQSPMYGTRCSNFLRLYHH 233
>gi|302759024|ref|XP_002962935.1| hypothetical protein SELMODRAFT_165537 [Selaginella moellendorffii]
gi|300169796|gb|EFJ36398.1| hypothetical protein SELMODRAFT_165537 [Selaginella moellendorffii]
Length = 269
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 34/154 (22%), Positives = 69/154 (44%), Gaps = 13/154 (8%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D +GGTWL S G+L + N+ + RG L V++++ + + ++++ L
Sbjct: 45 AGRDGKEGGTWLGISATGRLAFVTNIREPLEKNAAVSRGFLPVQFLQSELNPKDHIDQLS 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ +NGF++V+ ++ Y S S K+ + ++ ++G N
Sbjct: 105 SQAMS--YNGFNLVVADVHSGEMACY---------SHSLKLGDTLTHDISRG-IHGVSNG 152
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLI 175
S + KV GK K E++ ++ K +I
Sbjct: 153 VFESN-WPKVDRGKRKLKELLERYPNKEIPDKII 185
>gi|359797572|ref|ZP_09300155.1| hypothetical protein KYC_11553 [Achromobacter arsenitoxydans SY8]
gi|359364375|gb|EHK66089.1| hypothetical protein KYC_11553 [Achromobacter arsenitoxydans SY8]
Length = 271
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 44/227 (19%), Positives = 84/227 (37%), Gaps = 41/227 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR------DRGPLVVKYVEGQKSAEE 75
+D GGTW+ + G+ ++ N R GR RG LV ++ G + +
Sbjct: 44 AGRDGQAGGTWMGVTAQGRYALVTNF-----REPGRHLDDAPSRGALVEDFLRGAMAPAD 98
Query: 76 YLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV 135
YL D+ + + +NGF++++ ++ ++ SN + V
Sbjct: 99 YLADVHR--VDQAYNGFNLIVGDI----RQAWYLSN------------RDGGPRALAPGV 140
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
Y N P+ K+ K F ++ + + + L + D ++ P +
Sbjct: 141 YALSN-HLLDTPWPKLARTKAAFTAVL-RGRPQPDLPALYAALADRNPADDADLPATGLP 198
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + + I S YGTR+ SI+ + + HE
Sbjct: 199 LDRERLLSSPF----------IVSPDYGTRSSSIMALHDGGAGELHE 235
>gi|255568307|ref|XP_002525128.1| conserved hypothetical protein [Ricinus communis]
gi|223535587|gb|EEF37255.1| conserved hypothetical protein [Ricinus communis]
Length = 172
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 13/82 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGP-----------LVVKYVEGQKS 72
+DAV GGTWL S G++ L NVL ++ P + RG ++ KS
Sbjct: 24 RDAVAGGTWLGCSRAGRVAFLTNVLELHALPEAKSRGNFQSFSWRKSKFFFFCIIQSAKS 83
Query: 73 AEEYLTDLKKETEENVFNGFHI 94
+++ L KE + +NGF++
Sbjct: 84 PKQFAEMLLKEAHQ--YNGFNL 103
>gi|452880485|ref|ZP_21957453.1| hypothetical protein G039_30524 [Pseudomonas aeruginosa VRFPA01]
gi|452183087|gb|EME10105.1| hypothetical protein G039_30524 [Pseudomonas aeruginosa VRFPA01]
Length = 251
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 52/115 (45%), Gaps = 18/115 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL G+ L NV + S+P G R RG LV ++ + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVR-DPSQPLGKRSRGELVADFLRDGGNPADYLARV 102
Query: 81 KKETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
K + ++GF++++ + L +Y SN A LDSP K+
Sbjct: 103 AKRAAD--YSGFNLLVGDRHQLWHYNPRVGPPLLLPAGVYGLSNAA-LDSPWPKL 154
>gi|119475455|ref|ZP_01615808.1| hypothetical protein GP2143_16586 [marine gamma proteobacterium
HTCC2143]
gi|119451658|gb|EAW32891.1| hypothetical protein GP2143_16586 [marine gamma proteobacterium
HTCC2143]
Length = 266
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 44/230 (19%), Positives = 94/230 (40%), Gaps = 28/230 (12%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPN-GRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + G+ + N ++ PN RG L ++ S +++L ++
Sbjct: 44 AGKDLQAGGTWMGVTRTGRFAAVTNHRSTHTPPNPAISRGALCTDFLRSNVSPQQFLQEI 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
K + + GF+++L + +++N + E K ++G N
Sbjct: 104 GKNRLD--YAGFNLLLGS----PNQLLYYAN------------QTGDAVELKAGIHGLSN 145
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ + KV GK + + + +E+++L+ ++D S PD I AE
Sbjct: 146 G-VLNDSWPKVEQGKSALSAALAQ---STEAEHLLHILMDDNPADASQVPDTGIGIDAEI 201
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETK 250
+ + +++ YGTR +++ +D N ++ E + K
Sbjct: 202 VLSSRFIPP-----INVRDNHYGTRNCTVLKIDRQNNSEWLEQAFSPDGK 246
>gi|116054067|ref|YP_788510.1| hypothetical protein PA14_04430 [Pseudomonas aeruginosa UCBPP-PA14]
gi|313112045|ref|ZP_07797829.1| hypothetical protein PA39016_004100005 [Pseudomonas aeruginosa
39016]
gi|355646546|ref|ZP_09054501.1| hypothetical protein HMPREF1030_03587 [Pseudomonas sp. 2_1_26]
gi|386068756|ref|YP_005984060.1| hypothetical protein NCGM2_5864 [Pseudomonas aeruginosa NCGM2.S1]
gi|421165196|ref|ZP_15623537.1| hypothetical protein PABE177_0362 [Pseudomonas aeruginosa ATCC
700888]
gi|421172059|ref|ZP_15629839.1| hypothetical protein PACI27_0316 [Pseudomonas aeruginosa CI27]
gi|115589288|gb|ABJ15303.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|310884331|gb|EFQ42925.1| hypothetical protein PA39016_004100005 [Pseudomonas aeruginosa
39016]
gi|348037315|dbj|BAK92675.1| hypothetical protein NCGM2_5864 [Pseudomonas aeruginosa NCGM2.S1]
gi|354828505|gb|EHF12625.1| hypothetical protein HMPREF1030_03587 [Pseudomonas sp. 2_1_26]
gi|404538286|gb|EKA47829.1| hypothetical protein PACI27_0316 [Pseudomonas aeruginosa CI27]
gi|404542727|gb|EKA52038.1| hypothetical protein PABE177_0362 [Pseudomonas aeruginosa ATCC
700888]
Length = 251
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWL G+ L NV + R RG LV ++ G + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVRDPSQALGKRSRGELVADFLRGGGNPADYLAQVA 103
Query: 82 KETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ ++GF++++ + L +Y SN A LD+P K+
Sbjct: 104 GRAAD--YSGFNLLIGDRHQLWHYNPRVGPPRLLPAGVYGLSNAA-LDTPWPKL 154
>gi|448432544|ref|ZP_21585619.1| hypothetical protein C472_05075 [Halorubrum tebenquichense DSM
14210]
gi|445686964|gb|ELZ39263.1| hypothetical protein C472_05075 [Halorubrum tebenquichense DSM
14210]
Length = 256
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ + G + N + R R RG L+V+ SAEE + ++ +
Sbjct: 46 RDAEAGGTWIGVAEGGLAVAVTNRWLDADRDGDRSRG-LLVRDCLTADSAEEAVRTVEDD 104
Query: 84 TEENVFNGFHIVLLE 98
+E ++GF++VL +
Sbjct: 105 VDERAYDGFNLVLAD 119
>gi|119776761|ref|YP_929501.1| hypothetical protein Sama_3629 [Shewanella amazonensis SB2B]
gi|119769261|gb|ABM01832.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
Length = 253
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 48/224 (21%), Positives = 89/224 (39%), Gaps = 38/224 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
QD + GGTWL T G++ L N+ + R RG L+ +Y++ + +L
Sbjct: 44 AGQDLLAGGTWLGTHRQGRVAALTNIRSPQHHSDNRASRGELIPRYLQDPNNFPPWLM-- 101
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG- 139
T +N F+++ E + F+++ KQE + G
Sbjct: 102 ---THAANYNPFNLIFGE----GQRFFCFNSL---------------KQEQTELAPGCHA 139
Query: 140 -NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
++ + K+ G+ + K + E+ L++ + D K SN P + +
Sbjct: 140 ISNGAMDDIWPKMASGQRALEALSAKPGKPDETL-LLEMMRDESKPDDSNLPQTGVSLEW 198
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E + Y +R+ YGTR+ S++L+D +F E
Sbjct: 199 ERRLSSIY-IRHEE---------YGTRSTSLLLIDKAGQQEFTE 232
>gi|402571388|ref|YP_006620731.1| hypothetical protein Desmer_0828 [Desulfosporosinus meridiei DSM
13257]
gi|402252585|gb|AFQ42860.1| hypothetical protein Desmer_0828 [Desulfosporosinus meridiei DSM
13257]
Length = 260
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 89/215 (41%), Gaps = 33/215 (15%)
Query: 30 GTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + +G+L L N + + + RG LV ++ + YL ++ + +
Sbjct: 52 GTWMGITRSGRLAALTNFRDPSLLLTDPKSRGALVSNFLCLNEPPIGYLRNVN--SYQGS 109
Query: 89 FNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQC-TSQP 147
+N F++++ + S +Y+ A E K +YG N TS P
Sbjct: 110 YNPFNLIVGD---PSCLLYYSQQTA-------------ETLELKPGIYGLSNHLLDTSWP 153
Query: 148 FQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADMDEDYK 207
KV K+K A + E+ L +LD K + P+ ID + E + +
Sbjct: 154 --KVSESKQKLARYIES-RPTIETDPLFDILLDEEKANDRDLPNTGIDYELEKLLSSIF- 209
Query: 208 MRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
I YGTR+ +++L+ NN++ F E
Sbjct: 210 ---------IRGTVYGTRSSTVLLIARNNLVTFRE 235
>gi|448456104|ref|ZP_21594957.1| hypothetical protein C469_04362 [Halorubrum lipolyticum DSM 21995]
gi|445812939|gb|EMA62925.1| hypothetical protein C469_04362 [Halorubrum lipolyticum DSM 21995]
Length = 298
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 1/79 (1%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+DA GGTW+ S +G + + N + R R RG L+V+ SAE+ + ++
Sbjct: 81 APRDAEAGGTWIGLSGSGVVAAVTNRWLDADREGDRSRG-LLVRDCLTADSAEDAVRTVE 139
Query: 82 KETEENVFNGFHIVLLELT 100
++ E ++GF++VL + T
Sbjct: 140 RDLETRSYDGFNLVLADET 158
>gi|448495706|ref|ZP_21610151.1| hypothetical protein C463_16267 [Halorubrum californiensis DSM
19288]
gi|445687799|gb|ELZ40074.1| hypothetical protein C463_16267 [Halorubrum californiensis DSM
19288]
Length = 298
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+DA GGTW+ S G + + N + R R RG L+V+ SAE + ++
Sbjct: 49 APRDAEAGGTWIGVSEGGLVVAVTNRWLDADRDGDRSRG-LLVRDCLTAASAEAAVKSVE 107
Query: 82 KETEENVFNGFHIVLLE 98
+E +E + GF++VL +
Sbjct: 108 RELDERAYAGFNLVLAD 124
>gi|300704087|ref|YP_003745689.1| hypothetical protein RCFBP_11785 [Ralstonia solanacearum CFBP2957]
gi|421888151|ref|ZP_16319262.1| conserved hypothetical protein, DUF833 [Ralstonia solanacearum
K60-1]
gi|299071750|emb|CBJ43074.1| conserved protein of unknown function, DUF833 [Ralstonia
solanacearum CFBP2957]
gi|378966498|emb|CCF96010.1| conserved hypothetical protein, DUF833 [Ralstonia solanacearum
K60-1]
Length = 272
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 94/230 (40%), Gaps = 27/230 (11%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMARDGRFAALTNYRAPSERRPDARSRGELVADFLSAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK T +NG+++ LT ++ SN A ++P + + VYG
Sbjct: 107 RLK--TRNGTYNGYNL----LTATRDELWWTSNRA--EAP----------RRLEPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV FAE + + + + LDL+ + P A
Sbjct: 149 SNALLDT-PWFKVRHRIAAFAEALAADTGRHGNAIDVARYLDLLAETREA---PTGALPA 204
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E K+ + I S YGTR +++ + H+ D E D +
Sbjct: 205 TGVAPEWEKLLSAAF---IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|330805598|ref|XP_003290767.1| hypothetical protein DICPUDRAFT_155307 [Dictyostelium purpureum]
gi|325079080|gb|EGC32698.1| hypothetical protein DICPUDRAFT_155307 [Dictyostelium purpureum]
Length = 338
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 80/165 (48%), Gaps = 25/165 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGEN----SRPNGR-DRGPLVVKYVEGQKSAEEYLT 78
+D + GGTWL + GK I+LN+ +N S +G+ RG +V Y+ + S +Y++
Sbjct: 72 KDKIGGGTWLGVNNRGKFCIILNLYNKNFHDSSSGSGKLSRGKIVHNYLSSKISPFDYIS 131
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYH--FSNIAPLDSPSTKVTNVTN-----KQEN 131
++K + ++ F I+++ + S Y+ S++ P K NV K
Sbjct: 132 LMEKNRHQ--YHPF-ILIVGDIINSNEKYYCLLSHLLPPKELYNKELNVKTYPSYLKAIE 188
Query: 132 KDHVYGFGNSQCTSQPFQ----KVIFGKEKFAEIVNK-FNRKSES 171
+D V+G N P + KV GK++ ++ + + F +K+ S
Sbjct: 189 RDTVFGISN-----YPLEWNTPKVTKGKQRLSDTLKEIFEKKAHS 228
>gi|17546499|ref|NP_519901.1| signal peptide protein [Ralstonia solanacearum GMI1000]
gi|17428797|emb|CAD15482.1| putative signal peptide protein [Ralstonia solanacearum GMI1000]
gi|299066791|emb|CBJ37985.1| conserved protein of unknown function, DUF833 [Ralstonia
solanacearum CMR15]
Length = 272
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 94/230 (40%), Gaps = 27/230 (11%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMTRDGRFAALTNYRAPSERRPDARSRGELVANFLAAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK V+NG+++ LT ++ SN A ++P ++ VYG
Sbjct: 107 QLK--ARNGVYNGYNL----LTATRDELWWTSNRA--EAP----------RQLAPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV FAE + + + + LDL+ + P A
Sbjct: 149 SNALLDT-PWFKVRHRVAAFAEALAADTGRHGNAIDVARYLDLLAETREA---PLGALPA 204
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E K+ + I S YGTR +++ + H+ D E D +
Sbjct: 205 TGVAPEWEKLLSAAF---IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|89900124|ref|YP_522595.1| hypothetical protein Rfer_1330 [Rhodoferax ferrireducens T118]
gi|89344861|gb|ABD69064.1| protein of unknown function DUF833 [Rhodoferax ferrireducens T118]
Length = 266
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL S +G+L L N + R + RG LV Y++ SA +YL +L
Sbjct: 43 AGRDLQAGGTWLGISRSGRLAALTNHRNPAALRADAPSRGELVSAYLQTDASALDYLMEL 102
Query: 81 KKETEENVFNGFHIVLLELT 100
+ +N F++++ + T
Sbjct: 103 AGRAGD--YNPFNLLVFDGT 120
>gi|448501262|ref|ZP_21612130.1| hypothetical protein C464_08580 [Halorubrum coriense DSM 10284]
gi|445695350|gb|ELZ47457.1| hypothetical protein C464_08580 [Halorubrum coriense DSM 10284]
Length = 272
Score = 41.6 bits (96), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S G + + N + R R RG L+V+ SAE + +++E
Sbjct: 62 RDAEAGGTWIGVSEGGLVVAVTNRWTDAEREGERSRG-LLVRDCLTADSAEAAVRAVERE 120
Query: 84 TEENVFNGFHIVLLE 98
+E + GF++VL +
Sbjct: 121 LDERAYAGFNLVLAD 135
>gi|302757836|ref|XP_002962341.1| hypothetical protein SELMODRAFT_404056 [Selaginella moellendorffii]
gi|300169202|gb|EFJ35804.1| hypothetical protein SELMODRAFT_404056 [Selaginella moellendorffii]
Length = 269
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/144 (22%), Positives = 66/144 (45%), Gaps = 13/144 (9%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D +GGTWL S G+L + N+ + RG L V++++ + + ++++ L
Sbjct: 45 AGRDGKEGGTWLGISATGRLAFVTNIREPLEKNAAVSRGFLPVQFLQSELNPKDHIDQLS 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ +NGF++V+ ++ Y S S K+ + ++ ++G N
Sbjct: 105 SQAMS--YNGFNLVVADVHSGEMACY---------SHSLKLGDTLTHDISRG-IHGVSNG 152
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKF 165
S + KV GK K E++ ++
Sbjct: 153 VFESN-WPKVDRGKRKLKELLERY 175
>gi|389780850|ref|ZP_10194362.1| hypothetical protein UU7_10580 [Rhodanobacter spathiphylli B39]
gi|388435836|gb|EIL92728.1| hypothetical protein UU7_10580 [Rhodanobacter spathiphylli B39]
Length = 258
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 29/214 (13%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKK 82
+D GGTWL + +G+ G++ NV G RG L+ Y+ G+ A + +L
Sbjct: 45 GRDREAGGTWLGVNEDGRCGVVTNVRDPADPQRGASRGLLITDYLAGKVDAPRHAQNLL- 103
Query: 83 ETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQ 142
T + F++ LT ++ N P ++ VT V+G N+
Sbjct: 104 -TAGAQYRPFNL----LTFDPHAAFYLGN-----RPEPRMQPVTPG------VHGLSNAD 147
Query: 143 CTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEADM 202
+ P+ K + ++ + +++ +L + D + PD + + E +
Sbjct: 148 FNT-PWPKTRALMRRLQAWIDNGD-EADFGDLFDALADERQAPDDQLPDTGVGLERERWL 205
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNN 236
+ I YGTR +++ + H+
Sbjct: 206 SSAF----------IRGERYGTRASTLVAIGHDG 229
>gi|241958338|ref|XP_002421888.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223645233|emb|CAX39832.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 329
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 107/254 (42%), Gaps = 52/254 (20%)
Query: 30 GTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEY-------LTDL 80
GTW+ +TNGK+ +L+N E + RG L + Y+ G++S E+ D
Sbjct: 55 GTWIGVTTNGKIAVLVNYRENDEKEAISEVSRGILPLDYLCGKESDNEWKDNLASTFRDG 114
Query: 81 KKETEENVFNGFHIVLLELTLQ-STNIYHFSNIAPLDSPSTKVTNVTNKQENKD------ 133
+ + GF ++ +LT+ T H NI KV +K +KD
Sbjct: 115 RTTIDLKKIGGFSLLYGKLTINPETGKIHHLNILSNRGDQGKV--FKSKANSKDIANGSI 172
Query: 134 -------HVYGFGNSQCT-SQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKN 184
H FG S ++P+ KV G+ + ++++ + SQ+ LI+ L+ +
Sbjct: 173 DVSDDIPHKTTFGLSNSLYNKPWDKVKLGELLLSNLIDESITNNYSQDELIEKCFQLLSH 232
Query: 185 KQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDI------------SSI----FYGTRTHS 228
N ++ D++ ++S + I SI +YGTRT +
Sbjct: 233 DTYN---------SKILNSNDFEKKFSELKYSIFIPPLIRNNYETDSIAVGKYYGTRTQT 283
Query: 229 IILVDHNNVMDFHE 242
IIL+D ++++E
Sbjct: 284 IILLDKYGNLNYYE 297
>gi|88799732|ref|ZP_01115306.1| hypothetical protein MED297_14165 [Reinekea blandensis MED297]
gi|88777466|gb|EAR08667.1| hypothetical protein MED297_14165 [Reinekea sp. MED297]
Length = 248
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 7/94 (7%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
QD GGTW+ +T+ + +L N+ G R RG LVV ++ Q + E + ++
Sbjct: 43 AGQDLKAGGTWMGVTTDRRFALLTNIRPGYVGVQGERSRGDLVVSFLAHQDTIETFHANI 102
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAP 114
+ E + + GF+++L + + ++ FS+ P
Sbjct: 103 RAEIQN--YAGFNLLLGD----ARRVFWFSSDHP 130
>gi|322802680|gb|EFZ22913.1| hypothetical protein SINV_16348 [Solenopsis invicta]
Length = 73
Score = 41.2 bits (95), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 34/61 (55%), Gaps = 17/61 (27%)
Query: 190 PDPEIDRKAEADMDEDYKMRYSRVCVDISSIF-----YGTRTHSIILVDHNNVMDFHEWT 244
PD E+ RK +RY D+SSIF YGTRTHSI+L+D +N + F E T
Sbjct: 4 PDDELQRKC--------PLRYE----DLSSIFVSGKEYGTRTHSILLIDGSNQVTFVEET 51
Query: 245 L 245
L
Sbjct: 52 L 52
>gi|448469023|ref|ZP_21600032.1| hypothetical protein C468_13878 [Halorubrum kocurii JCM 14978]
gi|445809850|gb|EMA59886.1| hypothetical protein C468_13878 [Halorubrum kocurii JCM 14978]
Length = 267
Score = 41.2 bits (95), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+DA GGTW+ S +G + + N + R + R RG L+V+ SAE+ + ++
Sbjct: 50 APRDAEAGGTWIGLSGSGVVAAVTNRWLDADRRSDRSRG-LLVRDCLTADSAEDAVRRVE 108
Query: 82 KETEENVFNGFHIVLLE 98
++ E ++GF++VL +
Sbjct: 109 RDLETRSYDGFNLVLAD 125
>gi|311106681|ref|YP_003979534.1| hypothetical protein AXYL_03499 [Achromobacter xylosoxidans A8]
gi|310761370|gb|ADP16819.1| hypothetical protein AXYL_03499 [Achromobacter xylosoxidans A8]
Length = 262
Score = 41.2 bits (95), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 44/223 (19%), Positives = 83/223 (37%), Gaps = 33/223 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSR--PNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D + GG+W+ +T G+ ++ N E + N RG LV Y+ G S +YL
Sbjct: 44 AGRDGLAGGSWMGVTTGGRHALVTNFR-EPGKLIQNAPSRGALVEDYLRGAASPADYLAQ 102
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
+ + +NGF++++ + + ++ SN + VY
Sbjct: 103 VHAAGQ--AYNGFNLIVGD----AHEAWYLSN------------RDGGPRRLAPGVYALS 144
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N P+ K+ K F E V + + + L+ + D + P + E
Sbjct: 145 N-HLLDTPWPKLARTKAAF-EAVLRSGPQPDLPALMAALADRQPAGDAELPATGLPLDRE 202
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I S YGTR+ S++ + + E
Sbjct: 203 RLLSSPF----------IVSPNYGTRSSSVLALRDDGAGQLDE 235
>gi|304311560|ref|YP_003811158.1| hypothetical protein HDN1F_19300 [gamma proteobacterium HdN1]
gi|301797293|emb|CBL45513.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
Length = 254
Score = 41.2 bits (95), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 48/216 (22%), Positives = 83/216 (38%), Gaps = 38/216 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +G+ L NV N+ RG LV ++ G+ SA Y +
Sbjct: 44 AGRDLQAGGTWLGVTRSGRFAALTNVREPANAYQGSLSRGTLVSDFLSGKDSAVNYADSV 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK----DHVY 136
+ + GF+++L S K+ V+N+ + VY
Sbjct: 104 --HVIGSAYAGFNLLL--------------------SDGEKLVWVSNRDGDPRILTSGVY 141
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
G N+ + P+ KV GK +E + + ++ LI + D PD +
Sbjct: 142 GVSNALLDT-PWPKVAGGKAAISERLPLHQPEQQTAGLITLLQDGTVAPDEELPDTGVGL 200
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
E + + +R S+ YGTR + + +
Sbjct: 201 MMERILSPRF-IRTSQ---------YGTRASTALFM 226
>gi|261884661|ref|ZP_06008700.1| hypothetical protein CfetvA_05282 [Campylobacter fetus subsp.
venerealis str. Azul-94]
Length = 190
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 83/194 (42%), Gaps = 48/194 (24%)
Query: 60 GPLVVKYVEGQKSAEEYLTDLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPS 119
G +V ++ G AEEYL +L ++ +NGF++++ +Y++SNIA
Sbjct: 19 GEIVRNFLSGDLLAEEYLKELHSNKDQ--YNGFNLIVG----TPDELYYYSNIA------ 66
Query: 120 TKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL 179
K+ V + +G N P+ KV+ G N+
Sbjct: 67 GKIEKV------EPGTHGLSN-HLLDTPWPKVVKGT--------------------TNLR 99
Query: 180 DLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDI----SSIF-----YGTRTHSII 230
+ + P+P + A A+ D ++ + V +D+ SS+F YGTR +++
Sbjct: 100 AYAEQHEELQPEPLFEILANAEQAGDEELPNTGVGIDLERSLSSMFIKIPNYGTRCSTVV 159
Query: 231 LVDHNNVMDFHEWT 244
LVD +N + F E T
Sbjct: 160 LVDRHNQVTFVERT 173
>gi|161524551|ref|YP_001579563.1| hypothetical protein Bmul_1378 [Burkholderia multivorans ATCC
17616]
gi|189350693|ref|YP_001946321.1| hypothetical protein BMULJ_01865 [Burkholderia multivorans ATCC
17616]
gi|160341980|gb|ABX15066.1| protein of unknown function DUF833 [Burkholderia multivorans ATCC
17616]
gi|189334715|dbj|BAG43785.1| conserved hypothetical protein [Burkholderia multivorans ATCC
17616]
Length = 281
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 6/95 (6%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAE-EYLTD 79
+D GGTWL S +G+ L N + R RG LV Y+ G A +YL
Sbjct: 46 AGRDLEAGGTWLGVSRDGRFAALTNYRAPFDIRAGAPTRGKLVSDYLGGPAVAPLDYLAQ 105
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAP 114
L + V+NGF++++ + + + F N AP
Sbjct: 106 LAEHAA--VYNGFNLLVGD--WKRRELAWFCNRAP 136
>gi|221198169|ref|ZP_03571215.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|221208340|ref|ZP_03581343.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221215088|ref|ZP_03588055.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
gi|421476900|ref|ZP_15924757.1| PF05742 family protein [Burkholderia multivorans CF2]
gi|221165024|gb|EED97503.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
gi|221171753|gb|EEE04197.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221182101|gb|EEE14502.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|400227465|gb|EJO57465.1| PF05742 family protein [Burkholderia multivorans CF2]
Length = 281
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 6/95 (6%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAE-EYLTD 79
+D GGTWL S +G+ L N + R RG LV Y+ G A +YL
Sbjct: 46 AGRDLEAGGTWLGVSRDGRFAALTNYRAPFDIRAGAPTRGKLVSDYLGGPAVAPLDYLAQ 105
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAP 114
L + V+NGF++++ + + + F N AP
Sbjct: 106 LAEHAA--VYNGFNLLVGD--WKRRELAWFCNRAP 136
>gi|421473058|ref|ZP_15921205.1| PF05742 family protein [Burkholderia multivorans ATCC BAA-247]
gi|400221797|gb|EJO52225.1| PF05742 family protein [Burkholderia multivorans ATCC BAA-247]
Length = 281
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 6/95 (6%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAE-EYLTD 79
+D GGTWL S +G+ L N + R RG LV Y+ G A +YL
Sbjct: 46 AGRDLEAGGTWLGVSRDGRFAALTNYRAPFDIRAGAPTRGKLVSDYLGGPAVAPLDYLAQ 105
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAP 114
L + V+NGF++++ + + + F N AP
Sbjct: 106 LAEHAA--VYNGFNLLVGD--WKRRELAWFCNRAP 136
>gi|119623409|gb|EAX03004.1| chromosome 22 open reading frame 25, isoform CRA_d [Homo sapiens]
Length = 148
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 129 QENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSN 188
Q YG N+ + P++K+ FGK+ F E V + ++ LI ++LD++ N+++
Sbjct: 84 QARGRGTYGLSNALLET-PWRKLCFGKQLFLEAVER-SQALPKDVLIASLLDVLNNEEAQ 141
Query: 189 YPDPEID 195
PDP I+
Sbjct: 142 LPDPAIE 148
>gi|254515779|ref|ZP_05127839.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
gi|219675501|gb|EED31867.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
Length = 262
Score = 40.8 bits (94), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 51/129 (39%), Gaps = 21/129 (16%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL--GENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D GGTWL S +G++ + N+ R R RG L ++ G S E YL +
Sbjct: 44 AGRDLQAGGTWLGVSRSGRIAAITNIRDPAAGDRSALRSRGELTHNFLAGGASPETYLAE 103
Query: 80 LKKETEENVFNGFHIVLLELT----------------LQSTNIYHFSNIAPLDSPSTKVT 123
+ + + GF++++ + + IY SN A LD P KV
Sbjct: 104 IADRIGD--YQGFNLLVGDRQSLWYLHGNAADKTPPRALAPGIYALSNAA-LDVPWPKVR 160
Query: 124 NVTNKQENK 132
+ N
Sbjct: 161 RAREQLHNA 169
>gi|281206524|gb|EFA80710.1| hypothetical protein PPL_06296 [Polysphondylium pallidum PN500]
Length = 1181
Score = 40.8 bits (94), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 43/77 (55%), Gaps = 11/77 (14%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNV-----LGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
+D +GG+WL +T G+ ++LN + E P + RG ++ Y+ ++ ++YLT
Sbjct: 954 RDLKQGGSWLGINTKGQFAVVLNYRSIKNIVEPIYP--KSRGSVIEDYLLSGETPQQYLT 1011
Query: 79 DLKKETEE----NVFNG 91
LK+E ++ NVF G
Sbjct: 1012 KLKEEIDQYGPFNVFVG 1028
>gi|152987011|ref|YP_001345827.1| hypothetical protein PSPA7_0432 [Pseudomonas aeruginosa PA7]
gi|150962169|gb|ABR84194.1| hypothetical protein PSPA7_0432 [Pseudomonas aeruginosa PA7]
Length = 251
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 52/115 (45%), Gaps = 18/115 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL G+ L NV + S+P G R RG LV ++ + +YL +
Sbjct: 44 AGRDLEAGGTWLGLGPAGRFAALTNVR-DPSQPLGKRSRGELVADFLRDGGNPADYLARV 102
Query: 81 KKETEENVFNGFHIVLLE-------------LTLQSTNIYHFSNIAPLDSPSTKV 122
+ + ++GF++++ + L +Y SN A LDSP K+
Sbjct: 103 AERAAD--YSGFNLLVGDRHQLWHYNPRVGPPLLLPAGVYGLSNAA-LDSPWPKL 154
>gi|71907822|ref|YP_285409.1| hypothetical protein Daro_2200 [Dechloromonas aromatica RCB]
gi|71847443|gb|AAZ46939.1| Protein of unknown function DUF833 [Dechloromonas aromatica RCB]
Length = 249
Score = 40.8 bits (94), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 49/220 (22%), Positives = 86/220 (39%), Gaps = 38/220 (17%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKET 84
D GGTWL + G+ + NV RG L ++ + +A EY + +
Sbjct: 47 DLEAGGTWLGITGMGRFAAVTNVREPGVVKGLLSRGALTRNFLTSELTASEYASQIDYAQ 106
Query: 85 EENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNSQCT 144
++GF+++L + ++ + SN + VYG N Q
Sbjct: 107 ----YSGFNLLLSD----GESLVYCSN------------RDGQPRVLAPGVYGLSN-QLL 145
Query: 145 SQPFQKVIFGKEKFAEIVNKFNRKSESQNLI--QNILDLMKNKQSNYPDPEIDRKAEADM 202
P+ K++ +E+FAE + + + +L+ Q I+D KN S E +R A
Sbjct: 146 DSPWPKLLQARERFAEALPRLPDEPAFFDLLADQGIVD-DKNLPSTGAPIEWERLLSA-- 202
Query: 203 DEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + S YGTR +++ N + HE
Sbjct: 203 ------------IFVKSENYGTRASTLVWQGANGDITIHE 230
>gi|384106545|ref|ZP_10007452.1| hypothetical protein W59_34568 [Rhodococcus imtechensis RKJ300]
gi|383833881|gb|EID73331.1| hypothetical protein W59_34568 [Rhodococcus imtechensis RKJ300]
Length = 254
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 47/229 (20%), Positives = 85/229 (37%), Gaps = 34/229 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV R RG L V Y+ G+ S +Y +
Sbjct: 42 AGRDLLAGGTWMGIADSLRFAAVTNVRDGAPSTGARSRGALPVDYLRGRLSPADYAAQVA 101
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + +TN H +Q + V+G
Sbjct: 102 ATGAE--YGSFNLLVGDPAELWWATNRPH-----------------GRRQRVEPGVHGLS 142
Query: 140 NSQCTSQPFQKVIFGKE--KFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N++ + P+ KV GK+ A + N ++ + D PD I
Sbjct: 143 NAELDT-PWPKVTGGKQAFAAALAADDGNPDADPGAYFDVLADSDPAPWEALPDTGI--- 198
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
E +++ R+ I YGTR +++ V + D E D
Sbjct: 199 -EPELERALSSRF------IHYGDYGTRASTLLRVRADGTFDITERRFD 240
>gi|328868562|gb|EGG16940.1| hypothetical protein DFA_07921 [Dictyostelium fasciculatum]
Length = 345
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 103/222 (46%), Gaps = 13/222 (5%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLN--VLGENSRPNGRDRGPLVVKYVEGQKS-AEEYLT 78
+D GGTW + G ILLN V E P+ + RG ++ ++++ S ++Y+
Sbjct: 103 AGRDLKFGGTWFGINNYGSFAILLNYTVPKELISPDLKSRGLIIPEFLKHPSSELKDYIE 162
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
++K E++++ ++V+ +++ IY+ N D+ + T D V
Sbjct: 163 TMQK--EKHLYPPLNLVV--GNVKTGEIYYICNYNKSDNNNNMNDTTTEAIILGDQVVTV 218
Query: 139 GNSQCTSQ-PFQKV--IFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEID 195
N + P KV + +E F + N N + ++ LD + N SN P++
Sbjct: 219 SNGNVNDKWPKMKVGELLLQEYFDNLNNNLNPNQQGYDINNIPLDGLWNILSNNQKPDLS 278
Query: 196 RKAEADMDEDYKMRYSRVCVD---ISSIFYGTRTHSIILVDH 234
+ ++E + + S V V+ I+ YGTRT SI++VD+
Sbjct: 279 DIPKVGLEETFAKQCSSVFVEEAPINGTNYGTRTSSILIVDN 320
>gi|398408089|ref|XP_003855510.1| hypothetical protein MYCGRDRAFT_36216 [Zymoseptoria tritici IPO323]
gi|339475394|gb|EGP90486.1| hypothetical protein MYCGRDRAFT_36216 [Zymoseptoria tritici IPO323]
Length = 318
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 38/172 (22%), Positives = 74/172 (43%), Gaps = 30/172 (17%)
Query: 27 VKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVE----GQKSAEEYLTDLK 81
V+ GTWL + G+ +L N E NG + RG ++ Y+ +++ +E++ L
Sbjct: 51 VEKGTWLGITRQGRFAVLTNFREEGVDENGAKSRGGMINSYLTVPPGSEETDDEFVQRLL 110
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHV------ 135
+ + GF ++ EL AP P + V+N+ E+ + +
Sbjct: 111 NDVGIHDVGGFTLLFGELR------------APRKLPGLAI--VSNRTESPNELRRIGTH 156
Query: 136 ----YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLM 182
+G NS + KV+ G+ E + K ++SQN I ++ +++
Sbjct: 157 LGETHGLSNSHYGDNSWPKVVNGERLLKEAITKNTSTNQSQNEFIASLFEIL 208
>gi|408793273|ref|ZP_11204883.1| NRDE protein [Leptospira meyeri serovar Hardjo str. Went 5]
gi|408464683|gb|EKJ88408.1| NRDE protein [Leptospira meyeri serovar Hardjo str. Went 5]
Length = 263
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQK--SAEEYL 77
+D GGTWL S+ GK+ L NV S P+ + RG LV+ +++ K S+ Y
Sbjct: 40 AGKDLKAGGTWLGASSLGKVSFLTNVRNFRNPSHPHPKSRGKLVLDFLQSPKDLSSRNYR 99
Query: 78 TDLKKETEENVFNGFHIVLLE 98
++ + F GF++ L +
Sbjct: 100 AEVFSNASQ--FEGFNLFLYD 118
>gi|424854270|ref|ZP_18278628.1| hypothetical protein OPAG_02752 [Rhodococcus opacus PD630]
gi|356664317|gb|EHI44410.1| hypothetical protein OPAG_02752 [Rhodococcus opacus PD630]
Length = 254
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 30/227 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV R RG L V Y+ G+ S +Y +
Sbjct: 42 AGRDLLAGGTWMGIADSLRFAAVTNVRNGAPSTGARSRGALPVDYLRGRLSPADYAAQVA 101
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + +TN H +Q + V+G
Sbjct: 102 ATGAE--YGSFNLLVGDPAELWWATNRPH-----------------GRRQRVEPGVHGLS 142
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N++ + P+ KV GK+ FA + + ++ D++ + + D E
Sbjct: 143 NAELDT-PWPKVTGGKQAFAAALAADDGSPDADP--GAYFDVLADSDPAPWEALPDTGIE 199
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+++ R+ I YGTR +++ V + D E D
Sbjct: 200 PELERALSSRF------IHYADYGTRASTLLRVRPDGTFDITERRFD 240
>gi|294012348|ref|YP_003545808.1| hypothetical protein SJA_C1-23620 [Sphingobium japonicum UT26S]
gi|292675678|dbj|BAI97196.1| conserved hypothetical protein [Sphingobium japonicum UT26S]
Length = 250
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 80/215 (37%), Gaps = 40/215 (18%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L +D + GGTWL S G+ I+ N+ G + P+ RG LV + G+++A
Sbjct: 42 LIAGRDLLSGGTWLGVSEQGRAAIVTNLRGHGDPHPDRASRGALVTDLLSGREAAPPA-- 99
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
FN F+++L+E + + APL + + +YG
Sbjct: 100 ---------DFNPFNLILVE---GDRAQFLTNRPAPL------------RTDLAPGLYGL 135
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N + P+ K + K + + + L + PD D A
Sbjct: 136 SNGTLDA-PWPKTLALKSALLDWL--VAGADDPLTLFDALRAESLPHAGIAPDAPSDVPA 192
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
EA + + I + YGTR +I+ VD
Sbjct: 193 EAMVSPIF----------IRNPVYGTRCSTIVAVD 217
>gi|448358441|ref|ZP_21547123.1| hypothetical protein C482_10941 [Natrialba chahannaoensis JCM
10990]
gi|445646074|gb|ELY99066.1| hypothetical protein C482_10941 [Natrialba chahannaoensis JCM
10990]
Length = 304
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L+ QDA GGTW+ + +G + N + R RG LV +E +SA +
Sbjct: 49 LVVAPQDAEAGGTWIGFNEHGLFAGITNKWIDADLAGDRSRGLLVADVLE-ARSASDAAD 107
Query: 79 DLKKETEENVFNGFHIVLLELT 100
+++ T+ N ++GF +V+ + T
Sbjct: 108 IVEEATDGNEYDGFWLVIADAT 129
>gi|111021760|ref|YP_704732.1| hypothetical protein RHA1_ro04789 [Rhodococcus jostii RHA1]
gi|110821290|gb|ABG96574.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 254
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 45/227 (19%), Positives = 82/227 (36%), Gaps = 30/227 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV R RG L V Y+ G+ S +Y +
Sbjct: 42 AGRDLLAGGTWMGIADSLRFAAVTNVRNGAPSTGARSRGALPVDYLRGRLSPADYAEQVA 101
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + T +TN H +Q + V+G
Sbjct: 102 ATGAE--YGSFNLLVGDPTELRWATNRPH-----------------GRQQRVEPGVHGLS 142
Query: 140 NSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
N++ + P+ KV GK+ D++ + D D E
Sbjct: 143 NAELDT-PWPKVTGGKQA--FAAALDADDGGPDADPGTYFDVLSDSDPAPWDSLPDTGIE 199
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
+++ R+ I YGTR +++ V + D E D
Sbjct: 200 PELERALSSRF------IHYDDYGTRASTLLRVRPDGTFDITERRFD 240
>gi|300691468|ref|YP_003752463.1| hypothetical protein RPSI07_1819 [Ralstonia solanacearum PSI07]
gi|299078528|emb|CBJ51183.1| conserved protein of unknown function, DUF833 [Ralstonia
solanacearum PSI07]
Length = 272
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 52/234 (22%), Positives = 94/234 (40%), Gaps = 35/234 (14%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMTRDGRFAALTNYRAPSERRPDARSRGELVANFLTAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK +NG+++ LT ++ SN A ++P ++ VYG
Sbjct: 107 QLK--ARNGTYNGYNL----LTATRDELWWTSNRA--EAP----------RQLAPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNY----PDPEI 194
N+ + P+ KV FAE + + + + LDL+ + + P +
Sbjct: 149 SNALLDT-PWFKVRHRIAAFAEALAADTGRHGNAIDVARYLDLLAETREAHTGALPSTGV 207
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
+ E + + I S YGTR +++ + H+ D E D +
Sbjct: 208 APEWEKLLSAAF----------IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|448473376|ref|ZP_21601518.1| hypothetical protein C461_03882 [Halorubrum aidingense JCM 13560]
gi|445818888|gb|EMA68737.1| hypothetical protein C461_03882 [Halorubrum aidingense JCM 13560]
Length = 272
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ S +G + + N + R R RG L+V+ SAE + +++E
Sbjct: 53 RDASAGGTWIGLSASGLVVAVTNRWLDADRDGDRSRG-LLVRDCLTADSAEAAVRSVERE 111
Query: 84 TEENVFNGFHIVLLE 98
+ ++GF++VL +
Sbjct: 112 LDARSYDGFNLVLAD 126
>gi|71279983|ref|YP_268597.1| hypothetical protein CPS_1867 [Colwellia psychrerythraea 34H]
gi|71145723|gb|AAZ26196.1| conserved hypothetical protein [Colwellia psychrerythraea 34H]
Length = 254
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 92/216 (42%), Gaps = 39/216 (18%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQKSAEEYLTD 79
+D GGTWL S+ GK L N L + P + RG LV++ + K +
Sbjct: 43 AGKDLQAGGTWLGLSSTGKFAALTNFRKLPLSEAPK-KSRGDLVLQALADTKI--NMTAE 99
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVYG 137
L ++ + ++GF+++ L +Y + ++ N Q ++ D V+
Sbjct: 100 LAQQASQ--YHGFNLIYGSL----KQLYCYDSV--------------NNQSHQLSDGVHS 139
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N + K+ G++ +E + +S+ I+ + DLM N + P
Sbjct: 140 ICNG-ALDDIWPKMARGEKLLSETI-----RSQKNLSIEALFDLMTNDKQALP----HLL 189
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVD 233
E +DE+++ S + I S YGTRT +II D
Sbjct: 190 PETGLDEEWEQLLSAIF--IKSPTYGTRTTTIITQD 223
>gi|289663646|ref|ZP_06485227.1| hypothetical protein XcampvN_11369 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 255
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G A Y
Sbjct: 45 AGRDLRSGGSWVGLGQDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSLGATAYAD--A 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ F F+++L + + H SN PL ++ ++G N
Sbjct: 103 LAGAAHAFPPFNLLLCD----ANRCEHLSNHPPL------------ARQLAAGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKF--NRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
P + ++ + + + Q L + + + P +D E
Sbjct: 147 -----PLDALWPKTAALTNVLRHWCADGAEDLQPLWAALGNPAIAPDAALPQTGVDLATE 201
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I+ YGTR +I+ VDH HE
Sbjct: 202 RLLSAAF----------ITGPSYGTRASTIVAVDHRGHGFIHE 234
>gi|435846948|ref|YP_007309198.1| hypothetical protein Natoc_1603 [Natronococcus occultus SP4]
gi|433673216|gb|AGB37408.1| hypothetical protein Natoc_1603 [Natronococcus occultus SP4]
Length = 254
Score = 40.0 bits (92), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEY 76
E + +DA GGTW+ + +G + N G R RG LV +E +SA E
Sbjct: 39 EPRIVAPRDAEAGGTWIGYNEHGVFAGVTNRWGTPELAGDRSRGLLVADVLE-ARSAREA 97
Query: 77 LTDLKKETEENVFNGFHIVLLE 98
+ +ET ++GF++V+ +
Sbjct: 98 AEIVAEETAAREYDGFYLVIAD 119
>gi|390169229|ref|ZP_10221171.1| hypothetical protein SIDU_17111 [Sphingobium indicum B90A]
gi|389588185|gb|EIM66238.1| hypothetical protein SIDU_17111 [Sphingobium indicum B90A]
Length = 245
Score = 40.0 bits (92), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 12/80 (15%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGE-NSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L +D + GGTWL S G+ I+ N+ G + P+ RG LV + G+++A
Sbjct: 37 LIAGRDLLSGGTWLGVSEQGRAAIVTNLRGHGDPHPDRASRGALVTDLLSGREAAPPA-- 94
Query: 79 DLKKETEENVFNGFHIVLLE 98
FN F+++L+E
Sbjct: 95 ---------DFNPFNLILVE 105
>gi|419962181|ref|ZP_14478176.1| hypothetical protein WSS_A08699 [Rhodococcus opacus M213]
gi|414572474|gb|EKT83172.1| hypothetical protein WSS_A08699 [Rhodococcus opacus M213]
Length = 253
Score = 40.0 bits (92), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 47/229 (20%), Positives = 85/229 (37%), Gaps = 34/229 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV R RG L V Y+ G+ S +Y +
Sbjct: 41 AGRDLLAGGTWMGIADSLRFAAVTNVRNGAPSTGARSRGALPVDYLRGRLSPADYAAQVA 100
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + +TN H +Q + V+G
Sbjct: 101 ATGAE--YGSFNLLVGDPAELWWATNRPH-----------------GRRQRVEPGVHGLS 141
Query: 140 NSQCTSQPFQKVIFGKE--KFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N++ + P+ KV GK+ A + N ++ + D PD I
Sbjct: 142 NAELDT-PWPKVTGGKQAFAAALAADDGNLDADPGAYFDVLADSDPASWEALPDTGI--- 197
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
E +++ R+ I YGTR +++ V + D E D
Sbjct: 198 -EPELERALSSRF------IHYGDYGTRASTLLRVRPDGTFDITERRFD 239
>gi|432337128|ref|ZP_19588580.1| hypothetical protein Rwratislav_19309 [Rhodococcus wratislaviensis
IFP 2016]
gi|430775937|gb|ELB91408.1| hypothetical protein Rwratislav_19309 [Rhodococcus wratislaviensis
IFP 2016]
Length = 254
Score = 40.0 bits (92), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 47/229 (20%), Positives = 85/229 (37%), Gaps = 34/229 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D + GGTW+ + + + + NV R RG L V Y+ G+ S +Y +
Sbjct: 42 AGRDLLAGGTWMGIADSLRFAAVTNVRNGAPSTGARSRGALPVDYLRGRLSPADYAAQVA 101
Query: 82 KETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFG 139
E + F++++ + +TN H +Q + V+G
Sbjct: 102 ATGAE--YGSFNLLVGDPAELWWATNRPH-----------------GRRQRVEPGVHGLS 142
Query: 140 NSQCTSQPFQKVIFGKE--KFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRK 197
N++ + P+ KV GK+ A + N ++ + D PD I
Sbjct: 143 NAELDT-PWPKVTGGKQAFAAALAADDGNLDADPGAYFDVLADSDPASWEALPDTGI--- 198
Query: 198 AEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLD 246
E +++ R+ I YGTR +++ V + D E D
Sbjct: 199 -EPELERALSSRF------IHYGDYGTRASTLLRVRPDGTFDITERRFD 240
>gi|390603419|gb|EIN12811.1| DUF833-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 298
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 55/249 (22%), Positives = 103/249 (41%), Gaps = 55/249 (22%)
Query: 23 SQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG---RDRGPLVVKYVEGQKSAEEYLTD 79
+D GGTWL + +GK+ +L N+ + P G RG LV ++ L+D
Sbjct: 53 GRDLKAGGTWLGLNRSGKIAVLTNI----TEPLGSYTSSRGHLVSSFL---------LSD 99
Query: 80 LKKETE-ENV------FNGFHIVLLELTLQSTNIYHFSNI-------------APLDSPS 119
L +T+ ENV + GF++V + ++ FS I PL +
Sbjct: 100 LPADTDVENVVTRNEKYAGFNLVYFDPITDASGSLSFSPIFLTNHGGGGPLTARPLTAEE 159
Query: 120 TKVTNVTNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNIL 179
+ ++N + G G + + KV G E++ S+ + L++++
Sbjct: 160 RRCGGLSNG------IDGQGADE-----WPKVQVGIRSMREVIES-AEASDERRLVEDLF 207
Query: 180 DLMKNKQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMD 239
+++ + P + + + E Y + + ++ FYGTR ++ILV + +
Sbjct: 208 HVLEWCTAIPPRERAEHRNTIHI-EPYPVAFDPTH---ATDFYGTRLSTVILVRRDGKVL 263
Query: 240 FHE---WTL 245
F E W L
Sbjct: 264 FVEREPWRL 272
>gi|299743027|ref|XP_001835494.2| hypothetical protein CC1G_08003 [Coprinopsis cinerea okayama7#130]
gi|298405464|gb|EAU86279.2| hypothetical protein CC1G_08003 [Coprinopsis cinerea okayama7#130]
Length = 313
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/173 (20%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR---DRGPLVVKYVEGQKSAEEYLTDL 80
+D GG+W + G++ +L N+ + P GR RG LV ++ KS +L
Sbjct: 53 RDIKAGGSWFGLNKEGRVALLTNI----TEPLGRYSSSRGALVSSFLL-SKSNHPLEDEL 107
Query: 81 KKETEEN-VFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKV--TNVTNKQENKDHVYG 137
K N F GF+++LL TL+S + + + + V + +E +
Sbjct: 108 GKIVSPNAAFAGFNLLLLSPTLRSDDTISYDALFVTNHGGGGVLTSRRLTPEEQRCGCMS 167
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYP 190
G + + KVI +F+ ++ ++++ + ++ +++ + + + P
Sbjct: 168 NGIDGKDASQWPKVIHATREFSAVLQSLSQEATETEIADHLFNVLASWRCSEP 220
>gi|83748618|ref|ZP_00945637.1| signal peptide [Ralstonia solanacearum UW551]
gi|207743123|ref|YP_002259515.1| hypothetical protein RSIPO_01290 [Ralstonia solanacearum IPO1609]
gi|421897526|ref|ZP_16327894.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
gi|83724742|gb|EAP71901.1| signal peptide [Ralstonia solanacearum UW551]
gi|206588732|emb|CAQ35695.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
gi|206594520|emb|CAQ61447.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
Length = 272
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 93/230 (40%), Gaps = 27/230 (11%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMARDGRFAALTNYRAPSERRPDARSRGELVADFLTAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK +NG+++ LT ++ SN A ++P + + VYG
Sbjct: 107 RLK--ARNGTYNGYNL----LTATRDELWWTSNRA--EAP----------RRLEPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV FAE + + + + LDL+ + P A
Sbjct: 149 SNALLDT-PWFKVRHRVAAFAEALAADTGRHGNAIDVARYLDLLAETREA---PTGALPA 204
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E K+ + I S YGTR +++ + H+ D E D +
Sbjct: 205 TGVAPEWEKLLSAAF---IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|425744500|ref|ZP_18862556.1| NRDE protein [Acinetobacter baumannii WC-323]
gi|425490655|gb|EKU56951.1| NRDE protein [Acinetobacter baumannii WC-323]
Length = 260
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
QD GGTW+ + +G+ IL N G + +P RG L+ ++E + + + L
Sbjct: 43 AGQDLQSGGTWMGVTASGRWAILTNFRNGRDQQPYTTSRGHLIQSFLESELAPIRFAQQL 102
Query: 81 KKETEENVFNGFHIVL 96
++ ++ + GF++ +
Sbjct: 103 EQRQQD--YAGFNLFV 116
>gi|386333478|ref|YP_006029648.1| hypothetical protein RSPO_c01812 [Ralstonia solanacearum Po82]
gi|334195927|gb|AEG69112.1| conserved hypothetical protein [Ralstonia solanacearum Po82]
Length = 272
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 27/230 (11%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMARDGRFAALTNYRAPSERRPDARSRGELVADFLTAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK +NG+++ LT ++ SN A ++P + + VYG
Sbjct: 107 RLK--ARNGTYNGYNL----LTATRDELWWTSNRA--EAP----------RRLEPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV FAE + + + LDL+ + P A
Sbjct: 149 SNALLDT-PWFKVRHRIAAFAEALAADTGRHGHAIDVARYLDLLAETREA---PTGALPA 204
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E K+ + I S YGTR +++ + H+ D E D +
Sbjct: 205 TGVAPEWEKLLSAAF---IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|344174723|emb|CCA86533.1| conserved hypothetical protein, DUF833 [Ralstonia syzygii R24]
Length = 272
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/234 (22%), Positives = 93/234 (39%), Gaps = 35/234 (14%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGENS-RPNGRDRGPLVVKYVEGQ-KSAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ + S YL
Sbjct: 47 DLAQVVGHAGTWMGMTRDGRFAALTNYRAPSERRPDARSRGELVANFLTAEDASIPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
LK +NG+++ LT ++ SN A ++P ++ VYG
Sbjct: 107 QLK--ARNGTYNGYNL----LTATRDELWWTSNRA--EAP----------RQLAPGVYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQ----SNYPDPEI 194
N+ + P+ KV FAE + + + + LDL+ + P +
Sbjct: 149 SNALLDT-PWFKVRHRIAAFAEALAADTGRHGNAIDVARYLDLLAETREAPTGALPSTGV 207
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
+ E + + I S YGTR +++ + H+ D E D +
Sbjct: 208 APEWEKLLSAAF----------IRSPRYGTRASTVLRIRHDGCFDVTERRFDAQ 251
>gi|289671029|ref|ZP_06492104.1| hypothetical protein XcampmN_21733 [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 255
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 35/223 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G+ ++ NV + +GR RG L+ Y+ G A Y
Sbjct: 45 AGRDLRSGGSWVDLGQDGRAAVVTNVRDPLATASGRSRGHLIADYLSGSLGATAYAD--A 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+ F F+++L + + H SN PL ++ ++G N
Sbjct: 103 LAGAAHAFPPFNLLLCD----ANRCEHLSNHPPL------------ARQLAAGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKF--NRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAE 199
P + ++ + + + Q L + + + P +D E
Sbjct: 147 -----PLDALWPKTAALTNVLRHWCADGAEDLQPLWAALGNPAIAPDAALPQTGVDLATE 201
Query: 200 ADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
+ + I+ YGTR +I+ VDH HE
Sbjct: 202 RLLSAAF----------ITGPSYGTRASTIVAVDHRGHGFIHE 234
>gi|296412432|ref|XP_002835928.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629725|emb|CAZ80085.1| unnamed protein product [Tuber melanosporum]
Length = 356
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/169 (21%), Positives = 72/169 (42%), Gaps = 16/169 (9%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRP--NGRDRGPLVVKYVEGQK----SAEEYLTDLKKE 83
GTWL + +G+L +L N ++S + + RG +V ++ + S E +L L +
Sbjct: 54 GTWLGITRSGRLAVLTNFCEDSSAAAISQKSRGSIVTSFLTSHEDLNGSTESWLHRLLAD 113
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDH--------V 135
+ GF ++ L S++ +N+ + + E+ H
Sbjct: 114 GKMKGVGGFSLLCGILRPSSSSSDPKANLEKIAVVCNRTAAGDEGVESSAHWIAGSKGET 173
Query: 136 YGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKN 184
+G NS P+ KV+ G+ AE + KSE + L++ + ++ +
Sbjct: 174 HGLSNS-LFDDPWPKVVIGRNLLAETIENSAGKSEDE-LLEGLFAILSH 220
>gi|325923943|ref|ZP_08185533.1| hypothetical protein XGA_4589 [Xanthomonas gardneri ATCC 19865]
gi|325545569|gb|EGD16833.1| hypothetical protein XGA_4589 [Xanthomonas gardneri ATCC 19865]
Length = 255
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 86/222 (38%), Gaps = 33/222 (14%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+W+ +G++ ++ NV + +GR RG L+ Y+ G A Y +DL
Sbjct: 45 AGRDLRSGGSWVGLGNHGRVAVVTNVRDPLATASGRSRGHLIADYLAGSLDAAAYASDLA 104
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E F F+++L + + H SN PL + ++G N
Sbjct: 105 AAAHE--FPPFNLLLCD----AERCEHLSNHPPL------------ARTLAPGIHGMSNG 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKF-NRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+Q K A + N N + + +Q + + N + PD +
Sbjct: 147 PLDAQ--------WPKTAALTNALHNWCATDSDDLQPLWTALGNP-AIAPDAALHHTG-V 196
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
D+ + + + I+ YGTR +I+ VD HE
Sbjct: 197 DLATERLLSAAF----ITGASYGTRASTIVAVDQQGRGFIHE 234
>gi|23098506|ref|NP_691972.1| hypothetical protein OB1051 [Oceanobacillus iheyensis HTE831]
gi|22776732|dbj|BAC13007.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
Length = 256
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 95/224 (42%), Gaps = 31/224 (13%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYVEGQKSAEEYLTDL 80
+D GTWL + GK+ + N + G RG + V ++ + A +L+ L
Sbjct: 44 AGRDLKSKGTWLGITKQGKIAAITNYRNPSLPETGTYSRGKIPVDFLNNEIDASHFLSRL 103
Query: 81 KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGN 140
E ++++ G++ + N+Y ++NI +S V+ + + +K
Sbjct: 104 --EENKSLYAGYNALFGNYQ----NLYTYNNI--YNSSEQLVSGIHSLSNDK-------- 147
Query: 141 SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKAEA 200
+ P+ KV+ + + N+ ++ L + + + K + PD I + E
Sbjct: 148 ---LNTPWPKVVKARGLLQNHLES-NQDISNEALFRILQNADKPNDQDLPDTGIGLQLER 203
Query: 201 DMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWT 244
++ S + +D + YGTR +IIL+D NN F E T
Sbjct: 204 NL--------SSMFIDTGN--YGTRASTIILIDKNNDATFIERT 237
>gi|88706381|ref|ZP_01104086.1| protein containing DUF833 [Congregibacter litoralis KT71]
gi|88699317|gb|EAQ96431.1| protein containing DUF833 [Congregibacter litoralis KT71]
Length = 258
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 53/130 (40%), Gaps = 21/130 (16%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG--RDRGPLVVKYVEGQKSAEEYL 77
L +D GGTWL S G+ + N+ + G R RG L ++ ++ +YL
Sbjct: 42 LLAGRDLQAGGTWLGVSRGGRFAAITNIRDPAAEAGGAPRTRGELTTDFLTATQTPGDYL 101
Query: 78 TDLKKETEENVFNGFHIVLLEL----------------TLQSTNIYHFSNIAPLDSPSTK 121
++ N + GF++++ + + IY SN A LD P K
Sbjct: 102 AEVAG--RRNDYQGFNLLVGDRDSLWYLHGNRDEPSAPEPLAPGIYGLSNAA-LDVPWPK 158
Query: 122 VTNVTNKQEN 131
V ++ ++
Sbjct: 159 VQRARHRLQD 168
>gi|392579982|gb|EIW73109.1| hypothetical protein TREMEDRAFT_42185 [Tremella mesenterica DSM
1558]
Length = 407
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 6/93 (6%)
Query: 29 GGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GGTWL + + ++G++ NV P+ RG L+ ++ + E + D K + V
Sbjct: 96 GGTWLGMTKDLRIGVITNVRQPGLNPSPPSRGTLLKDFLSPSPHSAEKVHDYLKSHVDTV 155
Query: 89 --FNGFHIVLLEL----TLQSTNIYHFSNIAPL 115
+ GF+++L L T T F++ P+
Sbjct: 156 GEYEGFNLLLFRLHPPQTSNPTQTLQFASNTPI 188
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 82/180 (45%), Gaps = 24/180 (13%)
Query: 68 EGQKSAEEYLTDLKKETEENVF--NGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNV 125
EG+K E + + ET E + N + ++L+L S+ I + N
Sbjct: 215 EGEKEKERISLEDRWETPEIGYLSNRPKPIYVDLSLPSSPI---------------IENT 259
Query: 126 TNKQENKDHVYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKN 184
K E K +G NS +P+ KVIFG+E+ A+ + ++ E ++ LI+ + D++ +
Sbjct: 260 DKKIEIKTICHGLSNSPL-KEPWPKVIFGEERMAQSLTQWEENEEDEDKLIERMFDILSD 318
Query: 185 KQSNYPDPEIDRKAEADMDEDYKMRYSRVCVDIS--SIFYGTRTHSIILVDHNNVMDFHE 242
+ + +D K+ + K+ + + +YGTR ++ILV + + F E
Sbjct: 319 SKPI--NTLVDAKSSTTIPA-LKLGPNPLLPPKQGEGRWYGTRLSTVILVRDDATVVFVE 375
>gi|385301409|gb|EIF45599.1| duf833 domain-containing protein [Dekkera bruxellensis AWRI1499]
Length = 350
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 102/249 (40%), Gaps = 41/249 (16%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRP-NGRDRGPLVVKYVEGQKSAEEYLTDLKKETEENV 88
GTW+ + G++ +L+N N RG + +K+++ K A + E+
Sbjct: 51 GTWIGITKQGRVAVLVNYREPAMECLNKISRGSIPMKFLKSNKKAITW-----SESANKH 105
Query: 89 FNGFHIV----LLELTLQSTNIYHFSNIAPLDSPSTK---VTNVTNKQE----NKDHVY- 136
NGF V LL LQ++ I PL S + +V N +E D Y
Sbjct: 106 MNGFKDVGGFSLLFGELQASQ--GGKEIMPLHIMSNRYRETVSVFNTEETLNRQDDDRYH 163
Query: 137 --------GFGNSQCTSQPFQKVIFGKEKFAEIV-NKFNRKSESQNLIQNILDLMKNKQS 187
G NS +P+ KV GK+ E+ N +K L+ + +++
Sbjct: 164 KMQLFDTIGLSNSPYL-EPWPKVTEGKKLVEEVAENAVXQKWNKDKLVSKLFEVLTVAHP 222
Query: 188 NYPDPEIDRKAEADMDEDYK------MRYSR--VCVDISS---IFYGTRTHSIILVDHNN 236
+ P+ ++ E D+ K +R ++ C S YGTRT ++ILVD +
Sbjct: 223 SDPEKWLNYNLEEGFDQMPKSIFVPPLRKTKGPACSGSKSKPTSLYGTRTQTVILVDKHG 282
Query: 237 VMDFHEWTL 245
+ + E TL
Sbjct: 283 KVTYVEKTL 291
>gi|114328323|ref|YP_745480.1| hypothetical protein GbCGDNIH1_1659 [Granulibacter bethesdensis
CGDNIH1]
gi|114316497|gb|ABI62557.1| hypothetical protein GbCGDNIH1_1659 [Granulibacter bethesdensis
CGDNIH1]
Length = 240
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 3/83 (3%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEE 75
+M + D V GGTW+A + G + +LN G G R RG L + + +SAEE
Sbjct: 39 DMAVTGGLDTVAGGTWMAINDQGVVAAVLNRTGTLGPVAGKRSRGDLPILALH-YRSAEE 97
Query: 76 YLTDLKKETEENVFNGFHIVLLE 98
+ + + + + GFH+V+ +
Sbjct: 98 AVQAI-AQLDAGAWRGFHLVVAD 119
>gi|358056832|dbj|GAA97182.1| hypothetical protein E5Q_03858 [Mixia osmundae IAM 14324]
Length = 620
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 79/184 (42%), Gaps = 41/184 (22%)
Query: 25 DAVKGGTWLATSTNGKLGILLNVLGENSRPNGR-DRGPLVVKYV----EGQKSAEEYLTD 79
D + GTWL + G+L IL N + P + RG L ++ KS ++YL
Sbjct: 59 DLLAKGTWLGINRQGRLAILTNYAEPVASPGTKLSRGDLTKGWLMLEERSSKSMDDYLAR 118
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTN------VTNK--QEN 131
+++ ++ + GF+++L +LT D PS V+N+ +E
Sbjct: 119 VERTKDD--YPGFNLLLGQLT---------------DDPSDHAQKRYDLALVSNRSGREA 161
Query: 132 KDHVYGF--------GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMK 183
+ H+ G G C + KV G+ F +++ + K+E ++L + I DL+
Sbjct: 162 QRHLTGEERAGGMTNGGMSCGEAAWPKVKSGQPVFDQLIE--SNKTE-RDLKEGIWDLLS 218
Query: 184 NKQS 187
S
Sbjct: 219 KSGS 222
>gi|421780380|ref|ZP_16216869.1| hypothetical protein MCRH_1437 [Moraxella catarrhalis RH4]
gi|407812536|gb|EKF83321.1| hypothetical protein MCRH_1437 [Moraxella catarrhalis RH4]
Length = 145
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVL-GENSRPNGRDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTW+ + +G+ +L N G++ + G LV Y++ Q + ++LT L
Sbjct: 41 AGRDKQAGGTWMGVNLSGRWAVLTNFRDGQDKKLYATSHGTLVQNYLDSQMTPLQFLTCL 100
Query: 81 KKETEENVFNGFHIVL 96
K + + GF++++
Sbjct: 101 KPSQTD--YAGFNLIV 114
>gi|296813389|ref|XP_002847032.1| DUF833 domain-containing protein [Arthroderma otae CBS 113480]
gi|238842288|gb|EEQ31950.1| DUF833 domain-containing protein [Arthroderma otae CBS 113480]
Length = 322
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 51/242 (21%), Positives = 104/242 (42%), Gaps = 38/242 (15%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNG----RDRGPLVVKYV----EGQKSAEEYLTDLK 81
GTW+ + NG+L +L N EN+ P R RG ++ ++ + +A+EY+ +L
Sbjct: 58 GTWMGVTKNGRLAVLTNYR-ENT-PGAIVGLRSRGSIINAFLSLPPDSDMTAQEYIEELV 115
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
E GF + ++ F + + + ++ + ++ G N+
Sbjct: 116 AGGEGQAAGGFSLACGDV---------FGPLGIVSNRASAGDEIPWIATARNQTVGLSNT 166
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMK-------NKQSNYPD-- 191
+ + K++ G+ E + K ES++ LI +L ++ +++++ D
Sbjct: 167 AFGDRSWPKILQGERLMKEAIQKSYASGESEDALIHRLLGVLSVDNLPRLHERASLQDYL 226
Query: 192 PEIDRK----AEADMDEDYKMRYSRVCVDISSIF----YGTRTHSIILVDHNNVMDFHEW 243
P + A D DE + S + +++ YGT+ ++ILVD + + F E
Sbjct: 227 PSLSESIFIPAIGDPDEGVEEVVSH-GAEAHTLYMKGLYGTQKQTVILVDESGRVKFFER 285
Query: 244 TL 245
TL
Sbjct: 286 TL 287
>gi|374261239|ref|ZP_09619825.1| hypothetical protein LDG_6206 [Legionella drancourtii LLAP12]
gi|363538346|gb|EHL31754.1| hypothetical protein LDG_6206 [Legionella drancourtii LLAP12]
Length = 263
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 100/250 (40%), Gaps = 41/250 (16%)
Query: 20 LEVSQDAVKGGTWLATSTNGKLGILLNVLGENSR-PNGRDRGPLVVKYV--EGQKSAEEY 76
L QD GGTWL + N ++ N N+ P + RG LV Y+ + S Y
Sbjct: 42 LFAGQDLESGGTWLGVNKNSGFSLITNYRNPNAYDPLMQSRGLLVKNYLLDSTRTSPSAY 101
Query: 77 LTDL-KKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQEN-KDH 134
+ + N FN F + +T IY+ +NV NK +
Sbjct: 102 INKIVPNAMRYNPFNLF------VGTINTLIYY--------------SNVENKAKKLTPG 141
Query: 135 VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEI 194
+YG N P+ KV+ KE F + + ++ + I+++L + PD
Sbjct: 142 LYGISN-HLLDTPWYKVLRAKELFNKCLGILRIRANPEQ-IEDLLFPILEDCRLAPD--- 196
Query: 195 DRKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWIH 254
+ + + + + S + V + YGTR+ ++IL N+ + K +IH
Sbjct: 197 NLLPQTGVPIEVEKSLSSIFVTTPNHEYGTRSSTLILFTKENIF--------FSEKIFIH 248
Query: 255 ---THIRKTL 261
T ++KT+
Sbjct: 249 GKATSLKKTI 258
>gi|341901620|gb|EGT57555.1| hypothetical protein CAEBREN_16295 [Caenorhabditis brenneri]
Length = 290
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 5/118 (4%)
Query: 138 FGNSQCTSQPFQKVI-FGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQS-NYPDPEID 195
FGNS+ TS+P QK++ F KE F ++ + ++ NL++ LD MK +S N P +
Sbjct: 53 FGNSEDTSEPIQKMVKFAKELFPKL--QLYLILDASNLVE-FLDTMKFWRSFNIPLKTVK 109
Query: 196 RKAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKKWI 253
+ A++D++ C D+ ++ +HS+ D F + ++ +W+
Sbjct: 110 IGSAAEIDDELMKAVLEQCSDVRNLTIYYSSHSLFQYDFRAHAPFKFDFFELKSAEWV 167
>gi|354544947|emb|CCE41672.1| hypothetical protein CPAR2_802220 [Candida parapsilosis]
Length = 318
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 51/249 (20%), Positives = 103/249 (41%), Gaps = 42/249 (16%)
Query: 30 GTWLATSTNGKLGILLNVLGENSRPNGR--DRGPLVVKYVEGQKSAEEYLTDLKKETE-- 85
GTW+ +T+GK+ +L+N ++ + RG L + Y+E K+ +E+ +
Sbjct: 55 GTWIGVTTSGKIAVLVNYRDMDTESTMKQVSRGVLPLNYLETTKTDDEWRDSFSANIQNG 114
Query: 86 ------ENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTN---VTNKQENKDH-- 134
+N+ GF ++ L + N NI +V + N D
Sbjct: 115 NASLDLKNI-GGFTLLYGSLRVDEKNGIDHLNILSNKGHHGRVFERECSSQDMANPDRFA 173
Query: 135 ---VYGFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN-LIQNILDLMKNKQSNYP 190
+G NS + P++KV G++ ++++ + +Q+ L+++ L+ +
Sbjct: 174 TKTTFGMSNS-LYNDPWKKVDLGEQMLEKVIDASVENNCTQDQLVEDCFKLLSHDT---- 228
Query: 191 DPEIDRKAEADMDEDYKMRYSRVCVDISSI--------------FYGTRTHSIILVDHNN 236
+R D D K+ R + + + +YGTRT +IIL+D
Sbjct: 229 ---YERSIMKQNDFDVKVMELRNSIFVPPLKREGAETHLVSIGDYYGTRTQTIILLDKLG 285
Query: 237 VMDFHEWTL 245
++++E L
Sbjct: 286 NLNYYERNL 294
>gi|386288953|ref|ZP_10066092.1| hypothetical protein DOK_16043 [gamma proteobacterium BDW918]
gi|385277957|gb|EIF41930.1| hypothetical protein DOK_16043 [gamma proteobacterium BDW918]
Length = 249
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GG+WL S NG+ + N L E RG LV ++ +KS +YL L
Sbjct: 44 AGRDLEHGGSWLGISRNGRFAAVTN-LREVEHSGEHSRGDLVKDFLLSEKSTADYLEQL- 101
Query: 82 KETEENVFNGFHIV 95
E +++++ F+ +
Sbjct: 102 -EVKKSIYRPFNFI 114
>gi|390444478|ref|ZP_10232255.1| hypothetical protein A3SI_11409 [Nitritalea halalkaliphila LW7]
gi|389664485|gb|EIM75977.1| hypothetical protein A3SI_11409 [Nitritalea halalkaliphila LW7]
Length = 255
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 40/92 (43%), Gaps = 9/92 (9%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRP--NGRDRGPLVVKYVEGQKSAEEYLTD 79
+D GGTW+ NG+ L N + RP RG LV +VEG E YL
Sbjct: 42 AGKDLKAGGTWMGFHPNGRFAYLTNYR-DLRRPVQGALTRGDLVRDFVEGNLKPEAYLKR 100
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSN 111
+ + E + GF++ L +Y+ SN
Sbjct: 101 IAQRQER--YEGFNL----LVGTPEELYYLSN 126
>gi|91791459|ref|YP_561110.1| hypothetical protein Sden_0090 [Shewanella denitrificans OS217]
gi|91713461|gb|ABE53387.1| protein of unknown function DUF833 [Shewanella denitrificans OS217]
Length = 258
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 6/76 (7%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYVEGQKSAEEYLTDL 80
+D GGTWL + +GK L N+ + S +G R RG LVVK + S E D
Sbjct: 43 AGKDLQAGGTWLGLAQDGKFAGLTNLRTQTSNQDGVRSRGELVVKAL----SVEGLTPDW 98
Query: 81 KKETEENVFNGFHIVL 96
K N +N F++V
Sbjct: 99 LKSHAHN-YNPFNLVF 113
>gi|393213458|gb|EJC98954.1| hypothetical protein FOMMEDRAFT_95305, partial [Fomitiporia
mediterranea MF3/22]
Length = 293
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 53/248 (21%), Positives = 104/248 (41%), Gaps = 19/248 (7%)
Query: 16 IEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEE 75
I+ + +D GG+W + G++ +L N+ E S RG LV ++ + +E
Sbjct: 26 IDNFVLSGRDLQAGGSWFGINRVGRVALLTNISEEVSVRFRESRGHLVSSFLLPEAQDKE 85
Query: 76 YLTDLKKETEENVFNGFHIVLLELT--LQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKD 133
+ + + GF+I++L + + ++ + + + +T T E +
Sbjct: 86 HCARILANNHSAEYAGFNILVLTPSSDREDGSLAYEALLITNCKGGGAITGRTLYTEER- 144
Query: 134 HVYGFGNS--QCTSQPFQKVIFGKEKFAEIVNKFN----RKSESQNLIQNILDLMKNKQS 187
V G N + + KV G+ AE++ K R SE + L++ + ++ + S
Sbjct: 145 RVGGVSNGVDSVNGRSWPKVKTGEACLAEVLYKHQGTPARLSEDE-LVERLFGILSKQTS 203
Query: 188 NYPDPEIDRKAEADMDEDYKMRYSRVCVDISSI----FYGTRTHSIILVDHNNVMDFHE- 242
P+ + ++ +M S + D FYGTR +++LV + + F E
Sbjct: 204 PPPNNRFQLRTSIEI-PPLRMSGS-LSPDARPADGHNFYGTRLSTVLLVRRDGRVLFIER 261
Query: 243 --WTLDYE 248
W LD E
Sbjct: 262 DVWRLDDE 269
>gi|47227707|emb|CAG09704.1| unnamed protein product [Tetraodon nigroviridis]
Length = 208
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 3/69 (4%)
Query: 28 KGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYV-EGQKSAEEYLTDLKKETEE 86
+GG+WL S GKL + N L + P+ RG LV Y+ + ++ + YL K +
Sbjct: 57 EGGSWLGISKRGKLAGITNYLEKYLNPDAPGRGFLVSNYLMDKEQDSYSYLK--KVSLDS 114
Query: 87 NVFNGFHIV 95
+++NGF+++
Sbjct: 115 HLYNGFNLL 123
>gi|393213463|gb|EJC98959.1| DUF833-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 321
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 53/261 (20%), Positives = 99/261 (37%), Gaps = 36/261 (13%)
Query: 14 GLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSA 73
G+ E + D GGTW + G++ +L N+ + + RG LV ++
Sbjct: 41 GIREAFVLSGLDLQCGGTWFGINRAGRIALLTNINEQVGAHHKESRGHLVSSFLLPDAPD 100
Query: 74 EEYLTDLKKETEENVFNGFHIVLLELTLQS-TNIYHFSNIAPLDSPSTKVTNVTNKQENK 132
E + K+ + GF+++LL + Q TN + + +S +
Sbjct: 101 VEQFVQMLKDDPSADYPGFNLLLLSPSTQGQTNTLTYEALLITNSAGGGTITSRPLLAEE 160
Query: 133 DHVYGFGNS--QCTSQPFQKVIFGKEKFAEIVNKFNRKSESQN---LIQNILDLMKNKQS 187
+ G N + + KV G++ E++N+ K+ + L + + +++ + S
Sbjct: 161 RRIGGVSNGVDRINEGTWPKVKTGEDILVELLNESQTKASQPSDYELAERLFEILSLQSS 220
Query: 188 NYPDPEIDRKAEADMDEDYKMRYS----RVCVDISSI-----------FYGTRTHSIILV 232
P Y+MR S VC+ + + TR +I+LV
Sbjct: 221 PAP------------TNLYEMRTSIEVPPVCLRSNCLSPKSDSNDEHDLGATRLSTILLV 268
Query: 233 DHNNVMDFHE---WTLDYETK 250
+ + F E W LD E K
Sbjct: 269 GRDGHVLFIERDVWKLDGEGK 289
>gi|448393266|ref|ZP_21567591.1| hypothetical protein C477_15365 [Haloterrigena salina JCM 13891]
gi|445663681|gb|ELZ16423.1| hypothetical protein C477_15365 [Haloterrigena salina JCM 13891]
Length = 265
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 19 LLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLT 78
L+ +DA GGTW+ + +G + N + R RG LV +E +AE
Sbjct: 41 LVVAPRDAEAGGTWIGYNEHGVFAGITNKWTDADLAGERSRGLLVADVLEATSAAEAGAI 100
Query: 79 DLKKETEENVFNGFHIVLLELT 100
++ +T++ ++GF++V+ + T
Sbjct: 101 -VEAQTDDYEYDGFYLVVADET 121
>gi|126176418|ref|YP_001052567.1| hypothetical protein Sbal_4233 [Shewanella baltica OS155]
gi|386343182|ref|YP_006039548.1| hypothetical protein [Shewanella baltica OS117]
gi|418023066|ref|ZP_12662052.1| protein of unknown function DUF833 [Shewanella baltica OS625]
gi|125999623|gb|ABN63698.1| protein of unknown function DUF833 [Shewanella baltica OS155]
gi|334865583|gb|AEH16054.1| protein of unknown function DUF833 [Shewanella baltica OS117]
gi|353538068|gb|EHC07624.1| protein of unknown function DUF833 [Shewanella baltica OS625]
Length = 267
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 96/235 (40%), Gaps = 41/235 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYV-EGQKSAEEYLTD 79
+D GGTW + G++ + N+ +P R RG L+ + G +L
Sbjct: 44 AGKDLQAGGTWFGVNKQGQVAAVTNLRVPQKKPEAMRSRGELITMTLNSGSLVCPNWLI- 102
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVYG 137
E +N + F++V Q T++Y F++I NK K D +
Sbjct: 103 ---EHSDN-YQPFNLVF----GQGTDLYCFNSI--------------NKDTVKLADGFHA 140
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDR 196
N + K+ G++ ++N +S NL +Q +L LMK+ P+ +
Sbjct: 141 ISNG-ALDDIWPKMAKGQQALEAVIN------QSDNLEVQALLQLMKDDSQ----PQDNE 189
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ +++ R + + + YGTR+ SI+L D M F E D + ++
Sbjct: 190 LPNTGVGIEWERRLAAIYIRHPD--YGTRSTSILLEDAEGGMHFTEVRYDGKGRQ 242
>gi|328860312|gb|EGG09418.1| hypothetical protein MELLADRAFT_61092 [Melampsora larici-populina
98AG31]
Length = 295
Score = 37.7 bits (86), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 52/235 (22%), Positives = 99/235 (42%), Gaps = 42/235 (17%)
Query: 25 DAVKGGTWLA-TSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
D + GGTWL T + G L N+ N P + + + S EEYL ++K
Sbjct: 52 DLIGGGTWLGYTKRLNRFGFLTNL--SNITPPMQKKSKWI--------SREEYLQEIKDN 101
Query: 84 TEENVFNGFHIVLLELTLQSTNIYHFS--------NIAPLDSPSTKVTNVTNKQENKDHV 135
+ ++ +G+++V+ E+ + S N ++S + + V +EN
Sbjct: 102 SALDLMDGYNLVIGEIQKNGEEVIMDSFCNRQTHHNEIQINSNFYQKSRVEEYKEN---- 157
Query: 136 YGFGN---SQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDP 192
G N S + KV GK+ + + K + +S + LI+++ D++ N
Sbjct: 158 -GISNGIKSDEEENEWIKVKLGKDLLTKRMIKESEESNQEKLIEDLFDILSQTDPN---- 212
Query: 193 EIDRKAEADMDEDYKMRYSRVCVDISSI-----FYGTRTHSIILVDHNNVMDFHE 242
+ + ++ +R + S I +YGT+T +++LV N + F E
Sbjct: 213 ------DEALPKNILIRPHHRLNEPSDIPNRQTWYGTKTQTLVLVSRNKKVTFIE 261
>gi|448345339|ref|ZP_21534236.1| hypothetical protein C485_06100 [Natrinema altunense JCM 12890]
gi|445635337|gb|ELY88507.1| hypothetical protein C485_06100 [Natrinema altunense JCM 12890]
Length = 267
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 17 EMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEY 76
E L+ +DA GGTW+ + +G L N + R RG LV +E +SA
Sbjct: 39 EPLIVAPRDAEAGGTWIGYNEDGVFVGLTNKWTDADLAGDRSRGHLVADVLE-ARSAAAA 97
Query: 77 LTDLKKETEENVFNGFHIVLLE 98
+ ++ T+ + ++GF++V+ +
Sbjct: 98 KSIVESATDTDEYSGFYLVVAD 119
>gi|448537836|ref|ZP_21622705.1| hypothetical protein C467_12876 [Halorubrum hochstenium ATCC
700873]
gi|445701796|gb|ELZ53769.1| hypothetical protein C467_12876 [Halorubrum hochstenium ATCC
700873]
Length = 256
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 24 QDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLKKE 83
+DA GGTW+ + G + + N + R RG L+V+ SAEE + ++ +
Sbjct: 46 RDAEAGGTWIGVAEGGLVVAVTNRWLDADGEGDRSRG-LLVRDCLTADSAEEAVRTVEDD 104
Query: 84 TEENVFNGFHIVLLE 98
E ++GF++VL +
Sbjct: 105 LNERAYDGFNLVLAD 119
>gi|150864464|ref|XP_001383289.2| hypothetical protein PICST_57105 [Scheffersomyces stipitis CBS
6054]
gi|149385720|gb|ABN65260.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 339
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 58/282 (20%), Positives = 112/282 (39%), Gaps = 58/282 (20%)
Query: 13 NGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNV--LGENSRPNGRDRGPLVVKYVEGQ 70
+G +++L + + GTW+ +T+GK+ +L+N + + RG L + Y+
Sbjct: 38 DGTMKILSPLDMARPEHGTWIGVTTSGKVAVLVNYREIDHAHSLSEVSRGILPLDYLCTN 97
Query: 71 KSAEEYLTDL--------KKETEENVFNGFHIVLLELTLQ-STNIYHFSNIAPLDSPSTK 121
KSA+++ L + + E + GF +V +L++ T + NI K
Sbjct: 98 KSADKWHRTLESSLSHVTRGKVELSQIGGFSLVYGQLSIDPKTGKLNHLNILSNRGDHGK 157
Query: 122 VTNVTNKQENKDH---------------------------VYGFGNSQCTSQPFQKVIFG 154
+ N++ +G NS +P++KV G
Sbjct: 158 IHASAKDNSNEEREEKEEADEEEEDDDEEDDLHGDISNKTTFGLSNS-LYYEPWKKVKLG 216
Query: 155 KEKFAEIVNKFNRKSESQN-LIQNILDLMKNKQSNYPDPEIDRKAEADMD---------- 203
+E E+V K SQ L+ L+ + N D E+ ++ +
Sbjct: 217 EELLHELVEKSKEMKLSQEALVSECFKLLSH---NTYDKEVAKQKDFSKKITELRNSIYI 273
Query: 204 ---EDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHE 242
E Y +R+ + +YGTRT +I+L+D ++++E
Sbjct: 274 PPLETYISPSARLLT--AGKYYGTRTQTILLLDRFGYLNYYE 313
>gi|126665228|ref|ZP_01736211.1| hypothetical protein MELB17_19209 [Marinobacter sp. ELB17]
gi|126630598|gb|EBA01213.1| hypothetical protein MELB17_19209 [Marinobacter sp. ELB17]
Length = 268
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 9/105 (8%)
Query: 11 CTNGLIEMLLEVSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQ 70
++ L + +D GGTWLA S NG + + NV + P RG L ++ ++
Sbjct: 39 ASHALASKPVLAGRDLQAGGTWLAVSANGVVSAVTNVREGSPEPGVISRGDLPLRALQQS 98
Query: 71 KSA-EEYLTDLKKETEENVFNGFHIVLLELT---LQSTNIYHFSN 111
+ A +L D + ++GF++V L+ + ++FSN
Sbjct: 99 RQALHRHLAD-----NASRYSGFNLVTLDAGQKRAAGSGGWYFSN 138
>gi|160873252|ref|YP_001552568.1| hypothetical protein Sbal195_0126 [Shewanella baltica OS195]
gi|378706488|ref|YP_005271382.1| hypothetical protein [Shewanella baltica OS678]
gi|160858774|gb|ABX47308.1| protein of unknown function DUF833 [Shewanella baltica OS195]
gi|315265477|gb|ADT92330.1| protein of unknown function DUF833 [Shewanella baltica OS678]
Length = 267
Score = 37.4 bits (85), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 52/235 (22%), Positives = 96/235 (40%), Gaps = 41/235 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYV-EGQKSAEEYLTD 79
+D GGTW + G++ + N+ +P R RG L+ + G +L
Sbjct: 44 AGKDLQAGGTWFGVNKQGQVAAVTNLRVPQKKPEAMRSRGELITMTLNSGSLVCPNWLI- 102
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVYG 137
E +N + F+++ Q T++Y F++I NK K D +
Sbjct: 103 ---EHSDN-YQPFNLIF----GQGTDLYCFNSI--------------NKDTVKLADGFHA 140
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDR 196
N + K+ G++ ++N +S NL +Q +L LMK+ P+ +
Sbjct: 141 ISNG-ALDDIWPKMAKGQQALEAVIN------QSDNLEVQALLQLMKDDSQ----PQDNE 189
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ +++ R + + + YGTR+ SI+L D M F E D + ++
Sbjct: 190 LPNTGVGIEWERRLAAIYIRHPD--YGTRSTSILLEDAEGGMHFTEVRYDGKGRQ 242
>gi|187928515|ref|YP_001899002.1| hypothetical protein Rpic_1429 [Ralstonia pickettii 12J]
gi|187725405|gb|ACD26570.1| protein of unknown function DUF833 [Ralstonia pickettii 12J]
Length = 271
Score = 37.4 bits (85), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 92/230 (40%), Gaps = 27/230 (11%)
Query: 21 EVSQDAVKGGTWLATSTNGKLGILLNVLGEN-SRPNGRDRGPLVVKYVEGQK-SAEEYLT 78
+++Q GTW+ + +G+ L N + RP+ R RG LV ++ S YL
Sbjct: 47 DMAQVVGHAGTWMGMTRDGRFAALTNYRAPSERRPDARSRGELVSNFLMADDMSVPAYLD 106
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGF 138
L + +NG+++ LT ++ SN A ++P ++ +YG
Sbjct: 107 SLA--ARDRAYNGYNL----LTATRDELWWTSNRA--EAP----------RKLAPGLYGL 148
Query: 139 GNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDRKA 198
N+ + P+ KV FAE + + + + LDL+ + P A
Sbjct: 149 SNALLDT-PWFKVRHRMAAFAEALAADTGRHGNALDVARYLDLLSEAREA---PTGALPA 204
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYE 248
E K+ + I S YGTR +I+ + H+ D E D +
Sbjct: 205 TGVAPEWEKLLSAAF---IRSPRYGTRASTILRIRHDGCFDVTERRFDAQ 251
>gi|24371726|ref|NP_715768.1| DUF833 family protein [Shewanella oneidensis MR-1]
gi|24345506|gb|AAN53213.1| DUF833 family protein [Shewanella oneidensis MR-1]
Length = 266
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 51/235 (21%), Positives = 101/235 (42%), Gaps = 41/235 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGIL--LNVLGENSRPNGRDRGPLVVKYV-EGQKSAEEYLT 78
+D GGTW +T G+L + L V +N P R RG LV++ + G +L
Sbjct: 44 AGKDLQAGGTWFGVNTQGQLAGVTNLRVAQKNQEPM-RSRGELVIQALNSGSLICPNWLV 102
Query: 79 DLKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVY 136
+ ++ + F++V Q TN+Y F++I + K+T + N D V+
Sbjct: 103 EHAQD-----YQPFNLVF----GQGTNLYCFNSI---RRETVKLTQGFHAISNGAMDDVW 150
Query: 137 GFGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNLIQNILDLMKNKQSNYPDPEIDR 196
K+ G+ ++ + +Q +++++ LM++ P+ +
Sbjct: 151 ------------PKMAKGQRSLEALIQQ-----SAQLEVESLIKLMQDDSQ----PQDNE 189
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ + ++K R + + I YGTR+ SI+L + + F E D + ++
Sbjct: 190 LPDTGVGLEWKRRLA--AIYIRHPDYGTRSTSILLQNAQGEIQFTEVRYDGKGRR 242
>gi|66823273|ref|XP_644991.1| hypothetical protein DDB_G0272746 [Dictyostelium discoideum AX4]
gi|60473062|gb|EAL71010.1| hypothetical protein DDB_G0272746 [Dictyostelium discoideum AX4]
Length = 379
Score = 37.0 bits (84), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 13/84 (15%)
Query: 24 QDAVKGGTWLATSTNGKLGILLN--------VLGENSRPN-GR-DRGPLVVKY-VEGQKS 72
+D + GGTWLA + GK +LLN L +++ PN G+ RG +V Y ++ S
Sbjct: 50 RDQIGGGTWLAVNNKGKFCMLLNSYYRNEECFLPKSTSPNHGKMSRGAIVYDYLIDDNTS 109
Query: 73 AEEYLTDLKKETEENVFNGFHIVL 96
+Y+ K E + N + F I++
Sbjct: 110 PFDYIQ--KIERQRNNYQPFKIIV 131
>gi|373951566|ref|ZP_09611527.1| protein of unknown function DUF833 [Shewanella baltica OS183]
gi|386322615|ref|YP_006018732.1| hypothetical protein [Shewanella baltica BA175]
gi|333816760|gb|AEG09426.1| protein of unknown function DUF833 [Shewanella baltica BA175]
gi|373888166|gb|EHQ17058.1| protein of unknown function DUF833 [Shewanella baltica OS183]
Length = 267
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 51/235 (21%), Positives = 94/235 (40%), Gaps = 41/235 (17%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNG-RDRGPLVVKYV-EGQKSAEEYLTD 79
+D GGTW + G++ + N+ P R RG L+ + G +L +
Sbjct: 44 AGKDLQAGGTWFGVNKQGQVAAVTNLRVPQKNPEAMRSRGELITMTLNSGSLVCPNWLIE 103
Query: 80 LKKETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENK--DHVYG 137
+ + F++V Q T++Y F++I NK K D +
Sbjct: 104 -----HSDHYQPFNLVF----GQGTDLYCFNSI--------------NKDTVKLADGFHA 140
Query: 138 FGNSQCTSQPFQKVIFGKEKFAEIVNKFNRKSESQNL-IQNILDLMKNKQSNYPDPEIDR 196
N + K+ G++ ++N +S NL +Q +L LMK+ P+ +
Sbjct: 141 ISNG-ALDDIWPKMAKGQQALEAVIN------QSDNLEVQALLQLMKDDSQ----PQDNE 189
Query: 197 KAEADMDEDYKMRYSRVCVDISSIFYGTRTHSIILVDHNNVMDFHEWTLDYETKK 251
+ +++ R + + + YGTR+ SI+L D M F E D + ++
Sbjct: 190 LPNTGVGIEWERRLAAIYIRHPD--YGTRSTSILLEDAEGGMHFTEVRYDGKGRQ 242
>gi|149377244|ref|ZP_01894991.1| hypothetical protein MDG893_12605 [Marinobacter algicola DG893]
gi|149358432|gb|EDM46907.1| hypothetical protein MDG893_12605 [Marinobacter algicola DG893]
Length = 261
Score = 36.6 bits (83), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 43/214 (20%), Positives = 82/214 (38%), Gaps = 33/214 (15%)
Query: 22 VSQDAVKGGTWLATSTNGKLGILLNVLGENSRPNGRDRGPLVVKYVEGQKSAEEYLTDLK 81
+D GGTWLA +G + + NV + P RG L + + + A ++
Sbjct: 47 AGRDLQSGGTWLAVDEDGGVAAVTNVREGSQVPGAHSRGELPLMSLAESRDA----LAIR 102
Query: 82 KETEENVFNGFHIVLLELTLQSTNIYHFSNIAPLDSPSTKVTNVTNKQENKDHVYGFGNS 141
+T+++ ++GF++V L+ + Y+ + A P V T YG N
Sbjct: 103 LQTKKSDYSGFNLVRLD---EHAGWYYSNRDA---HPGRHVHRGT---------YGLSN- 146
Query: 142 QCTSQPFQKVIFGKEKFAEIVNKFNRKSES---QNLIQNILDLMKNKQSNYPDPEIDRKA 198
P+ K++ + +++ + NLI+ + D PD + +
Sbjct: 147 HLLQTPWPKLLRLRNSVTDLLEGASADRTDDLHHNLIERLQDTTPAPDHELPDTGVGKDT 206
Query: 199 EADMDEDYKMRYSRVCVDISSIFYGTRTHSIILV 232
E + + I YGTR +++ V
Sbjct: 207 ERFLSSPF----------IIGGDYGTRATTVVTV 230
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.384
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,250,751,229
Number of Sequences: 23463169
Number of extensions: 173211662
Number of successful extensions: 359368
Number of sequences better than 100.0: 623
Number of HSP's better than 100.0 without gapping: 219
Number of HSP's successfully gapped in prelim test: 404
Number of HSP's that attempted gapping in prelim test: 358626
Number of HSP's gapped (non-prelim): 685
length of query: 264
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 124
effective length of database: 9,074,351,707
effective search space: 1125219611668
effective search space used: 1125219611668
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 75 (33.5 bits)