BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034235
(100 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225431557|ref|XP_002282176.1| PREDICTED: UPF0235 protein C15orf40 homolog [Vitis vinifera]
gi|147866968|emb|CAN83055.1| hypothetical protein VITISV_009894 [Vitis vinifera]
Length = 128
Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/90 (73%), Positives = 75/90 (83%), Gaps = 1/90 (1%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP+KKGK+KA S GSTQS N + PSCIR VPPSSVSIT+HAKPGSK SITD D
Sbjct: 1 MAPSKKGKAKANSGGSTQSSDSVN-SSFPSCIRFVPPSSVSITVHAKPGSKVSSITDFDD 59
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EA+GVQIDAPAKDGEANAALL+Y+SS++ V
Sbjct: 60 EALGVQIDAPAKDGEANAALLDYISSVVGV 89
>gi|30694498|ref|NP_175343.2| uncharacterized protein [Arabidopsis thaliana]
gi|34365605|gb|AAQ65114.1| At1g49170 [Arabidopsis thaliana]
gi|51971533|dbj|BAD44431.1| similar to serine/threonine kinase 9 gb|AAD28798.1 [Arabidopsis
thaliana]
gi|332194278|gb|AEE32399.1| uncharacterized protein [Arabidopsis thaliana]
Length = 126
Score = 124 bits (311), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/90 (71%), Positives = 74/90 (82%), Gaps = 3/90 (3%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP KKGK K KSA +S T + P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1 MAPTKKGK-KTKSAA--ESTTVTESSSFPTCLRLLTPSSVAITIHAKPGSKAASITDVSD 57
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAALLEYMSS++ V
Sbjct: 58 EAVGVQIDAPARDGEANAALLEYMSSVLGV 87
>gi|351726281|ref|NP_001237633.1| uncharacterized protein LOC100527290 [Glycine max]
gi|255632017|gb|ACU16361.1| unknown [Glycine max]
Length = 126
Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/90 (71%), Positives = 77/90 (85%), Gaps = 3/90 (3%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
M PAK+GK+KA+ +T S+ K ND PSCIR VPPSSV+ITIHAKPGSKS S+TD+SD
Sbjct: 1 MPPAKRGKAKAEPKVTTPSE-KPND--FPSCIRSVPPSSVAITIHAKPGSKSASVTDISD 57
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAALL+Y+SS++ V
Sbjct: 58 EAVGVQIDAPARDGEANAALLDYISSVLGV 87
>gi|351722711|ref|NP_001238533.1| uncharacterized protein LOC100499954 [Glycine max]
gi|255627953|gb|ACU14321.1| unknown [Glycine max]
Length = 126
Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/90 (71%), Positives = 78/90 (86%), Gaps = 3/90 (3%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
M PAK+GK+KA+S +T S+ K ND PSCI+ VPPSSV+ITIHAKPG+KS SITD+SD
Sbjct: 1 MPPAKRGKAKAESKVTTPSE-KPND--FPSCIQSVPPSSVAITIHAKPGAKSASITDISD 57
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAALL+Y+SS++ V
Sbjct: 58 EAVGVQIDAPARDGEANAALLDYISSVLGV 87
>gi|7770327|gb|AAF69697.1|AC016041_2 F27J15.6 [Arabidopsis thaliana]
Length = 91
Score = 122 bits (305), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/87 (72%), Positives = 72/87 (82%), Gaps = 3/87 (3%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP KKGK K KSA +S T + P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1 MAPTKKGK-KTKSAA--ESTTVTESSSFPTCLRLLTPSSVAITIHAKPGSKAASITDVSD 57
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSL 87
EAVGVQIDAPA+DGEANAALLEYMSS+
Sbjct: 58 EAVGVQIDAPARDGEANAALLEYMSSV 84
>gi|449467753|ref|XP_004151587.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cucumis sativus]
gi|449513315|ref|XP_004164293.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cucumis sativus]
Length = 156
Score = 120 bits (302), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/91 (70%), Positives = 75/91 (82%), Gaps = 3/91 (3%)
Query: 1 MAPAKKGKSKA-KSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
M PAKKGK+KA K+ S QS +K+N N PSC+R V PSSV+ITIHAKPGSK SITD
Sbjct: 29 MPPAKKGKTKAPKATESIQSSVKSN--NYPSCLRSVSPSSVAITIHAKPGSKIASITDFG 86
Query: 60 DEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
D+A+GVQIDAPAKDGEANAALL+YMSS++ V
Sbjct: 87 DDALGVQIDAPAKDGEANAALLDYMSSVLGV 117
>gi|297852564|ref|XP_002894163.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
lyrata]
gi|297340005|gb|EFH70422.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
lyrata]
Length = 126
Score = 119 bits (298), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/90 (68%), Positives = 73/90 (81%), Gaps = 3/90 (3%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP KKGK K KSA +S T P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1 MAPTKKGK-KTKSAA--ESTTVTESSIFPACLRLLTPSSVAITIHAKPGSKAASITDVSD 57
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAALLE++SS++ V
Sbjct: 58 EAVGVQIDAPARDGEANAALLEFISSVLGV 87
>gi|224131202|ref|XP_002328480.1| predicted protein [Populus trichocarpa]
gi|222838195|gb|EEE76560.1| predicted protein [Populus trichocarpa]
Length = 131
Score = 114 bits (284), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/77 (76%), Positives = 66/77 (85%), Gaps = 2/77 (2%)
Query: 16 STQSK--IKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKD 73
STQSK + N PSCIR VPPSSV+ITIHAKPGSKS SITD+SDEAVGVQIDAPAKD
Sbjct: 16 STQSKPTAPSPQNNFPSCIRAVPPSSVAITIHAKPGSKSASITDLSDEAVGVQIDAPAKD 75
Query: 74 GEANAALLEYMSSLIHV 90
GEANAALL+Y+SS++ V
Sbjct: 76 GEANAALLDYISSVLGV 92
>gi|357464555|ref|XP_003602559.1| hypothetical protein MTR_3g095670 [Medicago truncatula]
gi|355491607|gb|AES72810.1| hypothetical protein MTR_3g095670 [Medicago truncatula]
Length = 125
Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/64 (82%), Positives = 61/64 (95%)
Query: 27 NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
N+PSCIR +PPSSV+ITIHAKPGSKS SITDVSDEAVGVQIDAPA+DGEANAALL+Y+SS
Sbjct: 24 NIPSCIRCMPPSSVAITIHAKPGSKSASITDVSDEAVGVQIDAPARDGEANAALLDYISS 83
Query: 87 LIHV 90
++ V
Sbjct: 84 VLGV 87
>gi|226496211|ref|NP_001150562.1| LOC100284194 [Zea mays]
gi|195640232|gb|ACG39584.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
mays]
Length = 129
Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/90 (55%), Positives = 68/90 (75%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP K+GK+K + T P C+RL+PPS+V+I+IHAKPGSK +IT++ D
Sbjct: 1 MAPGKRGKAKGAAPPPPTXAASTGAVGFPGCLRLMPPSTVAISIHAKPGSKVATITEIGD 60
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 61 EAVGVQIDAPARDGEANAALVDFISSVLGV 90
>gi|255634374|gb|ACU17552.1| unknown [Glycine max]
Length = 77
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/64 (81%), Positives = 58/64 (90%), Gaps = 2/64 (3%)
Query: 22 KTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALL 81
K ND PSCIR VPPSSV+ITIHAKPGSKS S+TD+SDEAVGVQIDAPA+DGEANAALL
Sbjct: 2 KPND--FPSCIRSVPPSSVAITIHAKPGSKSASVTDISDEAVGVQIDAPARDGEANAALL 59
Query: 82 EYMS 85
+Y+S
Sbjct: 60 DYIS 63
>gi|414586173|tpg|DAA36744.1| TPA: COG1872 containing protein, Uncharacterized ACR, YggU
family, partial [Zea mays]
Length = 86
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 66/86 (76%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP K+GK+K + + T P C+RL+PPS+V+I+IHAKPGSK +IT++ D
Sbjct: 1 MAPGKRGKAKGAAPPPPTAAASTGAVGFPGCLRLMPPSTVAISIHAKPGSKVATITEIGD 60
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSS 86
EAVGVQIDAPA+DGEANAAL++++SS
Sbjct: 61 EAVGVQIDAPARDGEANAALVDFISS 86
>gi|326522380|dbj|BAK07652.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531514|dbj|BAJ97761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 129
Score = 98.2 bits (243), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 52/91 (57%), Positives = 70/91 (76%), Gaps = 1/91 (1%)
Query: 1 MAPAKKG-KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
MAP K+G K+KAK A + P C+RL+PPS+V+I+IHAKPGSK +IT+V
Sbjct: 1 MAPGKRGSKAKAKGAPAPAPAAAAAGGEFPGCLRLMPPSTVAISIHAKPGSKMATITEVG 60
Query: 60 DEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+EAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 61 EEAVGVQIDAPARDGEANAALVDFISSVLGV 91
>gi|242077752|ref|XP_002448812.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
gi|241939995|gb|EES13140.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
Length = 131
Score = 97.4 bits (241), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 42/63 (66%), Positives = 57/63 (90%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSL 87
P C+RL+PPS+V+I+IHAKPGSK +IT++ DEAVGVQIDAPA+DGEANAAL++++SS+
Sbjct: 30 FPGCLRLMPPSTVAISIHAKPGSKVATITEIGDEAVGVQIDAPARDGEANAALVDFISSV 89
Query: 88 IHV 90
+ V
Sbjct: 90 LGV 92
>gi|297603606|ref|NP_001054326.2| Os04g0686300 [Oryza sativa Japonica Group]
gi|215687220|dbj|BAG91785.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222629814|gb|EEE61946.1| hypothetical protein OsJ_16702 [Oryza sativa Japonica Group]
gi|255675903|dbj|BAF16240.2| Os04g0686300 [Oryza sativa Japonica Group]
Length = 129
Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 64/90 (71%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP K+ + + P+C+RL+PPS+V+I+I AKPGSK +IT++ D
Sbjct: 1 MAPGKQRGKAKGAPPPPAAPNAAAAGGFPACLRLMPPSTVAISIQAKPGSKLATITEIGD 60
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 61 EAVGVQIDAPARDGEANAALVDFISSVLGV 90
>gi|38345823|emb|CAD41928.2| OSJNBa0070M12.6 [Oryza sativa Japonica Group]
Length = 207
Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 64/90 (71%)
Query: 1 MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
MAP K+ + + P+C+RL+PPS+V+I+I AKPGSK +IT++ D
Sbjct: 79 MAPGKQRGKAKGAPPPPAAPNAAAAGGFPACLRLMPPSTVAISIQAKPGSKLATITEIGD 138
Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
EAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 139 EAVGVQIDAPARDGEANAALVDFISSVLGV 168
>gi|357166810|ref|XP_003580862.1| PREDICTED: UPF0235 protein C15orf40 homolog [Brachypodium
distachyon]
Length = 127
Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/64 (62%), Positives = 57/64 (89%)
Query: 27 NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
P C+RL+PPS+V+I++HAKPGSK +IT++ +EAVGVQIDAPA+DGEANAAL++++SS
Sbjct: 25 GFPRCLRLMPPSTVAISVHAKPGSKVATITEIGEEAVGVQIDAPARDGEANAALVDFISS 84
Query: 87 LIHV 90
++ V
Sbjct: 85 VLGV 88
>gi|326504128|dbj|BAK02850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 129
Score = 95.1 bits (235), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
Query: 1 MAPAKKG-KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
MAP K+G K+KAK A + P C+ L+PPS+V+I+IHAKPGSK +IT+V
Sbjct: 1 MAPGKRGSKAKAKGAPAPAPAAAAAGGEFPGCLHLMPPSTVAISIHAKPGSKMATITEVG 60
Query: 60 DEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+EAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 61 EEAVGVQIDAPARDGEANAALVDFISSVLGV 91
>gi|90399179|emb|CAJ86041.1| H0723C07.11 [Oryza sativa Indica Group]
gi|125550304|gb|EAY96126.1| hypothetical protein OsI_18003 [Oryza sativa Indica Group]
Length = 125
Score = 95.1 bits (235), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 41/60 (68%), Positives = 56/60 (93%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
C+RL+PPS+V+I+IHAKPGSK +IT++ DEAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 27 CLRLMPPSTVAISIHAKPGSKLATITEIGDEAVGVQIDAPARDGEANAALVDFISSVLGV 86
>gi|168032813|ref|XP_001768912.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679824|gb|EDQ66266.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 123
Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 59/82 (71%)
Query: 9 SKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQID 68
SK S+G ++ +P CIR + S+V+IT+HAKPGSK +ITD D AVGVQID
Sbjct: 5 SKGASSGKPEAVATAEKSRIPGCIRSLADSTVAITVHAKPGSKLSAITDTDDGAVGVQID 64
Query: 69 APAKDGEANAALLEYMSSLIHV 90
APA++GEANAALLEY++ ++ +
Sbjct: 65 APAREGEANAALLEYIAEVLGI 86
>gi|223975045|gb|ACN31710.1| unknown [Zea mays]
Length = 95
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 38/56 (67%), Positives = 52/56 (92%)
Query: 35 VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+PPS+V+I+IHAKPGSK +IT++ DEAVGVQIDAPA+DGEANAAL++++SS++ V
Sbjct: 1 MPPSTVAISIHAKPGSKVATITEIGDEAVGVQIDAPARDGEANAALVDFISSVLGV 56
>gi|259089201|ref|NP_001158638.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
gi|225705490|gb|ACO08591.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
Length = 182
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/57 (56%), Positives = 41/57 (71%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V+I++HAKPGSK +ITDVS EAVGV I AP GEANA L+ Y+S ++ + E V
Sbjct: 93 VTISVHAKPGSKQNAITDVSIEAVGVAIAAPPTGGEANAELVRYLSKVLELKRSEVV 149
>gi|327289069|ref|XP_003229247.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Anolis
carolinensis]
Length = 119
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 39/52 (75%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
SV+I +HAKPGSK ++TD+S EAVG+ I AP DGEANA L Y+S ++ V
Sbjct: 29 SVTIAVHAKPGSKQNAVTDLSAEAVGIAIAAPPSDGEANAELCRYLSKVLEV 80
>gi|440790394|gb|ELR11677.1| hypothetical protein ACA1_260830 [Acanthamoeba castellanii str.
Neff]
Length = 116
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 43/62 (69%)
Query: 27 NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
LP CI +V +T+H KP +K ++TD+S EA+GV + AP +DGEANA L+++M+
Sbjct: 12 GLPQCIGTTKEGNVKLTVHVKPNAKISAVTDMSSEAIGVALAAPPRDGEANAELVDFMAG 71
Query: 87 LI 88
++
Sbjct: 72 VL 73
>gi|322792209|gb|EFZ16226.1| hypothetical protein SINV_80163 [Solenopsis invicta]
Length = 122
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 22 KTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAA 79
KT N+PS + L +V+I I AKPG+K +ITD+SDEAVG+ I AP +GEANA
Sbjct: 14 KTASTNVPSTGPVILDKDGNVAIKIQAKPGAKCNNITDISDEAVGIAISAPPTEGEANAE 73
Query: 80 LLEYMSSLIHV 90
L++Y++S V
Sbjct: 74 LVKYLASTFGV 84
>gi|327289071|ref|XP_003229248.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Anolis
carolinensis]
Length = 104
Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 39/52 (75%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
SV+I +HAKPGSK ++TD+S EAVG+ I AP DGEANA L Y+S ++ V
Sbjct: 47 SVTIAVHAKPGSKQNAVTDLSAEAVGIAIAAPPSDGEANAELCRYLSKVLEV 98
>gi|196001449|ref|XP_002110592.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
gi|190586543|gb|EDV26596.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
Length = 132
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 42/51 (82%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V ITI AKPGSK ++TD+S + +G+QI APA++GEAN+ L++++SS++ V
Sbjct: 41 VLITIKAKPGSKENAVTDISSDGIGIQIAAPAREGEANSELIKFLSSILKV 91
>gi|344284100|ref|XP_003413808.1| PREDICTED: UPF0235 protein C15orf40-like [Loxodonta africana]
Length = 154
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 5/95 (5%)
Query: 7 GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
G K AG+T +S+ K ++ LPS + + P V+I IHAKPGSK ++TD++ E
Sbjct: 26 GAEMPKKAGTTNKGKSQSKESERPLPSLGPVGVDPKGCVTIAIHAKPGSKQNAVTDLTAE 85
Query: 62 AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AV + I AP +GEANA L Y+S ++ + + V
Sbjct: 86 AVNIAIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|354499521|ref|XP_003511857.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cricetulus griseus]
Length = 126
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K+ S G Q+K + P V+I IHAKPGSK ++TD++ EAVGV I
Sbjct: 4 KAGTMSKGKNQTKEPETPPPPAGPVTTDPKGCVTIAIHAKPGSKQNAVTDLTTEAVGVAI 63
Query: 68 DAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AP +GEANA L Y+S ++ + + V
Sbjct: 64 AAPPSEGEANAELCRYLSKVLELRKSDVV 92
>gi|431920276|gb|ELK18311.1| hypothetical protein PAL_GLEAN10009540 [Pteropus alecto]
Length = 154
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 55/95 (57%), Gaps = 5/95 (5%)
Query: 7 GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
G K AG+T +S+ K + LP + + P V+I IHAKPGSK +ITDV+ E
Sbjct: 26 GTEMPKKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAITDVTAE 85
Query: 62 AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AV V I AP +GEANA L Y+S ++ + + V
Sbjct: 86 AVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|426248144|ref|XP_004017825.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ovis aries]
Length = 126
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K A + G QSK + LP + + P VSI IHAKPGSK ++TDV+ EAV V
Sbjct: 4 KGGATNKGKNQSK--EPERPLPPLGPVTVDPKGGVSIAIHAKPGSKQNAVTDVTAEAVSV 61
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 62 AIAAPPTEGEANAELCRYLSKVLELRKSDVV 92
>gi|307176568|gb|EFN66055.1| UPF0235 protein C15orf40-like protein [Camponotus floridanus]
Length = 122
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 54/83 (65%), Gaps = 1/83 (1%)
Query: 9 SKAKSAGSTQSKIKTNDENLPSC-IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
SK +SA + + + T +N + + L +V I I AKPG+K +ITD+SDEAVG+ I
Sbjct: 2 SKQRSAKNEKMRKGTTTDNQSTGPVVLNKDGNVVIKIQAKPGAKCNNITDISDEAVGIAI 61
Query: 68 DAPAKDGEANAALLEYMSSLIHV 90
AP +GEANA L++Y++S+ +
Sbjct: 62 SAPPMEGEANAELVKYLASIFEL 84
>gi|332025796|gb|EGI65953.1| UPF0235 protein C15orf40-like protein [Acromyrmex echinatior]
Length = 139
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 23 TNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLE 82
T+D ++ S + L +V+I I AKPG+K ++TD+SDEA+G+ I AP +GEANA L++
Sbjct: 35 TDDPSMGSVV-LNKDGNVAIKIQAKPGAKCNNVTDISDEAIGIAISAPPTEGEANAELVK 93
Query: 83 YMSSLIHV 90
Y++S+ V
Sbjct: 94 YLASIFGV 101
>gi|386781826|ref|NP_001248190.1| uncharacterized protein LOC718713 [Macaca mulatta]
gi|380809518|gb|AFE76634.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809520|gb|AFE76635.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809522|gb|AFE76636.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809524|gb|AFE76637.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809526|gb|AFE76638.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809528|gb|AFE76639.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
gi|380809530|gb|AFE76640.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
Length = 154
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 41/61 (67%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+ITIHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 60 PKGCVTITIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 119
Query: 96 V 96
V
Sbjct: 120 V 120
>gi|115495117|ref|NP_001068854.1| UPF0235 protein C15orf40 homolog [Bos taurus]
gi|122140809|sp|Q3ZBP8.1|CO040_BOVIN RecName: Full=UPF0235 protein C15orf40 homolog
gi|73587092|gb|AAI03180.1| Chromosome 15 open reading frame 40 ortholog [Bos taurus]
gi|296475539|tpg|DAA17654.1| TPA: hypothetical protein LOC509050 [Bos taurus]
Length = 126
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K A + G QSK + LP + + P VSI IHAKPGSK ++TDV+ EAV V
Sbjct: 4 KCGATNKGKNQSK--EPERPLPPLGPVTVDPKGGVSIAIHAKPGSKQNAVTDVTTEAVSV 61
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 62 AIAAPPTEGEANAELCRYLSKVLELRKSDVV 92
>gi|402875136|ref|XP_003901372.1| PREDICTED: UPF0235 protein C15orf40 homolog [Papio anubis]
Length = 154
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 41/61 (67%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+ITIHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 60 PKGCVTITIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 119
Query: 96 V 96
V
Sbjct: 120 V 120
>gi|328717531|ref|XP_003246233.1| PREDICTED: UPF0235 protein C15orf40 homolog [Acyrthosiphon pisum]
Length = 136
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 40/49 (81%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
V I I+AKPG+K+ +ITD+S + +GVQI+AP DGEANA L++Y+S ++
Sbjct: 45 VVIKINAKPGAKNNNITDISSDGIGVQINAPPTDGEANAELIKYLSKVL 93
>gi|260800225|ref|XP_002595035.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
gi|229280275|gb|EEN51046.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
Length = 98
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 39/52 (75%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S+ + IHAKPG+K+ +ITDV+ E VGVQI AP +GEANA L Y++ ++ V
Sbjct: 9 SILVAIHAKPGAKANAITDVTTETVGVQITAPPMEGEANAELCRYLAGVLEV 60
>gi|13385576|ref|NP_080353.1| UPF0235 protein C15orf40 homolog [Mus musculus]
gi|29839616|sp|Q9CRC3.1|CO040_MOUSE RecName: Full=UPF0235 protein C15orf40 homolog
gi|12851848|dbj|BAB29184.1| unnamed protein product [Mus musculus]
gi|12857526|dbj|BAB31031.1| unnamed protein product [Mus musculus]
gi|12859380|dbj|BAB31634.1| unnamed protein product [Mus musculus]
Length = 126
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 52/92 (56%), Gaps = 6/92 (6%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
K+ A S G Q+K E P V P V+I IHAKPGS+ ++TD+S EAVG
Sbjct: 4 KAGATSKGKNQTK---EPETAPPAAGPVATDPKGFVTIAIHAKPGSRQNAVTDLSTEAVG 60
Query: 65 VQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V I AP +GEANA L Y+S ++ + + V
Sbjct: 61 VAIAAPPSEGEANAELCRYLSKVLDLRKSDVV 92
>gi|311260654|ref|XP_001929217.2| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Sus scrofa]
Length = 154
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 54/95 (56%), Gaps = 5/95 (5%)
Query: 7 GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
G K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ E
Sbjct: 26 GAEMPKKAGATNKGKSQSKEQERPLPPLGPVTVDPKGCVTIAIHAKPGSKQNAVTDLTTE 85
Query: 62 AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AV V I AP +GEANA L Y+S + + + V
Sbjct: 86 AVSVAIAAPPSEGEANAELCRYLSKVFELRKSDVV 120
>gi|355735946|gb|AES11838.1| hypothetical protein [Mustela putorius furo]
Length = 145
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TDV+ EAV V
Sbjct: 24 KKAGATNKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDVTAEAVSVA 83
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 84 IAAPPSEGEANAELCRYLSKVLELRKSDVV 113
>gi|395822693|ref|XP_003784647.1| PREDICTED: UPF0235 protein C15orf40 homolog [Otolemur garnettii]
Length = 154
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 55/89 (61%), Gaps = 4/89 (4%)
Query: 12 KSAGSTQSKIKTNDENLP----SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K++ +++ K ++N+ P + + + P V+I IHAKPGSK ++TD++ EAV V I
Sbjct: 32 KASATSKGKCQSNEPERPLPPLAPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVAI 91
Query: 68 DAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AP +GEANA L Y+S ++ + + V
Sbjct: 92 AAPPSEGEANAELCRYLSKILELRKSDVV 120
>gi|301789543|ref|XP_002930186.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ailuropoda
melanoleuca]
Length = 214
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 55/95 (57%), Gaps = 5/95 (5%)
Query: 7 GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
G K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TDV+ E
Sbjct: 86 GAEMPKKAGATNKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDVTAE 145
Query: 62 AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AV V I AP +GEANA L Y+S ++ + + V
Sbjct: 146 AVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVV 180
>gi|355692948|gb|EHH27551.1| hypothetical protein EGK_17775 [Macaca mulatta]
gi|355778257|gb|EHH63293.1| hypothetical protein EGM_16229 [Macaca fascicularis]
Length = 127
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 41/61 (67%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+ITIHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 33 PKGCVTITIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 92
Query: 96 V 96
V
Sbjct: 93 V 93
>gi|440913185|gb|ELR62667.1| hypothetical protein M91_21008, partial [Bos grunniens mutus]
Length = 116
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P VSI IHAKPGSK ++TDV+ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 22 PKGGVSIAIHAKPGSKQNAVTDVTTEAVSVAIAAPPTEGEANAELCRYLSKVLELRKSDV 81
Query: 96 V 96
V
Sbjct: 82 V 82
>gi|432862407|ref|XP_004069840.1| PREDICTED: UPF0235 protein C15orf40 homolog [Oryzias latipes]
Length = 169
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 39/52 (75%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V++++H KPGSK +ITDVS EAVGV + AP DGEAN L+ Y++ ++ +
Sbjct: 80 AVTVSVHVKPGSKQSAITDVSAEAVGVSVGAPPTDGEANTELIRYLADVLDL 131
>gi|348523449|ref|XP_003449236.1| PREDICTED: UPF0235 protein C15orf40 homolog [Oreochromis niloticus]
Length = 184
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 42/59 (71%), Gaps = 5/59 (8%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI-----HVSV 92
+V+I +HAKPGSK SIT+VS EAV V I AP DGEANA L+ Y++ ++ H+S+
Sbjct: 95 AVTIRVHAKPGSKQSSITEVSTEAVSVAISAPPTDGEANAELIRYLAEVLDLKKSHISL 153
>gi|345485956|ref|XP_001604898.2| PREDICTED: UPF0235 protein C15orf40 homolog [Nasonia vitripennis]
Length = 147
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 42/52 (80%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V+I + AKPG+K +ITD S+E VG+ I AP ++GEANA L++Y++S+++V
Sbjct: 57 NVTIKVQAKPGAKQNNITDFSEETVGIAISAPPQEGEANAELVKYLASILNV 108
>gi|395747069|ref|XP_002825813.2| PREDICTED: UPF0235 protein C15orf40 homolog [Pongo abelii]
Length = 147
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 24 KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 83
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 84 IAAPPSEGEANAELCRYLSKVLELRKSDVV 113
>gi|20381446|gb|AAH27500.1| RIKEN cDNA 3110040N11 gene [Mus musculus]
gi|74206730|dbj|BAE41614.1| unnamed protein product [Mus musculus]
gi|148674975|gb|EDL06922.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
gi|148674977|gb|EDL06924.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
Length = 126
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 6/92 (6%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
K+ A S G Q+K E P V P V+I IHAKPGS+ ++TD+S EAVG
Sbjct: 4 KAGATSKGKNQTK---EPETAPPAAGPVATDPKGFVTIAIHAKPGSRQNAVTDLSTEAVG 60
Query: 65 VQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V I AP GEANA L Y+S ++ + + V
Sbjct: 61 VAIAAPPSQGEANAELCRYLSKVLDLRKSDVV 92
>gi|157743130|gb|AAI49509.1| C21H15orf40 protein [Bos taurus]
Length = 132
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P VSI IHAKPGSK ++TDV+ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 38 PKGGVSIAIHAKPGSKQNAVTDVTTEAVSVAIAAPPTEGEANAELCRYLSKVLELRKSDV 97
Query: 96 V 96
V
Sbjct: 98 V 98
>gi|149057385|gb|EDM08708.1| similar to RIKEN cDNA 3110040N11, isoform CRA_e [Rattus norvegicus]
Length = 130
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 50/93 (53%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K+ A S G Q+K + V+I IHAKPGSK ++TD++ EAVGV I
Sbjct: 4 KAGATSKGKNQTKEPETPPPPTGPVATDSKGFVTIAIHAKPGSKQNAVTDLNTEAVGVAI 63
Query: 68 DAPAKDGEANAALLEYMSSLIHVSVGEFVESVF 100
AP +GEANA L Y+S ++ + + V F
Sbjct: 64 AAPPSEGEANAELCRYLSKVLDLRKSDVVLDKF 96
>gi|296204187|ref|XP_002749223.1| PREDICTED: UPF0235 protein C15orf40 [Callithrix jacchus]
Length = 154
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 55/90 (61%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + +LP + + P V+I IHAKPGSK ++TD++ EA+ V
Sbjct: 31 KKAGATTKGKSQSKEPERSLPPSGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAINVA 90
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 91 IAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|335292316|ref|XP_003356704.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Sus scrofa]
Length = 150
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 54/95 (56%), Gaps = 5/95 (5%)
Query: 7 GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
G K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ E
Sbjct: 26 GAEMPKKAGATNKGKSQSKEQERPLPPLGPVTVDPKGCVTIAIHAKPGSKQNAVTDLTTE 85
Query: 62 AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AV V I AP +GEANA L Y+S + + + V
Sbjct: 86 AVSVAIAAPPSEGEANAELCRYLSKVFELRKSDVV 120
>gi|67078514|ref|NP_001019920.1| UPF0235 protein C15orf40 homolog [Rattus norvegicus]
gi|392341258|ref|XP_003754291.1| PREDICTED: UPF0235 protein C15orf40 homolog [Rattus norvegicus]
gi|392349088|ref|XP_003750283.1| PREDICTED: UPF0235 protein C15orf40 homolog [Rattus norvegicus]
gi|81908918|sp|Q505I4.1|CO040_RAT RecName: Full=UPF0235 protein C15orf40 homolog
gi|63100368|gb|AAH94529.1| Similar to RIKEN cDNA 3110040N11 [Rattus norvegicus]
gi|149044058|gb|EDL97440.1| rCG63322 [Rattus norvegicus]
gi|149057383|gb|EDM08706.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus
norvegicus]
gi|149057387|gb|EDM08710.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus
norvegicus]
Length = 126
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K+ A S G Q+K + V+I IHAKPGSK ++TD++ EAVGV I
Sbjct: 4 KAGATSKGKNQTKEPETPPPPTGPVATDSKGFVTIAIHAKPGSKQNAVTDLNTEAVGVAI 63
Query: 68 DAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AP +GEANA L Y+S ++ + + V
Sbjct: 64 AAPPSEGEANAELCRYLSKVLDLRKSDVV 92
>gi|397488591|ref|XP_003815342.1| PREDICTED: UPF0235 protein C15orf40 homolog [Pan paniscus]
gi|410049554|ref|XP_001148978.2| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Pan
troglodytes]
Length = 147
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 24 KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 83
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 84 IAAPPSEGEANAELCRYLSKVLELRKSDVV 113
>gi|73951609|ref|XP_545872.2| PREDICTED: UPF0235 protein C15orf40 homolog [Canis lupus
familiaris]
Length = 152
Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 3/88 (3%)
Query: 12 KSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVGVQID 68
K AG+T E P + V P SV+I I AKPGSK ++TDV+ EAV V I
Sbjct: 31 KKAGATNKGKSQEPERPPPSLGPVTVDPRGSVTIAIRAKPGSKQNAVTDVTAEAVSVAIA 90
Query: 69 APAKDGEANAALLEYMSSLIHVSVGEFV 96
AP +GEANA L Y+S ++ + + V
Sbjct: 91 APPSEGEANAELCRYLSKVLELRKSDVV 118
>gi|237858662|ref|NP_653198.2| UPF0235 protein C15orf40 isoform a [Homo sapiens]
gi|119582835|gb|EAW62431.1| chromosome 15 open reading frame 40, isoform CRA_a [Homo sapiens]
Length = 153
Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 30 KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 IAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|29839587|sp|Q8WUR7.1|CO040_HUMAN RecName: Full=UPF0235 protein C15orf40
gi|18043732|gb|AAH19820.1| Chromosome 15 open reading frame 40 [Homo sapiens]
gi|189053288|dbj|BAG35094.1| unnamed protein product [Homo sapiens]
Length = 126
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 3 KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 62
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 63 IAAPPSEGEANAELCRYLSKVLELRKSDVV 92
>gi|237858666|ref|NP_001153586.1| UPF0235 protein C15orf40 isoform c [Homo sapiens]
Length = 153
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 31 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 89 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|157123809|ref|XP_001653923.1| hypothetical protein AaeL_AAEL009675 [Aedes aegypti]
gi|108874207|gb|EAT38432.1| AAEL009675-PA [Aedes aegypti]
Length = 149
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 38/52 (73%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V I I AKPG+K+ ITD+ +E VGVQI AP DGEAN L++Y+S L+ +
Sbjct: 57 NVLIKIQAKPGAKTNGITDIGEEGVGVQIAAPPVDGEANTELVKYLSKLLEL 108
>gi|403258550|ref|XP_003921821.1| PREDICTED: UPF0235 protein C15orf40 homolog [Saimiri boliviensis
boliviensis]
Length = 150
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 32 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 89
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|149057384|gb|EDM08707.1| similar to RIKEN cDNA 3110040N11, isoform CRA_d [Rattus norvegicus]
Length = 181
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 40/57 (70%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V+I IHAKPGSK ++TD++ EAVGV I AP +GEANA L Y+S ++ + + V
Sbjct: 91 VTIAIHAKPGSKQNAVTDLNTEAVGVAIAAPPSEGEANAELCRYLSKVLDLRKSDVV 147
>gi|148236990|ref|NP_001089221.1| uncharacterized protein LOC734268 [Xenopus laevis]
gi|57920974|gb|AAH89152.1| MGC85153 protein [Xenopus laevis]
Length = 121
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 39/50 (78%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
SV+I+IHAKPG+K +ITDV+ +AVGV I AP +GEANA L Y+S ++
Sbjct: 30 SVTISIHAKPGAKQNAITDVTADAVGVAIAAPPTEGEANAELCRYLSKVL 79
>gi|410049556|ref|XP_003952769.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Pan
troglodytes]
Length = 143
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 25 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 82
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 83 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 113
>gi|291413562|ref|XP_002723040.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 154
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 55/91 (60%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A S G +QSK ++ LP + + P V+I IHAKPG+K ++TD++ EAV V
Sbjct: 32 KAGATSKGRSQSK--DHEAPLPPSGPVAVDPKGCVTIAIHAKPGAKQNAVTDLTAEAVSV 89
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|237858670|ref|NP_001153588.1| UPF0235 protein C15orf40 isoform e [Homo sapiens]
Length = 149
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 31 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 89 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|426380120|ref|XP_004056728.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
gorilla]
Length = 153
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
+ AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 30 RKAGATTKDKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 IAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|237858664|ref|NP_001153585.1| UPF0235 protein C15orf40 isoform b [Homo sapiens]
Length = 167
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 31 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 89 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|149456878|ref|XP_001519576.1| PREDICTED: UPF0235 protein C15orf40 homolog, partial
[Ornithorhynchus anatinus]
Length = 119
Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 41/58 (70%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+V+I +HAKPG+K ++TDVS EAVGV I AP +GEANA L Y++ ++ + + V
Sbjct: 28 AVTIAVHAKPGAKQNAVTDVSVEAVGVAIAAPPSEGEANAELCRYLAKILELRKSDVV 85
>gi|237858668|ref|NP_001153587.1| UPF0235 protein C15orf40 isoform d [Homo sapiens]
Length = 167
Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 31 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 89 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|426380122|ref|XP_004056729.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
gorilla]
Length = 153
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+I IHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 59 PKGCVTIAIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 118
Query: 96 V 96
V
Sbjct: 119 V 119
>gi|444722140|gb|ELW62843.1| BTB/POZ domain-containing protein 1 [Tupaia chinensis]
Length = 441
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/61 (49%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+I IHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ V +
Sbjct: 347 PKGCVTIAIHAKPGSKQNAVTDLTTEAVNVAIAAPPTEGEANAELCRYLSKVLEVRKSDV 406
Query: 96 V 96
V
Sbjct: 407 V 407
>gi|426380124|ref|XP_004056730.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
gorilla]
Length = 149
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
+ AG+T +S+ K + LP + + P V+I IHAKPGSK ++TD++ EAV V
Sbjct: 30 RKAGATTKDKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 IAAPPSEGEANAELCRYLSKVLELRKSDVV 119
>gi|338717345|ref|XP_001498126.2| PREDICTED: UPF0235 protein C15orf40 homolog [Equus caballus]
Length = 126
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+I IHAKPGSK ++TDV+ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 32 PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLDLRKSDV 91
Query: 96 V 96
V
Sbjct: 92 V 92
>gi|332238617|ref|XP_003268500.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Nomascus
leucogenys]
Length = 154
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%), Gaps = 5/90 (5%)
Query: 12 KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
K AG+T +S+ K + LP + + P V+I IHAKPG K ++TD++ EAV V
Sbjct: 31 KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNVA 90
Query: 67 IDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 91 IAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|307199144|gb|EFN79854.1| UPF0235 protein C15orf40-like protein [Harpegnathos saltator]
Length = 100
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 43/59 (72%), Gaps = 5/59 (8%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI-----HVSV 92
+V+I I AKPG+K +ITD+++E VGV I AP +GEANA L++Y++S+ HVS+
Sbjct: 34 NVTIKIQAKPGAKCNNITDITNEGVGVAISAPPTEGEANAELVKYLASIFGLRKSHVSL 92
>gi|281348884|gb|EFB24468.1| hypothetical protein PANDA_020551 [Ailuropoda melanoleuca]
Length = 87
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+I IHAKPGSK ++TDV+ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 22 PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLELRKSDV 81
Query: 96 V 96
V
Sbjct: 82 V 82
>gi|349602907|gb|AEP98900.1| UPF0235 protein C15orf40-like protein, partial [Equus caballus]
Length = 116
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V+I IHAKPGSK ++TDV+ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 22 PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLDLRKSDV 81
Query: 96 V 96
V
Sbjct: 82 V 82
>gi|52345734|ref|NP_001004913.1| chromosome 15 open reading frame 40 [Xenopus (Silurana)
tropicalis]
gi|49523245|gb|AAH75357.1| MGC89060 protein [Xenopus (Silurana) tropicalis]
Length = 120
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 38/50 (76%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
SV I+IHAKPG+K +ITDV+ +AVGV I AP +GEANA L Y+S ++
Sbjct: 29 SVIISIHAKPGAKQNAITDVTADAVGVAIAAPPTEGEANAELCRYLSKVL 78
>gi|351704948|gb|EHB07867.1| hypothetical protein GW7_13591 [Heterocephalus glaber]
Length = 164
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 39/57 (68%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V+I IHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + + V
Sbjct: 74 VTIAIHAKPGSKQNAVTDLTTEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDVV 130
>gi|403284558|ref|XP_003933632.1| PREDICTED: UPF0235 protein C15orf40 homolog [Saimiri boliviensis
boliviensis]
Length = 141
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 40/58 (68%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
++S TIHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + + V
Sbjct: 23 NLSYTIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDVV 80
>gi|346466449|gb|AEO33069.1| hypothetical protein [Amblyomma maculatum]
Length = 137
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 39/51 (76%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V+I +HAKPG+ ITD+S+++VG+QI AP DGEAN L+ ++S L+++
Sbjct: 20 VAIRVHAKPGASVSRITDISNDSVGIQIAAPPVDGEANTELVRFLSKLLNL 70
>gi|332238619|ref|XP_003268501.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Nomascus
leucogenys]
Length = 150
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPG K ++TD++ EAV V
Sbjct: 32 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNV 89
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|410960502|ref|XP_003986828.1| PREDICTED: UPF0235 protein C15orf40 homolog [Felis catus]
Length = 126
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 40/61 (65%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P V++ IHAKPGSK ++TD++ EAV V I AP +GEANA L Y+S ++ + +
Sbjct: 32 PKGCVTVAIHAKPGSKQNAVTDLTPEAVSVAIAAPPSEGEANAELCRYLSKVLKLRKSDV 91
Query: 96 V 96
V
Sbjct: 92 V 92
>gi|441616624|ref|XP_004088386.1| PREDICTED: UPF0235 protein C15orf40 homolog [Nomascus leucogenys]
Length = 168
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
K+ A + G +QSK + LP + + P V+I IHAKPG K ++TD++ EAV V
Sbjct: 32 KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNV 89
Query: 66 QIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I AP +GEANA L Y+S ++ + + V
Sbjct: 90 AIAAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|126306449|ref|XP_001373757.1| PREDICTED: UPF0235 protein C15orf40 homolog [Monodelphis domestica]
Length = 146
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 37/55 (67%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P S++I IHAKPGSK +ITDV+ E V V I AP +GEAN L Y+S ++ +
Sbjct: 52 PSGSITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANTELCRYLSKVLEL 106
>gi|126273662|ref|XP_001365632.1| PREDICTED: UPF0235 protein C15orf40 homolog [Monodelphis domestica]
Length = 146
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P S++I IHAKPGSK +ITDV+ E V V I AP +GEAN L Y+S ++ + +
Sbjct: 52 PSGSITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANTELCRYLSKVLELRKSDV 111
Query: 96 V 96
+
Sbjct: 112 I 112
>gi|380028163|ref|XP_003697778.1| PREDICTED: UPF0235 protein C15orf40 homolog [Apis florea]
Length = 144
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 2/81 (2%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K +K G Q T +E + I + +V+I I AKPG+K +ITD+S++AVGV I
Sbjct: 26 KQGSKKTGKNQKA--TVNEEVKGPISIDKNGNVAIKIQAKPGAKHNNITDISEDAVGVAI 83
Query: 68 DAPAKDGEANAALLEYMSSLI 88
AP +GEAN L++Y++S++
Sbjct: 84 SAPPVEGEANTELVKYIASVL 104
>gi|118353243|ref|XP_001009893.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila]
gi|89291659|gb|EAR89647.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila
SB210]
Length = 127
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 42/71 (59%), Gaps = 6/71 (8%)
Query: 26 ENLPSCIRLVPPS------SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAA 79
E P + VP S I+IHAKP SK I+ +SDE V + I AP KDGEANA
Sbjct: 17 EEKPKAVSNVPASIYEKGGKFFISIHAKPNSKISQISGISDEGVDINIAAPPKDGEANAE 76
Query: 80 LLEYMSSLIHV 90
L++Y+S ++ V
Sbjct: 77 LIDYISQVLGV 87
>gi|395502265|ref|XP_003755502.1| PREDICTED: UPF0235 protein C15orf40 homolog [Sarcophilus harrisii]
Length = 147
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 39/58 (67%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
S++I IHAKPGSK +ITDV+ E V V I AP +GEANA L Y+S ++ + + +
Sbjct: 56 SITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVI 113
>gi|350399290|ref|XP_003485480.1| PREDICTED: UPF0235 protein C15orf40 homolog [Bombus impatiens]
Length = 144
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 42/57 (73%)
Query: 32 IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
I L +V+I I AKPG+K +ITD+S++AVGV I AP +GEAN L++Y++S++
Sbjct: 48 ISLDKNGNVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASIL 104
>gi|340709930|ref|XP_003393552.1| PREDICTED: UPF0235 protein C15orf40 homolog [Bombus terrestris]
Length = 144
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 42/57 (73%)
Query: 32 IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
I L +V+I I AKPG+K +ITD+S++AVGV I AP +GEAN L++Y++S++
Sbjct: 48 ISLDKNGNVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASIL 104
>gi|328780498|ref|XP_392249.2| PREDICTED: UPF0235 protein C15orf40 homolog [Apis mellifera]
Length = 144
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 50/81 (61%), Gaps = 2/81 (2%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K +K G Q T +E I + +V+I I AKPG+K +ITD+S++AVGV I
Sbjct: 26 KQGSKKTGKNQKA--TVNEEAKGPISVDKNGNVTIKIQAKPGAKHNNITDISEDAVGVAI 83
Query: 68 DAPAKDGEANAALLEYMSSLI 88
AP +GEAN L++Y++S++
Sbjct: 84 SAPPVEGEANTELVKYLASVL 104
>gi|383852310|ref|XP_003701671.1| PREDICTED: UPF0235 protein C15orf40 homolog [Megachile rotundata]
Length = 144
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 40/50 (80%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
+V+I I AKPG+K +ITD+S++AVGV I AP +GEAN L++Y++S++
Sbjct: 55 NVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASVL 104
>gi|241835850|ref|XP_002415074.1| conserved hypothetical protein [Ixodes scapularis]
gi|215509286|gb|EEC18739.1| conserved hypothetical protein [Ixodes scapularis]
Length = 104
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 38/52 (73%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V+I +HAKPG+ ITD+ + VGVQI AP DGEANA L+ +++ ++++
Sbjct: 14 TVAIRVHAKPGASESRITDIGTDGVGVQIAAPPMDGEANAELVRFLAKVLNL 65
>gi|156373856|ref|XP_001629526.1| predicted protein [Nematostella vectensis]
gi|156216528|gb|EDO37463.1| predicted protein [Nematostella vectensis]
Length = 133
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 40/61 (65%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIH 89
S ++ S+ + IHAKPG+K IT++S + VGVQI A K+GEAN L+ YMSS+
Sbjct: 32 SPFEIIGDGSILVKIHAKPGAKQNRITELSPDFVGVQIAAQPKEGEANDELVRYMSSVFG 91
Query: 90 V 90
V
Sbjct: 92 V 92
>gi|347966092|ref|XP_321597.4| AGAP001528-PA [Anopheles gambiae str. PEST]
gi|333470215|gb|EAA01322.4| AGAP001528-PA [Anopheles gambiae str. PEST]
Length = 164
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 36/49 (73%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I AKPG+K+ ITDVS+E +G QI AP DGEAN L++Y+S L+ +
Sbjct: 72 VKILAKPGAKTSGITDVSEEGIGCQIAAPPIDGEANTELIKYLSKLLDL 120
>gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 157
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 38/59 (64%), Gaps = 2/59 (3%)
Query: 34 LVPPSS--VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V P S V I I AKPGSK IT + DE VGVQI AP DGEAN L++Y++ L+ +
Sbjct: 57 FVEPKSGNVLIKILAKPGSKFNGITGIEDEGVGVQIAAPPIDGEANTELVKYLAKLLDL 115
>gi|348580051|ref|XP_003475792.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cavia porcellus]
Length = 154
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 38/57 (66%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V+I IHAKPGSK ++TD++ EAV + I AP +GEAN L Y+S ++ + + V
Sbjct: 64 VTIAIHAKPGSKQSAVTDLTSEAVNIAIAAPPTEGEANTELCRYLSKVLELRKSDVV 120
>gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni]
gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni]
Length = 127
Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 42/65 (64%)
Query: 26 ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
E P I L +++I I AKPG+K ITD+ E VGVQI AP +GEANA L++Y+S
Sbjct: 23 ETPPLPINLDKSGNIAIKILAKPGAKQNGITDIGLEGVGVQIAAPPSEGEANAELVKYLS 82
Query: 86 SLIHV 90
++ +
Sbjct: 83 KVLGL 87
>gi|363737754|ref|XP_413838.2| PREDICTED: UPF0235 protein C15orf40 homolog [Gallus gallus]
Length = 140
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 37/51 (72%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V +++ AKPGS+ ++TDV+ EAVGV I AP +GEANA L Y+S ++ V
Sbjct: 51 VRVSVRAKPGSRCSAVTDVTAEAVGVAIAAPPSEGEANAELCRYLSKVLGV 101
>gi|442751525|gb|JAA67922.1| Hypothetical protein [Ixodes ricinus]
Length = 145
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 38/52 (73%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V+I +HAKPG+ ITD+ + VG+QI AP DGEANA L+ +++ ++++
Sbjct: 55 TVAIRVHAKPGASESRITDIGTDGVGIQIAAPPMDGEANAELVRFLAKVLNL 106
>gi|47213227|emb|CAF89748.1| unnamed protein product [Tetraodon nigroviridis]
Length = 120
Score = 57.4 bits (137), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 38/57 (66%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+V+IT+HAKPGSK +T VS EAV V I AP DGEAN L+ +++ ++ + G
Sbjct: 31 AVTITVHAKPGSKHSRVTAVSTEAVEVAIAAPPVDGEANVELVRFLAEVLELKKGHL 87
>gi|307102424|gb|EFN50700.1| hypothetical protein CHLNCDRAFT_142618 [Chlorella variabilis]
Length = 127
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 43/62 (69%), Gaps = 1/62 (1%)
Query: 27 NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
LP+ I P+S I +HAKPGSK CS+T + +A+ V +DA DGEANAAL+E+++
Sbjct: 30 GLPTFITPTGPASCRIGVHAKPGSKVCSVT-LGPDALEVAVDAKPVDGEANAALIEFVAE 88
Query: 87 LI 88
++
Sbjct: 89 VL 90
>gi|349805669|gb|AEQ18307.1| hypothetical protein [Hymenochirus curtipes]
Length = 84
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 37/57 (64%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+V + IHAKPGSK +ITDV+ EAV V I AP +GEAN L Y++ ++ V E
Sbjct: 11 TVCVNIHAKPGSKQNAITDVTTEAVVVAIAAPPMEGEANLELCRYLAQVLEVKKSEV 67
>gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta]
gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta]
Length = 127
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 6 KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
KGKSKA + S ++ K+ PS I + ++ I I AKPG+K IT + E VGV
Sbjct: 5 KGKSKA-AVESAKNVAKSTPAKEPSPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63
Query: 66 QIDAPAKDGEANAALLEYMSSLIHV 90
QI AP +GEANA L++++S ++ +
Sbjct: 64 QIAAPPSEGEANAELVKFLSKVLGL 88
>gi|390355931|ref|XP_003728660.1| PREDICTED: UPF0235 protein C15orf40 homolog [Strongylocentrotus
purpuratus]
Length = 147
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 38/52 (73%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S+SI+I AKPG+K IT + +E VGVQI AP +G AN L++Y++S++ +
Sbjct: 55 SISISIQAKPGAKHNGITGIEEEGVGVQISAPPVEGAANTELVKYLASVLGL 106
>gi|390355950|ref|XP_003728665.1| PREDICTED: UPF0235 protein C15orf40 homolog [Strongylocentrotus
purpuratus]
Length = 130
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 38/52 (73%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S+SI+I AKPG+K IT + +E VGVQI AP +G AN L++Y++S++ +
Sbjct: 38 SISISIQAKPGAKHNGITGIEEEGVGVQISAPPVEGAANTELVKYLASVLGL 89
>gi|296088594|emb|CBI37585.3| unnamed protein product [Vitis vinifera]
Length = 89
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/37 (70%), Positives = 32/37 (86%)
Query: 54 SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+I DEA+GVQIDAPAKDGEANAALL+Y+SS++ V
Sbjct: 14 TIDYFDDEALGVQIDAPAKDGEANAALLDYISSVVGV 50
>gi|242021057|ref|XP_002430963.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516183|gb|EEB18225.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 117
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 39/57 (68%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
++ + I AKPG+K+ +IT + +E +GVQI+A DGEAN+ L+ YMS L+ + E
Sbjct: 27 NIMLQIFAKPGAKTNAITGIDEEGIGVQINARPVDGEANSELVNYMSCLLGLRKTEI 83
>gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis]
gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis]
Length = 125
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 6/85 (7%)
Query: 6 KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
KGK+ +K+ +T + N P + L +++I I AKPG+K ITD+ E VGV
Sbjct: 7 KGKATSKAGEATTPPV----SNTP--VTLDKSGNIAIKILAKPGAKQNGITDIGLEGVGV 60
Query: 66 QIDAPAKDGEANAALLEYMSSLIHV 90
QI AP +GEANA L++++S ++ +
Sbjct: 61 QIAAPPSEGEANAELVKFLSKVLGL 85
>gi|443689915|gb|ELT92199.1| hypothetical protein CAPTEDRAFT_102723 [Capitella teleta]
Length = 129
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 37/49 (75%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I AKPG+K +ITDV+ + +GVQI AP DGEANA L++YMS ++ V
Sbjct: 40 LQICAKPGAKFNNITDVNADGIGVQIAAPPVDGEANAELVKYMSKVLGV 88
>gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi]
gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi]
Length = 126
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 37/52 (71%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I I AKPG+K ITD+ E VGVQI AP +GEANA L++++S ++ +
Sbjct: 35 NIVIKILAKPGAKQNGITDIGLEGVGVQIAAPPSEGEANAELVKFLSKVLGL 86
>gi|294916777|ref|XP_002778391.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886749|gb|EER10186.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 132
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 41/61 (67%), Gaps = 2/61 (3%)
Query: 27 NLPSCIRLVPPSS-VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYM 84
+L SC+ P +S I I AKPG+K +T + E A+GVQ++APA+DGEAN LL ++
Sbjct: 27 DLLSCVTKCPRTSQARIAIRAKPGAKVSCLTGIDAEGALGVQLNAPARDGEANEELLSFL 86
Query: 85 S 85
S
Sbjct: 87 S 87
>gi|290996548|ref|XP_002680844.1| predicted protein [Naegleria gruberi]
gi|284094466|gb|EFC48100.1| predicted protein [Naegleria gruberi]
Length = 73
Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 37/53 (69%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
S+ +TI AKP S S I +++DE +GV I AP K+GEAN L +Y+S ++ VS
Sbjct: 1 SIRLTILAKPNSSSSQIANINDEEIGVHIAAPPKEGEANKELCDYVSGVLGVS 53
>gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba]
gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba]
Length = 128
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 6 KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
KGKSKA + S +S K+ S I + ++ I I AKPG+K IT + E VGV
Sbjct: 5 KGKSKA-AVESAKSDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63
Query: 66 QIDAPAKDGEANAALLEYMSSLIHV 90
QI AP +GEANA L++++S ++ +
Sbjct: 64 QIAAPPSEGEANAELVKFLSKVLGL 88
>gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae]
gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae]
Length = 126
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 37/52 (71%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I I AKPG+K IT +S E VGVQI AP +GEANA L++++S ++ +
Sbjct: 36 NICIQILAKPGAKQNGITGISTEGVGVQIAAPPSEGEANAELVKFLSKVLGL 87
>gi|195337079|ref|XP_002035160.1| GM14073 [Drosophila sechellia]
gi|194128253|gb|EDW50296.1| GM14073 [Drosophila sechellia]
Length = 128
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 6 KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
KGKSKA + S ++ K+ S I + ++ I I AKPG+K IT + E VGV
Sbjct: 5 KGKSKA-AVESAKNDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63
Query: 66 QIDAPAKDGEANAALLEYMSSLIHV 90
QI AP +GEANA L++++S ++ +
Sbjct: 64 QIAAPPSEGEANAELVKFLSKVLGL 88
>gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans]
gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans]
Length = 128
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 6 KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
KGKSKA + S ++ K+ S I + ++ I I AKPG+K IT + E VGV
Sbjct: 5 KGKSKA-AVESAKNDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63
Query: 66 QIDAPAKDGEANAALLEYMSSLIHV 90
QI AP +GEANA L++++S ++ +
Sbjct: 64 QIAAPPSEGEANAELVKFLSKVLGL 88
>gi|145543083|ref|XP_001457228.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425043|emb|CAK89831.1| unnamed protein product [Paramecium tetraurelia]
Length = 106
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 35/52 (67%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S + I+AKP SK IT +SDEAV V I AP KDGEANA L ++++ + V
Sbjct: 17 SYFLVINAKPNSKVSQITGISDEAVDVNIAAPPKDGEANAELCDFVAQTLGV 68
>gi|391330297|ref|XP_003739600.1| PREDICTED: UPF0235 protein C15orf40 homolog [Metaseiulus
occidentalis]
Length = 160
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 35/51 (68%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V + IHAKPG+K ++T + EAV VQI AP DGEAN L++Y++ + +
Sbjct: 71 VGVRIHAKPGAKLSAVTGIGAEAVEVQIGAPPVDGEANTELVKYLAKALDL 121
>gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster]
gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster]
gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster]
Length = 140
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 5/87 (5%)
Query: 8 KSKAKSAGSTQSKIKTNDENLP----SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAV 63
KSK KS +S K + + +P S I + ++ I I AKPG+K IT + E V
Sbjct: 15 KSKGKSKAGVESA-KNDAKAMPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGFEGV 73
Query: 64 GVQIDAPAKDGEANAALLEYMSSLIHV 90
GVQI AP +GEANA L++++S ++ +
Sbjct: 74 GVQIAAPPSEGEANAELVKFLSKVLGL 100
>gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis]
gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis]
Length = 124
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 37/52 (71%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I I AKPG+K ITD+ + VGVQI AP +GEANA L++++S ++ +
Sbjct: 33 NIVIKILAKPGAKQNGITDIGLDGVGVQIAAPPSEGEANAELVKFLSKVLGL 84
>gi|145538297|ref|XP_001454854.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422631|emb|CAK87457.1| unnamed protein product [Paramecium tetraurelia]
Length = 106
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 35/52 (67%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S + I+AKP SK IT +SDEAV + I AP KDGEANA L ++++ + V
Sbjct: 17 SYYLVINAKPNSKVSQITGISDEAVDINIAAPPKDGEANAELCDFVAQTLGV 68
>gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis]
gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis]
gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
Length = 125
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 36/50 (72%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
++ I I AKPG+K IT++ E VGVQI AP +GEANA L++++S ++
Sbjct: 34 NICIKILAKPGAKHNGITNIDLEGVGVQIAAPPSEGEANAELVKFLSKVL 83
>gi|340508740|gb|EGR34382.1| hypothetical protein IMG5_013830 [Ichthyophthirius multifiliis]
Length = 150
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 34/51 (66%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I I AKP SK ++ +SD+ V V I AP KDGEAN L++ SSL+++
Sbjct: 59 LYIQIKAKPNSKISQVSSISDDCVDVNIAAPPKDGEANEELIQLFSSLLNI 109
>gi|294949448|ref|XP_002786202.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239900359|gb|EER17998.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 132
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 40/61 (65%), Gaps = 2/61 (3%)
Query: 27 NLPSCIRLVPPSS-VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYM 84
+L SC+ P +S I I AKPG+K +T + E A+GVQ++A A+DGEAN LL ++
Sbjct: 27 DLLSCVTKCPRTSQARIAIRAKPGAKVSCLTGIDAEGALGVQLNASARDGEANEELLSFL 86
Query: 85 S 85
S
Sbjct: 87 S 87
>gi|291412810|ref|XP_002722671.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 154
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
K+ A S G +QSK + + + P V+I IHAKPG+K ++TD++ EAV V I
Sbjct: 32 KAGATSKGRSQSKDREAPLPPSGSVAVDPKGCVTIAIHAKPGAKQNAVTDLTAEAVIVAI 91
Query: 68 DAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AP +GEANA L Y+S ++ + + V
Sbjct: 92 AAPPSEGEANAELCRYLSKVLELRKSDVV 120
>gi|449675515|ref|XP_004208423.1| PREDICTED: uncharacterized protein LOC100205424 [Hydra
magnipapillata]
Length = 1002
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 37/53 (69%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + + I+AKPG+K +TD+S E +G+Q+ A +DG+AN LL Y+S L ++
Sbjct: 911 NEIILQIYAKPGAKRNKVTDISVEFIGIQLAAQPRDGKANDELLSYLSELFNI 963
>gi|71668144|ref|XP_821011.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886377|gb|EAN99160.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 156
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 3/62 (4%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S I + P +T+HAKPG++S S+ V+D A+ V++ AP +G+ANA L+E+M
Sbjct: 5 SFISQLRPGWFHLTVHAKPGARSSSLACHPAVTDAAIEVRVGAPPVEGKANAELVEFMQM 64
Query: 87 LI 88
L+
Sbjct: 65 LL 66
>gi|399217812|emb|CCF74699.1| unnamed protein product [Babesia microti strain RI]
Length = 97
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 35/51 (68%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V +T+ KPG+KS I ++ + + +QI AP +DGE N AL++Y+S ++ +
Sbjct: 10 VFLTVRVKPGAKSTKIVEIDENCLHLQIAAPPRDGECNEALIKYISKILGI 60
>gi|312084117|ref|XP_003144143.1| hypothetical protein LOAG_08565 [Loa loa]
gi|307760694|gb|EFO19928.1| hypothetical protein LOAG_08565 [Loa loa]
Length = 129
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 2/91 (2%)
Query: 7 GKSKAKSAGSTQSKIKT-NDENLP-SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG 64
GK+K+K G + T N E L S I L + + IHAKP +K+ + D+ V
Sbjct: 2 GKNKSKIEGMVSVTMNTSNKEKLKDSAISLNKNGDIILRIHAKPNAKTTRVIDIGANEVE 61
Query: 65 VQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ I AP DG+AN AL+ M ++ + E
Sbjct: 62 LAIAAPPHDGQANEALINAMMDILELRKNEI 92
>gi|302879874|ref|YP_003848438.1| hypothetical protein Galf_2678 [Gallionella capsiferriformans
ES-2]
gi|302582663|gb|ADL56674.1| protein of unknown function DUF167 [Gallionella capsiferriformans
ES-2]
Length = 96
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 36/55 (65%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGE 94
+++T+H +PG+K I + EA+ +++ AP DG AN ALL+Y++ L V V +
Sbjct: 12 ITLTLHVQPGAKRSEICGLHGEALKLKLAAPPIDGRANEALLKYIAELFRVPVRQ 66
>gi|410907834|ref|XP_003967396.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40 homolog
[Takifugu rubripes]
Length = 115
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 36/52 (69%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V+IT+HAKPGSK SIT++ +++ AP DGEAN L+ +++ ++ +
Sbjct: 31 AVTITVHAKPGSKHSSITEIXWPFCLLELSAPPVDGEANVELIRFLAEVLEL 82
>gi|338715974|ref|XP_003363374.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40 homolog
[Equus caballus]
Length = 196
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
P V+I IHAKPGSK +I DV+ EAV + + AP +GE N L Y S ++
Sbjct: 94 PKGCVTIAIHAKPGSKQHAIPDVTAEAVSMDVAAPPLEGEVNTELCCYCSEVL 146
>gi|72388068|ref|XP_844458.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359409|gb|AAX79847.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70800991|gb|AAZ10899.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 165
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S I + P + IHAKPG++S ++ DEA+ V++ AP DG+AN L+E+M +
Sbjct: 5 SFIVQLRPGFFRLMIHAKPGARSTALAAQPQALDEALEVRLAAPPVDGKANTELVEFMQT 64
Query: 87 LI 88
L+
Sbjct: 65 LL 66
>gi|401412293|ref|XP_003885594.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool]
gi|325120013|emb|CBZ55566.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool]
Length = 531
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 29/88 (32%), Positives = 53/88 (60%), Gaps = 5/88 (5%)
Query: 2 APAKKGKSKAKSAGSTQSKIKTNDE--NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
A +K K +A + + + T D +LP +R+V +++ +HAKPG+K I ++
Sbjct: 11 AAVQKKKPEAGNKHEKKDERNTTDSQSDLPRFLRIVD-GGLTLAVHAKPGAKQSQIPSIN 69
Query: 60 DEA--VGVQIDAPAKDGEANAALLEYMS 85
++A + VQIDAPA++G AN L ++++
Sbjct: 70 EQAEQLDVQIDAPAREGAANEELCDFLA 97
>gi|452819902|gb|EME26952.1| hypothetical protein Gasu_54060 [Galdieria sulphuraria]
Length = 129
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 18 QSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEAN 77
Q + ++ +PS IR V + + + + KPGSK + ++E V + + A K GEAN
Sbjct: 17 QRPLMAHNNRIPSYIR-VKENHILLQVLVKPGSKRPGLMQTTEEEVIIHVGAQPKQGEAN 75
Query: 78 AALLEYMSSLIHV 90
L+E ++ L+HV
Sbjct: 76 QELVERLAKLLHV 88
>gi|261327632|emb|CBH10608.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
gi|289742777|gb|ADD20136.1| hypothetical protein Tb927.4.3080 [Glossina morsitans morsitans]
Length = 165
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S I + P + IHAKPG++S ++ DEA+ V++ AP DG+AN L+E+M +
Sbjct: 5 SFIVQLRPGFFRLMIHAKPGARSTALAAQPQALDEALEVRLAAPPVDGKANTELVEFMQT 64
Query: 87 LI 88
L+
Sbjct: 65 LL 66
>gi|71660987|ref|XP_817521.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882718|gb|EAN95670.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 156
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 3/62 (4%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S I + P +T+HAKPG++S S+ V+D A+ V+I AP +G+ANA L+++M
Sbjct: 5 SFISQLRPGWFHLTVHAKPGARSSSLACHPAVTDAALEVRIGAPPVEGKANAELVDFMQM 64
Query: 87 LI 88
L+
Sbjct: 65 LL 66
>gi|403343269|gb|EJY70959.1| UPF0235 protein C15orf40-like protein [Oxytricha trifallax]
Length = 148
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 33/49 (67%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
+ + I AKPGSKS I V D+ +GV + AP DG AN +LE+++S++
Sbjct: 58 IFMVIRAKPGSKSDEIFAVEDDYIGVAVQAPPLDGAANEGILEFLASVL 106
>gi|403224187|dbj|BAM42317.1| uncharacterized protein TOT_040000685 [Theileria orientalis strain
Shintoku]
Length = 1360
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFVESVF 100
+ ++ KPG+K + + + +QI AP +DGE N ALLEY++ L +S +F S F
Sbjct: 1291 LKVNVKPGAKQTQVVGEVEGLLSLQISAPPRDGECNKALLEYVAELCKLSAYQFSFSAF 1349
>gi|406936038|gb|EKD69854.1| hypothetical protein ACD_46C00713G0005 [uncultured bacterium]
Length = 92
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 35/49 (71%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+TI+ +PGSK I + D + ++++AP DG+ANAAL+ ++S+LI +
Sbjct: 7 LTIYLQPGSKKSEICGMHDGHIKIKLNAPPVDGKANAALILFLSNLIGI 55
>gi|358638832|dbj|BAL26129.1| hypothetical protein AZKH_3845 [Azoarcus sp. KH32C]
Length = 99
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 36/60 (60%)
Query: 32 IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+R SV +T+H +PG+K I + EA+ V++ AP DG+ANAAL +++ VS
Sbjct: 6 LREAADGSVVLTLHIQPGAKRTEIVGLHGEALKVRLAAPPVDGKANAALCAFLAEFCGVS 65
>gi|444725293|gb|ELW65866.1| T-cell surface glycoprotein CD3 delta chain [Tupaia chinensis]
Length = 178
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 1/51 (1%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V+I IHAKPGSK ++TD++ +AV V I AP GEA+A L ++S ++ +
Sbjct: 9 VTIAIHAKPGSKQNAVTDLTTKAVNVAITAPPA-GEASAELCRFLSKVLEL 58
>gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 96
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 37/53 (69%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
++++TIH +PG+K+ I +A+ V+I AP DG ANAAL+++++ + +S
Sbjct: 11 AITVTIHLQPGAKANEIAGRHGDALKVRITAPPVDGRANAALVDFLAQRLGLS 63
>gi|403346375|gb|EJY72583.1| UPF0235 protein C15orf40-like protein [Oxytricha trifallax]
Length = 148
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
+ + I AKPGSKS I V D+ GV + AP DG AN +LE+++S++
Sbjct: 58 IFMVIRAKPGSKSDEIFAVEDDYKGVAVQAPPLDGAANEGILEFLASVL 106
>gi|328772106|gb|EGF82145.1| hypothetical protein BATDEDRAFT_86894 [Batrachochytrium
dendrobatidis JAM81]
Length = 128
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 35/52 (67%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S+ + KPG+K + D+ +AVG+QI A A++GEANA L++ ++ ++ +
Sbjct: 38 SIRLNTLVKPGTKVSQVIDIQGDAVGIQIAAVAREGEANAELIQTVADVLKL 89
>gi|392512740|emb|CCI73951.1| ECU07_0355 [Encephalitozoon cuniculi GB-M1]
Length = 100
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 34/51 (66%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ +T+ AKP +++ SI +V+D V V I AP D +AN L+E++S ++ V
Sbjct: 11 LKLTVRAKPNARNTSIREVTDTEVVVNISAPPADNKANKELVEFISKMLKV 61
>gi|312375313|gb|EFR22710.1| hypothetical protein AND_14314 [Anopheles darlingi]
Length = 155
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 35/73 (47%), Gaps = 24/73 (32%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQID------------------------APAKDGEAN 77
+ I AKPG+K+ ITDVS+E VG QI AP DGEAN
Sbjct: 39 VKILAKPGAKTSGITDVSEEGVGCQIGMSLVDGSTLTLFDQTIQLHLSPLAAPPIDGEAN 98
Query: 78 AALLEYMSSLIHV 90
L+ Y+S L+ +
Sbjct: 99 TELIRYLSKLLEL 111
>gi|417407795|gb|JAA50493.1| Hypothetical protein, partial [Desmodus rotundus]
Length = 116
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P V+I IHAKPGSK +IT ++ EAV V + AP +GEANA L Y+S ++ +
Sbjct: 22 PKGCVTIAIHAKPGSKQNAITGLTAEAVSVAVAAPPSEGEANAELCRYLSKVLEL 76
>gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72]
gi|166232594|sp|A1KB74.1|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464
gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length = 98
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 35/59 (59%)
Query: 32 IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+R S+++T+H +PG++ + EA+ +++ AP DG+ANAAL +++ V
Sbjct: 5 VREAADGSLTLTLHIQPGARQTGFAGLHGEAMKIRLAAPPVDGKANAALCAFLADFCEV 63
>gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4]
gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4]
Length = 104
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V+ + +PG+K + + EAV + + APA DG+AN AL+ ++++L+ V
Sbjct: 16 VTFAVRVQPGAKRSGVVGIYGEAVKIALVAPAVDGKANEALVRFVATLLDV 66
>gi|302763307|ref|XP_002965075.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
gi|300167308|gb|EFJ33913.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
Length = 233
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%)
Query: 13 SAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK 72
S S ++ D +P CI V + I + ++ +IT V+ + V V + APA
Sbjct: 117 SLSSMAAETNPQDAPVPPCISQTEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAA 176
Query: 73 DGEANAALLEYMSSLIHVSVGEF 95
GEAN LLEYM+ ++ + V +
Sbjct: 177 RGEANNELLEYMAKVLSLRVTQM 199
>gi|401428443|ref|XP_003878704.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494953|emb|CBZ30256.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 204
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
C+ V P + +T+HAKP +++ + +T EA ++I AP DG+ANA LL Y+
Sbjct: 14 CLVQVQPGAYLLTVHAKPSARASAFAAPVTPTLTEA-DLRIAAPPVDGQANAELLRYLGE 72
Query: 87 LI 88
LI
Sbjct: 73 LI 74
>gi|428182068|gb|EKX50930.1| hypothetical protein GUITHDRAFT_135010 [Guillardia theta CCMP2712]
Length = 722
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 6/61 (9%)
Query: 39 SVSITIHAKPGSKSCSITDVSD------EAVGVQIDAPAKDGEANAALLEYMSSLIHVSV 92
SV +H KPG+K IT+ D + VQI AP +DGEAN ++ +++SL +V
Sbjct: 350 SVQTALHVKPGAKVTRITNAEDIRTRRAGFIDVQIAAPPRDGEANEEVVAFIASLFNVKR 409
Query: 93 G 93
G
Sbjct: 410 G 410
>gi|384253020|gb|EIE26495.1| DUF167-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 120
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 26 ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGE 75
+NLP+ ++ + SVSI++H KPGSK S + +++ +V + I APA++GE
Sbjct: 18 KNLPTYVKQLSEDSVSISVHCKPGSKVNSFS-ITEGSVSLAISAPAREGE 66
>gi|341898004|gb|EGT53939.1| hypothetical protein CAEBREN_29926 [Caenorhabditis brenneri]
Length = 125
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 30/56 (53%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ + IHAKPG+K + + D V V I AP ++G AN L+ Y+ + + E
Sbjct: 35 IGLRIHAKPGAKKSCVVAIGDSEVDVAIGAPPREGAANEELISYLMQALGLRKNEL 90
>gi|157875650|ref|XP_001686209.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129283|emb|CAJ07823.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 204
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
C+ V P +T+HAKP +++ + +T V EA ++I AP +G+ANA LL Y+
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPVLTEA-DLRIAAPPVEGQANAELLRYLDE 72
Query: 87 LI 88
LI
Sbjct: 73 LI 74
>gi|296491729|tpg|DAA33762.1| TPA: C21H15orf40 protein-like [Bos taurus]
Length = 107
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
P ++ IH K GSK ++TDV+ EAV V I P +GEAN L +S ++ + +
Sbjct: 10 PKGGFTMAIHDKAGSKQNAMTDVTTEAVSVGIAGPPIEGEANVELCCCLSKILELRTSDV 69
Query: 96 V 96
V
Sbjct: 70 V 70
>gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9]
gi|254801627|sp|C1D6C4.1|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181
gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis
HLHK9]
Length = 97
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 34/53 (64%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
V +T+H +PG++ + + +A+ +++ AP DG+ANA LL +++ + VS
Sbjct: 10 GVRLTLHVQPGARRTEVAGLHGDALKIRLAAPPVDGKANACLLAFLARGLGVS 62
>gi|449268046|gb|EMC78919.1| UPF0235 protein C15orf40 like protein, partial [Columba livia]
Length = 75
Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 55 ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+TDV+ EAVGV I AP +GEANA L Y+S ++ V E
Sbjct: 1 VTDVTAEAVGVAIAAPPSEGEANAELCRYLSKVLEVKKSEV 41
>gi|167999197|ref|XP_001752304.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696699|gb|EDQ83037.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 231
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ IT V+ + V V + APA GEAN LLEY
Sbjct: 127 QDAPVPPCISQIDGGLVQVAIEVEDRAQRSQITRVNADDVRVTVAAPAARGEANNELLEY 186
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 187 MGKVLGLRLTQM 198
>gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM
2638]
gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM
2638]
Length = 106
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 38/61 (62%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSL 87
LPS IR S +++ +PG+K+ IT ++V V+I+APA D +AN AL ++++
Sbjct: 8 LPSYIRPCGHGSWRVSVWVQPGAKNEGITGEYQDSVRVRINAPAVDNKANKALAAFVATR 67
Query: 88 I 88
+
Sbjct: 68 L 68
>gi|116784383|gb|ABK23322.1| unknown [Picea sitchensis]
gi|224284776|gb|ACN40118.1| unknown [Picea sitchensis]
Length = 232
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLEY
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEY 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLTQM 199
>gi|422348904|ref|ZP_16429796.1| TIGR00251 family protein [Sutterella wadsworthensis 2_1_59BFAA]
gi|404658956|gb|EKB31818.1| TIGR00251 family protein [Sutterella wadsworthensis 2_1_59BFAA]
Length = 112
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 33/50 (66%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
I +HA+PG+K ++ V + + + + +P DG+ANA L++Y+S + VS
Sbjct: 25 IAVHAQPGAKRTAVVGVHGDRLKIALASPPVDGKANATLIKYLSKGLGVS 74
>gi|388455770|ref|ZP_10138065.1| hypothetical protein FdumT_04323 [Fluoribacter dumoffii Tex-KL]
Length = 91
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 16/59 (27%), Positives = 35/59 (59%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V++ ++ +PG+K I + + + ++++ P +G AN ALL+Y++ L V V + +
Sbjct: 8 KDVTLHLYVQPGAKKTEIIGIHEGELKIRLNTPPIEGRANKALLQYIAQLFKVPVRQVI 66
>gi|428225074|ref|YP_007109171.1| hypothetical protein GEI7407_1628 [Geitlerinema sp. PCC 7407]
gi|427984975|gb|AFY66119.1| protein of unknown function DUF167 [Geitlerinema sp. PCC 7407]
Length = 75
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 33/51 (64%)
Query: 41 SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+IT+ KP +K IT SD ++ +Q+ AP DG+ANAAL+E ++ VS
Sbjct: 3 TITVKVKPRAKVQKITTASDGSLIIQLSAPPVDGKANAALIEILADHFGVS 53
>gi|168035145|ref|XP_001770071.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678597|gb|EDQ65053.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 232
Score = 42.7 bits (99), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ IT V+ + V V + APA GEAN LLEY
Sbjct: 128 QDAPVPPCISQIDGGLVQVGIEVEDRAQRSQITRVNADDVRVTVAAPAARGEANNELLEY 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLTQM 199
>gi|302756509|ref|XP_002961678.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
gi|300170337|gb|EFJ36938.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
Length = 233
Score = 42.4 bits (98), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%)
Query: 13 SAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK 72
S S ++ D +P CI V + I + ++ +IT V+ + V V + APA
Sbjct: 117 SLSSVAAETNPQDAPVPPCISQTEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAA 176
Query: 73 DGEANAALLEYMSSLIHVSVGEF 95
GEAN LLEYM+ ++ + +
Sbjct: 177 RGEANNELLEYMAKVLSLRATQM 199
>gi|116780073|gb|ABK21543.1| unknown [Picea sitchensis]
Length = 265
Score = 42.4 bits (98), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLEY
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEY 187
Query: 84 MSSLIHVSVGE 94
M ++ + + +
Sbjct: 188 MGKVLGLRLTQ 198
>gi|167854882|ref|ZP_02477658.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
gi|219871635|ref|YP_002476010.1| hypothetical protein HAPS_1504 [Haemophilus parasuis SH0165]
gi|254800538|sp|B8F6W0.1|Y1504_HAEPS RecName: Full=UPF0235 protein HAPS_1504
gi|167853949|gb|EDS25187.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
gi|219691839|gb|ACL33062.1| conserved hypothetical protein [Haemophilus parasuis SH0165]
Length = 97
Score = 42.4 bits (98), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + I +P + I + D + + I AP DG+ANA LL+Y+S L V
Sbjct: 10 PQGIRLRIFLQPKASRDQIVGLHDNELKIAITAPPIDGQANAHLLKYLSKLFKV 63
>gi|120602738|ref|YP_967138.1| hypothetical protein Dvul_1694 [Desulfovibrio vulgaris DP4]
gi|120562967|gb|ABM28711.1| protein of unknown function DUF167 [Desulfovibrio vulgaris DP4]
Length = 108
Score = 42.4 bits (98), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 29 PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
P+ + + ++ + +PG+K + V+D + V++ APA D +AN L YM+SL+
Sbjct: 7 PAYVEAMGDGRWALLVWVQPGAKKDGLAGVADGRLRVRLSAPAVDNKANRGLERYMASLL 66
Query: 89 HV 90
V
Sbjct: 67 GV 68
>gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
Length = 95
Score = 42.0 bits (97), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S V I I+AKP +K + +SD+ + + + A ++GEAN LL ++S +
Sbjct: 8 SHVEIAIYAKPNAKKTKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKI 60
>gi|115471615|ref|NP_001059406.1| Os07g0295200 [Oryza sativa Japonica Group]
gi|34394983|dbj|BAC84531.1| unknown protein [Oryza sativa Japonica Group]
gi|113610942|dbj|BAF21320.1| Os07g0295200 [Oryza sativa Japonica Group]
gi|215765278|dbj|BAG86975.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218199457|gb|EEC81884.1| hypothetical protein OsI_25695 [Oryza sativa Indica Group]
gi|222636860|gb|EEE66992.1| hypothetical protein OsJ_23901 [Oryza sativa Japonica Group]
Length = 232
Score = 42.0 bits (97), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 39/72 (54%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
+D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|46579785|ref|YP_010593.1| hypothetical protein DVU1374 [Desulfovibrio vulgaris str.
Hildenborough]
gi|387153132|ref|YP_005702068.1| hypothetical protein Deval_1269 [Desulfovibrio vulgaris RCH1]
gi|46449200|gb|AAS95852.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
Hildenborough]
gi|311233576|gb|ADP86430.1| protein of unknown function DUF167 [Desulfovibrio vulgaris RCH1]
Length = 106
Score = 42.0 bits (97), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 29 PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
P+ + + ++ + +PG+K + V+D + V++ APA D +AN L YM+SL+
Sbjct: 5 PAYVEAMGDGRWALLVWVQPGAKKDGLAGVADGRLRVRLSAPAVDNKANRGLERYMASLL 64
Query: 89 HV 90
V
Sbjct: 65 GV 66
>gi|357111014|ref|XP_003557310.1| PREDICTED: UPF0235 protein C15orf40 homolog [Brachypodium
distachyon]
Length = 232
Score = 42.0 bits (97), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 39/72 (54%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
+D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 HDAPVPPCITQLQGGLVQVAIEVEDRAQRSAITRVNADDVRVAVAAPATRGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|145334887|ref|NP_001078789.1| uncharacterized protein [Arabidopsis thaliana]
gi|332010366|gb|AED97749.1| uncharacterized protein [Arabidopsis thaliana]
Length = 213
Score = 42.0 bits (97), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 109 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 168
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 169 MGRVLGLRLSQM 180
>gi|398022248|ref|XP_003864286.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502521|emb|CBZ37604.1| hypothetical protein, conserved [Leishmania donovani]
Length = 204
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 5/62 (8%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
C+ V P +T+HAKP +++ + +T EA ++I AP +G+ANA LL Y+
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPALTEA-DLRIAAPPVEGQANAELLRYLGE 72
Query: 87 LI 88
LI
Sbjct: 73 LI 74
>gi|146098877|ref|XP_001468495.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134072863|emb|CAM71579.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 204
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 5/62 (8%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
C+ V P +T+HAKP +++ + +T EA ++I AP +G+ANA LL Y+
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPALTEA-DLRIAAPPVEGQANAELLRYLGE 72
Query: 87 LI 88
LI
Sbjct: 73 LI 74
>gi|9758285|dbj|BAB08809.1| unnamed protein product [Arabidopsis thaliana]
Length = 213
Score = 42.0 bits (97), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 109 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 168
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 169 MGRVLGLRLSQM 180
>gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|378778610|ref|YP_005187049.1| hypothetical protein lp12_2709 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|364509425|gb|AEW52949.1| hypothetical protein lp12_2709 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
Length = 95
Score = 42.0 bits (97), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S V I I+AKP +K + +SD+ + + + A ++GEAN LL ++S +
Sbjct: 8 SHVEIAIYAKPNAKKSKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKI 60
>gi|85859583|ref|YP_461785.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
Length = 118
Score = 42.0 bits (97), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 33/58 (56%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ VS +H P S C++ + A+ +++ AP DG+AN LE+++ ++ V G+
Sbjct: 11 NGVSFCVHVLPRSAKCALAGAQEGALRIKLTAPPVDGKANDECLEFLAGILGVKKGQM 68
>gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
Length = 95
Score = 42.0 bits (97), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 32/53 (60%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S V I I+AKP +K + +SD+++ + + A ++GEAN LL ++S +
Sbjct: 8 SHVEIAIYAKPNAKKSKLMAISDDSLHIALHAKPQEGEANNELLFFISQFFKI 60
>gi|297793925|ref|XP_002864847.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
lyrata]
gi|297310682|gb|EFH41106.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
lyrata]
Length = 232
Score = 42.0 bits (97), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGRVLGLRLSQM 199
>gi|255540269|ref|XP_002511199.1| conserved hypothetical protein [Ricinus communis]
gi|223550314|gb|EEF51801.1| conserved hypothetical protein [Ricinus communis]
Length = 232
Score = 41.6 bits (96), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|225456297|ref|XP_002283689.1| PREDICTED: uncharacterized protein LOC100247416 [Vitis vinifera]
gi|147823132|emb|CAN75279.1| hypothetical protein VITISV_030868 [Vitis vinifera]
gi|297734405|emb|CBI15652.3| unnamed protein product [Vitis vinifera]
Length = 232
Score = 41.6 bits (96), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|240949737|ref|ZP_04754069.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
gi|240295769|gb|EER46456.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
Length = 100
Score = 41.6 bits (96), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P+ + + I +P + I + DE + + I AP DG ANA LL+++S L V
Sbjct: 11 PNGIRLRIFLQPKASRDQIVGLHDEELKIAITAPPVDGAANAHLLKFLSKLFKV 64
>gi|30697845|ref|NP_568972.2| uncharacterized protein [Arabidopsis thaliana]
gi|26452404|dbj|BAC43287.1| unknown protein [Arabidopsis thaliana]
gi|332010365|gb|AED97748.1| uncharacterized protein [Arabidopsis thaliana]
Length = 232
Score = 41.6 bits (96), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGRVLGLRLSQM 199
>gi|340053341|emb|CCC47629.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 160
Score = 41.6 bits (96), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 11/63 (17%)
Query: 29 PSCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMS 85
P C RL+ +HAKPG+++ S+ D ++ V+I AP DG+AN L+ +M
Sbjct: 12 PGCFRLI--------VHAKPGARATSLACRPAPGDASLEVKIAAPPVDGKANVELVAFMQ 63
Query: 86 SLI 88
+L+
Sbjct: 64 ALL 66
>gi|33151941|ref|NP_873294.1| hypothetical protein HD0778 [Haemophilus ducreyi 35000HP]
gi|47117473|sp|Q7VN15.1|Y778_HAEDU RecName: Full=UPF0235 protein HD_0778
gi|33148163|gb|AAP95683.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP]
Length = 97
Score = 41.6 bits (96), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + I +P + I + D + V I AP DG ANA LL+Y+S L V
Sbjct: 10 PYGIRLRIFLQPKASRDQIVGLHDNELKVAITAPPVDGAANAYLLKYLSKLFKV 63
>gi|84996333|ref|XP_952888.1| proton translocating inorganic pyrophosphatase [Theileria annulata
strain Ankara]
gi|65303885|emb|CAI76264.1| proton translocating inorganic pyrophosphatase, putative [Theileria
annulata]
Length = 1204
Score = 41.6 bits (96), Expect = 0.065, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 30/46 (65%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+ + ++ KPGS+ I S+ + VQI AP ++GE N AL+E++S
Sbjct: 1096 ILLKVNVKPGSRQTQIIGESEGRLSVQIAAPPREGECNKALIEFIS 1141
>gi|255630115|gb|ACU15411.1| unknown [Glycine max]
Length = 225
Score = 41.6 bits (96), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQLAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|372487286|ref|YP_005026851.1| hypothetical protein Dsui_0600 [Dechlorosoma suillum PS]
gi|359353839|gb|AEV25010.1| TIGR00251 family protein [Dechlorosoma suillum PS]
Length = 96
Score = 41.2 bits (95), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 31/44 (70%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+T+H +PG+K + + +A+ +++ AP DG+ANAALL +++
Sbjct: 13 LTLHIQPGAKKTEVCGLHGDALKIRLAAPPVDGKANAALLAFVA 56
>gi|351724451|ref|NP_001237058.1| uncharacterized protein LOC100306581 [Glycine max]
gi|255628955|gb|ACU14822.1| unknown [Glycine max]
Length = 227
Score = 41.2 bits (95), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 38/71 (53%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQLAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGE 94
M ++ + + +
Sbjct: 188 MGKVLGLRLSQ 198
>gi|257465218|ref|ZP_05629589.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
gi|257450878|gb|EEV24921.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
Length = 100
Score = 41.2 bits (95), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + I +P + I + DE + + I AP DG ANA LL+++S L V
Sbjct: 11 PDGIRLRIFLQPKASRDQIVGLHDEELKIAITAPPVDGAANAHLLKFLSKLFKV 64
>gi|167630164|ref|YP_001680663.1| hypothetical protein HM1_2095 [Heliobacterium modesticaldum Ice1]
gi|259646567|sp|B0TGP1.1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270
gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family
[Heliobacterium modesticaldum Ice1]
Length = 96
Score = 41.2 bits (95), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 33/56 (58%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
P S+ I +P + + + D+A+ V++ AP DGEANAA L++++ + +S
Sbjct: 8 PGGSIRFRIRVQPRASKNEVCGLLDDALKVRLTAPPVDGEANAACLQFIAKTLGLS 63
>gi|442771071|gb|AGC71769.1| hypothetical protein [uncultured bacterium A1Q1_fos_504]
Length = 98
Score = 41.2 bits (95), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSL 87
+P+ +R V + I+ +PG+K + D A+ ++I+AP DG+ANAA++ +++
Sbjct: 3 IPTWLRECA-DGVEVFIYVQPGAKKTELAGDHDGALKLRINAPPVDGKANAAVIAFLADR 61
Query: 88 IHVSVGEFV 96
VS G+ +
Sbjct: 62 CGVSKGKVL 70
>gi|397668393|ref|YP_006509930.1| hypothetical protein LPV_3066 [Legionella pneumophila subsp.
pneumophila]
gi|395131804|emb|CCD10097.1| conserved protein of unknown function [Legionella pneumophila
subsp. pneumophila]
Length = 95
Score = 41.2 bits (95), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 18/60 (30%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+++ P V I I+AKP +K + +SD+ + + + A ++GEAN LL ++S +
Sbjct: 2 WFKIIHPH-VEIAIYAKPNAKKTKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKI 60
>gi|11499654|ref|NP_070896.1| hypothetical protein AF2072 [Archaeoglobus fulgidus DSM 4304]
gi|29839698|sp|O28207.1|Y2072_ARCFU RecName: Full=UPF0235 protein AF_2072
gi|2648454|gb|AAB89177.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 78
Score = 40.8 bits (94), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 40 VSITIHAKPGSKSCSIT-DVSDEAVGVQIDAPAKDGEANAALL 81
V I++H PGSK S + D AV V+I +PAK+G+AN LL
Sbjct: 8 VLISVHVSPGSKEVSFSYDEWRRAVEVRIKSPAKEGKANRELL 50
>gi|436842528|ref|YP_007326906.1| conserved protein of unknown function [Desulfovibrio
hydrothermalis AM13 = DSM 14728]
gi|432171434|emb|CCO24807.1| conserved protein of unknown function [Desulfovibrio
hydrothermalis AM13 = DSM 14728]
Length = 106
Score = 40.8 bits (94), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 37/61 (60%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSL 87
LPS IR S +++ +PG+++ +T +V V+I+APA D +AN AL ++++
Sbjct: 8 LPSYIRPCGSRSWKLSVWVQPGARTEGVTGEYQGSVRVRINAPAVDNKANKALARFVAAR 67
Query: 88 I 88
+
Sbjct: 68 L 68
>gi|145590207|ref|YP_001156804.1| hypothetical protein Pnuc_2029 [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
gi|145048613|gb|ABP35240.1| protein of unknown function DUF167 [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
Length = 98
Score = 40.8 bits (94), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 14/54 (25%), Positives = 32/54 (59%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P+ +++ +H +PG+K + + D + + + APA + +AN LL ++S + +
Sbjct: 10 PTGITLNLHCQPGAKVTKVVGLHDGCLKISLQAPAIENKANELLLGWLSKQLKI 63
>gi|149057381|gb|EDM08704.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus
norvegicus]
gi|149057386|gb|EDM08709.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus
norvegicus]
Length = 114
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 12/57 (21%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
V+I IHAKPGSK ++T AP +GEANA L Y+S ++ + + V
Sbjct: 36 VTIAIHAKPGSKQNAVT------------APPSEGEANAELCRYLSKVLDLRKSDVV 80
>gi|30697842|ref|NP_851256.1| uncharacterized protein [Arabidopsis thaliana]
gi|16226478|gb|AAL16178.1|AF428410_1 AT5g63440/MLE2_7 [Arabidopsis thaliana]
gi|22137224|gb|AAM91457.1| AT5g63440/MLE2_7 [Arabidopsis thaliana]
gi|332010364|gb|AED97747.1| uncharacterized protein [Arabidopsis thaliana]
Length = 205
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%)
Query: 25 DENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYM 84
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+M
Sbjct: 129 DAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEFM 188
Query: 85 SSLIHVSV 92
+ + SV
Sbjct: 189 GRVPNQSV 196
>gi|167386103|ref|XP_001737619.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899553|gb|EDR26129.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 118
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S+V I I KP +K+ + V D + V IDAP DG+AN ++ +M+S
Sbjct: 31 SNVIIEIEIKPNAKTSELQGVEDGILKVAIDAPPIDGKANTEVIAFMAS 79
>gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396]
gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 102
Score = 40.8 bits (94), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 18/60 (30%), Positives = 32/60 (53%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
C+ L ++ + H +PG+K I +A+ ++I AP DG AN L+ +++ L V
Sbjct: 9 CVSLQDEQTLILQCHLQPGAKKDEIVGTHGDALKIKISAPPIDGRANQQLVRFLAKLCRV 68
>gi|387769243|ref|ZP_10125509.1| TIGR00251 family protein [Pasteurella bettyae CCUG 2042]
gi|386907199|gb|EIJ71914.1| TIGR00251 family protein [Pasteurella bettyae CCUG 2042]
Length = 95
Score = 40.8 bits (94), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 28/51 (54%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + I +P + I + DE + V I AP DG ANA LL+Y+S L V
Sbjct: 11 LRLKIFLQPKASKDQIIGIHDEELKVSITAPPVDGAANAHLLKYLSKLFKV 61
>gi|407693704|ref|YP_006818493.1| hypothetical protein ASU2_10630 [Actinobacillus suis H91-0380]
gi|407389761|gb|AFU20254.1| hypothetical protein ASU2_10630 [Actinobacillus suis H91-0380]
Length = 99
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 30/58 (51%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I AP DG ANA LL+Y+S L V
Sbjct: 6 RIENPCGIRLRIFLQPKASRDQIVGLHDNELKIAITAPPVDGAANAHLLKYLSKLFKV 63
>gi|388510890|gb|AFK43511.1| unknown [Lotus japonicus]
Length = 232
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQVAIEVDDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLSQM 199
>gi|406940280|gb|EKD73095.1| hypothetical protein ACD_45C00464G0002 [uncultured bacterium]
Length = 101
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 16/57 (28%), Positives = 34/57 (59%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
++++I KP +K +I + D+A+ + + A G+AN AL+ Y++ L ++ E +
Sbjct: 10 INLSILVKPNAKRTAILKIDDQALTIALHATPHYGKANQALIAYLADLFQLTKKEVI 66
>gi|165976805|ref|YP_001652398.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
serovar 3 str. JL03]
gi|226734144|sp|B0BQW9.1|Y1398_ACTPJ RecName: Full=UPF0235 protein APJL_1398
gi|165876906|gb|ABY69954.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
serovar 3 str. JL03]
Length = 98
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 30/58 (51%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I AP DG ANA LL+Y+S L V
Sbjct: 6 RIENPYGIRLRIFLQPKASRDQIVGLHDSELKIAITAPPVDGAANAHLLKYLSKLFKV 63
>gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB]
gi|123626353|sp|Q478W6.1|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887
gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB]
Length = 97
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 7/74 (9%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS-- 85
+ R +++T+H +PG+K + +A+ +++ AP DG+AN AL+ +++
Sbjct: 1 MSDWFRQAANGCITLTLHIQPGAKKSEFAGLHGDALKIRLAAPPVDGKANEALIRFIADA 60
Query: 86 -----SLIHVSVGE 94
S +H+ G+
Sbjct: 61 LGLAKSAVHLKSGQ 74
>gi|133930345|ref|NP_001076616.1| Protein W01A8.2 [Caenorhabditis elegans]
gi|114420882|emb|CAL44973.1| Protein W01A8.2 [Caenorhabditis elegans]
Length = 127
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 30/56 (53%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ + IHAKPG+K + + D V V I A ++G AN L+ Y+ S + + E
Sbjct: 37 IGLHIHAKPGAKKSCVVAIGDSEVDVAIGAAPREGAANEELISYLMSALGLRKNEL 92
>gi|190150700|ref|YP_001969225.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
serovar 7 str. AP76]
gi|307264052|ref|ZP_07545650.1| hypothetical protein appser13_14550 [Actinobacillus
pleuropneumoniae serovar 13 str. N273]
gi|226734158|sp|B3GYF9.1|Y1431_ACTP7 RecName: Full=UPF0235 protein APP7_1431
gi|189915831|gb|ACE62083.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
serovar 7 str. AP76]
gi|306870598|gb|EFN02344.1| hypothetical protein appser13_14550 [Actinobacillus
pleuropneumoniae serovar 13 str. N273]
Length = 97
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 30/58 (51%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I AP DG ANA LL+Y+S L V
Sbjct: 6 RIENPYGIRLRIFLQPKASRDQIVGLHDSELKIAITAPPVDGAANAHLLKYLSKLFKV 63
>gi|374261190|ref|ZP_09619777.1| hypothetical protein LDG_6154 [Legionella drancourtii LLAP12]
gi|363538577|gb|EHL31984.1| hypothetical protein LDG_6154 [Legionella drancourtii LLAP12]
Length = 91
Score = 40.0 bits (92), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 33/58 (56%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ I ++ +PG+K I EA+ +++ AP +G AN ALL++++ + V + V
Sbjct: 9 QIIINLYIQPGAKHTEIAGFHGEALKIRLHAPPIEGRANEALLKFIAQIFAVPTRQVV 66
>gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway
5J108-111]
gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway
5J108-111]
Length = 92
Score = 40.0 bits (92), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 14/53 (26%), Positives = 35/53 (66%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
S+ +T++ +PG+K I+ E + +++ AP +G+AN AL+++++ + ++
Sbjct: 4 SIILTVYIQPGAKQTQISGKHGEHIKIRLQAPPTEGKANKALIDFLAQRLKLN 56
>gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans
Length = 263
Score = 40.0 bits (92), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 30/56 (53%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ + IHAKPG+K + + D V V I A ++G AN L+ Y+ S + + E
Sbjct: 37 IGLHIHAKPGAKKSCVVAIGDSEVDVAIGAAPREGAANEELISYLMSALGLRKNEL 92
>gi|148674974|gb|EDL06921.1| RIKEN cDNA 3110040N11, isoform CRA_c [Mus musculus]
Length = 114
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 42/92 (45%), Gaps = 18/92 (19%)
Query: 8 KSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
K+ A S G Q+K E P V P V+I IHAKPGS+ ++T
Sbjct: 4 KAGATSKGKNQTK---EPETAPPAAGPVATDPKGFVTIAIHAKPGSRQNAVT-------- 52
Query: 65 VQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
AP GEANA L Y+S ++ + + V
Sbjct: 53 ----APPSQGEANAELCRYLSKVLDLRKSDVV 80
>gi|291615210|ref|YP_003525367.1| hypothetical protein Slit_2755 [Sideroxydans lithotrophicus ES-1]
gi|291585322|gb|ADE12980.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus
ES-1]
Length = 94
Score = 40.0 bits (92), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 14/46 (30%), Positives = 31/46 (67%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+++T+H +PG+K + + A+ +++ AP +G AN ALL++++
Sbjct: 12 LTLTLHIQPGAKRTEVAGLHGAALKIRLAAPPIEGRANEALLKFIA 57
>gi|394990021|ref|ZP_10382853.1| hypothetical protein SCD_02446 [Sulfuricella denitrificans skB26]
gi|393790286|dbj|GAB72492.1| hypothetical protein SCD_02446 [Sulfuricella denitrificans skB26]
Length = 99
Score = 40.0 bits (92), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 13/47 (27%), Positives = 31/47 (65%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+++T+H +PG+K + + +A+ +++ A A +G+AN LL+++
Sbjct: 11 RLTLTVHVQPGAKRTEVIGLHGDALKIRVAAAAVEGQANTRLLDFLR 57
>gi|416067426|ref|ZP_11582302.1| hypothetical protein D18P1_0403 [Aggregatibacter
actinomycetemcomitans serotype f str. D18P1]
gi|348002128|gb|EGY42839.1| hypothetical protein D18P1_0403 [Aggregatibacter
actinomycetemcomitans serotype f str. D18P1]
Length = 97
Score = 39.7 bits (91), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+++ + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ANLRLRIFLQPKTAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|442320552|ref|YP_007360573.1| hypothetical protein MYSTI_03583 [Myxococcus stipitatus DSM
14675]
gi|441488194|gb|AGC44889.1| hypothetical protein MYSTI_03583 [Myxococcus stipitatus DSM
14675]
Length = 98
Score = 39.7 bits (91), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P V + + +P + + D + +Q+ AP DGEANAALLE+++ + V
Sbjct: 10 PEGVELALLIQPRASRTRVVGEHDGLLKIQLAAPPVDGEANAALLEFLAKKLGV 63
>gi|444348227|ref|ZP_21155939.1| hypothetical protein S23A_0823 [Aggregatibacter
actinomycetemcomitans serotype b str. S23A]
gi|443547513|gb|ELT56994.1| hypothetical protein S23A_0823 [Aggregatibacter
actinomycetemcomitans serotype b str. S23A]
Length = 73
Score = 39.7 bits (91), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+++ + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|387120571|ref|YP_006286454.1| hypothetical protein D7S_00821 [Aggregatibacter
actinomycetemcomitans D7S-1]
gi|416037652|ref|ZP_11573989.1| hypothetical protein H5P1_1770 [Aggregatibacter
actinomycetemcomitans serotype a str. H5P1]
gi|416045179|ref|ZP_11575274.1| hypothetical protein I63B_0735 [Aggregatibacter
actinomycetemcomitans serotype d str. I63B]
gi|416055471|ref|ZP_11579550.1| hypothetical protein SCC393_0537 [Aggregatibacter
actinomycetemcomitans serotype e str. SCC393]
gi|429734958|ref|ZP_19268955.1| TIGR00251 family protein [Aggregatibacter actinomycetemcomitans
Y4]
gi|444334140|ref|ZP_21149763.1| hypothetical protein A160_0973 [Aggregatibacter
actinomycetemcomitans serotype a str. A160]
gi|347995606|gb|EGY36772.1| hypothetical protein H5P1_1770 [Aggregatibacter
actinomycetemcomitans serotype a str. H5P1]
gi|347995660|gb|EGY36821.1| hypothetical protein I63B_0735 [Aggregatibacter
actinomycetemcomitans serotype d str. I63B]
gi|348003148|gb|EGY43804.1| hypothetical protein SCC393_0537 [Aggregatibacter
actinomycetemcomitans serotype e str. SCC393]
gi|385875063|gb|AFI86622.1| hypothetical protein D7S_00821 [Aggregatibacter
actinomycetemcomitans D7S-1]
gi|429150724|gb|EKX93620.1| TIGR00251 family protein [Aggregatibacter actinomycetemcomitans
Y4]
gi|443550821|gb|ELT58954.1| hypothetical protein A160_0973 [Aggregatibacter
actinomycetemcomitans serotype a str. A160]
Length = 97
Score = 39.7 bits (91), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+++ + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|440299717|gb|ELP92265.1| hypothetical protein EIN_118690 [Entamoeba invadens IP1]
Length = 129
Score = 39.7 bits (91), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 32/47 (68%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
V+I I+ KP S++ I + D + + IDAP DG+AN+ +++++++
Sbjct: 46 VTIEINVKPNSRNSEIQGIEDGLLKIAIDAPPVDGKANSEVVDFIAT 92
>gi|365966924|ref|YP_004948486.1| hypothetical protein ANH9381_0735 [Aggregatibacter
actinomycetemcomitans ANH9381]
gi|416074797|ref|ZP_11584695.1| hypothetical protein SCC1398_0565 [Aggregatibacter
actinomycetemcomitans serotype b str. SCC1398]
gi|416084621|ref|ZP_11587051.1| hypothetical protein I23C_0861 [Aggregatibacter
actinomycetemcomitans serotype b str. I23C]
gi|444337775|ref|ZP_21151706.1| hypothetical protein SCC4092_0652 [Aggregatibacter
actinomycetemcomitans serotype b str. SCC4092]
gi|348006605|gb|EGY47008.1| hypothetical protein SCC1398_0565 [Aggregatibacter
actinomycetemcomitans serotype b str. SCC1398]
gi|348010355|gb|EGY50407.1| hypothetical protein I23C_0861 [Aggregatibacter
actinomycetemcomitans serotype b str. I23C]
gi|365745837|gb|AEW76742.1| hypothetical protein ANH9381_0735 [Aggregatibacter
actinomycetemcomitans ANH9381]
gi|443546317|gb|ELT55992.1| hypothetical protein SCC4092_0652 [Aggregatibacter
actinomycetemcomitans serotype b str. SCC4092]
Length = 97
Score = 39.7 bits (91), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+++ + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|378821593|ref|ZP_09844473.1| TIGR00251 family protein [Sutterella parvirubra YIT 11816]
gi|378599538|gb|EHY32546.1| TIGR00251 family protein [Sutterella parvirubra YIT 11816]
Length = 109
Score = 39.3 bits (90), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 32/53 (60%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ V++ +H +PG+K + E + V + AP DG+AN AL+++++ + V
Sbjct: 19 AGVTVALHVQPGAKRSRVVGEHGERLKVALQAPPVDGKANQALVKFLAGVAGV 71
>gi|336466937|gb|EGO55101.1| hypothetical protein NEUTE1DRAFT_85194 [Neurospora tetrasperma
FGSC 2508]
gi|350288454|gb|EGZ69690.1| YggU-like protein [Neurospora tetrasperma FGSC 2509]
Length = 130
Score = 39.3 bits (90), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 39 SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I H KPG+ +T ++DEAV + + A AK+GEAN A+++ +S +++
Sbjct: 27 TIYIHCHVKPGASKTREGVTSITDEAVEICVAAQAKEGEANKAVVKVLSEALNL 80
>gi|452944550|ref|YP_007500715.1| protein of unknown function DUF167 [Hydrogenobaculum sp. HO]
gi|452882968|gb|AGG15672.1| protein of unknown function DUF167 [Hydrogenobaculum sp. HO]
Length = 73
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 16/50 (32%), Positives = 30/50 (60%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+ + KP +K+ S+ + D+++ + I AP DG+AN L+ +S + VS
Sbjct: 3 LRVKVKPNAKTVSVEQLEDKSLKISIKAPPVDGKANEELINVLSEFLKVS 52
>gi|340518653|gb|EGR48893.1| predicted protein [Trichoderma reesei QM6a]
Length = 121
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 36 PPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + +H KPG+ I V+++A+ + + A AKDGEAN A++E +S ++ +
Sbjct: 19 PQGVLQLRLHVKPGASRNREGIQAVTEDAIELCVAAQAKDGEANQAVIEVLSEVLDI 75
>gi|342180720|emb|CCC90196.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 165
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 34/56 (60%), Gaps = 3/56 (5%)
Query: 37 PSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSSLIH 89
P + + AKPG+++ ++ + D+A+ V++ AP DG+AN L+ +M +L+
Sbjct: 12 PGCFHLAVRAKPGARTTALAARPQIIDDALEVRVAAPPVDGKANTELICFMQALLE 67
>gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1]
gi|81358142|sp|Q5P6I2.1|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540
gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
Length = 97
Score = 39.3 bits (90), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 18/60 (30%), Positives = 34/60 (56%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+R S+ +++H +PG+K EA+ +++ AP DG+ANAAL ++++ V
Sbjct: 3 WLREAADGSLVLSLHVQPGAKKTEFVGPHGEAMKLRLAAPPVDGKANAALTVFLAAFCGV 62
>gi|255022260|ref|ZP_05294254.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC
51756]
gi|340783351|ref|YP_004749958.1| hypothetical protein Atc_2609 [Acidithiobacillus caldus SM-1]
gi|254968316|gb|EET25884.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC
51756]
gi|340557502|gb|AEK59256.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
Length = 115
Score = 39.3 bits (90), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 41 SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
++T+ +PG++ + +A+ +++ A A DG ANAALL +++ + + G+ V
Sbjct: 14 TLTVQVQPGARDDCVVGYHGDALKIRLRARAVDGAANAALLSFLARRLDLGPGQVV 69
>gi|301774793|ref|XP_002922805.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40-like
[Ailuropoda melanoleuca]
Length = 129
Score = 39.3 bits (90), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 17/75 (22%)
Query: 33 RLVPP---------SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK--------DGE 75
RL+PP V+I IHAKPGSK + TDV+ + V V I AP +GE
Sbjct: 20 RLLPPLGPVAVDPKGCVTIAIHAKPGSKQNATTDVTAKVVSVAITAPPPTPPAPRQSEGE 79
Query: 76 ANAALLEYMSSLIHV 90
ANA L +S ++ +
Sbjct: 80 ANAELSWCLSKVLEL 94
>gi|374313030|ref|YP_005059460.1| UPF0235 protein yggU [Granulicella mallensis MP5ACTX8]
gi|358755040|gb|AEU38430.1| UPF0235 protein yggU [Granulicella mallensis MP5ACTX8]
Length = 99
Score = 39.3 bits (90), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P S+ + PG+K ++T D ++ + + P DG AN AL+ ++S +++
Sbjct: 9 PGGCSLPVRVHPGAKQNAVTGTHDGSLKISLTTPPTDGRANTALIAFLSDRLNI 62
>gi|223944751|gb|ACN26459.1| unknown [Zea mays]
gi|414884282|tpg|DAA60296.1| TPA: COG1872 containing protein, Uncharacterized ACR, YggU family
[Zea mays]
Length = 194
Score = 39.3 bits (90), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 39/72 (54%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
+D +P CI + V + I + ++ +IT V+ + V V + A A GEAN+ LLE+
Sbjct: 90 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGEANSELLEF 149
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 150 MGKVLGLRLTQM 161
>gi|310822209|ref|YP_003954567.1| hypothetical protein STAUR_4962 [Stigmatella aurantiaca DW4/3-1]
gi|309395281|gb|ADO72740.1| conserved uncharacterized protein [Stigmatella aurantiaca
DW4/3-1]
Length = 98
Score = 39.3 bits (90), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+P ++++P V + + +P + + D + +Q+ AP DGEANAAL+E+++
Sbjct: 1 MPPWLKVLP-EGVELAVLVQPRASRTRVVGEHDGMLKLQLAAPPVDGEANAALVEFLA 57
>gi|212723588|ref|NP_001132046.1| uncharacterized protein LOC100193457 [Zea mays]
gi|194693286|gb|ACF80727.1| unknown [Zea mays]
gi|414884279|tpg|DAA60293.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
gi|414884280|tpg|DAA60294.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
Length = 213
Score = 39.3 bits (90), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%)
Query: 16 STQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGE 75
S ++ +D +P CI + V + I + ++ +IT V+ + V V + A A GE
Sbjct: 120 SVAAETNPHDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGE 179
Query: 76 ANAALLEYMSSLI 88
AN+ LLE+M +I
Sbjct: 180 ANSELLEFMGKVI 192
>gi|317153487|ref|YP_004121535.1| hypothetical protein Daes_1777 [Desulfovibrio aespoeensis Aspo-2]
gi|316943738|gb|ADU62789.1| protein of unknown function DUF167 [Desulfovibrio aespoeensis
Aspo-2]
Length = 102
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 16/55 (29%), Positives = 34/55 (61%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
I + +PG++ + V + V +++ APA D +AN AL+ +++S+++V + V
Sbjct: 16 IAVWVQPGARKSEVAGVYQQCVKIRLCAPAVDNKANKALVAFVASVLNVKKSQVV 70
>gi|66813694|ref|XP_641026.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
gi|74855711|sp|Q54UW1.1|U235_DICDI RecName: Full=UPF0235 protein
gi|60469052|gb|EAL67049.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
Length = 124
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 15/51 (29%), Positives = 30/51 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I ++ P SK SI D+ + ++I P DG+AN ++E++S +++
Sbjct: 32 IKINVNVHPNSKESSIVSFEDQILSLRISEPPIDGKANIGVIEFLSKELNI 82
>gi|442751115|gb|JAA67717.1| Hypothetical protein [Ixodes ricinus]
Length = 172
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAP 70
+V+I +HAKPG+ ITD+ + VG+QI AP
Sbjct: 59 TVAIRVHAKPGASESRITDIGTDGVGIQIAAP 90
>gi|358385705|gb|EHK23301.1| hypothetical protein TRIVIDRAFT_139499, partial [Trichoderma
virens Gv29-8]
Length = 91
Score = 38.9 bits (89), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 36 PPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + +H KPG+ I V+D+ + + + A AKDGEAN A++E +S + +
Sbjct: 15 PQGILQLRLHVKPGASKTREGIQMVTDDVIELCVAAQAKDGEANQAVIEVLSEALDI 71
>gi|322513549|ref|ZP_08066649.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
25976]
gi|322120620|gb|EFX92514.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
25976]
Length = 99
Score = 38.9 bits (89), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 29/58 (50%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I P DG ANA LL+Y+S L V
Sbjct: 6 RIENPCGIRLRIFLQPKASRDQIVGLHDNELKIAITTPPVDGAANAHLLKYLSKLFKV 63
>gi|326926706|ref|XP_003209539.1| PREDICTED: UPF0235 protein C15orf40 homolog [Meleagris gallopavo]
Length = 85
Score = 38.9 bits (89), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 57 DVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+V+ EAVGV I AP +GEANA L Y+S ++ V
Sbjct: 13 NVTAEAVGVAIAAPPSEGEANAELCRYLSKVLQV 46
>gi|429329316|gb|AFZ81075.1| proton translocating inorganic pyrophosphatase, putative [Babesia
equi]
Length = 110
Score = 38.9 bits (89), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 32/51 (62%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
SS+ + ++ KPG+K + + + +QI AP ++G N AL+EY++ ++
Sbjct: 21 SSLILKVNVKPGAKQTQVVGAQEGQLTLQIAAPPREGACNEALVEYVAEVL 71
>gi|451947238|ref|YP_007467833.1| TIGR00251 family protein [Desulfocapsa sulfexigens DSM 10523]
gi|451906586|gb|AGF78180.1| TIGR00251 family protein [Desulfocapsa sulfexigens DSM 10523]
Length = 99
Score = 38.5 bits (88), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 15/49 (30%), Positives = 32/49 (65%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ ++ +P + S + D A+ + I AP DG+ANAA++++++S ++V
Sbjct: 14 LRVYVQPRASRNSFAGLHDNAMRLTITAPPVDGKANAAVIQFLASFLNV 62
>gi|383456913|ref|YP_005370902.1| hypothetical protein COCOR_04940 [Corallococcus coralloides DSM
2259]
gi|380730113|gb|AFE06115.1| hypothetical protein COCOR_04940 [Corallococcus coralloides DSM
2259]
Length = 99
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ V +T+ +P + + D + +Q+ AP DGEANAAL+E+++ + V
Sbjct: 11 TGVELTVLVQPRASRTKVVGEHDGQLKIQLAAPPVDGEANAALVEFIAKTLGV 63
>gi|251793930|ref|YP_003008662.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
NJ8700]
gi|422337117|ref|ZP_16418089.1| hypothetical protein HMPREF9335_01277 [Aggregatibacter
aphrophilus F0387]
gi|247535329|gb|ACS98575.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
NJ8700]
gi|353345669|gb|EHB89960.1| hypothetical protein HMPREF9335_01277 [Aggregatibacter
aphrophilus F0387]
Length = 97
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + + I +P + I + D+ + ++I AP DG+ANA LL+++S L V
Sbjct: 9 ADLRLRIFLQPKAAKDHIVGLHDDELKIRITAPPIDGQANAHLLKFLSKLFKV 61
>gi|85092099|ref|XP_959226.1| hypothetical protein NCU06879 [Neurospora crassa OR74A]
gi|28920629|gb|EAA29990.1| predicted protein [Neurospora crassa OR74A]
Length = 130
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 39 SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I H KPG+ +T ++DEAV + + A AK+GEAN A+++ +S +++
Sbjct: 27 TIYIHCHVKPGASKNREGVTSITDEAVEICVAAQAKEGEANKAVVKVLSEALNL 80
>gi|289164111|ref|YP_003454249.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
gi|288857284|emb|CBJ11111.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
Length = 91
Score = 38.5 bits (88), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 14/49 (28%), Positives = 30/49 (61%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ ++ +PG+K I + + + ++++AP +G AN LL+Y++ L V
Sbjct: 12 LYLYVQPGAKKSEIVGMHEGVLKIRLNAPPIEGRANKELLKYVAQLFKV 60
>gi|194688626|gb|ACF78397.1| unknown [Zea mays]
gi|414884278|tpg|DAA60292.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
Length = 232
Score = 38.5 bits (88), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 39/72 (54%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
+D +P CI + V + I + ++ +IT V+ + V V + A A GEAN+ LLE+
Sbjct: 128 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGEANSELLEF 187
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 188 MGKVLGLRLTQM 199
>gi|434400838|ref|YP_007134842.1| UPF0235 protein yggU [Stanieria cyanosphaera PCC 7437]
gi|428271935|gb|AFZ37876.1| UPF0235 protein yggU [Stanieria cyanosphaera PCC 7437]
Length = 72
Score = 38.5 bits (88), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 15/52 (28%), Positives = 33/52 (63%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+ I+I KP S+ I +++D ++ +++ +P +DG+AN L+E ++ V+
Sbjct: 1 MKISIKVKPNSQQQKIEELADGSLIIRLKSPPRDGKANQELIEMLAKKFQVA 52
>gi|401826895|ref|XP_003887540.1| hypothetical protein EHEL_070320 [Encephalitozoon hellem ATCC
50504]
gi|392998546|gb|AFM98559.1| hypothetical protein EHEL_070320 [Encephalitozoon hellem ATCC
50504]
Length = 100
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 32/51 (62%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ +++ A+P ++S + +V+D + V I+AP D +AN L ++S ++ +
Sbjct: 11 LRLSVWARPNARSTGVREVTDTEIVVNINAPPTDNKANKELTGFVSKMLKI 61
>gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159]
gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159]
Length = 102
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 15/55 (27%), Positives = 30/55 (54%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVG 93
+V + +P + + +A+ V++ AP +DGEAN A+L ++ +H+ G
Sbjct: 12 AVEFWVQVQPRAPRAEVAGSRRDALLVRVTAPPRDGEANEAVLRLLAETLHLPRG 66
>gi|226529615|ref|NP_001152637.1| uncharacterized protein LOC100286278 [Zea mays]
gi|195658405|gb|ACG48670.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
mays]
Length = 194
Score = 38.5 bits (88), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 39/72 (54%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
+D +P CI + V + I + ++ ++T V+ + V V + A A GEAN+ LLE+
Sbjct: 90 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAVTRVNADDVRVTVAALAARGEANSELLEF 149
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 150 MGKVLGLRLTQM 161
>gi|432095255|gb|ELK26515.1| hypothetical protein MDA_GLEAN10001617 [Myotis davidii]
Length = 91
Score = 38.5 bits (88), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 48 PGSKSCS-ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
PG + + + D++ EAV V I AP +GEANA L Y+S ++ + + V
Sbjct: 8 PGRRWFALVRDLTTEAVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVV 57
>gi|270157502|ref|ZP_06186159.1| conserved hypothetical protein [Legionella longbeachae D-4968]
gi|269989527|gb|EEZ95781.1| conserved hypothetical protein [Legionella longbeachae D-4968]
Length = 80
Score = 38.5 bits (88), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 14/49 (28%), Positives = 30/49 (61%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ ++ +PG+K I + + + ++++AP +G AN LL+Y++ L V
Sbjct: 1 MYLYVQPGAKKSEIVGMHEGVLKIRLNAPPIEGRANKELLKYVAQLFKV 49
>gi|416050873|ref|ZP_11577157.1| hypothetical protein SC1083_0304 [Aggregatibacter
actinomycetemcomitans serotype e str. SC1083]
gi|418464692|ref|ZP_13035631.1| hypothetical protein RHAA1_03356 [Aggregatibacter
actinomycetemcomitans RhAA1]
gi|347993686|gb|EGY35027.1| hypothetical protein SC1083_0304 [Aggregatibacter
actinomycetemcomitans serotype e str. SC1083]
gi|359756647|gb|EHK90804.1| hypothetical protein RHAA1_03356 [Aggregatibacter
actinomycetemcomitans RhAA1]
Length = 97
Score = 38.5 bits (88), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ADLRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|261867124|ref|YP_003255046.1| hypothetical protein D11S_0417 [Aggregatibacter
actinomycetemcomitans D11S-1]
gi|415767506|ref|ZP_11483178.1| hypothetical protein D17P2_0150 [Aggregatibacter
actinomycetemcomitans D17P-2]
gi|416107251|ref|ZP_11590338.1| hypothetical protein SCC2302_1544 [Aggregatibacter
actinomycetemcomitans serotype c str. SCC2302]
gi|444346864|ref|ZP_21154822.1| hypothetical protein AAS4A_1734 [Aggregatibacter
actinomycetemcomitans serotype c str. AAS4A]
gi|261412456|gb|ACX81827.1| hypothetical protein D11S_0417 [Aggregatibacter
actinomycetemcomitans D11S-1]
gi|348005581|gb|EGY46058.1| hypothetical protein SCC2302_1544 [Aggregatibacter
actinomycetemcomitans serotype c str. SCC2302]
gi|348658442|gb|EGY76010.1| hypothetical protein D17P2_0150 [Aggregatibacter
actinomycetemcomitans D17P-2]
gi|443541156|gb|ELT51620.1| hypothetical protein AAS4A_1734 [Aggregatibacter
actinomycetemcomitans serotype c str. AAS4A]
Length = 97
Score = 38.5 bits (88), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 AGLRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|449706804|gb|EMD46572.1| hypothetical protein EHI5A_162950 [Entamoeba histolytica KU27]
Length = 118
Score = 38.5 bits (88), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S+V I + KP +K+ I V D + V I++P DG+AN ++ +M+S
Sbjct: 31 SNVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMAS 79
>gi|375107692|ref|ZP_09753953.1| hypothetical protein BurJ1DRAFT_4417 [Burkholderiales bacterium
JOSHI_001]
gi|374668423|gb|EHR73208.1| hypothetical protein BurJ1DRAFT_4417 [Burkholderiales bacterium
JOSHI_001]
Length = 124
Score = 38.1 bits (87), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 17/62 (27%), Positives = 36/62 (58%)
Query: 29 PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
P+ +R P++ + + PG+K + + D A+ +++ AP DG+ANAAL + ++ +
Sbjct: 15 PTWLRADGPAACVLDVSVVPGAKRSELAGLHDGALRIKLAAPPVDGKANAALEKLLAQWL 74
Query: 89 HV 90
+
Sbjct: 75 DL 76
>gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Pseudogulbenkiania
ferrooxidans 2002]
gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Pseudogulbenkiania
ferrooxidans 2002]
Length = 108
Score = 38.1 bits (87), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S + +T+H +PG+K + A+ +++ AP +G+ANA LL +++ V
Sbjct: 21 SVIRLTLHVQPGAKKTDLAGEHGGALKLRLAAPPVEGKANAMLLAWLAERFEV 73
>gi|67482694|ref|XP_656664.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473879|gb|EAL51278.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 118
Score = 38.1 bits (87), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S+V I + KP +K+ I V D + V I++P DG+AN ++ +M+S
Sbjct: 31 SNVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMAS 79
>gi|386390193|ref|ZP_10074989.1| TIGR00251 family protein [Haemophilus paraphrohaemolyticus HK411]
gi|385693877|gb|EIG24509.1| TIGR00251 family protein [Haemophilus paraphrohaemolyticus HK411]
Length = 98
Score = 38.1 bits (87), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 28/54 (51%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + I +P + I + D + + I +P DG ANA LL+Y+S L V
Sbjct: 10 PDGIRLRIFLQPKASRDQIIGLHDGELKIAITSPPVDGAANAHLLKYLSKLFKV 63
>gi|308456125|ref|XP_003090529.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
gi|308262652|gb|EFP06605.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
Length = 102
Score = 38.1 bits (87), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ + IHAKPG+K + + + + V I A ++G AN L+ Y+ + + + E
Sbjct: 32 IGLRIHAKPGAKKSCVVAIGESEIDVSIGAAPREGAANEELISYLMAALGLRKNEL 87
>gi|324527094|gb|ADY48748.1| Unknown [Ascaris suum]
Length = 181
Score = 38.1 bits (87), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 31/49 (63%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
+ + IHAKP +K +T++++ + V I AP G+AN AL + ++ ++
Sbjct: 90 ILLKIHAKPNAKISRVTEINETEIEVAIAAPPHKGQANEALTDAIAEIL 138
>gi|358394345|gb|EHK43738.1| hypothetical protein TRIATDRAFT_284504 [Trichoderma atroviride
IMI 206040]
Length = 120
Score = 37.7 bits (86), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 36 PPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + +H KPG+ I V++E + + + A AKDGEAN A++E +S + +
Sbjct: 19 PQGILQLRLHVKPGASKNREGIQAVTEETIELCVAAQAKDGEANQAVIEVLSEALDI 75
>gi|347538136|ref|YP_004845560.1| hypothetical protein NH8B_0297 [Pseudogulbenkiania sp. NH8B]
gi|345641313|dbj|BAK75146.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
Length = 97
Score = 37.7 bits (86), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S + +T+H +PG+K + A+ +++ AP +G+ANA LL +++ V
Sbjct: 10 SVIRLTLHVQPGAKKTDLAGEHGGALKLRLAAPPVEGKANAMLLAWLAERFEV 62
>gi|224119652|ref|XP_002318126.1| predicted protein [Populus trichocarpa]
gi|222858799|gb|EEE96346.1| predicted protein [Populus trichocarpa]
Length = 231
Score = 37.7 bits (86), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCITQLG-GLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 186
Query: 84 MSSLIHVSVGEF 95
M ++ + + +
Sbjct: 187 MGRVLGLRLSQM 198
>gi|336261944|ref|XP_003345758.1| hypothetical protein SMAC_05915 [Sordaria macrospora k-hell]
gi|380090094|emb|CCC12177.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 133
Score = 37.7 bits (86), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 2/54 (3%)
Query: 39 SVSITIHAKPGS--KSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++ I H KPG+ + +T ++DEAV + + A AK+GEAN ++++ +S +++
Sbjct: 29 TIYIHCHVKPGASKQREGVTCITDEAVEICVAAQAKEGEANKSVVKVLSEALNI 82
>gi|338813357|ref|ZP_08625486.1| hypothetical protein ALO_14457 [Acetonema longum DSM 6540]
gi|337274716|gb|EGO63224.1| hypothetical protein ALO_14457 [Acetonema longum DSM 6540]
Length = 108
Score = 37.7 bits (86), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 31/51 (60%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V++ I +P S ++ + ++V ++AP DGEAN AL++ +S + H+
Sbjct: 18 VTVKIKLQPRSSRSEVSGLCGDSVKACVNAPPADGEANQALIQLLSDVFHI 68
>gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae]
Length = 258
Score = 37.7 bits (86), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 31/56 (55%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ + IHAKPG+K + +++ + V I A ++G AN L+ Y+ S + + E
Sbjct: 33 IGLRIHAKPGAKKSGVVAINESEIDVAIGAAPREGAANEELVSYLMSALGLRKNEL 88
>gi|428203544|ref|YP_007082133.1| hypothetical protein Ple7327_3360 [Pleurocapsa sp. PCC 7327]
gi|427980976|gb|AFY78576.1| hypothetical protein Ple7327_3360 [Pleurocapsa sp. PCC 7327]
Length = 73
Score = 37.7 bits (86), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 29/52 (55%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+ I + KP SK I + DE++ + + +P DG+AN L+E ++ V+
Sbjct: 1 MKIQVKVKPNSKQQRIEEADDESLFIYLKSPPIDGKANQELIELLAKKFKVT 52
>gi|407041472|gb|EKE40757.1| ACR, YggU family COG1872 protein, putative [Entamoeba nuttalli
P19]
Length = 118
Score = 37.7 bits (86), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 29/49 (59%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
S V I + KP +K+ I V D + V I++P DG+AN ++ +M+S
Sbjct: 31 SDVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMAS 79
>gi|396081665|gb|AFN83280.1| hypothetical protein EROM_070290 [Encephalitozoon romaleae
SJ-2008]
Length = 100
Score = 37.7 bits (86), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 13/51 (25%), Positives = 32/51 (62%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ +++ A+P ++S I ++S + V ++AP + +AN L+ ++S ++ V
Sbjct: 11 LRLSVRARPSARSTGIREISGTEIVVNVNAPPVENKANKELVRFISKMLRV 61
>gi|315633577|ref|ZP_07888867.1| conserved hypothetical protein [Aggregatibacter segnis ATCC
33393]
gi|315477619|gb|EFU68361.1| conserved hypothetical protein [Aggregatibacter segnis ATCC
33393]
Length = 97
Score = 37.7 bits (86), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 11 LRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61
>gi|209694133|ref|YP_002262061.1| hypothetical protein VSAL_I0540 [Aliivibrio salmonicida LFI1238]
gi|208008084|emb|CAQ78225.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
Length = 83
Score = 37.7 bits (86), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 30/53 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGE 94
+ ++ +P + I + E + V I AP DG+ANA L++Y S L V+ G+
Sbjct: 12 LRLYLQPKASRDQIVGIHGEELKVAITAPPVDGKANAHLIKYFSKLFKVAKGK 64
>gi|416893474|ref|ZP_11924662.1| hypothetical protein ATCC33389_1782 [Aggregatibacter aphrophilus
ATCC 33389]
gi|347814028|gb|EGY30680.1| hypothetical protein ATCC33389_1782 [Aggregatibacter aphrophilus
ATCC 33389]
Length = 97
Score = 37.7 bits (86), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + + I +P + I + D+ + + I AP DG+ANA LL+++S L V
Sbjct: 9 ADLRLRIFLQPKAAKDHIVGLHDDELKISITAPPIDGQANAHLLKFLSKLFKV 61
>gi|269837719|ref|YP_003319947.1| hypothetical protein Sthe_1691 [Sphaerobacter thermophilus DSM
20745]
gi|269786982|gb|ACZ39125.1| protein of unknown function DUF167 [Sphaerobacter thermophilus
DSM 20745]
Length = 102
Score = 37.4 bits (85), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 15/50 (30%), Positives = 30/50 (60%)
Query: 41 SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+T+ P + + V+D A+ V++ AP +G AN AL E++++L+ +
Sbjct: 15 QVTVRVTPRASRTQVDGVADGALRVRLAAPPVEGAANRALTEFLANLLRL 64
>gi|395236297|ref|ZP_10414494.1| hypothetical protein A936_21497 [Enterobacter sp. Ag1]
gi|394728928|gb|EJF28948.1| hypothetical protein A936_21497 [Enterobacter sp. Ag1]
Length = 99
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/60 (28%), Positives = 34/60 (56%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
P + + ++ +P + S+ + + + V I AP DG+ANA L++Y++ V+ G+ V
Sbjct: 8 PDGLVLRLYIQPKASRDSLVGMHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKGQVV 67
>gi|406936857|gb|EKD70483.1| hypothetical protein ACD_46C00523G0009 [uncultured bacterium]
Length = 99
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/51 (29%), Positives = 28/51 (54%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
V++ I AKP +K + V ++ + + + A +GEAN L+ Y++ L
Sbjct: 8 HQVTLCIFAKPHAKQTVLLKVDNDGLHITLHAKPHEGEANKELISYLAKLF 58
>gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74]
gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74]
Length = 86
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 42 ITIH--AKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+T+H AKPGSK + + + +I APA+DG+ANA L+E+++
Sbjct: 1 MTLHLKAKPGSKIDQLFYDAAGQLNAKIRAPAQDGKANAYLIEFLA 46
>gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Coleofasciculus chthonoplastes
PCC 7420]
gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Coleofasciculus chthonoplastes
PCC 7420]
Length = 74
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 31/49 (63%)
Query: 43 TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
++ KP SK+ SI +++D + V + +P DG+AN L+E ++ +V+
Sbjct: 4 SVKVKPNSKTQSIEEMADGTLKVNLKSPPVDGKANKELIELLAEKFNVT 52
>gi|59711034|ref|YP_203810.1| hypothetical protein VF_0427 [Vibrio fischeri ES114]
gi|423685140|ref|ZP_17659948.1| hypothetical protein VFSR5_0414 [Vibrio fischeri SR5]
gi|59479135|gb|AAW84922.1| conserved protein [Vibrio fischeri ES114]
gi|371495641|gb|EHN71236.1| hypothetical protein VFSR5_0414 [Vibrio fischeri SR5]
Length = 95
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 30/54 (55%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ ++ +P + I + E + + I AP DG+ANA L++Y S L V+ G+
Sbjct: 12 LRLYLQPKASRDQIVGIHGEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGKI 65
>gi|195953751|ref|YP_002122041.1| hypothetical protein HY04AAS1_1378 [Hydrogenobaculum sp. Y04AAS1]
gi|226734129|sp|B4U5M3.1|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378
gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1]
Length = 73
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 14/50 (28%), Positives = 31/50 (62%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+ + KP +K+ S+ + D+++ + I +P +G+AN L++ +S + VS
Sbjct: 3 LRVKVKPNAKTVSVEQLEDKSLKISIKSPPVNGKANEELIKVLSEFLKVS 52
>gi|342883630|gb|EGU84084.1| hypothetical protein FOXB_05382 [Fusarium oxysporum Fo5176]
Length = 117
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 8/69 (11%)
Query: 30 SCIRLV------PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALL 81
S IR V P SV + + KPG+ + V+D+A+ + + A A++GEAN A++
Sbjct: 5 SAIRFVTGTKKSPLGSVHLQLRVKPGASKNREGVIAVADDAIELCVSAQAREGEANKAVV 64
Query: 82 EYMSSLIHV 90
+ +S ++ V
Sbjct: 65 QVLSGILGV 73
>gi|197335333|ref|YP_002155183.1| hypothetical protein VFMJ11_0427 [Vibrio fischeri MJ11]
gi|197316823|gb|ACH66270.1| conserved hypothetical protein [Vibrio fischeri MJ11]
Length = 95
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 30/54 (55%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ ++ +P + I + E + + I AP DG+ANA L++Y S L V+ G+
Sbjct: 12 LRLYLQPKASRDQIVGIHGEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGKI 65
>gi|82701713|ref|YP_411279.1| hypothetical protein Nmul_A0579 [Nitrosospira multiformis ATCC
25196]
gi|82409778|gb|ABB73887.1| Protein of unknown function DUF167 [Nitrosospira multiformis ATCC
25196]
Length = 105
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 14/49 (28%), Positives = 30/49 (61%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+T+H +PG++ + +A+ +++ AP +G AN ALL +++ + V
Sbjct: 21 LTLHIQPGARRTEVVGSHGDALKIKLAAPPVEGAANVALLAFLAGVFGV 69
>gi|296242831|ref|YP_003650318.1| hypothetical protein Tagg_1097 [Thermosphaera aggregans DSM
11486]
gi|296095415|gb|ADG91366.1| protein of unknown function DUF167 [Thermosphaera aggregans DSM
11486]
Length = 111
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSV 92
V + + KPGS+ T SDE V + PA+ G ANA+L++Y+S + + V
Sbjct: 23 VILQVRVKPGSEPEGFTIESDELV-FRTSEPAERGRANASLIKYLSRELKIPV 74
>gi|261416137|ref|YP_003249820.1| hypothetical protein Fisuc_1745 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|385791007|ref|YP_005822130.1| hypothetical protein FSU_2243 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261372593|gb|ACX75338.1| protein of unknown function DUF167 [Fibrobacter succinogenes
subsp. succinogenes S85]
gi|302328014|gb|ADL27215.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 74
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
++I +HA+ SK S+T D + V++ AP DG ANAA+ E ++ HV
Sbjct: 3 INIKVHAR--SKRESVTPQPDGSYKVEVKAPPVDGAANAAICELIADYFHV 51
>gi|374301322|ref|YP_005052961.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332554258|gb|EGJ51302.1| UPF0235 protein yggU [Desulfovibrio africanus str. Walvis Bay]
Length = 109
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/61 (26%), Positives = 31/61 (50%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSL 87
LP C+ + P + I +PG+ + + +++ AP D +AN AL+ Y++ L
Sbjct: 13 LPGCVISIEPGVWRLNIWVQPGANRNEPVGLYQDCCKIKLSAPPVDNKANKALVVYIAGL 72
Query: 88 I 88
+
Sbjct: 73 L 73
>gi|302916303|ref|XP_003051962.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
77-13-4]
gi|256732901|gb|EEU46249.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
77-13-4]
Length = 75
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 2/54 (3%)
Query: 39 SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
S+ + +H KPG+ + V+D+A+ + + A A++GEAN A+++ +SS++ V
Sbjct: 2 SLHLQLHVKPGASKNREGVIAVTDDAIELCVAAQAREGEANKAVVQVLSSVLGV 55
>gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus
Ellin428]
gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus
Ellin428]
Length = 93
Score = 37.0 bits (84), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/49 (30%), Positives = 30/49 (61%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + P ++ + V +AV V++ APA DG+AN AL ++++ ++ V
Sbjct: 7 LRLRIVPNARRSEVVGVHGDAVKVKVQAPAMDGKANEALRDFLAEVLTV 55
>gi|257056705|ref|YP_003134537.1| hypothetical protein Svir_27290 [Saccharomonospora viridis DSM
43017]
gi|256586577|gb|ACU97710.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
43017]
Length = 118
Score = 37.0 bits (84), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSLIHVS 91
P S + I KPG+K ++ + D A+G V + APA DG+AN A+ ++ + V
Sbjct: 15 PMSELRFAIRVKPGAKRDAVGGIWDGALGEALVVSVRAPAVDGKANEAVCRVLAEALSVR 74
>gi|78357213|ref|YP_388662.1| hypothetical protein [Desulfovibrio alaskensis G20]
gi|78219618|gb|ABB38967.1| protein of unknown function DUF167 [Desulfovibrio alaskensis G20]
Length = 118
Score = 37.0 bits (84), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 41 SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + A+PG+K I + D V +++ APA D +AN L+ +++ L V
Sbjct: 16 RLKVWAQPGAKHSGIAGLYDGRVRIRLSAPAVDNKANKELIRFVAQLCGV 65
>gi|307261870|ref|ZP_07543532.1| hypothetical protein appser12_14270 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
gi|306868417|gb|EFN00232.1| hypothetical protein appser12_14270 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
Length = 97
Score = 37.0 bits (84), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 29/58 (50%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I A DG ANA LL+Y+S L V
Sbjct: 6 RIENPYGIRLRIFLQPKASRDQIVGLHDNELKIAITALPVDGAANAHLLKYLSKLFKV 63
>gi|387773933|ref|ZP_10129215.1| TIGR00251 family protein [Haemophilus parahaemolyticus HK385]
gi|386903425|gb|EIJ68240.1| TIGR00251 family protein [Haemophilus parahaemolyticus HK385]
Length = 97
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 27/51 (52%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + I +P + I + D + + I AP DG ANA LL+Y+S L V
Sbjct: 13 IRLRILLQPKASRDQIVGLHDGELKIAITAPPVDGAANAHLLKYLSKLFQV 63
>gi|157374368|ref|YP_001472968.1| hypothetical protein Ssed_1229 [Shewanella sediminis HAW-EB3]
gi|189038638|sp|A8FSL7.1|Y1229_SHESH RecName: Full=UPF0235 protein Ssed_1229
gi|157316742|gb|ABV35840.1| protein of unknown function DUF167 [Shewanella sediminis HAW-EB3]
Length = 95
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 30/55 (54%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + I V E + + I AP DG+ANA L++Y+S V G+ V
Sbjct: 13 LNLYIQPKASRDQIVGVHGEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGDIV 67
>gi|354567643|ref|ZP_08986811.1| UPF0235 protein yggU [Fischerella sp. JSC-11]
gi|353542101|gb|EHC11565.1| UPF0235 protein yggU [Fischerella sp. JSC-11]
Length = 75
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I + KP SK I + +D ++ V + +P DG+AN L++ +S HV
Sbjct: 1 MQIKVKVKPNSKMQKIEEAADRSLIVYLKSPPVDGKANEELIKVLSENFHV 51
>gi|325577506|ref|ZP_08147868.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
ATCC 33392]
gi|325160610|gb|EGC72734.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
ATCC 33392]
Length = 95
Score = 36.6 bits (83), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 29/54 (53%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P + + I +P + I + D+ + + I AP DG+ANA LL+++S V
Sbjct: 8 PEGLRLKIILQPKASKDQIVGLHDDELKITITAPPVDGQANAHLLKFLSKTFKV 61
>gi|53728760|ref|ZP_00135410.2| COG1872: Uncharacterized conserved protein [Actinobacillus
pleuropneumoniae serovar 1 str. 4074]
gi|126208844|ref|YP_001054069.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
serovar 5b str. L20]
gi|303250781|ref|ZP_07336976.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
gi|303252447|ref|ZP_07338612.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
gi|307246303|ref|ZP_07528382.1| hypothetical protein appser1_15050 [Actinobacillus
pleuropneumoniae serovar 1 str. 4074]
gi|307248416|ref|ZP_07530437.1| hypothetical protein appser2_13900 [Actinobacillus
pleuropneumoniae serovar 2 str. S1536]
gi|307250644|ref|ZP_07532582.1| hypothetical protein appser4_14180 [Actinobacillus
pleuropneumoniae serovar 4 str. M62]
gi|307253024|ref|ZP_07534909.1| hypothetical protein appser6_15320 [Actinobacillus
pleuropneumoniae serovar 6 str. Femo]
gi|307255287|ref|ZP_07537100.1| hypothetical protein appser9_15200 [Actinobacillus
pleuropneumoniae serovar 9 str. CVJ13261]
gi|307257450|ref|ZP_07539217.1| hypothetical protein appser10_14450 [Actinobacillus
pleuropneumoniae serovar 10 str. D13039]
gi|307259722|ref|ZP_07541443.1| hypothetical protein appser11_15170 [Actinobacillus
pleuropneumoniae serovar 11 str. 56153]
gi|166200384|sp|A3N228.1|Y1380_ACTP2 RecName: Full=UPF0235 protein APL_1380
gi|126097636|gb|ABN74464.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
serovar 5b str. L20]
gi|302648720|gb|EFL78911.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
gi|302650386|gb|EFL80547.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
gi|306852773|gb|EFM84999.1| hypothetical protein appser1_15050 [Actinobacillus
pleuropneumoniae serovar 1 str. 4074]
gi|306855058|gb|EFM87240.1| hypothetical protein appser2_13900 [Actinobacillus
pleuropneumoniae serovar 2 str. S1536]
gi|306857316|gb|EFM89434.1| hypothetical protein appser4_14180 [Actinobacillus
pleuropneumoniae serovar 4 str. M62]
gi|306859482|gb|EFM91510.1| hypothetical protein appser6_15320 [Actinobacillus
pleuropneumoniae serovar 6 str. Femo]
gi|306861736|gb|EFM93717.1| hypothetical protein appser9_15200 [Actinobacillus
pleuropneumoniae serovar 9 str. CVJ13261]
gi|306864030|gb|EFM95946.1| hypothetical protein appser10_14450 [Actinobacillus
pleuropneumoniae serovar 10 str. D13039]
gi|306866190|gb|EFM98057.1| hypothetical protein appser11_15170 [Actinobacillus
pleuropneumoniae serovar 11 str. 56153]
Length = 97
Score = 36.6 bits (83), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 29/58 (50%)
Query: 33 RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
R+ P + + I +P + I + D + + I A DG ANA LL+Y+S L V
Sbjct: 6 RIENPCGIRLRIFLQPKASRDQIVGLHDNELKIAITALPVDGAANAHLLKYLSKLFKV 63
>gi|345428817|ref|YP_004821933.1| hypothetical protein PARA_02320 [Haemophilus parainfluenzae T3T1]
gi|301154876|emb|CBW14339.1| conserved protein [Haemophilus parainfluenzae T3T1]
Length = 95
Score = 36.6 bits (83), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 37 PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
P + + I +P + I + D+ + + I AP DG+ANA LL+++S
Sbjct: 8 PEGLRLKIILQPKASKDQIVGLHDDELKITITAPPVDGQANAHLLKFLS 56
>gi|417772443|ref|ZP_12420332.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|418679788|ref|ZP_13241045.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|418706859|ref|ZP_13267697.1| TIGR00251 family protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|418716666|ref|ZP_13276629.1| TIGR00251 family protein [Leptospira interrogans str. UI 08452]
gi|418728749|ref|ZP_13287320.1| TIGR00251 family protein [Leptospira interrogans str. UI 12758]
gi|421118710|ref|ZP_15579045.1| TIGR00251 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|421127704|ref|ZP_15587925.1| TIGR00251 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421134285|ref|ZP_15594426.1| TIGR00251 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|400328389|gb|EJO80621.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|409945814|gb|EKN95829.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|410009734|gb|EKO67890.1| TIGR00251 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|410021584|gb|EKO88368.1| TIGR00251 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410434763|gb|EKP83898.1| TIGR00251 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|410763436|gb|EKR34165.1| TIGR00251 family protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|410776601|gb|EKR56578.1| TIGR00251 family protein [Leptospira interrogans str. UI 12758]
gi|410787437|gb|EKR81169.1| TIGR00251 family protein [Leptospira interrogans str. UI 08452]
gi|455668218|gb|EMF33463.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
str. Fox 32256]
Length = 73
Score = 36.2 bits (82), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/51 (29%), Positives = 28/51 (54%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ T++ KP SK D + + + PA +G+AN A++E++S + V
Sbjct: 1 MKFTVYVKPNSKKVFFRKEEDGVLTIAVREPALEGKANEAVIEFISKEMKV 51
>gi|113461793|ref|YP_719862.1| hypothetical protein HS_1657 [Haemophilus somnus 129PT]
gi|170718106|ref|YP_001785139.1| hypothetical protein HSM_1819 [Haemophilus somnus 2336]
gi|123132241|sp|Q0I525.1|Y1657_HAES1 RecName: Full=UPF0235 protein HS_1657
gi|226696075|sp|B0UWD6.1|Y1819_HAES2 RecName: Full=UPF0235 protein HSM_1819
gi|112823836|gb|ABI25925.1| conserved hypothetical protein [Haemophilus somnus 129PT]
gi|168826235|gb|ACA31606.1| protein of unknown function DUF167 [Haemophilus somnus 2336]
Length = 99
Score = 36.2 bits (82), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/58 (27%), Positives = 32/58 (55%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
++ + I +P + + + D A+ + I AP DG+ANA LL+++S V+ + +
Sbjct: 10 NLRLRIFLQPKASKDHLIGLYDNALKISITAPPIDGQANAHLLKFLSKTFKVAKSQII 67
>gi|374814495|ref|ZP_09718232.1| protein C15orf40 [Treponema primitia ZAS-1]
Length = 97
Score = 36.2 bits (82), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 35/58 (60%), Gaps = 1/58 (1%)
Query: 28 LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+ +C+R + +S+ + I A PGS I +S+ + ++I A +DG+ANA L +++
Sbjct: 1 MTNCLR-IAGASILLDIKAVPGSSKSQIAGLSENRLRIKIAAAPEDGKANAELRSFLA 57
>gi|417841364|ref|ZP_12487468.1| UPF0235 protein [Haemophilus haemolyticus M19501]
gi|417844664|ref|ZP_12490705.1| UPF0235 protein [Haemophilus haemolyticus M21639]
gi|341949402|gb|EGT76006.1| UPF0235 protein [Haemophilus haemolyticus M19501]
gi|341956623|gb|EGT83044.1| UPF0235 protein [Haemophilus haemolyticus M21639]
Length = 95
Score = 36.2 bits (82), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 27/46 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+ + I +P + I + D+ + + I AP DG+ANA LL+++S
Sbjct: 11 IRLRIFLQPKASKDHIAGIHDDELKITITAPPVDGQANAHLLKFLS 56
>gi|427715535|ref|YP_007063529.1| hypothetical protein Cal7507_0193 [Calothrix sp. PCC 7507]
gi|427347971|gb|AFY30695.1| UPF0235 protein yggU [Calothrix sp. PCC 7507]
Length = 75
Score = 36.2 bits (82), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 30/51 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ IT+ KP +K I + SD ++ V + +P DG+AN L++ ++ +V
Sbjct: 1 MQITVKVKPNAKQQKIEEQSDGSLNVLLKSPPVDGKANEELIKLLAEKFNV 51
>gi|417839599|ref|ZP_12485773.1| Hypothetical protein GG7_0792 [Haemophilus haemolyticus M19107]
gi|341952137|gb|EGT78675.1| Hypothetical protein GG7_0792 [Haemophilus haemolyticus M19107]
Length = 95
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 27/46 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+ + I +P + I + D+ + + I AP DG+ANA LL+++S
Sbjct: 11 IRLRIFLQPKASKDHIAGIHDDELKITITAPPVDGQANAHLLKFLS 56
>gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
Length = 101
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 13/53 (24%), Positives = 35/53 (66%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVS 91
+ + IHA+PG++ ++ + +A+ + + A+DG+AN A++ +++ +++S
Sbjct: 13 GLYVNIHAQPGARKPALRGMHGDALKIAVAEAAQDGKANEAIVRFIADALNLS 65
>gi|404496024|ref|YP_006720130.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15]
gi|418066794|ref|ZP_12704152.1| protein of unknown function DUF167 [Geobacter metallireducens
RCH3]
gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens
GS-15]
gi|373559932|gb|EHP86211.1| protein of unknown function DUF167 [Geobacter metallireducens
RCH3]
Length = 102
Score = 35.8 bits (81), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 30/51 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V+ ++H +P + I + +A+ +++ +P +GEAN EY++ L+ V
Sbjct: 19 VTFSVHVQPRASKNGICGIQGDAIKLRLTSPPVEGEANRLCTEYLAKLLKV 69
>gi|428215927|ref|YP_007089071.1| hypothetical protein Oscil6304_5676 [Oscillatoria acuminata PCC
6304]
gi|428004308|gb|AFY85151.1| hypothetical protein Oscil6304_5676 [Oscillatoria acuminata PCC
6304]
Length = 73
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 44 IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
I KP SK IT+ D ++ +++ +P DG+ANA L++ ++ V
Sbjct: 6 IKVKPNSKQQQITEEPDGSLTIRLKSPPVDGKANAELIKLLADKFEV 52
>gi|406981760|gb|EKE03161.1| hypothetical protein ACD_20C00238G0001 [uncultured bacterium]
Length = 103
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 29/51 (56%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ I++ P S C I D+ + +++D P +G+AN ++++S L+ V
Sbjct: 18 IKISVKVIPNSSRCEIAGTIDDILRIKLDVPPIEGKANEKCVKFLSKLLGV 68
>gi|332530697|ref|ZP_08406628.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624]
gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624]
Length = 115
Score = 35.4 bits (80), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 29/52 (55%)
Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
SV + +H P + I + D A+ V++ AP DG+AN AL +++ + +
Sbjct: 22 SVLVDLHVMPNASRTQIQGLFDGALKVRLQAPPVDGKANEALRVWLAKTLSI 73
>gi|429857874|gb|ELA32714.1| duf167 domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 116
Score = 35.4 bits (80), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 36 PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI 88
P ++ + H KPG+ +T V+D V + + A A++GEAN A+++ +S ++
Sbjct: 16 PLGTLYLQCHVKPGASRVREGVTAVTDATVELCVSAQAREGEANKAVVKLLSEVL 70
>gi|302038855|ref|YP_003799177.1| hypothetical protein NIDE3568 [Candidatus Nitrospira defluvii]
gi|300606919|emb|CBK43252.1| conserved protein of unknown function DUF167 [Candidatus
Nitrospira defluvii]
Length = 105
Score = 35.4 bits (80), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 27/51 (52%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
V+I++H +P + + AV ++I AP DG ANA L +++ V
Sbjct: 20 VTISVHVQPKASRSECAGLHGHAVKIRIAAPPADGAANAELCRFLARCCEV 70
>gi|152979630|ref|YP_001345259.1| hypothetical protein Asuc_1977 [Actinobacillus succinogenes 130Z]
gi|171704390|sp|A6VQS7.1|Y1977_ACTSZ RecName: Full=UPF0235 protein Asuc_1977
gi|150841353|gb|ABR75324.1| protein of unknown function DUF167 [Actinobacillus succinogenes
130Z]
Length = 98
Score = 35.4 bits (80), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ + I +P + +I + DE + + I AP DG ANA L++Y+S V
Sbjct: 11 IRLRIMLQPKASKDAIIGLHDEELKISITAPPVDGAANAHLIKYLSKAFKV 61
>gi|170725690|ref|YP_001759716.1| hypothetical protein Swoo_1329 [Shewanella woodyi ATCC 51908]
gi|226695928|sp|B1KIX3.1|Y1329_SHEWM RecName: Full=UPF0235 protein Swoo_1329
gi|169811037|gb|ACA85621.1| protein of unknown function DUF167 [Shewanella woodyi ATCC 51908]
Length = 95
Score = 35.4 bits (80), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 29/54 (53%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEF 95
+ ++ +P + I V E + + I AP DG+ANA L++Y+S V G+
Sbjct: 13 LNLYIQPKASRDQIVGVHGEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGDI 66
>gi|336314737|ref|ZP_08569653.1| hypothetical protein Rhein_1022 [Rheinheimera sp. A13L]
gi|335881036|gb|EGM78919.1| hypothetical protein Rhein_1022 [Rheinheimera sp. A13L]
Length = 97
Score = 35.4 bits (80), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
+ + +H +PG+ + +A+ V I AP DG+ANA L ++++
Sbjct: 10 ILLQLHVQPGASRDQFLGLHGDAIKVAIKAPPVDGKANAHLQQFLA 55
>gi|224133940|ref|XP_002321697.1| predicted protein [Populus trichocarpa]
gi|222868693|gb|EEF05824.1| predicted protein [Populus trichocarpa]
Length = 231
Score = 35.4 bits (80), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 24 NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
D +P CI + V + I + ++ +IT V+ + V V + APA GEAN LLE+
Sbjct: 128 QDAPVPPCISQLG-GLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 186
Query: 84 MSSLIHVSVGEF 95
+ ++ + + +
Sbjct: 187 VGKVLGLKLSQM 198
>gi|333990437|ref|YP_004523051.1| hypothetical protein JDM601_1797 [Mycobacterium sp. JDM601]
gi|333486405|gb|AEF35797.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length = 78
Score = 35.4 bits (80), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSLIHV 90
S ++ + KPGS+ + + D+ + V + PA DG+ANAA++ ++ V
Sbjct: 3 QSETVVVKVKPGSRKGPLVETDDDGQLTVYVREPAVDGKANAAVIRLLAQYFEV 56
>gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA]
gi|409911415|ref|YP_006889880.1| hypothetical protein KN400_0845 [Geobacter sulfurreducens KN400]
gi|39982735|gb|AAR34194.1| protein of unknown function DUF167 [Geobacter sulfurreducens PCA]
gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens
KN400]
Length = 107
Score = 35.4 bits (80), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 14/53 (26%), Positives = 31/53 (58%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ V+ ++H +P + I V EA+ +++ +P +GEAN +E+++ + V
Sbjct: 18 NGVTFSVHVQPRASRNEICGVQGEAIKLRLTSPPVEGEANRLCVEFLAKRLGV 70
>gi|262172428|ref|ZP_06040106.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
gi|261893504|gb|EEY39490.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
Length = 97
Score = 35.0 bits (79), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|408793795|ref|ZP_11205401.1| TIGR00251 family protein [Leptospira meyeri serovar Hardjo str.
Went 5]
gi|408462299|gb|EKJ86028.1| TIGR00251 family protein [Leptospira meyeri serovar Hardjo str.
Went 5]
Length = 75
Score = 35.0 bits (79), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 29/51 (56%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ +T+ KP +K + +S+ V++ +P DG+AN L+E +S HV
Sbjct: 1 MKLTVKVKPNNKQPGLEFLSETDCIVRLKSPPVDGKANEELIEVLSKHFHV 51
>gi|422921712|ref|ZP_16954922.1| hypothetical protein VCBJG01_0465 [Vibrio cholerae BJG-01]
gi|341648215|gb|EGS72280.1| hypothetical protein VCBJG01_0465 [Vibrio cholerae BJG-01]
Length = 96
Score = 35.0 bits (79), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|339484001|ref|YP_004695787.1| hypothetical protein Nit79A3_2621 [Nitrosomonas sp. Is79A3]
gi|338806146|gb|AEJ02388.1| UPF0235 protein yggU [Nitrosomonas sp. Is79A3]
Length = 98
Score = 35.0 bits (79), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/67 (23%), Positives = 39/67 (58%)
Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIH 89
S R +++ +T+H + G+K+ + +A+ +++ A +G+ANAALL++++
Sbjct: 2 SWYRYDDANNLVLTLHIQTGAKNTEAAGLHGDALRIKLAAAPVEGKANAALLKFLAKHFD 61
Query: 90 VSVGEFV 96
V + + +
Sbjct: 62 VPLSQVI 68
>gi|229530300|ref|ZP_04419688.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
gi|384423759|ref|YP_005633117.1| hypothetical protein VCLMA_A0416 [Vibrio cholerae LMA3984-4]
gi|229332073|gb|EEN97561.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
gi|327483312|gb|AEA77719.1| UPF0235 protein [Vibrio cholerae LMA3984-4]
Length = 96
Score = 35.0 bits (79), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|320352454|ref|YP_004193793.1| hypothetical protein Despr_0318 [Desulfobulbus propionicus DSM
2032]
gi|320120956|gb|ADW16502.1| protein of unknown function DUF167 [Desulfobulbus propionicus DSM
2032]
Length = 110
Score = 35.0 bits (79), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 12/60 (20%), Positives = 34/60 (56%)
Query: 31 CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
C++ + S+ + + +P + + + + + + +++ P DG+AN A++ Y++ L H+
Sbjct: 10 CLQRLADGSLLLRLQVQPRAAANHLAGLQGDMLKLRVTTPPVDGKANQAVVAYLAKLFHL 69
>gi|147674564|ref|YP_001215984.1| hypothetical protein VC0395_A0010 [Vibrio cholerae O395]
gi|153216287|ref|ZP_01950380.1| conserved hypothetical protein [Vibrio cholerae 1587]
gi|227116861|ref|YP_002818757.1| hypothetical protein VC395_0502 [Vibrio cholerae O395]
gi|262167148|ref|ZP_06034862.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
gi|424589734|ref|ZP_18029181.1| hypothetical protein VCCP103710_0503 [Vibrio cholerae CP1037(10)]
gi|172047756|sp|A5F9H9.1|Y1210_VIBC3 RecName: Full=UPF0235 protein VC0395_A0010/VC395_0502
gi|124114376|gb|EAY33196.1| conserved hypothetical protein [Vibrio cholerae 1587]
gi|146316447|gb|ABQ20986.1| conserved hypothetical protein [Vibrio cholerae O395]
gi|227012311|gb|ACP08521.1| conserved hypothetical protein [Vibrio cholerae O395]
gi|262024448|gb|EEY43135.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
gi|408036533|gb|EKG72959.1| hypothetical protein VCCP103710_0503 [Vibrio cholerae CP1037(10)]
Length = 96
Score = 35.0 bits (79), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|424658356|ref|ZP_18095613.1| hypothetical protein VCHE16_0509 [Vibrio cholerae HE-16]
gi|408055246|gb|EKG90184.1| hypothetical protein VCHE16_0509 [Vibrio cholerae HE-16]
Length = 96
Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|153828377|ref|ZP_01981044.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gi|229519844|ref|ZP_04409278.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
gi|417819343|ref|ZP_12465960.1| hypothetical protein VCHE39_0818 [Vibrio cholerae HE39]
gi|417823596|ref|ZP_12470188.1| hypothetical protein VCHE48_1515 [Vibrio cholerae HE48]
gi|419829034|ref|ZP_14352523.1| hypothetical protein VCHC1A2_1398 [Vibrio cholerae HC-1A2]
gi|419831815|ref|ZP_14355282.1| hypothetical protein VCHC61A2_0442 [Vibrio cholerae HC-61A2]
gi|421353281|ref|ZP_15803615.1| hypothetical protein VCHE45_0604 [Vibrio cholerae HE-45]
gi|422305891|ref|ZP_16393078.1| hypothetical protein VCCP1035_0510 [Vibrio cholerae CP1035(8)]
gi|422908995|ref|ZP_16943647.1| hypothetical protein VCHE09_0473 [Vibrio cholerae HE-09]
gi|422916199|ref|ZP_16950540.1| hypothetical protein VCHC02A1_0505 [Vibrio cholerae HC-02A1]
gi|423816157|ref|ZP_17715143.1| hypothetical protein VCHC55C2_0515 [Vibrio cholerae HC-55C2]
gi|423848221|ref|ZP_17718930.1| hypothetical protein VCHC59A1_0557 [Vibrio cholerae HC-59A1]
gi|423878800|ref|ZP_17722538.1| hypothetical protein VCHC60A1_0507 [Vibrio cholerae HC-60A1]
gi|423946497|ref|ZP_17733405.1| hypothetical protein VCHE40_0475 [Vibrio cholerae HE-40]
gi|423975656|ref|ZP_17736954.1| hypothetical protein VCHE46_0479 [Vibrio cholerae HE-46]
gi|423996620|ref|ZP_17739886.1| hypothetical protein VCHC02C1_0516 [Vibrio cholerae HC-02C1]
gi|424015317|ref|ZP_17755167.1| hypothetical protein VCHC55B2_0506 [Vibrio cholerae HC-55B2]
gi|424018431|ref|ZP_17758233.1| hypothetical protein VCHC59B1_0515 [Vibrio cholerae HC-59B1]
gi|424623803|ref|ZP_18062283.1| hypothetical protein VCHC50A1_0509 [Vibrio cholerae HC-50A1]
gi|424628378|ref|ZP_18066687.1| hypothetical protein VCHC51A1_0505 [Vibrio cholerae HC-51A1]
gi|424632334|ref|ZP_18070453.1| hypothetical protein VCHC52A1_0510 [Vibrio cholerae HC-52A1]
gi|424635421|ref|ZP_18073445.1| hypothetical protein VCHC55A1_0514 [Vibrio cholerae HC-55A1]
gi|424639212|ref|ZP_18077112.1| hypothetical protein VCHC56A1_0478 [Vibrio cholerae HC-56A1]
gi|424647495|ref|ZP_18085175.1| hypothetical protein VCHC57A1_0508 [Vibrio cholerae HC-57A1]
gi|443526351|ref|ZP_21092436.1| hypothetical protein VCHC78A1_00498 [Vibrio cholerae HC-78A1]
gi|148876086|gb|EDL74221.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gi|229343132|gb|EEO08116.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
gi|340041199|gb|EGR02166.1| hypothetical protein VCHE39_0818 [Vibrio cholerae HE39]
gi|340048225|gb|EGR09147.1| hypothetical protein VCHE48_1515 [Vibrio cholerae HE48]
gi|341636077|gb|EGS60780.1| hypothetical protein VCHE09_0473 [Vibrio cholerae HE-09]
gi|341640719|gb|EGS65298.1| hypothetical protein VCHC02A1_0505 [Vibrio cholerae HC-02A1]
gi|395955054|gb|EJH65659.1| hypothetical protein VCHE45_0604 [Vibrio cholerae HE-45]
gi|408016251|gb|EKG53804.1| hypothetical protein VCHC50A1_0509 [Vibrio cholerae HC-50A1]
gi|408021366|gb|EKG58623.1| hypothetical protein VCHC52A1_0510 [Vibrio cholerae HC-52A1]
gi|408027591|gb|EKG64554.1| hypothetical protein VCHC56A1_0478 [Vibrio cholerae HC-56A1]
gi|408027669|gb|EKG64628.1| hypothetical protein VCHC55A1_0514 [Vibrio cholerae HC-55A1]
gi|408037290|gb|EKG73689.1| hypothetical protein VCHC57A1_0508 [Vibrio cholerae HC-57A1]
gi|408059200|gb|EKG93972.1| hypothetical protein VCHC51A1_0505 [Vibrio cholerae HC-51A1]
gi|408622223|gb|EKK95211.1| hypothetical protein VCHC1A2_1398 [Vibrio cholerae HC-1A2]
gi|408627891|gb|EKL00681.1| hypothetical protein VCCP1035_0510 [Vibrio cholerae CP1035(8)]
gi|408636828|gb|EKL08950.1| hypothetical protein VCHC55C2_0515 [Vibrio cholerae HC-55C2]
gi|408644094|gb|EKL15800.1| hypothetical protein VCHC60A1_0507 [Vibrio cholerae HC-60A1]
gi|408645206|gb|EKL16867.1| hypothetical protein VCHC59A1_0557 [Vibrio cholerae HC-59A1]
gi|408652222|gb|EKL23447.1| hypothetical protein VCHC61A2_0442 [Vibrio cholerae HC-61A2]
gi|408662087|gb|EKL33059.1| hypothetical protein VCHE40_0475 [Vibrio cholerae HE-40]
gi|408666193|gb|EKL36990.1| hypothetical protein VCHE46_0479 [Vibrio cholerae HE-46]
gi|408854525|gb|EKL94278.1| hypothetical protein VCHC02C1_0516 [Vibrio cholerae HC-02C1]
gi|408862102|gb|EKM01651.1| hypothetical protein VCHC55B2_0506 [Vibrio cholerae HC-55B2]
gi|408869977|gb|EKM09259.1| hypothetical protein VCHC59B1_0515 [Vibrio cholerae HC-59B1]
gi|443455344|gb|ELT19126.1| hypothetical protein VCHC78A1_00498 [Vibrio cholerae HC-78A1]
Length = 96
Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|367031698|ref|XP_003665132.1| hypothetical protein MYCTH_2308510 [Myceliophthora thermophila
ATCC 42464]
gi|347012403|gb|AEO59887.1| hypothetical protein MYCTH_2308510 [Myceliophthora thermophila
ATCC 42464]
Length = 119
Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 45 HAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
+ KPG+ IT V+DEAV + + A A++GEAN A++ +S ++ +
Sbjct: 25 NVKPGASKNREGITSVNDEAVEICVAAQAREGEANKAVIRVLSEVLDL 72
>gi|121590704|ref|ZP_01678036.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|153819156|ref|ZP_01971823.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|153826562|ref|ZP_01979229.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gi|227080668|ref|YP_002809219.1| hypothetical protein VCM66_0443 [Vibrio cholerae M66-2]
gi|229507096|ref|ZP_04396602.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
gi|298501011|ref|ZP_07010812.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|421350320|ref|ZP_15800686.1| hypothetical protein VCHE25_1531 [Vibrio cholerae HE-25]
gi|254803911|sp|C3LRX5.1|Y443_VIBCM RecName: Full=UPF0235 protein VCM66_0443
gi|121547435|gb|EAX57544.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|126510301|gb|EAZ72895.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|149739654|gb|EDM53868.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gi|227008556|gb|ACP04768.1| conserved hypothetical protein [Vibrio cholerae M66-2]
gi|229355841|gb|EEO20761.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
gi|297540259|gb|EFH76319.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|395954442|gb|EJH65052.1| hypothetical protein VCHE25_1531 [Vibrio cholerae HE-25]
Length = 96
Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|15640485|ref|NP_230112.1| hypothetical protein VC0458 [Vibrio cholerae O1 biovar El Tor
str. N16961]
gi|153823175|ref|ZP_01975842.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|229509068|ref|ZP_04398556.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
gi|229519736|ref|ZP_04409179.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
gi|229606248|ref|YP_002876896.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
gi|254850689|ref|ZP_05240039.1| conserved hypothetical protein [Vibrio cholerae MO10]
gi|255744295|ref|ZP_05418248.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
gi|262147280|ref|ZP_06028079.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
gi|360037099|ref|YP_004938862.1| hypothetical protein Vch1786_I2779 [Vibrio cholerae O1 str.
2010EL-1786]
gi|379740331|ref|YP_005332300.1| hypothetical protein O3Y_02125 [Vibrio cholerae IEC224]
gi|417812443|ref|ZP_12459103.1| hypothetical protein VCHC49A2_1432 [Vibrio cholerae HC-49A2]
gi|417815305|ref|ZP_12461939.1| hypothetical protein VCHCUF01_0533 [Vibrio cholerae HCUF01]
gi|418331166|ref|ZP_12942116.1| hypothetical protein VCHC06A1_0498 [Vibrio cholerae HC-06A1]
gi|418336323|ref|ZP_12945222.1| hypothetical protein VCHC23A1_0655 [Vibrio cholerae HC-23A1]
gi|418342703|ref|ZP_12949503.1| hypothetical protein VCHC28A1_0498 [Vibrio cholerae HC-28A1]
gi|418347867|ref|ZP_12952603.1| hypothetical protein VCHC43A1_0506 [Vibrio cholerae HC-43A1]
gi|418354310|ref|ZP_12957034.1| hypothetical protein VCHC61A1_1305 [Vibrio cholerae HC-61A1]
gi|419824929|ref|ZP_14348436.1| hypothetical protein VCCP10336_0523 [Vibrio cholerae CP1033(6)]
gi|421315748|ref|ZP_15766320.1| hypothetical protein VCCP10325_0510 [Vibrio cholerae CP1032(5)]
gi|421319246|ref|ZP_15769805.1| hypothetical protein VCCP103811_0499 [Vibrio cholerae CP1038(11)]
gi|421323279|ref|ZP_15773808.1| hypothetical protein VCCP104114_0473 [Vibrio cholerae CP1041(14)]
gi|421327685|ref|ZP_15778201.1| hypothetical protein VCCP104215_1439 [Vibrio cholerae CP1042(15)]
gi|421330686|ref|ZP_15781168.1| hypothetical protein VCCP104619_0534 [Vibrio cholerae CP1046(19)]
gi|421334284|ref|ZP_15784754.1| hypothetical protein VCCP104821_0431 [Vibrio cholerae CP1048(21)]
gi|421338181|ref|ZP_15788620.1| hypothetical protein VCHC20A2_0516 [Vibrio cholerae HC-20A2]
gi|421345745|ref|ZP_15796130.1| hypothetical protein VCHC46A1_0527 [Vibrio cholerae HC-46A1]
gi|422890498|ref|ZP_16932923.1| hypothetical protein VCHC40A1_0477 [Vibrio cholerae HC-40A1]
gi|422901298|ref|ZP_16936676.1| hypothetical protein VCHC48A1_0481 [Vibrio cholerae HC-48A1]
gi|422905481|ref|ZP_16940339.1| hypothetical protein VCHC70A1_0498 [Vibrio cholerae HC-70A1]
gi|422912202|ref|ZP_16946732.1| hypothetical protein VCHFU02_0496 [Vibrio cholerae HFU-02]
gi|422924681|ref|ZP_16957719.1| hypothetical protein VCHC38A1_0501 [Vibrio cholerae HC-38A1]
gi|423143727|ref|ZP_17131345.1| hypothetical protein VCHC19A1_0496 [Vibrio cholerae HC-19A1]
gi|423148711|ref|ZP_17136072.1| hypothetical protein VCHC21A1_0503 [Vibrio cholerae HC-21A1]
gi|423152502|ref|ZP_17139704.1| hypothetical protein VCHC22A1_0485 [Vibrio cholerae HC-22A1]
gi|423155286|ref|ZP_17142425.1| hypothetical protein VCHC32A1_0499 [Vibrio cholerae HC-32A1]
gi|423159145|ref|ZP_17146119.1| hypothetical protein VCHC33A2_0485 [Vibrio cholerae HC-33A2]
gi|423163823|ref|ZP_17150619.1| hypothetical protein VCHC48B2_0476 [Vibrio cholerae HC-48B2]
gi|423729844|ref|ZP_17703165.1| hypothetical protein VCHC17A1_0514 [Vibrio cholerae HC-17A1]
gi|423747070|ref|ZP_17711352.1| hypothetical protein VCHC50A2_0521 [Vibrio cholerae HC-50A2]
gi|423891674|ref|ZP_17725366.1| hypothetical protein VCHC62A1_0496 [Vibrio cholerae HC-62A1]
gi|423926451|ref|ZP_17729983.1| hypothetical protein VCHC77A1_0502 [Vibrio cholerae HC-77A1]
gi|424001006|ref|ZP_17744099.1| hypothetical protein VCHC17A2_0504 [Vibrio cholerae HC-17A2]
gi|424005166|ref|ZP_17748154.1| hypothetical protein VCHC37A1_0630 [Vibrio cholerae HC-37A1]
gi|424022959|ref|ZP_17762626.1| hypothetical protein VCHC62B1_0497 [Vibrio cholerae HC-62B1]
gi|424025977|ref|ZP_17765597.1| hypothetical protein VCHC69A1_0496 [Vibrio cholerae HC-69A1]
gi|424585359|ref|ZP_18024955.1| hypothetical protein VCCP10303_0505 [Vibrio cholerae CP1030(3)]
gi|424593979|ref|ZP_18033322.1| hypothetical protein VCCP1040_0502 [Vibrio cholerae CP1040(13)]
gi|424597915|ref|ZP_18037117.1| hypothetical protein VCCP104417_0506 [Vibrio Cholerae CP1044(17)]
gi|424600679|ref|ZP_18039838.1| hypothetical protein VCCP1047_0494 [Vibrio cholerae CP1047(20)]
gi|424605596|ref|ZP_18044563.1| hypothetical protein VCCP1050_0509 [Vibrio cholerae CP1050(23)]
gi|424609311|ref|ZP_18048174.1| hypothetical protein VCHC39A1_0495 [Vibrio cholerae HC-39A1]
gi|424612231|ref|ZP_18051042.1| hypothetical protein VCHC41A1_0512 [Vibrio cholerae HC-41A1]
gi|424616108|ref|ZP_18054803.1| hypothetical protein VCHC42A1_0500 [Vibrio cholerae HC-42A1]
gi|424620871|ref|ZP_18059402.1| hypothetical protein VCHC47A1_0517 [Vibrio cholerae HC-47A1]
gi|424643688|ref|ZP_18081446.1| hypothetical protein VCHC56A2_0509 [Vibrio cholerae HC-56A2]
gi|424651610|ref|ZP_18089138.1| hypothetical protein VCHC57A2_0503 [Vibrio cholerae HC-57A2]
gi|424655558|ref|ZP_18092864.1| hypothetical protein VCHC81A2_0506 [Vibrio cholerae HC-81A2]
gi|440708664|ref|ZP_20889325.1| hypothetical protein VC4260B_00700 [Vibrio cholerae 4260B]
gi|443502508|ref|ZP_21069500.1| hypothetical protein VCHC64A1_00491 [Vibrio cholerae HC-64A1]
gi|443506416|ref|ZP_21073213.1| hypothetical protein VCHC65A1_00495 [Vibrio cholerae HC-65A1]
gi|443510250|ref|ZP_21076922.1| hypothetical protein VCHC67A1_00494 [Vibrio cholerae HC-67A1]
gi|443514087|ref|ZP_21080631.1| hypothetical protein VCHC68A1_00490 [Vibrio cholerae HC-68A1]
gi|443517900|ref|ZP_21084322.1| hypothetical protein VCHC71A1_00490 [Vibrio cholerae HC-71A1]
gi|443522482|ref|ZP_21088732.1| hypothetical protein VCHC72A2_00493 [Vibrio cholerae HC-72A2]
gi|443530386|ref|ZP_21096402.1| hypothetical protein VCHC7A1_01504 [Vibrio cholerae HC-7A1]
gi|443534158|ref|ZP_21100077.1| hypothetical protein VCHC80A1_00463 [Vibrio cholerae HC-80A1]
gi|443537740|ref|ZP_21103597.1| hypothetical protein VCHC81A1_01284 [Vibrio cholerae HC-81A1]
gi|449054312|ref|ZP_21732980.1| Hypothetical protein B839_03460 [Vibrio cholerae O1 str. Inaba
G4222]
gi|29839647|sp|Q9KUQ7.1|Y458_VIBCH RecName: Full=UPF0235 protein VC_0458
gi|9654883|gb|AAF93631.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor
str. N16961]
gi|126519301|gb|EAZ76524.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|229344425|gb|EEO09400.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
gi|229353993|gb|EEO18927.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
gi|229368903|gb|ACQ59326.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
gi|254846394|gb|EET24808.1| conserved hypothetical protein [Vibrio cholerae MO10]
gi|255738235|gb|EET93627.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
gi|262031274|gb|EEY49889.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
gi|340043291|gb|EGR04250.1| hypothetical protein VCHCUF01_0533 [Vibrio cholerae HCUF01]
gi|340043823|gb|EGR04780.1| hypothetical protein VCHC49A2_1432 [Vibrio cholerae HC-49A2]
gi|341625813|gb|EGS51240.1| hypothetical protein VCHC70A1_0498 [Vibrio cholerae HC-70A1]
gi|341627186|gb|EGS52512.1| hypothetical protein VCHC48A1_0481 [Vibrio cholerae HC-48A1]
gi|341627511|gb|EGS52814.1| hypothetical protein VCHC40A1_0477 [Vibrio cholerae HC-40A1]
gi|341641138|gb|EGS65697.1| hypothetical protein VCHFU02_0496 [Vibrio cholerae HFU-02]
gi|341648634|gb|EGS72678.1| hypothetical protein VCHC38A1_0501 [Vibrio cholerae HC-38A1]
gi|356421656|gb|EHH75150.1| hypothetical protein VCHC06A1_0498 [Vibrio cholerae HC-06A1]
gi|356422010|gb|EHH75497.1| hypothetical protein VCHC21A1_0503 [Vibrio cholerae HC-21A1]
gi|356426926|gb|EHH80209.1| hypothetical protein VCHC19A1_0496 [Vibrio cholerae HC-19A1]
gi|356433104|gb|EHH86297.1| hypothetical protein VCHC23A1_0655 [Vibrio cholerae HC-23A1]
gi|356434758|gb|EHH87928.1| hypothetical protein VCHC22A1_0485 [Vibrio cholerae HC-22A1]
gi|356438061|gb|EHH91117.1| hypothetical protein VCHC28A1_0498 [Vibrio cholerae HC-28A1]
gi|356443244|gb|EHH96067.1| hypothetical protein VCHC32A1_0499 [Vibrio cholerae HC-32A1]
gi|356447978|gb|EHI00763.1| hypothetical protein VCHC43A1_0506 [Vibrio cholerae HC-43A1]
gi|356450404|gb|EHI03126.1| hypothetical protein VCHC33A2_0485 [Vibrio cholerae HC-33A2]
gi|356454086|gb|EHI06741.1| hypothetical protein VCHC61A1_1305 [Vibrio cholerae HC-61A1]
gi|356456470|gb|EHI09069.1| hypothetical protein VCHC48B2_0476 [Vibrio cholerae HC-48B2]
gi|356648253|gb|AET28308.1| hypothetical protein Vch1786_I2779 [Vibrio cholerae O1 str.
2010EL-1786]
gi|378793841|gb|AFC57312.1| hypothetical protein O3Y_02125 [Vibrio cholerae IEC224]
gi|395922489|gb|EJH33305.1| hypothetical protein VCCP10325_0510 [Vibrio cholerae CP1032(5)]
gi|395923124|gb|EJH33936.1| hypothetical protein VCCP104114_0473 [Vibrio cholerae CP1041(14)]
gi|395925571|gb|EJH36368.1| hypothetical protein VCCP103811_0499 [Vibrio cholerae CP1038(11)]
gi|395931419|gb|EJH42164.1| hypothetical protein VCCP104215_1439 [Vibrio cholerae CP1042(15)]
gi|395934539|gb|EJH45277.1| hypothetical protein VCCP104619_0534 [Vibrio cholerae CP1046(19)]
gi|395937814|gb|EJH48525.1| hypothetical protein VCCP104821_0431 [Vibrio cholerae CP1048(21)]
gi|395946544|gb|EJH57207.1| hypothetical protein VCHC20A2_0516 [Vibrio cholerae HC-20A2]
gi|395948414|gb|EJH59064.1| hypothetical protein VCHC46A1_0527 [Vibrio cholerae HC-46A1]
gi|395964018|gb|EJH74261.1| hypothetical protein VCHC57A2_0503 [Vibrio cholerae HC-57A2]
gi|395964105|gb|EJH74347.1| hypothetical protein VCHC56A2_0509 [Vibrio cholerae HC-56A2]
gi|395967073|gb|EJH77176.1| hypothetical protein VCHC42A1_0500 [Vibrio cholerae HC-42A1]
gi|395975687|gb|EJH85168.1| hypothetical protein VCHC47A1_0517 [Vibrio cholerae HC-47A1]
gi|395977759|gb|EJH87158.1| hypothetical protein VCCP10303_0505 [Vibrio cholerae CP1030(3)]
gi|395979271|gb|EJH88630.1| hypothetical protein VCCP1047_0494 [Vibrio cholerae CP1047(20)]
gi|408010207|gb|EKG48079.1| hypothetical protein VCHC39A1_0495 [Vibrio cholerae HC-39A1]
gi|408016967|gb|EKG54491.1| hypothetical protein VCHC41A1_0512 [Vibrio cholerae HC-41A1]
gi|408037621|gb|EKG74009.1| hypothetical protein VCCP1040_0502 [Vibrio cholerae CP1040(13)]
gi|408044972|gb|EKG80848.1| hypothetical protein VCCP104417_0506 [Vibrio Cholerae CP1044(17)]
gi|408046805|gb|EKG82470.1| hypothetical protein VCCP1050_0509 [Vibrio cholerae CP1050(23)]
gi|408057530|gb|EKG92375.1| hypothetical protein VCHC81A2_0506 [Vibrio cholerae HC-81A2]
gi|408611953|gb|EKK85309.1| hypothetical protein VCCP10336_0523 [Vibrio cholerae CP1033(6)]
gi|408627743|gb|EKL00546.1| hypothetical protein VCHC17A1_0514 [Vibrio cholerae HC-17A1]
gi|408642288|gb|EKL14038.1| hypothetical protein VCHC50A2_0521 [Vibrio cholerae HC-50A2]
gi|408658783|gb|EKL29843.1| hypothetical protein VCHC77A1_0502 [Vibrio cholerae HC-77A1]
gi|408660102|gb|EKL31132.1| hypothetical protein VCHC62A1_0496 [Vibrio cholerae HC-62A1]
gi|408849210|gb|EKL89238.1| hypothetical protein VCHC37A1_0630 [Vibrio cholerae HC-37A1]
gi|408849742|gb|EKL89751.1| hypothetical protein VCHC17A2_0504 [Vibrio cholerae HC-17A2]
gi|408874518|gb|EKM13688.1| hypothetical protein VCHC62B1_0497 [Vibrio cholerae HC-62B1]
gi|408881450|gb|EKM20335.1| hypothetical protein VCHC69A1_0496 [Vibrio cholerae HC-69A1]
gi|439975760|gb|ELP51867.1| hypothetical protein VC4260B_00700 [Vibrio cholerae 4260B]
gi|443433120|gb|ELS75638.1| hypothetical protein VCHC64A1_00491 [Vibrio cholerae HC-64A1]
gi|443436951|gb|ELS83061.1| hypothetical protein VCHC65A1_00495 [Vibrio cholerae HC-65A1]
gi|443440824|gb|ELS90505.1| hypothetical protein VCHC67A1_00494 [Vibrio cholerae HC-67A1]
gi|443444595|gb|ELS97864.1| hypothetical protein VCHC68A1_00490 [Vibrio cholerae HC-68A1]
gi|443448433|gb|ELT05063.1| hypothetical protein VCHC71A1_00490 [Vibrio cholerae HC-71A1]
gi|443451551|gb|ELT11805.1| hypothetical protein VCHC72A2_00493 [Vibrio cholerae HC-72A2]
gi|443458587|gb|ELT25982.1| hypothetical protein VCHC7A1_01504 [Vibrio cholerae HC-7A1]
gi|443462659|gb|ELT33691.1| hypothetical protein VCHC80A1_00463 [Vibrio cholerae HC-80A1]
gi|443466565|gb|ELT41222.1| hypothetical protein VCHC81A1_01284 [Vibrio cholerae HC-81A1]
gi|448266105|gb|EMB03335.1| Hypothetical protein B839_03460 [Vibrio cholerae O1 str. Inaba
G4222]
Length = 96
Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13]
gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13]
Length = 100
Score = 34.7 bits (78), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 14/44 (31%), Positives = 26/44 (59%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
I + P +K I EA+ +++ AP DG+AN AL+++++
Sbjct: 15 IAVQVSPNAKKTEIVSSDGEALRIRLQAPPVDGKANEALVQFIA 58
>gi|121728574|ref|ZP_01681595.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|153802603|ref|ZP_01957189.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
gi|229512520|ref|ZP_04401991.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
gi|254291169|ref|ZP_04961965.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
gi|121629130|gb|EAX61573.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|124121866|gb|EAY40609.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
gi|150422863|gb|EDN14814.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
gi|229350413|gb|EEO15362.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
Length = 96
Score = 34.7 bits (78), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|340966729|gb|EGS22236.1| hypothetical protein CTHT_0017530 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 120
Score = 34.7 bits (78), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 36 PPSSVSITIHAKPGS--KSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHV 90
P ++ + + KPG+ + I V D+AV + + A A++GEAN A+++ +S ++ +
Sbjct: 17 PNGTLYLNCNVKPGASKQREGIASVGDDAVEICVAAQAREGEANKAVIKVLSEVLDL 73
>gi|229525149|ref|ZP_04414554.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
VL426]
gi|229338730|gb|EEO03747.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
VL426]
Length = 96
Score = 34.7 bits (78), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|297580597|ref|ZP_06942523.1| conserved hypothetical protein [Vibrio cholerae RC385]
gi|297535013|gb|EFH73848.1| conserved hypothetical protein [Vibrio cholerae RC385]
Length = 96
Score = 34.7 bits (78), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|262191069|ref|ZP_06049276.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
gi|429886754|ref|ZP_19368298.1| hypothetical protein OSU_1905 [Vibrio cholerae PS15]
gi|262033045|gb|EEY51576.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
gi|429226359|gb|EKY32484.1| hypothetical protein OSU_1905 [Vibrio cholerae PS15]
Length = 96
Score = 34.7 bits (78), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 42 ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGEFV 96
+ ++ +P + SI + E + V I AP DG+ANA L +Y++ L V+ G V
Sbjct: 13 LRLYIQPKASRDSIIGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVV 67
>gi|374584225|ref|ZP_09657317.1| UPF0235 protein yggU [Leptonema illini DSM 21528]
gi|373873086|gb|EHQ05080.1| UPF0235 protein yggU [Leptonema illini DSM 21528]
Length = 75
Score = 34.7 bits (78), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 5/55 (9%)
Query: 43 TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLI-----HVSV 92
++H KP S + + D + V++ +P +DG+ANA L+E ++ + HVS+
Sbjct: 5 SVHVKPKSSKQGVHVLEDGTLEVRVHSPPEDGKANAELIETLARHLGVKRRHVSI 59
>gi|345429265|ref|YP_004822383.1| Signal Recognition Particle (SRP) receptor: membrane binding
protein/conserved protein [Haemophilus parainfluenzae
T3T1]
gi|301155326|emb|CBW14792.1| fused Signal Recognition Particle (SRP) receptor: membrane binding
protein/conserved protein [Haemophilus parainfluenzae
T3T1]
Length = 485
Score = 34.3 bits (77), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 13/101 (12%)
Query: 11 AKSAGSTQSKIKTNDENLPSCIRLV------PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
A +AG Q+K DE L +R++ P + +T+ A G + S + DEAVG
Sbjct: 367 ADTAGRLQNKNNLMDE-LKKIVRVMRKYDETAPHEIMLTLDAGTGQNAISQAKLFDEAVG 425
Query: 65 V------QIDAPAKDGEANAALLEYMSSLIHVSVGEFVESV 99
+ ++D AK G A ++ + ++ VGE +E +
Sbjct: 426 LTGISLTKLDGTAKGGVIFAIADQFNLPIRYIGVGEKIEDL 466
>gi|407983885|ref|ZP_11164522.1| hypothetical protein C731_2485 [Mycobacterium hassiacum DSM
44199]
gi|407374462|gb|EKF23441.1| hypothetical protein C731_2485 [Mycobacterium hassiacum DSM
44199]
Length = 76
Score = 34.3 bits (77), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSLIHVSVGE 94
+S ++ + KPGS+ + +V D+ A+ V + A DG+ANAA++ ++ VS E
Sbjct: 2 TSQTVVVRVKPGSRKGPLVEVDDDGALTVYVPERAVDGKANAAVVRLLAEHFGVSRSE 59
>gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens
DSM 19672]
gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio
nitroreducens DSM 19672]
Length = 86
Score = 34.3 bits (77), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/55 (27%), Positives = 32/55 (58%)
Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGE 94
+ I I+ +PG+K + + + ++I +P DG+AN L+ ++S +++S E
Sbjct: 1 MRIKIYVQPGAKKTAYDGEFNGCIKIKIKSPPTDGKANEELISFISQSLNLSKKE 55
>gi|357037199|ref|ZP_09098999.1| UPF0235 protein yggU [Desulfotomaculum gibsoniae DSM 7213]
gi|355361364|gb|EHG09119.1| UPF0235 protein yggU [Desulfotomaculum gibsoniae DSM 7213]
Length = 95
Score = 34.3 bits (77), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 31/57 (54%)
Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSLIHVSVGE 94
VSI I +P + ++ V + A+ V++ AP DG AN A +++ L+ V+ G
Sbjct: 9 GGVSIKIRVQPRAAKNQVSGVMEGALKVRLTAPPVDGAANKACCAFVAELLGVAKGR 65
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.307 0.123 0.332
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,431,268,870
Number of Sequences: 23463169
Number of extensions: 47841091
Number of successful extensions: 111019
Number of sequences better than 100.0: 392
Number of HSP's better than 100.0 without gapping: 341
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 110645
Number of HSP's gapped (non-prelim): 392
length of query: 100
length of database: 8,064,228,071
effective HSP length: 69
effective length of query: 31
effective length of database: 6,445,269,410
effective search space: 199803351710
effective search space used: 199803351710
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 69 (31.2 bits)