BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780981|ref|YP_003065394.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] (122 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780981|ref|YP_003065394.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] gi|254040658|gb|ACT57454.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] Length = 122 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKKYFTILTMLFVSSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQ 60 MKKYFTILTMLFVSSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQ Sbjct: 1 MKKYFTILTMLFVSSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQ 60 Query: 61 ENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGH 120 ENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGH Sbjct: 61 ENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGH 120 Query: 121 RY 122 RY Sbjct: 121 RY 122 >gi|254780984|ref|YP_003065397.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62] gi|254040661|gb|ACT57457.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62] Length = 125 Score = 47.1 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 20/125 (16%) Query: 1 MKKYFTILTMLFVSSAIN---------PCGIEEDNLKSSPLPHIA--LESLSAENEK--K 47 MKKY T+LT+L +S+ +N P E+ + + P + L L AENEK K Sbjct: 1 MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60 Query: 48 ELSEHEKKVIESQENPKKQFSEHEKKETDD---PKSARKENIVMKKTFSQKSKKYTPYFD 104 E + +++ EN KK F EHEKK T + RK NI F + KY Y + Sbjct: 61 EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNI---NQFYEARGKYR-YGN 116 Query: 105 HYMTN 109 Y N Sbjct: 117 GYYRN 121 >gi|160937427|ref|ZP_02084788.1| hypothetical protein CLOBOL_02318 [Clostridium bolteae ATCC BAA-613] gi|158439496|gb|EDP17246.1| hypothetical protein CLOBOL_02318 [Clostridium bolteae ATCC BAA-613] Length = 512 Score = 41.7 bits (96), Expect = 0.034, Method: Composition-based stats. Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 11/62 (17%) Query: 38 ESLSAENE--------KKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVMK 89 ESL +NE + EL +EK+V+ +EN +K+ EKKE D R+ENI+ K Sbjct: 64 ESLKTKNELEKETKERRAELQRYEKRVLSKEENVEKKADALEKKEAD---LVRRENILSK 120 Query: 90 KT 91 +T Sbjct: 121 RT 122 >gi|195995537|ref|XP_002107637.1| hypothetical protein TRIADDRAFT_51343 [Trichoplax adhaerens] gi|190588413|gb|EDV28435.1| hypothetical protein TRIADDRAFT_51343 [Trichoplax adhaerens] Length = 7662 Score = 41.7 bits (96), Expect = 0.034, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 9/83 (10%) Query: 22 IEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEH---------EK 72 +EE++++ A E LSAE +K L +HEK++ Q+N + + ++H E+ Sbjct: 6977 LEEEDVEFQKKAAEAKEKLSAEEFQKMLDDHEKELESMQQNQEIEIAKHRDQLNSKLEER 7036 Query: 73 KETDDPKSARKENIVMKKTFSQK 95 + + ARK+ I M+K Q+ Sbjct: 7037 RRRKQQQLARKQEIEMQKKLLQQ 7059 >gi|239628380|ref|ZP_04671411.1| RNA binding metal dependent phosphohydrolase [Clostridiales bacterium 1_7_47_FAA] gi|239518526|gb|EEQ58392.1| RNA binding metal dependent phosphohydrolase [Clostridiales bacterium 1_7_47FAA] Length = 513 Score = 41.3 bits (95), Expect = 0.049, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 11/62 (17%) Query: 38 ESLSAENE--------KKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVMK 89 ESL +NE + EL +EK+V+ +EN +K+ EKKE D R+EN++ K Sbjct: 65 ESLKTKNELEKETKERRAELQRYEKRVLSKEENVEKKADALEKKEAD---LVRRENVLSK 121 Query: 90 KT 91 +T Sbjct: 122 RT 123 >gi|225388514|ref|ZP_03758238.1| hypothetical protein CLOSTASPAR_02250 [Clostridium asparagiforme DSM 15981] gi|225045445|gb|EEG55691.1| hypothetical protein CLOSTASPAR_02250 [Clostridium asparagiforme DSM 15981] Length = 515 Score = 40.5 bits (93), Expect = 0.074, Method: Composition-based stats. Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 11/62 (17%) Query: 38 ESLSAENE--------KKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVMK 89 ESL +NE + EL +EK+V+ +EN +K+ EKKE D R+EN++ K Sbjct: 67 ESLKTKNELEKETKERRAELQRYEKRVLSKEENVEKKADALEKKEAD---LVRRENVLSK 123 Query: 90 KT 91 ++ Sbjct: 124 RS 125 >gi|328766389|gb|EGF76443.1| hypothetical protein BATDEDRAFT_28354 [Batrachochytrium dendrobatidis JAM81] Length = 201 Score = 40.1 bits (92), Expect = 0.094, Method: Composition-based stats. Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 10/86 (11%) Query: 21 GIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKS 80 G E LK + H L+ L +NE KEL + +I+ +E+ +K+ +EH P S Sbjct: 124 GAAEARLK---MQHEVLDGL--KNELKELVKKASDIIKKEESLEKKLTEHLS-----PGS 173 Query: 81 ARKENIVMKKTFSQKSKKYTPYFDHY 106 + I+ KT S K Y +FDH+ Sbjct: 174 SANGGILKLKTVSGYKKCYEFFFDHF 199 >gi|20090163|ref|NP_616238.1| type IV secretion system protein [Methanosarcina acetivorans C2A] gi|19915148|gb|AAM04718.1| type IV secretion system protein [Methanosarcina acetivorans C2A] Length = 838 Score = 39.4 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 4/52 (7%) Query: 38 ESLSAENEKKE--LSEHEKKVIESQENPKK--QFSEHEKKETDDPKSARKEN 85 E+ ENE+KE E+E+K + QEN +K Q E+EKKE ++ RKEN Sbjct: 144 ENNRQENERKENQRQENERKENQRQENERKENQRQENEKKENQRQENERKEN 195 >gi|118405113|ref|NP_001072267.1| guanylate binding protein 2, interferon-inducible [Xenopus (Silurana) tropicalis] gi|113197897|gb|AAI21266.1| hypothetical protein MGC145306 [Xenopus (Silurana) tropicalis] Length = 591 Score = 39.4 bits (90), Expect = 0.18, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 30/49 (61%) Query: 38 ESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENI 86 E+ AE +++ L E EK + ES EN KK F +HEK + ++ R++ I Sbjct: 494 ETAEAERQRQILEEKEKYLQESMENQKKSFEQHEKMLIEKMEADRQKLI 542 >gi|328766551|gb|EGF76605.1| hypothetical protein BATDEDRAFT_28464 [Batrachochytrium dendrobatidis JAM81] Length = 201 Score = 38.6 bits (88), Expect = 0.25, Method: Composition-based stats. Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%) Query: 43 ENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPY 102 +NE KEL + +I+ +E+ +K+ +EH P S+ I+ KT S K Y + Sbjct: 141 KNELKELVKKASDIIKKEESLEKKLTEHLS-----PGSSANGGILKLKTVSGYKKCYEFF 195 Query: 103 FDHY 106 FDH+ Sbjct: 196 FDHF 199 >gi|328793299|ref|XP_397196.4| PREDICTED: hypothetical protein LOC413757 [Apis mellifera] Length = 622 Score = 38.6 bits (88), Expect = 0.26, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 49/87 (56%), Gaps = 2/87 (2%) Query: 13 VSSAINPCGIE-EDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHE 71 +SS +N ++ ++N SS ++ +++S E EK+ +S+ E V S + + S+ Sbjct: 199 LSSKLNKVSVDIDNNYSSSRKENLISKNISDEEEKENISKEENNVAASFQTSFQNISKIN 258 Query: 72 KKETD-DPKSARKENIVMKKTFSQKSK 97 K + + S++KEN+++K F ++ K Sbjct: 259 KISVEINNNSSKKENLILKSIFDEEKK 285 >gi|154316179|ref|XP_001557411.1| hypothetical protein BC1G_03675 [Botryotinia fuckeliana B05.10] gi|150846461|gb|EDN21654.1| hypothetical protein BC1G_03675 [Botryotinia fuckeliana B05.10] Length = 1959 Score = 38.2 bits (87), Expect = 0.38, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 44/89 (49%), Gaps = 13/89 (14%) Query: 16 AINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKET 75 ++ P + +D+ K +PL +S E+ + EK + S+EN ++ ++H E Sbjct: 387 SLKPTPVHKDD-KQAPLR----SEISDEHSPHKRKSSEKHIKMSKENERRDINDHSDGEF 441 Query: 76 DDPKSARKENIVMKKTFSQKSKKYTPYFD 104 D P+S+RK+ +KS+ +P D Sbjct: 442 DRPQSSRKQR--------RKSRSPSPSVD 462 >gi|295425041|ref|ZP_06817747.1| PTS family maltose/glucose porter, IIABC component [Lactobacillus amylolyticus DSM 11664] gi|295065238|gb|EFG56140.1| PTS family maltose/glucose porter, IIABC component [Lactobacillus amylolyticus DSM 11664] Length = 285 Score = 37.4 bits (85), Expect = 0.56, Method: Composition-based stats. Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 4/68 (5%) Query: 49 LSEHEKKVIESQENPKKQFSEHEKKETDDPK-SARKENIVMKKTFSQKSKKYTPY-FDHY 106 ++ H ++VIE ++ +Q H+K E DP S + N +++K ++ K K PY F +Y Sbjct: 3 IARHRRQVIEKIKSNAEQKKFHKKAEIGDPVLSKEQSNALIRKFWANKDK--APYLFFNY 60 Query: 107 MTNGHLNL 114 + G NL Sbjct: 61 LLRGVFNL 68 >gi|62857693|ref|NP_001016773.1| guanylate binding protein 1, interferon-inducible [Xenopus (Silurana) tropicalis] gi|89270408|emb|CAJ81522.1| novel protein similar to guanylate binding protein 4 [Xenopus (Silurana) tropicalis] Length = 595 Score = 37.4 bits (85), Expect = 0.62, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 38 ESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVM-KKTFSQK 95 E+ AE +++ L E++K + ES EN KK F +HEK + + R++ I ++ +QK Sbjct: 495 ETAEAERQRQILEENKKYLQESMENQKKSFEQHEKMLIEKMDADRQKLIAENERVIAQK 553 >gi|154299111|ref|XP_001549976.1| serine/threonine-protein kinase [Botryotinia fuckeliana B05.10] gi|150857571|gb|EDN32763.1| serine/threonine-protein kinase [Botryotinia fuckeliana B05.10] Length = 1184 Score = 37.4 bits (85), Expect = 0.69, Method: Composition-based stats. Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%) Query: 13 VSSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVI 57 V P GIEE NLK LP E LS ENE E+ +H +V+ Sbjct: 381 VFDVFRPWGIEEKNLKLMILPQPKPE-LSTENEISEMDQHRNRVV 424 >gi|157278622|ref|NP_001098411.1| forkhead box Q2 [Danio rerio] gi|156230377|gb|AAI51983.1| Foxq2 protein [Danio rerio] Length = 244 Score = 37.1 bits (84), Expect = 0.82, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 11/90 (12%) Query: 24 EDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARK 83 EDN+ P L+S E E+K LSE + + E QEN + + H K E D K A+ Sbjct: 36 EDNVN----PSEDLQSTVNEPEQKTLSEQDSEKSEEQENDEDHENTHVKSEGTDEKPAQS 91 Query: 84 ENIVMKKTFSQKSKK-------YTPYFDHY 106 ++ +K Y DHY Sbjct: 92 YIALISMAILDSDEKKLLLCDIYQWIMDHY 121 >gi|301610289|ref|XP_002934696.1| PREDICTED: interferon-induced guanylate-binding protein 2-like [Xenopus (Silurana) tropicalis] Length = 420 Score = 37.1 bits (84), Expect = 0.85, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 38 ESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKENIVM-KKTFSQK 95 E+ AE +++ L E++K + ES EN KK F +HEK + + R++ I ++ +QK Sbjct: 320 ETAEAERQRQILEENKKYLQESMENQKKSFEQHEKMLIEKMDADRQKLIAENERVIAQK 378 >gi|328786591|ref|XP_624181.3| PREDICTED: glycoprotein 150 [Apis mellifera] Length = 908 Score = 36.7 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 35/71 (49%) Query: 22 IEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSA 81 + E N+ +S +P I L+ E KE S K I +E+ S +E+ ET K Sbjct: 20 LNEANIDASHVPLIKSNHLNDSPELKEESLTATKAISEKESYSTDASINERTETVADKEL 79 Query: 82 RKENIVMKKTF 92 RK+++ KTF Sbjct: 80 RKDDLQFMKTF 90 >gi|209877184|ref|XP_002140034.1| translation initiation factor SUI1 family protein [Cryptosporidium muris RN66] gi|209555640|gb|EEA05685.1| translation initiation factor SUI1 family protein [Cryptosporidium muris RN66] Length = 187 Score = 36.7 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 1/86 (1%) Query: 12 FVSSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHE 71 FV + CG+ D + P P I LE + + S+ K +I+S E Q H+ Sbjct: 15 FVQVYCSKCGLPPDYCEYGPTPEICLELKKSNVNVNKTSDINKNIIDSLETEGNQ-QIHD 73 Query: 72 KKETDDPKSARKENIVMKKTFSQKSK 97 K+ D ++ + E + +K +KSK Sbjct: 74 KQSKDICETNQNETVNLKLYKVKKSK 99 >gi|227871832|ref|ZP_03990234.1| 2',3'-cyclic-nucleotide 2'-phosphodiesterase [Oribacterium sinus F0268] gi|227842333|gb|EEJ52561.1| 2',3'-cyclic-nucleotide 2'-phosphodiesterase [Oribacterium sinus F0268] Length = 522 Score = 36.7 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 19/60 (31%), Positives = 36/60 (60%), Gaps = 8/60 (13%) Query: 25 DNLKSSPLPHIALESLSAENE--------KKELSEHEKKVIESQENPKKQFSEHEKKETD 76 +N K L E+L A+NE ++E+S+ EK++++ +E+ +K+ + EKKE+D Sbjct: 61 ENKKREALLEAKEEALKAKNELDKEIKDRRREISDLEKRILKREESSEKKAANLEKKESD 120 >gi|326503804|dbj|BAK02688.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 821 Score = 36.3 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 28/50 (56%) Query: 23 EEDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEK 72 E D S+P H E + +K++ E+EKKV+E Q+ K+ + ++K Sbjct: 696 EHDGFASTPPKHTMTEKWIMDYQKRKYGENEKKVLEQQKAHKRMSASYQK 745 >gi|196008339|ref|XP_002114035.1| hypothetical protein TRIADDRAFT_58085 [Trichoplax adhaerens] gi|190583054|gb|EDV23125.1| hypothetical protein TRIADDRAFT_58085 [Trichoplax adhaerens] Length = 804 Score = 36.3 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 1/82 (1%) Query: 38 ESLSAENEKKELSEHEKKVIESQ-ENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQKS 96 + +S +EK + + V ES E+P+++ E +K ++ D A + I ++ + S Sbjct: 60 DKVSNLDEKDQADDQGDNVKESSDESPEEKADEEQKIKSVDGNDALQNGINAEQDDDKNS 119 Query: 97 KKYTPYFDHYMTNGHLNLPQNN 118 ++ T +H MTNG P N+ Sbjct: 120 QEKTDRVNHGMTNGDREEPTND 141 >gi|126304351|ref|XP_001382128.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 331 Score = 36.3 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 29/49 (59%) Query: 36 ALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKE 84 A+E L EKKELS+ E K S+ + K+ FSE E+++++ +E Sbjct: 241 AVEDLGNSEEKKELSDEEDKEDISEGDDKEDFSEKEEEKSNSNDETTEE 289 >gi|86151083|ref|ZP_01069299.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 260.94] gi|315124455|ref|YP_004066459.1| hypothetical protein ICDCCJ07001_922 [Campylobacter jejuni subsp. jejuni ICDCCJ07001] gi|85842253|gb|EAQ59499.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 260.94] gi|315018177|gb|ADT66270.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni ICDCCJ07001] Length = 272 Score = 35.9 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 3/68 (4%) Query: 46 KKELSEHEKKVIESQENPKKQF--SEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYF 103 KK + E + K ++NPKK+F S++ K ET +PK K I +K SQK +KYT Sbjct: 44 KKAIIEAKAKKRILKKNPKKEFTSSKNHKPETHEPKQENKIEIKNEKNKSQK-EKYTVKS 102 Query: 104 DHYMTNGH 111 D + H Sbjct: 103 DEIFDSFH 110 >gi|86152862|ref|ZP_01071067.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni HB93-13] gi|85843747|gb|EAQ60957.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni HB93-13] Length = 272 Score = 35.9 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 3/68 (4%) Query: 46 KKELSEHEKKVIESQENPKKQF--SEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYF 103 KK + E + K ++NPKK+F S++ K ET +PK K I +K SQK +KYT Sbjct: 44 KKAIIEAKAKKRILKKNPKKEFTSSKNHKPETHEPKQENKIEIKNEKNKSQK-EKYTVKS 102 Query: 104 DHYMTNGH 111 D + H Sbjct: 103 DEIFDSFH 110 >gi|86150682|ref|ZP_01068903.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni CF93-6] gi|88596264|ref|ZP_01099501.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 84-25] gi|121612959|ref|YP_001000639.1| hypothetical protein CJJ81176_0980 [Campylobacter jejuni subsp. jejuni 81-176] gi|167005566|ref|ZP_02271324.1| hypothetical protein Cjejjejuni_05060 [Campylobacter jejuni subsp. jejuni 81-176] gi|218562576|ref|YP_002344355.1| hypothetical protein Cj0957c [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|283956363|ref|ZP_06373843.1| hypothetical protein C1336_000250134 [Campylobacter jejuni subsp. jejuni 1336] gi|85838863|gb|EAQ56131.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni CF93-6] gi|87248882|gb|EAQ71845.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 81-176] gi|88191105|gb|EAQ95077.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 84-25] gi|112360282|emb|CAL35077.1| hypothetical protein Cj0957c [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|283792083|gb|EFC30872.1| hypothetical protein C1336_000250134 [Campylobacter jejuni subsp. jejuni 1336] gi|284926190|gb|ADC28542.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni IA3902] gi|315928513|gb|EFV07817.1| Putative uncharacterized protein [Campylobacter jejuni subsp. jejuni 305] Length = 272 Score = 35.9 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 3/68 (4%) Query: 46 KKELSEHEKKVIESQENPKKQF--SEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYF 103 KK + E + K ++NPKK+F S++ K ET +PK K I +K SQK +KYT Sbjct: 44 KKAIIEAKAKKRILKKNPKKEFTSSKNHKPETHEPKQENKIEIKNEKNKSQK-EKYTVKS 102 Query: 104 DHYMTNGH 111 D + H Sbjct: 103 DEIFDSFH 110 >gi|153951735|ref|YP_001397951.1| hypothetical protein JJD26997_0824 [Campylobacter jejuni subsp. doylei 269.97] gi|152939181|gb|ABS43922.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei 269.97] Length = 272 Score = 35.5 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 3/68 (4%) Query: 46 KKELSEHEKKVIESQENPKKQF--SEHEKKETDDPKSARKENIVMKKTFSQKSKKYTPYF 103 KK + E + K ++NPKK+F S++ K ET +PK K I +K SQK +KYT Sbjct: 44 KKAIIEAKAKKRILKKNPKKEFTSSKNHKSETHEPKQEIKIEIKNEKNKSQK-EKYTVKS 102 Query: 104 DHYMTNGH 111 D + H Sbjct: 103 DEIFDSFH 110 >gi|66546392|ref|XP_392295.2| PREDICTED: hypothetical protein LOC408761 isoform 2 [Apis mellifera] Length = 1166 Score = 35.5 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 28/98 (28%), Positives = 43/98 (43%), Gaps = 13/98 (13%) Query: 31 PLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARK---ENI- 86 P A E + + E++++ + K I+ N EHE++ETD PK+ K ENI Sbjct: 666 PRQQQATERIQSPIEREKIDD--KSAID---NANHSDDEHEREETDGPKTISKTALENIA 720 Query: 87 ----VMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGH 120 M+ FS K Y + NG + + N H Sbjct: 721 RTGVTMQFCFSDKPISNKSYLPGFKQNGQKPMDEQNQH 758 >gi|156088517|ref|XP_001611665.1| hypothetical protein [Babesia bovis T2Bo] gi|154798919|gb|EDO08097.1| hypothetical protein BBOV_III005340 [Babesia bovis] Length = 918 Score = 35.1 bits (79), Expect = 2.9, Method: Composition-based stats. Identities = 16/57 (28%), Positives = 34/57 (59%), Gaps = 2/57 (3%) Query: 42 AENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDP--KSARKENIVMKKTFSQKS 96 ++N+ ++ ++E + S+ +P +Q + +KKET+DP + +R V F++KS Sbjct: 36 SQNKSQKQKKNENASVYSENHPHRQAHDAQKKETNDPSKRESRLRGTVPDSQFAEKS 92 >gi|162849467|emb|CAN10434.1| pB407L [African swine fever virus OURT 88/3] Length = 412 Score = 35.1 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 47/110 (42%), Gaps = 7/110 (6%) Query: 14 SSAINPCGIEEDNLKSSPLPH-IALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEK 72 + +I + +D+++S +A S S + LS H KK ES+ + K FS+ Sbjct: 131 NGSIRSGSVRDDSVRSGKTRRGLACNSSSRNDRGYSLSTHRKKYAESEASQKTAFSKR-- 188 Query: 73 KETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGHRY 122 D K+ E+ +K+ +K+ +H +NG N H Y Sbjct: 189 ----DRKNHYAESEYSEKSIKPSTKQVDRLINHLRSNGDPNSFYKKDHDY 234 >gi|162849294|emb|CAN10184.1| pB407L [African swine fever virus Benin 97/1] Length = 417 Score = 35.1 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 47/110 (42%), Gaps = 7/110 (6%) Query: 14 SSAINPCGIEEDNLKSSPLPH-IALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEK 72 + +I + +D+++S +A S S + LS H KK ES+ + K FS+ Sbjct: 136 NGSIRSGSVRDDSVRSGKTRRGLACNSSSRNDRGYSLSTHRKKYAESEASQKTAFSKR-- 193 Query: 73 KETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGHRY 122 D K+ E+ +K+ +K+ +H +NG N H Y Sbjct: 194 ----DRKNHYAESEYSEKSIKPSTKQVDRLINHLRSNGDPNSFYKKDHDY 239 >gi|308460872|ref|XP_003092735.1| hypothetical protein CRE_24245 [Caenorhabditis remanei] gi|308252572|gb|EFO96524.1| hypothetical protein CRE_24245 [Caenorhabditis remanei] Length = 593 Score = 35.1 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 39/78 (50%) Query: 24 EDNLKSSPLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARK 83 +D K+S I E + + E ++L + +KV+ES+ K++ + ++K + + + Sbjct: 486 QDPSKASIYSQICEEMMEIKKELEDLKNNYEKVVESEAKLKEELTLEKEKNRETILTLKA 545 Query: 84 ENIVMKKTFSQKSKKYTP 101 EN ++ Q K TP Sbjct: 546 ENAANERVIQQLLDKLTP 563 >gi|9628192|ref|NP_042778.1| pB407L [African swine fever virus] gi|82051542|sp|Q65173|VF407_ASFB7 RecName: Full=Uncharacterized protein B407L; Short=pB407L gi|780454|gb|AAA65314.1| pB407L [African swine fever virus] gi|291289529|emb|CBH29186.1| BA71V-B407L [African swine fever virus E75] gi|1097475|prf||2113434CN B407L gene Length = 407 Score = 35.1 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 47/110 (42%), Gaps = 7/110 (6%) Query: 14 SSAINPCGIEEDNLKSSPLPH-IALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEK 72 + +I + +D+++S +A S S + LS H KK ES+ + K FS+ Sbjct: 126 NGSIRSGSVRDDSVRSGKTRRGLACNSSSRNDRGYSLSTHRKKYAESEASQKTAFSKR-- 183 Query: 73 KETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGHRY 122 D K+ E+ +K+ +K+ +H +NG N H Y Sbjct: 184 ----DRKNHYAESEYSEKSIKPSTKQVDRLINHLRSNGDPNSFYKKDHDY 229 >gi|328777259|ref|XP_003249305.1| PREDICTED: hypothetical protein LOC408761 isoform 1 [Apis mellifera] Length = 1108 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 28/98 (28%), Positives = 43/98 (43%), Gaps = 13/98 (13%) Query: 31 PLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARK---ENI- 86 P A E + + E++++ + K I+ N EHE++ETD PK+ K ENI Sbjct: 666 PRQQQATERIQSPIEREKIDD--KSAID---NANHSDDEHEREETDGPKTISKTALENIA 720 Query: 87 ----VMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNNGH 120 M+ FS K Y + NG + + N H Sbjct: 721 RTGVTMQFCFSDKPISNKSYLPGFKQNGQKPMDEQNQH 758 >gi|291398522|ref|XP_002715912.1| PREDICTED: Bromodomain testis-specific protein-like [Oryctolagus cuniculus] Length = 953 Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 26/54 (48%) Query: 32 LPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKSARKEN 85 L + +NEK + ++KV ENP+K+F + + KET +K N Sbjct: 437 LSQVPFRKPKKKNEKSQRENKKEKVNNRDENPRKKFKQMKPKETSKSNQPKKRN 490 >gi|260940879|ref|XP_002615279.1| hypothetical protein CLUG_04161 [Clavispora lusitaniae ATCC 42720] gi|238850569|gb|EEQ40033.1| hypothetical protein CLUG_04161 [Clavispora lusitaniae ATCC 42720] Length = 1633 Score = 34.4 bits (77), Expect = 4.7, Method: Composition-based stats. Identities = 16/39 (41%), Positives = 25/39 (64%), Gaps = 1/39 (2%) Query: 42 AENEKKELSEHEKKVIESQENPKKQFSEHEKKETDDPKS 80 +++ KELSE K +SQ++PK+QF + K + D KS Sbjct: 138 SQDSPKELSEDSPKE-QSQDSPKEQFQDSPKDSSQDSKS 175 >gi|324326202|gb|ADY21462.1| hypothetical protein YBT020_11090 [Bacillus thuringiensis serovar finitimus YBT-020] Length = 331 Score = 34.4 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 51/106 (48%), Gaps = 11/106 (10%) Query: 2 KKYFTILTMLFV-----SSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKV 56 KK F I +LFV SS NP E+ S + E ++ ++ E E ++K Sbjct: 93 KKQFIITAILFVIFGALSSISNPTS-EKTTATSKKVASNTTEQKDSDKKELEKKEADEKA 151 Query: 57 IESQENPKKQFSEHEKKETDDPK----SARKENIVMKKTFSQKSKK 98 + QE+ K+Q E +K+ D+ + ARK+ K+ ++++K Sbjct: 152 -QKQEDEKRQAEEQARKQEDEKRQADEQARKQQEEQKRLADEQTRK 196 >gi|321476963|gb|EFX87922.1| hypothetical protein DAPPUDRAFT_305608 [Daphnia pulex] Length = 347 Score = 34.4 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 11/90 (12%) Query: 12 FVSSAINPCGIEEDNLKSS-----PLPHIALESLSAENEKKELSE------HEKKVIESQ 60 F+ ++ C IE N SS PLP AL+ L ++ + S+ HE++ IES+ Sbjct: 252 FIQHWVDSCEIETANNNSSDSDTQPLPMSALDPLWTDDPEPTFSKEEPTKNHEEEKIESR 311 Query: 61 ENPKKQFSEHEKKETDDPKSARKENIVMKK 90 P+ Q ++ + T P+S + +V + Sbjct: 312 NKPELQDNKILEPSTSTPRSIQSREMVTPR 341 >gi|206975258|ref|ZP_03236172.1| conserved domain protein [Bacillus cereus H3081.97] gi|217959697|ref|YP_002338249.1| hypothetical protein BCAH187_A2296 [Bacillus cereus AH187] gi|206746679|gb|EDZ58072.1| conserved domain protein [Bacillus cereus H3081.97] gi|217067730|gb|ACJ81980.1| conserved domain protein [Bacillus cereus AH187] Length = 331 Score = 34.4 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 51/106 (48%), Gaps = 11/106 (10%) Query: 2 KKYFTILTMLFV-----SSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKV 56 KK F I +LFV SS NP E+ S + E ++ ++ E E ++K Sbjct: 93 KKQFIITAILFVIFGALSSISNPTS-EKTTATSKKVASNTTEQKDSDKKELEKKEADEKT 151 Query: 57 IESQENPKKQFSEHEKKETDDPK----SARKENIVMKKTFSQKSKK 98 + QE+ K+Q E +K+ D+ + ARK+ K+ ++++K Sbjct: 152 -QKQEDEKRQAEEQARKQEDEKRQADEQARKQQEEQKRLADEQTRK 196 >gi|222095784|ref|YP_002529841.1| hypothetical protein BCQ_2124 [Bacillus cereus Q1] gi|221239842|gb|ACM12552.1| conserved hypothetical protein [Bacillus cereus Q1] Length = 331 Score = 34.4 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 51/106 (48%), Gaps = 11/106 (10%) Query: 2 KKYFTILTMLFV-----SSAINPCGIEEDNLKSSPLPHIALESLSAENEKKELSEHEKKV 56 KK F I +LFV SS NP E+ S + E ++ ++ E E ++K Sbjct: 93 KKQFIITAILFVIFGALSSISNPTS-EKTTATSKKVASNTTEQKDSDKKELEKKEADEKA 151 Query: 57 IESQENPKKQFSEHEKKETDDPK----SARKENIVMKKTFSQKSKK 98 + QE+ K+Q E +K+ D+ + ARK+ K+ ++++K Sbjct: 152 -QKQEDEKRQAEEQARKQEDEKRQADEQARKQQEEQKRLADEQTRK 196 >gi|118381042|ref|XP_001023683.1| hypothetical protein TTHERM_00732850 [Tetrahymena thermophila] gi|89305450|gb|EAS03438.1| hypothetical protein TTHERM_00732850 [Tetrahymena thermophila SB210] Length = 1530 Score = 34.0 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 10/99 (10%) Query: 23 EEDNLKSSPLPHIALESLSAENEKKELSEHE--KKVIESQENP--KKQFSEHEKKETDDP 78 + L SP H + + ++N + S+HE + V E+Q N K+Q E ++ DD Sbjct: 1225 KNQKLDKSP-THTQSQEIQSKNSSRIPSKHESSQDVNENQSNAFQKQQKIEQRQRSIDDQ 1283 Query: 79 KSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQN 117 KENI M + SQK K+ D+ N+PQ+ Sbjct: 1284 NKVEKENIQMSQKNSQKEKE-----DNSKIQQQNNIPQH 1317 >gi|126306475|ref|XP_001374375.1| PREDICTED: similar to centrosome-associated protein 350 [Monodelphis domestica] Length = 3131 Score = 34.0 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 6/55 (10%) Query: 56 VIESQENPKKQFSEHEKKETDDPKSARKENIVM----KKTFSQKSKKYTPYFDHY 106 +I+ Q +F +H + TD + + E++V+ ++T K++KY+PYFD Y Sbjct: 1471 LIDHQRQQHSEFMKHLRARTDTDR--KSESVVLSQSKEETSDSKNQKYSPYFDSY 1523 >gi|123478011|ref|XP_001322170.1| hypothetical protein [Trichomonas vaginalis G3] gi|121905011|gb|EAY09947.1| hypothetical protein TVAG_482210 [Trichomonas vaginalis G3] Length = 392 Score = 34.0 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 12/106 (11%) Query: 14 SSAINPCGIEEDNLKSS-PLPHIALESLSAENEKKELSEHEKKVIESQENPKKQFSEHEK 72 SSA N EED K + L +AL S S++N+ SE +KK+ E+ K S + + Sbjct: 179 SSAKNTKLSEEDQFKLNLELTDVALHS-SSQNK----SEQQKKLSEAD----KILSPYAE 229 Query: 73 KETDDPKSARKENIVMKKTFSQKSKKYTPYFDHYMTNGHLNLPQNN 118 ET DPK A + ++ + SQK K+Y Y G N PQ N Sbjct: 230 TETSDPKLAYRRDLYIADIHSQK-KRYEESLPFYERAGD-NFPQAN 273 >gi|47522868|ref|NP_999188.1| osteopontin precursor [Sus scrofa] gi|311262844|ref|XP_003129381.1| PREDICTED: osteopontin-like isoform 1 [Sus scrofa] gi|311262846|ref|XP_003129383.1| PREDICTED: osteopontin-like isoform 3 [Sus scrofa] gi|311262848|ref|XP_003129382.1| PREDICTED: osteopontin-like isoform 2 [Sus scrofa] gi|129262|sp|P14287|OSTP_PIG RecName: Full=Osteopontin; AltName: Full=Bone sialoprotein 1; AltName: Full=Secreted phosphoprotein 1; Short=SPP-1; Flags: Precursor gi|2121|emb|CAA34594.1| unnamed protein product [Sus scrofa] gi|262204805|dbj|BAI48023.1| secreted phosphoprotein 1 [Sus scrofa] Length = 303 Score = 33.6 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 8/82 (9%) Query: 23 EEDNLKSSPLPHIALESLSAENEKK------ELSEHEKKVIESQENPK--KQFSEHEKKE 74 E+D+ ++S ++E+ S E K+ + S VIESQENPK ++F HE K Sbjct: 216 EKDSQETSQPDDRSVETRSQEQSKEYTIKTYDGSNEHSNVIESQENPKVSQEFHSHEDKL 275 Query: 75 TDDPKSARKENIVMKKTFSQKS 96 D KS +++ ++ + +S Sbjct: 276 VPDSKSEEDKHLKLRVSHELES 297 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.126 0.350 Lambda K H 0.267 0.0393 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,015,092,311 Number of Sequences: 14124377 Number of extensions: 30212819 Number of successful extensions: 135487 Number of sequences better than 10.0: 274 Number of HSP's better than 10.0 without gapping: 136 Number of HSP's successfully gapped in prelim test: 138 Number of HSP's that attempted gapping in prelim test: 134011 Number of HSP's gapped (non-prelim): 1495 length of query: 122 length of database: 4,842,793,630 effective HSP length: 89 effective length of query: 33 effective length of database: 3,585,724,077 effective search space: 118328894541 effective search space used: 118328894541 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 75 (33.6 bits)