BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781072|ref|YP_003065485.1| hypothetical protein CLIBASIA_04870 [Candidatus Liberibacter asiaticus str. psy62] (190 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781072|ref|YP_003065485.1| hypothetical protein CLIBASIA_04870 [Candidatus Liberibacter asiaticus str. psy62] gi|254040749|gb|ACT57545.1| hypothetical protein CLIBASIA_04870 [Candidatus Liberibacter asiaticus str. psy62] Length = 190 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 190/190 (100%), Positives = 190/190 (100%) Query: 1 MGYSVSASNNDTSKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYG 60 MGYSVSASNNDTSKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYG Sbjct: 1 MGYSVSASNNDTSKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYG 60 Query: 61 AYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEIT 120 AYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEIT Sbjct: 61 AYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEIT 120 Query: 121 KAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFA 180 KAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFA Sbjct: 121 KAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFA 180 Query: 181 EKLEGKFVRK 190 EKLEGKFVRK Sbjct: 181 EKLEGKFVRK 190 >gi|71993010|ref|NP_001023349.1| Kinesin-Like Protein family member (klp-12) [Caenorhabditis elegans] gi|58081827|emb|CAB07273.2| C. elegans protein T01G1.1a, confirmed by transcript evidence [Caenorhabditis elegans] gi|58081863|emb|CAB05214.2| C. elegans protein T01G1.1a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 1605 Score = 41.2 bits (95), Expect = 0.064, Method: Composition-based stats. Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 9/111 (8%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYI---DFRTELKY 89 I + + K+ EL E+ + +RD+V K + + K + + + + D R ELK Sbjct: 629 IRETYEKKLRELMERIKDTETERDRVLNEGGKRGGNNEQMKAIKQEYELKITDLRKELKK 688 Query: 90 FKALTKYYKSVVA----ELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKE 136 +AL K + V+A EL+E KS E+ ++ KA L + NE KK+ Sbjct: 689 IEALDKEHLKVIAKSQRELQEKTRLKS--EVVDLKKAKVELIKKMNEDKKK 737 >gi|11559273|dbj|BAB18763.1| kinesin like protein KLP-12 [Caenorhabditis elegans] Length = 1609 Score = 41.2 bits (95), Expect = 0.066, Method: Composition-based stats. Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 9/111 (8%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYI---DFRTELKY 89 I + + K+ EL E+ + +RD+V K + + K + + + + D R ELK Sbjct: 633 IRETYEKKLRELMERIKDTETERDRVLNEGGKRGGNNEQMKAIKQEYELKITDLRKELKK 692 Query: 90 FKALTKYYKSVVA----ELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKE 136 +AL K + V+A EL+E KS E+ ++ KA L + NE KK+ Sbjct: 693 IEALDKEHLKVIAKSQRELQEKTRLKS--EVVDLKKAKVELIKKMNEDKKK 741 >gi|193206546|ref|NP_001122796.1| Kinesin-Like Protein family member (klp-12) [Caenorhabditis elegans] gi|148472918|emb|CAN86621.1| C. elegans protein T01G1.1c, confirmed by transcript evidence [Caenorhabditis elegans] gi|148472926|emb|CAN86600.1| C. elegans protein T01G1.1c, confirmed by transcript evidence [Caenorhabditis elegans] Length = 1580 Score = 40.8 bits (94), Expect = 0.081, Method: Composition-based stats. Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 9/111 (8%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYI---DFRTELKY 89 I + + K+ EL E+ + +RD+V K + + K + + + + D R ELK Sbjct: 629 IRETYEKKLRELMERIKDTETERDRVLNEGGKRGGNNEQMKAIKQEYELKITDLRKELKK 688 Query: 90 FKALTKYYKSVVA----ELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKE 136 +AL K + V+A EL+E KS E+ ++ KA L + NE KK+ Sbjct: 689 IEALDKEHLKVIAKSQRELQEKTRLKS--EVVDLKKAKVELIKKMNEDKKK 737 >gi|291460769|ref|ZP_06600159.1| putative nickase [Oribacterium sp. oral taxon 078 str. F0262] gi|291416728|gb|EFE90447.1| putative nickase [Oribacterium sp. oral taxon 078 str. F0262] Length = 475 Score = 40.1 bits (92), Expect = 0.14, Method: Composition-based stats. Identities = 32/136 (23%), Positives = 63/136 (46%), Gaps = 12/136 (8%) Query: 4 SVSASNNDTSKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYD 63 SVS + D + + I+ E I+++ +L ++A+K K +D +Y AY Sbjct: 324 SVSELKESMVTLRDKNYKTTRTIKDTEKKIDEK-----TQLIDQAEKYLKHKD-IYKAYT 377 Query: 64 KVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAV 123 K+ +K+ FY + E+ F++ KY K + E + + K +E+ + K Sbjct: 378 KLKKNKQD------TFYNEHTAEIILFESARKYLKEHLGESKSLNISKLKLEVTALKKEK 431 Query: 124 DTLTRAYNEYKKEIRE 139 D+L + +KE+ + Sbjct: 432 DSLYSQIIDLRKEVEQ 447 >gi|268536404|ref|XP_002633337.1| C. briggsae CBR-KLP-12 protein [Caenorhabditis briggsae] gi|187034478|emb|CAP26295.1| CBR-KLP-12 protein [Caenorhabditis briggsae AF16] Length = 1874 Score = 40.1 bits (92), Expect = 0.16, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 9/111 (8%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYI---DFRTELKY 89 I + K+ EL E+ + +RD+V K + + K + + + + D R ELK Sbjct: 910 IRDTYEKKLRELMERIKDTETERDRVLNEGGKRGGNNEQMKAIKQEYELKISDLRKELKK 969 Query: 90 FKALTKYYKSVVA----ELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKE 136 +AL + + V+A EL+E KS E+ ++ KA L + NE KK+ Sbjct: 970 IEALDREHLKVIAKSQRELQEKTRLKS--EVVDLKKAKVELIKKMNEDKKK 1018 >gi|227485075|ref|ZP_03915391.1| pyruvate synthase [Anaerococcus lactolyticus ATCC 51172] gi|227236908|gb|EEI86923.1| pyruvate synthase [Anaerococcus lactolyticus ATCC 51172] Length = 1174 Score = 39.7 bits (91), Expect = 0.19, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 26/47 (55%) Query: 13 SKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVY 59 SK P A F F+Q + S + + F +ELYE+A+KA K R Y Sbjct: 1122 SKEPSASFQEFIQGEVRYSSLKRSFPETADELYEEAEKAAKARYNTY 1168 >gi|15614758|ref|NP_243061.1| aspartate aminotransferase [Bacillus halodurans C-125] gi|10174814|dbj|BAB05914.1| aspartate aminotransferase [Bacillus halodurans C-125] Length = 403 Score = 38.9 bits (89), Expect = 0.36, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 33/61 (54%) Query: 90 FKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGF 149 F ++ +Y + V R G G + + ++ + +DTLT+ +EYKK +++ + GF Sbjct: 271 FPSVEQYMAAFVFANRTLGFGNATVTVQRMISKMDTLTQEQHEYKKRRDAMVQVLEDAGF 330 Query: 150 D 150 + Sbjct: 331 E 331 >gi|71061468|ref|NP_001804.2| centromere-associated protein E [Homo sapiens] gi|160358869|sp|Q02224|CENPE_HUMAN RecName: Full=Centromere-associated protein E; AltName: Full=Centromere protein E; Short=CENP-E; AltName: Full=Kinesin-related protein CENPE; Flags: Precursor gi|119626576|gb|EAX06171.1| centromere protein E, 312kDa, isoform CRA_c [Homo sapiens] gi|225356542|gb|AAI56502.1| Centromere protein E, 312kDa [synthetic construct] Length = 2701 Score = 38.5 bits (88), Expect = 0.46, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 66/123 (53%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N K+ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEIENLNLAQKLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K V +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETVDKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|119626575|gb|EAX06170.1| centromere protein E, 312kDa, isoform CRA_b [Homo sapiens] Length = 2664 Score = 38.5 bits (88), Expect = 0.48, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 66/123 (53%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N K+ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEIENLNLAQKLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K V +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETVDKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|119626574|gb|EAX06169.1| centromere protein E, 312kDa, isoform CRA_a [Homo sapiens] Length = 2665 Score = 38.5 bits (88), Expect = 0.48, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 66/123 (53%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N K+ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEIENLNLAQKLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K V +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETVDKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|328868498|gb|EGG16876.1| actin binding protein [Dictyostelium fasciculatum] Length = 1786 Score = 38.1 bits (87), Expect = 0.54, Method: Composition-based stats. Identities = 37/157 (23%), Positives = 72/157 (45%), Gaps = 9/157 (5%) Query: 33 INKEFVTKVEELY----EKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELK 88 +N E K+E+ E + K K+ +++ G +D+ L A ID+++ Sbjct: 635 LNLEAAKKLEDELQIEKEYSLKVKKELEELIGQHDREKGETSRLHGLLSARDIDYQS--- 691 Query: 89 YFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF-IEL 147 + ++ S EL E L E EE A+ L ++++ I+EL+ + +E+ Sbjct: 692 LVSKMDQFATSHAKELEELKLSLQTCE-EERKLALQQLQDTKSDHQSSIKELMNKHRLEI 750 Query: 148 GFDQCDECDLCSEKADVIQKKRIAFEMVEREFAEKLE 184 Q EC +C++K I + + + ER F E+++ Sbjct: 751 DEIQIQECVICNDKQLEISRLIVNLDEKERGFIEEMD 787 >gi|227485074|ref|ZP_03915390.1| pyruvate synthase [Anaerococcus lactolyticus ATCC 51172] gi|227236907|gb|EEI86922.1| pyruvate synthase [Anaerococcus lactolyticus ATCC 51172] Length = 1175 Score = 38.1 bits (87), Expect = 0.56, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 3/55 (5%) Query: 13 SKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSS 67 SK P F F+Q + S + + F +ELY++A+KA K+R Y +Y K+++ Sbjct: 1123 SKEPTGSFQEFIQGEVRYSSLKRSFPETADELYQEAEKAAKER---YESYKKLAN 1174 >gi|296242083|ref|YP_003649570.1| hypothetical protein Tagg_0343 [Thermosphaera aggregans DSM 11486] gi|296094667|gb|ADG90618.1| hypothetical protein Tagg_0343 [Thermosphaera aggregans DSM 11486] Length = 426 Score = 38.1 bits (87), Expect = 0.59, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 7/85 (8%) Query: 41 VEELYEKAQK----AHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKY 96 V+E+ EKA K AH K++ S KSP+ELS R+ LK+F ++ + Sbjct: 10 VKEIMEKAVKSELLAHPHVKKLWELSGDKSLFTKSPEELSSLRRDLLRSSLKFFAEMSDF 69 Query: 97 YKSVVAELREFGLGKSAIEIEEITK 121 Y+ + G+ + E+E++ K Sbjct: 70 YEEL---FNRIGIDPGSAELEDLVK 91 >gi|309367027|emb|CAP20393.2| hypothetical protein CBG_23571 [Caenorhabditis briggsae AF16] Length = 1136 Score = 37.8 bits (86), Expect = 0.72, Method: Composition-based stats. Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 12/101 (11%) Query: 32 VINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYID--------F 83 V+NKE ++++ E+ + H+K +KV A K KS +EL+ + +D Sbjct: 640 VLNKE---ELQKFRERLDEEHQKAEKVREAEKKYLDAMKSTRELTDTYKMDVERLNGAVL 696 Query: 84 RTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVD 124 RTEL + K L K+ K + E GL +S IE + D Sbjct: 697 RTELAHRKRLAKHLKEDPKKYTE-GLIESVYGIEVNNRQFD 736 >gi|268567508|ref|XP_002647798.1| Hypothetical protein CBG23571 [Caenorhabditis briggsae] Length = 857 Score = 37.8 bits (86), Expect = 0.72, Method: Composition-based stats. Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 12/101 (11%) Query: 32 VINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYID--------F 83 V+NKE ++++ E+ + H+K +KV A K KS +EL+ + +D Sbjct: 368 VLNKE---ELQKFRERLDEEHQKAEKVREAEKKYLDAMKSTRELTDTYKMDVERLNGAVL 424 Query: 84 RTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVD 124 RTEL + K L K+ K + E GL +S IE + D Sbjct: 425 RTELAHRKRLAKHLKEDPKKYTE-GLIESVYGIEVNNRQFD 464 >gi|317058831|ref|ZP_07923316.1| SWF/SNF family helicase [Fusobacterium sp. 3_1_5R] gi|313684507|gb|EFS21342.1| SWF/SNF family helicase [Fusobacterium sp. 3_1_5R] Length = 1009 Score = 37.8 bits (86), Expect = 0.73, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 11/84 (13%) Query: 72 PKELSKAFYIDFRTELKYFKALTK--YYKSVVAELREFGLGKSAI-------EIEEITKA 122 P ++ K YID E K F + YY + A +RE GLGK+ E+ +IT Sbjct: 772 PDKIEKTIYIDMNVEHKKFYEERRNYYYNMIHASIREKGLGKAQFFILQALNELRQITSC 831 Query: 123 VDTLTRAYNEYKKEIRELIEEFIE 146 + + KKE+ LIE+ +E Sbjct: 832 PEVKNSYISSSKKEM--LIEQIVE 853 >gi|257465912|ref|ZP_05630223.1| SWF/SNF family helicase [Fusobacterium gonidiaformans ATCC 25563] Length = 1134 Score = 37.8 bits (86), Expect = 0.78, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 11/84 (13%) Query: 72 PKELSKAFYIDFRTELKYFKALTK--YYKSVVAELREFGLGKSAI-------EIEEITKA 122 P ++ K YID E K F + YY + A +RE GLGK+ E+ +IT Sbjct: 897 PDKIEKTIYIDMNVEHKKFYEERRNYYYNMIHASIREKGLGKAQFFILQALNELRQITSC 956 Query: 123 VDTLTRAYNEYKKEIRELIEEFIE 146 + + KKE+ LIE+ +E Sbjct: 957 PEVKNSYISSSKKEM--LIEQIVE 978 >gi|257452288|ref|ZP_05617587.1| SWF/SNF family helicase [Fusobacterium sp. 3_1_5R] Length = 1134 Score = 37.8 bits (86), Expect = 0.78, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 11/84 (13%) Query: 72 PKELSKAFYIDFRTELKYFKALTK--YYKSVVAELREFGLGKSAI-------EIEEITKA 122 P ++ K YID E K F + YY + A +RE GLGK+ E+ +IT Sbjct: 897 PDKIEKTIYIDMNVEHKKFYEERRNYYYNMIHASIREKGLGKAQFFILQALNELRQITSC 956 Query: 123 VDTLTRAYNEYKKEIRELIEEFIE 146 + + KKE+ LIE+ +E Sbjct: 957 PEVKNSYISSSKKEM--LIEQIVE 978 >gi|325294834|ref|YP_004281348.1| Isoleucyl-tRNA synthetase [Desulfurobacterium thermolithotrophum DSM 11699] gi|325065282|gb|ADY73289.1| Isoleucyl-tRNA synthetase [Desulfurobacterium thermolithotrophum DSM 11699] Length = 931 Score = 37.8 bits (86), Expect = 0.79, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 10/112 (8%) Query: 22 SFLQIRSKESVINKEFVTKVE----ELYEKAQKAHKKRDKVYGAYDKVSSHKK--SPKEL 75 S++ R KESV +EF E EL E+ +K K + V A +K +HK E Sbjct: 776 SYIPGREKESVFLEEFPLFCESPDGELLERWEKLMKVKSLVNKALEKARAHKMIGHSLEA 835 Query: 76 SKAFYID--FRTELKYFKALTKY-YKSVVAELREFGLGKS-AIEIEEITKAV 123 S Y+D R L+ +K L Y + + AE+++F S ++ +EI K V Sbjct: 836 SITVYVDGELREFLERYKELLPYIFITSGAEVKQFAAAPSCSVGDDEILKGV 887 >gi|315917068|ref|ZP_07913308.1| SWF/SNF family helicase [Fusobacterium gonidiaformans ATCC 25563] gi|313690943|gb|EFS27778.1| SWF/SNF family helicase [Fusobacterium gonidiaformans ATCC 25563] Length = 1143 Score = 37.8 bits (86), Expect = 0.84, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 11/84 (13%) Query: 72 PKELSKAFYIDFRTELKYFKALTK--YYKSVVAELREFGLGKSAI-------EIEEITKA 122 P ++ K YID E K F + YY + A +RE GLGK+ E+ +IT Sbjct: 906 PDKIEKTIYIDMNVEHKKFYEERRNYYYNMIHASIREKGLGKAQFFILQALNELRQITSC 965 Query: 123 VDTLTRAYNEYKKEIRELIEEFIE 146 + + KKE+ LIE+ +E Sbjct: 966 PEVKNSYISSSKKEM--LIEQIVE 987 >gi|320580809|gb|EFW95031.1| DNA repair protein RAD50 [Pichia angusta DL-1] Length = 732 Score = 37.8 bits (86), Expect = 0.86, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 52/102 (50%), Gaps = 11/102 (10%) Query: 25 QIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFR 84 QIR E+ + E+ TK+ E+ + +++ + ++ Y + ++ KE Y + Sbjct: 484 QIRQLETQLQSEYSTKLGEMTQLSRQIQSVKHELQRDYKDID--ERYTKE-----YASLQ 536 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTL 126 T+L LT YYK++ + + EF ++++EI + +D L Sbjct: 537 TKLSMVTDLTTYYKAIDSAVMEF----HQVKMKEINRIIDEL 574 >gi|262281162|ref|ZP_06058944.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] gi|262257393|gb|EEY76129.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] Length = 212 Score = 37.4 bits (85), Expect = 0.94, Method: Composition-based stats. Identities = 28/118 (23%), Positives = 57/118 (48%), Gaps = 6/118 (5%) Query: 31 SVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDF-RTELKY 89 S ++E K+++ Y ++ A KRD+ A K ++++ + + I F R E++Y Sbjct: 99 SARHRELDIKLKKAYSNSKTALIKRDQALLAIKKQINYQQEQLDQLQQDKILFKRQEMEY 158 Query: 90 FKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIEL 147 F+ + V+ E + + + I + + ++TLT+ YN E +IE +L Sbjct: 159 FRK-----REVIPERHKTLIENNLNNIYRVKQTIETLTKTYNRTNSEYENIIERLKKL 211 >gi|189461674|ref|ZP_03010459.1| hypothetical protein BACCOP_02338 [Bacteroides coprocola DSM 17136] gi|189431646|gb|EDV00631.1| hypothetical protein BACCOP_02338 [Bacteroides coprocola DSM 17136] Length = 597 Score = 37.4 bits (85), Expect = 0.94, Method: Composition-based stats. Identities = 28/102 (27%), Positives = 46/102 (45%), Gaps = 16/102 (15%) Query: 89 YFKALTKYYKSVVAELREFGLGKSAIEIEE-ITKAVDTLTRAYNEYKKEIREL---IEEF 144 Y+ A K+YK+ VAEL E GL K E + + + + R + EIR L + ++ Sbjct: 199 YYVAFDKHYKAEVAELMEKGLSKEEAEAQSPLMQEAREMLRKWEANDPEIRALWKKMNDW 258 Query: 145 IELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFAEKLEGK 186 + GFD+ + ++F+ + E LEGK Sbjct: 259 VYAGFDE------------TYKMMGVSFDKIYYESETYLEGK 288 >gi|330684666|gb|EGG96368.1| acetyltransferase, GNAT family [Staphylococcus epidermidis VCU121] Length = 176 Score = 37.4 bits (85), Expect = 1.1, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 8/89 (8%) Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIE-IEEITKAVDTLTRAYNEYKKEIRELIEE 143 +E K + ALT SV ELR GLGK+ ++ +EE K+ T + RE Sbjct: 75 SEEKTYTALTVASLSVKKELRNQGLGKALVQAVEERAKSEGYTTIVVD------RE-TSY 127 Query: 144 FIELGFDQCDECDLCSEKADVIQKKRIAF 172 F +LG+ +E DL SE DV Q + F Sbjct: 128 FTQLGYQLANEYDLYSEDEDVTQPLLVKF 156 >gi|196008767|ref|XP_002114249.1| hypothetical protein TRIADDRAFT_27651 [Trichoplax adhaerens] gi|190583268|gb|EDV23339.1| hypothetical protein TRIADDRAFT_27651 [Trichoplax adhaerens] Length = 903 Score = 37.4 bits (85), Expect = 1.1, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 25/121 (20%) Query: 25 QIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFR 84 +++ ES N++ +++ YEK Q+ K K+ Y++ + KE+S Sbjct: 774 ELQDLESANNQKLMSE----YEKYQELQAKSQKMQEEYERQLNEMDESKEVS-------- 821 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 + LT+YY++ + E KS+ ++E+ + LTR Y+E K++I E ++ Sbjct: 822 -----LQDLTEYYENKLQE-------KSS-QLEQANEEQQQLTREYDETKRQIEEDVDRE 868 Query: 145 I 145 I Sbjct: 869 I 869 >gi|29865|emb|CAA78727.1| CENP-E [Homo sapiens] Length = 2663 Score = 37.0 bits (84), Expect = 1.2, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 66/123 (53%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N ++ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEIENLNLAQELHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K V +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETVDKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|382658|prf||1819485A CENP-E protein Length = 2663 Score = 37.0 bits (84), Expect = 1.2, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 66/123 (53%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N ++ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEIENLNLAQELHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K V +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETVDKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|257451737|ref|ZP_05617036.1| hypothetical protein F3_01639 [Fusobacterium sp. 3_1_5R] gi|317058296|ref|ZP_07922781.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] gi|313683972|gb|EFS20807.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] Length = 494 Score = 37.0 bits (84), Expect = 1.2, Method: Composition-based stats. Identities = 34/127 (26%), Positives = 58/127 (45%), Gaps = 12/127 (9%) Query: 13 SKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSP 72 S + D + + I+ E N + EL ++++K K RD Y Y K+ KKS Sbjct: 352 SALRDKNYKTTRAIKDTEKKTNDKV-----ELIDQSEKYLKHRD-TYKTYVKL---KKSK 402 Query: 73 KELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNE 132 +E FY + E+ F++ KY K + E + + K +E + K DTL + Sbjct: 403 QE---DFYNEHTAEIILFESARKYLKEHLGESKTLAISKWKLEATALKKEKDTLYSQILD 459 Query: 133 YKKEIRE 139 +KE+ + Sbjct: 460 MRKEVEQ 466 >gi|326485258|gb|EGE09268.1| hypothetical protein TEQG_08218 [Trichophyton equinum CBS 127.97] Length = 599 Score = 37.0 bits (84), Expect = 1.3, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 4/112 (3%) Query: 55 RDKVYGAYDKVSSHKK--SPKELSKAFYIDFRTE--LKYFKALTKYYKSVVAELREFGLG 110 RD + A D+++ +P S+ + + TE L F AL + K ++E+ F Sbjct: 82 RDLLAQATDEIAQEPNHGAPIHSSQFGIVRWSTEEKLALFSALARKGKGAISEIASFIGS 141 Query: 111 KSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKA 162 KS IE+++ ++ R ++ +K+ R + I + DEC L E A Sbjct: 142 KSRIEVQQYIMGLEQALREHHITEKKFRGITFSDIHAAAEISDECCLALENA 193 >gi|302508980|ref|XP_003016450.1| conserved hypothetical protein [Arthroderma benhamiae CBS 112371] gi|291180020|gb|EFE35805.1| conserved hypothetical protein [Arthroderma benhamiae CBS 112371] Length = 588 Score = 37.0 bits (84), Expect = 1.4, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 37/77 (48%) Query: 86 ELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFI 145 +L F AL + K + E+ F KS IE+++ ++ R ++ +K+ R + I Sbjct: 117 KLALFSALARKGKGAIPEIASFIGSKSRIEVQQYIMGLEQALRVHHITEKKFRAITFSDI 176 Query: 146 ELGFDQCDECDLCSEKA 162 + DEC L EKA Sbjct: 177 HAAAEISDECCLALEKA 193 >gi|123392888|ref|XP_001300313.1| hypothetical protein [Trichomonas vaginalis G3] gi|121881331|gb|EAX87383.1| hypothetical protein TVAG_452940 [Trichomonas vaginalis G3] Length = 1490 Score = 37.0 bits (84), Expect = 1.4, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 22/98 (22%) Query: 87 LKYFKALTK------YYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIREL 140 L++ K LTK + K V+ L +F + S +++ ++++ V ++ +NE EI+ Sbjct: 924 LEFDKNLTKLISKEFFVKMVLDSLNKFCIKNSVVKLSKLSEIVYKYSKIFNENLTEIKSA 983 Query: 141 IEEFIELGFDQCDECDLCSEKADVIQKKRIAFEMVERE 178 EFIE I+K R+A E ERE Sbjct: 984 ATEFIE----------------QFIRKSRVASESDERE 1005 >gi|326469784|gb|EGD93793.1| hypothetical protein TESG_01326 [Trichophyton tonsurans CBS 112818] Length = 599 Score = 37.0 bits (84), Expect = 1.4, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 4/112 (3%) Query: 55 RDKVYGAYDKVSSHKK--SPKELSKAFYIDFRTE--LKYFKALTKYYKSVVAELREFGLG 110 RD + A D+++ +P S+ + + TE L F AL + K ++E+ F Sbjct: 82 RDLLAQATDEIAQEPNHGAPIHSSQFGIVRWSTEEKLALFSALARKGKGAISEIASFIGS 141 Query: 111 KSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKA 162 KS IE+++ ++ R ++ +K+ R + I + DEC L E A Sbjct: 142 KSRIEVQQYIMGLEQALREHHITEKKFRGITFSDIHAAAEISDECCLALENA 193 >gi|329764872|ref|ZP_08256463.1| SMC domain-containing protein [Candidatus Nitrosoarchaeum limnia SFB1] gi|329138658|gb|EGG42903.1| SMC domain-containing protein [Candidatus Nitrosoarchaeum limnia SFB1] Length = 1152 Score = 36.6 bits (83), Expect = 1.7, Method: Composition-based stats. Identities = 37/153 (24%), Positives = 76/153 (49%), Gaps = 26/153 (16%) Query: 25 QIRSKESVINKEFVTKVEELYEKAQK-------AHKKRDKVYGAYDKVSSHKKSPKEL-S 76 ++ +K+S + K + T + E +K + ++K+ ++ G YD++SS +EL + Sbjct: 777 RVNAKKSDLEKLYTTIMNEYRDKDSQLTAIQVEENRKQSQIKGLYDEISSLNSQHQELEN 836 Query: 77 KAFYIDFRTELKYFKALTKYYK---------SVVAELREFG--LGKSAIEIEEITKAVDT 125 K ++ R E K +T K S + +L+E+ L + + +E+TK +++ Sbjct: 837 KIKELEIRKESKTATLVTLREKEQELISTSGSSIGQLKEYDDKLKVLSEKDKELTKEINS 896 Query: 126 LTRAYNEYKKEIRELIEEFIEL-------GFDQ 151 L R + +++R+LIE +L GFD+ Sbjct: 897 LERQSDSLTRDLRDLIENETKLQTLLTSFGFDK 929 >gi|308457870|ref|XP_003091296.1| CRE-KLP-12 protein [Caenorhabditis remanei] gi|308257401|gb|EFP01354.1| CRE-KLP-12 protein [Caenorhabditis remanei] Length = 937 Score = 36.6 bits (83), Expect = 1.8, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 9/111 (8%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYI---DFRTELKY 89 I + + K+ EL + + +RD+V K + + K + + + + D R ELK Sbjct: 625 IRETYENKLRELMARIKDTETERDRVLNEGGKRGGNNEQMKAIKQEYELKISDLRKELKK 684 Query: 90 FKALTKYYKSVVA----ELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKE 136 +AL K + V+A EL+E KS E+ ++ KA L + NE KK+ Sbjct: 685 IEALDKEHLKVIAKSQRELQEKTRLKS--EVVDLKKAKVELIKKMNEDKKK 733 >gi|114595515|ref|XP_001170168.1| PREDICTED: centromere protein E isoform 1 [Pan troglodytes] gi|114595517|ref|XP_001170275.1| PREDICTED: centromere-associated protein E isoform 2 [Pan troglodytes] Length = 2701 Score = 36.2 bits (82), Expect = 2.0, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 65/123 (52%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N K+ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEMENLNLAQKLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETADKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|114595519|ref|XP_517376.2| PREDICTED: centromere protein E isoform 5 [Pan troglodytes] gi|114595521|ref|XP_001170294.1| PREDICTED: centromere protein E isoform 3 [Pan troglodytes] Length = 2665 Score = 36.2 bits (82), Expect = 2.0, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 65/123 (52%), Gaps = 7/123 (5%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKEL---SKAFYIDFR 84 SK + N K+ E E+ + K+RD + + + + KE +KA ++ + Sbjct: 1864 SKLEMENLNLAQKLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQ 1923 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 ELK + L+K +K +LRE + + I+I +I K +D ++ +E +K+I+EL ++ Sbjct: 1924 QELKTARMLSKEHKETADKLRE-KISEKTIQISDIQKDLD---KSKDELQKKIQELQKKE 1979 Query: 145 IEL 147 ++L Sbjct: 1980 LQL 1982 >gi|283769219|ref|ZP_06342123.1| conserved domain protein [Bulleidia extructa W1219] gi|283104195|gb|EFC05574.1| conserved domain protein [Bulleidia extructa W1219] Length = 306 Score = 36.2 bits (82), Expect = 2.1, Method: Composition-based stats. Identities = 33/136 (24%), Positives = 59/136 (43%), Gaps = 12/136 (8%) Query: 4 SVSASNNDTSKIPDAKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYD 63 S+S + + D + + I+ E I+ +L + A+K K +D Y AY Sbjct: 155 SLSELKENIVTLRDKNYKTTRTIKDTEKKIDDRV-----QLIDHAEKYLKHKD-TYKAYT 208 Query: 64 KVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAV 123 K+ +K+ FY + E+ F++ KY K + E + + K EI + K Sbjct: 209 KLKKNKQD------TFYNEHTAEIILFESAKKYLKEHLGESKTINISKWKSEIGTLRKEK 262 Query: 124 DTLTRAYNEYKKEIRE 139 DTL + +KE+ + Sbjct: 263 DTLYSQITDIRKEVEQ 278 >gi|194671953|ref|XP_612396.4| PREDICTED: A-kinase anchor protein 11 [Bos taurus] gi|297480991|ref|XP_002691818.1| PREDICTED: A kinase (PRKA) anchor protein 11 [Bos taurus] gi|296481877|gb|DAA23992.1| A kinase (PRKA) anchor protein 11 [Bos taurus] Length = 1905 Score = 36.2 bits (82), Expect = 2.2, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 12/66 (18%) Query: 91 KALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGFD 150 +A+ +Y VV + E GLG + + E TKAVD +T Y +++ L+ + Sbjct: 1542 QAIEQYANKVVGDTLELGLGSATFHVSEATKAVDRIT-----YAEKLSPLVGQ------- 1589 Query: 151 QCDECD 156 C CD Sbjct: 1590 ACRSCD 1595 >gi|153954813|ref|YP_001395578.1| transcriptional regulator [Clostridium kluyveri DSM 555] gi|146347671|gb|EDK34207.1| Transcriptional regulator [Clostridium kluyveri DSM 555] Length = 281 Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats. Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 8/72 (11%) Query: 75 LSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEE------ITKAVDTLTR 128 L K Y+D +T +Y+ L + ELR+ LG S EI+E +TK++ LT Sbjct: 32 LFKPTYVDPKTGYRYYSVLQYEKLGTIKELRQ--LGMSLKEIKEYFNNRHVTKSLSILTN 89 Query: 129 AYNEYKKEIREL 140 + E K++I++L Sbjct: 90 KHEELKEKIKDL 101 >gi|302661475|ref|XP_003022405.1| conserved hypothetical protein [Trichophyton verrucosum HKI 0517] gi|291186348|gb|EFE41787.1| conserved hypothetical protein [Trichophyton verrucosum HKI 0517] Length = 593 Score = 36.2 bits (82), Expect = 2.4, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 37/77 (48%) Query: 86 ELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFI 145 +L F AL + K + E+ F KS IE+++ ++ R ++ +K+ R + I Sbjct: 117 KLALFSALARKGKGAIPEIASFIGSKSRIEVQQYIMGLEQALREHHITEKKFRGITFSDI 176 Query: 146 ELGFDQCDECDLCSEKA 162 + DEC L EKA Sbjct: 177 HAAAEISDECCLALEKA 193 >gi|219855273|ref|YP_002472395.1| hypothetical protein CKR_1930 [Clostridium kluyveri NBRC 12016] gi|219568997|dbj|BAH06981.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 294 Score = 36.2 bits (82), Expect = 2.5, Method: Composition-based stats. Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 8/72 (11%) Query: 75 LSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEE------ITKAVDTLTR 128 L K Y+D +T +Y+ L + ELR+ LG S EI+E +TK++ LT Sbjct: 45 LFKPTYVDPKTGYRYYSVLQYEKLGTIKELRQ--LGMSLKEIKEYFNNRHVTKSLSILTN 102 Query: 129 AYNEYKKEIREL 140 + E K++I++L Sbjct: 103 KHEELKEKIKDL 114 >gi|146416945|ref|XP_001484442.1| hypothetical protein PGUG_03823 [Meyerozyma guilliermondii ATCC 6260] Length = 797 Score = 35.8 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 30/104 (28%), Positives = 48/104 (46%), Gaps = 8/104 (7%) Query: 88 KYFKALTKYYKSVVAELREFGLGKSAIEIE--EITKAVDTLTRAYNEYKKEIRELIEE-- 143 K +A K + +++ + K +E E E+ + +D+L YNE K E+ L+EE Sbjct: 104 KQLEASEKSVDAAESKIEMLSVDKEVLETENEELRQELDSLHVKYNELKLELEALMEESE 163 Query: 144 ----FIELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFAEKL 183 +L Q D DL E+ +I K A E + +EKL Sbjct: 164 LNRQLEQLVSAQVDVSDLSEEEIKLILVKNKALEQKLEDNSEKL 207 >gi|290968471|ref|ZP_06560010.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781467|gb|EFD94056.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 862 Score = 35.8 bits (81), Expect = 2.7, Method: Composition-based stats. Identities = 30/120 (25%), Positives = 57/120 (47%), Gaps = 15/120 (12%) Query: 28 SKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTEL 87 +K++ NK++ K+E L ++ + K KVYG DK + + + +F EL Sbjct: 458 NKQTSYNKDY-NKIEYLVKQVNVTYSKVKKVYGEIDK-------NQFILSDYTSEFNGEL 509 Query: 88 KYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIEL 147 F + K+++ + + EIE I K ++L +E+ K+++ L EEF + Sbjct: 510 --FGNINDILKNILTAINNI-----SAEIETIAKNENSLRTIKDEFDKDMKSLKEEFASI 562 >gi|332799906|ref|YP_004461405.1| UvrD/REP helicase [Tepidanaerobacter sp. Re1] gi|332697641|gb|AEE92098.1| UvrD/REP helicase [Tepidanaerobacter sp. Re1] Length = 970 Score = 35.8 bits (81), Expect = 2.7, Method: Composition-based stats. Identities = 19/53 (35%), Positives = 34/53 (64%), Gaps = 2/53 (3%) Query: 114 IEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKADVIQ 166 +E+E + K +D L R Y+E +RE +EE + G++Q + D+ SE+ DV++ Sbjct: 536 LEVENVFKFLD-LCRDYDEQGYTLREFLEELSDFGWEQQEAIDV-SEEEDVVK 586 >gi|148270224|ref|YP_001244684.1| CRISPR-associated RAMP Csm4 family protein [Thermotoga petrophila RKU-1] gi|170288807|ref|YP_001739045.1| CRISPR-associated RAMP Csm4 family protein [Thermotoga sp. RQ2] gi|147735768|gb|ABQ47108.1| CRISPR-associated protein, Csm4 family [Thermotoga petrophila RKU-1] gi|170176310|gb|ACB09362.1| CRISPR-associated RAMP protein, Csm4 family [Thermotoga sp. RQ2] Length = 285 Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 69/151 (45%), Gaps = 17/151 (11%) Query: 2 GYSVSASNNDTSKIP----DAKFGSFLQIRSKESVINKEFVTKVEE---LYEKAQKAHKK 54 G+ V N S +P D +G+ + K S ++F K++ L+ + + Sbjct: 8 GFRVGRGNETDSTLPTIHSDTIYGAVVYHAFKWSEKAEDFARKLKVSSLLFVRGDRLLVP 67 Query: 55 RDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAI 114 + +Y Y ++SPKEL KA Y+ F +L ++++ K +V E + + Sbjct: 68 KPFLYNTYSLEEDDEESPKELKKASYV-FLDKLANYQSIKKASGAVCKE--------NPV 118 Query: 115 EIEEITK-AVDTLTRAYNEYKKEIRELIEEF 144 EI I + ++D +T + N Y E+ + E F Sbjct: 119 EIVRIPRNSLDRITSSSNLYFVEVAFVKEGF 149 >gi|156093824|ref|XP_001612950.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148801824|gb|EDL43223.1| hypothetical protein, conserved [Plasmodium vivax] Length = 1074 Score = 35.4 bits (80), Expect = 3.4, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 12/126 (9%) Query: 40 KVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKY--FKALTKYY 97 K+ ++ EK + K + YDK H P E+ + +F+T KY K T YY Sbjct: 847 KMAKIVEKYAEEIPKDSERSLRYDK-EEHIDDPDEMDDLLFGEFKTLEKYGTHKTSTFYY 905 Query: 98 KSVVAE--LREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRE-------LIEEFIELG 148 + + LR+F + E+EE+ + + L+ + Y+ E + L+E+F+EL Sbjct: 906 EMTCFDERLRDFEINTKLKEMEEVPEKWELLSLYWQSYRNERHKYLAVKKYLLEKFLELK 965 Query: 149 FDQCDE 154 +Q E Sbjct: 966 TNQSTE 971 >gi|290994729|ref|XP_002679984.1| predicted protein [Naegleria gruberi] gi|284093603|gb|EFC47240.1| predicted protein [Naegleria gruberi] Length = 1593 Score = 35.4 bits (80), Expect = 3.6, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 73/169 (43%), Gaps = 17/169 (10%) Query: 33 INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKA 92 INK F+ ++ L+E A++ + ++ + K ++++ R + K Sbjct: 425 INKSFIVHLDSLFEVARQVFSTCVERGVIIQRIHIYFKEFFDITRTLLQGERKKSTELKK 484 Query: 93 LTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIREL------------ 140 + Y+ VV E+ + L K E+E + T+ E+KK+I ++ Sbjct: 485 VINQYEKVVKEVSDEKL-KMKQELETLLTKSATIVSENEEFKKKISQIPQYEVVMNLIQN 543 Query: 141 ---IEEFI-ELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFAEKLEG 185 EEF+ + G D DE + +K V KKR+A + + A+ + G Sbjct: 544 KKKWEEFVNDGGEDLFDEDGVGGDKKGVTPKKRMAKQRSQNGEADSVSG 592 >gi|190347455|gb|EDK39725.2| hypothetical protein PGUG_03823 [Meyerozyma guilliermondii ATCC 6260] Length = 797 Score = 35.4 bits (80), Expect = 3.7, Method: Composition-based stats. Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 8/104 (7%) Query: 88 KYFKALTKYYKSVVAELREFGLGKSAIEIE--EITKAVDTLTRAYNEYKKEIRELIEE-- 143 K +A K + +++ + K +E E E+ + +D+L YNE K E+ L+EE Sbjct: 104 KQLEASEKSVDAAESKIEMLSVDKEVLETENEELRQELDSLHVKYNELKSELEALMEESE 163 Query: 144 ----FIELGFDQCDECDLCSEKADVIQKKRIAFEMVEREFAEKL 183 +L Q D DL E+ I K A E + +EKL Sbjct: 164 LNRQLEQLVSAQVDVSDLSEEEIKSILVKNKALEQKLEDNSEKL 207 >gi|331703101|ref|YP_004399788.1| hypothetical protein MLC_0810 [Mycoplasma mycoides subsp. capri LC str. 95010] gi|328801656|emb|CBW53809.1| Hypothetical protein, predicted lipoprotein [Mycoplasma mycoides subsp. capri LC str. 95010] Length = 563 Score = 35.4 bits (80), Expect = 3.8, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 59/123 (47%), Gaps = 9/123 (7%) Query: 25 QIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFR 84 +IR E+ +NK + +E+L + KA K R+ + ++ K +E+ K ++ Sbjct: 387 EIRKIEADLNKLKQSTIEKLTKDNNKAEKARNDLPSLESSLNKMKTEKEEIDKHLHLIND 446 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEF 144 K K LT+Y K+V G EIE+ TK+ T+ EY+ ++ +L+ E Sbjct: 447 QVSKDKKYLTEYKKNV---------GLLEEEIEQATKSESYWTKTLKEYEADLNKLLAEK 497 Query: 145 IEL 147 E+ Sbjct: 498 SEI 500 >gi|157871447|ref|XP_001684273.1| hypothetical protein [Leishmania major strain Friedlin] gi|68127341|emb|CAJ05669.1| conserved hypothetical protein [Leishmania major strain Friedlin] Length = 827 Score = 35.4 bits (80), Expect = 3.9, Method: Composition-based stats. Identities = 43/180 (23%), Positives = 71/180 (39%), Gaps = 19/180 (10%) Query: 17 DAKFGSFLQIRSKESV-----INKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKS 71 D F F RS+E + N +VEE ++A+K HK R + + + Sbjct: 130 DLDFARFELTRSQEEMKLLRASNHNLKNEVEEYVQRAEKEHKARMSLEAKMSAEGATRTE 189 Query: 72 PKELSKAFYIDFRTE-LKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAY 130 E K + R E K +T+ + EF ++ EE+ A + L R Sbjct: 190 EIEFLKKTIDELRAENAKRLDYMTQQQED------EFQARLQSMR-EELAGAAEELARLT 242 Query: 131 NE--YKKEIRELIEEFIELGFDQCDE----CDLCSEKADVIQKKRIAFEMVEREFAEKLE 184 E ++ REL E + ++ D C + D +QK+ A E R+F ++ E Sbjct: 243 AEAAQAQKARELSESKLTAALTDLEKSRAVVDDCKARIDALQKECHALEQQHRDFVQQQE 302 >gi|154250244|ref|YP_001411069.1| exodeoxyribonuclease VII [Fervidobacterium nodosum Rt17-B1] gi|154154180|gb|ABS61412.1| Exodeoxyribonuclease VII [Fervidobacterium nodosum Rt17-B1] Length = 495 Score = 35.4 bits (80), Expect = 4.0, Method: Composition-based stats. Identities = 32/138 (23%), Positives = 59/138 (42%), Gaps = 17/138 (12%) Query: 13 SKIPDAKFGSFLQI-RSKESVINKEFVTKVEELYEK-AQKAHKKRDKVYGAYDKVSSHKK 70 +K+ D + + +I R K + +E + E LY++ + K ++K + + K Sbjct: 324 AKLSDTNYKTVKEIFRGKIQSVEEELIDSAEVLYKRFSLKEYEKMISMDFIKNLSEKMKY 383 Query: 71 SPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAY 130 E SK +D+ KY K + E + ++E TK DTL + Y Sbjct: 384 DISEFSKKLMLDY-----------KYLKDTL----ELKINNINQQVERYTKLSDTLEKEY 428 Query: 131 NEYKKEIRELIEEFIELG 148 + + E EEF+++G Sbjct: 429 SNLSNLVEEKKEEFVKIG 446 >gi|28211476|ref|NP_782420.1| signaling protein [Clostridium tetani E88] gi|28203917|gb|AAO36357.1| signaling protein [Clostridium tetani E88] Length = 449 Score = 35.4 bits (80), Expect = 4.1, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 4/55 (7%) Query: 83 FRTELKYFKALTKYYKSVVAELREFGLGKSAIEIE----EITKAVDTLTRAYNEY 133 F E +Y+K + KYY +++ E +E + K+ +E E E D LT YN Y Sbjct: 245 FVFEKRYYKIINKYYNTIIREKQELVMEKTKVENEYKRLESIAITDPLTNLYNRY 299 >gi|225848205|ref|YP_002728368.1| hypothetical protein SULAZ_0374 [Sulfurihydrogenibium azorense Az-Fu1] gi|225644704|gb|ACN99754.1| hypothetical protein SULAZ_0374 [Sulfurihydrogenibium azorense Az-Fu1] Length = 382 Score = 35.4 bits (80), Expect = 4.1, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 54/117 (46%) Query: 24 LQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDF 83 ++I KE+ + +EE ++ + A+K ++Y + + KS + AF + Sbjct: 199 IEIILKEAEKQSPVLKSLEETLKRDRFAYKLAKEIYYPDVSIGTTYKSKERFQDAFSVSV 258 Query: 84 RTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIREL 140 L + T + + +V E + F + + +I+ + LT YNEYK +++L Sbjct: 259 NLNLNFPFWRTLHQEQIVLERKLFVIAQEEQKIQTLNNLKTQLTTFYNEYKYNLQKL 315 >gi|327303794|ref|XP_003236589.1| hypothetical protein TERG_03634 [Trichophyton rubrum CBS 118892] gi|326461931|gb|EGD87384.1| hypothetical protein TERG_03634 [Trichophyton rubrum CBS 118892] Length = 588 Score = 35.1 bits (79), Expect = 4.5, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 4/112 (3%) Query: 55 RDKVYGAYDKVSSHKK--SPKELSKAFYIDFRTE--LKYFKALTKYYKSVVAELREFGLG 110 RD + A D+++ +P S+ + + TE L F AL + K ++E+ Sbjct: 82 RDLLIQATDEIAQEPNHGAPIHSSQFGIVRWSTEEKLALFSALARKGKGAISEIASSIGS 141 Query: 111 KSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEKA 162 KS IE+++ ++ R ++ +K+ R + I + DEC L EKA Sbjct: 142 KSRIEVQQYIMGLEQALREHHLTEKKFRCITFSDIHAAAEISDECCLALEKA 193 >gi|253741491|gb|EES98360.1| Kinesin-2 [Giardia intestinalis ATCC 50581] Length = 718 Score = 35.1 bits (79), Expect = 4.7, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 77/174 (44%), Gaps = 32/174 (18%) Query: 22 SFLQIRSKESVI-NKEFVTKVEEL----YEKAQKAHKKRDKVYGAYDKVSSHKKSPKELS 76 + ++S+ S I N FV K E+ +KAQ+ R + + A + HK++ + Sbjct: 470 TLTDLKSQLSAIKNSVFVVKQLEIKNKVLDKAQQKLTTRQEKHNALQQALQHKQNEHQTK 529 Query: 77 KAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAY----NE 132 A + +L+ FKA + K+ EI E+ + +D +T + E Sbjct: 530 TAEVLSAVEKLERFKA---------------DIAKTEAEINEVNQEIDDITEQHALSIEE 574 Query: 133 YKKEIRE------LIEEFIELGFDQCD--ECDLCSEKADVIQKKRIAFEMVERE 178 ++E++E L++ I+ QC+ + + +E + K I+ E V++E Sbjct: 575 ERRELKEVDKRSALLDAIIQAFIPQCEVTKAEALAEYDEEAMKWVISPEKVDKE 628 >gi|147919772|ref|YP_686482.1| hypothetical protein RCIX1999 [uncultured methanogenic archaeon RC-I] gi|110621878|emb|CAJ37156.1| hypothetical protein RCIX1999 [uncultured methanogenic archaeon RC-I] Length = 1632 Score = 35.1 bits (79), Expect = 5.6, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 60/138 (43%), Gaps = 14/138 (10%) Query: 26 IRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRT 85 +R+ S I + T++ + E + D +K+ + LSKA +D+R+ Sbjct: 795 VRADVSSIREMTRTEMADFKEAFGSDIRAFDGYRDDLNKLRDSMEQVNSLSKAV-MDYRS 853 Query: 86 ELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRELIEEFI 145 ELK K + + + V + REF G+ EEI + D R Y K IRE+I Sbjct: 854 ELKSIKGMVEKFAEDVKQAREFAKGR-----EEILRLKDEF-RLYTNDIKTIREIIG--- 904 Query: 146 ELGFDQCDECDLCSEKAD 163 DQ DE + + D Sbjct: 905 ----DQKDELNAVKKTLD 918 >gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti] gi|108873380|gb|EAT37605.1| set domain protein [Aedes aegypti] Length = 1480 Score = 34.7 bits (78), Expect = 5.8, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 57/118 (48%), Gaps = 4/118 (3%) Query: 18 AKFGSFLQIRSKESVINKEFVTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSK 77 AK S ++ R+K SV EE YE +K + R AY+ + S + LSK Sbjct: 587 AKAKSTVRRRTK-SVSLGYHSNSFEESYESGRKRQRSRTPESPAYEALPLSSNSTESLSK 645 Query: 78 AFYIDFRTELKYFKALTKYYKSVV--AELREFGLGKSAIEIEEITKAVDTLTRAYNEY 133 ++ + + +++Y++S+ +++ + G S+ E E+IT+ ++ YN + Sbjct: 646 RIKLETNEQDLFRNTISRYFQSMAEGSDVVDTASGTSS-ECEDITEMLEFDKEIYNNF 702 >gi|255087754|ref|XP_002505800.1| predicted protein [Micromonas sp. RCC299] gi|226521070|gb|ACO67058.1| predicted protein [Micromonas sp. RCC299] Length = 1616 Score = 34.7 bits (78), Expect = 6.1, Method: Composition-based stats. Identities = 25/99 (25%), Positives = 48/99 (48%), Gaps = 6/99 (6%) Query: 63 DKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKA 122 D+V+++ + +L +A ++ E+ ++K + +AEL E GLG+ +++ Sbjct: 104 DRVAAYSRK-MQLIRAGDLNAAAEISTV-GVSKQVREAIAELEEDGLGRDVADVDTTADR 161 Query: 123 VDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSEK 161 V T + E R+LI F+ +C+ C CS K Sbjct: 162 VVWTTNSMVES----RDLIRNFLASVPAKCENCGCCSPK 196 >gi|160879186|ref|YP_001558154.1| methyl-accepting chemotaxis sensory transducer [Clostridium phytofermentans ISDg] gi|160427852|gb|ABX41415.1| methyl-accepting chemotaxis sensory transducer [Clostridium phytofermentans ISDg] Length = 489 Score = 34.7 bits (78), Expect = 6.8, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 55/109 (50%), Gaps = 4/109 (3%) Query: 38 VTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYY 97 V+ V +++ Q +++ A + SS + E+++ D + +KY K++++ Sbjct: 281 VSSVGDIHNSMQGIKLAANEISSAMETSSSDAQRLTEMTQEVSKDAQESVKYSKSISEID 340 Query: 98 KSVVAELREF--GL--GKSAIEIEEITKAVDTLTRAYNEYKKEIRELIE 142 + E+RE GL G A+ EE+ ++ +A++E+ ++ +++ Sbjct: 341 DRLSHEIREMFEGLSTGNQAVTNEELQLVIEKAVKAHSEWMVNLKNIVD 389 >gi|253581922|ref|ZP_04859146.1| SWF/SNF family helicase [Fusobacterium varium ATCC 27725] gi|251836271|gb|EES64808.1| SWF/SNF family helicase [Fusobacterium varium ATCC 27725] Length = 926 Score = 34.7 bits (78), Expect = 7.3, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 11/87 (12%) Query: 69 KKSPKELSKAFYIDFRTELK--YFKALTKYYKSVVAELREFGLGKSAI-------EIEEI 119 K P ++ K +I+ E K Y + + YYK V +++RE G+GK+ E+ +I Sbjct: 686 KDLPDKIEKTMFIEMNVEQKKLYEERRSYYYKMVNSQIRENGIGKTQFFILQALNELRQI 745 Query: 120 TKAVDTLTRAYNEYKKEIRELIEEFIE 146 T + + K+E+ L+ +E Sbjct: 746 TSCPEAKSNGVTSSKREV--LVNNIVE 770 >gi|15644552|ref|NP_229605.1| hypothetical protein TM1808 [Thermotoga maritima MSB8] gi|4982389|gb|AAD36871.1|AE001818_6 conserved hypothetical protein [Thermotoga maritima MSB8] Length = 292 Score = 34.3 bits (77), Expect = 7.4, Method: Composition-based stats. Identities = 37/156 (23%), Positives = 71/156 (45%), Gaps = 22/156 (14%) Query: 2 GYSVSASNNDTSKIP----DAKFGSFLQIRSKESVINKEFVTKVEE---LYEKAQKAHKK 54 G+ V N S +P D +G+ + K S ++F K++ L+ + + Sbjct: 15 GFRVGRGNETDSTLPTIHSDTIYGAVVYHAFKWSEKAEDFAKKLKVSSLLFVRGDRLLVP 74 Query: 55 RDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYYKSVVAELREFGLGKSAI 114 + +Y Y ++SPKEL KA Y+ F +L ++++ + +V E + + Sbjct: 75 KPFLYNTYSLEEDDEESPKELKKASYV-FLDKLADYRSIKEASGAVCKE--------NPV 125 Query: 115 EIEEITK-AVDTLTRAYNEYKKEIRELIEEFIELGF 149 EI I + ++D +T + N Y E+ F++ GF Sbjct: 126 EIVRIPRNSLDRITSSSNLYFVEV-----AFVKEGF 156 >gi|222475702|ref|YP_002564223.1| hypothetical protein Hlac_2766 [Halorubrum lacusprofundi ATCC 49239] gi|222454073|gb|ACM58337.1| hypothetical protein Hlac_2766 [Halorubrum lacusprofundi ATCC 49239] Length = 782 Score = 34.3 bits (77), Expect = 8.3, Method: Composition-based stats. Identities = 17/42 (40%), Positives = 24/42 (57%) Query: 119 ITKAVDTLTRAYNEYKKEIRELIEEFIELGFDQCDECDLCSE 160 +T AVDT R +++ +E R L+ EFIE+ EC C E Sbjct: 330 LTGAVDTYIRYFDDLPEEERSLLNEFIEVSEIDTFECYRCHE 371 >gi|317063195|ref|ZP_07927680.1| SWF/SNF family helicase [Fusobacterium ulcerans ATCC 49185] gi|313688871|gb|EFS25706.1| SWF/SNF family helicase [Fusobacterium ulcerans ATCC 49185] Length = 926 Score = 34.3 bits (77), Expect = 8.7, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 11/87 (12%) Query: 69 KKSPKELSKAFYIDFRTELK--YFKALTKYYKSVVAELREFGLGKSAI-------EIEEI 119 K P ++ K +I+ E K Y + + YYK V +++RE G+GK+ E+ +I Sbjct: 686 KDLPDKIEKTMFIEMNAEQKKLYEERRSYYYKMVNSQIRENGIGKTQFFILQALNELRQI 745 Query: 120 TKAVDTLTRAYNEYKKEIRELIEEFIE 146 T + + K+E+ L+ +E Sbjct: 746 TSCPEAKSNGVTSSKREV--LVNNIVE 770 >gi|257468947|ref|ZP_05633041.1| SWF/SNF family helicase [Fusobacterium ulcerans ATCC 49185] Length = 915 Score = 34.3 bits (77), Expect = 9.1, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 11/87 (12%) Query: 69 KKSPKELSKAFYIDFRTELK--YFKALTKYYKSVVAELREFGLGKSAI-------EIEEI 119 K P ++ K +I+ E K Y + + YYK V +++RE G+GK+ E+ +I Sbjct: 675 KDLPDKIEKTMFIEMNAEQKKLYEERRSYYYKMVNSQIRENGIGKTQFFILQALNELRQI 734 Query: 120 TKAVDTLTRAYNEYKKEIRELIEEFIE 146 T + + K+E+ L+ +E Sbjct: 735 TSCPEAKSNGVTSSKREV--LVNNIVE 759 >gi|149194280|ref|ZP_01871377.1| RNA polymerase sigma factor RpoD [Caminibacter mediatlanticus TB-2] gi|149135455|gb|EDM23934.1| RNA polymerase sigma factor RpoD [Caminibacter mediatlanticus TB-2] Length = 604 Score = 34.3 bits (77), Expect = 9.1, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 54/115 (46%), Gaps = 15/115 (13%) Query: 28 SKESVINKEFV---TKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFR 84 +KE NK+ ++ EL++ +KA K+ +KV YD++ P++ Sbjct: 178 TKEGKKNKKISLREKRINELFKNLEKAKKEWEKVIDEYDRILKDTSMPED---------- 227 Query: 85 TELKYFKALTKYYKSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKKEIRE 139 L LT K V L E LG ++ I EI K V+T + +++KE+++ Sbjct: 228 ERLHKILQLTFKKKKVKDALNE--LGYTSKLINEIVKTVETALKGDKDFEKELKK 280 >gi|323704403|ref|ZP_08115982.1| chromosome segregation ATPase-like protein [Thermoanaerobacterium xylanolyticum LX-11] gi|323536469|gb|EGB26241.1| chromosome segregation ATPase-like protein [Thermoanaerobacterium xylanolyticum LX-11] Length = 1389 Score = 33.9 bits (76), Expect = 9.8, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 45/98 (45%), Gaps = 22/98 (22%) Query: 38 VTKVEELYEKAQKAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYY 97 +T ++++Y+ A+ + +++ +DK S+ +K + AFY+ Sbjct: 1002 LTDIDDMYKAAKTIQSQYKEMFAKFDKDSATEK----IRNAFYV---------------- 1041 Query: 98 KSVVAELREFGLGKSAIEIEEITKAVDTLTRAYNEYKK 135 + EL E+GL +EIE+ +DT Y + ++ Sbjct: 1042 --CINELVEYGLSLDVVEIEQTDTDIDTYQEIYKDLRR 1077 >gi|293401347|ref|ZP_06645490.1| putative phage minor capsid protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304985|gb|EFE46231.1| putative phage minor capsid protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 594 Score = 33.9 bits (76), Expect = 9.8, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 7/74 (9%) Query: 50 KAHKKRDKVYGAYDKVSSHKKSPKELSKAFYIDFRTELKYFKALTKYY------KSVVAE 103 K +R+K+YG K+SS ++ E S ID RT L+ KALTK Y K +V+E Sbjct: 370 KRQSRREKIYGTDSKISSKLQNFDE-SSLKDIDERTILEVDKALTKIYEDYPHMKGIVSE 428 Query: 104 LREFGLGKSAIEIE 117 ++ G + E++ Sbjct: 429 VKLVEKGTAVAELD 442 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.315 0.132 0.356 Lambda K H 0.267 0.0414 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,344,299,256 Number of Sequences: 14124377 Number of extensions: 43178059 Number of successful extensions: 174080 Number of sequences better than 10.0: 144 Number of HSP's better than 10.0 without gapping: 15 Number of HSP's successfully gapped in prelim test: 129 Number of HSP's that attempted gapping in prelim test: 173984 Number of HSP's gapped (non-prelim): 191 length of query: 190 length of database: 4,842,793,630 effective HSP length: 131 effective length of query: 59 effective length of database: 2,992,500,243 effective search space: 176557514337 effective search space used: 176557514337 T: 11 A: 40 X1: 15 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 77 (34.3 bits)