BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] (129 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] gi|254040903|gb|ACT57699.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] Length = 129 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 129/129 (100%), Positives = 129/129 (100%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY Sbjct: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYKRG Sbjct: 121 IEELIYKRG 129 >gi|317120691|gb|ADV02514.1| hypothetical protein SC1_gp170 [Liberibacter phage SC1] gi|317120835|gb|ADV02656.1| hypothetical protein SC1_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 123/129 (95%), Positives = 126/129 (97%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E+ HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIKENNHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|317120733|gb|ADV02555.1| hypothetical protein SC2_gp170 [Liberibacter phage SC2] gi|317120794|gb|ADV02615.1| hypothetical protein SC2_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 123/129 (95%), Positives = 125/129 (96%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIQEYSHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|34365522|tpg|DAA01288.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio] Length = 3007 Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 58/121 (47%), Gaps = 18/121 (14%) Query: 19 KRRYYLK--NRDKILERRRRRYLKN---KDKIRESYHQYYLKN---KDKYREYKRRYYLK 70 KR+Y +++++ +R+Y +N K++++ + Y +N K++ + +R Y + Sbjct: 1063 KRKYDQNALHKERVKNMSKRKYEQNALHKERVKNMSKRKYEQNALHKERVKNMSKRKYEQ 1122 Query: 71 N---RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 N +++++ ++ Y + + + M K E AL +E K K + L Y+ Sbjct: 1123 NALHKERVKNMSKMKYEQNSLHKERV---KNMSKMKYEQNALHKE----KVKCMSRLKYQ 1175 Query: 128 R 128 + Sbjct: 1176 Q 1176 Score = 64.7 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 58/123 (47%), Gaps = 16/123 (13%) Query: 19 KRRYYLK--NRDKILERRRRRYLKN---KDKIRESYHQYYLKN---KDKYREYKRRYYLK 70 KR+Y +++++ +R+Y +N K++++ Y +N K++ + + Y + Sbjct: 1099 KRKYEQNALHKERVKNMSKRKYEQNALHKERVKNMSKMKYEQNSLHKERVKNMSKMKYEQ 1158 Query: 71 N---RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE--IARLKAKPIEELI 125 N ++K++ +R Y++ + + M K ++ L R+ IA K K E Sbjct: 1159 NALHKEKVKCMSRLKYQQNMLHKKTV---KEMSKRKYKLNQLHRQNVIAINKKKYHENAE 1215 Query: 126 YKR 128 +K+ Sbjct: 1216 HKK 1218 Score = 60.4 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 39/94 (41%), Gaps = 13/94 (13%) Query: 41 NKDKIRESYHQYYLKN----KDKYREYKRRYYLK--NRDKMREKARQSYRKLYSKDSWIA 94 K+K + Y KN K K +Y + +++K++ +++ Y + + Sbjct: 1018 YKEKKNNACKIRYQKNVLYKKKKRNASITKYCVNALHKEKVKNMSKRKYDQNALHKERV- 1076 Query: 95 PEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 + M K E AL +E + K + + Y++ Sbjct: 1077 --KNMSKRKYEQNALHKE----RVKNMSKRKYEQ 1104 Score = 54.3 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 47/112 (41%), Gaps = 14/112 (12%) Query: 19 KRRYYLK--NRDKILERRRRRYLKN---KDKIRESYHQYYLKN---KDKYREYKRRYYLK 70 KR+Y +++++ + +Y +N K++++ Y +N K+K + R Y + Sbjct: 1117 KRKYEQNALHKERVKNMSKMKYEQNSLHKERVKNMSKMKYEQNALHKEKVKCMSRLKYQQ 1176 Query: 71 N---RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 N + ++E +++ Y+ + + K A ++ + K Sbjct: 1177 NMLHKKTVKEMSKRKYKLNQLHRQNVIA---INKKKYHENAEHKKRVKAGMK 1225 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 41/108 (37%), Gaps = 16/108 (14%) Query: 16 REYKRRYYLKN---RDKILERRRRRYLKN---KDKIRESYHQYYLKN---KDKYREYKRR 66 + + Y +N ++++ + +Y +N K+K++ Y +N K +E +R Sbjct: 1131 KNMSKMKYEQNSLHKERVKNMSKMKYEQNALHKEKVKCMSRLKYQQNMLHKKTVKEMSKR 1190 Query: 67 YYLKN---RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 Y N R + ++ Y + + GM + E+ Sbjct: 1191 KYKLNQLHRQNVIAINKKKYHENAEHKKRVKA----GMKHKREQIKEK 1234 >gi|157953109|ref|YP_001498001.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] gi|155123336|gb|ABT15204.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] Length = 340 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 31/99 (31%), Positives = 57/99 (57%), Gaps = 2/99 (2%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y+R+Y +N++ ILE+ R+ Y +NK++I+E QY +NK+ E R+YY +N++++ Sbjct: 21 KDYQRQYREENKEHILEKARQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEI 80 Query: 76 REKARQSYRKLYSKDS--WIAPEEPMGMTKAEIEALERE 112 +EK RQ Y K ++ + K + RE Sbjct: 81 QEKNRQYYEDNKDKIRQYREDNKDKILEYKRQYYEDNRE 119 Score = 73.2 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 44/73 (60%), Gaps = 4/73 (5%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E KR+Y +N++ ILE+ R+ Y +NK++I+E QYY NKDK R+Y N+DK+ Sbjct: 52 EKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKI----RQYREDNKDKIL 107 Query: 77 EKARQSYRKLYSK 89 E RQ Y K Sbjct: 108 EYKRQYYEDNREK 120 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 22/95 (23%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E R+YY +N+++I E++R+ +NK+ I E QYY +NK++ +E R+YY N+DK+R Sbjct: 37 EKARQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKIR 96 Query: 77 ----------------------EKARQSYRKLYSK 89 EK +Q ++ K Sbjct: 97 QYREDNKDKILEYKRQYYEDNREKIQQYRASVHGK 131 Score = 38.1 bits (87), Expect = 0.42, Method: Composition-based stats. Identities = 13/35 (37%), Positives = 23/35 (65%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESY 49 ++ R+Y N+DKILE +R+ Y N++KI++ Sbjct: 91 NKDKIRQYREDNKDKILEYKRQYYEDNREKIQQYR 125 >gi|157953051|ref|YP_001497943.1| hypothetical protein NY2A_B747L [Paramecium bursaria Chlorella virus NY2A] gi|155123278|gb|ABT15146.1| hypothetical protein NY2A_B747L [Paramecium bursaria Chlorella virus NY2A] Length = 309 Score = 63.9 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 43/75 (57%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 REY+ Y KN++K+ + ++ Y +NK+ +++ + Y +NKD + + Y +N++ Sbjct: 7 KTREYQEEYRTKNKEKLQQNKKDYYQQNKELLKKKRNDRYHQNKDVILQQQNEYCQQNKE 66 Query: 74 KMREKARQSYRKLYS 88 +++ R+ LY+ Sbjct: 67 MIKKNQRKRNDILYT 81 >gi|157953770|ref|YP_001498661.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] gi|156068418|gb|ABU44125.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] Length = 310 Score = 63.9 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 29/79 (36%), Positives = 46/79 (58%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ R+Y N +KI ER R+R +N +K RE + KN +KYREY+R+Y + N +K Sbjct: 24 KKINNRKYCTNNAEKIKERGRKRRAENPEKFRERDRKRREKNPEKYREYERKYRIANPEK 83 Query: 75 MREKARQSYRKLYSKDSWI 93 +E R+ + +Y K I Sbjct: 84 FKEYERKRHATIYRKYQHI 102 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 38/97 (39%), Gaps = 8/97 (8%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 +++ R+ N +KI+E + +N +K+RE R+ KN + YR+ Sbjct: 21 KEQKKINNRKYCTNNAEKIKERGRKRRAENPEKFRERDRKRREKN--------PEKYREY 72 Query: 87 YSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 K PE+ + + R+ ++ + Sbjct: 73 ERKYRIANPEKFKEYERKRHATIYRKYQHIRDGALRR 109 >gi|13470657|ref|NP_102226.1| hypothetical protein mll0428 [Mesorhizobium loti MAFF303099] gi|14021399|dbj|BAB48012.1| mll0428 [Mesorhizobium loti MAFF303099] Length = 241 Score = 63.9 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 40/87 (45%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R+ L + R E +R+ +N +K E RR L++ D+++ + Y + N +K Sbjct: 29 REELLAKGRSPASLEKQRKRRAENPEKTAEHRRLFKLRHPDRVKAGWKAYRVANSEKIAA 88 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSK 89 Y Y ++++ ++ R K K Sbjct: 89 YYATYSVEHKAEILATDRVYRAKNRDK 115 >gi|167044410|gb|ABZ09087.1| hypothetical protein ALOHA_HF4000APKG6D3ctg6g4 [uncultured marine crenarchaeote HF4000_APKG6D3] Length = 308 Score = 61.6 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 34/85 (40%), Positives = 52/85 (61%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 VL+ EER RRE R++ +N +KI E R+ KN +KI E+ +Y +N +K RE + Sbjct: 2 VLSDEERKRKRRELTRKWKKENPEKIREGGRKYRAKNPEKISEAKKKYNRENPEKVREKR 61 Query: 65 RRYYLKNRDKMREKARQSYRKLYSK 89 R+YY +NR+K+REK + + K Sbjct: 62 RKYYEENREKIREKQIKYLEENPEK 86 Score = 60.1 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 53/87 (60%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +++ EE +E + ++Y +NR+KI ERRR+ Y +N +KI E + Y KN +K E Sbjct: 75 KQIKYLEENPEKIKEARSKHYEENREKINERRRKDYEENHEKINERRRKDYKKNPEKINE 134 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSK 89 +R+Y +N +K+RE R+ Y K + K Sbjct: 135 RRRKYRKENPEKVRESKRKDYEKNHEK 161 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 61/117 (52%), Gaps = 7/117 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 K E RE +R+YY +NR+KI E++ + +N +KI+E+ ++Y +N++K E Sbjct: 46 KKKYNRENPEKVREKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKINER 105 Query: 64 KRRYYLKNRDKMREKARQSYRKL-------YSKDSWIAPEEPMGMTKAEIEALEREI 113 +R+ Y +N +K+ E+ R+ Y+K K PE+ + + E +I Sbjct: 106 RRKDYEENHEKINERRRKDYKKNPEKINERRRKYRKENPEKVRESKRKDYEKNHEKI 162 Score = 53.9 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 62/115 (53%), Gaps = 7/115 (6%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 E K++Y +N +K+ E+RR+ Y +N++KIRE +Y +N +K +E + ++Y +NR+K+ Sbjct: 43 SEAKKKYNRENPEKVREKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKI 102 Query: 76 REKARQSYRKLY-------SKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 E+ R+ Y + + KD PE+ + + ++ K K E+ Sbjct: 103 NERRRKDYEENHEKINERRRKDYKKNPEKINERRRKYRKENPEKVRESKRKDYEK 157 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 40/68 (58%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E +R+Y +N +K+ E +R+ Y KN +KI+E ++ +N + + + ++YY KN + Sbjct: 134 ERRRKYRKENPEKVRESKRKDYEKNHEKIKERVKKWQKENPELRKSHLKKYYEKNPVALT 193 Query: 77 EKARQSYR 84 ++ + R Sbjct: 194 KEMTKYRR 201 >gi|255947256|ref|XP_002564395.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591412|emb|CAP97642.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] Length = 669 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 3/102 (2%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + R+Y KN DKI +RR KN D+I+ ++ KN DK + KR + KN DK+ Sbjct: 415 KAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKARKREWRRKNADKI 474 Query: 76 REKARQSYRKLYSK---DSWIAPEEPMGMTKAEIEALEREIA 114 + R+ YR+ + ++ E KA +RE A Sbjct: 475 KANNREYYRENADRIKANNREYQRENADRIKARNREYQRENA 516 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 35/100 (35%), Positives = 48/100 (48%), Gaps = 3/100 (3%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 Y RRYY KN KI + R+ KN DKIR +Y KN D+ + R + KN DK++ Sbjct: 402 YSRRYYRKNAGKIKAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKA 461 Query: 78 KARQSYRKLYSK---DSWIAPEEPMGMTKAEIEALEREIA 114 + R+ RK K ++ E KA +RE A Sbjct: 462 RKREWRRKNADKIKANNREYYRENADRIKANNREYQRENA 501 Score = 54.7 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 40/81 (49%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + KR + KN DKI R Y +N D+I+ + +Y +N D+ + R Y +N D++ Sbjct: 460 KARKREWRRKNADKIKANNREYYRENADRIKANNREYQRENADRIKARNREYQRENADRI 519 Query: 76 REKARQSYRKLYSKDSWIAPE 96 + + R+ R+ + E Sbjct: 520 KARNREYQRENADRIKARNRE 540 Score = 51.6 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 37/76 (48%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + R Y +N D+I R R KN D+I S ++ KN D+ KR ++ KN D++ Sbjct: 520 KARNREYQRENADRIKARNREYQRKNADRISASIQEWRRKNADRVSASKREWHRKNADRV 579 Query: 76 REKARQSYRKLYSKDS 91 + R+ RK K Sbjct: 580 NARQRERRRKNAGKTK 595 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 50/108 (46%), Gaps = 4/108 (3%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 R +R Y KN D+I R R KN DKI+ ++ KN DK + R YY +N D++ Sbjct: 430 RAKRREYERKNADRIKARLREWRRKNADKIKARKREWRRKNADKIKANNREYYRENADRI 489 Query: 76 ----REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 RE R++ ++ +++ E + E R+KA+ Sbjct: 490 KANNREYQRENADRIKARNREYQRENADRIKARNREYQRENADRIKAR 537 Score = 50.0 bits (118), Expect = 9e-05, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 50/107 (46%), Gaps = 4/107 (3%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + R + KN DKI R+R KN DKI+ + +YY +N D+ + R Y +N D++ Sbjct: 445 KARLREWRRKNADKIKARKREWRRKNADKIKANNREYYRENADRIKANNREYQRENADRI 504 Query: 76 ----REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 RE R++ ++ +++ E + E + R+ A Sbjct: 505 KARNREYQRENADRIKARNREYQRENADRIKARNREYQRKNADRISA 551 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 3/101 (2%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + R Y +N D+I R R +N D+I+ +Y KN D+ + + KN D++ Sbjct: 505 KARNREYQRENADRIKARNREYQRENADRIKARNREYQRKNADRISASIQEWRRKNADRV 564 Query: 76 REKARQSYRKLYSK---DSWIAPEEPMGMTKAEIEALEREI 113 R+ +RK + + G TK I+ ER++ Sbjct: 565 SASKREWHRKNADRVNARQRERRRKNAGKTKPRIQERERKM 605 Score = 44.3 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 24/65 (36%), Positives = 34/65 (52%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 +N D+ RR Y KN KI+ +Y KN DK R +R Y KN D+++ + R+ R Sbjct: 394 QNTDREDSYSRRYYRKNAGKIKAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWRR 453 Query: 85 KLYSK 89 K K Sbjct: 454 KNADK 458 >gi|238776850|ref|NP_001154919.1| nipped-B-like protein [Danio rerio] Length = 2856 Score = 57.4 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 29/114 (25%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYH-QYYLKNKDKY 60 GR ++ CR + + + K+R+K ++ K++DK+R+ + K++DK Sbjct: 763 GRDKKRSKDLEKCREKDQDKELEKDREKNQDKE---LEKDRDKVRDKDRDKVREKDRDKV 819 Query: 61 REYKR-RYYLKNRDKMREKAR-QSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 RE R + K+R+K+RE+ R + K K+ E+ + + ERE Sbjct: 820 REKDRDKLREKDREKIRERDRDKGREKDRDKEQVKTREKDQEKERLKDRDKERE 873 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 7/121 (5%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRR-RRYLKNKDKIRESYH-QYYLKNKDKYREYK 64 ++R R + + + K+RDK+ E+ R + K+++KIRE + K++DK + Sbjct: 796 LEKDRDKVRDKDRDKVREKDRDKVREKDRDKLREKDREKIRERDRDKGREKDRDKEQVKT 855 Query: 65 RRY-----YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 R LK+RDK REK R R +E +A + R+K + Sbjct: 856 REKDQEKERLKDRDKEREKVRDKGRDRDRDQEKKRNKELTEDKQAPEQRSRPNSPRVKQE 915 Query: 120 P 120 P Sbjct: 916 P 916 Score = 39.6 bits (91), Expect = 0.16, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 55/135 (40%), Gaps = 12/135 (8%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 GR+ + R R + + + K RDK +E+ R + KD+++E +K++ + Sbjct: 707 GREKELEKGRDKERVKDQEKDQEKGRDKEVEKGR-YKERVKDRVKEQEKVR---DKEQVK 762 Query: 62 EYKRRY-------YLKNRDKMREKARQSYR-KLYSKDSWIAPEEPMGMTKAEIEALEREI 113 ++ K++DK EK R+ + K KD ++ + + RE Sbjct: 763 GRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKDRDKVRDKDRDKVREKDRDKVREK 822 Query: 114 ARLKAKPIEELIYKR 128 R K + + + Sbjct: 823 DRDKLREKDREKIRE 837 >gi|2947224|gb|AAC39306.1| ORF4 [Lactococcus phage phi31] Length = 246 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 27/86 (31%), Positives = 46/86 (53%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 +CR+ + Y ++KILE+++R Y KN DKI+E YY +NK K R+ ++ + N Sbjct: 64 KVCRKAKDKARYEAKKEKILEQKKRYYEKNADKIKERQLGYYNENKGKCRQSEKDWCKNN 123 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEE 97 + R +S Y +S + +E Sbjct: 124 PTRRRMTCAKSRTLKYGSESTLTEKE 149 Score = 42.0 bits (97), Expect = 0.028, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 48/116 (41%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 ++ E + K + + + + + RY K+KI E +YY KN DK +E Sbjct: 40 QEKELDEHNFPKNFKRKGGFEGRCKVCRKAKDKARYEAKKEKILEQKKRYYEKNADKIKE 99 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 + YY +N+ K R+ + + ++ + +E E+E +K+ Sbjct: 100 RQLGYYNENKGKCRQSEKDWCKNNPTRRRMTCAKSRTLKYGSESTLTEKEWLEIKS 155 >gi|169628899|ref|YP_001702548.1| bacteriophage protein [Mycobacterium abscessus ATCC 19977] gi|169240866|emb|CAM61894.1| Bacteriophage protein [Mycobacterium abscessus] Length = 215 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 33/74 (44%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 C R R Y +N + R+ Y ++++K ++ N ++ E R+YY N+ Sbjct: 46 CTRRTNRASYSRNFEGGKARKAAAYKRDREKFLARNRRWRADNYERELETNRKYYHDNKA 105 Query: 74 KMREKARQSYRKLY 87 ++R ++ Y Sbjct: 106 QVRAWHKRYYEANK 119 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 20/58 (34%), Positives = 32/58 (55%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 + K Y ++R+K L R RR N ++ E+ +YY NK + R + +RYY N+D Sbjct: 63 KARKAAAYKRDREKFLARNRRWRADNYERELETNRKYYHDNKAQVRAWHKRYYEANKD 120 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 25/56 (44%) Query: 34 RRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSK 89 R Y +N + + Y ++++K+ RR+ N ++ E R+ Y ++ Sbjct: 51 NRASYSRNFEGGKARKAAAYKRDREKFLARNRRWRADNYERELETNRKYYHDNKAQ 106 >gi|155370580|ref|YP_001426114.1| hypothetical protein FR483_N482R [Paramecium bursaria Chlorella virus FR483] gi|155123900|gb|ABT15767.1| hypothetical protein FR483_N482R [Paramecium bursaria Chlorella virus FR483] Length = 318 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 33/67 (49%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +E R Y N++KI ++ + Y N +K + + + Y + K+ R + +N +K+ Sbjct: 27 KEKARLYRENNKEKISKKEKEAYAANPEKKKANANARYAEKKEDIRAKVKAERAENPEKI 86 Query: 76 REKARQS 82 R + R Sbjct: 87 RAQNRSY 93 >gi|321445182|gb|EFX60620.1| hypothetical protein DAPPUDRAFT_123552 [Daphnia pulex] Length = 455 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 32/76 (42%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 C RE++ Y +N ++ E+RR N Q+ N D++ + ++ L N + Sbjct: 330 CNREWQAASYQRNLERNREKRRLHAKVNAAVNNARSRQWAKDNADRHNKNGLQWRLANPE 389 Query: 74 KMREKARQSYRKLYSK 89 K E + + K Sbjct: 390 KSAEMRAKWVAENLHK 405 >gi|9632057|ref|NP_048846.1| hypothetical protein PBCV1_A490L [Paramecium bursaria Chlorella virus 1] gi|1620161|gb|AAC96857.1| Lys-, Glu-rich [Paramecium bursaria Chlorella virus 1] Length = 310 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 44/73 (60%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + RE+ ++ Y K+ +K ER+R+ +N DK RE + Y ++ +K+REY R+Y+ ++ Sbjct: 21 KIKNREWDKKRYAKDPEKFRERKRKEREENIDKFRERARERYAEDPEKFREYDRKYHAED 80 Query: 72 RDKMREKARQSYR 84 +K RE+ R Sbjct: 81 PEKKRERNRLRNA 93 Score = 41.6 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 34/72 (47%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 + K E ++RY K+ +K RE + +N DK+RE R Y ++ +K RE R+ + + Sbjct: 21 KIKNREWDKKRYAKDPEKFRERKRKEREENIDKFRERARERYAEDPEKFREYDRKYHAED 80 Query: 87 YSKDSWIAPEEP 98 K Sbjct: 81 PEKKRERNRLRN 92 >gi|168698018|ref|ZP_02730295.1| Homing nuclease of HNH family with NUMOD4 and IENR domains [Gemmata obscuriglobus UQM 2246] Length = 223 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 22/70 (31%), Positives = 34/70 (48%), Gaps = 11/70 (15%) Query: 15 RREYKRRYYLKNRD-----------KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 RREY R YY NR+ K +RR Y +NK+ ++ + Y +NK+K+ Sbjct: 109 RREYSRLYYHANRERIISLAVPRVEKRKAGQRRYYQQNKEAVQAAVKIYRQENKEKFNAN 168 Query: 64 KRRYYLKNRD 73 RR ++ D Sbjct: 169 ARRKRAESPD 178 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 31/74 (41%), Gaps = 11/74 (14%) Query: 23 YLKNRDKILERRRRRYLKNKDKI-----------RESYHQYYLKNKDKYREYKRRYYLKN 71 + + + E R Y N+++I + +YY +NK+ + + Y +N Sbjct: 102 RDRQKTERREYSRLYYHANRERIISLAVPRVEKRKAGQRRYYQQNKEAVQAAVKIYRQEN 161 Query: 72 RDKMREKARQSYRK 85 ++K AR+ + Sbjct: 162 KEKFNANARRKRAE 175 Score = 43.5 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 33/70 (47%), Gaps = 1/70 (1%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R + R+ R+ +RRYY +N++ + + +NK+K + + ++ D + E Sbjct: 123 RIISLAVPRVEKRKAGQRRYYQQNKEAVQAAVKIYRQENKEKFNANARRKRAESPD-FSE 181 Query: 63 YKRRYYLKNR 72 +R + + Sbjct: 182 KERGWSKSDI 191 Score = 37.7 bits (86), Expect = 0.51, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 33/81 (40%), Gaps = 13/81 (16%) Query: 22 YYLKNR--DKILERRRRRYLKNKDKIRESYHQYYLKNKDKY-----------REYKRRYY 68 + NR + + + + K + RE YY N+++ + +RRYY Sbjct: 84 WRTNNRPSNLVRLTEQEARDRQKTERREYSRLYYHANRERIISLAVPRVEKRKAGQRRYY 143 Query: 69 LKNRDKMREKARQSYRKLYSK 89 +N++ ++ + ++ K Sbjct: 144 QQNKEAVQAAVKIYRQENKEK 164 >gi|9632045|ref|NP_048834.1| hypothetical protein PBCV1_A478L [Paramecium bursaria Chlorella virus 1] gi|1620149|gb|AAC96845.1| Lys-, Arg-rich; contains eukaryotic putative RNA-binding region RNP-1 signature; similar to PBCV-1 ORF A267L, corresponds to GenBank Accession Number U42580 [Paramecium bursaria Chlorella virus 1] Length = 310 Score = 52.0 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 44/79 (55%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ R+Y K+ +KI ER R+R K+ +K E + ++ +K+RE R+Y + N +K Sbjct: 24 KKINNRKYCAKHTEKIRERDRKRREKDLEKYNEIKRKQRAEDPEKFRERARKYRIANPEK 83 Query: 75 MREKARQSYRKLYSKDSWI 93 +E R+ + +Y K I Sbjct: 84 FKEYERKRHATIYRKYQHI 102 >gi|221110910|ref|XP_002161653.1| PREDICTED: similar to predicted protein, partial [Hydra magnipapillata] Length = 533 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 26 NRDKILERRRRRYLKN-----KDKIRESYHQYYLKN-KDKYREYKRRYYLKN-RDKMREK 78 N + + E R +N ++K+R++ + +N K+ RE R +N ++K+RE Sbjct: 67 NFENVKENIRENISENIRKKVREKVRKNVRENVRENVKENVRENVRENVRENIKEKVREN 126 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 R++ R+ ++ + E K + + Sbjct: 127 VRENVRENVRENVRESVRENASGDKYNVSEI 157 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 15/95 (15%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYR 61 R+ ++ R R + ++ R+ + E R +N ++ I+E + +N Sbjct: 76 RENISENIRKKVREKVRKNVRENVRENVKENVRENVRENVRENIKEKVRENVRENV---- 131 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPE 96 R+ +RE R+S R+ S D + E Sbjct: 132 ----------RENVRENVRESVRENASGDKYNVSE 156 >gi|315121958|ref|YP_004062447.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122926|ref|YP_004063415.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495360|gb|ADR51959.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496328|gb|ADR52927.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 209 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 32/103 (31%), Positives = 51/103 (49%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 K L + + R EY+R+YY N++ +E +R+ KNKDKI E Y Q+Y KNKDK E Sbjct: 47 KRLYRQRKKEERNEYQRQYYQNNKEARIEYQRQYLQKNKDKINEYYRQHYQKNKDKLTEN 106 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI 106 R+YY + ++ E R ++ + + + Sbjct: 107 SRQYYQRKKEARIEYQRLYRQRKKEERKAYRRQYIQNTEDKQQ 149 Score = 48.1 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 42/86 (48%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 + + R EY+R Y + +++ RR+ +DK + S YY KN++K Y R+Y+ Sbjct: 112 QRKKEARIEYQRLYRQRKKEERKAYRRQYIQNTEDKQQASSRLYYQKNREKLIAYSRQYH 171 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIA 94 N++K R RQ R + A Sbjct: 172 RNNKEKRRAYDRQYRRDIKQGKLREA 197 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 25/68 (36%), Positives = 39/68 (57%) Query: 22 YYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 Y KN+DK+ E R+ KN+DK+ E+ Y + K++ EY+R+YY N++ E RQ Sbjct: 20 YIQKNKDKLTEYYRQYQQKNRDKLTENKRLYRQRKKEERNEYQRQYYQNNKEARIEYQRQ 79 Query: 82 SYRKLYSK 89 +K K Sbjct: 80 YLQKNKDK 87 >gi|311748598|ref|ZP_07722383.1| putative peptidase M56, BlaR1 [Algoriphagus sp. PR1] gi|126577122|gb|EAZ81370.1| putative peptidase M56, BlaR1 [Algoriphagus sp. PR1] Length = 692 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 17/88 (19%), Positives = 41/88 (46%), Gaps = 7/88 (7%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRE---SYHQYYLKNKDKYREYKR 65 +E +E+++ Y K ++ + + N+ K++E + + + K EY+R Sbjct: 604 KEFEAKMKEWQKAYEPKMKE-FEAKMKEWQKSNEPKLKEFEEKMKAWEKEMQPKMEEYQR 662 Query: 66 R---YYLKNRDKMREKARQSYRKLYSKD 90 + + +N DK++E ++ +L D Sbjct: 663 KMEVWQKENADKIKEFQKKLEEQLNKSD 690 Score = 43.1 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 47/129 (36%), Gaps = 18/129 (13%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERR---RRRYLKNKDKIRE---SYHQY---YLK 55 T E +M E + + +N ++ E + N+ K++E ++ Y Sbjct: 560 EKTMEPKMKEFEEKMKAWEKENEPRMKEYEQKMKEWEKANEPKMKEFEAKMKEWQKAYEP 619 Query: 56 NKDKYREYKRRYYLKN-------RDKMREKARQSYRKL--YSKDSWIAPEEPMGMTKAEI 106 ++ + + N +KM+ ++ K+ Y + + +E K Sbjct: 620 KMKEFEAKMKEWQKSNEPKLKEFEEKMKAWEKEMQPKMEEYQRKMEVWQKENADKIKEFQ 679 Query: 107 EALEREIAR 115 + LE ++ + Sbjct: 680 KKLEEQLNK 688 Score = 33.5 bits (75), Expect = 10.0, Method: Composition-based stats. Identities = 13/87 (14%), Positives = 29/87 (33%), Gaps = 7/87 (8%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY---KRRYYLKNRDKMREKARQSYR 84 K E + K K+ E + +N+ + +EY + + N KM+E + Sbjct: 554 KKAAEWEKTMEPKMKE-FEEKMKAWEKENEPRMKEYEQKMKEWEKANEPKMKEFEAKMKE 612 Query: 85 KLYSKDSWIAPEEPMGMTKAEIEALER 111 + ++ + E + Sbjct: 613 ---WQKAYEPKMKEFEAKMKEWQKSNE 636 >gi|170676322|ref|YP_001742083.1| putative endonuclease protein [Salmonella phage E1] gi|170321632|emb|CAM33151.1| phage endonuclease protein [Salmonella phage Vi II-E1] Length = 334 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 32/56 (57%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY 67 C+ E+++ Y + +I E+ +R Y KNK+K E Q+ KNK R ++R+Y Sbjct: 206 KKCQAEFRKNIYKLKKAEINEKSKRYYDKNKEKHNEKSKQWRQKNKGVVRYHQRKY 261 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 25/99 (25%), Positives = 41/99 (41%), Gaps = 1/99 (1%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 EE + + E R+ Y K +I E +YY KNK+K+ E +++ Sbjct: 188 EENFNKNKSSLDGLRPDCKKCQAEFRKNIYKLKKAEINEKSKRYYDKNKEKHNEKSKQWR 247 Query: 69 LKNRDKMREKARQS-YRKLYSKDSWIAPEEPMGMTKAEI 106 KN+ +R R+ Y K + SW E + E+ Sbjct: 248 QKNKGVVRYHQRKYKYLKRQAIPSWFEHERVLIEKVYEM 286 >gi|302035568|ref|YP_003795890.1| hypothetical protein NIDE0179 [Candidatus Nitrospira defluvii] gi|300603632|emb|CBK39963.1| protein of unknown function [Candidatus Nitrospira defluvii] Length = 175 Score = 48.5 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 2/68 (2%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 +LT +ER R +++ N + ER+ + Y N++ + Q+ N ++ Sbjct: 1 MLTDDERKERNRLSCKKWRENNPEYNRERQAKWYAANRNDVISRARQWQASNPERANALI 60 Query: 65 R--RYYLK 70 R ++ + Sbjct: 61 REGKWRKQ 68 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 24/57 (42%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 + +++ ++ N + RE ++Y N++ R++ N ++ R+ Sbjct: 6 ERKERNRLSCKKWRENNPEYNRERQAKWYAANRNDVISRARQWQASNPERANALIRE 62 Score = 38.1 bits (87), Expect = 0.36, Method: Composition-based stats. Identities = 12/45 (26%), Positives = 23/45 (51%) Query: 42 KDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 K++ R S ++ N + RE + ++Y NR+ + +ARQ Sbjct: 8 KERNRLSCKKWRENNPEYNRERQAKWYAANRNDVISRARQWQASN 52 >gi|290343638|ref|YP_003495005.1| hypothetical protein OTV1_166 [Ostreococcus tauri virus 1] gi|260161053|emb|CAY39754.1| hypothetical protein OTV1_166 [Ostreococcus tauri virus 1] Length = 222 Score = 48.5 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 37/70 (52%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 +CR++Y + Y +KN++K+ E+ + Y N+ +I+E ++ N + E +R Y + Sbjct: 32 KVCRKKYMKAYEVKNKEKLSEKAKEYYENNRVQIKERVRNHWNDNASEINEKRRERYNND 91 Query: 72 RDKMREKARQ 81 ++ + Sbjct: 92 EQYHTKRLAE 101 Score = 37.0 bits (84), Expect = 1.00, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 37/80 (46%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + E ++ K + + + + + +KNK+K+ E +YY N+ + +E Sbjct: 9 EETKSIEHFGKHKQMKDGHLNQCKVCRKKYMKAYEVKNKEKLSEKAKEYYENNRVQIKER 68 Query: 64 KRRYYLKNRDKMREKARQSY 83 R ++ N ++ EK R+ Y Sbjct: 69 VRNHWNDNASEINEKRRERY 88 >gi|195546630|ref|YP_002117658.1| p100 [Rhizobium phage 16-3] gi|102642566|gb|ABF71347.1| p100 [Rhizobium phage 16-3] Length = 265 Score = 47.4 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLK--NKDKYREYKRR 66 ++ R Y++ N ++ +N +KI+ES ++Y K K+K RE + Sbjct: 58 KDGKDKRYAYQKAKRAANPEQRKAWDAAYRERNPEKIKESQSKHYAKPEAKEKRRESSSK 117 Query: 67 YYLKNRDKMREKARQSYRK 85 + +N +K+REK R+S K Sbjct: 118 WAEENIEKVREKTRRSQAK 136 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 12/59 (20%), Positives = 24/59 (40%) Query: 34 RRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 R+ KDK N ++ + + Y +N +K++E + Y K +K+ Sbjct: 53 RKSYEKDGKDKRYAYQKAKRAANPEQRKAWDAAYRERNPEKIKESQSKHYAKPEAKEKR 111 >gi|312599309|gb|ADQ91332.1| hypothetical protein BpV2_165 [Bathycoccus sp. RCC1105 virus BpV2] Length = 226 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 40/85 (47%), Gaps = 2/85 (2%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 C REY++ +Y +NR+ LER ++ Y +NK+ I+ + + N ++ +R Y D Sbjct: 35 CVREYRKGWYKENREYDLERHKKYYEQNKEHIKARVRKNWNDNAEEINTKRRELYKT--D 92 Query: 74 KMREKARQSYRKLYSKDSWIAPEEP 98 + R + Y K+ + Sbjct: 93 DVYRTKRLEQCEKYRKEKRPENRKN 117 >gi|34365520|tpg|DAA01286.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio] Length = 2353 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 42/121 (34%), Gaps = 12/121 (9%) Query: 6 LTPEERMLCRREYKRRYYLKN---RDKILERRRRRYLKNKDKIRESYH----QYYLKNKD 58 EE ++ + Y N R+K + + Y K +KIRE Y NK+ Sbjct: 344 KASEEYREKHKQAMKNNYWTNVTFREKQKQAMKNSYEK-SEKIRERKKIAGFANYRDNKE 402 Query: 59 KYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 K + + Y KN ++ RE + Y D ++ + R + Sbjct: 403 KRKCNAKNKY-KNDEQYRENKKCKAMNSYRHDEQYREKK---KKYVIYNYKNNKQYRDRL 458 Query: 119 K 119 K Sbjct: 459 K 459 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 44/109 (40%), Gaps = 17/109 (15%) Query: 14 CRREYKRRYYLKNRDKILERRR----RRYLKNKDKIRESYHQYYLKNKDKYREYKR---- 65 +++ + Y K+ +KI ER++ Y NK+K + + Y KN ++YRE K+ Sbjct: 370 KQKQAMKNSYEKS-EKIRERKKIAGFANYRDNKEKRKCNAKNKY-KNDEQYRENKKCKAM 427 Query: 66 -------RYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIE 107 +Y K + + + + + +++ + + Sbjct: 428 NSYRHDEQYREKKKKYVIYNYKNNKQYRDRLKNYVIRNYRTNVKYKQQH 476 Score = 37.0 bits (84), Expect = 0.87, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 42/102 (41%), Gaps = 19/102 (18%) Query: 3 RKVLTPEERMLCRREYKR-----RYYLKNRDKILERRRRRYLKNKDKIRESYH-----QY 52 + + E+ RE K+ Y ++K + +Y KN ++ RE+ Y Sbjct: 373 QAMKNSYEKSEKIRERKKIAGFANYRDN-KEKRKCNAKNKY-KNDEQYRENKKCKAMNSY 430 Query: 53 YLKNKDKYREYKRRY----YLKNRDKMREKARQSYRKLYSKD 90 ++YRE K++Y Y N+ R++ + + Y + Sbjct: 431 RHD--EQYREKKKKYVIYNYKNNKQY-RDRLKNYVIRNYRTN 469 >gi|160885908|ref|ZP_02066911.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] gi|156108721|gb|EDO10466.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] Length = 197 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 4/77 (5%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKN----KDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 + + Y ++NRDK+L RR+ N ++ R SY + Y NK+K E R+Y L ++ Sbjct: 118 KNNKAYRIRNRDKLLARRKELRNANIEHYRELERASYKRRYKVNKEKILEKNRKYQLAHK 177 Query: 73 DKMREKARQSYRKLYSK 89 ++RE + Y+K SK Sbjct: 178 SEIREYMKVYYQKNKSK 194 Score = 33.9 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 36/83 (43%), Gaps = 5/83 (6%) Query: 37 RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPE 96 Y++N++KI ++ Y ++N+DK ++ N + RE R SY K + + Sbjct: 108 YYIENREKILKNNKAYRIRNRDKLLARRKELRNANIEHYRELERASY-----KRRYKVNK 162 Query: 97 EPMGMTKAEIEALEREIARLKAK 119 E + + + + R K Sbjct: 163 EKILEKNRKYQLAHKSEIREYMK 185 >gi|332283226|ref|YP_004418911.1| hypothetical protein PT7_P051 [Pusillimonas sp. T7-7] gi|330430954|gb|AEC22287.1| hypothetical protein PT7_P051 [Pusillimonas sp. T7-7] Length = 247 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 36/79 (45%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 R + Y N +K E +R K D+ R +YY NK++ + + R Y + N + Sbjct: 56 KRLATVKAYREANPEKASEAKRVARGKKIDQYRAREKEYYENNKERAKAWYREYRVANIE 115 Query: 74 KMREKARQSYRKLYSKDSW 92 ++ E +R +K + S Sbjct: 116 RLTEASRAYKKKNAKELSR 134 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 35/74 (47%) Query: 13 LCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 C++ KR YL+NR+K L + N +K E+ K D+YR ++ YY N+ Sbjct: 40 PCKKISKREEYLRNREKRLATVKAYREANPEKASEAKRVARGKKIDQYRAREKEYYENNK 99 Query: 73 DKMREKARQSYRKL 86 ++ + R+ Sbjct: 100 ERAKAWYREYRVAN 113 >gi|187929387|ref|YP_001899874.1| hypothetical protein Rpic_2308 [Ralstonia pickettii 12J] gi|187726277|gb|ACD27442.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 207 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 11/99 (11%), Positives = 37/99 (37%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 C E + + + + + ++++++ + YY +N ++ R ++ KN + Sbjct: 47 CSLERTLSHRKAHPEHRKQSDAQYRERHRERVNAASISYYWRNAERLRPVLAQWAKKNPE 106 Query: 74 KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + ++K + + +A + E Sbjct: 107 RAKAAQIVWKAANFAKRNAHEAKRRARKLQATPKWANTE 145 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 37/96 (38%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R++ +Y ++R+++ Y +N +++R Q+ KN ++ + + + N K Sbjct: 63 RKQSDAQYRERHRERVNAASISYYWRNAERLRPVLAQWAKKNPERAKAAQIVWKAANFAK 122 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 + + A E + +AL Sbjct: 123 RNAHEAKRRARKLQATPKWANTERIEAFYRTADALN 158 >gi|222636869|gb|EEE67001.1| hypothetical protein OsJ_23913 [Oryza sativa Japonica Group] Length = 877 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 25 KNRDK----ILERRRRRYLKNKDKIRESYHQYYLKNKDKY--REYKRRYYLKNRDKMREK 78 KN ++ + +R+R + ++DK + + ++ ++ E +R + + ++ +EK Sbjct: 89 KNVEELQQIVKKRKREKTQSDRDKDKGKEKERMEEH-ERRPGGERERERHDQEKELEKEK 147 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 R + K+ E + KAE ++L Sbjct: 148 DRAERDRDQDKEKEKLHTERIDKVKAEEDSL 178 >gi|218199463|gb|EEC81890.1| hypothetical protein OsI_25707 [Oryza sativa Indica Group] Length = 839 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 25 KNRDK----ILERRRRRYLKNKDKIRESYHQYYLKNKDKY--REYKRRYYLKNRDKMREK 78 KN ++ + +R+R + ++DK + + ++ ++ E +R + + ++ +EK Sbjct: 89 KNVEELQQIVKKRKREKTQSDRDKDKGKEKERMEEH-ERRPGGERERERHDQEKELEKEK 147 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 R + K+ E + KAE ++L Sbjct: 148 DRAERDRDQDKEKEKLHTERIDKVKAEEDSL 178 >gi|221101353|ref|XP_002168998.1| PREDICTED: similar to helicase-like protein [Hydra magnipapillata] Length = 1176 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 39/118 (33%), Gaps = 13/118 (11%) Query: 6 LTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 L E R +C+ E E RR+ +N + E + +N + E R Sbjct: 5 LRAEIRRICQNERNIARRE-------EINRRQNERNATRREELNRRQNERNATRQEELNR 57 Query: 66 RYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 R +N + E R+ + ++ E + + E R + R I Sbjct: 58 RQNERNATRREELNRRQNERNATR------REELNRRQNERNEARRALLRCGMHCIAR 109 >gi|221109657|ref|XP_002168950.1| PREDICTED: similar to F59H6.5 [Hydra magnipapillata] Length = 1199 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 35/120 (29%), Gaps = 2/120 (1%) Query: 6 LTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 L E R C+ E + + ER R + ++ E + + E Sbjct: 5 LRAEIRRTCQNERNIARREEINRRQNERNATRREELNRRLNERNATRQEELNRRQNERNA 64 Query: 66 RYYLKNRDKMREKARQSYRK-LYSKDSWIAP-EEPMGMTKAEIEALEREIARLKAKPIEE 123 + + E+ + + ++ A E + + E R + + I Sbjct: 65 TRREELNHRQNERNATRREELNHRQNERNATRREELNRRQNERNEARRALLKCGMHCIAR 124 >gi|77955946|gb|ABB05534.1| Hel [Xiphophorus maculatus] Length = 2816 Score = 44.3 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 46/130 (35%), Gaps = 20/130 (15%) Query: 11 RMLCRREYKRRYYLK--NRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 R + Y +Y K R KI + Y KN +E ++ + + Y+++ Sbjct: 908 RQKKNQRYTTKYRTKMVYRQKIKLNKTNTYKKNL-LYKEQKKKWIRNKYRQDQTYQKKLR 966 Query: 69 LK----------NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 +R+K+++ + Y S + + + + K + RE K Sbjct: 967 QVSISKYKEDKCHREKVKQISIAKYNTDQSHREKV---KLISIAKYNTDQSHRE----KV 1019 Query: 119 KPIEELIYKR 128 K I YK Sbjct: 1020 KQISIAKYKE 1029 Score = 35.0 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 40/93 (43%), Gaps = 15/93 (16%) Query: 14 CRREYKR-----RYY--LKNRDKILERRRRRY---LKNKDKIRESYHQYYLKNK---DKY 60 C RE + +Y +R+K+ +Y +++K+++ Y +NK +K Sbjct: 978 CHREKVKQISIAKYNTDQSHREKVKLISIAKYNTDQSHREKVKQISIAKYKENKSHREKV 1037 Query: 61 REYKRRYYLK--NRDKMREKARQSYRKLYSKDS 91 + + R+ YL ++ + + +++ K Sbjct: 1038 KHFSRKQYLNPQHKIHIISNVKLKRQEIKMKSK 1070 >gi|198476752|ref|XP_002132434.1| GA25185 [Drosophila pseudoobscura pseudoobscura] gi|198137836|gb|EDY69836.1| GA25185 [Drosophila pseudoobscura pseudoobscura] Length = 4986 Score = 43.9 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 17/94 (18%), Positives = 37/94 (39%), Gaps = 6/94 (6%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR------EYKRRYYLKNRDKMREK 78 ++++K R+ + + +E + + + K+K R E +R +K R+ Sbjct: 1037 EDKEKEKAHPERKTQTHHQEEKEKERKSHQEEKEKERKAQQQEEKERDERKAKEEKERKA 1096 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + +K + +E K E E E+E Sbjct: 1097 QEEREQKERDEKEQQRVQEQREQEKREREQREKE 1130 Score = 39.6 bits (91), Expect = 0.13, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 42/125 (33%), Gaps = 11/125 (8%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRES---------YHQYY 53 ++ E + + E +R +K + + R K +D+ + + Sbjct: 1066 QEEKEKERKAQQQEEKERDERKAKEEKERKAQEEREQKERDEKEQQRVQEQREQEKRERE 1125 Query: 54 LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREI 113 + K+ + R +R+K +E++ + ++S ++ M E Sbjct: 1126 QREKEHREKEHRE--RDHREKEKEQSSRRSISDSDQESRMSRMREMSSYHKSKSDASSEA 1183 Query: 114 ARLKA 118 + A Sbjct: 1184 SSFYA 1188 Score = 37.0 bits (84), Expect = 0.80, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 43/117 (36%), Gaps = 24/117 (20%) Query: 17 EYKRRYYLKNRDKILERRRR---------------------RYLKNKDKIRESYHQYYLK 55 E + + R+K ++ R ++K+K + + Sbjct: 993 ERSEKEQRQEREKAHQKERHEREKRKQQEREEKEKERKAHHHQPEDKEKEKAHPERKTQT 1052 Query: 56 NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + +E +R+ + + ++K E+ Q + +D A EE + E E ER+ Sbjct: 1053 HHQEEKEKERKSHQEEKEK--ERKAQQQEE-KERDERKAKEEKERKAQEEREQKERD 1106 >gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura] gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura] Length = 3214 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + + +KN+ E Sbjct: 1142 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNA-KLEQQRKEKEMKNQQAIEER 1194 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1195 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKELLLAAEMERERRRQHMNL 1254 Query: 121 IEELIYKR 128 I L +R Sbjct: 1255 IRMLEVRR 1262 >gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi] gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi] Length = 3415 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + + LKN+ E Sbjct: 1182 EIAKQQQAMRDAKKQQKEELARNKEKARQ------EKNA-KLEQQRKEKELKNQQAIEER 1234 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1235 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKELLLAAEMERERRRQHMNL 1294 Query: 121 IEELIYKR 128 I L +R Sbjct: 1295 IRMLELRR 1302 >gi|317158138|ref|XP_001826850.2| hypothetical protein AOR_1_450034 [Aspergillus oryzae RIB40] Length = 1678 Score = 43.5 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 29/114 (25%), Positives = 45/114 (39%), Gaps = 9/114 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI--RESYHQYYLKNKDKYR 61 K L + L RR+ + + K R LE +R+ +N DK+ E + KNK+ Sbjct: 1257 KELKSLQEDLERRDQGMKEHEKKRQHDLEEAKRKVKENNDKLAALEKDVKDAKKNKEDAE 1316 Query: 62 EYKRRYYLKNRDKMRE----KARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + R K +++ E K R+ KL E K +EA R Sbjct: 1317 KRYRNELKKQQEERNEIVARKRREYEEKLR---KLENDERDYRTKKENLEATHR 1367 >gi|261367712|ref|ZP_05980595.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] gi|282570507|gb|EFB76042.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] Length = 100 Score = 43.5 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 30/57 (52%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRK 85 K+ ++R Y N+DK+ +Y N+DK +R Y NRDK+ E R+ Y K Sbjct: 38 KVAAQQRAWYEANRDKVAAQQRAWYEANRDKVAAQQRAYREANRDKVAEYNRKYYAK 94 Score = 42.3 bits (98), Expect = 0.023, Method: Composition-based stats. Identities = 21/57 (36%), Positives = 31/57 (54%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 +R +Y NRDK+ ++R Y N+DK+ Y N+DK EY R+YY K + Sbjct: 40 AAQQRAWYEANRDKVAAQQRAWYEANRDKVAAQQRAYREANRDKVAEYNRKYYAKRK 96 >gi|168058265|ref|XP_001781130.1| predicted protein [Physcomitrella patens subsp. patens] gi|162667448|gb|EDQ54078.1| predicted protein [Physcomitrella patens subsp. patens] Length = 2302 Score = 43.1 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 51/132 (38%), Gaps = 5/132 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYL---KNKD-KIRESYHQYYLKNKD 58 RK +++ KR++ + + +E+RRR N++ + R+ K Sbjct: 1823 RKEEDWKKKEAENAARKRKHEAAEKKEKMEKRRRLEEVMKANREMEERQRLELERKAQKQ 1882 Query: 59 KYREYKRRYYLKNRDKMREKAR-QSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 K E + ++++ + R + +++ + E + E +E A + Sbjct: 1883 KALEEMEKERKTIEEEIKRQKRLEKEKEVEQRRKKEEERELAWLESKEATRKRKEEAAKQ 1942 Query: 118 AKPIEELIYKRG 129 K +E Y++ Sbjct: 1943 LKLLESEGYQQA 1954 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 51/129 (39%), Gaps = 12/129 (9%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYL--KNKDKIRESYH--QYYLKNKD- 58 K+ ++ L + ++ K ++ R+R++ + K+K+ + + N++ Sbjct: 1808 KMDQDKQERLLEEQRRKEEDWKKKEAENAARKRKHEAAEKKEKMEKRRRLEEVMKANREM 1867 Query: 59 KYREYKRRYYLKNRDK-MREKARQSY---RKLYSKDSWIAPEEPMGMTKAEIEALEREIA 114 + R+ + K + E ++ ++ + +E K E ERE+A Sbjct: 1868 EERQRLELERKAQKQKALEEMEKERKTIEEEIKRQKRLEKEKEVEQRRKKEE---ERELA 1924 Query: 115 RLKAKPIEE 123 L++K Sbjct: 1925 WLESKEATR 1933 >gi|115901570|ref|XP_783551.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115959369|ref|XP_001179820.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 10045 Score = 43.1 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 27/125 (21%), Positives = 53/125 (42%), Gaps = 9/125 (7%) Query: 5 VLTPEERMLCRR-EYKRRYYLKN-RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 L E L R+ E ++ +N ++K+ E+RRR K K ++ H L+ ++ R+ Sbjct: 9312 ELESLEDGLKRKLEKRKSDLNRNMKNKMAEKRRR-----KTKKLQTQHDKELEKIEEVRQ 9366 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIE 122 + N +++R K R+ + + K EE E E +RE + + Sbjct: 9367 KLESERMANEERIR-KEREDFEENEKKKMARELEEYESKLTDE-EKKDREKNEKRINGLN 9424 Query: 123 ELIYK 127 + + Sbjct: 9425 KRKDQ 9429 Score = 42.7 bits (99), Expect = 0.018, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 43/111 (38%), Gaps = 12/111 (10%) Query: 10 ERMLCRREYKRRYYLKNR--DKILERRRRRYLKNKDKIRESYHQYYLK-------NKDKY 60 +R L +R+ +KN+ +K + ++ ++ DK E + K N+++ Sbjct: 9321 KRKLEKRKSDLNRNMKNKMAEKRRRKTKKLQTQH-DKELEKIEEVRQKLESERMANEERI 9379 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLY--SKDSWIAPEEPMGMTKAEIEAL 109 R+ + + + KM + + KL K E+ + + L Sbjct: 9380 RKEREDFEENEKKKMARELEEYESKLTDEEKKDREKNEKRINGLNKRKDQL 9430 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 23/63 (36%), Gaps = 1/63 (1%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + ERM E R+ + ++ R + + K+ + + KN+ + Sbjct: 9365 RQKLESERM-ANEERIRKEREDFEENEKKKMARELEEYESKLTDEEKKDREKNEKRINGL 9423 Query: 64 KRR 66 +R Sbjct: 9424 NKR 9426 >gi|148228559|ref|NP_001082528.1| otogelin [Xenopus laevis] gi|49903364|gb|AAH76719.1| LOC398539 protein [Xenopus laevis] Length = 2414 Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 46/138 (33%), Gaps = 24/138 (17%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI---RESYHQYYLKNKDKY 60 K+ T E + + + KN +K + + ++ + + NK + Sbjct: 171 KMKTNERKGKDHNDSSKESKEKNHEKRQNNKNGWLKDSSEENDSASKEKKDHDKNNKVRT 230 Query: 61 RE-------YKRRYYLKNRDKMREKARQSYR--------------KLYSKDSWIAPEEPM 99 + + + +N + EK+++ ++ K + K+ + P+ Sbjct: 231 NQNKGNDHDHSSKESKENSHEKGEKSKKRWQNGSAEEKDDSSKEKKDHEKNDKVKPKHNK 290 Query: 100 GMTKAEIEALEREIARLK 117 G + +E + K Sbjct: 291 GNGHDDSSKESKENSHEK 308 Score = 38.5 bits (88), Expect = 0.29, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 38/100 (38%), Gaps = 12/100 (12%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKI--RESYHQYYLKNKDKYREYK---------RRY 67 + + +K + ++R + ++ + + KN DK + + Sbjct: 242 SKESKENSHEKGEKSKKRWQNGSAEEKDDSSKEKKDHEKN-DKVKPKHNKGNGHDDSSKE 300 Query: 68 YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIE 107 +N + REK + ++K +++ +E T E++ Sbjct: 301 SKENSHEKREKNKNRWQKGSAEEKDDGSKEKKDKTLEEMD 340 Score = 37.0 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 40/113 (35%), Gaps = 12/113 (10%) Query: 19 KRRYYLKNRDKIL----------ERRRRRYLKNKDKIRESYHQYYLKNKDK--YREYKRR 66 K ++ + DK+ + + KN +K + + + + + ++ +++ Sbjct: 161 KEKWSHEKNDKMKTNERKGKDHNDSSKESKEKNHEKRQNNKNGWLKDSSEENDSASKEKK 220 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 + KN + + + SK+S E +K + E +K Sbjct: 221 DHDKNNKVRTNQNKGNDHDHSSKESKENSHEKGEKSKKRWQNGSAEEKDDSSK 273 >gi|238507994|ref|XP_002385198.1| hypothetical protein AFLA_009530 [Aspergillus flavus NRRL3357] gi|220688717|gb|EED45069.1| hypothetical protein AFLA_009530 [Aspergillus flavus NRRL3357] Length = 1686 Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 48/111 (43%), Gaps = 3/111 (2%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI--RESYHQYYLKNKDKYR 61 K L + L RR+ + + K R LE +R+ +N DK+ E + KNK+ Sbjct: 1265 KELKSLQEDLERRDQGMKEHEKKRQHDLEEAKRKVKENNDKLAALEKDVKDAKKNKEDAE 1324 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE-IEALER 111 + R K +++ E + R+ K + +E TK E +EA R Sbjct: 1325 KRYRNELKKQQEERNEIVARKRREYEEKLRKLENDERDYRTKKENLEATHR 1375 >gi|242372089|ref|ZP_04817663.1| CPA1 family monovalent cation:proton (H+) antiporter-1 [Staphylococcus epidermidis M23864:W1] gi|242350201|gb|EES41802.1| CPA1 family monovalent cation:proton (H+) antiporter-1 [Staphylococcus epidermidis M23864:W1] Length = 735 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 34/89 (38%), Gaps = 9/89 (10%) Query: 30 ILERRRRRYLKNKDKIRESYHQYY--LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 RR+ L +K + + +NK+K RE +R Y +R +++E+ ++ Y Sbjct: 555 RKNRRKEERLVHKQAHAKLKEAHQPSSENKEKIRE-QREAYKNSRKELKEQREIRKQQFY 613 Query: 88 ------SKDSWIAPEEPMGMTKAEIEALE 110 K I + K E + Sbjct: 614 GSYKQVQKILRIVNHHIVLRMKEEQNSSN 642 Score = 37.7 bits (86), Expect = 0.50, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 41/100 (41%), Gaps = 6/100 (6%) Query: 2 GRKVLTPEERMLCRREYKRRYYL-----KNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 RK EER++ ++ + + +N++KI E+ R Y ++ +++E + Sbjct: 554 ARKNRRKEERLVHKQAHAKLKEAHQPSSENKEKIREQ-REAYKNSRKELKEQREIRKQQF 612 Query: 57 KDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPE 96 Y++ ++ + N + + + S + + Sbjct: 613 YGSYKQVQKILRIVNHHIVLRMKEEQNSSNVLEVSLVINQ 652 >gi|156340221|ref|XP_001620388.1| hypothetical protein NEMVEDRAFT_v1g223169 [Nematostella vectensis] gi|156205242|gb|EDO28288.1| predicted protein [Nematostella vectensis] Length = 387 Score = 42.3 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 32/105 (30%), Gaps = 28/105 (26%) Query: 6 LTPEERMLCRREY------KRRYYLKNRDKILERRRRRYLKNK------DKIRESYHQYY 53 L+ E+R C+R Y Y + ++K KN +K R S Y Sbjct: 263 LSSEKRRACKRAYVQNKRANTAYREREKEKAKISASD---KNPDKRAQIEKRRASKRSYI 319 Query: 54 LKNKDKYREYKRRYYLKN-------------RDKMREKARQSYRK 85 K K ++ N +K R R +K Sbjct: 320 QKKKANTTFREKEKEKANISVAYNSPDKQDQIEKRRASKRAYMQK 364 >gi|254729466|ref|YP_003084148.1| hypothetical protein PSS2_gp004 [Cyanophage PSS2] gi|254211618|gb|ACT65566.1| hypothetical protein [Cyanophage PSS2] gi|265524819|gb|ACY75711.1| predicted protein [Cyanophage PSS2] Length = 216 Score = 42.3 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 25/100 (25%), Positives = 41/100 (41%), Gaps = 6/100 (6%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL 69 E R Y R Y NR+KI R+ +R +N + RE+ ++ N Y ++ L Sbjct: 13 EEHEARLAYARNYREANREKINARQNQRRRENPEANREAARKFRRNNPG----YHSQWRL 68 Query: 70 KNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 N +K E R + + + E KA+ A+ Sbjct: 69 DNPEKNAEIRRI--ELIKQTERYRNDSEYREKKKAKQRAI 106 >gi|307182688|gb|EFN69813.1| Probable multidrug resistance-associated protein lethal(2)03659 [Camponotus floridanus] Length = 1174 Score = 42.3 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 28/67 (41%), Gaps = 2/67 (2%) Query: 42 KDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW--IAPEEPM 99 K+ +E+ + Y +N ++ Y + ++ E +++ + Y ++ + ++ Sbjct: 416 KENTKENTKEKYKENANERENYNQNLNENYKETANENYKENVNENYKENVKNDLTNQKNS 475 Query: 100 GMTKAEI 106 A+ Sbjct: 476 DRMNADQ 482 Score = 38.9 bits (89), Expect = 0.22, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 3/79 (3%) Query: 1 MGRKVLTPEERMLCR--REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKD 58 M +V +PE +++ +E + Y +N ++ + K+ E+Y + +N Sbjct: 403 MYDEVASPETKLMKENTKENTKEKYKENANERENYNQNLNENYKETANENYKENVNENY- 461 Query: 59 KYREYKRRYYLKNRDKMRE 77 K KN D+M Sbjct: 462 KENVKNDLTNQKNSDRMNA 480 >gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae] gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae] Length = 3047 Score = 42.3 bits (98), Expect = 0.023, Method: Composition-based stats. Identities = 27/128 (21%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ KN++K + KN K+ + + LKN+ E Sbjct: 1087 EIAKQQQAMRDAKKLQKEELAKNKEKARQ------EKNA-KMEQQRKEKELKNQQAIEER 1139 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1140 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKEMLLAAEMERERRRQHMNL 1199 Query: 121 IEELIYKR 128 I L +R Sbjct: 1200 IRMLEIRR 1207 >gi|326492544|dbj|BAK02055.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 459 Score = 42.0 bits (97), Expect = 0.025, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 45/116 (38%), Gaps = 11/116 (9%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 RK + R ++ + +N++ + + N + I E+ Q Y + E Sbjct: 26 RKAFKSKAR-----DFHPDKHTENKEFYSAKFQEITEANNELIDEAKKQAYDSKYRAHAE 80 Query: 63 YKRRYYLKN------RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 KR+Y +N RD + ++ ++ + K E+ TKA I L E Sbjct: 81 RKRKYAEENASFRAMRDALSKREKEHDDQRSQKKQKTMAEQKASETKATIRRLMTE 136 >gi|302916785|ref|XP_003052203.1| predicted protein [Nectria haematococca mpVI 77-13-4] gi|256733142|gb|EEU46490.1| predicted protein [Nectria haematococca mpVI 77-13-4] Length = 2027 Score = 42.0 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 52/115 (45%), Gaps = 3/115 (2%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL 69 ++ L + E +R+ + ++ ERR+ + K+K + + K+ + + + +Y Sbjct: 1869 DKELKKLEQRRKKLDEKVARLHERRQSKLQGEKEKDAATLAKMREKHDKEVAKQEEKYRR 1928 Query: 70 KNR--DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIE 122 + + ++ RE A Q + + + E+ + E ER++AR + +E Sbjct: 1929 EIKKLEEKRE-AEQRKAEERRRKALEREEKNNLSLELERVRAERDLARRQIDLLE 1982 Score = 37.3 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 44/117 (37%), Gaps = 4/117 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 RK L + L R + ++K + K+ ++ + +Y + K E Sbjct: 1879 RKKLDEKVARLHERRQSK--LQGEKEKDAATLAKMREKHDKEVAKQEEKYRREIKK--LE 1934 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 KR + ++ R KA + K + + +I+ LE ++ L+A+ Sbjct: 1935 EKREAEQRKAEERRRKALEREEKNNLSLELERVRAERDLARRQIDLLESQVGELQAQ 1991 Score = 36.6 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 45/104 (43%), Gaps = 2/104 (1%) Query: 26 NRD-KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 +++ K LE+RR++ + ++ E K+K + K+ ++ ++ + R Sbjct: 1869 DKELKKLEQRRKKLDEKVARLHERRQSKLQGEKEKDAATLAKMREKHDKEVAKQEEKYRR 1928 Query: 85 KLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 ++ + E+ + +ALERE + +E + +R Sbjct: 1929 EIKKLEEKREAEQRKAEERRR-KALEREEKNNLSLELERVRAER 1971 >gi|74181644|dbj|BAE32543.1| unnamed protein product [Mus musculus] Length = 932 Score = 42.0 bits (97), Expect = 0.029, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 39/111 (35%), Gaps = 9/111 (8%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRR------YLKNKDKIRESYHQYYLKN 56 R ER R + K + + ++K + + ++ D + ++ + Sbjct: 541 RDREGERERNKERSKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDID 600 Query: 57 KDKYREYK---RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K R RR K++DK +++ R K + + + KA Sbjct: 601 AAKVRLKDGIVRREREKHKDKKKDRERSKREKDKRERERLKEKNREDKIKA 651 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 36/97 (37%), Gaps = 15/97 (15%) Query: 38 YLKNKDKIRESYHQY-YLKNKDKYREYKRRYYLKNRD--------KMREKARQSYRKLYS 88 +NK++ +E K K+ +E K + +D K++E K+ Sbjct: 547 RERNKERSKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDIDAAKVRL 606 Query: 89 KDSWIAPEEPMGMTKA------EIEALEREIARLKAK 119 KD + E K + E +RE RLK K Sbjct: 607 KDGIVRREREKHKDKKKDRERSKREKDKRERERLKEK 643 >gi|34395322|dbj|BAC84349.1| putative WD-40 repeat protein family [Oryza sativa Japonica Group] gi|50508526|dbj|BAD30810.1| WD-40 repeat protein-like [Oryza sativa Japonica Group] Length = 609 Score = 42.0 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 25 KNRDK----ILERRRRRYLKNKDKIRESYHQYYLKNKDKY--REYKRRYYLKNRDKMREK 78 KN ++ + +R+R + ++DK + + ++ ++ E +R + + ++ +EK Sbjct: 89 KNVEELQQIVKKRKREKTQSDRDKDKGKEKERMEEH-ERRPGGERERERHDQEKELEKEK 147 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 R + K+ E + KAE ++L Sbjct: 148 DRAERDRDQDKEKEKLHTERIDKVKAEEDSL 178 >gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis] gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis] Length = 2976 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + + LKN+ E Sbjct: 1120 EIAKQQQAMRDAKKQQKEELARNKEKARQ------EKNA-KLEQQRKEKELKNQQAIEER 1172 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1173 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKELLLAAEMERERRRQHMNL 1232 Query: 121 IEELIYKR 128 I L +R Sbjct: 1233 IRMLEVRR 1240 >gi|314055250|ref|YP_004063588.1| hypothetical protein OtV2_155 [Ostreococcus tauri virus 2] gi|313575141|emb|CBI70154.1| hypothetical protein OtV2_155 [Ostreococcus tauri virus 2] Length = 223 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 37/68 (54%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 C+ Y+++Y N++++L ++ Y N ++I+E +++ N + E +R Y + + Sbjct: 35 CKSNYQKKYGQDNKERLLLNAKKHYEANTEQIKERVRKHWNDNATEINEKRRERYNNDEE 94 Query: 74 KMREKARQ 81 +++ + Sbjct: 95 YKKKRMEE 102 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 46/117 (39%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 ++ K + + + +++ NK+++ + ++Y N ++ +E R+++ N Sbjct: 18 FPKHKQMKDGHLNQCKSCKSNYQKKYGQDNKERLLLNAKKHYEANTEQIKERVRKHWNDN 77 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 ++ EK R+ Y E K E ++ A + EL KR Sbjct: 78 ATEINEKRRERYNNDEEYKKKRMEEVSKSNAKCRGERRKKAKEERSASYLLELFRKR 134 >gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis] gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis] Length = 3244 Score = 41.6 bits (96), Expect = 0.034, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + + +KN+ E Sbjct: 1145 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNA-KLEQQRKEKEMKNQQAIEER 1197 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1198 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKELLLAAEMERERRRQHMNL 1257 Query: 121 IEELIYKR 128 I L +R Sbjct: 1258 IRMLEVRR 1265 >gi|195169095|ref|XP_002025363.1| GL12280 [Drosophila persimilis] gi|194108831|gb|EDW30874.1| GL12280 [Drosophila persimilis] Length = 1832 Score = 41.6 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 47/101 (46%), Gaps = 5/101 (4%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLK---NKDKYREYKR-RYYLKNRDKMREKA 79 KNR E+ RR +K+K+RE ++ K+K R+ + R+ +++D+ RE++ Sbjct: 363 EKNRKSSKEKHRR-RDVSKEKLRERSKEHPKDMYMPKEKNRDRSKDRHRERSKDRHRERS 421 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 ++ K +++ ++ K E+ + KP Sbjct: 422 KEKKSKETTRERSKDDKKNAKRPKDLKESSKESHTNHAEKP 462 Score = 38.1 bits (87), Expect = 0.41, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 44/111 (39%), Gaps = 6/111 (5%) Query: 3 RKVLTPEERMLCRREYKRRY--YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 R V + R + K Y KNRD+ +R R ++KD+ RE + K + Sbjct: 376 RDVSKEKLRERSKEHPKDMYMPKEKNRDRSKDRHR---ERSKDRHRERSKEKKSKETTRE 432 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 R + K ++E +++S+ +++ P+E E R Sbjct: 433 RSKDDKKNAKRPKDLKESSKESH-TNHAEKPRDFPKEISHRDNKEQPKEHR 482 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 45/124 (36%), Gaps = 14/124 (11%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRR-YLKNKDKIRESYHQYYLKNKDKY 60 GR +E + +R +N + E+ KN+ +E + + +K+K Sbjct: 326 GRSADKHKENA-ANEKDGKRSKERNSETSKEKSAEAPNEKNRKSSKEKHRR-RDVSKEKL 383 Query: 61 REYKR-----------RYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 RE + + +++D+ RE+++ +R+ + + Sbjct: 384 RERSKEHPKDMYMPKEKNRDRSKDRHRERSKDRHRERSKEKKSKETTRERSKDDKKNAKR 443 Query: 110 EREI 113 +++ Sbjct: 444 PKDL 447 >gi|223648228|gb|ACN10872.1| GTPase IMAP family member 7 [Salmo salar] Length = 686 Score = 41.2 bits (95), Expect = 0.045, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 40/95 (42%), Gaps = 5/95 (5%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R + ++ + K ++ ++R + +++I++ + N ++ E + K+R + Sbjct: 540 RIQREKEWLQKEEEERIQREKEWME--RERIKKETRERV--NMERIEEDRMERMEKDRME 595 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 ++ K + + +E + K E E + Sbjct: 596 RERMEKERIEKDRVERERVE-KERIKKEKVEREKV 629 Score = 39.6 bits (91), Expect = 0.16, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 44/96 (45%), Gaps = 7/96 (7%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 D+ ++R + K +++ + ++ +++ ++ R N +++ E + K Sbjct: 538 DERIQREKEWLQKEEEERIQREKEWME--RERIKKETRE--RVNMERIEEDRMERMEKDR 593 Query: 88 SKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 + + +E + + E E +E+E R+K + +E Sbjct: 594 MERERME-KERIEKDRVERERVEKE--RIKKEKVER 626 >gi|195014854|ref|XP_001984090.1| GH15196 [Drosophila grimshawi] gi|193897572|gb|EDV96438.1| GH15196 [Drosophila grimshawi] Length = 1895 Score = 41.2 bits (95), Expect = 0.052, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 43/119 (36%), Gaps = 4/119 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRR-RRYLKNKDKIRESYHQYYLKNKDKYR 61 R+ T R R + RY K R++ E++R R N+ + E + Y + + ++ Sbjct: 378 RESKTASSRDRPRERSRERYKEKKRERSNEKQRARSKEGNRSR--ERRFERYKEKSEHFK 435 Query: 62 EYKRRY-YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 R K D +EK + + S +P ++A AK Sbjct: 436 AKPRERSREKRLDSSKEKQLDRSNEKRRELSKERQRDPNSSKDKTTSRSTDKLASTAAK 494 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 41/99 (41%), Gaps = 1/99 (1%) Query: 31 LERRRRRYLKNKDKIRESYHQYYLKNKDKYRE-YKRRYYLKNRDKMREKARQSYRKLYSK 89 E+ +R +++ +++ K ++ RE + R++ RE+ ++ R+ ++ Sbjct: 348 REQGKRSRDTPREREENHRNRFREKRSERSRESKTASSRDRPRERSRERYKEKKRERSNE 407 Query: 90 DSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 +E + E + + KAKP E KR Sbjct: 408 KQRARSKEGNRSRERRFERYKEKSEHFKAKPRERSREKR 446 Score = 37.3 bits (85), Expect = 0.67, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 38/106 (35%), Gaps = 4/106 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R+ + R + R ++R++ E++R ++ +K R + +++ E Sbjct: 370 REKRSERSRESKTASSRDRPRERSRERYKEKKR---ERSNEKQRARSKEGNRS-RERRFE 425 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEA 108 + + K RE++R+ + E E + Sbjct: 426 RYKEKSEHFKAKPRERSREKRLDSSKEKQLDRSNEKRRELSKERQR 471 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 44/116 (37%), Gaps = 6/116 (5%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYY-LKNKDKYREYKRR-YYLKNRDKMR 76 +R ++RD ER + ++K E + ++D+ RE R Y K R++ Sbjct: 347 QREQGKRSRDTPREREENHRNRFREKRSERSRESKTASSRDRPRERSRERYKEKKRERSN 406 Query: 77 EKARQS-YRKLYSKDSWIAPEEPMG---MTKAEIEALEREIARLKAKPIEELIYKR 128 EK R S++ + K + E+ + K K ++ KR Sbjct: 407 EKQRARSKEGNRSRERRFERYKEKSEHFKAKPRERSREKRLDSSKEKQLDRSNEKR 462 >gi|167520504|ref|XP_001744591.1| hypothetical protein [Monosiga brevicollis MX1] gi|163776922|gb|EDQ90540.1| predicted protein [Monosiga brevicollis MX1] Length = 600 Score = 41.2 bits (95), Expect = 0.053, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%) Query: 38 YLKNKDKIRESYHQYYLKNKDKYREYKRRY-YLKNRDKMREKARQSYRKLYSKDSWIAPE 96 +++ +E + + +D+ RE +R K R+K REK R+ R+ S Sbjct: 541 RETERERDQERDRERDQE-RDQEREKQREKQREKQREKQREKQREKQRETESDQERERAR 599 Query: 97 E 97 E Sbjct: 600 E 600 Score = 38.1 bits (87), Expect = 0.37, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 31/61 (50%), Gaps = 3/61 (4%) Query: 23 YLKNRDKILERRRRR-YLKNKDKIRESYHQYYLKNKDKYREYKRRY-YLKNRDKMREKAR 80 R++ ER R R +++++ ++ Q K ++K RE +R D+ RE+AR Sbjct: 541 RETERERDQERDRERDQERDQEREKQREKQ-REKQREKQREKQREKQRETESDQERERAR 599 Query: 81 Q 81 + Sbjct: 600 E 600 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 28/57 (49%), Gaps = 4/57 (7%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 +ER R + + + K R+K E++R K ++K RE + ++++ R + Sbjct: 548 DQERDRERDQERDQEREKQREKQREKQR---EKQREKQREKQRE-TESDQERERARE 600 >gi|171769783|sp|A2R7P5|NST1_ASPNC RecName: Full=Stress response protein nst1 gi|134083288|emb|CAK46843.1| unnamed protein product [Aspergillus niger] Length = 1201 Score = 41.2 bits (95), Expect = 0.053, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 7/94 (7%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD--- 73 E +RR + + K E R+ + + + + +++ E +R+ + ++ Sbjct: 592 EEQRRKREEQKKK-REAERKAQEAERLRKEAEKQKRLREERERQAEIERKQREQ-KELEK 649 Query: 74 KMREKARQSY-RKLYSKDSWIAP-EEPMGMTKAE 105 K RE+ARQ+ R+ +KD E T E Sbjct: 650 KRREEARQNELREKKTKDERERKLREAAPKTDYE 683 >gi|313225529|emb|CBY07003.1| unnamed protein product [Oikopleura dioica] Length = 327 Score = 40.8 bits (94), Expect = 0.056, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 1/90 (1%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 E+ L +R+ R ++ RDKI ++ + K ++K E + K+ ++ RE R Sbjct: 44 REKSLEQRDRLREKSIEQRDKIRDKSIEQREKFREKSIEQREKLREKSVEQ-REKFREKS 102 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEP 98 L+ ++K REK R+ + ++ + P Sbjct: 103 LEQQEKFREKGREIRERAVRENWYTIPNAI 132 Score = 39.3 bits (90), Expect = 0.17, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKILERR-------RRRYLKNKDKIRESYHQYYLKNKDK-- 59 +E+ L +RE R + + R+K E+ R + ++ +DKIR+ + K ++K Sbjct: 22 KEKTLEQREKIRHHASEQREKFREKSLEQRDRLREKSIEQRDKIRDKSIEQREKFREKSI 81 Query: 60 -YREYKRRYYLKNRDKMREKARQSYRKLYSKDSWI 93 RE R ++ R+K REK+ + K K I Sbjct: 82 EQREKLREKSVEQREKFREKSLEQQEKFREKGREI 116 Score = 38.1 bits (87), Expect = 0.39, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 2/107 (1%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 K Y K ++K LE+R + ++ RE + + L+ +D+ RE K RDK E Sbjct: 13 KKEVYREKIKEKTLEQREKIRHHASEQ-REKFREKSLEQRDRLREKSIEQRDKIRDKSIE 71 Query: 78 KARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE-REIARLKAKPIEE 123 + + K + + + K ++LE +E R K + I E Sbjct: 72 QREKFREKSIEQREKLREKSVEQREKFREKSLEQQEKFREKGREIRE 118 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 3/85 (3%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN---KDKYREYKRRYYLKN 71 +RE R L+ RD++ E+ + K +DK E ++ K+ ++K RE K Sbjct: 39 QREKFREKSLEQRDRLREKSIEQRDKIRDKSIEQREKFREKSIEQREKLREKSVEQREKF 98 Query: 72 RDKMREKARQSYRKLYSKDSWIAPE 96 R+K E+ + K E Sbjct: 99 REKSLEQQEKFREKGREIRERAVRE 123 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 36/98 (36%), Gaps = 3/98 (3%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN---KDKYREYKRRYYLKNR 72 RE + L+ R+KI + K ++K E + K+ +DK R+ K R Sbjct: 18 REKIKEKTLEQREKIRHHASEQREKFREKSLEQRDRLREKSIEQRDKIRDKSIEQREKFR 77 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 +K E+ + K + + K + E Sbjct: 78 EKSIEQREKLREKSVEQREKFREKSLEQQEKFREKGRE 115 >gi|168024673|ref|XP_001764860.1| predicted protein [Physcomitrella patens subsp. patens] gi|162683896|gb|EDQ70302.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1367 Score = 40.8 bits (94), Expect = 0.059, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 44/107 (41%), Gaps = 12/107 (11%) Query: 22 YYLKN--RDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 Y+ N R+K R + +N +K+++ Q + + ++ R +R+K +E+ Sbjct: 1070 YHKNNLKREKNSSRNAKDRIRENTEKVKDRESQRNKEREHTVQKISERERESDREKHKER 1129 Query: 79 AR--------QSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 + +S + ++ + K+ + +ERE + K Sbjct: 1130 EKDKDRGKGTKSRGRDSERERRKSTSSQFDR-KSTLNKMEREKSYNK 1175 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 38/104 (36%), Gaps = 16/104 (15%) Query: 20 RRYYLKN-RDKILERRRRRYLKN---KDKIRES-YHQYYLKNKDKYREYKRRYYLKNRD- 73 + N + E Y KN ++K +N +K ++ + +N++ Sbjct: 1051 KDRDQSNAKQDAKEAEGDGYHKNNLKREKNSSRNAKDRIRENTEKVKDRES---QRNKER 1107 Query: 74 -----KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 K+ E+ R+S R+ ++ TK+ ERE Sbjct: 1108 EHTVQKISERERESDRE--KHKEREKDKDRGKGTKSRGRDSERE 1149 >gi|110004991|emb|CAK99322.1| hypothetical transmembrane protein [Spiroplasma citri] Length = 1058 Score = 40.8 bits (94), Expect = 0.061, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 29/76 (38%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R +R ++ E ++ +KN D ++ + Y + + + + KN Sbjct: 884 HRAQIKRLRRAHKVYYQEELKKLAIKNPDLKLKALKKQYREQLHRVQLAAKDIKSKNHKI 943 Query: 75 MREKARQSYRKLYSKD 90 +RE ++ Y + Sbjct: 944 IREIRKEFRVYNYEHN 959 Score = 38.5 bits (88), Expect = 0.33, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 37/100 (37%), Gaps = 6/100 (6%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD-KMREKARQS 82 +N+ I R + + R + +K Y+E ++ +KN D K++ +Q Sbjct: 863 KQNKKTINIREYETFFAYMSEHRAQIKRLRRAHKVYYQEELKKLAIKNPDLKLKALKKQY 922 Query: 83 YRKLYS-----KDSWIAPEEPMGMTKAEIEALEREIARLK 117 +L+ KD + + + E E L+ Sbjct: 923 REQLHRVQLAAKDIKSKNHKIIREIRKEFRVYNYEHNFLQ 962 >gi|115471661|ref|NP_001059429.1| Os07g0405100 [Oryza sativa Japonica Group] gi|113610965|dbj|BAF21343.1| Os07g0405100 [Oryza sativa Japonica Group] gi|215767231|dbj|BAG99459.1| unnamed protein product [Oryza sativa Japonica Group] Length = 555 Score = 40.8 bits (94), Expect = 0.064, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 3/82 (3%) Query: 30 ILERRRRRYLKNKDKIRESYHQYYLKNKDKY--REYKRRYYLKNRDKMREKARQSYRKLY 87 + +R+R + ++DK + + ++ ++ E +R + + ++ +EK R + Sbjct: 98 VKKRKREKTQSDRDKDKGKEKERMEEH-ERRPGGERERERHDQEKELEKEKDRAERDRDQ 156 Query: 88 SKDSWIAPEEPMGMTKAEIEAL 109 K+ E + KAE ++L Sbjct: 157 DKEKEKLHTERIDKVKAEEDSL 178 >gi|167044082|gb|ABZ08766.1| hypothetical protein ALOHA_HF4000APKG5B22ctg2g4 [uncultured marine crenarchaeote HF4000_APKG5B22] Length = 82 Score = 40.8 bits (94), Expect = 0.066, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 44/82 (53%), Gaps = 4/82 (4%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRR----RYLKNKDKIRESYHQYYLKNKDKY 60 ++ P + RE R++ +N +K E R+ KN +K+R++ +Y +N++K+ Sbjct: 1 MILPYKDSEKARESGRKWRKENPEKAKESSRKSNRKYRKKNLEKVRKAEKKYREENREKF 60 Query: 61 REYKRRYYLKNRDKMREKARQS 82 R+Y KN +K +E +R+ Sbjct: 61 IRASRKYRKKNLEKAKESSRKW 82 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 37/71 (52%), Gaps = 4/71 (5%) Query: 25 KNRDKILERRRRRYLKNKDKIRESY----HQYYLKNKDKYREYKRRYYLKNRDKMREKAR 80 K+ +K E R+ +N +K +ES +Y KN +K R+ +++Y +NR+K +R Sbjct: 6 KDSEKARESGRKWRKENPEKAKESSRKSNRKYRKKNLEKVRKAEKKYREENREKFIRASR 65 Query: 81 QSYRKLYSKDS 91 + +K K Sbjct: 66 KYRKKNLEKAK 76 >gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba] gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba] Length = 3129 Score = 40.4 bits (93), Expect = 0.073, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1068 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAIEER 1120 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1121 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKEMLLAAEMERERRRQHMSL 1180 Query: 121 IEELIYKR 128 I L +R Sbjct: 1181 IRMLELRR 1188 >gi|262231748|ref|NP_082024.2| transcription initiation factor TFIID subunit 3 [Mus musculus] gi|110287981|sp|Q5HZG4|TAF3_MOUSE RecName: Full=Transcription initiation factor TFIID subunit 3; AltName: Full=140 kDa TATA box-binding protein-associated factor; AltName: Full=TBP-associated factor 3; AltName: Full=Transcription initiation factor TFIID 140 kDa subunit; Short=TAF(II)140; Short=TAF140; Short=TAFII-140; Short=TAFII140 gi|13374178|emb|CAC34476.1| TAFII140 protein [Mus musculus] gi|123233028|emb|CAM23088.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] gi|148676053|gb|EDL08000.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] gi|187951735|gb|AAI37618.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] gi|187957546|gb|AAI37619.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] Length = 932 Score = 40.4 bits (93), Expect = 0.077, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 34/98 (34%), Gaps = 8/98 (8%) Query: 15 RREYKRRY-YLKNRDKILERRRRRYL----KNKDKIRESYHQYYLKNKDKYREYK---RR 66 +E + K ++ E + ++ D + ++ + K R RR Sbjct: 554 SKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDIDAAKVRLKDGIVRR 613 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K++DK +++ R K + + + KA Sbjct: 614 EREKHKDKKKDRERSKREKDKRERERLKEKNREDKIKA 651 >gi|57242908|gb|AAH89030.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] Length = 948 Score = 40.4 bits (93), Expect = 0.078, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 34/98 (34%), Gaps = 8/98 (8%) Query: 15 RREYKRRY-YLKNRDKILERRRRRYL----KNKDKIRESYHQYYLKNKDKYREYK---RR 66 +E + K ++ E + ++ D + ++ + K R RR Sbjct: 570 SKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDIDAAKVRLKDGIVRR 629 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K++DK +++ R K + + + KA Sbjct: 630 EREKHKDKKKDRERSKREKDKRERERLKEKNREDKIKA 667 >gi|20065976|ref|NP_612842.1| putative tape-measure protein [Clostridium phage phi3626] gi|168211289|ref|ZP_02636914.1| phage tail tape measure protein, family [Clostridium perfringens B str. ATCC 3626] gi|19908307|gb|AAL96783.1| putative tape-measure protein [Clostridium phage phi3626] gi|170710730|gb|EDT22912.1| phage tail tape measure protein, family [Clostridium perfringens B str. ATCC 3626] Length = 961 Score = 40.4 bits (93), Expect = 0.080, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 43/114 (37%), Gaps = 10/114 (8%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKD--KIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++ + KI E + ++ K E + K++ ++ + Y N+ Sbjct: 86 KQKSKQSELAKKISETSDKYKKSVQETGKNSEESKKL----KEELKQLEAEYTKNNKAID 141 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKRG 129 + S ++ +S A EE K +E + +E+ + K E I K G Sbjct: 142 TNVIKISNAEIKLNNSKKALEEN----KKALENINKELDKSKIDKFSEGIGKAG 191 >gi|198465721|ref|XP_002135027.1| GA23468 [Drosophila pseudoobscura pseudoobscura] gi|198150286|gb|EDY73654.1| GA23468 [Drosophila pseudoobscura pseudoobscura] Length = 1924 Score = 40.4 bits (93), Expect = 0.080, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 46/101 (45%), Gaps = 5/101 (4%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLK---NKDKYREYKR-RYYLKNRDKMREKA 79 KNR E+ RR +K+K RE ++ K+K R+ + R+ +++D+ RE++ Sbjct: 363 EKNRKSSKEKHRR-RDVSKEKPRERSKEHPKDMYMPKEKNRDRSKDRHRERSKDRHRERS 421 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 ++ K +++ ++ K E+ + KP Sbjct: 422 KEKKSKETTRERSKDDKKNAKRPKDLKESSKESHTNHAEKP 462 Score = 39.3 bits (90), Expect = 0.20, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 4/117 (3%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 P ER + KNRD+ +R R ++KD+ RE + K + R + Sbjct: 382 KPRERSKEHPKDMYMPKEKNRDRSKDRHR---ERSKDRHRERSKEKKSKETTRERSKDDK 438 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 K ++E +++S+ +++ P+E E R + + E+ Sbjct: 439 KNAKRPKDLKESSKESH-TNHAEKPRDFPKEISHRDNKEQPKEHRSKEGMHEQSAEK 494 >gi|50549217|ref|XP_502079.1| YALI0C21186p [Yarrowia lipolytica] gi|49647946|emb|CAG82399.1| YALI0C21186p [Yarrowia lipolytica] Length = 1567 Score = 40.4 bits (93), Expect = 0.082, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 46/108 (42%), Gaps = 18/108 (16%) Query: 17 EYKRRYYL---KNRD-KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 ++ ++++ K ++ K E+ +R K+ ++E +Y NK + Sbjct: 720 KFGKKWWSRSGKTKEAKRQEKETKRLE--KEAVKERERLWYENNK-----------KEKE 766 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 D++RE R+ K + ++ P ++ AL+ +K+KP Sbjct: 767 DRLREMRREK-EKTVVAEEAVSEHTPATKSRNAFAALKTVSQLVKSKP 813 Score = 39.3 bits (90), Expect = 0.17, Method: Composition-based stats. Identities = 10/59 (16%), Positives = 24/59 (40%), Gaps = 1/59 (1%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 G+K + + + ++ ++ + ER R Y NK + + + + K+K Sbjct: 722 GKKWWSRSGKTKEAKRQEKETKRLEKEAVKERERLWYENNKKEKEDRLREMRRE-KEKT 779 >gi|326407828|gb|ADZ64899.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 704 Score = 40.4 bits (93), Expect = 0.082, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 47/119 (39%), Gaps = 7/119 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRES----YHQYYLKNKDK 59 + L +E + +R KR+ +N + + + + + + + + KNK+K Sbjct: 541 QKLQAKEEKIKQRLEKRKQKFENPNLLKRKVADYRKQYQQRKNATNPNLAQKVEQKNKEK 600 Query: 60 YREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 + Y KN+ +++ +A + + D + K +AL +E +A Sbjct: 601 -NASIQALYDKNKTQLQARAIERRLREAVPDQEKPNLSELAKNK--QKALAKEKLDERA 656 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 43/107 (40%), Gaps = 8/107 (7%) Query: 24 LKNRDKILER--RRRRYLKNKDKIRESYHQYYLKNKDKYREYK----RRYYLKNRDKMRE 77 +KI +R +R++ +N + ++ Y + + + ++ KN++K Sbjct: 544 QAKEEKIKQRLEKRKQKFENPNLLKRKVADYRKQYQQRKNATNPNLAQKVEQKNKEK-NA 602 Query: 78 KARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE-REIARLKAKPIEE 123 + Y K ++ A E + + E E+A+ K K + + Sbjct: 603 SIQALYDKNKTQLQARAIERRLREAVPDQEKPNLSELAKNKQKALAK 649 >gi|260834587|ref|XP_002612291.1| hypothetical protein BRAFLDRAFT_122528 [Branchiostoma floridae] gi|229297668|gb|EEN68300.1| hypothetical protein BRAFLDRAFT_122528 [Branchiostoma floridae] Length = 2617 Score = 40.4 bits (93), Expect = 0.084, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 51/116 (43%), Gaps = 17/116 (14%) Query: 17 EYKRRYYLKNR--DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 E ++++ + + +++ E + ++ + + + K+ + +K + ++DK Sbjct: 354 EERKKFEEEQKTWNRLQEMHEQALREDAKYKKRLSFDDWNRQKELEKIHKMEEHEDDKDK 413 Query: 75 MREKARQSYRKLYSKD--SWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 + ++ +K+ +++ W+ + E E LE+E + K E Y+R Sbjct: 414 --NQQKERRQKMSAEEYQKWV--------LRKEQEELEKERQKQKE---AEKDYQR 456 >gi|323450322|gb|EGB06204.1| hypothetical protein AURANDRAFT_65873 [Aureococcus anophagefferens] Length = 1681 Score = 40.4 bits (93), Expect = 0.085, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 55/133 (41%), Gaps = 14/133 (10%) Query: 4 KVLTPEERMLCRREYKRRYYLKNR-------DKILERRRRRYLKNKDKIRESYHQYYL-K 55 + ER++ E R + + ++ +E +RR+ ++++ R+ + Sbjct: 162 RDKAAFERIVAETEKHRAEEEELQALRDLLWEEEMEAQRRKEEQDREAKRQQNKRDMALA 221 Query: 56 NKDKYREYKRRYYLKNRDKMR--EKARQSYRKLYSKDSWIAPEEPMGMTKA----EIEAL 109 N++ +++ ++ +++ R E R + + + IA K +++ + Sbjct: 222 NEEMLVAKQKQREVERKEEQRLVETMRHKFAEDERNERLIADRRAADKAKYIEAVKLQKI 281 Query: 110 EREIARLKAKPIE 122 ER+ +AK E Sbjct: 282 ERQEVYERAKAAE 294 >gi|221506508|gb|EEE32125.1| kelch motif domain-containing protein, putative [Toxoplasma gondii VEG] Length = 818 Score = 40.4 bits (93), Expect = 0.089, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 4/109 (3%) Query: 6 LTPEERMLCRREYK-RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 L E+R L R + + + R+K+ + ++ Y K K++ R++ + + K E + Sbjct: 329 LRKEKRDLVRAKKEFEEERKQAREKLQQEKKIEYEKIKEERRKAQAEMATQIKQIQVERE 388 Query: 65 RRYYLKNRDKMR---EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 N ++ R EK R L K+ + E + K I Sbjct: 389 DARRKLNAERGRFEEEKEAHRRRTLLEKERFRQEAEALDAEKRRIVDAN 437 >gi|221486805|gb|EEE25051.1| kelch motif domain-containing protein, putative [Toxoplasma gondii GT1] Length = 818 Score = 40.4 bits (93), Expect = 0.089, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 4/109 (3%) Query: 6 LTPEERMLCRREYK-RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 L E+R L R + + + R+K+ + ++ Y K K++ R++ + + K E + Sbjct: 329 LRKEKRDLVRAKKEFEEERKQAREKLQQEKKIEYEKIKEERRKAQAEMATQIKQIQVERE 388 Query: 65 RRYYLKNRDKMR---EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 N ++ R EK R L K+ + E + K I Sbjct: 389 DARRKLNAERGRFEEEKEAHRRRTLLEKERFRQEAEALDAEKRRIVDAN 437 >gi|237832079|ref|XP_002365337.1| kelch motif domain-containing protein [Toxoplasma gondii ME49] gi|211963001|gb|EEA98196.1| kelch motif domain-containing protein [Toxoplasma gondii ME49] Length = 818 Score = 40.4 bits (93), Expect = 0.089, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 4/109 (3%) Query: 6 LTPEERMLCRREYK-RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 L E+R L R + + + R+K+ + ++ Y K K++ R++ + + K E + Sbjct: 329 LRKEKRDLVRAKKEFEEERKQAREKLQQEKKIEYEKIKEERRKAQAEMATQIKQIQVERE 388 Query: 65 RRYYLKNRDKMR---EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 N ++ R EK R L K+ + E + K I Sbjct: 389 DARRKLNAERGRFEEEKEAHRRRTLLEKERFRQEAEALDAEKRRIVDAN 437 >gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster] gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster] Length = 2999 Score = 40.0 bits (92), Expect = 0.096, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1069 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAVEER 1121 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1122 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKEMLLAAEMERERRRQHMSL 1181 Query: 121 IEELIYKR 128 I L +R Sbjct: 1182 IRMLELRR 1189 >gi|281349155|gb|EFB24739.1| hypothetical protein PANDA_012644 [Ailuropoda melanoleuca] Length = 1041 Score = 40.0 bits (92), Expect = 0.10, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 2 GRKVLTPEERMLCRREYKR--RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDK 59 G ++ PE + RE R +Y + ++ R ++D+ RE + ++D+ Sbjct: 21 GSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDREKQRERKKDSRDR 80 Query: 60 YREYKR-RYYLKNRDK 74 RE + +Y ++ +K Sbjct: 81 DRERHKEKYREQDTEK 96 Score = 37.0 bits (84), Expect = 0.80, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 1/91 (1%) Query: 3 RKVLTPEERML-CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 ++ E+ L E Y + + + +R + + ++ + + ++D+ R Sbjct: 8 KEDKKHREKKLHKGSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDR 67 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 E +R +RD+ RE+ ++ YR+ ++ S Sbjct: 68 EKQRERKKDSRDRDRERHKEKYREQDTEKSH 98 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 33/91 (36%), Gaps = 4/91 (4%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY----YLKNR 72 + +++ K K E Y + K + + +Y + ++ R ++R Sbjct: 8 KEDKKHREKKLHKGSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDR 67 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTK 103 +K RE+ + S + + E+ + Sbjct: 68 EKQRERKKDSRDRDRERHKEKYREQDTEKSH 98 >gi|258574525|ref|XP_002541444.1| conserved hypothetical protein [Uncinocarpus reesii 1704] gi|237901710|gb|EEP76111.1| conserved hypothetical protein [Uncinocarpus reesii 1704] Length = 1192 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 34/86 (39%), Gaps = 6/86 (6%) Query: 40 KNKDKIRESYHQYYLKNKDKYREYKRRYYLKN------RDKMREKARQSYRKLYSKDSWI 93 N ++ + +N++ Y Y + + ++EK R++ R SW+ Sbjct: 504 ANPQDKKDQKKDEHQENEEYPVAYNEAYEEDDYGEPLWKKYIKEKHRETMRLPIFGWSWM 563 Query: 94 APEEPMGMTKAEIEALEREIARLKAK 119 +G +I +E+ARL + Sbjct: 564 PSLPLIGKKVDKIYHCRKELARLNLE 589 >gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta] gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta] Length = 3148 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1069 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAVEER 1121 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1122 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKEMLLAAEMERERRRQHMSL 1181 Query: 121 IEELIYKR 128 I L +R Sbjct: 1182 IRMLELRR 1189 >gi|326406227|gb|ADZ63298.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 697 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 49/119 (41%), Gaps = 7/119 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKN----RDKILERRRRRYLKNKDKIRESYHQYYLKNKDK 59 + L +E + +R KR+ +N + K+ + R++ + + KNK+K Sbjct: 534 QKLQAKEEKIKQRLEKRKQKFENPNLLKRKVADYRKQYQQRKTMNNPNLAQKMEQKNKEK 593 Query: 60 YREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 + Y KN+ +++ +A + + D + + K +AL +E +A Sbjct: 594 -NASIQALYDKNKTQLQARAIERRMREAVPDQEKSNLSELAKNK--QKALAKEKLDERA 649 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 43/107 (40%), Gaps = 8/107 (7%) Query: 24 LKNRDKILER--RRRRYLKNKDKIRESYHQYYLKNKDKYREYK----RRYYLKNRDKMRE 77 +KI +R +R++ +N + ++ Y + + + ++ KN++K Sbjct: 537 QAKEEKIKQRLEKRKQKFENPNLLKRKVADYRKQYQQRKTMNNPNLAQKMEQKNKEK-NA 595 Query: 78 KARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE-REIARLKAKPIEE 123 + Y K ++ A E M + E E+A+ K K + + Sbjct: 596 SIQALYDKNKTQLQARAIERRMREAVPDQEKSNLSELAKNKQKALAK 642 >gi|328792397|ref|XP_001122367.2| PREDICTED: hypothetical protein LOC726647 [Apis mellifera] Length = 673 Score = 40.0 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 41/111 (36%), Gaps = 22/111 (19%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL------------KNKDKYREYKRR 66 + +N I E+ + Y K+ ++ YY N++ + + + Sbjct: 573 TKIELHENIPFISEQLSKIYKDYKENFKQKQEDYYETEQHMEHKKFAPNNQNNILKNENK 632 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 + N + Y LY++++ I +EP+ + E +E + K Sbjct: 633 WIQNNI------QTKDYDVLYNENNKINFQEPIDYQQEE----NKEREQRK 673 >gi|301776168|ref|XP_002923504.1| PREDICTED: WD repeat-containing protein 60-like [Ailuropoda melanoleuca] Length = 1062 Score = 39.6 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 2 GRKVLTPEERMLCRREYKR--RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDK 59 G ++ PE + RE R +Y + ++ R ++D+ RE + ++D+ Sbjct: 42 GSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDREKQRERKKDSRDR 101 Query: 60 YREYKR-RYYLKNRDK 74 RE + +Y ++ +K Sbjct: 102 DRERHKEKYREQDTEK 117 Score = 37.0 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 1/91 (1%) Query: 3 RKVLTPEERML-CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 ++ E+ L E Y + + + +R + + ++ + + ++D+ R Sbjct: 29 KEDKKHREKKLHKGSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDR 88 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 E +R +RD+ RE+ ++ YR+ ++ S Sbjct: 89 EKQRERKKDSRDRDRERHKEKYREQDTEKSH 119 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 33/91 (36%), Gaps = 4/91 (4%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY----YLKNR 72 + +++ K K E Y + K + + +Y + ++ R ++R Sbjct: 29 KEDKKHREKKLHKGSEMDLPEYKEPKHREPDRDSKYRERPSERDGHTSRENPHGDRDRDR 88 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTK 103 +K RE+ + S + + E+ + Sbjct: 89 EKQRERKKDSRDRDRERHKEKYREQDTEKSH 119 >gi|301111203|ref|XP_002904681.1| Crinkler (CRN) family protein [Phytophthora infestans T30-4] gi|262095998|gb|EEY54050.1| Crinkler (CRN) family protein [Phytophthora infestans T30-4] Length = 731 Score = 39.6 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 44/113 (38%), Gaps = 7/113 (6%) Query: 24 LKNRDKILERRRRR-------YLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 +NR+K ER R+R + K E + K ++ RE R+ +NR++ R Sbjct: 175 EENREKNRERDRKRAEENREENRERDRKRAEENRERDRKRAEENRERDRKRAEENRERDR 234 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKRG 129 ++A++ R +AE E R A+ EL KR Sbjct: 235 KQAKEFRRLAEETRKLAEENRERDRKRAEETRKLAEEIRKLAEEKRELDRKRA 287 Score = 38.1 bits (87), Expect = 0.44, Method: Composition-based stats. Identities = 12/84 (14%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKD--KYREYKRRYYLKNRDKMREKARQ 81 +NR++ +R +++ + E+ + + K+ + E R+ +NR++ R++A + Sbjct: 205 EENRERDRKRAEENRERDRKRAEENRERDRKQAKEFRRLAEETRKLAEENRERDRKRAEE 264 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAE 105 + + + + +A+ Sbjct: 265 TRKLAEEIRKLAEEKRELDRKRAQ 288 >gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster] gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster] Length = 3131 Score = 39.6 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 10/128 (7%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1069 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAVEER 1121 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI---EALEREIARLKAKP 120 K+R +R K E ++ K + I +E + E+ +ERE R Sbjct: 1122 KKRQEELDRLKQEELLKKQQEKEKRRQEAILAKEQELQKQKEMLLAAEMERERRRQHMSL 1181 Query: 121 IEELIYKR 128 I L +R Sbjct: 1182 IRMLELRR 1189 >gi|195377218|ref|XP_002047389.1| GJ13409 [Drosophila virilis] gi|194154547|gb|EDW69731.1| GJ13409 [Drosophila virilis] Length = 1893 Score = 39.6 bits (91), Expect = 0.13, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 8/123 (6%) Query: 3 RKVLTPEERMLC----RREYKRRYYL--KNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 R+ + +ER C +RE + NR + R+ + DK E Q + Sbjct: 407 RQRINSKERRECSNDKQRERSKDIRECSNNRQSERSKDRKEFPN--DKNIERPKQRRESS 464 Query: 57 KDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 DK RE + + ++ + R K + ++ + +K + + E L Sbjct: 465 YDKQRERSKDKRIGDKPRERSKEKLDRSIEKQRERSKEKKRDSDQSKEKSTSRSTEKLSL 524 Query: 117 KAK 119 ++K Sbjct: 525 ESK 527 Score = 38.9 bits (89), Expect = 0.23, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 51/126 (40%), Gaps = 8/126 (6%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ + +ER+ R R+ K ++I ER+R + ++ + + ++ Sbjct: 380 RIDSSKERLKGRS--SERFREKKSERISERQRINSKERRECSNDKQRERSKDIRECSNNR 437 Query: 64 KRRYYLKNRD----KMREKARQSYRKLYSKDSWIAPEEPMGMTKAE--IEALEREIARLK 117 + ++ K E+ +Q Y K + ++ +G E E L+R I + + Sbjct: 438 QSERSKDRKEFPNDKNIERPKQRRESSYDKQRERSKDKRIGDKPRERSKEKLDRSIEKQR 497 Query: 118 AKPIEE 123 + E+ Sbjct: 498 ERSKEK 503 >gi|195587988|ref|XP_002083743.1| GD13891 [Drosophila simulans] gi|194195752|gb|EDX09328.1| GD13891 [Drosophila simulans] Length = 1889 Score = 39.6 bits (91), Expect = 0.13, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 43/109 (39%), Gaps = 8/109 (7%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRR-------RRYLKNKDKIRESYHQYYLK 55 R + R + + + + + + R +R K+K+K E + + Sbjct: 401 RDRTRNKTREKSQARDQTKRKATDAEHTRVKSRKRSSSRDKRREKSKEKNGEKLKEKSTE 460 Query: 56 NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 ++ E K + N +K+RE++++ + +SK+ ++ + Sbjct: 461 RSERRNESKVKNKDTNEEKLRERSKEK-DRNHSKERLHERKQNKSEERK 508 Score = 37.0 bits (84), Expect = 0.99, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 40/104 (38%), Gaps = 5/104 (4%) Query: 6 LTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 + +R R + + + KN +K+ E+ R ++ ES + N++K RE + Sbjct: 430 VKSRKRSSSRDKRREKSKEKNGEKLKEKSTER----SERRNESKVKNKDTNEEKLRERSK 485 Query: 66 RYYLKN-RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEA 108 + ++++ E+ + + + + + + Sbjct: 486 EKDRNHSKERLHERKQNKSEERKPEAKKVRNARENSKEGTDHKK 529 >gi|317036171|ref|XP_001397737.2| stress response protein nst1 [Aspergillus niger CBS 513.88] Length = 1153 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 38/103 (36%), Gaps = 3/103 (2%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E +RR + + K E R+ + + + + +++ E +R+ + + + Sbjct: 544 EEQRRKREEQKKK-REAERKAQEAERLRKEAEKQKRLREERERQAEIERKQREQKELEKK 602 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMT--KAEIEALEREIARLK 117 + +L K + E + K + E E+ + K Sbjct: 603 RREEARQNELREKKTKDERERKLREAAPKTDYEGQEKRDPQAK 645 >gi|30690730|ref|NP_849509.1| ATU2AF65A; RNA binding / nucleic acid binding / nucleotide binding [Arabidopsis thaliana] gi|19310597|gb|AAL85029.1| putative splicing factor [Arabidopsis thaliana] gi|332661289|gb|AEE86689.1| Splicing factor U2af large subunit A [Arabidopsis thaliana] Length = 542 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + +R K ER + K++++ E + ++ +K +E R +R++ Sbjct: 38 KRESRDHERETSRSKDREREKG-RDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRER 96 Query: 75 MREKARQS 82 +R Sbjct: 97 HHRSSRHR 104 >gi|157147968|ref|YP_001455287.1| hypothetical protein CKO_03775 [Citrobacter koseri ATCC BAA-895] gi|157085173|gb|ABV14851.1| hypothetical protein CKO_03775 [Citrobacter koseri ATCC BAA-895] Length = 378 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 8/66 (12%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 NR+K E + +Y N +++ +Y ++ + R+ NR + + R+ + Sbjct: 173 ANREKYREYYQGKYSLNDEQM-----AHYE---EEQLQLAARWSQPNRAAIAQMCRERHL 224 Query: 85 KLYSKD 90 L S D Sbjct: 225 ALASHD 230 >gi|115964466|ref|XP_001191416.1| PREDICTED: similar to WD repeat domain 60 [Strongylocentrotus purpuratus] Length = 1077 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 48/115 (41%), Gaps = 7/115 (6%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY- 67 +E+ R K R +++K ERR+ K+ K ++ + K++ +E + Sbjct: 77 KEKDKDRERRKDREGESSKEKDRERRKD-REKDSSKDKDRREKDRDGRKEREKESSKEKD 135 Query: 68 -----YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 K+R+K +++ R+ + + + A G + + E++ R K Sbjct: 136 RGEKDRSKHREKEKDRDREKKERSHRDNKESAERTDSGKKERHRDRDEKKREREK 190 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 53/131 (40%), Gaps = 8/131 (6%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRR---RRYLKNKDKIRESYH---QYYLKNK 57 K + + + + + K RDK ER R R ++DK E + K K Sbjct: 20 KEARSSGKSRHKDKDRDKEREKPRDKDRERHRDKDRDKDSHRDKEHERKRDKDRDRQKEK 79 Query: 58 DKYREYKRRYY-LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 DK RE ++ +++K RE+ R+ K SKD ++ G + E E+ + + Sbjct: 80 DKDRERRKDREGESSKEKDRER-RKDREKDSSKDKDRREKDRDGRKEREKESSKEKDRGE 138 Query: 117 KAKPIEELIYK 127 K + K Sbjct: 139 KDRSKHREKEK 149 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 46/117 (39%), Gaps = 6/117 (5%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNK------DKIRESYHQYYLKNKDKYRE 62 ++R + K R K+R+K + + R K++ +K K++ K+RE Sbjct: 87 KDREGESSKEKDRERRKDREKDSSKDKDRREKDRDGRKEREKESSKEKDRGEKDRSKHRE 146 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 ++ + +++ ++S + S + + E E+ R+ + K Sbjct: 147 KEKDRDREKKERSHRDNKESAERTDSGKKERHRDRDEKKREREKESGSRDGDKSSTK 203 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 47/122 (38%), Gaps = 15/122 (12%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRR-YLKNKDKIRESYHQY--------- 52 R+ ++R R + + + ++++ +R + R K KDK RE Sbjct: 39 REKPRDKDRERHRDKDRDKDSHRDKEHERKRDKDRDRQKEKDKDRERRKDREGESSKEKD 98 Query: 53 --YLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 K+++K + K+RD +E+ ++S KD + + + + Sbjct: 99 RERRKDREKDSSKDKDRREKDRDGRKEREKES---SKEKDRGEKDRSKHREKEKDRDREK 155 Query: 111 RE 112 +E Sbjct: 156 KE 157 >gi|42573197|ref|NP_974695.1| ATU2AF65A; RNA binding / nucleic acid binding / nucleotide binding [Arabidopsis thaliana] gi|332661288|gb|AEE86688.1| Splicing factor U2af large subunit A [Arabidopsis thaliana] Length = 565 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + +R K ER + K++++ E + ++ +K +E R +R++ Sbjct: 38 KRESRDHERETSRSKDREREKG-RDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRER 96 Query: 75 MREKARQS 82 +R Sbjct: 97 HHRSSRHR 104 >gi|328766753|gb|EGF76806.1| hypothetical protein BATDEDRAFT_28303 [Batrachochytrium dendrobatidis JAM81] Length = 424 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 45/96 (46%), Gaps = 2/96 (2%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIR--ESYHQYYLKNKDKYREYKRR 66 ER + +E + + L+ ++I E + +R KD ++ +++ Y + +++ +++ R Sbjct: 328 RERPMSNKERRNKRELEELERIPEPKNKRRKNYKDYLQCAKNWDLYSEQQQEQLKKWYRN 387 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMT 102 YY +N++ R + +Y + + M Sbjct: 388 YYSQNKESKRLRYCNNYANNSDIERGRLQKYKMSKM 423 >gi|115762736|ref|XP_785240.2| PREDICTED: similar to WD repeat domain 60, partial [Strongylocentrotus purpuratus] Length = 1073 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 48/115 (41%), Gaps = 7/115 (6%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY- 67 +E+ R K R +++K ERR+ K+ K ++ + K++ +E + Sbjct: 73 KEKDKDRERRKDREGESSKEKDRERRKD-REKDSSKDKDRREKDRDGRKEREKESSKEKD 131 Query: 68 -----YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 K+R+K +++ R+ + + + A G + + E++ R K Sbjct: 132 RGEKDRSKHREKEKDRDREKKERSHRDNKESAERTDSGKKERHRDRDEKKREREK 186 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 53/131 (40%), Gaps = 8/131 (6%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRR---RRYLKNKDKIRESYH---QYYLKNK 57 K + + + + + K RDK ER R R ++DK E + K K Sbjct: 16 KEARSSGKSRHKDKDRDKEREKPRDKDRERHRDKDRDKDSHRDKEHERKRDKDRDRQKEK 75 Query: 58 DKYREYKRRYY-LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 DK RE ++ +++K RE+ R+ K SKD ++ G + E E+ + + Sbjct: 76 DKDRERRKDREGESSKEKDRER-RKDREKDSSKDKDRREKDRDGRKEREKESSKEKDRGE 134 Query: 117 KAKPIEELIYK 127 K + K Sbjct: 135 KDRSKHREKEK 145 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 46/117 (39%), Gaps = 6/117 (5%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNK------DKIRESYHQYYLKNKDKYRE 62 ++R + K R K+R+K + + R K++ +K K++ K+RE Sbjct: 83 KDREGESSKEKDRERRKDREKDSSKDKDRREKDRDGRKEREKESSKEKDRGEKDRSKHRE 142 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 ++ + +++ ++S + S + + E E+ R+ + K Sbjct: 143 KEKDRDREKKERSHRDNKESAERTDSGKKERHRDRDEKKREREKESGSRDGDKSSTK 199 Score = 33.9 bits (76), Expect = 6.7, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 47/122 (38%), Gaps = 15/122 (12%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRR-YLKNKDKIRESYHQY--------- 52 R+ ++R R + + + ++++ +R + R K KDK RE Sbjct: 35 REKPRDKDRERHRDKDRDKDSHRDKEHERKRDKDRDRQKEKDKDRERRKDREGESSKEKD 94 Query: 53 --YLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 K+++K + K+RD +E+ ++S KD + + + + Sbjct: 95 RERRKDREKDSSKDKDRREKDRDGRKEREKES---SKEKDRGEKDRSKHREKEKDRDREK 151 Query: 111 RE 112 +E Sbjct: 152 KE 153 >gi|332661290|gb|AEE86690.1| Splicing factor U2af large subunit A [Arabidopsis thaliana] Length = 551 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + +R K ER + K++++ E + ++ +K +E R +R++ Sbjct: 38 KRESRDHERETSRSKDREREKG-RDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRER 96 Query: 75 MREKARQS 82 +R Sbjct: 97 HHRSSRHR 104 >gi|15234495|ref|NP_195387.1| ATU2AF65A; RNA binding / nucleic acid binding / nucleotide binding [Arabidopsis thaliana] gi|75318082|sp|O23212|U2A2A_ARATH RecName: Full=Splicing factor U2af large subunit A; AltName: Full=U2 auxiliary factor 65 kDa subunit A; AltName: Full=U2 small nuclear ribonucleoprotein auxiliary factor large subunit A; Short=U2 snRNP auxiliary factor large subunit A gi|18087531|gb|AAL58899.1|AF462805_1 At4g35590/C7A10_670 [Arabidopsis thaliana] gi|4006898|emb|CAB16828.1| splicing factor-like protein [Arabidopsis thaliana] gi|7270617|emb|CAB80335.1| splicing factor-like protein [Arabidopsis thaliana] gi|23506119|gb|AAN28919.1| At4g35590/C7A10_670 [Arabidopsis thaliana] gi|24030414|gb|AAN41365.1| putative splicing factor [Arabidopsis thaliana] gi|332661287|gb|AEE86687.1| Splicing factor U2af large subunit A [Arabidopsis thaliana] Length = 573 Score = 39.6 bits (91), Expect = 0.16, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + +R K ER + K++++ E + ++ +K +E R +R++ Sbjct: 38 KRESRDHERETSRSKDREREKG-RDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRER 96 Query: 75 MREKARQS 82 +R Sbjct: 97 HHRSSRHR 104 >gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis] gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis] Length = 3086 Score = 39.6 bits (91), Expect = 0.16, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 53/149 (35%), Gaps = 29/149 (19%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDK-ILERRRRRYLKNKDKIRESYH--QYYLKNKDKY 60 ++ ++ M ++ ++ +N++K E+ + + K+K ++ + K +++ Sbjct: 1121 EIAKQQQAMRDAKKQQKEELARNKEKARQEKNAKLEQQRKEKELKNQQAIEERKKRQEEL 1180 Query: 61 ------------------------REYKRRYYLKNRDKMR--EKARQSYRKLYSKDSWIA 94 E +R ++ + +R E R+ + K + Sbjct: 1181 DRLKQEELLKKQQELQKQKELLLAAEMERERRRQHMNLIRMLELRRKFEEREKKKHQLVL 1240 Query: 95 PEEPMGMTKAEIEALEREIARLKAKPIEE 123 + + + EI +L KP E+ Sbjct: 1241 DRLILRERRMAERKRDAEILQLIRKPSED 1269 >gi|156376995|ref|XP_001630643.1| predicted protein [Nematostella vectensis] gi|156217668|gb|EDO38580.1| predicted protein [Nematostella vectensis] Length = 1140 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 3/85 (3%) Query: 6 LTPEERMLCRREYKRRYYLK-NRDKILERRRRRYLKNKDKIRESYHQYYLK--NKDKYRE 62 T ER RR R+ Y K N K ++ R+R +++ K + K N+D+ R+ Sbjct: 3 KTSTERSQIRRSKLRKDYAKYNEYKHKDKERKRAERDRPKEQSQQEIDRKKKANRDRVRK 62 Query: 63 YKRRYYLKNRDKMREKARQSYRKLY 87 ++ KN+ K R + + K Y Sbjct: 63 FRLLQKGKNKAKARTTPKSNNEKAY 87 >gi|325118717|emb|CBZ54268.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 973 Score = 39.3 bits (90), Expect = 0.17, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 4/74 (5%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM- 75 E + K ++ ER R + + K+ + Y + K++ RE +RRY +++ Sbjct: 364 ENDAKKREKQIERENERSRAKEQR---KLEKQERAKYRQEKERLREEQRRYRNLRKEEQE 420 Query: 76 REKARQSYRKLYSK 89 ++ R Y + S+ Sbjct: 421 LDRQRGKYAESRSR 434 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 9/66 (13%), Positives = 28/66 (42%), Gaps = 5/66 (7%) Query: 32 ERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDS 91 E ++ K ++ E + +K + + +Y + ++++RE+ R+ + K+ Sbjct: 364 ENDAKKREKQIERENERSRAKEQRKLEK--QERAKYRQE-KERLREEQRRY--RNLRKEE 418 Query: 92 WIAPEE 97 + Sbjct: 419 QELDRQ 424 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 42/93 (45%), Gaps = 8/93 (8%) Query: 16 REYKRRYYLKN-----RD--KILERRRRRYLKNKDKIRESYHQYYLKNKDK-YREYKRRY 67 ++ +++ +N ++ K+ ++ R +Y + K+++RE +Y K++ + +R Sbjct: 368 KKREKQIERENERSRAKEQRKLEKQERAKYRQEKERLREEQRRYRNLRKEEQELDRQRGK 427 Query: 68 YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMG 100 Y ++R + + + S+ P G Sbjct: 428 YAESRSRGMDYDGGYRPGGHYGRSYGRSYSPSG 460 >gi|300021651|ref|YP_003754262.1| hypothetical protein Hden_0114 [Hyphomicrobium denitrificans ATCC 51888] gi|299523472|gb|ADJ21941.1| hypothetical protein Hden_0114 [Hyphomicrobium denitrificans ATCC 51888] Length = 242 Score = 39.3 bits (90), Expect = 0.17, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 30/63 (47%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 T E+R E +R++ N DK R +R Y KN DK R + + ++ ++ R+ Sbjct: 42 TVEQRYQHDTERRRKWRQNNPDKERARHKRYYAKNTDKHRATCARVRMERLEERRQKDIE 101 Query: 67 YYL 69 Y Sbjct: 102 TYR 104 Score = 37.7 bits (86), Expect = 0.47, Method: Composition-based stats. Identities = 21/58 (36%), Positives = 32/58 (55%) Query: 31 LERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYS 88 ERRR+ N DK R + +YY KN DK+R R ++ ++ R+K ++YR S Sbjct: 51 TERRRKWRQNNPDKERARHKRYYAKNTDKHRATCARVRMERLEERRQKDIETYRLTRS 108 >gi|195470186|ref|XP_002087389.1| GE16637 [Drosophila yakuba] gi|194173490|gb|EDW87101.1| GE16637 [Drosophila yakuba] Length = 4999 Score = 39.3 bits (90), Expect = 0.20, Method: Composition-based stats. Identities = 14/105 (13%), Positives = 38/105 (36%), Gaps = 4/105 (3%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 + + ++ K+ + +R R K ++K + ++K ++ K Sbjct: 2007 KEQREKEIREKDLREKEQRERDNREKELREKELREKEMREKEQREKELHREKDQREK--- 2063 Query: 74 KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 + REK + ++S + + + + E++ L A Sbjct: 2064 EHREKEQSRRAMEVEQESRVGRMRELSSYQKSKMDIAGEVSSLAA 2108 >gi|281203478|gb|EFA77678.1| hypothetical protein PPL_12287 [Polysphondylium pallidum PN500] Length = 349 Score = 38.9 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 34/89 (38%), Gaps = 14/89 (15%) Query: 39 LKNKDKIRESYHQYYLK--NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPE 96 ++++ R +Y K N++K R+ Y R+ + E Sbjct: 5 EEHREACRVYMQKYRSKPENREKIRQQNAEYR----------KRKKLESIRQVSEIGNLE 54 Query: 97 EPMGMTKAEIEALEREI--ARLKAKPIEE 123 + + ++ I LE+++ A K +E+ Sbjct: 55 QNLDSYQSRISMLEKDLKEALFKVAILEQ 83 >gi|325847095|ref|ZP_08169921.1| acyltransferase [Anaerococcus hydrogenalis ACS-025-V-Sch4] gi|325481067|gb|EGC84112.1| acyltransferase [Anaerococcus hydrogenalis ACS-025-V-Sch4] Length = 605 Score = 38.9 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 27/51 (52%) Query: 42 KDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 K+K ++ + +NK+ + ++Y K+++K + + +K+ K ++ Sbjct: 384 KEKDMKNLEKRISENKENIEKNNKKYINKDKNKEKTNQKLPNKKISKKKAY 434 >gi|46125097|ref|XP_387102.1| hypothetical protein FG06926.1 [Gibberella zeae PH-1] Length = 2022 Score = 38.9 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 47/96 (48%), Gaps = 3/96 (3%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYS 88 +I ERR+ + + +K+K + K++ + + + +Y + R K+ EK + RK Sbjct: 1885 RIEERRQSKLVGDKEKDATMLAKLREKHEKEVAKQEEKYRREMR-KLEEKRERDQRKAEE 1943 Query: 89 KDSWIAPEEPMGMTKAEIE--ALEREIARLKAKPIE 122 + A +E E+E ER++A+ + + +E Sbjct: 1944 RRRKAAEQEEKNKLSLELERVRAERDVAQRQIELLE 1979 Score = 38.9 bits (89), Expect = 0.27, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 43/104 (41%), Gaps = 2/104 (1%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 E ++ + +++K + K++ ++ + +Y + + E KR + ++ Sbjct: 1887 EERRQSKLVGDKEKDATMLAKLREKHEKEVAKQEEKYRRE--MRKLEEKRERDQRKAEER 1944 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 R KA + K + + +IE LE ++ L+A+ Sbjct: 1945 RRKAAEQEEKNKLSLELERVRAERDVAQRQIELLEGQVGELQAQ 1988 >gi|298715358|emb|CBJ27986.1| conserved unknown protein [Ectocarpus siliculosus] Length = 1621 Score = 38.9 bits (89), Expect = 0.22, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 38/75 (50%), Gaps = 4/75 (5%) Query: 25 KNRDKILERR-RRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 K+ ++ ER ++ + +K+K +E+ ++ + E R+ K + ++ R + Sbjct: 1446 KDAEEKRERALQQYRIWSKEKSKEARKARRVEQAKQAEEAHRKRRRK-AEGLKAYRR--W 1502 Query: 84 RKLYSKDSWIAPEEP 98 +L SK+++ +P+ Sbjct: 1503 LRLASKEAYFSPQNN 1517 Score = 38.1 bits (87), Expect = 0.43, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 36/73 (49%), Gaps = 4/73 (5%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 +GR EE+ R ++Y + +++K E R+ R ++ + E++ + K + Sbjct: 1441 LGRGHKDAEEK---RERALQQYRIWSKEKSKEARKARRVEQAKQAEEAHRKRRRK-AEGL 1496 Query: 61 REYKRRYYLKNRD 73 + Y+R L +++ Sbjct: 1497 KAYRRWLRLASKE 1509 >gi|322510447|gb|ADX05762.1| hypothetical protein OLV1 [Organic Lake virophage] Length = 205 Score = 38.9 bits (89), Expect = 0.24, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 32/56 (57%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 + + E ++ Y ++K +I E QYY +K ++Y++ Y ++++K++E + Sbjct: 111 EDLKEYHKQYYQEHKAEIVEKKKQYYQDHKADLKQYEKLYREEHKEKIKEYHKLYR 166 Score = 37.7 bits (86), Expect = 0.51, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 50/88 (56%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +EY ++YY +++ +I+E++++ Y +K +++ Y ++K+K +EY + Y ++++K+ Sbjct: 114 KEYHKQYYQEHKAEIVEKKKQYYQDHKADLKQYEKLYREEHKEKIKEYHKLYREEHKEKI 173 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTK 103 +EK Y +++ +K Sbjct: 174 KEKVECKYCAKLLSKQYMSRHHKTCKSK 201 >gi|307111906|gb|EFN60140.1| hypothetical protein CHLNCDRAFT_56599 [Chlorella variabilis] Length = 1852 Score = 38.9 bits (89), Expect = 0.24, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 41/107 (38%), Gaps = 8/107 (7%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYH------QYYLKNKDKYRE 62 +E+ML +E K R++ L R K+ +++ + + + +K R+ Sbjct: 609 QEKMLREKERMDMKVAKERERELARLEGERRKHLERVMKEQKKLIALQEKERERLEKMRQ 668 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 + + ++ ++ Q R++ + P E +AL Sbjct: 669 REEKKRE--KEVLKALQAQEKREMRLRQREAGVTGPRDDADIEWDAL 713 >gi|221118534|ref|XP_002167966.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 1058 Score = 38.9 bits (89), Expect = 0.25, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 29/87 (33%), Gaps = 8/87 (9%) Query: 15 RREYKRRYYLKNRD--KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 RE + ++ N K E+R + N ++ + K + R K + Sbjct: 427 NREKREKFKESNVRNLKNREKREKFKESNVRNLKNREKREKFKESNVRNLKNREKREKFK 486 Query: 73 D------KMREKARQSYRKLYSKDSWI 93 + K REK + + + I Sbjct: 487 ESNVRNLKNREKREKFKESNFQGEMHI 513 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 11/83 (13%), Positives = 26/83 (31%) Query: 22 YYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 Y KN E+R + N ++ + K + R K ++ + Sbjct: 419 YGFKNHHSNREKREKFKESNVRNLKNREKREKFKESNVRNLKNREKREKFKESNVRNLKN 478 Query: 82 SYRKLYSKDSWIAPEEPMGMTKA 104 ++ K+S + + + Sbjct: 479 REKREKFKESNVRNLKNREKREK 501 >gi|158291201|ref|XP_001230848.2| AGAP004686-PB [Anopheles gambiae str. PEST] gi|157017784|gb|EAU77034.2| AGAP004686-PB [Anopheles gambiae str. PEST] Length = 777 Score = 38.9 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 10/60 (16%), Positives = 29/60 (48%), Gaps = 3/60 (5%) Query: 28 DKILERRRRRYLKNKDKIR--ESYHQYY-LKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 + + ++ Y K ++ E ++ +++ +E +R Y K+RD+ +++ + R Sbjct: 168 EHVSSSGKKYYYNCKTEVSQWEKPREWIEKESRVLVKEQQRDYREKDRDRDKDRCGEKER 227 >gi|158291199|ref|XP_001688402.1| AGAP004686-PA [Anopheles gambiae str. PEST] gi|157017783|gb|EDO64195.1| AGAP004686-PA [Anopheles gambiae str. PEST] Length = 900 Score = 38.9 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 10/60 (16%), Positives = 29/60 (48%), Gaps = 3/60 (5%) Query: 28 DKILERRRRRYLKNKDKIR--ESYHQYY-LKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 + + ++ Y K ++ E ++ +++ +E +R Y K+RD+ +++ + R Sbjct: 291 EHVSSSGKKYYYNCKTEVSQWEKPREWIEKESRVLVKEQQRDYREKDRDRDKDRCGEKER 350 >gi|221111106|ref|XP_002164743.1| PREDICTED: similar to ATP-dependent DNA helicase PIF1 [Hydra magnipapillata] Length = 682 Score = 38.9 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 51/108 (47%), Gaps = 7/108 (6%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 L+PEE M+ +R Y K + K + LK +++ R +YY +NK+K +K Sbjct: 550 ELSPEELMM-----RREYERKRKKKNSKLSPEELLKKRERDRNRKKKYYTENKNKIL-HK 603 Query: 65 RRYYLKNRDKMREKAR-QSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + + + R +RE+ + + Y K Y S + +K ++ ++ Sbjct: 604 QILHKQGRMYIREENQDEWYGKWYRPHSLTQWNKHKSSSKWKVNTIKE 651 >gi|154315015|ref|XP_001556831.1| hypothetical protein BC1G_04849 [Botryotinia fuckeliana B05.10] gi|172052491|sp|A6RW62|NST1_BOTFB RecName: Full=Stress response protein nst1 gi|150848387|gb|EDN23580.1| hypothetical protein BC1G_04849 [Botryotinia fuckeliana B05.10] Length = 1168 Score = 38.9 bits (89), Expect = 0.27, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 31/79 (39%), Gaps = 1/79 (1%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R + + R + + + +K E +R + +N+ K + K E Sbjct: 525 RAMAEEKARKDAEKAAEEASLREIEEKKAEAQRLKREENRKKKEAQKKADEEERVRKESE 584 Query: 63 YKRRYYLKNRDKMREKARQ 81 +RR + R++ E+ R+ Sbjct: 585 KQRRLQEQ-RERQAEQERK 602 >gi|67540830|ref|XP_664189.1| hypothetical protein AN6585.2 [Aspergillus nidulans FGSC A4] gi|40738924|gb|EAA58114.1| hypothetical protein AN6585.2 [Aspergillus nidulans FGSC A4] gi|259480162|tpe|CBF71041.1| TPA: DEAH-box RNA helicase (Dhr1), putative (AFU_orthologue; AFUA_6G04330) [Aspergillus nidulans FGSC A4] Length = 956 Score = 38.5 bits (88), Expect = 0.28, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 34/83 (40%), Gaps = 9/83 (10%) Query: 46 RESYHQYYLK--------NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 R+ ++ + N + + + ++K+RE+ R + K+ +K + Sbjct: 7 RQRKQKHRQQAANAAVDTNAAEIQPLSKSEREARKEKLREELRAQHTKISAKKQKRLDKY 66 Query: 98 PMGMTKAEIE-ALEREIARLKAK 119 K E AL +++A+ K + Sbjct: 67 IENKLKKEENTALLKKLAQSKVE 89 >gi|195350023|ref|XP_002041541.1| GM16721 [Drosophila sechellia] gi|194123314|gb|EDW45357.1| GM16721 [Drosophila sechellia] Length = 2358 Score = 38.5 bits (88), Expect = 0.28, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 44/98 (44%), Gaps = 6/98 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R + Sbjct: 1999 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEIREKEQREKELHREKDQRERE-H 2057 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 R+K + + + D+ +A ++ G AE++A+ Sbjct: 2058 REKEQSRRAMDVEQEGQLDTPLANKD--GKRTAEMQAI 2093 >gi|115944197|ref|XP_001187929.1| PREDICTED: similar to zinc finger protein 291 [Strongylocentrotus purpuratus] Length = 1885 Score = 38.5 bits (88), Expect = 0.28, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 44/122 (36%), Gaps = 8/122 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ +++ ++ RR+ + E+ +Y + ++K K + Sbjct: 1133 EERQRKNEEKAAKEEAVQERRKALEGERIARLEAMKKYRREQEEKMNALKEQREQAREST 1192 Query: 75 MREKARQSYRKLY-----SKDSWIAPEEPMGMTKAEIEALER---EIARLKAKPIEELIY 126 ++EKA++ ++L ++ + E + E R KA + L + Sbjct: 1193 VKEKAKEREQRLSALTAAQNAKEEEIQKKIQQKHDESQKRHNQQIEQKREKAIELSTLRH 1252 Query: 127 KR 128 R Sbjct: 1253 YR 1254 >gi|123233027|emb|CAM23087.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor [Mus musculus] Length = 779 Score = 38.5 bits (88), Expect = 0.30, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 34/98 (34%), Gaps = 8/98 (8%) Query: 15 RREYKRRY-YLKNRDKILERRRRRYL----KNKDKIRESYHQYYLKNKDKYREYK---RR 66 +E + K ++ E + ++ D + ++ + K R RR Sbjct: 401 SKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDIDAAKVRLKDGIVRR 460 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K++DK +++ R K + + + KA Sbjct: 461 EREKHKDKKKDRERSKREKDKRERERLKEKNREDKIKA 498 >gi|224045523|ref|XP_002196817.1| PREDICTED: transmembrane channel-like 2 [Taeniopygia guttata] Length = 973 Score = 38.5 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R + Y+ KNRD R + K++ K + Q +NK+K + KR +N+ K Sbjct: 123 RASRRTNYHRKNRDGKKAREEGSHQKDRGKTALNKDQKESENKEK-KTTKRGKQGENKQK 181 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 K S S D EE + M K ++E ++ I+ ++ KP + Sbjct: 182 KGGKNSTSSASSNSSDEESLSEEELVMLKEQVEQKKKLISTMRNKPWRMMK 232 >gi|46107368|ref|XP_380743.1| hypothetical protein FG00567.1 [Gibberella zeae PH-1] Length = 2627 Score = 38.5 bits (88), Expect = 0.33, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 39/96 (40%), Gaps = 8/96 (8%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK------ 57 ++ + L R L+ + E ++ ++++R+ + K K Sbjct: 2533 ELFKTKHEALKTRAQILMESLQKKFGTSEYSKQLLAI-REEVRKKREERSSKRKIDAIIK 2591 Query: 58 -DKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 +KY KR+ + KN+ +++ ++++ S W Sbjct: 2592 PEKYGRDKRKKFEKNKSRIKTRSKEQKDMRQSFKRW 2627 >gi|169606838|ref|XP_001796839.1| hypothetical protein SNOG_06469 [Phaeosphaeria nodorum SN15] gi|121748600|sp|Q0UP45|SPB4_PHANO RecName: Full=ATP-dependent rRNA helicase SPB4 gi|111065180|gb|EAT86300.1| hypothetical protein SNOG_06469 [Phaeosphaeria nodorum SN15] Length = 633 Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 42/94 (44%), Gaps = 2/94 (2%) Query: 27 RDKILERRR-RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRK 85 +DK E+ R ++K++ + Q+ ++++K +++ ++ REK + Sbjct: 532 KDKAREKLRLEELERDKEEGTKK-KQHKKEDREKSAWTEQKESKATKEVRREKKKSKREH 590 Query: 86 LYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 + +A IE + +++A+ +A+ Sbjct: 591 ERLAKMTDEERKEEDRVQAMIEQMRKKVAKQEAE 624 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 39/94 (41%), Gaps = 6/94 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R + + +++++ ++++ ++++K + + K+ RE K+ R Sbjct: 536 REKLRLEELERDKEEGT-KKKQHKKEDREKSAWTEQKESKATKEVRREKKKSKREHER-- 592 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEA 108 A+ + + +D A E M A+ EA Sbjct: 593 ---LAKMTDEERKEEDRVQAMIEQMRKKVAKQEA 623 >gi|221054494|ref|XP_002258386.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] gi|193808455|emb|CAQ39158.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] Length = 3192 Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 38/83 (45%), Gaps = 9/83 (10%) Query: 28 DKILERRRRRYLKNK----DKIRESYHQYYLKN---KDKYREYKRRYYLKNRD--KMREK 78 +++ ER++ R L K +I + + Y + KDK +E Y +++D KM E+ Sbjct: 3078 ERLKERKKLRRLNKKLRKCQQITNTKKKRYNEQMLEKDKIKEKTALLYKEHKDLTKMLEQ 3137 Query: 79 ARQSYRKLYSKDSWIAPEEPMGM 101 + Y + DS E GM Sbjct: 3138 MKGQYINISRNDSETFSSEEYGM 3160 >gi|182418538|ref|ZP_02949825.1| phage tail tape measure protein, family, core region domain protein [Clostridium butyricum 5521] gi|182377607|gb|EDT75156.1| phage tail tape measure protein, family, core region domain protein [Clostridium butyricum 5521] Length = 1707 Score = 38.1 bits (87), Expect = 0.36, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 42/106 (39%), Gaps = 6/106 (5%) Query: 4 KVLTPEERMLCRREYKRRY--YLKN-RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 +VL + K+ Y +KN + + E+ + +++ K + + Sbjct: 1417 EVLQAKMNNTASEADKKAYAIKIKNAKAALAEKEKEWDREDEKKKLQEKIDNINSKAEAQ 1476 Query: 61 REYKRRYYLKNRDKMREKAR---QSYRKLYSKDSWIAPEEPMGMTK 103 +E + Y ++K E+ + + Y KL DS A M +T+ Sbjct: 1477 KEALKEEYEATKEKFDEQIKVAEEYYEKLLETDSINAQARYMLLTR 1522 >gi|237668517|ref|ZP_04528501.1| phage tail tape measure protein, family, core region domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656865|gb|EEP54421.1| phage tail tape measure protein, family, core region domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1718 Score = 38.1 bits (87), Expect = 0.41, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 42/106 (39%), Gaps = 6/106 (5%) Query: 4 KVLTPEERMLCRREYKRRY--YLKN-RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 +VL + K+ Y +KN + + E+ + +++ K + + Sbjct: 1428 EVLQAKMNNTASEADKKAYAIKIKNAKAALAEKEKEWDREDEKKKLQEKIDNINSKAEAQ 1487 Query: 61 REYKRRYYLKNRDKMREKAR---QSYRKLYSKDSWIAPEEPMGMTK 103 +E + Y ++K E+ + + Y KL DS A M +T+ Sbjct: 1488 KEALKEEYEATKEKFDEQIKVAEEYYEKLLETDSINAQARYMLLTR 1533 >gi|154422195|ref|XP_001584110.1| Beige/BEACH domain containing protein [Trichomonas vaginalis G3] gi|121918355|gb|EAY23124.1| Beige/BEACH domain containing protein [Trichomonas vaginalis G3] Length = 2734 Score = 38.1 bits (87), Expect = 0.42, Method: Composition-based stats. Identities = 19/120 (15%), Positives = 49/120 (40%), Gaps = 7/120 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + + ++ ++ + +N +K+ E++ +N + +NK K ++ Sbjct: 1141 ENKEEKGKIQAEKQKFEKIQDEN-EKVDEKK----EEN-EVKSNDNKDLDQENKIKNQKS 1194 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL-EREIARLKAKPIE 122 ++ +KN +K+ ++ +S K E K E + E+EI + P++ Sbjct: 1195 QQENQVKNEEKVEKQPNESQTKENEGKQLNKDETKDQTNKNEENPVQEKEINKENVSPVK 1254 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 44/103 (42%), Gaps = 15/103 (14%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRE-------SYHQYYLKN 56 KV +E + + +N+ K + ++ +KN++K+ + ++ N Sbjct: 1165 KVDEKKEENEVKSNDNKDLDQENKIKNQKSQQENQVKNEEKVEKQPNESQTKENEGKQLN 1224 Query: 57 KDKYREYKRRYYLK-------NRDKMRE-KARQSYRKLYSKDS 91 KD+ ++ + N++ + K+ +++ K+ +K+ Sbjct: 1225 KDETKDQTNKNEENPVQEKEINKENVSPVKSDENHDKIRNKEE 1267 >gi|70725623|ref|YP_252537.1| hypothetical protein SH0622 [Staphylococcus haemolyticus JCSC1435] gi|68446347|dbj|BAE03931.1| unnamed protein product [Staphylococcus haemolyticus JCSC1435] Length = 735 Score = 38.1 bits (87), Expect = 0.43, Method: Composition-based stats. Identities = 11/95 (11%), Positives = 32/95 (33%), Gaps = 8/95 (8%) Query: 22 YYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 + ++ E R N+ ++E +H +NK + + Y + R K++++ + Sbjct: 550 RFKARKESRKETHRIHKE-NRGLLKEEHHATPEQNKMDIKAKREEYKAE-RQKVKDEKKA 607 Query: 82 SYRKLY------SKDSWIAPEEPMGMTKAEIEALE 110 + + + + E + Sbjct: 608 QREECANSFNKVQQLMRVVNHNIFMKMRQEQNSSN 642 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 38/106 (35%), Gaps = 7/106 (6%) Query: 11 RMLCRREYKRRYYLKNRDKILERRRRRY---LKNKDKIRESYHQY---YLKNKDKYREYK 64 R R+E ++ + +++ + + +NK I+ +Y K KD+ + + Sbjct: 550 RFKARKESRKETHRIHKENRGLLKEEHHATPEQNKMDIKAKREEYKAERQKVKDEKKA-Q 608 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 R + +K+++ R ++ K + I Sbjct: 609 REECANSFNKVQQLMRVVNHNIFMKMRQEQNSSNVLEVSLIINQYH 654 >gi|261328986|emb|CBH11964.1| ATP-dependent RNA helicase, putative [Trypanosoma brucei gambiense DAL972] Length = 1009 Score = 38.1 bits (87), Expect = 0.43, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 45/113 (39%), Gaps = 5/113 (4%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY- 67 +++ + +R K +K L R + K+K+ ++ + +K + Sbjct: 72 KKKKRLEVKSERNLRTKEAEKQLSLAREKVEKHKEYVKSEACAKQKEKAEKRKALNDMRV 131 Query: 68 ----YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 +N +K++++ S ++ Y E G T+ EA++R+ L Sbjct: 132 SLHERRENEEKLQKQRVVSSQQEYISKLVANIAEEEGRTRQVREAIDRDTLDL 184 >gi|72390565|ref|XP_845577.1| pre-mRNA splicing factor ATP-dependent RNA helicase [Trypanosoma brucei TREU927] gi|62358798|gb|AAX79251.1| pre-mRNA splicing factor ATP-dependent RNA helicase, putative [Trypanosoma brucei] gi|70802112|gb|AAZ12018.1| pre-mRNA splicing factor ATP-dependent RNA helicase, putative [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 1009 Score = 38.1 bits (87), Expect = 0.44, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 45/113 (39%), Gaps = 5/113 (4%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY- 67 +++ + +R K +K L R + K+K+ ++ + +K + Sbjct: 72 KKKKRLEVKSERNLRTKEAEKQLSLAREKVEKHKEYVKSEACAKQKEKAEKRKALNDMRV 131 Query: 68 ----YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 +N +K++++ S ++ Y E G T+ EA++R+ L Sbjct: 132 SLHERRENEEKLQKQRVVSSQQEYISKLVANIAEEEGRTRQVREAIDRDTLDL 184 >gi|24658459|ref|NP_647974.1| CG5146 [Drosophila melanogaster] gi|23094104|gb|AAF50767.2| CG5146 [Drosophila melanogaster] Length = 1929 Score = 38.1 bits (87), Expect = 0.46, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 46/121 (38%), Gaps = 7/121 (5%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 + R + + K+++K+ ++ KN+D +E ++ +K E K + Sbjct: 435 RAKSRKRSSSRDKRREKSKEKLGDK-----EKNEDLSKEKLEGKSTEHSEKRNESKVKNK 489 Query: 69 LKNRDKMREKARQS-YRKLYSKDSWIAPEEPMGMTKA-EIEALEREIARLKAKPIEELIY 126 N + +K R+ + +SK+ + + E + + I K P + Sbjct: 490 DTNEENANKKLRERSKDRNHSKERLHERTQNKSEERKPEAKKVRNAIENSKEAPTDRKKD 549 Query: 127 K 127 K Sbjct: 550 K 550 >gi|309357428|emb|CAP35738.2| hypothetical protein CBG_18257 [Caenorhabditis briggsae AF16] Length = 2277 Score = 38.1 bits (87), Expect = 0.46, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 6/96 (6%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYH--QYYLKNKDKYREYK 64 E++ + Y+ R +NR R+ + + +++ + KNKD+ Sbjct: 24 LAEQKSWHQLAYEDRLRRENRKIPKICRKAFFRWHLEQLEARKKFKRLREKNKDESIYVA 83 Query: 65 RRYY---LKNRDKMR-EKARQSYRKLYSKDSWIAPE 96 +++Y K+ K R E + Q KL SK+ IA E Sbjct: 84 KQWYYVLKKHPLKPRVEWSFQRVHKLSSKNERIAQE 119 >gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni] gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni] Length = 3148 Score = 37.7 bits (86), Expect = 0.47, Method: Composition-based stats. Identities = 20/151 (13%), Positives = 54/151 (35%), Gaps = 33/151 (21%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKIL--------ERRRRRYLKNKDKIRESYHQYYLK 55 ++ ++ M ++ ++ +N++K ++R+ + LKN+ I E + Sbjct: 1155 EIAKQQQAMRDAKKQQKEELARNKEKARQEKNAKLEQQRKEKELKNQQAIEERKK--RQE 1212 Query: 56 NKDKYR---------------------EYKRRYYLKNRDKMR--EKARQSYRKLYSKDSW 92 D+ + E +R ++ + +R E R+ + K Sbjct: 1213 ELDRLKQEELIKKQQELQKQKELLLAAEMERERRRQHMNLIRMLEVRRKFEEREKKKHQL 1272 Query: 93 IAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 + + + + +I +L +P E+ Sbjct: 1273 VLDRLILRERRMAERKRDADILQLIRRPNED 1303 >gi|268574580|ref|XP_002642269.1| Hypothetical protein CBG18257 [Caenorhabditis briggsae] Length = 2245 Score = 37.7 bits (86), Expect = 0.47, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 6/96 (6%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYH--QYYLKNKDKYREYK 64 E++ + Y+ R +NR R+ + + +++ + KNKD+ Sbjct: 13 LAEQKSWHQLAYEDRLRRENRKIPKICRKAFFRWHLEQLEARKKFKRLREKNKDESIYVA 72 Query: 65 RRYY---LKNRDKMR-EKARQSYRKLYSKDSWIAPE 96 +++Y K+ K R E + Q KL SK+ IA E Sbjct: 73 KQWYYVLKKHPLKPRVEWSFQRVHKLSSKNERIAQE 108 >gi|332020478|gb|EGI60893.1| Vesicular glutamate transporter 3 [Acromyrmex echinatior] Length = 888 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 12/100 (12%), Positives = 37/100 (37%), Gaps = 11/100 (11%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN---KDKIRESYHQYYLKNKDKY 60 K ++ + ++ ++ +N + E+ + KN ++ ++ +N + Sbjct: 556 KQSKKRDKHVKEQKKRKNSSERNNEDSSEKH--LHEKNSYKPEQKWKNKRTKRQENGEVI 613 Query: 61 REYKRRYYLKNRDKMREKARQS--YRKLYSKDSWIAPEEP 98 E +N +K + + + K++ +EP Sbjct: 614 NEEGN----RNIEKKLKYLKDKTGTQNKKDKNAKERSKEP 649 Score = 37.0 bits (84), Expect = 0.83, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYY--LKN---KDKYREY 63 +ER + + K RDK ++ +++ + ++ E + + KN ++ + Sbjct: 543 KERKVDVEYRTKDKQSKKRDKHVKEQKK-RKNSSERNNEDSSEKHLHEKNSYKPEQKWKN 601 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 KR +N + + E + R + K ++ + K + A ER Sbjct: 602 KRTKRQENGEVINE---EGNRNIEKKLKYLKDKTG-TQNKKDKNAKER 645 >gi|156538281|ref|XP_001603300.1| PREDICTED: similar to GA10623-PA [Nasonia vitripennis] Length = 2101 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 44/119 (36%), Gaps = 8/119 (6%) Query: 6 LTPEERMLCRREYKRRYYLKNRDKILERRRRRYL-KNKDKIRESYHQYYL-------KNK 57 +E +E R + +++ E RR +N+ + E + +N+ Sbjct: 768 KLAKEEAQRHKEQARLIKEQEKNERQEAVRRERELRNQQLLEERERRRQHMALIRALENR 827 Query: 58 DKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 K E +++ +++ K R++ ++ + +P+ + EI R+ Sbjct: 828 RKMEEREKKRLEARAERIASKERRAEQRRMELELVEQIRKPVEDMELTDHKPLPEIKRM 886 >gi|302817814|ref|XP_002990582.1| hypothetical protein SELMODRAFT_428998 [Selaginella moellendorffii] gi|300141750|gb|EFJ08459.1| hypothetical protein SELMODRAFT_428998 [Selaginella moellendorffii] Length = 1015 Score = 37.7 bits (86), Expect = 0.50, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 57/132 (43%), Gaps = 13/132 (9%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLK--------NKDKIRESYHQYYL 54 RK E R+ E + + K ++ RR++ + +++K +E Sbjct: 443 RKRKAEEARIAKEHEAQDKRMKKEMERQEAMRRKKEDQERKESEKVHREKQKELERLARE 502 Query: 55 KNKDKY---REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 K +++ R++K+ Y + +K+ +K Q KL K+ +E + A A + Sbjct: 503 KAREEERLQRQHKKEY--ERMEKLLQKENQRVAKLRQKEEMRREKEATKLRAAYERATAK 560 Query: 112 EIARLKAKPIEE 123 ++A+L I++ Sbjct: 561 KLAKLSTGLIDD 572 >gi|47223161|emb|CAG11296.1| unnamed protein product [Tetraodon nigroviridis] Length = 1267 Score = 37.7 bits (86), Expect = 0.50, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 31/79 (39%), Gaps = 6/79 (7%) Query: 41 NKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMG 100 N +KI Y + + RE ++ ++M+ + + + P+ + Sbjct: 211 NMEKIVSEVKDQYEEIAARSREEAENWHRTKFEQMKA------EANHYGEGFCNPKYEIS 264 Query: 101 MTKAEIEALEREIARLKAK 119 K +I L+ EI K++ Sbjct: 265 KLKQKINRLQIEIETSKSQ 283 >gi|308492011|ref|XP_003108196.1| hypothetical protein CRE_10060 [Caenorhabditis remanei] gi|308249044|gb|EFO92996.1| hypothetical protein CRE_10060 [Caenorhabditis remanei] Length = 4373 Score = 37.7 bits (86), Expect = 0.52, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 34/82 (41%) Query: 46 RESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 R + ++Y KN K + Y K ++ E + + +DS + P+ + M Sbjct: 2722 RAQWKKWYEKNIAKAEQKDFVYRTKTEEEKDEMDIEEFFTNQEQDSQVLPDSDVSMLLEA 2781 Query: 106 IEALEREIARLKAKPIEELIYK 127 IE + + K++ + Y+ Sbjct: 2782 IEQKKFTLVDEKSRKQSDERYE 2803 >gi|195337671|ref|XP_002035452.1| GM14709 [Drosophila sechellia] gi|194128545|gb|EDW50588.1| GM14709 [Drosophila sechellia] Length = 1759 Score = 37.7 bits (86), Expect = 0.52, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 36/78 (46%), Gaps = 4/78 (5%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 RDK E+ + KN D +E + + ++ E K + N +K+RE++++ + Sbjct: 439 RDKRREKSK---EKNGDLSKEKLKEISTERSERRNESKVKNKDTNEEKLRERSKEK-DRN 494 Query: 87 YSKDSWIAPEEPMGMTKA 104 +SK+ ++ + Sbjct: 495 HSKERLHERKQNKSEERK 512 Score = 33.9 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 43/112 (38%), Gaps = 2/112 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN-RD 73 R + + + KN D E+ + + ++ ES + N++K RE + + ++ Sbjct: 439 RDKRREKSKEKNGDLSKEKLKEISTERSERRNESKVKNKDTNEEKLRERSKEKDRNHSKE 498 Query: 74 KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 ++ E+ + + + + + + ++ + L K E L Sbjct: 499 RLHERKQNKSEERKPEAKKVRNARENSKEGTDHKK-DKLLGPLGDKTDERLK 549 >gi|115738156|ref|XP_781895.2| PREDICTED: similar to zinc finger protein 291 [Strongylocentrotus purpuratus] Length = 1863 Score = 37.7 bits (86), Expect = 0.54, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 44/122 (36%), Gaps = 8/122 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ +++ ++ RR+ + E+ +Y + ++K K + Sbjct: 1133 EERQRKNEEKAAKEEAVQERRKALEGERIARLEAMKKYRREQEEKMNALKEQREQAREST 1192 Query: 75 MREKARQSYRKLY-----SKDSWIAPEEPMGMTKAEIEALER---EIARLKAKPIEELIY 126 ++EKA++ ++L ++ + E + E R KA + L + Sbjct: 1193 VKEKAKEREQRLSALTAAQNAKEEEIQKKIQQKHDESQKRHNQQIEQKREKAIELSTLRH 1252 Query: 127 KR 128 R Sbjct: 1253 YR 1254 >gi|91795028|ref|YP_564679.1| peptidoglycan-binding LysM [Shewanella denitrificans OS217] gi|91717030|gb|ABE56956.1| Peptidoglycan-binding LysM [Shewanella denitrificans OS217] Length = 5236 Score = 37.7 bits (86), Expect = 0.55, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 29/92 (31%), Gaps = 20/92 (21%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYL-------KNKDKYR----------EYKRR 66 N+ KI + ++ +K+ Q +N+ E R Sbjct: 4842 QDNKAKIAIKSQQVTEDAIEKMESKQQQRRQSFIEGALENQQDINDKITKLGETVEKNRI 4901 Query: 67 YYLKNRDKMREKARQS---YRKLYSKDSWIAP 95 DK+RE+ R+ +R K+ P Sbjct: 4902 QREAESDKIRERYRERDINFRINNPKEELFTP 4933 >gi|58336988|ref|YP_193573.1| recombinase A [Lactobacillus acidophilus NCFM] gi|227903547|ref|ZP_04021352.1| recombination protein A [Lactobacillus acidophilus ATCC 4796] gi|76363334|sp|Q5FL82|RECA_LACAC RecName: Full=Protein RecA; AltName: Full=Recombinase A gi|58254305|gb|AAV42542.1| recombination protein [Lactobacillus acidophilus NCFM] gi|227868434|gb|EEJ75855.1| recombination protein A [Lactobacillus acidophilus ATCC 4796] Length = 363 Score = 37.7 bits (86), Expect = 0.59, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 32/68 (47%), Gaps = 9/68 (13%) Query: 16 REYKRRYYLKNRD---KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 RE ++Y ++ D K+ E+ R+ Y ++ I + +N +K +E + +N+ Sbjct: 302 RENAKKYLEEHPDIYQKVQEQVRQAYGIDEKSIADR------ENPEKIKEKREETSEENK 355 Query: 73 DKMREKAR 80 EK + Sbjct: 356 TDNSEKTK 363 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 34/76 (44%), Gaps = 12/76 (15%) Query: 22 YYLKNRDKI---LERRRRRYLKNKD---KIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +Y D+I E ++ ++ D K++E Q Y ++ + +N +K+ Sbjct: 290 WYSYKSDRIGQGRENAKKYLEEHPDIYQKVQEQVRQAYGIDEKSIADR------ENPEKI 343 Query: 76 REKARQSYRKLYSKDS 91 +EK ++ + + +S Sbjct: 344 KEKREETSEENKTDNS 359 >gi|212697507|ref|ZP_03305635.1| hypothetical protein ANHYDRO_02077 [Anaerococcus hydrogenalis DSM 7454] gi|212675506|gb|EEB35113.1| hypothetical protein ANHYDRO_02077 [Anaerococcus hydrogenalis DSM 7454] Length = 524 Score = 37.7 bits (86), Expect = 0.60, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 28/53 (52%), Gaps = 4/53 (7%) Query: 42 KDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQS--YRKLYSKDSW 92 K+K ++ + +NK+ + ++Y N+DK EK ++ +K+ K ++ Sbjct: 303 KEKDMKNLEKRISENKENIEKNNKKYI--NKDKNTEKTKEKLPNKKISKKKAY 353 >gi|169604943|ref|XP_001795892.1| hypothetical protein SNOG_05488 [Phaeosphaeria nodorum SN15] gi|160706670|gb|EAT86552.2| hypothetical protein SNOG_05488 [Phaeosphaeria nodorum SN15] Length = 1185 Score = 37.3 bits (85), Expect = 0.61, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 44/107 (41%), Gaps = 13/107 (12%) Query: 27 RDKILERRRRRYLKNKDKI-RESYHQYYLKNKDKYRE-------YKRRYYLKNRD--KMR 76 ++ +R+ ++ D I + ++ NK+ +E + + R+ +++ Sbjct: 561 KEHKKAHKRKYRDESPDDIPLKKLKKHKEMNKEPVKEAKAAKEPKVSKEPKETREPKEVK 620 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 E ++ + KD AP + T + E E + A + P++E Sbjct: 621 E-PKEPKEQKEPKDKGKAPAKAPSKTPIKEE--EPKEATPRETPVQE 664 >gi|302693336|ref|XP_003036347.1| hypothetical protein SCHCODRAFT_81569 [Schizophyllum commune H4-8] gi|300110043|gb|EFJ01445.1| hypothetical protein SCHCODRAFT_81569 [Schizophyllum commune H4-8] Length = 1937 Score = 37.3 bits (85), Expect = 0.63, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 38/88 (43%) Query: 36 RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAP 95 R +K++D+ +E + +++N+D+ + RY ++ R+ + Y + ++D P Sbjct: 502 ERRIKDRDERKEREKREWIRNRDRANQGSGRYRDRDDSYGRDYESRPYSRGANRDYESRP 561 Query: 96 EEPMGMTKAEIEALEREIARLKAKPIEE 123 G E R +++P Sbjct: 562 YSRGGDRDYEPRPYSRGGGDYESRPYSR 589 >gi|71896371|ref|NP_001026185.1| hypothetical protein LOC421045 [Gallus gallus] gi|82083028|sp|Q5ZML6|CR019_CHICK RecName: Full=Uncharacterized protein C18orf19 homolog gi|53127288|emb|CAG31027.1| hypothetical protein RCJMB04_1k21 [Gallus gallus] Length = 275 Score = 37.3 bits (85), Expect = 0.64, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 13/75 (17%) Query: 21 RYYLKN-----RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +Y KN + E + R + KDKI E + KDK E + +DK+ Sbjct: 206 KYLRKNGYMSTPPPVKEYLQDRMEETKDKITEKM----EETKDKITEKM----EETKDKI 257 Query: 76 REKARQSYRKLYSKD 90 EK +++ K+ K Sbjct: 258 TEKIQETKDKVSFKK 272 >gi|290343600|ref|YP_003494967.1| hypothetical protein OTV1_128 [Ostreococcus tauri virus 1] gi|260161015|emb|CAY39716.1| hypothetical protein OTV1_128 [Ostreococcus tauri virus 1] Length = 83 Score = 37.3 bits (85), Expect = 0.65, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 34/53 (64%) Query: 38 YLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKD 90 Y +KDK +ES +Y +NKDK +E + YY +N++K++E+ R Y + K+ Sbjct: 29 YDYDKDKNKESCRKYREQNKDKVKEQNKLYYERNKEKIKEQKRLQYEEKKKKE 81 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 28/49 (57%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +E R+Y +N+DK+ E+ + Y +NK+KI+E Y + K K + Sbjct: 35 KNKESCRKYREQNKDKVKEQNKLYYERNKEKIKEQKRLQYEEKKKKENQ 83 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 20/52 (38%), Positives = 31/52 (59%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 Y ++DK E R+ +NKDK++E YY +NK+K +E KR Y + + K Sbjct: 29 YDYDKDKNKESCRKYREQNKDKVKEQNKLYYERNKEKIKEQKRLQYEEKKKK 80 >gi|296804456|ref|XP_002843080.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238845682|gb|EEQ35344.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 1303 Score = 37.3 bits (85), Expect = 0.65, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 44/108 (40%), Gaps = 2/108 (1%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 KV + +R + ++ + ++KI +R + L NK R + N +K + Sbjct: 895 KVASERQRRQAAMQREKGKREEAKEKI--KRLQDNLANKIADRNRVEKELKDNVEKAKLK 952 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + + R + EK +Q+ ++ + + TK + L++ Sbjct: 953 RAALIQRKRREYDEKLKQAKQQEADSCRRLEELKYRQRTKYGTDFLKK 1000 >gi|315151163|gb|EFT95179.1| conserved hypothetical protein [Enterococcus faecalis TX0012] Length = 666 Score = 37.3 bits (85), Expect = 0.66, Method: Composition-based stats. Identities = 17/102 (16%), Positives = 40/102 (39%), Gaps = 6/102 (5%) Query: 9 EERMLCRREYKRRYYLK--NRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE-YKR 65 +E ++ + K N K + ++R + D + + + N+ K ++ ++ Sbjct: 464 KENTNQSKQKSKPSRQKKVNEPKQTNKEQKRNSHSLDSLSKKQQELRSNNQSKQQQFTQK 523 Query: 66 RYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIE 107 + KN++ E R+ + + K+ P TK E Sbjct: 524 QDKEKNKE---ETGRKYKQSQHRKNEQKKPTTKSSSTKGNQE 562 >gi|299469649|emb|CBN76503.1| hypothetical protein Esi_0000_0113 [Ectocarpus siliculosus] Length = 935 Score = 37.3 bits (85), Expect = 0.66, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 47/125 (37%), Gaps = 4/125 (3%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + + R RE +R+ + R+ RR+ + + E + K +K REY Sbjct: 314 QEQREKRRAAAERETRRKEKQRAREIAEAARRKALEA--ELLAEMEKEKARKRLEKIREY 371 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 K + + + + A + + ++ + A + E E + R K KP E Sbjct: 372 KEKKAARELAEKKSAAAKREEERRAQAAEEA-KRLAAEASFERERAN-WLERAKRKPRAE 429 Query: 124 LIYKR 128 KR Sbjct: 430 EKAKR 434 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 27/118 (22%), Positives = 46/118 (38%), Gaps = 5/118 (4%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + + + R ER RR K + + + + E ++ K +K Sbjct: 308 QREREFQEQREKRRAAAERETRRKEKQRAREIAEAARRKALEAELLAEMEKEKARKRLEK 367 Query: 75 MREKA-RQSYRKLYSKDSWIAPEEPMGMTKAEIE----ALEREIARLKAKPIEELIYK 127 +RE +++ R+L K S A E +A E A E R +A +E K Sbjct: 368 IREYKEKKAARELAEKKSAAAKREEERRAQAAEEAKRLAAEASFERERANWLERAKRK 425 >gi|189523699|ref|XP_001923800.1| PREDICTED: titin [Danio rerio] Length = 32757 Score = 37.3 bits (85), Expect = 0.67, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 41/91 (45%), Gaps = 5/91 (5%) Query: 26 NRDKILERRRRRY----LKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 ++K ++ +N+ K + + Y K ++K R+ + Y+ KN+DK++ +Q Sbjct: 9342 KKEKETAHLKKEVFLSVKQNEKKFPKGKNIYPRKEEEKLRKQEYSYFPKNKDKIKFLKKQ 9401 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + +D I P E ++ + L E Sbjct: 9402 EPLSVK-QDEKILPREKEVFSQKGFQLLHEE 9431 >gi|328766850|gb|EGF76902.1| hypothetical protein BATDEDRAFT_28128 [Batrachochytrium dendrobatidis JAM81] Length = 975 Score = 37.3 bits (85), Expect = 0.67, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 36/83 (43%), Gaps = 2/83 (2%) Query: 14 CRREYKRRYYLKNRD-KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 R+ +++ KI E+ R + + + ++ + +N +K E +R+ K Sbjct: 260 KRKADGAEIRSSSKEAKIKEKESRDIERQRKENEKAAEKQRKEN-EKAIEKQRKEDEKAL 318 Query: 73 DKMREKARQSYRKLYSKDSWIAP 95 ++ +K ++ + SK++ Sbjct: 319 ERAAKKCKKDAERKASKEAKELN 341 >gi|312221786|emb|CBY01726.1| similar to ATP-dependent rRNA helicase spb4 [Leptosphaeria maculans] Length = 636 Score = 37.3 bits (85), Expect = 0.69, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 39/103 (37%), Gaps = 16/103 (15%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 + R R ++ Y N+ E ++R +NK + + + K+ RE K Sbjct: 532 KDKAREQQRLVEQKEY-EANK---AEGKKRYVTENKKESAWTQQKDARATKELRREKKAA 587 Query: 67 YYLKNR------------DKMREKARQSYRKLYSKDSWIAPEE 97 +R ++++ + +K+ +++ A +E Sbjct: 588 KREHDRVAKLTDAERKEEERLKAMVDEVRQKVKEQEAKEAADE 630 Score = 37.3 bits (85), Expect = 0.71, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 38/99 (38%), Gaps = 3/99 (3%) Query: 21 RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKAR 80 Y K R++ ++ Y NK E +Y +NK + +++ ++ REK Sbjct: 530 AYKDKAREQQRLVEQKEYEANKA---EGKKRYVTENKKESAWTQQKDARATKELRREKKA 586 Query: 81 QSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 A + KA ++ + +++ +AK Sbjct: 587 AKREHDRVAKLTDAERKEEERLKAMVDEVRQKVKEQEAK 625 >gi|139947514|ref|NP_001077335.1| telomerase reverse transcriptase [Danio rerio] gi|124257924|gb|ABM92944.1| telomerase reverse transcriptase [Danio rerio] Length = 1098 Score = 37.3 bits (85), Expect = 0.75, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 37/101 (36%), Gaps = 8/101 (7%) Query: 19 KRRYYLKNRDKILERRRRRYL-------KNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 ++ N I +RRR + + + N++ +RR + Sbjct: 212 SKKRTRDNEKYISVKRRRVKETVNNNNGNYRSLCFAISKKRAIDNEENISLKRRRMEETD 271 Query: 72 R-DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + K+R + +S SK EE + + + +E +++ Sbjct: 272 QVAKIRNENHESQSFAISKKRARDNEENISLKRQRMEEIDQ 312 >gi|190339728|gb|AAI63300.1| Telomerase reverse transcriptase [Danio rerio] Length = 1098 Score = 37.3 bits (85), Expect = 0.77, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 37/101 (36%), Gaps = 8/101 (7%) Query: 19 KRRYYLKNRDKILERRRRRYL-------KNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 ++ N I +RRR + + + N++ +RR + Sbjct: 212 SKKRTRDNEKYISVKRRRVKETVNNNNGNYRSLCFAISKKRAIDNEENISLKRRRMEETD 271 Query: 72 R-DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + K+R + +S SK EE + + + +E +++ Sbjct: 272 QVAKIRNENHESQSFAISKKRARDNEENISLKRQRMEEIDQ 312 >gi|215422509|dbj|BAG85376.1| telomerase reverse transcriptase [Danio rerio] Length = 1098 Score = 37.3 bits (85), Expect = 0.78, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 37/101 (36%), Gaps = 8/101 (7%) Query: 19 KRRYYLKNRDKILERRRRRYL-------KNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 ++ N I +RRR + + + N++ +RR + Sbjct: 212 SKKRTRDNEKYISVKRRRVKETVNNNNGNYRSLCFAISKKRAIDNEENISLKRRRMEETD 271 Query: 72 R-DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 + K+R + +S SK EE + + + +E +++ Sbjct: 272 QVAKIRNENHESQSFAISKKRARDNEENISLKRQRMEEIDQ 312 >gi|302409914|ref|XP_003002791.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] gi|261358824|gb|EEY21252.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] Length = 1159 Score = 37.0 bits (84), Expect = 0.82, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 49/123 (39%), Gaps = 16/123 (13%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKN---KDKIRESYHQYYLKNKDK-YREYKRRYYLKNR 72 + + K ++K + + + + K K++ +Y Q NK+K +E + R Sbjct: 326 KKMQESLRKEKEKKKQTHKEKDDQTTQLKQKVKATYEQMRQTNKEKARKESVLKEKQDKR 385 Query: 73 DKMRE----------KARQSYRKLYSKDSWIAPEEPMGMTK--AEIEALEREIARLKAKP 120 KM E ++S S+ + + + + E AL+ E A+L+A+ Sbjct: 386 RKMSETVDKCNTDVGTMKKSRDGFSSEREHVQQDRDLQVKHLSEENAALQEEFAQLEAEY 445 Query: 121 IEE 123 E Sbjct: 446 KER 448 >gi|237830805|ref|XP_002364700.1| rhoptry protein, putative [Toxoplasma gondii ME49] gi|211962364|gb|EEA97559.1| rhoptry protein, putative [Toxoplasma gondii ME49] gi|221507581|gb|EEE33185.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 958 Score = 37.0 bits (84), Expect = 0.83, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 2/69 (2%) Query: 3 RKVLTPEERM-LCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 RK L E+R R E ++ + R K+LE+R + L+ K +I+E + K ++ R Sbjct: 749 RKELKEEQRKARERAEAEKMVAEEERKKLLEQRTQEELETKRRIQE-EKELREKQAEEMR 807 Query: 62 EYKRRYYLK 70 + + K Sbjct: 808 KQEEEQRKK 816 >gi|308498706|ref|XP_003111539.1| hypothetical protein CRE_03004 [Caenorhabditis remanei] gi|308239448|gb|EFO83400.1| hypothetical protein CRE_03004 [Caenorhabditis remanei] Length = 1448 Score = 37.0 bits (84), Expect = 0.84, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 39/106 (36%), Gaps = 5/106 (4%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNK-----DKYREYKRRYYLKNRDKMREKARQ 81 + K+ E +R + K K E Q K + + +N +++E + Sbjct: 1034 KKKLTEAEKRLEMTEKKKALEMKKQTEKYEKMTSKCENLESRIKELEEENAAEIKEHVAK 1093 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 +K+ + + MG E E +E+ K K ++E K Sbjct: 1094 YEQKVKKYEELEKKMKEMGREMKENEKKMKELDMKKDKELDEYKTK 1139 >gi|74193687|dbj|BAE22792.1| unnamed protein product [Mus musculus] Length = 554 Score = 37.0 bits (84), Expect = 0.85, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 34/98 (34%), Gaps = 8/98 (8%) Query: 15 RREYKRRY-YLKNRDKILERRRRRYL----KNKDKIRESYHQYYLKNKDKYREYK---RR 66 +E + K ++ E + ++ D + ++ + K R RR Sbjct: 176 SKEKDKMREREKEKEAGKELKYPWRELMKDEDSDPYKFKIKEFEDIDAAKVRLKDGIVRR 235 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K++DK +++ R K + + + KA Sbjct: 236 EREKHKDKKKDRERSKREKDKRERERLKEKNREDKIKA 273 >gi|20806882|ref|NP_622053.1| hypothetical protein TTE0369 [Thermoanaerobacter tengcongensis MB4] gi|20515355|gb|AAM23657.1| hypothetical protein TTE0369 [Thermoanaerobacter tengcongensis MB4] Length = 440 Score = 37.0 bits (84), Expect = 0.88, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 8/101 (7%) Query: 35 RRRYL-KNKDKIRESYHQYYLKNKDKYREYKRR-----YYLKNRDKMREKARQSYRKLYS 88 ++ +N+ ++Y N++K R Y ++ Y K ++ + Y K+ + Sbjct: 335 KKFREARNRILKAAEKIKFYRDNREKVRFYYKKLLISLYQKKALERGNYSVKDVYEKVTA 394 Query: 89 KDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKRG 129 S+ EPM E + K + +EE I + G Sbjct: 395 --SFENVREPMERVSRLYEKVRYGKYYPKREEVEEFIKEIG 433 >gi|123507969|ref|XP_001329532.1| hypothetical protein [Trichomonas vaginalis G3] gi|121912488|gb|EAY17309.1| conserved hypothetical protein [Trichomonas vaginalis G3] Length = 395 Score = 37.0 bits (84), Expect = 0.96, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 28/63 (44%), Gaps = 4/63 (6%) Query: 31 LERRRRRYLKNKDKIRE----SYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 +R + + +NKDKI E +Y ++ + ++ ++ + +K ++ Y K Sbjct: 203 ANKREKYFEENKDKIIEESEKRKAEYAKMTPEEAAKARKERRSQHNQLIMKKRQEKYEKQ 262 Query: 87 YSK 89 K Sbjct: 263 QRK 265 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 11/81 (13%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Query: 15 RREYKRRYYLKNRDKILE----RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 + +Y+ +N+DKI+E R+ ++ ++ + ++ + ++ Y K Sbjct: 202 NANKREKYFEENKDKIIEESEKRKAEYAKMTPEEAAKARKERRSQHNQLIMKKRQEKYEK 261 Query: 71 NRDKMREKARQSYRKLYSKDS 91 + K++ + +Q+ + ++++ Sbjct: 262 QQRKLQGQTQQAAPQDFNQNQ 282 >gi|330801236|ref|XP_003288635.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum] gi|325081308|gb|EGC34828.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum] Length = 1853 Score = 37.0 bits (84), Expect = 0.97, Method: Composition-based stats. Identities = 16/118 (13%), Positives = 40/118 (33%), Gaps = 20/118 (16%) Query: 15 RREYKRRYYLKNRDKILERRRRRY-LKNKDKIRESYHQYYLK-NKDKYREYKRRYY--LK 70 RE R + R+ +R + ++K+ + ++ +K+ E ++ + + Sbjct: 1383 HRESHREHRSSEREHRSDRSKEHRSDRDKEHRSDRDKEHRSDRDKEHRSEREKDQHRSDR 1442 Query: 71 NRD----------------KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 N++ + R + +R KD +G + + RE Sbjct: 1443 NKEHRSGERDKEHRSDRDKEHRSDRDKEHRSEREKDQHREARTSIGEREKDQNRDHRE 1500 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 9/88 (10%), Positives = 32/88 (36%), Gaps = 4/88 (4%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLK-NKDKYREYKRRYY-LKNRDKMREKARQS 82 +++D E R ++ + ++ +K+ + + + ++++ E+ + Sbjct: 1378 RDKDHHRESHREHRSSEREHRSDRSKEHRSDRDKEHRSDRDKEHRSDRDKEHRSEREKDQ 1437 Query: 83 YR--KLYSKDSWIAPEEPMGMTKAEIEA 108 +R + S +E E + Sbjct: 1438 HRSDRNKEHRSGERDKEHRSDRDKEHRS 1465 >gi|67925378|ref|ZP_00518728.1| hypothetical protein CwatDRAFT_0726 [Crocosphaera watsonii WH 8501] gi|67852774|gb|EAM48183.1| hypothetical protein CwatDRAFT_0726 [Crocosphaera watsonii WH 8501] Length = 297 Score = 37.0 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 9/82 (10%) Query: 16 REYKRRYYLKNRDKILERR---RRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 RE+ R Y NR + R R N+ + + N D +++ +N Sbjct: 62 REFDRGYRQDNRQINRDGRDFDRGYRQDNRQGNFDDRRENRQGNFDD----RQQNRQENI 117 Query: 73 DKMREKARQS--YRKLYSKDSW 92 D ++ +++ R+ + D W Sbjct: 118 DNRQQNRQENIDNRQDFRNDRW 139 >gi|298251442|ref|ZP_06975245.1| transposase IS605 OrfB [Ktedonobacter racemifer DSM 44963] gi|297546034|gb|EFH79902.1| transposase IS605 OrfB [Ktedonobacter racemifer DSM 44963] Length = 554 Score = 36.6 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 7/69 (10%) Query: 5 VLTPEERMLCR--REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQY-----YLKNK 57 L PE R +CR R+ R+ N E+ R + NK + +Y K Sbjct: 282 ELAPEARAICRLQRQMDRQRRAANPKHYDEKGRIKKQGNKKLRWKQSKRYQVTRRRKATK 341 Query: 58 DKYREYKRR 66 ++ R+ Sbjct: 342 ERKLAAHRK 350 >gi|270289241|ref|ZP_06195543.1| adherence factor [Chlamydia muridarum Weiss] Length = 3154 Score = 36.6 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 23/57 (40%), Gaps = 1/57 (1%) Query: 57 KDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREI 113 K++ EY R+ N++K+ ++ +L S I + + K E+ Sbjct: 300 KEEIEEY-RKLKESNKEKIASVVKKLNEQLGSARVHIKDIKELNSMKHSQNIYNYEM 355 >gi|320583686|gb|EFW97899.1| hypothetical protein HPODL_0529 [Pichia angusta DL-1] Length = 1260 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 45/97 (46%), Gaps = 2/97 (2%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R+E R+ + ++ ER R + + + +IR + + K++ RE +R K + + Sbjct: 270 RKEQDRKERERQENERKERERVQRERKEQEIRAQQLRD-QQRKEQQREMERVEEEKRKKE 328 Query: 75 MREKARQSYR-KLYSKDSWIAPEEPMGMTKAEIEALE 110 + E+ +Q + + E + ++ E+E ++ Sbjct: 329 LLERQKQEIELRDRQQREQRLEMERLEQSRREMEKMQ 365 >gi|221090707|ref|XP_002169539.1| PREDICTED: similar to F59H6.5, partial [Hydra magnipapillata] Length = 993 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 45/135 (33%), Gaps = 17/135 (12%) Query: 6 LTPEERMLCRREYKRRYYLK--------NRDKILERRRRRYLKNKDKIRESYHQYYLKNK 57 L E R +C+ E + N + E RR+ +N + E + +N Sbjct: 5 LRVEIRRICQNERNIARREEINHRQNERNATRREELNRRQNERNATRQEELNRRQNKRNA 64 Query: 58 DKYREYKRRYYLKN---RDKM----REKARQSYRK-LYSKDSWIAP-EEPMGMTKAEIEA 108 + E R +N R+++ E+ + + ++ A E + + E Sbjct: 65 TRQEELNHRQNERNATRREELNCCQNERNATRREELNHRQNERNATRREELNRRQNERNE 124 Query: 109 LEREIARLKAKPIEE 123 R + R I Sbjct: 125 ARRALVRCGMHCIAR 139 >gi|158602708|gb|ABW74630.1| telomerase reverse transcriptase [Danio rerio] Length = 1091 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 13/110 (11%), Positives = 39/110 (35%), Gaps = 5/110 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 + V ++R +Y + ++ + + + + + N++ Sbjct: 200 KTVAISKKRTRDNEKYISVKRRRVKETVNNKNGNY----RSRCFAISKKRARDNEENISL 255 Query: 63 YKRRYYLKNRD-KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 +RR +++ K+ + S SK EE + + + +E ++ Sbjct: 256 KRRRMEETDQEAKICNENHGSQSFTISKKRARDNEENISLKRQRMEETDQ 305 >gi|225868016|ref|YP_002743964.1| primosomal protein N' [Streptococcus equi subsp. zooepidemicus] gi|225701292|emb|CAW98291.1| putative primosomal protein N' [Streptococcus equi subsp. zooepidemicus] Length = 794 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 5/98 (5%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 +DK + + Y NK ++E + + N+ K R+ + Y L N K + A+ Sbjct: 166 KDKKTIKIEKYYQVNKSALQE----HKISNRAKKRQALKAYLLANPQKDK-LAKLYQDYS 220 Query: 87 YSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 S+ + + + + + E +KA P+ +L Sbjct: 221 REVVSYFVANQLIELEEQRVNRSEAYFNHIKAAPLLKL 258 >gi|300637998|gb|ADK26163.1| Ycf1 [Asimina obovata] Length = 597 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 54/124 (43%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYY-LKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYK 64 +ER L +R+ ++ K ++++ ++ + + K K+++ + + + + + K Sbjct: 280 KAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQKEKERLNKK 339 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 340 EKERLNKKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 399 Query: 125 IYKR 128 +R Sbjct: 400 RKRR 403 Score = 33.5 bits (75), Expect = 9.0, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K R Sbjct: 271 KDYEIFNFTKAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQ 330 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + + E E L ++ Sbjct: 331 KEKERLNKKEKERLNKKEKERLNQKEKEQLNQK 363 >gi|221487790|gb|EEE26022.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 958 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 2/69 (2%) Query: 3 RKVLTPEERM-LCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 RK L E+R L R E ++ + R K+LE+R + L+ K +I+E + K ++ R Sbjct: 749 RKELKEEQRKALERAEAEKIVAEEERKKLLEQRTQEELETKRRIQE-EKELREKQAEEMR 807 Query: 62 EYKRRYYLK 70 + + K Sbjct: 808 KQEEEQRKK 816 >gi|300637970|gb|ADK26149.1| Ycf1 [Asimina tetramera] gi|300637980|gb|ADK26154.1| Ycf1 [Asimina angustifolia] gi|300637982|gb|ADK26155.1| Ycf1 [Asimina pygmaea] gi|300638018|gb|ADK26173.1| Ycf1 [Asimina angustifolia] Length = 598 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 54/124 (43%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYY-LKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYK 64 +ER L +R+ ++ K ++++ ++ + + K K+++ + + + + + K Sbjct: 281 KAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQKEKERLNKK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K R Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQ 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + + E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNQKEKEQLNQK 364 >gi|300638016|gb|ADK26172.1| Ycf1 [Asimina pygmaea] Length = 586 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 54/124 (43%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYY-LKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYK 64 +ER L +R+ ++ K ++++ ++ + + K K+++ + + + + + K Sbjct: 281 KAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQKEKERLNKK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 Score = 33.5 bits (75), Expect = 9.3, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K R Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQ 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + + E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNQKEKEQLNQK 364 >gi|145296787|ref|YP_001139608.1| molecular chaperone DnaK [Corynebacterium glutamicum R] gi|167016953|sp|A4QHJ0|DNAK_CORGB RecName: Full=Chaperone protein DnaK; AltName: Full=HSP70; AltName: Full=Heat shock 70 kDa protein; AltName: Full=Heat shock protein 70 gi|140846707|dbj|BAF55706.1| hypothetical protein [Corynebacterium glutamicum R] Length = 618 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 27/78 (34%), Gaps = 5/78 (6%) Query: 40 KNKDKIRESYHQYYLK-----NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIA 94 K+ + + + + N + R++ +N +K+ E + + Sbjct: 491 KDAEAHADEDKKRREEQEVRNNAESLVYQTRKFVEENSEKVSEDLKAKVEEAAKGVEEAL 550 Query: 95 PEEPMGMTKAEIEALERE 112 E + KA +E L E Sbjct: 551 KGEDLEAIKAAVEKLNTE 568 >gi|115386330|ref|XP_001209706.1| hypothetical protein ATEG_07020 [Aspergillus terreus NIH2624] gi|114190704|gb|EAU32404.1| hypothetical protein ATEG_07020 [Aspergillus terreus NIH2624] Length = 1205 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 35/85 (41%), Gaps = 3/85 (3%) Query: 35 RRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIA 94 R+R KNK + ++ N + + R+K+RE+ + + K+ SK Sbjct: 7 RQRKQKNKQRDSKTAP--VDTNAAEILPTSKNDKEARREKLREQLKDQHPKVSSKKQKRL 64 Query: 95 PEEPMGMTKAEIE-ALEREIARLKA 118 + K E L +++A+ K Sbjct: 65 DKYIENKLKKEENLELLKKLAQSKI 89 >gi|19553990|ref|NP_601992.1| molecular chaperone DnaK [Corynebacterium glutamicum ATCC 13032] gi|62391635|ref|YP_227037.1| molecular chaperone DnaK [Corynebacterium glutamicum ATCC 13032] gi|24211638|sp|Q8NLY6|DNAK_CORGL RecName: Full=Chaperone protein DnaK; AltName: Full=HSP70; AltName: Full=Heat shock 70 kDa protein; AltName: Full=Heat shock protein 70 gi|21325573|dbj|BAC00194.1| Molecular chaperone and 70 kDa heat shock chaperonin protein dnaK [Corynebacterium glutamicum ATCC 13032] gi|41326977|emb|CAF20821.1| Heat shock protein hsp70 [Corynebacterium glutamicum ATCC 13032] Length = 618 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 27/78 (34%), Gaps = 5/78 (6%) Query: 40 KNKDKIRESYHQYYLK-----NKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIA 94 K+ + + + + N + R++ +N +K+ E + + Sbjct: 491 KDAEAHADEDKKRREEQEVRNNAESLVYQTRKFVEENSEKVSEDLKAKVEEAAKGVEEAL 550 Query: 95 PEEPMGMTKAEIEALERE 112 E + KA +E L E Sbjct: 551 KGEDLEAIKAAVEKLNTE 568 >gi|322702569|gb|EFY94206.1| hypothetical protein MAA_10332 [Metarhizium anisopliae ARSEF 23] Length = 669 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 38/114 (33%), Gaps = 14/114 (12%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 + ++ Y +D+ ++ ++K+ E + K +K K+ +K Sbjct: 510 SKRLQKAYAQAVKDRAKA------VQEREKVLERRRKKAAKEAEK--------QAKDAEK 555 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 R K + K ++ A E E + E K K L KR Sbjct: 556 TRAKLEKQVEKQRVREEKQASESVAAEGHEEQDGTEDAERPKKLKKFCALPSKR 609 >gi|208751335|gb|ACI31281.1| RNA polymerase beta' subunit [Volvox carteri] gi|261888156|gb|ACY06011.1| beta' subunit of RNA polymerase [Volvox carteri f. nagariensis] Length = 2094 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 1/69 (1%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 K L ++RML ++ K+ K E + KN + + + + KNK + Sbjct: 524 KSLGSKQRMLAKKGNKKEMITNKITKKTESNTKLEPKNSELLLIYISKNWRKNKKLLYQQ 583 Query: 64 KR-RYYLKN 71 R +YYLKN Sbjct: 584 NRKKYYLKN 592 >gi|310795759|gb|EFQ31220.1| stress response protein NST1 [Glomerella graminicola M1.001] Length = 1212 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 13/93 (13%), Positives = 36/93 (38%), Gaps = 9/93 (9%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 ++ + ++K E+ R+ + K K++ R ++ + ++RE+ Sbjct: 644 QKRLQEQQEKRAEQERKAREA---------REKAQKLKEEARLREKENREQKEKEIRERK 694 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + K++ + K E L++E Sbjct: 695 EKQDQAKRDKEAKAKADREAKADKESSERLKQE 727 >gi|198421981|ref|XP_002130898.1| PREDICTED: similar to mitogen-activated protein kinase kinase kinase kinase 4 [Ciona intestinalis] Length = 1022 Score = 36.2 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 40/94 (42%), Gaps = 3/94 (3%) Query: 21 RYYLKNRDKILERRRRRYLK--NKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 + +N+++ ++R R K N ++ + + + D RE RR K ++ RE+ Sbjct: 350 KLQQENKERSELQKRERQQKELNDERRKNEKKKEDRRKSDDQRENLRREQAKKEERAREQ 409 Query: 79 ARQSYRKLYSKDSWIA-PEEPMGMTKAEIEALER 111 R+ + K+ +E + + E E + Sbjct: 410 RRKLDAERKRKEEEARLMQERIQQQRREQEEAHQ 443 >gi|260684143|ref|YP_003215428.1| two-component sensor histidine kinase [Clostridium difficile CD196] gi|260687802|ref|YP_003218936.1| two-component sensor histidine kinase [Clostridium difficile R20291] gi|260210306|emb|CBA64617.1| two-component sensor histidine kinase [Clostridium difficile CD196] gi|260213819|emb|CBE05794.1| two-component sensor histidine kinase [Clostridium difficile R20291] Length = 778 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 127 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 182 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 183 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 228 >gi|126700189|ref|YP_001089086.1| two-component sensor histidine kinase [Clostridium difficile 630] Length = 778 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 127 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 182 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 183 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 228 >gi|16330312|ref|NP_441040.1| DNA gyrase B subunit [Synechocystis sp. PCC 6803] gi|2501296|sp|P77966|GYRB_SYNY3 RecName: Full=DNA gyrase subunit B; Contains: RecName: Full=Ssp gyrB intein gi|1652801|dbj|BAA17720.1| DNA gyrase B subunit [Synechocystis sp. PCC 6803] Length = 1078 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 52/138 (37%), Gaps = 16/138 (11%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRD--------KILERRRRRYLKNKDKIRESYHQY 52 M ++ +PE R + ++ + ++ + + +N +++ + +Y Sbjct: 646 MSARMQSPETRAILSKQAQAQWQNETYKLTMMESWRSFYDSNEDYRQQNAEQLNRAQQEY 705 Query: 53 Y--LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 + +N+ E R+++ +N RQ Y + K W PE K E Sbjct: 706 WAQAENRTAQAERVRQHFAQNP-----GLRQQYSENAVK-QWNNPELLKWRQKKTKEQWT 759 Query: 111 REIARLKAKPIEELIYKR 128 E + + + + Y++ Sbjct: 760 PEFREKRREALAQTYYRK 777 >gi|328887726|emb|CAJ69459.2| Two-component sensor histidine kinase [Clostridium difficile] Length = 773 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 122 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 177 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 178 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 223 >gi|255307603|ref|ZP_05351774.1| two-component sensor histidine kinase [Clostridium difficile ATCC 43255] Length = 777 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 126 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 181 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 182 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 227 >gi|195349197|ref|XP_002041133.1| GM15205 [Drosophila sechellia] gi|194122738|gb|EDW44781.1| GM15205 [Drosophila sechellia] Length = 594 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 27/55 (49%), Gaps = 2/55 (3%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 +N++ + E + +NK+ + E+ + +NK++ + +N++ + E Sbjct: 152 ENKEVVSE-NKEVVSENKEVVSEN-KEVVSENKEEVVSENKEVVSENKEVVSENK 204 >gi|254976161|ref|ZP_05272633.1| two-component sensor histidine kinase [Clostridium difficile QCD-66c26] gi|255093551|ref|ZP_05323029.1| two-component sensor histidine kinase [Clostridium difficile CIP 107932] gi|255315294|ref|ZP_05356877.1| two-component sensor histidine kinase [Clostridium difficile QCD-76w55] gi|255517962|ref|ZP_05385638.1| two-component sensor histidine kinase [Clostridium difficile QCD-97b34] gi|255651078|ref|ZP_05397980.1| two-component sensor histidine kinase [Clostridium difficile QCD-37x79] gi|306520932|ref|ZP_07407279.1| two-component sensor histidine kinase [Clostridium difficile QCD-32g58] Length = 777 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 126 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 181 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 182 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 227 >gi|195978662|ref|YP_002123906.1| primosome assembly protein PriA [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975367|gb|ACG62893.1| helicase PriA essential for oriC/DnaA-independent DNA replication [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 794 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 5/98 (5%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 +DK + + Y NK ++E + + N+ K R+ + Y L N K + A+ Sbjct: 166 KDKKTIKIEKYYQVNKSALQE----HKISNRAKKRQALKTYLLANPQKDK-LAKLYQDYS 220 Query: 87 YSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 S+ + + + + + E +KA P+ +L Sbjct: 221 REVVSYFVANQLIELEERRVNRSEDYFNHIKAAPLLKL 258 >gi|255101734|ref|ZP_05330711.1| two-component sensor histidine kinase [Clostridium difficile QCD-63q42] Length = 777 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 36/106 (33%), Gaps = 10/106 (9%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK------IRESYHQYYLKNKDKYREY 63 E + +YYL +++ + NK+ ++ + +N+ K E Sbjct: 126 EHFIKASGKNLKYYLVDKE----SNFEFFNDNKELGVLDTTKNDASKENSDENQSKNTEM 181 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 K ++E R + KD + + G+ + I + Sbjct: 182 NDEERQKQITDIKENYRFYVVMNFDKDGKVNIVDSYGVNQKVINQM 227 >gi|300638030|gb|ADK26179.1| Ycf1 [Asimina reticulata] Length = 598 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 51/124 (41%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDK--YREYK 64 +ER L +R+ ++ K + L ++ + L K+K + + + NK + K Sbjct: 281 KAQERYLRQRKKEKERLNKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNKKEKERLNKK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNKKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K + Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKERLNKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNK 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + K E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNKKEKEQLNQK 364 >gi|224046036|ref|XP_002187561.1| PREDICTED: hypothetical protein [Taeniopygia guttata] Length = 286 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 12/75 (16%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +EY + + +DKI E+ + KDKI E + KDK + +DK+ Sbjct: 221 KEYLQDRMEETKDKITEKM----EETKDKITEKM----EETKDKITGKM----EETKDKI 268 Query: 76 REKARQSYRKLYSKD 90 K +++ K+ K Sbjct: 269 TGKIQETKDKVSFKK 283 >gi|194206723|ref|XP_001918295.1| PREDICTED: similar to microtubule-associated protein 1A [Equus caballus] Length = 3059 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 50/108 (46%), Gaps = 9/108 (8%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRES------YHQYYLKNKDKYREYKRRYYLKNR 72 ++ + +D LE++ + + KDK+ E + L KD+ E + +K + Sbjct: 1765 QKDKAEEQKDMDLEQKGKALEQ-KDKVLEKKDQALEQKYWALGQKDEALEQNNK-SVKQK 1822 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE-IEALEREIARLKAK 119 DK+ E+ Q L +D + P++ + E +EA+E++ L K Sbjct: 1823 DKVLEEKTQEQESLVQEDKTMKPKKILEEKPPEKVEAVEQKEEALPEK 1870 >gi|221488723|gb|EEE26937.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 2685 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 36/112 (32%), Gaps = 5/112 (4%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 E R E R++ + R ++++I + + K RE R Sbjct: 2275 EHRQRVESERPSDRRQGAREETRQGEAREERSDREQIEAVRRGWRREEAVKPREDTRGDR 2334 Query: 69 LKNRD-----KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIAR 115 +D K+++ + + + +K++ A EI R Sbjct: 2335 EAAKDSCRSRKVQKTCEEEDSEDELCARLERRQRTTDSSKSDEMATHEEIRR 2386 >gi|300637992|gb|ADK26160.1| Ycf1 [Asimina reticulata] Length = 597 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 51/124 (41%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDK--YREYK 64 +ER L +R+ ++ K + L ++ + L K+K + + + NK + K Sbjct: 281 KAQERYLRQRKKEKERLNKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNKKEKERLNKK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNKKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 Score = 33.5 bits (75), Expect = 10.0, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K + Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKERLNKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNK 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + K E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNKKEKEQLNQK 364 >gi|20090163|ref|NP_616238.1| type IV secretion system protein [Methanosarcina acetivorans C2A] gi|19915148|gb|AAM04718.1| type IV secretion system protein [Methanosarcina acetivorans C2A] Length = 838 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 22/56 (39%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 + + +N K +R+ +N+ + E +N+ K + + +N+ Sbjct: 141 KKQENNRQENERKENQRQENERKENQRQENERKENQRQENEKKENQRQENERKENK 196 >gi|194017149|ref|ZP_03055761.1| phage tail like protein [Bacillus pumilus ATCC 7061] gi|194011017|gb|EDW20587.1| phage tail like protein [Bacillus pumilus ATCC 7061] Length = 1477 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 35/110 (31%), Gaps = 13/110 (11%) Query: 15 RREYKRRYYLKNRDKILE------RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 +E ++ N +K E + ++ Y K + RE + + + K+ + Sbjct: 822 EQEILKKIKEGNEEKQKEIEKHENKVKKIYEKAAKEHRELTKEEWKEVKEHTQAM----- 876 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 N++ + + A + +A + A+ K Sbjct: 877 --NKEIETALTKSMDEQTLISKKLKAESSNLSAKQAAATVKHSKDAKDKV 924 >gi|110005909|gb|ABG48500.1| titin a [Danio rerio] Length = 32757 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 5/91 (5%) Query: 26 NRDKILERRRRRY----LKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 ++K ++ +N+ K + Y K ++K R+ + Y+ KN DK++ +Q Sbjct: 9342 KKEKETAHLKKEVFLSVKQNEKKFPKRKDIYPRKEEEKLRKQEYSYFPKNEDKIKFLKKQ 9401 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + +D I P E ++ + L E Sbjct: 9402 EPLSVK-QDEKILPREKEVFSQKGFQLLHEE 9431 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 68/137 (49%), Gaps = 18/137 (13%) Query: 1 MGRKVLTPEE-RMLCRREYKRRYYLK--------NRDKILERRRRRYLKNKDKIRESYHQ 51 M ++ + +E + L ++E + + K N K +R+ K ++K+R+ + Sbjct: 9327 MEDEIKSEKETKSLHKKEKETAHLKKEVFLSVKQNEKKFPKRKDIYPRKEEEKLRKQEYS 9386 Query: 52 YYLKNKDKYREYKRRY--YLKNRDKMREKARQSYRK-----LYSKDSWIAPEEPMGMTKA 104 Y+ KN+DK + K++ +K +K+ + ++ + + L+ + ++P + + ++ Sbjct: 9387 YFPKNEDKIKFLKKQEPLSVKQDEKILPREKEVFSQKGFQLLHEEKEVLSPHKNVYLSYT 9446 Query: 105 EIEAL--EREIARLKAK 119 E + + +++ + L AK Sbjct: 9447 EEDEVPPQKKDSPLPAK 9463 >gi|110741990|dbj|BAE98934.1| splicing factor like protein [Arabidopsis thaliana] Length = 341 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 +RE + +R K ER + K++++ E + ++ +K +E R +R++ Sbjct: 38 KRESRDHERETSRSKDREREKG-RDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRER 96 Query: 75 MREKARQS 82 +R Sbjct: 97 HHRSSRHR 104 >gi|167769520|ref|ZP_02441573.1| hypothetical protein ANACOL_00854 [Anaerotruncus colihominis DSM 17241] gi|167668488|gb|EDS12618.1| hypothetical protein ANACOL_00854 [Anaerotruncus colihominis DSM 17241] Length = 614 Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 47/140 (33%), Gaps = 21/140 (15%) Query: 1 MGRKVLTPEERMLCRREYKRRYYL---KNRDKILERRRRRYLK--------------NKD 43 + ++ L R L E Y + +K L R R N Sbjct: 182 LAQQELNSARRDLQDHEDDLEYADSVMNDLEKELNRARNEMNDAERELELAEREERENGT 241 Query: 44 KIRESYHQYYLKNKDKYREYKRRYYLKNRDK---MREKARQSYRKLYSKDSWIAPEEPM- 99 + + + Y + K+KY + + +R + +R + S ++ +A ++ Sbjct: 242 PVPQEVRERYEQAKEKYDAVNKEWVEADRSYGSDISSYSRAYRQARVSYENALANKKAAE 301 Query: 100 GMTKAEIEALEREIARLKAK 119 ++ L+ EI + + Sbjct: 302 NRASRNLDTLQDEITGQQIQ 321 >gi|225428352|ref|XP_002280049.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 1019 Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 4/84 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIR---ESYHQY-YLKNKD 58 R E R R E ++ K+R+K ++ R + K + + E K+++ Sbjct: 30 RDKEKKEHRESRRSEREKSSDDKHREKERDKHRDKRDKERTRSSRDDERDRARERTKDRE 89 Query: 59 KYREYKRRYYLKNRDKMREKARQS 82 K R+ R+K RE+ ++S Sbjct: 90 KERDRDHEGRDAEREKERERDKES 113 >gi|300637996|gb|ADK26162.1| Ycf1 [Asimina obovata] Length = 598 Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 54/124 (43%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYY-LKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYK 64 +ER L +R+ ++ K ++++ ++ + + K K+++ + + K + + K Sbjct: 281 KAQERYLRQRKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNKKEKERLNKK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNKKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 Score = 34.3 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 40/93 (43%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER R+ K K+++ + + + + + K + L ++K R Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNK 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + K E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNKKEKEQLNQK 364 >gi|300637984|gb|ADK26156.1| Ycf1 [Asimina angustifolia] gi|300638020|gb|ADK26174.1| Ycf1 [Asimina pygmaea] Length = 598 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 54/124 (43%), Gaps = 2/124 (1%) Query: 7 TPEERMLCRREYKRRYY-LKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYK 64 +ER L +R+ ++ K ++++ ++ + + K K+++ + + + + + K Sbjct: 281 KAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNQKEKERLNQK 340 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 + L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 341 EKERLNKKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKEL 400 Query: 125 IYKR 128 +R Sbjct: 401 RKRR 404 >gi|224083075|ref|XP_002188579.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog, Drosophila) [Taeniopygia guttata] Length = 3849 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 40/102 (39%), Gaps = 15/102 (14%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 ++ + R +KDK + + KDK RE R +N+ RE ++ RK Sbjct: 715 ERGKNKDRAVEELSKDKDADKGLE-----KDKTREKDREREKENK---RESKKEKRRKGS 766 Query: 88 SKDSWIA-------PEEPMGMTKAEIEALEREIARLKAKPIE 122 S A P+E + A + ++ R K+ ++ Sbjct: 767 EIQSSTALFPVGKMPKEKVSEDAAATSSAKKPAGRKKSTAVD 808 >gi|332288399|ref|YP_004419251.1| hypothetical protein UMN179_00317 [Gallibacterium anatis UMN179] gi|330431295|gb|AEC16354.1| hypothetical protein UMN179_00317 [Gallibacterium anatis UMN179] Length = 1336 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 44/110 (40%), Gaps = 13/110 (11%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL-KNKDKYRE-------YKRRYYL 69 Y+ +Y L+++ K+ + YY +K K + Y Sbjct: 382 YQWKYEQTKDTTWLDKKANLIKGYKELVSNYNKLYYQPDSKTKINDIEAYFKYKHNEYVE 441 Query: 70 KNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 KN + + EK R + KL E + + K ++ E+EIA+LK++ Sbjct: 442 KNINSITEKNRITTDKLTQ-----DLEAALNINKNAVKDKEKEIAKLKSE 486 >gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum] Length = 2221 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 16/116 (13%) Query: 16 REYKRRYYLKNR---DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 + ++ K+R + ++ R + I+E + ++ + L+NR Sbjct: 933 KRKRQEELEKHRHEEQQRKQQEREFKRQQAAIIKEQERERRRQHMALVKA------LENR 986 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 K+ E+ R+ + L K A +E K E +E EI KP E+L + Sbjct: 987 RKLEERERKKQQLLAEKQ---ANKE----KKMEQRKMELEILSELRKPCEDLELDQ 1035 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 37/95 (38%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R++ +R + + I E+ R R ++ ++ ++ L+ +++ ++ N++K Sbjct: 950 RKQQEREFKRQQAAIIKEQERERRRQHMALVKALENRRKLEERERKKQQLLAEKQANKEK 1009 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 E+ + L E E E + Sbjct: 1010 KMEQRKMELEILSELRKPCEDLELDQKPLPEYERI 1044 >gi|167390910|ref|XP_001739558.1| hypothetical protein [Entamoeba dispar SAW760] gi|165896725|gb|EDR24060.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 773 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 39/100 (39%), Gaps = 14/100 (14%) Query: 24 LKNR-DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK---- 78 KN+ +K L R + K K+ I + + + E + K +++++E+ Sbjct: 420 EKNQLEKKLLEREKLEEKAKELI---ESERRSETRKSLVEESKIEREKMKERIKERWSEE 476 Query: 79 -ARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 + +L K +E E ++++ +LK Sbjct: 477 EIKSRNERLSEKRKRERKDEI-----FEDNDIKKKTWQLK 511 >gi|291240071|ref|XP_002739944.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1861 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 29/66 (43%), Gaps = 5/66 (7%) Query: 21 RYYLKNRDKILERRRRRYLKNK----DKIRESYHQYYLKNKDKYREYKRRY-YLKNRDKM 75 +YY ++K+ R+ K DK + Y + + + R+Y R +NRD+ Sbjct: 113 KYYESEKNKLRSPDRKYKDDYKSTSSDKYKSRSQDRYKEKRGRSRDYDRHRDRSRNRDRG 172 Query: 76 REKARQ 81 R ++ Sbjct: 173 RSHDKE 178 >gi|156027200|gb|ABU42272.1| erythrocyte membrane protein [Plasmodium falciparum] Length = 126 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 28/63 (44%), Gaps = 4/63 (6%) Query: 19 KRRYYLKNR----DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 K Y N+ DK+ E+ ++ + K D + + Y + D Y + + ++ NR Sbjct: 15 KDLYRRDNKKVKTDKLQEQLKKYFQKIHDGLTNGVKERYQDDGDNYFQLREDWWDANRAT 74 Query: 75 MRE 77 + E Sbjct: 75 IWE 77 >gi|326921540|ref|XP_003207015.1| PREDICTED: WD repeat-containing protein 60-like [Meleagris gallopavo] Length = 927 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 47/120 (39%), Gaps = 19/120 (15%) Query: 24 LKNRDKILERRRRRYL--KNKDKIRESYHQYYLKNKDKYREYKR-------RYYLKNRDK 74 K+R+K L++ + + +KD+ ++ K++ RE R + + +++ Sbjct: 41 RKHREKKLQKEKVGHDIGTHKDEYKDRGKS-----KERTRERLRSVERDRDKMRERRKER 95 Query: 75 MREKARQSY-----RKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKRG 129 R++ R+ + K+ + P++ + E + + + + L G Sbjct: 96 HRDQDRERHQDKLKESSLEKERYSVPKDKDRESNREHDKMHKRHEIKQTDMQNNLKSFEG 155 Score = 34.3 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 43/117 (36%), Gaps = 8/117 (6%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 G+ ER+ + + + +++ ++ R R+ +K K + Y KDK R Sbjct: 68 GKSKERTRERLRSVERDRDKMRERRKERHRDQDRERHQ-DKLKESSLEKERYSVPKDKDR 126 Query: 62 EYKRRYYLKNR-DKMREKARQSYRKLYS------KDSWIAPEEPMGMTKAEIEALER 111 E R + ++ ++++ Q+ K + +G + E + Sbjct: 127 ESNREHDKMHKRHEIKQTDMQNNLKSFEGRDKGHHRRREKSRHRLGKDRLSKERSHK 183 Score = 33.9 bits (76), Expect = 7.5, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 40/109 (36%), Gaps = 11/109 (10%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQY-------YLKNKDKYREYKRRYYLKNRDKM 75 K++++ ER R +++DK+RE + ++K K ++ Y +DK Sbjct: 67 RGKSKERTRERLRS-VERDRDKMRERRKERHRDQDRERHQDKLKESSLEKERYSVPKDKD 125 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 RE R+ + M E ++ R + K L Sbjct: 126 RESNREH---DKMHKRHEIKQTDMQNNLKSFEGRDKGHHRRREKSRHRL 171 >gi|195546614|ref|YP_002117642.1| p084 [Rhizobium phage 16-3] gi|102642554|gb|ABF71335.1| p084 [Rhizobium phage 16-3] Length = 224 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 29/60 (48%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 DK+ ++R+ + +D + Y ++ ++YR R N D+ EK R +Y++ Sbjct: 54 DKLRSEQKRQTQEYRDHYNAWARKKYAEDPERYRLKVRILRESNIDRALEKERATYKRTK 113 >gi|170717234|ref|YP_001784352.1| YadA domain-containing protein [Haemophilus somnus 2336] gi|168825363|gb|ACA30734.1| YadA domain protein [Haemophilus somnus 2336] Length = 3749 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 49/110 (44%), Gaps = 11/110 (10%) Query: 7 TPEERMLCRREYK-RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 E ++L +E + Y ++ + + R+ +N ++I K K Y+R Sbjct: 839 LDELKLLVTKEQDYKNYAKVKKEYLKLKARK--ERNNEEINL------ENIKSKVEAYER 890 Query: 66 RY--YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREI 113 +Y + +K++ + ++Y+ L ++++ PE K +E ++I Sbjct: 891 KYSDFKNVAEKLKTEDEKNYKLLLTQNAKQDPELEKKKRKEWLEQAYKDI 940 >gi|242011262|ref|XP_002426374.1| pnuts protein, putative [Pediculus humanus corporis] gi|212510451|gb|EEB13636.1| pnuts protein, putative [Pediculus humanus corporis] Length = 1173 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 50/103 (48%), Gaps = 5/103 (4%) Query: 17 EYKRRYY-LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR-RYYLKNRDK 74 + + Y KNRDK +R + +KD+ + + + +K + R+ ++ +Y K +D Sbjct: 390 KNSKEYDSEKNRDKSKDR-KSSRDSDKDRKKSDHKSH-KGDKSRERDREKDKYKNKEKDC 447 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 ++EK ++S K K+ + +E ++ +++ L+ Sbjct: 448 LKEKHKES-DKSKLKEKYGKEKEGGKTSEKVQTQADKDKDTLE 489 >gi|304310507|ref|YP_003810105.1| Translation initiation factor IF-2 [gamma proteobacterium HdN1] gi|301796240|emb|CBL44448.1| Translation initiation factor IF-2 [gamma proteobacterium HdN1] Length = 901 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 43/107 (40%), Gaps = 8/107 (7%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK--DKY 60 RK + EE E K+ +NR K E +R+ +++ +I E + ++ Sbjct: 194 RKEKSAEEDRRSAEEQKK--REENRRKAEEELKRKTMEDAKRIAEQLESRRSDSPSAEEL 251 Query: 61 REYK----RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTK 103 + RR Y + D+ +++R+ K S PE + + Sbjct: 252 LDSSDLIVRRAYEDSFDEEEKQSRRRRGSNRKKSSKSQPERKLVQRE 298 >gi|242075108|ref|XP_002447490.1| hypothetical protein SORBIDRAFT_06g001810 [Sorghum bicolor] gi|241938673|gb|EES11818.1| hypothetical protein SORBIDRAFT_06g001810 [Sorghum bicolor] Length = 602 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 1/56 (1%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 ++++ +R+ +R +DK E+ R + + E + + K K++ RE R Sbjct: 136 QQIVKKRKLERSQTEHEKDKGKEKERN-EEHERRPVGERERERHDKEKEQVREKDR 190 >gi|301608384|ref|XP_002933783.1| PREDICTED: MAX gene-associated protein [Xenopus (Silurana) tropicalis] Length = 2922 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 27/104 (25%), Positives = 41/104 (39%), Gaps = 2/104 (1%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 K E RE K+ +N D+ E ++R +N DK RE +N DK ++ Sbjct: 2015 KEDKQHENDDKSREDKQHSIEENDDESREDKQRGIEENNDKSREDKQHGIEENDDKSKDD 2074 Query: 64 KRRYYLKNRDKMREKARQSYRKL--YSKDSWIAPEEPMGMTKAE 105 K+ +N DK +E + + SKD E E Sbjct: 2075 KQHGIEENDDKSKEDKQHGIEENDDKSKDDKQHGIEENDDKSKE 2118 >gi|161376725|ref|YP_001569007.1| hypothetical protein BMQ_pBM30007 [Bacillus megaterium QM B1551] gi|52352749|gb|AAU43808.1| conserved hypothetical protein [Bacillus megaterium QM B1551] Length = 253 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 31/60 (51%), Gaps = 2/60 (3%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY--LKNRDKM 75 K +Y ++D++ R+ + NKD+++ ++ NKD+ + K + KN+ K+ Sbjct: 2 RKDKYSSSDKDEVKVRKNKNSSSNKDEVKVRKNKNSSSNKDEVKVKKNKNSSLSKNKAKI 61 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 34/80 (42%), Gaps = 14/80 (17%) Query: 33 RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 R+ + +KD+++ ++ NKD+ + K + N+D+++ K Sbjct: 2 RKDKYSSSDKDEVKVRKNKNSSSNKDEVKVRKNKNSSSNKDEVKVKK------------- 48 Query: 93 IAPEEPMGMTKAEIEALERE 112 + KA+IE+ +E Sbjct: 49 -NKNSSLSKNKAKIESAHQE 67 >gi|308484047|ref|XP_003104224.1| CRE-EIF-3.B protein [Caenorhabditis remanei] gi|308258193|gb|EFP02146.1| CRE-EIF-3.B protein [Caenorhabditis remanei] Length = 725 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 46/106 (43%), Gaps = 13/106 (12%) Query: 1 MGRKVLTPEERMLCRREYKR----RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 +G ++ T + R LCR+ R ++ + K+ E++++ KN K + + + Sbjct: 586 LGYRIFTFQGRELCRKNLDRLAQFKWRPRPPVKLSEQKQKEIKKNLKKTAAKFIK--QDD 643 Query: 57 KDKYREYKRRYYLKNR-----DKMREKARQSYRKLYSKDSWIAPEE 97 +K R + + + D +R + R+ +++ IA + Sbjct: 644 DEKCRASQEVVEKRRKIMAAFDIIRNRNREQLE--ATREQRIALRK 687 >gi|224061597|ref|XP_002300559.1| predicted protein [Populus trichocarpa] gi|222847817|gb|EEE85364.1| predicted protein [Populus trichocarpa] Length = 1716 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 45/120 (37%), Gaps = 17/120 (14%) Query: 21 RYYLKNR----DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 +Y N+ +K+ + R+ ++ E + N+ K E K + +N K Sbjct: 305 KYEEANKKFETEKLKVTKERKRADSEMAKAEVKKKLAEANRKKLAEEKS--HTENLSKQL 362 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAE-----------IEALEREIARLKAKPIEELI 125 E RQ +L + + E + AE +E +++ KAK +EE Sbjct: 363 EDVRQRIEELQKAEEYQLQLESLKKEAAESKSKLASETLKLEDANKKLEAEKAKVMEERK 422 >gi|198285443|gb|ACH85260.1| DEAD (Asp-Glu-Ala-Asp) box polypeptide 55 [Salmo salar] Length = 606 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 1/94 (1%) Query: 36 RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ-SYRKLYSKDSWIA 94 R K ++K+R+ + +++ +R+ KN+ ++K R+ RK +K + Sbjct: 484 RYKDKQREKLRQKLLIELKEKREERAAAPKRHNDKNKAWSKQKNRKDRRRKTAAKRNRDE 543 Query: 95 PEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 E E+ R + RLK I E +++ Sbjct: 544 GSEMDDEDMKELLKDTRLLKRLKKGQISEEDFEK 577 >gi|198464513|ref|XP_001353254.2| GA13284 [Drosophila pseudoobscura pseudoobscura] gi|198149751|gb|EAL30757.2| GA13284 [Drosophila pseudoobscura pseudoobscura] Length = 1138 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 29/87 (33%), Gaps = 6/87 (6%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK--NRDKMREKAR 80 + + I ++ Y NKD I ++ +E +R + N K + + Sbjct: 856 EERRKTGIQAWEQKYYELNKDIIEAK----REAEEETRKEMERIEREEQLNTRKRQAEDE 911 Query: 81 QSYRKLYSKDSWIAPEEPMGMTKAEIE 107 + +A E M + E+ Sbjct: 912 EENADNSKPTQGLAYNERMAVLWDELN 938 >gi|224048581|ref|XP_002192240.1| PREDICTED: similar to KIAA0244 [Taeniopygia guttata] Length = 2069 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 47/121 (38%), Gaps = 4/121 (3%) Query: 2 GRKVL-TPEERMLCRREYKRRYYLKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDK 59 GR+ + + +R Y + K + K ++ E + +N++K Sbjct: 1924 GREEQHLERGFSRGKNDRQRLYNETHHQKKDRHEKEWSNEKYWEQDSERNRR-RDRNQEK 1982 Query: 60 YREYK-RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 RE K R +++++ R S PE+ K++ +A E++ R K+ Sbjct: 1983 ERERKSREEGQRDKERARSPHSDRAADGKSPRETRNPEKKTEKPKSDEQAHEKDKEREKS 2042 Query: 119 K 119 K Sbjct: 2043 K 2043 >gi|256086894|ref|XP_002579619.1| serine-rich repeat protein [Schistosoma mansoni] gi|238665082|emb|CAZ35858.1| serine-rich repeat protein , putative [Schistosoma mansoni] Length = 662 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 13/87 (14%), Positives = 35/87 (40%), Gaps = 4/87 (4%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYL----KNKDKYREYKRRYYLKNRDKMREKARQSYR 84 K+ E + +K++ + ++ +NK + + + Y +KN+ +++K ++ Sbjct: 56 KLKEHEKEMEEIAAEKLQTTKNKARKLDTIENKKRLKTKEAEYQVKNKQGIKDKRSKNKV 115 Query: 85 KLYSKDSWIAPEEPMGMTKAEIEALER 111 + E A+I +R Sbjct: 116 NETEGKNDENNERTKTTKPAKINTQKR 142 >gi|189347580|ref|YP_001944109.1| hypothetical protein Clim_2106 [Chlorobium limicola DSM 245] gi|189341727|gb|ACD91130.1| conserved hypothetical protein [Chlorobium limicola DSM 245] Length = 1327 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 11/66 (16%), Positives = 30/66 (45%), Gaps = 6/66 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKI------RESYHQYYLKNKDKYREYKRRYYL 69 R R + +N+D + +R++ N+ K + + + +++ ++ R Y Sbjct: 710 RRLMRSHSRQNQDFLKAWHKRKHEWNEKKAFFNGRRHKRTSKRWKLKEERRKQNNRSNYE 769 Query: 70 KNRDKM 75 KN++ + Sbjct: 770 KNKELI 775 >gi|332295592|ref|YP_004437515.1| Smr protein/MutS2 [Thermodesulfobium narugense DSM 14796] gi|332178695|gb|AEE14384.1| Smr protein/MutS2 [Thermodesulfobium narugense DSM 14796] Length = 774 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 19/101 (18%), Positives = 36/101 (35%), Gaps = 11/101 (10%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR-----------EYKRR 66 YK+ L N++KILE R+ + +++ + Y + + E KR Sbjct: 525 YKKVEELNNKEKILENERKFLKEKFEQLERDMKEQYEEKISQLNKLLSSVNLVAKELKRN 584 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIE 107 +D + E ++ + + E K EI Sbjct: 585 LNQPKKDLVLESLKKWEELSSLINEKVLKREETSKDKIEIN 625 >gi|308159406|gb|EFO61938.1| Hypothetical protein GLP15_489 [Giardia lamblia P15] Length = 546 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 6/85 (7%) Query: 35 RRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIA 94 +R + NKD++ Y ++ + +EY R Y + ++E R K+ ++ + Sbjct: 214 KRPHSMNKDELLTFKQSYKQRSLELLKEYDRIYEI----MIKEYGRDRVEKMIRRNERSS 269 Query: 95 PEEPMGMTK--AEIEALEREIARLK 117 + K E LER++ ++ Sbjct: 270 LFSRSSLRKEARNFEKLERDLEEME 294 >gi|195163027|ref|XP_002022355.1| GL26558 [Drosophila persimilis] gi|194104316|gb|EDW26359.1| GL26558 [Drosophila persimilis] Length = 1138 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 29/87 (33%), Gaps = 6/87 (6%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK--NRDKMREKAR 80 + + I ++ Y NKD I ++ +E +R + N K + + Sbjct: 856 EERRKTGIQAWEQKYYELNKDIIEAK----REAEEETRKEMERIEREEQLNTRKRQAEDE 911 Query: 81 QSYRKLYSKDSWIAPEEPMGMTKAEIE 107 + +A E M + E+ Sbjct: 912 EENADNSKPTQGLAYNERMAVLWDELN 938 >gi|160693740|gb|ABX46570.1| non-structural polyprotein [Turkey astrovirus] Length = 1125 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 +K E + NK I+E + NK R+ KNR+KM+E + + Sbjct: 136 EKTREAKENLEKANK--IQEQLDKVIESNK----ALHRKIQEKNREKMQEYMVRLHNTQK 189 Query: 88 SKDSWIAPEEPMGMTKAEIEALERE 112 +D W+ + ++ +E Sbjct: 190 DRDDWVQRCSRLEQENVTLQKRLKE 214 >gi|268566341|ref|XP_002639696.1| C. briggsae CBR-TAG-214 protein [Caenorhabditis briggsae] Length = 1173 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 44/117 (37%), Gaps = 5/117 (4%) Query: 17 EYKRRYYLKNRDKILERRRR--RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK-NRD 73 E +R+ KNR + E+ + RY+ +DK +E D + + N+D Sbjct: 881 EQERKERKKNRGESKEKHEKERRYIDEQDKKKERKRDRNADEYDSDAPQVKIKREEKNKD 940 Query: 74 KMREKARQSYRKLYSKD--SWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 +++ R+ KD E + + ERE K K E+ ++ Sbjct: 941 DQQDRDRKKEDPENGKDWRRKERDREVKDKKHDKYKEKEREDDNEKVKQREKEKERQ 997 >gi|160693764|gb|ABX46588.1| non-structural polyprotein [Turkey astrovirus] Length = 1121 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 +K E + NK I+E + NK R+ KNR+KM+E + + Sbjct: 136 EKSREAKENLEKANK--IQEQLDKVIESNK----ALHRKIQEKNREKMQEYMVRLHNTQK 189 Query: 88 SKDSWIAPEEPMGMTKAEIEALERE 112 +D W+ + ++ +E Sbjct: 190 DRDDWVQRCSRLEQENVNLQKRLKE 214 >gi|301310090|ref|ZP_07216029.1| MutS2 family protein [Bacteroides sp. 20_3] gi|300831664|gb|EFK62295.1| MutS2 family protein [Bacteroides sp. 20_3] Length = 822 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 45/111 (40%), Gaps = 9/111 (8%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNK---DKYREYKRRYY 68 +RY+ R I ++ ++ RY ++ + + + + + K + Sbjct: 538 RDKRYWESKRQNIRQQEKKLEDVTSRYEQDLEAVNKQRKEIIREAKAEAQRILAEANAKI 597 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +++E + + ++ + + + T+ E + + R++A+LK + Sbjct: 598 ENTVREIKEAQAEKEQTKLARKALEEFKNSVMATEEEDDKIARKMAKLKER 648 >gi|150007883|ref|YP_001302626.1| DNA mismatch repair protein MutS [Parabacteroides distasonis ATCC 8503] gi|256840140|ref|ZP_05545649.1| DNA mismatch repair protein MutS [Parabacteroides sp. D13] gi|262381617|ref|ZP_06074755.1| DNA mismatch repair protein MutS [Bacteroides sp. 2_1_33B] gi|298376756|ref|ZP_06986711.1| MutS2 family protein [Bacteroides sp. 3_1_19] gi|149936307|gb|ABR43004.1| DNA mismatch repair protein MutS [Parabacteroides distasonis ATCC 8503] gi|256739070|gb|EEU52395.1| DNA mismatch repair protein MutS [Parabacteroides sp. D13] gi|262296794|gb|EEY84724.1| DNA mismatch repair protein MutS [Bacteroides sp. 2_1_33B] gi|298266634|gb|EFI08292.1| MutS2 family protein [Bacteroides sp. 3_1_19] Length = 822 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 45/111 (40%), Gaps = 9/111 (8%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNK---DKYREYKRRYY 68 +RY+ R I ++ ++ RY ++ + + + + + K + Sbjct: 538 RDKRYWESKRQNIRQQEKKLEDVTSRYEQDLEAVNKQRKEIIREAKAEAQRILAEANAKI 597 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +++E + + ++ + + + T+ E + + R++A+LK + Sbjct: 598 ENTVREIKEAQAEKEQTKLARKALEEFKNSVMATEEEDDKIARKMAKLKER 648 >gi|312232057|ref|NP_987087.2| non-structural polyprotein 1AB [Turkey astrovirus 2] Length = 1638 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 +K E + NK I+E + NK R+ +NR+KM+E + + Sbjct: 136 EKTREAKENLEKANK--IQEQLDKVIESNK----ALHRKIQERNREKMQEYMVRLHNTQK 189 Query: 88 SKDSWIAPEEPMGMTKAEIEALERE 112 +D W+ + ++ +E Sbjct: 190 DRDDWVQRCSRLEQENVTLQKRLKE 214 >gi|255013416|ref|ZP_05285542.1| DNA mismatch repair protein MutS [Bacteroides sp. 2_1_7] Length = 822 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 45/111 (40%), Gaps = 9/111 (8%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNK---DKYREYKRRYY 68 +RY+ R I ++ ++ RY ++ + + + + + K + Sbjct: 538 RDKRYWESKRQNIRQQEKKLEDVTSRYEQDLEAVNKQRKEIIREAKAEAQRILAEANAKI 597 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +++E + + ++ + + + T+ E + + R++A+LK + Sbjct: 598 ENTVREIKEAQAEKEQTKLARKALEEFKNSVMATEEEDDKIARKMAKLKER 648 >gi|296809211|ref|XP_002844944.1| SNF2 family ATP-dependent chromatin-remodeling factor snf21 [Arthroderma otae CBS 113480] gi|238844427|gb|EEQ34089.1| SNF2 family ATP-dependent chromatin-remodeling factor snf21 [Arthroderma otae CBS 113480] Length = 1660 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 15/121 (12%) Query: 9 EERMLCRRE-----YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + R R KR+ N L R K I + L + E Sbjct: 311 KGRKRAREAEALDNQKRKRRRANFLGKLARVFE-----KQAIAGTERFARLHEAEVIEEV 365 Query: 64 KRRYYLKNRDKMREKARQSYRKL---YSKDSWIAPEEPMGMTK--AEIEALEREIARLKA 118 +++ + +++ ++ R+ R+ + A ++ + + AE E L+REIAR + Sbjct: 366 QQKEIMDEKERKKDMQRKRRRENTVRAEMEKIQAAQKKVSKMEDDAEKEKLQREIARSRK 425 Query: 119 K 119 K Sbjct: 426 K 426 >gi|302849266|ref|XP_002956163.1| hypothetical protein VOLCADRAFT_97128 [Volvox carteri f. nagariensis] gi|300258466|gb|EFJ42702.1| hypothetical protein VOLCADRAFT_97128 [Volvox carteri f. nagariensis] Length = 2003 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 14/102 (13%), Positives = 39/102 (38%), Gaps = 9/102 (8%) Query: 15 RREYKRRYYLKNRDKILE----RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 RE R + N +++ + ++ N ++ E+ + + E + + Sbjct: 1388 DRERARAEHEANINRLKAEWRAKTQQTLELNDARLTEAESMH-----EALVEQVKETNKR 1442 Query: 71 NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++++ E RQ + S +A + + + E + + E Sbjct: 1443 HKEEHTELLRQRKEVEATNLSRLAEAKAVHKVRLESRSADNE 1484 >gi|156052809|ref|XP_001592331.1| hypothetical protein SS1G_06572 [Sclerotinia sclerotiorum 1980] gi|171704395|sp|A7EMM3|NST1_SCLS1 RecName: Full=Stress response protein nst1 gi|154704350|gb|EDO04089.1| hypothetical protein SS1G_06572 [Sclerotinia sclerotiorum 1980 UF-70] Length = 1171 Score = 35.0 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 1/54 (1%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 +K E +R + +N+ K + K E +RR + R++ E+ R+ Sbjct: 602 EKKAEEQRLKREENRKKKEAQKKADEEERVRKEAEKQRRLQEQ-RERQAEQERK 654 Score = 33.9 bits (76), Expect = 6.7, Method: Composition-based stats. Identities = 8/53 (15%), Positives = 24/53 (45%), Gaps = 1/53 (1%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 + E +R +NR K E +++ + + + + + +++ E +R+ Sbjct: 603 KKAEEQRLKREENRKK-KEAQKKADEEERVRKEAEKQRRLQEQRERQAEQERK 654 >gi|261260073|sp|A8WWH5|POP1_CAEBR RecName: Full=Protein pop-1; AltName: Full=Posterior pharynx defect protein 1 gi|309364647|emb|CAP24984.2| CBR-POP-1 protein [Caenorhabditis briggsae AF16] Length = 439 Score = 35.0 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Query: 22 YYLKNRDKILERRRRRYLKNKDKIRESYHQYY---LKNKDKYREYKRRYYLKNRDKMRE- 77 Y +NR K+LE ++ + +E +++ + + KY E ++ +++K + Sbjct: 209 YMKENRPKLLEEVGNDQKQSAELNKELGKRWHDLPKEEQQKYFEMAKKDRESHKEKYPQW 268 Query: 78 KARQSYRKL 86 AR++Y Sbjct: 269 SARENYAVN 277 >gi|34365518|tpg|DAA01285.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio] Length = 2181 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 3/104 (2%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY 67 +E + + + K E+ + R KN N+ ++ Sbjct: 186 AQENKAMMSKKNKSRRRREMRKTTEKFKSRKSKNTTASYAMKKHKQRDNERDRYAACLQF 245 Query: 68 YLKNRDKMREKARQS-YRKLYSKD--SWIAPEEPMGMTKAEIEA 108 ++ +REK Q + + KD + +E + K E A Sbjct: 246 RTNKKEAVREKYSQDAHFRNKKKDCSNRRYKDEIVLSKKKEYMA 289 >gi|307187162|gb|EFN72405.1| Death-inducer obliterator 1 [Camponotus floridanus] Length = 2322 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 21/106 (19%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYL---------------KNKDKIR 46 +K +ER +E R RD + + + R K +DK + Sbjct: 1493 AKKSKGMDERRRKDKERGRE-----RDGVKSKSKDRRERSHSRHRRSRDRDRSKTRDKSK 1547 Query: 47 ESYHQYYLKNKDKYREYKR-RYYLKNRDKMREKARQSYRKLYSKDS 91 E + +DK RE ++ + +N+D+ + ++S + K+ Sbjct: 1548 ERKSRNKESRRDKDREREKDKDRDRNKDRDKTSRKESRERGRQKEK 1593 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 41/126 (32%), Gaps = 26/126 (20%) Query: 4 KVLTPEERMLCRREYKR---RYYLKNRDKILERRRRRYLKNKD----------------- 43 K R + + K++++ ER K+KD Sbjct: 1481 KDKEDRLRSRMDAKKSKGMDERRRKDKERGRERDGV-KSKSKDRRERSHSRHRRSRDRDR 1539 Query: 44 -KIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMT 102 K R+ + +NK+ R+ R K++D+ R K R K K+S + Sbjct: 1540 SKTRDKSKERKSRNKESRRDKDRE-REKDKDRDRNKDR---DKTSRKESRERGRQKEKDR 1595 Query: 103 KAEIEA 108 ++ Sbjct: 1596 HKSSDS 1601 Score = 33.9 bits (76), Expect = 8.6, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 50/115 (43%), Gaps = 8/115 (6%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 R+ + +N++ ++ R + KDK R+ K++DK + R + ++ Sbjct: 1541 KTRDKSKERKSRNKESRRDKDR---EREKDKDRD-----RNKDRDKTSRKESRERGRQKE 1592 Query: 74 KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 K R K+ S+ + S+ + + + ++E + KPIE+ +++ Sbjct: 1593 KDRHKSSDSWMRNRSESRESKDVDKKEEDRKKETEKKKETSPSVEKPIEDRLWRH 1647 >gi|156397400|ref|XP_001637879.1| predicted protein [Nematostella vectensis] gi|156224995|gb|EDO45816.1| predicted protein [Nematostella vectensis] Length = 1034 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 38/103 (36%), Gaps = 6/103 (5%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R+ KR+ K++++ + K KD+ ++ L +K ++ KR + Sbjct: 307 RKPAKRKEREKHKEEQANDSNWKQEKGKDREKKQEKIGKLVKGEKSKKQKRGEREDHGKT 366 Query: 75 MREKARQ------SYRKLYSKDSWIAPEEPMGMTKAEIEALER 111 ++ + + SK +P+ K E + R Sbjct: 367 SKDNRKSTSPKEPRKTQNESKHQRESPDAKSTANKKESQTGSR 409 >gi|326673092|ref|XP_686400.5| PREDICTED: hypothetical protein LOC553518 [Danio rerio] Length = 1738 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 3/82 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYR 61 RK E R +Y ++R+K ER ++ KD+ E + + + + R Sbjct: 1658 RKDSRDREHYDKHHHKSR-HYERDREKHRERSHSHKERHSKDERYEKHRERRHSSHHRDR 1716 Query: 62 EYKRRYYLKNRDKMREKARQSY 83 + R + +K R+ +R Sbjct: 1717 DKHRS-RDSDHEKHRKDSRDRR 1737 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 13/90 (14%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL-----KNKDKYRE 62 EE + Y + + R Y ++++K RE H + + +K+RE Sbjct: 1653 SEETHRKDSRDREHYDKHH------HKSRHYERDREKHRERSHSHKERHSKDERYEKHRE 1706 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 + + ++RDK R +R S + + KDS Sbjct: 1707 RRHSSHHRDRDKHR--SRDSDHEKHRKDSR 1734 >gi|281211768|gb|EFA85930.1| hypothetical protein PPL_01163 [Polysphondylium pallidum PN500] Length = 1358 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 50/113 (44%), Gaps = 2/113 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R E ++ K +++ +++ + + + K E + K +++ ++ + + ++ Sbjct: 730 RIEKQKELVQKQKEEKERKQKLKEEELRMKKLEEQKKMEEKQRERQKKKEDEARKREEER 789 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 + +Q K + K +E +AE++ ++ + R K K +E+ + Sbjct: 790 AKAAEKQLDPKAHEKSEKKKRQEQANR-EAELKKQQQRLDRNK-KALEQSKQQ 840 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 10/104 (9%), Positives = 43/104 (41%), Gaps = 4/104 (3%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ ++ ++ + +++++++ + K + + +K ++ ++ + + + + K Sbjct: 722 QKARSKKERIEKQKELVQKQKEEKER---KQKLKEEELRMKKLEEQKKMEEKQRERQKKK 778 Query: 75 MREKARQSYRKLYSKDSWIAPE-EPMGMTKAEIEALEREIARLK 117 E ++ + + + + P+ K E RE K Sbjct: 779 EDEARKREEERAKAAEKQLDPKAHEKSEKKKRQEQANREAELKK 822 Score = 33.5 bits (75), Expect = 9.6, Method: Composition-based stats. Identities = 15/116 (12%), Positives = 49/116 (42%), Gaps = 3/116 (2%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +K L +++ R+ K + + R K LE +++ K +++ ++ + +++ R Sbjct: 734 QKELVQKQKEEKERKQKLK-EEELRMKKLEEQKKMEEKQRERQKKKEDEARK--REEERA 790 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 L + + + ++ + + ++ + K +E +++ +KA Sbjct: 791 KAAEKQLDPKAHEKSEKKKRQEQANREAELKKQQQRLDRNKKALEQSKQQTTAVKA 846 >gi|326468559|gb|EGD92568.1| SNF2 family helicase/ATPase [Trichophyton tonsurans CBS 112818] Length = 1686 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 15/121 (12%) Query: 9 EERMLCRR-----EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + R R KR+ N L R K I + L + E Sbjct: 337 KGRKRAREMEALDNQKRKRRRANFLGKLARVFE-----KQAIAGTERFARLHEAEVIEEV 391 Query: 64 KRRYYLKNRDKMREKARQSYRKL---YSKDSWIAPEEPMGMTK--AEIEALEREIARLKA 118 +++ + +++ ++ R+ R+ + A ++ + + AE E L+REIAR + Sbjct: 392 QQKEIMDEKERKKDMQRKRRRENTVRAEMEKIQAAQKKVSKMEDDAEKEKLQREIARSRK 451 Query: 119 K 119 K Sbjct: 452 K 452 >gi|168030227|ref|XP_001767625.1| transcription initiation factor TFIID, subunit TAF1 [Physcomitrella patens subsp. patens] gi|162681154|gb|EDQ67584.1| transcription initiation factor TFIID, subunit TAF1 [Physcomitrella patens subsp. patens] Length = 2228 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 35/85 (41%), Gaps = 4/85 (4%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 R+ + EER +E ++ + E+ +R+ + + + +++ R Sbjct: 1956 ARERVELEERQRKEKEKND--RDRHSQHMKEKEKRQREREAKLREAAELEARKAQEERER 2013 Query: 62 EYKRRYYLK--NRDKMREKARQSYR 84 + + +R+K+REK+ + Sbjct: 2014 QVGEEMRQQQRDREKIREKSLKFKH 2038 >gi|149637208|ref|XP_001510761.1| PREDICTED: similar to THO complex 2 [Ornithorhynchus anatinus] Length = 1648 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 7/83 (8%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNK--DKYREYKRRYYLKNRDKMREKARQS 82 N ++ E + Y+ + I + + K DK RE R R+K EK R+ Sbjct: 1481 NNLSELKEPSAKHYINHTTPILSKSKEREVDKKDLDKSRERSRE-----REKKDEKDRKE 1535 Query: 83 YRKLYSKDSWIAPEEPMGMTKAE 105 ++ +S P++ K E Sbjct: 1536 RKRDHSNSDREVPQDSTKRRKEE 1558 >gi|326479960|gb|EGE03970.1| SNF2 family helicase/ATPase [Trichophyton equinum CBS 127.97] Length = 1688 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 15/121 (12%) Query: 9 EERMLCRR-----EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + R R KR+ N L R K I + L + E Sbjct: 339 KGRKRAREMEALDNQKRKRRRANFLGKLARVFE-----KQAIAGTERFARLHEAEVIEEV 393 Query: 64 KRRYYLKNRDKMREKARQSYRKL---YSKDSWIAPEEPMGMTK--AEIEALEREIARLKA 118 +++ + +++ ++ R+ R+ + A ++ + + AE E L+REIAR + Sbjct: 394 QQKEIMDEKERKKDMQRKRRRENTVRAEMEKIQAAQKKVSKMEDDAEKEKLQREIARSRK 453 Query: 119 K 119 K Sbjct: 454 K 454 >gi|83775599|dbj|BAE65719.1| unnamed protein product [Aspergillus oryzae] Length = 376 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 26/104 (25%), Positives = 44/104 (42%), Gaps = 2/104 (1%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI--RESYHQYYLKNKDKYR 61 K L + L RR+ + + K R LE +R+ +N DK+ E + KNK+ Sbjct: 243 KELKSLQEDLERRDQGMKEHEKKRQHDLEEAKRKVKENNDKLAALEKDVKDAKKNKEDAE 302 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 + R K +++ E + R+ K + +E TK E Sbjct: 303 KRYRNELKKQQEERNEIVARKRREYEEKLRKLENDERDYRTKKE 346 >gi|331003230|ref|ZP_08326737.1| hypothetical protein HMPREF0491_01599 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412883|gb|EGG92263.1| hypothetical protein HMPREF0491_01599 [Lachnospiraceae oral taxon 107 str. F0167] Length = 1549 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 1/102 (0%) Query: 21 RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKAR 80 Y +K+ + +KI S +Y + DK + Y + DK A+ Sbjct: 39 EYAKDTDEKLKAAKSEYEKDVDEKINTSKSEY-EEEADKKLNATKSEYEEEADKKLNAAK 97 Query: 81 QSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIE 122 Y K ++ A E + + + + + + KP E Sbjct: 98 SEYEKESDENLDSAKSEYTKEANKILNSAKADTQKEENKPKE 139 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 38/99 (38%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 + Y + DK L + Y + DK + Y K D+ + + Y K +K+ Sbjct: 66 SKSEYEEEADKKLNATKSEYEEEADKKLNAAKSEYEKESDENLDSAKSEYTKEANKILNS 125 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 A+ +K +K + + EA+ ++I K Sbjct: 126 AKADTQKEENKPKELRLIIELDKDSLVEEAINKDIDYDK 164 >gi|327300603|ref|XP_003234994.1| SNF2 family helicase/ATPase [Trichophyton rubrum CBS 118892] gi|326462346|gb|EGD87799.1| SNF2 family helicase/ATPase [Trichophyton rubrum CBS 118892] Length = 1691 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 15/121 (12%) Query: 9 EERMLCRRE-----YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + R R KR+ N L R K I + L + E Sbjct: 342 KGRKRAREAEALDNQKRKRRRANFLGKLARVFE-----KQAIAGTERFARLHEAEVIEEV 396 Query: 64 KRRYYLKNRDKMREKARQSYRKL---YSKDSWIAPEEPMGMTK--AEIEALEREIARLKA 118 +++ + +++ ++ R+ R+ + A ++ + + AE E L+REIAR + Sbjct: 397 QQKEIMDEKERKKDMQRKRRRENTVRAEMEKIQAAQKKVSKMEDDAEKEKLQREIARSRK 456 Query: 119 K 119 K Sbjct: 457 K 457 >gi|326428465|gb|EGD74035.1| hypothetical protein PTSG_12353 [Salpingoeca sp. ATCC 50818] Length = 2540 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 48/123 (39%), Gaps = 10/123 (8%) Query: 4 KVLTPEERMLCRREYKRRYYLKNR------DKILERRRRRYLKNKDKIRESYHQYYLKNK 57 K+ T E +++ +E + +N+ +K+ ++ + +N+ + + ++ Sbjct: 323 KLATTEGQLVAEKERTKSLEAENKALRADVEKLTKQVEQLKAENEKLSKARERRSMEFHR 382 Query: 58 DKYREYKRRYYLKNRDKMREKAR--QSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIAR 115 ++ + + + + ++ E+AR Q++ K + E E + Sbjct: 383 EE--DQRVKLEKELKETKTEQARLNQAHEAETQKAKKELDAAAVRQKHLEQEVAKANDKT 440 Query: 116 LKA 118 K+ Sbjct: 441 AKS 443 >gi|270000966|gb|EEZ97413.1| hypothetical protein TcasGA2_TC011242 [Tribolium castaneum] Length = 761 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 11/111 (9%) Query: 20 RRYYLKNRDKILERRRRRYL---------KNKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 + Y + R+ +LE+ + KNK + + Q +K+K + + K Sbjct: 482 KEYVEEERENLLEKMKSYRPQGTIFEICGKNKSQEYQIMKQKRSFHKEKITSFHQNQEQK 541 Query: 71 NRDK--MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +K + K + K S+D A + + K ++E L + + K K Sbjct: 542 ENEKQLLNHKDETTLAKSSSEDIDDAFDRVITPKKRKLENLYKRNKKQKTK 592 >gi|302803755|ref|XP_002983630.1| hypothetical protein SELMODRAFT_422901 [Selaginella moellendorffii] gi|300148467|gb|EFJ15126.1| hypothetical protein SELMODRAFT_422901 [Selaginella moellendorffii] Length = 924 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 53/130 (40%), Gaps = 9/130 (6%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDK----IRESYHQYYLKNKD 58 RK E R+ E + + K ++ RR++ + + + RE + ++ Sbjct: 42 RKRKAEEARIAKEHEAQDKRMKKEMERQEAMRRKKEDQERKESEKVHREKQKELERLARE 101 Query: 59 KYREYKRRYYLKNR-----DKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREI 113 K RE +R + +K+ +K Q KL K+ +E + A A +++ Sbjct: 102 KAREEERLQRQHKKEYERMEKLLQKENQRVAKLRQKEEMRREKEATKLRAAYERATAKKL 161 Query: 114 ARLKAKPIEE 123 A+L I++ Sbjct: 162 AKLSTGLIDD 171 >gi|212535682|ref|XP_002147997.1| SNF2 family helicase/ATPase (Ino80), putative [Penicillium marneffei ATCC 18224] gi|210070396|gb|EEA24486.1| SNF2 family helicase/ATPase (Ino80), putative [Penicillium marneffei ATCC 18224] Length = 2500 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 46/126 (36%), Gaps = 16/126 (12%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDK-----ILERRRRRYLKNKDKIR----------E 47 R + E + Y K++ + ILE + + +N+ ++ Sbjct: 1122 RAEKSDVEDGTGFEAELKLYIAKSKKRAMEADILESHKNKRRRNEFLLKLSANFANGAEA 1181 Query: 48 SYHQYYLKN-KDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEI 106 ++ N ++ E +R+ +++ ++ R+ R+ + E K + Sbjct: 1182 GGERFRRANEREVIAEVQRKEVQDEKERKKDMQRKRRRENTVRVETAKKLEAERRAKRTV 1241 Query: 107 EALERE 112 + +E+E Sbjct: 1242 DPVEKE 1247 >gi|170049659|ref|XP_001857967.1| vitellogenin-A1 [Culex quinquefasciatus] gi|167871399|gb|EDS34782.1| vitellogenin-A1 [Culex quinquefasciatus] Length = 2060 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 19 KRRYYLKNRD---KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y N K + + Y + +K E + +Y+ + + E ++Y +N Sbjct: 936 DKQYDSSNISEDYKSADYYYKYYQQFPEKKSEYFEKYFKTH-ESQNEMNKKYNEQNSKYF 994 Query: 76 REKARQSYRKLYSKDSWIAPE 96 ++ + KL + D A E Sbjct: 995 QKYSTTRIAKLLNIDPQEAEE 1015 >gi|224002454|ref|XP_002290899.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220974321|gb|EED92651.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1421 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 34/109 (31%), Gaps = 2/109 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 E + NR I E R ++ + K + + KN + + K D+ Sbjct: 286 EEERLKA--EANRKAIAEARAKKDSEEKAAEQARLKKEEEKNAADTKAAEEAKLKKEEDE 343 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 R ++ + + E A + ++ + K + E Sbjct: 344 RRASEAKAAEEDRIRKDTEEKRIAEAKALEEQRAADAKLKAEQEKKVAE 392 >gi|220675860|emb|CAX12571.1| novel protein similar to vertebrate PHD finger protein 3 (PHF3) [Danio rerio] Length = 1691 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 3/82 (3%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYR 61 RK E R +Y ++R+K ER ++ KD+ E + + + + R Sbjct: 1611 RKDSRDREHYDKHHHKSR-HYERDREKHRERSHSHKERHSKDERYEKHRERRHSSHHRDR 1669 Query: 62 EYKRRYYLKNRDKMREKARQSY 83 + R + +K R+ +R Sbjct: 1670 DKHRS-RDSDHEKHRKDSRDRR 1690 Score = 33.9 bits (76), Expect = 8.7, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 13/90 (14%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL-----KNKDKYRE 62 EE + Y + + R Y ++++K RE H + + +K+RE Sbjct: 1606 SEETHRKDSRDREHYDKHH------HKSRHYERDREKHRERSHSHKERHSKDERYEKHRE 1659 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 + + ++RDK R +R S + + KDS Sbjct: 1660 RRHSSHHRDRDKHR--SRDSDHEKHRKDSR 1687 >gi|54289293|gb|AAV31930.1| vitellogenin C1 [Culex quinquefasciatus] Length = 2111 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 19 KRRYYLKNRD---KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y N K + + Y + +K E + +Y+ + + E ++Y +N Sbjct: 991 DKQYDSSNISEDYKSADYYYKYYQQFPEKKSEYFEKYFKTH-ESQNEMNKKYNEQNSKYF 1049 Query: 76 REKARQSYRKLYSKDSWIAPE 96 ++ + KL + D A E Sbjct: 1050 QKYSTTRIAKLLNIDPQEAEE 1070 >gi|170049661|ref|XP_001857970.1| vitellogenin-A1 [Culex quinquefasciatus] gi|167871400|gb|EDS34783.1| vitellogenin-A1 [Culex quinquefasciatus] Length = 2111 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 19 KRRYYLKNRD---KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y N K + + Y + +K E + +Y+ + + E ++Y +N Sbjct: 987 DKQYDSSNISEDYKSADYYYKYYQQFPEKKSEYFEKYFKTH-ESQNEMNKKYNEQNSKYF 1045 Query: 76 REKARQSYRKLYSKDSWIAPE 96 ++ + KL + D A E Sbjct: 1046 QKYSTTRIAKLLNIDPQEAEE 1066 >gi|288927681|ref|ZP_06421528.1| MutS2 family protein [Prevotella sp. oral taxon 317 str. F0108] gi|288330515|gb|EFC69099.1| MutS2 family protein [Prevotella sp. oral taxon 317 str. F0108] Length = 846 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 6/114 (5%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 +RY+ R I ++ + RY K+ ++I ++ + K++ E R + Sbjct: 547 RDKRYWENKRATIHKQEKELQAAIERYEKDIEEIGKTRKDVLKRAKEQAEELLRESNKRI 606 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 +RE K +K E + A E K + I+ Sbjct: 607 ETTIREIKEAQAEKERTKRIREELSEFRTSVEQVDAAANDEFIAKKIEQIKRRK 660 >gi|146421944|ref|XP_001486915.1| hypothetical protein PGUG_00292 [Meyerozyma guilliermondii ATCC 6260] Length = 819 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 46/102 (45%), Gaps = 13/102 (12%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK------DKYREYKRRYYLK 70 EYK R ++NR+ R++R +N ++ R++ + + + D KR + + Sbjct: 551 EYKERIRIENRE----RKKRWREENAERNRDNDLRLRVLKRASLMFGDNDSAEKRAWAEE 606 Query: 71 --NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE 110 NR K++ A+Q +++ PE+ +TK + Sbjct: 607 EFNRRKVKRLAKQQKEDDKPRENR-RPEKNPELTKPLHDVFH 647 >gi|225557093|gb|EEH05380.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR] Length = 1120 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 48/121 (39%), Gaps = 3/121 (2%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + L ++ R E ++ K +D+ + + I+E +K RE Sbjct: 914 EALETSKKSSERLERQKMEIAKLKDEKAILAKELDEA-RTTIKEGGGTDAE--LEKTREE 970 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 RR +N R ++ + Y++ + + E++ LE E+A LKAK + Sbjct: 971 VRRLTKENASLQRTVQQERSQAEYTRQQYQNASTSAAQSAMELQQLEEELANLKAKAAGD 1030 Query: 124 L 124 L Sbjct: 1031 L 1031 >gi|225007707|gb|ACN78619.1| acetylcholinesterase 1 [Liposcelis bostrychophila] Length = 937 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 5/93 (5%) Query: 26 NRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYKRRYY--LKNRDK-MREKARQ 81 N++ E + Y +NK+K + + K +K ++D+ + E R Sbjct: 59 NKEWTKEWENKYYIQENKEKDMKRKIGHREKRIKNLGFHKSENRGKESSKDRGVSETGRS 118 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAEIEALEREIA 114 S +++ E T AE E + +A Sbjct: 119 RDHNTKSAENYE-NIETSTQTVAEYEDFQNTVA 150 >gi|240956661|ref|XP_002400030.1| hypothetical protein IscW_ISCW000843 [Ixodes scapularis] gi|215490650|gb|EEC00293.1| hypothetical protein IscW_ISCW000843 [Ixodes scapularis] Length = 597 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 5/69 (7%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E ++ K+ +K E+R+ K K++ + K++ +E +R L +RDK Sbjct: 482 EMEKERSEKHAEKQREKRKLHREKLKERKAAEVKR-----KEEEQEKQRFLSLSDRDKRA 536 Query: 77 EKARQSYRK 85 A + + K Sbjct: 537 LAAERRFAK 545 >gi|118443035|ref|YP_877770.1| penicillin-binding protein 2 [Clostridium novyi NT] gi|118133491|gb|ABK60535.1| penicillin-binding protein 2 [Clostridium novyi NT] Length = 1049 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 3/74 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 RK E R +EY + Y +K +DK + + +KDK+ E + Y KD++ Sbjct: 220 RKEANGEARF---KEYLKSYDIKEKDKKKQYEQLYKNLDKDKVYEGLLKSYNITKDRFSL 276 Query: 63 YKRRYYLKNRDKMR 76 ++R ++ +D ++ Sbjct: 277 DEKRKFIMIKDSIK 290 >gi|17552208|ref|NP_498392.1| THO Complex (transcription factor/nuclear export) subunit family member (thoc-2) [Caenorhabditis elegans] gi|14573810|gb|AAB47593.3| Tho complex (transcription factor/nuclear export) subunit protein 2 [Caenorhabditis elegans] Length = 1437 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 15/76 (19%), Positives = 31/76 (40%), Gaps = 2/76 (2%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 + + K+ + ++R ++KDK + NK+ RE + +N + Sbjct: 1281 SKKDEKESPKDTESEKGEGKKRRERSKDKETGKEKRLEKDNKE--REKDKERKRENGENQ 1338 Query: 76 REKARQSYRKLYSKDS 91 +E+ +K SK S Sbjct: 1339 KEQKILKDQKQKSKPS 1354 >gi|82704910|ref|XP_726748.1| hypothetical protein [Plasmodium yoelii yoelii str. 17XNL] gi|23482290|gb|EAA18313.1| Protein kinase domain, putative [Plasmodium yoelii yoelii] Length = 1018 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 31/77 (40%), Gaps = 11/77 (14%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 +R Y Y N+ I E R +N ++ +RE+ + +N R+ Sbjct: 666 KRLYDILYNKNNKKNIRENARENARENARENVRENVRENVRENA----------RENARE 715 Query: 74 KMREKARQSYRKLYSKD 90 +RE ++ K SKD Sbjct: 716 NVRENEKKYAHKNDSKD 732 >gi|259533520|sp|B0KWH8|THOC2_CALJA RecName: Full=THO complex subunit 2; Short=Tho2 gi|166832197|gb|ABY90124.1| THO complex 2 isoform 1 (predicted) [Callithrix jacchus] Length = 1600 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 31/84 (36%), Gaps = 2/84 (2%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 + K E ++ K++++ RE + K++ R++ D ++ + ++ Sbjct: 1449 KSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLIKRRKEENGTMG 1508 Query: 87 YSKDSWIAPEEP--MGMTKAEIEA 108 SK +P E E Sbjct: 1509 VSKHKSESPCESPYPNEKDKEKNK 1532 >gi|67521858|ref|XP_658990.1| hypothetical protein AN1386.2 [Aspergillus nidulans FGSC A4] gi|40746060|gb|EAA65216.1| hypothetical protein AN1386.2 [Aspergillus nidulans FGSC A4] gi|259488276|tpe|CBF87598.1| TPA: IQ calmodulin-binding motif domain protein (AFU_orthologue; AFUA_1G08920) [Aspergillus nidulans FGSC A4] Length = 960 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 28/75 (37%), Gaps = 4/75 (5%) Query: 2 GRKVLTPEERMLCRRE--YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQY--YLKNK 57 R+ L E C E ++ NR+K R ++ +K+ E+ + Y + Sbjct: 114 ARERLLAEITAKCEEEVRRAKKKAEDNREKKAAEHARLRIEMAEKLAEAEKRRLLYQQTP 173 Query: 58 DKYREYKRRYYLKNR 72 ++R K + Sbjct: 174 RRHRTSSSTLEEKRK 188 >gi|6979936|gb|AAF34661.1|AF221715_1 split ends long isoform [Drosophila melanogaster] Length = 5554 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R Sbjct: 1992 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQRERE-- 2049 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + ++ + + + E + L A Sbjct: 2050 ---HREKEQSRRAMDVEQEGRGGRMRELSSYQKSKMDIAGEASSLTA 2093 >gi|6467825|gb|AAF13218.1|AF188205_1 Spen RNP motif protein long isoform [Drosophila melanogaster] Length = 5533 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R Sbjct: 1998 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQRERE-- 2055 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + ++ + + + E + L A Sbjct: 2056 ---HREKEQSRRAMDVEQEGRGGRMRELSSYQKSKMDIAGEASSLTA 2099 >gi|24580583|ref|NP_722616.1| split ends, isoform C [Drosophila melanogaster] gi|6715140|gb|AAF26299.1|AF184612_1 split ends [Drosophila melanogaster] gi|22945598|gb|AAN10511.1| split ends, isoform C [Drosophila melanogaster] Length = 5476 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R Sbjct: 1941 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQRERE-- 1998 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + ++ + + + E + L A Sbjct: 1999 ---HREKEQSRRAMDVEQEGRGGRMRELSSYQKSKMDIAGEASSLTA 2042 >gi|24580581|ref|NP_524718.2| split ends, isoform B [Drosophila melanogaster] gi|10727420|gb|AAF51534.2| split ends, isoform B [Drosophila melanogaster] Length = 5533 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R Sbjct: 1998 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQRERE-- 2055 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + ++ + + + E + L A Sbjct: 2056 ---HREKEQSRRAMDVEQEGRGGRMRELSSYQKSKMDIAGEASSLTA 2099 >gi|24580579|ref|NP_722615.1| split ends, isoform A [Drosophila melanogaster] gi|46397733|sp|Q8SX83|SPEN_DROME RecName: Full=Protein split ends gi|10727421|gb|AAF51535.2| split ends, isoform A [Drosophila melanogaster] Length = 5560 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK---NKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 + + ++ K+ + +R R K +KD + + + K+ +RE +R Sbjct: 1998 KEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQRERE-- 2055 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + ++ + + + E + L A Sbjct: 2056 ---HREKEQSRRAMDVEQEGRGGRMRELSSYQKSKMDIAGEASSLTA 2099 >gi|325093724|gb|EGC47034.1| conserved hypothetical protein [Ajellomyces capsulatus H88] Length = 1137 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 48/121 (39%), Gaps = 3/121 (2%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + L ++ R E ++ K +D+ + + I+E +K RE Sbjct: 905 EALETSKKSSERLERQKMEIAKLKDEKAILAKELDEA-RTTIKEGGGTDAE--LEKTREE 961 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 RR +N R ++ + Y++ + + E++ LE E+A LKAK + Sbjct: 962 VRRLTKENASLQRTVQQERSQAEYTRQQYQNASTSAAQSAMELQQLEEELANLKAKAAGD 1021 Query: 124 L 124 L Sbjct: 1022 L 1022 >gi|156358477|ref|XP_001624545.1| predicted protein [Nematostella vectensis] gi|156211332|gb|EDO32445.1| predicted protein [Nematostella vectensis] Length = 1269 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 34/77 (44%), Gaps = 10/77 (12%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNK----------DKYREYKRRYYLKNRDK 74 +N+++ + R K +D IR+ + + K +K ++ R+ N++K Sbjct: 141 RNKEEQELKDRMENEKREDTIRKQHRLQWEKKARAEEIAKEQEEKLKDDIRKSKEYNKEK 200 Query: 75 MREKARQSYRKLYSKDS 91 +RE + ++ ++ Sbjct: 201 IREMQNELFKHNEKQER 217 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 29/63 (46%), Gaps = 1/63 (1%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 ++ + +K+ + R+ NK+KIRE ++ + N +K K ++ ++ Sbjct: 171 KKARAEEIAKEQEEKLKDDIRKSKEYNKEKIREMQNELFKHN-EKQERLKGVSSQEDIER 229 Query: 75 MRE 77 + E Sbjct: 230 IAE 232 >gi|300637986|gb|ADK26157.1| Ycf1 [Asimina angustifolia] Length = 584 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 52/122 (42%), Gaps = 6/122 (4%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 +ER L +R+ ++ + + L ++ + L K+K + + + K++ + ++ Sbjct: 275 KAQERYLRQRKKEKERLNQKEKEQLNQKEKEQLNKKEKEQLNKKE-----KERLNQKEKE 329 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 330 -RLNKKEKERLNKKEKEQLNKKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRK 388 Query: 127 KR 128 +R Sbjct: 389 RR 390 >gi|296399395|gb|ADH10517.1| transmembrane channel-like 2 [Zonotrichia albicollis] Length = 903 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 16/103 (15%) Query: 33 RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 RR+ KN++ + K++ K K + +N++K + R + K Sbjct: 56 RRKNYQRKNRNGKKAREEGSQKKDRGKTAPNKDQKESENKEK-KTTKRGKEGENKQKKGR 114 Query: 93 IA---------------PEEPMGMTKAEIEALEREIARLKAKP 120 EE + M K ++E ++ I+ ++ KP Sbjct: 115 KNSTSSASSNSSDEESLSEEELAMLKEQVEQKKKLISTMRNKP 157 >gi|160693744|gb|ABX46573.1| non-structural polyprotein [Turkey astrovirus] Length = 1125 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLY 87 +K E + NK I+E + NK R+ KNR+KM+E + + Sbjct: 136 EKSREAKENLEKANK--IQEQLDKVIESNK----ALHRKIQEKNREKMQEYMVRLHNTQK 189 Query: 88 SKDSWIAPEEPMGMTKAEIEALERE 112 +D W+ + ++ +E Sbjct: 190 DRDDWVQRCSRLEQENVNLQKRLKE 214 >gi|328770470|gb|EGF80512.1| hypothetical protein BATDEDRAFT_25153 [Batrachochytrium dendrobatidis JAM81] Length = 577 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 28/62 (45%), Gaps = 1/62 (1%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 ++ + C + + Y + ++++ + R Y +NK+ R Y Y N D R ++Y Sbjct: 517 KDYLQCAKNWD-LYSEQQQEQLKKWYRNYYTQNKESKRLRYRNNYANNSDIERGRLQKYK 575 Query: 69 LK 70 Sbjct: 576 KA 577 >gi|324499828|gb|ADY39936.1| Ubiquitin carboxyl-terminal hydrolase 7 [Ascaris suum] Length = 1862 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 51/123 (41%), Gaps = 14/123 (11%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRR-RYLKNKDKIRESYHQYYLKNKDKYRE 62 ++ E++ + + ++K + ++ KN+ K +E + + ++ ++ RE Sbjct: 1209 RLKKSEQKHANTMKQIQAQLQAEKEKHEKTVQQLNQKKNELKKQEKVLKQHERDAERCRE 1268 Query: 63 YK----------RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + KN D++R+ R+ S+D+ + + +AE L + Sbjct: 1269 LQDKVDYLTKELQEMRQKNADEVRKLQRELQ---SSQDTRKNMADEIKGREAEQAELRNQ 1325 Query: 113 IAR 115 ++ Sbjct: 1326 LSE 1328 >gi|46116454|ref|XP_384245.1| hypothetical protein FG04069.1 [Gibberella zeae PH-1] Length = 1186 Score = 34.6 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 43/109 (39%), Gaps = 1/109 (0%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + E ++E +++ + R + R+RR + ++K E + + + + Sbjct: 595 EQKARAEEKRKQKEAQKKAEEEARLRKEAERQRRLHEQREKQAEQERKARETKEKEKKLK 654 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + ++++ +K R++ + ++ +E + RE Sbjct: 655 EEQ-RQRDKEAREQKEREAQERKDKQERDKRDKEARAAKAQKDAQAARE 702 >gi|171920546|ref|ZP_02931818.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|188024348|ref|ZP_02997020.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|188518442|ref|ZP_03003930.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|188524201|ref|ZP_03004263.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|195867391|ref|ZP_03079395.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|198273540|ref|ZP_03206076.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|225551229|ref|ZP_03772175.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 8 str. ATCC 27618] gi|171903243|gb|EDT49532.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|188018614|gb|EDU56654.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|188998128|gb|EDU67225.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|195659796|gb|EDX53176.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|195660867|gb|EDX54120.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|198250060|gb|EDY74840.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|225379044|gb|EEH01409.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 8 str. ATCC 27618] Length = 736 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 9/96 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y R Y KN + I E + NK +QY + K+ +E + KN+ + Sbjct: 379 KDYYRDYENKNNNTISEYKAIYIEINK-----KRNQYLQELKELRKELDK---FKNQKQY 430 Query: 76 REKARQS-YRKLYSKDSWIAPEEPMGMTKAEIEALE 110 Q Y Y K ++ + E Sbjct: 431 NGVNEQYQYTVNYIKTTYNNWLKVEYSKYLEQMKNN 466 >gi|150866807|ref|XP_001386530.2| hypothetical protein PICST_33868 [Scheffersomyces stipitis CBS 6054] gi|149388063|gb|ABN68501.2| predicted protein [Scheffersomyces stipitis CBS 6054] Length = 1403 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 6/77 (7%) Query: 21 RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK------DKYREYKRRYYLKNRDK 74 Y K R + ER++R +N ++ +++ + + K + K+ + + +K Sbjct: 1057 EYREKIRSENRERKKRWREENAERNKDNDLRSRVLKKATSLFGEAESAEKKAWADEEFNK 1116 Query: 75 MREKARQSYRKLYSKDS 91 R+K K S Sbjct: 1117 RRQKRLTRKLKGSETKS 1133 >gi|225550769|ref|ZP_03771718.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 2 str. ATCC 27814] gi|225379923|gb|EEH02285.1| ABC transporter, permease protein [Ureaplasma urealyticum serovar 2 str. ATCC 27814] Length = 736 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 9/96 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y R Y KN + I E + NK +QY + K+ +E + KN+ + Sbjct: 379 KDYYRDYENKNNNTISEYKAIYIEINK-----KRNQYLQELKELRKELDK---FKNQKQY 430 Query: 76 REKARQS-YRKLYSKDSWIAPEEPMGMTKAEIEALE 110 Q Y Y K ++ + E Sbjct: 431 NGVNEQYQYTVNYIKTTYNNWLKVEYSKYLEQMKNN 466 >gi|218261146|ref|ZP_03476084.1| hypothetical protein PRABACTJOHN_01748 [Parabacteroides johnsonii DSM 18315] gi|218224207|gb|EEC96857.1| hypothetical protein PRABACTJOHN_01748 [Parabacteroides johnsonii DSM 18315] Length = 825 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 46/120 (38%), Gaps = 13/120 (10%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 +RY+ R I ++ ++ RY ++ + + + + K+ E +R N Sbjct: 540 RDKRYWESKRQNIRQQEKKLEDITARYEQDLEAVNKQRKEIMRNAKE---EAQRILSEAN 596 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE---IARLKAKPIEELIYKR 128 K+ R+ K+ + + KA + A E E IAR AK E K+ Sbjct: 597 A-KIENTIREIKEAQAEKEQTKLARKALEEFKASVIAAEEEDDKIARKMAKLQERKERKK 655 >gi|324499806|gb|ADY39927.1| Ubiquitin carboxyl-terminal hydrolase 7 [Ascaris suum] Length = 1904 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 51/123 (41%), Gaps = 14/123 (11%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRR-RYLKNKDKIRESYHQYYLKNKDKYRE 62 ++ E++ + + ++K + ++ KN+ K +E + + ++ ++ RE Sbjct: 1251 RLKKSEQKHANTMKQIQAQLQAEKEKHEKTVQQLNQKKNELKKQEKVLKQHERDAERCRE 1310 Query: 63 YK----------RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + KN D++R+ R+ S+D+ + + +AE L + Sbjct: 1311 LQDKVDYLTKELQEMRQKNADEVRKLQRELQ---SSQDTRKNMADEIKGREAEQAELRNQ 1367 Query: 113 IAR 115 ++ Sbjct: 1368 LSE 1370 >gi|148673022|gb|EDL04969.1| mCG1028802 [Mus musculus] Length = 309 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 39/89 (43%), Gaps = 7/89 (7%) Query: 43 DKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDS-----WIAPE- 96 +K E Y L N + YK Y KN +++R+ ++ ++ + D W+ E Sbjct: 183 EKKTEQSMSYPLHNSSQIIAYKNAKYHKNINEIRQFYQKQHQNWHVIDGFHSKWWVWNEV 242 Query: 97 -EPMGMTKAEIEALEREIARLKAKPIEEL 124 + + I+ ++I KA I++L Sbjct: 243 LKDIQQMNKYIQTYMKKIKEGKAACIDKL 271 >gi|39953989|ref|XP_364038.1| hypothetical protein MGG_08883 [Magnaporthe oryzae 70-15] gi|145021174|gb|EDK05303.1| hypothetical protein MGG_08883 [Magnaporthe oryzae 70-15] Length = 746 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 44/118 (37%), Gaps = 8/118 (6%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 T E+ L + +RR + KI ER + +K+K E + +K + Sbjct: 584 TQNEKELRKLHDRRRRVHEKIAKIQERSSTKRNGDKEK-DEREMAKLKEKHEKELARQEE 642 Query: 67 YYLKNRDKMREKA-------RQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 Y + K+ E+ Q +K ++ + KAE + ++I LK Sbjct: 643 KYQRELRKIEERRQHEERKAEQRRKKQVEREEKANIALELDRVKAERDVARKQIDILK 700 >gi|307176865|gb|EFN66209.1| Vesicular glutamate transporter 2 [Camponotus floridanus] Length = 848 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 45/118 (38%), Gaps = 15/118 (12%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL--KNKDKYREYKRRYYLKNRDKM- 75 K ++ K +DK + +++R + ++ E+ + Y KN D ++ + K + + Sbjct: 553 KDKHQSKGKDKHSKEQKKRKSSSLERNVETPSEKYSEAKNSDNVKQKLKNKRNKREENIV 612 Query: 76 -------REKARQSYRKLYSKDSWIAPEEPMGM-----TKAEIEALEREIARLKAKPI 121 K + Y K +K + + +K E + EI + K + Sbjct: 613 ISEGASKNSKDKPKYSKDKTKTETQNKRDKIVKEQSRESKKHFETSDSEIRQKSEKSL 670 >gi|301774030|ref|XP_002922417.1| PREDICTED: protein FAM186A-like [Ailuropoda melanoleuca] Length = 2258 Score = 34.6 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 50/111 (45%), Gaps = 4/111 (3%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 +K+ Y + N+ + + +L+ K+ + + + K+ ++E +++ K ++ E Sbjct: 852 WKKSYEMTNK---KLQEKESWLQTKEGKQGQQKRKQWQEKEMWKEQQKQRMQKQTEQDEE 908 Query: 78 KARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 + QS + Y K E + ++ LE+E + K +E+ ++ Sbjct: 909 QM-QSEEEDYQKPKQQQLEAWNKKMEKQVVPLEKEKGQQKRHVQKEVRHQE 958 Score = 33.5 bits (75), Expect = 10.0, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 48/133 (36%), Gaps = 30/133 (22%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQY-------YLKNKDKYR--------------- 61 +++K +RR + ++KI ++ + K+ + Sbjct: 816 RDHKEKEKQRREEHLQEGQEKIFGKGLKHQLQEGSFWKKSYEMTNKKLQEKESWLQTKEG 875 Query: 62 ----EYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALE--REIAR 115 + ++++ ++ +E+ +Q +K +D E K + + LE + Sbjct: 876 KQGQQKRKQWQE--KEMWKEQQKQRMQKQTEQDEEQMQSEEEDYQKPKQQQLEAWNKKME 933 Query: 116 LKAKPIEELIYKR 128 + P+E+ ++ Sbjct: 934 KQVVPLEKEKGQQ 946 >gi|302497699|ref|XP_003010849.1| peroxisomal membrane protein receptor Pex19, putative [Arthroderma benhamiae CBS 112371] gi|291174394|gb|EFE30209.1| peroxisomal membrane protein receptor Pex19, putative [Arthroderma benhamiae CBS 112371] Length = 451 Score = 34.6 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 32/97 (32%), Gaps = 7/97 (7%) Query: 38 YLKNKDKIRESYHQ-------YYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKD 90 L NK+ + E + + +NK+K E R Y + + R+ K +S + Sbjct: 264 QLSNKEMLYEPLKELDDKFGPWLRENKEKIGEEDRERYELQASLVSDIVRKFEEKNFSDE 323 Query: 91 SWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 + A + A ++E + Sbjct: 324 KPEDRAYIWEKMQKMQAAGSPPDDLISAPFMDESKLQ 360 >gi|255552107|ref|XP_002517098.1| Vicilin GC72-A precursor, putative [Ricinus communis] gi|223543733|gb|EEF45261.1| Vicilin GC72-A precursor, putative [Ricinus communis] Length = 613 Score = 34.6 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 48/118 (40%), Gaps = 10/118 (8%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R++ ++R C E + + ++K ER+ R + +++K + QY ++ E Sbjct: 99 RELQKCQQR--CEEEPIKE-REREQEKGQERQPREHELDREKSDDPRKQY-----ERCLE 150 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 + R K + K R + + W E G +E E REI + K P Sbjct: 151 ICE--RQEGRQKQQCKRRCYTQYEEQQKEWEEREHGGGGGNSETETRSREIEQHKNNP 206 >gi|300638012|gb|ADK26170.1| Ycf1 [Asimina reticulata] Length = 561 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 50/122 (40%), Gaps = 6/122 (4%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 +ER L +R+ ++ K + L ++ + L K+K + + + NK K + Sbjct: 281 KAQERYLRQRKKEKERLNKKEKEQLNKKEKEQLNQKEKEQLNQKEKEQLNK------KEK 334 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 335 ERLNKKEKERLNKKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRK 394 Query: 127 KR 128 +R Sbjct: 395 RR 396 >gi|326916345|ref|XP_003204468.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 3-like [Meleagris gallopavo] Length = 2064 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 7/109 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQY---YLKNKDKYREYK-RRYYLK 70 ++ +R Y + K + N +K E + +N++K RE K R + Sbjct: 1933 DQQRQRFYSDSHHQKKERHEKEW---NNEKYWEQDSERNRRRDRNQEKERERKSREEGHR 1989 Query: 71 NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +++++R S PE+ K E +A E++ R K+K Sbjct: 1990 DKERLRLPHSDRGADGKSPRETRNPEKKTEKPKPEEQAQEKDKEREKSK 2038 >gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans] gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans] Length = 2944 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 48/120 (40%), Gaps = 7/120 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1069 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAVEER 1121 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 +RR + +M E R+ + K + + + + EI +L +P E+ Sbjct: 1122 ERRRQHMSLIRMLELRRKFEDREKKKHQLVLDRLLLRERRMAERKRDAEILQLIRRPNED 1181 >gi|307172721|gb|EFN64027.1| Myosin light chain kinase, smooth muscle [Camponotus floridanus] Length = 7605 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 19/101 (18%), Positives = 42/101 (41%), Gaps = 6/101 (5%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNK-DKIRESYHQYYLKNKDKYR 61 RK + +E+ K + ++K+ E + KN+ +K+++ K + K + Sbjct: 4235 RKKVQAKEKEKADEIKKEQELKDIKEKVKEATEQYDKKNEIEKLQKKETDEIDKTQQKEQ 4294 Query: 62 EYKRRYY-----LKNRDKMREKARQSYRKLYSKDSWIAPEE 97 E+ R ++ +++E +Q K KD +E Sbjct: 4295 EHVRIEKTEIVEKDHKIEVKEIKKQEEDKTIKKDGKEQEKE 4335 Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 52/109 (47%), Gaps = 10/109 (9%) Query: 19 KRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNK-DKYREYKRRYYLKNRDKMR 76 +++ K ++K E ++ + LK+ K+K++E+ QY KN+ +K ++ + K + K + Sbjct: 4235 RKKVQAKEKEKADEIKKEQELKDIKEKVKEATEQYDKKNEIEKLQKKETDEIDKTQQKEQ 4294 Query: 77 EKARQSYRKLYSKDSWIAPEEP--------MGMTKAEIEALEREIARLK 117 E R ++ KD I +E + E E + + R+K Sbjct: 4295 EHVRIEKTEIVEKDHKIEVKEIKKQEEDKTIKKDGKEQEKEDEKAKRIK 4343 >gi|221058090|ref|XP_002261553.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] gi|194247558|emb|CAQ40958.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] Length = 2918 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 39/119 (32%), Gaps = 11/119 (9%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKY 60 G +L E+ +E + ++K + KN D+ Y +N K Sbjct: 2538 GGDILRDEKGAEAAKEKNYEDEGEAKEKNYDDEGAAKEKNYDDEGAAKEKNYDDENAAKE 2597 Query: 61 REY-------KRRYYLKNRDKMREKARQS--YRKLYSKDSWIAPEEPMGMTKAEIEALE 110 + Y ++ Y +N K + + K Y D A E+ M + + Sbjct: 2598 KNYDDEDAAKEKNYDDENAAKEKNYDDEDAAKEKNYD-DENAAKEKNYDMKTRQRKEKH 2655 >gi|194893986|ref|XP_001977982.1| GG19345 [Drosophila erecta] gi|190649631|gb|EDV46909.1| GG19345 [Drosophila erecta] Length = 1507 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 30/78 (38%), Gaps = 4/78 (5%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRES--YHQYYLKNKDKYREYKR 65 +ER R +YK+ Y R + + KD + +YY ++ + Y + Sbjct: 1379 SKERY--RPKYKQEYNSNERKQRRQNEEHDRRNRKDDHSKEHGKKKYYDQSSNNRGRYNK 1436 Query: 66 RYYLKNRDKMREKARQSY 83 N ++ EK ++ Sbjct: 1437 NSKKCNEEQSNEKCKERN 1454 >gi|169865260|ref|XP_001839233.1| hypothetical protein CC1G_12586 [Coprinopsis cinerea okayama7#130] gi|116499699|gb|EAU82594.1| hypothetical protein CC1G_12586 [Coprinopsis cinerea okayama7#130] Length = 215 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 19/36 (52%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +++ E+ RR Y KNKD I + Y NK ++ Sbjct: 9 KEQKREKNRRSYHKNKDSINARRRERYKDNKGATKQ 44 >gi|154492823|ref|ZP_02032449.1| hypothetical protein PARMER_02462 [Parabacteroides merdae ATCC 43184] gi|154087128|gb|EDN86173.1| hypothetical protein PARMER_02462 [Parabacteroides merdae ATCC 43184] Length = 825 Score = 34.6 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 46/120 (38%), Gaps = 13/120 (10%) Query: 18 YKRRYYLKNRDKILERRRR------RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 +RY+ R I ++ ++ RY ++ + + + + K+ E +R N Sbjct: 540 RDKRYWESKRQNIRQQEKKLEDITARYEQDLEAVNKQRKEIMRNAKE---EAQRILSEAN 596 Query: 72 RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE---IARLKAKPIEELIYKR 128 K+ R+ K+ + + KA + A E E IAR AK E K+ Sbjct: 597 A-KIENTIREIKEAQAEKEQTKLARKALEEFKASVIAAEEEDGKIARKMAKLQERKERKK 655 >gi|326519276|dbj|BAJ96637.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 708 Score = 34.6 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 41/104 (39%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +E K++ LK + R +++ + + + +N + Y + R YL+ +K+ Sbjct: 397 KEKKKQILLKTTTEGKLRAKQKRAEEAALREKQEEERRTENPELYFDELRARYLELSEKV 456 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 ++ RQ + + E + + E L A + K Sbjct: 457 DQRKRQKVNGNNNSSGAVGRGERLNAVQKERMRLLTTAAFDRGK 500 >gi|194853276|ref|XP_001968134.1| GG24702 [Drosophila erecta] gi|190660001|gb|EDV57193.1| GG24702 [Drosophila erecta] Length = 5592 Score = 34.6 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 35/104 (33%), Gaps = 7/104 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 + + ++ K+ + +R R K+ + + + K+ +RE +R Sbjct: 2011 KEQKEKEIREKDLREKEQRERDNRE--KELREKEMREKEQREKELHREKDQRERE----- 2063 Query: 75 MREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKA 118 REK + + + + + E++ L A Sbjct: 2064 HREKEQSRRAMDVELEGRAGRMRELSSYQKSKMDITSEVSSLTA 2107 >gi|300638004|gb|ADK26166.1| Ycf1 [Asimina triloba] Length = 576 Score = 34.6 bits (78), Expect = 5.1, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 47/113 (41%), Gaps = 5/113 (4%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYY----LKNKDKYREYKRRYYLKNRDKM 75 + Y + N K ER R+ K K+++ + + K K++ + ++ L ++K Sbjct: 271 KDYEIFNFTKAQERYLRQRKKEKERLNQKEKEEKERLNQKEKEQLNQKEKE-RLNQKEKE 329 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 R ++ + + + + + + LE++ A I+EL +R Sbjct: 330 RLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRKRR 382 >gi|302797228|ref|XP_002980375.1| hypothetical protein SELMODRAFT_419836 [Selaginella moellendorffii] gi|300151991|gb|EFJ18635.1| hypothetical protein SELMODRAFT_419836 [Selaginella moellendorffii] Length = 1354 Score = 34.6 bits (78), Expect = 5.1, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 43/125 (34%), Gaps = 11/125 (8%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 KV R + + K + +N++K L + + ++ N++ Sbjct: 1023 KVKLENARRVAALKQKIQDLKENKEKTLSTYKAELAAAMAEHEAQSKKFAKLNQETEERA 1082 Query: 64 KRRYY-------LKN----RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + +N + + E RQS + M KAE+EA E Sbjct: 1083 RANWKLACEAVAAENMAIAKAAVEEYERQSKFVHEENMRRASQIAAMLKKKAEVEAQNEE 1142 Query: 113 IARLK 117 + R+K Sbjct: 1143 VTRVK 1147 >gi|296258175|gb|ADH04224.1| vitellogenin 1a [Culex tarsalis] Length = 2117 Score = 34.3 bits (77), Expect = 5.1, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 1/43 (2%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 K + + Y + DK E + +YY + + E ++Y +N Sbjct: 1005 KSADYYYKYYQQFPDKKSEYFEKYYKAH-ESQNEMNKKYNEQN 1046 >gi|312903890|ref|ZP_07763062.1| Cna protein B-type domain protein [Enterococcus faecalis TX0635] gi|310632834|gb|EFQ16117.1| Cna protein B-type domain protein [Enterococcus faecalis TX0635] Length = 814 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 10/84 (11%) Query: 16 REYKRRY--YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 +EY + Y Y K+ D+ E+++ D +E ++ +N +K Y++ +D Sbjct: 40 KEYNKEYAQYEKDLDEYNEKKQEH-----DDAQEEANRIEKENAEKKAAYEKVLEQYQKD 94 Query: 74 KMREKARQSYRKL--YSKDSWIAP 95 + E ++ SK +I P Sbjct: 95 -LEEYKKKQDEINDHSSKGQYIGP 117 >gi|239977075|sp|A8DZJ1|BAZ1B_XENLA RecName: Full=Tyrosine-protein kinase BAZ1B; AltName: Full=Bromodomain adjacent to zinc finger domain protein 1B; AltName: Full=Williams syndrome transcription factor homolog Length = 1441 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 30/55 (54%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 E+++++ + ++ E R++ K +I+E + K +++ E ++RY ++ Sbjct: 504 EHRKQWAVMTEEQREEYMRKKREALKARIKEKTRERKQKEREERLEKQKRYEDQD 558 >gi|268565963|ref|XP_002639597.1| C. briggsae CBR-POP-1 protein [Caenorhabditis briggsae] Length = 428 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Query: 22 YYLKNRDKILERRRRRYLKNKDKIRESYHQYY---LKNKDKYREYKRRYYLKNRDKMRE- 77 Y +NR K+LE ++ + +E +++ + + KY E ++ +++K + Sbjct: 200 YMKENRPKLLEEVGNDQKQSAELNKELGKRWHDLPKEEQQKYFEMAKKDRESHKEKYPQW 259 Query: 78 KARQSYRKL 86 AR++Y Sbjct: 260 SARENYAVN 268 >gi|190344510|gb|EDK36194.2| hypothetical protein PGUG_00292 [Meyerozyma guilliermondii ATCC 6260] Length = 819 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 12/90 (13%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK------DKYREYKRRYYLK 70 EYK R ++NR+ R++R +N ++ R++ + + + D KR + + Sbjct: 551 EYKERIRIENRE----RKKRWREENAERNRDNDLRSRVLKRASLMFGDNDSAEKRAWAEE 606 Query: 71 --NRDKMREKARQSYRKLYSKDSWIAPEEP 98 NR K++ A+Q +++ + P Sbjct: 607 EFNRRKVKRLAKQQKEDDKPRENRRPEKNP 636 >gi|331002249|ref|ZP_08325768.1| hypothetical protein HMPREF0491_00630 [Lachnospiraceae oral taxon 107 str. F0167] gi|330411343|gb|EGG90759.1| hypothetical protein HMPREF0491_00630 [Lachnospiraceae oral taxon 107 str. F0167] Length = 1719 Score = 34.3 bits (77), Expect = 5.3, Method: Composition-based stats. Identities = 9/84 (10%), Positives = 35/84 (41%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 +N+++ + + +N+++ + S +N+++ + ++ +NR++ + + Sbjct: 241 ENKEESKPSEKPKTPENREESKPSEKPKTPENREESKPSEKPKAPENREESKPSEKPKTP 300 Query: 85 KLYSKDSWIAPEEPMGMTKAEIEA 108 + + E+P E Sbjct: 301 ENKEESKPSESEKPKAPENKEESK 324 >gi|326920284|ref|XP_003206404.1| PREDICTED: inner centromere protein A-like [Meleagris gallopavo] Length = 634 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 47/117 (40%), Gaps = 2/117 (1%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 +ER K++ + R K +E +RR + RE + L+ +++ + + Sbjct: 144 KERQRLESLRKKQEAEEQRKKKVEEEKRRRQAEMKQKREERLRKALQARERVEQMEEEKK 203 Query: 69 LKNRDKMREKARQSY--RKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 + K + + + K S ++ + E + +++I +LK ++E Sbjct: 204 KRMEQKFLQSDEKVRLSQVREEKGSEERSKKKLSKKPWESDVRKQKILKLKGDGLQE 260 >gi|290988211|ref|XP_002676815.1| FH2 domain-containing protein [Naegleria gruberi] gi|284090419|gb|EFC44071.1| FH2 domain-containing protein [Naegleria gruberi] Length = 1344 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 53/121 (43%), Gaps = 6/121 (4%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 K+ T +E+ L R ++Y L N E + Y +I ++ + K K+ E Sbjct: 511 KIKTEKEKQLADRVKTQQYQLNNLQSERESILQEYETKIVEISKTLEE-IEKKKNSLSEE 569 Query: 64 KRRYYLKNRDKMREKARQSYRKLY-SKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIE 122 K ++ + + E+ Q+ +L ++ +E KAE+ L +EI +LK + Sbjct: 570 KEKFLKE----IEEQTAQTRSELSETEKEKATQQEKHSHIKAELTQLHQEIEKLKNEVFS 625 Query: 123 E 123 + Sbjct: 626 K 626 >gi|256070881|ref|XP_002571770.1| arylformamidase [Schistosoma mansoni] gi|238656918|emb|CAZ28000.1| arylformamidase [Schistosoma mansoni] Length = 967 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 3/67 (4%) Query: 30 ILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSK 89 + + R N RE + + ++ RE +R++ NR K+ K R+ + K Sbjct: 57 LEAKERYNLNYNITFSRERNEEIWQ---EREREAQRQWEENNRQKILLKQREEETQRKIK 113 Query: 90 DSWIAPE 96 W + + Sbjct: 114 AEWESKQ 120 >gi|325119819|emb|CBZ55372.1| unnamed protein product [Neospora caninum Liverpool] Length = 945 Score = 34.3 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 41/89 (46%), Gaps = 1/89 (1%) Query: 31 LERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKD 90 E R+R +++++ ++ + + K RE + R + K+ E+ R+S R+ KD Sbjct: 823 QEAERQRVAEDEERKKKEAQRKAEEEAAKLREEEARKRDEEAAKLAEQ-RESLRRAEQKD 881 Query: 91 SWIAPEEPMGMTKAEIEALEREIARLKAK 119 + KA+ + + +A+ + K Sbjct: 882 QQAKQKLTESAEKAQNQKKIQAVAQKEGK 910 >gi|25145616|ref|NP_500551.2| hypothetical protein F55F10.1 [Caenorhabditis elegans] gi|20198837|gb|AAC17540.2| Hypothetical protein F55F10.1 [Caenorhabditis elegans] Length = 4368 Score = 34.3 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 26/61 (42%), Gaps = 3/61 (4%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKD---KYREYKRRYYLKN 71 R +++Y KN K ++ K +++ E + + N + + ++ R+ KN Sbjct: 2679 TRAQWKKWYEKNIAKAEQKDFVYRTKTEEEKDEMDLEEFFANHEQDHQVQKKNRKKCSKN 2738 Query: 72 R 72 Sbjct: 2739 P 2739 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 3/74 (4%) Query: 46 RESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 R + ++Y KN K + Y K ++ E + + + +D + + K Sbjct: 2680 RAQWKKWYEKNIAKAEQKDFVYRTKTEEEKDEMDLEEFFANHEQDHQVQKK---NRKKCS 2736 Query: 106 IEALEREIARLKAK 119 +K K Sbjct: 2737 KNPSNYANILMKIK 2750 >gi|123966319|ref|YP_001011400.1| aminopeptidase N [Prochlorococcus marinus str. MIT 9515] gi|123200685|gb|ABM72293.1| probable aminopeptidase N [Prochlorococcus marinus str. MIT 9515] Length = 869 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 32/76 (42%), Gaps = 1/76 (1%) Query: 39 LKNKDKIRESYHQYYLKNKDKYREYKRRY-YLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 NK+ E + +Y +N ++ R++ K K++ + +K Y + + Sbjct: 436 ENNKEINIEKFKTWYKQNGTPLVKFNRKWDREKEILKIKVSQINANKKNYYNERPLIIPI 495 Query: 98 PMGMTKAEIEALEREI 113 +G+ + E + + I Sbjct: 496 NIGILLSSYEKINKTI 511 >gi|118088827|ref|XP_426199.2| PREDICTED: similar to KIAA0244 [Gallus gallus] Length = 2065 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 7/109 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQY---YLKNKDKYREYK-RRYYLK 70 ++ +R Y + K + N +K E + +N++K RE K R + Sbjct: 1934 DQQRQRFYSDSHHQKKERHEKDW---NNEKYWEQDSERNRRRDRNQEKERERKSREEGHR 1990 Query: 71 NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 +++++R S PE+ K E +A E++ R K+K Sbjct: 1991 DKERLRLPHSDRSADGKSPRETRNPEKKTEKPKPEEQAQEKDKEREKSK 2039 >gi|296258177|gb|ADH04225.1| vitellogenin 1b [Culex tarsalis] Length = 2119 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 1/43 (2%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 K + + Y + DK E + +YY + + E ++Y +N Sbjct: 1007 KSADYYYKYYQQFPDKKSEYFEKYYKAH-ESQNEMNKKYNEQN 1048 >gi|294954863|ref|XP_002788332.1| hypothetical protein Pmar_PMAR026718 [Perkinsus marinus ATCC 50983] gi|239903644|gb|EER20128.1| hypothetical protein Pmar_PMAR026718 [Perkinsus marinus ATCC 50983] Length = 1274 Score = 34.3 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 48/117 (41%), Gaps = 6/117 (5%) Query: 11 RMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 + L E +R + E ++R+ + IR++ + + + R+ + + Sbjct: 1062 KRLAAAERIATETEASRQALGEEAKKRHE---EAIRKNDRL-RKEAWEAIKLRNRK-HEE 1116 Query: 71 NRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 + DK+RE+A ++ + + + E + A E RE + ++ +Y+ Sbjct: 1117 DIDKVRERAAKARLEHHQRVK-EMYERKISRKDAAGEKAAREAFVARQLRLKREMYE 1172 >gi|323302933|gb|EGA56737.1| Mrm1p [Saccharomyces cerevisiae FostersB] Length = 412 Score = 34.3 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 36/110 (32%), Gaps = 6/110 (5%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL-KNRDKMR 76 K+++ N DK +R+ + K + + + KN+ K+ Sbjct: 45 RKKQWETLNEDKASWFKRKY-----AHVHAREQDRAADPYGKKKAHVEKLKEIKNQAKLN 99 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 +K+ +S + + + P+ ++ + L+Y Sbjct: 100 QKSHKSKFQNKDIALKLMNDNPIFEYVYGTNSVYAALLNPSRNCHSRLLY 149 >gi|315160858|gb|EFU04875.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0645] Length = 1182 Score = 34.3 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 10/84 (11%) Query: 16 REYKRRY--YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 +EY + Y Y K+ D+ E+++ D +E ++ +N +K Y++ +D Sbjct: 40 KEYNKEYAQYEKDLDEYNEKKQEH-----DDAQEEANRIEKENAEKKAAYEKVLEQYQKD 94 Query: 74 KMREKARQSYRKL--YSKDSWIAP 95 + E ++ SK +I P Sbjct: 95 -LEEYKKKQDEINDHSSKGQYIGP 117 >gi|68059385|ref|XP_671679.1| hypothetical protein [Plasmodium berghei strain ANKA] gi|56488075|emb|CAI03998.1| hypothetical protein PB301478.00.0 [Plasmodium berghei] Length = 324 Score = 34.3 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 30/66 (45%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYS 88 K + + Y ++ K + S+ +Y ++NKDK E+ + Y N+ MR + K Sbjct: 241 KKMAYEKNNYKDDEQKKKISHDEYLIQNKDKLYEFYKEYQKNNKTHMRNDLPVMWAKNNF 300 Query: 89 KDSWIA 94 + I Sbjct: 301 QQFIIN 306 >gi|262274543|ref|ZP_06052354.1| electron transport complex protein RnfC [Grimontia hollisae CIP 101886] gi|262221106|gb|EEY72420.1| electron transport complex protein RnfC [Grimontia hollisae CIP 101886] Length = 690 Score = 34.3 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 27/58 (46%) Query: 66 RYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 +YY + + ++R + + +K+ + A + M K E R+ A + K +E+ Sbjct: 436 QYYRQAKSEIRTRNAEQQAAERAKERFEAKKARMERDKKAREERHRQAAENRRKAMEK 493 >gi|156364639|ref|XP_001626454.1| predicted protein [Nematostella vectensis] gi|156213330|gb|EDO34354.1| predicted protein [Nematostella vectensis] Length = 644 Score = 34.3 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 43/123 (34%), Gaps = 32/123 (26%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYH---------- 50 + RK ++ +R + YY KNR+K + + R +++ + Sbjct: 5 VSRKQAAALKKKEIKRAKDKDYYEKNRNKKIAQVIERRKARREETKGKSRTRTEIKKAAS 64 Query: 51 ----------------QYYLKNKDKYREYKRRYYLKNRD------KMREKARQSYRKLYS 88 + ++K RE RRY KN++ K+ + + ++ S Sbjct: 65 KKRRKEQRATAREELERKRQIAREKTRERVRRYRQKNKEEQGQEKKILQNTGERFKNRTS 124 Query: 89 KDS 91 K Sbjct: 125 KKR 127 >gi|239615284|gb|EEQ92271.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3] Length = 781 Score = 34.3 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 44/125 (35%), Gaps = 8/125 (6%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKIL------ERRRRRYLKNKDKIRESYHQYYLKN 56 R ER + R + ++K ER + + +++K + + Sbjct: 28 RADRNEHERTQEKERADREKRERAQEKERADRNEHERTQEKERADREKRERAQEKERADR 87 Query: 57 KDKYREYKRRYYLKN-RDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIAR 115 ++ R ++ +N R++ +EK R + + A + P E+ ++ Sbjct: 88 NERERAQEKERADRNERERAQEKERADREE-RGRMRLEADQSPTAFLVYLQHVKEKLLST 146 Query: 116 LKAKP 120 +P Sbjct: 147 FSVEP 151 >gi|229072238|ref|ZP_04205445.1| Spore germination protein gerIA [Bacillus cereus F65185] gi|228710895|gb|EEL62863.1| Spore germination protein gerIA [Bacillus cereus F65185] Length = 728 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 1/104 (0%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLK-NKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 R++ K N E+ NK++ R H N+ K ++ K D Sbjct: 7 RKKKKSNKPEANETDNQEQHSNNQEDDNKEQTRSMKHNKGKNNEQKDSSQNKQQSAKQED 66 Query: 74 KMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 ++K + + ++ S+D ++ + A + + ++ K Sbjct: 67 SSQDKQQNAKQEDSSQDKQQNAKQGDSSQDKQQNAKQEDSSQDK 110 >gi|238925261|ref|YP_002938778.1| DNA mismatch repair protein [Eubacterium rectale ATCC 33656] gi|259511157|sp|C4ZI07|MUTS2_EUBR3 RecName: Full=MutS2 protein gi|238876937|gb|ACR76644.1| DNA mismatch repair protein [Eubacterium rectale ATCC 33656] Length = 792 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 40/97 (41%), Gaps = 4/97 (4%) Query: 32 ERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD--KMREKARQSYRK-LYS 88 E+ + K +I + Q KN+ + + N + K+ + A++ K + + Sbjct: 533 EKEQLEINSYKAEIEKLKKQLEEKNERIDKSKDKILREANEEAYKILQDAKELADKTIRN 592 Query: 89 KDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 + + + PM + E AL R+ K K + ++ Sbjct: 593 FNKYGQGQAPMSQMEKERSAL-RDKMNDKEKKLSDIK 628 >gi|309371464|emb|CAP24992.3| CBR-EIF-3.B protein [Caenorhabditis briggsae AF16] Length = 716 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 11/90 (12%) Query: 1 MGRKVLTPEERMLCRREYKR----RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 +G ++ T + R LCR+ R ++ + K+ E+++ KN K + + + Sbjct: 577 LGYRIFTFQGRELCRKNLDRLAQFKWRPRPPVKLSEQKQNEIRKNLKKTAAKFVK--QDD 634 Query: 57 KDKYREYKRRYYLKNR-----DKMREKARQ 81 +K R + + + D +R + R+ Sbjct: 635 DEKCRASQEVVEKRRKIMAAFDVIRNRNRE 664 >gi|312075231|ref|XP_003140325.1| hypothetical protein LOAG_04740 [Loa loa] gi|307764511|gb|EFO23745.1| hypothetical protein LOAG_04740 [Loa loa] Length = 1395 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 50/128 (39%), Gaps = 7/128 (5%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 R L + R+ ++ ++ +N +K + ++ KN K + + + + K + Sbjct: 1126 RDELKDDSRVTKKKNDEKETR-RNEEKDSRKEGDKFRKNDRKDEKMTSKRWDNKEGKTKG 1184 Query: 63 YKRRYYLKNRDKM-----REKARQSY-RKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 + +N +K+ +E R+ + K S S + P + + + ++ Sbjct: 1185 EREAKRCENAEKVEPEPTKESRRKEHGEKRKSDASRSDHKRPRLRSYGSHDEQRSDSSKS 1244 Query: 117 KAKPIEEL 124 K + E+ Sbjct: 1245 DQKFLSEI 1252 >gi|223635141|sp|A8WX15|EIF3B_CAEBR RecName: Full=Eukaryotic translation initiation factor 3 subunit B; Short=eIF3b; AltName: Full=Eukaryotic translation initiation factor 3 subunit 9 Length = 722 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 11/90 (12%) Query: 1 MGRKVLTPEERMLCRREYKR----RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 +G ++ T + R LCR+ R ++ + K+ E+++ KN K + + + Sbjct: 583 LGYRIFTFQGRELCRKNLDRLAQFKWRPRPPVKLSEQKQNEIRKNLKKTAAKFVK--QDD 640 Query: 57 KDKYREYKRRYYLKNR-----DKMREKARQ 81 +K R + + + D +R + R+ Sbjct: 641 DEKCRASQEVVEKRRKIMAAFDVIRNRNRE 670 >gi|237833469|ref|XP_002366032.1| hypothetical protein TGME49_023460 [Toxoplasma gondii ME49] gi|211963696|gb|EEA98891.1| hypothetical protein TGME49_023460 [Toxoplasma gondii ME49] Length = 2491 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 2/51 (3%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDK 59 E +E KNR+K + R + KN++K + + KN++K Sbjct: 2216 RESARSNKERNVEKEDKNREK-EDTNREKEDKNREK-EDKNREKEDKNREK 2264 >gi|323335423|gb|EGA76709.1| Mrm1p [Saccharomyces cerevisiae Vin13] gi|323346586|gb|EGA80873.1| Mrm1p [Saccharomyces cerevisiae Lalvin QA23] Length = 412 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 36/110 (32%), Gaps = 6/110 (5%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL-KNRDKMR 76 K+++ N DK +R+ + K + + + KN+ K+ Sbjct: 45 RKKQWETLNEDKASWFKRKY-----AHVHAREQDRAADPYGKKKAHVEKLKEIKNQAKLN 99 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 +K+ +S + + + P+ ++ + L+Y Sbjct: 100 QKSHKSKFQNKDIALKLMNDNPIFEYVYGTNSVYAALLNPSRNCHSRLLY 149 >gi|300637988|gb|ADK26158.1| Ycf1 [Asimina triloba] Length = 577 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 47/113 (41%), Gaps = 5/113 (4%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYY----LKNKDKYREYKRRYYLKNRDKM 75 + Y + N K ER R+ K K+++ + + K K++ + ++ L ++K Sbjct: 272 KDYEIFNFTKAQERYLRQRKKEKERLNQKEKEEKERLNQKEKEQLNQKEKE-RLNQKEKE 330 Query: 76 REKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 R ++ + + + + + + LE++ A I+EL +R Sbjct: 331 RLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRKRR 383 >gi|307203942|gb|EFN82849.1| Death-inducer obliterator 1 [Harpegnathos saltator] Length = 2352 Score = 34.3 bits (77), Expect = 6.1, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 38/97 (39%), Gaps = 2/97 (2%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR-Y 67 +ER +E R NR K +RR R +++ + K+K++ + Sbjct: 1511 DERRKKDKERGRE-RDSNRSKGKDRRERSRSRHRHSRDRDRSKTRDKSKEQKSGRGKESK 1569 Query: 68 YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKA 104 K++++ ++K R+ + KDS + Sbjct: 1570 REKDKERDKDKVRERNKDRDRKDSRERGRQNERDRHK 1606 >gi|325089964|gb|EGC43274.1| PHD finger domain-containing protein [Ajellomyces capsulatus H88] Length = 978 Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 25/115 (21%), Positives = 50/115 (43%), Gaps = 14/115 (12%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY--- 60 ++ + +ER E + ++I + K K E Y + +++ Sbjct: 337 RIASKQERERQEMEAAEEARKREAERIAALK----ELEKQKKIEKERLYRMMTREQRLKD 392 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAP---EEPMGMTKAEIEALERE 112 RE KR+ + + K+ E+A+ K+ S +S ++ + M K +IEAL+ E Sbjct: 393 REEKRKLHEEELAKIAEEAK----KIESGESRMSERHLKAEMEKRKKDIEALQNE 443 >gi|38649400|gb|AAH63220.1| gnl3-prov protein [Xenopus (Silurana) tropicalis] Length = 550 Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 42/115 (36%), Gaps = 14/115 (12%) Query: 4 KVLTPEERMLCRREY--KRRYYLKNRDKILERRRRRYLKNKDKIR-----------ESYH 50 K+ +R+ C + Y +++ NR E ++ K K +I Sbjct: 7 KLKKGSKRLSCHKRYKIQKKVREHNRKARKEAKKSGTRKQKKEISVPNNAPFKAEILQEA 66 Query: 51 QYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 Q + +++ ++ ++ K K R K + +K K ++ G AE Sbjct: 67 QRRRQQEEELKQNRKLERQKEVAK-RRKLDEKKKKNSEKREKRDNKKNKGTKAAE 120 >gi|291527503|emb|CBK93089.1| MutS2 family protein [Eubacterium rectale M104/1] Length = 792 Score = 34.3 bits (77), Expect = 6.3, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 40/97 (41%), Gaps = 4/97 (4%) Query: 32 ERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD--KMREKARQSYRK-LYS 88 E+ + K +I + Q KN+ + + N + K+ + A++ K + + Sbjct: 533 EKEQLEINSYKAEIEKLKKQLEEKNERIDKSKDKILREANEEAYKILQDAKELADKTIRN 592 Query: 89 KDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 + + + PM + E AL R+ K K + ++ Sbjct: 593 FNKYGQGQAPMSQMEKERSAL-RDKMNDKEKKLSDIK 628 >gi|307704515|ref|ZP_07641423.1| immunogenic secreted protein [Streptococcus mitis SK597] gi|307621928|gb|EFO00957.1| immunogenic secreted protein [Streptococcus mitis SK597] Length = 879 Score = 34.3 bits (77), Expect = 6.3, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 44/140 (31%), Gaps = 29/140 (20%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRES-----------YHQ 51 RK E L ++ + L +R E ++ L NKD Q Sbjct: 13 RKRRIKEGSYLNNKKTPKEARLDSRVAFKEAKKNYRLANKDYKLHKGPQATARGLFFKQQ 72 Query: 52 YYLKNKDKYREYKRRYYLKNRD------KMREKARQSYRKLYSKD------------SWI 93 +K K D +++++A+QS R D W+ Sbjct: 73 RDRAKLEKQIATTLYKRAKKADDSYIPNRLKKRAKQSARMNARHDVEKIVRDNEILGDWV 132 Query: 94 APEEPMGMTKAEIEALEREI 113 + + K ++ A ++ + Sbjct: 133 QARQNIRQFKYQVNASKKAL 152 >gi|46124859|ref|XP_386983.1| hypothetical protein FG06807.1 [Gibberella zeae PH-1] Length = 778 Score = 34.3 bits (77), Expect = 6.3, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 37/75 (49%), Gaps = 7/75 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +++ + R+K++E+R++R K +K + + K+ +E +++ K+ +K Sbjct: 600 KDHAKAVRE--REKLVEKRQKRAQKESEKALKEAARRE---KEALKESQKK--EKDAEKE 652 Query: 76 REKARQSYRKLYSKD 90 +++ K K+ Sbjct: 653 QQRQASEREKQAQKE 667 >gi|298714686|emb|CBJ27611.1| conserved unknown protein [Ectocarpus siliculosus] Length = 2173 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 32/77 (41%), Gaps = 3/77 (3%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKD---KIRESYHQYYLKNKDKYREYKRR 66 ER R+ + K R+ E+ R R K + K+ + + K++++ + R Sbjct: 574 ERQAERQAAHVKKLEKQREIAEEKERARRAKEAERLAKLEQKRLEKEAKDRERAEQQARE 633 Query: 67 YYLKNRDKMREKARQSY 83 + +K R + + Y Sbjct: 634 QARRGIEKERARLAKRY 650 >gi|291524453|emb|CBK90040.1| MutS2 family protein [Eubacterium rectale DSM 17629] Length = 792 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 40/97 (41%), Gaps = 4/97 (4%) Query: 32 ERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD--KMREKARQSYRK-LYS 88 E+ + K +I + Q KN+ + + N + K+ + A++ K + + Sbjct: 533 EKEQLEINSYKAEIEKLKKQLEEKNERIDKSKDKILREANEEAYKILQDAKELADKTIRN 592 Query: 89 KDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELI 125 + + + PM + E AL R+ K K + ++ Sbjct: 593 FNKYGQGQAPMSQMEKERSAL-RDKMNDKEKKLSDIK 628 >gi|194751205|ref|XP_001957917.1| GF10653 [Drosophila ananassae] gi|190625199|gb|EDV40723.1| GF10653 [Drosophila ananassae] Length = 1128 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 39/112 (34%), Gaps = 15/112 (13%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNK-----DKYREYKRRYYLKNRDKMRE 77 + + I ++ Y NKD I K +K ++ R + + E Sbjct: 852 EERRKVGIQAWEQKYYELNKDIIEAKREAEAETRKEMERLEKEEQFASRKRKADEEGGEE 911 Query: 78 KARQSYRK-LYSKDSWIAPEE---------PMGMTKAEIEALEREIARLKAK 119 K Q+Y Y++ + +E M + + E L RE A K K Sbjct: 912 KKGQNYAGLNYNERMAVKWDELNLNRMMTILMSRKQMDPEKLARETALEKEK 963 >gi|190407514|gb|EDV10781.1| ribose methyltransferase for mitochondrial 21S rRNA [Saccharomyces cerevisiae RM11-1a] Length = 385 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 36/110 (32%), Gaps = 6/110 (5%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL-KNRDKMR 76 K+++ N DK +R+ + K + + + KN+ K+ Sbjct: 18 RKKQWETLNEDKASWFKRKY-----AHVHAREQDRAADPYGKKKAHVEKLKEIKNQAKLN 72 Query: 77 EKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 +K+ +S + + + P+ ++ + L+Y Sbjct: 73 QKSHKSKFQNKDIALKLMNDNPIFEYVYGTNSVYAALLNPSRNCHSRLLY 122 >gi|308448072|ref|XP_003087606.1| hypothetical protein CRE_13862 [Caenorhabditis remanei] gi|308254357|gb|EFO98309.1| hypothetical protein CRE_13862 [Caenorhabditis remanei] Length = 493 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 34/82 (41%) Query: 46 RESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 R + ++Y KN K + Y K ++ E + + +DS + P+ + M Sbjct: 317 RAQWKKWYEKNIAKAEQKDFVYRTKTEEEKDEMDIEEFFTNQEQDSQVLPDSDVSMLLEA 376 Query: 106 IEALEREIARLKAKPIEELIYK 127 IE + + K++ + Y+ Sbjct: 377 IEQKKFTLVDEKSRKQSDERYE 398 >gi|209554013|ref|YP_002284427.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|209541514|gb|ACI59743.1| ribose/galactose ABC transporter [Ureaplasma urealyticum serovar 10 str. ATCC 33699] Length = 736 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 33/96 (34%), Gaps = 9/96 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y R Y KN + I E + NK +QY + K+ +E + KN+ + Sbjct: 379 KDYYRDYENKNNNTISEYKAIYIEINK-----KRNQYLQELKELRKELDK---FKNQKQY 430 Query: 76 REKARQSYRK-LYSKDSWIAPEEPMGMTKAEIEALE 110 Q Y K ++ + E Sbjct: 431 NGVNEQYQHTVNYIKTTYNNWLKVEYSKYLEQMKNN 466 >gi|297793051|ref|XP_002864410.1| hypothetical protein ARALYDRAFT_357817 [Arabidopsis lyrata subsp. lyrata] gi|297310245|gb|EFH40669.1| hypothetical protein ARALYDRAFT_357817 [Arabidopsis lyrata subsp. lyrata] Length = 1781 Score = 34.3 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 36/95 (37%), Gaps = 4/95 (4%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL 69 +R R E ++ R +I E +R++ ++ E + + + Sbjct: 1598 DRKRQREEEDKKLKEAKRQRIAEIQRQQREADEKLQAEKELKRQAMEARTKAQKVLKADQ 1657 Query: 70 KNRDKMREKAR-QSY---RKLYSKDSWIAPEEPMG 100 N +K+R +AR +SY S+ S + + Sbjct: 1658 SNAEKIRGQARSKSYSSDDTNASRSSRDTDFKVIS 1692 >gi|300795801|ref|NP_001178685.1| THO complex subunit 2 [Rattus norvegicus] gi|109512524|ref|XP_001058768.1| PREDICTED: THO complex 2 [Rattus norvegicus] Length = 1594 Score = 34.3 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 30/84 (35%), Gaps = 2/84 (2%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 + K E ++ K++++ RE + K++ R++ D + + ++ Sbjct: 1449 KSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDITKRRKEENGTMG 1508 Query: 87 YSKDSWIAPEEPM--GMTKAEIEA 108 SK +P E E Sbjct: 1509 VSKHKSESPCESQYPNEKDKEKNK 1532 >gi|147899081|ref|NP_001086608.1| ATP-dependent RNA helicase DDX55 [Xenopus laevis] gi|82199952|sp|Q6AZV7|DDX55_XENLA RecName: Full=ATP-dependent RNA helicase DDX55; AltName: Full=DEAD box protein 55 gi|50603912|gb|AAH77172.1| Ddx55-prov protein [Xenopus laevis] Length = 594 Score = 34.3 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 6/61 (9%) Query: 21 RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKAR 80 Y KNR+K ++ + K+K+ +++ KNK ++ R+ + + K+ K + Sbjct: 485 AYKDKNREKQRQKMLKER---KEKLETEGRKHFAKNKAWSKQKARK---EKKQKVALKRK 538 Query: 81 Q 81 + Sbjct: 539 K 539 >gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster] Length = 3109 Score = 34.3 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 48/120 (40%), Gaps = 7/120 (5%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 ++ ++ M ++ ++ +N++K + KN K+ + LKN+ E Sbjct: 1069 EIAKQQQAMRDAKKLQKEELARNKEKARQ------EKNS-KLEQQRKDKELKNQQAVEER 1121 Query: 64 KRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEE 123 +RR + +M E R+ + K + + + + EI +L +P E+ Sbjct: 1122 ERRRQHMSLIRMLELRRKFEDREKKKHQLVLDRLLLRERRMAERKRDAEILQLIRRPNED 1181 >gi|326504734|dbj|BAK06658.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 592 Score = 34.3 bits (77), Expect = 6.6, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 43/101 (42%), Gaps = 18/101 (17%) Query: 8 PEERMLCRREYKRRY---YLKNRDKILERRR-RRYLKNKDKIRESYHQYYLK-----NKD 58 ER RE++ R +R+K+ E+ R K ++K + + + + Sbjct: 57 DRERHRSGREHRDREDGKEKADREKVREKDEGRDREKVREKDGGRSREKVREKDESRDPE 116 Query: 59 KYREY-----KRRYYLKN----RDKMREKARQSYRKLYSKD 90 K RE + + K+ R+K+REK R+ +L ++ Sbjct: 117 KVREKEESRDREKVREKDGSRDREKVREKEREGRDRLMERE 157 >gi|18139836|gb|AAL60161.1|AF412333_1 Williams syndrome transcription factor [Xenopus laevis] Length = 1079 Score = 34.3 bits (77), Expect = 6.6, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY 67 E+++++ + ++ E R++ K +I+E + K +++ E ++RY Sbjct: 142 EHRKQWAVMTEEQREEYMRKKREALKVRIKEKARERKQKEREERLEKQKRY 192 >gi|18376049|emb|CAD21055.1| hypothetical protein [Neurospora crassa] Length = 1367 Score = 34.3 bits (77), Expect = 6.6, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 33/78 (42%), Gaps = 11/78 (14%) Query: 8 PEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY-REYKRR 66 P+ RE +R++ E R+ Y NK +++ + + E +R+ Sbjct: 1269 PQAESPSVRERRRKH---------EYDRKWYWNNKALLQQEKDEQATQPGPHIESERQRQ 1319 Query: 67 YYLKNRDKMREKARQSYR 84 L +R++M++ R+ Sbjct: 1320 ARL-HRERMKKYQREKRA 1336 >gi|326523595|dbj|BAJ92968.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 592 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 43/101 (42%), Gaps = 18/101 (17%) Query: 8 PEERMLCRREYKRRY---YLKNRDKILERRR-RRYLKNKDKIRESYHQYYLK-----NKD 58 ER RE++ R +R+K+ E+ R K ++K + + + + Sbjct: 57 DRERHRSGREHRDREDGKEKADREKVREKDEGRDREKVREKDGGRSREKVREKDESRDPE 116 Query: 59 KYREY-----KRRYYLKN----RDKMREKARQSYRKLYSKD 90 K RE + + K+ R+K+REK R+ +L ++ Sbjct: 117 KVREKEESRDREKVREKDGSRDREKVREKEREGRDRLMERE 157 >gi|326494786|dbj|BAJ94512.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 592 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 43/101 (42%), Gaps = 18/101 (17%) Query: 8 PEERMLCRREYKRRY---YLKNRDKILERRR-RRYLKNKDKIRESYHQYYLK-----NKD 58 ER RE++ R +R+K+ E+ R K ++K + + + + Sbjct: 57 DRERHRSGREHRDREDGKEKADREKVREKDEGRDREKVREKDGGRSREKVREKDESRDPE 116 Query: 59 KYREY-----KRRYYLKN----RDKMREKARQSYRKLYSKD 90 K RE + + K+ R+K+REK R+ +L ++ Sbjct: 117 KVREKEESRDREKVREKDGSRDREKVREKEREGRDRLMERE 157 >gi|57525164|ref|NP_001006185.1| ATP-dependent RNA helicase DDX55 [Gallus gallus] gi|82197869|sp|Q5ZLN8|DDX55_CHICK RecName: Full=ATP-dependent RNA helicase DDX55; AltName: Full=DEAD box protein 55 gi|53129029|emb|CAG31355.1| hypothetical protein RCJMB04_5g4 [Gallus gallus] Length = 591 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 3/93 (3%) Query: 37 RYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK-ARQSYRKLYSKDSWIAP 95 KN++K + Q + K++ ++ ++KN+ ++K R+ RKL +K Sbjct: 486 FKDKNREK--QRQKQLEQQRKEREESEGKKKFIKNKSWSKQKAKREKKRKLTAKRKREEG 543 Query: 96 EEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 + E+ R + RLK I E +++ Sbjct: 544 SDMEDEDMEELLNDTRLLKRLKKGKISEEEFEK 576 >gi|268530390|ref|XP_002630321.1| C. briggsae CBR-EIF-3.B protein [Caenorhabditis briggsae] Length = 722 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 11/90 (12%) Query: 1 MGRKVLTPEERMLCRREYKR----RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 +G ++ T + R LCR+ R ++ + K+ E+++ KN K + + + Sbjct: 583 LGYRIFTFQGRELCRKNLDRLAQFKWRPRPPVKLSEQKQNEIRKNLKKTAAKFVK--QDD 640 Query: 57 KDKYREYKRRYYLKNR-----DKMREKARQ 81 +K R + + + D +R + R+ Sbjct: 641 DEKCRASQEVVEKRRKIMAAFDVIRNRNRE 670 >gi|322787841|gb|EFZ13753.1| hypothetical protein SINV_04777 [Solenopsis invicta] Length = 482 Score = 33.9 bits (76), Expect = 7.0, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 35/83 (42%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 EE+ L KR Y+++ + +RR+RY K+ R Y K+ +RR Sbjct: 204 EEQDLHNYAKKRLRYVEDIENNRAKRRQRYQKDLINNRVKQLHRYKKDLINNLAKQRRRN 263 Query: 69 LKNRDKMREKARQSYRKLYSKDS 91 ++K R RQ Y K + Sbjct: 264 EDKKEKNRFNKRQRYSKRSQHER 286 >gi|307186615|gb|EFN72119.1| hypothetical protein EAG_05501 [Camponotus floridanus] Length = 1093 Score = 33.9 bits (76), Expect = 7.0, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 9/111 (8%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERR-----RRRYLKNKDKIRESYHQYYLKNKDKYR 61 + + E ++ Y N ++I ER+ + N K S Y KN + Y Sbjct: 401 KDKNSLRIEVESEKNYNSNNTNEIYERQSNKYEKLYKQNNPKKESASLKDIYEKNYESYN 460 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 + + N ++ +R +Y K I E + E ++E Sbjct: 461 LSRGN--IDNCERQSYSSRTDSVLMYEKIDKIGRNE--TNSTYEQTDRKKE 507 >gi|62510617|sp|Q6P4W5|GNL3_XENTR RecName: Full=Guanine nucleotide-binding protein-like 3; AltName: Full=Nucleostemin-like protein Length = 548 Score = 33.9 bits (76), Expect = 7.0, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 42/115 (36%), Gaps = 14/115 (12%) Query: 4 KVLTPEERMLCRREY--KRRYYLKNRDKILERRRRRYLKNKDKIR-----------ESYH 50 K+ +R+ C + Y +++ NR E ++ K K +I Sbjct: 5 KLKKGSKRLSCHKRYKIQKKVREHNRKARKEAKKSGTRKQKKEISVPNNAPFKAEILQEA 64 Query: 51 QYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 Q + +++ ++ ++ K K R K + +K K ++ G AE Sbjct: 65 QRRRQQEEELKQNRKLERQKEVAK-RRKLDEKKKKNSEKREKRDNKKNKGTKAAE 118 >gi|296236333|ref|XP_002763278.1| PREDICTED: THO complex subunit 2-like [Callithrix jacchus] Length = 1365 Score = 33.9 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 31/84 (36%), Gaps = 2/84 (2%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 + K E ++ K++++ RE + K++ R++ D ++ + ++ Sbjct: 1221 KSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLIKRRKEENGTMG 1280 Query: 87 YSKDSWIAPEEP--MGMTKAEIEA 108 SK +P E E Sbjct: 1281 VSKHKSESPCESPYPNEKDKEKNK 1304 >gi|291397324|ref|XP_002715062.1| PREDICTED: WD repeat domain 60 isoform 2 [Oryctolagus cuniculus] Length = 1014 Score = 33.9 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 36/71 (50%), Gaps = 2/71 (2%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 E + +++ + ER + + ++ + ++ +D+ ++ +R+ ++RDK Sbjct: 44 ESETDLPEHKEHKYRDPEREAKYRERLTEREAHASREHPRGERDREKQRERKKDARDRDK 103 Query: 75 MREKARQSYRK 85 EK+R+ YR+ Sbjct: 104 --EKSREKYRE 112 >gi|71998426|ref|NP_001022469.1| Eukaryotic Initiation Factor family member (eif-3.B) [Caenorhabditis elegans] gi|75028999|sp|Q9XWI6|EIF3B_CAEEL RecName: Full=Eukaryotic translation initiation factor 3 subunit B; Short=eIF3b; AltName: Full=Eukaryotic translation initiation factor 3 subunit 9 gi|3881103|emb|CAA21681.1| C. elegans protein Y54E2A.11a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 725 Score = 33.9 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 46/103 (44%), Gaps = 13/103 (12%) Query: 1 MGRKVLTPEERMLCRREYKR----RYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKN 56 +G ++ T + R LCR+ R ++ + K+ E+++R KN K + + + Sbjct: 586 LGYRIFTFQGRELCRKNLDRLAQFKWRPRPPVKLSEQKQREIKKNLKKTAAKFIK--QDD 643 Query: 57 KDKYREYKRRYYLKNR-----DKMREKARQSYRKLYSKDSWIA 94 +K R + + + D +R + R+ + ++D I+ Sbjct: 644 DEKCRASQEVVEKRRKIMAAFDIIRSRNRE--QLDATRDERIS 684 >gi|149635100|ref|XP_001510757.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 867 Score = 33.9 bits (76), Expect = 7.3, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 52/140 (37%), Gaps = 17/140 (12%) Query: 4 KVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESY-----------HQY 52 K L + +CR++ + + R +I R ++ Y + + ++R Sbjct: 684 KRLQEFKERICRQKLAQTKVKEKRQQI-ARAKKYYEEYRVQLRAKMMRARTREERIFKNL 742 Query: 53 YLKNKDKYREYKRRYYLKNRDKMREKARQSYR-----KLYSKDSWIAPEEPMGMTKAEIE 107 + + + ++ R +++K E+ RQ + Y +D + E + + EI Sbjct: 743 FEEGLEIQKQRLRELRAYSKEKRDEQKRQHQNELESMENYYRDQFSMLAEAVSQERQEIN 802 Query: 108 ALEREIARLKAKPIEELIYK 127 E+ + K E K Sbjct: 803 LREKAQTKTLRKIKREFRSK 822 >gi|126341598|ref|XP_001378834.1| PREDICTED: similar to transmembrane channel-like protein 2 [Monodelphis domestica] Length = 1028 Score = 33.9 bits (76), Expect = 7.3, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 2/70 (2%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 CR+E K + + K L +++ K+++ E ++ K + KR N++ Sbjct: 85 CRKEEKEEGSREIKIKALHQKKDVRRKHREMFGED--EWNKAEKSSNGKRKRERKASNKE 142 Query: 74 KMREKARQSY 83 + K ++ Sbjct: 143 QKEAKNKEKQ 152 >gi|308161777|gb|EFO64212.1| Seryl-tRNA synthetase [Giardia lamblia P15] Length = 457 Score = 33.9 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 9/92 (9%) Query: 26 NRDKILERRRRRYLKNK--DKIRESYHQYYLKNKDKYREYKRRYYLKNRD---KMREK-- 78 N +KI E +RRRY + DK+ E + K+ + + + N+D ++ K Sbjct: 15 NPEKIKESQRRRYASTEVVDKVIELDKAWVSKDYEA-SQLSKELNQLNKDFGAAVKAKSP 73 Query: 79 -ARQSYRKLYSKDSWIAPEEPMGMTKAEIEAL 109 S + +K ++ +G AE A+ Sbjct: 74 TDELSKKIASNKQETERVKKEVGQLLAERNAV 105 >gi|169844613|ref|XP_001829027.1| hypothetical protein CC1G_01707 [Coprinopsis cinerea okayama7#130] gi|116509767|gb|EAU92662.1| hypothetical protein CC1G_01707 [Coprinopsis cinerea okayama7#130] Length = 935 Score = 33.9 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 3/60 (5%) Query: 24 LKNRDKILERRRRRYLKNKDKI-RESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQS 82 ++R+K+ + + +++I + Q + +NK +E + + R+K +K RQS Sbjct: 770 ERHREKMRDIQAPLTRAEREQIELAAAKQRWERNKAIEKEAMEKREKEQREK--QKRRQS 827 >gi|302815166|ref|XP_002989265.1| hypothetical protein SELMODRAFT_447609 [Selaginella moellendorffii] gi|300143008|gb|EFJ09703.1| hypothetical protein SELMODRAFT_447609 [Selaginella moellendorffii] Length = 434 Score = 33.9 bits (76), Expect = 7.5, Method: Composition-based stats. Identities = 13/80 (16%), Positives = 33/80 (41%), Gaps = 4/80 (5%) Query: 3 RKVLTPEERMLCRREYKRRYY----LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKD 58 RK L +E ++E K+ K+ + + ++ ++ + D + ++ + + Sbjct: 89 RKELRKKELKRNKKERKKVREVGILKKDPEALKDQIKKLDTQKADGALDKTRRHKKRQLE 148 Query: 59 KYREYKRRYYLKNRDKMREK 78 + + DKM+EK Sbjct: 149 DTLNLVLKKRKEYVDKMKEK 168 >gi|253741836|gb|EES98697.1| Seryl-tRNA synthetase [Giardia intestinalis ATCC 50581] Length = 457 Score = 33.9 bits (76), Expect = 7.6, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 41/96 (42%), Gaps = 11/96 (11%) Query: 26 NRDKILERRRRRYL--KNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 N +KI E +RRRY + DK+ E + K+ + + R N+D Sbjct: 15 NPEKIKESQRRRYASIEAVDKVIELDKAWVAKDYEA-GQLSRELNQLNKD--------FG 65 Query: 84 RKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAK 119 + +K + + K E E + +E+A+L A+ Sbjct: 66 AAMKAKSPTDELSKKIATNKQETERVRKEVAQLLAE 101 >gi|224002298|ref|XP_002290821.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220974243|gb|EED92573.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 831 Score = 33.9 bits (76), Expect = 7.6, Method: Composition-based stats. Identities = 12/67 (17%), Positives = 28/67 (41%), Gaps = 5/67 (7%) Query: 26 NRDKILERR--RRRYLKNKDKIRESYHQYYLKNKDK-YREYKRRYYL--KNRDKMREKAR 80 N+ I E+ ++ + K ++K K K++ E R +N ++RE+ Sbjct: 742 NKAHIEEQNSIKQYFEKLEEKEAAKRLARETKRKEEARNERMRSRREKLENAAELREQRM 801 Query: 81 QSYRKLY 87 + + + Sbjct: 802 ERWSTMS 808 >gi|260831888|ref|XP_002610890.1| hypothetical protein BRAFLDRAFT_126279 [Branchiostoma floridae] gi|229296259|gb|EEN66900.1| hypothetical protein BRAFLDRAFT_126279 [Branchiostoma floridae] Length = 1722 Score = 33.9 bits (76), Expect = 7.7, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 23/98 (23%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYL-------KNKDKY----REY 63 R E + Y + + K E+ K K ++Y + +++ RE Sbjct: 5 RAEIQWAYRERKKAKDGEQ-------YKAKEAARVKKHYKPAVSFSHEARERRNAQAREK 57 Query: 64 KRRYYLKNRDKMREKARQS-----YRKLYSKDSWIAPE 96 RRY + + KM E + Y L S DS A E Sbjct: 58 MRRYRARKQFKMEEYNKPHWTNIEYASLTSHDSHCADE 95 >gi|298714387|emb|CBJ27444.1| myosin-like protein [Ectocarpus siliculosus] Length = 1143 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 43/104 (41%), Gaps = 6/104 (5%) Query: 29 KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQS-----Y 83 K R+R++ +N D ++ Y K + E +R +++ +A + Sbjct: 775 KSELRKRKKEKENVDALQAKIKAYEQAAKAREAELRRE-QEAAIARIKAEAEATARAEMR 833 Query: 84 RKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 K ++ ++ K + E +E++ A +KA+ EE K Sbjct: 834 EKARLEEERRVRKDHEAKVKVQEEKMEKDAAEMKARIEEEYRVK 877 >gi|218563708|ref|NP_001136259.1| tyrosine-protein kinase BAZ1B [Xenopus laevis] gi|157390157|emb|CAJ29032.1| Williams syndrome transcription factor [Xenopus laevis] Length = 1441 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 30/55 (54%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 E+++++ + ++ E R++ K +I+E + K +++ E ++RY ++ Sbjct: 504 EHRKQWAVMTEEQREEYMRKKREALKARIKEKTRERKQKEREERLEKQKRYEDQD 558 >gi|116487535|gb|AAI26048.1| Wstf protein [Xenopus laevis] Length = 777 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 30/55 (54%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 E+++++ + ++ E R++ K +I+E + K +++ E ++RY ++ Sbjct: 504 EHRKQWAVMTEEQREEYMRKKREALKARIKEKTRERKQKEREERLEKQKRYEDQD 558 >gi|300638028|gb|ADK26178.1| Ycf1 [Asimina parviflora] Length = 600 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 39/93 (41%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER + K K+++ + + + + + K + L ++K R Sbjct: 272 KDYEIFNFTKAQERYLHQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNK 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + K E E L ++ Sbjct: 332 KEKERLNKKEKERLNQKEKERLNKKEKERLNKK 364 >gi|260913786|ref|ZP_05920261.1| hemagglutination activity domain protein PfhB [Pasteurella dagmatis ATCC 43325] gi|260632099|gb|EEX50275.1| hemagglutination activity domain protein PfhB [Pasteurella dagmatis ATCC 43325] Length = 2845 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 39/109 (35%), Gaps = 9/109 (8%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 EE+ + Y N R+ ++ H+ + + K++ + Sbjct: 1054 LNEEKYVKDDAYYDARRTNN---------ERFENLPRDLQRRLHEEFSRRKEEQNAKRLE 1104 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIAR 115 K +++ K +Q ++ + ++ + + E +A E ++ + Sbjct: 1105 LLRKAKEEKEAKNQQRNDNYKVEEKRLENDKIIKQKELEAQANEEKLIK 1153 >gi|225871066|ref|YP_002747013.1| primosomal protein N' [Streptococcus equi subsp. equi 4047] gi|225700470|emb|CAW94892.1| putative primosomal protein N' [Streptococcus equi subsp. equi 4047] Length = 794 Score = 33.9 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 5/98 (5%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 +DK + Y N+ + ++ + N+ K R+ + Y L N K + A+ Sbjct: 166 KDKKTIKTETYYQVNR----LALQEHKISNRAKKRQALKAYLLANPQKDK-LAKLYQDYS 220 Query: 87 YSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 S+ + + + + + E +KA P+ +L Sbjct: 221 REVVSYFVTNQLIELEERRVNRSEDYFNHIKAAPLLKL 258 >gi|258597525|ref|XP_001350695.2| conserved Plasmodium protein [Plasmodium falciparum 3D7] gi|254945392|gb|AAN36375.2| conserved Plasmodium protein [Plasmodium falciparum 3D7] Length = 2001 Score = 33.9 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 39/92 (42%), Gaps = 1/92 (1%) Query: 6 LTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKR 65 +E +E ++ Y KN++ + + +N ++ Y KN ++ ++ Sbjct: 1638 KRAKEAERKNKE-QKNYEDKNKEDKNDEEKNDEEENYEQKNYEQKNYEQKNYEQKNYEQK 1696 Query: 66 RYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 Y KN ++ + + +K Y + ++ ++ Sbjct: 1697 NYEQKNYEQKNYEQKNYEQKNYEQKNYEQKKK 1728 >gi|304317232|ref|YP_003852377.1| MutS2 family protein [Thermoanaerobacterium thermosaccharolyticum DSM 571] gi|302778734|gb|ADL69293.1| MutS2 family protein [Thermoanaerobacterium thermosaccharolyticum DSM 571] Length = 786 Score = 33.9 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 37/93 (39%), Gaps = 4/93 (4%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNK----DKIRESYHQYYLKN 56 + R+V + +E +R + +K E+ R+ K + I + Sbjct: 539 LKRQVNSVKEEYERKRRQTEAERDRIIEKAREKARKILENTKSTADEIIAKLREAEKSDK 598 Query: 57 KDKYREYKRRYYLKNRDKMREKARQSYRKLYSK 89 K+K E R+ +N +M E ++S +Y K Sbjct: 599 KNKLIEEARKKLKENISEMEESLKKSEVPVYKK 631 >gi|9631835|ref|NP_048621.1| hypothetical protein PBCV1_A267L [Paramecium bursaria Chlorella virus 1] gi|1181430|gb|AAC96635.1| A267L [Paramecium bursaria Chlorella virus 1] Length = 314 Score = 33.9 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 1/53 (1%) Query: 55 KNKDKYREYKRRYYLKNRDKMREKARQS-YRKLYSKDSWIAPEEPMGMTKAEI 106 K+ +KY+EY+ Y KN++K+RE + + + I E + K + Sbjct: 4 KDPEKYKEYQTWYRSKNKEKLRECDSKRIQERNKNALDSITSGEIIDKKKWDY 56 >gi|327409587|ref|YP_004347007.1| hypothetical protein LAU_0039 [Lausannevirus] gi|326784761|gb|AEA06895.1| hypothetical protein LAU_0039 [Lausannevirus] Length = 91 Score = 33.9 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 30/55 (54%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 ++K L +R+ Y +K++ R+ ++Y NK+ ++ + ++ + E+ ++ Sbjct: 4 KEKRLASQRKYYESHKEEERQRLKEWYETNKEIIKKKRAEKRKTDKARKLEEKKE 58 >gi|327268535|ref|XP_003219052.1| PREDICTED: intersectin-1-like [Anolis carolinensis] Length = 1719 Score = 33.9 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 39/106 (36%), Gaps = 9/106 (8%) Query: 16 REYKRRYYLKNRDKILE------RRRRRYLKNKDKIRESYHQ---YYLKNKDKYREYKRR 66 R+ + R K+ E + + + + + +K K++ R+ + Sbjct: 583 RDQLDEVEKETRSKLQEIDIFNNQLKELREIHNKQQLQKQKNIEADRIKQKEQERKTELE 642 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 N+ + +E+ RQ ++ ++ P + + E L ++ Sbjct: 643 KQKDNQRRTQERERQWLDRIQQEEEHPKPRKIQDEETQKWEELNKK 688 >gi|49116962|gb|AAH72944.1| Wstf protein [Xenopus laevis] Length = 777 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 30/55 (54%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN 71 E+++++ + ++ E R++ K +I+E + K +++ E ++RY ++ Sbjct: 504 EHRKQWAVMTEEQREEYMRKKREALKARIKEKTRERKQKEREERLEKQKRYEDQD 558 >gi|118376418|ref|XP_001021391.1| hypothetical protein TTHERM_00317100 [Tetrahymena thermophila] gi|89303158|gb|EAS01146.1| hypothetical protein TTHERM_00317100 [Tetrahymena thermophila SB210] Length = 1863 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 46/123 (37%), Gaps = 19/123 (15%) Query: 23 YLKNRDKILERRRRRYLKN-KDKIRESYHQYYLKNKDKYREY-----------------K 64 N++ + + Y +N K++ + H++ ++N + ++Y + Sbjct: 1185 QQANQNDLKN-TKEYYEENVKERKNTNEHEHDIENHQRQQQYFDKLNTAQIIEKQFSIWE 1243 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 ++ N + ++ +++ + + + ++ + K E E + +E Sbjct: 1244 KKEDADNHEDPKQLLKENQDGNEIDEIFDSNQQKLEQKKDESEKQRNSVKSATESDAQEF 1303 Query: 125 IYK 127 Y+ Sbjct: 1304 RYQ 1306 >gi|47565158|ref|ZP_00236201.1| spore germination protein IA [Bacillus cereus G9241] gi|47557944|gb|EAL16269.1| spore germination protein IA [Bacillus cereus G9241] Length = 678 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 12/89 (13%), Positives = 32/89 (35%), Gaps = 3/89 (3%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 +++ L +K + N++ + + NKD E K+ ++++ R+K Sbjct: 7 RKKKKLNTTEKNETDNSEQQPNNQEDDNKKQTRSMKHNKDSNGEQKK---EEHKESSRDK 63 Query: 79 ARQSYRKLYSKDSWIAPEEPMGMTKAEIE 107 + + + S E ++ Sbjct: 64 QQDKSVQNQQQHSKQDNSEQNQQQHSKQN 92 >gi|309365641|emb|CAP22948.2| hypothetical protein CBG_01673 [Caenorhabditis briggsae AF16] Length = 4379 Score = 33.9 bits (76), Expect = 8.4, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 34/82 (41%) Query: 46 RESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAE 105 R + ++Y KN K + Y K ++ E + + +DS + P+ + + Sbjct: 2714 RAQWKKWYEKNIAKAEQKDFIYRTKTEEEKDEMDIEEFFAKQEQDSQVLPDSDVSLLLEA 2773 Query: 106 IEALEREIARLKAKPIEELIYK 127 +E + + K++ + Y+ Sbjct: 2774 VEQKKFSLVDDKSRKQSDERYE 2795 >gi|302658601|ref|XP_003021002.1| hypothetical protein TRV_04867 [Trichophyton verrucosum HKI 0517] gi|291184877|gb|EFE40384.1| hypothetical protein TRV_04867 [Trichophyton verrucosum HKI 0517] Length = 1522 Score = 33.9 bits (76), Expect = 8.4, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 15/121 (12%) Query: 9 EERMLCRRE-----YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 + R R KR+ N L R K I + L + E Sbjct: 192 KGRKRAREAEALDNQKRKRRRANFLGKLARVFE-----KQAIAGTERFARLHEAEVIEEV 246 Query: 64 KRRYYLKNRDKMREKARQSYRKL---YSKDSWIAPEEPMGMTK--AEIEALEREIARLKA 118 +++ + +++ ++ R+ R+ + A ++ + + AE E L+REIAR + Sbjct: 247 QQKEIMDEKERKKDMQRKRRRENTVRAEMEKIQAAQKKVSKMEDDAEKEKLQREIARSRK 306 Query: 119 K 119 K Sbjct: 307 K 307 >gi|158297171|ref|XP_317442.4| AGAP008017-PA [Anopheles gambiae str. PEST] gi|157015066|gb|EAA12387.4| AGAP008017-PA [Anopheles gambiae str. PEST] Length = 2930 Score = 33.9 bits (76), Expect = 8.4, Method: Composition-based stats. Identities = 11/75 (14%), Positives = 34/75 (45%), Gaps = 3/75 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +++ +++ R K N++K R+ + +N +++ + + LKN+ E Sbjct: 995 QRIEIAKQQQALRDAKKLAKEEMNKNKEKARQAKEAERN-ERLEQQRKERELKNQQALEE 1053 Query: 63 YKRRYYLKNRDKMRE 77 +R ++ +++ Sbjct: 1054 RER--RRQHMALIKQ 1066 >gi|301625882|ref|XP_002942130.1| PREDICTED: hypothetical protein LOC100486353 [Xenopus (Silurana) tropicalis] Length = 2359 Score = 33.9 bits (76), Expect = 8.6, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 10/121 (8%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYY 68 +ER R YK RY +++ RR L+ +E Y + Y + + YK RY Sbjct: 2181 KERY--SRRYKLRYDQGYKER---YSRRYKLRYDQGYKERYSRRYKERYSRR--YKLRYD 2233 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYKR 128 + + + +Q Y + Y ++ + + + + ER R K + +L Y + Sbjct: 2234 HGYKKRYSRRYKQRYDQGY-RERYSRRYKQRYKLRYDQGYRERYSRRYKQR--YKLRYDQ 2290 Query: 129 G 129 G Sbjct: 2291 G 2291 >gi|327409723|ref|YP_004347143.1| hypothetical protein LAU_0180 [Lausannevirus] gi|326784897|gb|AEA07031.1| hypothetical protein LAU_0180 [Lausannevirus] Length = 88 Score = 33.5 bits (75), Expect = 8.8, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 29/56 (51%) Query: 26 NRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 + +R+R +N + ++ ++Y+KN+ K E K + N+DK+ EK + Sbjct: 9 KKKTNADRQREYRERNPEVAKQRQREWYMKNRAKLLEQKAEKRIANKDKIFEKREK 64 >gi|321469903|gb|EFX80881.1| hypothetical protein DAPPUDRAFT_303739 [Daphnia pulex] Length = 1924 Score = 33.5 bits (75), Expect = 8.8, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 3/103 (2%) Query: 26 NRDK-ILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 N+ K LE R+R + + + +++ + + + REK R Sbjct: 1279 NKAKEALEERKRALEAERQAKVALLQEKRRQREERIDREHQEKEKERLELAREKQRDREE 1338 Query: 85 KLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 ++ + + A + + +I+ + E AR + IE++ K Sbjct: 1339 RMSALQA--AHQASQSQLQRKIQQKQEESARRHEENIEQIRQK 1379 >gi|300638022|gb|ADK26175.1| Ycf1 [Asimina pygmaea] Length = 606 Score = 33.5 bits (75), Expect = 9.1, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 52/121 (42%), Gaps = 1/121 (0%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRY-LKNKDKIRESYHQYYLKNKDKYREYKRRY 67 +E+ ++ K + K ++++ ++ + + K K+++ + + K + + K + Sbjct: 292 KEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNKKEKERLNKKEKE 351 Query: 68 YLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIYK 127 L ++K R ++ + + + + + + LE++ A I+EL + Sbjct: 352 RLNKKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRKR 411 Query: 128 R 128 R Sbjct: 412 R 412 >gi|67469501|ref|XP_650729.1| WD domain containing protein [Entamoeba histolytica HM-1:IMSS] gi|56467381|gb|EAL45343.1| WD domain containing protein [Entamoeba histolytica HM-1:IMSS] Length = 764 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 50/125 (40%), Gaps = 9/125 (7%) Query: 3 RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYRE 62 +KV E+ L ++ +R + +++E R + + + E K K++ +E Sbjct: 405 QKVELDEKSQLEKKLIEREKLEEKAKELIESER--RSETRKSLVEESKIEREKMKERIKE 462 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEP-MGMTKAEIEALEREIARLKAKPI 121 ++++ + + K PE+ + +++ LE + K+ I Sbjct: 463 RWSE------EEIKPRNERLSEKRKRDRKDEIPEDNEIKKKTWQLKELEICDSIKKSIMI 516 Query: 122 EELIY 126 +++Y Sbjct: 517 NKVMY 521 >gi|255934056|ref|XP_002558309.1| Pc12g15060 [Penicillium chrysogenum Wisconsin 54-1255] gi|211582928|emb|CAP81133.1| Pc12g15060 [Penicillium chrysogenum Wisconsin 54-1255] Length = 961 Score = 33.5 bits (75), Expect = 9.3, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 34/68 (50%), Gaps = 3/68 (4%) Query: 15 RREYKRRYYLKNRD--KILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 R+E ++ + + R +I E +R + + K RE Q +++ + +E + +R Sbjct: 426 RKEREKVWREEERKQREIEEEEKRVRDEERKKKRELERQKDEEDRARRKEIDEQ-RRADR 484 Query: 73 DKMREKAR 80 ++ RE+ R Sbjct: 485 ERQREEQR 492 >gi|300637968|gb|ADK26148.1| Ycf1 [Asimina triloba] Length = 587 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 51/122 (41%), Gaps = 8/122 (6%) Query: 7 TPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRR 66 +ER L +R+ K ++++ ++ + + K +E + K K++ + ++ Sbjct: 280 KAQERYLRQRK-------KEKERLNQKEKEEKERLNQKEKEEKERLNQKEKEQLNQKEKE 332 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEELIY 126 L ++K R ++ + + + + + + LE++ A I+EL Sbjct: 333 -RLNQKEKERLNQKEKEQLNQKEKEQPGQKNLGSDPQNQQKGLEKDYAESDINKIKELRK 391 Query: 127 KR 128 +R Sbjct: 392 RR 393 >gi|320167376|gb|EFW44275.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 1156 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 38/98 (38%), Gaps = 5/98 (5%) Query: 27 RDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK-NRDKMREKARQSYRK 85 ++++L ++ + ++ + + KN+++ KR+ K +D +E+ R Sbjct: 343 KEELLRQKEDAKRQKDEQKLQRELEVQKKNQEREDAKKRKEDEKRQKDLAKEEERVRKEA 402 Query: 86 LYSKDSWIAPEEPMG----MTKAEIEALEREIARLKAK 119 ++ + K E + + +A AK Sbjct: 403 AKAQQEAERAKRIADEAALKAKREADKENKRLADEAAK 440 >gi|298251435|ref|ZP_06975238.1| transposase IS4 family protein [Ktedonobacter racemifer DSM 44963] gi|297546027|gb|EFH79895.1| transposase IS4 family protein [Ktedonobacter racemifer DSM 44963] Length = 557 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 44/120 (36%), Gaps = 5/120 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLK-----NRDKILERRRRRYLKNKDKIRESYHQYYLKNK 57 RK +R ++ +++ K + + + + + + E ++ + Sbjct: 405 RKHKEERQRQRKSKQSEQKRQEKVQAQAHEAECRAQHKNARAQRTREQEEQRTRHREEKA 464 Query: 58 DKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 + RE++ + R+K+ Q + +++ E +A+ + E AR + Sbjct: 465 QRVREHEAHRAQRTREKVSRTQEQEHEHERKREARRQRHEEQEQIRAQRQLWHEECARTR 524 >gi|298207258|ref|YP_003715437.1| glutamyl-tRNA synthetase [Croceibacter atlanticus HTCC2559] gi|83849894|gb|EAP87762.1| glutamyl-tRNA synthetase [Croceibacter atlanticus HTCC2559] Length = 1146 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 5/111 (4%) Query: 9 EERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI--RESYHQYYLKNKDKYREYKRR 66 EER E ++ + + + + +K+ +Q KN ++ E +R Sbjct: 585 EERFQRNEEQLKKNEELLEELQKVQDKINKEELSEKLEKLAKQNQNEEKNLEQLLELTKR 644 Query: 67 YYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 YY+ +K K + KL K ++ + TK E L +E +L+ Sbjct: 645 YYV---EKKALKVAEDLEKLAQKQEELSKNDGDENTKEAQEELNKEFKKLQ 692 >gi|298250994|ref|ZP_06974798.1| transposase IS4 family protein [Ktedonobacter racemifer DSM 44963] gi|297548998|gb|EFH82865.1| transposase IS4 family protein [Ktedonobacter racemifer DSM 44963] Length = 557 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 44/120 (36%), Gaps = 5/120 (4%) Query: 3 RKVLTPEERMLCRREYKRRYYLK-----NRDKILERRRRRYLKNKDKIRESYHQYYLKNK 57 RK +R ++ +++ K + + + + + + E ++ + Sbjct: 405 RKHKEERQRQRKSKQSEQKRQEKVQAQAHEAECRAQHKNARAQRTREQEEQRTRHREEKA 464 Query: 58 DKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLK 117 + RE++ + R+K+ Q + +++ E +A+ + E AR + Sbjct: 465 QRVREHEAHRAQRTREKVSRTQEQEHEHERKREARRQRHEEQEQIRAQRQLWHEECARTR 524 >gi|300637972|gb|ADK26150.1| Ycf1 [Asimina parviflora] Length = 606 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 39/93 (41%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 + Y + N K ER + K K+++ + + + + + K + L ++K R Sbjct: 272 KDYEIFNFTKAQERYLHQRKKEKERLNQKEKEQLNQKEKEQLNQKEKEQLNQKEKERLNK 331 Query: 80 RQSYRKLYSKDSWIAPEEPMGMTKAEIEALERE 112 ++ R + + +E + K E E L ++ Sbjct: 332 KEKERLNKKEKERLNKKEKERLNKKEKERLNKK 364 >gi|255946730|ref|XP_002564132.1| Pc22g00870 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591149|emb|CAP97375.1| Pc22g00870 [Penicillium chrysogenum Wisconsin 54-1255] Length = 1140 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 15/88 (17%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNR---------DKILERRRRRYL----KNKDKIRE 47 + ++ LT E+ R R + K R ++ ++ + + + K +E Sbjct: 976 IAKQALTASEKKALRAGNMRLFRAKRRFSDIEDGEEEEKADQEKVVRERIETQRRQKKQE 1035 Query: 48 SYHQYYLKNK--DKYREYKRRYYLKNRD 73 + + K ++ + R K ++ Sbjct: 1036 HERKRREEQKKQEELDKKTREEREKQKE 1063 >gi|322711668|gb|EFZ03241.1| stress response protein nst1 [Metarhizium anisopliae ARSEF 23] Length = 1209 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 51/125 (40%), Gaps = 5/125 (4%) Query: 6 LTPEERMLCRREYKRRYYL-KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 EE +RE R+ + + R++ E+ R+ + + + Q ++K+ + Sbjct: 611 KKAEEEARLKREADRQRRIHEQRERQAEQERKAREAKEREKKLKEEQ-RQRDKEARDMRE 669 Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKPIEEL 124 R + ++K +E+ ++ +K E K + ++ ++A+PI+ Sbjct: 670 RE-AQERKEK-QERDKREKEARAAKAHRELS-EAAQRAKEDKANIKSSTQTVQAQPIQIA 726 Query: 125 IYKRG 129 +G Sbjct: 727 KRGQG 731 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.303 0.135 0.354 Lambda K H 0.267 0.0415 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,009,634,541 Number of Sequences: 14124377 Number of extensions: 41716141 Number of successful extensions: 972841 Number of sequences better than 10.0: 10000 Number of HSP's better than 10.0 without gapping: 9205 Number of HSP's successfully gapped in prelim test: 16573 Number of HSP's that attempted gapping in prelim test: 637659 Number of HSP's gapped (non-prelim): 179163 length of query: 129 length of database: 4,842,793,630 effective HSP length: 95 effective length of query: 34 effective length of database: 3,500,977,815 effective search space: 119033245710 effective search space used: 119033245710 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 75 (33.5 bits)