RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16212
(233 letters)
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 54.4 bits (130), Expect = 1e-08
Identities = 42/124 (33%), Positives = 70/124 (56%), Gaps = 1/124 (0%)
Query: 6 SEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKE 65
+E I E + +E K+ +EE K+A E+ K+ ++ K+ EE K+A EE K+ E
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA-EELKKKEAE 1713
Query: 66 GKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEEV 125
K+ ++E K+A EE K +EE K+ +EE K+ +EE K+ EE K + + K EE+
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773
Query: 126 TQTR 129
+ +
Sbjct: 1774 RKEK 1777
Score = 50.9 bits (121), Expect = 2e-07
Identities = 36/110 (32%), Positives = 66/110 (60%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGK 74
+E K+ +++ K+ E K+ +EE K+ +E K E + +EE K+ ++E K+A ++ K
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Query: 75 RASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEE 124
+A+E K+ +EE K+A E K+ +EE K+A E K + + IK + +E
Sbjct: 1689 KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Score = 50.5 bits (120), Expect = 3e-07
Identities = 29/99 (29%), Positives = 53/99 (53%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGK 74
+E K+ K+ +EE K+A+E K ++ E + +E ++ +E K+ +D K
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Query: 75 RASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQL 113
+ +EE K+A E K+A E+ K+A E K A+ + K +
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423
Score = 49.4 bits (117), Expect = 7e-07
Identities = 26/94 (27%), Positives = 52/94 (55%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E + E K+ +EE K+ ++ K+ ++E K+A E K +E ++ + ++ ++
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Query: 76 ASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
EE K+ ++ K+ +EE K+A E K+A E+ K
Sbjct: 1372 KKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 49.0 bits (116), Expect = 1e-06
Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 2/118 (1%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E + E + +EE K+ E+ K+ E K+ EE K+A EE K + E + ++E K+
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Query: 76 ASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCR--KCGEEVTQTRLE 131
+EE K+A E+ K+A+E K+ +EE K+A E K + K K EE + + E
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733
Score = 47.4 bits (112), Expect = 3e-06
Identities = 36/114 (31%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E + E K+ +EE K+A EE K+ ++E K+A E K+A E + + E K+ ++E K+
Sbjct: 1441 EEAKKADEAKKKAEEAKKA-EEAKKKAEEAKKADEAKKKAEE--AKKADEAKKKAEEAKK 1497
Query: 76 ASEEGKRASEEGKRASE----EGKRASEEGKRASEEGKGVQLDIIKCRKCGEEV 125
++E K+A+E K+A E E + ++E K+A E K + + +K +E+
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADEL 1551
Score = 47.1 bits (111), Expect = 4e-06
Identities = 35/99 (35%), Positives = 57/99 (57%), Gaps = 8/99 (8%)
Query: 16 EGKRGLKEGKRASEEGKRASE-----EGKRGSKEGKRALEEGKRASEEGKRGSKEGKRAS 70
K+ E K+ +EE K+A E E + + E K+ EE K+A E K+ E + +
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA--EEAKKA 1472
Query: 71 DEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
DE K+ +EE K+A +E K+ +EE K+ ++E K+A+E K
Sbjct: 1473 DEAKKKAEEAKKA-DEAKKKAEEAKKKADEAKKAAEAKK 1510
Score = 47.1 bits (111), Expect = 4e-06
Identities = 31/104 (29%), Positives = 59/104 (56%), Gaps = 6/104 (5%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGK 74
+ + + ++ EE K+ ++ K+ ++E K+A +E K+ +EE K+ + E K+A+ K
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAEEDKKKADELKKAAAAKK 1418
Query: 75 RASE-----EGKRASEEGKRASEEGKRASEEGKRASEEGKGVQL 113
+A E E K+ ++E K+ +EE K+A E K+A E K +
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Score = 45.5 bits (107), Expect = 1e-05
Identities = 36/94 (38%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E K+ E K+ +EE K+A +E K+ ++E K+A E K+A E + + E K+ ++E K+
Sbjct: 1428 EEKKKADEAKKKAEEAKKA-DEAKKKAEEAKKAEEAKKKAEE--AKKADEAKKKAEEAKK 1484
Query: 76 ASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
A E K+A E K+A +E K+A+E K+A E K
Sbjct: 1485 ADEAKKKAEEAKKKA-DEAKKAAEAKKKADEAKK 1517
Score = 45.1 bits (106), Expect = 1e-05
Identities = 30/105 (28%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
Query: 16 EGKRGLKEGKRASEEGKRA------SEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRA 69
E K+ E K+ +EE K+A +EE K+ + K+ EE K+A+E K ++
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 70 SDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLD 114
++ + +E ++ EE K+ ++ K+ +EE K+A E K + D
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Score = 45.1 bits (106), Expect = 2e-05
Identities = 31/112 (27%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKR 61
+K+++E ++A K+ E K+ K+A E K+ ++E K+
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA--EEKKKADEAKK 1398
Query: 62 GSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQL 113
++E K+ +DE K+A+ K+A +E K+ +EE K+A E K+A E K +
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKA-DEAKKKAEEKKKADEAKKKAEEAKKADEA 1449
Score = 44.7 bits (105), Expect = 2e-05
Identities = 29/106 (27%), Positives = 58/106 (54%)
Query: 4 ESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGS 63
E + + + E ++ +K + E + E + ++E K+ +E+ K+ E K+ +
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Query: 64 KEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
+E K+A +E K + E + +EE K+ +EE K+A E+ K+A+E K
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Score = 42.8 bits (100), Expect = 1e-04
Identities = 38/114 (33%), Positives = 64/114 (56%), Gaps = 8/114 (7%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E + +E K+ +EE K+A E K+ E + +E K+ +EE K+ + E K+A++ K+
Sbjct: 1454 EEAKKAEEAKKKAEEAKKADEAKKKA--EEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Query: 76 ASE----EGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEEV 125
A E E + ++E K+A EE K+A +E K+A E+ K +L + K EE
Sbjct: 1512 ADEAKKAEEAKKADEAKKA-EEAKKA-DEAKKAEEKKKADELKKAEELKKAEEK 1563
Score = 40.9 bits (95), Expect = 4e-04
Identities = 33/102 (32%), Positives = 55/102 (53%), Gaps = 7/102 (6%)
Query: 23 EGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKR 82
E + ++E K+ +EE K+ + E K+A E K+A E K E + +DE K+A E +
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK---AEEAKKADEAKKAEE--AK 1534
Query: 83 ASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEE 124
++E K+A E K+ ++E K+A E K + + K EE
Sbjct: 1535 KADEAKKAEE--KKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
Score = 39.7 bits (92), Expect = 0.001
Identities = 33/98 (33%), Positives = 54/98 (55%), Gaps = 6/98 (6%)
Query: 16 EGKRGLKEGKRASEEGKRASE----EGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASD 71
E R E K+A EE K+A E E K+ + E K+ EE K+A +E K+ ++E K+ +D
Sbjct: 1275 EEARKADELKKA-EEKKKADEAKKAEEKKKADEAKKKAEEAKKA-DEAKKKAEEAKKKAD 1332
Query: 72 EGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
K+ +EE K+A+E K +E +E + +E +
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Score = 38.6 bits (89), Expect = 0.002
Identities = 30/86 (34%), Positives = 53/86 (61%), Gaps = 7/86 (8%)
Query: 28 SEEGKRASE----EGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRA 83
+EE ++A E E K+ + E K+A E K+ ++E K+ ++E K+A DE K+ +EE K+
Sbjct: 1274 AEEARKADELKKAEEKKKADEAKKA--EEKKKADEAKKKAEEAKKA-DEAKKKAEEAKKK 1330
Query: 84 SEEGKRASEEGKRASEEGKRASEEGK 109
++ K+ +EE K+A+E K +E
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Score = 37.8 bits (87), Expect = 0.004
Identities = 32/102 (31%), Positives = 54/102 (52%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E K+ +E K+A E+ A + + K + +EE + EE K+ E + ++E K
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 76 ASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIK 117
+EE K+A EE K+ + K+ +EE K+A E K + + IK
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662
Score = 36.7 bits (84), Expect = 0.011
Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 2/114 (1%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKR 75
E R +E + A ++A+ + + K + E K+ ++E K+ E K+ +DE K+
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA--EEKKKADEAKK 1309
Query: 76 ASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEEVTQTR 129
+EE K+A E K+A E K+A K+A E K + + +E
Sbjct: 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363
Score = 29.0 bits (64), Expect = 3.1
Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 4/91 (4%)
Query: 23 EGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKR 82
E +R +EE ++A + K E + EE K+ +EE K+ +E +
Sbjct: 1209 EEERKAEEARKAEDAKK---AEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHF 1265
Query: 83 ASEEGKRASEEGKRASEEGKRASEEGKGVQL 113
A + +EE R ++E K+A E+ K +
Sbjct: 1266 ARRQAAIKAEE-ARKADELKKAEEKKKADEA 1295
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 34.2 bits (78), Expect = 0.054
Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 7/111 (6%)
Query: 6 SEVPGTITPKEGKR-GLKEGKRASEEGKRASEEGKRGSK-----EGKRALEEGKRASEEG 59
+E G T +EG+R EG+ E G A +EG+ +K EG+ E EG
Sbjct: 638 AEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEG 697
Query: 60 KRGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKG 110
+ +KE + E EG E + EEG+ +EG+G
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE-TGEEGEEVEDEGEG 747
Score = 33.4 bits (76), Expect = 0.090
Identities = 37/150 (24%), Positives = 54/150 (36%), Gaps = 15/150 (10%)
Query: 24 GKRASEEGKRASE-EGKRGSKEGKRAL--EEGKRASEEGKRGSKEGKRASD---EGKRAS 77
G+R EEG+R +E EG+ G + G A E + E G +R + EG+ +
Sbjct: 642 GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEA 701
Query: 78 EEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEEVTQTRLESYLSAL 137
+E E E EG E I+ + GEEV E
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGE--------IETGEEGEEVED-EGEGEAEGK 752
Query: 138 EFIEAQLDRMKHTSYRTTLEVGFESIHSGQ 167
+E + DR + T G E G+
Sbjct: 753 HEVETEGDRKETEHEGETEAEGKEDEDEGE 782
Score = 32.7 bits (74), Expect = 0.18
Identities = 35/134 (26%), Positives = 50/134 (37%), Gaps = 10/134 (7%)
Query: 3 KESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGK-- 60
K E G I KE + E EG E + EEG+ +EG+
Sbjct: 690 KGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE-TGEEGEEVEDEGEGE 748
Query: 61 -RGSKEGKRASDEGKRASE-----EGKRASEEGK-RASEEGKRASEEGKRASEEGKGVQL 113
G E + D + E EGK +EG+ +A E+G+ +EG E +G
Sbjct: 749 AEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETE 808
Query: 114 DIIKCRKCGEEVTQ 127
K G+ TQ
Sbjct: 809 AGEKDEHEGQSETQ 822
Score = 31.9 bits (72), Expect = 0.35
Identities = 35/126 (27%), Positives = 55/126 (43%), Gaps = 5/126 (3%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALE-EGKRASEEGK 60
++ T EG E + EEG+ +EG+ G EGK +E EG R E +
Sbjct: 710 TEAEEVEHEGETEAEGTEDEGEIE-TGEEGEEVEDEGE-GEAEGKHEVETEGDRKETEHE 767
Query: 61 RGSK-EGKRASDEGK-RASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLDIIKC 118
++ EGK DEG+ +A E+G+ +EG E + +E G++ EG+
Sbjct: 768 GETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTE 827
Query: 119 RKCGEE 124
K
Sbjct: 828 VKDETG 833
Score = 31.5 bits (71), Expect = 0.41
Identities = 28/109 (25%), Positives = 43/109 (39%), Gaps = 6/109 (5%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKR 61
++ E+P ++ G E K A +G E + EG E +
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKG----ETEAEEVEHEGETEAEGTEDEGEIET 734
Query: 62 GSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASE-EGKRASEEGK 109
G EG+ DEG+ +E EG R E + +E EGK +EG+
Sbjct: 735 GE-EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGE 782
Score = 31.1 bits (70), Expect = 0.59
Identities = 20/95 (21%), Positives = 31/95 (32%), Gaps = 5/95 (5%)
Query: 14 PKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEG 73
E + + +R E+ G E K A +G E + EG
Sbjct: 670 EGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKG----ETEAEEVEHEGETEAEG 725
Query: 74 KRASEEGKRASEEGKRASEEGKRASEEGKRASEEG 108
E + EEG+ +EG+ +E EG
Sbjct: 726 TEDEGEIE-TGEEGEEVEDEGEGEAEGKHEVETEG 759
Score = 30.0 bits (67), Expect = 1.1
Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 5/82 (6%)
Query: 31 GKRASEEGKRGSKEGKRALEEGKRASE-EGKRGSKEGKRASDEGKRASEEGKRASEEGKR 89
G + + G+R EEG+R +E EG+ G + G A EG+ +G+ SE
Sbjct: 628 GDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGE-TETKGENESEGEIP 686
Query: 90 ASEEGKRASE---EGKRASEEG 108
A +G++ E E K A +G
Sbjct: 687 AERKGEQEGEGEIEAKEADHKG 708
Score = 28.0 bits (62), Expect = 4.8
Identities = 28/102 (27%), Positives = 46/102 (45%), Gaps = 6/102 (5%)
Query: 17 GKRGLKEGKRASEEGKRASE-EGKRGSKEGK-RALEEGKRASEEGKRGSKEGKRASDEGK 74
EG R E + +E EGK EG+ +A E+G+ +EG G E + ++ G+
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811
Query: 75 RASEEGK----RASEEGKRASEEGKRASEEGKRASEEGKGVQ 112
+ EG+ E K + E + +E A ++ KGV
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 33.6 bits (77), Expect = 0.067
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 8/88 (9%)
Query: 22 KEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGK 81
KE A E+ K+A E K+ + + K+A E +A+ K +A E KRA+ K
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAK------AKAEAEAKRAAAAAK 161
Query: 82 RASEEGKR--ASEEGKRASEEGKRASEE 107
+A+ E K+ +E K+A+ E K+ +E
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEA 189
Score = 32.9 bits (75), Expect = 0.12
Identities = 30/97 (30%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEG---KRGSKEGKRALEEGKRASEEGKRGSKEGKRASD 71
E +R KE ++A E ++ + E K+ KE A E+ K+A E K+ + + K+A +
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Query: 72 EGKRASEEGK-RASEEGKRASEEGKRASEEGKRASEE 107
+A+ K +A E KRA+ K+A+ E K+ +E
Sbjct: 137 AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEA 173
Score = 29.8 bits (67), Expect = 1.1
Identities = 28/92 (30%), Positives = 51/92 (55%), Gaps = 4/92 (4%)
Query: 22 KEGKRASEEGKRASEEGKRGSKEGKR--ALEEGK-RASEEGKRGSKEGKRASDEGKRASE 78
E +R +E ++A E ++ + E +R LE+ + A E+ K+ + K+A+ + K+A E
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Query: 79 EGKRASEEGK-RASEEGKRASEEGKRASEEGK 109
+A+ K +A E KRA+ K+A+ E K
Sbjct: 137 AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168
Score = 29.4 bits (66), Expect = 1.5
Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 9/94 (9%)
Query: 18 KRGLKEGKRASEEGKRASEEGKRGSK-EGKRALEEGKRASEEGKRGSKEGKRASDEGKRA 76
K+ + K+A E +A+ K ++ E KRA K+A+ E K+ K ++ K+A
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK-----KAEAEAAKKA 179
Query: 77 SEEGKRASEE---GKRASEEGKRASEEGKRASEE 107
+ E K+ +E K A+E K+A E K+ +
Sbjct: 180 AAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAA 213
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
and secretion / Signal transduction mechanisms].
Length = 408
Score = 33.4 bits (76), Expect = 0.074
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 1/107 (0%)
Query: 1 MSKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGK 60
S +S++ G + +E + + EE EG + +E +L E A EE
Sbjct: 286 RSADSAKEIGLLI-EEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVS 344
Query: 61 RGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEE 107
+ E A++E EE + EE ++E A EE ASEE
Sbjct: 345 QLISEIAAATEEQTAVLEEINASIEELDDVTQENAAAVEELAAASEE 391
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.3 bits (73), Expect = 0.20
Identities = 21/114 (18%), Positives = 45/114 (39%), Gaps = 13/114 (11%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGK-RGSKEGKRASDEG 73
+E + + +A ++ A + + E ++ +E K + KE K+ ++
Sbjct: 220 EELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQ 279
Query: 74 KRASE----EGKRASEEGKRASE--------EGKRASEEGKRASEEGKGVQLDI 115
KR E E K+ EE +A + E K + +E + E + + +
Sbjct: 280 KREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333
Score = 29.2 bits (65), Expect = 2.1
Identities = 19/104 (18%), Positives = 46/104 (44%), Gaps = 9/104 (8%)
Query: 12 ITPKEGKRGLKEGKRASEEGKRAS-EEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRAS 70
E ++ +E K + +S +E K+ ++ KR +E +A E K+ +E +A
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE---KAQIEIKKNDEEALKAK 301
Query: 71 DEG-----KRASEEGKRASEEGKRASEEGKRASEEGKRASEEGK 109
D + + K A ++ A ++ + +E+ ++ + +
Sbjct: 302 DHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVE 345
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 31.4 bits (71), Expect = 0.26
Identities = 16/71 (22%), Positives = 32/71 (45%)
Query: 33 RASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASE 92
RA K + KR+ ++ ++ E + KE +R E KR ++E KR G +
Sbjct: 148 RALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGG 207
Query: 93 EGKRASEEGKR 103
+ ++++
Sbjct: 208 QSGLSTKDEPP 218
Score = 30.2 bits (68), Expect = 0.80
Identities = 14/99 (14%), Positives = 34/99 (34%)
Query: 26 RASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASE 85
S+ A K ++ +E + S + + + KR E+ ++ E
Sbjct: 121 THSKVAHEAEIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKE 180
Query: 86 EGKRASEEGKRASEEGKRASEEGKGVQLDIIKCRKCGEE 124
+ ++ EE +R E+ + G + + +
Sbjct: 181 DRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPK 219
Score = 30.2 bits (68), Expect = 0.83
Identities = 20/82 (24%), Positives = 35/82 (42%)
Query: 26 RASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASE 85
RA K E KR K+ ++ E + +E +R +E KR +DE KR G +
Sbjct: 148 RALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGG 207
Query: 86 EGKRASEEGKRASEEGKRASEE 107
+ ++++ + K E
Sbjct: 208 QSGLSTKDEPPKEKRQKHHDPE 229
Score = 28.3 bits (63), Expect = 2.9
Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 2/89 (2%)
Query: 19 RGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASE 78
R L K E KR+ ++ ++ E + +E +R +E KR + E KR G +
Sbjct: 148 RALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGG 207
Query: 79 EGKRASEEGKRASEEGKRASEEGKRASEE 107
+ ++++ +E ++ + +R E
Sbjct: 208 QSGLSTKDEPP--KEKRQKHHDPERRLEP 234
Score = 27.5 bits (61), Expect = 5.1
Identities = 17/84 (20%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 17 GKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRA 76
K + KR+ ++ ++ E + KE +R +E KR ++E KR G + + +
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLS 212
Query: 77 SEEGKRASEEGKRASEEGKRASEE 100
+++ +E ++ + +R E
Sbjct: 213 TKDEPP--KEKRQKHHDPERRLEP 234
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.8 bits (69), Expect = 0.63
Identities = 21/77 (27%), Positives = 35/77 (45%)
Query: 4 ESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGS 63
ES++ P +E LK KR E+ +EE +R + + + E + S G
Sbjct: 60 ESTKKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGE 119
Query: 64 KEGKRASDEGKRASEEG 80
EG+ S + + +EEG
Sbjct: 120 GEGEGESSDSRSVNEEG 136
Score = 28.1 bits (62), Expect = 4.3
Identities = 18/77 (23%), Positives = 33/77 (42%)
Query: 39 KRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRAS 98
K+ +E L+ KR E+ ++E +R + + + E + S EG+ S
Sbjct: 67 KKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGES 126
Query: 99 EEGKRASEEGKGVQLDI 115
+ + +EEG DI
Sbjct: 127 SDSRSVNEEGSSDPKDI 143
Score = 28.1 bits (62), Expect = 4.4
Identities = 17/70 (24%), Positives = 30/70 (42%)
Query: 32 KRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRAS 91
K+ EE K KR E+ +EE +R + + + + + S EG+ S
Sbjct: 67 KKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGES 126
Query: 92 EEGKRASEEG 101
+ + +EEG
Sbjct: 127 SDSRSVNEEG 136
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
(chemotaxis sensory transducer). Thought to undergo
reversible methylation in response to attractants or
repellants during bacterial chemotaxis.
Length = 262
Score = 29.9 bits (68), Expect = 0.92
Identities = 29/81 (35%), Positives = 38/81 (46%)
Query: 27 ASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEE 86
A EE EEG +E ALEE + EE +E A+DE SEE A +E
Sbjct: 166 AMEESSSEVEEGVELVEETGDALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDE 225
Query: 87 GKRASEEGKRASEEGKRASEE 107
+ ++E SEE A+EE
Sbjct: 226 IAQVTQETAAMSEEISAAAEE 246
>gnl|CDD|204670 pfam11508, DUF3218, Protein of unknown function (DUF3218). This
family of proteins with unknown function appears to be
restricted to Pseudomonas.
Length = 213
Score = 30.0 bits (67), Expect = 0.96
Identities = 22/72 (30%), Positives = 30/72 (41%), Gaps = 10/72 (13%)
Query: 104 ASEEGKGVQLDII----KCRKCGEEVTQTRLESYLSALEFIEAQ-----LDRMKHTSYRT 154
A E GV + K R C E+ L S L+ I+A+ LDR K +
Sbjct: 66 AQEIANGVTIGFRMVNEKERVCNVEILPL-LTSVAQNLDRIKARFGSGYLDRFKGSPNVY 124
Query: 155 TLEVGFESIHSG 166
+VGF + SG
Sbjct: 125 PTDVGFSTDASG 136
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 30.3 bits (68), Expect = 1.0
Identities = 20/108 (18%), Positives = 43/108 (39%), Gaps = 1/108 (0%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEG-KRALEEGKRASEEGK 60
K ++ KE + +K+ + ++ KR SE+ K + R L + +++ +
Sbjct: 214 IKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDE 273
Query: 61 RGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEG 108
+ G R+S KR S+ ++ S ++ K + G
Sbjct: 274 KNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNG 321
Score = 27.6 bits (61), Expect = 6.7
Identities = 15/92 (16%), Positives = 31/92 (33%)
Query: 11 TITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRAS 70
T K+ ++ + G R+S KR S+ ++ S ++ K + G S
Sbjct: 266 TKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNS 325
Query: 71 DEGKRASEEGKRASEEGKRASEEGKRASEEGK 102
+ +S ++ R + K
Sbjct: 326 NNDDTSSSNHLGSNRISNRNPSSPYKKQTTTK 357
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 29.5 bits (67), Expect = 1.5
Identities = 11/52 (21%), Positives = 25/52 (48%)
Query: 63 SKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQLD 114
E K+ + ++A ++ ++ E A ++ + A+++ K E K V D
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 28.4 bits (64), Expect = 3.3
Identities = 13/59 (22%), Positives = 28/59 (47%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGK 60
+E G ++P E K+ K+ ++A ++ ++ E K+ + A ++ K E K
Sbjct: 396 GEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 28.4 bits (64), Expect = 4.2
Identities = 14/72 (19%), Positives = 34/72 (47%)
Query: 42 SKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEG 101
++ + E G + E K+ K+ ++A + ++ E A ++ + A+++ K E
Sbjct: 394 AEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Query: 102 KRASEEGKGVQL 113
K+ + G +L
Sbjct: 454 KKVDPDPLGEKL 465
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 29.4 bits (66), Expect = 1.8
Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 2/87 (2%)
Query: 25 KRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRAS 84
K +AS G GS E E A+ +GSK+ K S K + E
Sbjct: 382 KEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDD 441
Query: 85 EEGKRASEEGKRASEEGKRASEEGKGV 111
EE ++GK+ ++G+ S +
Sbjct: 442 EEDAP--KKGKKNQKKGRDKSSKVPSD 466
Score = 27.5 bits (61), Expect = 8.5
Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 14 PKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEE 51
PK+GK+ K+G+ S + S+ G G KE ++ E+
Sbjct: 446 PKKGKKNQKKGRDKSSKVPSDSKAG--GKKESVKSQED 481
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 29.1 bits (65), Expect = 2.1
Identities = 24/110 (21%), Positives = 45/110 (40%), Gaps = 3/110 (2%)
Query: 2 SKESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKR 61
++E V + K+ K S K +++ K K+ K A K
Sbjct: 184 TREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKK--AAKTAVSAKKA 241
Query: 62 GSKEGKRASDEGKRASEEGKRASEEGKRASEEGKR-ASEEGKRASEEGKG 110
K A K A + K+A++ K+A+++ + A++ K A++ KG
Sbjct: 242 AKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKG 291
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 29.0 bits (65), Expect = 2.3
Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 30 EGKRASEEGKRGSKEGKRALEEGKRASE-EGKRGSKEGKRASDEGKRASEEGKRASEEGK 88
E K+ K +K K+ + + +RA+E E R +++ + A+ RA +RA+ + K
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARA----RRAAVKAK 386
Query: 89 RASEEGKRASEEGKRASEEGKGVQ 112
+ +E+ +EE KG
Sbjct: 387 KKGLIDASPNEDTPSENEESKGSP 410
>gnl|CDD|206779 cd11386, MCP_signal, Methyl-accepting chemotaxis protein (MCP),
signaling domain. Methyl-accepting chemotaxis proteins
(MCPs or chemotaxis receptors) are an integral part of
the transmembrane protein complex that controls
bacterial chemotaxis, together with the histidine kinase
CheA, the receptor-coupling protein CheW,
receptor-modification enzymes, and localized
phosphatases. MCPs contain a four helix trans membrane
region, an N-terminal periplasmic ligand binding domain,
and a C-terminal HAMP domain followed by a cytoplasmic
signaling domain. This C-terminal signaling domain
dimerizes into a four-helix bundle and interacts with
CheA through the adaptor protein CheW.
Length = 200
Score = 28.4 bits (64), Expect = 2.6
Identities = 28/95 (29%), Positives = 41/95 (43%), Gaps = 10/95 (10%)
Query: 23 EGKRASEEGK---------RA-SEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDE 72
E RA E G+ R +EE +KE + +EE + +EE +E +E
Sbjct: 90 EAARAGEAGRGFAVVADEVRKLAEESAEAAKEIEELIEEIQEQTEEAVEAMEETSEEVEE 149
Query: 73 GKRASEEGKRASEEGKRASEEGKRASEEGKRASEE 107
G EE RA EE + EE +E A++E
Sbjct: 150 GVELVEETGRAFEEIVASVEEVADGIQEISAATQE 184
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 28.6 bits (64), Expect = 2.8
Identities = 15/83 (18%), Positives = 39/83 (46%)
Query: 25 KRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRAS 84
++ + + +E ++ ++ E+ + A + ++ ++ A K+A + K+A
Sbjct: 56 QQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAE 115
Query: 85 EEGKRASEEGKRASEEGKRASEE 107
E+ K+A E + + E K +E
Sbjct: 116 EKQKQAEEAKAKQAAEAKAKAEA 138
Score = 27.1 bits (60), Expect = 8.1
Identities = 17/83 (20%), Positives = 39/83 (46%)
Query: 18 KRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRAS 77
++ K + +E ++ E+ +++ + A + ++ E+ K K+A K+A
Sbjct: 56 QQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAE 115
Query: 78 EEGKRASEEGKRASEEGKRASEE 100
E+ K+A E + + E K +E
Sbjct: 116 EKQKQAEEAKAKQAAEAKAKAEA 138
>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
Length = 108
Score = 27.5 bits (61), Expect = 2.8
Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 19 RGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASE 78
RGL E K+A+++ K++ +E R ++E K E+ +E + + E K D+ +
Sbjct: 34 RGLAEFKKATDDFKQSMQEESRTAEE-KEKAEKLAETKKEAE--APEAKAEEDQAPKPKG 90
Query: 79 EGKRASEEGKRASEEGK 95
G+ + +A + K
Sbjct: 91 AGEPPATVASKAGDGAK 107
Score = 26.4 bits (58), Expect = 7.4
Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 5/64 (7%)
Query: 47 RALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASE 106
R L E K+A+++ K+ +E R ++E +E E K+ +E + +EE +
Sbjct: 34 RGLAEFKKATDDFKQSMQEESRTAEE-----KEKAEKLAETKKEAEAPEAKAEEDQAPKP 88
Query: 107 EGKG 110
+G G
Sbjct: 89 KGAG 92
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 28.8 bits (64), Expect = 2.8
Identities = 16/85 (18%), Positives = 27/85 (31%)
Query: 21 LKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEG 80
L R EG+ A +E R E + A E+ + G + G G
Sbjct: 393 LPRPPRVPVEGEEADDEAGDSVGTIFREAREQRAAEEQRRGGGRSGPGGGSRSGSVGGGG 452
Query: 81 KRASEEGKRASEEGKRASEEGKRAS 105
+R ++ EG+ +
Sbjct: 453 RRDGAGADGKPRPRRKPRVEGEADA 477
>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F. This family represents the
oligoendopeptidase F clade of the family of larger M3 or
thimet (for thiol-dependent metallopeptidase)
oligopeptidase family. Lactococcus lactis PepF
hydrolyzed peptides of 7 and 17 amino acids with fairly
broad specificity. The homolog of lactococcal PepF in
group B Streptococcus was named PepB (PMID:8757883),
with the name difference reflecting a difference in
species of origin rather activity; substrate profiles
were quite similar. Differences in substrate specificity
should be expected in other species. The gene is
duplicated in Lactococcus lactis on the plasmid that
bears it. A shortened second copy is found in Bacillus
subtilis [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 591
Score = 28.8 bits (65), Expect = 3.1
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 16/87 (18%)
Query: 140 IEAQLDRMKHTSYRTTLEVGFES-----IHSGQSLVPESIQKSLMDIKYDVEKKLCSIRM 194
+ ++ T R TL FE I G+ L E +L +I ++ KK +
Sbjct: 440 LLERISNFFGTFTRQTLFAEFEYEAYELIEEGEPLTAE----TLNEIYANLLKKYFGDLV 495
Query: 195 SMKSSGLKEKKLVETYGIRPSGYYLPF 221
+ + L +R +Y+ F
Sbjct: 496 KID----EGAGLT---WMRIPHFYMGF 515
>gnl|CDD|219478 pfam07586, HXXSHH, Protein of unknown function (DUF1552). A family
of proteins identified in Rhodopirellula baltica.
Length = 298
Score = 28.0 bits (63), Expect = 3.7
Identities = 8/55 (14%), Positives = 22/55 (40%)
Query: 127 QTRLESYLSALEFIEAQLDRMKHTSYRTTLEVGFESIHSGQSLVPESIQKSLMDI 181
+ +L+ YL+++ IE +L++ + + G + + D+
Sbjct: 180 RAKLDEYLTSVREIEKRLEKAEAWLDVPKAKPDAPKSDPGDPEDFPEEIRLMYDL 234
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.4 bits (63), Expect = 3.9
Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 21 LKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEG 80
LK + A +E + E+ + ++E ++ EE ++ ++ ++ +E R K A+E+
Sbjct: 92 LKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR-----KAAAEQK 146
Query: 81 KRASEEGKRASEEGKR--ASEEGKRASEEGK 109
K+A +A+ E + A+ E K+ +EE
Sbjct: 147 KKAEAAKAKAAAEAAKLKAAAEAKKKAEEAA 177
Score = 28.0 bits (62), Expect = 5.3
Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 3/93 (3%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSK---EGKRALEEGKRASEEGKRGSKEGKRASDE 72
E K+ +E +A+EE K +E K E K A E+ K +E + K+ + A++E
Sbjct: 168 EAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227
Query: 73 GKRASEEGKRASEEGKRASEEGKRASEEGKRAS 105
A ++ A + +A+ K A + A+
Sbjct: 228 KAAAEKKKAAAKAKADKAAAAAKAAERKAAAAA 260
Score = 27.2 bits (60), Expect = 8.8
Identities = 23/98 (23%), Positives = 49/98 (50%), Gaps = 5/98 (5%)
Query: 15 KEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEE--GKRASEEGKRGSKEGKRASDE 72
+E + L++ + ++E ++ +EE ++ ++ ++ EE K A+E+ K+ +A+ E
Sbjct: 100 QERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAE 159
Query: 73 GKR---ASEEGKRASEEGKRASEEGKRASEEGKRASEE 107
+ A+E K+A E K A E +A + E
Sbjct: 160 AAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197
>gnl|CDD|237546 PRK13889, PRK13889, conjugal transfer relaxase TraA; Provisional.
Length = 988
Score = 28.1 bits (63), Expect = 4.7
Identities = 17/94 (18%), Positives = 32/94 (34%), Gaps = 2/94 (2%)
Query: 16 EGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALE--EGKRASEEGKRGSKEGKRASDEG 73
E RG+ +G R + + + + A++ + RA
Sbjct: 759 EKLRGMFDGLRLPDPVPGPEAGRRPERESAAATTDAPARTVAADPEAALRQARTRALVRH 818
Query: 74 KRASEEGKRASEEGKRASEEGKRASEEGKRASEE 107
RA + R E+G + +E ++A EE
Sbjct: 819 ARAVDAIFRMQEQGGPVLPHQVKELQEARKAFEE 852
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 27.9 bits (62), Expect = 5.1
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 4/55 (7%)
Query: 49 LEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKR 103
LEE +R E+ + E +R ++E +R EE +A+ E RA + K E+ +
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEEQRRREEE--KAAMEADRA--QAKAEVEKRRE 301
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 27.6 bits (61), Expect = 5.8
Identities = 23/110 (20%), Positives = 43/110 (39%)
Query: 4 ESSEVPGTITPKEGKRGLKEGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGS 63
E S++ KE + E +RA+ K + + EG E ++
Sbjct: 33 EISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAK 92
Query: 64 KEGKRASDEGKRASEEGKRASEEGKRASEEGKRASEEGKRASEEGKGVQL 113
+ K A+ +A+ K+ E + +EE K A++ A+ + K L
Sbjct: 93 AKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAAL 142
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 27.8 bits (62), Expect = 5.8
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 27 ASEEGKRASEEGKRGSK--EGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRAS 84
A+EEGK + + K +K EG+++ ++ K+ ++ G+ GK A D K E K
Sbjct: 13 AAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIE 72
Query: 85 EE 86
E
Sbjct: 73 EH 74
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 27.6 bits (62), Expect = 6.5
Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 3/79 (3%)
Query: 34 ASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRASEEGKRASE- 92
A G+ SK + K E ++ +KE +A+ A ++++ +
Sbjct: 1 AESNGEPLSKNALKKRL--KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDP 58
Query: 93 EGKRASEEGKRASEEGKGV 111
RA+ K + KGV
Sbjct: 59 TQYRANRLKKVEALRAKGV 77
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 27.5 bits (61), Expect = 7.1
Identities = 20/90 (22%), Positives = 30/90 (33%), Gaps = 5/90 (5%)
Query: 23 EGKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKR 82
E R E A E + R E + S + ++E +RA + +
Sbjct: 293 ETAREEERELDARTEALEREADALRTRLEALQGSP-AYQDAEELERARADAEALQAAAAD 351
Query: 83 A----SEEGKRASEEGKRASEEGKRASEEG 108
A E R EE +R EE R +
Sbjct: 352 ARQAIREAESRLEEERRRLDEEAGRLDDAE 381
Score = 27.5 bits (61), Expect = 8.9
Identities = 25/130 (19%), Positives = 46/130 (35%), Gaps = 9/130 (6%)
Query: 26 RASEEGKRASEEGKRGSK--EGKRALEEGKRASEEGKRGSKEGKRASDEG-KRASEEGKR 82
+ R E +GS + LE + +E + + + ++A E R EE +R
Sbjct: 310 EREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRR 369
Query: 83 ASEEGKRASEEGK--RASEEGKRASEEGKGVQL----DIIKCRKCGEEVTQTRLESYLSA 136
EE R + + RA+ E + E G+ + E L++
Sbjct: 370 LDEEAGRLDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQD 429
Query: 137 LEFIEAQLDR 146
+ + AQ
Sbjct: 430 ADRVIAQRSE 439
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 27.6 bits (61), Expect = 7.5
Identities = 21/84 (25%), Positives = 33/84 (39%)
Query: 25 KRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRASDEGKRASEEGKRAS 84
KR E + S SK + E AS+E + + +E + EE +
Sbjct: 405 KRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEE 464
Query: 85 EEGKRASEEGKRASEEGKRASEEG 108
EEG+ EE + ++ G EG
Sbjct: 465 EEGEDEEEEEEVEADNGSEEEMEG 488
>gnl|CDD|213932 TIGR04319, SerAla_Lrha_rpt, surface protein repeat Ser-Ala-175.
This serine and alanine-rich surface protein repeat,
about 175 amino acids long, occurs up to nine times in
surface proteins of some Lactobacillus strains,
particularly in Lactobacillus rhamnosus. Members
proteins have the N-terminal variant signal sequence
described by TIGR03715 and C-terminal LPXTG signals for
surface attachment by sortase.
Length = 175
Score = 26.8 bits (59), Expect = 8.7
Identities = 18/81 (22%), Positives = 27/81 (33%), Gaps = 6/81 (7%)
Query: 24 GKRASEEGKRASEEGKRGSKEGKRALEEGKRASEEGKRGSKEGKRAS------DEGKRAS 77
+AS+ AS + + +AL E +AS S A+ A
Sbjct: 93 ANQASKAASNASSLADKANSAASKALSEALQASSAAAIASSAASSAATLAGSLASANDAK 152
Query: 78 EEGKRASEEGKRASEEGKRAS 98
+ AS+ AS AS
Sbjct: 153 SDASAASDAASSASVVASSAS 173
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 27.2 bits (60), Expect = 9.6
Identities = 14/77 (18%), Positives = 23/77 (29%), Gaps = 5/77 (6%)
Query: 31 GKRASEEGKRGSKEGKR-ALEEGKRA-SEEGKRGSKEGKRASDEGKRASEEGKRASEEGK 88
R E G+ + + E R S R + R G+R + + +
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPN---R 57
Query: 89 RASEEGKRASEEGKRAS 105
G R+ R S
Sbjct: 58 YYRPRGDRSYRRDDRRS 74
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.305 0.125 0.332
Gapped
Lambda K H
0.267 0.0742 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,049,014
Number of extensions: 1025721
Number of successful extensions: 1621
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1134
Number of HSP's successfully gapped: 261
Length of query: 233
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 139
Effective length of database: 6,768,326
Effective search space: 940797314
Effective search space used: 940797314
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 57 (25.7 bits)