RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6121
(126 letters)
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes
the Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 43.2 bits (102), Expect = 2e-06
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRR 64
G DG D+L + ++DG + + L G + + + +D QWH +S+ R
Sbjct: 14 GGEDGLDFLALELEDGRLVLRYDL--GSGGSVLLLSGKKLNDGQWHRVSVSRD 64
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 42.7 bits (101), Expect = 5e-06
Identities = 13/52 (25%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHR 63
G DYL + ++DG + L+ +G + + +D QWH +++ R
Sbjct: 19 GSKGGGDYLALELRDGRL--VLRYDLGSGPARLTSDPTPLNDGQWHRVAVER 68
>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are
usually Ca++ mediated receptors that can have binding
sites for steroids, beta1 integrins, heparin,
sulfatides, fibulin-1, and alpha-dystroglycans.
Proteins that contain LamG domains serve a variety of
purposes including signal transduction via cell-surface
steroid receptors, adhesion, migration and
differentiation through mediation of cell adhesion
molecules.
Length = 151
Score = 40.9 bits (96), Expect = 3e-05
Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHR 63
+G D+L + ++DG + + L G L + +D QWH +S+ R
Sbjct: 41 GSQNGGDFLALELEDGRLVLRYDLGSGSL--VLSSKT-PLNDGQWHSVSVER 89
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.1 bits (92), Expect = 2e-04
Identities = 19/52 (36%), Positives = 34/52 (65%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
K+ E++K+ + K +K EEEE E++K+E++++ EEEE +EE+ E
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 38.7 bits (91), Expect = 3e-04
Identities = 17/48 (35%), Positives = 33/48 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E+ +K+RE++ +E++ + ++ EE++EE+KEK EEE+ +EE
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 36.8 bits (86), Expect = 0.001
Identities = 20/50 (40%), Positives = 38/50 (76%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
+++EKKK+ ++EE+E+ E+E+ EE+KEE++E+ EEE+ +EE ++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 36.8 bits (86), Expect = 0.001
Identities = 21/54 (38%), Positives = 35/54 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
K EE+KK ++ K + ++ EEEE +E+KEE+KE+ EEE +E+ E + +
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 36.4 bits (85), Expect = 0.002
Identities = 17/52 (32%), Positives = 33/52 (63%)
Query: 69 TGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
TG K +K K+ +K E++ +E+ +E++ + +KKE+ EEEE +++ E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 34.1 bits (79), Expect = 0.013
Identities = 19/46 (41%), Positives = 34/46 (73%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
KE++KK K EEEE+E+ E++E E+++EE++ + E+EE +E+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 33.7 bits (78), Expect = 0.014
Identities = 17/41 (41%), Positives = 28/41 (68%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
KKK E++ E+E+ E+ +EEE EE +EEK+E+ E++
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 29.1 bits (66), Expect = 0.56
Identities = 15/36 (41%), Positives = 31/36 (86%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
+EEE++K ++++ +EEE+E+ EEE+ EE++++KK+
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 38.4 bits (89), Expect = 3e-04
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
G G + E+++ E++ EEEE+E+ EEEE EE+ EE
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 37.7 bits (87), Expect = 7e-04
Identities = 18/31 (58%), Positives = 25/31 (80%)
Query: 83 DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
D EEEE+E+ EEEE EE++EE++E+ E EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 37.7 bits (87), Expect = 7e-04
Identities = 17/31 (54%), Positives = 25/31 (80%)
Query: 87 EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+ E+E+ EEEE EE++EE++E+ EEEE EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 37.3 bits (86), Expect = 0.001
Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 65 IQEGTGCGKEEEKKKRRE---DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
E G K++EK D + EE+E+ EEEE EE++EE++E+ EEEE +E
Sbjct: 838 NAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 36.9 bits (85), Expect = 0.001
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 15/83 (18%)
Query: 53 DNQWHHLSIHRRIQEGTGC-GKEEEKKKRREDKVEEEEDEK--------------LEEEE 97
+ H + T + E++ E++ E ++DEK EEEE
Sbjct: 810 GEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEE 869
Query: 98 MEEQKEEKKEKVEEEEVKEERRE 120
EE++EE++E+ EEEE +EE E
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEE 892
Score = 29.2 bits (65), Expect = 0.58
Identities = 16/53 (30%), Positives = 25/53 (47%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
G E+E + ++ EE EDE E E + + E + ++ E E E EG
Sbjct: 725 GTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777
Score = 28.4 bits (63), Expect = 1.3
Identities = 15/51 (29%), Positives = 19/51 (37%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
G E + E EE E E E E+ + E+ E E K E G
Sbjct: 633 GDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEG 683
Score = 28.0 bits (62), Expect = 1.4
Identities = 14/52 (26%), Positives = 24/52 (46%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
G E E++ E K E E + ++ E EQ+ E + + +E + K E
Sbjct: 664 GGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715
Score = 28.0 bits (62), Expect = 1.7
Identities = 19/55 (34%), Positives = 28/55 (50%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
E G + E+ + ED+ E E + K E E ++KE + E E E KE+ EG
Sbjct: 727 EDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781
Score = 27.7 bits (61), Expect = 2.2
Identities = 13/36 (36%), Positives = 25/36 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
+EEE+++ E++ EEEE+E+ EE + + E ++K
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904
Score = 27.7 bits (61), Expect = 2.2
Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 1/56 (1%)
Query: 68 GTGCGKEEEKKKRREDKV-EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
G G+E E+ E + EE E +E E E + E + E E K E+ G
Sbjct: 642 GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEG 697
Score = 26.9 bits (59), Expect = 3.8
Identities = 12/56 (21%), Positives = 22/56 (39%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
G+ E+ ++ E E + E E E E K E+ E E++ + +
Sbjct: 652 PTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707
Score = 26.5 bits (58), Expect = 4.7
Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 1/60 (1%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK-EKVEEEEVKEERREGGRGG 125
E E E + E +E E E EE E E + E + E E E + +R+E G
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 37.9 bits (88), Expect = 5e-04
Identities = 15/52 (28%), Positives = 33/52 (63%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
++ + + E EE + K +EE++++++ EKK+K + +E KE+R++ R
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 34.8 bits (80), Expect = 0.007
Identities = 13/67 (19%), Positives = 28/67 (41%)
Query: 52 DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
DDN L++ + +E K++ K ++ + ++ E + E + E
Sbjct: 340 DDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE 399
Query: 112 EEVKEER 118
+ KEE+
Sbjct: 400 AKAKEEK 406
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 37.2 bits (87), Expect = 0.001
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ R+ DK EEE+EK+ + EE++EE +EK EE++ +E +
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAK 299
Score = 32.6 bits (75), Expect = 0.034
Identities = 17/48 (35%), Positives = 32/48 (66%), Gaps = 6/48 (12%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV-KEERR 119
++EE ++++E+K +EE + KL + EEQ+ K+EE+E K+ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQR-----KLEEKERKKQARK 322
Score = 32.2 bits (74), Expect = 0.039
Identities = 16/47 (34%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
+EEE+K + EEE E+ +E++ E++KEE++ K+ + +E+R+
Sbjct: 266 EEEEEKILKAA--EEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310
Score = 31.5 bits (72), Expect = 0.092
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE+ K E++ +EE EK EE++ EE++ + + EE+ K E +E
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKE 315
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 35.4 bits (81), Expect = 0.004
Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
K+RE+ VE+ + E E++ E++E +KEK +E E +E RE R +
Sbjct: 579 KKREEAVEKAKREA--EQKAREEREREKEKEKERE-REREREAERAAK 623
Score = 31.2 bits (70), Expect = 0.13
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 3/43 (6%)
Query: 78 KKRRE--DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
KKR E +K + E ++K EE E KE++KE+ E E + ER
Sbjct: 579 KKREEAVEKAKREAEQKAREERERE-KEKEKEREREREREAER 620
Score = 31.2 bits (70), Expect = 0.13
Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 73 KEE--EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKE 107
+EE EK KR ++ EE E+ +E+E E ++E ++E
Sbjct: 581 REEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
Score = 29.3 bits (65), Expect = 0.56
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKE 103
++ +E K E ++K RE++ E+E EK E E E + E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.4 bits (79), Expect = 0.011
Identities = 22/44 (50%), Positives = 34/44 (77%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
EK+ +E EEEE+EK E++E EE+ +K+E+V+EEE KEE++
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71
Score = 33.6 bits (77), Expect = 0.015
Identities = 18/47 (38%), Positives = 36/47 (76%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+EE+++ +E+K EEEE +EEE++E++E++++K + ++VKE E
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
Score = 33.2 bits (76), Expect = 0.025
Identities = 20/47 (42%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK---EKVEEEEVKEE 117
EEE++K + + EE+ +K EE + EE+KEEKK +KV+E + E
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Score = 32.4 bits (74), Expect = 0.044
Identities = 14/41 (34%), Positives = 28/41 (68%)
Query: 80 RREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ +K +E+E+ E+EE +E++E+ +K EE + +EE+ E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 30.5 bits (69), Expect = 0.19
Identities = 12/42 (28%), Positives = 29/42 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
+++E++++ DK EE ++E+ +EE+ ++ K+ K+ E E +
Sbjct: 46 EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87
Score = 26.7 bits (59), Expect = 3.7
Identities = 16/43 (37%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEK-KEKVEEEEV 114
KEE+K++ + +EEE ++ EE+E +++K +K KE E E+
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 33.3 bits (76), Expect = 0.023
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEE-EEMEEQKEEKKEKVEEEEVKEERREG 121
CG E +R E VEE E ++E ++EE+ E++ EK+ +EE +R
Sbjct: 356 CGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVA 407
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 32.9 bits (76), Expect = 0.035
Identities = 20/45 (44%), Positives = 28/45 (62%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
EE ++ E K EE E E E+++E+ EEKKEK++EEE K
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE 570
Score = 26.3 bits (59), Expect = 6.0
Identities = 21/53 (39%), Positives = 37/53 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
K +E+ + +++K++EEED+ LEE E E Q+ K+ K E +E+ +E R+ +GG
Sbjct: 548 KLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 32.8 bits (74), Expect = 0.038
Identities = 19/58 (32%), Positives = 33/58 (56%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ +E EE KK EDK + EE +K EE+E + + KKE E ++ +E +++
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710
Score = 32.4 bits (73), Expect = 0.052
Identities = 15/48 (31%), Positives = 32/48 (66%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K EE +K+ + +++E +E + EE+++++ E+K+K EE + EE +
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Score = 32.4 bits (73), Expect = 0.054
Identities = 19/48 (39%), Positives = 33/48 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE E+KK+ E+ + EE+ K++ EE +++ EE K+K EE + EE ++
Sbjct: 1710 KEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Score = 31.6 bits (71), Expect = 0.086
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKE-----KVEEEEVKE 116
K EE KK+ E+ + EEDEK E ++++ EE K+ K E EE K+
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 30.9 bits (69), Expect = 0.17
Identities = 19/45 (42%), Positives = 29/45 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K EE+KK+ E ++E +EK + EE+++ +EE K K EE K E
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
Score = 30.9 bits (69), Expect = 0.18
Identities = 18/45 (40%), Positives = 29/45 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
KE E+KK+ E+ + EE+ K++ E ++ EE K+K EE + EE
Sbjct: 1641 KEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685
Score = 30.5 bits (68), Expect = 0.19
Identities = 17/45 (37%), Positives = 30/45 (66%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
KE E+ K+ E+ ++E +EK + EE+++ +EE K K EE + + E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Score = 30.5 bits (68), Expect = 0.26
Identities = 18/58 (31%), Positives = 34/58 (58%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ E + EEKKK E K EEE++ EE ++ +E+KK+ E ++ +E+ ++
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 30.1 bits (67), Expect = 0.27
Identities = 19/44 (43%), Positives = 30/44 (68%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
EE+KKK E K +EEE +K+ + EE+K+ ++ + E+E V EE
Sbjct: 1740 EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783
Score = 29.0 bits (64), Expect = 0.85
Identities = 17/45 (37%), Positives = 22/45 (48%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K EE KK E E E E E EE+ E ++K EE + K +
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Score = 29.0 bits (64), Expect = 0.87
Identities = 19/48 (39%), Positives = 30/48 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K EE KK ED+ + E K E EE ++ +E KK++ EE++ EE ++
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Score = 28.6 bits (63), Expect = 0.89
Identities = 19/57 (33%), Positives = 36/57 (63%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
RI+E +EE+K K E K EE K EE + E++++K E+++++E +E+++
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Score = 28.6 bits (63), Expect = 1.0
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K EE KK+ ++ ++ E+ K EEE + + EE K++ EE++ K E
Sbjct: 1703 KAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747
Score = 27.0 bits (59), Expect = 3.0
Identities = 13/48 (27%), Positives = 31/48 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K+ E+ K++ D++++ K + +E +++ EEKK+ E ++ EE ++
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 27.0 bits (59), Expect = 3.2
Identities = 15/47 (31%), Positives = 29/47 (61%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE KK E+K++ E+ K EE+ ++ +E KK + +E++ E ++
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Score = 26.6 bits (58), Expect = 4.3
Identities = 14/45 (31%), Positives = 33/45 (73%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K+ E+ K + +++++ E+EK + E++++++ E+K+K EE + EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657
Score = 26.6 bits (58), Expect = 4.4
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEE-QKEEKKEKVEEEEVKEERR 119
K EE KK E K EE+ + E ++ EE +K E+K+K EE + EE +
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Score = 26.3 bits (57), Expect = 5.4
Identities = 10/48 (20%), Positives = 31/48 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K++ ++ ++ D+ +++ +E + +E +++ EE K+K +E + E ++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510
Score = 26.3 bits (57), Expect = 5.5
Identities = 19/57 (33%), Positives = 35/57 (61%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ E K EE KK E K++ EE +K EEE+ + ++ +KKE E+++ +E ++
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Score = 26.3 bits (57), Expect = 5.6
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 4/55 (7%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEM----EEQKEEKKEKVEEEEVKEERREGGR 123
E KK E+ + EE +K E EE E +K E++ K++ EE K+E E +
Sbjct: 1690 AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744
Score = 26.3 bits (57), Expect = 5.6
Identities = 18/48 (37%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K EEKKK E K + EED+K + +E+++ KK+ E ++ EE+++
Sbjct: 1386 KAEEKKKADEAKKKAEEDKK-KADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 26.3 bits (57), Expect = 6.0
Identities = 16/48 (33%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KEE KKK K + EE +K +E +++ EE K+K +E + ++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKK--ADEAKKKAEEDKKKADELKKAAAAKK 1418
Score = 26.3 bits (57), Expect = 6.8
Identities = 17/50 (34%), Positives = 32/50 (64%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
EEE K + E+ +E E++K + EE ++ +EEKK+ ++ +E++ E R
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
Score = 26.3 bits (57), Expect = 7.2
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEE-----EEMEEQKEEKKEKVEEEEVKEE 117
K EEKKK E K + EE +K +E EE +++ + K+K EE + E
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346
Score = 25.9 bits (56), Expect = 7.5
Identities = 16/47 (34%), Positives = 32/47 (68%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE KK E+K++ EE +K EE+ ++ +E KK++ E++++ ++E
Sbjct: 1719 EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKE 1765
Score = 25.9 bits (56), Expect = 8.6
Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
K EE KK E K + EE +K + + +++ EE K+ E + + E
Sbjct: 1310 KAEEAKKADEAKKKAEEAKK-KADAAKKKAEEAKKAAEAAKAEAEAA 1355
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 32.6 bits (74), Expect = 0.039
Identities = 20/49 (40%), Positives = 33/49 (67%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+EEKKK++E EE +D K +EE E++ ++KEK +E++V+E R
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160
Score = 29.1 bits (65), Expect = 0.58
Identities = 15/48 (31%), Positives = 33/48 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE + + +E++ E+E+ ++ ++++ E+ KEE K++ +EE KE+R
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 28.3 bits (63), Expect = 1.3
Identities = 14/51 (27%), Positives = 34/51 (66%), Gaps = 1/51 (1%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
++E+ K+ ++ K +E+ E+ ++ + +E+ +EK+ E+E+ KE++ E R
Sbjct: 109 EKEQVKEEKK-KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
Score = 28.3 bits (63), Expect = 1.3
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
K +E+ K R+ K E +E +E+E E++K+ ++ + EEE K ER
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.041
Identities = 12/45 (26%), Positives = 24/45 (53%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K+ KK+R+ +K E+E+ + + + + KK K + E K+
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 31.9 bits (73), Expect = 0.041
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ ++E + EEEE+ ++EE + EEQ +E EK + +E+RRE
Sbjct: 91 RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRE 138
Score = 29.9 bits (68), Expect = 0.19
Identities = 12/48 (25%), Positives = 30/48 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE+E+++ E +VEE ++E+ +E +E++ + K + E ++++
Sbjct: 99 KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146
Score = 29.6 bits (67), Expect = 0.28
Identities = 15/46 (32%), Positives = 30/46 (65%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
++K+K E++ E E +E EEE+++E E++ K++ E+ +E R
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141
Score = 27.3 bits (61), Expect = 1.8
Identities = 14/46 (30%), Positives = 30/46 (65%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
+EEE++ E+ EEE+ ++L E+E+ + K EK+ + E ++ + +
Sbjct: 103 EEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Score = 26.9 bits (60), Expect = 2.1
Identities = 12/49 (24%), Positives = 28/49 (57%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
G ++++K+ E++ E E+ EE+ E ++E + E+ + ER++
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.5 bits (74), Expect = 0.043
Identities = 14/60 (23%), Positives = 35/60 (58%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
R Q+ K E++++++++ + EE ++ + E E K+ +KE++ +E K++ E +
Sbjct: 66 RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
Score = 29.0 bits (65), Expect = 0.62
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 73 KEEEKKKRREDKVEE------EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE++KK+ + + EE E E+L++ E E ++++K EE K+ +
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Score = 29.0 bits (65), Expect = 0.66
Identities = 9/43 (20%), Positives = 27/43 (62%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
++K+ ++++++ E E+L +E ++Q EE ++ ++ + E
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE 135
Score = 27.5 bits (61), Expect = 2.3
Identities = 12/44 (27%), Positives = 24/44 (54%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
E+E+ K+ E + +++K + EE +Q K+++ EE K
Sbjct: 98 AEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141
Score = 26.3 bits (58), Expect = 4.9
Identities = 11/44 (25%), Positives = 18/44 (40%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+E+KK+ E + +K EE + K K E E +
Sbjct: 114 QEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAA 157
Score = 26.3 bits (58), Expect = 5.4
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E+E+ +E K + EE K + ++ +E + + K E
Sbjct: 107 EKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150
Score = 25.5 bits (56), Expect = 9.7
Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 74 EEEKKKRRED----KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E KK+ E K E +K E E + E K+K E E K+ E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAE 214
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 32.3 bits (74), Expect = 0.048
Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 12/65 (18%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE------------EMEEQKEEKKEKVEE 111
I +G +EE ++ +E + EED+K +EE + E+ +E +K+
Sbjct: 491 TITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPA 550
Query: 112 EEVKE 116
EE ++
Sbjct: 551 EEKEK 555
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 32.4 bits (74), Expect = 0.049
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
+ E+ ED+ EE+ ++ EEEE E+ +EE + E+ + G
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDS 416
Score = 29.7 bits (67), Expect = 0.31
Identities = 15/52 (28%), Positives = 31/52 (59%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
K +K++ R D ++ EE ++ E+EE E++ +E +E+ E+ +E + G
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404
Score = 28.2 bits (63), Expect = 1.2
Identities = 10/48 (20%), Positives = 21/48 (43%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
EEE+++ E + EE ED + E + E + + + + +
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
Score = 28.2 bits (63), Expect = 1.4
Identities = 14/47 (29%), Positives = 26/47 (55%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E+E++++R D+ EEEE E EEE + +++ E + E +
Sbjct: 374 EDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420
Score = 27.0 bits (60), Expect = 2.9
Identities = 11/54 (20%), Positives = 19/54 (35%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
E+E ++ + E EE+E + EE Q E +V +
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKES 424
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 32.1 bits (74), Expect = 0.053
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 84 KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+++ EE+ +E+ME ++ E+ E++ K E
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 29.0 bits (66), Expect = 0.71
Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 2/71 (2%)
Query: 47 NRLRFDDNQ--WHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
+L ++ W L + GT K E R + + E ++ E ++ EE
Sbjct: 474 EQLGLEEELTSWESLLEWGGLPAGTKVAKGEPLFPRIDVEEEIAYIKEQMEGSAPKEPEE 533
Query: 105 KKEKVEEEEVK 115
K++K E+ E+
Sbjct: 534 KEKKPEKPEIT 544
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 32.3 bits (73), Expect = 0.055
Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 6/54 (11%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK-----EERREGG 122
EE R++ V +++DE +E++ +E+ +E++E+ EEEE+K +E EGG
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEE-EEEEIKGFDDEDEEDEGG 193
Score = 30.4 bits (68), Expect = 0.22
Identities = 13/39 (33%), Positives = 29/39 (74%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+D+ E+E+D+ E++E EE++EE+ + ++E+ ++E E
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194
Score = 26.2 bits (57), Expect = 6.6
Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 5/87 (5%)
Query: 34 KLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKL 93
K +G LDM I R + + + ++E + +D E++E+E+
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDD-----DDEDEDEDDDDEEDDEEEEE 175
Query: 94 EEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEEE++ +E +E E+ E+ E
Sbjct: 176 EEEEIKGFDDEDEEDEGGEDFTYEKSE 202
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 32.0 bits (73), Expect = 0.056
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 65 IQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
I + + E+ R+D+ EE+E+E+ EE+E E KE ++EE +E+ E
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGP---SKEHSDDEEFEEDDVESKYE 428
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.6 bits (72), Expect = 0.057
Identities = 11/46 (23%), Positives = 32/46 (69%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEK+K + K + ++ + ++++ +++K++K EK +E+E +++ +
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 28.9 bits (65), Expect = 0.60
Identities = 10/50 (20%), Positives = 31/50 (62%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+ +KKK ++ K ++++ + ++++ E+K+EK+ + + E++ + E
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
Score = 26.6 bits (59), Expect = 3.0
Identities = 11/38 (28%), Positives = 24/38 (63%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE 110
K+++KK ++D E++DEK E+++E+ + E +
Sbjct: 97 KDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 25.4 bits (56), Expect = 8.2
Identities = 11/50 (22%), Positives = 28/50 (56%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
+ + K+++K K ++D ++++ EK +E+E E++ E+ + E
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
Score = 25.4 bits (56), Expect = 8.8
Identities = 10/46 (21%), Positives = 32/46 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
KE E+K++ + K ++ + +K ++++ ++ K++ K + ++E+ E++
Sbjct: 76 KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.8 bits (72), Expect = 0.067
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 3/71 (4%)
Query: 46 PNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR---EDKVEEEEDEKLEEEEMEEQK 102
R RF + + S+ E + G ++K+RR + +EDE +E+ + +
Sbjct: 327 LLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYE 386
Query: 103 EEKKEKVEEEE 113
+E+K + EE++
Sbjct: 387 KERKRRREEDK 397
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.8 bits (72), Expect = 0.070
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 3/51 (5%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE---KKEKVEEEEVKEERRE 120
E D+++E++DEK +++ E KE+ KKE+V + V +E
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55
Score = 26.8 bits (59), Expect = 3.4
Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 4/44 (9%)
Query: 78 KKRREDKVEEEEDEKLEEEEMEEQKE--EKKEKV--EEEEVKEE 117
+ RRE + + + +++E ++QK E KEKV ++EEV +
Sbjct: 4 RNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTD 47
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 0.074
Identities = 18/39 (46%), Positives = 27/39 (69%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ EEE E+ EEEE EE++EE++ + EE E +EE E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEE 475
Score = 31.0 bits (70), Expect = 0.12
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
EEE+ E++ EEEE+E+ +E E EE ++E++E+ E + E G
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488
Score = 31.0 bits (70), Expect = 0.13
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
+EE ++ E++ EEEE+E+ EEE E +EE++E + +EE G
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492
Score = 31.0 bits (70), Expect = 0.15
Identities = 18/42 (42%), Positives = 29/42 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
+E ++ + EEEE+E+ EEEE E ++EE +++ EEEEV
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476
Score = 30.7 bits (69), Expect = 0.16
Identities = 20/54 (37%), Positives = 33/54 (61%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
+E E+++ E++ EEEE+E+ EE+E EE++ E +E+ EE E E G
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489
Score = 29.9 bits (67), Expect = 0.31
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
E +EEE+++ E++ E+E +E+ E+E EE++ E EEE
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486
Score = 29.1 bits (65), Expect = 0.63
Identities = 18/58 (31%), Positives = 30/58 (51%)
Query: 68 GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
+ E +E + EE +E+ EEEE EE++E++ E+ E E+ +EE G
Sbjct: 424 ASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481
Score = 29.1 bits (65), Expect = 0.66
Identities = 17/58 (29%), Positives = 32/58 (55%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ + G+ + ++ E E+E+ EEEE EE+++E +E+ E+E +EE E
Sbjct: 420 DPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
Score = 28.3 bits (63), Expect = 0.99
Identities = 20/56 (35%), Positives = 31/56 (55%)
Query: 66 QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
QE EE+++ E++ EEE++ + EE E EE++EE + EE E EG
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492
Score = 27.2 bits (60), Expect = 2.9
Identities = 18/52 (34%), Positives = 32/52 (61%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
EEE+++ E++ EE+E E+ E E+ EE++E + + EEE++ G G
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Score = 26.0 bits (57), Expect = 8.0
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 74 EEEKKKRREDKVEEEEDEKL----EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
EE++ + E + EEEE+E EEEME E + E EE E R G
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 31.6 bits (73), Expect = 0.081
Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 12/65 (18%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE------------EMEEQKEEKKEKVEE 111
I +G EE ++ ++ + EED+K +E + E+ +E +KV
Sbjct: 493 TITASSGLSDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPA 552
Query: 112 EEVKE 116
+E ++
Sbjct: 553 DEKEK 557
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 31.6 bits (71), Expect = 0.083
Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 78 KKRREDKVEEEEDEKLEEEEMEEQKEEK--KEKVEEEEVKEERREGGRGGR 126
+K+R +++E EE E+LE E ME + E+ +E++E E ++ +R E R R
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDR 506
Score = 30.9 bits (69), Expect = 0.17
Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 67 EGTGCGKEEEKKKRREDKV-EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
E KE +K E K+ E++ E+LE EE E + E+ E++E E ++ ER E R
Sbjct: 436 ERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERER 493
Score = 25.9 bits (56), Expect = 7.3
Identities = 18/75 (24%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 48 RLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR--EDKVEEEEDEKLEEEEMEEQKEEK 105
R R + H ++ +I E + E +++ R +++E E E+LE E +E ++ E
Sbjct: 437 RARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLE- 495
Query: 106 KEKVEEEEVKEERRE 120
++++E + + RE
Sbjct: 496 RDRLERDRLDRLERE 510
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 31.6 bits (72), Expect = 0.085
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
EEE K ++++++K E + + +K++KK+K ++E E
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSE 86
Score = 30.5 bits (69), Expect = 0.21
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
KK ++EDK E + + ++++ +++K+E K + E +
Sbjct: 55 KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 31.2 bits (70), Expect = 0.090
Identities = 17/45 (37%), Positives = 30/45 (66%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+E + +RRE+K++E ED++ +E E EEQ E++ E E+ E+
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQ 183
Score = 28.1 bits (62), Expect = 1.2
Identities = 15/46 (32%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
++E+ +K R+++ EE +E++E E EE+ +E++++ E+E +EE+
Sbjct: 124 RQEQIEKARQER--EELEERMEWERREEKIDEREDQEEQEREREEQ 167
Score = 25.8 bits (56), Expect = 6.6
Identities = 20/60 (33%), Positives = 38/60 (63%), Gaps = 4/60 (6%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKL--EEEEMEEQK--EEKKEKVEEEEVKEERR 119
RI++ K+++ KK++ + +E+ EK E EE+EE+ E ++EK++E E +EE+
Sbjct: 102 RIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQE 161
Score = 25.4 bits (55), Expect = 9.6
Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ EEK RED+ EE+E E+ E+ E+ + + E +E++E + E +
Sbjct: 146 RREEKIDEREDQ-EEQEREREEQTIEEQSDDSEHEIIEQDESETESDD 192
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 31.2 bits (71), Expect = 0.093
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E++ E EE E+ EE+ ++ K+ E +E +E RR
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.0 bits (71), Expect = 0.096
Identities = 10/41 (24%), Positives = 25/41 (60%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+KKK+++ K E +EK ++ + E+++E ++ V+ +
Sbjct: 84 KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAA 124
>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
Length = 424
Score = 31.2 bits (72), Expect = 0.11
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
+E + +EE + LEEEE+EE+ K E+ EE++ R E G
Sbjct: 316 DELLYAVAELLEETPEFPLEEEEVEEEVYYKFEE-EEKDFTITRDEDGV 363
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 31.2 bits (71), Expect = 0.11
Identities = 16/38 (42%), Positives = 25/38 (65%)
Query: 83 DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
D EE + EK E+E ++EQK +K K +EE+ K+E +
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 28.8 bits (65), Expect = 0.70
Identities = 11/39 (28%), Positives = 27/39 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
KEE ++++ E + +E+ + ++ EE+K+++ EK+E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 28.8 bits (65), Expect = 0.77
Identities = 12/38 (31%), Positives = 25/38 (65%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E + E+EE E L+E++ + ++++EK ++E K E+
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 28.5 bits (64), Expect = 0.91
Identities = 12/43 (27%), Positives = 24/43 (55%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
+ +++KR ++EE +K E E++E+ K E + +E K
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDK 603
Score = 28.5 bits (64), Expect = 1.1
Identities = 10/37 (27%), Positives = 28/37 (75%)
Query: 84 KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K++++E+ + E+EE E KE+K+ + +++ +++++E
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582
Score = 26.9 bits (60), Expect = 3.5
Identities = 10/41 (24%), Positives = 26/41 (63%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
+E+E+K+ +++ + +K EE++ +E ++ +K K+ E
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 31.1 bits (70), Expect = 0.11
Identities = 17/52 (32%), Positives = 24/52 (46%)
Query: 70 GCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
EEE++ D EEEEDE++ + E E E+ EEE + E
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGED 224
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.0 bits (71), Expect = 0.12
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
++ + + EEE EE++EE++E+ EEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 29.4 bits (67), Expect = 0.43
Identities = 15/44 (34%), Positives = 23/44 (52%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
+D ++EE E L + EE++E+ EEEE +E E G
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAAG 324
Score = 27.5 bits (62), Expect = 1.9
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
K +++++E + + EE++EE++E+ EEE +EE G
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAAG 324
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.4 bits (71), Expect = 0.12
Identities = 15/49 (30%), Positives = 27/49 (55%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
G E + ++ EEE + K ++EE ++KE KK K ++E K + +
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49
Score = 28.3 bits (63), Expect = 1.4
Identities = 14/54 (25%), Positives = 32/54 (59%), Gaps = 6/54 (11%)
Query: 73 KEEEKKKRREDKVEEEEDEKLE------EEEMEEQKEEKKEKVEEEEVKEERRE 120
+E E+KK++E+K +E+E +KL+ + +++ Q+ V ++ K+ R+
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69
Score = 26.0 bits (57), Expect = 7.9
Identities = 12/55 (21%), Positives = 31/55 (56%)
Query: 68 GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
G+ E EKK E+++E ++ ++ + +E E +K + +K + +++ ++ G
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDG 55
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.0 bits (70), Expect = 0.12
Identities = 16/82 (19%), Positives = 30/82 (36%), Gaps = 2/82 (2%)
Query: 45 RPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQK-E 103
R NRLR W +H E + EK + + K E + + E E+ + +
Sbjct: 309 RRNRLRMIITPWRAPELHAENAEIKK-TRTAEKNEAKARKKEIAQKRRAAEREINREARQ 367
Query: 104 EKKEKVEEEEVKEERREGGRGG 125
E+ + + + + G
Sbjct: 368 ERAAAMARARARRAAVKAKKKG 389
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 30.8 bits (70), Expect = 0.14
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 87 EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
E +D KL EE E+ E EEE + E++E +
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194
Score = 27.7 bits (62), Expect = 1.3
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
+D EE E E KEE+ + E++E K
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193
Score = 26.9 bits (60), Expect = 2.5
Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)
Query: 74 EEEKKKRREDKVEEE-EDEKLEEEEMEEQKEEKKEK 108
++ K E E EEE E +K+E K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.8 bits (69), Expect = 0.15
Identities = 13/53 (24%), Positives = 30/53 (56%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
G + K + + + + ++ +E E+K+ ++E+ EEE +E ++E G+G
Sbjct: 188 GSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQG 240
Score = 30.0 bits (67), Expect = 0.24
Identities = 12/49 (24%), Positives = 24/49 (48%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+ + K ++E+ +E ++E K E+K+ EE+E +E E
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEA 233
Score = 27.7 bits (61), Expect = 1.7
Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 59 LSIHRRIQEGTGCGKEEEKKKRREDKVEE-EEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
LS+ +Q K+E+ + ++ ++E + +EK +EE EE++ E++ K EE + ++
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 30.6 bits (69), Expect = 0.16
Identities = 33/102 (32%), Positives = 45/102 (44%), Gaps = 10/102 (9%)
Query: 20 LNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKK 79
L+VA +DG AI L D Y P+ DD RI+ EEE+ +
Sbjct: 34 LDVAARDGDDAIELAEPETSDDPYGNPDPFGEDDEG--------RIEVRISEDGEEEEVE 85
Query: 80 RREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
E+ +EEE+ + E EE E + +E E E EE EG
Sbjct: 86 EGEE--DEEEEGEEESEEFEPLGDTPEELTEASEQLEEHEEG 125
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.6 bits (69), Expect = 0.18
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 74 EEEKKKRRED--KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE K K+ + E E EK +EE ++Q EE+ + E K++ E
Sbjct: 122 EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Score = 29.8 bits (67), Expect = 0.33
Identities = 13/45 (28%), Positives = 21/45 (46%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
KE E++ E ++ E + EE ++Q EE K K E +
Sbjct: 92 KELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
Score = 28.6 bits (64), Expect = 0.74
Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEE---EMEEQKEEKKEKVEEEEVKEERRE 120
+E +KK + E E+ E+ E+E++ +K + E+ ++ E
Sbjct: 67 QERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEE 116
Score = 27.9 bits (62), Expect = 1.4
Identities = 14/48 (29%), Positives = 20/48 (41%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E +KK E K + E K + EE + + E K K E + E
Sbjct: 169 AEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
Score = 27.9 bits (62), Expect = 1.7
Identities = 11/48 (22%), Positives = 28/48 (58%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE+E++K+ E + EE E ++ E+ +++ E++ + + E+ +
Sbjct: 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Score = 26.7 bits (59), Expect = 3.6
Identities = 11/58 (18%), Positives = 29/58 (50%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
R +E E+ K+ E ++ E+++ + EE + ++ + + E E +++ +E
Sbjct: 89 ARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKE 146
Score = 26.7 bits (59), Expect = 3.8
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E EKK + E K + EE+ K + ++K + +K E E K +
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182
Score = 26.0 bits (57), Expect = 7.3
Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 3/46 (6%)
Query: 74 EEEKKKRR--EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
EEE K + E K + E +K E E + K E K K + EE K +
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAE-AKAKAEAKAKAKAEEAKAK 197
Score = 25.6 bits (56), Expect = 7.9
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 4/48 (8%)
Query: 74 EEEKKKR----REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E K + E K +EE ++ EEE + E K+K E + K E
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.5 bits (69), Expect = 0.18
Identities = 17/52 (32%), Positives = 28/52 (53%)
Query: 66 QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+E E+++ K +K +EE +E+ EEE+ E E +KE E+ E E
Sbjct: 68 EEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119
Score = 28.9 bits (65), Expect = 0.59
Identities = 18/47 (38%), Positives = 28/47 (59%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
KEEEK+ + E++ D + E+EE EE+ EE+ E+ +E KE
Sbjct: 65 VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111
Score = 26.6 bits (59), Expect = 3.0
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
++E + E K EE+E E++E + E++ E+ EEE +E+
Sbjct: 54 SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Score = 26.2 bits (58), Expect = 5.4
Identities = 16/55 (29%), Positives = 30/55 (54%)
Query: 66 QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+E E+K+ + + + E+EE E+ EEE EE +E +++ EE+ +E
Sbjct: 67 EEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121
Score = 25.9 bits (57), Expect = 7.2
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++++ E+ EEE++ E++ ++ EK+++ EEE +EE E
Sbjct: 54 SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.1 bits (68), Expect = 0.19
Identities = 12/45 (26%), Positives = 24/45 (53%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
EEE ++ D+ E+E+ + + E E++K E + E+E +
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 29.7 bits (67), Expect = 0.28
Identities = 12/40 (30%), Positives = 25/40 (62%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
RE ++E+E+ EEE +E ++E + + E ++++RE
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
Score = 28.2 bits (63), Expect = 0.86
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 83 DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
EEEE E+ +E E++ + + +E+ K E
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129
Score = 26.6 bits (59), Expect = 2.9
Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+I +EE++ E E E+++ E E KE+KK +V + + ++E+ +
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQED--PPETKTESKEKKKREVPKPKTEKEKPK 140
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 30.4 bits (69), Expect = 0.20
Identities = 11/46 (23%), Positives = 23/46 (50%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE+++ + ++E + K ++E E EE EE+ ++ E
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE 278
Score = 28.0 bits (63), Expect = 1.3
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
K+ ++ E++ EE + E E + K+E++ EE+
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263
Score = 28.0 bits (63), Expect = 1.6
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KK + + EE E+E + E+E E K K E+E EE
Sbjct: 224 KKLKETSETEEREEET--DVEIETTSETKGTKQEQEGSTEEDPS 265
Score = 27.7 bits (62), Expect = 2.0
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 90 DEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
E E EE EE+ + + E E + ++ +EG
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEG 258
Score = 27.7 bits (62), Expect = 2.1
Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 74 EEEKKKRREDKVEEEEDEKL--EEEEMEEQKEEKKEKVEEEEVKEE 117
E K ++E + EED L EE+E ++ E+ + +E K+E
Sbjct: 247 SETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Score = 26.5 bits (59), Expect = 4.9
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+ E + E K ++E E EE+ EEK++ + E++ +
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Score = 26.1 bits (58), Expect = 5.6
Identities = 9/32 (28%), Positives = 21/32 (65%)
Query: 88 EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E+ ++ + + E+K +K+EK E+ + +E+ R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 29.6 bits (67), Expect = 0.21
Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 3/60 (5%)
Query: 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGT 69
D+L + ++DG + ++ L G +R + +D +WH + + R + GT
Sbjct: 12 YNGTQTERDFLALELRDGRLEVSYDLGSGAA--VVRSGD-KLNDGKWHSVELERNGRSGT 68
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 30.4 bits (68), Expect = 0.21
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
R + +EE DEK ++++ E + +VE + E
Sbjct: 51 RSCQGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGEG 88
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.7 bits (69), Expect = 0.21
Identities = 14/48 (29%), Positives = 25/48 (52%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
+E ++ + E+DEK+ E+ EE +E +E E+ +E E G
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQG 4091
Score = 29.2 bits (65), Expect = 0.56
Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 3/53 (5%)
Query: 65 IQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
++ GTG G + ED+ LEE EE + + E E + E
Sbjct: 3812 LESGTGLGSGVGAEDITNTL---NEDDDLEELANEEDTANQSDLDESEARELE 3861
Score = 25.7 bits (56), Expect = 9.5
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
E+ +K ED EE E E E + +E+ E+ E E
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 30.5 bits (69), Expect = 0.21
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
E E+ +R E+ E L E+E E++ E ++ + EEEE
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 27.1 bits (60), Expect = 2.6
Identities = 13/47 (27%), Positives = 21/47 (44%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E +R E++ + E E+ E E E+ EE E + E+ E
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFE 432
Score = 26.3 bits (58), Expect = 5.4
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E E K E + EE+E L+ E E + E+ E+ EE+ E
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422
Score = 25.9 bits (57), Expect = 6.5
Identities = 15/48 (31%), Positives = 21/48 (43%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+E E++ E EED +L E + E + E EE EE E
Sbjct: 400 REAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447
Score = 25.5 bits (56), Expect = 8.7
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
E + ED E + EEEE +++ E++ + E+ E EE
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARL 420
Score = 25.5 bits (56), Expect = 8.7
Identities = 12/44 (27%), Positives = 23/44 (52%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
EE ++ E+++E E+ E+ EE+ + E E+E + E
Sbjct: 395 EEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELE 438
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 30.4 bits (68), Expect = 0.23
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 83 DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
D V EEE+E EE+ E ++E + V+E+ +EE E
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175
Score = 30.0 bits (67), Expect = 0.29
Identities = 17/34 (50%), Positives = 25/34 (73%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
E++VE EEDE+ E+E + +EK E+ EEEE+K
Sbjct: 143 EEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELK 176
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 30.4 bits (69), Expect = 0.23
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
EEEE+E+ E E EE++EE++E+ EEEV
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERTFEEEV 334
Score = 29.2 bits (66), Expect = 0.53
Identities = 16/43 (37%), Positives = 27/43 (62%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
+++RR E +E+ EEEE ++EE++E+ EE +EE R
Sbjct: 293 RRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVR 335
Score = 28.5 bits (64), Expect = 1.1
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E E+E+ EEEE+ E++EE++E+ E +E R
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRA 336
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 30.2 bits (68), Expect = 0.24
Identities = 14/45 (31%), Positives = 29/45 (64%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E+K+++ E+K EE + E+++EE EE+K E E+ ++ ++
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 29.8 bits (67), Expect = 0.26
Identities = 17/62 (27%), Positives = 32/62 (51%)
Query: 62 HRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
IQ K + KK R +D + EE E + E+K +KK+ +++E K++++E
Sbjct: 103 LMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162
Query: 122 GR 123
+
Sbjct: 163 KK 164
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 30.2 bits (69), Expect = 0.27
Identities = 15/47 (31%), Positives = 22/47 (46%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEE+ ++ E K E EE+ K E EE + E++ K E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEA 85
Score = 29.0 bits (66), Expect = 0.69
Identities = 12/47 (25%), Positives = 18/47 (38%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEE +K + EEE + EE + EE+ + E
Sbjct: 40 EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAA 86
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 29.6 bits (67), Expect = 0.28
Identities = 15/45 (33%), Positives = 30/45 (66%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
EEK ++ E++VEE E E+ E+E+EE+ E++ + E + ++ +
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.0 bits (68), Expect = 0.30
Identities = 11/44 (25%), Positives = 29/44 (65%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+ + +D +E++DE ++++++++ EEKKE E E++ ++
Sbjct: 141 DLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184
Score = 30.0 bits (68), Expect = 0.34
Identities = 11/45 (24%), Positives = 28/45 (62%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
E++ + +D V++E++EK E +E+E+ ++ +E++ + R
Sbjct: 154 EDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 29.8 bits (67), Expect = 0.31
Identities = 11/36 (30%), Positives = 16/36 (44%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
C + R D ++ E +E EME +E KK
Sbjct: 534 CWSLRNSTEERIDLDADDWTEDEDENEMETDEERKK 569
Score = 28.3 bits (63), Expect = 1.1
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 88 EEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
EE L+ ++ E ++E + + +EE K
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKK 569
Score = 28.3 bits (63), Expect = 1.1
Identities = 6/28 (21%), Positives = 13/28 (46%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
EE D ++ +E + E + E ++
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKK 569
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 29.4 bits (66), Expect = 0.32
Identities = 16/44 (36%), Positives = 23/44 (52%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ + ++VEEE + EE +EE EE EK E E E E
Sbjct: 2 SEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADE 45
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 29.7 bits (67), Expect = 0.33
Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 73 KEEEKKKR----REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+E E+ +R + +K EEE K + + E +EE++ EE +++EE
Sbjct: 378 REAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.8 bits (67), Expect = 0.33
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 7/61 (11%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEK-------KEKVEEEEVKEERREGGRGG 125
K + KK ED+ E+EE E + E +EE++ E+ K++ E+EEV G R G
Sbjct: 294 KRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRG 353
Query: 126 R 126
R
Sbjct: 354 R 354
Score = 28.3 bits (63), Expect = 1.1
Identities = 12/49 (24%), Positives = 25/49 (51%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+E E++ + D + E+E + + +E ++E K E + +EE E
Sbjct: 241 KEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEE 289
Score = 27.1 bits (60), Expect = 3.2
Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E+E K E EEE EE E++K ++ +K+ E+E ++E E
Sbjct: 269 EDEDEPKPSGERSDSEEE-----TEEKEKEKRKRLKKMMEDEDEDEEME 312
Score = 26.3 bits (58), Expect = 5.4
Identities = 13/49 (26%), Positives = 23/49 (46%), Gaps = 1/49 (2%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K +EKK+++E ++E EEE + + E E + E+ E
Sbjct: 224 KKTKEKKEKKEASESTVKEES-EEESGKRDVILEDESAEPTGLDEDEDE 271
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.7 bits (68), Expect = 0.34
Identities = 14/44 (31%), Positives = 34/44 (77%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
++ E ++RE+++E++E E ++++ E+KEE+ E++ EE+++E
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.4 bits (62), Expect = 1.9
Identities = 13/48 (27%), Positives = 33/48 (68%), Gaps = 1/48 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K E++ ++E+ ++ + E LE+ E E +K+EK+ + +++E++++ E
Sbjct: 86 KLEKRLLQKEENLDRK-LELLEKREEELEKKEKELEQKQQELEKKEEE 132
Score = 26.7 bits (60), Expect = 3.9
Identities = 18/51 (35%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKE---EKKEKVEEEEVKEERRE 120
++EE R+ + +E+ E+E ++E+ EQK+ EKKE+ EE ++E+ +E
Sbjct: 93 QKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 26.3 bits (59), Expect = 5.0
Identities = 14/52 (26%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
Query: 73 KEEEKKKRREDKVEEEE---DEKLEEEEMEEQKEEKKEKVE-EEEVKEERRE 120
K +E ++ + +EE + + +E +E ++E K + E E+E++E R E
Sbjct: 32 KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83
Score = 25.5 bits (57), Expect = 8.8
Identities = 15/45 (33%), Positives = 29/45 (64%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E++ + R +++++ E L++EE ++K E EK EEE K+E+
Sbjct: 74 EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 29.6 bits (67), Expect = 0.35
Identities = 15/44 (34%), Positives = 29/44 (65%), Gaps = 2/44 (4%)
Query: 72 GKEEEKKKRREDKVE--EEEDEKLEEEEMEEQKEEKKEKVEEEE 113
G +EE + +K E EEED+ E E+ +E+ E+++E+ +++E
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 29.3 bits (66), Expect = 0.36
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
K E ++ + E +E++K +EE++E ++ EK+E +E+ E E +
Sbjct: 9 KHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58
Score = 26.6 bits (59), Expect = 3.4
Identities = 13/42 (30%), Positives = 28/42 (66%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+K+ ++ K E E++ +E E +E+ + K+E +E EE+++E
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKE 42
Score = 26.6 bits (59), Expect = 3.4
Identities = 19/46 (41%), Positives = 26/46 (56%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
C KE E K+ + K E+ E E++E+EE+ E EE E EE E
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 29.3 bits (66), Expect = 0.36
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
E +K RE V E + K EE +++E E+K E+ E+ V++ R
Sbjct: 40 ETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEAREDAVRKSR 84
Score = 25.5 bits (56), Expect = 7.8
Identities = 18/49 (36%), Positives = 27/49 (55%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
GK E K + E+ +++ E E+L E ++EE K K E+ EE RE
Sbjct: 29 GKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 29.3 bits (66), Expect = 0.40
Identities = 19/48 (39%), Positives = 33/48 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ EE+KK E + E ++E+ EEE E++KEE++ K EE+ ++E+ E
Sbjct: 27 EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74
Score = 27.7 bits (62), Expect = 1.5
Identities = 21/47 (44%), Positives = 36/47 (76%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEE+++ R+ E+ E E+ EEEE+EE++E+KKE+ E +E +E+ R+
Sbjct: 24 EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70
Score = 27.0 bits (60), Expect = 2.8
Identities = 13/46 (28%), Positives = 32/46 (69%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
EE+++ R+++ E EE+ + ++EE E ++ E++ + E+EE ++ +
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79
Score = 26.2 bits (58), Expect = 4.7
Identities = 17/46 (36%), Positives = 31/46 (67%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ ++++RE + EE E+ K EE+ E +++E++E EE E K+E E
Sbjct: 15 QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Score = 26.2 bits (58), Expect = 4.9
Identities = 18/48 (37%), Positives = 34/48 (70%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K EEK+ RR+ + EEE+ + ++ E+++ E+KE+ E EE +E+++E
Sbjct: 10 KLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKE 57
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 29.5 bits (67), Expect = 0.41
Identities = 9/33 (27%), Positives = 15/33 (45%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
R +EEE + EE EE++E + +
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.0 bits (65), Expect = 0.51
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+EE ++ E + EEEE +L ++ EE+++ K K ++ + E
Sbjct: 144 QEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDE 186
Score = 26.3 bits (58), Expect = 4.4
Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 77 KKKRREDKVEEEEDEKLE-EEEMEEQKEEKKEKVEEEEVKEERR 119
K K + + +EE +E LE E+E EEQ+ +K EEE+ +R+
Sbjct: 135 KNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.0 bits (65), Expect = 0.53
Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 73 KEEEKKKRREDKV--EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+++ +K + E +KL +E +E+K++ K+K +V +
Sbjct: 39 QDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLFV 89
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 28.9 bits (65), Expect = 0.54
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
E +K+ +E E E+ E++E +++KE KKEK E+++ KE+
Sbjct: 132 SETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 27.8 bits (62), Expect = 1.2
Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 72 GKEEEKKKRREDKVEEEED-EKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE K +E +VEEEE EK +++E++++K+EKK+K E+ + ++
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Score = 26.2 bits (58), Expect = 4.7
Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+ + E+E K+E+E E ++EEKKEK +++EVK+E++E
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEA-EVEEEEKKEKKKKKEVKKEKKEK 170
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.4 bits (64), Expect = 0.58
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 88 EEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
EE+ EE++EE++++ EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.5 bits (59), Expect = 2.5
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 85 VEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
EE++ EE++EE+KE+ EEE
Sbjct: 69 AAAAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 25.7 bits (57), Expect = 4.3
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 94 EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
E+K+E++E+ EE+E EE G G
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103
Score = 24.9 bits (55), Expect = 9.9
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEE 104
EE+ E+ EEEE +E+ EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region. Family of
higher eukaryotic proteins. SIN was identified as a
protein that interacts specifically with SXL (sex
lethal) in a yeast two-hybrid assay. The interaction is
mediated by one of the SXL RNA binding domains.
Length = 422
Score = 28.9 bits (65), Expect = 0.58
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
++K+K+ ED +E+ED E EE +Q K + E E+ ++ R
Sbjct: 142 KDKRKKEEDTADEDEDPDEEAEEELKQVTVKFSRPETEKQRKRRE 186
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 28.2 bits (63), Expect = 0.61
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEE 112
+EK EE+E E++KEE+KE+ EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 25.5 bits (56), Expect = 5.0
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 94 EEEEMEEQKEEKKEKVEEEEVKEERREG 121
EE+ EE++EEKK++ E+EE +EE G
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 25.2 bits (55), Expect = 6.4
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 95 EEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
EE+ EEK+E+ ++EE KEE E G
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 24.8 bits (54), Expect = 9.2
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 87 EEEDEKLEEEEMEEQKEEKKEKVEEEE 113
E+ EE+ EE+K+E++++ EEEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 28.8 bits (64), Expect = 0.62
Identities = 12/50 (24%), Positives = 21/50 (42%), Gaps = 1/50 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
KEE K + + + ++ L+ E E K K + +E + EE
Sbjct: 146 KEEAKAAAKAAALAKAKEASLKSAE-AELKASKGKSIEVNVIGEEDEGEK 194
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.8 bits (64), Expect = 0.66
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 6/61 (9%)
Query: 66 QEGTGCGKEEEKKKRREDKVEE------EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
Q+ + E+++KK+ E EE E E+L++ E E K ++++K EE K+ +
Sbjct: 70 QQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQL 129
Query: 120 E 120
E
Sbjct: 130 E 130
Score = 27.6 bits (61), Expect = 2.0
Identities = 14/44 (31%), Positives = 28/44 (63%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E+E+ K +E + + EE EK + E ++Q+E+ ++ E++ K E
Sbjct: 107 EKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAE 150
Score = 26.5 bits (58), Expect = 4.8
Identities = 11/43 (25%), Positives = 28/43 (65%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
E+E+ K+ E + + ++++ + EE E+Q + ++++ EE+ K
Sbjct: 99 EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKA 141
Score = 26.1 bits (57), Expect = 5.6
Identities = 14/48 (29%), Positives = 19/48 (39%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E KKK E EE + E ++K E + K E+ K E
Sbjct: 166 AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEA 213
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.9 bits (65), Expect = 0.73
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
C ++K E++ E+EE+E+ EEE E + EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 28.5 bits (64), Expect = 0.85
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
R V E+ + EEEE E+++EE++E+ EE EE
Sbjct: 338 RCTWVNPEQKD--EEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 27.7 bits (62), Expect = 1.6
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 3/39 (7%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEV---KEERREG 121
E EE+E+ EE ++++E +E+ EEE KEE G
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207
Score = 26.9 bits (60), Expect = 2.9
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
E++DE+ E+E+ EE++EE++ + E E
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376
Score = 26.6 bits (59), Expect = 4.4
Identities = 15/33 (45%), Positives = 20/33 (60%)
Query: 85 VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
VE E E+ EEEE + +E+ E EEEE + E
Sbjct: 165 VELREGEEEEEEEEVGEADEEDEGEEEEEEEPE 197
Score = 25.8 bits (57), Expect = 7.8
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
EEEE+E+ EEE E + E+ E EV+E
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRE 318
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 28.7 bits (65), Expect = 0.78
Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEE----MEEQKEEKKEKVEEEEVKE 116
K+ E+ + K +E + K E EE +E+ +E+ +K+ E + K+
Sbjct: 511 KDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKK 558
Score = 26.8 bits (60), Expect = 3.5
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 7/51 (13%)
Query: 74 EEEKKKRREDKVEEEEDEKL---EEEEM----EEQKEEKKEKVEEEEVKEE 117
++K +E K+ L E E M EE E K++ E E K E
Sbjct: 482 AKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNE 532
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.9 bits (65), Expect = 0.81
Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 76 EKKKR--REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE--VKEERR 119
K+ R ++ E + + E + +K++KKEK +EEE +E+ R
Sbjct: 719 GLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.5 bits (64), Expect = 1.00
Identities = 13/43 (30%), Positives = 25/43 (58%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
GK+ E K + + ++ +K E+++ EE K E+K ++E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 28.5 bits (64), Expect = 0.82
Identities = 8/34 (23%), Positives = 17/34 (50%)
Query: 87 EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E +D K +EE E+++E ++ ++ E
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVE 191
Score = 27.7 bits (62), Expect = 1.5
Identities = 6/35 (17%), Positives = 18/35 (51%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
++K++ ++ +E ++ ++ E KK+K
Sbjct: 162 AKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 26.2 bits (58), Expect = 4.1
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
E + K++E+ EE E L + +K E +++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 28.5 bits (64), Expect = 0.83
Identities = 17/60 (28%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 61 IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+ R ++ G + + KKR ++ E +EK EE+++Q+E K +EE + ER+
Sbjct: 34 MDAREKKIAGELADADTKKREAEQERREYEEKN--EELDQQREVLLTKA-KEEAQAERQR 90
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.1 bits (63), Expect = 0.84
Identities = 14/49 (28%), Positives = 33/49 (67%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
G+++ K+K R DK++ +++E +E+E EE+ + + E+E + E+++
Sbjct: 64 GRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKK 112
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.5 bits (63), Expect = 0.85
Identities = 16/48 (33%), Positives = 28/48 (58%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+EEEK++ RE++ E EE E + + E + + +E +EE+ E E
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEE 159
Score = 26.2 bits (57), Expect = 6.6
Identities = 19/49 (38%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 74 EEEKKKRREDK--VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE KKKR E + +EEEE + +EE + +EE++++ +EE++ R E
Sbjct: 217 EELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAE 265
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 28.7 bits (64), Expect = 0.86
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 6/56 (10%)
Query: 75 EEKKKRREDK-----VEEE-EDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
+E KK RE++ ++E ED + EEE+ E++ + + EEEE K +R G
Sbjct: 259 DEYKKIREERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLR 314
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 28.3 bits (63), Expect = 0.87
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E+EK+ ++ + E ++ EE EE+KEE+ E V+++EV +R
Sbjct: 65 EKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKR 110
Score = 27.5 bits (61), Expect = 1.7
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
KK E + E+E ++ + + EEQK E +E +E+E E RE
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPERED 77
Score = 26.7 bits (59), Expect = 3.6
Identities = 15/55 (27%), Positives = 31/55 (56%)
Query: 66 QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+E +E K + E++ E E+ + E+E E ++E+ E+ E+EE+ +E +
Sbjct: 39 KEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEK 93
Score = 25.6 bits (56), Expect = 8.7
Identities = 16/51 (31%), Positives = 32/51 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
K +E++++ E + E+E E E E + ++++KE++ +EE KEE E +
Sbjct: 51 KSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVK 101
>gnl|CDD|148593 pfam07074, TRAP-gamma, Translocon-associated protein, gamma subunit
(TRAP-gamma). This family consists of several
eukaryotic translocon-associated protein, gamma subunit
(TRAP-gamma) sequences. The translocation site
(translocon), at which nascent polypeptides pass through
the endoplasmic reticulum membrane, contains a component
previously called 'signal sequence receptor' that is now
renamed as 'translocon-associated protein' (TRAP). The
TRAP complex is comprised of four membrane proteins
alpha, beta, gamma and delta which are present in a
stoichiometric relation, and are genuine neighbors in
intact microsomes. The gamma subunit is predicted to
span the membrane four times.
Length = 170
Score = 28.2 bits (63), Expect = 0.92
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 6/40 (15%)
Query: 80 RREDKVEEE------EDEKLEEEEMEEQKEEKKEKVEEEE 113
RRE+ V E +D+K+ +E +E+ KK +V + E
Sbjct: 80 RREEAVTREVNRKLADDKKMSRKEKDERILWKKNEVADYE 119
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 28.4 bits (64), Expect = 0.94
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 74 EEEKKKRREDKVEEEEDE-KLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E E KK+ +K+ E+E + E+ + E+++E K+EE + K E E
Sbjct: 315 ERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLE 362
Score = 27.7 bits (62), Expect = 1.6
Identities = 16/48 (33%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEE-EEMEEQKEEKKEKVEEEEVKEER 118
K EK + +E ++EE E +EE +E+ + EE ++K+E+ E + E+
Sbjct: 320 KKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 28.5 bits (63), Expect = 0.96
Identities = 18/56 (32%), Positives = 30/56 (53%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
E G + ++ R + EEEEDE +EE E E+ E +E+ +++E E+ E
Sbjct: 89 ELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144
Score = 27.3 bits (60), Expect = 2.1
Identities = 21/64 (32%), Positives = 35/64 (54%)
Query: 59 LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
L+I + +E G KEEE+ + E+ E EE E +EEE +++ EK ++ E + +E
Sbjct: 93 LNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDEN 152
Query: 119 REGG 122
G
Sbjct: 153 ELAG 156
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 28.5 bits (64), Expect = 1.0
Identities = 7/43 (16%), Positives = 21/43 (48%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+E ++ + +E +E+ E ++E+ + +EE+ +
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43
Score = 26.2 bits (58), Expect = 6.2
Identities = 11/55 (20%), Positives = 23/55 (41%), Gaps = 1/55 (1%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
KE K E E EE++ E + ++E+ ++ +++ E + E
Sbjct: 12 AKELRKLTEDEKLAEAEEEKA-EYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65
Score = 25.8 bits (57), Expect = 7.3
Identities = 8/52 (15%), Positives = 20/52 (38%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
E ++++ E +EE +KL+ E ++ + + + E
Sbjct: 24 LAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 28.0 bits (62), Expect = 1.1
Identities = 15/40 (37%), Positives = 28/40 (70%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
EEE + ++VEE+E+E + E+ ++++E+ E+VEEE
Sbjct: 20 EEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 28.1 bits (63), Expect = 1.2
Identities = 18/43 (41%), Positives = 29/43 (67%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
++E+K D E EE EK EEEE EE++ E++ ++EEE+ +
Sbjct: 2 SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQE 44
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.1 bits (62), Expect = 1.3
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 6/56 (10%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE------EVKEERREGG 122
EEE+ + E+ E EED+K E++ + +E+K K +EE +VK+ GG
Sbjct: 241 AEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRNIAGG 296
Score = 27.7 bits (61), Expect = 1.6
Identities = 17/33 (51%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
K DKV EEE E +EE EE +E+KKE +E+
Sbjct: 234 KNDGDKVAEEE-EFELDEEHEEAEEDKKEALEK 265
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.0 bits (63), Expect = 1.3
Identities = 18/52 (34%), Positives = 32/52 (61%)
Query: 66 QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
G G EEE++ +D+ EE++D+ LEEEE + +++E E+E+ +E
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366
Score = 26.1 bits (58), Expect = 6.2
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 82 EDKVEEEED--EKLEEEEMEEQKEEKKEKVEEEEVKEER 118
+D EEEED EEE+ E++ + ++ EEEE ++E+
Sbjct: 340 DDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Score = 26.1 bits (58), Expect = 6.8
Identities = 14/47 (29%), Positives = 30/47 (63%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EEE ED+ ++++D + EEE+++ EE+ E+ E+ + +++ E
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 27.8 bits (62), Expect = 1.3
Identities = 11/41 (26%), Positives = 27/41 (65%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
EKK + + + +E+++ +EEE EE++EE ++ ++++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 27.4 bits (61), Expect = 1.7
Identities = 16/42 (38%), Positives = 27/42 (64%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
E+K K E + +EEDEK EEEE EE++E++ ++++
Sbjct: 155 MLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 26.3 bits (58), Expect = 4.3
Identities = 16/58 (27%), Positives = 27/58 (46%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
I + G K KR+ EEE++ E+ M E+K ++ E + +E E+ E
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE 178
Score = 25.9 bits (57), Expect = 6.8
Identities = 15/45 (33%), Positives = 31/45 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
EEE+ + + E++ ++LE E+++E+ E+ +E+ EEEE ++E
Sbjct: 143 TEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 28.0 bits (62), Expect = 1.3
Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 3/80 (3%)
Query: 40 LDMYIRPNRLRFDD---NQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE 96
+D+ NR++ + + + + E E+ E + E E DE + EE
Sbjct: 306 VDLERNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEE 365
Query: 97 EMEEQKEEKKEKVEEEEVKE 116
E + E + E + E
Sbjct: 366 VAEMELLEVQVDDLAERLAE 385
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 28.0 bits (63), Expect = 1.4
Identities = 10/36 (27%), Positives = 18/36 (50%), Gaps = 3/36 (8%)
Query: 78 KKRREDKVEE--EEDEKLEEEEMEEQKEEKKEKVEE 111
K +R + + + EE+E+ E E EE + +E
Sbjct: 264 KAKRAEILAQRAEEEEESSEGAAETI-EEPELDLET 298
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 27.9 bits (62), Expect = 1.5
Identities = 10/49 (20%), Positives = 20/49 (40%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
E+ + + K +E++D+ EEE + EE++ E
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269
>gnl|CDD|176483 cd08365, APC10-like1, APC10-like DOC1 domains of E3 ubiquitin
ligases that mediate substrate ubiquitination. This
model represens the APC10-like DOC1 domain of
multi-domain proteins present in E3 ubiquitin ligases.
E3 ubiquitin ligases mediate substrate ubiquitination
(or ubiquitylation), a component of the ubiquitin-26S
proteasome pathway for selective proteolytic
degradation. APC10/DOC1 domains such as those present
in HECT (Homologous to the E6-AP Carboxyl Terminus) and
Cullin-RING (Really Interesting New Gene) E3 ubiquitin
ligase proteins, HECTD3, and CUL7, respectively, are
also included here. CUL7 is a member of the Cullin-RING
ligase family and functions as a molecular scaffold
assembling a SCF-ROC1-like E3 ubiquitin ligase complex
consisting of Skp1, CUL7, Fbx29 F-box protein, and ROC1
(RING-box protein 1) and promotes ubiquitination. CUL7
is a multi-domain protein with a C-terminal cullin
domain that binds ROC1 and a centrally positioned
APC10/DOC1 domain. HECTD3 contains a C-terminal HECT
domain which contains the active site for ubiquitin
transfer onto substrates, and an N-terminal APC10/DOC1
domain which is responsible for substrate recognition
and binding. An APC10/DOC1 domain homolog is also
present in HERC2 (HECT domain and RLD2), a large
multi-domain protein with three RCC1-like domains
(RLDs), additional internal domains including zinc
finger ZZ-type and Cyt-b5 (Cytochrome b5-like
Heme/Steroid binding) domains, and a C-terminal HECT
domain. Recent studies have shown that the protein
complex HERC2-RNF8 coordinates ubiquitin-dependent
assembly of DNA repair factors on damaged chromosomes.
Also included in this hierarchy is an uncharacterized
APC10/DOC1-like domain found in a multi-domain protein,
which also contains CUB, zinc finger ZZ-type, and
EF-hand domains. The APC10/DOC1 domain forms a
beta-sandwich structure that is related in architecture
to the galactose-binding domain-like fold; their
sequences are quite dissimilar, however, and are not
included here.
Length = 131
Score = 27.5 bits (61), Expect = 1.5
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 8 CWLGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRL 49
W DGS GS ++ + +K + +L LA+ D P R+
Sbjct: 28 YWQSDGSQGSHWIRLKMKPDVLVRHLSLAVDATDSSYMPQRV 69
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 27.9 bits (62), Expect = 1.6
Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 8/38 (21%)
Query: 84 KVEEEEDEKLEEEEMEEQK--------EEKKEKVEEEE 113
K EE ++LEE E E+ +E+KE++E EE
Sbjct: 77 KEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEE 114
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 26.8 bits (60), Expect = 1.6
Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 82 EDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEEEVKEERRE 120
+D +E E E E+ E +K EKKEK E++++EE +E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEK-LEKKLEEELKE 87
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 28.0 bits (63), Expect = 1.7
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 8/48 (16%)
Query: 76 EKKKRREDKVEEEEDEKLEE-----EEMEEQKEEKKEKVE--EEEVKE 116
EK +R + +EE E E+LEE EE +EQ+EE + + E EEEV E
Sbjct: 348 EKIERYQADLEELE-ERLEEQNEVVEEADEQQEENEARAEAAEEEVDE 394
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 27.7 bits (61), Expect = 1.7
Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+EE K ++ K + + E E+ E KE + +K + E V E+
Sbjct: 293 NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQK-KREPVAED 336
Score = 26.5 bits (58), Expect = 4.7
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K E +K + E K +EE K ++ + + K+E K +E E KE
Sbjct: 280 KREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 27.4 bits (61), Expect = 1.8
Identities = 9/30 (30%), Positives = 20/30 (66%)
Query: 87 EEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
EE EK ++E +E +E++ K ++E+ ++
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121
Score = 26.6 bits (59), Expect = 3.1
Identities = 8/34 (23%), Positives = 22/34 (64%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
EE +++ +E E +++E + +E++ K+E++
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 25.5 bits (56), Expect = 7.9
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
+K ++E ++ EE+E + K+EKKEK E++
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 25.5 bits (56), Expect = 8.0
Identities = 13/44 (29%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
EE +K +++ +E E+++ + + E+ KE+K++KV E+ K++
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKK 134
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 27.7 bits (62), Expect = 1.9
Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEK----KEKVEEEEVKEER 118
+EK++ K+EEE +LE +E +K+ + +EK E + EE+
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEK 242
Score = 27.3 bits (61), Expect = 2.5
Identities = 12/62 (19%), Positives = 28/62 (45%), Gaps = 3/62 (4%)
Query: 59 LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
S + + K+ + K E++ E ++ EE + + +EE ++E +E E+
Sbjct: 166 ASAKEELDQ---LSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEK 222
Query: 119 RE 120
+
Sbjct: 223 QL 224
Score = 25.8 bits (57), Expect = 8.4
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 5/52 (9%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKE-----EKKEKVEEEEVKEERRE 120
EEE+ +R + EE KLEEE + + EK+ ++E E KEE R+
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRK 237
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 27.3 bits (60), Expect = 2.0
Identities = 10/53 (18%), Positives = 25/53 (47%)
Query: 68 GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
G G K+K + E+E E++ E+ ++ +++ +E + +E +
Sbjct: 87 GLAIGLPIVKRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQ 139
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 27.6 bits (61), Expect = 2.0
Identities = 13/75 (17%), Positives = 32/75 (42%)
Query: 52 DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
D + + +G EE + K + + E E+ E+ E E+++ ++E
Sbjct: 276 DGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLS 335
Query: 112 EEVKEERREGGRGGR 126
++ K+ ++ G+
Sbjct: 336 KKGKKLKKLKGKKNG 350
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.4 bits (61), Expect = 2.0
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
K +K + +EE +E +EE ++ +K+++K+K
Sbjct: 400 KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435
Score = 26.2 bits (58), Expect = 5.6
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
K E++ K+ E+K+E++EDE E E+ E+ +K E +
Sbjct: 290 KLEKELKKLENKLEKQEDELEELEKAAEELRQKGELL 326
Score = 26.2 bits (58), Expect = 6.1
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K+ +K K + ++ + E E E + EK E ++ EE RE
Sbjct: 369 FKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIRE 417
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 27.0 bits (60), Expect = 2.1
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
K++++K E+K E +D+K EE E + E+ +E
Sbjct: 70 KKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109
Score = 25.5 bits (56), Expect = 7.5
Identities = 11/45 (24%), Positives = 24/45 (53%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
++ KK+ ++K+ EEE + +++ E+ E + EE +E
Sbjct: 64 TTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108
>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter. [Transport and binding
proteins, Other].
Length = 953
Score = 27.4 bits (61), Expect = 2.1
Identities = 9/48 (18%), Positives = 19/48 (39%), Gaps = 3/48 (6%)
Query: 75 EEKKKRREDKVE---EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E + E+ + + EK E + + K K+ + E++ R
Sbjct: 838 TESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWKITDAELQSNVR 885
>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
Length = 673
Score = 27.4 bits (62), Expect = 2.2
Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (2%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEK-KEKVEEEEVKEE 117
R EDK E E +E + + EE + E
Sbjct: 528 TRIEDKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAET 567
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 27.1 bits (61), Expect = 2.2
Identities = 14/44 (31%), Positives = 31/44 (70%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+++E +++E+ +EE+E E ++ E+KEE+ E++ EE+ +E
Sbjct: 96 RKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 27.2 bits (61), Expect = 2.4
Identities = 12/58 (20%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEE--EVKEERRE 120
++++ + +++ ++ +E+LE E+E++E EE+ ++E E E+++E
Sbjct: 69 LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE-LEEEISELENEIKELEQEIER 125
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 27.4 bits (61), Expect = 2.4
Identities = 12/53 (22%), Positives = 21/53 (39%), Gaps = 5/53 (9%)
Query: 76 EKKKRREDKVEEEEDEK-LEEEEMEEQKEEKKEKVE----EEEVKEERREGGR 123
KK+ +E K E + +E++ K+E EE + E + R
Sbjct: 365 LAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 26.9 bits (60), Expect = 2.4
Identities = 15/47 (31%), Positives = 28/47 (59%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
K+ R +++E EE+EK +EEE E K + E+ +K++ +E +
Sbjct: 114 KQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160
Score = 26.5 bits (59), Expect = 3.4
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
EEEK+ E + + E+ +QK ++ +K E+EE++
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 27.4 bits (60), Expect = 2.5
Identities = 14/69 (20%), Positives = 30/69 (43%)
Query: 52 DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
+ + +E + KE+++++ + +EE + +E+ E+KEE E E
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEEN 332
Query: 112 EEVKEERRE 120
E E +
Sbjct: 333 ERHTELLAD 341
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 27.3 bits (60), Expect = 2.5
Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 2/43 (4%)
Query: 82 EDKVEEEEDEKLEEEEMEE--QKEEKKEKVEEEEVKEERREGG 122
ED+ +E E L E+E E EE E +E E K
Sbjct: 954 EDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRP 996
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 27.0 bits (60), Expect = 2.6
Identities = 13/42 (30%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 72 GKEEEKKKRREDKVEE--EEDEKLEEEEMEEQKEEKKEKVEE 111
K+ +++K+ + +EE +++ + E EE E ++ KEK EE
Sbjct: 95 AKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136
Score = 25.8 bits (57), Expect = 7.2
Identities = 10/37 (27%), Positives = 30/37 (81%), Gaps = 1/37 (2%)
Query: 84 KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+++E E +KL++++ + ++E +K++ +++E +EER++
Sbjct: 183 RLQEWELKKLKQQQQKREEERRKQRKKQQE-EEERKQ 218
Score = 25.5 bits (56), Expect = 8.0
Identities = 13/39 (33%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
+E E KK ++ + + EE+ + + ++ +E +EE+K+K EE
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQE-EEERKQKAEE 222
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 26.5 bits (59), Expect = 2.6
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 1/49 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKL-EEEEMEEQKEEKKEKVEEEEVKEERRE 120
K + +KK+ E K E E+ K+ EE+ E + E+ ++++EE E + RRE
Sbjct: 21 KAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRE 69
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 27.1 bits (60), Expect = 2.6
Identities = 23/95 (24%), Positives = 40/95 (42%), Gaps = 11/95 (11%)
Query: 19 YLNVAIKDGGVAINLKLAMGKLDMYIRPNRLR-FDDNQWH--HLSIHRRIQEGTGCGKEE 75
Y V + + + ++ A G P R +D+N H + I E +EE
Sbjct: 379 YAEVVVPERALVLDWVFADGP------PGNARNYDNNGRQDFHAIVPNNIPEELYWVEEE 432
Query: 76 EK--KKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
+ +K +E++ EE + + E+ K E KEK
Sbjct: 433 HQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEK 467
Score = 26.8 bits (59), Expect = 3.6
Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 5/50 (10%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK-----EERRE 120
E+K+R +K+ +EE E+ + E + ++EE+K +E + + E+RRE
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 27.1 bits (60), Expect = 2.7
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 91 EKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
E+ EEE EE++EE++E E E EE G
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429
Score = 27.1 bits (60), Expect = 3.1
Identities = 17/41 (41%), Positives = 21/41 (51%)
Query: 85 VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
EEEE+E+ EEEE E + +E V EV E GG
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAGGG 443
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 27.0 bits (60), Expect = 2.8
Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 1/49 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
K E V EEE+ + E EE+ E ++ +E ++E REG
Sbjct: 24 ANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQE-LQEGIEEGAREG 71
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 27.3 bits (60), Expect = 2.8
Identities = 16/58 (27%), Positives = 26/58 (44%)
Query: 57 HHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
H S+ R I + EEEKK ++ + E E E+ E + E+ + EE +
Sbjct: 398 HRPSLLRFISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDK 455
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 27.1 bits (60), Expect = 2.9
Identities = 11/45 (24%), Positives = 17/45 (37%), Gaps = 1/45 (2%)
Query: 81 REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
+E+K +E D E E++ E K+E G G
Sbjct: 882 KENKDSKEGDADAEVAEVDAA-PGGAVDHEPPVAKQEGNAEGGDG 925
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 27.2 bits (61), Expect = 2.9
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
K EE K R E + E EK E ++ E + +++ V
Sbjct: 447 KAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAA 490
>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
Length = 1119
Score = 27.1 bits (60), Expect = 2.9
Identities = 17/63 (26%), Positives = 30/63 (47%)
Query: 54 NQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
N H +++H E + KE K + E K K + +EME+ E+ K++ E ++
Sbjct: 562 NNNHRVTVHLEAVESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADD 621
Query: 114 VKE 116
E
Sbjct: 622 DPE 624
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 26.8 bits (60), Expect = 3.0
Identities = 10/49 (20%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLE-EEEMEEQKEEKKEKVEEEEVKEERRE 120
K +++K+ R+ + +++ K EE E+ E +E + + + E++
Sbjct: 24 KRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 26.8 bits (60), Expect = 3.1
Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE-EEEVKEERRE 120
K K EEEE +L +E E EKK E +EE+ EE +
Sbjct: 16 KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDD 58
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 26.7 bits (59), Expect = 3.1
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 88 EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE E+E EE KEE KEK E+++KE RR
Sbjct: 93 EEALDEEKERREESKEEMKEKKFEKKLKELRRA 125
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 26.9 bits (59), Expect = 3.1
Identities = 11/47 (23%), Positives = 19/47 (40%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
+ R E + E E E + E +K + E E V++E +
Sbjct: 84 RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQ 130
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 26.5 bits (59), Expect = 3.2
Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
KEE + ++R +K E E K EEE E +++++K
Sbjct: 83 KEERELRKRAEK-EALEQAKKEEELREAKRQQRK 115
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.2 bits (58), Expect = 3.3
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEME 99
+++ E++ EEEE+E+ EEE M
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 25.8 bits (57), Expect = 4.2
Identities = 12/20 (60%), Positives = 17/20 (85%)
Query: 94 EEEEMEEQKEEKKEKVEEEE 113
EEEE EE++EE++E+ EEE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
Score = 25.0 bits (55), Expect = 7.4
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 94 EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
EE++EE++E+ EEEE EE G G
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMAGLGA 102
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 26.5 bits (59), Expect = 3.3
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
E ++ + E K++EEE ++ EE E ++E+KK
Sbjct: 94 EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 24.9 bits (55), Expect = 9.3
Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+R E + +E++KL+EEE + Q EE KE E E K+
Sbjct: 91 ERAEAQRARDEEKKLDEEEAKRQHEEAKE--REREKKK 126
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 26.4 bits (59), Expect = 3.4
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE 110
KEE +K+ E + + E EK EEEE + +++ +++
Sbjct: 136 KEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173
Score = 26.0 bits (58), Expect = 5.3
Identities = 15/47 (31%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
+ E+ K E ++++ E+EK EE+E++ E + K+E E +EE
Sbjct: 117 QAEQGKSELEQEIKKLEEEK---EELEKRVAELEAKLEAIEKREEEE 160
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 26.6 bits (59), Expect = 3.4
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 3/50 (6%)
Query: 74 EEEKKKRREDKVEEEE---DEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+E++KK RE K++ E + K E +++ E+ + K E V EE VKE E
Sbjct: 49 DEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEE 98
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 26.8 bits (59), Expect = 3.6
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)
Query: 73 KEEEKKKRREDKV--EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
K +E K R + EEE+++ LEE + +E + E E E E
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESES 675
Score = 26.4 bits (58), Expect = 4.6
Identities = 15/55 (27%), Positives = 29/55 (52%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+ G+EE++ E +E+E E +E+ E E + + + + +E+E E EG
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEG 696
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 26.7 bits (59), Expect = 3.6
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E +++ R+ ++EE+ + +E EE+ + +++EEV
Sbjct: 401 EAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDR 446
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 26.8 bits (59), Expect = 3.7
Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
K+K +E K +++ K+E +E + E K EE
Sbjct: 798 KQKPKE-KGPNDKEIKIESPSVETEGERCTIKQREE 832
>gnl|CDD|234660 PRK00139, murE,
UDP-N-acetylmuramoylalanyl-D-glutamate--2,
6-diaminopimelate ligase; Provisional.
Length = 460
Score = 26.6 bits (60), Expect = 3.7
Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 18/51 (35%)
Query: 8 CWLGDGSDGSDYLNVAIKDGGVAI------------------NLKLAMGKL 40
G DG D++ AI +G A+ +L+ A+ L
Sbjct: 33 ALPGHKVDGRDFIAQAIANGAAAVVAEADGEAGTGVPVIIVPDLRKALALL 83
>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
Length = 74
Score = 25.6 bits (56), Expect = 3.8
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
KK +DK ++ E EK+E +E +++ KEK
Sbjct: 40 KKAMADDKPKDAEFEKVEAKEAASTEQKAKEK 71
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 26.3 bits (58), Expect = 3.9
Identities = 8/57 (14%), Positives = 25/57 (43%)
Query: 48 RLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
+ D + + R+++ + +K + K+ ++ +E E+ + E K++
Sbjct: 135 GGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKK 191
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 26.4 bits (58), Expect = 3.9
Identities = 20/45 (44%), Positives = 33/45 (73%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
K +EKK+R E+ +E E+ + E++ +E+ EEK++K E EE+KEE
Sbjct: 151 KRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEE 195
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 26.7 bits (60), Expect = 4.0
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
EE+ E+LEE E +EE+++K E++ +
Sbjct: 63 LEEQRERLEELAPELLEEEEEKKEEKKGLPP 93
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 26.5 bits (59), Expect = 4.0
Identities = 11/49 (22%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++EE K +D E+D+ ++++ +E E+ EE+R
Sbjct: 135 EEDEESSKSEDD----EDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179
Score = 26.1 bits (58), Expect = 6.9
Identities = 10/51 (19%), Positives = 25/51 (49%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
R+ + ++++DE+ +EE+ E K E E ++++ ++ R
Sbjct: 114 PRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERS 164
Score = 25.7 bits (57), Expect = 8.8
Identities = 8/51 (15%), Positives = 24/51 (47%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
E D E+++D++ +EE EE + + ++ ++++ ++
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 26.7 bits (60), Expect = 4.0
Identities = 12/46 (26%), Positives = 28/46 (60%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
+ +++ + + +EEEDEKL +E +E K+ + EK ++ + + +
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK 211
>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
(SSRP1) Pleckstrin homology (PH) domain, repeat 1.
SSRP1 is a component of FACT (facilitator of chromatin
transcription), an essential chromatin reorganizing
factor. In yeast FACT (yFACT) is composed of three
proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
human and mouse, respectively. The middle domain of the
Pob3 subunit (Pob3-M) has an unusual double pleckstrin
homology (PH) architecture. yFACT interacts in a
physiologically important way with the central
single-strand DNA binding factor RPA to promote a step
in DNA Replication. Coordinated function by yFACT and
RPA is important during nucleosome deposition. These
results support the model that the FACT family has an
essential role in constructing nucleosomes during DNA
replication, and suggest that RPA contributes to this
process. Members of this cd are composed of the first
PH-like repeat. PH domains have diverse functions, but
in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 137
Score = 26.0 bits (58), Expect = 4.1
Identities = 16/31 (51%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 83 DKVEEEEDE-KLEEEEMEEQKEEKKEKVEEE 112
DK EE E E L EEE+EE+ + K EK E
Sbjct: 85 DKDEEIELELNLTEEELEEKYKGKLEKEYEG 115
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 26.6 bits (59), Expect = 4.2
Identities = 7/32 (21%), Positives = 20/32 (62%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E+E++ E E+ K++++ + ++++ EE
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
Score = 26.6 bits (59), Expect = 4.2
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 85 VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
++ EDEK E ++ KK++ E + K+E E
Sbjct: 71 DQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 25.7 bits (57), Expect = 4.2
Identities = 14/57 (24%), Positives = 32/57 (56%)
Query: 56 WHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
++ I ++ +EE ++ RE K + E+++ E + E++EE+K+++ EE
Sbjct: 52 YYKKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 26.1 bits (58), Expect = 4.3
Identities = 13/41 (31%), Positives = 30/41 (73%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
++ E+K+R +++ E +E+++ E + QKEE+++++EE E
Sbjct: 16 EKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56
Score = 25.3 bits (56), Expect = 8.1
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+K R D E+ E+++ E+EE E ++E++KE + KEER +
Sbjct: 6 DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREK 50
>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
Provisional.
Length = 443
Score = 26.6 bits (60), Expect = 4.3
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 7/42 (16%)
Query: 77 KKKRREDKVEEEEDEKLEEEE------MEEQKEEKKEKVEEE 112
KKK+R+ KV+E + L EEE MEE K+E E+VE+
Sbjct: 210 KKKKRKLKVKEAR-KILIEEEAAKLIDMEEIKQEAIERVEQN 250
>gnl|CDD|176486 cd08665, APC10-CUL7, APC10-like DOC1 domain of CUL7, subunit of
the SCF-ROC1-like E3 ubiquitin ligase complex that
mediates substrate ubiquitination. This model
represents the APC10/DOC1 domain present in CUL7, a
subunit of the SCF-ROC1-like E3 Ubiquitin (Ub) ligase
complex, which mediates substrate ubiquitination (or
ubiquitylation), and is a component of the
ubiquitin-26S proteasome pathway for selective
proteolytic degradation. CUL7 is a member of the
Cullin-RING ligase family and functions as a molecular
scaffold assembling the SCF-ROC1-like E3 Ub ligase
complex consisting of the adapter protein Skp1, CUL7,
the WD40 repeat-containing F-box Fbw8 (also known as
Fbx29), and ROC1 (RING-box protein 1). CUL7 is a large
protein with a C-terminal cullin domain that binds ROC1
and additional domains, including an APC10/DOC1 domain.
While the Fbw8 protein is responsible for substrate
protein recognition, the ROC1 RING domain recruits an
Ub-charged E2 Ub-conjugating enzyme for substrate
ubiquitination. It remains to be determined how CUL7
binds to the Skp1-Fbw8 heterodimer. The CUL7 E3 Ub
ligase has been implicated in the proteasomal
degradation of the cellular proteins, cyclin D1, an
important regulator of the G1 to S-phase cell cycle
progression, and insulin receptor substrate 1, a
critical component of the signaling pathways downstream
of the insulin and insulin-like growth factor 1
receptor. CUL7 appears to be an important regulator of
placental development. Germ line mutations of CUL7 are
linked to 3-M syndrome and Yakuts short stature
syndrome.
Length = 131
Score = 26.0 bits (57), Expect = 4.4
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 9 WLGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRL 49
W +GS GS Y+N+ + G V L + + D P R+
Sbjct: 28 WESNGSTGSHYINIHMHRGVVIRQLYMLVASEDSSYMPARV 68
>gnl|CDD|151491 pfam11044, TMEMspv1-c74-12, Plectrovirus spv1-c74 ORF 12
transmembrane protein. This is a family of proteins
expressed by Plectroviruses. The plectroviruses are
single-stranded DNA viruses belonging to the
Inoviridae. Except that it is a putative transmembrane
protein the function is not known.
Length = 49
Score = 24.8 bits (54), Expect = 4.5
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 56 WHHLSIHRRIQEGTGCGKEEEKKKRREDK 84
W LSI+++I++ G K++++ KR+E
Sbjct: 20 WIGLSIYQKIRQIRGKKKDKKEIKRKESN 48
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 26.6 bits (59), Expect = 4.5
Identities = 11/48 (22%), Positives = 27/48 (56%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
E+ ++ +++ E+E + + +E+E +E+K EK+ E + + G
Sbjct: 193 EDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEG 240
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 26.4 bits (58), Expect = 4.6
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEE 104
+ V E ++E++EEEE Q EE
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEE 419
Score = 26.0 bits (57), Expect = 6.2
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKE 103
K+ RE EE E+E+ + E + E
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEEMEME 423
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 26.5 bits (59), Expect = 4.7
Identities = 18/39 (46%), Positives = 23/39 (58%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
K+E KK +ED EE +E EE EE+ E +KEK E
Sbjct: 138 KDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176
>gnl|CDD|235240 PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional.
Length = 456
Score = 26.2 bits (59), Expect = 4.9
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
KK+ R D + EE E EE KE +E
Sbjct: 93 KKRYRADHLIEELGIDAEGLSNEELKELIREN 124
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 26.0 bits (57), Expect = 4.9
Identities = 19/48 (39%), Positives = 30/48 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE E K + + +EEE+E+ EEE E + E +E++ E+E +EE E
Sbjct: 38 KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85
Score = 25.6 bits (56), Expect = 7.7
Identities = 14/47 (29%), Positives = 36/47 (76%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
+EE+++ E+++EE ED + EEE +E+++EE++++ + ++K+ ++
Sbjct: 51 DEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Score = 25.2 bits (55), Expect = 8.7
Identities = 20/48 (41%), Positives = 33/48 (68%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KE+ K+ + K E++ED++ EEEE EE+ EE ++ +EEE+ E+ E
Sbjct: 33 KEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEE 80
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 26.1 bits (58), Expect = 5.0
Identities = 14/48 (29%), Positives = 26/48 (54%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+EE K+ ++ E +++ E +E+ EE+ E++ EE E EG
Sbjct: 80 KEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 26.1 bits (56), Expect = 5.0
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 64 RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQ--KEEKKEKVEEEEVKEERRE 120
+ ++GT G ++ +D+ EEEE+ + EEE EE +EE +E+ E EE EE E
Sbjct: 43 KGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPE 101
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 25.9 bits (57), Expect = 5.1
Identities = 5/39 (12%), Positives = 12/39 (30%), Gaps = 3/39 (7%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
L + E + + ++ +EE R+
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKK---REEHRK 139
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 26.6 bits (59), Expect = 5.1
Identities = 13/48 (27%), Positives = 27/48 (56%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
EE + + ++ EE + + + EE++E+ E + ++EE E + E E
Sbjct: 326 EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
>gnl|CDD|180707 PRK06811, PRK06811, RNA polymerase factor sigma-70; Validated.
Length = 189
Score = 26.1 bits (58), Expect = 5.2
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
+K + ++++ ++ L EE E + KE EE
Sbjct: 90 RKLTKNNEIDSIDEFILISEESIENEIILKENKEE 124
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 26.5 bits (58), Expect = 5.5
Identities = 14/81 (17%), Positives = 36/81 (44%), Gaps = 3/81 (3%)
Query: 41 DMYIRPNRLRFDDNQWHH---LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEE 97
D +RP +N+ ++ + + G+E + E +E + D K E+++
Sbjct: 57 DYELRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQK 116
Query: 98 MEEQKEEKKEKVEEEEVKEER 118
E+++ E ++++ E+
Sbjct: 117 SLEEQKIAPEIPVKQQIDSEK 137
>gnl|CDD|188965 cd09566, SAM_liprin-beta1,2_repeat2, SAM domain of liprin-beta1,2
proteins repeat 2. SAM (sterile alpha motif) domain
repeat 2 of liprin-beta1,2 proteins is a
protein-protein interaction domain. Liprin-beta
proteins contain three copies (repeats) of SAM domain.
They may form heterodimers with liprin-alpha proteins
through their SAM domains. It was suggested based on
bioinformatic approaches that the second SAM domain of
liprin-beta potentially is able to form polymers.
Liprins were originally identified as LAR (leukocyte
common antigen-related) transmembrane protein-tyrosine
phosphatase-interacting proteins. They participate in
mammary gland development, in axon guidance, and in the
maintenance of lymphatic vessel integrity.
Length = 63
Score = 25.0 bits (55), Expect = 5.5
Identities = 10/27 (37%), Positives = 12/27 (44%), Gaps = 7/27 (25%)
Query: 47 NRLRFDD-------NQWHHLSIHRRIQ 66
+ L DD + HH SI R IQ
Sbjct: 34 HYLTVDDLLHLKVTSALHHASIRRGIQ 60
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 26.0 bits (58), Expect = 5.7
Identities = 14/39 (35%), Positives = 25/39 (64%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E+K+ EEE EKLE+++ +E+K++ +E EE +
Sbjct: 8 EEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQS 46
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 25.9 bits (58), Expect = 5.8
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 76 EKKKRREDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEEEVK 115
EKK + DK +EE E LE +E +EE K + ++ +EE+K
Sbjct: 144 EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIK 185
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 26.2 bits (58), Expect = 5.8
Identities = 7/28 (25%), Positives = 15/28 (53%)
Query: 99 EEQKEEKKEKVEEEEVKEERREGGRGGR 126
E+ + +++ + E K ER++ R R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPR 610
Score = 26.2 bits (58), Expect = 6.8
Identities = 12/47 (25%), Positives = 32/47 (68%), Gaps = 1/47 (2%)
Query: 75 EEKKKRREDKVEEEEDEK-LEEEEMEEQKEEKKEKVEEEEVKEERRE 120
E +++R ++K + +++ K L EE Q+ E++E+V++ + + ++R+
Sbjct: 678 ERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQVQPRRKQRQ 724
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 25.7 bits (57), Expect = 6.0
Identities = 11/51 (21%), Positives = 26/51 (50%)
Query: 71 CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
G + KKK+ + + E ++ +K E + + + KK ++ + E+ + G
Sbjct: 117 YGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSG 167
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 26.1 bits (58), Expect = 6.0
Identities = 18/44 (40%), Positives = 30/44 (68%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
K +EK + +KVEE+ +E+ EEE E ++EE++E EE++ K
Sbjct: 3 DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEK 46
>gnl|CDD|153125 cd02434, Nodulin-21_like_3, Nodulin-21 and CCC1-related protein
family. Nodulin-21_like_3: This is a family of proteins
closely related to nodulin-21, a plant nodule-specific
protein that may be involved in symbiotic nitrogen
fixation. This family is also related to CCC1, a yeast
vacuole transmembrane protein that functions as an iron
and manganese transporter. .
Length = 225
Score = 25.8 bits (57), Expect = 6.0
Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 2/43 (4%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKE--EKKEKVEEE 112
K E E K EE E E E E E E K EE
Sbjct: 57 TKAERDFLHSEKKREEWEIENYPEGEKSEMVEIYSLKGLSEEV 99
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 26.2 bits (58), Expect = 6.0
Identities = 13/44 (29%), Positives = 25/44 (56%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E+E+K+ + KV + K + E+KE+ K + EE+ ++E
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 25.9 bits (57), Expect = 6.0
Identities = 12/51 (23%), Positives = 21/51 (41%)
Query: 69 TGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
T K+E+++ E + E+ EE + + E K + EV E
Sbjct: 67 TKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 26.2 bits (58), Expect = 6.4
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 61 IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
+ R+++E KEEE +++ K+ E+ E LEE + E K E+ EE++++
Sbjct: 89 LQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 26.2 bits (57), Expect = 6.4
Identities = 12/58 (20%), Positives = 26/58 (44%), Gaps = 6/58 (10%)
Query: 65 IQEGTGCGKEEEKKKRREDK------VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+ +E KK R + + E+EK E++ + +K +K ++ E++ E
Sbjct: 99 VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 25.4 bits (56), Expect = 6.5
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 95 EEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
E + +E +E+ +EEE +EE + G
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDDMLFG 106
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 26.1 bits (58), Expect = 6.6
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
++ EE E E ++EK E+ +EEE KE E + G
Sbjct: 1 MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQG 40
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 25.8 bits (56), Expect = 6.6
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
KK +ED+ +E+E+E+ EEEE E + EE ++ + G G
Sbjct: 122 AGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNG 174
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 25.8 bits (57), Expect = 6.7
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 84 KVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
K +EE++ E EE+E + + E E
Sbjct: 141 KTDEEKELLREVEELESRLATEPSPAPELE 170
Score = 25.8 bits (57), Expect = 8.2
Identities = 9/28 (32%), Positives = 11/28 (39%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
EE+E + EE E E EE
Sbjct: 144 EEKELLREVEELESRLATEPSPAPELEE 171
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 25.7 bits (57), Expect = 6.9
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 84 KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+V+E+E + E EE EEQKEE++ K+ EE ++
Sbjct: 134 EVDEDESSEEEIEEFEEQKEEEEAKLAEEALEA 166
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 25.9 bits (57), Expect = 6.9
Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+KK E+ EEEEDE L + + E+ + ++E E+
Sbjct: 178 RREKKEEEE--EEEEDEALIKSLSFGPETEEDRRRADDEDSED 218
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 26.2 bits (58), Expect = 6.9
Identities = 7/47 (14%), Positives = 19/47 (40%), Gaps = 1/47 (2%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
E ++ + ++E + E E +K ++ E ++ E+
Sbjct: 230 LISELEEVIAEI-QDELESLRSELEALAEKIAEELLAVREILEIEKA 275
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 25.9 bits (56), Expect = 7.0
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
EE + E+ VEE +E EE EE EE E+ ++E +E E
Sbjct: 1089 EENVEEIEENVEENVEENAEEN-AEENAEENAEEYDDENPEEHNEE 1133
Score = 25.9 bits (56), Expect = 8.9
Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 60 SIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLE---EEEMEEQKEEKKEKVEEEEVKE 116
+I +E EE ++ E+ VEE +E +E EE +EE EE E+ EE ++E
Sbjct: 946 NIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEE 1005
Query: 117 ERRE 120
E
Sbjct: 1006 NVEE 1009
Score = 25.5 bits (55), Expect = 9.8
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 75 EEKKKRREDKVEEEEDEKLEE--EEMEEQKEEKKEKVEEEEVKEERRE 120
EE ++ E+ +EE +E +EE EE+EE EE E+ EE +E E
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEE 1117
>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal Holliday
junction resolvase. This domain is found in various
predicted bacterial endonucleases which are distantly
related to archaeal Holliday junction resolvases.
Length = 153
Score = 25.4 bits (56), Expect = 7.2
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
E+ + RE + E + K EE + +E+KE +K+ V
Sbjct: 29 EQWRNAEREAQAMREAEAKFEEWQKKEEKEIRKDAV 64
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 25.5 bits (57), Expect = 7.4
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
++ EE+E+ EEE EE++EE+ E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 25.6 bits (56), Expect = 7.4
Identities = 14/48 (29%), Positives = 30/48 (62%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
K E+KKK E K ++ ++ + E+ +++++ EK+E + K++ RE
Sbjct: 67 KFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERE 114
>gnl|CDD|133433 cd05297, GH4_alpha_glucosidase_galactosidase, Glycoside Hydrolases
Family 4; Alpha-glucosidases and alpha-galactosidases.
Glucosidases cleave glycosidic bonds to release glucose
from oligosaccharides. Alpha-glucosidases and
alpha-galactosidases release alpha-D-glucose and
alpha-D-galactose, respectively, via the hydrolysis of
alpha-glycopyranoside bonds. Some bacteria
simultaneously translocate and phosphorylate
disaccharides via the phosphoenolpyruvate-dependent
phosphotransferase system (PEP-PTS). After
translocation, these phospho-disaccharides may be
hydrolyzed by the GH4 glycoside hydrolases such as the
alpha-glucosidases. Other organsisms (such as archaea
and Thermotoga maritima) lack the PEP-PTS system, but
have several enzymes normally associated with the
PEP-PTS operon. Alpha-glucosidases and
alpha-galactosidases are part of the NAD(P)-binding
Rossmann fold superfamily, which includes a wide variety
of protein families including the NAD(P)-binding domains
of alcohol dehydrogenases, tyrosine-dependent
oxidoreductases, glyceraldehyde-3-phosphate
dehydrogenases, formate/glycerate dehydrogenases,
siroheme synthases, 6-phosphogluconate dehydrogenases,
aminoacid dehydrogenases, repressor rex, and NAD-binding
potassium channel domains, among others.
Length = 423
Score = 26.0 bits (58), Expect = 7.4
Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 5/54 (9%)
Query: 73 KEEEKKKRREDKVEE-----EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
KE +K E +E EE EE + + +K E + VK
Sbjct: 260 KETKKIWYGEFNEDEYGGRDEEQGWEWYEERLKLILAEIDKEELDPVKRSGEYA 313
>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein. Mei5
is one of a pair of meiosis-specific proteins which
facilitate the loading of Dmc1 on to Rad51 on DNA at
double-strand breaks during recombination. Recombination
is carried out by a large protein complex based around
the two RecA homologues, Rad51 and Dmc1. This complex
may play both a catalytic and a structural role in the
interaction between homologous chromosomes during
meiosis. Mei5 is seen to contain a coiled-coli region.
Length = 212
Score = 25.5 bits (56), Expect = 7.7
Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 4/51 (7%)
Query: 75 EEKKKRREDKVEEEEDE--KLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
E +K++ E +V+EE D+ +L+ E E K + E +E +K+ + R
Sbjct: 128 EAEKRKLEYQVDEESDDLRRLKLVEKYEIKNDLSEL--QELIKKWEKAKQR 176
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 25.9 bits (57), Expect = 7.8
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 74 EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
+ +K + +D ++ E E+ + E++EEK++ +EE
Sbjct: 785 QAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 25.8 bits (56), Expect = 7.9
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
KE+EK+ ++ K E E +L+ + K E K K E E
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIE 275
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 25.9 bits (58), Expect = 8.0
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 79 KRREDKVEEEEDEKLEEEEMEEQKEE 104
++RE K E E+ EEEE E +EE
Sbjct: 327 RKREKKAAELAAEEAEEEEAAEPEEE 352
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 26.0 bits (57), Expect = 8.1
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 6/63 (9%)
Query: 61 IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQK-----EEKKEKVEEEEVK 115
I +++ + KKK R D + + E LE EM+ E K+K+EE
Sbjct: 327 IIKKLSVLQKDYNDYIKKKSRYDDLNNQILE-LEGYEMDYNSYLKSIESLKKKIEEYSKN 385
Query: 116 EER 118
ER
Sbjct: 386 IER 388
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 25.6 bits (56), Expect = 8.3
Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 67 EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
G + +K ED +++ + +EE+E++E +EE++ + EEEE E+
Sbjct: 236 GPKGSDVKNKKSDDEEDD--DQDGDYVEEKELKEDEEEEETEEEEEEEDED 284
>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
This family of eukaryotic proteins includes
Saccharomyces cerevisiae Ydl063c and Chaetomium
thermophilum Syo1, which mediate the co-import of two
ribosomal proteins, Rpl5 and Rpl11 (which both interact
with 5S rRNA) into the nucleus. Import precedes their
association with rRNA and subsequent ribosome assembly
in the nucleolus. The primary structure of syo1 is a
mixture of Armadillo- (ARM, N-terminal part of syo1) and
HEAT-repeats (C-terminal part of syo1).
Length = 597
Score = 25.7 bits (57), Expect = 8.3
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 88 EEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
E D+K +E +EQ +E +E+ E++E E
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASET 333
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 25.2 bits (55), Expect = 8.3
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 92 KLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
KL ++ EE++EE E +EEE+ E R+
Sbjct: 92 KLTSKKEEEKEEEIPEPTKEEELLGEIRD 120
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 25.5 bits (56), Expect = 8.5
Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
+D V+E+E+++ EE E + E++E EE EVK
Sbjct: 42 DDVVDEDEEDEAVVEEDENELTEEEED-EEGEVK 74
>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
This group consists of Eukaryotic TruB proteins similar
to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
makes psi55 in the T loop of both cytoplasmic and
mitochondrial tRNAs. Psi55 is almost universally
conserved. Pseudouridine synthases catalyze the
isomerization of specific uridines in an RNA molecule to
pseudouridines (5-ribosyluracil, psi).
Length = 312
Score = 25.5 bits (56), Expect = 8.5
Identities = 15/43 (34%), Positives = 17/43 (39%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
+E K E E EE+E EE K K E EE E
Sbjct: 211 KELKTFAEVTELTATAEGDPVEEVEATHEESKRKSEVEEEANE 253
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 25.3 bits (56), Expect = 8.6
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 91 EKLEEEEMEEQKEEKKEKVEEEEVKEER 118
E +EEE++++ EE + +V E E++ ER
Sbjct: 103 EAKKEEELQQELEELEARVGELELEIER 130
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 25.8 bits (58), Expect = 8.6
Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 11/60 (18%)
Query: 76 EKKKRRE--DKVEEEEDEKLEEEEMEEQKEEKK----EKVEEEEV-----KEERREGGRG 124
+K++R D ++EE E L EE EE+ E++ +K+E++ V +E R GR
Sbjct: 261 DKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKEAFKKLEKKIVRRRILEEGIRIDGRK 320
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 25.7 bits (56), Expect = 8.6
Identities = 10/56 (17%), Positives = 25/56 (44%)
Query: 63 RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
R+ G EE + ++ E+ ++ +E+ EQ+ E ++ E+ + +
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKA 201
>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
[General function prediction only].
Length = 244
Score = 25.4 bits (56), Expect = 8.7
Identities = 16/32 (50%), Positives = 24/32 (75%), Gaps = 2/32 (6%)
Query: 91 EKLEE--EEMEEQKEEKKEKVEEEEVKEERRE 120
+KLE+ EE+EEQ E+ E++E+EE + ER E
Sbjct: 207 DKLEKEAEEIEEQLEKLAEQLEKEEERVEREE 238
>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family. This family of
proteins includes several glycine rich proteins as well
as two nodulins 16 and 24. The family also contains
proteins that are induced in response to various
stresses.
Length = 91
Score = 24.7 bits (54), Expect = 8.8
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 97 EMEEQKEEKKEKVE-EEEVKEERREGGRGG 125
E+ EK E E EV++++ GG GG
Sbjct: 22 EVAAADLSNTEKSESENEVQDDKYGGGGGG 51
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 25.2 bits (55), Expect = 8.9
Identities = 12/38 (31%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 86 EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
+EE+E+ EE E ++Q +E +V++ + E E G
Sbjct: 95 RQEEEEEEEENEKQQQSDE--AQVQQHQQHEHPAENGE 130
>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 843
Score = 25.7 bits (57), Expect = 9.1
Identities = 9/43 (20%), Positives = 19/43 (44%), Gaps = 1/43 (2%)
Query: 71 CGKEEEKKKRREDKVEE-EEDEKLEEEEMEEQKEEKKEKVEEE 112
G EE +R + + E E++ + E+ ++ KV +
Sbjct: 779 AGLPEEVIERAREILAELEKESRSSNLELNQKDLSLFPKVLKA 821
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 25.3 bits (56), Expect = 9.1
Identities = 12/46 (26%), Positives = 26/46 (56%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
+KK+ + E D+K EE+K++KK+ +++++K+ G
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVG 202
>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 25.5 bits (56), Expect = 9.3
Identities = 8/39 (20%), Positives = 19/39 (48%)
Query: 77 KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
+++ E + ++ EE E K+ KE++ E++
Sbjct: 325 RRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 25.8 bits (57), Expect = 9.4
Identities = 15/53 (28%), Positives = 28/53 (52%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
+ +E +++R D E+ + + E E + +KEE +E++EE E E G
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886
>gnl|CDD|221117 pfam11460, DUF3007, Protein of unknown function (DUF3007). This is
a family of uncharacterized proteins found in bacteria
and eukaryotes.
Length = 104
Score = 25.0 bits (55), Expect = 9.4
Identities = 5/37 (13%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 76 EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
++++ + + +E+L ++ + E+ E ++ E
Sbjct: 66 QQRRDYREAYDALTNEEL-QKRFDSLSPEELEALQAE 101
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 25.2 bits (55), Expect = 9.5
Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 15/86 (17%)
Query: 56 WHHLSIHRRIQEGTGCGKEE-----EKKKRREDKVE-------EEED---EKLEEEEMEE 100
L R + KE +K RR++ +E E+ D K +E++ +
Sbjct: 51 IQKLRPRWRSLQKRSSEKEITVFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQL 110
Query: 101 QKEEKKEKVEEEEVKEERREGGRGGR 126
++E++++K+E + +E + +
Sbjct: 111 EEEKRRQKIEMWDRMQEGKSYKVSAK 136
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 25.3 bits (55), Expect = 9.5
Identities = 9/52 (17%), Positives = 23/52 (44%)
Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
G KK+ + ++ ++ E +E K E++E ++ E+++
Sbjct: 53 GLVTSKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAA 104
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 25.7 bits (57), Expect = 9.6
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 75 EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
E K + K + + + EE+E + ++E+K+ K E VKE
Sbjct: 472 EGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513
>gnl|CDD|227415 COG5083, SMP2, Uncharacterized protein involved in plasmid
maintenance [General function prediction only].
Length = 580
Score = 25.6 bits (56), Expect = 9.7
Identities = 10/32 (31%), Positives = 14/32 (43%), Gaps = 2/32 (6%)
Query: 59 LSIHRRIQEGTGCGKEEEKKKRREDKVEEEED 90
L ++ R QE G G + K D V +E
Sbjct: 114 LILNEREQESLGSGSDLSKLN--FDIVLKEPK 143
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 25.4 bits (56), Expect = 9.8
Identities = 14/45 (31%), Positives = 25/45 (55%)
Query: 73 KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
KE+++ E+ +EEE + + ++ EK +K EEEE +E
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
>gnl|CDD|234425 TIGR03974, rSAM_six_Cys, SCIFF radical SAM maturase. Members of
this protein family are predicted radical SAM enzymes
universally associated with Six Cysteines in Forty-Five
protein, or SCIFF (family TIGR03973), a predicted
ribosomal natural product precursor that is nearly
universal in the class Clostridia. Similarity of this
family to radical SAM maturases (PqqE and subtilosin A
maturase) found in the vicinity of other peptide
precursors suggests this protein is the SCIFF radical
SAM maturase [Cellular processes, Biosynthesis of
natural products].
Length = 451
Score = 25.3 bits (56), Expect = 9.9
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 90 DEKLEEEEMEEQKEEKKEKVEEEEVKE 116
+E ++E EE KE+ K K EE+++E
Sbjct: 35 EEGKDKETPEEIKEKLKGKYSEEDIEE 61
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 25.0 bits (55), Expect = 9.9
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 82 EDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
+D E+EE+E EE++ E++ EE++ V
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPV 142
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.133 0.375
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,766,780
Number of extensions: 661256
Number of successful extensions: 11136
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7337
Number of HSP's successfully gapped: 2320
Length of query: 126
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 41
Effective length of database: 7,167,512
Effective search space: 293867992
Effective search space used: 293867992
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.0 bits)