RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1625
(130 letters)
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 74.3 bits (183), Expect = 5e-18
Identities = 65/116 (56%), Positives = 82/116 (70%)
Query: 1 MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
GSSAGAGSGEFHVYR LRR+EY R + + EK +KE DEEFQQK +EK++ E+KTAKK
Sbjct: 27 QGSSAGAGSGEFHVYRALRRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKK 86
Query: 61 RAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
RAKR K+KQK K K K K KEE+E S+ +S DE + +ED +E E+ E+
Sbjct: 87 RAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 35.3 bits (82), Expect = 0.005
Identities = 14/71 (19%), Positives = 38/71 (53%)
Query: 56 KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
K KK K +++ +K + + KK K + ++ E++ + E E +++++ +E E +
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 116 RKTEIEQDENS 126
+ E ++ + +
Sbjct: 467 EEEEEKKKKQA 477
Score = 33.0 bits (76), Expect = 0.034
Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 33 SEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDD 92
+E+E+ EF ++ + + K KR + K++ K K K + +EEEEE E
Sbjct: 395 TEEEI---EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 93 SKSEDENSDKDEDNKEGNEEVSERKT 118
+ ++E ++ E+ KE EE +++
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 31.8 bits (73), Expect = 0.074
Identities = 18/72 (25%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 59 KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
KK K++ +K+ K +K + E K+E+++ K ++E +++++ KE +E E +
Sbjct: 406 KKATKKI---KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 119 EIEQDENSVQED 130
E E++E ++
Sbjct: 463 EEEKEEEEEKKK 474
Score = 26.4 bits (59), Expect = 5.4
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 63 KRLKRKQKLKNKPKKVKTEGKEEEEES-EDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
+ L +K K KK+ + +++ EE ++ K K+E+ +E E+ E K E E
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Query: 122 QDE 124
++
Sbjct: 460 EEA 462
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 34.6 bits (80), Expect = 0.009
Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 2/78 (2%)
Query: 31 EKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESE 90
K E ++K + +++T K +K R K+ Q + + K K + +
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKK--KSDSSSED 1367
Query: 91 DDSKSEDENSDKDEDNKE 108
DD D++ D+D+++ E
Sbjct: 1368 DDDSEVDDSEDEDDEDDE 1385
Score = 27.7 bits (62), Expect = 2.0
Identities = 18/92 (19%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
Query: 40 EEFQQKIQEKRQTFEDKTAKKRAK-RLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
EE ++ +++ + +K + K RK KLK K KK K ++ +++ S+
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201
Query: 99 NSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+SD+ + + + +Q+++ Q+
Sbjct: 1202 DSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKT 1233
Score = 26.2 bits (58), Expect = 5.9
Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 6/86 (6%)
Query: 32 KSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESED 91
+ + +E ++E+ + E + AK++ RLK K K K + K+E+++ +
Sbjct: 1131 LEDLDKFEE----ALEEQEEVEEKEIAKEQ--RLKSKTKGKASKLRKPKLKKKEKKKKKS 1184
Query: 92 DSKSEDENSDKDEDNKEGNEEVSERK 117
+ + S + ++E +
Sbjct: 1185 SADKSKKASVVGNSKRVDSDEKRKLD 1210
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 34.1 bits (78), Expect = 0.011
Identities = 25/101 (24%), Positives = 41/101 (40%), Gaps = 3/101 (2%)
Query: 11 EFHVYRHLRRKEYARQKHIQEKSEKELLDE--EFQQKI-QEKRQTFEDKTAKKRAKRLKR 67
E H +K +K+ + +KE+ + +++I +E RQ A+ RA+R
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383
Query: 68 KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
K K K E E E S + E + E N+E
Sbjct: 384 KAKKKGLIDASPNEDTPSENEESKGSPPQVEATTTAEPNRE 424
Score = 27.9 bits (62), Expect = 1.6
Identities = 17/93 (18%), Positives = 39/93 (41%), Gaps = 15/93 (16%)
Query: 38 LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQK-------------LKNKPKKVKTEGKE 84
L +EF++ E+R + +K +K+A ++ L + ++K
Sbjct: 279 LTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTA 338
Query: 85 EEEESEDDSKSEDEN--SDKDEDNKEGNEEVSE 115
E+ E++ K + + + E N+E +E +
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAA 371
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.0 bits (75), Expect = 0.031
Identities = 18/77 (23%), Positives = 33/77 (42%)
Query: 34 EKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDS 93
++L DE ++ + E Q +++A L K+ + + KE+EEE DD
Sbjct: 3904 NEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDV 3963
Query: 94 KSEDENSDKDEDNKEGN 110
+DE ++N
Sbjct: 3964 GIDDEIQPDIQENNSQP 3980
Score = 30.0 bits (67), Expect = 0.39
Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 6/92 (6%)
Query: 33 SEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDD 92
E + LDEE ++ + +K + + + + K+ E ES+
Sbjct: 3883 EENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN------EQSAANNESDLV 3936
Query: 93 SKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
SK +D + +D+D +E +E DE
Sbjct: 3937 SKEDDNKALEDKDRQEKEDEEEMSDDVGIDDE 3968
Score = 28.4 bits (63), Expect = 1.2
Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 88 ESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
++D SE+ENSD +E+N++ +EEV++ E NS+ E
Sbjct: 3867 VTKDSVVSENENSDSEEENQDLDEEVNDIP---EDLSNSLNE 3905
Score = 27.3 bits (60), Expect = 3.0
Identities = 19/102 (18%), Positives = 37/102 (36%), Gaps = 6/102 (5%)
Query: 34 EKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEE-----EEE 88
KE D E + +Q++ E+ + L +K+ +G EE EE
Sbjct: 4019 NKEEADAEKDEPMQDE-DPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEES 4077
Query: 89 SEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+ED KS++E + + + + + E+
Sbjct: 4078 TEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEE 4119
>gnl|CDD|237115 PRK12489, PRK12489, anaerobic C4-dicarboxylate transporter;
Reviewed.
Length = 443
Score = 32.5 bits (75), Expect = 0.036
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 5/48 (10%)
Query: 18 LRR-KEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKR 64
LRR K+ + QE+ L D EF++ + T DK K AK
Sbjct: 193 LRRGKDLDKDPEFQER----LKDPEFREYVYGTTATLLDKKLPKSAKL 236
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 32.2 bits (74), Expect = 0.051
Identities = 19/117 (16%), Positives = 44/117 (37%), Gaps = 13/117 (11%)
Query: 22 EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
+Y RQ K+E +++E ++K +E + FE + A+ ++ R+ + K + +
Sbjct: 429 QYYRQ----AKAEIRAIEQE-KKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483
Query: 82 GKEE--------EEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
K+ + + ++ + DN RK + + Q
Sbjct: 484 DKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAA 540
>gnl|CDD|222127 pfam13436, Gly-zipper_OmpA, Glycine-zipper containing OmpA-like
membrane domain.
Length = 116
Score = 31.1 bits (71), Expect = 0.055
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 2 GSSAGAGSGEFHVYRHLRRKEYARQKHIQEK 32
G++AGAGSG++ Y RR + A ++ + K
Sbjct: 86 GAAAGAGSGQYSSYGAQRRYDNAYRQCMYAK 116
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 32.1 bits (73), Expect = 0.059
Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 1/77 (1%)
Query: 32 KSEKELLDEEFQQKIQEKRQTFEDKTAKKRA-KRLKRKQKLKNKPKKVKTEGKEEEEESE 90
+ E+ LL E +K + +EDK R + P+ E EE E
Sbjct: 530 RDEQILLIESTYPIKSKKIKYYEDKFFTSRLLPPTFVPTQEPYDPRANNNEASEETETIT 589
Query: 91 DDSKSEDENSDKDEDNK 107
+E++ D DED+
Sbjct: 590 VPENNEEDEEDDDEDDD 606
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 31.7 bits (72), Expect = 0.061
Identities = 19/73 (26%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
Query: 56 KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSE-DENSDKDEDNKEGNEEVS 114
KT+KK+ ++K+ K KK K + + E +E+ S SE +E +DK D + + +
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNT 108
Query: 115 ERKTEIEQDENSV 127
+ E +
Sbjct: 109 AAEVNHEDVIDLS 121
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.9 bits (72), Expect = 0.068
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 22 EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
+ +H E E + E Q + Q +D+T ++ + + + K K V
Sbjct: 797 AEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNA-ENQGEAKQDEKGVDGG 855
Query: 82 GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
G + +SE++ + E+E +++E+ +E EE E
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 30.0 bits (67), Expect = 0.32
Identities = 14/49 (28%), Positives = 29/49 (59%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
E K++E+ + S+ +S+++E+ +E EE E + E E++E +E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 28.0 bits (62), Expect = 1.4
Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 2/91 (2%)
Query: 40 EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDEN 99
E+ ++ K D + A+ ++ + ++ E EES +++ E E
Sbjct: 614 EQLSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGET 673
Query: 100 SDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
K E+ EG +ERK E E E ++
Sbjct: 674 ETKGENESEGEIP-AERKGEQE-GEGEIEAK 702
Score = 26.5 bits (58), Expect = 5.8
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 79 KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
+ E E + E++S+ E K E EG E E + E + V+
Sbjct: 667 AEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716
Score = 26.5 bits (58), Expect = 5.8
Identities = 31/132 (23%), Positives = 50/132 (37%), Gaps = 5/132 (3%)
Query: 2 GSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDE---EFQQKIQEKRQTFEDKTA 58
G G G E H E +H E + DE E ++ +E E +
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750
Query: 59 KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
K + +K + + EGKE+E+E E + + E K ++ EG E
Sbjct: 751 GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEM--KGDEGAEGKVEHEGETE 808
Query: 119 EIEQDENSVQED 130
E+DE+ Q +
Sbjct: 809 AGEKDEHEGQSE 820
Score = 25.7 bits (56), Expect = 8.9
Identities = 21/99 (21%), Positives = 44/99 (44%), Gaps = 13/99 (13%)
Query: 39 DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE-GKEEEEESEDDSK--- 94
DE + K++ + +T + + + + + K + + E E + E++ D K
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853
Query: 95 ---------SEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
SE+E +++E+ +E EE E + E E +E
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 31.9 bits (73), Expect = 0.072
Identities = 15/69 (21%), Positives = 29/69 (42%)
Query: 20 RKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVK 79
+ E+ L ++F +KI+EK + E+ +R + KL+ KK++
Sbjct: 300 EPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLE 359
Query: 80 TEGKEEEEE 88
K E+
Sbjct: 360 DLEKRLEKL 368
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.6 bits (71), Expect = 0.094
Identities = 23/91 (25%), Positives = 49/91 (53%), Gaps = 5/91 (5%)
Query: 44 QKIQEKRQTFEDKTAKKRAKRLKRKQ-KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDK 102
++ +EK + ++ + AK+ K ++ N P + + EG+ E E S+ S +E+ +SD
Sbjct: 81 KRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDP 140
Query: 103 ---DEDNKEGNEEV-SERKTEIEQDENSVQE 129
D+DN+ + + S + E + D ++ Q+
Sbjct: 141 KDIDQDNRSSSPSIPSPQDNESDSDSSAQQQ 171
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 31.3 bits (72), Expect = 0.10
Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 3/63 (4%)
Query: 24 ARQKHIQEKSEKEL--LDEEFQQKIQEKRQTFEDKTA-KKRAKRLKRKQKLKNKPKKVKT 80
Q+ +K E E+ L+E++++ +E + DK K K K+K+K + K K K+
Sbjct: 29 LAQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Query: 81 EGK 83
+ +
Sbjct: 89 KPR 91
Score = 28.3 bits (64), Expect = 1.3
Identities = 14/78 (17%), Positives = 32/78 (41%), Gaps = 14/78 (17%)
Query: 1 MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
+ + GE V +L + K ++E+ + LLD++ + K K A+K
Sbjct: 29 LAQRKKSQKGELEV-TNLNE----QYKEMKEELKAALLDKK-ELKAWHK--------AQK 74
Query: 61 RAKRLKRKQKLKNKPKKV 78
+ ++ + K ++
Sbjct: 75 KKEKQEAKAAKAKSKPRL 92
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 31.6 bits (72), Expect = 0.10
Identities = 19/101 (18%), Positives = 48/101 (47%)
Query: 30 QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
QE ++ + EE + K ++ ++ + ++ LK + K +E +E EEE
Sbjct: 805 QELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEI 864
Query: 90 EDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
++ +++++E + D K+ N+ + + I+ +S +
Sbjct: 865 QNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIK 905
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.1 bits (71), Expect = 0.13
Identities = 20/84 (23%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 5 AGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKR 64
A +G + V + + ++ ++E+ E+EL + E +++I E + K +K+AK+
Sbjct: 46 AESGERKSAVDKLAMKPLEEIEEELREEYEEELKEYEAEKEIWEAEK----KGLEKKAKK 101
Query: 65 LKRKQKLKNKPKKVKTEGKEEEEE 88
+K K + + E + EE E
Sbjct: 102 AIKKGKDEEALAEELLELEAEEPE 125
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.0 bits (70), Expect = 0.15
Identities = 19/76 (25%), Positives = 37/76 (48%)
Query: 30 QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
E +K L +EE ++K +++ + E + K +A + + K KL+ + T ++ E+
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 90 EDDSKSEDENSDKDED 105
EDEN + D
Sbjct: 66 SRKRDVEDENPEDFID 81
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.0 bits (70), Expect = 0.15
Identities = 16/62 (25%), Positives = 25/62 (40%)
Query: 43 QQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDK 102
++ +EK Q + AK++ R+ K PKK + E E E S E +
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENV 1234
Query: 103 DE 104
E
Sbjct: 1235 AE 1236
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 30.2 bits (69), Expect = 0.16
Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 25 RQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
K ++E E LLD++ + K EK + K K +AK K+K K + ++
Sbjct: 50 EYKDLKESLEAALLDKK-ELKAWEKAEK---KAEKAKAKAEKKKAKKEEPKPRL 99
Score = 29.0 bits (66), Expect = 0.41
Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 24 ARQKHIQEKSEKEL--LDEEFQQKIQEKRQTFEDK-TAKKRAKRLKRKQKLKNKPKKVKT 80
QK +K E E+ L+EE++ + DK K K K+ +K K K +K K
Sbjct: 31 LAQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKK- 89
Query: 81 EGKEEEEES 89
K+EE +
Sbjct: 90 -AKKEEPKP 97
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 30.7 bits (69), Expect = 0.17
Identities = 11/53 (20%), Positives = 26/53 (49%)
Query: 73 NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDEN 125
++P + + E + E E + + E+ D +++ G EE+ + + DE+
Sbjct: 218 DQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDES 270
Score = 28.0 bits (62), Expect = 1.7
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
E EE DD++SEDE D D+D NE+ + + E E E S ++
Sbjct: 198 ELAEEMGDDTESEDE-EDGDDDQPTENEQEEQGEGEGEGQEGSAPQE 243
Score = 27.6 bits (61), Expect = 2.0
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
E+EE+ +DD +E+E ++ E EG E + +++E E
Sbjct: 210 EDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRE 250
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 30.6 bits (69), Expect = 0.20
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 82 GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
G + EE D+ +D+ D +ED++ G EE SE E E+DE +
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREE-SEGSDESEEDEAEATDG 270
Score = 29.0 bits (65), Expect = 0.63
Identities = 14/52 (26%), Positives = 22/52 (42%)
Query: 73 NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
++P + EE D EDE D + +EG + +E + E DE
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290
Score = 28.3 bits (63), Expect = 1.1
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
+EE+ +DD +E+S+ + EG++E E + E E
Sbjct: 231 ADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGE 271
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.5 bits (70), Expect = 0.23
Identities = 17/103 (16%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 27 KHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
K I E+++KE + ++ + E ++ + + +R+ +L+ K++ + + +
Sbjct: 41 KRILEEAKKEA-EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99
Query: 87 EESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+ E K E+E K+++ ++ +E+ +++ E+E+ +
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.1 bits (68), Expect = 0.24
Identities = 13/71 (18%), Positives = 27/71 (38%)
Query: 49 KRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
KRQ F RA ++ + K+K ++++ + K + + K
Sbjct: 2 KRQDFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61
Query: 109 GNEEVSERKTE 119
++E E K +
Sbjct: 62 HSQEKKEEKKK 72
Score = 26.3 bits (58), Expect = 5.0
Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 5/86 (5%)
Query: 12 FHVYRHLRRKEYARQKHIQ-----EKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLK 66
F R +R E + K + E K ++ +K + + + KK + K
Sbjct: 7 FRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEK 66
Query: 67 RKQKLKNKPKKVKTEGKEEEEESEDD 92
+++K K K KKV + + +D+
Sbjct: 67 KEEKKKPKKKKVPLQVNPAQLFVDDE 92
>gnl|CDD|217952 pfam04189, Gcd10p, Gcd10p family. eIF-3 is a multi-subunit complex
that stimulates translation initiation in vitro at
several different steps. This family corresponds to the
gamma subunit if eIF3. The Yeast protein Gcd10p has also
been shown to be part of a complex with the
methyltransferase Gcd14p that is involved in modifying
tRNA.
Length = 299
Score = 30.0 bits (68), Expect = 0.29
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 40 EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQK 70
EE +K+ E +TF KTA + K LKRK+K
Sbjct: 124 EEIIEKLVENSETFHTKTAFSQEKYLKRKKK 154
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 30.1 bits (68), Expect = 0.33
Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 63 KRLKRKQKLKNKPKKVKTEG----KEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
K + ++ KN V G KEE++ ++ ++ ED + E +KE E++ + K
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKE 273
Query: 119 EIE 121
+ +
Sbjct: 274 KAK 276
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 29.7 bits (67), Expect = 0.41
Identities = 12/109 (11%), Positives = 49/109 (44%), Gaps = 3/109 (2%)
Query: 14 VYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKN 73
+ + R + A + + + E E E + +E ++ +++ R +++L+
Sbjct: 25 LKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKE--LLLRERNQQRQEARR-EREELQR 81
Query: 74 KPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
+ +++ + ++ + +E E++ ++++ E+ E + +++
Sbjct: 82 EEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130
Score = 25.8 bits (57), Expect = 8.3
Identities = 18/117 (15%), Positives = 43/117 (36%), Gaps = 3/117 (2%)
Query: 10 GEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRK- 68
+ + R QK +EK E LL+ + + +Q E + ++ +R + +
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 69 -QKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
QK + + + E + E + + +E K+ + R + ++
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ-LDNELYRVAGLTPEQ 142
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.8 bits (67), Expect = 0.42
Identities = 22/81 (27%), Positives = 46/81 (56%), Gaps = 1/81 (1%)
Query: 45 KIQEKRQTFEDKTAKKRAKRLKR-KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKD 103
K+ E+ ++ D + KR + LK +KL++ KK+ E + E ++K+++E ++
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
Query: 104 EDNKEGNEEVSERKTEIEQDE 124
E+ K+ E+ E K + ++DE
Sbjct: 411 ENEKKQKEQADEDKEKRQKDE 431
Score = 26.7 bits (59), Expect = 4.1
Identities = 21/91 (23%), Positives = 49/91 (53%), Gaps = 6/91 (6%)
Query: 23 YARQKHIQE-KSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
A K +E KS +L ++ Q+ ++E + +D T K+L + + ++ + +
Sbjct: 347 LALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYT-----KKLGEVKDETDASEEAEAK 401
Query: 82 GKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
KEE+ + E++ K + E +D+D++ ++ +E
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 29.6 bits (67), Expect = 0.42
Identities = 9/46 (19%), Positives = 24/46 (52%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+EE + ED+ S+ E+ + D+D+ + ++ + ++ + E
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWE 175
Score = 29.6 bits (67), Expect = 0.45
Identities = 14/40 (35%), Positives = 20/40 (50%)
Query: 80 TEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE 119
E EE+EES ED++ D D+D+ E ER+
Sbjct: 131 EESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRR 170
Score = 28.4 bits (64), Expect = 1.2
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE 119
+ EE +E +++S +++ D D+D+ + + ER E
Sbjct: 128 DDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166
Score = 28.0 bits (63), Expect = 1.7
Identities = 10/42 (23%), Positives = 27/42 (64%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
+ ++EE E+D +S D+D+D+ + +++++ R+ +E+
Sbjct: 126 DDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER 167
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 29.7 bits (67), Expect = 0.48
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 10/53 (18%)
Query: 82 GKEEEEESEDDSKSEDENSDKDEDNKEGNE---EVSERKTE-------IEQDE 124
GK EE E+ + +D+ S K N N + S +K I+Q+E
Sbjct: 841 GKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Score = 27.4 bits (61), Expect = 2.9
Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 3/65 (4%)
Query: 26 QKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEG-KE 84
QK ++ + E EE +E QT A A LK+ K K K+ K+ K+
Sbjct: 834 QKLMENAGKMEEYVEE--NGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQ 891
Query: 85 EEEES 89
EE E+
Sbjct: 892 EERET 896
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 29.4 bits (66), Expect = 0.48
Identities = 14/68 (20%), Positives = 28/68 (41%)
Query: 57 TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
A + K RKQK +NK + + K+ ++E + D ++ KE E
Sbjct: 12 PAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAI 71
Query: 117 KTEIEQDE 124
+ + ++
Sbjct: 72 EEHLSIEQ 79
>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
Glycoprotein E (gE) of Alphaherpesvirus forms a complex
with glycoprotein I (gI) (pfam01688), functioning as an
immunoglobulin G (IgG) Fc binding protein. gE is
involved in virus spread but is not essential for
propagation.
Length = 437
Score = 29.3 bits (66), Expect = 0.60
Identities = 12/56 (21%), Positives = 20/56 (35%)
Query: 57 TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
++R ++ P E E+ DS+S+DE D D D +
Sbjct: 375 VTRRRRRKPYAVILNPFSPVYTSLPTNELFEDVSSDSESDDEEFDSDSDREPSGPS 430
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 29.2 bits (65), Expect = 0.60
Identities = 12/52 (23%), Positives = 30/52 (57%)
Query: 64 RLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
+K++ ++ K+ + E EEE++ E+D +D+ + D+D+ + ++ E
Sbjct: 31 TMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDE 82
Score = 27.3 bits (60), Expect = 2.4
Identities = 14/61 (22%), Positives = 34/61 (55%)
Query: 70 KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
K +N +K+ E +EE E EDD + +D++ D ++++ + +++ + + E + D +
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDD 92
Query: 130 D 130
Sbjct: 93 S 93
Score = 26.5 bits (58), Expect = 5.7
Identities = 17/70 (24%), Positives = 35/70 (50%)
Query: 57 TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
T K+ +KL + ++ E ++++EE +DD EDE+ D D+D+++ +E +
Sbjct: 27 TLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDD 86
Query: 117 KTEIEQDENS 126
T +
Sbjct: 87 STLHDDSSAD 96
>gnl|CDD|222025 pfam13286, HD_assoc, Phosphohydrolase-associated domain. This
domain is found on bacterial and archaeal
metal-dependent phosphohydrolases.
Length = 91
Score = 27.8 bits (63), Expect = 0.62
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 12/57 (21%)
Query: 13 HVYRHLRRK-EYARQKHIQEK------SEKELLDEEFQQKIQEKRQTFEDKTAKKRA 62
+VYRH + E + + I + ++ ELL EF+ + + D A+ R
Sbjct: 21 YVYRHPEVQREEYKARRIVRELFEALMADPELLPPEFRARWEA-----ADDDARARV 72
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 28.8 bits (65), Expect = 0.73
Identities = 11/73 (15%), Positives = 33/73 (45%), Gaps = 14/73 (19%)
Query: 38 LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKP--------------KKVKTEGK 83
L++E ++ +Q++ D+ +K K K+ ++ +++P V +
Sbjct: 465 LEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDPIE 524
Query: 84 EEEEESEDDSKSE 96
+ E+E ++++
Sbjct: 525 KTSLETEVSNEAK 537
>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
Length = 211
Score = 28.7 bits (64), Expect = 0.79
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 2/60 (3%)
Query: 49 KRQTFEDKTAKKRAKRLKRKQKLKN--KPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
K+ E K L ++ KLK KP+K+ G E E + E N KD D
Sbjct: 38 KKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDK 97
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.5 bits (64), Expect = 0.87
Identities = 18/86 (20%), Positives = 39/86 (45%), Gaps = 8/86 (9%)
Query: 31 EKSEKELLDEEFQ------QKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKE 84
+KS+ + + + ++I E+K A K+ K++ K KK +E K+
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK--SEKKK 74
Query: 85 EEEESEDDSKSEDENSDKDEDNKEGN 110
++++ + + KSE E + K+
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSK 100
Score = 28.1 bits (63), Expect = 1.5
Identities = 15/79 (18%), Positives = 37/79 (46%), Gaps = 2/79 (2%)
Query: 53 FEDKTAKKRAKRL--KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGN 110
F T +K + + K++ + EEE + S +D+ DK+ ++K+ +
Sbjct: 11 FFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKS 70
Query: 111 EEVSERKTEIEQDENSVQE 129
E+ ++K E ++ ++ +
Sbjct: 71 EKKKKKKKEKKEPKSEGET 89
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.2 bits (63), Expect = 0.92
Identities = 19/75 (25%), Positives = 31/75 (41%)
Query: 56 KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
K KK+ RK K KK E EEE+ + + E ++ D DN E + + E
Sbjct: 49 KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108
Query: 116 RKTEIEQDENSVQED 130
+ + ++D
Sbjct: 109 SASANSLSDIDNEDD 123
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 28.7 bits (64), Expect = 0.93
Identities = 15/50 (30%), Positives = 21/50 (42%), Gaps = 2/50 (4%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
++ E ESE D ED D+ ED+ E NE V + + V
Sbjct: 667 AIEDSESESESDG--EDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNL 714
Score = 27.9 bits (62), Expect = 1.8
Identities = 13/59 (22%), Positives = 31/59 (52%)
Query: 68 KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENS 126
K + EG+EE+E+ +++ +++ + ED++ +E E E EQ++++
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691
Score = 27.9 bits (62), Expect = 1.8
Identities = 12/53 (22%), Positives = 27/53 (50%)
Query: 77 KVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+T+ E+E E+ +DS+SE E+ +D + E ++ + + D+ +
Sbjct: 655 LEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 0.97
Identities = 29/115 (25%), Positives = 54/115 (46%), Gaps = 13/115 (11%)
Query: 19 RRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRK--------QK 70
R + A +K + E SE EE + + + + ++ KK +R +RK ++
Sbjct: 75 RSVKRATKKTVVEISEPL---EEGSELVVNEDAALDKES-KKTPRRTRRKAAAASSDVEE 130
Query: 71 LKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDE-DNKEGNEEVSERKTEIEQDE 124
K + K K ++ +E +D SE E SD +E + E SE + ++E+D+
Sbjct: 131 EKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185
>gnl|CDD|222988 PHA03105, PHA03105, EEV glycoprotein; Provisional.
Length = 188
Score = 28.1 bits (62), Expect = 1.0
Identities = 15/49 (30%), Positives = 32/49 (65%)
Query: 58 AKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
A R+ + ++ K+KN+ + ++ E K+ + E E+DS + +EN + D++N
Sbjct: 82 ANFRSSKKEKSFKIKNEKENIEVEIKDCDNEFEEDSNNNEENLNWDDNN 130
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 28.4 bits (63), Expect = 1.1
Identities = 10/49 (20%), Positives = 22/49 (44%)
Query: 73 NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
P +V G + SED+ ++ S+ +D ++ ++ E K +
Sbjct: 762 EGPSRVPERGNHDLLHSEDEMADDEAESENMDDYEDSDDNAYESKDHAD 810
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 28.5 bits (63), Expect = 1.4
Identities = 11/47 (23%), Positives = 24/47 (51%)
Query: 75 PKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
++ EE+ D+ +D++ D+DED+ + ++ E + E E
Sbjct: 133 KRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 28.3 bits (63), Expect = 1.4
Identities = 15/54 (27%), Positives = 34/54 (62%), Gaps = 1/54 (1%)
Query: 76 KKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
K++K + +E++EE ++ K+E+ + + +E+ K E+ E K IE ++ ++E
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYE-KVNIEANKKFIKE 329
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 28.0 bits (62), Expect = 1.4
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 76 KKVKTEGKEEEEESEDDSKSEDENSDKDED 105
K + T +++EEE + D K ED+ ++ +
Sbjct: 7 KLIGTSEEDDEEEMDMDVKEEDDGDRRNRE 36
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 28.2 bits (63), Expect = 1.4
Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 4/87 (4%)
Query: 30 QEKSEKELLDEEFQQKIQEKRQTFED---KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
+ D +Q+I+ K KK+AK KR+ + EG +
Sbjct: 198 EALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSD 257
Query: 87 EE-SEDDSKSEDENSDKDEDNKEGNEE 112
++ ED +S+ ++SD D +++G +
Sbjct: 258 DDDDEDAIESDLDDSDDDVSDEDGEDL 284
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.0 bits (63), Expect = 1.5
Identities = 20/108 (18%), Positives = 49/108 (45%), Gaps = 9/108 (8%)
Query: 26 QKHIQEKSEKELLDEEFQQKIQEKRQTFE--DKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
Q + ++++LL+ E + Q+ QT DK +++ + + KQ+L P K++
Sbjct: 42 QAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLR---- 97
Query: 84 EEEEESEDDSKSEDENSDKDED-NKEGNEEVSERKTEIEQDENSVQED 130
+ ++E ++ +D + + E + ++ R + + Q D
Sbjct: 98 --QAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQND 143
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 28.3 bits (63), Expect = 1.5
Identities = 20/60 (33%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 18 LRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKK 77
L K +K +E++E+E EE Q++ +E++ E A+ +A+ KR++KL+N KK
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEE-QRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 28.1 bits (62), Expect = 1.6
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
E E EEE + S D+ SD+ ++++E +E + + ++++S +ED
Sbjct: 930 ESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED 979
>gnl|CDD|162284 TIGR01282, nifD, nitrogenase molybdenum-iron protein alpha chain.
Nitrogenase consists of alpha (NifD) and beta (NifK)
subunits of the molybdenum-iron protein and an
ATP-binding iron-sulfur protein (NifH). This model
describes a large clade of NifD proteins, but excludes
a lineage that contains putative NifD and NifD homologs
from species with vanadium-dependent nitrogenases
[Central intermediary metabolism, Nitrogen fixation].
Length = 466
Score = 28.1 bits (63), Expect = 1.6
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 46 IQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
I E + + +K AKKR+K L + VK+ K
Sbjct: 2 IDEVLEVYPEKAAKKRSKHLVVNEPEGKSDCGVKSNIK 39
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 28.0 bits (62), Expect = 1.6
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 80 TEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
E EEEEE E++ + E++ S+++E E EE E E++ E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491
Score = 26.8 bits (59), Expect = 4.6
Identities = 19/93 (20%), Positives = 43/93 (46%)
Query: 38 LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSED 97
L+E + ++ T E++ K++ + + + P K + E + +S+ E+
Sbjct: 384 LEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443
Query: 98 ENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+++E+ +E EE E + E +DE +E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.5 bits (62), Expect = 1.7
Identities = 16/62 (25%), Positives = 31/62 (50%)
Query: 30 QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
Q KS K+ L E ++ + + K A+ K K + +++ K+++ + KE E ES
Sbjct: 45 QAKSIKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104
Query: 90 ED 91
+
Sbjct: 105 DH 106
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 27.6 bits (62), Expect = 1.8
Identities = 12/56 (21%), Positives = 24/56 (42%)
Query: 55 DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGN 110
KK K+ ++ +K K + K K++ E + +K D + K + + G
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGE 463
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 27.7 bits (61), Expect = 1.9
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)
Query: 80 TEGKEEEEESEDDSKSEDENS--DKDEDNKEGNEEV-SERKTEIEQDENSV 127
E +E + EDDS S S ++++ ++ E + IE D+ V
Sbjct: 460 QESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGV 510
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 27.6 bits (61), Expect = 2.0
Identities = 12/58 (20%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Query: 55 DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES----EDDSKSEDENSDKDEDNKE 108
A RAK+ +++ +K K+++ E E E + + K D + + K+
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.5 bits (62), Expect = 2.0
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 14/93 (15%)
Query: 20 RKEYARQK---HIQEKSEKELLDEEFQQKIQEKRQTFED-----------KTAKKRAKRL 65
++E +K +E E ++E QQ I+E ++ ++ A +A L
Sbjct: 550 KEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHEL 609
Query: 66 KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
+K NK + K + K++++E +++ K DE
Sbjct: 610 IEARKRLNKANEKKEKKKKKQKEKQEELKVGDE 642
>gnl|CDD|189012 cd09605, M3A, Peptidase M3A family includes Thimet oligopeptidase,
dipeptidyl carboxypeptidase and mitochondrial
intermediate peptidase. The peptidase M3-like family,
also called neurolysin-like family, is part of the
"zincins" metallopeptidases, and includes M3, M2 and M32
families of metallopeptidases. The M3 family is
subdivided into two subfamilies: the widespread M3A,
which comprises a number of high-molecular mass endo-
and exopeptidases from bacteria, archaea, protozoa,
fungi, plants and animals, and the small M3B, whose
members are enzymes primarily from bacteria. Well-known
mammalian/eukaryotic M3A endopeptidases are the thimet
oligopeptidase (TOP; endopeptidase 3.4.24.15),
neurolysin (alias endopeptidase 3.4.24.16), and the
mitochondrial intermediate peptidase. The first two are
intracellular oligopeptidases, which act only on
relatively short substrates of less than 20 amino acid
residues, while the latter cleaves N-terminal
octapeptides from proteins during their import into the
mitochondria. The M3A subfamily also contains several
bacterial endopeptidases, collectively called
oligopeptidases A, as well as a large number of
bacterial carboxypeptidases, called dipeptidyl
peptidases (Dcp; Dcp II; peptidyl dipeptidase; EC
3.4.15.5). The peptidases in the M3 family contain the
HEXXH motif that forms the active site in conjunction
with a C-terminally-located Glutamic acid (Glu) residue.
A single zinc ion is ligated by the side-chains of the
two Histidine (His) residues, and the more C-terminal
Glu. Most of the peptidases are synthesized without
signal peptides or propeptides, and function
intracellularly. The structure of neurolysin shows
similarities to those of angiotensin-converting enzyme
(ACE; peptidyl-dipeptidase A) peptidase unit 2 belonging
to peptidase family M2. ACE is an enzyme responsible for
cleavage of dipeptides from the C-termini of proteins,
notably converting angiotensin I to angiotensin II in
mammals. There are similarities to the thermostable
carboxypeptidases from Pyrococcus furiosus
carboxypeptidase (PfuCP), and Thermus aquaticus (TaqCP),
belonging to peptidase family M32. Little is known about
function of this family, including carboxypeptidases Taq
and Pfu.
Length = 590
Score = 27.5 bits (62), Expect = 2.0
Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 6/64 (9%)
Query: 11 EFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFE------DKTAKKRAKR 64
EF +L YAR K + E E LD E ++ ++ + FE + ++R +
Sbjct: 73 EFSTELNLNEDLYARLKAVYESRESLSLDPEQRRLLELLLRDFERSGAHLPEEKRERLRE 132
Query: 65 LKRK 68
L +
Sbjct: 133 LNEE 136
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 27.6 bits (61), Expect = 2.1
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 9/89 (10%)
Query: 51 QTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDED----- 105
F+D+ A + + K+ K+ KKV ++E S S E+ D D+D
Sbjct: 3 DGFKDRAAFIKGAKDIAKEVKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPA 62
Query: 106 ----NKEGNEEVSERKTEIEQDENSVQED 130
E ++ E + + E ++D
Sbjct: 63 DGYSRGEAADDEEEGEASSDATEGHDEDD 91
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 27.5 bits (61), Expect = 2.1
Identities = 17/83 (20%), Positives = 24/83 (28%), Gaps = 7/83 (8%)
Query: 28 HIQEKSEKELLDEEFQQKIQEKRQTF-------EDKTAKKRAKRLKRKQKLKNKPKKVKT 80
H+Q +L D + + + E K KK K K KPKK K
Sbjct: 110 HLQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169
Query: 81 EGKEEEEESEDDSKSEDENSDKD 103
+G + D
Sbjct: 170 KGSVSNRSVKMPGIDPRSKPDWK 192
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.4 bits (60), Expect = 2.4
Identities = 30/101 (29%), Positives = 63/101 (62%), Gaps = 2/101 (1%)
Query: 24 ARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
A +++ + +E+ EE ++K +E ++ ED+ KK A+ LK++ + K +++K +
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE--KKAAEALKKEAEEAKKAEELKKKEA 1712
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
EE++++E+ K+E+EN K E+ K+ EE ++ E ++DE
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Score = 26.3 bits (57), Expect = 5.9
Identities = 27/107 (25%), Positives = 62/107 (57%), Gaps = 2/107 (1%)
Query: 18 LRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKK 77
LR+ E A++ +E +E++ ++K + + + + AK +A+ LK+ ++ K K ++
Sbjct: 1580 LRKAEEAKK--AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ 1637
Query: 78 VKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
+K + EE++++E+ K+E+EN K + + EE ++ E ++ E
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 26.8 bits (60), Expect = 2.6
Identities = 8/34 (23%), Positives = 20/34 (58%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERK 117
EE +E E++ + ++S +DED + + ++ +
Sbjct: 113 EEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 27.4 bits (61), Expect = 2.7
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 89 SEDDSKSEDENSDKDEDNKEGNEEVSE 115
SEDDS+ D D+D G + E
Sbjct: 33 SEDDSEDPASLDDDDDDRDSGGGMLEE 59
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 27.2 bits (61), Expect = 2.7
Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 57 TAKKRAKRLKRKQKLKNKPKKVKT-EGKEEEEESEDDSKSEDE 98
A +KR +++ + K KP + E +EEEEES DD D
Sbjct: 319 LAYLLSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDP 361
>gnl|CDD|184220 PRK13663, PRK13663, hypothetical protein; Provisional.
Length = 493
Score = 27.2 bits (61), Expect = 2.7
Identities = 14/62 (22%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 29 IQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEE 88
+E S++E++ +Q + K+ D+ A K+ + L + + +KV +E+ EE
Sbjct: 295 CREASKQEIIRRYYQTLVDFKK-GGVDEEAVKKIELLMNDAGITPEDRKVVVAAREKAEE 353
Query: 89 SE 90
+
Sbjct: 354 TG 355
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 26.6 bits (59), Expect = 2.7
Identities = 12/42 (28%), Positives = 26/42 (61%)
Query: 65 LKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
L RK+ N P ++ +G + +++ EDD E+ +++ D++N
Sbjct: 34 LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDEN 75
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 27.2 bits (61), Expect = 2.8
Identities = 15/53 (28%), Positives = 30/53 (56%)
Query: 36 ELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEE 88
+ L EE ++I+E ++ + KKR ++ +K+K K K KK + K++ +
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 27.3 bits (60), Expect = 2.8
Identities = 15/59 (25%), Positives = 24/59 (40%)
Query: 71 LKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+K K V EE SE +S++ E +D K+ + E + EI + E
Sbjct: 78 IKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSE 136
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.3 bits (61), Expect = 2.9
Identities = 14/44 (31%), Positives = 27/44 (61%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
+G ++E+E +DD E+E D D ++E +EE + E +++E
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371
Score = 25.7 bits (57), Expect = 8.6
Identities = 11/52 (21%), Positives = 33/52 (63%)
Query: 79 KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+ E +EE+ ++D + +D++ +++E++ + ++E + + E DE+ +E+
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372
Score = 25.7 bits (57), Expect = 9.7
Identities = 12/41 (29%), Positives = 28/41 (68%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
E+EE+ +DD + E+E+ D ++ ++ +E S+ + + E++E
Sbjct: 333 EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 27.1 bits (60), Expect = 3.8
Identities = 18/53 (33%), Positives = 26/53 (49%)
Query: 60 KRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
K K K K N ++ + +E+EDD EDE+ D+DED E +E
Sbjct: 858 KLVKNFISKFKELNDLTDLEEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 26.7 bits (60), Expect = 3.9
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 9/78 (11%)
Query: 26 QKHIQEKSEKE-------LLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
+ + E++ KE +DEE + K++E K A + ++ +R L ++V
Sbjct: 210 IEELAEEAGKEKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEV 269
Query: 79 KTE--GKEEEEESEDDSK 94
+EEEEE D K
Sbjct: 270 LEALAEEEEEEEEADREK 287
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 26.5 bits (59), Expect = 4.1
Identities = 17/57 (29%), Positives = 34/57 (59%)
Query: 68 KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
K+KL +KV+ + +EEEEE +++ E+E D++E ++ + TEI++ +
Sbjct: 6 KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKD 62
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 26.6 bits (59), Expect = 4.1
Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 39 DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
Q E+ D + +K ++K + +++ + ++D +SE+E
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANS----EDKEDKGDAEKEDEESEEE 94
Query: 99 NSDKDEDNKEGNEEVSERKTE 119
N ++DE++ + NE+ +E KTE
Sbjct: 95 NEEEDEESSDENEKETEEKTE 115
Score = 26.2 bits (58), Expect = 4.8
Identities = 14/77 (18%), Positives = 38/77 (49%)
Query: 54 EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEV 113
+ ++ AK+ ++ + + K + + E+ ED +E E+ + +E+N+E +EE
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102
Query: 114 SERKTEIEQDENSVQED 130
S+ + +++ +
Sbjct: 103 SDENEKETEEKTESNVE 119
Score = 25.5 bits (56), Expect = 9.5
Identities = 17/77 (22%), Positives = 34/77 (44%)
Query: 53 FEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
F + + A + +K ++ E KEEE+E+ + ED+ + ED + E
Sbjct: 36 FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95
Query: 113 VSERKTEIEQDENSVQE 129
E + +++E +E
Sbjct: 96 EEEDEESSDENEKETEE 112
>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
Length = 438
Score = 26.5 bits (59), Expect = 4.2
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
Query: 60 KRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNE 111
K++ R KR++ KN K+ +EE E+ EDD K+ D + + E+ K E
Sbjct: 81 KQSLREKRRKASKNTIKR----KREEIEKLEDDIKNLDVDDEMYEEIKTDLE 128
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 26.8 bits (59), Expect = 4.4
Identities = 11/43 (25%), Positives = 15/43 (34%)
Query: 88 ESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
ES D S E D+DED + + NS +
Sbjct: 112 ESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQ 154
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.5 bits (59), Expect = 4.5
Identities = 10/43 (23%), Positives = 24/43 (55%)
Query: 66 KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
K+KQ+ K + K+ K ++++++ ++D E +E +E
Sbjct: 183 KKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQRE 225
Score = 26.1 bits (58), Expect = 5.4
Identities = 7/43 (16%), Positives = 21/43 (48%)
Query: 55 DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSED 97
++ K+ ++KQ+ K + + + +E+E+ + + E
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEG 218
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 26.7 bits (59), Expect = 4.6
Identities = 19/101 (18%), Positives = 31/101 (30%), Gaps = 25/101 (24%)
Query: 54 EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEE-------------------------E 88
D A A+ L+ + + +P+ E + +
Sbjct: 374 WDAFALWHARVLRGEFAERRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGD 433
Query: 89 SEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
EDD ED D DED+ ++V E + E E
Sbjct: 434 DEDDDDWEDLGFDLDEDDVYDLKDVDEWRLERRIATAERWE 474
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 26.4 bits (59), Expect = 4.7
Identities = 12/42 (28%), Positives = 20/42 (47%)
Query: 45 KIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
K ++ + + K++ K KR + KN K+ K G EE
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 26.6 bits (59), Expect = 5.0
Identities = 16/72 (22%), Positives = 28/72 (38%)
Query: 59 KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
++R K ++ Q K + K E + E D+ + + E S GN++ E
Sbjct: 308 RRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDP 367
Query: 119 EIEQDENSVQED 130
+ E D D
Sbjct: 368 QEEADGQGSSTD 379
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 26.6 bits (59), Expect = 5.0
Identities = 14/121 (11%), Positives = 42/121 (34%), Gaps = 18/121 (14%)
Query: 1 MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
M S G + + Q I+++++ + + + ++ +E
Sbjct: 823 MESVPSGGGSQ-----DIIADHQGHQAIIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDT 877
Query: 61 RAKRLKRKQKLKNKPKKVKTEGKEE---------EEESED----DSKSEDENSDKDEDNK 107
+ + +K + +++ K E EE+S + S +E + ++ +
Sbjct: 878 QNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQ 937
Query: 108 E 108
+
Sbjct: 938 D 938
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 26.5 bits (58), Expect = 5.1
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 82 GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQD 123
G E +++ S DE D D+D+ E N++ + + + D
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCD 311
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 26.5 bits (59), Expect = 5.1
Identities = 7/41 (17%), Positives = 16/41 (39%)
Query: 58 AKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
AK+ + K++ K +G+ + + DD +
Sbjct: 19 AKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 26.4 bits (58), Expect = 5.1
Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 55 DKTAKKRAKRLKRKQKL--KNKPKKVKTEGKEEEEESEDDSKSEDE 98
D++ KK+ ++ Q K P K + +EEE+ SE+D+ + +
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGD 48
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 26.4 bits (59), Expect = 5.3
Identities = 15/62 (24%), Positives = 30/62 (48%)
Query: 44 QKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKD 103
+++ TF ++ ++ A+ + +K++ E EE EE +DD +D + D D
Sbjct: 322 KELCWDLMTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDD 381
Query: 104 ED 105
E
Sbjct: 382 EG 383
>gnl|CDD|177775 PLN00180, PLN00180, NDF6 (NDH-dependent flow 6); Provisional.
Length = 180
Score = 25.9 bits (57), Expect = 5.4
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 23 YARQKHIQEKSEKELLDEEFQQKIQEK 49
Y RQ+ I+E + EL EE ++I++K
Sbjct: 141 YERQEDIEESARAELWREELIEEIEQK 167
>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4. All proteins in this
family for which functions are known are involved in
targeting nucleotide excision repair to specific regions
of the genome.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 713
Score = 26.4 bits (58), Expect = 5.5
Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 1/91 (1%)
Query: 39 DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
+ E Q K + ++ E++ + +R K+ + + P +V + S+D S +
Sbjct: 21 EAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLNEL-PHEVVLMCNLDSTHSDDRVVSVPD 79
Query: 99 NSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+ E+ E+ R+ E N V E
Sbjct: 80 SLSVSEEIPSREEDYDSREFEDVYLSNLVAE 110
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 25.8 bits (57), Expect = 5.8
Identities = 12/48 (25%), Positives = 22/48 (45%)
Query: 81 EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
+ +E EEE ED+ +E S K+ N ++ IE ++ +
Sbjct: 16 DSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSNPK 63
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 26.2 bits (58), Expect = 6.1
Identities = 11/44 (25%), Positives = 19/44 (43%)
Query: 47 QEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESE 90
QE + +K +KK AK+ K + + K E+E +
Sbjct: 3 QEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDD 46
>gnl|CDD|238566 cd01146, FhuD, Fe3+-siderophore binding domain FhuD. These
proteins have been shown to function as initial
receptors in ABC transport of Fe3+-siderophores in many
eubacterial species. They belong to the TroA-like
superfamily of helical backbone metal receptor proteins
that share a distinct fold and ligand binding mechanism.
A typical TroA-like protein is comprised of two
globular subdomains connected by a long alpha helix and
binds its specific ligands in the cleft between these
domains.
Length = 256
Score = 26.1 bits (58), Expect = 6.3
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 30 QEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
+E++EK L E+ Q++ E RQ DK K
Sbjct: 115 EEEAEKLL--AEYDQRLAELRQKLPDKGPKP 143
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 26.4 bits (58), Expect = 6.4
Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 4/59 (6%)
Query: 72 KNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE----IEQDENS 126
K+ K T EE+ + D ++ +++ +E N + E KTE + E +
Sbjct: 763 KDYSSKETTRSVEEKGNAPDLDNEYEDEKLDNKNMRENNVDEVELKTELGTNVADTEQT 821
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 26.0 bits (58), Expect = 6.5
Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 9/89 (10%)
Query: 25 RQKHIQEKSEKELLDEEFQQKIQEKRQ-TFEDKTAKKRAKRLKR--------KQKLKNKP 75
+++ I + L D+E ++ +++ + EDK K+R + ++ LK +
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEG 548
Query: 76 KKVKTEGKEEEEESEDDSKSEDENSDKDE 104
K+ K++ EE+ + K E E DK+E
Sbjct: 549 DKLPEADKKKVEEAIEWLKEELEGEDKEE 577
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 26.3 bits (58), Expect = 6.8
Identities = 13/47 (27%), Positives = 23/47 (48%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
E+ DD + EDE +++ED EG + E E+D+ + +
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 25.9 bits (57), Expect = 7.3
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+ EE D+ +S D ++DED+ + +E+ + + E +S+ ED
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253
Score = 25.5 bits (56), Expect = 9.6
Identities = 13/48 (27%), Positives = 23/48 (47%)
Query: 83 KEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
+ E +SED+ +D D+D+D E E S + D +S + +
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEME 262
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 25.8 bits (57), Expect = 7.4
Identities = 26/101 (25%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
Query: 17 HLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPK 76
HL+ + +A + E +E + +E ++I++ ++ +E+K K K+ +K+K K+K K
Sbjct: 42 HLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDK 101
Query: 77 KVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERK 117
K + + E++ E ++ED+ D + E +SE K
Sbjct: 102 KDDKKDDKSEKKDEK--EAEDKLEDLTKSYSETLSTLSELK 140
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 25.8 bits (57), Expect = 7.9
Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 70 KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
K+K +P+ V G +EE+ E K+E+E + D E +E E + E E D+ +
Sbjct: 264 KVKGEPEVV---GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 26.1 bits (58), Expect = 7.9
Identities = 14/67 (20%), Positives = 27/67 (40%)
Query: 54 EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEV 113
E ++R + + + ++ K K E + EE E E+ DK ED +
Sbjct: 228 ETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILK 287
Query: 114 SERKTEI 120
++ E+
Sbjct: 288 EKKDEEL 294
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 25.9 bits (57), Expect = 8.2
Identities = 9/57 (15%), Positives = 24/57 (42%)
Query: 73 NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+ + + E +EEE+ S++ + E E+S+++ + E+ +
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 25.7 bits (57), Expect = 8.3
Identities = 21/90 (23%), Positives = 44/90 (48%), Gaps = 1/90 (1%)
Query: 40 EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDEN 99
E QQ++ E ++ + ++ A+ + Q+LK + ++ E + ++E +
Sbjct: 69 PELQQELAELQEEL-AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127
Query: 100 SDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
+ DE+N+E EE++E K E E E +
Sbjct: 128 IELDEENRELREELAELKQENEALEAENER 157
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 25.7 bits (57), Expect = 8.5
Identities = 25/99 (25%), Positives = 42/99 (42%), Gaps = 9/99 (9%)
Query: 28 HIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKR--AKRLKRKQKLKNKPKKVKTE--GK 83
HI++ E+ E +K++ + K R +L ++ K K E K
Sbjct: 32 HIEDLKEEL--SNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIK 89
Query: 84 EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
+ EEE E E E + +E+ E E+ E + EIE+
Sbjct: 90 DVEEELEKI---EKEIKELEEEISELENEIKELEQEIER 125
>gnl|CDD|223522 COG0445, GidA, Flavin-dependent tRNA uridine
5-carboxymethylaminomethyl modification enzyme GidA
[Cell cycle control, cell division, chromosome
partitioning].
Length = 621
Score = 26.0 bits (58), Expect = 8.5
Identities = 9/48 (18%), Positives = 21/48 (43%), Gaps = 8/48 (16%)
Query: 37 LLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKE 84
L+D+E ++ +K++ E + ++R + P +V E
Sbjct: 454 LVDDERYERFLKKKENIE--------EEIERLKSTWVTPSEVAKELLA 493
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 25.6 bits (56), Expect = 8.8
Identities = 20/94 (21%), Positives = 35/94 (37%), Gaps = 7/94 (7%)
Query: 19 RRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
+Y K + +ELL E F + D ++ RK+K + P
Sbjct: 309 YSSDYQAPKPEPIYTWEELLRERFPSDLFAISSL-PDSDSEASDSGPTRKRKRRRVPPL- 366
Query: 79 KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
E ED+ S+++ D +++ K EE
Sbjct: 367 -----PEYSSDEDEDDSDEDEVDYEKERKRRREE 395
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 25.4 bits (56), Expect = 9.3
Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 1/74 (1%)
Query: 22 EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
E + K E E Q I+ + +++ +KR + + K+ L + + +
Sbjct: 53 EDGKLAADLLKLEVARKKERLNQ-IRARISQLKEEIEQKRERIEELKRALAQRRSDLSSA 111
Query: 82 GKEEEEESEDDSKS 95
+ E+ +
Sbjct: 112 SYQLEKRRASQLEK 125
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 25.6 bits (57), Expect = 9.5
Identities = 12/47 (25%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Query: 35 KELLDE--EFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVK 79
KE+L E E + ++ ++R+ + + ++R L+ K NK V
Sbjct: 307 KEILKEFLEHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVI 353
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.296 0.119 0.298
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,290,625
Number of extensions: 573388
Number of successful extensions: 4898
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3119
Number of HSP's successfully gapped: 1294
Length of query: 130
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 44
Effective length of database: 7,123,158
Effective search space: 313418952
Effective search space used: 313418952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.9 bits)
S2: 53 (24.0 bits)