RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3835
(1659 letters)
>gnl|CDD|212794 cd11860, SH3_DLG5, Src homology 3 domain of Disks Large homolog 5.
DLG5 is a multifunctional scaffold protein that is
located at sites of cell-cell contact and is involved in
the maintenance of cell shape and polarity. Mutations in
the DLG5 gene are associated with Crohn's disease (CD)
and inflammatory bowel disease (IBD). DLG5 is a member of
the MAGUK (membrane-associated guanylate kinase) protein
family, which is characterized by the presence of a core
of three domains: PDZ, SH3, and guanylate kinase (GuK).
The GuK domain in MAGUK proteins is enzymatically
inactive; instead, the domain mediates protein-protein
interactions and associates intramolecularly with the SH3
domain. DLG5 contains 4 PDZ domains as well as an
N-terminal domain of unknown function. SH3 domains are
protein interaction domains that bind to proline-rich
ligands with moderate affinity and selectivity,
preferentially to PxxP motifs. They play versatile and
diverse roles in the cell including the regulation of
enzymes, changing the subcellular localization of
signaling pathway components, and mediating the formation
of multiprotein complex assemblies.
Length = 63
Score = 115 bits (291), Expect = 6e-31
Identities = 45/65 (69%), Positives = 53/65 (81%), Gaps = 2/65 (3%)
Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
Y+RALF R+ + D +L F KDDILYVDNTMFNGV G WRAWLVD +G +++CGIIPSK
Sbjct: 1 YVRALFDRSAENED--ELSFKKDDILYVDNTMFNGVFGQWRAWLVDEEGRKRKCGIIPSK 58
Query: 1394 YKVEE 1398
YKVEE
Sbjct: 59 YKVEE 63
>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
signaling molecules, often in tandem arrangements. May be
responsible for specific protein-protein interactions, as
most PDZ domains bind C-terminal polypeptides, and
binding to internal (non-C-terminal) polypeptides and
even to lipids has been demonstrated. In this subfamily
of PDZ domains an N-terminal beta-strand forms the
peptide-binding groove base, a circular permutation with
respect to PDZ domains found in proteases.
Length = 82
Score = 79.1 bits (196), Expect = 1e-17
Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 3/82 (3%)
Query: 1129 LRRVHIDKS-VEPLGIQIQC--LDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS 1185
+R V + K LG ++ GG+FVS V A + GL+VGD++LEV G+++
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60
Query: 1186 ATYQLAASVLRQCGNSITMLVQ 1207
T++ A +L+ G+ +T+ V+
Sbjct: 61 LTHEEAVELLKNSGDEVTLTVR 82
Score = 67.6 bits (166), Expect = 9e-14
Identities = 31/80 (38%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 1237 PRFLMIETRKCSNLGISLVGGN--AVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRA 1294
R + + LG SL GG GI+V V+ G GLR GDRILE NG +
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60
Query: 1295 ATAEEAAYELAKPADKVTVL 1314
T EEA L D+VT+
Sbjct: 61 LTHEEAVELLKNSGDEVTLT 80
Score = 55.3 bits (134), Expect = 2e-09
Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 8/82 (9%)
Query: 812 LVTTQLQLNNSYDHGLTL------ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVD 865
+ T L+ + G +L G+++ ++ PG A + G L VGDR+L +N +V+
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGG-LRVGDRILEVNGVSVE 59
Query: 866 GLSSAREGMALLSDSPDVLTIT 887
GL+ E + LL +S D +T+T
Sbjct: 60 GLTHE-EAVELLKNSGDEVTLT 80
Score = 54.9 bits (133), Expect = 3e-09
Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 6/72 (8%)
Query: 192 GLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMA 251
G L D G G+++ ++ PG A + G L VGDR+L +N +V+GL+ E +
Sbjct: 15 GFSLRGGKDSG----GGIFVSRVEPGGPAERGG-LRVGDRILEVNGVSVEGLTHE-EAVE 68
Query: 252 LLSDSPDVLTIT 263
LL +S D +T+T
Sbjct: 69 LLKNSGDEVTLT 80
Score = 34.8 bits (81), Expect = 0.031
Identities = 9/38 (23%), Positives = 20/38 (52%)
Query: 766 KVNDCIVRVNNLDCSNVSKRMVLETVRSSGTSALMVIR 803
+V D I+ VN + ++ +E +++SG + +R
Sbjct: 45 RVGDRILEVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82
Score = 30.2 bits (69), Expect = 1.4
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 6/42 (14%)
Query: 72 SPNEALGLELVGGRDDPFYPNDSSIYVSNIVKGSVTD-GKLK 112
P LG L GG+D + I+VS + G + G L+
Sbjct: 9 DPGGGLGFSLRGGKD-----SGGGIFVSRVEPGGPAERGGLR 45
>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also
called DHR (Dlg homologous region) or GLGF (relatively
well conserved tetrapeptide in these domains). Some PDZs
have been shown to bind C-terminal polypeptides; others
appear to bind internal (non-C-terminal) polypeptides.
Different PDZs possess different binding specificities.
Length = 85
Score = 77.0 bits (190), Expect = 6e-17
Identities = 29/85 (34%), Positives = 46/85 (54%), Gaps = 2/85 (2%)
Query: 1128 ELRRVHIDKSVEPLGIQIQ--CLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS 1185
E R V ++K LG + + GGV VS+V S A++ GL+VGD +LEV G ++
Sbjct: 1 EPRLVELEKGGGGLGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60
Query: 1186 ATYQLAASVLRQCGNSITMLVQYSP 1210
T+ A +L++ G +T+ V
Sbjct: 61 LTHLEAVDLLKKAGGKVTLTVLRGG 85
Score = 67.4 bits (165), Expect = 1e-13
Identities = 33/73 (45%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 1244 TRKCSNLGISLVGG--NAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAA 1301
+ LG SLVGG G+ V SV GS AGLR GD ILE NGT + T EA
Sbjct: 8 EKGGGGLGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEGLTHLEAV 67
Query: 1302 YELAKPADKVTVL 1314
L K KVT+
Sbjct: 68 DLLKKAGGKVTLT 80
Score = 50.1 bits (120), Expect = 1e-07
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 206 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 265
G+ + + PGS AAK G L VGD +L +N +V+GL+ + LL + +T+T L
Sbjct: 25 GGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEA-VDLLKKAGGKVTLTVL 82
Query: 266 KP 267
+
Sbjct: 83 RG 84
Score = 50.1 bits (120), Expect = 1e-07
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 830 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 889
G+ + + PGS AAK G L VGD +L +N +V+GL+ + LL + +T+T L
Sbjct: 25 GGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEA-VDLLKKAGGKVTLTVL 82
Query: 890 KP 891
+
Sbjct: 83 RG 84
Score = 32.4 bits (74), Expect = 0.27
Identities = 7/40 (17%), Positives = 18/40 (45%)
Query: 766 KVNDCIVRVNNLDCSNVSKRMVLETVRSSGTSALMVIRRR 805
+V D I+ VN ++ ++ ++ +G + + R
Sbjct: 45 RVGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRG 84
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 79.0 bits (195), Expect = 2e-14
Identities = 64/315 (20%), Positives = 137/315 (43%), Gaps = 21/315 (6%)
Query: 297 NETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELR 356
LE R Q + QLE +E ++L + L S + L+ EL EL
Sbjct: 691 KSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELE 750
Query: 357 SGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELA 416
+ L + + E++++ L++ ++L ++LE
Sbjct: 751 ------------EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEEL 798
Query: 417 QEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH 476
+EE+ +++ + + +E + + R L+Q+ + + + +E +E L +++++
Sbjct: 799 EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKEL 858
Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
E+ +E+ A + + +LK L EE+ +E + SE + +E+EKL E L +
Sbjct: 859 EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELE 918
Query: 537 RKIKTLEMENKELVE-----DKKTLSYQIETLKREIASALHDRD----KALKECNDLRER 587
K++ LE+E EL E + TL ++E + + +A++E ++ ER
Sbjct: 919 AKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEER 978
Query: 588 FGEFTAAKEECQREA 602
+ E + +E+ +
Sbjct: 979 YEELKSQREDLEEAK 993
Score = 78.6 bits (194), Expect = 2e-14
Identities = 69/367 (18%), Positives = 148/367 (40%), Gaps = 38/367 (10%)
Query: 301 RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
R +EL+ + AQLE +E SL+++ L + L R+L EL
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE---- 715
Query: 361 TQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEV 420
+ L + + + E ++ + L + ++L ++LE +EE+
Sbjct: 716 --------RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEEL 767
Query: 421 ARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
+++ + +E +R L++ E E +E L + +++ +
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQE--------------ELEELEEELEEAERRLDALE 813
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
+E+ + R + ++++ L EE ++ + E + + KE+E+L E+L + + +
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873
Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
LE ++ K L + E L+ E+ + + +E LRER E A E +
Sbjct: 874 ELE-------DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926
Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
E E E + E+E + + NL A +E + + + ++
Sbjct: 927 EL-PELEEELEEEYEDTLETELEREIERLEE----EIEALGPVNLRAIEEYEEVEERYEE 981
Query: 661 LQAELQD 667
L+++ +D
Sbjct: 982 LKSQRED 988
Score = 77.8 bits (192), Expect = 3e-14
Identities = 54/321 (16%), Positives = 130/321 (40%), Gaps = 26/321 (8%)
Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
K E L + +L ++L+ + E+ ++ E++ ++ R+
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719
Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTE 502
LK++ A + + E +E L +++++ E+ + + L E
Sbjct: 720 ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLE--------------ELEE 765
Query: 503 ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIET 562
E + + + + E + + ++ + L E+L + +++ E L + ++L + E
Sbjct: 766 ELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825
Query: 563 LKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKE 622
L++EI + ++ ++ ++L E E EE + E E E +E
Sbjct: 826 LEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEE 885
Query: 623 RNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
+ E+E + ++ + E +EI++LR+ ++L+A+L+ V L E ++E
Sbjct: 886 KEELEE-------ELRELESELAELKEEIEKLRERLEELEAKLERLEVELP-ELEEELEE 937
Query: 683 AEVSKRRRDWAFSERDKIEVI 703
+ +IE +
Sbjct: 938 EYEDTLETEL----EREIERL 954
Score = 74.4 bits (183), Expect = 4e-13
Identities = 53/283 (18%), Positives = 115/283 (40%), Gaps = 11/283 (3%)
Query: 421 ARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
+R K+ +E E L+ Q + E ++ L ++++Q E+
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE 715
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
+++ ++L++L +E + E + + + +E+L E+L +
Sbjct: 716 RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775
Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
L+ E +EL E ++ L ++E L+ E+ A D +E L +R EE +
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835
Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
E + D +E E+E +L + ++ + E E+ L + ++
Sbjct: 836 EIEELE-------EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888
Query: 661 LQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVI 703
L+ EL++ L E +E E + R + ++ +++EV
Sbjct: 889 LEEELRELESE----LAELKEEIEKLRERLEELEAKLERLEVE 927
Score = 72.1 bits (177), Expect = 2e-12
Identities = 56/298 (18%), Positives = 123/298 (41%), Gaps = 23/298 (7%)
Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQ----QCTAAIRQWDIALRERNE 464
+ LE ++ + ++KQ E + ++ A R + + + + + +E E
Sbjct: 184 TEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEE 243
Query: 465 YQEALAKVQQQHEDAVKEINTAM--------------AVRMKATKDLKRLTEERNAAMQE 510
+E L++++++ E+ +E+ A + ++L L EE E
Sbjct: 244 LEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGE 303
Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR---EI 567
SL+ + + E+E+L E L + KI+ L+ E +E + L + L+ E+
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363
Query: 568 ASALHDRDKALKECND-LRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEV 626
L + L+E + LRE E A E + E + E + + ++
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDL 423
Query: 627 EAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAE 684
+ +L + ++ + + E N+E++ L + ++L+ L++ L E +Q E
Sbjct: 424 KEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL-AELQEELQRLE 480
Score = 71.7 bits (176), Expect = 3e-12
Identities = 49/261 (18%), Positives = 111/261 (42%), Gaps = 4/261 (1%)
Query: 377 HYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNT 436
+ E +++E L + ++L A ++E + E+ ++++ E++ +E
Sbjct: 233 KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292
Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD 496
E L+ + + + + E E +E L +++++ E +E+ + + +
Sbjct: 293 LKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQL 352
Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHK----EMEKLSEDLTQAVRKIKTLEMENKELVED 552
L L E + ++ S ++ E + + + E+ +L +L + +++ L+ E + L E
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412
Query: 553 KKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMR 612
+ LS ++E LK E+ + ++ E +L E E EE + E A
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAEL 472
Query: 613 MGERDSSRKERNEVEAASMDL 633
E KE + +EA L
Sbjct: 473 QEELQRLEKELSSLEARLDRL 493
Score = 64.4 bits (157), Expect = 5e-10
Identities = 50/294 (17%), Positives = 108/294 (36%), Gaps = 34/294 (11%)
Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
L + + E + + +EL + + +LE A +E L+S+ +L + + L
Sbjct: 229 LLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQE 288
Query: 351 ELLEL-----------------RSGCCTQPESDDAGSADSLNQH--YMSALNKYETIKDE 391
ELLEL + E + + + L + ET+ +E
Sbjct: 289 ELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348
Query: 392 YDALRKRFDDLINTHSTAVDKLELAQEE-VARIKKQCEDMNQEKNTALRERNGLKQQCTA 450
+ L ++ + L EE ++++ ++ E E LK
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK----- 403
Query: 451 AIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
RE +E L ++ ++ ED +E+ A + +L+ L EE ++
Sbjct: 404 ---------REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQ 454
Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
+ + +E+ +L E+L + +++ +LE L +++ L+
Sbjct: 455 LEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508
Score = 64.4 bits (157), Expect = 5e-10
Identities = 60/329 (18%), Positives = 131/329 (39%), Gaps = 38/329 (11%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
EL + + +LE +E S L + +L + + ++E+ EL+S
Sbjct: 225 LELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS---------- 274
Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC 427
+ E +++E + L++ +L ++ L +E + ++ +
Sbjct: 275 ----------------ELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318
Query: 428 EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAM 487
E++ + + LK++ L ER E L ++ + E+A +E+ +
Sbjct: 319 EELEERLEELKEKIEALKEE-----------LEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 488 AVRMKATKDLK-RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
+ ++ ++L L EE E + I +E + + +E+E L E L + +++ L+ E
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427
Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
KEL + + L ++E L E+ ++ +L E + ++E
Sbjct: 428 KELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLE 487
Query: 607 FEYAMRMGERDSSRKERNEVEAASMDLFG 635
E+ +S+ R +EA L G
Sbjct: 488 ARLDRLEAEQRASQGVRAVLEALESGLPG 516
Score = 56.6 bits (137), Expect = 1e-07
Identities = 38/239 (15%), Positives = 99/239 (41%), Gaps = 11/239 (4%)
Query: 307 LKELEYYRGQHQAAMAQLEAAAQES----SSLRSKYSDLISDKQRLDRELLELRSGCCTQ 362
++ELE R Q + +LE +E+ +L + L ++RL++E+ EL +
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE-----E 835
Query: 363 PESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVAR 422
+ D L + + E +K+E + L ++L + ++ E +EE+
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Query: 423 IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL--AKVQQQHEDAV 480
++ + ++ +E L+ + + E E E +++++ E
Sbjct: 896 LESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI 539
+EI V ++A ++ + + E + + ++ + + +E+L ++ + ++
Sbjct: 956 EEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKET 1014
Score = 43.9 bits (104), Expect = 9e-04
Identities = 57/383 (14%), Positives = 134/383 (34%), Gaps = 62/383 (16%)
Query: 110 KLKKKEEVAR-IKKQCEDMNQEKNTALRERNGLKQ----QCTAAIRQWDIALRERNEYQE 164
L++ E++ ++KQ E + ++ A R + + + + + +E E +E
Sbjct: 187 NLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEE 246
Query: 165 ALAKVQQQHEDAVKEINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEG 224
L++++++ E+ +E+ A L +L + L+ L K + EG
Sbjct: 247 ELSRLEEELEELQEELEEAEKEIEELKS-ELEELREELEELQEELLELKE---EIEELEG 302
Query: 225 SLAVGDRVLSINNRTVDGLSSAREG--MALLSDSPDVLTITTLKPTSLGEHSPRVHRKSV 282
+++ L ++ L E + + ++ TL E
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL----EELE-------- 350
Query: 283 NSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEA----AAQESSSLRSKY 338
Q + E + L+ELE + +A+LEA E L+ +
Sbjct: 351 ----QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406
Query: 339 SDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKR 398
L +RL L + L + + E ++ E + L +
Sbjct: 407 ESLEERLERLSERLED-------------------LKEELKELEAELEELQTELEELNEE 447
Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
++L D+L+ + E+A ++++ + + +E ++ + L+ A
Sbjct: 448 LEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLE------------A 495
Query: 459 LRERNEYQEALAKVQQQHEDAVK 481
+ ++ A+ + + V
Sbjct: 496 EQRASQGVRAVLEALESGLPGVY 518
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 77.4 bits (191), Expect = 5e-14
Identities = 98/591 (16%), Positives = 223/591 (37%), Gaps = 81/591 (13%)
Query: 112 KKKEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 171
+ +EE+ ++++ E + + E +Q AA R+ L + ++L ++Q+
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERE----LAQLQARLDSLERLQE 499
Query: 172 QHEDAVKEINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDR 231
E + + + + L G+ G+ E + + G AA E A+G R
Sbjct: 500 NLEGFSEGVKALLKNQSGLSGI-------LGVLSE----LISVDEGYEAAIE--AALGGR 546
Query: 232 VLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQTMPL 291
+ V+ L++A++ +A L + ++ +T L S+ + +
Sbjct: 547 ---LQAVVVENLNAAKKAIAFLKQN-ELGRVTFLPLDSIKGTEIQGND------------ 590
Query: 292 YVRQHNETVRRGDHTLKELEYYRGQHQAAMA----------QLEAAAQESSSLRSKYSDL 341
R+ + + K+L + + + A++ L+ A + + LR Y +
Sbjct: 591 --REILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR-I 647
Query: 342 ISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDD 401
++ LD +L+ G T G + N S L + I++ L ++ ++
Sbjct: 648 VT----LDGDLV-RPGGVIT-------GGSAKTNS---SILERRREIEE----LEEKIEE 688
Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE 461
L +K+ ++ +A ++K+ E++ +E +E L +Q +A + E
Sbjct: 689 LE-------EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAE 741
Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEER-------NAAMQEYSLI 514
+ +E +A++ ++ + EI +A ++L E +E +
Sbjct: 742 VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801
Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
D + E+ L+E+ ++++LE + L QIE L +I S +
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861
Query: 575 DKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLF 634
++ + +L + + R E E +R+E+ +L
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR 921
Query: 635 GKCQKERMDNLEANQEIDRLR-KHTDKLQAELQDQSVNYLFILTESIQEAE 684
K + + ID L+ + +++ L++ I + +EA
Sbjct: 922 EKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKI-EDDEEEAR 971
Score = 63.5 bits (155), Expect = 1e-09
Identities = 60/349 (17%), Positives = 132/349 (37%), Gaps = 54/349 (15%)
Query: 292 YVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQE-SSSLRS---------KYSDL 341
Y + ET R+ + T + + +LE E L+S +Y +L
Sbjct: 170 YKERRKETERKLERT-----------RENLDRLEDILNELERQLKSLERQAEKAERYKEL 218
Query: 342 ISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHY---MSALNKYETIKDEYDALRKR 398
++ + L+ LL + L + L + E +E A +
Sbjct: 219 KAELRELELALL--------------VLRLEELREELEELQEELKEAEEELEELTAELQE 264
Query: 399 FDDLINTHSTAV----DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQ 454
++ + V +++E Q+E+ + + + Q+K L++Q Q
Sbjct: 265 LEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ 324
Query: 455 WDIALRERNEYQEALAKV-------QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA 507
+ + +E E LA++ +++ E E+ A + L+ L E+
Sbjct: 325 LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384
Query: 508 MQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE-----DKKTLSYQIET 562
+ + + + S++ E+E+L L + + + L+ E +EL++ + K L ++E
Sbjct: 385 RSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
Query: 563 LKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAM 611
L+ E+ + ++ + +LRE E A + +RE +
Sbjct: 445 LEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
Score = 60.1 bits (146), Expect = 1e-08
Identities = 57/310 (18%), Positives = 126/310 (40%), Gaps = 45/310 (14%)
Query: 293 VRQHNETVRRGDHTLKELEYYRGQHQAAMAQ----LEAAAQESSSLRSKYSDLISDKQRL 348
V Q E + + L ELE + + + + L A E L ++ L + + L
Sbjct: 742 VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801
Query: 349 DRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHST 408
L ELR E L + +L +
Sbjct: 802 REALDELR---------------------------------AELTLLNEEAANLRERLES 828
Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEA 468
++ + + +++Q E+++++ + E L++ + + L ER +EA
Sbjct: 829 LERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEA 888
Query: 469 LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
LA ++ + E+ +E+ + R + ++L+ L E+ + D++ E+L
Sbjct: 889 LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL---QERL 945
Query: 529 SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD-KALKECNDLRER 587
SE+ + + + + LE ++ +D++ +++ L+ +I L + A++E +L+ER
Sbjct: 946 SEEYSLTLEEAEALE---NKIEDDEEEARRRLKRLENKIK-ELGPVNLAAIEEYEELKER 1001
Query: 588 FGEFTAAKEE 597
+ TA KE+
Sbjct: 1002 YDFLTAQKED 1011
Score = 58.9 bits (143), Expect = 2e-08
Identities = 54/276 (19%), Positives = 100/276 (36%), Gaps = 42/276 (15%)
Query: 432 QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRM 491
E+ + E L+++ + + E ++ L +++++ E KE+
Sbjct: 673 LERRREIEE---LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729
Query: 492 KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE 551
KDL RL E + + + E + E+E+L E L +A ++ E E
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE------ 783
Query: 552 DKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAM 611
IE L+ +I ++ +E LRE E A EA + R
Sbjct: 784 --------IEELEAQI-------EQLKEELKALREALDELRAELTLLNEEAANLRERLES 828
Query: 612 RMGERDSSRKERNEVEAASMDLFGKCQKERM--DNLEANQEIDRLRKHTDKLQAELQDQS 669
ER + ER + Q E + D EI+ L + ++L++EL+
Sbjct: 829 L--ERRIAATERRLEDLEE-------QIEELSEDIESLAAEIEELEELIEELESELEA-- 877
Query: 670 VNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDS 705
+L E E R +++ ++S
Sbjct: 878 -----LLNERASLEEALALLRSELEELSEELRELES 908
Score = 58.9 bits (143), Expect = 2e-08
Identities = 65/372 (17%), Positives = 150/372 (40%), Gaps = 37/372 (9%)
Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
L V + E + +EL+ + + A+L+ ++ LR + S+L + + L +
Sbjct: 229 LLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQK 288
Query: 351 ELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
EL L A L Q + ++ + + L + ++L +
Sbjct: 289 ELYAL------------ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELA 336
Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
++L +E++ +K++ E + E L E ++ + + + + + + +A
Sbjct: 337 EELAELEEKLEELKEELESLEAE----LEELEAELEELESRLEELE---EQLETLRSKVA 389
Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
+++ Q EI R++A L+RL + R QE ++ + + E+++L
Sbjct: 390 QLELQIASLNNEI-----ERLEAR--LERLEDRRERLQQEIEELLKKLEE--AELKELQA 440
Query: 531 DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKA---LKECNDLRER 587
+L + +++ L+ E + L E + L ++E ++ + +A + + L L+E
Sbjct: 441 ELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQEN 500
Query: 588 FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAA-SMDLFGKCQKERMDNLE 646
F+ + + + +G EAA L G+ Q ++NL
Sbjct: 501 LEGFSEGV----KALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLN 556
Query: 647 -ANQEIDRLRKH 657
A + I L+++
Sbjct: 557 AAKKAIAFLKQN 568
Score = 50.4 bits (121), Expect = 8e-06
Identities = 49/292 (16%), Positives = 110/292 (37%), Gaps = 37/292 (12%)
Query: 381 ALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
+ + E +++E + L++ + +L+ +E++ ++ + ++ +E +E
Sbjct: 230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289
Query: 441 RNGLKQQCTAAIRQWDIALRERN--EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
L +I+ E+ +E LA +++Q E+ ++ + + ++L
Sbjct: 290 LYALAN---------EISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340
Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
L E+ + + +E+E L +L + +++ LE +EL E
Sbjct: 341 ELEEKL--------------EELKEELESLEAELEELEAELEELESRLEELEE------- 379
Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKE---ECQREAFHARFEYAMRMGE 615
Q+ETL+ ++A E L R +E + E E ++ +
Sbjct: 380 QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ 439
Query: 616 RDSSRKERNEVEAASMDLFGKCQKERMDNL--EANQEIDRLRKHTDKLQAEL 665
+ E E + E + EA Q +D + +LQA L
Sbjct: 440 AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARL 491
Score = 38.9 bits (91), Expect = 0.028
Identities = 53/396 (13%), Positives = 143/396 (36%), Gaps = 58/396 (14%)
Query: 115 EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 174
E++ ++++ ++ + +E L+++ ++ + R+ + ++ LA+++ + E
Sbjct: 684 EKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE 743
Query: 175 DAVKEIN-TAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVL 233
+ I + +T ++ + + A+ + ++
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEER-----------LEEAEEELAEAEAEIEELEAQIE 792
Query: 234 SINNRTV---DGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQTMP 290
+ + L R + LL++ + SL R+ + Q
Sbjct: 793 QLKEELKALREALDELRAELTLLNEE---AANLRERLESLERRIAATERRLEDLEEQI-- 847
Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQL----EAAAQESSSLRSKYSDLISDKQ 346
+ +E + ++ELE + ++ + L + + + LRS+ +L + +
Sbjct: 848 ---EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELR 904
Query: 347 RLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRF------- 399
L+ + ELR + L + + E ++ D L++R
Sbjct: 905 ELESKRSELR------------RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLT 952
Query: 400 -DDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
++ + D E A+ + R++ + +++ A+ E LK++
Sbjct: 953 LEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEEYEELKER----------- 1001
Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKAT 494
+E L + ++ E+A++EI+ R K T
Sbjct: 1002 YDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDT 1037
Score = 38.5 bits (90), Expect = 0.042
Identities = 60/330 (18%), Positives = 118/330 (35%), Gaps = 46/330 (13%)
Query: 114 KEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA---LRERNEYQEALAKVQ 170
++++AR++ + E + + +E L+ + + + A L E E L
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
Query: 171 QQHEDAVKEINTAMAVTNYLDGLQ--LNNSYDHGLTLESGLYICKISPGSLAAKEGSLAV 228
+Q ++ +K + A LD L+ L + L L + +AA E L
Sbjct: 792 EQLKEELKALREA------LDELRAELTLLNEEAANLRERL---ESLERRIAATERRLED 842
Query: 229 GDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQT 288
+ + + ++ L++ E L + + L + S + S +
Sbjct: 843 LEEQIEELSEDIESLAAEIEE---LEELIEELESELEALLNERA-SLEEALALLRSELEE 898
Query: 289 MPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQE-SSSLRSKYSDLI----- 342
+ +R+ L+EL Q + + LE L +YS +
Sbjct: 899 LSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEA 958
Query: 343 ------SDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALR 396
D++ R L L L ++A+ +YE +K+ YD L
Sbjct: 959 LENKIEDDEEEARRRLKRLE------------NKIKELGPVNLAAIEEYEELKERYDFLT 1006
Query: 397 KRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
+ +DL + A + LE A EE+ R ++
Sbjct: 1007 AQKEDL----TEAKETLEEAIEEIDREARE 1032
Score = 34.6 bits (80), Expect = 0.55
Identities = 64/436 (14%), Positives = 144/436 (33%), Gaps = 64/436 (14%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
++LE R + Q+ + E L ++ L ++RL +E+ EL +
Sbjct: 379 EQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKEL 438
Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA---RIK 424
+ L + + E +++ + LR+ ++ A +L Q + R++
Sbjct: 439 QAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQ 498
Query: 425 KQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH-------- 476
+ E ++ L+ ++GL ++ + Y+ A+
Sbjct: 499 ENLEGFSEGVKALLKNQSGLSG---ILGVLSEL-ISVDEGYEAAIEAALGGRLQAVVVEN 554
Query: 477 -EDAVKEIN----------TAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
A K I T + + +++ E ++ + + + ++
Sbjct: 555 LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKL 614
Query: 526 EKL----------SEDLTQAVRKIKTLEMENKELVEDK---------------------- 553
K +DL A+ K L + + D
Sbjct: 615 RKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILE 674
Query: 554 -----KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFE 608
+ L +IE L+ +IA + KE +L E + EE R+ R +
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD 734
Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
A E + + ++ +L + ++ EA +E+ ++L+A+++
Sbjct: 735 LARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794
Query: 669 SVNYLFILTESIQEAE 684
L L E++ E
Sbjct: 795 K-EELKALREALDELR 809
>gnl|CDD|214504 smart00072, GuKc, Guanylate kinase homologues. Active enzymes
catalyze ATP-dependent phosphorylation of GMP to GDP.
Structure resembles that of adenylate kinase. So-called
membrane-associated guanylate kinase homologues (MAGUKs)
do not possess guanylate kinase activities; instead at
least some possess protein-binding functions.
Length = 174
Score = 65.8 bits (161), Expect = 5e-12
Identities = 39/185 (21%), Positives = 76/185 (41%), Gaps = 28/185 (15%)
Query: 1481 GALSDTVTDKLLQDFPDKFVRCAP--------------EIMHCPQAAMEKGLADNLFVDY 1526
G T+ +L+Q+ PD F R + + E + LF+++
Sbjct: 2 GVGKGTLLAELIQEIPDAFERVVSHTTRPPRPGEVNGVDYHFVSKEEFEDDIKSGLFLEW 61
Query: 1527 -RKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIKFKSTKQIK 1585
+G Y+ TS ++ +K H +LD+ GV++L + +YPIV+ I S+++++
Sbjct: 62 GEYEGNYYG-TSKETIRQVAEKGK-HCLLDIDPQGVKQLRKAQLYPIVIFIAPPSSEELE 119
Query: 1586 EIKEIRDGRYSSLDKVTAKAAKEMYEHGLKIECEHKHYISAVIP-AGVNIAYMCTQIQTA 1644
R T++ ++ K + H VI + AY +++
Sbjct: 120 RRLRQR-------GTETSERIQKRLAAAQKEA-QEYHLFDYVIVNDDLEDAY--EELKEI 169
Query: 1645 VELEQ 1649
+E EQ
Sbjct: 170 LEAEQ 174
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 70.9 bits (174), Expect = 5e-12
Identities = 66/306 (21%), Positives = 130/306 (42%), Gaps = 27/306 (8%)
Query: 379 MSALNKYETIKD-----EYDA-LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ 432
MS + + + I + E+D K ++L +++L+L +E ++Q E + +
Sbjct: 150 MSPVERRKIIDEIAGVAEFDRKKEKALEEL-EEVEENIERLDLIIDEK---RQQLERLRR 205
Query: 433 EKNTALR---------ERNG--LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVK 481
E+ A R E G L ++ A RQ + R+ +E L K+ ++ + K
Sbjct: 206 EREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEK 265
Query: 482 EINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
+ + + K +K L EE ++E + + E+ L + + R+++
Sbjct: 266 RLEEIEQLLEELNKKIKDLGEEEQLRVKE------KIGELEAEIASLERSIAEKERELED 319
Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQRE 601
E +L + L +IE L+REI RDK +E +L+E + A EE +E
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379
Query: 602 AFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKL 661
R E + + ++E NE++ L + Q+ + + N I + ++L
Sbjct: 380 FAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINEL 439
Query: 662 QAELQD 667
+ E +D
Sbjct: 440 EEEKED 445
Score = 57.0 bits (138), Expect = 9e-08
Identities = 55/313 (17%), Positives = 125/313 (39%), Gaps = 29/313 (9%)
Query: 330 ESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETI- 388
E L + L K+ ++R+L L + L + E +
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLE-----EELEKLTEEISELEKRLEEIEQLLEELN 278
Query: 389 -------KDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
++E ++++ +L ++ + + E+ +++ + E + L E
Sbjct: 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEI 338
Query: 442 NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD-LKRL 500
L+++ I + + R++ E A+++++ ED E+ + T+D LK
Sbjct: 339 EELERE----IEEER---KRRDKLTEEYAELKEELEDLRAELE-EVDKEFAETRDELKDY 390
Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQI 560
E+ +E + + E D + +E+++LSE+L I +E + EL E+K+ + +I
Sbjct: 391 REKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI 450
Query: 561 ETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR 620
+ + ++ D K +E DL+E + + ++E + E A + +S
Sbjct: 451 KKQEWKLEQLAADLSKYEQELYDLKEEY-------DRVEKELSKLQRELAEAEAQARASE 503
Query: 621 KERNEVEAASMDL 633
+ A L
Sbjct: 504 ERVRGGRAVEEVL 516
Score = 53.9 bits (130), Expect = 7e-07
Identities = 69/397 (17%), Positives = 158/397 (39%), Gaps = 64/397 (16%)
Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
A+L+ + L+ + S L S+ +R++ L EL E DA +
Sbjct: 674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQ------ELSDASRKIGEIE----- 722
Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
+ E ++ E + L++R ++L S+ ++E + E+ ++ + E++ ++ +
Sbjct: 723 -KEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL 781
Query: 442 NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
N L+ + + + E ++ +E +++++ + + +++N + T + + L
Sbjct: 782 NDLEAR--LSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN-------RLTLEKEYLE 832
Query: 502 EERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIE 561
+E ++ + + S+ KE+E L+ + +++ LE ++L L + +
Sbjct: 833 KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892
Query: 562 TLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK 621
L+ ++ ++ + R+R E A E + E + E + +
Sbjct: 893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-----------LSEIEDPKG 941
Query: 622 ERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD-QSVNYLFILTESI 680
E E+ + L Q E +++ E++ + VN L +I
Sbjct: 942 EDEEIPEEELSL-EDVQAEL-----------------QRVEEEIRALEPVNML-----AI 978
Query: 681 QEAEVSKRR-------RDWAFSERDKI-EVIDSYHTK 709
QE E +R R ER I E I+ Y K
Sbjct: 979 QEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
Score = 50.1 bits (120), Expect = 1e-05
Identities = 48/230 (20%), Positives = 95/230 (41%), Gaps = 39/230 (16%)
Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDA 368
E+ +LE A + + L ++ L+++ + L+RE+ E R
Sbjct: 302 EIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER------------ 349
Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
D L +Y +K+E + LR +++ + D+L+ +E++ E
Sbjct: 350 KRRDKLT-------EEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKL-------E 395
Query: 429 DMNQEKNTALRERNGL---KQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINT 485
+ +E N RE + L Q+ + + + A+ E + +++++ ED EI
Sbjct: 396 KLKREINELKRELDRLQEELQRLSEELADLNAAI-AGIE--AKINELEEEKEDKALEIK- 451
Query: 486 AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
K L++L + + QE + E D V KE+ KL +L +A
Sbjct: 452 ------KQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495
Score = 48.5 bits (116), Expect = 3e-05
Identities = 58/345 (16%), Positives = 121/345 (35%), Gaps = 60/345 (17%)
Query: 301 RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
R D +EL + ++E QE L+ + +L D L++E+ ++S
Sbjct: 702 NRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS--- 758
Query: 361 TQPESDDAGSADSLNQHYMSALNKYE-TIKDEYDALRKRFDDLINTHSTAVDKLELAQEE 419
E + ++ + L+K E + D L I + ++ EE
Sbjct: 759 ---ELKEL---EARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLE------EE 806
Query: 420 VARIKKQCEDMNQ-------EKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL--- 469
V+RI+ + ++ Q EK +E L++Q ++ + E L
Sbjct: 807 VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGK 862
Query: 470 -AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
+++++ E+ + + K+ L + ++ + ++ + K + +L
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922
Query: 529 SEDLTQAVRKIKTLEMENKELVE------DKKTLSYQIETLKREIASALHD--------- 573
L ++ +E E E + + +++ ++ EI AL
Sbjct: 923 KAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEI-RALEPVNMLAIQEY 981
Query: 574 -------------RDKALKECNDLRERFGEFTAAKEECQREAFHA 605
R K +E + ER E+ K E EAF A
Sbjct: 982 EEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026
Score = 47.0 bits (112), Expect = 1e-04
Identities = 54/336 (16%), Positives = 121/336 (36%), Gaps = 68/336 (20%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
KE+E + + +LE ++ SSL + ++ S+ + L+ + EL
Sbjct: 723 KEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE----------- 771
Query: 368 AGSADSLNQHYMSALNKYE-TIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
L+K E + D L I + ++ EEV+RI+ +
Sbjct: 772 ------------EDLHKLEEALNDLEARLSHSRIPEIQAELSKLE------EEVSRIEAR 813
Query: 427 CEDMNQ-------EKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL----AKVQQQ 475
++ Q EK +E L++Q ++ + E L +++++
Sbjct: 814 LREIEQKLNRLTLEKEYLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGKKEELEEE 869
Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
E+ + + K+ L + ++ + ++ + K + +L L
Sbjct: 870 LEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEAL 929
Query: 536 VRKIKTLEMENKELVE------DKKTLSYQIETLKREIAS-------ALHDRDKALKECN 582
++ +E E E + + +++ ++ EI + A+ + ++ LK +
Sbjct: 930 EEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLD 989
Query: 583 DL----------RERFGEFTAAKEECQREAFHARFE 608
+L R+ E E+ +RE F FE
Sbjct: 990 ELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFE 1025
Score = 43.5 bits (103), Expect = 0.001
Identities = 41/219 (18%), Positives = 93/219 (42%), Gaps = 19/219 (8%)
Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
E+ K ++ E+ + I + + + L+RL ER A + Y ++ E+
Sbjct: 167 EFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKA-ERYQALLKEKREY-- 223
Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
E +L ++ R+ + +E + L E+ + L+ +I L++ + ++ K+ D
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD 283
Query: 584 L--------RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK----ERNEVEAASM 631
L +E+ GE A +R + E + + + E +++ A
Sbjct: 284 LGEEEQLRVKEKIGELEAEIASLER----SIAEKERELEDAEERLAKLEAEIDKLLAEIE 339
Query: 632 DLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV 670
+L + ++ER + +E L++ + L+AEL++
Sbjct: 340 ELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDK 378
Score = 37.7 bits (88), Expect = 0.067
Identities = 49/277 (17%), Positives = 98/277 (35%), Gaps = 41/277 (14%)
Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
L R + + L +LE + +A + ++E + + I + Q
Sbjct: 784 LEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843
Query: 351 ELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
+L E + E+ + + L + ++ L+K D+L
Sbjct: 844 DLKEQIKSIEKEIENLNGKKEE-LEEELEELEAALRDLESRLGDLKKERDELEAQLRELE 902
Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
K+E + ++ + +K+ ++ + K AL E + I E E + +L
Sbjct: 903 RKIEELEAQIEKKRKRLSEL-KAKLEALEEEL-------SEIEDPKGEDEEIPEEELSLE 954
Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
VQ + + ++++ L A+QEY + V K +++L E
Sbjct: 955 DVQAE--------------LQRVEEEIRALEPVNMLAIQEY-------EEVLKRLDELKE 993
Query: 531 DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREI 567
K LE E K ++E + E KRE+
Sbjct: 994 -------KRAKLEEERKAILE----RIEEYEKKKREV 1019
>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF). PDZ domains
are found in diverse signaling proteins.
Length = 80
Score = 61.5 bits (150), Expect = 1e-11
Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 2/79 (2%)
Query: 1240 LMIETRKCSNLGISLVGGNA--VGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATA 1297
+ +E LG SLVGG+ GI+V V G + GL+ GDRIL NG DL +
Sbjct: 2 VTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLSH 61
Query: 1298 EEAAYELAKPADKVTVLAQ 1316
+EA L +VT+
Sbjct: 62 DEAVLALKGSGGEVTLTIL 80
Score = 54.2 bits (131), Expect = 6e-09
Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 1137 SVEPLGIQIQ--CLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASV 1194
LG + G+FVS V A GLQ GD++L + G ++ + ++ A
Sbjct: 8 GRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLSHDEAVLA 67
Query: 1195 LRQCGNSITMLVQ 1207
L+ G +T+ +
Sbjct: 68 LKGSGGEVTLTIL 80
Score = 45.3 bits (108), Expect = 8e-06
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 208 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 265
G+++ ++ PG AA+ G L GDR+LSIN + ++ LS + L S +T+T L
Sbjct: 25 GIFVSEVLPGG-AAEAGGLQEGDRILSINGQDLENLSHDEA-VLALKGSGGEVTLTIL 80
Score = 45.3 bits (108), Expect = 8e-06
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 832 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 889
G+++ ++ PG AA+ G L GDR+LSIN + ++ LS + L S +T+T L
Sbjct: 25 GIFVSEVLPGG-AAEAGGLQEGDRILSINGQDLENLSHDEA-VLALKGSGGEVTLTIL 80
>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region) or
GLGF (after a conserved sequence motif). Many PDZ domains
bind C-terminal polypeptides, though binding to internal
(non-C-terminal) polypeptides and even to lipids has been
demonstrated. Heterodimerization through PDZ-PDZ domain
interactions adds to the domain's versatility, and PDZ
domain-mediated interactions may be modulated dynamically
through target phosphorylation. Some PDZ domains play a
role in scaffolding supramolecular complexes. PDZ domains
are found in diverse signaling proteins in bacteria,
archebacteria, and eurkayotes. This CD contains two
distinct structural subgroups with either a N- or
C-terminal beta-strand forming the peptide-binding groove
base. The circular permutation placing the strand on the
N-terminus appears to be found in Eumetazoa only, while
the C-terminal variant is found in all three kingdoms of
life, and seems to co-occur with protease domains. PDZ
domains have been named after PSD95(post synaptic density
protein), DlgA (Drosophila disc large tumor suppressor),
and ZO1, a mammalian tight junction protein.
Length = 70
Score = 58.9 bits (143), Expect = 8e-11
Identities = 29/65 (44%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYEL-AKPA 1308
LG S+ GG G+ V SV+ GS AGL+ GD IL NGTD++ T E+ A L +
Sbjct: 3 LGFSIRGGTEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVG 62
Query: 1309 DKVTV 1313
+KVT+
Sbjct: 63 EKVTL 67
Score = 54.6 bits (132), Expect = 3e-09
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 1140 PLGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLRQC- 1198
LG I+ GGV V +V S A + GLQ GD +L V G ++++ T + A +L++
Sbjct: 2 GLGFSIRGGTEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61
Query: 1199 GNSITMLV 1206
G +T+ V
Sbjct: 62 GEKVTLTV 69
Score = 43.4 bits (103), Expect = 2e-05
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 201 HGLTL----ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDS 256
G ++ E G+ + + PGS A + G L GD +L++N V L+ L +
Sbjct: 3 LGFSIRGGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61
Query: 257 PDVLTITTL 265
+ +T+T
Sbjct: 62 GEKVTLTVR 70
Score = 43.4 bits (103), Expect = 2e-05
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 825 HGLTL----ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDS 880
G ++ E G+ + + PGS A + G L GD +L++N V L+ L +
Sbjct: 3 LGFSIRGGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61
Query: 881 PDVLTITTL 889
+ +T+T
Sbjct: 62 GEKVTLTVR 70
>gnl|CDD|201353 pfam00625, Guanylate_kin, Guanylate kinase.
Length = 183
Score = 60.4 bits (147), Expect = 4e-10
Identities = 35/153 (22%), Positives = 65/153 (42%), Gaps = 19/153 (12%)
Query: 1475 RPVILVGAL---SDTVTDKLLQDFPDKFVRCAPEIMHCPQA--------------AMEKG 1517
RP++L G + LL ++P+KF P+ ME
Sbjct: 3 RPIVLSGPSGVGKSHIKKALLDEYPEKFGYSVSHTTRPPRPGEVDGKDYHFVSKEEMEND 62
Query: 1518 LADNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIK 1577
++ N F++Y + + TS A++ ++ ILDV + GV++L + + PI + IK
Sbjct: 63 ISANEFLEYAEFNGNYYGTSKEAIEQIA-ESGKICILDVDIQGVKQLRKAELSPISVFIK 121
Query: 1578 FKSTKQI-KEIKEIRDGRYSSLDKVTAKAAKEM 1609
S K + + +K + ++K A +E
Sbjct: 122 PPSLKVLQRRLKRRGTEQEEKINKRMEAAEQEF 154
>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ family.
This family consists of a set proteins various
designated DegP, heat shock protein HtrA, and protease
DO. The ortholog in Pseudomonas aeruginosa is designated
MucD and is found in an operon that controls mucoid
phenotype. This family also includes the DegQ (HhoA)
paralog in E. coli which can rescue a DegP mutant, but
not the smaller DegS paralog, which cannot. Members of
this family are located in the periplasm and have
separable functions as both protease and chaperone.
Members have a trypsin domain and two copies of a PDZ
domain. This protein protects bacteria from thermal and
other stresses and may be important for the survival of
bacterial pathogens.// The chaperone function is dominant
at low temperatures, whereas the proteolytic activity is
turned on at elevated temperatures [Protein fate, Protein
folding and stabilization, Protein fate, Degradation of
proteins, peptides, and glycopeptides].
Length = 428
Score = 57.6 bits (140), Expect = 3e-08
Identities = 49/200 (24%), Positives = 72/200 (36%), Gaps = 41/200 (20%)
Query: 1141 LGIQIQCLDS-----------GGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSA--- 1186
LG+ IQ + S G V+ V S A + GL+ GD + V G + S
Sbjct: 236 LGVTIQEVTSDLAKSLGLEKQRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADL 295
Query: 1187 TYQLAAS---------VLRQCGN-SITMLVQYSPDKYHELEGSGSSSAENESVSGRGSGE 1236
+ +LR+ +IT+ + SP+ SSS ++
Sbjct: 296 RRAIGTLKPGKKVTLGILRKGKEKTITVTLGASPE------EQASSSNPFLGLTVANLSP 349
Query: 1237 PRFLMIETRKCSNLGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAAT 1296
E RK L G+ G+ V V SGS AGL+ GD IL N + +
Sbjct: 350 ------EIRKELRLK-----GDVKGVVVTKVVSGSPAARAGLQPGDVILSVNQQPVSSVA 398
Query: 1297 AEEAAYELAKPADKVTVLAQ 1316
AK +V +L
Sbjct: 399 ELRKVLARAKKGGRVALLIL 418
Score = 32.2 bits (74), Expect = 2.5
Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 208 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALL 253
G+ + K+ GS AA+ G L GD +LS+N + V SS E +L
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPV---SSVAELRKVL 404
Score = 32.2 bits (74), Expect = 2.5
Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 832 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALL 877
G+ + K+ GS AA+ G L GD +LS+N + V SS E +L
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPV---SSVAELRKVL 404
Score = 31.4 bits (72), Expect = 3.9
Identities = 26/102 (25%), Positives = 44/102 (43%), Gaps = 13/102 (12%)
Query: 780 SNVSKRMVLETVRSSGTSALMVIRRRRVGARSLVTTQLQLNNSYDHGLTLESGLYICKIS 839
SN++K V++ + G ++R +G T L S GL + G + ++
Sbjct: 215 SNMAKN-VVDQLIEGGK-----VKRGWLGVTIQEVTS-DLAKSL--GLEKQRGALVAQVL 265
Query: 840 PGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP 881
PGS A K G L GD + S+N + + SS + +
Sbjct: 266 PGSPAEKAG-LKAGDVITSVNGKPI---SSFADLRRAIGTLK 303
Score = 31.0 bits (71), Expect = 6.3
Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 202 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP 257
GL + G + ++ PGS A K G L GD + S+N + + SS + +
Sbjct: 252 GLEKQRGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPI---SSFADLRRAIGTLK 303
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 57.1 bits (138), Expect = 7e-08
Identities = 50/305 (16%), Positives = 126/305 (41%), Gaps = 22/305 (7%)
Query: 390 DEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
++Y+ L + ++I +++LE E+ + + +E+ L++ ++++
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEE-- 224
Query: 450 AAIRQWDIALRERNEYQEALAKVQ---QQHEDAVKEINTAMAVRMKATKDLKRLTEERNA 506
++ + +E +E LA+++ ++ E+ + ++ ++A K + E
Sbjct: 225 ---QEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELER 281
Query: 507 AMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN---KELVEDKKTLSYQIETL 563
++E + + + +E+E+L E+L ++ LE K L E + L ++E L
Sbjct: 282 LLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKL 341
Query: 564 KREIASA---LHDRDKALKE-CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSS 619
+ E+ ++ K L+E +L ER E + E R + +
Sbjct: 342 ESELEELAEEKNELAKLLEERLKELEERLEELEK-----ELEKALERLKQLEEAIQELKE 396
Query: 620 RKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTES 679
E+ AA ++ + ++ + E +E++ L + KL+ ++ L I +
Sbjct: 397 ELA--ELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELA 454
Query: 680 IQEAE 684
+
Sbjct: 455 GAGEK 459
Score = 52.8 bits (127), Expect = 2e-06
Identities = 65/430 (15%), Positives = 168/430 (39%), Gaps = 63/430 (14%)
Query: 304 DHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQP 363
+ ++ELE +A + +LE ++ SL + L ++L+ EL EL
Sbjct: 297 EREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEEL------AE 350
Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
E ++ L + E +E + ++ + + A ++ +EE+A +
Sbjct: 351 EKNELAKLLE------ERLKELEERLEELEKELEKALERLKQLEEA---IQELKEELAEL 401
Query: 424 KKQCEDMNQEKNTALRERNGLKQQCT------AAIRQWDIALRERNEYQEALA------- 470
E++ +E +E L+++ + + L + LA
Sbjct: 402 SAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP 461
Query: 471 ------------KVQQQHEDAVKEINTAMAVRMKATK---DLKRLTEERNAAMQEYSLIM 515
++ + +E ++E+ ++ + + +++ L +E +E ++
Sbjct: 462 VCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELL 521
Query: 516 SERDSVHKEMEKLSEDLTQAVRKIKTL--EMENKELVEDKKTLSYQIETLKREIASALHD 573
+++ +E+E+ E L + +++ L +++ ++L E+ + L +++ LK +
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLL 581
Query: 574 RDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
R + + +LRER E +E + ++ E + + + NE+E A +L
Sbjct: 582 RTRKEEL-EELRERLKELKKKLKELEER--LSQLEELL---QSLELSEAENELEEAEEEL 635
Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWA 693
+ ++ L E++ L + EL+++ L I+ +
Sbjct: 636 -----ESELEKLNLQAELEELLQ---AALEELEEKVEE----LEAEIRRELQRIENEEQL 683
Query: 694 FSERDKIEVI 703
+ +++E +
Sbjct: 684 EEKLEELEQL 693
Score = 47.8 bits (114), Expect = 5e-05
Identities = 49/294 (16%), Positives = 121/294 (41%), Gaps = 42/294 (14%)
Query: 301 RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
+H + LE Y + + +L +E+ LR + +L + + L+ EL+EL
Sbjct: 467 LPEEHEKELLELYELELEELEEELSREKEEAE-LREEIEELEKELRELEEELIEL----- 520
Query: 361 TQPESDDAGSADSLNQHYMSALNKYETIK------------DEYDALRKRFDDLINTH-- 406
+ E + + + L + E +K + + + +L+
Sbjct: 521 LELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL 580
Query: 407 -STAVDKLELAQEEVARIKK---QCEDMNQEKNTALRERNGLKQ--------QCTAAIRQ 454
T ++LE +E + +KK + E+ + L+ + + + +
Sbjct: 581 LRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE 640
Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
L + E +E L ++ E+ V+E+ + ++ ++ ++L E+ +E +
Sbjct: 641 ---KLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL----EELEQL 693
Query: 515 MSERDSVHKEMEKLSEDLT---QAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
E + + +E+E+L + L Q + ++++ + E +EL ++ + L +E L+
Sbjct: 694 EEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 56.2 bits (135), Expect = 1e-07
Identities = 74/343 (21%), Positives = 141/343 (41%), Gaps = 49/343 (14%)
Query: 307 LKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESD 366
LKELE + + +LE A +E + D EL +LR+ E
Sbjct: 63 LKELESTKRTVEDLKLKLEKAEKEEQQAKQ------------DSELAKLRA------EEL 104
Query: 367 DAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
+ G + + Y++A + +++K+E +R+ +D L+ A+ + E A +K+
Sbjct: 105 EQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKRAEEAICASKVNEKK 164
Query: 427 CEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
E++ +E A + A E+++ +E K ++ E ++ +
Sbjct: 165 VEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYEKELKEAEKELERLKQD 224
Query: 487 MAVRMKATKDLKRLTE--ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT--- 541
+ KDL++L E ++Q+ IM+ SV KE+E+ +L +A + K+
Sbjct: 225 ----LDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRN 280
Query: 542 --------LEMENKELVEDKKTLSYQIETLKREIASALHDR---------DKALKECNDL 584
LE E KEL E + + E E AS+L KA ++ ++
Sbjct: 281 LVESLKQELEEEKKELEELR-----EKEGEAEEAASSLEAELNRTKSEKESKAREKMVEI 335
Query: 585 RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVE 627
+ + ++ EE ++EA AR E E + ++ E
Sbjct: 336 PLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAE 378
Score = 48.1 bits (114), Expect = 3e-05
Identities = 61/261 (23%), Positives = 110/261 (42%), Gaps = 13/261 (4%)
Query: 413 LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKV 472
L LAQEE ++KK+ E +EK L+E K+ + + A +E + ++
Sbjct: 38 LMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELA 97
Query: 473 QQQHEDAVKEI-NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
+ + E+ + I + + AT +L + EE QEY ++ ERD+ K E
Sbjct: 98 KLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKRAE----- 152
Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF-GE 590
+A+ K E + +EL ++ + +E A +R A E + RE + E
Sbjct: 153 --EAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYEKE 210
Query: 591 FTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKE----RMDNLE 646
A++E +R E + S+ E + E + M KE + + +
Sbjct: 211 LKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEK 270
Query: 647 ANQEIDRLRKHTDKLQAELQD 667
A +E LR + L+ EL++
Sbjct: 271 AAEEEKSLRNLVESLKQELEE 291
Score = 48.1 bits (114), Expect = 4e-05
Identities = 58/295 (19%), Positives = 108/295 (36%), Gaps = 7/295 (2%)
Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI-KKQCEDMNQEKNTAL 438
S E +K + + K A + E ++ + + ++ E ++
Sbjct: 68 STKRTVEDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVK 127
Query: 439 RERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
E ++Q+ A + + D AL+ E A +++ E+ KEI +A
Sbjct: 128 EELRKIRQEYDALVEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHL 187
Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV---RKIKTLEMENKELVEDKKT 555
EER A E + KE EK E L Q + + ++ L + EL +K
Sbjct: 188 EAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKE 247
Query: 556 LSYQ--IETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRM 613
+S + ++ +E+ A + +KA +E LR EE ++E R +
Sbjct: 248 ISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAE 307
Query: 614 GERDSSRKERNEV-EAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
S E N K + + +A+ E + RK + + EL+
Sbjct: 308 EAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRK 362
Score = 37.7 bits (87), Expect = 0.049
Identities = 58/277 (20%), Positives = 114/277 (41%), Gaps = 21/277 (7%)
Query: 308 KELEYYRGQHQAA--MAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC--TQP 363
KELE + + +L A+ E SL+ + S + S + +EL E ++ +
Sbjct: 216 KELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVA-SVLKELEEAKANLEKAAEE 274
Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALR-KRFDDLINTHSTAVDKLELAQEEVAR 422
E +SL Q + E K E + LR K + S + E+ ++
Sbjct: 275 EKSLRNLVESLKQ-------ELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESK 327
Query: 423 IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKE 482
+++ ++ + A E +++ AA + E + + AL + + E A+KE
Sbjct: 328 AREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALKE 387
Query: 483 INTAMAVRMKATKDLKRLTEERNAAMQEYS----LIMSERDSV---HKEMEKLSE-DLTQ 534
A A A ++K L E +A + L + E + + +E E+L+E +
Sbjct: 388 AEAAKAAEALALAEIKALQESEESAKADSPRKITLSLEEYEELSKKAEEAEELAEKKVAA 447
Query: 535 AVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
A+ +++ + E ++ + S +IE K + AL
Sbjct: 448 ALAQVEEAKESENESLKKLEEASKEIEEKKAALEEAL 484
Score = 33.8 bits (77), Expect = 0.91
Identities = 40/227 (17%), Positives = 81/227 (35%), Gaps = 27/227 (11%)
Query: 461 ERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDL---KRLTEERNAAMQEYSLIMSE 517
+ L Q++ KE+ A +++ K+L KR E+ +++ E
Sbjct: 30 RVLVKETELMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAE---KE 86
Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL-VEDKKTLSYQIETLKREIASALHDRDK 576
++ E + + I+ LE+E + ++ ++ +++E + + +RD
Sbjct: 87 EQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDA 146
Query: 577 ALKEC--------------NDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR-- 620
ALK +L + + E A E E+D R
Sbjct: 147 ALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRET 206
Query: 621 --KERNEVEAASMDLFGKC--QKERMDNLEANQEIDRLRKHTDKLQA 663
KE E E L +K+ EA+ E++ L+K + +
Sbjct: 207 YEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMAS 253
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 54.7 bits (132), Expect = 4e-07
Identities = 84/409 (20%), Positives = 160/409 (39%), Gaps = 68/409 (16%)
Query: 293 VRQHNETVRRGDHTLKE-LEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRE 351
V E + D L++ LE R QA + E+ +++ L + +L + L+ E
Sbjct: 312 VEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESE 371
Query: 352 LLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVD 411
L E R + E + E +++E + LR+RF D A D
Sbjct: 372 LEEAREAVEDRRE-------------------EIEELEEEIEELRERFGDAPVDLGNAED 412
Query: 412 KLELAQEEVARIKKQCEDM---------NQEKNTALRERNGLKQQCTAAIRQWDIA---- 458
LE +EE ++++ ++ E+ AL E G +C +
Sbjct: 413 FLEELREERDELREREAELEATLRTARERVEEAEALLEA-GKCPECGQPVEGSPHVETIE 471
Query: 459 -LRERNEYQEA-LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
RER E EA L ++++ E+ + + A + +A ++RL E R + LI
Sbjct: 472 EDRERVEELEAELEDLEEEVEEVEERLERAEDLV-EAEDRIERLEERREDLEE---LIAE 527
Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR-D 575
R+++ ++ E+ E +A LE E +E KRE A+ + +
Sbjct: 528 RRETIEEKRERAEELRERA----AELEAEAEE---------------KREAAAEAEEEAE 568
Query: 576 KALKECNDLRERFGEFTAAKEECQR--EAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
+A +E +L + E E +R A + + R+ E+ +
Sbjct: 569 EAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRER 628
Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
+ ++ER LEA + R+ ++ + + + ++ YL + E + E
Sbjct: 629 LAE-KRERKRELEAEFDEARI----EEAREDKE-RAEEYLEQVEEKLDE 671
Score = 45.4 bits (108), Expect = 3e-04
Identities = 42/193 (21%), Positives = 84/193 (43%), Gaps = 17/193 (8%)
Query: 516 SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD 575
SE + +E+E+ E QA T + ++ L E ++ ++ETL+ EI
Sbjct: 213 SELAELDEEIERYEEQREQARE---TRDEADEVLEEHEERRE-ELETLEAEIEDLRETIA 268
Query: 576 KALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFG 635
+ +E +L E + EE + E E + + ++ R E+E +L
Sbjct: 269 ETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328
Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQ---AELQDQSVNYLFILTESIQEAEVSKRRRDW 692
+ ++ R+ N+E + LR+ D L+ EL++++ L ++EA + R
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAE----LESELEEAREAVEDR-- 382
Query: 693 AFSERDKIEVIDS 705
R++IE ++
Sbjct: 383 ----REEIEELEE 391
Score = 41.9 bits (99), Expect = 0.003
Identities = 63/381 (16%), Positives = 133/381 (34%), Gaps = 66/381 (17%)
Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDA 368
E+E Y Q + A + A + + +L + L+ E+ +LR T E++
Sbjct: 221 EIERYEEQREQARETRDEADEVLEEHEERREEL----ETLEAEIEDLRE---TIAETER- 272
Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLEL----------AQE 418
+ E + +E LR+R ++L + + L +E
Sbjct: 273 ---------------EREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARRE 317
Query: 419 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED 478
E+ ++ D +E A + N + L ER E +++++ +
Sbjct: 318 ELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD----LEERAE------ELREEAAE 367
Query: 479 AVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRK 538
E+ A ++++ L EE + + + + +E+L E+ + +
Sbjct: 368 LESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRER 427
Query: 539 IKTLEM----------ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF 588
LE E + L+E K + + + ++E E
Sbjct: 428 EAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELE-- 485
Query: 589 GEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEAN 648
+E + E R E + +R +ER E + ER + +E
Sbjct: 486 ---DLEEEVEEVEERLERAEDLVEAEDRIERLEERRE-------DLEELIAERRETIEEK 535
Query: 649 QE-IDRLRKHTDKLQAELQDQ 668
+E + LR+ +L+AE +++
Sbjct: 536 RERAEELRERAAELEAEAEEK 556
Score = 39.6 bits (93), Expect = 0.014
Identities = 58/319 (18%), Positives = 122/319 (38%), Gaps = 50/319 (15%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSG--------- 358
+E+E R + A L A LR + +L + L+ L R
Sbjct: 391 EEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLE 450
Query: 359 ---C--CTQP--ESDDAGSADSLNQHYMSALNKYETIKDEYDALR-------------KR 398
C C QP S + + + + E +++E + + R
Sbjct: 451 AGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDR 510
Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
+ L + + +E + +++ E++ + E ++ A + + A
Sbjct: 511 IERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEA 570
Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTE------ERNAAMQEYS 512
E E LA+++++ E +++ I T +A A +++RL E E N +E
Sbjct: 571 REEVAELNSKLAELKERIE-SLERIRTLLAAIADAEDEIERLREKREALAELNDERRER- 628
Query: 513 LIMSERDSVHKEM---------EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
++E+ +E+ E+ ED +A ++ +E + EL E++ L +I +
Sbjct: 629 --LAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAV 686
Query: 564 KREIAS--ALHDRDKALKE 580
+ E+ L +R +AL+
Sbjct: 687 ENELEELEELRERREALEN 705
Score = 34.6 bits (80), Expect = 0.48
Identities = 37/183 (20%), Positives = 68/183 (37%), Gaps = 13/183 (7%)
Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR---EIASALHDRDKALKECN 582
EK +DL ++ LE E EL E+ + Q E + E L + ++ +E
Sbjct: 198 EKEEKDL---HERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254
Query: 583 DLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERM 642
L + E +RE R E A + + R+ E+E DL + +
Sbjct: 255 TLEAEIEDLRETIAETERE----REELAEEVRDL---RERLEELEEERDDLLAEAGLDDA 307
Query: 643 DNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEV 702
D + L ++L+ L++ V E+ E + + A R++
Sbjct: 308 DAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAE 367
Query: 703 IDS 705
++S
Sbjct: 368 LES 370
>gnl|CDD|212793 cd11859, SH3_ZO, Src homology 3 domain of the Tight junction
proteins, Zonula occludens (ZO) proteins. ZO proteins
are scaffolding proteins that associate with each other
and with other proteins of the tight junction, zonula
adherens, and gap junctions. They play roles in
regulating cytoskeletal dynamics at these cell junctions.
They are considered members of the MAGUK
(membrane-associated guanylate kinase) protein family,
which is characterized by the presence of a core of three
domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
domain in MAGUK proteins is enzymatically inactive;
instead, the domain mediates protein-protein interactions
and associates intramolecularly with the SH3 domain.
Vertebrates contain three ZO proteins (ZO-1, ZO-2, and
ZO-3) with redundant and non-redundant roles. They
contain three PDZ domains, followed by SH3 and GuK
domains; in addition, ZO-1 and ZO-2 contains a
proline-rich (PR) actin binding domain at the C-terminus
while ZO-3 contains this PR domain between the second and
third PDZ domains. The C-terminal regions of the three ZO
proteins are unique. The SH3 domain of ZO-1 has been
shown to bind ZONAB, ZAK, afadin, and Galpha12. SH3
domains are protein interaction domains that bind to
proline-rich ligands with moderate affinity and
selectivity, preferentially to PxxP motifs. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 62
Score = 46.9 bits (112), Expect = 1e-06
Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
Y+R F + +L F K ++ +V +T++ G G W+A V + + G+IP+K
Sbjct: 1 YIRTHFDY--EKPAKGELSFKKGEVFHVVDTLYQGTVGSWQAVRVGRNHQELERGVIPNK 58
Query: 1394 YKVE 1397
+ E
Sbjct: 59 SRAE 62
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 51.7 bits (124), Expect = 3e-06
Identities = 78/360 (21%), Positives = 142/360 (39%), Gaps = 60/360 (16%)
Query: 318 QAAMAQLEAAAQESSSLRS-KYSDLISDKQRLDRELLELRSG------CCTQPESDDAGS 370
Q +AQLE ++ + L S KY+ L S ++L E L + C + E +
Sbjct: 220 QERLAQLEGSSLGPNQLGSKKYNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKE-LAE 278
Query: 371 ADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDM 430
N S + + +KDE D LR+ + DK + + +V KK+ ED+
Sbjct: 279 LQHRNDELTSLAAESQALKDEIDVLRE-----------SSDKAKKLEAQVETYKKKLEDL 327
Query: 431 NQEKNT--ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
N + L ERN + Q T Q + L++ N + L ++Q ++ +++
Sbjct: 328 NDLRRQVKLLEERNAMYMQNT---VQLEEELKKANAARGQLETYKRQVQELHAKLSEESK 384
Query: 489 VRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKE 548
K + KRL E+ A +E +++ERDS L + +++ + + +
Sbjct: 385 KADKLEFEYKRLEEKLEALQKEKERLLAERDS-----------LRETNEELRCGQAQQDQ 433
Query: 549 LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFE 608
L + S + L E+ + + RE+ Q E
Sbjct: 434 LTQADAGTSPSGDNLAAELLPS------------EYREKL-------IRLQHE------N 468
Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
+R+G+ S + E++ D + + ANQ I L++ + LQ LQ+Q
Sbjct: 469 KMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQ 528
Score = 47.5 bits (113), Expect = 6e-05
Identities = 99/502 (19%), Positives = 193/502 (38%), Gaps = 84/502 (16%)
Query: 132 NTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ---QHEDAVKEINTAMAVTN 188
A E++ L Q+C +Q + E+N Q+ K+Q+ Q E + N +
Sbjct: 182 KEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKY 241
Query: 189 YLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 248
L QL + LE+ +I L ++ L+ D L+S
Sbjct: 242 NLLQSQLEQLQEENFRLEAARDDYRIKCEEL----------EKELAELQHRNDELTSLAA 291
Query: 249 GMALLSDSPDVLTITTLK----PTSLGEHSPRVH-----RKSVNSSTQTMPLYV---RQH 296
L D DVL ++ K + + ++ R+ V + +Y+ Q
Sbjct: 292 ESQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNTVQL 351
Query: 297 NETVRRGDHTLKELEYYR-------GQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRL- 348
E +++ + +LE Y+ + + + E L K L +K+RL
Sbjct: 352 EEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLL 411
Query: 349 -DRELL-----ELRSG-----CCTQPESDDAGSADSLNQHYMSA---------------- 381
+R+ L ELR G TQ ++ + S D+L + +
Sbjct: 412 AERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKML 471
Query: 382 -LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
L + + + L++ +D ++ +L LA + + +++Q ED+ + AL+E
Sbjct: 472 RLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQK----ALQE 527
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
+ + + + + L + +E E L K ++Q E+ + + ++ ++ + +
Sbjct: 528 QGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQK 587
Query: 501 TEERNAAMQE-YSLIMSERDSVHK-----------EMEKLSEDLTQAVRKIKTLEMENKE 548
+E AM+E Y + + V K E++ L + LT+ ++I+ LE E
Sbjct: 588 KDEDMRAMEERYKKYVEKAREVIKTLDPKQNPASPEIQLLKKQLTERDKRIRHLESE--- 644
Query: 549 LVEDKKTLSYQIETLKREIASA 570
E K + Q E L I SA
Sbjct: 645 -YEKAKPMREQEEKL---IVSA 662
Score = 39.4 bits (92), Expect = 0.018
Identities = 41/219 (18%), Positives = 89/219 (40%), Gaps = 15/219 (6%)
Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
L K + + + +++ +L S K + + E R++++ E + +EK L ER
Sbjct: 355 LKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAER 414
Query: 442 NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
+ L++ + ++++ +A A ++ E+ + R K L RL
Sbjct: 415 DSLRE----TNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPS-EYREK----LIRLQ 465
Query: 502 EERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIE 561
E L + + S ++ + +L + L A R+ LE + + + L Q+E
Sbjct: 466 HENKM------LRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVE 519
Query: 562 TLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
L++ + + + + L E + A EE Q+
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQK 558
Score = 39.0 bits (91), Expect = 0.025
Identities = 55/240 (22%), Positives = 97/240 (40%), Gaps = 25/240 (10%)
Query: 435 NTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ---QHEDAVKEINTAMAVR- 490
A E++ L Q+C +Q + E+N Q+ K+Q+ Q E + N + +
Sbjct: 182 KEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKY 241
Query: 491 MKATKDLKRLTEER---NAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENK 547
L++L EE AA +Y + E + E++ +++LT + + L+ E
Sbjct: 242 NLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEID 301
Query: 548 ELVE--DK-KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFH 604
L E DK K L Q+ET K K L++ NDLR + EE
Sbjct: 302 VLRESSDKAKKLEAQVETYK-----------KKLEDLNDLRRQ----VKLLEERNAMYMQ 346
Query: 605 ARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAE 664
+ + + +++R + + +L K +E + E RL + + LQ E
Sbjct: 347 NTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKE 406
Score = 30.5 bits (69), Expect = 8.5
Identities = 32/167 (19%), Positives = 64/167 (38%), Gaps = 19/167 (11%)
Query: 506 AAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
A+QE +MS+ E Q +K + KE E+K L+ + L +
Sbjct: 148 TAIQE---LMSKEQGSSPSRESAGNLDQQ----LKKALEDLKEAQEEKDELAQRCHELDK 200
Query: 566 EIASALHDRDKALKECNDLRERFGEFTAAKEE----CQREAFHARFEYAMRMGERDSSRK 621
++ +++ +E L+ER + + ++ + + + ++
Sbjct: 201 QVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQL-------EQLQE 253
Query: 622 ERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
E +EAA D KC++ + E D L + QA L+D+
Sbjct: 254 ENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQA-LKDE 299
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 52.0 bits (125), Expect = 3e-06
Identities = 83/469 (17%), Positives = 178/469 (37%), Gaps = 61/469 (13%)
Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISD-KQRL----D 349
+ E ++R + L +L R QL A SL + ++ +
Sbjct: 508 EALEALQRAERRLLQL---RQALDELELQLSPQA---GSLLHFLRNEAPGWEESIGKVIS 561
Query: 350 RELLELRSGCCTQPESDDAGSADSLN--QHYMSALNKYETIKDEYDALRKRFDDLINTHS 407
ELLE P+ + +D+L + L+ + +E LR+R
Sbjct: 562 PELLER---TDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANE-TELRERLQQAEEALQ 617
Query: 408 TAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALR-ERNEYQ 466
+AV K + A+E++ + + E+ + + A +Q +++ + +++ +
Sbjct: 618 SAVAKQKQAEEQLVQANAELEEQKRAEAEARTA----LKQARLDLQRLQNEQQSLKDKLE 673
Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLK-----RLTEERNAAMQEYSLIMSERDSV 521
A+A+ +QQ E +++++ + ++ + E R + ++ ++ E D+
Sbjct: 674 LAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQ 733
Query: 522 HKEMEKLSEDL-TQAVRKIKTLEMENKELVEDK-------KTLSYQIETLKREIASALHD 573
++ E TQA ++K L+ + + K L QIE L+ I
Sbjct: 734 LAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVR 793
Query: 574 RDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSS----RKERNEVEAA 629
R + + ++E + + +EE A+++ E +SS ++E +
Sbjct: 794 RPEVREYRAFMQETWLHRDSLREERPN--------LAIQLRELESSAEELQQELTRLIKD 845
Query: 630 SMDLFGKCQKERMDN----LEANQEIDRLRKHTDKLQAELQDQSVNYL-FILTESIQEAE 684
+ K ++ER + ++ + LR +L + + N ++E + + E
Sbjct: 846 TKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISERLDQLE 905
Query: 685 VSKRRRDWAFSERDK-IEVIDSYHTKHAGRHKRGKELDKNGGTWPKARS 732
KR+R + K +E D H G L + W K RS
Sbjct: 906 EFKRKRKRLSGDLKKFLERFDGVIKDH-----SGSGLYET---WQKLRS 946
Score = 40.1 bits (94), Expect = 0.014
Identities = 65/407 (15%), Positives = 131/407 (32%), Gaps = 79/407 (19%)
Query: 311 EYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGS 370
E + QLEA E + + + RL R L S T E +
Sbjct: 420 EDLQALESQLRQQLEAGKLEFNEEEYEL------ELRLGRLKQRLDSATATPEELEQLEI 473
Query: 371 ADSLNQHYMSALNKY----ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
D + + E ++ E LRKR D+ + A +L ++ + ++ Q
Sbjct: 474 NDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQ 533
Query: 427 C--------EDMNQEK-----------NTALRERNGLKQQCTAAIR-------QWDIALR 460
+ E + L ER L Q D+
Sbjct: 534 LSPQAGSLLHFLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRL 593
Query: 461 ERNEY-------QEALAKVQ---QQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
+ +Y +E L + + Q K+ + ++ KR E A+++
Sbjct: 594 DVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQ 653
Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
L + +R ++ K + + ++ + L Q++ L + +
Sbjct: 654 ARLDL-QRLQNEQQSLK--------DKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAF 704
Query: 571 LHDRDKALKECNDLRERFGEFTAA--KEECQREAFHARFEYAMRMGERDSSRKERNEVEA 628
L +ALK+ D RE E A E + + A+ A+ + K R +
Sbjct: 705 L----EALKD--DFRELRTERLAKWQVVEGELDNQLAQLSAAIE--AARTQAKARLK--- 753
Query: 629 ASMDLFGKCQKERMDNLEAN----QEIDRLRKHTDKLQAELQDQSVN 671
+ +K+ L + + L++ ++L+ ++ +V
Sbjct: 754 -------ELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVR 793
Score = 38.5 bits (90), Expect = 0.037
Identities = 45/271 (16%), Positives = 91/271 (33%), Gaps = 50/271 (18%)
Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAI----RQWDIALRERNEYQ 466
++ E QE R+++Q + + A E N A + + ++ ++ ++
Sbjct: 281 EEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFE 340
Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEME 526
+A + Q D + I + + + L LT + ++Y + +
Sbjct: 341 DADIEQLQADLDQLPSIRSELE---EVEARLDALTGKHQDVQRKYE----------RLKQ 387
Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRE 586
K+ E L + + K E + + + K I L + LR+
Sbjct: 388 KIKEQLERDLEKNN----------ERLAAIREEKDRQKAAIEEDLQ------ALESQLRQ 431
Query: 587 RFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK------ERNEVEAASMDLFGKCQKE 640
+ A K E E + +R+G E E + + K Q+E
Sbjct: 432 QLE---AGKLEFNEEEYE----LELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEE 484
Query: 641 ----RMDNLEANQEIDRLRKHTDKLQAELQD 667
+ + E+ +LRK D+ LQ
Sbjct: 485 QEQAEANVEQLQSELRQLRKRRDEALEALQR 515
Score = 37.4 bits (87), Expect = 0.076
Identities = 71/420 (16%), Positives = 148/420 (35%), Gaps = 67/420 (15%)
Query: 313 YRGQHQAAMAQLEAAAQESSSLRSKYSDLISD-KQRLDRELLELRSGCCTQPESDDAGSA 371
R + + A+L+A + ++ KY L K++L+R+L + E D A
Sbjct: 357 IRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKA 416
Query: 372 DSLNQHY---MSALNK-YETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIK--- 424
+ + S L + E K E++ + + +D EE+ +++
Sbjct: 417 A-IEEDLQALESQLRQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEIND 475
Query: 425 KQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV---- 480
+ E +E+ A L+ + ++ D AL + L +++Q ++
Sbjct: 476 EALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLS 535
Query: 481 ----------------------KEINTAMAVRMKATKDL--KRLTEERNAAMQEYSLIMS 516
K I+ + R DL + + + + SL +
Sbjct: 536 PQAGSLLHFLRNEAPGWEESIGKVISPELLER----TDLDPQLVEGSDSDTLYGVSLDLQ 591
Query: 517 --ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
+ +L E L QA +++ + K+ E + ++E KR A A
Sbjct: 592 RLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEAR--- 648
Query: 575 DKALKECNDLRER-FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
ALK+ +R E + K++ + + + + +++A L
Sbjct: 649 -TALKQARLDLQRLQNEQQSLKDKLEL----------AIAERKQQAETQLRQLDAQLKQL 697
Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV---NYLFILTESIQEAEVSKRRR 690
+++ LEA + D R+ + A+ Q N L L+ +I+ A + R
Sbjct: 698 ----LEQQQAFLEALK--DDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKAR 751
>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant zinc
metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to lipids
has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 46.1 bits (110), Expect = 4e-06
Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 8/80 (10%)
Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAK-PA 1308
LG GG + + V GS AGL+ GDRIL NG + + E+ + + P
Sbjct: 3 LGFV-PGGPPIEPVIGEVVPGSPAAKAGLKAGDRILAINGQ--KIKSWEDLVDAVQENPG 59
Query: 1309 DKVTVLAQYSVERYNEVKDK 1328
+T+ +VER E
Sbjct: 60 KPLTL----TVERNGETITL 75
Score = 34.1 bits (79), Expect = 0.060
Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 1155 VSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLRQCGNSITMLVQ 1207
+ V S A++ GL+ GD++L + G ++S L +V G +T+ V+
Sbjct: 16 IGEVVPGSPAAKAGLKAGDRILAINGQKIKSWE-DLVDAVQENPGKPLTLTVE 67
Score = 31.8 bits (73), Expect = 0.45
Identities = 14/29 (48%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 213 KISPGSLAAKEGSLAVGDRVLSINNRTVD 241
++ PGS AAK G L GDR+L+IN + +
Sbjct: 18 EVVPGSPAAKAG-LKAGDRILAINGQKIK 45
Score = 31.8 bits (73), Expect = 0.45
Identities = 14/29 (48%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 837 KISPGSLAAKEGSLAVGDRVLSINNRTVD 865
++ PGS AAK G L GDR+L+IN + +
Sbjct: 18 EVVPGSPAAKAG-LKAGDRILAINGQKIK 45
>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain.
Length = 81
Score = 44.9 bits (107), Expect = 9e-06
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 4/78 (5%)
Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPAD 1309
+G+ +V G+ V SV+ GS AGL+ GD IL +G + + T KP D
Sbjct: 3 IGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSLTELIEVILNGKPGD 62
Query: 1310 KVTVLAQYSVERYNEVKD 1327
V + +V R + K
Sbjct: 63 TVKL----TVYRDGKKKT 76
Score = 34.5 bits (80), Expect = 0.041
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 206 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP-DVLTIT 263
+G+ + + GS AAK G L GD +LSI+ + V+ L+ E +L+ P D + +T
Sbjct: 12 GTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIE--VILNGKPGDTVKLT 67
Score = 34.5 bits (80), Expect = 0.041
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 830 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP-DVLTIT 887
+G+ + + GS AAK G L GD +LSI+ + V+ L+ E +L+ P D + +T
Sbjct: 12 GTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIE--VILNGKPGDTVKLT 67
Score = 32.6 bits (75), Expect = 0.24
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSAT 1187
+G+++ + GV V +V E S A++ GL+ GD +L + G + S T
Sbjct: 3 IGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSLT 49
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 50.0 bits (119), Expect = 1e-05
Identities = 89/440 (20%), Positives = 157/440 (35%), Gaps = 62/440 (14%)
Query: 304 DHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQP 363
+ KE+E R A A+ + AAQE S +++ L ++ + L + ++R
Sbjct: 269 NRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMR------- 321
Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
D L + A + ++ E DALR + NT + L+ AQEE A
Sbjct: 322 -----QHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATY 376
Query: 424 KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEI 483
+ EDM R+R ++ R+ Q+ + +Q+ + +
Sbjct: 377 AGEIEDM--------RDR-------------YEKTERKLRVLQKKIENLQETFRRKERRL 415
Query: 484 NTAMAVRMKATKDL---KRLTEERNAAMQEYSLIMSERDSVHKEMEKL-SEDLTQAVRKI 539
R+++ + E+ A+ E I+ E+ E+ ++
Sbjct: 416 KEE-KERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEF 474
Query: 540 KTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC- 598
+ L+ E + L Q+E LK E++ ++ K+ +DL E +E+
Sbjct: 475 EDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQL---KQRSDLERAHIELEKIREKHE 531
Query: 599 ----QREAFHARFEYAMRMGE--------RDSSRKERNEVEAASMDLFGKCQKERMDNLE 646
+ + A E A R R S RNEV+ +D K ++ER D
Sbjct: 532 KLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL-LDRLEKAEQERDD--- 587
Query: 647 ANQEIDRLRKHTDKLQAELQDQSVNYLFILTE----SIQEAEVSKRRRDWAFSERDKIEV 702
E RL K +K Q L Q E S + E ++R E +
Sbjct: 588 TEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQS 647
Query: 703 IDSYHTKHAGRHKRGKELDK 722
H + A + + K
Sbjct: 648 TQIGHAQAAAHNHIEHHVQK 667
Score = 33.4 bits (76), Expect = 1.4
Identities = 54/282 (19%), Positives = 107/282 (37%), Gaps = 19/282 (6%)
Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
+ E + E + L+K + + ++++ ++ + ++ + T +E+
Sbjct: 4 QLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLR 63
Query: 444 LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
+ Q+ + AL+E + Q L ++QQ+ E + E + K+ R+ E
Sbjct: 64 VTQEENQHAQLTVQALQEELKSQRELNRLQQKLEKSGNETGELQQLE-KSEGQFSRIQAE 122
Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
R E L+ + + ME+L + L +IK L + K E L
Sbjct: 123 RERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERL 182
Query: 564 KREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKER 623
R + + + L E+ + + RE H + + ERD ++ E
Sbjct: 183 GR-----IREAEDQQSHLEVLLEQKEK----EHMMLREEIHQKLQ-----MERDDAKTEA 228
Query: 624 NEVEAASMDLFGKCQKERMDNLE-ANQEIDRLRKHTDKLQAE 664
++ +D KE LE A EI RL+ D +A+
Sbjct: 229 SQ---KLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEAD 267
>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
proteases, such as DegP/HtrA, which are oligomeric
proteins involved in heat-shock response, chaperone
function, and apoptosis. May be responsible for substrate
recognition and/or binding, as most PDZ domains bind
C-terminal polypeptides, though binding to internal
(non-C-terminal) polypeptides and even to lipids has been
demonstrated. In this subfamily of protease-associated
PDZ domains a C-terminal beta-strand forms the
peptide-binding groove base, a circular permutation with
respect to PDZ domains found in Eumetazoan signaling
proteins.
Length = 90
Score = 44.2 bits (105), Expect = 2e-05
Identities = 22/53 (41%), Positives = 28/53 (52%)
Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTV 1313
G+ V SV GS AGL+ GD IL NG +++ A KP DKVT+
Sbjct: 25 GVLVASVDPGSPAAKAGLKPGDVILAVNGKPVKSVADLRRALAELKPGDKVTL 77
Score = 36.5 bits (85), Expect = 0.010
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 202 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 248
GL G+ + + PGS AAK G L GD +L++N + V ++ R
Sbjct: 19 GLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRR 64
Score = 36.5 bits (85), Expect = 0.010
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 826 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 872
GL G+ + + PGS AAK G L GD +L++N + V ++ R
Sbjct: 19 GLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRR 64
Score = 31.8 bits (73), Expect = 0.55
Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 13/79 (16%)
Query: 1141 LGIQIQCL-----------DSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQ 1189
LG+ +Q L D+ GV V++V S A++ GL+ GD +L V G ++S
Sbjct: 3 LGVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAGLKPGDVILAVNGKPVKSVA-D 61
Query: 1190 LAASVLR-QCGNSITMLVQ 1207
L ++ + G+ +T+ V
Sbjct: 62 LRRALAELKPGDKVTLTVL 80
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 46.1 bits (110), Expect = 4e-05
Identities = 21/105 (20%), Positives = 43/105 (40%), Gaps = 11/105 (10%)
Query: 460 RERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERD 519
+ ++ L +++Q+ + +I R + TEER ++E + E
Sbjct: 69 TRLEKLKKELEELKQRIAELQAQIEKLKKGREE--------TEERTELLEELKQLEKELK 120
Query: 520 SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
+ E+EK ++ + + K+K E K E + I +LK
Sbjct: 121 KLKAELEKYEKNDPERIEKLK---EETKVAKEAANRWTDNIFSLK 162
Score = 30.7 bits (70), Expect = 4.0
Identities = 13/68 (19%), Positives = 28/68 (41%), Gaps = 8/68 (11%)
Query: 108 DGKLKKKEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 167
K K + ++KK+ E++ Q L+ Q + + ER E E L
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRI-------AELQAQIEKLKKGREETE-ERTELLEELK 113
Query: 168 KVQQQHED 175
+++++ +
Sbjct: 114 QLEKELKK 121
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 47.0 bits (112), Expect = 6e-05
Identities = 39/235 (16%), Positives = 83/235 (35%), Gaps = 15/235 (6%)
Query: 359 CCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQE 418
+ D + + + K +D+ L K+ L ++ +L +
Sbjct: 28 AAAFSAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETAD 87
Query: 419 EVARIKKQCEDMNQEKN---TALRERNG-LKQQCTAAIRQWD---IALRERNEYQEALAK 471
++ +++KQ D+N N RE+ L +Q A R AL E +A
Sbjct: 88 DLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPE--DAQRS 145
Query: 472 V--QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLS 529
V + + K L + E A E + ++SE+ + ++ +L
Sbjct: 146 VRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLL 205
Query: 530 EDLTQAVRK----IKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
E+ + + + + + + +EL ++ L +I + + A A A
Sbjct: 206 EERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAA 260
Score = 36.6 bits (85), Expect = 0.11
Identities = 37/212 (17%), Positives = 69/212 (32%), Gaps = 28/212 (13%)
Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
+ +R +LE + +A LEA E++ K I+D L+ L
Sbjct: 49 ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIAD---LNARLNA 105
Query: 355 LRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE 414
L Q A+ L S N AL +D + A+
Sbjct: 106 LE----VQEREQRRRLAEQLAALQRSGRN-------PPPALLVSPEDAQRSVRLAIYYGA 154
Query: 415 LAQEEVARI---KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
L RI K + + + E+ L L E+ Q LA+
Sbjct: 155 LNPARAERIDALKATLKQLAAVRAEIAAEQAELTT-----------LLSEQRAQQAKLAQ 203
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
+ ++ + + ++N+ ++ K ++L+
Sbjct: 204 LLEERKKTLAQLNSELSADQKKLEELRANESR 235
Score = 33.9 bits (78), Expect = 0.70
Identities = 42/217 (19%), Positives = 74/217 (34%), Gaps = 33/217 (15%)
Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
+ + KE+ L + + + + LE + K L + I +L+ ++ D K
Sbjct: 39 QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-------IASLEAQLIETADDLKK 91
Query: 577 ALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMG------ERDSSRKERNEVEAAS 630
K+ DL R + E QR + R G S + V A
Sbjct: 92 LRKQIADLNARLNALEVQERE-QRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAI 150
Query: 631 MDLFGKCQKER---MDNLEANQ-EIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVS 686
+G R +D L+A ++ +R QAEL L+E +
Sbjct: 151 --YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTL-------LSEQRAQQAKL 201
Query: 687 KRRRDWAFSERDKIEVIDSYHTKHAGRHKRGKELDKN 723
+ ER K +++ + K+ +EL N
Sbjct: 202 AQLL----EERKKTL--AQLNSELSADQKKLEELRAN 232
>gnl|CDD|212961 cd12028, SH3_ZO-3, Src homology 3 domain of the Tight junction
protein, Zonula occludens protein 3. ZO-3 is a
scaffolding protein that associates with other ZO
proteins and other proteins of the tight junction, zonula
adherens, and gap junctions. ZO proteins play roles in
regulating cytoskeletal dynamics at these cell junctions.
ZO-3 is critical for epidermal barrier function. It
regulates cyclin D1-dependent cell proliferation. It is
considered a member of the MAGUK (membrane-associated
guanylate kinase) protein family, which is characterized
by the presence of a core of three domains: PDZ, SH3, and
guanylate kinase (GuK). The GuK domain in MAGUK proteins
is enzymatically inactive; instead, the domain mediates
protein-protein interactions and associates
intramolecularly with the SH3 domain. The C-terminal
region of ZO-3 is the smallest of the three ZO proteins.
The SH3 domain of the related protein ZO-1 has been shown
to bind ZONAB, ZAK, afadin, and Galpha12. SH3 domains are
protein interaction domains that bind to proline-rich
ligands with moderate affinity and selectivity,
preferentially to PxxP motifs. They play versatile and
diverse roles in the cell including the regulation of
enzymes, changing the subcellular localization of
signaling pathway components, and mediating the formation
of multiprotein complex assemblies.
Length = 65
Score = 42.2 bits (99), Expect = 7e-05
Identities = 22/67 (32%), Positives = 33/67 (49%), Gaps = 2/67 (2%)
Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
DSFY+R F D L F + ++ +V +TM G G W A + D + GII
Sbjct: 1 DSFYIRTHFDYEPD--PPSGLSFTRGEVFHVLDTMHRGKLGSWLAVRMGRDLREMEKGII 58
Query: 1391 PSKYKVE 1397
P++ + E
Sbjct: 59 PNQSRAE 65
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 47.4 bits (113), Expect = 8e-05
Identities = 57/264 (21%), Positives = 112/264 (42%), Gaps = 42/264 (15%)
Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN--TALR 439
+ + E I++ K ++++ + +L +EE+ +++K+ +++ + K L
Sbjct: 185 IKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELE 244
Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
+ + + R+ + +RE E E L K ++ E+ VKE+ K+LK
Sbjct: 245 KELESLEG---SKRKLEEKIRELEERIEELKKEIEELEEKVKEL-----------KELKE 290
Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQ 559
EE + Y + E + K + +L E++ +IK LE + + L E KK +
Sbjct: 291 KAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKK----K 346
Query: 560 IETLKREIAS-----ALHDRDKALK-ECNDLRERFGEFT------------AAKEECQRE 601
++ L++ + L++ KA K E L++R T AKEE + E
Sbjct: 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406
Query: 602 AFHARFEYAMRMGERDSSRKERNE 625
+ R+GE KE +
Sbjct: 407 ----ISKITARIGELKKEIKELKK 426
Score = 46.6 bits (111), Expect = 1e-04
Identities = 52/231 (22%), Positives = 106/231 (45%), Gaps = 34/231 (14%)
Query: 384 KYETIKDEYDALR----------KRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
+YE +K++ L+ ++ ++L + KL+ +EE+A + K+ E++ E
Sbjct: 526 EYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585
Query: 434 KNTALRERNG-LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMK 492
L ER L+ + D A +E ++ L K++++ + A +E+ +
Sbjct: 586 SVEELEERLKELEPFYNEYLELKD-AEKELEREEKELKKLEEELDKAFEELA-------E 637
Query: 493 ATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED 552
K L+ L +E ++YS +E E+L E+ + R++ L E +EL +
Sbjct: 638 TEKRLEELRKELEELEKKYS---------EEEYEELREEYLELSRELAGLRAELEELEKR 688
Query: 553 KKTLSYQIETLKREIAS------ALHDRDKALKECNDLRERFGEFTAAKEE 597
++ + +E LK E+ L +KAL+ +LRE+ ++ A +E
Sbjct: 689 REEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKE 739
Score = 38.9 bits (91), Expect = 0.030
Identities = 48/213 (22%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSER 518
++ R E E K + E+ +KE K+L+ + E N E + E
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKE----------KEKELEEVLREINEISSELPELREEL 223
Query: 519 DSVHK---EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD 575
+ + K E+E+L E++ + +++++LE ++L E + L +IE LK+EI
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-------- 275
Query: 576 KALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFG 635
+L E+ E KE+ + EY + E E+E L
Sbjct: 276 ------EELEEKVKELKELKEKAE--------EYIKLSEFYEEYLDELREIEKRLSRLEE 321
Query: 636 KCQ--KERMDNLEA-NQEIDRLRKHTDKLQAEL 665
+ +ER+ LE + ++ L+K +L+ L
Sbjct: 322 EINGIEERIKELEEKEERLEELKKKLKELEKRL 354
Score = 38.5 bits (90), Expect = 0.035
Identities = 59/393 (15%), Positives = 139/393 (35%), Gaps = 91/393 (23%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
K L + +++ E + +L + L++ L EL
Sbjct: 314 KRLSRLEEEINGIEERIK----ELEEKEERLEELKKKLKELEKRLEELEE---------- 359
Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDL--------INTHSTAVDKLELAQEE 419
+ + K+E + L+KR L + A +++E +
Sbjct: 360 ----------RHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISK 409
Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER--NEYQEALAKVQQQHE 477
+ + + +E A+ E K +C R+ R+ EY L +++++ +
Sbjct: 410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELK 469
Query: 478 DA----------VKEINTAMA-----VRMKATKDLKRLTEER---------NAAMQEYSL 513
+ ++E+ + +++K + + EE+ +EY
Sbjct: 470 EIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEK 529
Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA----S 569
+ + + E++ L ++L K++ L+ + EL + L ++ L +E+
Sbjct: 530 LKEKLIKLKGEIKSLKKEL----EKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585
Query: 570 ALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAA 629
++ + ++ LKE + E A++E +R E +K E++ A
Sbjct: 586 SVEELEERLKELEPFYNEYLELKDAEKELER--------------EEKELKKLEEELDKA 631
Query: 630 SMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQ 662
+L E + ++ LRK ++L+
Sbjct: 632 FEEL-----------AETEKRLEELRKELEELE 653
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 47.1 bits (112), Expect = 8e-05
Identities = 35/268 (13%), Positives = 85/268 (31%), Gaps = 16/268 (5%)
Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE 461
L+ + E + ++ E+ + + A ++Q A+ + L
Sbjct: 671 LLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQWYDALL--EAGLGG 728
Query: 462 RNEYQEALAKVQQQHE--DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIM--SE 517
R AL ++ + ++ + +DL R EE A + + M +
Sbjct: 729 RASPAGALDALELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTP 788
Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKA 577
D + +++ + + L E +E ++ + ++ + + + L
Sbjct: 789 ADETARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLR--AAR 846
Query: 578 LKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKC 637
+L A + R R R + A + L
Sbjct: 847 CTTIEELL------AAVERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAALDPDE 900
Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAEL 665
R++ L ++I+ L + ++L E+
Sbjct: 901 LPARLEEL--ARDIEELEEELNELAQEV 926
Score = 42.4 bits (100), Expect = 0.002
Identities = 57/360 (15%), Positives = 123/360 (34%), Gaps = 59/360 (16%)
Query: 293 VRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDREL 352
+++ E ++E++ + + L+ A +E ++LR + L +KQRL+R
Sbjct: 156 LKELKELEAE----IREVQLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERL- 210
Query: 353 LELRSGCCTQPESDDAGSADSLN--QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
L + L + D +R+++ A
Sbjct: 211 ---------------RRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAAR 255
Query: 411 DKLELAQEEVARIKKQCEDMN-----QEKNTALRERNGLKQQCTAAIRQWDIALRE--RN 463
LEL E + ++ + ++++ + A+ + + + A + D+ RE
Sbjct: 256 RNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNA--EQDLPDREGEIA 313
Query: 464 EYQEALAKVQQQ-----HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSER 518
+EA A + Q E+AV+ + ++A + + + L + + A Q
Sbjct: 314 NAREAAAALLAQIGPDADEEAVESLRPSLAAK----ETVTELEKRKEALDQALKSARDAL 369
Query: 519 DSVHKEMEKLSEDLTQAVRKIKTLEMENK-ELVEDKKTLSYQIETLKREIASALHDRDKA 577
+ +E++++ L + + + L++E+A A
Sbjct: 370 EERERELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVA-------VA 422
Query: 578 LKECNDLRERFGEFTAAKEEC---------QREAFHARFEYAMRMGERDSSRKERNEVEA 628
+E R G + EE Q +AF E M + R E+E
Sbjct: 423 KRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAE--EIMQAKRQKRDRLLELEE 480
Score = 34.7 bits (80), Expect = 0.57
Identities = 62/391 (15%), Positives = 127/391 (32%), Gaps = 34/391 (8%)
Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR----ELLELRSGCCTQPE 364
E R + +AA LE + +L+++ ++ D++ L + E L + G E
Sbjct: 243 RYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAE 302
Query: 365 SD---DAGSADSLNQHYMSALNKYETIKDEYD--------ALRKRFDDLINTHSTAVDKL 413
D G + + + L + DE A ++ +L L
Sbjct: 303 QDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQAL 362
Query: 414 ELAQEEVARIKKQCEDMNQEKNT--ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
+ A++ + +++ + + + + L+ A+R DI QE
Sbjct: 363 KSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVAVA 422
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
+++ A+ + + E+ A ++E IM + + +L ED
Sbjct: 423 -KRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEED 481
Query: 532 LTQAVRKIKTL----EMENKELVEDKKTLSYQIETLKREIASALHDR----DKALKECND 583
L +++ L + +E V + + R + L ++A++ +
Sbjct: 482 LETLELQLRHLDAAGAVPTEEEVAAARARRDALWQDIRRGYALLEPTASAYEEAVESADQ 541
Query: 584 LRER-FGEFTAAKE----ECQREAFHARFEYA---MRMGERDSSRKERNEVEAASMDLFG 635
L +R E E Q EA R E + + E + +
Sbjct: 542 LADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQAQWAAAGLP 601
Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
E D L + K +AEL
Sbjct: 602 LTPAEMEDWLAERATAREQVRAYFKARAELD 632
Score = 31.7 bits (72), Expect = 4.4
Identities = 33/168 (19%), Positives = 67/168 (39%), Gaps = 10/168 (5%)
Query: 365 SDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLIN-THSTAVDKLELAQEEVARI 423
D A +A+ L + A + D R L+ T +++L A E
Sbjct: 804 RDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTY 863
Query: 424 KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEI 483
++ + + + T +R GL + A + A + +E L ++ + E+ +E+
Sbjct: 864 RELRKRIAALERTLVRAGGGLSLEALVA----EAAALDPDELPARLEELARDIEELEEEL 919
Query: 484 NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
N A ++L R+ AA + +ER+S+ ++ L+E
Sbjct: 920 NELAQEVGAAKQELARMDGGSTAA-----ELEAERESLLAQLRDLAER 962
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 46.3 bits (111), Expect = 1e-04
Identities = 33/153 (21%), Positives = 66/153 (43%), Gaps = 38/153 (24%)
Query: 461 ERNEYQEALAK----VQQQHEDAVKEIN------------TAMAVRMKATKDLKRLTEER 504
E EY+ + Q++ E+A +E+ T KD + LTEE
Sbjct: 109 ESEEYEARKEEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEE 168
Query: 505 NAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
A+ E ER+ + +++++L E+L + +R+++ LE E +E ++ L
Sbjct: 169 FEALPE-----EEREELEEKIDELEEELQEILRQLRELEREARE----------KLRELD 213
Query: 565 REIASALHDRDKALKEC-NDLRERFGEFTAAKE 596
RE+A A+ ++L+E++ +
Sbjct: 214 REVALF------AVGPLLDELKEKYADLPKVLA 240
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 45.0 bits (107), Expect = 1e-04
Identities = 25/180 (13%), Positives = 62/180 (34%), Gaps = 22/180 (12%)
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
E + + K LK+ E A + + E + + ++ +L ++ + +IK
Sbjct: 17 LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76
Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
E + + +++ ++ L EI A + E +L E + E+
Sbjct: 77 RAEEKLSAVKDER-----ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIED--- 128
Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
++ + + A L + + R + E + + + L++ D
Sbjct: 129 --------------LKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDP 174
Score = 43.5 bits (103), Expect = 4e-04
Identities = 36/170 (21%), Positives = 69/170 (40%), Gaps = 15/170 (8%)
Query: 460 RERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY-----SLI 514
E ++AL K + + E K + A+ + ++ ++ E ++E +
Sbjct: 24 PRIKEIRKALKKAKAELEALNKAL-EALEIELEDLENQVSQLESEIQEIRERIKRAEEKL 82
Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
+ +D +E+ L+ ++ A +I +LE E EL+E+ + L +IE LK +
Sbjct: 83 SAVKDE--RELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLE------ 134
Query: 575 DKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERN 624
+ K + R E A E +E R E ++ S ER
Sbjct: 135 -RLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDPELLSEYERI 183
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 46.2 bits (109), Expect = 1e-04
Identities = 45/216 (20%), Positives = 85/216 (39%), Gaps = 4/216 (1%)
Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
+ Q D + + L Q + A E A + KA ++ + + +E AA Q
Sbjct: 73 VFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNL 132
Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
+ E + K+ + L L + + LE + + L +K L LK ++
Sbjct: 133 AKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLK 192
Query: 572 HDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASM 631
+ +E +L R A EE R A A + A + +RD+ ++ + AA
Sbjct: 193 LRSAQIEQEAQNLATRANAAQARTEELARRA-AAAQQTAQAIQQRDAQISQKAQQIAARA 251
Query: 632 DLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
+ + ++ ++ LE RL + +L+A Q
Sbjct: 252 EQIRE-RERQLQRLE--TAQARLEQEVAQLEAYYQA 284
Score = 43.1 bits (101), Expect = 0.001
Identities = 19/138 (13%), Positives = 51/138 (36%)
Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
+L + E+ + + E+ A E +Q+ A ++ A + + Q+ LA+
Sbjct: 82 QLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELAR 141
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
+ +Q +D + T R + + L + + + S+ + ++ ++
Sbjct: 142 LTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQE 201
Query: 532 LTQAVRKIKTLEMENKEL 549
+ + +EL
Sbjct: 202 AQNLATRANAAQARTEEL 219
Score = 38.9 bits (90), Expect = 0.024
Identities = 36/238 (15%), Positives = 79/238 (33%), Gaps = 41/238 (17%)
Query: 298 ETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRS 357
V + D +L R + A + AA E + RS+ K R +RE +
Sbjct: 71 SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSE-----LQKARQEREAVRQEL 125
Query: 358 GCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQ 417
+ Q+ A + + + L+ R
Sbjct: 126 --------------AAARQNLAKAQQELARLTKQAQDLQTR------------------- 152
Query: 418 EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 477
+ + +Q + + + + L Q ++ + L+ R+ E A+
Sbjct: 153 --LKTLAEQRRQLEAQAQSLQASQKQL-QASATQLKSQVLDLKLRSAQIEQEAQNLATRA 209
Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
+A + +A R A + + ++R+A + + + ++ R +E E+ + L A
Sbjct: 210 NAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETA 267
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 45.1 bits (107), Expect = 2e-04
Identities = 52/274 (18%), Positives = 105/274 (38%), Gaps = 43/274 (15%)
Query: 327 AAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYE 386
+E+S L K +L + REL E A L + + +
Sbjct: 32 LRKEASELAEKRDELNAKV----RELREK---------------AQELREERDEINEEVQ 72
Query: 387 TIKDEYDALRKRFDDL---INTHSTAVDKLELAQEEVARIKK---QCEDMNQEKNTALRE 440
+K++ D + + +L ++ L + +++ + E Q E
Sbjct: 73 ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEE 132
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
L Q+ R E ++A K +++E + ++ KA + +++
Sbjct: 133 ERELVQKIK----------ELRKELEDA-KKALEENEKLKELKAEIDELKKKAREIHEKI 181
Query: 501 TEERNAAMQEYSLIM----SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTL 556
E N A QEY M E D + KE ++L E+ + +KI L E + L + + L
Sbjct: 182 QELANEA-QEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELREL 240
Query: 557 SYQIETLKREIASA--LHDRDKALKECNDLRERF 588
+I+ L+ + +A R++ + ++ E+F
Sbjct: 241 EKKIKALRAKEKAAKRREKREELKERAEEIYEKF 274
Score = 42.0 bits (99), Expect = 0.002
Identities = 47/269 (17%), Positives = 115/269 (42%), Gaps = 36/269 (13%)
Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALR 439
K + +K+E + L+++ D+L S +K + +V ++++ +++ +E++
Sbjct: 10 ELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINE 69
Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR---MKATKD 496
E LK+ +R+E L ++++++ + ++ N ++
Sbjct: 70 EVQELKE--------------KRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLERE 115
Query: 497 LKRL----------TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
++RL EE +Q+ + + K+ + +E L + +I L+ +
Sbjct: 116 IERLEKKQQTSVLTPEEERELVQKIKEL-RKELEDAKKALEENEKLKELKAEIDELKKKA 174
Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
+E+ E + L+ + + E+ + D+ KE ++L E F E + +E E + +
Sbjct: 175 REIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQ 234
Query: 607 FEY--------AMRMGERDSSRKERNEVE 627
E A+R E+ + R+E+ E
Sbjct: 235 NELRELEKKIKALRAKEKAAKRREKREEL 263
Score = 41.6 bits (98), Expect = 0.003
Identities = 68/327 (20%), Positives = 134/327 (40%), Gaps = 72/327 (22%)
Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
K + +EE+ +K++ +++ +E + +R+ L + + ER+E E +
Sbjct: 13 LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEV- 71
Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
Q+ ++ EIN + K ++LK E N + ++ L
Sbjct: 72 ---QELKEKRDEINAKLQELRKEYRELKEKRNEFN--------------LGGRSIKSLER 114
Query: 531 DLTQAVRKIKTLEM---ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
++ + +K +T + E +ELV+ +I+ L++E L D KAL+E L+E
Sbjct: 115 EIERLEKKQQTSVLTPEEERELVQ-------KIKELRKE----LEDAKKALEENEKLKEL 163
Query: 588 FGEFTAAKEECQREAFHARF--------EYAMRM----GERDSSRKERNEVEAASMDLFG 635
E K++ H + EY M E D RKE +E+
Sbjct: 164 KAEIDELKKK--AREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHE------- 214
Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFS 695
+ +E +++ID L + LQ EL++ L ++++ E + +RR+
Sbjct: 215 -------EFVELSKKIDELHEEFRNLQNELRE-----LEKKIKALRAKEKAAKRREKREE 262
Query: 696 ERDKIEVIDSYHTKHAGRHKRGKELDK 722
+++ E I + KRG++L
Sbjct: 263 LKERAEEIYE-------KFKRGEKLTT 282
Score = 36.2 bits (84), Expect = 0.10
Identities = 50/236 (21%), Positives = 99/236 (41%), Gaps = 19/236 (8%)
Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
L L +R +E + +RD + KE +L+E + K++ L + +EL E++
Sbjct: 5 LDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREER 64
Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAA---KEECQREAFHARFEYA 610
++ +++ LK + + KE +L+E+ EF + +RE R E
Sbjct: 65 DEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREI--ERLEK- 121
Query: 611 MRMGERDSSRKERNEVEAASMDLFGKCQKERMD---NLEANQEIDRLRKHTDKLQAELQD 667
+ + +ER V+ + +KE D LE N+++ L+ D+L+ + ++
Sbjct: 122 KQQTSVLTPEEERELVQKIK-----ELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176
Query: 668 QSVNYLFILTESIQEAEVSKRRRDWAFSERDKI-EVIDSYHTKHAGRHKRGKELDK 722
+ E EA+ F E D++ + D H + K+ EL +
Sbjct: 177 ----IHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHE 228
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 44.9 bits (106), Expect = 4e-04
Identities = 59/322 (18%), Positives = 134/322 (41%), Gaps = 36/322 (11%)
Query: 362 QPESDDAGSAD-SLNQHYMSALNKYETIKDEYDALRKRFDDL---INTHSTAVDKLELAQ 417
Q E D S D + + + + + +++ YD L+ D L I+ +KL+ +
Sbjct: 137 QGEMDSLISGDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSN 196
Query: 418 EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 477
E+ IKKQ D + + L+E L + A+ ++ NE +L ++ ++E
Sbjct: 197 LELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELS-SLEDMKNRYE 255
Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNA-----------AMQEYSLIMSERDSVHKEME 526
+K + +++ ++ K L E + +Y ++ ++ + +
Sbjct: 256 SEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILS 315
Query: 527 KLSEDLTQ---AVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK---ALKE 580
+ ++ + ++K+ L+ + + ++ K + + L +I LK
Sbjct: 316 NIDAEINKYHAIIKKLSVLQKDYNDYIKKKS----RYDDLNNQILELEGYEMDYNSYLKS 371
Query: 581 CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKE 640
L+++ E++ K + AF + ++ + D+ +KE NE+ D+ K
Sbjct: 372 IESLKKKIEEYS--KNIERMSAFISEI-LKIQEIDPDAIKKELNEINVKLQDISSK---- 424
Query: 641 RMDNLEANQEIDRLRKHTDKLQ 662
+ +L NQ I LR++ D+L
Sbjct: 425 -VSSL--NQRIRALRENLDELS 443
Score = 39.9 bits (93), Expect = 0.015
Identities = 63/366 (17%), Positives = 128/366 (34%), Gaps = 60/366 (16%)
Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED--MNQEKNTALRE 440
N Y K YD L + +L L+ + +KK+ E+ N E+ +A
Sbjct: 339 NDYIKKKSRYDDLNNQILELEGYEMDYNSYLK----SIESLKKKIEEYSKNIERMSAFIS 394
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
Q+ D +E NE L Q V +N R++A ++
Sbjct: 395 EILKIQEIDP-----DAIKKELNEINVKL----QDISSKVSSLN----QRIRALRE-NLD 440
Query: 501 TEERNAAMQEYSLIMS------ERDSVHKEMEKLSEDLTQAVRKIKTLEME----NKELV 550
RN M + + + + +E ++ KI+ +E+E ++++V
Sbjct: 441 ELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIV 500
Query: 551 EDKKTLSY-------QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR--- 600
+ KK Y + +I SA D + + N+L+++ ++ K +
Sbjct: 501 DLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKL 560
Query: 601 EAFHARF-EYAMRMGERDS-----SRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
E ++ + + +R NE++ DL + R+ +E
Sbjct: 561 EDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDL-----ESRLQEIEIG--FPDD 613
Query: 655 RKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYHTKHAGRH 714
+ + DK E+++++ N E + +K + R KI+ +
Sbjct: 614 KSYIDKSIREIENEANNL----NNKYNEIQENKILIE---KLRGKIDNYKKQIAEIDSII 666
Query: 715 KRGKEL 720
KE+
Sbjct: 667 PDLKEI 672
>gnl|CDD|212795 cd11861, SH3_DLG-like, Src Homology 3 domain of Disks large homolog
proteins. The DLG-like proteins are scaffolding proteins
that cluster at synapses and are also called PSD
(postsynaptic density)-95 proteins or SAPs
(synapse-associated proteins). They play important roles
in synaptic development and plasticity, cell polarity,
migration and proliferation. They are members of the
MAGUK (membrane-associated guanylate kinase) protein
family, which is characterized by the presence of a core
of three domains: PDZ, SH3, and guanylate kinase (GuK).
The GuK domain in MAGUK proteins is enzymatically
inactive; instead, the domain mediates protein-protein
interactions and associates intramolecularly with the SH3
domain. DLG-like proteins contain three PDZ domains and
varying N-terminal regions. All DLG proteins exist as
alternatively-spliced isoforms. Vertebrates contain four
DLG proteins from different genes, called DLG1-4. DLG4
and DLG2 are found predominantly at postsynaptic sites
and they mediate surface ion channel and receptor
clustering. DLG3 is found axons and some presynaptic
terminals. DLG1 interacts with AMPA-type glutamate
receptors and is critical in their maturation and
delivery to synapses. The SH3 domain of DLG4 binds and
clusters the kainate subgroup of glutamate receptors via
two proline-rich sequences in their C-terminal tail. It
also binds AKAP79/150 (A-kinase anchoring protein). SH3
domains are protein interaction domains that bind to
proline-rich ligands with moderate affinity and
selectivity, preferentially to PxxP motifs. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 61
Score = 39.6 bits (93), Expect = 4e-04
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 6/63 (9%)
Query: 1334 YMRALFSRTGDLGDAL---QLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
Y+RALF L L F DIL+V N W+A V +G ++ G+I
Sbjct: 1 YVRALFDYDPSRDSGLPSQGLSFKFGDILHVTNA---SDDEWWQARRVTPNGEEEEVGVI 57
Query: 1391 PSK 1393
PSK
Sbjct: 58 PSK 60
>gnl|CDD|214620 smart00326, SH3, Src homology 3 domains. Src homology 3 (SH3)
domains bind to target proteins through sequences
containing proline and hydrophobic amino acids.
Pro-containing polypeptides may bind to SH3 domains in 2
different binding orientations.
Length = 56
Score = 39.4 bits (93), Expect = 4e-04
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 10/64 (15%)
Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
+ +RAL+ T D +L F K DI+ V W W R + G+
Sbjct: 1 EGPQVRALYDYTAQDPD--ELSFKKGDIITVLEK----SDDGW--WKGRLG--RGKEGLF 50
Query: 1391 PSKY 1394
PS Y
Sbjct: 51 PSNY 54
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 44.4 bits (106), Expect = 5e-04
Identities = 76/356 (21%), Positives = 137/356 (38%), Gaps = 78/356 (21%)
Query: 384 KYETIKDEYDALRKRFDDLINTHS--TAVDKLELAQEEVARIKKQCEDMNQEKN-----T 436
+ E +KD Y LRK L N S A+D+LE +++ ++++ +
Sbjct: 141 EVEQLKDLYRELRKSL--LANRFSFGPALDELE---KQLENLEEEFSQFVELTESGDYVE 195
Query: 437 ALRERNGLKQQCTAAIRQWDIA-----LRE-RNEYQEALAKVQQQHEDAV---------- 480
A + L+++ AA+ Q + L+E + E + L +++ + + V
Sbjct: 196 AREILDQLEEE-LAALEQ-IMEEIPELLKELQTELPDQLQELKAGYRELVEEGYHLDHLD 253
Query: 481 --KEINTAMAVRMKATKDLKRL----TEERNAAMQE-----YSLIMSE---RDSVHKEME 526
KEI + L+ L EE+N +QE Y ++ E R V K +
Sbjct: 254 IEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKYVEKNSD 313
Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRE 586
L + L A + K L+ E + + SY + S L + K+ L +
Sbjct: 314 TLPDFLEHAKEQNKELKEEIDRVKQ-----SYTLN------ESELESVRQLEKQLESLEK 362
Query: 587 RFGEFTAAKEECQREAF---HARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMD 643
++ E T E Q A+ E + K+ E+E L Q R D
Sbjct: 363 QYDEITERIAE-QEIAYSELQEELEEIL---------KQLEEIEKEQEKLSEMLQGLRKD 412
Query: 644 NLEANQEIDRLRK--HTDKLQAE------LQDQSVNYLFILTESIQE--AEVSKRR 689
LEA ++++R R H K E L + + F +++ I+ E+ ++
Sbjct: 413 ELEAREKLERYRNKLHEIKRYLEKSNLPGLPEDYLEMFFEVSDEIEALAEELEEKP 468
Score = 30.6 bits (70), Expect = 8.4
Identities = 53/337 (15%), Positives = 115/337 (34%), Gaps = 88/337 (26%)
Query: 391 EYDALRKRFDDLINTHSTAVDK-LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
+++ R+++D+++ +++ L A+E + + + ++ + +
Sbjct: 65 KFEEWRQKWDEIVTNSLPDIEEQLFEAEELNDKFR------------FRKAKHEINE--- 109
Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN--AA 507
D+ + + E L ++ + E +E+ + + K L L + A
Sbjct: 110 -IESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSL--LANRFSFGPA 166
Query: 508 MQEYSLIMSERDSVHKEMEKLSE--DLTQAVRKIKTLEMENKEL-----------VEDKK 554
+ E + + + +L+E D +A + LE E L E +
Sbjct: 167 LDELEKQLENLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQT 226
Query: 555 TLSYQIETLK---RE------------IASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
L Q++ LK RE I + D + + E L E + A+E+ +
Sbjct: 227 ELPDQLQELKAGYRELVEEGYHLDHLDIEKEIQDLKEQIDENLALLEEL-DLDEAEEKNE 285
Query: 600 R-----EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
+ + E + K R VE + D L ++
Sbjct: 286 EIQERIDQLYDILEREV---------KARKYVE------------KNSDTLP--DFLEHA 322
Query: 655 RKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSKRR 689
++ +L+ E+ QS + L ES E+ R
Sbjct: 323 KEQNKELKEEIDRVKQS----YTLNES----ELESVR 351
>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
tail-specific-, and tricorn proteases, which function in
posttranslational protein processing, maturation, and
disassembly or degradation, in Bacteria, Archaea, and
plant chloroplasts. May be responsible for substrate
recognition and/or binding, as most PDZ domains bind
C-terminal polypeptides, and binding to internal
(non-C-terminal) polypeptides and even to lipids has been
demonstrated. In this subfamily of protease-associated
PDZ domains a C-terminal beta-strand forms the
peptide-binding groove base, a circular permutation with
respect to PDZ domains found in Eumetazoan signaling
proteins.
Length = 85
Score = 39.9 bits (94), Expect = 6e-04
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)
Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELA-KPADKVTV 1313
G+ + SV GS AG++ GD I+ +G + + E+ L K KV +
Sbjct: 14 GLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVRL 67
Score = 35.7 bits (83), Expect = 0.020
Identities = 14/68 (20%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLR-QCG 1199
+G++++ D GG+ +++V S A++ G++ GD ++ + G + + + +LR + G
Sbjct: 4 IGLELKY-DDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAG 62
Query: 1200 NSITMLVQ 1207
+ + ++
Sbjct: 63 TKVRLTLK 70
Score = 34.1 bits (79), Expect = 0.067
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 202 GLTL---ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSA 246
GL L + GL I + PGS AAK G + GD +++I+ VDGLS
Sbjct: 5 GLELKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLE 51
Score = 34.1 bits (79), Expect = 0.067
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 826 GLTL---ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSA 870
GL L + GL I + PGS AAK G + GD +++I+ VDGLS
Sbjct: 5 GLELKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLE 51
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 41.5 bits (98), Expect = 0.001
Identities = 22/104 (21%), Positives = 46/104 (44%), Gaps = 18/104 (17%)
Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEME 526
E L ++QQ+ + +E+ + + L L +E QE S + +E + + KE+
Sbjct: 66 ERLPELQQELAELQEELA-------ELQEQLAELQQENQELKQELSTLEAELERLQKELA 118
Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
++ + A+ EL E+ + L ++ LK+E +
Sbjct: 119 RIKQLSANAI-----------ELDEENRELREELAELKQENEAL 151
Score = 32.7 bits (75), Expect = 1.1
Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 2/86 (2%)
Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ--EKNTALRE 440
+ ++++ L++ +L ST +LE Q+E+ARIK+ + + E+N LRE
Sbjct: 80 EELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELRE 139
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQ 466
+Q A+ + L+E + +
Sbjct: 140 ELAELKQENEALEAENERLQENEQRR 165
Score = 32.7 bits (75), Expect = 1.1
Identities = 26/139 (18%), Positives = 58/139 (41%), Gaps = 39/139 (28%)
Query: 416 AQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ 475
A+E + ++++ ++ QE+ L+E+ QQ E E ++ L+ ++ +
Sbjct: 64 ARERLPELQQELAEL-QEELAELQEQLAELQQ-------------ENQELKQELSTLEAE 109
Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
E KE+ +K+L+ +E +L E+L +
Sbjct: 110 LERLQKELAR-----------IKQLSANAIELDEEN--------------RELREELAEL 144
Query: 536 VRKIKTLEMENKELVEDKK 554
++ + LE EN+ L E+++
Sbjct: 145 KQENEALEAENERLQENEQ 163
Score = 30.0 bits (68), Expect = 7.2
Identities = 15/94 (15%), Positives = 39/94 (41%), Gaps = 4/94 (4%)
Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
+L QEE+A +++Q ++ QE +E + L+ + + + L + +
Sbjct: 74 ELAELQEELAELQEQLAELQQENQELKQELSTLEAE----LERLQKELARIKQLSANAIE 129
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
+ +++ + +E+ + +RL E
Sbjct: 130 LDEENRELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|212690 cd00174, SH3, Src Homology 3 domain superfamily. Src Homology 3
(SH3) domains are protein interaction domains that bind
proline-rich ligands with moderate affinity and
selectivity, preferentially to PxxP motifs. Thus, they
are referred to as proline-recognition domains (PRDs).
SH3 domains are less selective and show more diverse
specificity compared to other PRDs. They have been shown
to bind peptide sequences that lack the PxxP motif;
examples include the PxxDY motif of Eps8 and the RKxxYxxY
sequence in SKAP55. SH3 domain containing proteins play
versatile and diverse roles in the cell, including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies, among others. Many members of this
superfamily are adaptor proteins that associate with a
number of protein partners, facilitating complex
formation and signal transduction.
Length = 51
Score = 37.4 bits (88), Expect = 0.002
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 10/61 (16%)
Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
Y RAL+ D +L F K DI+ V W W + +G R+ G+ P+
Sbjct: 1 YARALYDYEAQDDD--ELSFKKGDIITVLEK----DDDGW--WEGELNGGRE--GLFPAN 50
Query: 1394 Y 1394
Y
Sbjct: 51 Y 51
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 41.8 bits (99), Expect = 0.002
Identities = 56/263 (21%), Positives = 108/263 (41%), Gaps = 37/263 (14%)
Query: 309 ELEYYRGQHQAAMAQLEAA-AQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
++E R + A ++L + QE LR + +L +++ RL E+ LR
Sbjct: 33 KIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLR----------- 81
Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC 427
+A+ + Y LN ++ +++ LRK D+ K+E QEE+A +KK
Sbjct: 82 -LAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNH 140
Query: 428 ED-----MNQEKNTALRE-RNGLKQQCTAAIR----QWD-IALRERNEYQEALAKVQQQH 476
E+ +Q + E K T A+ Q++ +A + R QEA + +
Sbjct: 141 EEEVRELQSQIQGQVNVEMDAARKLDLTKALAEIRAQYEELAEKNR---QEAEEWYKSKL 197
Query: 477 EDAVKEINT-AMAVRMKAT--KDLKRLTEERNAAMQEYSLIMSERDSVHKEM----EKLS 529
E+ + A+R +L+R + E + ++ S+ +++ E+
Sbjct: 198 EELQQAAARNGDALRSAKEEITELRRQIQSLEI---ELQSLKKQKASLERQLAELEERYE 254
Query: 530 EDLTQAVRKIKTLEMENKELVED 552
+L I LE E ++L +
Sbjct: 255 LELADYQDTISELEEELQQLKAE 277
>gnl|CDD|212964 cd12031, SH3_DLG1, Src Homology 3 domain of Disks Large homolog 1.
DLG1, also called synapse-associated protein 97 (SAP97),
is a scaffolding protein that clusters at synapses and
plays an important role in synaptic development and
plasticity. DLG1 plays roles in regulating cell polarity,
proliferation, migration, and cycle progression. It
interacts with AMPA-type glutamate receptors and is
critical in their maturation and delivery to synapses. It
also interacts with PKCalpha and promotes wound healing.
DLG1 is a member of the MAGUK (membrane-associated
guanylate kinase) protein family, which is characterized
by the presence of a core of three domains: PDZ, SH3, and
guanylate kinase (GuK). The GuK domain in MAGUK proteins
is enzymatically inactive; instead, the domain mediates
protein-protein interactions and associates
intramolecularly with the SH3 domain. DLG1 contains three
PDZ domains. SH3 domains are protein interaction domains
that bind to proline-rich ligands with moderate affinity
and selectivity, preferentially to PxxP motifs. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 67
Score = 38.1 bits (88), Expect = 0.002
Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 1332 SFYMRALFS--RTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
S Y+RALF +T D G Q L F DIL+V N + W+A V ADG ++ G
Sbjct: 2 SLYVRALFDYDKTKDSGLPSQGLNFKFGDILHVVNASDDE---WWQARQVTADGESEEIG 58
Query: 1389 IIPSKYKVE 1397
+IPSK +VE
Sbjct: 59 VIPSKRRVE 67
>gnl|CDD|212959 cd12026, SH3_ZO-1, Src homology 3 domain of the Tight junction
protein, Zonula occludens protein 1. ZO-1 is a
scaffolding protein that associates with other ZO
proteins and other proteins of the tight junction, zonula
adherens, and gap junctions. ZO proteins play roles in
regulating cytoskeletal dynamics at these cell junctions.
ZO-1 plays an essential role in embryonic development. It
regulates the assembly and dynamics of the cortical
cytoskeleton at cell-cell junctions. It is considered a
member of the MAGUK (membrane-associated guanylate
kinase) protein family, which is characterized by the
presence of a core of three domains: PDZ, SH3, and
guanylate kinase (GuK). The GuK domain in MAGUK proteins
is enzymatically inactive; instead, the domain mediates
protein-protein interactions and associates
intramolecularly with the SH3 domain. The C-terminal
region of ZO-1 is the largest of the three ZO proteins
and contains an actin-binding region and domains of
unknown function designated alpha and ZU5. The SH3 domain
of ZO-1 has been shown to bind ZONAB, ZAK, afadin, and
Galpha12. SH3 domains are protein interaction domains
that bind to proline-rich ligands with moderate affinity
and selectivity, preferentially to PxxP motifs. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 65
Score = 37.8 bits (87), Expect = 0.003
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
DSFY+R F + L F+K ++ V +T++NG G W A + + + GII
Sbjct: 1 DSFYIRTHFEYEKE--SPYGLSFNKGEVFRVVDTLYNGKLGSWLAIRIGKNHKEVERGII 58
Query: 1391 PSKYKVE 1397
P+K + E
Sbjct: 59 PNKNRAE 65
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 41.6 bits (98), Expect = 0.004
Identities = 74/343 (21%), Positives = 139/343 (40%), Gaps = 71/343 (20%)
Query: 345 KQRLDRELLELRSGCCTQPESDDAGSA-DSLNQHYMSALNKYETIKDEYDA-LRKR---F 399
++RL+ E EL++ ++A +A + + A + I+ E + L ++ F
Sbjct: 463 RRRLEAEKDELQAAL------EEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEF 516
Query: 400 DDLINTHSTAVDKLELAQE-------EVARIKKQCE-DMNQEKNTALRERNGLKQQCTAA 451
++ H A++ L+ E E +R+KK+ E D+N+ L+ A
Sbjct: 517 ENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINE-----------LEIALDHA 565
Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
+ A + +YQ+ + ++Q Q E+ + + ++ + E R A++
Sbjct: 566 NKANAEAQKNVKKYQQQVKELQTQVEEEQR--------AREDAREQLAVAERRATALEA- 616
Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL---KR--- 565
E+E+L L QA R K E E E E L+ Q +L KR
Sbjct: 617 ------------ELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLE 664
Query: 566 -EIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERN 624
E+A+ D D+A+ E ER ++ Q +A E E++ S +
Sbjct: 665 GELAALQSDLDEAVNELKAAEERA-------KKAQADAARLAEELRQ---EQEHS--QHL 712
Query: 625 EVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
E ++ K + R+D EA + +K KL+A +++
Sbjct: 713 ERLRKQLESQVKELQVRLDEAEAAA-LKGGKKMIQKLEARVRE 754
Score = 37.7 bits (88), Expect = 0.058
Identities = 76/395 (19%), Positives = 139/395 (35%), Gaps = 85/395 (21%)
Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC---TQPESDDAGSADSLNQHY 378
AQL + SLR + + K L+R+L + + ++ ES+ A A+ L +
Sbjct: 264 AQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELK 323
Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
K +++ +A + D L T S +LE Q E+ R ++ +++
Sbjct: 324 KKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFD 383
Query: 439 RERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
+ K++ + D A RE L +++ + E+ ++ L+
Sbjct: 384 KILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVE-----------ALR 432
Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
R E ++ E+ L++ L + R + LE + L
Sbjct: 433 R-----------------ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRL--------- 466
Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDS 618
E K E+ +AL + + AL+ R Q E R E R+ E++
Sbjct: 467 --EAEKDELQAALEEAEAALELEESKVLR----------AQVELSQIRSEIERRLAEKEE 514
Query: 619 SRKERNEVEAASMDLFGKCQKERMDNLEAN--------QEIDRLRK-------------- 656
E E K + +++L+A E RL+K
Sbjct: 515 ------EFENTR-----KNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALD 563
Query: 657 HTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRD 691
H +K AE Q Y + E + E +R R+
Sbjct: 564 HANKANAEAQKNVKKYQQQVKELQTQVEEEQRARE 598
Score = 36.9 bits (86), Expect = 0.11
Identities = 57/314 (18%), Positives = 128/314 (40%), Gaps = 49/314 (15%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSS--------LRSKYSDLISDKQRLDRELLELRSGC 359
K+LE QH+ A+A L Q++ + L+ + + +K +L E+ +L +
Sbjct: 109 KDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQL 168
Query: 360 CTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEE 419
+ +A+ + S L++ + DE L+++ +DL + S +L Q E
Sbjct: 169 --DQIAKAKLNAEKKAKQLESQLSELQVKLDE---LQRQLNDLTSQKS----RL---QSE 216
Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER----------------- 462
+ + +Q E+ + + + ++ L+ Q A R + RER
Sbjct: 217 NSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSL 276
Query: 463 -----------NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
E + L+K + + + + A+R + ++LK+ ++ + ++E
Sbjct: 277 REQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEEA 336
Query: 512 SLIM-SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
+ ++ DS+ K +L +L +++ EL + +K + KR++
Sbjct: 337 AEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDEL 396
Query: 571 LHDRDKALKECNDL 584
+ D A +E +L
Sbjct: 397 QAELDTAQREARNL 410
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 36.9 bits (86), Expect = 0.006
Identities = 18/56 (32%), Positives = 30/56 (53%)
Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
+ +E+L + QAV I L+ME +EL E+ + LS + E L+ E +R+
Sbjct: 2 SLELLEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNA 57
Score = 29.2 bits (66), Expect = 2.8
Identities = 12/49 (24%), Positives = 26/49 (53%)
Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ 447
++L + AV+ + L Q E+ +K++ E +++E E LK++
Sbjct: 6 LEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEE 54
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 40.8 bits (96), Expect = 0.006
Identities = 28/192 (14%), Positives = 62/192 (32%), Gaps = 30/192 (15%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
+ ++ Y + A+ +LE+ + S+ D + L L L
Sbjct: 289 ELIDKYESHIEKALEELES----ILDTEKENSEFKLDVEELKALLEALEE---------- 334
Query: 368 AGSADSLNQH------YMSALNKYETIKDEYDALRKRFD---DLINTHSTAVDKLELAQE 418
+ Q S + E+I D +++ D +LI H+ +D L+ +
Sbjct: 335 --ILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKN 392
Query: 419 EVAR-----IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
+ + + + ++ + I+Q + ++ + + L K
Sbjct: 393 KAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQL 452
Query: 474 QQHEDAVKEINT 485
E EIN
Sbjct: 453 TNIEPTADEINK 464
Score = 38.1 bits (89), Expect = 0.045
Identities = 51/278 (18%), Positives = 107/278 (38%), Gaps = 48/278 (17%)
Query: 339 SDLISDKQRLD-----RELLELRSGC--CTQPESDDAGSADSLNQHYMSALNKYETIKDE 391
S+LI++ D E E C C Q +++ L H+ +E
Sbjct: 232 SELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEE--RKAELEAHF----------DEE 279
Query: 392 YDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAA 451
Y L ++ ++LI+ + + ++K E + +K+ + + L++
Sbjct: 280 YQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEI---- 335
Query: 452 IRQWDIALRERNEY------QEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
+ + L E+ + E++ + + D + IN + + + + L +E+N
Sbjct: 336 LEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIR---EHNEKIDNLKKEKN 392
Query: 506 AAMQEYSLIMSERDSV-HKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
A + + + +L ED+ ++ K LE L ++ K L +I+ L+
Sbjct: 393 KA----------KKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALE 442
Query: 565 REIAS---ALHDRDKALKECNDLRERFG--EFTAAKEE 597
+EI L + + E N L + +G F KE
Sbjct: 443 KEIKELEKQLTNIEPTADEINKLLKAYGFGNFELEKEA 480
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 40.9 bits (95), Expect = 0.006
Identities = 82/458 (17%), Positives = 178/458 (38%), Gaps = 44/458 (9%)
Query: 272 EHSPRVHRKSVNSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQES 331
+H ++K +N + + L + Q E ++ +K+L + + + QLE E
Sbjct: 225 QHLEEEYKKEINDKEKQVSLLLIQITEK----ENKMKDLTFLLEESRDKANQLE----EK 276
Query: 332 SSLRSK-YSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKD 390
+ L+ + +LI K L +EL +++ + +L + A + +
Sbjct: 277 TKLQDENLKELIEKKDHLTKELEDIKMSL-----QRSMSTQKALEEDLQIATKTICQLTE 331
Query: 391 EYDALRKRFDDLINTHSTAVDKLELAQ---EEVARIKKQCEDMN-----------QEKNT 436
E +A + + HS V + E EE+ R ++Q + N Q+K++
Sbjct: 332 EKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSS 391
Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEY---QEALAKVQQQHEDAVKEINTAMAVRMKA 493
L E K + + L E + ++ K+ ++ + +E+ + R K
Sbjct: 392 ELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKE 451
Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
DL+ + + Y E + + E+EK + L +ENKEL ++
Sbjct: 452 IHDLEIQLTAIKTSEEHYL---KEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEA 508
Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRM 613
++ +++ + +I + ++ LK+ +L E+ E + E E ++
Sbjct: 509 SDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKL 568
Query: 614 GERD----SSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQS 669
+ + S E + E L KC + N+ I+ L + L+ + ++
Sbjct: 569 DKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAEN 628
Query: 670 VNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYH 707
+ E +V+K + A +++ E+ID+Y
Sbjct: 629 KQL------NAYEIKVNKLELELASAKQKFEEIIDNYQ 660
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 40.5 bits (96), Expect = 0.007
Identities = 40/183 (21%), Positives = 79/183 (43%), Gaps = 23/183 (12%)
Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE---------RNGLKQQCTAAI 452
+ A LE A++E IKK+ +E+ LR RN L++
Sbjct: 33 IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKL----- 87
Query: 453 RQWDIALRERN--EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
+ + +E N E L K +++ E KE+ K ++L+ L EE+ ++
Sbjct: 88 -EKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Query: 511 YSLIMSE--RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED--KKTLSYQIETLKRE 566
S + +E ++ + +++E+ + +A IK +E E KE + K+ L+ I+ +
Sbjct: 147 ISGLTAEEAKEILLEKVEEEAR--HEAAVLIKEIEEEAKEEADKKAKEILAQAIQRCAAD 204
Query: 567 IAS 569
+
Sbjct: 205 HVA 207
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 40.1 bits (94), Expect = 0.007
Identities = 64/311 (20%), Positives = 131/311 (42%), Gaps = 34/311 (10%)
Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
+L+K +T +++ + L KR +L+ L+ Q+ +++K+ + + E + A+
Sbjct: 11 NQSLSKLDTPEEKLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDKLQSEHSKAI 70
Query: 439 RERNGLKQQCTAAIRQ----WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKAT 494
R+ L+ C + + +E +E ++ ++ + +K+I M
Sbjct: 71 LARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPN 130
Query: 495 KDLKRLTEERNAAMQE----YSLIMSERDSVHKEME---KLSE-DLTQAVRKIKTLEMEN 546
L EE ++E Y L + + K + +L++ L QA + + + ++
Sbjct: 131 HKLCEENEELREKLKELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKH 190
Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKE-CNDLRERFGEFTAAKEECQREAFHA 605
K E + Q+ TLK + +K L+ N ++F EF + E F
Sbjct: 191 KREKEILLKEAAQVSTLK--------ETEKELRAQLNLYSDKFKEFQDTLNK-SNEVF-L 240
Query: 606 RFEYAMRMGERDSSRKERNEVEAASMDL-FGKCQKERMDNLEANQ----EIDRLRKHTDK 660
F+ M E+ S + ++ E E + + K K ++ E Q EI++L+K DK
Sbjct: 241 TFKQEM---EKMSKKIKKLEKENLAWKRKWEKSNKALLEMAEERQKRDKEIEKLKKKIDK 297
Query: 661 LQA---ELQDQ 668
L+ LQ +
Sbjct: 298 LEKLCRALQAE 308
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 39.1 bits (91), Expect = 0.009
Identities = 36/196 (18%), Positives = 64/196 (32%), Gaps = 37/196 (18%)
Query: 375 NQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEK 434
+ L + D K F +L + D+ L + E ++ + +
Sbjct: 2 RDEILRRLK-------DRDRREKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSH 54
Query: 435 NTALRERNGLKQQCTAAIRQWDIALRE--------RNEYQEALAKVQQQHEDAVKEINTA 486
+++ A + Q LRE R E + L + + E +EI
Sbjct: 55 SSSPSADGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQ- 113
Query: 487 MAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL-----------TQA 535
K + L E + E + + +E EK +E L
Sbjct: 114 ------LEKTIAELRSEITSLETE----IRDLREELQEKEKDNETLQDELISLNIELNAL 163
Query: 536 VRKIKTLEMENKELVE 551
K++ L+ EN+ELVE
Sbjct: 164 EEKLRKLQKENQELVE 179
Score = 31.3 bits (71), Expect = 2.6
Identities = 33/181 (18%), Positives = 66/181 (36%), Gaps = 22/181 (12%)
Query: 430 MNQEKNTALRERNGLKQQCTAAIRQWDIALR-------ERNEYQEALAKVQQQHEDAVKE 482
E L++R+ ++ + Q+ E + Q + Q H +
Sbjct: 1 WRDEILRRLKDRDRREKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSA 60
Query: 483 INTAMAVRMKATKD-LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
+ + L +L EE ++ + ++ E+E+L ++ Q + I
Sbjct: 61 DGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAE 120
Query: 542 LEMENKELVEDKKTL-------SYQIETLKREIAS------ALHDRDKALKECND-LRER 587
L E L + + L ETL+ E+ S AL ++ + L++ N L ER
Sbjct: 121 LRSEITSLETEIRDLREELQEKEKDNETLQDELISLNIELNALEEKLRKLQKENQELVER 180
Query: 588 F 588
+
Sbjct: 181 W 181
>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
membrane].
Length = 406
Score = 40.0 bits (94), Expect = 0.010
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 5/80 (6%)
Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELA-KPA 1308
+GI L + G+ V S GS AG++ GD I++ +G + + +EA + KP
Sbjct: 102 IGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPG 161
Query: 1309 DKVTVLAQYSVERYNEVKDK 1328
KVT+ ++ R K
Sbjct: 162 TKVTL----TILRAGGGKPF 177
Score = 37.3 bits (87), Expect = 0.057
Identities = 18/75 (24%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 1134 IDKSVEPLGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAAS 1193
+GI++Q D GGV V + + S A++ G++ GD ++++ G ++ + A
Sbjct: 95 TSGEFGGIGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVK 154
Query: 1194 VLR-QCGNSITMLVQ 1207
++R + G +T+ +
Sbjct: 155 LIRGKPGTKVTLTIL 169
>gnl|CDD|212960 cd12027, SH3_ZO-2, Src homology 3 domain of the Tight junction
protein, Zonula occludens protein 2. ZO-2 is a
scaffolding protein that associates with other ZO
proteins and other proteins of the tight junction, zonula
adherens, and gap junctions. ZO proteins play roles in
regulating cytoskeletal dynamics at these cell junctions.
ZO-2 plays an essential role in embryonic development. It
is critical for the blood-testis barrier integrity and
male fertility. It also regulates the expression of
cyclin D1 and cell proliferation. It is considered a
member of the MAGUK (membrane-associated guanylate
kinase) protein family, which is characterized by the
presence of a core of three domains: PDZ, SH3, and
guanylate kinase (GuK). The GuK domain in MAGUK proteins
is enzymatically inactive; instead, the domain mediates
protein-protein interactions and associates
intramolecularly with the SH3 domain. The C-terminal
region of ZO-2 contains an actin-binding region and a
domain of unknown function designated beta. The SH3
domain of the related protein ZO-1 has been shown to bind
ZONAB, ZAK, afadin, and Galpha12. SH3 domains are protein
interaction domains that bind to proline-rich ligands
with moderate affinity and selectivity, preferentially to
PxxP motifs. They play versatile and diverse roles in the
cell including the regulation of enzymes, changing the
subcellular localization of signaling pathway components,
and mediating the formation of multiprotein complex
assemblies.
Length = 63
Score = 36.0 bits (83), Expect = 0.010
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 1330 GDSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGI 1389
GDSF++R F +L +L F + +I V +T+++G G+W A + G + G+
Sbjct: 1 GDSFFIRTHFEYEKELPQSLA--FTRGEIFRVVDTLYDGKLGNWLAVRI---GNELEKGL 55
Query: 1390 IPSKYKVE 1397
IP+K + E
Sbjct: 56 IPNKSRAE 63
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 40.2 bits (94), Expect = 0.011
Identities = 48/248 (19%), Positives = 96/248 (38%), Gaps = 35/248 (14%)
Query: 372 DSLNQHYMSALNKYETIKDEYDALRKRFDDL------INTHSTAVDKLELAQEEVARIKK 425
+ L + L ET+ + R +D+ I + + + +E+ I +
Sbjct: 539 EGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQ 598
Query: 426 QCEDMNQE----KNTALRERNGLKQQCTAAIRQ-WDIALRE------------RNEYQEA 468
Q E++ E K + E+N L+++ + + + L+E + Y EA
Sbjct: 599 QIEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEA 658
Query: 469 LAK--VQQQHEDAVKEINTAMAVRMKAT------KDLKRLTEERNAAMQEYSLIMSERDS 520
+K +Q + E MK+ K+LK E +N + ++I + ++
Sbjct: 659 KSKEDLQTLLNTSKNEYEKLE--FMKSDNIDNIIKNLK--KELQNLLSLKENIIKKQLNN 714
Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
+ +++ T +K+ E KE E + +QI K E L++ DK L +
Sbjct: 715 IEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPD 774
Query: 581 CNDLRERF 588
+ E F
Sbjct: 775 GKNTYEEF 782
Score = 33.3 bits (76), Expect = 1.6
Identities = 42/256 (16%), Positives = 99/256 (38%), Gaps = 59/256 (23%)
Query: 337 KYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALR 396
KYSD I KQR +L+E Y + L+
Sbjct: 1910 KYSDDIDKKQRDCNKLVE-------------------------DGNEIYLKSTAINE-LK 1943
Query: 397 KRFDDLINTHSTAVDKLELAQEEVARIKK-QCED------MNQEKNTALRE-RNGLKQQC 448
+ + N S +K++ +++ + K C D + +E+ L++ RN Q+
Sbjct: 1944 NMINSVKNKESAISNKIDNVSNKLSELNKITCNDESYDEILEKEEYEELKDLRNSFNQEK 2003
Query: 449 TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
+ + N+ +E + ++ K + T +KA++++K++ E + +
Sbjct: 2004 AETLNNLKL-----NKIKEDFNSYKNLLDELEKSVKT-----LKASENIKKIVENKKTS- 2052
Query: 509 QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
I + + + KE+E ++ L + ++K +E+ + I+ ++ +I+
Sbjct: 2053 --IDAINTNIEDIEKEIESINPSLDELLKKGHKIEISRYTSI---------IDNVQTKIS 2101
Query: 569 ---SALHDRDKALKEC 581
++D +K +
Sbjct: 2102 NDSKNINDIEKKAQIY 2117
Score = 32.9 bits (75), Expect = 1.9
Identities = 42/210 (20%), Positives = 75/210 (35%), Gaps = 39/210 (18%)
Query: 374 LNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
+N +Y + Y T Y K++ D IN +E EE+ + +QC
Sbjct: 118 VNIYYFKEMIHYAT---SYYDDLKKYSDKINE------DVEPLNEEIIKNIEQC---LGN 165
Query: 434 KNTALRERNGLKQQCTAAIRQ--WDIALRE----RNEYQEALAKVQQQHEDAVKEINTAM 487
KN L+ +R+ +D E + + L ++ ++ +K+IN +
Sbjct: 166 KNDLDNLIIVLENPEKYNVRKTLYDEKFNEYKNKKEAFYNCLKNKKEDYDKKIKKINNEI 225
Query: 488 AVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVH-KEMEK---------LSEDLTQAVR 537
K LK + N + + M E + E+ E L
Sbjct: 226 ------RKLLKNIKCTGNMCKTDTYVDMVELYLLRVNEVPSNNYDNYLNRAKELLESGSD 279
Query: 538 KIKTLEMENKELVEDKKTLSYQIETLKREI 567
I ++ E + D KT+ Y I ++ EI
Sbjct: 280 LINKIKKE----LGDNKTI-YSINFIQEEI 304
Score = 32.9 bits (75), Expect = 2.3
Identities = 45/237 (18%), Positives = 101/237 (42%), Gaps = 31/237 (13%)
Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
+ + + Y+ L K+ DDLI D +EL + + K+ E+ + + L +
Sbjct: 999 EIDKLNVNYNILNKKIDDLIKKQH--DDIIELIDKLIKEKGKEIEEKVDQYISLLEK--- 1053
Query: 444 LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA----VRMKATKDLKR 499
+K + ++ DI + + +E + ++++ E +K+I+ ++ K+ + +
Sbjct: 1054 MKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVN 1113
Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL-TQAVRKIKTLEMENKEL--------- 549
+E+N + Y+ + ++K+MEK ++L + I E+ E+
Sbjct: 1114 ADKEKNKQTEHYNKKKKSLEKIYKQMEKTLKELENMNLEDITLNEVNEIEIEYERILIDH 1173
Query: 550 --------VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC 598
+ KT+ +IE+ K++I + KE ND F E+ A ++
Sbjct: 1174 IVEQINNEAKKSKTIMEEIESYKKDIDQV---KKNMSKERNDHLTTF-EYNAYYDKA 1226
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 39.2 bits (92), Expect = 0.014
Identities = 27/136 (19%), Positives = 50/136 (36%), Gaps = 33/136 (24%)
Query: 490 RMKATKDLK-RLTEERNAAMQEYSLIMSERDSVHKEMEKLSE------------------ 530
RMK + LK L E ++Y L+M E + ++ KL +
Sbjct: 138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDE 197
Query: 531 -------DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
+L +A K+K L E V+ + L +++ L+ +I + ++
Sbjct: 198 LEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIE-------DLTNKKSE 250
Query: 584 LRERFGEFTAAKEECQ 599
L E E+C+
Sbjct: 251 LNTEIAEAEKKLEQCR 266
Score = 33.8 bits (78), Expect = 0.67
Identities = 40/207 (19%), Positives = 85/207 (41%), Gaps = 40/207 (19%)
Query: 491 MKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV 550
A+ D+K L +++ ++ ++ + +++ ++ KL E L + L+ + L
Sbjct: 105 FSASPDVKLLMDKQFQLVKTFARLEAKKMW-YEWRMKLLEGL------KEGLDENLEGLK 157
Query: 551 EDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYA 610
ED K L ++E L I L DR AL+E + ++E +
Sbjct: 158 EDYKLLMKELELLN-SIKPKLRDRKDALEE------ELRQLKQLEDELED---------- 200
Query: 611 MRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV 670
+ E++ A L +K + + ++++ L + +L+++++D
Sbjct: 201 ----------CDPTELDRAKEKL----KKLLQEIMIKVKKLEELEEELQELESKIED-LT 245
Query: 671 NYLFILTESIQEAE-VSKRRRDWAFSE 696
N L I EAE ++ R + F E
Sbjct: 246 NKKSELNTEIAEAEKKLEQCRGFTFKE 272
Score = 32.3 bits (74), Expect = 2.0
Identities = 24/115 (20%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL-RERNGL 444
E +K++Y L K + L + D+ + +EE+ ++K+ +++ T L R + L
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213
Query: 445 KQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
K+ ++ I +++ E +E L +++ + ED + + +A K L++
Sbjct: 214 KKLL----QEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQ 264
Score = 31.5 bits (72), Expect = 3.1
Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 3/87 (3%)
Query: 390 DEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
E D +++ L+ V KLE +EE+ ++ + ED+ +K+ E +++
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKK-L 262
Query: 450 AAIRQWDIALRERNEYQEALAKVQQQH 476
R + +E + +E L +Q
Sbjct: 263 EQCRGFTF--KEIEKLKEQLKLLQSLT 287
Score = 31.1 bits (71), Expect = 4.9
Identities = 31/155 (20%), Positives = 61/155 (39%), Gaps = 20/155 (12%)
Query: 411 DKLELAQEEVARIKKQCEDMN------QEKNTALRERNGLKQQCTAAIRQWDIALRERNE 464
+ LE +E+ + K+ E +N +++ AL E +Q + D +R
Sbjct: 151 ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDR-- 208
Query: 465 YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK- 523
+E L K+ Q+ VK++ + ++ LT +++ E I + +
Sbjct: 209 AKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTE---IAEAEKKLEQC 265
Query: 524 ------EMEKLSE--DLTQAVRKIKTLEMENKELV 550
E+EKL E L Q++ K ++ L
Sbjct: 266 RGFTFKEIEKLKEQLKLLQSLTGWKITKLSGNTLS 300
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 39.6 bits (92), Expect = 0.014
Identities = 49/284 (17%), Positives = 113/284 (39%), Gaps = 36/284 (12%)
Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
K + +V+ ++ + E + + + L++Q A + ERN AL
Sbjct: 158 KFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERN----ALLA 213
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
+ +++ + E+ +D++ LT++ QE ++ + E+E++ +
Sbjct: 214 QRAENQQRILELE----------QDIQTLTQK----KQENDRVLEGTQDIEAELERMKGE 259
Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEF 591
L Q ++K T++ ++E L + E L ++ + + + ++ L E
Sbjct: 260 LKQRLKK-MTIQRRDEETERIDLQL--ENEQLHEDLRTLQERLESSQQKAGLLGEELASL 316
Query: 592 TAAKEECQREAFHARFEYA-MRMGERDSSRKERNEV-----EAASMDLFGKCQKERMDNL 645
+ ++ E +R E A M + + + E ++ + K+R+ L
Sbjct: 317 GSLRDHTIAELHQSRLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKL 376
Query: 646 EANQEIDRL-------RKHTDKLQAELQDQSVNYLFILTESIQE 682
A E+ +L R +KL+ EL + L+E+ +E
Sbjct: 377 SA--ELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRE 418
Score = 38.4 bits (89), Expect = 0.038
Identities = 67/319 (21%), Positives = 126/319 (39%), Gaps = 50/319 (15%)
Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
K+ ++ + +R R + L + +K E +E+ + E M+ E+N L +R
Sbjct: 158 KFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAE 217
Query: 444 LKQQC----------TAAIRQWDIAL-----------RERNEYQEALAKVQQQHEDAVKE 482
+Q+ T ++ D L R + E ++ L K+ Q D E
Sbjct: 218 NQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETE 277
Query: 483 I------NTAMAVRMKATKDLKRLTEERNAAM-QEYSLIMSERDSVHKEMEKLSEDLTQA 535
N + ++ ++ ++++ + +E + + S RD E+ + + Q
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337
Query: 536 VRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAK 595
++ L N L E + + + ETL++ SA D+D+ K +L + E +
Sbjct: 338 SLQLSQL---NLALKEGQSQWAQERETLRQ---SAEADKDRIQKLSAELLKL--EEWLQE 389
Query: 596 EECQREAFHARF--EYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI-- 651
E QRE E + +R+E +E+ +A L QKE+ E QE+
Sbjct: 390 ERSQREKLEVELGIEKDCNRVQLSENRRELSELRSALRVL----QKEKEQLQEEKQELLD 445
Query: 652 ------DRLRKHTDKLQAE 664
RL K D+ AE
Sbjct: 446 YIRVLELRLDKEADEKWAE 464
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 39.4 bits (92), Expect = 0.016
Identities = 31/132 (23%), Positives = 53/132 (40%), Gaps = 22/132 (16%)
Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE-----KNTALRE 440
E K Y ++ + LI S +LE E + ++ K+ E + +E + RE
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV---KEINTAMAVRMKATKDL 497
RN + E QEAL ++++ E + KE A +K+ +DL
Sbjct: 560 RNKKLEL--------------EKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDL 605
Query: 498 KRLTEERNAAMQ 509
+L E + Q
Sbjct: 606 VKLKETKQKIPQ 617
Score = 31.7 bits (72), Expect = 3.9
Identities = 28/106 (26%), Positives = 54/106 (50%), Gaps = 9/106 (8%)
Query: 482 EINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
EIN + L++ E++N +++ ++ E++ + KE+E+ E+L + R K
Sbjct: 512 EINVLIE----KLSALEKELEQKNEHLEK---LLKEQEKLKKELEQEMEELKERERNKKL 564
Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
E+E KE E K L ++E++ RE+ + K +K DL +
Sbjct: 565 -ELE-KEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKL 608
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.7 bits (92), Expect = 0.017
Identities = 53/264 (20%), Positives = 112/264 (42%), Gaps = 16/264 (6%)
Query: 414 ELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
EL + E + ++ + ++KN ALR+ K+ A I + E + + AK
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615
Query: 474 QQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQ-----EYSLIMSERDSVHKEMEKL 528
++ + +E+ A + K + K+ EE+ A + E + I + ++ E +K
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Query: 529 -SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
+E+ +A K K+ E+ K + E LK++ A ++ K + + +
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAK----KAEELKKKEAEEKKKAEELKKAEEENKIK 1731
Query: 588 FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA 647
E EE +++A A+ + E+ + E E + ++ + +KE + E
Sbjct: 1732 AEEAKKEAEEDKKKAEEAKKD----EEEKKKIAHLKKEEEKKAEEI--RKEKEAVIEEEL 1785
Query: 648 NQEIDRLRKHTDKLQAELQDQSVN 671
++E ++ R DK ++ D N
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFDNFAN 1809
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 39.6 bits (93), Expect = 0.018
Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 45/275 (16%)
Query: 298 ETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRS 357
+ RR + L + Q Q +QLE A + S+L L+ L E L R
Sbjct: 841 QLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNR----LLPRLNLLADETLADRV 896
Query: 358 GCCTQPESDDAGSADSLNQHYMSALNKYETI-------KDEYDALRKRFDDLINTHSTAV 410
+ + D+A A Q + +AL + E I ++++ L++ + T A
Sbjct: 897 EEI-REQLDEAEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAK 955
Query: 411 DKLELAQEEVARIK----KQCEDM---NQEKNTALRERNGLKQ------QCTAAIRQWDI 457
+ E V R + +M N + N LR+R L+Q + +RQ
Sbjct: 956 QQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLRQR--LEQAEQERTRAREQLRQAQA 1013
Query: 458 ALRERNEYQEAL-----AKVQQQHEDAVKEIN-------TAMAVRMKATKD-----LKRL 500
L + N+ +L AK +Q ++ +E+ + R +A +D L
Sbjct: 1014 QLAQYNQVLASLKSSYDAK-RQMLQELKQELQDLGVPADSGAEERARARRDELHARLSAN 1072
Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
RN ++ + +E D++ K++ KL D +
Sbjct: 1073 RSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEM 1107
Score = 33.8 bits (78), Expect = 1.1
Identities = 50/248 (20%), Positives = 96/248 (38%), Gaps = 38/248 (15%)
Query: 314 RGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD------------RELLELRSGCCT 361
Q + A+ EAA +E L+S+ +D +Q LD + LE C
Sbjct: 375 DEQQEENEARAEAAEEEVDELKSQLADY---QQALDVQQTRAIQYQQAVQALERAKQLCG 431
Query: 362 QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
P+ + D L + K + +E +L ++ A + E A + V
Sbjct: 432 LPDLTADNAEDWLEE----FQAKEQEATEELLSLEQKLSV----AQAAHSQFEQAYQLVR 483
Query: 422 RI-----KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE-RNEYQEALAKVQQQ 475
+I + + D+ +E LRE+ L +Q ++Q + L E ++ QQ+
Sbjct: 484 KIAGEVSRSEAWDVARELLRRLREQRHLAEQ----LQQLRMRLSELEQRLRQ-----QQR 534
Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
E + E + + +L++L EE A ++ S +SE + + E L
Sbjct: 535 AERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQAR 594
Query: 536 VRKIKTLE 543
++++
Sbjct: 595 IQRLAARA 602
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 39.3 bits (92), Expect = 0.019
Identities = 57/330 (17%), Positives = 101/330 (30%), Gaps = 67/330 (20%)
Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE----LAQEE 419
S ++L KYE + E LR ++ + + K EE
Sbjct: 204 PSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEE 263
Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQC----------TAAIRQWDI------------ 457
++++Q +++ + + L + Q
Sbjct: 264 REQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQE 323
Query: 458 ALRER-NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
L ER E E+L K+ + VKEI +A K +
Sbjct: 324 ELEERDKELLESLPKLALP-AEHVKEIAAELAEIDKPATTDSEIPH-------------- 368
Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
E+ +L E L Q V+ +EL + K L ++ L+ E+A DK
Sbjct: 369 --RLSGSELTQL-EVLIQQVK---------RELQDAKSQLLKELRELEEELAEV----DK 412
Query: 577 ALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGK 636
+ E+ + E Q E F + E E + E +++ K
Sbjct: 413 KI-STIPSEEQIAQLLEELGEAQNELFRSEAEIE----ELLRQLETLKE----AIEALRK 463
Query: 637 CQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
E+ E++R DK + L+
Sbjct: 464 TLDEKTKQKINAFELERAITIADKAKKTLK 493
Score = 32.0 bits (73), Expect = 3.6
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDL 584
K SE + + +I+ LE E KE E + L+ +I L+ E+ A + K+
Sbjct: 198 RKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSE 256
>gnl|CDD|212963 cd12030, SH3_DLG4, Src Homology 3 domain of Disks Large homolog 4.
DLG4, also called postsynaptic density-95 (PSD95) or
synapse-associated protein 90 (SAP90), is a scaffolding
protein that clusters at synapses and plays an important
role in synaptic development and plasticity. It is
responsible for the membrane clustering and retention of
many transporters and receptors such as potassium
channels and PMCA4b, a P-type ion transport ATPase, among
others. DLG4 is a member of the MAGUK
(membrane-associated guanylate kinase) protein family,
which is characterized by the presence of a core of three
domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
domain in MAGUK proteins is enzymatically inactive;
instead, the domain mediates protein-protein interactions
and associates intramolecularly with the SH3 domain. DLG4
contains three PDZ domains. The SH3 domain of DLG4 binds
and clusters the kainate subgroup of glutamate receptors
via two proline-rich sequences in their C-terminal tail.
It also binds AKAP79/150 (A-kinase anchoring protein).
SH3 domains are protein interaction domains that bind to
proline-rich ligands with moderate affinity and
selectivity, preferentially to PxxP motifs. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 66
Score = 35.3 bits (81), Expect = 0.020
Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 10/71 (14%)
Query: 1332 SFYMRALFS--RTGDLG---DALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQ 1386
FY+RALF +T D G AL RF D+L+V + G W+A V +D ++
Sbjct: 1 GFYIRALFDYDKTKDCGFLSQALSFRF--GDVLHV---IDAGDEEWWQARRVHSDSETEE 55
Query: 1387 CGIIPSKYKVE 1397
G IPSK +VE
Sbjct: 56 IGFIPSKRRVE 66
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 36.9 bits (86), Expect = 0.021
Identities = 26/121 (21%), Positives = 47/121 (38%), Gaps = 32/121 (26%)
Query: 473 QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY--------------------- 511
+++ E A E+ A A + L++L + R Q+
Sbjct: 12 EKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFIR 71
Query: 512 ---SLIMSERDSVH---KEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
I ++ + +E+E E L +A R++K LE +++KK Y+ E KR
Sbjct: 72 QLDQRIQQQQQELALLQQEVEAKRERLLEARRELKALEK-----LKEKKQKEYRAEEAKR 126
Query: 566 E 566
E
Sbjct: 127 E 127
>gnl|CDD|212962 cd12029, SH3_DLG3, Src Homology 3 domain of Disks Large homolog 3.
DLG3, also called synapse-associated protein 102
(SAP102), is a scaffolding protein that clusters at
synapses and plays an important role in synaptic
development and plasticity. Mutations in DLG3 cause
midgestational embryonic lethality in mice and may be
associated with nonsyndromic X-linked mental retardation
in humans. It interacts with the NEDD4 (neural precursor
cell-expressed developmentally downregulated 4) family of
ubiquitin ligases and promotes apical tight junction
formation. DLG3 is a member of the MAGUK
(membrane-associated guanylate kinase) protein family,
which is characterized by the presence of a core of three
domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
domain in MAGUK proteins is enzymatically inactive;
instead, the domain mediates protein-protein interactions
and associates intramolecularly with the SH3 domain. DLG3
contains three PDZ domains. SH3 domains are protein
interaction domains that bind to proline-rich ligands
with moderate affinity and selectivity, preferentially to
PxxP motifs. They play versatile and diverse roles in the
cell including the regulation of enzymes, changing the
subcellular localization of signaling pathway components,
and mediating the formation of multiprotein complex
assemblies.
Length = 67
Score = 35.1 bits (80), Expect = 0.022
Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 6/69 (8%)
Query: 1332 SFYMRALFS--RTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
S Y+RALF RT D Q L F DIL+V N + W+A LV G +Q G
Sbjct: 2 SLYVRALFDYDRTRDSCLPSQGLSFSYGDILHVINASDDE---WWQARLVTPHGESEQIG 58
Query: 1389 IIPSKYKVE 1397
+IPSK +VE
Sbjct: 59 VIPSKKRVE 67
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 39.1 bits (91), Expect = 0.026
Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 46/292 (15%)
Query: 428 EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ-----HEDAVKE 482
++ Q +NT+LR L Q+ T++ + A +R+E A+ QQ + + +
Sbjct: 70 DENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQ 129
Query: 483 INTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTL 542
+ + + A K++ L + R A+++ I++E++++ ++ L L++ +IK
Sbjct: 130 LEDLVGMIQNAEKNILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLA 189
Query: 543 EME--NKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEE--- 597
E + E++E+ Q+E L+ E+ + E + E KEE
Sbjct: 190 AQEKIHVEILEE------QLEKLRNELLI------RGATEGLCVHSLSKELDVLKEENML 237
Query: 598 --CQREAFHARFEYAMRMGERDSS-RKERNEVEAASMDLFGK-----------------C 637
+ A ER KER+ ++A+ +L K C
Sbjct: 238 LKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDC 297
Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSK 687
E+++NL+ +DR +K L + + L S++EA VSK
Sbjct: 298 WWEKVENLQD--LLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKEANVSK 347
Score = 34.5 bits (79), Expect = 0.64
Identities = 74/361 (20%), Positives = 134/361 (37%), Gaps = 79/361 (21%)
Query: 331 SSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKD 390
SS + K I+ KQR L+ + Q E+ + L Q ++
Sbjct: 43 SSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTVMELPQ---------KSTSS 93
Query: 391 EYDALRKRFDDLINTHSTAVDK--LELAQEEVARIKKQCEDM-----NQEKNTALRERNG 443
+ D R + A+D +++ Q ED+ N EKN L
Sbjct: 94 DDD--HNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKNILL----- 146
Query: 444 LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
L Q A+ + L E+ Q + ++ + + I A ++ ++L +
Sbjct: 147 LNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKL 206
Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
RN + +E VH ++L L+ EN L +D I+ L
Sbjct: 207 RNELLIR---GATEGLCVHSLSKELD-----------VLKEENMLLKDD-------IQFL 245
Query: 564 KREIASALHDRDKAL----KECN----DLRERFGEFTAAKEEC------QREAFHARFEY 609
K E+ + + ++ + KE + LRE +F A+E+ Q + + + E
Sbjct: 246 KAEL-IEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVEN 304
Query: 610 AMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQS 669
+ +R + N+VE A++ L+ NQ+ LR DKL+A L++ +
Sbjct: 305 LQDLLDRAT-----NQVEKAAL------------VLDQNQD---LRDKVDKLEASLKEAN 344
Query: 670 V 670
V
Sbjct: 345 V 345
Score = 34.1 bits (78), Expect = 0.76
Identities = 48/266 (18%), Positives = 110/266 (41%), Gaps = 46/266 (17%)
Query: 374 LNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
LNQ + AL E I E +AL+ + + L S +++LA +E ++ E + +
Sbjct: 147 LNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKL 206
Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERN------------------EYQEALAKVQQQ 475
+N L + ++ D+ L+E N E +E + K++++
Sbjct: 207 RNELLIRGATEGLCVHSLSKELDV-LKEENMLLKDDIQFLKAELIEVAETEERVFKLEKE 265
Query: 476 H---EDAVKEINTAMAVR---------------MKATKDLKRLTEERNAAMQEYSLIMSE 517
+ +++E+ + V + ++L+ L + +++ +L++ +
Sbjct: 266 RSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQ 325
Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK-KTLSYQIETLKREIASALHDRDK 576
+ +++KL L +A + EL++ K K L +++ EI S + +
Sbjct: 326 NQDLRDKVDKLEASLKEA--NVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQE 383
Query: 577 ALKECNDLRERFGEFTAAKEECQREA 602
++KE D + KEE ++ +
Sbjct: 384 SIKEFQDTLSKL------KEESKKRS 403
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 36.0 bits (84), Expect = 0.028
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 13/88 (14%)
Query: 484 NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLE 543
N + R+ +T L+RL E + E + + +ERD +E+ V+ + E
Sbjct: 15 NVQLVERLSST--LRRLEGELASLKDELARLEAERDEARQEI----------VKLTEENE 62
Query: 544 MENKELVEDKKTLSYQIETLKREIASAL 571
E +EL ++ + L ++E L++ + L
Sbjct: 63 -ELRELKKEIEELEKELEDLEQRYETTL 89
Score = 29.1 bits (66), Expect = 7.3
Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 29/106 (27%)
Query: 318 QAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQH 377
+ + +LE E +SL+ + + L +++ +E+++L
Sbjct: 23 SSTLRRLEG---ELASLKDELARLEAERDEARQEIVKL---------------------- 57
Query: 378 YMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
+ +K E + L K +DL + T LEL E+ R+
Sbjct: 58 -TEENEELRELKKEIEELEKELEDLEQRYETT---LELLGEKSERV 99
Score = 29.1 bits (66), Expect = 8.0
Identities = 25/105 (23%), Positives = 43/105 (40%), Gaps = 29/105 (27%)
Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQ 509
+ +R+ + E ++ LA+++ + ++A +EI EE +
Sbjct: 24 STLRRLEG---ELASLKDELARLEAERDEARQEIVKLT-----------EENEELRELKK 69
Query: 510 EYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEM--ENKELVED 552
E +E+EK EDL Q R TLE+ E E VE+
Sbjct: 70 EI-----------EELEKELEDLEQ--RYETTLELLGEKSERVEE 101
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 38.4 bits (89), Expect = 0.034
Identities = 46/179 (25%), Positives = 85/179 (47%), Gaps = 27/179 (15%)
Query: 400 DDLINTHST----AVDKLELAQEEVARIKKQCEDMNQEKNTALRERN-----GLKQQCTA 450
+ L+ H+ AV E A+ E A +K+Q ED+ +K A +R LK +C
Sbjct: 2 EALVKQHAKVAEEAVSGWEKAEAEAAALKQQLEDVTLKKLAA-EDRASHLDGALK-ECMR 59
Query: 451 AIRQWDIALRERNE--YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
+RQ +RE E Q+A + ++ E E+ + + K+L R E N A+
Sbjct: 60 QLRQ----VREEQEQKIQDAATEKTKEWEKVKAELEKKLL---ELQKELARAAAE-NDAL 111
Query: 509 QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREI 567
S + E+ K + +LSE+ ++A +I+ L+ + ++ +L Y++ L +E+
Sbjct: 112 ---SRSLQEKS---KMIMELSEEKSRAESEIEDLKARLESAEKENSSLKYEVHVLSKEL 164
>gnl|CDD|212965 cd12032, SH3_DLG2, Src Homology 3 domain of Disks Large homolog 2.
DLG2, also called postsynaptic density-93 (PSD93) or
Channel-associated protein of synapse-110 (chapsyn 110),
is a scaffolding protein that clusters at synapses and
plays an important role in synaptic development and
plasticity. The DLG2 delta isoform binds inwardly
rectifying potassium Kir2 channels, which determine
resting membrane potential in neurons. It regulates the
spatial and temporal distribution of Kir2 channels within
neuronal membranes. DLG2 is a member of the MAGUK
(membrane-associated guanylate kinase) protein family,
which is characterized by the presence of a core of three
domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
domain in MAGUK proteins is enzymatically inactive;
instead, the domain mediates protein-protein interactions
and associates intramolecularly with the SH3 domain. DLG2
contains three PDZ domains. SH3 domains are protein
interaction domains that bind to proline-rich ligands
with moderate affinity and selectivity, preferentially to
PxxP motifs. They play versatile and diverse roles in the
cell including the regulation of enzymes, changing the
subcellular localization of signaling pathway components,
and mediating the formation of multiprotein complex
assemblies.
Length = 74
Score = 34.7 bits (79), Expect = 0.036
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 1332 SFYMRALF--SRTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
S Y+RA+F ++ D G Q L F DIL+V N + W+A V DG ++ G
Sbjct: 5 SLYVRAMFDYEKSKDSGLPSQGLSFRYGDILHVINASDDE---WWQARRVTPDGDSEEMG 61
Query: 1389 IIPSKYKVEEE 1399
+IPSK +VE +
Sbjct: 62 VIPSKRRVERK 72
>gnl|CDD|185320 PRK15422, PRK15422, septal ring assembly protein ZapB; Provisional.
Length = 79
Score = 34.7 bits (79), Expect = 0.044
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 7/62 (11%)
Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLR 585
EKL + QA+ I L+ME +EL E +LS +E+ +A H R++ +E N L+
Sbjct: 7 EKLEAKVQQAIDTITLLQMEIEELKEKNNSLS-------QEVQNAQHQREELERENNHLK 59
Query: 586 ER 587
E+
Sbjct: 60 EQ 61
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 38.1 bits (89), Expect = 0.046
Identities = 30/197 (15%), Positives = 72/197 (36%), Gaps = 21/197 (10%)
Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED------ 429
+ K + + ++D+L+ T ++E +E+ + ED
Sbjct: 201 NKNIEEQRKKN--GENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALN 258
Query: 430 -MNQE------------KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH 476
+N K + E+ G+ CT I + + + + + L ++
Sbjct: 259 KLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKL 318
Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
+ A+ E+ M + +K L L + + Q ++ + V +E+L +
Sbjct: 319 DTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNA 378
Query: 537 RKIKTLEMENKELVEDK 553
++ L+ E ++V+ K
Sbjct: 379 EELAKLQDELDKIVKTK 395
Score = 33.4 bits (77), Expect = 1.0
Identities = 42/240 (17%), Positives = 88/240 (36%), Gaps = 36/240 (15%)
Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN---- 435
S ++K KD+ L ++ L +++ + + +K+ + K
Sbjct: 166 SEMDK--LNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYD 223
Query: 436 TALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED---AVKEINTAMAVRMK 492
+ E +K + E + L + ED A+ ++NTA A ++K
Sbjct: 224 ELVEEAKTIKAE--------------IEELTDELLNLVMDIEDPSAALNKLNTAAA-KIK 268
Query: 493 A-----TKDLKRLTE--ERNAAMQEYS----LIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
+ K +K + Q+ S I +D + KE++ E L A+ +++
Sbjct: 269 SKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKL-KELQHSLEKLDTAIDELEE 327
Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQRE 601
+ E E + L +I T K+ + + + K +L+ F + + Q E
Sbjct: 328 IMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDE 387
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 35.7 bits (83), Expect = 0.053
Identities = 28/156 (17%), Positives = 64/156 (41%), Gaps = 24/156 (15%)
Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEA 468
++ Q E+ R+ ++ ED ++ + + A + +Y+
Sbjct: 1 LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEA----------QQKYERE 50
Query: 469 LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
L +H + ++E+ + K++ +L E +A E S +S ++ + L
Sbjct: 51 L----VKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELS---EAEESWEEQKKML 103
Query: 529 SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
++L++ ++I+ L +NK L + QIE L
Sbjct: 104 EDELSELEKRIEELNEQNKLLHD-------QIELLS 132
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 37.9 bits (88), Expect = 0.059
Identities = 54/254 (21%), Positives = 86/254 (33%), Gaps = 38/254 (14%)
Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD----------RELLELRS 357
+ELE R +A A A Q R S L +++RLD REL R
Sbjct: 333 EELERARADAEALQAAAADARQAI---REAESRLEEERRRLDEEAGRLDDAERELRAARE 389
Query: 358 GCCTQPESDDAGSADSLN------QHYMSALNKYETIKDEYD---ALRKRFDDLINTHST 408
E A + + L + + + D A R L+
Sbjct: 390 QLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQDADRVIAQRSEQVALLRRRDD 449
Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQC-------TAAIRQWDIALRE 461
D+ A+ A + + +++++E A R L + A + W LRE
Sbjct: 450 VADR---AEATHAAARARRDELDEEAEQA-AARAELADEAVHREGARLAWVDAWQAQLRE 505
Query: 462 -----RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
++ ALA + E +AV LT ER A ++
Sbjct: 506 LTILAVDDQPGALADLDSWDALLQGEAPVRVAVYSAVQPLADELTRERAALRLAEEVLEE 565
Query: 517 ERDSVHKEMEKLSE 530
ERD++ E E+L +
Sbjct: 566 ERDALRTERERLEQ 579
Score = 34.0 bits (78), Expect = 0.83
Identities = 35/237 (14%), Positives = 66/237 (27%), Gaps = 31/237 (13%)
Query: 397 KRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD 456
++D L A D+LE A+EE + + E + +E + L Q
Sbjct: 276 TQYDQLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEAL---------QGS 326
Query: 457 IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
A ++ E + A A + A + ++ +RL EE
Sbjct: 327 PAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLDEEAGRLDD------- 379
Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
+L Q R + + E L+ Q + + D D+
Sbjct: 380 -------AERELRAAREQLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQDADR 432
Query: 577 ALKECNDLRERFGEFTAA----KEECQREAFHARFEYAMRMGERDSSRKERNEVEAA 629
+ + R + + AR E + + + A
Sbjct: 433 VIAQ----RSEQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEA 485
Score = 32.9 bits (75), Expect = 2.1
Identities = 36/208 (17%), Positives = 67/208 (32%), Gaps = 14/208 (6%)
Query: 307 LKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQ-PES 365
+ + A +LE A +E L ++ L + L L L+ Q E
Sbjct: 275 QTQYDQLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEE 334
Query: 366 DDAGSADSLNQHYMSALNKYETIKDE----YDALRKRFDDLINTHSTAVDKLELAQEEVA 421
+ AD+ +A E + R+R D+ A +L A+E++A
Sbjct: 335 LERARADAEALQ--AAAADARQAIREAESRLEEERRRLDEEAGRLDDAERELRAAREQLA 392
Query: 422 RIKKQC--EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDA 479
R ++ + + A + Q A+ + + ++ +D
Sbjct: 393 RAAERAGLSPAHTAEPDAALAAQ--ELQELGALDARRQDADRVIAQRSEQVALLRRRDDV 450
Query: 480 VKEINTAMAVRMKATKDLKRLTEERNAA 507
A +A +D L EE A
Sbjct: 451 ADRAEATHA-AARARRD--ELDEEAEQA 475
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 37.6 bits (87), Expect = 0.059
Identities = 45/225 (20%), Positives = 87/225 (38%), Gaps = 35/225 (15%)
Query: 243 LSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKS---------VNSSTQTMPLYV 293
++ R+G+ + +S D K + S ++ ++ V S M +
Sbjct: 411 VTKERDGLRAILNSYD-------KELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQL 463
Query: 294 RQHNETV----RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD 349
+ E V R + E++ + Q + L + +++LR K L ++ RL
Sbjct: 464 SELEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLR 523
Query: 350 RELLELRSGCCTQPESDDAGSAD------SLN---QHYMSALNKYETIKDEYDALRKRFD 400
+E L D ++ SLN + A E ++ E + L++R
Sbjct: 524 QEKSLLEMKLEHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQ 583
Query: 401 DLINTHSTAVDKLE-----LAQEEVARIKKQCEDMNQEKNTALRE 440
L S D + ++ +E+A++KKQ E + KN L+E
Sbjct: 584 ALEEGKSQPGDLEKAVGSHISSKEIAQLKKQVESAEK-KNQRLKE 627
Score = 31.8 bits (72), Expect = 3.7
Identities = 50/282 (17%), Positives = 101/282 (35%), Gaps = 61/282 (21%)
Query: 380 SALNKYETIKDEYDALRKRFDDL-INTHSTAVDKLELAQEEVARIKK-----QCEDMNQE 433
A N+ ++E L+ + D + + D+L+ A+E ++RIK QC N +
Sbjct: 125 KAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNAD 184
Query: 434 KNTAL--RERNGLKQQCTAAIRQWDIALRERNEYQEALA--------------KVQQQHE 477
L E L++Q ++ A ++ A ++++ +
Sbjct: 185 TELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQ 244
Query: 478 D-AVKEINTAMAVRMKATKDLKRLTEERNA---AMQEYSLIMSE---------------R 518
D V + +++ + E N +M+E + ++ E R
Sbjct: 245 DAEVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMR 304
Query: 519 DSVHK---EMEKLSEDLTQ-----------------AVRKIKTLEMENKELVEDKKTLSY 558
+ + E EKL +L R+I L+ E +L E ++S
Sbjct: 305 EKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISS 364
Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
+ L+ + +R KA+ E +L+++ A QR
Sbjct: 365 SAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQR 406
Score = 31.8 bits (72), Expect = 3.7
Identities = 41/240 (17%), Positives = 89/240 (37%), Gaps = 18/240 (7%)
Query: 324 LEAAAQESSSLRSKYSDLISDKQRLDRELLE----LRSGCCTQPESDDAGSADS-LNQHY 378
L++ + +R K +DL +K++L+ EL L+ DD L
Sbjct: 293 LQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEE 352
Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
+ K +I L L AV ++ ++++ +K + +
Sbjct: 353 LQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVT 412
Query: 439 RERNGLKQQCTA-----AIRQWDIALRERNE--------YQEALAKVQQQHEDAVKEINT 485
+ER+GL+ + L +R E Q LAK++ Q + +++
Sbjct: 413 KERDGLRAILNSYDKELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQ 472
Query: 486 AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEME 545
++K L E+ + + S + +++ ++E L + + ++ LEM+
Sbjct: 473 QKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMK 532
>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
also known as guanylate kinase (GKase), catalyzes the
reversible phosphoryl transfer from adenosine
triphosphate (ATP) to guanosine monophosphate (GMP) to
yield adenosine diphosphate (ADP) and guanosine
diphosphate (GDP). It plays an essential role in the
biosynthesis of guanosine triphosphate (GTP). This enzyme
is also important for the activation of some antiviral
and anticancer agents, such as acyclovir, ganciclovir,
carbovir, and thiopurines.
Length = 137
Score = 35.6 bits (83), Expect = 0.060
Identities = 26/121 (21%), Positives = 53/121 (43%), Gaps = 24/121 (19%)
Query: 1476 PVILVGALS----DTVTDKLLQDFPDKFV-------RCAP---EIM----HC-PQAAMEK 1516
++L G S T+ +LL++F F R P E+ H + E+
Sbjct: 1 LIVLSGP-SGVGKSTLLKRLLEEFDPNFGFSVSHTTR-KPRPGEVDGVDYHFVSKEEFER 58
Query: 1517 GLADNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQH--VYPIVL 1574
+ + F+++ + + TS AAV++A + VIL++ + G ++ + + I +
Sbjct: 59 LIENGEFLEWAEFHGNYYGTSKAAVEEALAEGK-IVILEIDVQGARQVKKSYPDAVSIFI 117
Query: 1575 L 1575
L
Sbjct: 118 L 118
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 36.9 bits (86), Expect = 0.097
Identities = 47/226 (20%), Positives = 93/226 (41%), Gaps = 37/226 (16%)
Query: 390 DEYDALRKRFDDLIN--THSTAVDKLEL----AQEEVARIKKQCEDMNQEKNTALRERNG 443
L F+DLI+ D LE A+EE+ ++ K+ ++ E+
Sbjct: 136 PVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEE--EEELERA 193
Query: 444 LKQQCTAAIRQWDIALRERNEYQEA--LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
LK++ + + + L R E +EA +++ + E +E+ +++ +L+R
Sbjct: 194 LKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ--ELERQA 251
Query: 502 EE-----RNA-AMQEYSL-----------IMSERDSVHKEMEKLS---EDLTQAVRKIKT 541
E +N A+Q L + ER+ ++ +L+ + L +A+
Sbjct: 252 EAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSE 311
Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKAL-KECNDLRE 586
E EN ++ + L +E LK + S + L KE + L+E
Sbjct: 312 AEDENHKV----QQLWLAVEALKSALKSGSAGSPRPLVKELDALKE 353
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 37.1 bits (87), Expect = 0.10
Identities = 17/100 (17%), Positives = 45/100 (45%), Gaps = 10/100 (10%)
Query: 384 KYETIKDEYDALRKRFDDLINT-HSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
E ++++ + L++ D L+ A ++ A++E I K+ + + +++
Sbjct: 549 LKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA-- 606
Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKE 482
+ A ++ L + NE +E K Q++ ++ +K
Sbjct: 607 ---HELIEARKR----LNKANEKKEKKKKKQKEKQEELKV 639
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 36.8 bits (85), Expect = 0.10
Identities = 24/122 (19%), Positives = 52/122 (42%), Gaps = 15/122 (12%)
Query: 453 RQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYS 512
+ A +ER+ Y++ L+K++ Q+ + N +L L +E + E
Sbjct: 36 SELRDAEKERDTYKQYLSKLESQNVE---ISNYE-----ALDSELDELKKEEERLLDELE 87
Query: 513 LIMSERDSVHKEM-------EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
+ E D + E+ E+L + Q +R+ + N +L ++ ++L Q E
Sbjct: 88 ELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLN 147
Query: 566 EI 567
++
Sbjct: 148 QL 149
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 36.9 bits (85), Expect = 0.10
Identities = 56/396 (14%), Positives = 124/396 (31%), Gaps = 54/396 (13%)
Query: 314 RGQHQAAMAQLEAAAQESSSLR----SKYSDLISDKQRLDRELLELRSGCCTQPESDD-- 367
+ ++ + E S L KQ L++EL LR +S
Sbjct: 186 FAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYL 245
Query: 368 ------AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
Q + E ++ + L + + + + L + V
Sbjct: 246 TQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERI--NRARKAAPLAAHIKAVT 303
Query: 422 RIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVK 481
+I++Q + ++ E + +R R L + A ++Q E + L + Q
Sbjct: 304 QIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQ----QSSIEEQRRLLQTLHSQEIHIRD 359
Query: 482 EINTAMAVRMKATKD------LKRLTEERNAAMQEYSLIMSERDSVHKEMEK-LSEDLTQ 534
A ++R + + + L +++ Q+ + E D + +E +
Sbjct: 360 AHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAF 419
Query: 535 AVRKIK--------TLEMENKELVEDKKTLSYQIETLK----REIASALHDRDKALKECN 582
+ + L+ EL T + Q E L+ +E A +L +R++ L+
Sbjct: 420 RDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479
Query: 583 DLRERFGEFTAAKEE------------CQREAFHARFEYAMRMGERDSSRKERNEVEAAS 630
+ + A C + + R +R E A
Sbjct: 480 QIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQ 539
Query: 631 M-----DLFGKCQKERMDNLEANQEIDRLRKHTDKL 661
+ D++ + ER +++ +++ L
Sbjct: 540 LETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSIL 575
Score = 35.7 bits (82), Expect = 0.24
Identities = 40/287 (13%), Positives = 85/287 (29%), Gaps = 24/287 (8%)
Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
+++ ++ A+++ ED+ + + ++R LK+Q + + I + N +E +
Sbjct: 529 RMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPN 588
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
+Q + L + + + + +E+
Sbjct: 589 LQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQ-CSQELALKLTA 647
Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEF 591
L +E V + + R AL++ +E+ +
Sbjct: 648 LHA------LQLTLTQERVREHALSIRVLPKELLAS------RQLALQKMQSEKEQLTYW 695
Query: 592 TAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI 651
+CQ + +E NE+E AS L NQ +
Sbjct: 696 KEMLAQCQ-TLLRELETHIEE------YDREFNEIENASSSLGSDLAAR---EDALNQSL 745
Query: 652 DRLRKHTD-KLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSER 697
L L+A + N + AE+S + F R
Sbjct: 746 KELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNR 792
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 36.5 bits (84), Expect = 0.12
Identities = 46/203 (22%), Positives = 80/203 (39%), Gaps = 33/203 (16%)
Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
G A Y +++ K +T K YDA K+ + + L+ A A+ + E
Sbjct: 134 GEAQEATDLYEASIKKTDTAKSVYDAAEKK----LTQAQNKLQSLDPADPGYAQAEAAVE 189
Query: 429 DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
Q A + L + A ++ A + + L K Q TA A
Sbjct: 190 ---QAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQ----------GTANA 236
Query: 489 VRMKATKDLKRLTEERN-AAMQEYSLIMSER-DSVHKEMEK-LSEDLT--QAVRKIKTLE 543
A+++ E+ N + + +++M+ + V K E+ L DL A+++ + E
Sbjct: 237 ----ASQNQVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAE 292
Query: 544 MENKELVEDKKTLSYQIETLKRE 566
ME KK+ +Q ET K E
Sbjct: 293 ME-------KKSAEFQEETRKAE 308
Score = 31.1 bits (70), Expect = 6.5
Identities = 17/116 (14%), Positives = 40/116 (34%), Gaps = 4/116 (3%)
Query: 473 QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL 532
++ KE TA+ +AT + ++ + A Y + +++ L
Sbjct: 119 KEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPAD 178
Query: 533 TQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF 588
+ +E KE E K+ ++ A D ++ +++ +F
Sbjct: 179 PGYAQAEAAVEQAGKEATEAKE----ALDKATDATVKAGTDAKAKAEKADNILTKF 230
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 35.8 bits (83), Expect = 0.13
Identities = 32/217 (14%), Positives = 67/217 (30%), Gaps = 27/217 (12%)
Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVAR----IKKQCEDMNQEKNTALR 439
++E KD L ++ L S +L + E+ K + + +
Sbjct: 5 EFEEAKDYILKLEEQLKKL----SKQAQRLVKRRRELGSALGEFGKALIKLAKCEEE--- 57
Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
L + + + + L + + + + +KE + D
Sbjct: 58 VGGELGEALSKLGKAAE-ELSSL--SEAQANQELVKLLEPLKEYLRYCQAVKETLDD--- 111
Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQ 559
R A+ + + S ++EKL K++ LE E +E + +
Sbjct: 112 ----RADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKR 167
Query: 560 IETLKREIASAL--HDRDKALKECNDLRERFGEFTAA 594
E + + L ++A DL+ EF
Sbjct: 168 YEEISERLKEELKRFHEERA----RDLKAALKEFARL 200
Score = 31.2 bits (71), Expect = 4.1
Identities = 9/58 (15%), Positives = 23/58 (39%)
Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
AL +++K + + + + + L K+E +EE+ + E+ +
Sbjct: 110 DDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKR 167
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 36.9 bits (85), Expect = 0.13
Identities = 59/370 (15%), Positives = 145/370 (39%), Gaps = 38/370 (10%)
Query: 351 ELLELRSGCCT------QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLIN 404
+L + CC Q E++ L A +K ++ + E KR D+++
Sbjct: 671 QLTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLG 730
Query: 405 THSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNE 464
++L ++E+ ++ + + +N++ R +N + E
Sbjct: 731 LAPGRQSIIDLKEKEIPELRNKLQKVNRDIQ---RLKNDI-------------------E 768
Query: 465 YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKE 524
QE L E++ K T + + + +LK + ER A Q L S+ D ++
Sbjct: 769 EQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDV--ERKIAQQAAKLQGSDLDRTVQQ 826
Query: 525 MEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDL 584
+ + ++ + + + N++L++D++ +++ E+ S L+
Sbjct: 827 VNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQF 886
Query: 585 RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDN 644
E+ E + + RE A+ E + ++ E +S + K ++++++
Sbjct: 887 EEQLVELSTEVQSLIREIKDAK-EQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVND 945
Query: 645 LEANQEIDRLRKHTDKLQAELQDQSVNYLF-----ILTESIQEAEVSKRRRDWAFSERDK 699
++ +++ + + ++ ++QD +YL + T + Q E K + R
Sbjct: 946 IK--EKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLM 1003
Query: 700 IEVIDSYHTK 709
+ ID+ +
Sbjct: 1004 RQDIDTQKIQ 1013
Score = 31.2 bits (70), Expect = 6.5
Identities = 73/393 (18%), Positives = 149/393 (37%), Gaps = 47/393 (11%)
Query: 325 EAAAQESSSLRSKYSDLISDKQRLDRELLELRS-GCCTQPESDDAGSADSLNQHYMSALN 383
+AA + S L + +KQ EL + S + D QH S N
Sbjct: 811 QAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQ---QEQIQHLKSKTN 867
Query: 384 KYETIK---DEYDALRKRFDDLINTHSTAVDKL----ELAQEEVARIKKQCEDMNQEKNT 436
+ ++ K R++F++ + ST V L + A+E+ + ++ E QEK
Sbjct: 868 ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEE 927
Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEYQEAL-AKVQQQHEDAVK----EINTAMAVRM 491
+ + ++ + ++ + Y + + K+Q +D +K E+NT A
Sbjct: 928 LISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLE 987
Query: 492 KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE 551
+ K +++ E+ Q+ D+ + L ++LT R+ + E+E +
Sbjct: 988 ECEKHQEKINEDMRLMRQDI-------DTQKIQERWLQDNLTLRKRENELKEVEEELKQH 1040
Query: 552 DKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYA- 610
K+ Q+ +K+E + D + R + + ++E +F A
Sbjct: 1041 LKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAE 1100
Query: 611 ---------MRMGERDSS--RKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTD 659
MR E + ++ A M K +M+ E N+ I L + T
Sbjct: 1101 EKYREMMIVMRTTELVNKDLDIYYKTLDQAIM----KFHSMKME--EINKIIRDLWRST- 1153
Query: 660 KLQAELQDQSVNYLFILTESIQEAEVSKRRRDW 692
+ Q + Y+ I +++ + S +RR++
Sbjct: 1154 -----YRGQDIEYIEIRSDADENVSASDKRRNY 1181
>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 79
Score = 33.4 bits (76), Expect = 0.13
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 525 MEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
EKL + QA+ I L+ME +EL E +LS +++ + + AL ++ LKE
Sbjct: 6 FEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQR-EALERENEQLKE 60
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 36.7 bits (85), Expect = 0.14
Identities = 50/306 (16%), Positives = 94/306 (30%), Gaps = 35/306 (11%)
Query: 394 ALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC--EDMNQEKNTALRERNGLKQQCTAA 451
A I + S AV + + A ++ + D E L ++ ++Q
Sbjct: 503 AQIVFLSAEIKSSSRAVREEKAAVTDIPEELARLLITDELPELAVDLLVQSRIRQHWQQL 562
Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR--------------MKATKDL 497
+ D + A + + E A+ + KDL
Sbjct: 563 RKALDQLEAAYEALEGRFAAAEAAMAEWQSEWEEALDELGLSRELSPEQQLDILSTMKDL 622
Query: 498 KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE----------DLTQAVRKIKTLEMENK 547
K+L +++ + + + E+ + + +E L L R E++ +
Sbjct: 623 KKLMQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLFCVQRLRVAAELQKE 682
Query: 548 E---LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFH 604
E +E + ++ R L +C AA+EE Q
Sbjct: 683 EARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQLTQRE 742
Query: 605 ARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAE 664
+R E E + E S L + KE L ID L + ++L A+
Sbjct: 743 SRLESLEAQLEGVA-----AEAYELSASLDQRELKEEELALLEEA-IDALDEEVEELHAQ 796
Query: 665 LQDQSV 670
+ S
Sbjct: 797 VAALSR 802
Score = 34.0 bits (78), Expect = 0.95
Identities = 42/321 (13%), Positives = 97/321 (30%), Gaps = 40/321 (12%)
Query: 316 QHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLN 375
Q + Q E+ + Y L+ ++ L ELR D +L
Sbjct: 185 QLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHARLAELR-----SELRADRDHIRALR 239
Query: 376 QHY--MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
L +++ ++ E L +R ++L V +LE + + + + + + +
Sbjct: 240 DAVELWPRLQEWKQLEQE---LTRRREELATFPRDGVLRLEKREAHLQKTEAEIDALLVR 296
Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
+ L ++ + R Q + K A
Sbjct: 297 LAELKDLASQLIP-----AKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADL----- 346
Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
R EE +ER + +E + + R++K E + +++K
Sbjct: 347 -----RDKEEAAGNG-----FEAERVHDLRSLECMLRYQSSQ-RELKQTEAAYCKRLDEK 395
Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFT------AAKEECQREAFHARF 607
+ + E R+ L D ++ ++ ++ RE A
Sbjct: 396 RLFEDEAEEEARQ---RLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQ 452
Query: 608 EYAMRMGERDSSRKERNEVEA 628
+ ++ + +++ E
Sbjct: 453 KQRFLREKQTAFERQKTEHTK 473
Score = 32.5 bits (74), Expect = 2.7
Identities = 44/295 (14%), Positives = 99/295 (33%), Gaps = 27/295 (9%)
Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
A+++++++ ++ +D+ + RL + D + QH+
Sbjct: 510 AEIKSSSRAVREEKAAVTDIPEELARLLITDE-------LPELAVDLLVQSRIRQHWQQL 562
Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
+ ++ Y+AL RF + + E A +E+ + ++ Q + +
Sbjct: 563 RKALDQLEAAYEALEGRFAAAEAAMAEWQSEWEEALDELG-LSRELSPEQQLDILSTMKD 621
Query: 442 NGLKQQCTAAI-RQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
Q A + Q E+ ++E + + E +++T V + +
Sbjct: 622 LKKLMQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLFCV-QRLRVAAELQ 680
Query: 501 TEERNAAMQEYSLIMSE--------RDSVHKEMEKL--------SEDLTQAVRKIKTLEM 544
EE A++ E + KE+ L + +A R+ + L
Sbjct: 681 KEEARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQLTQ 740
Query: 545 ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
L + L + E++++L R+ +E L E EE
Sbjct: 741 RESRLESLEAQLE-GVAAEAYELSASLDQRELKEEELALLEEAIDALDEEVEELH 794
>gnl|CDD|139585 PRK13460, PRK13460, F0F1 ATP synthase subunit B; Provisional.
Length = 173
Score = 35.0 bits (80), Expect = 0.14
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 11/126 (8%)
Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
WD+ L+ +E + +IN A +R++A LK N+A E + I
Sbjct: 38 WDVILKALDERASGVQN----------DINKASELRLEAEALLKDYEARLNSAKDEANAI 87
Query: 515 MSE-RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHD 573
++E + K KL E+ V+ K ++ EL + K Q + ++ I A
Sbjct: 88 VAEAKSDALKLKNKLLEETNNEVKAQKDQAVKEIELAKGKALSQLQNQIVEMTITIASKV 147
Query: 574 RDKALK 579
+K LK
Sbjct: 148 LEKQLK 153
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 36.2 bits (83), Expect = 0.15
Identities = 56/265 (21%), Positives = 109/265 (41%), Gaps = 20/265 (7%)
Query: 375 NQHYMSALNKYET-IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
N +++A+ E +D++ + K D L+ V LE+ E+A+ K C +++
Sbjct: 129 NNRFIAAIILSEKQCQDQFKDMNKSCDALLFKLGEKVKTLEM---EIAKEKAIC---SKD 182
Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
K + L + ++Q A + ++ +E+ +E L KVQ K+ A +
Sbjct: 183 KESLLAGKRQAEEQLEACGKARELQHQEQQLAEENLQKVQALCIPLDKDKFEADLLNAWR 242
Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI-KTLEMENKELVED 552
+ R + N + + E S+ + + L ++ + ++ + L + + + +
Sbjct: 243 DSIIPRSLD--NLGYNLHHPLGPELASIRRACDHLPGIMSSKIEELARGLRADIERVARE 300
Query: 553 KKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMR 612
L Q ++ +A + KA KE RE + EC R+ A E A
Sbjct: 301 NADLQRQKLEAEQAAQAAQEAKAKAEKEAQ-ARE-----AKLQAECARQTQLALEEKAAL 354
Query: 613 MGERDSSRKE----RNEVEAASMDL 633
ERD+ KE + E E M+L
Sbjct: 355 RAERDNLAKELEAKKREAEQLRMEL 379
>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically periplasmic,
contain C-terminal PDZ domain [Posttranslational
modification, protein turnover, chaperones].
Length = 347
Score = 36.0 bits (83), Expect = 0.16
Identities = 17/53 (32%), Positives = 25/53 (47%)
Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTV 1313
G V V GS AG++ GD I NG + + + AA +P D+V +
Sbjct: 271 GAVVLGVLPGSPAAKAGIKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVAL 323
Score = 34.1 bits (78), Expect = 0.63
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 200 DHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDV 259
GL + +G + + PGS AAK G + GD + ++N + V LS +A +V
Sbjct: 263 ALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEV 321
Score = 34.1 bits (78), Expect = 0.63
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 824 DHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDV 883
GL + +G + + PGS AAK G + GD + ++N + V LS +A +V
Sbjct: 263 ALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEV 321
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
and chromosome partitioning].
Length = 212
Score = 35.1 bits (81), Expect = 0.17
Identities = 25/190 (13%), Positives = 59/190 (31%), Gaps = 22/190 (11%)
Query: 388 IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE-KNTALRERNGLKQ 446
++E D DD+I D E +E ++ + +++ +E K A E + Q
Sbjct: 24 DEEEVDEF---LDDVI-------DDYEQLLDENEDLEDEIDELKEELKEAADAEDSQAIQ 73
Query: 447 QCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNA 506
Q + + + + K V A R + + E+
Sbjct: 74 QAETEAEE---LKQAAEAEADDILKRASAQAQRVFGKAQYKADRYLEDAKIAQELEDLKR 130
Query: 507 AMQEY-----SLIMSERDS-VHKEMEKLSEDLTQAVRKI--KTLEMENKELVEDKKTLSY 558
Q S I ++ S ++ +++ V + + + + +
Sbjct: 131 QAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANEEAERLADAAQADADRLRDE 190
Query: 559 QIETLKREIA 568
+ ++A
Sbjct: 191 CDIYVDTKLA 200
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 35.1 bits (81), Expect = 0.17
Identities = 18/80 (22%), Positives = 31/80 (38%)
Query: 509 QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
+ + + ER + KE+ + + +I LE + EL L + E +EIA
Sbjct: 103 DDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIA 162
Query: 569 SALHDRDKALKECNDLRERF 588
+ K +E D F
Sbjct: 163 RLKSEASKIKQELVDAELEF 182
Score = 30.9 bits (70), Expect = 3.8
Identities = 34/150 (22%), Positives = 57/150 (38%), Gaps = 34/150 (22%)
Query: 523 KEMEKLSEDLTQAV--RKIKTLEMENKELVE---DKKTLSYQIETLKREIASALHDRDKA 577
+E++KL D + + K EL + + + L ++ L S D+ K
Sbjct: 57 EEVKKLQIDDSISEFESLSKCF----SELEKHGFNVQALQSRLNKL----LSLKDDQTKK 108
Query: 578 LKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKC 637
L+E R+ + A KE ++E E ++ E +R K
Sbjct: 109 LEE----RKGLEKEIAEKEISRQELDSEIAELERKILE-----LQRQAALL-------KE 152
Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAELQD 667
+KE D EI RL+ K++ EL D
Sbjct: 153 KKEAEDK-----EIARLKSEASKIKQELVD 177
>gnl|CDD|219499 pfam07653, SH3_2, Variant SH3 domain. SH3 (Src homology 3) domains
are often indicative of a protein involved in signal
transduction related to cytoskeletal organisation. First
described in the Src cytoplasmic tyrosine kinase. The
structure is a partly opened beta barrel.
Length = 53
Score = 32.1 bits (74), Expect = 0.18
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 12/65 (18%)
Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
Y R +F D +L K D++ V + W W + G R G++PS
Sbjct: 1 YGRVIFDYVAT--DPNELSLKKGDVVKV----LDKDDNGW--WEGERGGRR---GLVPSS 49
Query: 1394 YKVEE 1398
Y VEE
Sbjct: 50 Y-VEE 53
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 36.1 bits (83), Expect = 0.19
Identities = 54/337 (16%), Positives = 135/337 (40%), Gaps = 39/337 (11%)
Query: 372 DSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMN 431
D + Y S L + + L+ F+ ++ +T + L+ + + ++ ++
Sbjct: 235 DYFTESYKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKIS 294
Query: 432 QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRM 491
Q+ T + LK + ++ E+ L K++ + E +EI +
Sbjct: 295 QKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKA-----L 349
Query: 492 KATKD-LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV 550
++ D L + ++ + +++ L+ ER+ + +E++K++ + + +K+ ++E + +
Sbjct: 350 QSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIF 409
Query: 551 EDKKTLSYQIETLKREI----ASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
+ + Q ++L + I + H+ + + + N E+ + E + +
Sbjct: 410 KSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINI--EQLFPKGSGINESIK-----K 462
Query: 607 FEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA--NQEIDRLRKHTDKLQAE 664
+ D ++ E S+ L +E + NL+ N+ L K +L
Sbjct: 463 SILEL----NDEIQERIKTEENKSITL-----EEDIKNLKHDINELTQILEKLELEL--- 510
Query: 665 LQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIE 701
N F L++ E E+ +R E +K+E
Sbjct: 511 ---SEANSKFELSKEENERELVAQR-----IEIEKLE 539
Score = 31.9 bits (72), Expect = 4.0
Identities = 44/316 (13%), Positives = 108/316 (34%), Gaps = 20/316 (6%)
Query: 280 KSVNSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYS 339
K+ N + + ++ ++ + L+ +++ + ++ +QE K
Sbjct: 277 KTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLK 336
Query: 340 DLISDKQRLDRELLELRSGCCTQPESDDAGSAD--SLNQHYMSALNKYETIKDEYDALRK 397
I K+ + L Q + +NQ + + I + D L K
Sbjct: 337 SEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTK 396
Query: 398 RFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD- 456
KLE AQ ++K + R R+ + + + +
Sbjct: 397 SVKS---------RKLE-AQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINI 446
Query: 457 -----IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
++++ ++ + ++ +K + K+LK E +++
Sbjct: 447 EQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEKL 506
Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
L +SE +S + ++ +E A ++I+ ++E EL + I ++ + S
Sbjct: 507 ELELSEANSKFELSKEENERELVA-QRIEIEKLEK-ELNDLNLLSKTSILDAEQLVQSTE 564
Query: 572 HDRDKALKECNDLRER 587
D+ + N R +
Sbjct: 565 IKLDELKVDLNRKRYK 580
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 36.0 bits (83), Expect = 0.20
Identities = 37/161 (22%), Positives = 66/161 (40%), Gaps = 15/161 (9%)
Query: 395 LRKRFDDLINTHSTAVDKLELAQEEV----ARIKKQCEDMNQEKNTALRERNGLKQQCTA 450
LR+R + + +L+ EEV R+K+ E Q+ AL E G ++ A
Sbjct: 94 LRER----LQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELA 149
Query: 451 AIRQWDIALR---ERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN-- 505
A+ + R R ++ QQ ++ A + +A L+ +ERN
Sbjct: 150 ALSREVGKQRGLLSRGLATFKRDRILQQQWR-EEQEKYDAADKARAIYALQTKADERNLE 208
Query: 506 AAMQEYSLIMSERDSV-HKEMEKLSEDLTQAVRKIKTLEME 545
+Q S + V KE+E + + +A +I+ LE +
Sbjct: 209 TVLQSLSQADFQLAGVAEKELETVEARIKEARYEIEELENK 249
>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
proteases 1 [Cell envelope biogenesis, outer membrane].
Length = 375
Score = 35.6 bits (82), Expect = 0.20
Identities = 21/76 (27%), Positives = 29/76 (38%), Gaps = 4/76 (5%)
Query: 1252 ISLVGGNAVGI--YVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPAD 1309
+ V G V V S AGLR GDRI+ +G + A+ ++ L A
Sbjct: 119 LFFVIGLVPVASPVVGEVAPKSAAALAGLRPGDRIVAVDGEKV--ASWDDVRRLLVAAAG 176
Query: 1310 KVTVLAQYSVERYNEV 1325
V L V R +
Sbjct: 177 DVFNLLTILVIRLDGE 192
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 34.1 bits (79), Expect = 0.22
Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
Query: 390 DEYDALRKRFDDLINTHSTAVDKLELA-QEEVARIKKQCEDMNQEKNTALRERNGLKQQC 448
A +K+ + + K E Q+E +++KQ +++E A ++ KQQ
Sbjct: 31 PAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQE 90
Query: 449 TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
+Q + + QE L + + + A+KE+
Sbjct: 91 LQQKQQ-AAQQELQQKQQELLQPIYDKIDKAIKEVAKE 127
Score = 29.1 bits (66), Expect = 9.9
Identities = 23/103 (22%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 483 INTAMAVR-MKATKDL-KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
++ + A K K+L +E E E ++++K + L++ RK K
Sbjct: 21 VDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAK 80
Query: 541 TLEMENK--ELVEDKKTLSYQIETLKREIASALHDR-DKALKE 580
E++ K EL + ++ +++ ++E+ ++D+ DKA+KE
Sbjct: 81 QQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKE 123
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 33.7 bits (78), Expect = 0.26
Identities = 19/99 (19%), Positives = 36/99 (36%), Gaps = 18/99 (18%)
Query: 463 NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE---RNAAMQE--YSLIMSE 517
+ L +++Q+H+ R+ + + L E+ + SL +E
Sbjct: 31 GDISPFLPELRQKHKA-----------RIAKDPEFQYLNEDIAWLKEERDDKTVSLNEAE 79
Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLE--MENKELVEDKK 554
R + +E EK RK E +L +DK+
Sbjct: 80 RKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKE 118
>gnl|CDD|212785 cd11851, SH3_RIM-BP, Src homology 3 domains of Rab3-interacting
molecules (RIMs) binding proteins. RIMs binding proteins
(RBPs, RIM-BPs) associate with calcium channels present
in photoreceptors, neurons, and hair cells; they interact
simultaneously with specific calcium channel subunits,
and active zone proteins, RIM1 and RIM2. RIMs are part of
the matrix at the presynaptic active zone and are
associated with synaptic vesicles through their
interaction with the small GTPase Rab3. RIM-BPs play a
role in regulating synaptic transmission by serving as
adaptors and linking calcium channels with the synaptic
vesicle release machinery. RIM-BPs contain three SH3
domains and two to three fibronectin III repeats.
Invertebrates contain one, while vertebrates contain at
least two RIM-BPs, RIM-BP1 and RIM-BP2. RIM-BP1 is also
called peripheral-type benzodiazapine receptor associated
protein 1 (PRAX-1). Mammals contain a third protein,
RIM-BP3. RIM-BP1 and RIM-BP2 are predominantly expressed
in the brain where they display overlapping but distinct
expression patterns, while RIM-BP3 is almost exclusively
expressed in the testis and is essential in
spermiogenesis. The SH3 domains of RIM-BPs bind to the
PxxP motifs of RIM1, RIM2, and L-type (alpha1D) and
N-type (alpha1B) calcium channel subunits. SH3 domains
are protein interaction domains that bind to proline-rich
ligands with moderate affinity and selectivity. They play
versatile and diverse roles in the cell including the
regulation of enzymes, changing the subcellular
localization of signaling pathway components, and
mediating the formation of multiprotein complex
assemblies.
Length = 62
Score = 31.9 bits (73), Expect = 0.26
Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 24/71 (33%)
Query: 1335 MRALFS---RTGDLGDAL--QLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQ---- 1385
M AL+ T D +L FH D++ V M D DGF
Sbjct: 2 MVALYDYNPETMSPNDDPEEELSFHAGDVVRVYGPM-------------DEDGFYYGELE 48
Query: 1386 --QCGIIPSKY 1394
+ G++PS +
Sbjct: 49 GGRKGLVPSNF 59
>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
[General function prediction only].
Length = 558
Score = 35.5 bits (82), Expect = 0.29
Identities = 21/113 (18%), Positives = 35/113 (30%), Gaps = 4/113 (3%)
Query: 1243 ETRKCSNLGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAA--TAEEA 1300
+ R+ LG+ V + V G Y AGL GD+I+ NG + +
Sbjct: 446 KPREAYYLGLK-VKSEGGHEKITFVFPGGPAYKAGLSPGDKIVAINGISDQLDRYKVNDK 504
Query: 1301 AYELAKPADK-VTVLAQYSVERYNEVKDKPGDSFYMRALFSRTGDLGDALQLR 1352
+ L + + + P L + LG+ R
Sbjct: 505 IQVHVFREGRLREFLVKLGGDPTAQYIILPIGDRNPAQLANAAAWLGEIASAR 557
>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
4 and similar proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. This subfamily is composed of
Cdc42-Interacting Protein 4 (CIP4), Formin Binding
Protein 17 (FBP17), FormiN Binding Protein 1-Like
(FNBP1L), and similar proteins. CIP4 and FNBP1L are
Cdc42 effectors that bind Wiskott-Aldrich syndrome
protein (WASP) and function in endocytosis. CIP4 and
FBP17 bind to the Fas ligand and may be implicated in
the inflammatory response. CIP4 may also play a role in
phagocytosis. Members of this subfamily typically
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. In addition, some members such as FNBP1L contain
a central Cdc42-binding HR1 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 251
Score = 34.9 bits (81), Expect = 0.30
Identities = 35/155 (22%), Positives = 72/155 (46%), Gaps = 9/155 (5%)
Query: 400 DDLINTHSTAVDKLELAQEEVAR-IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
+D+ H + L V + +K ++ QE+ L E + L+Q+ ++I+Q + +
Sbjct: 75 NDIAGQHELIAENL---NSNVCKELKTLISELRQERKKHLSEGSKLQQKLESSIKQLEKS 131
Query: 459 LRE-RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
+ ++EA K +Q++E A ++N A KA + T+ A EY+ + +
Sbjct: 132 KKAYEKAFKEAE-KAKQKYEKADADMNLTKADVEKAKANANLKTQAAEEAKNEYAAQLQK 190
Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEME-NKELVE 551
+ ++ + S DL Q K++ L+ + VE
Sbjct: 191 FN--KEQRQHYSTDLPQIFDKLQELDEKRINRTVE 223
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 33.8 bits (78), Expect = 0.30
Identities = 28/113 (24%), Positives = 53/113 (46%), Gaps = 7/113 (6%)
Query: 458 ALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
A R + E Q LA+ +Q+ E+A ++ A + +A K+ +++ EE A +E + E
Sbjct: 49 AERLKEEAQALLAEYEQELEEAREQ---ASEIIEQAKKEAEQIAEEIKAEAEEELERIKE 105
Query: 518 RD--SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
+ E E+ E+L V ++ E +L+ K + Q + + IA
Sbjct: 106 AAEAEIEAEKERALEELRAEVAELAVAIAE--KLLGKKVDEAAQKDLIDAFIA 156
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 34.3 bits (79), Expect = 0.36
Identities = 42/216 (19%), Positives = 85/216 (39%), Gaps = 28/216 (12%)
Query: 458 ALRERNEYQEALAKVQQQHEDAVKE----INTAMAVRMK---ATKDLKRLTEERNAAMQE 510
A RE+ E +E + ++++ A KE TA+ + K ++ + L ++ + +E
Sbjct: 3 AEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEE 62
Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
+ E + +E E+L ++ +A ++ LE E ++ + L++E+ A
Sbjct: 63 NRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAET-------RQLQQELREA 115
Query: 571 LHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAAS 630
++A R+ E AA A E + E +
Sbjct: 116 QEAHERA-------RQELLEAAAAPTAPPHVAAPVNGEQLEP-DDNGEEASADLETDPDM 167
Query: 631 MDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
D ++ER+ E N+ RL+ L++EL
Sbjct: 168 KDR---SEEERVTYAEKNE---RLQTQLQALKSELA 197
Score = 32.8 bits (75), Expect = 1.4
Identities = 42/203 (20%), Positives = 76/203 (37%), Gaps = 20/203 (9%)
Query: 381 ALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
L +YE E + K+ ++ LE +E+ ++ E +E + E
Sbjct: 27 ELEEYEETALELEEKLKQEEE-------EAQLLEKKADELEEENRRLE---EEAAASEEE 76
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
R L+ + A + +A E E K + + +E+ A +A ++L
Sbjct: 77 RERLEAEVDEATAE--VAKLE-----EEREKKEAETRQLQQELREAQEAHERARQELLEA 129
Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQI 560
A + + E+ E+ S DL E E E + L Q+
Sbjct: 130 AAAPTAPPHVAAPVNGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYAEKNERLQTQL 189
Query: 561 ETLKREIASALHDRDKALKECND 583
+ LK E+A+A RD++ + ND
Sbjct: 190 QALKSELAAA---RDESKETAND 209
>gnl|CDD|150503 pfam09840, DUF2067, Uncharacterized protein conserved in archaea
(DUF2067). This domain, found in various archaeal
proteins, has no known function.
Length = 190
Score = 33.9 bits (78), Expect = 0.38
Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 3/72 (4%)
Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTL 556
L+RL+ Y + + + + ++ ++L +A+R++K L E + K
Sbjct: 15 LERLSRLLKDI---YLSVEIKGNKLKVKVFGTEKELKEAIRRVKELAREVRRKYRRKGLF 71
Query: 557 SYQIETLKREIA 568
Y + L RE+
Sbjct: 72 EYPLSDLFRELG 83
>gnl|CDD|226081 COG3551, COG3551, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 402
Score = 35.0 bits (80), Expect = 0.39
Identities = 21/155 (13%), Positives = 55/155 (35%), Gaps = 24/155 (15%)
Query: 388 IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ 447
++ E+ + + + + + R +Q D+
Sbjct: 268 LEREFSRAMPLLEYVGELLWQSASSHNES---MTRFSEQVADLAMS-------------- 310
Query: 448 CTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA 507
A RQ + LR+R A +Q+E A++E++ ++V + +R E +
Sbjct: 311 -YTAQRQLNDGLRDRLLEAGARD---EQNERAMRELSRRLSVCREELVSAQRNLAETDNL 366
Query: 508 MQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTL 542
+++ + ++ ++ T+ +R + L
Sbjct: 367 LRDRDDTHARLMAI---LDSRFWRFTKPLRNLSRL 398
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
and secretion / Signal transduction mechanisms].
Length = 408
Score = 34.6 bits (79), Expect = 0.47
Identities = 21/112 (18%), Positives = 53/112 (47%), Gaps = 3/112 (2%)
Query: 463 NEYQEALAKVQQQHEDAVKEINT---AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERD 519
E + ++Q + DAV+ + ++ +K ++ E AA++E S ++SE
Sbjct: 292 KEIGLLIEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIA 351
Query: 520 SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
+ +E + E++ ++ ++ + EN VE+ S +++ L ++ +
Sbjct: 352 AATEEQTAVLEEINASIEELDDVTQENAAAVEELAAASEELKELAEKLLELV 403
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 31.5 bits (72), Expect = 0.50
Identities = 16/48 (33%), Positives = 28/48 (58%)
Query: 523 KEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
+E EKLS+ + IK L +NKEL ++ L ++E L++E+ +
Sbjct: 19 EEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENL 66
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 34.1 bits (79), Expect = 0.52
Identities = 64/327 (19%), Positives = 132/327 (40%), Gaps = 50/327 (15%)
Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLK 445
+ IK E +R D+++ ++L+ EE R +K+ E+ + + L+E+ ++
Sbjct: 30 KRIKAEEKEEERRIDEMME-----EERLKALAEEEERERKRKEERREGRAV-LQEQ--IE 81
Query: 446 QQCTAAIRQWDIALRERNEYQEALAKVQQQHE-----------DAVKEINTAMAVRMKAT 494
++ +++ L+ER + E + ++Q++ E +EI+ R++
Sbjct: 82 EREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERK 141
Query: 495 KDLKRLTEERNAAMQEYSL--------IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
++ K E + EY +ER +E E+ L + + E
Sbjct: 142 EEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREEL 201
Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC---QREAF 603
EL D YQ E ++E + +K ++ +L+ E KEE +R
Sbjct: 202 DELRADL----YQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257
Query: 604 HARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQA 663
A E + D E E E A +K RM LE +E+++ + ++ +A
Sbjct: 258 EAERERMLEKQAED----EELEQENA--------EKRRMKRLEHRRELEQQIEEKEERRA 305
Query: 664 ELQDQSVNYLFILTESIQEAEVSKRRR 690
+++ + E ++E E ++ R
Sbjct: 306 AEREEELEEG----ERLREEEAERQAR 328
Score = 32.2 bits (74), Expect = 2.6
Identities = 24/136 (17%), Positives = 51/136 (37%), Gaps = 5/136 (3%)
Query: 413 LELAQEEVARIKKQCEDMNQEKNTALRE--RNGLKQQCTAAIRQWDI-ALRERNEYQEAL 469
+L QEE R ++Q E EK ++ + ++Q + E E + L
Sbjct: 206 ADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERML 265
Query: 470 AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA-MQEYSLIMSERDSVHKEMEKL 528
K + E + R++ ++L++ EE+ E + E + + +E +
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEAER 325
Query: 529 SEDLTQA-VRKIKTLE 543
+ + R +K
Sbjct: 326 QARIEEERQRLLKEHA 341
>gnl|CDD|238491 cd00991, PDZ_archaeal_metalloprotease, PDZ domain of archaeal zinc
metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to lipids
has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 31.6 bits (72), Expect = 0.53
Identities = 18/54 (33%), Positives = 25/54 (46%)
Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTVL 1314
G+ + V GS +A L TGD I NGT + A + KP + +TV
Sbjct: 11 GVVIVGVIVGSPAENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPGEVITVT 64
Score = 30.4 bits (69), Expect = 1.3
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 1/50 (2%)
Query: 221 AKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSL 270
A+ L GD + SIN + L E + +V+T+T L T+
Sbjct: 23 AENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPG-EVITVTVLPSTTK 71
Score = 30.4 bits (69), Expect = 1.3
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 1/50 (2%)
Query: 845 AKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSL 894
A+ L GD + SIN + L E + +V+T+T L T+
Sbjct: 23 AENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPG-EVITVTVLPSTTK 71
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 33.8 bits (78), Expect = 0.61
Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 12/132 (9%)
Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE----RNAA--MQEYSLIMSERDS 520
E L +++ E K++N +++ K K L+ L EE R A + + E +
Sbjct: 154 ENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPL--ELEK 211
Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA----LHDRDK 576
+E+ LS +++ ++++ L+ E +EL + L+ + L EIA A R
Sbjct: 212 ARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGW 271
Query: 577 ALKECNDLRERF 588
+ KE + L+ +
Sbjct: 272 SAKEISKLKAKV 283
Score = 31.2 bits (71), Expect = 4.6
Identities = 21/128 (16%), Positives = 46/128 (35%), Gaps = 26/128 (20%)
Query: 375 NQHYMSALNKYETIKDE----YDALRKRFDDLIN----THSTAVDKLELAQEE------- 419
+ LN +IK + AL++ L + +LE A++E
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVK 222
Query: 420 VARIKKQCEDMNQEKN---TALRERNGLKQQCTAAIRQWDIALRER--------NEYQEA 468
++ +KQ E++ QE A+ K + I + + E ++ +
Sbjct: 223 ISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAK 282
Query: 469 LAKVQQQH 476
++ +Q+
Sbjct: 283 VSLLQKLT 290
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 33.5 bits (77), Expect = 0.69
Identities = 35/176 (19%), Positives = 67/176 (38%), Gaps = 27/176 (15%)
Query: 458 ALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
A +ER EY+E ++ QQ E + A A + +RL +E A +E I +
Sbjct: 55 AEQERREYEEKNEELDQQREVLL---TKAKEE---AQAERQRLLDE---AREEADEIREK 105
Query: 518 R-DSVHKEMEKLSEDLTQAVRKI------KTLEMENKELVEDK--KTLSYQIETLKRE-- 566
+++ +E LS++L + K L +E++ ++ TL +
Sbjct: 106 WQEALRREQAALSDELRRRTGAEVFAIARKVLTDLADTDLEERMVDVFVQRLRTLDPDEK 165
Query: 567 --IASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR 620
+A AL D + +R F +E+ R+ +R+ +
Sbjct: 166 AALAEALADSGNPVL----VRSAFELPEEQREQI-RDTIRETLGPEIRLRFQTEPD 216
Score = 32.7 bits (75), Expect = 1.2
Identities = 26/93 (27%), Positives = 37/93 (39%), Gaps = 19/93 (20%)
Query: 583 DLRER--FGEFTAAKEECQREAFHARFEYAMRMGERDSSR-----KERNEVEAASMDLFG 635
D RE+ GE A + +REA R EY + E D R K + E +A L
Sbjct: 35 DAREKKIAGELADA-DTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLD 93
Query: 636 KCQKERMDNLEANQEIDRLRKH-TDKLQAELQD 667
EA +E D +R+ + L+ E
Sbjct: 94 ----------EAREEADEIREKWQEALRREQAA 116
>gnl|CDD|238490 cd00990, PDZ_glycyl_aminopeptidase, PDZ domain associated with
archaeal and bacterial M61 glycyl-aminopeptidases. May be
responsible for substrate recognition and/or binding, as
most PDZ domains bind C-terminal polypeptides, and
binding to internal (non-C-terminal) polypeptides and
even to lipids has been demonstrated. In this subfamily
of protease-associated PDZ domains a C-terminal
beta-strand is presumed to form the peptide-binding
groove base, a circular permutation with respect to PDZ
domains found in Eumetazoan signaling proteins.
Length = 80
Score = 31.2 bits (71), Expect = 0.73
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMR 1184
LG+ + + G V+ V + S A + GL GD+L+ V G +
Sbjct: 3 LGLTLDK-EEGLGKVTFVRDDSPADKAGLVAGDELVAVNGWRVD 45
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved in
a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 31.8 bits (73), Expect = 0.80
Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 15/81 (18%)
Query: 462 RNEYQEALAKVQQQHEDA-VKEINTAMAV---------RMKATKDLKRLTEERNAAMQEY 511
R E + AL ++++ EDA ++ MA R KAT+ LK+L ++ E
Sbjct: 23 RVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDE- 81
Query: 512 SLIMSERDSVHKEMEKLSEDL 532
E + +++ K DL
Sbjct: 82 ----KEIKDLKEQLSKAEVDL 98
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription factor
N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 33.7 bits (77), Expect = 0.82
Identities = 38/193 (19%), Positives = 55/193 (28%), Gaps = 37/193 (19%)
Query: 887 TTLKPTSLGEHSPRVHRKSVNSSTQTMPLYVRYGDQPPPDSYQVPKTSTKSH------DP 940
+ + P S +S PPP S ++ +P
Sbjct: 141 SPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPPMSSPDSSYPSEHRFQRQLSEP 200
Query: 941 AKPSPSSSGGGGKWDSLREKMQGVRRRGGGEEGKRSNRNS-SPPVTFEQEQVDAIAELDS 999
P P G G R+ R+ S PP F+QE D + E
Sbjct: 201 CLPFPPPPGRGS-----RDGRPPYHRQM-------SEPLVPYPPQGFKQEYHDPLYE--- 245
Query: 1000 FVHTHSPSDGSSPHY---HKAPR---------SLASAHSHPGG-PNNGAGGTSYETSSYH 1046
P+ G PH + PR S++ G +N G SY+
Sbjct: 246 --EAGVPNQGPFPHPMMIKQEPRDFTYDSEVPGCHSSYGRAEGFYSNRHEGFSYDKEPRL 303
Query: 1047 SYDAAGTFPRKKE 1059
YD P + E
Sbjct: 304 FYDDTCVVPERLE 316
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 33.9 bits (78), Expect = 0.84
Identities = 20/112 (17%), Positives = 39/112 (34%), Gaps = 5/112 (4%)
Query: 523 KEMEKLSEDLTQAVRKIKT--LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
KE+ + +L + +RK+ E +E + L +I+ L EI D+ +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 581 CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMD 632
E G +E A + +R G+ + + + D
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRAY---LRGGDDALAEERKALSTGTDSD 110
>gnl|CDD|219925 pfam08598, Sds3, Sds3-like. Repression of gene transcription is
mediated by histone deacetylases containing
repressor-co-repressor complexes, which are recruited to
promoters of target genes via interactions with
sequence-specific transcription factors. The
co-repressor complex contains a core of at least seven
proteins. This family represents the conserved region
found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
Length = 184
Score = 32.7 bits (75), Expect = 0.91
Identities = 23/132 (17%), Positives = 47/132 (35%), Gaps = 22/132 (16%)
Query: 379 MSALNKYETIKDEYDALRKR-FDDLINTHSTAVDKLELAQEEVA----RIKKQCEDMNQE 433
AL++ I+ ++ LR + + + + T +LEL + K E+ +
Sbjct: 1 QEALDELAKIEKQFAELRDQLYRERLAQLQT---ELELLLQGTHPEYLEPLKDLEERRDD 57
Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
+ R K +C EY+ +Q+ E + + + +
Sbjct: 58 RLKVAELRREYKLECI------------EREYEAERQAAKQEFEKEKRLLRERL--LEEL 103
Query: 494 TKDLKRLTEERN 505
+ + RL EER
Sbjct: 104 EEKIYRLEEERR 115
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 31.1 bits (71), Expect = 0.94
Identities = 17/61 (27%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ---AVRKIKTLEMENKELVEDK 553
RLT E+ A Q+Y M +D++ E +KL+ L + + ++K LE E + +++
Sbjct: 33 KLRLTAEKAKADQKYFAAMRSKDALDAENKKLNTQLNKSSELIEQLKDLENELRRRIKEL 92
Query: 554 K 554
+
Sbjct: 93 E 93
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 33.8 bits (77), Expect = 0.96
Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 16/177 (9%)
Query: 395 LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQ 454
+ A K+E A+ E + +M + +++ G + +A R+
Sbjct: 300 QETLVRQFTVEQTQAKSKVESAKIE----TDRAREMETLAHDNVKQIVGAQHGILSAKRE 355
Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
A+ +R A A + VK + A A + + L E NAA
Sbjct: 356 G--AVDKRRTISTARAGLDA----LVKGLGGA------APESAEELLELNNAARLTVDEY 403
Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
+ R+++ ++ ED T+AV + K + E L + + Y++ ++ + L
Sbjct: 404 PAAREALESAGQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQVRENLCQDL 460
Score = 31.5 bits (71), Expect = 5.5
Identities = 43/242 (17%), Positives = 82/242 (33%), Gaps = 30/242 (12%)
Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
+ E ++R HTL L + + + A + A +L + + R+ +E E
Sbjct: 249 EQVEDIKRQIHTLDPLVQLKNRREKAQQSKDHANALKKALPTVGN-------RIKKEEQE 301
Query: 355 LRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE 414
A +K E+ K E R ++ V ++
Sbjct: 302 TL--------------VRQFTVEQTQAKSKVESAKIE----TDRAREMETLAHDNVKQIV 343
Query: 415 LAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 474
AQ + K+ E ++ T R GL + E E A
Sbjct: 344 GAQHGILSAKR--EGAVDKRRTISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVD 401
Query: 475 QHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ 534
++ A + + +A + +D R +E AA QE S + ++ + ++ E+L Q
Sbjct: 402 EYPAAREALESA---GQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQVRENLCQ 458
Query: 535 AV 536
+
Sbjct: 459 DL 460
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 33.1 bits (76), Expect = 0.98
Identities = 36/203 (17%), Positives = 75/203 (36%), Gaps = 17/203 (8%)
Query: 468 ALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEK 527
+ ++ + ED K + A+ +L + + A+ + + + EK
Sbjct: 14 NINELLDKAEDPEKMLEQAI---RDMESELAKARQALAQAIARQKQLERKLEEAQARAEK 70
Query: 528 LSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
L E A++ E +E +E+K++L + L+ E+ A +K K+ L ++
Sbjct: 71 LEEKAELALQAGN--EDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQK 128
Query: 588 FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA 647
E A KE +A A + ++ +++M F + +E+++ EA
Sbjct: 129 IAELRAKKE--ALKARKAAAKAQEKVNRSLGGGS-----SSSAMAAFER-MEEKIEEREA 180
Query: 648 N---QEIDRLRKHTDKLQAELQD 667
D L E
Sbjct: 181 RAEAAAELAEGSGDD-LDKEFAQ 202
>gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated.
Length = 159
Score = 32.4 bits (74), Expect = 1.0
Identities = 22/73 (30%), Positives = 36/73 (49%), Gaps = 7/73 (9%)
Query: 456 DIALRERNEYQEALAKVQQQHEDA---VKEINTAMAVRMKA-TKDLKRLTEE---RNAAM 508
A R R E Q+ LA+ Q++ ++A +I A +A T + KR TEE R +
Sbjct: 43 AEARRLREEAQQLLAEYQRKRKEAEKEAADIVAAAEREAEALTAEAKRKTEEYVARRNKL 102
Query: 509 QEYSLIMSERDSV 521
E + +E D++
Sbjct: 103 AEQKIAQAETDAI 115
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 33.7 bits (78), Expect = 1.0
Identities = 26/130 (20%), Positives = 49/130 (37%), Gaps = 34/130 (26%)
Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDK-----LELAQEEVARIKKQCEDMNQEKNTALRE 440
++ + DAL K+ A DK LE + +I +Q E+ Q
Sbjct: 39 ADVQAQLDALNKQ------KLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQ-------- 84
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
LKQQ A + LR+ Q L ++ +++ +E + +++R L+
Sbjct: 85 ---LKQQLAQAPAK----LRQ---AQAELEALKDDNDEETRETLSTLSLR-----QLESR 129
Query: 501 TEERNAAMQE 510
+ +Q
Sbjct: 130 LAQTLDQLQN 139
Score = 32.2 bits (74), Expect = 3.5
Identities = 24/117 (20%), Positives = 52/117 (44%), Gaps = 6/117 (5%)
Query: 370 SADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED 429
+ + Q + ALNK + ++ E +++ DL T + +DK++ +EE ++K+Q
Sbjct: 37 TEADV-QAQLDALNKQKLLEAEDKLVQQ---DLEQTLAL-LDKIDRQKEETEQLKQQLAQ 91
Query: 430 MNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
+ A E LK R+ ++ + + LA+ Q ++A ++
Sbjct: 92 APAKLRQAQAELEALKDDNDEETRE-TLSTLSLRQLESRLAQTLDQLQNAQNDLAEY 147
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 33.4 bits (76), Expect = 1.1
Identities = 65/280 (23%), Positives = 117/280 (41%), Gaps = 63/280 (22%)
Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
A+L+ A Q S LR++ S L S ++ L +L +L+
Sbjct: 420 AELQQARQNESELRNQISLLTSLERSLKSDLGQLKK------------------------ 455
Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED-MNQEKNTALRE 440
E D L+ + + +++ + DK + E R+K + + +N EK A E
Sbjct: 456 ---------ENDMLQTKLNSMVS--AKQKDKQSMQSME-KRLKSEADSRVNAEKQLA-EE 502
Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
+ K++ A R A R E E+L + +Q E +K++ +K ++ R+
Sbjct: 503 KKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEH----DLKLKEEECRM 558
Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI--KTLEMENKELVED--KKTL 556
E+ +++Y +E EK +E L A++ + K L +EN E K L
Sbjct: 559 LEKEAQELRKY-----------QESEKETEVLMSALQAMQDKNLMLENSLSAETRLKLDL 607
Query: 557 SYQIETLKR--EIAS-ALHDRDKALKECNDLRERFGEFTA 593
+ +KR EIA A++ R+ E DL+ + + A
Sbjct: 608 FSALGDVKRQLEIAHGAIYKRE---AEIVDLKSKIADILA 644
Score = 33.4 bits (76), Expect = 1.4
Identities = 35/167 (20%), Positives = 68/167 (40%), Gaps = 13/167 (7%)
Query: 496 DLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKT 555
D+K+L E A Q S + ++ + L DL Q ++ L+ + +V K+
Sbjct: 414 DIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQK 473
Query: 556 LSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGE 615
++++++ + S R A K+ + ++R K+E + A A + A E
Sbjct: 474 DKQSMQSMEKRLKSEADSRVNAEKQLAEEKKR-------KKEEEETAARAAAQAAASREE 526
Query: 616 RDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQ 662
S K+ + DL + +K D +E L K +L+
Sbjct: 527 CAESLKQAKQ------DLEMEIKKLEHDLKLKEEECRMLEKEAQELR 567
>gnl|CDD|227406 COG5074, COG5074, t-SNARE complex subunit, syntaxin [Intracellular
trafficking and secretion].
Length = 280
Score = 32.9 bits (75), Expect = 1.3
Identities = 49/288 (17%), Positives = 98/288 (34%), Gaps = 82/288 (28%)
Query: 297 NETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELR 356
N+ + + + +++ H+ + ++ +S LR + S L R L
Sbjct: 31 NKNLSVYEKEINQID---NLHKDLLTEVFEE--QSRKLRRSLDNFSSQTTDLQRNL---- 81
Query: 357 SGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELA 416
D SA+ H + + + +R++F LI +D
Sbjct: 82 --------KKDIKSAERDGIHL-------ANKQAQAENVRQKFLKLIQ-DYRIIDS-NYR 124
Query: 417 QEEVARIKKQCEDMN-----QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
+EE + ++Q E A+ + NG + A + R E + ALA+
Sbjct: 125 EEEKEQARRQYIIAQPEATEDEVEAAINDVNGQQVFSQALLNA-----NRRGEAKTALAE 179
Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
VQ +H +++K++ EK +
Sbjct: 180 VQARH------------------QEIKKI-------------------------EKTMAE 196
Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALK 579
LTQ ++ L +E +E V+ + +E + + + DKA+K
Sbjct: 197 LTQLFNDMEELVIEQQENVD---VIDKNVEDAQENVEQGVGHTDKAVK 241
>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related. This family
represents a region within kinesin-related proteins from
higher plants. Many family members also contain the
pfam00225 domain. Kinesins are ATP-driven microtubule
motor proteins that produce directed force. Some family
members are associated with the phragmoplast, a
structure composed mainly of microtubules that executes
cytokinesis in higher plants.
Length = 488
Score = 33.3 bits (76), Expect = 1.3
Identities = 40/213 (18%), Positives = 89/213 (41%), Gaps = 34/213 (15%)
Query: 340 DLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRF 399
+L +++ ++ +EL S + + L A+ + I ++Y L++++
Sbjct: 303 ELEANRSLAEKLEMELES---------EKKCTEELKDALQRAMQGHARIIEQYAELQEKY 353
Query: 400 DDLINTHSTAVDKLELAQEEVARI-KKQCEDMNQEKNTALRERNGLKQQCTA--AIRQWD 456
+DL+ H ++ +E ++ A+ K R N L + +A A R+
Sbjct: 354 NDLLEKHRRIMEGIEDVKKAAAKAGVKGAG---------SRFANALAAELSALRAERE-- 402
Query: 457 IALRERNEYQEALAKVQQQHEDAVKEINTA--MAVRMKATKDLKRLTEERNAAMQEYSLI 514
+ER ++ +Q Q D + + A + VR++ ++ L +ER ++
Sbjct: 403 ---KERELLKKENKSLQIQLRDTAEAVQAAGELLVRLREAEEAATLAQERAMLAEQ---- 455
Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENK 547
E + + K++EKL + +K E+K
Sbjct: 456 --ENEKLKKQIEKLKRKHKMEIVTMKQYLAESK 486
Score = 33.3 bits (76), Expect = 1.3
Identities = 39/232 (16%), Positives = 74/232 (31%), Gaps = 45/232 (19%)
Query: 344 DKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLI 403
+ + + LL C + E DA + + + + + K L
Sbjct: 59 ESEARESTLLCSS---CKKKEDTDAVVQEISDCINLQLVPVDNSEKAVEKVL-------- 107
Query: 404 NTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ------CTAAIRQW-- 455
R + + E++ E+ + + N L QQ C A I Q
Sbjct: 108 --------------AGAIRREMELEELCVEQAAKIEQLNRLVQQYKHERECNALIDQTKE 153
Query: 456 ----------DIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
D L E + +H+ +K +K T +LKR+ EE
Sbjct: 154 DKIARLESLMDGVLPTEEFIDEEFLSLMNEHK-LLKSDYENHPEVLKETIELKRIQEELE 212
Query: 506 AAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLS 557
+ + + M E++ + +E++ L L + K+ L+ L
Sbjct: 213 SF-KNFYFDMGEKEVLLEEIQDLKSQLQCMLSSSKSTARTRSSLLARSFQLR 263
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 33.4 bits (77), Expect = 1.4
Identities = 12/102 (11%), Positives = 37/102 (36%), Gaps = 13/102 (12%)
Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
N ++ E L+++ + + + E Q+E+ ++ ++ +++ +
Sbjct: 142 NLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLE 201
Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEIN 484
L+++ QE K ++ + A K +
Sbjct: 202 QLQEKAAETS-------------QERKQKRKEITDQAAKRLE 230
Score = 32.6 bits (75), Expect = 2.2
Identities = 30/159 (18%), Positives = 54/159 (33%), Gaps = 51/159 (32%)
Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
+ L Q+EV +K+Q E +EK Q + + AL
Sbjct: 142 NLLHALQQEVLTLKQQLELQAREK------------------AQSQALAEAQQQELVALE 183
Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
+ + E+ +E+ L++L E+ QE +K E
Sbjct: 184 GLAAELEEKQQELEA----------QLEQLQEKAAETSQE-------------RKQKRKE 220
Query: 531 DLTQAVRKIKTLEMENKELVE----------DKKTLSYQ 559
QA ++++ E E + L++ D KTL +
Sbjct: 221 ITDQAAKRLELSEEETRILIDQQLRKAGWEADSKTLRFS 259
>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular
trafficking and secretion].
Length = 554
Score = 33.1 bits (76), Expect = 1.5
Identities = 24/119 (20%), Positives = 35/119 (29%), Gaps = 16/119 (13%)
Query: 1315 AQYSVERYNEVKDKPGDS----FYMRALFSRTGDLGDALQLRF-HKDDILYVDNTMFNGV 1369
+ +E +V D G S + R +L L H+ Y+D+T
Sbjct: 296 YRQVLEGPFDVLDYQGKSQNLSLRLSHPLLRNRSSKLSLGLGLSHRRSENYLDDTEIEVQ 355
Query: 1370 PGHWRAWLVDADGFRQQCGIIPSKYKVEEELLLRRSLGDL------ESDARRATRRSFF 1422
A + G R + + L RR LG L E T R
Sbjct: 356 RRRLTAVELGLSGRRY---LGGGTL--DASLGYRRGLGALGASKDDEETNGEPTSRFKK 409
>gnl|CDD|232932 TIGR00345, GET3_arsA_TRC40, transport-energizing ATPase,
TRC40/GET3/ArsA family. Members of this family are
ATPases that energize transport, although with different
partner proteins for different functions. Recent
findings show that TRC40 (GET3 in yeast) in involved in
the insertion of tail-anchored membrane proteins in
eukaryotes. A similar function is expected for members
of this family in archaea. However, the earliest
discovery of a function for this protein family is ArsA,
an arsenic resistance protein that partners with ArsB
(see pfam02040) for As(III) efflux [Hypothetical
proteins, Conserved].
Length = 284
Score = 32.4 bits (74), Expect = 1.5
Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 3/65 (4%)
Query: 324 LEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALN 383
+AA +E R+K + I L + G P D+ + D +H A N
Sbjct: 55 PQAALEE---YRAKLVEQIKGNLPDGDMLGDQLEGAALSPGIDEIAAFDEFLKHMTDAEN 111
Query: 384 KYETI 388
+++ +
Sbjct: 112 EFDVV 116
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 33.0 bits (76), Expect = 1.6
Identities = 35/157 (22%), Positives = 71/157 (45%), Gaps = 17/157 (10%)
Query: 408 TAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER----- 462
A +E A++E +KK E + + K + R L+++ R+ ++ ER
Sbjct: 33 LAKRIIEEAKKEAETLKK--EALLEAKEEVHKLRAELERE--LKERRNELQRLERRLLQR 88
Query: 463 ----NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE- 517
+ E+L K ++ E KE++ + ++L+ L E+ ++ S + E
Sbjct: 89 EETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEE 148
Query: 518 -RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
++ + +E+E E +A + IK +E E KE + K
Sbjct: 149 AKEILLEEVE--EEARHEAAKLIKEIEEEAKEEADKK 183
>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family.
Length = 850
Score = 33.2 bits (76), Expect = 1.7
Identities = 46/252 (18%), Positives = 83/252 (32%), Gaps = 49/252 (19%)
Query: 478 DAVKEINTAMAVRMKA---TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ 534
D +K++ + MK ++ E R+ ++ + M ++
Sbjct: 192 DTLKKLAPILITAMKIFVRHPGVEEARENRDYTAKQ----------MSDAMNEIIR---- 237
Query: 535 AVRKIKTLEMENKELVEDKKTLSYQIETLKREI----ASALHDRDKALKECNDLRERFGE 590
V ++ + + K L L+ ++ L I A+ + +R + E G
Sbjct: 238 -VLQLTSYTVMKKALSGIGSELAAALDWLDDPIIIDPATYIEERCRPSLEERIESIISGA 296
Query: 591 FTAAKEECQREAFHARFEYAMRMGERDSSRKERNE-VEAASMDL---FGKCQKERMDNLE 646
A C R + E N A DL + K + D
Sbjct: 297 ALMADSLCTRRN------------RGQAIVAECNGAGRQALQDLLSEYMKLAGRKDDLDA 344
Query: 647 ANQEIDRLRKHTDKLQAELQDQSVN-----YLFILTESIQEAEVSKRRRDWAFSERDKIE 701
IDR+RK T L+ +L+ V+ +L T E +K R+ F E+ +
Sbjct: 345 LCDAIDRMRKKTRDLRRQLRKAVVDHVSDSFLDTTTPLKLLIEAAKAGREENFEEKAQN- 403
Query: 702 VIDSYHTKHAGR 713
+HA R
Sbjct: 404 -----FQEHANR 410
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 33.1 bits (76), Expect = 1.7
Identities = 54/293 (18%), Positives = 94/293 (32%), Gaps = 58/293 (19%)
Query: 315 GQHQAAMAQLEAAAQ-ESSSLRSKYSDLIS--DKQRLDRELLELRSGCCTQP-------- 363
G A+A LE AAQ + R+ ++S + D+ L + QP
Sbjct: 411 GDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNL 470
Query: 364 -------ESDDAGSADSLNQ-------HYMSALN--KYETIKDEYDALRKRFDDLINTHS 407
+ D A + ++ + + +A N + + + D +RF+ ++
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530
Query: 408 TAVD------KLELAQ----EEVARIKKQCEDMNQEKNTAL------RERNGLKQQCTAA 451
+ L L E VA ++K E QE AL + LK+
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAIL 590
Query: 452 IRQWDIALRERNEYQEALAKVQQQ---HEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
D A + E L + Q AV +A++ + L L A
Sbjct: 591 NEAADAA-PDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLL-----ADA 644
Query: 509 QEYSLIMSERDSVHKEMEK---LSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
+M +++ L D T+A + L + K KK
Sbjct: 645 YA---VMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKS 694
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 32.8 bits (75), Expect = 2.0
Identities = 38/253 (15%), Positives = 82/253 (32%), Gaps = 28/253 (11%)
Query: 389 KDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQC 448
+ K F+ L + + EE + ED+ + + + L
Sbjct: 729 AKLFGRFLKAFNALKK-----LYEFAYTLEEEIQYL---EDLLKARLEKDYQSKYLD--- 777
Query: 449 TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR-----MKATKDLKRLTEE 503
+ RE + E+ A + ++ I + T + K E
Sbjct: 778 ---LIDNLNDAREIKDK-ESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAYE 833
Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
+ E + D K+ KL E L +A+ + + +++ E +E+ L+ + E
Sbjct: 834 SLKELIERIIKEWIEDLRQKK--KLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEF 891
Query: 564 KREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKER 623
K+ + ++ L +DL G+ E + + E++ +
Sbjct: 892 KQFAEEEGLNEEE-LAFYDDLALNGGKLPENGTELVE-----KLAKEKSLREKNKDDWKA 945
Query: 624 NEVEAASMDLFGK 636
E A + F +
Sbjct: 946 KEEVEAKLKAFFR 958
>gnl|CDD|227398 COG5066, SCS2, VAMP-associated protein involved in inositol
metabolism [Intracellular trafficking and secretion].
Length = 242
Score = 31.9 bits (72), Expect = 2.2
Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 8/76 (10%)
Query: 1053 TFPRKKENQR-----FRIPSNPSVAFKTSGTKMSNGSIEKTSERGSPMPTFHVEVLSPGL 1107
TF N+ + S V FK T + + PM T VEV+ GL
Sbjct: 10 TFYVPLTNKSKEMFSVQNNSPEPVGFKVKTTAPKDYCVRPNMGLIEPMSTVEVEVILQGL 69
Query: 1108 RQNKRNSLPDYCCSHK 1123
PD+ C K
Sbjct: 70 ---TEEPAPDFKCRDK 82
>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
Fer (Fes related) tyrosine kinases. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Fes (feline sarcoma), also called
Fps (Fujinami poultry sarcoma), and Fer (Fes related)
are cytoplasmic (or nonreceptor) tyrosine kinases that
play roles in haematopoiesis, inflammation and immunity,
growth factor signaling, cytoskeletal regulation, cell
migration and adhesion, and the regulation of cell-cell
interactions. Although Fes and Fer show redundancy in
their biological functions, they show differences in
their expression patterns. Fer is ubiquitously expressed
while Fes is expressed predominantly in myeloid and
endothelial cells. Fes and Fer contain an N-terminal
F-BAR domain, an SH2 domain, and a C-terminal catalytic
kinase domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules. The
F-BAR domain of Fes is critical in its role in
microtubule nucleation and bundling.
Length = 237
Score = 32.0 bits (73), Expect = 2.3
Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 11/117 (9%)
Query: 370 SADSLNQHYMSALNKY----ETIKDEYDALRKRFDDLINTHSTAVDKLELA----QEEVA 421
A++L + L K Y R++ D+ + V+KL+ E+
Sbjct: 84 HAEALESGTLDKLTLLIKDKRKAKKAYQEERQQIDEQYKKLTDEVEKLKSEYQKLLEDYK 143
Query: 422 RIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED 478
K + E+ + R+ + + + A R+ + N+Y AL + Q+ ED
Sbjct: 144 AAKSKFEEAVVKGGRGGRKLDKARDKYQKACRKLHLC---HNDYVLALLEAQEHEED 197
>gnl|CDD|212969 cd12036, SH3_MPP5, Src Homology 3 domain of Membrane Protein,
Palmitoylated 5 (or MAGUK p55 subfamily member 5). MPP5,
also called PALS1 (Protein associated with Lin7) or Nagie
oko protein in zebrafish or Stardust in Drosophila, is a
scaffolding protein which associates with Crumbs homolog
1 (CRB1), CRB2, or CRB3 through its PDZ domain and with
PALS1-associated tight junction protein (PATJ) or
multi-PDZ domain protein 1 (MUPP1) through its L27
domain. The resulting tri-protein complexes are core
proteins of the Crumb complex, which localizes at tight
junctions or subapical regions, and is involved in the
maintenance of apical-basal polarity in epithelial cells
and the morphogenesis and function of photoreceptor
cells. MPP5 is critical for the proper stratification of
the retina and is also expressed in T lymphocytes where
it is important for TCR-mediated activation of NFkB.
Drosophila Stardust exists in several isoforms, some of
which show opposing functions in photoreceptor cells,
which suggests that the relative ratio of different
Crumbs complexes regulates photoreceptor homeostasis.
MPP5 contains two L27 domains followed by the core of
three domains characteristic of MAGUK
(membrane-associated guanylate kinase) proteins: PDZ,
SH3, and guanylate kinase (GuK). In addition, it also
contains the Hook (Protein 4.1 Binding) motif in between
the SH3 and GuK domains. The GuK domain in MAGUK proteins
is enzymatically inactive; instead, the domain mediates
protein-protein interactions and associates
intramolecularly with the SH3 domain. SH3 domains are
protein interaction domains that bind to proline-rich
ligands with moderate affinity and selectivity,
preferentially to PxxP motifs. They play versatile and
diverse roles in the cell including the regulation of
enzymes, changing the subcellular localization of
signaling pathway components, and mediating the formation
of multiprotein complex assemblies.
Length = 63
Score = 29.3 bits (66), Expect = 2.3
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 1349 LQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
L L F K DIL+V + P W+A+ + + G+IPSK
Sbjct: 21 LGLSFQKGDILHVISQ---EDPNWWQAYREGEEDNQSLAGLIPSK 62
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 32.0 bits (73), Expect = 2.4
Identities = 49/260 (18%), Positives = 95/260 (36%), Gaps = 31/260 (11%)
Query: 371 ADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDM 430
A L+ + +K ++ E ++ + L N K++ Q+E+ + K + +
Sbjct: 26 AALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKK- 84
Query: 431 NQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR 490
Q++ L+E + ER E + A+ Q + A I+ + +
Sbjct: 85 LQKEIAELKEN-----------------IVERQELLKKRARAMQVNGTATSYID--VILN 125
Query: 491 MKATKDL-KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL 549
K+ DL R+T + ++ + ++ + L E K++TL EL
Sbjct: 126 SKSFSDLISRVTAISVIVDADKKIL----EQQKEDKKSLEEKQAALEDKLETLVALQNEL 181
Query: 550 VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEY 609
+L+ Q IA+ AL E L E+ A E EA A+ E
Sbjct: 182 ETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEEQ----KALAEAAAAEA--AKQEA 235
Query: 610 AMRMGERDSSRKERNEVEAA 629
A + ++ + + A
Sbjct: 236 AAKAAAQEQAALQAAATAAQ 255
>gnl|CDD|191863 pfam07818, HCNGP, HCNGP-like protein. This family comprises
sequences bearing significant similarity to the mouse
transcriptional regulator protein HCNGP. This protein is
localised to the nucleus and is thought to be involved
in the regulation of beta-2-microglobulin genes.
Length = 96
Score = 30.0 bits (68), Expect = 2.4
Identities = 17/92 (18%), Positives = 35/92 (38%), Gaps = 12/92 (13%)
Query: 100 NIVKGSVTDGKLKKKEEVARIKKQCEDMNQ--EKNTALR---------ERNGLKQQCTA- 147
KG + K E ++K+Q NQ + + R + + + T
Sbjct: 3 PEPKGECSPELNAKIEHFLKLKRQGTHFNQKLQNSKEFRNPSILEKLMDFFDIDEYGTNY 62
Query: 148 AIRQWDIALRERNEYQEALAKVQQQHEDAVKE 179
+D + +Y E LAK Q++ ++ ++
Sbjct: 63 PKDIFDPHVFPSWDYYEELAKEQKEEQEKREK 94
>gnl|CDD|220484 pfam09943, DUF2175, Uncharacterized protein conserved in archaea
(DUF2175). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 101
Score = 30.1 bits (68), Expect = 2.4
Identities = 22/92 (23%), Positives = 38/92 (41%), Gaps = 6/92 (6%)
Query: 1520 DNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIKFK 1579
LF K ++EC V+DA K L + RL IVL + +
Sbjct: 15 GELFTFTSKGPVHYECL----VEDASKKIKGDK--PEDLVALLRLEDYLHEGIVLAKELE 68
Query: 1580 STKQIKEIKEIRDGRYSSLDKVTAKAAKEMYE 1611
+ +EIK++ L+K+ AK +++ +
Sbjct: 69 QVAESEEIKKVIRQVRKDLEKLAAKLTRKIEK 100
>gnl|CDD|185252 PRK15354, PRK15354, type III secretion system protein SsaK;
Provisional.
Length = 224
Score = 31.6 bits (71), Expect = 2.5
Identities = 23/84 (27%), Positives = 37/84 (44%), Gaps = 22/84 (26%)
Query: 594 AKEECQREAFHARFEYAMRMGE---RDSSRKERNEVEAASMDLFGKCQKERMDNLEANQE 650
A+E+ + HA A R E RD+ R +R + +E QE
Sbjct: 34 AQEQQAKRVSHAIVSSAYRKAEKIIRDAYRYQREQ------------------KVEQQQE 75
Query: 651 IDRLRKHT-DKLQAELQDQSVNYL 673
+ LRK+T +K++ E +Q V +L
Sbjct: 76 LACLRKNTLEKMEVEWLEQHVKHL 99
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 30.7 bits (70), Expect = 2.5
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 5/77 (6%)
Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY--SLIMSERD 519
+ LA+ +QQ A E + + A K+ ++L EE A Q+ L+ S R
Sbjct: 46 LKQAAALLAEAEQQLAQARAEAS---EIINNAKKEAQKLKEEILAEAQKDAERLLESARA 102
Query: 520 SVHKEMEKLSEDLTQAV 536
+ +E E+ +L Q V
Sbjct: 103 EIEQEKEQALAELRQQV 119
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 32.1 bits (73), Expect = 2.6
Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 31/180 (17%)
Query: 516 SERDSVHKEMEKLSEDLTQAVR---------KIKTLEMENKELVEDKKTLSYQIETLKRE 566
SE +S + E + + K+K LE EN+ L + L + T + +
Sbjct: 138 SEDESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEENEHLRSEASHLKTETVTYEEK 197
Query: 567 IASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEV 626
++D K L+E ND E A K E + + ++E +
Sbjct: 198 EQQLVNDCVKQLREANDQIASLSEELAKKTE-----------------DLERQQEEITHL 240
Query: 627 EAASMDLFGKCQKERMDNLEANQ----EIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
+ +DL KC+ ++N E Q D R+ +LQ ELQD+ + +L E+ +E
Sbjct: 241 LSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQ-ELQDKYAECMEMLHEAQEE 299
>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
Length = 616
Score = 32.3 bits (73), Expect = 2.7
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 10/65 (15%)
Query: 615 ERDSSRKER-NEVEAA----SMDLFG---KCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
++D S +R N+ + A D FG + KE+MDNLE + D LRKH +L +L
Sbjct: 544 DKDKSDDDRLNDADKAMRKLEADGFGDRYRKIKEKMDNLEDD--YDDLRKHAIELPKKLD 601
Query: 667 DQSVN 671
+QS N
Sbjct: 602 NQSGN 606
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 32.0 bits (73), Expect = 2.7
Identities = 52/303 (17%), Positives = 95/303 (31%), Gaps = 68/303 (22%)
Query: 318 QAAMAQLEAAAQESSSLRSKYSDL--ISDKQRLDRELLELRSGCCTQPESDDAGSADSLN 375
Q A L AAQ ++ S DL + K RLD R + E +A+
Sbjct: 196 QKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPP 255
Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN 435
+ E D ++R +L E QE++ ++ E++ E
Sbjct: 256 A----------EMDTEEDRTKEREAEL-----------EALQEQIDELESSIEEVLSE-- 292
Query: 436 TALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV--------------K 481
+ + I+Q + L + E L + + + V
Sbjct: 293 ---------IKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQA 343
Query: 482 EINTAM-----------AVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
+ + A R + + L E+ E + E + ++E+L
Sbjct: 344 LVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELES 403
Query: 531 DLTQAVRKIKTLEMENKELVEDKKTLSY---------QIETLKREIASALHDRDKALKEC 581
+L + K L E + + +Y I+ K +I L D KE
Sbjct: 404 ELQTKEQLYKQLLDEYENAPKSVSRSAYTRRILEIIKNIKKQKEDIDKILSDTRSLQKEI 463
Query: 582 NDL 584
N++
Sbjct: 464 NNI 466
Score = 32.0 bits (73), Expect = 2.9
Identities = 29/177 (16%), Positives = 63/177 (35%), Gaps = 37/177 (20%)
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
E++T + +L+ L E+ + ++SE ++ ++++++E+LT ++ +
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315
Query: 541 TLEMENK------ELVED-----------------------------KKTLSYQIETLKR 565
LE E K EL+ D + L + +LK
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375
Query: 566 EIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKE 622
+ + + + L E LR + E E +E + + + SR
Sbjct: 376 KNRNKEDETQRQLDEIKKLRNKIEE--LESELQTKEQLYKQLLDEYENAPKSVSRSA 430
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.8 bits (72), Expect = 2.8
Identities = 27/114 (23%), Positives = 45/114 (39%), Gaps = 24/114 (21%)
Query: 490 RMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL 549
R+ DLK EE ++E + E++ + KE+E+L + + ++K LE+EN L
Sbjct: 126 RVPEYMDLKEDYEELKEKLEE---LQKEKEELLKELEELEAEYEEVQERLKRLEVENSRL 182
Query: 550 VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAF 603
E K E DL++R+ E E + E
Sbjct: 183 EEM---------------------LKKLPGEVYDLKKRWDELEPGVELPEEELI 215
>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
Length = 473
Score = 32.0 bits (73), Expect = 2.9
Identities = 36/149 (24%), Positives = 61/149 (40%), Gaps = 36/149 (24%)
Query: 1152 GVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS------------ATYQLAASVLRQCG 1199
G FVS V +S A++ G++ GD + + G + S +L +LR G
Sbjct: 312 GAFVSQVLPNSSAAKAGIKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRD-G 370
Query: 1200 NSITMLVQYSPDKYHELEGSGSSSAENESVSGRGSGEPRFLMIETRKCSNLGISLVGGNA 1259
+ + V EL+ S + ++ ++ F IE + SN G
Sbjct: 371 KPVNVNV--------ELQQSSQNQVDSSNI---------FNGIEGAELSN------KGGD 407
Query: 1260 VGIYVHSVQSGSLGYSAGLRTGDRILEYN 1288
G+ V +V+ G+ GL+ GD I+ N
Sbjct: 408 KGVVVDNVKPGTPAAQIGLKKGDVIIGAN 436
Score = 31.7 bits (72), Expect = 3.6
Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 3/58 (5%)
Query: 192 GLQLNNSYDHGLTLES--GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAR 247
G +LN+ + +++ G ++ ++ P S AAK G A GD + S+N + + ++ R
Sbjct: 294 GTELNSELAKAMKVDAQRGAFVSQVLPNSSAAKAGIKA-GDVITSLNGKPISSFAALR 350
>gnl|CDD|148810 pfam07412, Geminin, Geminin. This family contains the eukaryotic
protein geminin (approximately 200 residues long).
Geminin inhibits DNA replication by preventing the
incorporation of MCM complex into prereplication
complex, and is degraded during the mitotic phase of the
cell cycle. It has been proposed that geminin inhibits
DNA replication during S, G2, and M phases and that
geminin destruction at the metaphase-anaphase transition
permits replication in the succeeding cell cycle.
Length = 200
Score = 31.4 bits (71), Expect = 3.0
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 10/68 (14%)
Query: 498 KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLS 557
K + EER A+ E +E EKL +++ Q +I L+ EN+EL E + +
Sbjct: 110 KEVAEERRKALYE----------ALQENEKLHKEIEQKDEEIARLKKENEELAELAEHVQ 159
Query: 558 YQIETLKR 565
Y E ++R
Sbjct: 160 YMAEMIER 167
>gnl|CDD|221998 pfam13234, rRNA_proc-arch, rRNA-processing arch domain. Mtr4 is
the essential RNA helicase, and is an exosome-activating
cofactor. This arch domain is carried in Mtr4 and Ski2
(the cytosolic homologue of Mtr4). The arch domain is
required for proper 5.8S rRNA processing, and appears to
function independently of canonical helicase activity.
Length = 266
Score = 31.8 bits (73), Expect = 3.0
Identities = 26/118 (22%), Positives = 51/118 (43%), Gaps = 34/118 (28%)
Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNA---AMQEYSLIMSER--------DSVHKEME 526
+ I+ +VR+ KDL+ E R + +++E + +R D + ++M+
Sbjct: 158 SCIDAIS---SVRLYLPKDLRSE-EARESVLKSLKE----VKKRFPDGIPLLDPI-EDMK 208
Query: 527 KLSEDLTQAVRKIKTLE--MENKELVED------------KKTLSYQIETLKREIASA 570
+ + +RKI+ LE + + L K L +I+ LK++I SA
Sbjct: 209 IKDDSFKKLLRKIEVLESRLLSNPLHNSPRLEELYDQYSKKVELEEEIKALKKKIKSA 266
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 32.1 bits (74), Expect = 3.0
Identities = 72/308 (23%), Positives = 124/308 (40%), Gaps = 58/308 (18%)
Query: 384 KYETIKDEYDALRKRFDDLINTHS--TAVDKLELAQEEVARIKKQCEDMNQEKN--TALR 439
+ E +KD+Y LRK L + S A+D+LE +E+ +Q ++ + + A
Sbjct: 137 EVEELKDKYRELRKTL--LAKSFSYGPALDELEKQLDELEEEFEQFVELTESGDYLEARE 194
Query: 440 ERNGLKQQCTAAIRQWDIA-----LRE-RNEYQEALAKVQQ------------QHEDAVK 481
L+++ T A+ Q + L+E +NE+ + L +++ H D K
Sbjct: 195 VLLKLEEE-TDALEQ-KMEEIPPLLKELQNEFPDQLEELKAGYREMTEEGYHFDHLDIEK 252
Query: 482 EINTAMAVRMKATKDLKRL----TEERNAAMQE-----YSLIMSE---RDSVHKEMEKLS 529
E+ + L+ L EE N ++E Y ++ E + V K ++KL+
Sbjct: 253 ELQDLKEQIDQNLALLEELDLDEAEEENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLT 312
Query: 530 EDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND-LRERF 588
+ L A + K L +E L + SY + + E L + + L+ D L ER
Sbjct: 313 DFLEHAREQNKQLLLELDRLQQ-----SYTLNEDELETVRELEKQLEELEAQYDQLVERI 367
Query: 589 GEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEAN 648
E A E Q E K+ E+E +L Q R D LEA
Sbjct: 368 AEKKVAYSELQ-----EELEEIE---------KQLEEIEKEQEELSESLQGLRKDELEAR 413
Query: 649 QEIDRLRK 656
+++ R+
Sbjct: 414 EKLQEYRQ 421
Score = 31.8 bits (73), Expect = 4.0
Identities = 60/335 (17%), Positives = 115/335 (34%), Gaps = 86/335 (25%)
Query: 391 EYDALRKRFDDLINTHSTAVDK-LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
++ R+++DD++ +++ L A+E + + + K A++E L
Sbjct: 61 TFEEWRQKWDDIVTNSLPDIEELLFEAEELNDKFR-----FLKAKK-AIKEIEELLDTIE 114
Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN--AA 507
I Q E L ++++ E KE+ + K L L + + A
Sbjct: 115 EDIEQ----------ILEELNELKESEEKNRKEVEELKDKYRELRKTL--LAKSFSYGPA 162
Query: 508 MQEYSLIMSERDSVHKEMEKLSE--DLTQAVRKIKTLEMENKELVED-----------KK 554
+ E + E + ++ +L+E D +A + LE E L + +
Sbjct: 163 LDELEKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLLKELQN 222
Query: 555 TLSYQIETLK---------------REIASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
Q+E LK +I L D + + + L E + A+EE +
Sbjct: 223 EFPDQLEELKAGYREMTEEGYHFDHLDIEKELQDLKEQIDQNLALLEEL-DLDEAEEENE 281
Query: 600 R-----EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
+ + E EV+A K ++ +D L ++
Sbjct: 282 EIEERIDTLYDILE---------------KEVKA------KKFVEKNIDKLT--DFLEHA 318
Query: 655 RKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSK 687
R+ +L EL QS + L E E E +
Sbjct: 319 REQNKQLLLELDRLQQS----YTLNED--ELETVR 347
>gnl|CDD|237296 PRK13188, PRK13188, bifunctional UDP-3-O-[3-hydroxymyristoyl]
N-acetylglucosamine
deacetylase/(3R)-hydroxymyristoyl-[acyl-carrier-protein]
dehydratase; Reviewed.
Length = 464
Score = 31.8 bits (73), Expect = 3.1
Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 1148 LDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCG 1180
L+ GV V TV EH LA+ GL + + L+E+ G
Sbjct: 67 LEKNGVKVHTV-EHVLAALYGLGIDNCLIELDG 98
>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
The N-terminal region of the fission yeast spindle pole
body protein PPC89 has low similarity to the human Cep57
protein. The CLD or centrosome localisation domain of
Cep57 and PPC89 is found at the N-terminus. This region
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57
and PPC89.
Length = 69
Score = 28.8 bits (65), Expect = 3.4
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)
Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
E L L A RK+ E+E K L ++ + Q+ E L ++AL++ +
Sbjct: 6 ENLTLRNRLDSATRKVAVHEIELKRLNRERDSALRQLGVAYLECQE-LKAENEALRKELE 64
Query: 584 LRE 586
Sbjct: 65 ELR 67
>gnl|CDD|204632 pfam11365, DUF3166, Protein of unknown function (DUF3166). This
eukaryotic family of proteins has no known function.
Length = 97
Score = 29.8 bits (67), Expect = 3.4
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 9/55 (16%)
Query: 323 QLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQH 377
QL+ +E++ +R K + L +K +++ EL + +S G DS +H
Sbjct: 9 QLQFVKEEAALMRKKLAKLEEEKDKVEGELQKYKSK---------YGDLDSGAEH 54
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 30.9 bits (70), Expect = 3.5
Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 8/125 (6%)
Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
E ++LA Q+ E E+ + + +R E R A Q+ E + K
Sbjct: 36 ENLDSLASENQELE---VELELLREDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKK 92
Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD-KALKECN 582
E++K EDL + VR+ LE++ K L + L + LK+E ++LH+R K LK
Sbjct: 93 ELQKKIEDLEENVRQ---LELKAKNLSDQVSRLEERETELKKEY-NSLHERYTKLLKNYV 148
Query: 583 DLRER 587
+ ER
Sbjct: 149 EYVER 153
>gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional.
Length = 449
Score = 31.6 bits (72), Expect = 3.7
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 1267 VQSGSLGYSAGLRTGDRILEYNGTDLRA 1294
VQ S AGL+ GDRI++ +G L
Sbjct: 228 VQPNSAASKAGLQAGDRIVKVDGQPLTQ 255
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from
laminin A, B1 and B2 may come together to form a triple
helical coiled-coil structure.
Length = 263
Score = 31.2 bits (71), Expect = 3.7
Identities = 41/220 (18%), Positives = 73/220 (33%), Gaps = 36/220 (16%)
Query: 475 QHEDAVKEINTAMAVRM---------KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
Q + + +N A+ T+ L +A ++ + + ++
Sbjct: 1 QLDSVLLSLNLTGALPAPYKILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQDA 60
Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLR 585
E L + QA+ K + + + + KTL+ I+ L+ I KE +
Sbjct: 61 ENLQKKANQALAKAQKVNAAIERTLGHAKTLAEAIKNLRDNI-----------KEILEKL 109
Query: 586 ERFGE--FTAAKEECQREAFHARFEYAMRMGERD-SSRKERNEVE--------AASMDLF 634
GE F R A+ + ERD + + E E F
Sbjct: 110 AGLGENAFALPGSALSRMLAEAQR-MLGEIRERDFQKQLQNAEAELKAAEELLNRVQTWF 168
Query: 635 GKCQKERMDNLEANQEIDRLRKHTDKLQA--ELQDQSVNY 672
Q+E EA + D L ++ KL EL D++
Sbjct: 169 QTPQEENEGLFEALR--DDLNEYEAKLSDLRELLDEAQAK 206
>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
of Class IA Phosphoinositide 3-kinase Regulatory
subunits. PI3Ks catalyze the transfer of the
gamma-phosphoryl group from ATP to the 3-hydroxyl of the
inositol ring of D-myo-phosphatidylinositol (PtdIns) or
its derivatives. They play an important role in a
variety of fundamental cellular processes, including
cell motility, the Ras pathway, vesicle trafficking and
secretion, immune cell activation, and apoptosis. They
are classified according to their substrate specificity,
regulation, and domain structure. Class IA PI3Ks are
heterodimers of a p110 catalytic (C) subunit and a
p85-related regulatory (R) subunit. The R subunit
down-regulates PI3K basal activity, stabilizes the C
subunit, and plays a role in the activation downstream
of tyrosine kinases. All R subunits contain two SH2
domains that flank an intervening helical domain (iSH2),
which binds to the N-terminal adaptor-binding domain
(ABD) of the catalytic subunit. In vertebrates, there
are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
for different Class IA PI3K R subunits.
Length = 152
Score = 30.7 bits (70), Expect = 3.7
Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 14/74 (18%)
Query: 503 ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIET 562
E+ M+ L+ S + + E+L EDL + V + LE E + +
Sbjct: 73 EKQRLMENNELLKSRLKELEESKEQLEEDLRKQVAYNRELERE--------------MNS 118
Query: 563 LKREIASALHDRDK 576
LK E+ +D+
Sbjct: 119 LKPELMQLRKQKDQ 132
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 32.1 bits (73), Expect = 3.7
Identities = 99/568 (17%), Positives = 188/568 (33%), Gaps = 110/568 (19%)
Query: 130 EKNTALRERNGLKQQC------TAAIRQWDIALRERNEYQEALA-----KVQQQHEDAVK 178
EK AL ++ G+K A+ R +D NE L +Q++ + +K
Sbjct: 372 EKEEALFDKRGIKALSLFFIDKIASYRSYD------NELPAKLKGLFEKLYKQKYAEVLK 425
Query: 179 EINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGD-------- 230
+ +YL + HG G + S KEG A+ D
Sbjct: 426 KELDRSFYRDYLQSTDDDILKVHG-----GYF-------SKDNKEGDEAIEDEIDLILKD 473
Query: 231 --RVLSINN--RTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSST 286
++LS + R + + REG D+P+V TI L+ + S S
Sbjct: 474 KEKLLSFDEPLRFIFSKWALREGW----DNPNVFTICKLRSSG-----------SEISKL 518
Query: 287 QT----MPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLI 342
Q + L V ++ E V + EL + + +A +E + +
Sbjct: 519 QEVGRGLRLAVNENGERVTKDFDFPNELTVLVNESEKDFV--KALQKEIN--DESFIKQG 574
Query: 343 SDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDL 402
K + L L+ PE+ + L + LN+ E + E D L
Sbjct: 575 FAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNENEFAEKE-----PELDKL 629
Query: 403 INTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD------ 456
+ A +K++ ++ + + + K ++ R ++ A W+
Sbjct: 630 KDELYEAFEKIKDLIKDRLIGADRIRNAKERKKEKIKIRKENFKEFKAL---WEKINHKA 686
Query: 457 ---IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT------EERNA- 506
+ E + + ++ + ++I+ ++ + + E +A
Sbjct: 687 VLQYQIDSEALIDEIIKNILREFAVSAEQISYTTQKKVITMNNGLIIEEAGEYFEREDAI 746
Query: 507 --AMQEYSLIMSE--------RDSVHKEMEKLSED-LTQAVRKIKTLEMEN-KELVEDKK 554
+ +Y +S R S+HK +E + E+ L + K E K+L
Sbjct: 747 VWSSMKYHEFLSALANKTKLPRKSLHKVLEAIKENKLNE----FKKNPQEGLKQLSNIFL 802
Query: 555 TLSYQ-----IETLKREIASALHDRDKALKECNDLRERF-GEFTAAKEECQREAFHARFE 608
+ I R D E ++RE E KE A +E
Sbjct: 803 EIQLLNIVHKISYQMRLTTIKNSKTDAFYDEDGEIREELKAELGRKKENANSPAREYLYE 862
Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGK 636
+ + ++ E+E + +FGK
Sbjct: 863 EIFYQVDSEIEKEFVEELENTGVIVFGK 890
>gnl|CDD|188670 cd08715, RGS_RGS1, Regulator of G protein signaling (RGS) domain
found in the RGS1 protein. The RGS (Regulator of
G-protein Signaling) domain is an essential part of the
RGS1 protein. RGS1 is a member of the R4/RGS subfamily
of the RGS family, a diverse group of multifunctional
proteins that regulate cellular signaling events
downstream of G-protein coupled receptors (GPCRs). RGS
proteins play critical regulatory roles as GTPase
activating proteins (GAPs) of the heterotrimeric
G-protein G-alpha subunits. The RGS domain controls
G-protein signaling by accelerating the GTPase activity
of the G-alpha subunit which leads to G protein
deactivation and promotes desensitization. As a major
G-protein regulator, RGS domain containing proteins are
involved in many crucial cellular processes such as
regulation of intracellular trafficking, glial
differentiation, embryonic axis formation, skeletal and
muscle development, and cell migration during early
embryogenesis. RGS 1 is expressed predominantly in
hematopoietic compartments, including T and B
lymphocytes, and may play a major role in
chemokine-mediated homing of lymphocytes to secondary
lymphoid organs. In addition, RGS1 interacts with
calmodulin and 14-3-3 protein outside of the GPCR
pathway.
Length = 114
Score = 29.9 bits (67), Expect = 3.8
Identities = 15/45 (33%), Positives = 19/45 (42%)
Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDS 520
DA K+IN R K +K T Q+ I+ ERDS
Sbjct: 57 QSDAAKQINIDFRTRESTAKKIKAPTPTCFDEAQKVIYILMERDS 101
>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Protein
kinase C and Casein kinase Substrate in Neurons (PACSIN)
proteins, also called Synaptic dynamin-associated
proteins (Syndapins), act as regulators of cytoskeletal
and membrane dynamics. They bind both dynamin and
Wiskott-Aldrich syndrome protein (WASP), and may provide
direct links between the actin cytoskeletal machinery
through WASP and dynamin-dependent endocytosis.
Vetebrates harbor three isoforms with distinct
expression patterns and specific functions. PACSINs
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 31.1 bits (71), Expect = 3.8
Identities = 26/105 (24%), Positives = 44/105 (41%), Gaps = 10/105 (9%)
Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSL----I 514
+E E ++ AK Q+ +K++ A A K K ++ N A + SL +
Sbjct: 111 FKETKEAEDGFAKAQKPWAKLLKKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQV 170
Query: 515 MSERDSVHK---EMEKLSEDLTQAVRKIKTLE---MENKELVEDK 553
+D V K E+ K + +A+ + ME+ E V DK
Sbjct: 171 KKLQDKVEKCKQEVSKTKDKYEKALEDLNKYNPRYMEDMEQVFDK 215
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 30.2 bits (69), Expect = 3.9
Identities = 14/110 (12%), Positives = 39/110 (35%), Gaps = 9/110 (8%)
Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
+ + + + + K+ + + KL+ +++K +++
Sbjct: 10 LQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLK------EKLQKDAATLSEAAREKK 63
Query: 439 RER-NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAM 487
+ Q+ ++ L++R QE L K+ + A+KE+
Sbjct: 64 EKELQKKVQEFQRKQQKLQQDLQKR--QQEELQKILDKINKAIKEVAKKK 111
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 30.5 bits (69), Expect = 4.1
Identities = 24/121 (19%), Positives = 55/121 (45%), Gaps = 11/121 (9%)
Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINT-AMAVRMKATKDLKRLTEERNAAMQEYSL 513
W+ +R++ Y+EA+ ++Q ++ +E + + + ++LK A +E
Sbjct: 51 WNAYVRKQ--YEEAIELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENER 108
Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHD 573
+ ++ +S+ K E+L ++++ L + ED +TL I R++A D
Sbjct: 109 LKNQNESLQKRNEELE-------KELEKLRQRLSTIEEDYQTL-IDIMDRARKLAVVEDD 160
Query: 574 R 574
Sbjct: 161 E 161
>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the
stage IV sporulation protein B of endospore-forming
bacteria such as Bacillus subtilis, is a serine
proteinase, expressed in the spore (rather than mother
cell) compartment, that participates in a proteolytic
activation cascade for Sigma-K. It appears to be
universal among endospore-forming bacteria and occurs
nowhere else [Cellular processes, Sporulation and
germination].
Length = 402
Score = 31.5 bits (72), Expect = 4.1
Identities = 15/54 (27%), Positives = 28/54 (51%), Gaps = 16/54 (29%)
Query: 1252 ISLV-GGNAVGIYVHSVQSGSLGYS---------------AGLRTGDRILEYNG 1289
I ++ GG ++G+ +++ +G+S AG++ GDRIL+ NG
Sbjct: 89 IKVIPGGQSIGVKLNTKGVLVVGFSDIETEKGKIHSPGEEAGIQIGDRILKING 142
>gnl|CDD|222066 pfam13353, Fer4_12, 4Fe-4S single cluster domain. This family
includes proteins containing domains which bind to
iron-sulfur clusters. Members include bacterial
ferredoxins, various dehydrogenases, and various
reductases. The structure of the domain is an
alpha-antiparallel beta sandwich.
Length = 140
Score = 30.2 bits (69), Expect = 4.3
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 57 QEWDTEVLDMNLTNLSPNEALGLELVGGRDDPFYP 91
+ + E+ D + +L GL L+GG +P P
Sbjct: 34 EPFTEELEDEIIEDLKDPYISGLTLLGG--EPLLP 66
>gnl|CDD|233816 TIGR02302, aProt_lowcomp, TIGR02302 family protein. Members of
this family are long (~850 residue) bacterial proteins
from the alpha Proteobacteria. Each has 2-3 predicted
transmembrane helices near the N-terminus and a long
C-terminal region that includes stretches of
Gln/Gly-rich low complexity sequence, predicted by TMHMM
to be outside the membrane. In Bradyrhizobium japonicum,
two tandem reading frames are together homologous the
single members found in other species; the cutoffs
scores are set low enough that the longer scores above
the trusted cutoff and the shorter above the noise
cutoff for this model.
Length = 851
Score = 31.5 bits (71), Expect = 4.4
Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 33/143 (23%)
Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD 496
ALR+ I D++ ER L Q +DA++ R + ++
Sbjct: 478 ALRDVADNLWSLALGIEDGDLSDAER-----RLRAAQDALKDALE--------RGASDEE 524
Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV--EDKK 554
+K+LT++ AAMQ Y M +L++ L +++ N +++ +D +
Sbjct: 525 IKQLTDKLRAAMQTY-------------MRQLAQQLRNNPQQLARPLDPNTKVLRQQDLQ 571
Query: 555 TLSYQIETLKREIASALHDRDKA 577
+ QIE L R DRD+A
Sbjct: 572 NMMDQIENLARS-----GDRDQA 589
>gnl|CDD|234999 PRK02101, PRK02101, hypothetical protein; Validated.
Length = 255
Score = 31.0 bits (71), Expect = 4.5
Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 5/75 (6%)
Query: 1290 TDLRAATAEEAAYELAKPADKVTVLAQYSVERYNEVKDKPGDSFYMRALFSRTGDLGDAL 1349
LR + ++ A L K +DK LA + R+ + + + + A+ + TGD+ D L
Sbjct: 36 DALRELSPDDLA-SLMKISDK---LAGLNAARFQDWQPEFTTANARPAILAFTGDVYDGL 91
Query: 1350 QLR-FHKDDILYVDN 1363
+ D+ Y
Sbjct: 92 DADSLSEADLDYAQQ 106
>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
Provisional.
Length = 389
Score = 31.2 bits (71), Expect = 4.5
Identities = 18/53 (33%), Positives = 23/53 (43%)
Query: 1256 GGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPA 1308
G G+ V + G AG+R GD IL +GT + EAA L P
Sbjct: 98 DGPPAGLVVVAPAPGGPAARAGIRPGDVILAIDGTSTEGLSLYEAADRLQGPE 150
>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
Length = 353
Score = 31.1 bits (71), Expect = 4.8
Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 10/81 (12%)
Query: 802 IRRRRV--GARSLVTTQLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSI 859
IR RV G + ++ ++ G+ G+ + ++SP AAK G + V D ++S+
Sbjct: 248 IRDGRVIRGYIGIGGREIAPLHAQGGGIDQLQGIVVNEVSPDGPAAKAG-IQVNDLIISV 306
Query: 860 NNR-------TVDGLSSAREG 873
NN+ T+D ++ R G
Sbjct: 307 NNKPAISALETMDQVAEIRPG 327
Score = 30.7 bits (70), Expect = 6.4
Identities = 17/60 (28%), Positives = 32/60 (53%), Gaps = 8/60 (13%)
Query: 197 NSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNR-------TVDGLSSAREG 249
++ G+ G+ + ++SP AAK G + V D ++S+NN+ T+D ++ R G
Sbjct: 269 HAQGGGIDQLQGIVVNEVSPDGPAAKAG-IQVNDLIISVNNKPAISALETMDQVAEIRPG 327
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 31.6 bits (71), Expect = 4.9
Identities = 62/291 (21%), Positives = 111/291 (38%), Gaps = 39/291 (13%)
Query: 406 HSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEY 465
H+ + +L Q++VA ++++ +QE+ R + + + E +
Sbjct: 329 HADSTCQL---QKQVAELQEEVTSQSQEQAILQRSLQDKEAEVEVERMNSKVLQGELSRA 385
Query: 466 QEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
QEA ++QQQ A ++ L+ + E +++ Q M+E + ++
Sbjct: 386 QEARRRLQQQVASAEEQ--------------LQLVVEAVHSSQQGLLATMAEVEVALAQL 431
Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALK-ECNDL 584
LS L+ AVRK+ T+ + LV K L+ L++E + + L E L
Sbjct: 432 PSLSNRLSYAVRKVHTI----QGLVARKVALA----QLRQESSPPAPAEAEDLSTELRQL 483
Query: 585 RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDN 644
RE A E Q AR R+ +ER + + L + Q+ +
Sbjct: 484 REERNRLDA---ELQ---LSARLIQQEVGRAREQGEEERQRLAEVAQQLERELQESQESL 537
Query: 645 LEANQEIDRLRK-------HTDKLQAELQDQSVNYLFILTESIQEAEVSKR 688
A +++ R L+ EL Q Y L E + E E R
Sbjct: 538 ASAGAQLEAARAGQQESTEEAASLRQELTQQQEVYGQALQEKVAEVETRLR 588
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 30.9 bits (70), Expect = 4.9
Identities = 25/147 (17%), Positives = 58/147 (39%), Gaps = 11/147 (7%)
Query: 463 NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVH 522
NE E L + + ++E + R + L+ L EER ++E +
Sbjct: 60 NELGEQLIEEGHPDAEEIQERLEELNQRWEE---LRELAEERRQRLEEALDLQQF----F 112
Query: 523 KEMEKLSEDLTQAVRKIKTLEM-ENKELVEDKKTLSYQIETLKREIASALHDRDKALKEC 581
++ + L + L + + + ++ ++ E VE+ + + L+ E+ + +
Sbjct: 113 RDADDLEQWLEEKEAALASEDLGKDLESVEELLK---KHKELEEELEAHEPRLKSLNELA 169
Query: 582 NDLRERFGEFTAAKEECQREAFHARFE 608
+L E + E + E + R+E
Sbjct: 170 EELLEEGHPDADEEIEEKLEELNERWE 196
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 31.7 bits (72), Expect = 5.0
Identities = 37/180 (20%), Positives = 46/180 (25%), Gaps = 16/180 (8%)
Query: 920 GDQPPPDSYQVPKTSTKSHDPAKPSPSSSGGGGKWDSLREKMQGVRRRGGGEEGKRSNRN 979
+ P +S+ PSPSS G G S R R E S +
Sbjct: 275 WNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR---ESSSSSTSS 331
Query: 980 SSPPVTFEQEQVDAIAELDSFVHTHSPSDGSSPHYHKAPRSLASAHSHPGGPNNGAGGTS 1039
SS V SPS P + A
Sbjct: 332 SSESS--RGAAVSPGPSPSR-----SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384
Query: 1040 YETSSYHSYDAAGTFPRKKENQRF---RIPSNPSVAFKTSGTKMSNGSIEKTSERGSPMP 1096
T AG R+ RF R +P A SG + + S G P P
Sbjct: 385 -PTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS--GEPWP 441
>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
The binding of the LYPxL motif of late HIV p6Gag and
EIAV p9Gag to this domain is necessary for viral
budding.This domain is generally central between an
N-terminal Bro1 domain, pfam03097 and a C-terminal
proline-rich domain. The retroviruses thus used this
domain to hijack the ESCRT system of the cell.
Length = 292
Score = 31.1 bits (71), Expect = 5.2
Identities = 40/275 (14%), Positives = 96/275 (34%), Gaps = 60/275 (21%)
Query: 320 AMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYM 379
L+ E LR+KY + T+P S L
Sbjct: 48 IKELLDEEESEDEQLRAKYGERW------------------TRPPS------SELTAELR 83
Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ--EKNTA 437
L KY + + + KLE +E + + E++ ++
Sbjct: 84 EELRKYRGY----------LEQASESDAQLRSKLEEHEENLELLSGPEEELEALLPSSSP 133
Query: 438 LRERNGLKQ--QCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATK 495
+ +Q + + + + ER + E L ++ +D + ++ +A+ +
Sbjct: 134 SKTPEVSEQISRLRELLNKLNELKAEREKLLEELK--EKAQDDDISKL--LIALNKLGSS 189
Query: 496 DLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKT 555
+ ++L EE ++++ ++ V + + K E L + ++ N+E ++++K
Sbjct: 190 NEEQLFEEE---LKKFD---PLQERVEQNLSKQEELLKE-------IQEANEEFLQERK- 235
Query: 556 LSYQIETLKREIASALHDRDKALKECNDLRERFGE 590
++ ++E L + A + +L E
Sbjct: 236 ----SDSEQKEREKVLQKLENAYDKYKELSSNLEE 266
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 31.4 bits (71), Expect = 5.2
Identities = 41/174 (23%), Positives = 77/174 (44%), Gaps = 15/174 (8%)
Query: 416 AQEEVARIKKQCE-DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 474
A++E ++ + E D K TA RE LK++ ++ E +Y+E + + +
Sbjct: 40 AEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKE------EARKYREEIEQEFK 93
Query: 475 QHEDAVKEINTAMAVRMKA--TKDLKRLTEERNAAMQEYSLIMSER--DSVHKEMEKLSE 530
+K+I + + R + KD ++E+ +E SL + D +++EKL E
Sbjct: 94 SERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEE 153
Query: 531 DLTQAVRKIKTLEMENKE---LVEDKKTLSYQIETLKREIASALHDR-DKALKE 580
+ ++ L L E + L+++I T RE + DR DK K+
Sbjct: 154 QKKAELERVAALSQAEAREIILAETENKLTHEIATRIREAEREVKDRSDKMAKD 207
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.1 bits (70), Expect = 5.3
Identities = 22/104 (21%), Positives = 45/104 (43%), Gaps = 5/104 (4%)
Query: 411 DKLELAQEEVARIKK----QCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQ 466
+ + ++ +K + E + Q + L+ + KQ A +Q + +++ E
Sbjct: 80 QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE-KQAQLEQKQQEEQA 138
Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
A Q++ +A K A A ++KA + K+ EE A +E
Sbjct: 139 RKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEE 182
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 30.7 bits (70), Expect = 5.3
Identities = 34/198 (17%), Positives = 80/198 (40%), Gaps = 22/198 (11%)
Query: 362 QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
P + L Q ++ L++ + K + D +K+ DD A +L ++E+
Sbjct: 11 LPTPEQKALIQDLEQA-LNFLDEIDKSKQKADQYQKQIDD-------APKELRELRQELE 62
Query: 422 RIKKQCEDMNQE-KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
+KK + E N +L + L+Q+ + Q + + L ++Q + E A
Sbjct: 63 ALKKTDAPVFPELANLSLSQ---LEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQ 119
Query: 481 KEINTAMAVRMKATKDLKR-------LTEERNAAMQ-EYSLIMSERDSVHKEMEKLSE-- 530
++++ A + L+ L + + +Q E + + ++ + + E+ +
Sbjct: 120 QQLSEARRRLQEIRNRLQALSPGGTPLAQAQRTLLQAELAALKAQIEELELELLSNNNRQ 179
Query: 531 DLTQAVRKIKTLEMENKE 548
+L + R + +E E
Sbjct: 180 ELLRLQRDLLKKRIERLE 197
>gnl|CDD|147120 pfam04803, Cor1, Cor1/Xlr/Xmr conserved region. Cor1 is a
component of the chromosome core in the meiotic prophase
chromosomes. Xlr is a lymphoid cell specific protein.
Xlm is abundantly transcribed in testis in a
tissue-specific and developmentally regulated manner.
The protein is located in the nuclei of spermatocytes,
early in the prophase of the first meiotic division, and
later becomes concentrated in the XY nuclear subregion
where it is in particular associated with the axes of
sex chromosomes.
Length = 130
Score = 29.6 bits (67), Expect = 5.4
Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 12/102 (11%)
Query: 396 RKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQW 455
RKR + E E+V K Q E Q+ N QQ +QW
Sbjct: 8 RKRLETYTKASFKGS---EQKLEQV--WKTQQEQR-QKLN------QEYSQQFLTLFQQW 55
Query: 456 DIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDL 497
+ ++ E +E L+ + +Q + ++ R+K K+L
Sbjct: 56 EADAQKFEEQEEKLSNMFRQQQKLFQQARIVQNQRLKTIKEL 97
>gnl|CDD|223400 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
replication, recombination, and repair].
Length = 638
Score = 31.2 bits (71), Expect = 5.5
Identities = 34/153 (22%), Positives = 55/153 (35%), Gaps = 33/153 (21%)
Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
YE +K+E + + LI +LEL+ EE E++ E+ G
Sbjct: 484 LYEKLKNELGNVGELQPLLIP------IRLELSPEEADV---------LEEHKEELEKLG 528
Query: 444 LKQQ----CTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
+ + + A+R L + + + + E KDLK
Sbjct: 529 FEIESFGENSVAVRSVPAMLGKAEVQELIRELLDD-----LLE---------GKLKDLKE 574
Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL 532
L EE A M S + + R+ +EM L DL
Sbjct: 575 LLEELAATMACRSAVKAGRELSAEEMNALLRDL 607
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 31.1 bits (71), Expect = 5.6
Identities = 17/90 (18%), Positives = 38/90 (42%), Gaps = 6/90 (6%)
Query: 479 AVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRK 538
++ E + +++ K+L EER M+ ER + + D++++V++
Sbjct: 3 SIIEAGELFSKKVRMKLATKQLWEEREKFMKY------ERGWDTESLVGDPLDISESVKE 56
Query: 539 IKTLEMENKELVEDKKTLSYQIETLKREIA 568
+K + + KK ++ T EI
Sbjct: 57 LKLEMYTSLAEDQPKKEEIERLSTDSEEIP 86
>gnl|CDD|169982 PRK09578, PRK09578, periplasmic multidrug efflux lipoprotein
precursor; Reviewed.
Length = 385
Score = 30.9 bits (70), Expect = 5.6
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 314 RGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
R A+A+ EAA + R +Y DL+ D+ +R+ E
Sbjct: 103 RDAAAGALAKAEAAHLAALDKRRRYDDLVRDRAVSERDYTE 143
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 29.5 bits (67), Expect = 5.7
Identities = 12/60 (20%), Positives = 30/60 (50%), Gaps = 4/60 (6%)
Query: 417 QEEVARIKKQCEDMNQEKNTALRE-RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ 475
QE++ +++++ + + + A++E R QQ +Q + +ER + E ++ Q
Sbjct: 26 QEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVE---QERAKRLEQKNQLLFQ 82
>gnl|CDD|233008 TIGR00524, eIF-2B_rel, eIF-2B alpha/beta/delta-related
uncharacterized proteins. This model, eIF-2B_rel,
describes half of a superfamily, where the other half
consists of eukaryotic translation initiation factor 2B
(eIF-2B) subunits alpha, beta, and delta. It is unclear
whether the eIF-2B_rel set is monophyletic, or whether
they are all more closely related to each other than to
any eIF-2B subunit because the eIF-2B clade is highly
derived. Members of this branch of the family are all
uncharacterized with respect to function and are found
in the Archaea, Bacteria, and Eukarya, although a number
are described as putative translation intiation factor
components. Proteins found by eIF-2B_rel include at
least three clades, including a set of uncharacterized
eukaryotic proteins, a set found in some but not all
Archaea, and a set universal so far among the Archaea
and closely related to several uncharacterized bacterial
proteins [Unknown function, General].
Length = 303
Score = 30.9 bits (70), Expect = 5.8
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
Query: 378 YMSALNKYETIKDEYDA-LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNT 436
++A +E+ L K D L++T TAV+ L A E V + E + + K +
Sbjct: 31 ALAARKIETDNVEEFKEDLEKAADFLLSTRPTAVN-LFWALERVLNSAENGESVEEAKES 89
Query: 437 ALRE 440
LRE
Sbjct: 90 LLRE 93
>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
Provisional.
Length = 638
Score = 31.3 bits (71), Expect = 6.1
Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 3/76 (3%)
Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
+YQ+ L+ VQ+Q + A +A KD KR E Q + E + K
Sbjct: 514 DYQQWLSDVQKQENQTDEAPKENNANSAQARKDQKRREAELRTQTQP---LRKEIARLEK 570
Query: 524 EMEKLSEDLTQAVRKI 539
EMEKL+ L QA K+
Sbjct: 571 EMEKLNAQLAQAEEKL 586
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 30.8 bits (69), Expect = 6.2
Identities = 35/184 (19%), Positives = 65/184 (35%), Gaps = 20/184 (10%)
Query: 410 VDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL 469
+ KL A E KK +D N + L ERN Q +++ + NE E
Sbjct: 92 ISKLSAAVENAKNEKKAIDDKNAQIAKELAERNAKIQSNIEELKKIN------NEAFELS 145
Query: 470 AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLS 529
V + + K+ + + L ++ + ++ ++ E+E
Sbjct: 146 KTVNKTIAEVEKKFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAELESFK 205
Query: 530 EDLTQAVRKIKT-----LEMENKEL---------VEDKKTLSYQIETLKREIASALHDRD 575
E T +K+ L+ EL VE+ K LS++I + + + +
Sbjct: 206 EITTTWFNGMKSEWARVLDAWKNELTEINSIIKGVEELKKLSHEISEFSNSVKKTISELE 265
Query: 576 KALK 579
K K
Sbjct: 266 KKFK 269
>gnl|CDD|235076 PRK02842, PRK02842, light-independent protochlorophyllide reductase
subunit N; Provisional.
Length = 427
Score = 31.0 bits (71), Expect = 6.2
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 9/56 (16%)
Query: 1446 YSDSG-----TLNEDSVSLASYQRVIRLNSEPV-LRPVILVGALSDTVTDKLLQDF 1495
YS SG T ED+V LA+ V P ++LVG+L+D V D+L +F
Sbjct: 135 YSGSGLETTFTQGEDAV-LAAL--VPFCPEAPADHPSLVLVGSLADVVEDQLTLEF 187
>gnl|CDD|203998 pfam08606, Prp19, Prp19/Pso4-like. This regions is found
specifically in PRP19-like protein. The region
represented by this family covers the sequence
implicated in self-interaction and a coiled-coiled
motif. PRP19-like proteins form an oligomer that is
necessary for spliceosome assembly.
Length = 70
Score = 28.4 bits (64), Expect = 6.2
Identities = 12/45 (26%), Positives = 23/45 (51%), Gaps = 7/45 (15%)
Query: 555 TLSYQIETLKREIASALHDRD-------KALKECNDLRERFGEFT 592
L Q++ ++E++ AL+ D + +KE ++ RE E T
Sbjct: 26 ELRKQLDQTRQELSHALYQHDAACRVIARLIKERDEAREALAELT 70
Score = 28.0 bits (63), Expect = 8.2
Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 7/55 (12%)
Query: 116 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 170
E ++KQ + QE + AL Q AA R ++ER+E +EALA++
Sbjct: 23 ETFELRKQLDQTRQELSHAL-------YQHDAACRVIARLIKERDEAREALAELT 70
Score = 28.0 bits (63), Expect = 8.2
Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 7/55 (12%)
Query: 419 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
E ++KQ + QE + AL Q AA R ++ER+E +EALA++
Sbjct: 23 ETFELRKQLDQTRQELSHAL-------YQHDAACRVIARLIKERDEAREALAELT 70
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.5 bits (67), Expect = 6.2
Identities = 20/117 (17%), Positives = 45/117 (38%), Gaps = 22/117 (18%)
Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
KE++ A+A + + + + + MS + E+EKL ++L + ++
Sbjct: 11 KEVDKAIAKEKAKWEKKQEEKKSEAEKLAK----MSAEEKAEYELEKLEKELEELEAELA 66
Query: 541 TLEMENK---------------ELV--EDKKTLSYQIETLKREIASALHD-RDKALK 579
E++ + +LV ED + ++ K+ A+ + LK
Sbjct: 67 RRELKAEAKKMLSEKGLPVELLDLVVGEDAEETKANVKAFKKLFDKAVEAGVKERLK 123
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 6.3
Identities = 17/88 (19%), Positives = 36/88 (40%), Gaps = 11/88 (12%)
Query: 460 RERNEYQEALAKVQQQHEDA--VKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
+E + +++++ +KEI +A + KA K +ER +
Sbjct: 290 KELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKK----KLKRR 345
Query: 518 RDSVHKEMEK-----LSEDLTQAVRKIK 540
+ HK E L ++L+ ++R +K
Sbjct: 346 KLGKHKYPEPPLEVVLPDELSGSLRTLK 373
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 30.3 bits (69), Expect = 6.3
Identities = 14/71 (19%), Positives = 33/71 (46%), Gaps = 3/71 (4%)
Query: 497 LKRLTEERNAAMQEYSLIMSERDSVH--KEMEKLSEDLTQAVRKIKTLEMENKELVEDKK 554
L R+ +E + Y + E ++ + + ++ ++IK LE E +EL +
Sbjct: 86 LLRVRDELRMTIAAYQ-TLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVA 144
Query: 555 TLSYQIETLKR 565
L ++E +++
Sbjct: 145 ELEAKLEAIEK 155
>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated. The HNOBA domain is
found associated with the HNOB domain and pfam00211 in
soluble cyclases and signalling proteins. The HNOB
domain is predicted to function as a heme-dependent
sensor for gaseous ligands, and transduce diverse
downstream signals, in both bacteria and animals.
Length = 209
Score = 30.3 bits (69), Expect = 6.5
Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 24/81 (29%)
Query: 493 ATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED 552
AT+DL L E+ A + + K +EKL + L +A R+++ E
Sbjct: 151 ATRDLILLGEQARA-----------QLKLKKRLEKLKDKLEEAHRELE---------EEK 190
Query: 553 KKT--LSYQIETLKREIASAL 571
KKT L Y + L + +A+ L
Sbjct: 191 KKTDDLLYSM--LPKSVANRL 209
>gnl|CDD|184236 PRK13679, PRK13679, hypothetical protein; Provisional.
Length = 168
Score = 29.9 bits (68), Expect = 6.7
Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 387 TIKDEYDALRKRFDD---LINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
I+D ++ RKR+D LI H T + E++ E++ I ++ + E
Sbjct: 12 KIQDFANSYRKRYDPHYALIPPHITLKEPFEISDEQLDSIVEELRAIASE 61
>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region. Family of
proteins with a conserved region found in PAPA-1, a
PAP-1 binding protein.
Length = 89
Score = 28.5 bits (64), Expect = 6.7
Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 4/76 (5%)
Query: 110 KLKKKEEVARIKK-QCEDMNQE-KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 167
K+ E AR +K Q E +E K + + LK+Q R+ I + + QE A
Sbjct: 2 MQLKRAEAARRRKNQSEKRAEEEKMETINKL--LKKQARRKKREDKIKKGDEKKAQEKAA 59
Query: 168 KVQQQHEDAVKEINTA 183
+ ++ ++ ++
Sbjct: 60 RAEKAPPPMIRYVSGP 75
>gnl|CDD|217145 pfam02616, ScpA_ScpB, ScpA/B protein. ScpA and ScpB participate in
chromosomal partition during cell division. It may act
via the formation of a condensin-like complex containing
smc that pull DNA away from mid-cell into both cell
halves. These proteins are part of the Kleisin
superfamily.
Length = 177
Score = 30.0 bits (68), Expect = 6.8
Identities = 24/114 (21%), Positives = 47/114 (41%), Gaps = 10/114 (8%)
Query: 485 TAMAVRMKATKDLKRLTEERNAAMQEY-SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLE 543
++ +RMKA L+ +++ + E+ + E V K + L + V++ +
Sbjct: 43 ASILLRMKAEALLRSFSKKPV-TLDEFIKTLEEELVEVEKSVLDLIHAFNKLVKRKRKNR 101
Query: 544 MENKELVEDKKTLSYQIETLKREIASALHDRDK----ALKECNDLRERFGEFTA 593
+ K E L +E + E+ + L R K LK+ ++ G F A
Sbjct: 102 LRKKNETE----LEVSVEEVLEELLAFLGVRLKEVLTKLKKFPSKKDLVGAFVA 151
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
(chemotaxis sensory transducer). Thought to undergo
reversible methylation in response to attractants or
repellants during bacterial chemotaxis.
Length = 262
Score = 30.3 bits (69), Expect = 7.5
Identities = 15/63 (23%), Positives = 31/63 (49%)
Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVR 537
D V+EI A + ++++ +E QE + + E + +E+ L+E+L + V
Sbjct: 200 DLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEISAAAEELSGLAEELDELVE 259
Query: 538 KIK 540
+ K
Sbjct: 260 RFK 262
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.9 bits (71), Expect = 7.5
Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSV 521
+ E + +Q++ K + R+ K +L E+ ++ L
Sbjct: 217 PEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREKLKEELKSLRL-------T 269
Query: 522 HKEMEKLSEDLTQAVRKIKTLEMENKELVEDKK 554
K++++L E L ++++ E E LVE K
Sbjct: 270 SKQIDELVEQLRDINKRVRGQERELLRLVERLK 302
>gnl|CDD|148314 pfam06632, XRCC4, DNA double-strand break repair and V(D)J
recombination protein XRCC4. This family consists of
several eukaryotic DNA double-strand break repair and
V(D)J recombination protein XRCC4 sequences. In the
non-homologous end joining pathway of DNA double-strand
break repair, the ligation step is catalyzed by a
complex of XRCC4 and DNA ligase IV. It is thought that
XRCC4 and ligase IV are essential for alignment-based
gap filling, as well as for final ligation of the
breaks.
Length = 331
Score = 30.6 bits (69), Expect = 7.8
Identities = 55/282 (19%), Positives = 110/282 (39%), Gaps = 38/282 (13%)
Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
D +E + R K +L++ EY+ S ++S+H EK +D++ +
Sbjct: 50 SDISQEADDMAMEREKYVDELRKALVSGAGPAGEYTFNFS-KESLHFSYEKNLKDVSFRL 108
Query: 537 RKIKTLEMENKELVEDKKTLSYQIETLKREIASALH---DRDKALKECNDLRERFGEFTA 593
K ++ N V ++ + Y ++T+ A H + ++ L++ ND++ RF + +
Sbjct: 109 GSFKLEKVPNPAEV-IRELICYCLDTIAELQAKNEHLQKENERLLRDWNDIQGRFEKCVS 167
Query: 594 AKEECQREAFHARFEYAM--RMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI 651
KE + + + RF + + + S K +E + + ++++ + +++
Sbjct: 168 GKEALETD-LYKRFILVLNEKKAKIRSLHKLLSEAQELEKSI----KQKKETSACSDKTP 222
Query: 652 DRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYHTKHA 711
D K+ D S T+ QEA A +D + T A
Sbjct: 223 DEDSKY---------DGS-------TDEEQEAPPKPSESMPAAVSKDDSLISSPDITDIA 266
Query: 712 GRHKRGKELDKNGGTWPKARSGPVIENGTGTILHPRRHKERL 753
KR + + KN GT PK T + KE+
Sbjct: 267 PSRKRRQRMQKNLGTEPKMA----------TQEQQLQEKEKP 298
>gnl|CDD|222389 pfam13801, Metal_resist, Heavy-metal resistance. This is a
metal-binding protein which is involved in resistance to
heavy-metal ions. The protein forms a four-helix hooked
hairpin, consisting of two long alpha helices each
flanked by a shorter alpha helix. It binds a metal ion
in a type-2 like centre. It contains two copies of an
LTXXQ motif.
Length = 125
Score = 29.1 bits (66), Expect = 7.9
Identities = 21/88 (23%), Positives = 28/88 (31%), Gaps = 9/88 (10%)
Query: 583 DLRERFGEFTAAKEECQREAFHARFEYAMR--MGERDSSRKERNEVEAASMDLFGKCQKE 640
R G A QR A A R R + R R E+ A + E
Sbjct: 30 RRRPGLGPAALALPPEQRRALRAALRAHRREVRPLRRALRAARRELLALL-------RAE 82
Query: 641 RMDNLEANQEIDRLRKHTDKLQAELQDQ 668
D + R R+ LQA +Q+
Sbjct: 83 PFDPAALRAALARQREARAALQARVQEA 110
>gnl|CDD|218452 pfam05130, FlgN, FlgN protein. This family includes the FlgN
protein and export chaperone involved in flagellar
synthesis.
Length = 140
Score = 29.3 bits (66), Expect = 8.1
Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 12/97 (12%)
Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALH- 572
++ E + +E+ +L E+ +A++ +E EL E+K+ L Q+E L+ E L
Sbjct: 5 LLEEELELLEELLELLEEEQEALKAGDIEALE--ELTEEKQELLAQLEQLEEERQQLLAE 62
Query: 573 ----DRDKALKEC-----NDLRERFGEFTAAKEECQR 600
L E +LRE + E EE Q
Sbjct: 63 LGLSPGPAGLSELLAAEDEELRELWQELKELLEELQE 99
>gnl|CDD|225575 COG3031, PulC, Type II secretory pathway, component PulC
[Intracellular trafficking and secretion].
Length = 275
Score = 30.3 bits (68), Expect = 8.9
Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 2/61 (3%)
Query: 1256 GGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYEL--AKPADKVTV 1313
G + GSL Y +GL+ GD + N DL ++ P+ ++TV
Sbjct: 203 NEKIEGYRFEPGKDGSLFYKSGLQRGDIAVAINNLDLTDPEDMFRLLQMLRNMPSLQLTV 262
Query: 1314 L 1314
+
Sbjct: 263 I 263
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 30.4 bits (69), Expect = 9.2
Identities = 41/229 (17%), Positives = 78/229 (34%), Gaps = 23/229 (10%)
Query: 372 DSLNQHYM---SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
+ + + Y + L + E L D+++ Q+ + I+K
Sbjct: 333 ERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALT 392
Query: 429 DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
D+ E+ L++ A + R +++ E +++ + + E
Sbjct: 393 DIEDEQEKVQEHLTSLRKDELEA---RENLERLKSKLHEIKRYMEKSNLPGLPE------ 443
Query: 489 VRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKE 548
A E +M E V ME +S + A + TLE E +E
Sbjct: 444 ----------TFLSLFFTAGHEIQDLMKELSEVPINMEAVSALVDIATEDMNTLEDETEE 493
Query: 549 LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEE 597
+VE L+ Q+ S + K+L E L E ++ A+ E
Sbjct: 494 VVE-NAVLAEQLIQYGNRYRSRNAEVAKSLNEAERLFENAFDYDASFEI 541
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.129 0.366
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 82,784,939
Number of extensions: 8263287
Number of successful extensions: 8258
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7588
Number of HSP's successfully gapped: 552
Length of query: 1659
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1549
Effective length of database: 6,058,662
Effective search space: 9384867438
Effective search space used: 9384867438
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (29.1 bits)