RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10797
(517 letters)
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 55.9 bits (135), Expect = 3e-08
Identities = 32/151 (21%), Positives = 76/151 (50%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
EE R++++LE Q +L++++A ++E E L + +E + E+++E+ L ++L E +
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
+ L K + + E++ L+E+++ L ++E + LE + L+
Sbjct: 765 EELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRE 824
Query: 416 NAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
E+E+ + ++ + K E + +L+
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELE 855
Score = 51.6 bits (124), Expect = 8e-07
Identities = 29/151 (19%), Positives = 69/151 (45%), Gaps = 3/151 (1%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
+ E + EE ++++LE++ L++ +A K+E E L + + + E+++E+
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802
Query: 345 SRLLQQLRE-KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
++L + + ++ +L EE + L E++ L ++E + + +L
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEEL-EEEIEELEEKLDELEEELEELEKELEEL 861
Query: 404 EEKERFLQTVLTNAEKEL-HLRNQAMDLNKR 433
+E+ L+ E EL L + +L +
Sbjct: 862 KEELEELEAEKEELEDELKELEEEKEELEEE 892
Score = 50.9 bits (122), Expect = 2e-06
Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 10/141 (7%)
Query: 284 DSKRDERKKLADE-EAMRKIKQLEDQTH--QLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
+ E ++ +E E E + L++++A + E + NE+E + E +
Sbjct: 350 EQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESL 409
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
+E+ RL ++L + L E + + + E + L E+++ L Q+E + +
Sbjct: 410 EERLERLSERLED-------LKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462
Query: 401 RKLEEKERFLQTVLTNAEKEL 421
++LE + LQ L EKEL
Sbjct: 463 KELERELAELQEELQRLEKEL 483
Score = 49.7 bits (119), Expect = 3e-06
Identities = 28/162 (17%), Positives = 73/162 (45%), Gaps = 4/162 (2%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
+ E ++ EE + + +LE+ +L+++++ +E E L + + +++
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR---EELAELEA 390
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
+ + + +L E L + ++ + +EE L +++ L ++E ++ + +
Sbjct: 391 ELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE 450
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
LEE+ L+ L E+EL + + K + S ++ D
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQ-RLEKELSSLEARLD 491
Score = 48.2 bits (115), Expect = 9e-06
Identities = 39/148 (26%), Positives = 75/148 (50%), Gaps = 9/148 (6%)
Query: 283 RDSKRDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
R + ++E ++L +E EA R++ LE + L+++ +QE E L E+E + +++
Sbjct: 788 RQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS-------LQI 393
+E+ L ++L E + +L E+ + K EEK+ L E+++ L +I
Sbjct: 848 EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEI 907
Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKEL 421
E + + +LE K L+ L E+EL
Sbjct: 908 EKLRERLEELEAKLERLEVELPELEEEL 935
Score = 45.1 bits (107), Expect = 8e-05
Identities = 32/151 (21%), Positives = 73/151 (48%), Gaps = 1/151 (0%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ + E+ ++++LE++ +L++++ ++ E L E+E +A ++E+
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L ++ + + +L E ++ + E ++L ++ + L +IE + I +LEE
Sbjct: 783 ELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEE 842
Query: 406 KERFLQTVLTNAEKEL-HLRNQAMDLNKRKA 435
K L+ L EKEL L+ + +L K
Sbjct: 843 KLDELEEELEELEKELEELKEELEELEAEKE 873
Score = 40.9 bits (96), Expect = 0.002
Identities = 36/174 (20%), Positives = 88/174 (50%), Gaps = 8/174 (4%)
Query: 273 LEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV 332
L + A ++ ++ RK+L EE ++ +LE++ +LQ+++ ++E E L +E+E
Sbjct: 222 LRELELALLLAK-LKELRKEL--EELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278
Query: 333 TGQAFEDMQEQNSRLLQQLREKDD--ANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
+ E++QE+ L +++ E + + + E +++ R E+ L+E+++AL
Sbjct: 279 LREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEE--LKEKIEALK 336
Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
++E T + +LE+ L+ E++L + ++ +A+ + +
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE-ELFEALREELAELE 389
Score = 39.3 bits (92), Expect = 0.005
Identities = 31/167 (18%), Positives = 73/167 (43%), Gaps = 6/167 (3%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
+ ++++ + E +I L ++ +L+ ++ ++ E L ++E + E+ +
Sbjct: 291 LELKEEIEELE--GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348
Query: 347 LLQQL----REKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
L Q L K++ KL + +L + REE L ++ + ++E + I
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIES 408
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
LEE+ L L + ++EL ++ + + E + +L+ L
Sbjct: 409 LEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455
Score = 39.3 bits (92), Expect = 0.005
Identities = 37/177 (20%), Positives = 77/177 (43%), Gaps = 25/177 (14%)
Query: 285 SKRDERKKLADEEAMRKIKQ-------LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF 337
SK ERK EEA RK+++ LED +L+KQ+ +++ E +
Sbjct: 168 SKYKERK----EEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAER--------Y 215
Query: 338 EDMQEQNSRL--LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEA 395
++++ + L L + + +L E ++ EE + L+E+++ +IE
Sbjct: 216 QELKAELRELELALLLAKLKELRKEL--EELEEEL--SRLEEELEELQEELEEAEKEIEE 271
Query: 396 MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVII 452
+ + + +L E+ LQ L ++E+ + L + + E +L+ L +
Sbjct: 272 LKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEEL 328
Score = 35.8 bits (83), Expect = 0.064
Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 30/156 (19%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
+ E+++L DE +K+LE++ +L++++ + E L E+E + E+++
Sbjct: 865 LEELEAEKEELEDE-----LKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEA 919
Query: 343 QNSRLLQQLREKDDAN---FKLMTERIKSNQLHKLAR-------------EEKDTLREQV 386
+ RL +L E ++ ++ E ++ +L EE + + E+
Sbjct: 920 KLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERY 979
Query: 387 QALSLQIEAMHTAIRKLEE---------KERFLQTV 413
+ L Q E + A KL E +ERF +T
Sbjct: 980 EELKSQREDLEEAKEKLLEVIEELDKEKRERFKETF 1015
Score = 31.2 bits (71), Expect = 1.6
Identities = 29/131 (22%), Positives = 56/131 (42%), Gaps = 16/131 (12%)
Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 378
+K+ A E++ + +++ Q +L ++L+ + L EE
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLL-----------EE 706
Query: 379 KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRKA-I 436
LR Q++ L Q+E + + LEE+ LQ+ L E+EL L + +L +R +
Sbjct: 707 ---LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEEL 763
Query: 437 ESAQSAADLKL 447
E + + L
Sbjct: 764 EEELESLEEAL 774
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 52.8 bits (127), Expect = 3e-07
Identities = 27/157 (17%), Positives = 73/157 (46%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
E +++++LE++ QL+K++ ++ AL ++ E ++E+ ++L ++L E +
Sbjct: 701 AELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
+L ++ + A E + L Q++ L +++A+ A+ +L + L
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820
Query: 416 NAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVII 452
N + L + + +R+ + + +L + +
Sbjct: 821 NLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857
Score = 47.7 bits (114), Expect = 1e-05
Identities = 33/172 (19%), Positives = 74/172 (43%), Gaps = 15/172 (8%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ--- 343
E K E ++ +LE++ +L++++ + + E E L E+E E+++EQ
Sbjct: 327 ELESKLDELAE---ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383
Query: 344 -NSRLLQQLREKDDANFKLMTERIKSNQL-HKLAR-------EEKDTLREQVQALSLQIE 394
S++ Q + N ++ + +L + R K +++ L ++E
Sbjct: 384 LRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELE 443
Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
+ + +L+E+ L+ L +EL QA+D +R+ + L+
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLE 495
Score = 45.4 bits (108), Expect = 7e-05
Identities = 24/123 (19%), Positives = 52/123 (42%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
+ +L + L ++ A ++ E+L + T + ED++EQ L + +
Sbjct: 803 EALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIE 862
Query: 360 KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
+L + + E+ +L E + L ++E + +R+LE K L+ L +
Sbjct: 863 ELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELRE 922
Query: 420 ELH 422
+L
Sbjct: 923 KLA 925
Score = 42.7 bits (101), Expect = 4e-04
Identities = 29/167 (17%), Positives = 68/167 (40%), Gaps = 17/167 (10%)
Query: 284 DSKRDERKKLADEEAM--RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+ R+E ++L +E ++++L + +L++++ + E L E+E + +
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALA 294
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE--------------QVQ 387
+ SRL QQ + + L + + + + D L E +++
Sbjct: 295 NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354
Query: 388 ALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH-LRNQAMDLNKR 433
+L ++E + + +LE + L+ L ++ L Q LN
Sbjct: 355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401
Score = 42.0 bits (99), Expect = 0.001
Identities = 46/186 (24%), Positives = 88/186 (47%), Gaps = 34/186 (18%)
Query: 285 SKRDERKKLADEEAMRKI-------KQLEDQTHQLQKQVAT-HKQEEEAL--------LN 328
SK ER+K E RK+ +LED ++L++Q+ + +Q E+A L
Sbjct: 168 SKYKERRK----ETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELR 223
Query: 329 EME--VTGQAFEDMQEQNSRLLQQL----REKDDANFKLMTERIKSNQLHKLAREEKDTL 382
E+E + E+++E+ L ++L E ++ +L K +L +L E +
Sbjct: 224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL-RLEVSELEEE 282
Query: 383 REQVQA--LSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
E++Q +L E I +LE++++ L+ L N E++L ++ + K E A+
Sbjct: 283 IEELQKELYALANE-----ISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337
Query: 441 SAADLK 446
A+L+
Sbjct: 338 ELAELE 343
Score = 41.2 bits (97), Expect = 0.001
Identities = 37/168 (22%), Positives = 69/168 (41%), Gaps = 25/168 (14%)
Query: 264 DSASNIYGSLEFRKFACVVRDSKRDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQ 321
++ N SLE R E ++L++E E K +L + +L++++A +
Sbjct: 876 EALLNERASLEEA------LALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929
Query: 322 EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKL----- 374
E L ++ + + + L K + + + R+K N++ +L
Sbjct: 930 RLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNL 989
Query: 375 -AREEKDTLREQVQALSLQIEAMHTAIRKLEE---------KERFLQT 412
A EE + L+E+ L+ Q E + A LEE +ERF T
Sbjct: 990 AAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDT 1037
Score = 41.2 bits (97), Expect = 0.001
Identities = 32/162 (19%), Positives = 75/162 (46%), Gaps = 8/162 (4%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
S+ +++K++ E ++ LE Q +L+ Q+ + + + L E+ + E+++E+
Sbjct: 298 SRLEQQKQILRE----RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
L +L E + +L + + + + R + L Q+ +L+ +IE + + +LE
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413
Query: 405 EKERFLQTVLTNAEKELH---LRNQAMDLN-KRKAIESAQSA 442
++ LQ + K+L L+ +L + +E Q
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
Score = 41.2 bits (97), Expect = 0.001
Identities = 35/193 (18%), Positives = 77/193 (39%), Gaps = 4/193 (2%)
Query: 253 KSRWFDFRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQL 312
+ + + LE V + +R + E +I++LE++ +
Sbjct: 718 RKELEELSRQISALRKDLARLEAE----VEQLEERIAQLSKELTELEAEIEELEERLEEA 773
Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
++++A + E E L ++E + + ++E L +L ++ L +
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833
Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
+ L EQ++ LS IE++ I +LEE L++ L E +A+ L +
Sbjct: 834 AATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLR 893
Query: 433 RKAIESAQSAADL 445
+ E ++ +L
Sbjct: 894 SELEELSEELREL 906
Score = 40.0 bits (94), Expect = 0.003
Identities = 22/103 (21%), Positives = 51/103 (49%)
Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
++E E L ++E + ++++ + L ++L E ++ +L E + ++ R++
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735
Query: 380 DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH 422
L +V+ L +I + + +LE + L+ L AE+EL
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778
Score = 39.7 bits (93), Expect = 0.004
Identities = 38/208 (18%), Positives = 69/208 (33%), Gaps = 37/208 (17%)
Query: 252 YKSRWFDFRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKI-KQLEDQTH 310
K R D L + A + + ER+ A E + + +Q+E+ +
Sbjct: 794 LKEELKALREALDELRAELTLLN-EEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 311 Q----------LQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
L++ + + E EALLNE E + S L
Sbjct: 853 DIESLAAEIEELEELIEELESELEALLNERA---SLEEALALLRSEL------------- 896
Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
S +L +L ++ LR +++ L ++ + + LE + LQ L+ E
Sbjct: 897 ----EELSEELREL-ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLS----E 947
Query: 421 LHLRNQAMDLNKRKAIESAQSAADLKLH 448
+ IE + A +L
Sbjct: 948 EYSLTLEEAEALENKIEDDEEEARRRLK 975
Score = 39.3 bits (92), Expect = 0.005
Identities = 33/180 (18%), Positives = 74/180 (41%), Gaps = 21/180 (11%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ E ++L + ++++L ++ +LQ+++ ++E E L E++ + E+++ + S
Sbjct: 218 LKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVS 277
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L +++ E + L E + Q ++ RE L Q++ L Q+E KL+E
Sbjct: 278 ELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLE---ELESKLDE 334
Query: 406 KERFLQTV-----------------LTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLH 448
L + L E EL ++ + +E+ +S
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE-ELEEQLETLRSKVAQLEL 393
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 50.4 bits (121), Expect = 5e-07
Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 4/141 (2%)
Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSN 369
+K + K E EAL +E ED++ Q S+L +Q++RE+ + ++
Sbjct: 30 RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89
Query: 370 QLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
+L L E +E++ +L ++ + I KLE++ L+ L EK L ++
Sbjct: 90 ELRALNIEI-QIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLE 148
Query: 430 LNKRKAIESAQSAADLKLHLV 450
+ E Q + + L
Sbjct: 149 EEVAEIREEGQELSSKREELK 169
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 47.7 bits (114), Expect = 1e-05
Identities = 44/200 (22%), Positives = 78/200 (39%), Gaps = 43/200 (21%)
Query: 287 RDERKKLADEEAMRK--IKQLEDQTHQL-----QKQVATH-KQEEEALLNEMEVT----G 334
R +K + E RK + +E L K A+ K++ E +NE+E+
Sbjct: 507 RRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHAN 566
Query: 335 QAFEDMQEQNSRLLQQLRE-----------KDDA--NFKLMTERIK-------------- 367
+A + Q+ + QQ++E ++DA + R
Sbjct: 567 KANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSALE 626
Query: 368 -SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
+ + K A E E+V L+ Q ++ RKLE + LQ+ L A EL
Sbjct: 627 QAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGELAALQSDLDEAVNELKA--- 683
Query: 427 AMDLNKRKAIESAQSAADLK 446
A + K+ ++A+ A +L+
Sbjct: 684 AEERAKKAQADAARLAEELR 703
Score = 37.3 bits (87), Expect = 0.018
Identities = 36/155 (23%), Positives = 72/155 (46%), Gaps = 21/155 (13%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
+S+ +E K+ +EE+ + L+ Q QL+ + ++L ++E +A +++ Q
Sbjct: 242 ESQLEEAKRSLEEESRERAN-LQAQLRQLE-------HDLDSLREQLEEESEAKAELERQ 293
Query: 344 NSR---LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
S+ +QQ R K ++ L E ++ E K L +++ L EA +
Sbjct: 294 LSKANAEIQQWRSKFESEGALRAEELE---------ELKKKLNQKISELEEAAEAANAKC 344
Query: 401 RKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRK 434
LE+ + LQ+ L + + EL A +L K++
Sbjct: 345 DSLEKTKSRLQSELEDLQIELERANAAASELEKKQ 379
Score = 31.2 bits (71), Expect = 1.8
Identities = 32/133 (24%), Positives = 54/133 (40%), Gaps = 9/133 (6%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
E A +LE + K +A K++ + L E++ + ++ + RL +L E
Sbjct: 366 ERANAAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELK 425
Query: 356 DANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
D L E +++H L D L E + + + A R+LE ++ LQ L
Sbjct: 426 DQVEALRRENKNLQDEIHDL----TDQLGEGGRNV----HELEKARRRLEAEKDELQAAL 477
Query: 415 TNAEKELHLRNQA 427
AE L L
Sbjct: 478 EEAEAALELEESK 490
Score = 30.8 bits (70), Expect = 2.0
Identities = 30/149 (20%), Positives = 65/149 (43%), Gaps = 12/149 (8%)
Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
A EEA ++ E + + Q +++ + E E L E E + FE+ ++ + R ++ L+
Sbjct: 476 ALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKE---EEFENTRKNHQRAIESLQA 532
Query: 354 KDDANFKLMTE--RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
+A K E R+K KL + + L + + ++K +++ + LQ
Sbjct: 533 TLEAEAKGKAEASRLKK----KLEGDINE-LEIALDHANKANAEAQKNVKKYQQQVKELQ 587
Query: 412 TVLTNAEKELH--LRNQAMDLNKRKAIES 438
T + ++ A+ + A+E+
Sbjct: 588 TQVEEEQRAREDAREQLAVAERRATALEA 616
Score = 30.4 bits (69), Expect = 3.2
Identities = 30/159 (18%), Positives = 77/159 (48%), Gaps = 16/159 (10%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
E K + E + K LE+ Q ++ +AT +++ + +NE+ + E +Q+Q ++
Sbjct: 95 ELNKKREAELAKLRKDLEEANLQHEEALATLRKKHQDAINEL---SEQIEQLQKQKAKA- 150
Query: 349 QQLREKDDANFKLMTERIKSN--QLHKLAREEKDTLREQVQA----LSLQIEAMHTAIRK 402
EK+ + + + + + Q+ K A+ + +Q+++ L ++++ + +
Sbjct: 151 ----EKEKSQLQAEVDDLLAQLDQIAK-AKLNAEKKAKQLESQLSELQVKLDELQRQLND 205
Query: 403 LEEKERFLQTVLTNAEKELH-LRNQAMDLNKRKAIESAQ 440
L ++ LQ+ ++ ++L Q +L+K K+ +Q
Sbjct: 206 LTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQ 244
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 45.8 bits (109), Expect = 4e-05
Identities = 22/118 (18%), Positives = 45/118 (38%), Gaps = 22/118 (18%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEM-EVTGQAFEDMQEQNS-RLLQQLREKDDAN 358
+++ L + QL E + L NE+ E + ++ +QN +LL+ LRE +
Sbjct: 92 QLQLLIESLAQLS-------SEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEK- 143
Query: 359 FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTN 416
+ + E+ L +I+ + I++L ++ L L
Sbjct: 144 ------------FREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQEAGNLTAALKG 189
Score = 28.9 bits (65), Expect = 6.8
Identities = 30/136 (22%), Positives = 47/136 (34%), Gaps = 17/136 (12%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
+ + ++ +LE +L V A E E +Q N L+Q+L E
Sbjct: 22 QLLLRLGRLEQMLGELA-AVLEQLLLLLAFRAEAEQLRTFARSLQALNLELIQELNE--- 77
Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFL----QT 412
+ R++ L RE+ L E + LS + + A EE R L Q
Sbjct: 78 -----LKARLQQQLLQS--REQLQLLIESLAQLSSEFQ--ELANEIFEELNRRLAELNQQ 128
Query: 413 VLTNAEKELHLRNQAM 428
L K L +
Sbjct: 129 NLKQLLKPLREVLEKF 144
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.1 bits (107), Expect = 8e-05
Identities = 34/177 (19%), Positives = 76/177 (42%), Gaps = 22/177 (12%)
Query: 285 SKRDERKKLADE------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
S+ + DE +A RKI ++E + QL+++ K+ E L ++ Q E
Sbjct: 695 SELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIE 754
Query: 339 D----MQEQNSRLLQQLREKDDANFKLMTERIKSNQLH---KLAREEKDTLREQVQALSL 391
+ ++E +R+ + E+D + +++ H + E L E+V +
Sbjct: 755 NVKSELKELEARIEE--LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812
Query: 392 QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL-NKRKAIESAQSAADLKL 447
++ + + +L ++ +L+ ++ L+ Q +DL + K+IE + K
Sbjct: 813 RLREIEQKLNRLTLEKEYLE------KEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863
Score = 40.4 bits (95), Expect = 0.002
Identities = 35/159 (22%), Positives = 69/159 (43%), Gaps = 28/159 (17%)
Query: 290 RKKLAD-----EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME------------- 331
++K+ + R I + E + ++++A + E + LL E+E
Sbjct: 293 KEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRR 352
Query: 332 -VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQ--LHKLARE------EKDTL 382
+ + +++E+ L +L E D F + +K + L KL RE E D L
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEV-DKEFAETRDELKDYREKLEKLKREINELKRELDRL 411
Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
+E++Q LS ++ ++ AI +E K L+ + E+
Sbjct: 412 QEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI 450
Score = 39.3 bits (92), Expect = 0.006
Identities = 20/121 (16%), Positives = 54/121 (44%), Gaps = 3/121 (2%)
Query: 290 RKKLADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
R + A+ + +R +++ L+ + LQ ++ + + L E+ + +++++ +L
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729
Query: 349 QQLREKDDANFKLMTERIKSNQLHKLA-REEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
Q+ EK + + E + S + + E L +++ L + + A+ LE +
Sbjct: 730 QE-EEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788
Query: 408 R 408
Sbjct: 789 S 789
Score = 38.9 bits (91), Expect = 0.008
Identities = 32/146 (21%), Positives = 72/146 (49%), Gaps = 14/146 (9%)
Query: 284 DSKRDERKKLADE-------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
D KR + ++L E +A+ K K+ E + ++L K+ ++++EA+ ++ +
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKR-EYEGYELLKEKEALERQKEAIERQLASLEEE 252
Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
E + E+ S L ++L E + + + ++IK L EE+ ++E++ L +I ++
Sbjct: 253 LEKLTEEISELEKRLEEIEQL-LEELNKKIK-----DLGEEEQLRVKEKIGELEAEIASL 306
Query: 397 HTAIRKLEEKERFLQTVLTNAEKELH 422
+I + E + + L E E+
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEID 332
Score = 36.6 bits (85), Expect = 0.037
Identities = 20/144 (13%), Positives = 58/144 (40%), Gaps = 12/144 (8%)
Query: 283 RDSKRDERKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF 337
++ D R +L + + ++K ++ +L++++ K+E + L E++ +
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEEL 422
Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
D+ + + ++ E ++ + IK + + L + ++ +
Sbjct: 423 ADLNAAIAGIEAKINELEEE-KEDKALEIKK------QEWKLEQLAADLSKYEQELYDLK 475
Query: 398 TAIRKLEEKERFLQTVLTNAEKEL 421
++E++ LQ L AE +
Sbjct: 476 EEYDRVEKELSKLQRELAEAEAQA 499
Score = 32.3 bits (74), Expect = 0.67
Identities = 16/84 (19%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
E ++ + S L +LR ++ +L E +++ +E + L ++ + L ++E +
Sbjct: 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELE 743
Query: 398 TAIRKLEEKERFLQTVLTNAEKEL 421
+ LE++ + N + EL
Sbjct: 744 EDLSSLEQE-------IENVKSEL 760
Score = 32.3 bits (74), Expect = 0.69
Identities = 33/215 (15%), Positives = 73/215 (33%), Gaps = 40/215 (18%)
Query: 287 RDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
E ++L ++ + +IK +E + L + ++E E L + D++++
Sbjct: 832 EKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKER 891
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA---------------- 388
L QLRE + +L + K + + + + L E++
Sbjct: 892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEEL 951
Query: 389 ----LSLQIEAMHTAIRKLEE------------KERFL-----QTVLTNAEKELHLRNQA 427
+ +++ + IR LE +R + L K + R +
Sbjct: 952 SLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEE 1011
Query: 428 MDLNKRKAIESAQSAADLKLHLVI-IYTNGPARLE 461
+ KR+ A A + + + + G L
Sbjct: 1012 YEKKKREVFMEAFEAINENFNEIFAELSGGTGELI 1046
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 44.1 bits (105), Expect = 1e-04
Identities = 34/143 (23%), Positives = 67/143 (46%), Gaps = 21/143 (14%)
Query: 281 VVRDSKRD--ERKKL----ADEEAMRKIKQLEDQTHQ----LQKQVATHKQEEEALLNEM 330
++ ++K++ KK A EE + +LE + + LQ+ Q EE L +M
Sbjct: 37 IIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKM 96
Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR-------EEKDTLR 383
E + E+++++ L + + D+ +L E + + Q +L R E K+ L
Sbjct: 97 ESLDKKEENLEKKEKELSNKEKNLDEKEEEL--EELIAEQREELERISGLTQEEAKEILL 154
Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
E+V+ + EA I+++EE+
Sbjct: 155 EEVEE-EARHEAAKL-IKEIEEE 175
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 43.9 bits (104), Expect = 2e-04
Identities = 38/159 (23%), Positives = 68/159 (42%), Gaps = 13/159 (8%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM--EVTGQAFEDMQEQNS 345
E D E+ I +++ QL K++A K EEE L E + ++E+
Sbjct: 153 KEDNLKDDLES--LIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELL 210
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT---AIRK 402
L+ + +L ER K +L K E+ LR++++ + E A++
Sbjct: 211 ARLESKEAALEKQLRLEFEREKE-ELRKKYEEK---LRQELERQAEAHEQKLKNELALQA 266
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMD-LNKR-KAIESA 439
+E + F + + E+E + R + LN R K +E A
Sbjct: 267 IELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKA 305
Score = 34.6 bits (80), Expect = 0.13
Identities = 31/150 (20%), Positives = 58/150 (38%), Gaps = 21/150 (14%)
Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE-----EEALLN 328
E + +++ + + KL +E R + QL+ + K+E EE L
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ 245
Query: 329 EMEVTGQAFED-------MQ--EQNSRLLQQLREKDD---ANFKLMTERIKSN--QLHKL 374
E+E +A E +Q E ++++EK + + S L K
Sbjct: 246 ELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKA 305
Query: 375 AREEKDTLRE--QVQALSLQIEAMHTAIRK 402
+ E +VQ L L +EA+ +A++
Sbjct: 306 LDSRSEAEDENHKVQQLWLAVEALKSALKS 335
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 43.1 bits (102), Expect = 2e-04
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 45/171 (26%)
Query: 288 DER-KKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
D R KK+A E +A K ++ E + + +++ Q+ E LL +A E+ Q +
Sbjct: 35 DAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLT------KAKEEAQAER 88
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE-QVQALSLQIEAMHTAIRKL 403
RLL + AREE D +RE +AL + A+ +R+
Sbjct: 89 QRLLDE------------------------AREEADEIREKWQEALRREQAALSDELRRR 124
Query: 404 EEKERFL--QTVLTN-AEKELHL--------RNQAMDLNKRKAIESAQSAA 443
E F + VLT+ A+ +L R + +D +++ A+ A + +
Sbjct: 125 TGAEVFAIARKVLTDLADTDLEERMVDVFVQRLRTLDPDEKAALAEALADS 175
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 43.6 bits (104), Expect = 2e-04
Identities = 36/143 (25%), Positives = 71/143 (49%), Gaps = 21/143 (14%)
Query: 281 VVRDSKRD--ERKKLADEEAMRKIKQLEDQTHQ--------LQKQVATHKQEEEALLNEM 330
++ ++K++ KK A EA +I +L ++ + LQK Q+EE L ++
Sbjct: 43 ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102
Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR-------EEKDTLR 383
E+ + E+++++ L Q+ +E + +L E + QL +L R E K+ L
Sbjct: 103 ELLEKREEELEKKEKELEQKQQELEKKEEEL--EELIEEQLQELERISGLTAEEAKEILL 160
Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
E+V+ + EA I+++EE+
Sbjct: 161 EKVEE-EARHEAAVL-IKEIEEE 181
Score = 32.1 bits (74), Expect = 0.77
Identities = 30/139 (21%), Positives = 65/139 (46%), Gaps = 23/139 (16%)
Query: 286 KRDERKKL--ADEEAMRKIKQLEDQTHQLQKQVATH-KQEEEALLNEMEVTGQAFEDMQE 342
K+ K+ A+EEA R +++ + + ++K+ K+E L NE E + + +
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFE---KELRERRN 82
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
+ +L ++L +K E +L L + E++ L ++ + L + + +
Sbjct: 83 ELQKLEKRLLQK---------EENLDRKLELLEKREEE-LEKKEKELEQKQQ-------E 125
Query: 403 LEEKERFLQTVLTNAEKEL 421
LE+KE L+ ++ +EL
Sbjct: 126 LEKKEEELEELIEEQLQEL 144
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 40.0 bits (94), Expect = 2e-04
Identities = 22/96 (22%), Positives = 46/96 (47%)
Query: 326 LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQ 385
L E+ +AF+++ + + +L + +L E+ K++Q + A KD L +
Sbjct: 1 LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60
Query: 386 VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
+ L+ Q+ I +L++ E L+ + EK+L
Sbjct: 61 NKKLNTQLNKSSELIEQLKDLENELRRRIKELEKQL 96
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 41.9 bits (99), Expect = 3e-04
Identities = 29/133 (21%), Positives = 62/133 (46%), Gaps = 16/133 (12%)
Query: 282 VRDSKRDE----RKKLADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
VR K E + L+D + R ++ +L+ + +LQ+++A +++ L E + Q
Sbjct: 43 VRTPKGREGWVLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQE 102
Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
++ + RL ++L +++ +N + EE LRE++ L + EA+
Sbjct: 103 LSTLEAELERLQKELAR---------IKQLSANAI--ELDEENRELREELAELKQENEAL 151
Query: 397 HTAIRKLEEKERF 409
+L+E E+
Sbjct: 152 EAENERLQENEQR 164
Score = 29.6 bits (67), Expect = 2.8
Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
+L +L +EE L+EQ+ L + + + + LE + LQ L ++ +
Sbjct: 72 QQELAEL-QEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130
Query: 428 MDLNKR--KAIESAQSAADL 445
+ N+ + + + +
Sbjct: 131 DEENRELREELAELKQENEA 150
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 43.2 bits (102), Expect = 3e-04
Identities = 27/162 (16%), Positives = 67/162 (41%), Gaps = 13/162 (8%)
Query: 282 VRDSKRDERKKLADEEA--MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED 339
++ + + +L + E+ + +K E++ +L++ ++ EE + E+ + E
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEELRELERL---LEELEEKIERLEELEREIEEL 303
Query: 340 MQEQNSRLLQQLREKDD--ANFKLMTERIK--SNQLHKLAREEKDTLREQVQALSLQIEA 395
+E+ L L E ++ K + ER++ +L KL E ++ E+ + L E
Sbjct: 304 -EEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEER 362
Query: 396 MHT---AIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
+ + +LE++ L E+ + + +
Sbjct: 363 LKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404
Score = 40.9 bits (96), Expect = 0.002
Identities = 33/155 (21%), Positives = 74/155 (47%), Gaps = 18/155 (11%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTH---QLQKQVATHKQEE---EALLNEMEVTGQAFE 338
+ E ++L EE KI++LE+ +L++++ + E LL +++ + E
Sbjct: 274 EELRELERLL-EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332
Query: 339 DMQEQN---SRLLQQLREKDDANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIE 394
++E+ L++L E+ + KL+ ER+K + + +E + E+++ L I+
Sbjct: 333 KLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQ 392
Query: 395 AMHT-------AIRKLEEKERFLQTVLTNAEKELH 422
+ A+ +++E+ L+ L E+EL
Sbjct: 393 ELKEELAELSAALEEIQEELEELEKELEELERELE 427
Score = 40.5 bits (95), Expect = 0.002
Identities = 30/163 (18%), Positives = 69/163 (42%), Gaps = 22/163 (13%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTH--QLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
D + + + ++K+++++++ +L++++ ++ L E E + +
Sbjct: 200 DIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLL 259
Query: 342 EQNSRLLQQL--REKDDANFKLMTERI--KSNQLHKLARE----------------EKDT 381
E S L+ L RE++ + + E + K +L +L RE E +
Sbjct: 260 EIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEE 319
Query: 382 LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
L E++++L ++E + + KLE + L K L R
Sbjct: 320 LLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEER 362
Score = 38.6 bits (90), Expect = 0.008
Identities = 34/175 (19%), Positives = 76/175 (43%), Gaps = 15/175 (8%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVA-THKQEEEALLNEMEVTGQAFEDMQEQ 343
+ E ++ EE K+++LE+ +L++ Q+ + L ++E Q ++ +
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE-LLE 576
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL---------QIE 394
RLL+ +E+ + + +K K E L E +Q+L L E
Sbjct: 577 ELRLLRTRKEE-LEELRERLKELKKKL--KELEERLSQLEELLQSLELSEAENELEEAEE 633
Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
+ + + KL + L+ +L A +EL + + ++ R+ ++ ++ L+ L
Sbjct: 634 ELESELEKLNLQAE-LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL 687
Score = 38.2 bits (89), Expect = 0.012
Identities = 34/180 (18%), Positives = 74/180 (41%), Gaps = 25/180 (13%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
+E + A EE K+++LE + + +++ +Q EE L ++ + E ++E+ L
Sbjct: 649 EELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEE-LEQLREELEEL 707
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK- 406
L++L E + E ++S + E + L+++++ L +E + +R+ K
Sbjct: 708 LKKLGEIEQLI-----EELESR------KAELEELKKELEKLEKALE-LLEELREKLGKA 755
Query: 407 -------ERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPAR 459
L + A + L + LN+ +V++Y G R
Sbjct: 756 GLRADILRNLLAQIEAEANEIL----SKLSLNRYDLRRLTIRKDGNGGLVVVVYDGGEVR 811
Score = 35.9 bits (83), Expect = 0.054
Identities = 36/166 (21%), Positives = 82/166 (49%), Gaps = 9/166 (5%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+R+ ++ +KKL + E + Q+ +L + ++ EE L +E+E +++
Sbjct: 591 LRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE-KLNLQAELE 649
Query: 342 EQNSRLLQQLREKD---DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
E L++L EK +A + +RI++ + + EE + L E+++ L ++E
Sbjct: 650 ELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELE---E 706
Query: 399 AIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
++KL E E+ ++ L + + EL + ++ + KA+E + +
Sbjct: 707 LLKKLGEIEQLIEE-LESRKAELEELKKELEKLE-KALELLEELRE 750
Score = 35.9 bits (83), Expect = 0.064
Identities = 26/160 (16%), Positives = 71/160 (44%), Gaps = 13/160 (8%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
+ + + L +E +++L ++ L++++ +++ E L +E+E + + +
Sbjct: 305 EELEGLRALLEE-----LEELLEKLKSLEERLEKLEEKLEKLESELE----ELAEEKNEL 355
Query: 345 SRLLQQL-REKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
++LL++ +E ++ +L E K+ + K E ++E + L+ A+ +L
Sbjct: 356 AKLLEERLKELEERLEELEKELEKALERLKQLEEA---IQELKEELAELSAALEEIQEEL 412
Query: 404 EEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
EE E+ L+ + E+ + + + + A
Sbjct: 413 EELEKELEELERELEELEEEIKKLEEQINQLESKELMIAE 452
Score = 35.1 bits (81), Expect = 0.090
Identities = 38/173 (21%), Positives = 74/173 (42%), Gaps = 23/173 (13%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATH-------KQEEEALLNEMEVTGQAFE 338
R++ + EE K+L + +L + + +++ E L N +E + E
Sbjct: 491 SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKE 550
Query: 339 DMQEQN--------SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
+Q Q LQ+L+E + +L R + +L +L RE L+++++ L
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLE---ELRLLRTRKEELEEL-RERLKELKKKLKELE 606
Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
++ + ++ LE E + L AE+EL + LN + +E AA
Sbjct: 607 ERLSQLEELLQSLELSEAENE--LEEAEEELESELEK--LNLQAELEELLQAA 655
Score = 33.6 bits (77), Expect = 0.30
Identities = 29/170 (17%), Positives = 64/170 (37%), Gaps = 19/170 (11%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-- 343
+ +E + EE LE+ +L++ +E E L E+E + E+ Q
Sbjct: 386 QLEEAIQELKEELAELSAALEEIQEELEELEKEL-EELERELEELEEEIKKLEEQINQLE 444
Query: 344 -NSRLLQQLR---------------EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQ 387
++ +L E + +L ++ + +E+ LRE+++
Sbjct: 445 SKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIE 504
Query: 388 ALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIE 437
L ++ + + +L E E L+ L ++L + ++ K K
Sbjct: 505 ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 42.3 bits (99), Expect = 6e-04
Identities = 40/172 (23%), Positives = 68/172 (39%), Gaps = 20/172 (11%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
+ K +E + +++ E++ ++LQ + T + + L E+E A M+ + R
Sbjct: 234 DEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAA 293
Query: 349 QQLREKDDANFKLMTE-RIKSNQ-------LHKL------AREEKDTLREQVQALSLQIE 394
Q+L K L TE +NQ L KL A +EK L+ +V AL ++E
Sbjct: 294 QELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELE 353
Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELH-----LRNQAMDLNK-RKAIESAQ 440
H + K + Q E+ L +K IE+ Q
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405
Score = 31.9 bits (72), Expect = 1.0
Identities = 34/156 (21%), Positives = 63/156 (40%), Gaps = 15/156 (9%)
Query: 300 RKIKQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
RK L +T LQ ++ AT+ E E + + E T + +Q++ L + R K+
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKER 413
Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR-----KLEEKERF-- 409
K ER++S Q + L + + IE + + EE E +
Sbjct: 414 -RLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKK 472
Query: 410 ----LQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
L+ + N + +L R ++L K + + A +
Sbjct: 473 EFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASN 508
Score = 31.5 bits (71), Expect = 1.1
Identities = 31/142 (21%), Positives = 60/142 (42%), Gaps = 7/142 (4%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL------LNEMEVTGQAFEDMQEQNSRLLQ 349
E ++ L D +LQ ++ +Q +A L EM + +E++S L
Sbjct: 124 ERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLG 183
Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
++RE +D L + + H + REE + Q++ + EA I + + K +
Sbjct: 184 RIREAEDQQSHLEVLLEQKEKEHMMLREEIHQ-KLQMERDDAKTEASQKLIDEKDTKIKE 242
Query: 410 LQTVLTNAEKELHLRNQAMDLN 431
+ +L AE E++ D +
Sbjct: 243 FEKMLEKAENEIYRLQSRCDTS 264
Score = 31.1 bits (70), Expect = 1.7
Identities = 30/160 (18%), Positives = 66/160 (41%), Gaps = 19/160 (11%)
Query: 277 KFACVVRDSKRDERKKL--------ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
+ A + + R++ +KL A+ E+ + ++ T + + A + E + LL+
Sbjct: 517 ERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRLLD 576
Query: 329 EMEVTGQAFEDMQEQNSRLLQQL---------REKDDANFKLMTERIKSNQLHKLAREEK 379
+E Q +D + + RL ++L +++ ++ ER + L + R EK
Sbjct: 577 RLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERLEK 636
Query: 380 DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
E+ S QI A E +Q + ++ ++
Sbjct: 637 SEAEEETLRQSTQIGHAQAAAHNHIEHH--VQKLESDLKQ 674
Score = 30.8 bits (69), Expect = 1.9
Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 9/128 (7%)
Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
+ Q + + E E E+ + E +Q + RL Q L +D+ KL+
Sbjct: 113 EGQFSRIQAERERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSA 172
Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ-AMDLN 431
K E+ + L +A Q LE+KE+ +E+H + Q D
Sbjct: 173 KAEEEDSERLGRIREAEDQQSHLEVL----LEQKEKEHMM----LREEIHQKLQMERDDA 224
Query: 432 KRKAIESA 439
K +A +
Sbjct: 225 KTEASQKL 232
>gnl|CDD|131391 TIGR02338, gimC_beta, prefoldin, beta subunit, archaeal.
Chaperonins are cytosolic, ATP-dependent molecular
chaperones, with a conserved toroidal architecture, that
assist in the folding of nascent and/or denatured
polypeptide chains. The group I chaperonin system
consists of GroEL and GroES, and is found (usually) in
bacteria and organelles of bacterial origin. The group
II chaperonin system, called the thermosome in Archaea
and TRiC or CCT in the Eukaryota, is structurally
similar but only distantly related. Prefoldin, also
called GimC, is a complex in Archaea and Eukaryota, that
works with group II chaperonins. Members of this protein
family are the archaeal clade of the beta class of
prefoldin subunit. Closely related, but outside the
scope of this family are the eukaryotic beta-class
prefoldin subunits, Gim-1,3,4 and 6. The alpha class
prefoldin subunits are more distantly related.
Length = 110
Score = 38.5 bits (90), Expect = 0.001
Identities = 32/122 (26%), Positives = 53/122 (43%), Gaps = 19/122 (15%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
+ ++ QL+ QLQ VAT KQ+ EA L E E + L++L D
Sbjct: 4 QVQNQLAQLQQLQQQLQA-VATQKQQVEAQLKEAE--------------KALEELERLPD 48
Query: 357 ANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
+ N L K +EE L+E+ + L L+++ + +L E+ + LQ +
Sbjct: 49 ---DTPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQ 105
Query: 416 NA 417
A
Sbjct: 106 EA 107
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 41.4 bits (97), Expect = 0.001
Identities = 27/151 (17%), Positives = 73/151 (48%), Gaps = 15/151 (9%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
+ LA EE R++K L+ Q + + + ++E ++L E + FE+ + L+
Sbjct: 555 KHDLAREEFQRRVKLLQLQKEKQLEDIQDCREERKSLSERAEKLAEKFEEAKYNQELLVN 614
Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
+ + +L+ + ++QL L+ E+D + +++Q ++ Q++ + I+++++K+ +
Sbjct: 615 RCK-------RLL--QSANSQLPVLSDSERD-MSKELQRINKQLQHLANGIKQVKKKKNY 664
Query: 410 LQTVLTNAEKELHLRNQAMDLN--KRKAIES 438
+ + + + + L + K I
Sbjct: 665 QRYHM---ASQESPKKSSYTLPEKQHKTITE 692
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 38.8 bits (91), Expect = 0.001
Identities = 24/111 (21%), Positives = 52/111 (46%), Gaps = 17/111 (15%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+R E+ +EA ++KQ + ++Q+A + E ++N + Q
Sbjct: 30 ERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQ---------- 79
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDT---LREQVQALSLQI 393
+L +++ +A ER+ + ++ +E++ LR+QV AL++QI
Sbjct: 80 KLKEEIL--AEAQKDA--ERLLESARAEIEQEKEQALAELRQQVAALAVQI 126
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 40.1 bits (94), Expect = 0.001
Identities = 30/147 (20%), Positives = 65/147 (44%), Gaps = 16/147 (10%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
EEA R+ ++LE++ Q+++ + ++E L E E T E+ +Q Q L EK
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKE----LEEYEETALELEEKLKQEEEEAQLL-EKK 55
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI-RKLEEKERFLQTVL 414
+ R++ EE++ L +V + ++ + +K E +
Sbjct: 56 ADELEEENRRLEEEAAAS--EEERERLEAEVDEATAEVAKLEEEREKKEAETRQL----- 108
Query: 415 TNAEKELHLRNQAMDLNKRKAIESAQS 441
++EL +A + +++ +E+A +
Sbjct: 109 ---QQELREAQEAHERARQELLEAAAA 132
Score = 36.6 bits (85), Expect = 0.021
Identities = 30/168 (17%), Positives = 60/168 (35%), Gaps = 7/168 (4%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
R + EE +LE++ Q +++ +++ + L E + +E+ RL
Sbjct: 22 RRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLE 81
Query: 349 QQLREKDDANFKLMTERIK----SNQLHKLAREEKDTLREQVQALSLQIEAMHTA---IR 401
++ E KL ER K + QL + RE ++ Q L A
Sbjct: 82 AEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAA 141
Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
+ ++ A +L D ++ + + A+ L+ L
Sbjct: 142 PVNGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYAEKNERLQTQL 189
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from
laminin A, B1 and B2 may come together to form a triple
helical coiled-coil structure.
Length = 263
Score = 40.1 bits (94), Expect = 0.002
Identities = 29/142 (20%), Positives = 53/142 (37%), Gaps = 21/142 (14%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
+ IK L D ++ +++A + N + G A M + R+L ++RE+D
Sbjct: 89 KTLAEAIKNLRDNIKEILEKLAGLGE------NAFALPGSALSRMLAEAQRMLGEIRERD 142
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR--------KLEEKE 407
+ + K A E + ++ Q + E + A+R KL +
Sbjct: 143 -----FQKQLQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLR 197
Query: 408 RFLQTVL--TNAEKELHLRNQA 427
L T + L+L NQ
Sbjct: 198 ELLDEAQAKTRDAERLNLANQK 219
Score = 36.6 bits (85), Expect = 0.019
Identities = 30/148 (20%), Positives = 50/148 (33%), Gaps = 32/148 (21%)
Query: 297 EAMRKI-KQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFEDMQEQ---NSRLLQ 349
A KI +E+ T QL ++ K++ E L + Q E++Q++ Q
Sbjct: 16 PAPYKILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQDAENLQKKANQALAKAQ 75
Query: 350 QLREKDDANFKLMTERIKS----------------NQLHKLAREEKDTLREQVQALSLQI 393
++ A + K+ +L L + + +
Sbjct: 76 KVN----AAIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEA 131
Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKEL 421
+ M IR ER Q L NAE EL
Sbjct: 132 QRMLGEIR-----ERDFQKQLQNAEAEL 154
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 39.3 bits (92), Expect = 0.002
Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 34/114 (29%)
Query: 293 LADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ--NSRLLQQ 350
LAD E R QLE Q QL++Q+A + E L ++++ QEQ +R Q
Sbjct: 100 LADVEIRRA--QLEAQKAQLERQIAASEARAERLQADLQLAQA-----QEQQVAARQQQ- 151
Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
AR+E L Q QA Q+ + IR+L+
Sbjct: 152 ------------------------ARQEAVALEAQRQAAQAQLRKLQRQIRQLQ 181
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 40.3 bits (95), Expect = 0.002
Identities = 27/128 (21%), Positives = 56/128 (43%), Gaps = 13/128 (10%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQ-LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
++ +E ++ +E R+ + L++Q + +K+ ++E E L E E + E +
Sbjct: 52 LKALAEEEERERKRKEERREGRAVLQEQIEEREKR---RQEEYEERLQEREQMDEIIERI 108
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
QE++ Q+ REK KL E + N+ +E E+ + + +
Sbjct: 109 QEEDEAEAQEKREKQK---KLREEIDEFNEERIERKE------EEKEREREEELKILEYQ 159
Query: 401 RKLEEKER 408
R+ E+E
Sbjct: 160 REKAEREE 167
Score = 36.4 bits (85), Expect = 0.028
Identities = 24/127 (18%), Positives = 50/127 (39%), Gaps = 18/127 (14%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+R ER+K EEA ++ +Q ++ ++Q+ ++ + E E E M E+ +
Sbjct: 214 ERKERQKEK-EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEE---AERERMLEKQA 269
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLA----REEKDTLREQVQALSLQIEAMHTAIR 401
+ +E +R H+ EEK+ R + L+ +R
Sbjct: 270 EDEELEQENA-------EKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGER---LR 319
Query: 402 KLEEKER 408
+ E + +
Sbjct: 320 EEEAERQ 326
Score = 36.4 bits (85), Expect = 0.028
Identities = 27/141 (19%), Positives = 57/141 (40%), Gaps = 9/141 (6%)
Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD-DANFKL 361
K+ + Q + ++ A K+EE + ME +E+ R ++ R + +
Sbjct: 20 KERDAQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQ 79
Query: 362 MTERIKSNQLHKLAREEKDTLREQVQALSLQI--EAMHTAIRKLEEKERFLQTVL-TNAE 418
+ ER + + EE+ REQ+ + +I E A K E++++ + + N E
Sbjct: 80 IEERE---KRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEE 136
Query: 419 KELH--LRNQAMDLNKRKAIE 437
+ + + K +E
Sbjct: 137 RIERKEEEKEREREEELKILE 157
Score = 34.9 bits (81), Expect = 0.092
Identities = 23/121 (19%), Positives = 49/121 (40%), Gaps = 8/121 (6%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQEQNSRL 347
ER+K EE ++ + Q+ + + ++E+EA E + E++ E N
Sbjct: 82 EREKRRQEEY----EERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFN-EE 136
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
+ +E++ + +I Q K REE+ + + + E +R +E+
Sbjct: 137 RIERKEEEKEREREEELKILEYQREKAEREEE--REAERRERKEEKEREVARLRAQQEEA 194
Query: 408 R 408
Sbjct: 195 E 195
Score = 34.1 bits (79), Expect = 0.14
Identities = 25/150 (16%), Positives = 63/150 (42%), Gaps = 12/150 (8%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
ER++ + + + ++ ++ + + + K+E+E + + + ED +E+ L
Sbjct: 146 EREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELR 205
Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV-QALSLQIEA-MHTAIRKLEEK 406
L +++ ER K Q K E++ ++++ +A QIE + E+
Sbjct: 206 ADLYQEEY-------ER-KERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257
Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
E + +L ++ + + KR+
Sbjct: 258 EAERERML--EKQAEDEELEQENAEKRRMK 285
Score = 33.3 bits (77), Expect = 0.26
Identities = 26/156 (16%), Positives = 61/156 (39%), Gaps = 11/156 (7%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
++E + A E+ ++ K E+ ++++ ++E+E E ++ + +
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI-LEYQREKAEREE 167
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLA-REEKDTLREQVQALSLQIEAMHTAIRKLEE 405
+ R + + R+++ Q REE D LR L E R+ E+
Sbjct: 168 EREAERRERKEEKEREVARLRAQQEEAEDEREELDELRAD-----LYQEEYERKERQKEK 222
Query: 406 KERFLQTVLTNAEKELHL-RNQAMDLNKRKAIESAQ 440
+E + ++EL R + ++ + + E
Sbjct: 223 EEAEKR---RRQKQELQRAREEQIEEKEERLQEERA 255
Score = 29.1 bits (66), Expect = 6.0
Identities = 10/74 (13%), Positives = 32/74 (43%), Gaps = 2/74 (2%)
Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
++ DE + E A ++ + + +L++Q+ ++ A E G+ +
Sbjct: 264 MLEKQAEDEELEQ--ENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREE 321
Query: 341 QEQNSRLLQQLREK 354
+ + +++ R++
Sbjct: 322 EAERQARIEEERQR 335
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 40.4 bits (95), Expect = 0.002
Identities = 25/124 (20%), Positives = 54/124 (43%), Gaps = 2/124 (1%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+RD + R +A + + + L + L+++ ++E L +E+E +A ED +
Sbjct: 326 LRDRLEECR--VAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRR 383
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
E+ L +++ E + + + + REE+D LRE+ L + +
Sbjct: 384 EEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVE 443
Query: 402 KLEE 405
+ E
Sbjct: 444 EAEA 447
Score = 36.2 bits (84), Expect = 0.046
Identities = 31/129 (24%), Positives = 61/129 (47%), Gaps = 20/129 (15%)
Query: 283 RDSKRDERKKLADEEAMRK--IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
+ R++R+ LA+ R+ + + ++ +L+ + +EA + E + E+
Sbjct: 608 IERLREKREALAELNDERRERLAEKRERKRELEAEF------DEARIEEAREDKERAEEY 661
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLA--REEKDTLREQVQALSLQIEAM 396
EQ L +LRE+ D + I + N+L +L RE ++ L +V+AL EA+
Sbjct: 662 LEQVEEKLDELREERDD----LQAEIGAVENELEELEELRERREALENRVEAL----EAL 713
Query: 397 HTAIRKLEE 405
+ +LE
Sbjct: 714 YDEAEELES 722
Score = 31.5 bits (72), Expect = 1.3
Identities = 36/195 (18%), Positives = 68/195 (34%), Gaps = 48/195 (24%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ER + EEA +LE + + ++ V ++E E L E+E + F D
Sbjct: 353 DLEERAEELREEA----AELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLG 408
Query: 346 RL---LQQLRE-KDDANFKLMT---------ERIKSNQLHKLA----------------- 375
L++LRE +D+ + ER++ + A
Sbjct: 409 NAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVE 468
Query: 376 -----REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH-LRNQAMD 429
RE + L +++ L ++E + + + E+ L AE + L + D
Sbjct: 469 TIEEDRERVEELEAELEDLEEEVEEVEERLERAED--------LVEAEDRIERLEERRED 520
Query: 430 LNKRKAIESAQSAAD 444
L + A
Sbjct: 521 LEELIAERRETIEEK 535
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 39.9 bits (94), Expect = 0.002
Identities = 34/170 (20%), Positives = 76/170 (44%), Gaps = 35/170 (20%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT----------------- 333
+K DE + ++ LE + LQ+++A K+ E + E++
Sbjct: 109 RKDLDEATLARV-DLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDAARKLDL 167
Query: 334 GQAFEDMQEQNSRLLQQLREKDDANFKLMTERI-----KSNQLHKLAREEKDTLREQVQA 388
+A +++ Q L ++ R++ + +K E + ++ + A+EE LR Q+Q+
Sbjct: 168 TKALAEIRAQYEELAEKNRQEAEEWYKSKLEELQQAAARNGDALRSAKEEITELRRQIQS 227
Query: 389 LSLQIEAMHTAIRKLEE-----KERF------LQTVLTNAEKEL-HLRNQ 426
L ++++++ LE +ER+ Q ++ E+EL L+ +
Sbjct: 228 LEIELQSLKKQKASLERQLAELEERYELELADYQDTISELEEELQQLKAE 277
>gnl|CDD|177131 MTH00059, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 289
Score = 39.1 bits (92), Expect = 0.003
Identities = 28/117 (23%), Positives = 47/117 (40%), Gaps = 16/117 (13%)
Query: 118 FGIFRLSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFV 177
FG+F +VY FS +SS V ++ + +F ++ CF + S VFV
Sbjct: 96 FGLFPFM--FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVFV 145
Query: 178 YHCFSIFRFSSSTVFVY-----HCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCF 229
+I F + Y + + SSS V CF + +++Y+ F
Sbjct: 146 DCFLTIL-FCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFFLYLYYFF 201
Score = 34.1 bits (79), Expect = 0.12
Identities = 28/114 (24%), Positives = 45/114 (39%), Gaps = 18/114 (15%)
Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
F +F F +VY FS +SS V ++ + +F ++ CF + S VF
Sbjct: 96 FGLFPFM---FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVF 144
Query: 193 VYHCFSIFRFSSSTVFVY-----HCFSIFRFSSSTIFVYHCFDI-FSLTSTLFV 240
V +I F + Y + + SSS V CF + L L++
Sbjct: 145 VDCFLTIL-FCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFFLYL 197
Score = 30.3 bits (69), Expect = 2.2
Identities = 27/110 (24%), Positives = 44/110 (40%), Gaps = 17/110 (15%)
Query: 149 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 208
F +F F +VY FS +SS V ++ + +F ++ CF + S VF
Sbjct: 96 FGLFPFM---FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVF 144
Query: 209 VYHCFSI------FRFSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSFY 252
V +I F F S C S ++TL V+ + + F+
Sbjct: 145 VDCFLTILFCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFF 194
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 39.7 bits (93), Expect = 0.003
Identities = 34/209 (16%), Positives = 76/209 (36%), Gaps = 29/209 (13%)
Query: 265 SASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEE 324
AS + ++ F+ D + + +K +KI++ +DQ +L+KQ+ + + E
Sbjct: 18 LASLLSAAVLAAAFSAAADDKQLKQIQK-EIAALEKKIREQQDQRAKLEKQLKSLETEIA 76
Query: 325 AL---LNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERI--------------- 366
+L L E + + L L ++ + + E++
Sbjct: 77 SLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALL 136
Query: 367 -------KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
+S +L E++ AL ++ + ++ ++ L T+L+
Sbjct: 137 VSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRA 196
Query: 420 ELHLRNQAMDLNKRKAIESAQSAADLKLH 448
+ A L +RK AQ ++L
Sbjct: 197 Q--QAKLAQLLEERKKTL-AQLNSELSAD 222
Score = 30.1 bits (68), Expect = 3.5
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 287 RDERKKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
R ++ KLA EE + + QL + QK++ + E L NE+
Sbjct: 195 RAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIA 241
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 39.3 bits (92), Expect = 0.003
Identities = 14/51 (27%), Positives = 23/51 (45%)
Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
L +Q LQK + KQE E +L + Q ++ L +L++K
Sbjct: 71 ALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQQK 121
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 39.3 bits (92), Expect = 0.004
Identities = 31/175 (17%), Positives = 60/175 (34%), Gaps = 24/175 (13%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
R +K + +I+ LE + + ++ QE L NE+E E
Sbjct: 195 LRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELE----------EAQR 244
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQ----------ALSLQIEA 395
L ++ L ER + + K + R Q++ + +++
Sbjct: 245 SLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDS 304
Query: 396 MHTAIRKLEEKERFLQT--VLTNAEKEL--HLRNQAMDLNKRKAIESAQSAADLK 446
++K E+ ++ T L +KEL L A+ K I + + D
Sbjct: 305 TKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEIDKP 359
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 38.5 bits (90), Expect = 0.005
Identities = 35/165 (21%), Positives = 72/165 (43%), Gaps = 17/165 (10%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
+ K+ +LE + QL++++ K++ + L E + +++ + L ++ +E
Sbjct: 2 LAMLDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELR 61
Query: 356 DANFKLMTE-----------RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
+ ++ E K +L K RE K+ E I+++ I +LE
Sbjct: 62 EERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGG-RSIKSLEREIERLE 120
Query: 405 EKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
+K+ QT + E+E L + +L RK +E A+ A + L
Sbjct: 121 KKQ---QTSVLTPEEERELVQKIKEL--RKELEDAKKALEENEKL 160
Score = 34.3 bits (79), Expect = 0.13
Identities = 29/154 (18%), Positives = 65/154 (42%), Gaps = 30/154 (19%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
RK+L E +K + ++ +L+ ++ K++ + +++ + + QE + +++
Sbjct: 144 RKEL---EDAKKALEENEKLKELKAEIDELKKKAREIHEKIQ---ELANEAQEYHEEMIK 197
Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
E D+ R+E D L E+ LS +I+ +H R L+ + R
Sbjct: 198 LFEEADEL------------------RKEADELHEEFVELSKKIDELHEEFRNLQNELRE 239
Query: 410 LQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
L+ + K L + + +R+ E + A
Sbjct: 240 LEKKI----KALRAKEK--AAKRREKREELKERA 267
Score = 31.2 bits (71), Expect = 1.3
Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 8/89 (8%)
Query: 285 SKRDERKKLADE------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
+ DE +K ADE E +KI +L ++ LQ ++ +++ +AL + + + +
Sbjct: 200 EEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREK 259
Query: 339 DMQEQNSRLLQQLREKDDANFKLMTERIK 367
E+ +++ EK KL TE +
Sbjct: 260 R--EELKERAEEIYEKFKRGEKLTTEELL 286
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 38.9 bits (91), Expect = 0.007
Identities = 30/150 (20%), Positives = 66/150 (44%), Gaps = 18/150 (12%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
EA+ K+K+ E + + ++E ++ + E ++E+NS L ++L E
Sbjct: 391 AEALSKVKEEERPREKEGTE-EEERREITVYEKRIKKLEETVERLEEENSELKRELEELK 449
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
KL ++L + RE +D +R+ +I A I +LE++ + +
Sbjct: 450 REIEKL------ESELERFRREVRDKVRK-----DREIRARDRRIERLEKELEEKKKRVE 498
Query: 416 NAEKELH--LRNQAMDLNKR----KAIESA 439
E++L + + ++L+ + K +E
Sbjct: 499 ELERKLAELRKMRKLELSGKGTPVKVVEKL 528
Score = 30.1 bits (68), Expect = 3.3
Identities = 24/123 (19%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEA-LLNEMEVTGQAFEDMQEQN 344
+ ER + + E R++++L+ + +L+ ++ ++E + + E+ +
Sbjct: 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREI-----RARDRRI 483
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-TAIRKL 403
RL ++L EK K E ++ +L +L + K L + + +E + AI +
Sbjct: 484 ERLEKELEEK-----KKRVEELE-RKLAELRKMRKLELSGKGTPVK-VVEKLTLEAIEEA 536
Query: 404 EEK 406
EE+
Sbjct: 537 EEE 539
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 38.6 bits (89), Expect = 0.008
Identities = 47/167 (28%), Positives = 79/167 (47%), Gaps = 12/167 (7%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K DE KK A EEA +K + + + +K K E EA +E E A E+ E
Sbjct: 1316 KADEAKKKA-EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE----AAEEKAEAAE 1370
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
+ ++ ++K DA K E+ K+++ K A E+K E +A + + +A A +K EE
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA-DEAKKKAEE 1429
Query: 406 KERF--LQTVLTNAEKELHLRNQAMDLNK----RKAIESAQSAADLK 446
K++ + A+K + +A + K +K E A+ A + K
Sbjct: 1430 KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
Score = 37.0 bits (85), Expect = 0.026
Identities = 33/174 (18%), Positives = 68/174 (39%), Gaps = 21/174 (12%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+++ +E KL +EE K ++ + K K EEE E + +
Sbjct: 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ------LKKKE 1642
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
+ + ++L++ ++ N +IK+ + K A E+K E +A + +A +
Sbjct: 1643 AEEKKKAEELKKAEEEN------KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Query: 402 KLEEKERFLQTVLTNAE---------KELHLRNQAMDLNKRKAIESAQSAADLK 446
+ EE ++ + AE K + K++A E + A + K
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Score = 33.2 bits (75), Expect = 0.49
Identities = 38/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K +E KK +++ + K+ E+ + ++ K+ EEA + E+ + + M+ + +
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
+ ++ + K + K E+ K QL K EEK +A L+ I+ EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK------KAEELKKAEEENKIKAAEE 1666
Query: 406 --KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
K + A+K +A + K++A E A+ A +LK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA-EEAKKAEELK 1708
Score = 31.3 bits (70), Expect = 1.7
Identities = 27/138 (19%), Positives = 63/138 (45%), Gaps = 4/138 (2%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED---MQE 342
K+ E+ K + E +K ++L+ + + + A ++ E + E +A ED E
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
+ ++ ++ ++ K E+ K+ +L K A EE E+ + + + + +K
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAEELKK-AEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Query: 403 LEEKERFLQTVLTNAEKE 420
EE+++ + + EK+
Sbjct: 1752 DEEEKKKIAHLKKEEEKK 1769
Score = 30.5 bits (68), Expect = 2.9
Identities = 37/166 (22%), Positives = 75/166 (45%), Gaps = 15/166 (9%)
Query: 286 KRDERKKLADE-----EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
K DE KK A+E EA +K ++ + + + +K K+ +EA +A E
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA--------KKAEEAK 1522
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
+ ++ ++ ++ D+A K E+ K+++L K +K +++ + E + A+
Sbjct: 1523 KADEAKKAEEAKKADEA--KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Query: 401 RKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
RK EE ++ + + K + +KA E+ A +LK
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Score = 29.7 bits (66), Expect = 5.6
Identities = 32/161 (19%), Positives = 68/161 (42%), Gaps = 8/161 (4%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
++ E + EEA +K + + + +K K+ EE E+ A + +
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
+ + ++K D K E K+++ K A E K E+ + + + + A +K EE
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA--EEAKKKAEEAKKADEAKKKAEE 1481
Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
++ + +K + +A + +KA E+ + A + K
Sbjct: 1482 AKKADEA----KKKAEEAKKKADEA--KKAAEAKKKADEAK 1516
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 37.3 bits (87), Expect = 0.010
Identities = 36/196 (18%), Positives = 60/196 (30%), Gaps = 17/196 (8%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
EEA I +LE+Q +L KQ + L + + G+A + + + +L E
Sbjct: 7 EEAKDYILKLEEQLKKLSKQAQRLVKRRRELGSALGEFGKALIKLAKCEEEVGGELGEAL 66
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQ-------VQALSLQIEAMHTAIRKLEEKER 408
K E A +E L E QA+ ++ A+ L+ ++
Sbjct: 67 SKLGKAAEELSSL--SEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKK 124
Query: 409 FLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPA-------RLE 461
L + EK + E + A Y R
Sbjct: 125 DLASKKAQLEKLKAAPGIKPA-KVEELEEELEEAESALEEARKRYEEISERLKEELKRFH 183
Query: 462 FGKDREFSAKFVNFRR 477
+ R+ A F R
Sbjct: 184 EERARDLKAALKEFAR 199
>gnl|CDD|218452 pfam05130, FlgN, FlgN protein. This family includes the FlgN
protein and export chaperone involved in flagellar
synthesis.
Length = 140
Score = 36.2 bits (84), Expect = 0.011
Identities = 15/75 (20%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
+E + +++QLE++ QL ++ L + + ++ ++ LL++L+E +
Sbjct: 43 QELLAQLEQLEEERQQLLAELGL-SPGPAGLSELLAAEDEELRELWQELKELLEELQELN 101
Query: 356 DANFKLMTERIKSNQ 370
+ N +L+ + ++ NQ
Sbjct: 102 ERNGRLLEQSLEFNQ 116
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 37.9 bits (89), Expect = 0.011
Identities = 32/166 (19%), Positives = 64/166 (38%), Gaps = 39/166 (23%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHK-------QEEEALLNEMEVTGQAFEDMQE----- 342
+E+ +++QL+D +L+K + ++ E E L +E F ++ E
Sbjct: 135 EEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQFVELTESGDYV 194
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL----------- 391
+ +L QL E+ A + + E I +L +E + L +Q+Q L
Sbjct: 195 EAREILDQLEEELAA-LEQIMEEIP-----ELLKELQTELPDQLQELKAGYRELVEEGYH 248
Query: 392 --------QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
+I+ + I + L L AE++ + +D
Sbjct: 249 LDHLDIEKEIQDLKEQIDENLALLEELD--LDEAEEKNEEIQERID 292
Score = 32.1 bits (74), Expect = 0.76
Identities = 38/148 (25%), Positives = 66/148 (44%), Gaps = 31/148 (20%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQV-------ATHKQ---EEEALLNEMEVTGQAFEDMQEQN 344
+E + ++QLE Q L+KQ A + E + L E+ + E QE+
Sbjct: 343 NESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKL 402
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAR---------------EEKDTLREQVQAL 389
S +LQ LR KD+ + ER + N+LH++ R E + ++++AL
Sbjct: 403 SEMLQGLR-KDELEAREKLERYR-NKLHEIKRYLEKSNLPGLPEDYLEMFFEVSDEIEAL 460
Query: 390 SLQIEA----MHTAIRKLEEKERFLQTV 413
+ ++E M R LEE ++T+
Sbjct: 461 AEELEEKPINMEAVNRLLEEATEDVETL 488
Score = 31.7 bits (73), Expect = 1.0
Identities = 27/146 (18%), Positives = 60/146 (41%), Gaps = 31/146 (21%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL------LQQ---L 351
+I QL D L+++V K E+ + E +EQN L ++Q L
Sbjct: 290 RIDQLYDI---LEREVKARKYVEK----NSDTLPDFLEHAKEQNKELKEEIDRVKQSYTL 342
Query: 352 REKDDANFKLMTERIKS--NQLHKLA-------------REEKDTLREQVQALSLQIEAM 396
E + + + + ++++S Q ++ +EE + + +Q++ + + E +
Sbjct: 343 NESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKL 402
Query: 397 HTAIRKLEEKERFLQTVLTNAEKELH 422
++ L + E + L +LH
Sbjct: 403 SEMLQGLRKDELEAREKLERYRNKLH 428
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 37.8 bits (88), Expect = 0.015
Identities = 27/138 (19%), Positives = 52/138 (37%), Gaps = 3/138 (2%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
KR A E+ +I++ E + + + + ALL T +
Sbjct: 799 RLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVE 858
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
S ++LR++ A + + L L E +++ A +E + I +L
Sbjct: 859 RSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAALDPDELPAR---LEELARDIEEL 915
Query: 404 EEKERFLQTVLTNAEKEL 421
EE+ L + A++EL
Sbjct: 916 EEELNELAQEVGAAKQEL 933
Score = 32.4 bits (74), Expect = 0.63
Identities = 28/157 (17%), Positives = 61/157 (38%), Gaps = 22/157 (14%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
++ ++ + + E++ L+K+ ++ E +E + + E+ + L QQL
Sbjct: 177 KDLVKALDEAEEELANLRKE----LRQLEKEKQRLERLRRLLPLLAERKA-LEQQLAALG 231
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ---- 411
+ + AR E R ++ L+ ++EA+ + ++ E L
Sbjct: 232 EVI--DLPPDAVERYEE--ARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAA 287
Query: 412 -TVL-------TNAEKELHLRNQAMDLNKRKAIESAQ 440
L NAE++L R + N R+A +
Sbjct: 288 IEALHQQRGEYRNAEQDLPDREGEIA-NAREAAAALL 323
Score = 31.3 bits (71), Expect = 1.4
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 14/101 (13%)
Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
EE R I++LE++ ++L ++V KQE L M+ G +++ + LL QLR+
Sbjct: 904 RLEELARDIEELEEELNELAQEVGAAKQE----LARMD-GGSTAAELEAERESLLAQLRD 958
Query: 354 KDDANFKLMTER-IKSNQLHKLAREEKDTLREQVQALSLQI 393
+ ER ++ +L R+ + REQ Q L
Sbjct: 959 --------LAERYLELALASRLLRKAIERYREQRQPPVLNR 991
Score = 30.5 bits (69), Expect = 2.4
Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 17/151 (11%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
++ ++ L + A EAL + A +D+ ++ +
Sbjct: 270 AELDEISLDEELLAQAAAI-----EALHQQRGEYRNAEQDLPDREGEIANAREAAAALLA 324
Query: 360 KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI----RKLEEKERFLQTVLT 415
++ + + + +E V L + EA+ A+ LEE+ER L+ V
Sbjct: 325 QIGPDADEE---AVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELKQV-- 379
Query: 416 NAEKELH-LRNQAMDLNKRKAIESAQSAADL 445
+L L + R A+ A A D+
Sbjct: 380 --RAQLAALPTVTVPEPLRAALADALRAGDI 408
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 35.3 bits (82), Expect = 0.016
Identities = 22/124 (17%), Positives = 55/124 (44%), Gaps = 13/124 (10%)
Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
++ ++ + E LL + E E+++++ L + L + D +
Sbjct: 6 RREMEEVQLALDAKREEFERREELLKQRE------EELEKKEEELQESLIKFDKFLKENE 59
Query: 363 TERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK-------ERFLQTVLT 415
+R ++ + + ++ + E+++ L ++E + I KLEEK E FL++V+
Sbjct: 60 AKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVE 119
Query: 416 NAEK 419
+ +
Sbjct: 120 RSPE 123
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 37.6 bits (87), Expect = 0.017
Identities = 23/170 (13%), Positives = 66/170 (38%), Gaps = 9/170 (5%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K + + + +K E++ LQ+ + ++E E+ E+E + + ++N
Sbjct: 217 KEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENK 276
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
++ + +++ L E + + E L + +++ ++KLE+
Sbjct: 277 EEEKEKKLQEEELKLLAKEE-------EELKSELLKLERRKVDDEEKLKESEKELKKLEK 329
Query: 406 KERFLQTVLTNAEKEL--HLRNQAMDLNKRKAIESAQSAADLKLHLVIIY 453
+ + + + EKEL + + + + +E Q + ++
Sbjct: 330 ELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 37.3 bits (86), Expect = 0.019
Identities = 33/157 (21%), Positives = 63/157 (40%), Gaps = 20/157 (12%)
Query: 293 LADEEAMRKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
L + + QL+D Q L+ ++ T + E+ A E E + +++ + Q
Sbjct: 64 LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123
Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
+L K+ Q LAR L +Q Q L +++ + R+LE + +
Sbjct: 124 ELAA-------ARQNLAKAQQE--LAR-----LTKQAQDLQTRLKTLAEQRRQLEAQAQS 169
Query: 410 L---QTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
L Q L + +L + + L + + AQ+ A
Sbjct: 170 LQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLA 206
Score = 36.9 bits (85), Expect = 0.027
Identities = 28/124 (22%), Positives = 56/124 (45%), Gaps = 3/124 (2%)
Query: 326 LLNEMEVTGQA-FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE 384
LLN +G +D++ Q L +L TER + + AR+E++ +R+
Sbjct: 64 LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123
Query: 385 QVQALSLQIEAMHTAIRKLEEKERFLQTVLTN-AEKELHLRNQAMDL-NKRKAIESAQSA 442
++ A + + +L ++ + LQT L AE+ L QA L +K ++++ +
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ 183
Query: 443 ADLK 446
+
Sbjct: 184 LKSQ 187
Score = 35.4 bits (81), Expect = 0.079
Identities = 30/156 (19%), Positives = 58/156 (37%), Gaps = 19/156 (12%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFE 338
VR R+ LA +A +++ +L Q LQ K +A +++ EA ++ + + +
Sbjct: 121 VRQELAAARQNLA--KAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQ 178
Query: 339 DMQEQNSRLLQQL---------REKDDAN----FKLMTERIKSNQLHKLAREEK-DTLRE 384
Q + L ++ A + TE + +
Sbjct: 179 ASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA 238
Query: 385 QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
Q+ + QI A IR+ E + + L+T E+E
Sbjct: 239 QISQKAQQIAARAEQIRERERQLQRLETAQARLEQE 274
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 37.4 bits (87), Expect = 0.023
Identities = 39/146 (26%), Positives = 76/146 (52%), Gaps = 11/146 (7%)
Query: 296 EEAMRKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
E+ +R+I L+D L + + K+ E L ++ T E ++E+ L + LR
Sbjct: 148 EKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLR 207
Query: 353 EKDDANFKLMTERIK-SNQLHKLAREEK--DTLREQVQALSLQIEAMHTAIRKLEEKERF 409
E + ++ +E + +L KL +E K + L+E+++ L ++E++ + RKLEEK R
Sbjct: 208 EIN----EISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263
Query: 410 LQTVLTNAEKELH-LRNQAMDLNKRK 434
L+ + +KE+ L + +L + K
Sbjct: 264 LEERIEELKKEIEELEEKVKELKELK 289
Score = 35.4 bits (82), Expect = 0.071
Identities = 30/139 (21%), Positives = 69/139 (49%), Gaps = 9/139 (6%)
Query: 300 RKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
R+I++LE +T +++ + ++E E +L E+ +++E+ +L ++++E ++
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE 235
Query: 357 ANFKLMTERIKSNQLHKLARE-EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
+ E I+ + + E K L E+++ L +IE + I +LEEK + L+ +
Sbjct: 236 -----LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKE 290
Query: 416 NAEKELHLRNQAMDLNKRK 434
AE+ + L +
Sbjct: 291 KAEEYIKLSEFYEEYLDEL 309
Score = 34.7 bits (80), Expect = 0.14
Identities = 33/174 (18%), Positives = 78/174 (44%), Gaps = 9/174 (5%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQE---EEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
E+ + ++L+++ +L+ ++ + K+E E L ++ + ++++E+ + LL++L
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELE 580
Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
E + + + ER+K L E L++ + L + + + +L++ L
Sbjct: 581 ELGFESVEELEERLKE--LEPFYN-EYLELKDAEKELEREEKELKKLEEELDKAFEELAE 637
Query: 413 VLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPARLEFGKDR 466
E+ LR + +L K+ + E + + L L A LE + R
Sbjct: 638 TEKRLEE---LRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688
Score = 32.3 bits (74), Expect = 0.72
Identities = 31/151 (20%), Positives = 62/151 (41%), Gaps = 22/151 (14%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
E + E ++LE++ +L++++ K+E E L E+ + L
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL---------------EEKVKEL 285
Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQALSLQI---EAMHTAIRKL 403
++L+EK + KL L +L EK L E++ + +I E + +L
Sbjct: 286 KELKEKAEEYIKL--SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEEL 343
Query: 404 EEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
++K + L+ L E+ L +A +
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEEL 374
Score = 31.6 bits (72), Expect = 1.2
Identities = 38/174 (21%), Positives = 76/174 (43%), Gaps = 22/174 (12%)
Query: 260 RSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATH 319
L + + + LE F V +R + E + +L+D +L+++
Sbjct: 566 DELEEELAELLKELEELGFESVEELEERLKEL----EPFYNEYLELKDAEKELEREEKEL 621
Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
K+ EE L +AFE++ E RL ++LR++ + E+ S + ++ REE
Sbjct: 622 KKLEEEL-------DKAFEELAETEKRL-EELRKELEE-----LEKKYSEEEYEELREEY 668
Query: 380 DTLREQVQALSLQIEAMH----TAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
L ++ L ++E + + LE+ + L+ A+KEL +A++
Sbjct: 669 LELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE-REKAKKELEKLEKALE 721
Score = 30.4 bits (69), Expect = 2.6
Identities = 31/135 (22%), Positives = 66/135 (48%), Gaps = 13/135 (9%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE- 353
E + K +++ +L+K++ E+ L E+E +A E+++E+ S++ ++ E
Sbjct: 360 RHELYEEAKAKKEELERLKKRLTGLTPEK--LEKELEELEKAKEEIEEEISKITARIGEL 417
Query: 354 -KDDANFKLMTERIKSNQL------HKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
K+ K E +K + +L E + L E+ + +++ + ++++EEK
Sbjct: 418 KKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEE---YTAELKRIEKELKEIEEK 474
Query: 407 ERFLQTVLTNAEKEL 421
ER L+ L EK L
Sbjct: 475 ERKLRKELRELEKVL 489
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 35.9 bits (83), Expect = 0.023
Identities = 27/143 (18%), Positives = 68/143 (47%), Gaps = 14/143 (9%)
Query: 271 GSLEFRKFACVVRDSKRDERKKLAD-EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE 329
G L A V + + + E +++ +L+++ L++++A + E E L +
Sbjct: 56 GLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKED 115
Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL 389
+++ + + ++E+ L+ L E + K + + ++ +L + REE + LRE+++ L
Sbjct: 116 LQLLRELLKSLEER----LESLEE----SIKELAKELR--ELRQDLREEVEELREELERL 165
Query: 390 SLQIEAMHTAIRKLEEKERFLQT 412
E + ++E + L+
Sbjct: 166 Q---ENLQRLQEAIQELQSLLEQ 185
Score = 34.7 bits (80), Expect = 0.049
Identities = 24/116 (20%), Positives = 53/116 (45%), Gaps = 14/116 (12%)
Query: 310 HQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSN 369
Q +++ E + L+E++ + ED++E+ + L +L + + L
Sbjct: 71 PQKLEELQGELSELKQQLSELQ---EELEDLEERIAELESELEDLKEDLQLLRELLKSLE 127
Query: 370 Q-LHKLAREEKDT----------LREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
+ L L K+ LRE+V+ L ++E + +++L+E + LQ++L
Sbjct: 128 ERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183
Score = 27.8 bits (62), Expect = 9.6
Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
R+ + ++L E IK+L + +L++ + +E L ++ Q ++ +
Sbjct: 120 RELLKSLEERL--ESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQ 177
Query: 343 QNSRLLQQL 351
+ LL+QL
Sbjct: 178 ELQSLLEQL 186
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 37.0 bits (86), Expect = 0.024
Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 7/111 (6%)
Query: 317 ATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 376
A ++ E L + E Q+ Q+Q L Q + + + E K AR
Sbjct: 599 AANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEA---RTALKQAR 655
Query: 377 EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
+ L+ + Q+L ++E ++ E + L+ + +A+ + L Q
Sbjct: 656 LDLQRLQNEQQSLKDKLELAIAERKQQAETQ--LRQL--DAQLKQLLEQQQ 702
Score = 37.0 bits (86), Expect = 0.031
Identities = 22/162 (13%), Positives = 66/162 (40%), Gaps = 8/162 (4%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
R + +D +AD +A+R I+++ + +LQ+ +L ++ +E
Sbjct: 222 RLNPQDVEHWIADIQALRAIQKVAPEIEKLQEDFEQL----LSLELRLQHLHGELVADEE 277
Query: 343 QNSRLLQQLRE-KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
+ + ++ +E K+ +L T + + +E ++ A ++E +
Sbjct: 278 RLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKG 337
Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKR-KAIESAQSA 442
E+ + ++ + + ++ +R++ ++ R A+
Sbjct: 338 AFEDAD--IEQLQADLDQLPSIRSELEEVEARLDALTGKHQD 377
Score = 35.8 bits (83), Expect = 0.070
Identities = 33/179 (18%), Positives = 59/179 (32%), Gaps = 34/179 (18%)
Query: 293 LADEEAMR-KIKQLED--QTH-----QLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
A+E +R +++Q E+ Q+ Q ++Q+ E E A + +
Sbjct: 599 AANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDL 658
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
RL + + D + ER + A + L Q++ L Q +A
Sbjct: 659 QRLQNEQQSLKDKLELAIAERKQQ------AETQLRQLDAQLKQLLEQQQAF-----LEA 707
Query: 405 EKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA--ADLKLHLVIIYTNGPARLE 461
K+ F + L K + +E A L + T ARL+
Sbjct: 708 LKDDFRE--LRTERLA-----------KWQVVEGELDNQLAQLSAAIEAARTQAKARLK 753
Score = 35.1 bits (81), Expect = 0.094
Identities = 40/197 (20%), Positives = 77/197 (39%), Gaps = 27/197 (13%)
Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL-NEM 330
++ RK + + D + K E+ ++ L+D +L+ + Q E L N++
Sbjct: 675 AIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQL 734
Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDD--------------------ANFKLMTERIKSNQ 370
A E + Q L++L+++ D + ERI +
Sbjct: 735 AQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRR 794
Query: 371 LHKLAREEKDTLREQ---VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
+ RE + ++E +L + + +R+LE LQ LT K+ LR +
Sbjct: 795 -PE-VREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKK 852
Query: 428 MDLNKRKAIESAQSAAD 444
++ +RKA+E D
Sbjct: 853 LE-QERKALEKQLDQLD 868
Score = 31.6 bits (72), Expect = 1.3
Identities = 18/155 (11%), Positives = 57/155 (36%), Gaps = 11/155 (7%)
Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEM-----EVTGQAFEDMQEQN---SRLLQQLRE 353
+ E++ + Q++ K L + E + +++ N + +L
Sbjct: 272 LVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELEL 331
Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
+D I+ Q ++ ++R +++ + +++A+ + ++ K L+
Sbjct: 332 LEDQKGAFEDADIEQLQADL---DQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQK 388
Query: 414 LTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLH 448
+ + +N R+ + ++A + L
Sbjct: 389 IKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQ 423
Score = 30.8 bits (70), Expect = 2.4
Identities = 24/159 (15%), Positives = 59/159 (37%), Gaps = 18/159 (11%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
+R+ + +T + + + L ++ + E++Q++ +RL++ + +
Sbjct: 793 RRPEVREYRAFMQETWLHRDSLREERP---NLAIQLRELESSAEELQQELTRLIKDTKLR 849
Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS----------LQIEAMHTAIRKLE 404
KL ER + E LR++++ L+ + + +LE
Sbjct: 850 ---RKKLEQERKALEKQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISER-LDQLE 905
Query: 405 EKERFLQTVLTNAEKEL-HLRNQAMDLNKRKAIESAQSA 442
E +R + + + +K L D + E+ Q
Sbjct: 906 EFKRKRKRLSGDLKKFLERFDGVIKDHSGSGLYETWQKL 944
Score = 30.4 bits (69), Expect = 3.0
Identities = 22/109 (20%), Positives = 42/109 (38%), Gaps = 5/109 (4%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
+ + K +EE +L +L AT E L +E+ +A E QE+
Sbjct: 431 QQLEAGKLEFNEEEYELELRLGRLKQRLDSATAT----PEELEQ-LEINDEALEKAQEEQ 485
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
+ + + +L R ++ + + A LR+ + L LQ+
Sbjct: 486 EQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQL 534
Score = 29.7 bits (67), Expect = 5.0
Identities = 23/113 (20%), Positives = 45/113 (39%), Gaps = 16/113 (14%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
K A + R + + +L+ +A KQ+ E L +++ + Q
Sbjct: 651 LKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLD---------AQLK-----Q 696
Query: 351 LREKDDANFKLMTERIKSNQLHKLA--REEKDTLREQVQALSLQIEAMHTAIR 401
L E+ A + + + + + +LA + + L Q+ LS IEA T +
Sbjct: 697 LLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAK 749
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 36.7 bits (85), Expect = 0.025
Identities = 30/163 (18%), Positives = 65/163 (39%), Gaps = 20/163 (12%)
Query: 290 RKKLADEEAMRK----IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ ++EA K + + QL++Q+ K+E++ L NE Q +Q++ +
Sbjct: 43 TLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQ--IKQFELALQDEIA 100
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL---QIEAM----HT 398
+L + + +L +L +L++E + L+ + + +
Sbjct: 101 KLEALELLNLEKDKELELL---EKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFE 157
Query: 399 AIRKLEEKERFLQTVLTNAEKELHLRNQAMD-LNKRKAIESAQ 440
+KLEE L+ E++LH N ++ + ES
Sbjct: 158 NEKKLEES---LELEREKFEEQLHEANLDLEFKENEEQRESKW 197
Score = 29.8 bits (67), Expect = 3.8
Identities = 23/131 (17%), Positives = 53/131 (40%), Gaps = 15/131 (11%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+E+ K + +I +LE L+ +E E L E++ + + + +
Sbjct: 83 LFNEQIKQFELALQDEIAKLE----ALELLNLEKDKELELLEKELDELSKELQKQLQNTA 138
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE---------AM 396
++++ RE + +L E K + L E ++ EQ+ +L +E +
Sbjct: 139 EIIEKKRENNKNEERLKFENEKKLEE-SLELE-REKFEEQLHEANLDLEFKENEEQRESK 196
Query: 397 HTAIRKLEEKE 407
++KL+ +
Sbjct: 197 WAILKKLKRRA 207
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 36.1 bits (84), Expect = 0.026
Identities = 29/128 (22%), Positives = 51/128 (39%), Gaps = 25/128 (19%)
Query: 285 SKRDERKKLAD------EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
K ++ +K+ + E + K +Q L + +A KQ E L E + E
Sbjct: 20 DKAEDPEKMLEQAIRDMESELAKARQ------ALAQAIARQKQLERKL-EEAQARA---E 69
Query: 339 DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
++E+ LQ E D A L E +S + + L ++Q Q+E +
Sbjct: 70 KLEEKAELALQAGNE-DLAREAL--EEKQSLE------DLAKALEAELQQAEEQVEKLKK 120
Query: 399 AIRKLEEK 406
+ LE+K
Sbjct: 121 QLAALEQK 128
Score = 30.4 bits (69), Expect = 1.7
Identities = 22/132 (16%), Positives = 41/132 (31%), Gaps = 41/132 (31%)
Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQ---------------- 335
LA E A+ + + LED L+ ++ +++ E L ++ Q
Sbjct: 85 DLARE-ALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARK 143
Query: 336 ----AFEDMQEQ--------NSRLLQQLREKDD---ANFKLMTERIKSNQLHKLAREEKD 380
A E + +++ EK + A + E LA D
Sbjct: 144 AAAKAQEKVNRSLGGGSSSSAMAAFERMEEKIEEREARAEAAAE---------LAEGSGD 194
Query: 381 TLREQVQALSLQ 392
L ++ Q
Sbjct: 195 DLDKEFAQAGAQ 206
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 33.0 bits (76), Expect = 0.029
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE 322
E+KK +EE ++K+LE++ QL+++V ++E
Sbjct: 21 EKKKQREEELEERVKELEEENAQLRQKVEQLEKE 54
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 36.7 bits (86), Expect = 0.032
Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 318 THKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK-DDANFKL-MT------------ 363
++ +E + E QE+ ++L E+ + F L T
Sbjct: 112 EYEARKEEIEEEF----------QEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDG 161
Query: 364 ERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
E + + L EE++ L E++ L E + +R+L E ER + L ++E+
Sbjct: 162 EPLTEEEFEALPEEEREELEEKIDELE---EELQEILRQLRELEREAREKLRELDREV 216
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 34.1 bits (79), Expect = 0.034
Identities = 21/121 (17%), Positives = 51/121 (42%), Gaps = 24/121 (19%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
I + + Q Q + +A ++ E L E+E+ + E + +D
Sbjct: 3 LINKFQ-QLQQQLQLLAQQIKQLETQLKEIELVLEELELLD-------------EDTKVY 48
Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
+ + L +++K+ ++E+++ + E + I+ LE++ L+ L ++E
Sbjct: 49 KLIGDV-------LVKQDKEEVKEELEE---RKETLEKEIKTLEKQLEKLEKELEELKEE 98
Query: 421 L 421
L
Sbjct: 99 L 99
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 36.0 bits (83), Expect = 0.035
Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 7/91 (7%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
DS+ DE KK +E + ++++LE + L ++ ++E+E L NE Q +
Sbjct: 69 DSELDELKK-EEERLLDELEELEKEDDDLDGELVELQEEKEQLENEEL---QYLREYNLF 124
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKL 374
+ LQ + NQL KL
Sbjct: 125 DRNNLQLEDNLQSLELQY---EYSLNQLDKL 152
Score = 31.8 bits (72), Expect = 0.94
Identities = 24/104 (23%), Positives = 47/104 (45%), Gaps = 7/104 (6%)
Query: 306 EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTER 365
L ++ K+EEE LL+E+E ++++++ L +L E + +L E
Sbjct: 62 ISNYEALDSELDELKKEEERLLDELE-------ELEKEDDDLDGELVELQEEKEQLENEE 114
Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
++ + + L L + +Q+L LQ E + KL + F
Sbjct: 115 LQYLREYNLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRKTNIF 158
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 35.3 bits (82), Expect = 0.036
Identities = 32/143 (22%), Positives = 60/143 (41%), Gaps = 24/143 (16%)
Query: 296 EEAMRKIKQ-LEDQTH-------QLQKQVATHKQEEEALLNEMEVTGQAFEDM------- 340
E+A +IK D TH L++++A K+ EE M Q + +
Sbjct: 8 EKAFNEIKNYYNDITHNNLELIKSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKA 67
Query: 341 QEQNSRLLQQLR--EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
+E+ L ++L+ EKD + K + R+K + +E L+ + + L + E +
Sbjct: 68 EEEVEELRKKLKDYEKDKQSLKNLKARLKELE------KELKNLKWESEVLEQRFEKVER 121
Query: 399 AIRKLEEK-ERFLQTVLTNAEKE 420
+L +K E +Q V +
Sbjct: 122 ERDELYDKFEAAIQDVQQKTGLK 144
Score = 31.0 bits (71), Expect = 0.93
Identities = 32/147 (21%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K +E + E ++ K+L + + +++V E L +++ + + ++ +
Sbjct: 41 KNEEHNEKLMAEIAQENKRLVEPLKKAEEEV-------EELRKKLKDYEKDKQSLKNLKA 93
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA---------------LS 390
RL + EK+ N K +E + + K+ R E+D L ++ +A L
Sbjct: 94 RLKEL--EKELKNLKWESE-VLEQRFEKVER-ERDELYDKFEAAIQDVQQKTGLKNLLLE 149
Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNA 417
++EA++ LE+KE L VL A
Sbjct: 150 QKLEALNEE---LEKKEAQLNEVLAAA 173
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
(chemotaxis sensory transducer). Thought to undergo
reversible methylation in response to attractants or
repellants during bacterial chemotaxis.
Length = 262
Score = 35.7 bits (83), Expect = 0.041
Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 20/145 (13%)
Query: 276 RKFACVVRDSKRDERKKLADE--EAMRKIKQL----EDQTHQLQKQVATHKQEEEALLNE 329
R FA V DE +KLA+ E+ ++I+ L +++T++ + E E +
Sbjct: 126 RGFAVVA-----DEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVEL 180
Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT--ERIKS--NQLHKLAREEKDTLREQ 385
+E TG A E++ + + ++E A + E + + +++ ++ +E E
Sbjct: 181 VEETGDALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEI 240
Query: 386 VQALSLQIEAMHTAIRKLEE-KERF 409
A E + +L+E ERF
Sbjct: 241 SAA----AEELSGLAEELDELVERF 261
>gnl|CDD|218051 pfam04380, BMFP, Membrane fusogenic activity. BMFP consists of two
structural domains, a coiled-coil C-terminal domain via
which the protein self-associates as a trimer, and an
N-terminal domain disordered at neutral pH but adopting
an amphipathic alpha-helical structure in the presence
of phospholipid vesicles, high ionic strength, acidic pH
or SDS. BMFP interacts with phospholipid vesicles though
the predicted amphipathic alpha-helix induced in the
N-terminal half of the protein and promotes aggregation
and fusion of vesicles in vitro.
Length = 79
Score = 32.9 bits (76), Expect = 0.045
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 10/83 (12%)
Query: 330 MEVTGQAFEDMQEQNSRLL---QQLREKDDANFKLMTERIKSNQLHKL---AREEKDTLR 383
M+ + +++ + + L Q R + + N + + L KL REE D R
Sbjct: 1 MQDPNRLLDELAKLLTDALGAAQGPRREAEKNVRAQLQS----ALSKLDLVTREEFDVQR 56
Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
+ ++EA+ + LE K
Sbjct: 57 AVLARTREKLEALEARVAALEAK 79
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 35.8 bits (83), Expect = 0.046
Identities = 23/135 (17%), Positives = 56/135 (41%), Gaps = 26/135 (19%)
Query: 294 ADEEAMRKIKQLEDQ----THQLQKQVATHKQEEEALLNEMEVTG------QAFEDMQEQ 343
A EE ++K + Q++++ ++E E L+ E + G + E+ E+
Sbjct: 79 AKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGLEE 138
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLH------KLAR----EEKDTLREQVQALSLQI 393
L++QL + + K E ++S++ +A+ E + +E AL +
Sbjct: 139 VRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKKVIKEISENSKEIALALVKE- 197
Query: 394 EAMHTAIRKLEEKER 408
++++++
Sbjct: 198 -----LLKEVKDATD 207
>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
Members of this family of uncharacterized proteins have
no known function.
Length = 319
Score = 35.6 bits (82), Expect = 0.049
Identities = 27/143 (18%), Positives = 51/143 (35%), Gaps = 7/143 (4%)
Query: 285 SKRDERKKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
SK + K L E+ + Q + LQ + K+ E L E+ Q ++
Sbjct: 6 SKTEALKILGQDLEKCQNERDQFKLMAETLQMRYRNLKKSEREL--ELVAACQGDMKIKP 63
Query: 343 --QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
+ L Q L + + N +L T+ + Q + + LR+ + ++
Sbjct: 64 GTKRQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVR- 122
Query: 401 RKLEEKERFLQTVLTNAEKELHL 423
EE+ + + EK L
Sbjct: 123 ESFEERNDLVSQLEKLREKCKQL 145
Score = 29.8 bits (67), Expect = 3.1
Identities = 32/162 (19%), Positives = 65/162 (40%), Gaps = 28/162 (17%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQ---EEEALLNEMEVTGQAFEDMQEQ 343
R+E ++L+ + +++L + +LQ + +Q ++ V E +E+
Sbjct: 78 REENQQLSTD-----VQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVR----ESFEER 128
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIEAMHTA- 399
N L+ QL EK K + ++S L E E+D + + L+ ++ +
Sbjct: 129 N-DLVSQL-EKLREKCKQLESDLRS--LLDEKEEVLMERDEYQRKAHRLNNELSYLLGGD 184
Query: 400 ------IRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
+ L + R+L+ + E+E L L K K
Sbjct: 185 PRRVVDVDALVMENRYLKEKINTLEEESELIKMT--LAKYKQ 224
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 35.9 bits (83), Expect = 0.050
Identities = 40/172 (23%), Positives = 62/172 (36%), Gaps = 29/172 (16%)
Query: 284 DSKRDE--RKKLADEEA----------MRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
+KR E RKK ++A ++KQLE + Q+Q KQ EEA
Sbjct: 73 SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ---KKQAEEAAKQAAL 129
Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL 391
QA E + A K E ++ K A E + + +A
Sbjct: 130 KQKQAEE----------AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK-KAEAEA--- 175
Query: 392 QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
+A A +K E + + E + +A K+KA A++AA
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAA 227
>gnl|CDD|113545 pfam04778, LMP, LMP repeated region. This family consists of a
repeated sequence element found in the LMP group of
surface-located membrane proteins of Mycoplasma hominis.
The the number of repeats in the protein affects the
tendency of cells to spontaneously aggregate.
Agglutination may be an important factor in
colonisation. Non-agglutinating microorganisms might
easily be distributed whereas aggregation might provide
a better chance to avoid an antibody response since some
of the epitopes may be buried.
Length = 157
Score = 34.5 bits (79), Expect = 0.051
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 9/76 (11%)
Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGP 457
T + KL + + +Q T +KE+ NQA+ N +++SA+S+ D K V T
Sbjct: 4 TLLAKLTDNDNTIQKAKTELDKEIQKANQAVASNNTASMQSAKSSLDAK---VTEITK-- 58
Query: 458 ARLE-FGKDREFSAKF 472
+LE F KD++ AKF
Sbjct: 59 -KLETFNKDKD--AKF 71
Score = 29.5 bits (66), Expect = 2.8
Identities = 22/128 (17%), Positives = 49/128 (38%), Gaps = 23/128 (17%)
Query: 285 SKRDERKKLADEEAMRKIKQ-LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
K ++ + +M+ K L+ + ++ K++ T ++++A E+E T + ++
Sbjct: 28 QKANQAVASNNTASMQSAKSSLDAKVTEITKKLETFNKDKDAKFKELEQTRKDIDEFINT 87
Query: 344 N------SRLLQQLREKDD----------------ANFKLMTERIKSNQLHKLAREEKDT 381
N S L++ L K D AN +L K+ +
Sbjct: 88 NKNNPNYSTLVKDLTSKKDSKNSVTNSSNKSDIEAANTELKQALAKAKAAKDQIDNANKS 147
Query: 382 LREQVQAL 389
++EQ+
Sbjct: 148 IKEQLNDS 155
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 36.0 bits (83), Expect = 0.059
Identities = 44/190 (23%), Positives = 72/190 (37%), Gaps = 27/190 (14%)
Query: 278 FACVVRD----SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL------ 327
CV R + E +LA E + + K+L D+ L+ ++ H++E L
Sbjct: 667 LFCVQRLRVAAELQKEEARLALEGNIERTKELNDE---LRAELELHRKEILDLFDCGTAD 723
Query: 328 NEMEVTGQAFEDMQEQNSR-----LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL 382
E A E+ Q L QL ++L S +L EE L
Sbjct: 724 TEDAFREAAREEQQLTQRESRLESLEAQLEGVAAEAYELSA----SLDQRELKEEELALL 779
Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
E + AL ++E +H + L + L+ T AE + DL E A+
Sbjct: 780 EEAIDALDEEVEELHAQVAALSRQIAQLEGGGTVAELRQRRESLKEDLE-----EKARKW 834
Query: 443 ADLKLHLVII 452
A L+L + ++
Sbjct: 835 ASLRLAVQVL 844
Score = 33.3 bits (76), Expect = 0.41
Identities = 32/173 (18%), Positives = 64/173 (36%), Gaps = 20/173 (11%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+R + RK L EA + LE + + +A + E E L+E+ ++ + + Q
Sbjct: 555 IRQHWQQLRKALDQLEA--AYEALEGRFAAAEAAMAEWQSEWEEALDELGLSRELSPEQQ 612
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR--EEKDTLREQVQALSLQIEA---- 395
++ L++ + K+ H++AR EE+ E+V+ L +EA
Sbjct: 613 LDILSTMKDLKKLM---------QKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFID 663
Query: 396 ---MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
+ R E + E + + D + + + DL
Sbjct: 664 LSTLFCVQRLRVAAELQKEEARLALEGNIERTKELNDELRAELELHRKEILDL 716
Score = 30.2 bits (68), Expect = 3.5
Identities = 34/192 (17%), Positives = 69/192 (35%), Gaps = 13/192 (6%)
Query: 266 ASNIYGSLEFRKFACVVRDSKRDERKKLADE----EAMRKIKQLEDQTHQLQKQVATH-- 319
S GS K V+ K + + K+KQ ++ + +K+ AT+
Sbjct: 150 FSGTSGSPASTKLLEVLNKEADSLYKPSGRNPQINQLLEKLKQERNEIDEAEKEYATYHK 209
Query: 320 ----KQEEEALLNEMEVTGQAFED-MQEQNS--RLLQQLREKDDANFKLMTERIKSNQLH 372
++ E A L E+ +A D ++ L +L+E +L R +
Sbjct: 210 LLESRRAEHARLAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFP 269
Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
+ + +Q +I+A+ + +L++ L + L +Q + K
Sbjct: 270 RDGVLRLEKREAHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLSEIK 329
Query: 433 RKAIESAQSAAD 444
A E ++ A
Sbjct: 330 ASAFELTETLAG 341
>gnl|CDD|224300 COG1382, GimC, Prefoldin, chaperonin cofactor [Posttranslational
modification, protein turnover, chaperones].
Length = 119
Score = 33.4 bits (77), Expect = 0.061
Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 19/122 (15%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
E ++ QL+ QLQK V KQ+ EA L E+E + E + E D
Sbjct: 7 EVQAQLAQLQQLQQQLQK-VILQKQQLEAQLKEIEKALEELEKLDE------------DA 53
Query: 357 ANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
+K + N L K+++EE D L E+ + L L+I+ + KL+E+ LQ+ +
Sbjct: 54 PVYKKV-----GNLLVKVSKEEAVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQ 108
Query: 416 NA 417
A
Sbjct: 109 KA 110
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 35.8 bits (82), Expect = 0.066
Identities = 34/160 (21%), Positives = 54/160 (33%), Gaps = 18/160 (11%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
++ K L + + QK + +E+ ALL E +QE
Sbjct: 484 QEQMTAALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEE----AKQRQLQE 539
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
Q + L + +++ T S QL +L ++ D QA Q E R+
Sbjct: 540 QKA--LLEHKKE--------TLEYTS-QLAELLDQQADRFELSAQAAGSQKERGSDLYRE 588
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
+ L A EL A+DL + A SA
Sbjct: 589 ALAQ---NAAALNKALNELAAYWSALDLLQGDWKAGALSA 625
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 35.2 bits (81), Expect = 0.067
Identities = 27/144 (18%), Positives = 67/144 (46%), Gaps = 9/144 (6%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN----EMEVTGQAFEDMQEQN 344
E+ KL +EE L + L+ + T++++E+ L+N ++ + E+
Sbjct: 168 EKLKLLEEENEH----LRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEEL 223
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
++ + L + + L+++ + + K E + L++ + A + +++L+
Sbjct: 224 AKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQELQ 283
Query: 405 EKERFLQTVLTNAEKEL-HLRNQA 427
+K +L A++EL +LRN+
Sbjct: 284 DKYAECMEMLHEAQEELKNLRNKG 307
>gnl|CDD|214699 smart00503, SynN, Syntaxin N-terminal domain. Three-helix domain
that (in Sso1p) slows the rate of its reaction with the
SNAP-25 homologue Sec9p.
Length = 117
Score = 33.5 bits (77), Expect = 0.076
Identities = 15/88 (17%), Positives = 38/88 (43%), Gaps = 5/88 (5%)
Query: 302 IKQLEDQTHQLQKQVATHKQE---EEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN 358
I+++ +LQK ++ L ++E + + ++ L++L +++ N
Sbjct: 17 IQKISQNVAELQKLHEELLTPPDADKELREKLERLIDDIKRLAKEIRAKLKELEKENLEN 76
Query: 359 FKLMTE--RIKSNQLHKLAREEKDTLRE 384
+ R + Q KL ++ K+ + E
Sbjct: 77 RASGSASDRTRKAQTEKLRKKFKEVMNE 104
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 35.6 bits (82), Expect = 0.077
Identities = 36/160 (22%), Positives = 77/160 (48%), Gaps = 15/160 (9%)
Query: 294 ADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
+ E +I +L+ + QL+ +++ + E E L + + + E ++ Q RL +++
Sbjct: 27 SKEAYYLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIK 86
Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL---SLQIEAMHTAIRKLEEKERF 409
E +L+ + ++L EE +L++QV L ++ E + IR+LEE+
Sbjct: 87 EYKFREARLLQD---YSEL----EEENISLQKQVSVLRQSQVEFEGLKHEIRRLEEETEL 139
Query: 410 LQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
L + L A + L+ A + +A+E+ ++ + K L
Sbjct: 140 LNSQLEEAAR---LKEIA-EKQLEEALETLKTEREQKNAL 175
Score = 35.2 bits (81), Expect = 0.092
Identities = 28/138 (20%), Positives = 63/138 (45%), Gaps = 12/138 (8%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
A+R ++ ++Q L + E+ + EV E ++ + + ++ E
Sbjct: 311 NALRNLQASKEQQDGLDSEKDRGSHEDG---DYYEVDINGPEILECKYRVAVSEVGELKA 367
Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK----ERFLQT 412
+L + + N+L + +EKD LR ++Q L+ ++ ++ + + +E+ E+ L+
Sbjct: 368 ---ELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDLEKELRK 424
Query: 413 VLTNA-EKELHLRNQAMD 429
+ A E + L N A D
Sbjct: 425 LTHVAGESQGSL-NAAQD 441
Score = 29.8 bits (67), Expect = 3.8
Identities = 50/241 (20%), Positives = 84/241 (34%), Gaps = 39/241 (16%)
Query: 291 KKLAD---EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL-LNEME--VTGQAF----EDM 340
K++A+ EEA+ +K +Q + L+K+++ H E+ L+ + + G F E
Sbjct: 151 KEIAEKQLEEALETLKTEREQKNALRKELSHHLNLEDFYSLSNLSISLDGLKFSEDPEAS 210
Query: 341 QEQNSRL-LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE----QVQALSLQIEA 395
E N+ + + +L A D E ++Q L Q+E
Sbjct: 211 TEPNNDGEDENGHLNGGPGLAKSLGTPRKGELFPPAPGVSDLFSELNLSEIQKLKQQLEQ 270
Query: 396 MHTAIRKLEEKERFLQTVLTNAEKEL------------HLRNQAMDLNKRKAIESAQSAA 443
+ L + LQT L +E L HL ++ + S
Sbjct: 271 VEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQDGLDSEK 330
Query: 444 DLKLHLVIIY----TNGPARLEFGKDREFSAKFVNFRRKSYFFVEFLDCLRTRYETRQRK 499
D H Y NGP LE K R ++ + + L L+ RY Q +
Sbjct: 331 DRGSHEDGDYYEVDINGPEILE-CKYRVAVSEVGELKAE-------LKALKARYNELQER 382
Query: 500 C 500
Sbjct: 383 Y 383
>gnl|CDD|192486 pfam10224, DUF2205, Predicted coiled-coil protein (DUF2205). This
entry represent a highly conserved 100 residue region
which is likely to be a coiled-coil structure. The exact
function is unknown.
Length = 80
Score = 32.5 bits (74), Expect = 0.079
Identities = 19/62 (30%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 362 MTERIKSNQLHKLAREEKDTLREQV-------QALSLQIEAMHTAIRKLEEKERFLQTVL 414
M+ R S L L +E + L EQ + LS ++EA+ KL + +FLQ +
Sbjct: 1 MSPRRDSEDLEALEKEARQRLIEQALELQDSLEDLSQRVEAVKEENLKLRSENQFLQQYI 60
Query: 415 TN 416
N
Sbjct: 61 EN 62
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 35.2 bits (81), Expect = 0.081
Identities = 38/177 (21%), Positives = 79/177 (44%), Gaps = 27/177 (15%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM---QEQN 344
+ KL EE+ + K LE + +L++++ ++E+E LL E + + E++ Q Q
Sbjct: 374 ELHAKL-SEESKKADK-LEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELRCGQAQQ 431
Query: 345 SRLLQQL-----------REKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQ 392
+L Q E + ++ R++ N++ +L +E + E++ L
Sbjct: 432 DQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSE--NERITELQQL 489
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
+E + +LE + R ++ L L+ Q DL +KA++ S ++ L
Sbjct: 490 LEDANRRNNELETQLR------LANQRILELQQQVEDL--QKALQEQGSKSEDSSLL 538
Score = 33.6 bits (77), Expect = 0.27
Identities = 35/146 (23%), Positives = 63/146 (43%), Gaps = 28/146 (19%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
E+A R+ +LE Q +++ +Q+ E L ++ G ED S+L + L
Sbjct: 491 EDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLE--- 547
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
QLH+ A EE REQ++ L+ + RK+ E L+ L
Sbjct: 548 --------------QLHE-ANEELQKKREQIE--ELEPDQDQNLSRKIAE----LEAALQ 586
Query: 416 NAEKELHLRNQAMDLNKRKAIESAQS 441
++++ +AM+ +K +E A+
Sbjct: 587 KKDEDM----RAMEERYKKYVEKARE 608
Score = 33.6 bits (77), Expect = 0.30
Identities = 46/198 (23%), Positives = 79/198 (39%), Gaps = 34/198 (17%)
Query: 260 RSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLAD-EEAMRKIKQLEDQTHQLQKQVAT 318
S + G+L+ + +K L D +EA + +L + H+L KQV
Sbjct: 157 EQGSSPSRESAGNLD------------QQLKKALEDLKEAQEEKDELAQRCHELDKQVLL 204
Query: 319 HKQEEEALLNEMEVTGQAFEDM--------QEQNSRLLQ---QLREKDDANFKLMT---- 363
++E+ +L E E + + Q + + QL + + NF+L
Sbjct: 205 LQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQLEQLQEENFRLEAARDD 264
Query: 364 ERIKSNQLHKLARE------EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
RIK +L K E E +L + QAL +I+ + + K ++ E ++T
Sbjct: 265 YRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDVLRESSDKAKKLEAQVETYKKKL 324
Query: 418 EKELHLRNQAMDLNKRKA 435
E LR Q L +R A
Sbjct: 325 EDLNDLRRQVKLLEERNA 342
Score = 32.1 bits (73), Expect = 0.81
Identities = 37/166 (22%), Positives = 76/166 (45%), Gaps = 29/166 (17%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL-NEMEVTGQAFEDMQ 341
R+++ + + +LA++ + +Q+ED LQ+Q K E+ +LL +++E + +
Sbjct: 496 RNNELETQLRLANQRILELQQQVEDLQKALQEQ--GSKSEDSSLLKSKLEEHLEQLHEAN 553
Query: 342 EQNSRLLQQLRE---------------------KDDANFKLMTERIKSNQLHKLAREEKD 380
E+ + +Q+ E K D + + M ER K + K ARE
Sbjct: 554 EELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAMEERYKK-YVEK-AREVIK 611
Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
TL + S +I+ + +L E+++ ++ + + EK +R Q
Sbjct: 612 TLDPKQNPASPEIQLLKK---QLTERDKRIRHLESEYEKAKPMREQ 654
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 35.3 bits (82), Expect = 0.086
Identities = 27/110 (24%), Positives = 45/110 (40%), Gaps = 21/110 (19%)
Query: 288 DERKKLADEEAM---RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
E A E M ++++QL+ + +L + ++AL E +G+ FED Q+
Sbjct: 571 SESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVT 630
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE 394
+ Q L ER + E+D L + QAL +IE
Sbjct: 631 EYMQQLL------------ERERE------LTVERDELAARKQALDEEIE 662
Score = 30.7 bits (70), Expect = 2.2
Identities = 26/129 (20%), Positives = 52/129 (40%), Gaps = 29/129 (22%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
R++ + EEA R ++Q + QL+ V+ + + E + E Q ++ Q+
Sbjct: 900 REQLDEA--EEAKRFVQQHGNALAQLEPIVSVLQSDPE----QFEQLKQDYQQAQQTQRD 953
Query: 347 LLQQLREKDD-----ANFKL------------MTERIKSNQLHKLAREEKDTLREQVQAL 389
QQ + A+F + E+++ Q + A +E+ REQ++
Sbjct: 954 AKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLR--QRLEQAEQERTRAREQLR-- 1009
Query: 390 SLQIEAMHT 398
Q +A
Sbjct: 1010 --QAQAQLA 1016
Score = 30.7 bits (70), Expect = 2.6
Identities = 29/143 (20%), Positives = 56/143 (39%), Gaps = 30/143 (20%)
Query: 290 RKKLADEEAMR-----KIKQLEDQTHQ-------LQKQVATHKQEEEALLNEMEVTG-QA 336
R++L E R +++Q + Q Q L+ +Q + L E++ G A
Sbjct: 991 RQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPA 1050
Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
+E+ + R++ A L R + NQL K Q+ +++ +
Sbjct: 1051 DSGAEER----ARARRDELHAR--LSANRSRRNQLEK-----------QLTFCEAEMDNL 1093
Query: 397 HTAIRKLEEKERFLQTVLTNAEK 419
+RKLE ++ + NA+
Sbjct: 1094 TKKLRKLERDYHEMREQVVNAKA 1116
>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein. This family includes
the YfhG protein from E. coli. Members of this family
have an N-terminal lipoprotein attachment site. The
members of this family are functionally uncharacterized.
Length = 180
Score = 34.2 bits (79), Expect = 0.092
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 11/81 (13%)
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
LLQ RE +L ER + +L + + E D LREQ L ++E T RKLE
Sbjct: 100 LLQLWREGQVLQLQLAEERARYQRLQQSSDSELDRLREQQARLQYELE---TTTRKLEN- 155
Query: 407 ERFLQTVLTNAEKELHLRNQA 427
LT+ E++L R Q
Sbjct: 156 -------LTDIERQLSSRKQG 169
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 35.2 bits (81), Expect = 0.095
Identities = 36/138 (26%), Positives = 55/138 (39%), Gaps = 23/138 (16%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQ---VATHKQEEEALLNEMEVTGQAFED 339
D+ E KKL +E A + QL+ T QLQ+ + +QEE+AL +
Sbjct: 532 LDALEKEVKKLGEEGAALR-GQLDALTKQLQRDESEAQSLRQEEQALTQQW--------- 581
Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTA 399
Q + L L+ +DD L + QL L+ Q L QI A +
Sbjct: 582 -QAVCASLNITLQPQDDIQPWLDAQEEHERQLRLLS---------QRHELQGQIAAHNQQ 631
Query: 400 IRKLEEKERFLQTVLTNA 417
I + +++ Q L A
Sbjct: 632 IIQYQQQIEQRQQQLLTA 649
Score = 32.9 bits (75), Expect = 0.56
Identities = 36/176 (20%), Positives = 61/176 (34%), Gaps = 39/176 (22%)
Query: 308 QTHQLQKQVATHKQEEEALLNEMEVTG-QAFEDMQEQNSRLLQQLREKDDANFKLMTERI 366
T Q ++V T Q AL + + ++Q Q L L E D F+ +
Sbjct: 308 HTRQQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQSLNTWLAEHD--RFRQWNNEL 365
Query: 367 ------------KSNQLHKLA------REEKDTL---------REQVQALSLQIE----- 394
QL + ++ + L E AL+ E
Sbjct: 366 AGWRAQFSQQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTADEVAAALAQHAEQRPLR 425
Query: 395 ----AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
A+H I +++ LQ + N +E RN A++ +++ E Q AD+K
Sbjct: 426 QRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVK 481
Score = 32.1 bits (73), Expect = 0.85
Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 5/96 (5%)
Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
A EE R+++ L Q H+LQ Q+A H Q+ ++E Q + L Q
Sbjct: 604 AQEEHERQLRLLS-QRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLPQ--- 659
Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL 389
+D + R + Q + + E L+ ++Q L
Sbjct: 660 -EDEEASWLATRQQEAQSWQQRQNELTALQNRIQQL 694
Score = 32.1 bits (73), Expect = 0.99
Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 10/81 (12%)
Query: 372 HKLAREEKDTLREQVQALSL----QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
HK AR E + L+ Q ++L Q++++ +++ L ++E+ L T ++ L+
Sbjct: 193 HKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQQQQSLNW---- 248
Query: 428 MDLNKRKAIESAQSAADLKLH 448
L + ++ S L
Sbjct: 249 --LTRLDELQQEASRRQQALQ 267
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
Members of this family function as part of the Mediator
(Med) complex, which links DNA-bound transcriptional
regulators and the general transcription machinery,
particularly the RNA polymerase II enzyme. They play a
role in basal transcription by mediating activation or
repression according to the specific complement of
transcriptional regulators bound to the promoter.
Length = 176
Score = 33.5 bits (77), Expect = 0.12
Identities = 18/94 (19%), Positives = 37/94 (39%), Gaps = 20/94 (21%)
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
L + L DD + +K + H+ ++ LR +V++L +I
Sbjct: 3 LAEDLISADDE----LKSALKELEEHQENQQRILELRAEVESLDEKI------------- 45
Query: 407 ERFLQTVLTNAEKELH--LRNQAMDLNKRKAIES 438
+ + L AEKEL +++ ++ + S
Sbjct: 46 -KDILKELKEAEKELRTLPDLSSINKANKRKVSS 78
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 34.6 bits (79), Expect = 0.12
Identities = 39/165 (23%), Positives = 72/165 (43%), Gaps = 21/165 (12%)
Query: 287 RDERKKLADEEAMRKI-KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+K+++ ++ + K+LE+ L+K +EE++L N +E Q E+ +++
Sbjct: 242 ESLQKEISIMASVASVLKELEEAKANLEKA----AEEEKSLRNLVESLKQELEEEKKELE 297
Query: 346 RLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKDTLREQVQALSLQIE-------AMH 397
L ++ E ++A L E ++ ARE+ + ++Q S + E A
Sbjct: 298 ELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAR 357
Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
+RKL KE QT EL L K E+A++A
Sbjct: 358 EELRKL--KEEAEQTKAALETAELRLEAA------LKEAEAAKAA 394
Score = 33.8 bits (77), Expect = 0.20
Identities = 35/171 (20%), Positives = 66/171 (38%), Gaps = 37/171 (21%)
Query: 286 KRDERKKLADEEAMRKIKQL-------EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
KR E A + +K+++L ++ + ++E E + + +E
Sbjct: 149 KRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYE 208
Query: 339 DMQEQNSRLLQQLREKDDANF---KLMT-------------------------ERIKSNQ 370
++ + L++L++ D KL E K+N
Sbjct: 209 KELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKAN- 267
Query: 371 LHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
L K A EEK +LR V++L ++E + +L EKE + ++ E EL
Sbjct: 268 LEKAAEEEK-SLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAEL 317
Score = 32.3 bits (73), Expect = 0.59
Identities = 26/139 (18%), Positives = 53/139 (38%), Gaps = 18/139 (12%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE----MEVTGQAFEDM 340
S+++ + + E K++Q + + ++E EA E E Q +
Sbjct: 322 SEKESKAREKMVEIPLKLQQASSEAEE-------ARKEAEAAREELRKLKEEAEQTKAAL 374
Query: 341 QEQNSRLLQQLREKDDANF--KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
+ RL L+E + A L IK+ Q + + + + + ++L +E
Sbjct: 375 ETAELRLEAALKEAEAAKAAEALALAEIKALQESEESAKA-----DSPRKITLSLEEYEE 429
Query: 399 AIRKLEEKERFLQTVLTNA 417
+K EE E + + A
Sbjct: 430 LSKKAEEAEELAEKKVAAA 448
Score = 32.3 bits (73), Expect = 0.67
Identities = 39/183 (21%), Positives = 77/183 (42%), Gaps = 8/183 (4%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQ-KQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+ K +++ K E A + ++LE +L+ ++ T E +++ E+ Q ++ +
Sbjct: 82 KAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALV 141
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
E+ L++ E A K +L K K++L E+ A L+ E
Sbjct: 142 EERDAALKRAEE---AICASKVNEKKVEELTKEIIAMKESL-ERAHAAHLEAEEERIGAA 197
Query: 402 KLEEKER-FLQTVLTNAEKELHLRNQAMDLNK--RKAIESAQSAADLKLHLVIIYTNGPA 458
++++R + L AEKEL Q +D K K E++ L+ + I+ +
Sbjct: 198 LEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASV 257
Query: 459 RLE 461
E
Sbjct: 258 LKE 260
Score = 31.1 bits (70), Expect = 1.8
Identities = 37/163 (22%), Positives = 66/163 (40%), Gaps = 24/163 (14%)
Query: 292 KLADEEAMRKIKQL---EDQTHQL-------QKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
LA EEA + K+L E + Q+ ++ V K + E E + Q E +
Sbjct: 39 MLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELAK 98
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
+ L Q ++E + + T + S +EE +R++ AL +E A++
Sbjct: 99 LRAEELEQGIQELEVERYITATAELDS------VKEELRKIRQEYDAL---VEERDAALK 149
Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
+ EE + E EL AM ++++E A +A
Sbjct: 150 RAEEAICASKVNEKKVE-ELTKEIIAM----KESLERAHAAHL 187
>gnl|CDD|113156 pfam04375, HemX, HemX. This family consists of several bacterial
HemX proteins. The hemX gene is not essential for haem
synthesis in B. subtilis. HemX is a polytopic membrane
protein which by an unknown mechanism down-regulates the
level of HemA.
Length = 372
Score = 34.5 bits (79), Expect = 0.14
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%)
Query: 335 QAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQAL 389
Q +Q + L L QL + +A + + LA+ EK L + + L
Sbjct: 53 QQVAGLQIKLEALAQELTQLAQALEAQLAQEVRVLIRDNQEALAQLEKVLGQLDKSLAEL 112
Query: 390 SLQIEAMHTAIRK---LEEKERFLQ 411
++ A+ A R L E + L+
Sbjct: 113 QDRVAAISGANRNDWLLAEADFLLK 137
>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
protein Cenp-F/LEK1. Cenp-F, a centromeric kinetochore,
microtubule-binding protein consisting of two
1,600-amino acid-long coils, is essential for the full
functioning of the mitotic checkpoint pathway. There are
several leucine-rich repeats along the sequence of LEK1
that are considered to be zippers, though they do not
appear to be binding DNA directly in this instance.
Length = 140
Score = 32.8 bits (75), Expect = 0.14
Identities = 28/136 (20%), Positives = 60/136 (44%), Gaps = 13/136 (9%)
Query: 276 RKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQK-------QVATHKQEEEALLN 328
+K V+ K ER+ ++ K++ LE + ++ + K E E L
Sbjct: 3 KKQLHVLEKLKESEREADLLKD---KVENLERELEMSEENQELVILEAENSKAEVETLKT 59
Query: 329 EMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA 388
++E ++ +D++ L+ EK++ +L ++ + ++L KL ++ L E+ Q
Sbjct: 60 KIEEMAESLKDLE---LDLVTVRSEKENLTKQLQEKQERVSELEKLNSSTENLLEEKEQE 116
Query: 389 LSLQIEAMHTAIRKLE 404
E TA+ L+
Sbjct: 117 KIQMKEESKTAVEMLQ 132
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 33.7 bits (78), Expect = 0.15
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 3/63 (4%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL---LNEMEVTGQAFEDMQEQNSRLLQQLR 352
+ +++I+ L QLQ+ VA +QE +L + ++E T Q + + L+Q
Sbjct: 60 RQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQIEQIEKTRQGLVPLMYRMIDGLEQFV 119
Query: 353 EKD 355
D
Sbjct: 120 ALD 122
Score = 29.5 bits (67), Expect = 4.0
Identities = 12/63 (19%), Positives = 28/63 (44%)
Query: 367 KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
++ ++++ D +Q Q L +I + I L LQ ++ N ++E+ Q
Sbjct: 33 QTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQ 92
Query: 427 AMD 429
++
Sbjct: 93 QIE 95
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 33.6 bits (77), Expect = 0.15
Identities = 26/131 (19%), Positives = 53/131 (40%), Gaps = 21/131 (16%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
++ + L+ + ++ +E L+K EAL E+ + E + E +
Sbjct: 17 SEKEELLSSTDYGDDLESVEAL---LKKH--------EALEAELAAHEERVEALNELGEQ 65
Query: 347 LLQQLREKDDANFKLMTERIKS-----NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
L+++ + + ER++ +L +LA E + L E + +A +
Sbjct: 66 LIEEGHPDAEE----IQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQ 120
Query: 402 KLEEKERFLQT 412
LEEKE L +
Sbjct: 121 WLEEKEAALAS 131
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 33.0 bits (76), Expect = 0.15
Identities = 28/104 (26%), Positives = 47/104 (45%), Gaps = 12/104 (11%)
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
LLQQ D F+ E L KL E + L+ ++ L Q+E + + L+
Sbjct: 40 SLLQQRDR--DLEFRESLEE----TLRKL-EAEIERLQNTIERLKTQLEDLERELALLQA 92
Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNK-RKAIES--AQSAADLK 446
KER L+ L E+ L+N+ ++ + + I+ Q +LK
Sbjct: 93 KERQLEKKLKTLEQ--KLKNEKEEVQRLKNIIQQRKTQYNHELK 134
Score = 29.6 bits (67), Expect = 2.4
Identities = 21/102 (20%), Positives = 51/102 (50%), Gaps = 19/102 (18%)
Query: 290 RKKLADEEAMRK-IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
RK A+ E ++ I++L+ Q L++++A + +E L +++ L
Sbjct: 60 RKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQL--------------EKKLKTLE 105
Query: 349 QQLR-EKDDANFKLMT-ERIKSNQLHKLAREEK--DTLREQV 386
Q+L+ EK++ ++ K+ H+L + ++ + L+E++
Sbjct: 106 QKLKNEKEEVQRLKNIIQQRKTQYNHELKKRDREIEKLKERL 147
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.3 bits (79), Expect = 0.16
Identities = 37/173 (21%), Positives = 56/173 (32%), Gaps = 33/173 (19%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+RD+ K+LA E K+LE + +L+ ++ + E E L A E Q
Sbjct: 276 ERDKIKQLASE----LEKKLEKELKKLENKLEKQEDELEELEKA------AEEL--RQKG 323
Query: 346 RLLQQ--------LREKDDANFKLMTERIKSNQLHKLAREEKDT-------LREQVQALS 390
LL L+ A+F E K E L+ L
Sbjct: 324 ELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLD 383
Query: 391 LQIEAMHTAIRKLEEKERFLQT-----VLTNAEKELHLRNQ-AMDLNKRKAIE 437
Q+ + AI E + L+ + +EL KRK E
Sbjct: 384 RQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 33.8 bits (78), Expect = 0.16
Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 23/168 (13%)
Query: 284 DSKRDERKKLAD-----EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL-LNEMEVTGQAF 337
D ++K ++A +++++L + L+K A E L L+++E
Sbjct: 31 DEIDKSKQKADQYQKQIDDAPKELRELRQELEALKKTDAPVFPELANLSLSQLE------ 84
Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
+ + + S+L + + N +L+ + + + A+++ R ++Q +I
Sbjct: 85 QRLAQTLSQLQELQEQLQQENSQLIELQTRPER----AQQQLSEARRRLQ----EIRNRL 136
Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
A+ Q L A EL ++ + S + +L
Sbjct: 137 QALSPGGTPLAQAQRTLLQA--ELAALKAQIEEL-ELELLSNNNRQEL 181
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.7 bits (77), Expect = 0.16
Identities = 19/85 (22%), Positives = 39/85 (45%), Gaps = 8/85 (9%)
Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
E++Q++ LL++L E + A ++ + ER+K E L E ++ L ++ +
Sbjct: 144 LEELQKEKEELLKELEELE-AEYEEVQERLKR------LEVENSRLEEMLKKLPGEVYDL 196
Query: 397 HTAIRKLEEKERFLQTVL-TNAEKE 420
+LE + L ++ KE
Sbjct: 197 KKRWDELEPGVELPEEELISDLVKE 221
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 34.5 bits (79), Expect = 0.17
Identities = 32/163 (19%), Positives = 64/163 (39%), Gaps = 41/163 (25%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
R+E+++ + A K K+ E +L+K++ ++E E N
Sbjct: 65 REEQEQKIQDAATEKTKEWEKVKAELEKKLLELQKELARAAAE--------------NDA 110
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
L + L+EK +L EEK +++ L ++E+ L+ +
Sbjct: 111 LSRSLQEKSKMIMEL--------------SEEKSRAESEIEDLKARLESAEKENSSLKYE 156
Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
L KEL +RN+ +++ ++ ++A K HL
Sbjct: 157 VHVLS-------KELEIRNEERNMS-LQSADAAS-----KQHL 186
>gnl|CDD|214436 MTH00151, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 565
Score = 34.0 bits (79), Expect = 0.18
Identities = 16/76 (21%), Positives = 22/76 (28%), Gaps = 7/76 (9%)
Query: 180 CFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLT---S 236
S FS S S +F + + SS + D SL+
Sbjct: 6 LSSFLLFSYSVFLFPLLGYFGLNGISLIF---EWELLSVSSCSFSFELVLDWVSLSFSGV 62
Query: 237 TLFVSICTSAFQMSFY 252
+S C F S Y
Sbjct: 63 VCLISGCVFLF-SSSY 77
Score = 29.0 bits (66), Expect = 6.3
Identities = 16/74 (21%), Positives = 21/74 (28%), Gaps = 3/74 (4%)
Query: 132 CFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTV 191
S FS S S +F + + SS + S S V
Sbjct: 6 LSSFLLFSYSVFLFPLLGYFGLNGISLIF---EWELLSVSSCSFSFELVLDWVSLSFSGV 62
Query: 192 FVYHCFSIFRFSSS 205
+F FSSS
Sbjct: 63 VCLISGCVFLFSSS 76
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 33.6 bits (77), Expect = 0.20
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT 318
+ ++ R + K +EEA++KI LED+ L+ Q+A
Sbjct: 103 LRQEEPRLQSLKPPNEEALQKISALEDELAFLRAQIAK 140
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 33.3 bits (76), Expect = 0.21
Identities = 31/135 (22%), Positives = 57/135 (42%), Gaps = 9/135 (6%)
Query: 307 DQTHQLQKQVATHKQEEEALLNEMEVTGQAFED-MQEQNSRLLQQLREKDDANFKLMTER 365
+Q QL + K E +AL E + + E + KL R
Sbjct: 24 EQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKL---R 80
Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
+ +LHK R E L +++ L+ ++E + I++LE+ L++ +T+ E E+
Sbjct: 81 EELTELHK-KRGE---LAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLR 136
Query: 426 QAMDLNKRKAIESAQ 440
+ + K K E+ Q
Sbjct: 137 EELQ-EKEKDNETLQ 150
Score = 29.0 bits (65), Expect = 4.3
Identities = 24/137 (17%), Positives = 57/137 (41%), Gaps = 24/137 (17%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
+ KL +E K+ E L + E E + +++ + + L
Sbjct: 74 QKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTI-----AELRSEITSL 128
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK- 406
++R+ + + E+ K N+ TL++++ +L++++ A+ +RKL+++
Sbjct: 129 ETEIRDLREE----LQEKEKDNE----------TLQDELISLNIELNALEEKLRKLQKEN 174
Query: 407 ----ERFLQTVLTNAEK 419
ER++ AE
Sbjct: 175 QELVERWMAKKGQEAEA 191
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 34.2 bits (79), Expect = 0.21
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 7/75 (9%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE--DM 340
+ + + LA E K ++LE Q QLQ++ A + +E E+T QA + ++
Sbjct: 175 QQQELVALEGLA-AELEEKQQELEAQLEQLQEKAA--ETSQERKQKRKEITDQAAKRLEL 231
Query: 341 QEQNSRLL--QQLRE 353
E+ +R+L QQLR+
Sbjct: 232 SEEETRILIDQQLRK 246
Score = 31.8 bits (73), Expect = 1.1
Identities = 25/108 (23%), Positives = 45/108 (41%), Gaps = 12/108 (11%)
Query: 306 EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT-- 363
E+ H LQ++V T KQ+ E E + E Q++ L E ++ +L
Sbjct: 141 ENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQL 200
Query: 364 ERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE-RFL 410
E+++ ++E K +E + ++E L E+E R L
Sbjct: 201 EQLQEKAAET-SQERKQKRKEITDQAAKRLE--------LSEEETRIL 239
>gnl|CDD|236745 PRK10722, PRK10722, hypothetical protein; Provisional.
Length = 247
Score = 33.4 bits (77), Expect = 0.21
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 11/81 (13%)
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
L Q R+ L ER + +L + + E D LR+Q Q L Q+E RKLE
Sbjct: 146 LYQLWRDGQALQLALAEERQRYQKLQQSSDSELDALRQQQQRLQYQLE---LTTRKLEN- 201
Query: 407 ERFLQTVLTNAEKELHLRNQA 427
LT+ E++L R QA
Sbjct: 202 -------LTDIERQLSSRKQA 215
>gnl|CDD|189017 cd09610, M3B_PepF_5, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 472
Score = 33.7 bits (78), Expect = 0.22
Identities = 36/148 (24%), Positives = 56/148 (37%), Gaps = 29/148 (19%)
Query: 289 ERKKLADE--EAMRKIKQLEDQTHQLQKQ--VATH--KQEEEALLNEMEVTGQ-AFEDMQ 341
E KL + A+ I +L D H L+K A H + EE +LNE TG+ A+
Sbjct: 7 EWAKLDEAKAAALLAIPELADYRHYLEKVRRYAPHTLSEPEEKILNEKSPTGRGAWV--- 63
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
LR + D + E + L R +R++ A +
Sbjct: 64 RLFDETTSSLRFEFDGEKLTLEE------ILALLRSPDREVRKK---------AAKALTK 108
Query: 402 KLEEKERFL----QTVLTNAEKELHLRN 425
L++ R L TV+ + + E LR
Sbjct: 109 GLKKNSRTLTLIYNTVVKDKDIEDRLRG 136
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 34.1 bits (78), Expect = 0.23
Identities = 29/177 (16%), Positives = 64/177 (36%), Gaps = 18/177 (10%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE- 342
K K + +++ + K LE Q + ++ K + E L ME +A+ED+++
Sbjct: 20 AVKNINVLKSSIKDSTQFWKMLEKQQKLTKDGLSASKGKYEGLSEAMEKQKKAYEDLKQE 79
Query: 343 --QNSRLLQQLREKDDANFKLMTE-----RIKSNQL------HKLAREEKDTLREQVQAL 389
+ +R Q ++ T+ R S Q L ++E L+ + L
Sbjct: 80 VKEVNRATQASKKAYQEYNAQYTQAENKLRSLSGQFGVASEQLMLQQKEIQRLQYAISTL 139
Query: 390 S----LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
+ Q + K + + + + ++ N+ ++ K+ A
Sbjct: 140 NKSMAAQARLLEQTGNKFGTADAKVVGLRESFGRQTEALNKQLERTKKVADALTYVL 196
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.9 bits (77), Expect = 0.23
Identities = 38/172 (22%), Positives = 71/172 (41%), Gaps = 16/172 (9%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV----TGQAFED 339
D ++ K++ E + KI +L+++ L+ + +++ L +ME T + D
Sbjct: 248 DPLKNRLKEI--EHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND 305
Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTA 399
+ + R +REK+ E K N+ +L +EK L + L LQ +
Sbjct: 306 LYHNHQR---TVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEH 362
Query: 400 IRKLEEKERFLQTVLT------NAEKELHLRNQAMDLNKRKAIESAQSAADL 445
IR + + L T L E ++N L + + A++AA L
Sbjct: 363 IRARDSLIQSLATRLELDGFERGPFSERQIKN-FHTLVIERQEDEAKTAAQL 413
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 33.4 bits (77), Expect = 0.24
Identities = 28/144 (19%), Positives = 59/144 (40%), Gaps = 18/144 (12%)
Query: 293 LADEEAMRKIK--QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
LAD ++ + L+D L+ ++ + E L + A + + L Q+
Sbjct: 65 LADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQE 124
Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFL 410
L + Q+ A + + L +Q+ AL Q+ A+ A+ E+++R
Sbjct: 125 LDSEK--------------QVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRES 170
Query: 411 QTVLTNAEKELH--LRNQAMDLNK 432
Q + + + L+ L + +LN+
Sbjct: 171 QAKIADLGRRLNVALAQRVQELNR 194
Score = 29.2 bits (66), Expect = 5.6
Identities = 29/134 (21%), Positives = 50/134 (37%), Gaps = 22/134 (16%)
Query: 261 SLSDSASNIYGSLEFRKFACVVRDSKRDERKKL--ADEEAMRKIKQLEDQTHQLQKQVAT 318
L DS +N+ SL + ER +L E E + +L +++ +
Sbjct: 78 DLQDSVANLRASL----------SAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127
Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR--EKDDANFKLMTERIKSNQLHKLAR 376
KQ L ++E+ Q ++ Q + L L EK D ++ L R
Sbjct: 128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRE--------SQAKIADLGR 179
Query: 377 EEKDTLREQVQALS 390
L ++VQ L+
Sbjct: 180 RLNVALAQRVQELN 193
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 30.6 bits (70), Expect = 0.25
Identities = 18/44 (40%), Positives = 29/44 (65%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV 332
ERKK EE RK++QLE + +L+K++ ++E E L +E+E
Sbjct: 22 ERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELEE 65
Score = 28.7 bits (65), Expect = 0.96
Identities = 12/61 (19%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 287 RDERKKLADEEAMRK-IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ R++ + EA R+ ++ + + +L+++V + E E L E+E + E ++ +
Sbjct: 5 KRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELE 64
Query: 346 R 346
Sbjct: 65 E 65
>gnl|CDD|238345 cd00632, Prefoldin_beta, Prefoldin beta; Prefoldin is a hexameric
molecular chaperone complex, composed of two
evolutionarily related subunits (alpha and beta), which
are found in both eukaryotes and archaea. Prefoldin
binds and stabilizes newly synthesized polypeptides
allowing them to fold correctly. The hexameric
structure consists of a double beta barrel assembly with
six protruding coiled-coils. The alpha prefoldin
subunits have two beta hairpin structures while the beta
prefoldin subunits (this CD) have only one hairpin that
is most similar to the second hairpin of the alpha
subunit. The prefoldin hexamer consists of two alpha and
four beta subunits and is assembled from the beta
hairpins of all six subunits. The alpha subunits
initially dimerize providing a structural nucleus for
the assembly of the beta subunits. In archaea, there is
usually only one gene for each subunit while in
eukaryotes there two or more paralogous genes encoding
each subunit adding heterogeneity to the structure of
the hexamer.
Length = 105
Score = 31.6 bits (72), Expect = 0.25
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 23/117 (19%)
Query: 308 QTHQLQKQ---VATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN-FKLMT 363
Q QLQ+Q +Q+ EA LNE +A E+++ + DDA +KL+
Sbjct: 7 QLQQLQQQLQAYIVQRQKVEAQLNE---NKKALEELE----------KLADDAEVYKLV- 52
Query: 364 ERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
N L K +EE + L+E+++ + L+I+ + L+EK + LQ + A+K
Sbjct: 53 ----GNVLVKQEKEEARTELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQAQK 105
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 33.8 bits (77), Expect = 0.26
Identities = 44/184 (23%), Positives = 79/184 (42%), Gaps = 16/184 (8%)
Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
V R E+ + A + K ++LE+Q + Q + E ALL + Q ++
Sbjct: 166 VSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILEL 225
Query: 341 QEQNSRLLQQLREKD-------DANFKLMTERIKSNQ-LHKLAREEKDTLREQVQALSLQ 392
++ L Q+ +E D D +L + + Q L K+ + +D E++ L L+
Sbjct: 226 EQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERID-LQLE 284
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKEL----HLRNQAMDLNKRKAIESAQSA---ADL 445
E +H +R L+E+ Q +EL LR+ + + +ESAQ + + L
Sbjct: 285 NEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQMSLQLSQL 344
Query: 446 KLHL 449
L L
Sbjct: 345 NLAL 348
Score = 29.6 bits (66), Expect = 5.4
Identities = 30/138 (21%), Positives = 57/138 (41%), Gaps = 5/138 (3%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
DE L E+ + + + LQ Q+ ++E L+ + DM+ + +L
Sbjct: 117 DELVTLEKEDDGDDLLLVVPKATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQL 176
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL---REQVQALSLQIEA-MHTAIRKL 403
LR + +L + ++ + E++ L R + Q L++E + T +K
Sbjct: 177 ETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKK 236
Query: 404 EEKERFL-QTVLTNAEKE 420
+E +R L T AE E
Sbjct: 237 QENDRVLEGTQDIEAELE 254
Score = 29.2 bits (65), Expect = 6.6
Identities = 25/115 (21%), Positives = 46/115 (40%), Gaps = 13/115 (11%)
Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT 363
QL L++ + QE E L E + + + +L + L+E+ KL
Sbjct: 340 QLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQREKLEV 399
Query: 364 E--------RIKSNQLHKLAREEKDTLR----EQVQALSLQIEAMHTAIRKLEEK 406
E R++ ++ + E + LR E+ Q L + + + IR LE +
Sbjct: 400 ELGIEKDCNRVQLSENRRELSELRSALRVLQKEKEQ-LQEEKQELLDYIRVLELR 453
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 33.8 bits (77), Expect = 0.26
Identities = 24/155 (15%), Positives = 57/155 (36%), Gaps = 12/155 (7%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
++D+R+KL D R + + L++ V E+ + ++ + +QEQ +
Sbjct: 595 EKDDRRKLGDRSTYRLGSTNDAKVETLRETVKAMLSREDFYMIKIMRQQGEYIKLQEQ-A 653
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L ++ + A+ L + + E E++ + A L+
Sbjct: 654 NALAHIQALNFASIDLPSA--------QRQIAELQARLERLTHTQ---SDIAIAKAALDA 702
Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
+ + + ++E+ KR A+ S +
Sbjct: 703 AQTRQKVLERQYQQEVTECAGLKKDLKRAAMLSRK 737
Score = 30.4 bits (68), Expect = 3.0
Identities = 35/176 (19%), Positives = 69/176 (39%), Gaps = 22/176 (12%)
Query: 289 ERKKLADEEAMRKIKQLE---DQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
E E+ R+I L+ ++ +K + K AL + G + +EQ +
Sbjct: 245 EGAYEQVEDIKRQIHTLDPLVQLKNRREKAQQS-KDHANALKKALPTVGNRIKK-EEQET 302
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE----QVQALSLQIEAMHTAIR 401
+ Q E+ A K+ + +I++++ ++ D +++ Q LS + E R
Sbjct: 303 LVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGILSAKREGAVDKRR 362
Query: 402 KLEEKERFLQTVLT--------NAEKELHLRNQAM----DLN-KRKAIESAQSAAD 444
+ L ++ +AE+ L L N A + R+A+ESA
Sbjct: 363 TISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVDEYPAAREALESAGQRNV 418
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 33.8 bits (77), Expect = 0.26
Identities = 21/116 (18%), Positives = 55/116 (47%), Gaps = 6/116 (5%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
+ K + L+ +++ + V KQ+ + ++E E +E+ ++ L+
Sbjct: 298 KTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEE----IKALQSNI 353
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI--RKLEEKERF 409
D K + ++ S + +L +E++ L ++ +++Q + + ++ RKLE + F
Sbjct: 354 DELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIF 409
Score = 29.6 bits (66), Expect = 5.6
Identities = 31/137 (22%), Positives = 59/137 (43%), Gaps = 10/137 (7%)
Query: 295 DEEAMRKIK-QLEDQTHQLQKQVATHKQEEEALLN---EMEVTGQAFEDMQEQNSRLLQQ 350
E + +++K E H + +A K + + L E Q + ++E+ R L+
Sbjct: 251 YEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREK-WRALKS 309
Query: 351 LREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
K + M ++ + +L KL + E + E+++AL I+ +H +RK +
Sbjct: 310 DSNKYENYVNAMKQKSQEWPGKLEKL-KSEIELKEEEIKALQSNIDELHKQLRK--QGIS 366
Query: 409 FLQTVLTNAEKELHLRN 425
Q L N E+E R
Sbjct: 367 TEQFELMNQEREKLTRE 383
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 33.7 bits (78), Expect = 0.26
Identities = 26/140 (18%), Positives = 61/140 (43%), Gaps = 19/140 (13%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT--GQAFEDMQEQNSRLLQQ--- 350
EA + +LE++T L++++ +E LL E++ Q E+++ + ++
Sbjct: 190 LEAREVLLKLEEETDALEQKM----EEIPPLLKELQNEFPDQ-LEELKAGYREMTEEGYH 244
Query: 351 LREKD-DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-------TAIRK 402
D + + + E+I N L L + D E+ + + +I+ ++ A +
Sbjct: 245 FDHLDIEKELQDLKEQIDQN-LALLEELDLDEAEEENEEIEERIDTLYDILEKEVKAKKF 303
Query: 403 LEEKERFLQTVLTNAEKELH 422
+E+ L L +A ++
Sbjct: 304 VEKNIDKLTDFLEHAREQNK 323
Score = 33.7 bits (78), Expect = 0.27
Identities = 29/128 (22%), Positives = 60/128 (46%), Gaps = 24/128 (18%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDD 356
+++++LE Q QL +++A K L E+E + E+++++ L LQ LR KD+
Sbjct: 351 KQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSESLQGLR-KDE 409
Query: 357 ANFKLMTERIKSNQLHKLAR---------------EEKDTLREQVQALSLQIEA----MH 397
+ + + +LH++ R E + ++++AL+ ++ M
Sbjct: 410 LEAREKLQEYR-QKLHEIKRYLEKSNLPGLPEDYLEYFFVVSDEIEALADELNEVPINMD 468
Query: 398 TAIRKLEE 405
R+LEE
Sbjct: 469 AVNRQLEE 476
Score = 33.3 bits (77), Expect = 0.37
Identities = 31/167 (18%), Positives = 68/167 (40%), Gaps = 39/167 (23%)
Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHK-------QEEEALLNEMEVTGQAFEDMQE---- 342
++E+ +++++L+D+ +L+K + E E L+E+E + F ++ E
Sbjct: 130 SEEKNRKEVEELKDKYRELRKTLLAKSFSYGPALDELEKQLDELEEEFEQFVELTESGDY 189
Query: 343 -QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS----------- 390
+ +L +L E+ DA + M E I L +E ++ +Q++ L
Sbjct: 190 LEAREVLLKLEEETDALEQKM-EEIP-----PLLKELQNEFPDQLEELKAGYREMTEEGY 243
Query: 391 --------LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
+++ + I + L L AE+E + +D
Sbjct: 244 HFDHLDIEKELQDLKEQIDQNLALLEELD--LDEAEEENEEIEERID 288
Score = 31.4 bits (72), Expect = 1.4
Identities = 30/152 (19%), Positives = 62/152 (40%), Gaps = 29/152 (19%)
Query: 296 EEAMRKIKQLEDQTH-QLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ----- 349
EE +I++ D + L+K+V K E+ ++ E +EQN +LL
Sbjct: 277 EEENEEIEERIDTLYDILEKEVKAKKFVEK----NIDKLTDFLEHAREQNKQLLLELDRL 332
Query: 350 ----QLREKD--------------DANFKLMTERIKSN-QLHKLAREEKDTLREQVQALS 390
L E + +A + + ERI + +EE + + +Q++ +
Sbjct: 333 QQSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIE 392
Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELH 422
+ E + +++ L + E + L ++LH
Sbjct: 393 KEQEELSESLQGLRKDELEAREKLQEYRQKLH 424
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 32.9 bits (76), Expect = 0.26
Identities = 32/127 (25%), Positives = 62/127 (48%), Gaps = 30/127 (23%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN---EMEVTGQAFEDMQEQ 343
++ R +L +E K+L + L ++ + +++EE+L E+ Q E+ +E+
Sbjct: 74 KERRNELQRQE-----KRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEE 128
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTA--- 399
L+++ +++ ERI S L EE K+ L E+V+A E H A
Sbjct: 129 LEELIEEQQQE--------LERI-SG----LTAEEAKEILLEEVEA-----ELKHEAAKM 170
Query: 400 IRKLEEK 406
I+++EE+
Sbjct: 171 IKEIEEE 177
>gnl|CDD|219189 pfam06818, Fez1, Fez1. This family represents the eukaryotic Fez1
protein. Fez1 contains a leucine-zipper region with
similarity to the DNA-binding domain of the
cAMP-responsive activating-transcription factor 5. There
is evidence that Fez1 inhibits cancer cell growth
through regulation of mitosis, and that its alterations
result in abnormal cell growth. Note that some family
members contain more than one copy of this region.
Length = 200
Score = 32.6 bits (74), Expect = 0.29
Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 6/137 (4%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
EA ++++ E+Q +LQ + T E E NE++ E ++E+ RL +L E +
Sbjct: 42 EARAQLRESEEQLQELQDALRTKTLELEVCENELQRKKSEAELLREKAGRLEAELAELRE 101
Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI-----RKLEEKERFLQ 411
A A+ ++ ++ L Q+E + + R+ E+ F Q
Sbjct: 102 ALASCPAPLPVLLTESDEAKAQRQAGEAGLRELRRQVERLRAELRYERRRREEQASSFEQ 161
Query: 412 TVLT-NAEKELHLRNQA 427
EKE +R Q
Sbjct: 162 ERRVWQEEKEKVIRYQK 178
>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
His-Domain type N23 protein tyrosine phosphatase and
related domains. This family contains the V-shaped (V)
domain of mammalian His-Domain type N23 protein tyrosine
phosphatase (HD-PTP, also known as PTPN23) and related
domains. It belongs to the V_Alix_like superfamily which
includes the V domains of Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, mammalian Alix
(apoptosis-linked gene-2 interacting protein X/ also
known as apoptosis-linked gene-2 interacting protein 1,
AIP1), and related domains. HD_PTP interacts with the
ESCRT (Endosomal Sorting Complexes Required for
Transport) system, and participates in cell migration
and endosomal trafficking. The related Alix V-domain
(belonging to a different family in this superfamily)
contains a binding site, partially conserved in the
superfamily, for the retroviral late assembly (L) domain
YPXnL motif. The Alix V-domain is also a dimerization
domain. In addition to the V-domain, HD_PTP also has an
N-terminal Bro1-like domain, a proline-rich region
(PRR), a catalytically inactive tyrosine phosphatase
domain, and a region containing a PEST motif. Bro1-like
domains bind components of the ESCRT-III complex,
specifically to CHMP4 in the case of HD-PTP. The
Bro1-like domain of HD-PTP can also bind human
immunodeficiency virus type 1 (HIV-1) nucleocapsid.
HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
gene candidate frequently absent in human kidney,
breast, lung, and cervical tumors. This family also
contains Drosophila Myopic, which promotes epidermal
growth factor receptor (EGFR) signaling, and
Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
promotes Notch signaling.
Length = 337
Score = 33.0 bits (76), Expect = 0.30
Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 43/158 (27%)
Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE---KDDANFKL 361
L++ +L + E+EA+ E++ +M++Q L QQLR+ +DD KL
Sbjct: 168 LDELQKKLPSPSLLDRPEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAIHEDDITSKL 227
Query: 362 MT------ERIKSNQLHK---------------------------------LAREEKDTL 382
+T E + +L K A E
Sbjct: 228 VTTTGGDMEDLFKEELKKHDQLVNLIEQNLAAQENILKALTEANAKYAPVRKALSETKQK 287
Query: 383 RE-QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
RE + +L EA ++K ++ F + + N K
Sbjct: 288 RESTISSLIASYEAYEDLLKKSQKGIDFYKKLEGNVSK 325
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 33.5 bits (76), Expect = 0.32
Identities = 39/156 (25%), Positives = 67/156 (42%), Gaps = 7/156 (4%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K E K ++E + ++ E + H L+ Q+ K EE L E+E E + +N
Sbjct: 429 KIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNI 488
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L + N +L E S+ +L + ++D + + Q E M I LEE
Sbjct: 489 ELTAHCDKLLLENKELTQE--ASDMTLELKKHQEDIINCKKQE-----ERMLKQIENLEE 541
Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
KE L+ L + +E + + K+ E+A+S
Sbjct: 542 KEMNLRDELESVREEFIQKGDEVKCKLDKSEENARS 577
Score = 33.2 bits (75), Expect = 0.42
Identities = 35/151 (23%), Positives = 66/151 (43%), Gaps = 15/151 (9%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
+DE K E+ K+LED LQ+ ++T K EE ++++ + + E+
Sbjct: 280 QDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEE----DLQIATKTICQLTEEKEA 335
Query: 347 LLQQLREKDDANFKLMTE-RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
+++L + A+ ++TE + L +L R E+ L + L + + +LEE
Sbjct: 336 QMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEE 395
Query: 406 KERF----------LQTVLTNAEKELHLRNQ 426
+F L+ +L EK L + Q
Sbjct: 396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQ 426
Score = 32.8 bits (74), Expect = 0.58
Identities = 42/182 (23%), Positives = 82/182 (45%), Gaps = 23/182 (12%)
Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKI--KQLEDQTHQLQ-------------KQVAT 318
EF C + + R E+++L E KI +L+ ++ +L+ +++
Sbjct: 353 EFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKK 412
Query: 319 HKQEEEALLNEMEVTGQAFEDMQ--EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 376
E+E LL+E + + E+++ EQ L Q REK+ + ++ IK+++ H L
Sbjct: 413 ILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKE 472
Query: 377 -EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
E+ T E+ + ++++ A H LE KE + ++ L L+ D+ K
Sbjct: 473 VEDLKTELEKEKLKNIELTA-HCDKLLLENKELTQEA----SDMTLELKKHQEDIINCKK 527
Query: 436 IE 437
E
Sbjct: 528 QE 529
Score = 30.5 bits (68), Expect = 2.5
Identities = 37/168 (22%), Positives = 86/168 (51%), Gaps = 14/168 (8%)
Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKI----KQLEDQTHQLQKQVATHKQEEEALL 327
++E +K + + + + + K + E ++KI ++L D+ Q +K K +E+ L+
Sbjct: 383 TMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELI 442
Query: 328 NEMEVTGQAFEDMQEQNSRLLQQ----LREKDDANFKLMTERIKSNQL----HKLAREEK 379
++ + D++ Q + + L+E +D +L E++K+ +L KL E K
Sbjct: 443 FLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENK 502
Query: 380 DTLREQVQALSLQIEAMHTAIRKLE-EKERFLQTVLTNAEKELHLRNQ 426
+ L ++ ++L+++ I + ++ER L+ + EKE++LR++
Sbjct: 503 E-LTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDE 549
Score = 30.1 bits (67), Expect = 3.8
Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 12/131 (9%)
Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD----DANFKLMTERIKSNQLH--- 372
K++ E + + E + D ++Q S LL Q+ EK+ D F L R K+NQL
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277
Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHL-----RNQA 427
KL E L E+ L+ ++E + ++++ ++ L+ L A K + Q
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQM 337
Query: 428 MDLNKRKAIES 438
+LNK KA S
Sbjct: 338 EELNKAKAAHS 348
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 33.6 bits (77), Expect = 0.32
Identities = 27/170 (15%), Positives = 64/170 (37%), Gaps = 13/170 (7%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEE--ALLNEMEVTGQAFEDMQEQNSR 346
E + A RK Q ++++ + + + +E G A + +
Sbjct: 803 ESAERELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALEAVGLALKRFGDHLHT 862
Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL---REQVQALSLQIEAMHTAI--- 400
L +RE A + +R ++ + ARE + R + + SL++ + ++
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922
Query: 401 -RKLEEKERFLQTVLTNAEKELHLRNQAM----DLNKRKAIESAQSAADL 445
++ + + L + +EL +A+ + R + A++ A L
Sbjct: 923 VDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATL 972
Score = 32.9 bits (75), Expect = 0.48
Identities = 29/179 (16%), Positives = 54/179 (30%), Gaps = 19/179 (10%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLE---DQTHQLQKQVATHKQ---EEEALLNE----ME 331
D+ R + L A + ++LE LQ A +Q E E+ L E ++
Sbjct: 312 EADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLD 371
Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK------DTLREQ 385
+D + + +QL A ++ + LA +E D R+
Sbjct: 372 EEAGRLDDAERELRAAREQLAR--AAERAGLSPAHTAEPDAALAAQELQELGALDARRQD 429
Query: 386 VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
+ Q +R+ ++ + A + E A A
Sbjct: 430 ADRVIAQRSEQVALLRRRDDVADRAEATHAAA-RARRDELDEEAEQAAARAELADEAVH 487
>gnl|CDD|183473 PRK12361, PRK12361, hypothetical protein; Provisional.
Length = 547
Score = 33.4 bits (77), Expect = 0.34
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 15/100 (15%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQK-QVAT-HKQEEEALLNEMEVT---------GQAFED 339
L EE +++IKQ+ +T +L K Q+ K E+ LN + G +++
Sbjct: 202 PDLTVEEVLQQIKQIR-KTARLNKRQLRALEKMLEQGKLNIHKRAWLIANPVSGGGKWQE 260
Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
EQ + ++L+ D KL T I + L K AR+
Sbjct: 261 YGEQ---IQRELKAYFDLTVKLTTPEISAEALAKQARKAG 297
>gnl|CDD|234669 PRK00153, PRK00153, hypothetical protein; Validated.
Length = 104
Score = 30.9 bits (71), Expect = 0.36
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 4/38 (10%)
Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
M ++ L Q Q+Q+++ QEE L +MEV G+A
Sbjct: 2 MGNMQNLMKQAQQMQEKMQK-MQEE---LAQMEVEGEA 35
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 33.0 bits (76), Expect = 0.36
Identities = 16/109 (14%), Positives = 45/109 (41%), Gaps = 18/109 (16%)
Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
I Q + +K + + + EA E + + Q++ ++++ + K
Sbjct: 192 ILQTDQALTAKEKAIEAERAKAEAAEAEQ----ELLREKQKEEEQMMEAQERSYQEHVKQ 247
Query: 362 MTERIKSNQ-----------LHKLAREE---KDTLREQVQALSLQIEAM 396
+ E++++ + HKL +E K+ + + ++L +I+ +
Sbjct: 248 LIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEIQDL 296
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 33.1 bits (75), Expect = 0.37
Identities = 30/124 (24%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 316 VATHKQEEEA--LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE--------R 365
+ + K EE A L ++E + D+Q Q Q + +A K E +
Sbjct: 278 IMSSKIEELARGLRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQ 337
Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
+ + +LA EEK LR + L+ ++LE K+R + + EL +R
Sbjct: 338 AECARQTQLALEEKAALRAERDNLA----------KELEAKKREAEQLRM----ELAIRI 383
Query: 426 QAMD 429
A+D
Sbjct: 384 SALD 387
>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain. These
proteins contain several 22 residue repeats which form a
pair of alpha helices. This family includes:
Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
E.
Length = 191
Score = 32.2 bits (74), Expect = 0.38
Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 34/169 (20%)
Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
+QL + +++ +E EAL E+ + ED++ + L +L+ K N + +
Sbjct: 19 EQLGPYAQEFWARLS---KETEALREEL---QKDLEDVRARLQPYLDELKAKVGQNLEEL 72
Query: 363 TERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR------------------- 401
+R+ +L K + + LR ++ + E + +R
Sbjct: 73 RQRLAPYAEELRKRLNRDAEELRRKLAPYA---EELRDRLRQNVEALRARLGPYVEELRQ 129
Query: 402 KLEEKERFLQTVLTNAEKELHLR-NQAMDLNKRKAIESAQSAADLKLHL 449
KL E+ L+ + +E + ++ ++ + K A+ DL+ L
Sbjct: 130 KLAERLEELKESVGPYAEEYKAQLSEQVEELREKLEPYAE---DLREKL 175
Score = 29.5 bits (67), Expect = 3.1
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 27/133 (20%)
Query: 286 KRDE-RKKLADE-EAMRK---------IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG 334
DE + K+ E +R+ K+L +L++++A + +E L V
Sbjct: 57 YLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKLAPYAEELRDRL-RQNV-- 113
Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIK--SNQLHKLAREEKDTLREQVQALSLQ 392
E ++ + +++LR+K + ER++ + A E K L EQV+ L +
Sbjct: 114 ---EALRARLGPYVEELRQK-------LAERLEELKESVGPYAEEYKAQLSEQVEELREK 163
Query: 393 IEAMHTAIR-KLE 404
+E +R KLE
Sbjct: 164 LEPYAEDLREKLE 176
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 31.8 bits (73), Expect = 0.39
Identities = 19/70 (27%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEE-EALLNEMEVTGQAFEDMQEQN 344
+ ++ K E +K K+L+ + +LQKQ AT +E +A E++ Q + Q+
Sbjct: 39 QLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98
Query: 345 SRLLQQLREK 354
+ LQQ +++
Sbjct: 99 QQELQQKQQE 108
Score = 29.1 bits (66), Expect = 3.2
Identities = 12/86 (13%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 324 EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLR 383
+ +L+E A + ++++ +L +L++K+ + ++++ L+ E + +
Sbjct: 24 QKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKE-LQKEEQKLQKQAA-TLSEEARKAKQ 81
Query: 384 EQVQALSLQIE-AMHTAIRKLEEKER 408
+++Q +++ A ++L++K++
Sbjct: 82 QELQQKQQELQQKQQAAQQELQQKQQ 107
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 31.5 bits (72), Expect = 0.39
Identities = 8/47 (17%), Positives = 28/47 (59%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
++ ++L+++ +L++++ +K+ EE L + V + E+++ +
Sbjct: 39 KENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQK 85
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 31.7 bits (73), Expect = 0.39
Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 16/110 (14%)
Query: 288 DER-KKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQEQ 343
+ER KK+AD A R K+LE + + Q+A + E ++ + + E+ + +
Sbjct: 34 EERQKKIADGLAAAERAKKELELAQAKYEAQLAEARAEAAEIIEQAKKRAAQIIEEAKAE 93
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
+++ + A ++ ER + AREE LR+QV L++
Sbjct: 94 AEAEAARIKAQ--AQAEIEQERKR-------AREE---LRKQVADLAVAG 131
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 32.4 bits (74), Expect = 0.39
Identities = 28/184 (15%), Positives = 62/184 (33%), Gaps = 35/184 (19%)
Query: 275 FRKFACVVRDSKRDERKKLADEEAM--RKIKQLEDQ----THQLQKQVATHKQEEEALLN 328
F++ ++R + + K D E M + I+ ++ + L + +A KQ E L
Sbjct: 3 FKRLLRLLRANIHEGLDKAEDPEKMLEQAIRDMQSELGKARQALAQVIARQKQLERKL-- 60
Query: 329 EMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--------NQLHKLAREEKD 380
+ E ++ ++ L + ++ + I + R +
Sbjct: 61 -----EEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQLTQQRSAVE 115
Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
LR+Q+ AL K+++ + + + +N S +
Sbjct: 116 QLRKQLAALE----------TKIQQLKAKKTALKARLKAAKA----QEAVNTSLGSASTE 161
Query: 441 SAAD 444
SA D
Sbjct: 162 SATD 165
>gnl|CDD|177136 MTH00064, ND6, NADH dehydrogenase subunit 6; Provisional.
Length = 151
Score = 31.8 bits (73), Expect = 0.40
Identities = 34/156 (21%), Positives = 56/156 (35%), Gaps = 32/156 (20%)
Query: 123 LSSTVFVYHCFSI-FRFSSSTVFVYHCFSIF--RFSSSTVFVYHCFSIFRFS-SSTVF-- 176
L +Y + F FSS V Y+C + SS + C+ ++ FS S +F
Sbjct: 3 LEVLFSLYFIVLVLFSFSSHPV--YYCVLLVVNSLISSLI----CYLVYGFSWYSLLFCL 56
Query: 177 -----VYHCF----------SIFRFSS-STVFVYHCFSIFRFSSSTVFVYHC---FSIFR 217
VY F S +S+ + V+ F VF FS +
Sbjct: 57 VYVGGVYILFIFVSVFSPNSSFVSYSNLGELGVFFFFFFGLLLCVLVFYSLVNVEFSFYL 116
Query: 218 FSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSFYK 253
+SS Y + + + +S S + S+Y+
Sbjct: 117 CTSSEGVFYLFLCLVLMFGFIVLSFIVS-SKTSYYR 151
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 33.0 bits (75), Expect = 0.40
Identities = 30/146 (20%), Positives = 66/146 (45%), Gaps = 10/146 (6%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL--R 352
+ E +I L L+ + K+E + L ++ A ++++ + +Q + R
Sbjct: 429 ESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSA----KQKDKQSMQSMEKR 484
Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE---AMHTAIRKLEEKERF 409
K +A+ ++ E+ + + + EE+ R QA + + E ++ A + LE + +
Sbjct: 485 LKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKK 544
Query: 410 LQTVLTNAEKE-LHLRNQAMDLNKRK 434
L+ L E+E L +A +L K +
Sbjct: 545 LEHDLKLKEEECRMLEKEAQELRKYQ 570
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 31.8 bits (73), Expect = 0.43
Identities = 20/120 (16%), Positives = 54/120 (45%), Gaps = 23/120 (19%)
Query: 340 MQEQNSRLLQQLREKDDANFKL--------------MTERI--KSNQLHKLAREEKDTL- 382
++ +++ ++L +K++ L + E+I ++ +L +L + +T+
Sbjct: 18 LKHTLAKIQEKLEQKEELGEGLHLIDFEQLQIENQALNEKIEERNKELKRLKLKYTNTVH 77
Query: 383 -----REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRKAI 436
+E++ L ++E + +++ EE+ L+ L + E LR Q L ++ +
Sbjct: 78 ILTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGGL 137
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 33.1 bits (75), Expect = 0.43
Identities = 30/135 (22%), Positives = 60/135 (44%), Gaps = 21/135 (15%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
R+ +ER++LA+ +QLE + + Q+ +A+ + EA GQ ++ E
Sbjct: 509 REQGEEERQRLAEVA-----QQLERELQESQESLASAGAQLEAAR-----AGQ--QESTE 556
Query: 343 QNSRLLQQLREKDDANFKLMTERIKS------NQLHKLAREEKDTLREQVQALSLQIEAM 396
+ + L Q+L ++ + + + E++ QL R + RE +A+ +
Sbjct: 557 EAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVSLRQIQ 616
Query: 397 HTAIRKLEEKERFLQ 411
A + EKER +
Sbjct: 617 RQAAQ---EKERNQE 628
Score = 33.1 bits (75), Expect = 0.45
Identities = 26/118 (22%), Positives = 52/118 (44%), Gaps = 27/118 (22%)
Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE-DMQEQNSRLLQQ 350
K + QL+ Q +LQ++V + QE+ L ++ + E +++ NS++LQ
Sbjct: 323 KAQELSHADSTCQLQKQVAELQEEVTSQSQEQAILQRSLQ--DKEAEVEVERMNSKVLQG 380
Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA----LSLQIEAMHTAIRKLE 404
+ A+E + L++QV + L L +EA+H++ + L
Sbjct: 381 ELSR--------------------AQEARRRLQQQVASAEEQLQLVVEAVHSSQQGLL 418
Score = 32.7 bits (74), Expect = 0.49
Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 11/114 (9%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
R LA E +RK LE+ Q ++ QE+ + L + E S L
Sbjct: 121 RAALAGAEEVRK--NLEEGGQQELEEAQRLHQEQLSSLTQA---------HLEALSSLRS 169
Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
+ + + L T R + A+ E DTLREQ+ ++EA T + +L
Sbjct: 170 KAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEAQVTLVEQL 223
Score = 30.0 bits (67), Expect = 3.2
Identities = 30/146 (20%), Positives = 66/146 (45%), Gaps = 11/146 (7%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
E + ++ LE + K +A + E + L ++ T E+++ Q + L++QLR
Sbjct: 172 EGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQ---EELEAQVT-LVEQLR--- 224
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
K + E++ + + E+ L + V+ L A+ + L+ + + L +L
Sbjct: 225 ----KYVGEQVVAESRSQAWELERGELLQTVKHLQEDRAALQATVELLQVRVQSLTHILA 280
Query: 416 NAEKELHLRNQAMDLNKRKAIESAQS 441
E+EL + Q +D+ + + +S
Sbjct: 281 LQEEELTRKAQPVDILEPEFARKCRS 306
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.7 bits (74), Expect = 0.44
Identities = 35/196 (17%), Positives = 77/196 (39%), Gaps = 24/196 (12%)
Query: 262 LSDSASNI-YGSLEFRKFACVVR---DSKRDERKKLAD------EEAMRKIKQLEDQTHQ 311
LS + S++ S+ +K +R + + R+ + D +E ++ +QL+++ +
Sbjct: 165 LSGNVSDVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDK 224
Query: 312 LQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ----LREKDDANFKLMTERIK 367
Q +Q+ + Q D Q R QQ L + D + +++
Sbjct: 225 KQIDADKAQQKAD--------FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVA 276
Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
NQ ++ + + + + +AL + ++ + E+ + A+K+ A
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK--REPVA 334
Query: 428 MDLNKRKAIESAQSAA 443
DL K K AQ +
Sbjct: 335 EDLQKTKPQVEAQPTS 350
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 33.0 bits (75), Expect = 0.45
Identities = 32/194 (16%), Positives = 62/194 (31%), Gaps = 21/194 (10%)
Query: 285 SKRDERKKLADEEAMR-----KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG--QAF 337
S+ + + + L H LQ+Q T Q+ ++L E+++ QA
Sbjct: 352 SQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQAT 411
Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
D + R LQ +L + + Q L+ +
Sbjct: 412 IDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAA---------ITCTAQCEKLEKIHLQ 462
Query: 398 TAIRKLEEKERFLQTV--LTNAEKELH-LRNQAMDLNKRKAIESAQSAADLKLHLVIIYT 454
+ + L+E+E+ LQT + E + + + + S I
Sbjct: 463 ESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDN 522
Query: 455 NGP--ARLEFGKDR 466
GP R++ G+
Sbjct: 523 PGPLTRRMQRGEQT 536
Score = 32.2 bits (73), Expect = 0.69
Identities = 29/197 (14%), Positives = 62/197 (31%), Gaps = 31/197 (15%)
Query: 275 FRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG 334
F V + E + E+ TH L+ A QE + + + +
Sbjct: 764 FNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQC 823
Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQALSLQ 392
E + ++ + L +L E KS L ++ + + +Q+ L+ +
Sbjct: 824 ---ETLVQEEEQFLSRLEE-------------KSATLGEITHQLLKYEECSKQLAQLTQE 867
Query: 393 IEAMHTAIRKLE---------EKERFLQTVL-TNAEKELHLRNQA---MDLNKRKAIESA 439
+ KL + + ++ + + L NQ+ + +A
Sbjct: 868 QAKIIQLSDKLNGINQIKIQFDGDALIKFLHEITLYANVRLANQSEGRFHGRYADSHVNA 927
Query: 440 QSAADLKLHLVIIYTNG 456
+ L L + YT
Sbjct: 928 RKYQGLALLVADAYTGS 944
Score = 31.9 bits (72), Expect = 1.0
Identities = 24/166 (14%), Positives = 55/166 (33%), Gaps = 9/166 (5%)
Query: 268 NIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL 327
N++ ++ + A + K+ K ++ L T + KQ E L
Sbjct: 171 NLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQL--LTLCTPCMPDTYHERKQVLEKEL 228
Query: 328 NEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL------HKLAREEKDT 381
+ Q + ++ + E+ L R + +L + +E +
Sbjct: 229 KHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERIN- 287
Query: 382 LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
+ L+ I+A+ ++ + LQ+ + + K L R
Sbjct: 288 RARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAH 333
Score = 28.8 bits (64), Expect = 8.9
Identities = 24/194 (12%), Positives = 50/194 (25%), Gaps = 32/194 (16%)
Query: 279 ACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM-----EVT 333
C + K ++ +++++ +Q Q+ Q K A L E+ +
Sbjct: 448 TCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLC 507
Query: 334 GQAFEDMQ------------------EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA 375
G EQ L+ E ++ S +
Sbjct: 508 GSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLK----- 562
Query: 376 REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
E+ +++ L+ I L+ LQ + EK +
Sbjct: 563 -EQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLT---EKLSEAEDMLACEQHALL 618
Query: 436 IESAQSAADLKLHL 449
+ + L
Sbjct: 619 RKLQPEQDLQDVRL 632
>gnl|CDD|219487 pfam07611, DUF1574, Protein of unknown function (DUF1574). A
family of hypothetical proteins in Leptospira
interrogans.
Length = 345
Score = 32.5 bits (74), Expect = 0.46
Identities = 21/100 (21%), Positives = 39/100 (39%), Gaps = 11/100 (11%)
Query: 167 IFRFSS-STVFVYHCFSIFRFSSSTV---FVYHCFSIFRFSSSTVFVYHCFSIFRFSSST 222
++ FSS VY+ + + + +S V + S +F+S++ S +S
Sbjct: 91 LYNFSSPVATPVYYLYWLEKIASGGVKPDLIVIESSPEQFNSNSTTFKD--SNLAYSFDP 148
Query: 223 IFVYHCFDIFSLT--STLFVSICTSAFQMSFYKSRWFDFR 260
IF+ D S + F + F S+ K R +
Sbjct: 149 IFILRYADTLSKEDVNYYFAN---RLFGTSYNKPRIKNVI 185
Score = 31.4 bits (71), Expect = 1.2
Identities = 15/84 (17%), Positives = 30/84 (35%), Gaps = 4/84 (4%)
Query: 115 DSSFGIFRLSSTVFVYHCFSIFRFSS-STVFVYHCFSIFRFSSSTV---FVYHCFSIFRF 170
S F Y + ++ FSS VY+ + + + +S V + S +F
Sbjct: 71 SSRMLYFSNDDIEKKYPDWDLYNFSSPVATPVYYLYWLEKIASGGVKPDLIVIESSPEQF 130
Query: 171 SSSTVFVYHCFSIFRFSSSTVFVY 194
+S++ + F + Y
Sbjct: 131 NSNSTTFKDSNLAYSFDPIFILRY 154
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 32.4 bits (74), Expect = 0.46
Identities = 27/137 (19%), Positives = 56/137 (40%), Gaps = 23/137 (16%)
Query: 289 ERKKLADEEAMRKIKQ---LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
RK+ E ++ + L+ Q + +Q A + EE +L T + +++ +N
Sbjct: 116 ARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENI 175
Query: 346 RLL---------QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
R ++ RE +D N +++ + EE++T+ E ++ I
Sbjct: 176 RAKIEAEARGRAKEERENEDINREMLKLKAN---------EERETVLESIKTTFSHIGGG 226
Query: 397 HTAIRKLEEKERFLQTV 413
A+ L +K + TV
Sbjct: 227 FRAL--LTDKSKLTMTV 241
Score = 28.6 bits (64), Expect = 7.1
Identities = 21/112 (18%), Positives = 46/112 (41%), Gaps = 11/112 (9%)
Query: 288 DERKKLADEEAM--RKIKQLEDQ--THQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
+ER+K E+ ++ Q +D+ + QK++ +++ E LL E + E M+
Sbjct: 92 EERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRA 151
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLH-------KLAREEKDTLREQVQA 388
+ ++R + + ++ K RE +D RE ++
Sbjct: 152 TEEEILEMRRETIEEEAELERENIRAKIEAEARGRAKEERENEDINREMLKL 203
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 32.6 bits (74), Expect = 0.46
Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 28/171 (16%)
Query: 283 RDSKRDERKKLADEEAMR-----KIKQLEDQTHQL----QKQVATHKQEEEALLNEMEVT 333
+ ++ + K+ A++E ++ ++K E Q Q Q+ +QEE+A E
Sbjct: 87 QVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146
Query: 334 GQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
+A + + + + K A K E A EE E A +
Sbjct: 147 KKAEAAKAKAAA---EAAKLKAAAEAKKKAEEAAK------AAEEAKAKAEAAAA---KK 194
Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
+A A E+ + AEK +A + KA + AA
Sbjct: 195 KAEAEAKAAAEKAKA-EAEAKAKAEK------KAEAAAEEKAAAEKKKAAA 238
Score = 31.1 bits (70), Expect = 1.4
Identities = 25/137 (18%), Positives = 58/137 (42%), Gaps = 7/137 (5%)
Query: 308 QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIK 367
Q ++Q Q ++ K+ E+ + E Q E+++ + + ++L++ + K ++ +
Sbjct: 63 QYGRIQSQQSSAKKGEQQRKKKEE---QVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119
Query: 368 SNQLHKLAREEKDTLREQV--QALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
+ + K A+ E+ EQ A + +A + E + A+K+
Sbjct: 120 AEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK--LKAAAEAKKKAEEAA 177
Query: 426 QAMDLNKRKAIESAQSA 442
+A + K KA +A
Sbjct: 178 KAAEEAKAKAEAAAAKK 194
>gnl|CDD|219286 pfam07072, DUF1342, Protein of unknown function (DUF1342). This
family consists of several hypothetical bacterial
proteins of around 250 residues in length. Members of
this family are often known as YacF after the
Escherichia coli protein. The function of this family is
unknown.
Length = 211
Score = 32.1 bits (74), Expect = 0.47
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 10/57 (17%)
Query: 302 IKQLEDQTHQLQK-----QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
+K LE Q L K V E LL+E+E QA + R+ Q L+E
Sbjct: 46 LKDLERQRQTLLKWRGNPGVDQEA--LEQLLDEIE---QASAALNAAPGRIGQSLKE 97
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 32.0 bits (73), Expect = 0.54
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 5/56 (8%)
Query: 285 SKRDERKKLADEEAMRK-IKQLED----QTHQLQKQVATHKQEEEALLNEMEVTGQ 335
+RDE + EE + K K+++D + +K A +++E E L E +
Sbjct: 126 EERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQFLAERDDFSS 181
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 32.4 bits (74), Expect = 0.55
Identities = 29/139 (20%), Positives = 60/139 (43%), Gaps = 16/139 (11%)
Query: 289 ERKKLADEEAMRK-----IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
++ EE K ++ L++Q +L+ + E +AL ++++ + ++++
Sbjct: 254 PPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQE 313
Query: 344 NSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
N L ++ + K K E + N + KL + E++ L+ Q EA T
Sbjct: 314 NEELEEEYKIK-----KRTVELLPDAENNVAKLQALVVAS-SERLLELAQQWEAHRTP-- 365
Query: 402 KLEEKERFLQTVLTNAEKE 420
L ++ R L+ N E E
Sbjct: 366 -LIDEYRSLKEKNRNKEDE 383
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 30.0 bits (68), Expect = 0.57
Identities = 15/75 (20%), Positives = 35/75 (46%), Gaps = 9/75 (12%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATH-------KQEEEALLNEMEVTGQAFEDM 340
KKLA+++ I QL ++ +L K+ H + + + L E+ + E +
Sbjct: 2 SLEKKLAEKDEQ--IAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKL 59
Query: 341 QEQNSRLLQQLREKD 355
+++ L ++L+ +
Sbjct: 60 EKELENLEERLKRAE 74
Score = 29.2 bits (66), Expect = 0.94
Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 347 LLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L ++L EKD+ +LM E K S + K LR + + L +I + + KLE+
Sbjct: 3 LEKKLAEKDEQIAQLMEEGEKLSKKELKH-NNTIKKLRAKNKELEKEIAELKKKLEKLEK 61
Query: 406 KERFLQTVLTNAE 418
+ L+ L AE
Sbjct: 62 ELENLEERLKRAE 74
Score = 26.5 bits (59), Expect = 8.6
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 369 NQLHKLAREEKDT----LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
+ L K E KD L E+ + LS + + I+KL K + L+ + +K+L
Sbjct: 1 SSLEKKLAE-KDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKL 56
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 32.5 bits (74), Expect = 0.58
Identities = 27/170 (15%), Positives = 47/170 (27%), Gaps = 42/170 (24%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-------EDMQE 342
+ KL +A K + E L E+ A ++ E
Sbjct: 159 QDKLLLLDAAVKRHDGN--VITAVLIFLKRTLKLEILFQELAERQTALRHLSKHLQEEGE 216
Query: 343 QNSRL-------------LQQLREKDDANFKLMTERI---KSNQLHKLA----------- 375
L LQQ R+K + +RI K L +
Sbjct: 217 LQQLLKLLRAEGESEKQELQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIELEE 276
Query: 376 -REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
+ E + ++E++ L QI + E ++R + R
Sbjct: 277 LKHESEHVQEEITKLEGQII-----QLRSEAQDREAEASGEAESFRKQPR 321
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 30.7 bits (70), Expect = 0.63
Identities = 12/54 (22%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 302 IKQLEDQTHQLQKQVATHKQ----EEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
I QL+ + Q +++ ++ +++ L +E+ + E++ EQN L Q+
Sbjct: 75 IAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQNKLLHDQI 128
Score = 28.4 bits (64), Expect = 4.0
Identities = 23/130 (17%), Positives = 56/130 (43%), Gaps = 24/130 (18%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED-MQEQ--NSRLLQQLREKDDA 357
++K+L ++ ++++ +++ E Q +E + + + LQ LR+
Sbjct: 11 ELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRK---- 66
Query: 358 NFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
QL++L ++E L+ + ++ ++ A EE+++ L+ L+
Sbjct: 67 ------------QLNEL-KKEIAQLKAEAESAQAELS---EAEESWEEQKKMLEDELSEL 110
Query: 418 EKELH-LRNQ 426
EK + L Q
Sbjct: 111 EKRIEELNEQ 120
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 31.0 bits (71), Expect = 0.63
Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 16/90 (17%)
Query: 314 KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSN---- 369
KQ+ T + +EE ++QEQ ++L Q+L++ + K + E K +
Sbjct: 10 KQIVTERWKEELQ-----------AELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQI 58
Query: 370 -QLHKLAREEKDTLREQVQALSLQIEAMHT 398
Q+ + +E+ EQ L QIE +
Sbjct: 59 EQIQQQVEQERAKRLEQKNQLLFQIEQVQE 88
Score = 27.6 bits (62), Expect = 8.2
Identities = 16/85 (18%), Positives = 35/85 (41%), Gaps = 14/85 (16%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG-QAFEDMQEQNSRLLQ 349
K+ E ++ QLE + QL + + + + E+ Q E +Q+Q +
Sbjct: 18 KEELQAELQEQLTQLEQELQQL-------EFQGQKAIKEIRKQSAQQIEQIQQQVEQERA 70
Query: 350 QLREKDDANFKLMTERIKSNQLHKL 374
+ E+ + +I+ Q+ +L
Sbjct: 71 KRLEQK----NQLLFQIE--QVQEL 89
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 30.5 bits (69), Expect = 0.65
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 293 LADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
L EE ++ + ++ Q LQ Q+A +Q++ + + ED Q S
Sbjct: 44 LTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQS--EEAQQQQQSSIEDSPSQTS 94
>gnl|CDD|214457 MTH00205, ND4, NADH dehydrogenase subunit 4; Provisional.
Length = 448
Score = 32.3 bits (74), Expect = 0.65
Identities = 24/126 (19%), Positives = 43/126 (34%), Gaps = 3/126 (2%)
Query: 116 SSFGIFRLSSTVFVY--HCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSS 173
S FG+F L +FV +S F + F T F+ FS S
Sbjct: 2 SIFGVFSLLIILFVLKFKDWSFFSNLLLIGLFFSSFFFSNSFLMTSFISSSFSFDSLSLP 61
Query: 174 TVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFS 233
+ + + ++ V+ + + +V + I FS + +++ S
Sbjct: 62 LI-MLSFWISLLMVLASYKVFKSNKNSKLFTVSVIMLLLILILSFSVDNLLLFYITFEAS 120
Query: 234 LTSTLF 239
L TL
Sbjct: 121 LIPTLI 126
Score = 30.3 bits (69), Expect = 2.7
Identities = 21/128 (16%), Positives = 34/128 (26%), Gaps = 13/128 (10%)
Query: 133 FSIFRFS-SSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTV 191
F +F V + +S F + F T F+ FS S +
Sbjct: 4 FGVFSLLIILFVLKFKDWSFFSNLLLIGLFFSSFFFSNSFLMTSFISSSFSFDSLSLPLI 63
Query: 192 FVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSF 251
S + + S F S+ + L L +S + F
Sbjct: 64 ----MLSFW-----ISLLMVLASYKVFKSNKNSKLFTVSVIMLLLILILSFSVDNLLL-F 113
Query: 252 YKSRWFDF 259
Y F+
Sbjct: 114 YI--TFEA 119
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.0 bits (73), Expect = 0.66
Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 22/97 (22%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
K+ +E A +K+LE++T + EMEV E+++E SR
Sbjct: 121 VEKEREEELAGDAMKKLENRTADSK--------------REMEVLE-RLEELKELQSR-- 163
Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQ 385
+ D + M E + + + EE++
Sbjct: 164 -----RADVDVNSMLEALFRREKKEEEEEEEEDEALI 195
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 30.9 bits (70), Expect = 0.66
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 371 LHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
L + ++E++ LR++ + L + +RKLE+K + T E L L N
Sbjct: 44 LEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQETWHEHNLALANF 99
>gnl|CDD|225647 COG3105, COG3105, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 138
Score = 30.9 bits (70), Expect = 0.67
Identities = 20/105 (19%), Positives = 46/105 (43%), Gaps = 12/105 (11%)
Query: 281 VVRDSKRDERKKLADEEAMRKIK-QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED 339
+ R + R +++ + + K+K QL++ ++++ H LL + Q ++
Sbjct: 25 IARLTNRKLKQQQKLQYELEKVKAQLDEY----RQELVKHFARSAELLKTL---AQDYQK 77
Query: 340 MQEQ----NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
+ + ++ LL +L KD+ L+ E SN + +D
Sbjct: 78 LYQHMAKSSTSLLPELSAKDNPFRNLLAEEEASNDQAPVQMPPRD 122
>gnl|CDD|220173 pfam09311, Rab5-bind, Rabaptin-like protein. Members of this
family are predominantly found in Rabaptin and allow for
binding to the GTPase Rab5. This interaction is
necessary and sufficient for Rab5-dependent recruitment
of Rabaptin5 to early endosomal membranes.
Length = 180
Score = 31.2 bits (71), Expect = 0.68
Identities = 26/135 (19%), Positives = 48/135 (35%), Gaps = 14/135 (10%)
Query: 318 THKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR- 376
+ Q +EA+ ++++ + + ++ Q RL Q+ N L E + Q + +
Sbjct: 5 SFSQVQEAVQDQLQELEASRQKLRAQLRRLQQE-------NQWLRGELALTQQELQASEQ 57
Query: 377 ------EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
EE L+ ++ TA LEE+ + L + L A
Sbjct: 58 EVIQLPEEVKHLQFLCLQRREELIESRTASEHLEEELKSEIDPLKELKPNLEEELAAEIE 117
Query: 431 NKRKAIESAQSAADL 445
+ E Q A L
Sbjct: 118 LPQIQSEKDQLQASL 132
Score = 29.3 bits (66), Expect = 3.3
Identities = 19/113 (16%), Positives = 42/113 (37%), Gaps = 17/113 (15%)
Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-NSR--LLQQLREKDDANFK 360
QL ++ LQ ++E A E ++E+ S L++L+ +
Sbjct: 61 QLPEEVKHLQFLCLQRREELIESR-------TASEHLEEELKSEIDPLKELKPNLEE--- 110
Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
+ + EKD L+ + + ++ +H + + E + R+ V
Sbjct: 111 --ELAAEIELPQIQS--EKDQLQASLYEIPARLRTLHNLVIQYESQGRYEVAV 159
>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO. This family
contains the bacterial type III secretion protein YscO,
which is approximately 150 residues long. YscO has been
shown to be required for high-level expression and
secretion of the anti-host proteins V antigen and Yops
in Yersinia pestis.
Length = 152
Score = 31.2 bits (71), Expect = 0.69
Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 3/115 (2%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
KKL E A + +K+ E L + A H++ +++L + + Q + +
Sbjct: 9 KKLRAERAEKAVKRQE---QALVQARARHREAQQSLQDYRQWRQQEENRLFAELQGATVL 65
Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L+E + ++ R L + E K L + Q L + + A + E+
Sbjct: 66 LKELEKWQQQVGLLRENEASLEQQLAEAKQRLEAERQRLRQARQQLQEARKAQEK 120
>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 262
Score = 31.6 bits (72), Expect = 0.71
Identities = 19/88 (21%), Positives = 27/88 (30%), Gaps = 9/88 (10%)
Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVA--THKQEEEALLNE 329
S A R + ++ QLE Q QLQ ++ QE + N+
Sbjct: 28 SGAPLSSAGNGSVEDRVTPLERVQNAHSYRLTQLEQQLRQLQGKIEELRGIQELQYQNNQ 87
Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDA 357
QE+N L L A
Sbjct: 88 NVER-------QEENEARLDSLESGRQA 108
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 32.2 bits (73), Expect = 0.72
Identities = 25/158 (15%), Positives = 64/158 (40%), Gaps = 19/158 (12%)
Query: 290 RKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
+K++AD+E +++I++L + + K L + ++ + +++
Sbjct: 203 KKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAE 262
Query: 345 SRLLQQLREKDDANFKLMTER---IKSNQLHKLAREEKD---------TLREQVQALSLQ 392
S L + + + +K + ER I ++ ++K D ++ + + +
Sbjct: 263 SDL--SMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAE 320
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
I H I+KL ++ + + L NQ ++L
Sbjct: 321 INKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILEL 358
Score = 32.2 bits (73), Expect = 0.83
Identities = 32/176 (18%), Positives = 66/176 (37%), Gaps = 17/176 (9%)
Query: 279 ACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
A V E + E +++ LE + +++ K + + E+E
Sbjct: 573 ALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLN 632
Query: 339 DMQ---EQNSRLLQQLREKDDANFKL----MTERIKS-NQLHKLAREEKDTLREQVQALS 390
+ ++N L+++LR K D N+K + I ++ + +D L++
Sbjct: 633 NKYNEIQENKILIEKLRGKID-NYKKQIAEIDSIIPDLKEITSRINDIEDNLKK----SR 687
Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
++ +LE L+T + ++ N+ L K I+ A DLK
Sbjct: 688 KALDDAKANRARLESTIEILRTRINELSDRINDINE--TLESMKKIKKA--IGDLK 739
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 31.8 bits (73), Expect = 0.73
Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 15/134 (11%)
Query: 300 RKIKQLEDQTHQLQKQV--ATHK-QEEEALL--NEMEVTG-----QAFEDMQEQNSRLLQ 349
+KIKQLE++ + ++++ A K +E E E EV Q E+ E++ L
Sbjct: 1 KKIKQLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLA 60
Query: 350 QLREKDDANFKLMTERIKS-NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
EK + K E + L + ++ + + + L E A RK EE R
Sbjct: 61 TALEKLEEAEKAADESERGRKVLENRSLKDDEKMEQLEAQLKEAKEIAEEADRKYEEVAR 120
Query: 409 FLQTVLTNAEKELH 422
L V E +L
Sbjct: 121 KLVVV----EGDLE 130
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 32.0 bits (73), Expect = 0.75
Identities = 18/107 (16%), Positives = 45/107 (42%), Gaps = 18/107 (16%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
E MR+ + + + +KQ+ K + ++LL E T ++E + ++L+++ +
Sbjct: 508 ERMRQEAEKYAEEDRRRKQLIELKNQADSLLYSYEST------LKENGELISEELKQRAE 561
Query: 357 ANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIEAMHTAI 400
K QL + + L++Q++ + A+ +
Sbjct: 562 Q---------KVEQLEAALTDPNISLEELKQQLEEFQQALLAIGAEV 599
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 32.1 bits (73), Expect = 0.78
Identities = 29/166 (17%), Positives = 59/166 (35%), Gaps = 37/166 (22%)
Query: 296 EEAMRKIKQLEDQ----------------THQLQK-----QVATHKQEEEALLNEMEVTG 334
+E+ +K+ QL++Q H + + +EE A L+
Sbjct: 99 QESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSREVGKQ 158
Query: 335 QAFEDMQEQ---NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE-EKDTLREQVQALS 390
+ R+LQQ ++ + K+ ++ L + ++ L +Q+LS
Sbjct: 159 RGLLSRGLATFKRDRILQQQWREEQEKYD---AADKARAIYALQTKADERNLETVLQSLS 215
Query: 391 LQIEAMHTAIRKLEEKERFLQTV---LTNAEKELHLRNQAMDLNKR 433
+ E+ L+TV + A E+ ++LN R
Sbjct: 216 QADFQLAGV------AEKELETVEARIKEARYEIEELENKLNLNTR 255
>gnl|CDD|240364 PTZ00332, PTZ00332, paraflagellar rod protein; Provisional.
Length = 589
Score = 32.2 bits (73), Expect = 0.80
Identities = 33/163 (20%), Positives = 73/163 (44%), Gaps = 16/163 (9%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQ--VATHKQEEEALLNEM-EVTGQAFED---MQ 341
D ++A+E+ K++ E + + + +EE +++ EV QA ++ M+
Sbjct: 207 DGEMQVAEEQMWTKVQLQERLIELVADKFRLIGKCEEENKSFSKIHEVQKQANQETSQMK 266
Query: 342 EQNSRLLQQ----LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
+ RL Q+ L+ DA K E ++ + + +E+ + + + +
Sbjct: 267 DAKRRLKQRCETDLKHIHDAIQKADLEDAEAMKRYATNKEKSE------RFIRENEDRQE 320
Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
A K+++ ER LQ + T +E+ R + D +++ +E Q
Sbjct: 321 EAWNKIQDLERQLQRLGTERFEEVKRRIEENDREEKRRVEYQQ 363
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 31.8 bits (72), Expect = 0.82
Identities = 32/145 (22%), Positives = 60/145 (41%), Gaps = 12/145 (8%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE-EEALLNEMEVTGQAFEDMQEQNSRLL 348
RK +E +R I++ ++ + Q+ Q+ L + V + Q+
Sbjct: 78 RKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEELSR 137
Query: 349 ------QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
Q L+ + DA + E+I+ NQ +L RE +TL Q Q L+ + +A
Sbjct: 138 HLKEENQCLQIQLDALVQECNEKIEENQ--ELNRELAETLAYQ-QELNDEYQATFVEQHN 194
Query: 403 LEEKERFLQTVLTNAEKEL--HLRN 425
+ +K + L ++L +RN
Sbjct: 195 MLDKRQAYIGKLEAKVQDLMCEIRN 219
Score = 29.5 bits (66), Expect = 4.6
Identities = 27/148 (18%), Positives = 65/148 (43%), Gaps = 24/148 (16%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
+ +RK+ + ++ +++ + ++E + L + QN +L+ QL
Sbjct: 74 QTKIRKLTEQDEGLRNIRESIEERQKESDQL--------------KLQNQKLVNQLGH-- 117
Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
A + + + +L +L+R L+E+ Q L +Q++A+ + E+ + L L
Sbjct: 118 -ARGVFIKTKGRYQKLEELSRH----LKEENQCLQIQLDALVQECNEKIEENQELNRELA 172
Query: 416 NA---EKELHLRNQAMDLNKRKAIESAQ 440
++EL+ QA + + ++ Q
Sbjct: 173 ETLAYQQELNDEYQATFVEQHNMLDKRQ 200
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 31.6 bits (72), Expect = 0.88
Identities = 31/139 (22%), Positives = 70/139 (50%), Gaps = 23/139 (16%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ- 341
D+K ++ ++ A++E + K+ ++ + QV+T K+ E+ L ++ + F++ Q
Sbjct: 172 NDAKLEQAQEQAEQEQEKH-KREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKEFQD 230
Query: 342 ----------------EQNSRLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKDTLRE 384
E+ S+ +++L EK++ +K E KSN+ L ++A EE+ +
Sbjct: 231 TLNKSNEVFLTFKQEMEKMSKKIKKL-EKENLAWKRKWE--KSNKALLEMA-EERQKRDK 286
Query: 385 QVQALSLQIEAMHTAIRKL 403
+++ L +I+ + R L
Sbjct: 287 EIEKLKKKIDKLEKLCRAL 305
Score = 30.4 bits (69), Expect = 2.4
Identities = 35/161 (21%), Positives = 60/161 (37%), Gaps = 28/161 (17%)
Query: 287 RDERKKLADEEAMRK---IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
++E K +DEE ++ ++ + +Q Q+ H L E E + +++ EQ
Sbjct: 91 KEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQ 150
Query: 344 NSRLLQQLREKD-DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
+LRE+ + K + N KL + Q +A +
Sbjct: 151 -----YELREQHFEKLLKTKDLEQQLNDA-KLEQA--------------QEQAEQEQEKH 190
Query: 403 LEEKERFLQT---VLTNAEKELHLRNQ-AMDLNKRKAIESA 439
EKE L+ V T E E LR Q + +K K +
Sbjct: 191 KREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKEFQDT 231
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 31.6 bits (73), Expect = 0.92
Identities = 17/70 (24%), Positives = 34/70 (48%), Gaps = 10/70 (14%)
Query: 347 LLQQLREKDDANFKLMTE----RIKSNQL----HKLAREEKDT--LREQVQALSLQIEAM 396
+ +L E D+ +L TE + + N L + R+ +D L +V+ L +I+A+
Sbjct: 26 DVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKAL 85
Query: 397 HTAIRKLEEK 406
+ +LE +
Sbjct: 86 EAELDELEAE 95
>gnl|CDD|182409 PRK10361, PRK10361, DNA recombination protein RmuC; Provisional.
Length = 475
Score = 31.9 bits (72), Expect = 0.93
Identities = 36/146 (24%), Positives = 65/146 (44%), Gaps = 25/146 (17%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQ--EEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
+ A +K +QL ++ + + A +Q + E E E+ +Q N+ L LRE
Sbjct: 26 QHAQQKAEQLAEREEMVAELSAAKQQITQSEHWRAECELLNNEVRSLQSINTSLEADLRE 85
Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQV---QALSLQIEAMHTAI-----RKLEE 405
+T R+++ Q H D +R+ + Q LS Q E + I R+++E
Sbjct: 86 --------VTTRMEAAQQHA-----DDKIRQMINSEQRLSEQFENLANRIFEHSNRRVDE 132
Query: 406 KER-FLQTVLTNAEKELH-LRNQAMD 429
+ R L ++L+ ++L R Q D
Sbjct: 133 QNRQSLNSLLSPLREQLDGFRRQVQD 158
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 31.9 bits (73), Expect = 0.93
Identities = 25/116 (21%), Positives = 44/116 (37%), Gaps = 10/116 (8%)
Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
+ L E+ +A D Q L +D E IK L K
Sbjct: 88 SQVLRLEAEVARL-RAEADSQAAIE-FPDDLLSAEDPAVP---ELIK--GQQSLFESRKS 140
Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
TLR Q++ + QI+ + + L+ + + L+ L +EL R + L ++ +
Sbjct: 141 TLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRK---LKEKGLV 193
Score = 29.2 bits (66), Expect = 5.3
Identities = 37/163 (22%), Positives = 66/163 (40%), Gaps = 32/163 (19%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
+ + R + E + +IKQLE + LQ Q+ +Q+ E + E+E + E
Sbjct: 139 KSTLRAQL-----ELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLV 193
Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
LL+ RE+ + A+ E L +++ L QI+ + +
Sbjct: 194 SRLELLELERERAE------------------AQGELGRLEAELEVLKRQIDE--LQLER 233
Query: 403 LEEKERFLQTV---LTNAEKEL----HLRNQAMDLNKRKAIES 438
+ ++ F + V LT A+ L N+A D +R I S
Sbjct: 234 QQIEQTFREEVLEELTEAQARLAELRERLNKARDRLQRLIIRS 276
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 0.96
Identities = 30/141 (21%), Positives = 60/141 (42%), Gaps = 25/141 (17%)
Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
K LE L + + K++ + L+ E+E+ + + +LR++ DA L
Sbjct: 140 KLLEGLKEGLDENLEGLKEDYKLLMKELELL-----------NSIKPKLRDRKDA---LE 185
Query: 363 TERIKSNQLHKLARE-------EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL- 414
E QL +L E E D +E+++ L +I + +LEE+ + L++ +
Sbjct: 186 EE---LRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIE 242
Query: 415 TNAEKELHLRNQAMDLNKRKA 435
K+ L + + K+
Sbjct: 243 DLTNKKSELNTEIAEAEKKLE 263
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 31.8 bits (73), Expect = 0.99
Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 18/120 (15%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEE---EALLNEMEVTG--QAFEDMQEQNSRLLQQLRE- 353
K+++ D+ L + E E +L E + FE+ + + ++LRE
Sbjct: 386 PKVEKKLDK-LGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREE 444
Query: 354 --KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-TAIRKLEEKERFL 410
K D + + E ++ L +L EK L+ Q H +R+ + + L
Sbjct: 445 AAKIDPSLEGALEANEAKILKQLEFLEKRLLKAQ--------RRKHEELLRQFDRLQTAL 496
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.5 bits (72), Expect = 1.1
Identities = 28/135 (20%), Positives = 63/135 (46%), Gaps = 6/135 (4%)
Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
KL EE ++ LE+ + +++ K ++ + E+E E + + + + +
Sbjct: 319 KLDVEELKALLEALEEILEKNLQKLEE-KLKDPSTSIELESITDLIESINDIIDAINELI 377
Query: 352 REKDD--ANFKLMTERIKSNQ-LHKLAREEKD--TLREQVQALSLQIEAMHTAIRKLEEK 406
RE ++ N K + K LH +A ++D +++ + L I ++ I++LE +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 407 ERFLQTVLTNAEKEL 421
+ L+ + EK+L
Sbjct: 438 IKALEKEIKELEKQL 452
Score = 30.0 bits (68), Expect = 3.4
Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 16/161 (9%)
Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
DEE I+QLE+ + + + +E E++L+ E F+ E+ LL+ L E
Sbjct: 277 DEEYQELIEQLEELIDKYESHIEKALEELESILD-TEKENSEFKLDVEELKALLEALEEI 335
Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
+ N + + E++K + + E L E + + I+A++ IR+ EK L+
Sbjct: 336 LEKNLQKLEEKLK-DPSTSIELESITDLIESINDI---IDAINELIREHNEKIDNLKKEK 391
Query: 415 TNAEKEL--HLRNQAMDL---------NKRKAIESAQSAAD 444
A+K+L HL + + KAI S +
Sbjct: 392 NKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIK 432
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 31.8 bits (72), Expect = 1.1
Identities = 29/158 (18%), Positives = 61/158 (38%), Gaps = 12/158 (7%)
Query: 301 KIKQLEDQTHQLQKQVA----THKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLRE 353
++ + + + QL+ ++ HK+ + L + + +E ++N L L+ L E
Sbjct: 62 ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121
Query: 354 KD-DANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
+ A + ++ L KL E E+ L E++ L E + Q
Sbjct: 122 LEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQ 181
Query: 412 ---TVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
T L E EL + ++ +++ E+ + L
Sbjct: 182 NADTELKLLESELEELREQLEECQKELAEAEKKLQSLT 219
>gnl|CDD|215715 pfam00097, zf-C3HC4, Zinc finger, C3HC4 type (RING finger). The
C3HC4 type zinc-finger (RING finger) is a cysteine-rich
domain of 40 to 60 residues that coordinates two zinc
ions, and has the consensus sequence:
C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C
where X is any amino acid. Many proteins containing a
RING finger play a key role in the ubiquitination
pathway.
Length = 40
Score = 28.2 bits (63), Expect = 1.1
Identities = 5/17 (29%), Positives = 8/17 (47%)
Query: 487 DCLRTRYETRQRKCPKC 503
C+ + E+ CP C
Sbjct: 24 KCILSWLESGNVTCPLC 40
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 29.5 bits (67), Expect = 1.2
Identities = 17/78 (21%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 347 LLQQLREKDDANFKLMTE----RIKSNQLHKL------AREEKDTLREQVQALSLQIEAM 396
+ +L E D+ +L E + + N+L K +E+ + L +V+ L +++A+
Sbjct: 27 DVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKAL 86
Query: 397 HTAIRKLEEK-ERFLQTV 413
+R+LE + ++ L ++
Sbjct: 87 EAELRELEAELDKLLLSI 104
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 31.2 bits (71), Expect = 1.3
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 14/74 (18%)
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
Q+Q + LL QL+++ E S KL RE ++TL L+ QI+ ++ +I
Sbjct: 67 QQQRASLLAQLKKQ---------EEAISQASRKL-RETQNTL----NQLNKQIDELNASI 112
Query: 401 RKLEEKERFLQTVL 414
KLE+++ + +L
Sbjct: 113 AKLEQQQAAQERLL 126
>gnl|CDD|223091 COG0012, COG0012, Predicted GTPase, probable translation factor
[Translation, ribosomal structure and biogenesis].
Length = 372
Score = 31.0 bits (71), Expect = 1.4
Identities = 35/158 (22%), Positives = 52/158 (32%), Gaps = 41/158 (25%)
Query: 269 IYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
SLE R R KR + K D+E +++++ + EE L
Sbjct: 137 DLESLEKR----WERLEKRAKAGKKLDKEL--------------KEELSLLGKLEEHLEE 178
Query: 329 EMEVTGQAFEDMQEQNSRLLQQLR--------------EKDDANFKLMTERIKSNQLHKL 374
G E++ L L E D AN +R+K +L
Sbjct: 179 GKPARGLDLSKWSEEDLEALASLNLLTAKPMLYVANVSEDDLANLNEYVKRLK-----EL 233
Query: 375 AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
A +E +V +S IE + EEK FL
Sbjct: 234 AAKENA----EVVPVSAAIELELRELADAEEKGEFLIE 267
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 30.8 bits (70), Expect = 1.4
Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
+K ++L + E RK Q + + + +K+ QEE L E + T +
Sbjct: 146 TVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEE--RLAEAKET-------E 196
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHK 373
N + L++ E+++ K + +K +L++
Sbjct: 197 RINLKSLERYEEQEEEKKKAKIQALKKRRLYE 228
>gnl|CDD|178657 PLN03110, PLN03110, Rab GTPase; Provisional.
Length = 216
Score = 30.7 bits (69), Expect = 1.4
Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQ 392
Q F+++Q R L++LR+ D+N +M KS N L +A E+ L E+ + LS
Sbjct: 98 QTFDNVQ----RWLRELRDHADSNIVIMMAGNKSDLNHLRSVAEEDGQALAEK-EGLSF- 151
Query: 393 IEAMHTAIRKLEEKERFLQTVLTN 416
+ T+ + E+ QT+L
Sbjct: 152 ---LETSALEATNVEKAFQTILLE 172
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 31.4 bits (71), Expect = 1.4
Identities = 26/132 (19%), Positives = 61/132 (46%), Gaps = 10/132 (7%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
R++L EA R + +Q Q Q Q++ + Q +L ++ + +++ + + LQ
Sbjct: 990 RQRLEQAEAERTRAR--EQLRQHQAQLSQYNQ----VLASLKSSYDTKKELLNELQQELQ 1043
Query: 350 QLREKDDANFKLMTERIKSNQLHKL---AREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
+ + D+ RI+ ++LH R ++ L +Q+ +++ + +RKLE
Sbjct: 1044 DIGVRADSG-AEERARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMDNLTRKLRKLERD 1102
Query: 407 ERFLQTVLTNAE 418
++ + A+
Sbjct: 1103 YFEMREQVVTAK 1114
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.2 bits (69), Expect = 1.4
Identities = 30/123 (24%), Positives = 52/123 (42%), Gaps = 12/123 (9%)
Query: 261 SLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHK 320
S++D + SLE + V K+ E+KK KQ+ QK A
Sbjct: 46 SINDQRKEAFASLELTEQPQQVEKVKKSEKKKAQ--------KQIAKDHEAEQKVNAKKA 97
Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRL----LQQLREKDDANFKLMTERIKSNQLHKLAR 376
E+EA E E +A ++ + + + +Q+ +EK +A+ K + K + K +
Sbjct: 98 AEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEK 157
Query: 377 EEK 379
EK
Sbjct: 158 AEK 160
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 30.9 bits (70), Expect = 1.4
Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 9/87 (10%)
Query: 304 QLEDQTHQLQ--KQVATHKQEEEALLNEM--EVTGQAFEDMQEQNSRLLQQLREKDDANF 359
+ +D + Q Q A ++++ L + QA E EQ LL + RE
Sbjct: 44 RWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATAR 103
Query: 360 KLMTERIKSNQLHKLAREEKDTLREQV 386
E QL + +E L++Q
Sbjct: 104 DEWLE-----QLEREKQEFFKALQQQT 125
Score = 30.2 bits (68), Expect = 2.1
Identities = 15/84 (17%), Positives = 36/84 (42%), Gaps = 9/84 (10%)
Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
I+ ++ + ++ + +Q ++ E E Q + +++Q + + Q +E D
Sbjct: 31 IQVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADE---- 86
Query: 362 MTERIKSNQLHKLAREEKDTLREQ 385
+ L ARE+ T R++
Sbjct: 87 -----QRQHLLNEAREDVATARDE 105
>gnl|CDD|181787 PRK09343, PRK09343, prefoldin subunit beta; Provisional.
Length = 121
Score = 29.7 bits (67), Expect = 1.5
Identities = 24/111 (21%), Positives = 48/111 (43%), Gaps = 19/111 (17%)
Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
+ QL+ QL++ + K + + L E+ + L++L + D
Sbjct: 13 LAQLQQLQQQLER-LLQQKSQIDLELREIN--------------KALEELEKLPD---DT 54
Query: 362 MTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
+I N L K+ + + + L+E+ + L L+ + +KL EK + LQ
Sbjct: 55 PIYKIVGNLLVKVDKTKVEKELKERKELLELRSRTLEKQEKKLREKLKELQ 105
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 30.1 bits (68), Expect = 1.5
Identities = 13/65 (20%), Positives = 28/65 (43%)
Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
R E K+ A ++ + ++E E L N+ E + E+++++ +
Sbjct: 70 RKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEK 129
Query: 347 LLQQL 351
L Q+L
Sbjct: 130 LRQRL 134
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 29.9 bits (68), Expect = 1.5
Identities = 15/67 (22%), Positives = 34/67 (50%)
Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
KL E A + ++LE++ +L+++ +QE +L + + + E ++EQ ++L
Sbjct: 6 KLEAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKL 65
Query: 352 REKDDAN 358
E +
Sbjct: 66 EESEKLA 72
Score = 28.3 bits (64), Expect = 4.7
Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 348 LQQLR-EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
+ L+ E ++A + E ++ + K +E +++ +L + + + + KLEE+
Sbjct: 2 MNALKLEAENAQERA--EELE--EKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQ 57
Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKR 433
L+ E+ L A L +R
Sbjct: 58 ---LKEAKEKLEESEKLATNAEALTRR 81
>gnl|CDD|215791 pfam00210, Ferritin, Ferritin-like domain. This family contains
ferritins and other ferritin-like proteins such as
members of the DPS family and bacterioferritins.
Length = 142
Score = 30.0 bits (68), Expect = 1.6
Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 11/67 (16%)
Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
E + K +TE + +L +LA EE D + + + EE E FL+
Sbjct: 86 EAALEHEKEVTESL--RELIELAEEEGDYATADF---------LEWFLEEQEEHEWFLEA 134
Query: 413 VLTNAEK 419
+L ++
Sbjct: 135 LLEKLKR 141
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 30.3 bits (69), Expect = 1.6
Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 12/74 (16%)
Query: 290 RKKLADEEAM----RKIKQLEDQTHQLQKQVATHKQEEEAL----LNEMEVTGQAFED-- 339
RK L E+ ++IK+LE++ +L+K+VA + + EA+ E ++ + D
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172
Query: 340 --MQEQNSRLLQQL 351
+++QN +L QL
Sbjct: 173 AFLKKQNQQLKSQL 186
>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
complex, found in both eukaryotes and archaea, that
binds and stabilizes newly synthesized polypeptides
allowing them to fold correctly. The complex contains
two alpha and four beta subunits, the two subunits being
evolutionarily related. In archaea, there is usually
only one gene for each subunit while in eukaryotes there
two or more paralogous genes encoding each subunit
adding heterogeneity to the structure of the hexamer.
The structure of the complex consists of a double beta
barrel assembly with six protruding coiled-coils.
Length = 129
Score = 29.6 bits (67), Expect = 1.6
Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 11/127 (8%)
Query: 302 IKQLEDQTHQLQKQ---VATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN 358
+++L Q QLQ+Q + Q+ EA L E E + E +++ +
Sbjct: 1 LQELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKETLETLKKAEEEKELLVPLGAGLF 60
Query: 359 FK---LMTER----IKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFL 410
K ++ + + + + EE + L+++++ L QIE + + KL+++ L
Sbjct: 61 VKAEVKDDDKVLVDLGTGVYVEKSLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITEL 120
Query: 411 QTVLTNA 417
Q L
Sbjct: 121 QEELQQL 127
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.0 bits (70), Expect = 1.6
Identities = 16/76 (21%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
++ +L+ + ++Q+ +AL E T +A E +++Q L +L E+ + +
Sbjct: 432 IDSMQRELEMWRSEYRQQAQALQQERRATEEALEPLKDQ----LAEL-EQAIKDQRSKIS 486
Query: 365 RIKSNQLHKLAREEKD 380
+++N L A+ +K
Sbjct: 487 NVRANILKNEAKIQKL 502
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 30.4 bits (69), Expect = 1.7
Identities = 36/188 (19%), Positives = 65/188 (34%), Gaps = 50/188 (26%)
Query: 272 SLEFRKFACVVRDSKRDERKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEAL 326
+EF+ A K K + +EE K ++LE +LQ+ + + E
Sbjct: 17 VIEFKDLA--NAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIE--EGAREGY 72
Query: 327 LNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV 386
+ +E+ E+ + L +L + E + L+E V
Sbjct: 73 EEGFQ---LGYEEGFEEGQEEGRVLERL--------------AKLIAEFQAELEALKEVV 115
Query: 387 QALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK----ELHLRNQAMDLNKRKAIESA-QS 441
+ ++ +Q VL A K EL L +A+ R+A+E
Sbjct: 116 E-------------------KQLVQLVLEIARKVIGKELELDPEALLAAVREALEEVPLF 156
Query: 442 AADLKLHL 449
AA + L +
Sbjct: 157 AAAITLRV 164
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 30.0 bits (68), Expect = 1.7
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 19/82 (23%)
Query: 375 AREEKDTLREQVQALSLQI----------EAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
A EE+D+L++Q++ QI E + I KL+++ +T E +L
Sbjct: 25 ANEERDSLKKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQ---NKTAKEEYEAKLA-- 79
Query: 425 NQAMDLNKRKAIESAQSAADLK 446
D AIE A A +
Sbjct: 80 ----DTKLNNAIELALKKAKAR 97
>gnl|CDD|238105 cd00179, SynN, Syntaxin N-terminus domain; syntaxins are nervous
system-specific proteins implicated in the docking of
synaptic vesicles with the presynaptic plasma membrane;
they are a family of receptors for intracellular
transport vesicles; each target membrane may be
identified by a specific member of the syntaxin family;
syntaxins contain a moderately well conserved
amino-terminal domain, called Habc, whose structure is
an antiparallel three-helix bundle; a linker of about 30
amino acids connects this to the carboxy-terminal
region, designated H3 (t_SNARE), of the syntaxin
cytoplasmic domain; the highly conserved H3 region forms
a single, long alpha-helix when it is part of the core
SNARE complex and anchors the protein on the cytoplasmic
surface of cellular membranes; H3 is not included in
defining this domain.
Length = 151
Score = 29.9 bits (68), Expect = 1.7
Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 8/90 (8%)
Query: 302 IKQLEDQTHQLQKQV----ATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL---QQLREK 354
I ++ + +LQK + E L E+E Q + + ++ L ++ E+
Sbjct: 15 IDKISEDVEELQKLHSQLLTAPDADPE-LKQELESLVQEIKKLAKEIKGKLKELEESNEQ 73
Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLRE 384
++A +RI+ Q L+++ + + E
Sbjct: 74 NEALNGSSVDRIRKTQHSGLSKKFVEVMTE 103
>gnl|CDD|217894 pfam04102, SlyX, SlyX. The SlyX protein has no known function. It
is short less than 80 amino acids and is found close to
the slyD gene. The SlyX protein has a conserved PPH(Y/W)
motif at its C-terminus. The protein may be a
coiled-coil structure.
Length = 68
Score = 28.3 bits (64), Expect = 1.8
Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 8/59 (13%)
Query: 304 QLEDQTHQLQKQVATHKQEEEAL--LNEMEVTGQAFE--DMQEQNSRLLQQLREKDDAN 358
LE++ +L+ ++A +E+ + LN+ V Q + +Q Q L+++L+E + +N
Sbjct: 1 SLEERIEELEIRLA---FQEDTIEELNQ-VVAEQQRQIDRLQRQLRLLVERLKEVESSN 55
>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737). This
family consists of several uncharacterized mammalian
proteins of unknown function.
Length = 187
Score = 30.1 bits (68), Expect = 1.8
Identities = 28/124 (22%), Positives = 51/124 (41%), Gaps = 23/124 (18%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
E ++L E ++ + + + +E LL E Q +QE+ +R+
Sbjct: 47 SELRRLIAGELKGALED-AKRPSEETAGGLQSSEVKEDLLKRYE---QEQAIVQEELARI 102
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
+ RE++ A E++ L RE+ +E+ +A L R+LEEKE
Sbjct: 103 AK--REREAAE-----EQLSRAVL----REKASAEQERQKAKHLA--------RQLEEKE 143
Query: 408 RFLQ 411
L+
Sbjct: 144 AELK 147
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 30.8 bits (70), Expect = 1.8
Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 4/50 (8%)
Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
E K +Q + HQL Q A + L + + QA E + SR
Sbjct: 368 ELRVKTEQAQASVHQLDSQFAQL----DGKLADAQSAQQALEQQYQDLSR 413
>gnl|CDD|226419 COG3904, COG3904, Predicted periplasmic protein [Function unknown].
Length = 245
Score = 30.3 bits (68), Expect = 1.8
Identities = 13/47 (27%), Positives = 20/47 (42%)
Query: 325 ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL 371
A L EM + + M ++QL K+ KL+T R + L
Sbjct: 197 AYLREMGIGPGLLQMMLATPPSDIRQLGLKEMTAMKLVTSRDAVDLL 243
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 30.8 bits (70), Expect = 1.9
Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 5/79 (6%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEAL---LNEMEVTGQAFEDMQEQNSRLLQQLREKDDA 357
++K+LE ++ + + + L L E+E ED QE+ L LR KD+
Sbjct: 355 ELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLR-KDEL 413
Query: 358 NFKLMTERIKSNQLHKLAR 376
+ ER+KS +LH++ R
Sbjct: 414 EARENLERLKS-KLHEIKR 431
Score = 29.7 bits (67), Expect = 4.7
Identities = 28/132 (21%), Positives = 55/132 (41%), Gaps = 12/132 (9%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
EEA + L ++ +A + E L +++ + DM+E+ L
Sbjct: 200 EEAEEHMIALRSIMERIPSLLAELQTELPGQLQDLK---AGYRDMKEEGYHLEHV---NI 253
Query: 356 DANFKLMTERIKSN--QLHKL----AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
D+ + + E++ N L +L A EE ++E++++L +E A +EE
Sbjct: 254 DSRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDLLEREVEAKNVVEENLPI 313
Query: 410 LQTVLTNAEKEL 421
L L A++
Sbjct: 314 LPDYLEKAKENN 325
>gnl|CDD|237953 PRK15378, PRK15378, inositol phosphate phosphatase SopB;
Provisional.
Length = 564
Score = 30.6 bits (69), Expect = 2.0
Identities = 26/116 (22%), Positives = 42/116 (36%), Gaps = 9/116 (7%)
Query: 333 TGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
T +AF+ +Q+ +Q L + A K R + L R + T +Q
Sbjct: 14 TQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVL----RNKGATWGNYLQHQKTS 69
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKE----LHLRNQAMDLNKRKAIESAQSAAD 444
++H + T L A K+ + NQ M+L K KA A +
Sbjct: 70 NHSLHNLYNLQRDLLTVAATALKLAGKQDPVLKAMANQ-MELAKVKADRPATKQEE 124
>gnl|CDD|223181 COG0103, RpsI, Ribosomal protein S9 [Translation, ribosomal
structure and biogenesis].
Length = 130
Score = 29.4 bits (67), Expect = 2.1
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 274 EFRKFACVVRDSKRDERKK 292
+K + RD +R ERKK
Sbjct: 97 ALKKAGLLTRDPRRVERKK 115
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 30.6 bits (69), Expect = 2.2
Identities = 21/120 (17%), Positives = 39/120 (32%), Gaps = 13/120 (10%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
+ + + + E + + +Q A E A ++E+ +A E +
Sbjct: 358 AANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKAEAEA 417
Query: 346 R------LLQQLREKDDAN---FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
+ + +REK A + + E I+ A K VQAL E
Sbjct: 418 QAAEIKAEAEAIREKGKAEAEAKRALAEAIQVLGDAAAAELFKAL----VQALPEVAEEA 473
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 29.6 bits (67), Expect = 2.5
Identities = 25/125 (20%), Positives = 50/125 (40%), Gaps = 13/125 (10%)
Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL------LNEMEVTGQAFED-MQEQN 344
+L + + +QL + QL+ ++EE L L E+E +A E +E++
Sbjct: 24 QLLRRKRALE-RQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI----EAMHTAI 400
L LR + + E ++ + L R L ++ L Q+ E++ +
Sbjct: 83 KNLHPVLRLLESEVLEEN-ELLQDSLLELSERNFSPNLDPELLDLLKQLNKHLESLQGNL 141
Query: 401 RKLEE 405
+L
Sbjct: 142 EQLAG 146
>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
unknown].
Length = 272
Score = 30.0 bits (67), Expect = 2.5
Identities = 22/130 (16%), Positives = 41/130 (31%), Gaps = 12/130 (9%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQ 341
+D D K E+ + Q++ H + +N ++ + D
Sbjct: 113 QDINEDPAKAENVEKVPESDAIEALEKQLTQQKSEKHN---SSAINFIDELNKRLWSDPF 169
Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI-----EAM 396
+ RL +Q RE+ + K L A + D L +
Sbjct: 170 VSSQRLRKQFRERKKIE---KKQEAKDLSLKNRAALDIDILPSSSDKDKALLLLDNELGK 226
Query: 397 HTAIRKLEEK 406
+ IRKL+ +
Sbjct: 227 NKFIRKLDYR 236
>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein. Protein in this
family are transposases found in insects. This region is
about 230 amino acids in length and is found associated
with pfam05485.
Length = 236
Score = 29.8 bits (67), Expect = 2.5
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 5/47 (10%)
Query: 280 CVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL 326
V+ + E K L +KI+ LE + H L++Q+ +Q E++L
Sbjct: 10 KVINHTTLMENKSLR-----QKIRALEKEMHSLRQQLEESQQLEKSL 51
>gnl|CDD|184695 PRK14471, PRK14471, F0F1 ATP synthase subunit B; Provisional.
Length = 164
Score = 29.4 bits (66), Expect = 2.5
Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 16/118 (13%)
Query: 320 KQEEEALLNEMEVTGQAFEDMQE---QNSRLLQQLREKDDANFK---LMTERIKSNQLHK 373
K+ E+++ N + +A ++MQ N RLL++ R + DA K + E++ ++ +
Sbjct: 38 KEREDSIKNALASAEEARKEMQNLQADNERLLKEARAERDAILKEAREIKEKMIADAKEE 97
Query: 374 LAREEKDTLREQVQALSLQIE---AMH-----TAIRKLEEKERFLQTVLTNAEKELHL 423
A+ E D + EQ +A S++ E AM A +E E+ L+ L+N EK+ L
Sbjct: 98 -AQVEGDKMIEQAKA-SIESEKNAAMAEIKNQVANLSVEIAEKVLRKELSNKEKQHKL 153
>gnl|CDD|214448 MTH00173, ATP6, ATP synthase F0 subunit 6; Provisional.
Length = 231
Score = 29.8 bits (68), Expect = 2.5
Identities = 29/133 (21%), Positives = 49/133 (36%), Gaps = 22/133 (16%)
Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
FS F +S+ S + S + ++ S SSS + F +F + S+
Sbjct: 6 FSSFDDHNSSFSS---LSFLMWLLSLMSLFFFSSSVWVSSSNLSSV--FKLFVLTVSS-Q 59
Query: 193 VYHCFSIFR--FSSSTVFVYHCFSI---------FRFSSSTIFVYHCFDIFSLTSTLFVS 241
V + FS ++ F I F FS ++ + FSL L++S
Sbjct: 60 VTRSSGLNLGGFSLLLSSLF-LFLISLNLSGLLPFVFSVTSHLAF----TFSLALPLWLS 114
Query: 242 ICTSAFQMSFYKS 254
+ S + KS
Sbjct: 115 LILSGLFYNPSKS 127
>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus.
Length = 139
Score = 29.1 bits (66), Expect = 2.5
Identities = 13/56 (23%), Positives = 29/56 (51%), Gaps = 5/56 (8%)
Query: 379 KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
K RE+ + +++ + ++KL +K ++L++ L + +L N L K+K
Sbjct: 42 KRRYRERRKK---ELKKLEQTLKKLRDKNKYLESQLDYYKS--YLDNVLSTLQKKK 92
>gnl|CDD|203394 pfam06133, DUF964, Protein of unknown function (DUF964). This
family consists of several relatively short bacterial
and archaeal hypothetical sequences. The function of
this family is unknown.
Length = 108
Score = 28.6 bits (65), Expect = 2.5
Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 11/68 (16%)
Query: 294 ADEEAMRKIKQLEDQTHQLQ----------KQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
ADEEA + I + + ++Q K+V QE + ++ + F++ ++
Sbjct: 31 ADEEAQKLIDEFQKLQEEIQEKQMFGKEIPKEVQQKIQELKREIDLNPLV-AEFKEAEQA 89
Query: 344 NSRLLQQL 351
LLQ++
Sbjct: 90 LQDLLQEV 97
>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A. Members of this
family are the phage shock protein PspA, from the phage
shock operon. This is a narrower family than the set of
PspA and its homologs, sometimes several in a genome, as
described by pfam04012. PspA appears to maintain the
protonmotive force under stress conditions that include
overexpression of certain phage secretins, heat shock,
ethanol, and protein export defects [Cellular processes,
Adaptations to atypical conditions].
Length = 219
Score = 29.9 bits (68), Expect = 2.5
Identities = 29/96 (30%), Positives = 46/96 (47%), Gaps = 13/96 (13%)
Query: 339 DMQEQNSRLLQQLRE-KDDANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIE 394
+++ + SRL Q+ + +D A L R LAR EK +E +AL ++
Sbjct: 56 ELERRVSRLEAQVADWQDKAELALSKGR------EDLARAALIEKQKAQELAEALERELA 109
Query: 395 AMHTAIRKLEEKERFLQTVLTNA---EKELHLRNQA 427
A+ + KL+E LQ L A +K L +R+QA
Sbjct: 110 AVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQA 145
>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases
[Cell envelope biogenesis, outer membrane / Carbohydrate
transport and metabolism].
Length = 588
Score = 30.3 bits (69), Expect = 2.6
Identities = 17/95 (17%), Positives = 32/95 (33%), Gaps = 22/95 (23%)
Query: 149 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 208
S +S + + F +F +Y +++R S S +F I + ++
Sbjct: 24 LSASLWSGAQLLTICVFILFG-------LYR--AVWRASVSDLF-----IILKAVLTSAL 69
Query: 209 VYHCFSIFR--FSSSTIFVYHCFDIFSLTSTLFVS 241
V F R S + VY L + +
Sbjct: 70 VSLLFFSTRTDLPRSVVLVY------WLLLFVLIG 98
Score = 29.9 bits (68), Expect = 3.5
Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 16/80 (20%)
Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
S +S + + F +F +Y +++R S S +F I + ++
Sbjct: 24 LSASLWSGAQLLTICVFILFG-------LYR--AVWRASVSDLF-----IILKAVLTSAL 69
Query: 193 VYHCFSIFR--FSSSTVFVY 210
V F R S V VY
Sbjct: 70 VSLLFFSTRTDLPRSVVLVY 89
>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
Members of this protein family are phage lysis
regulatory protein, including the well-studied protein
LysB (lysis protein B) of Enterobacteria phage P2. For
members of this family, genes are found in phage or in
prophage regions of bacterial genomes, typically near a
phage lysozyme or phage holin.
Length = 135
Score = 29.3 bits (66), Expect = 2.6
Identities = 10/52 (19%), Positives = 21/52 (40%)
Query: 375 AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
A + ++ + + Q+ + ++ EE + L+ L A L R Q
Sbjct: 31 ANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQ 82
>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 79
Score = 28.0 bits (62), Expect = 2.6
Identities = 11/50 (22%), Positives = 26/50 (52%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
K++Q D LQ ++ K++ +L E++ E ++ +N +L ++
Sbjct: 12 KVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREALERENEQLKEE 61
>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
only].
Length = 128
Score = 29.0 bits (65), Expect = 2.8
Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 14/77 (18%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL--LQQLREKD--D 356
++ LE + + ++QV +QEEEA A +++E+ + +++L KD
Sbjct: 16 YLELLE-EMLEKEEQVVQERQEEEA---------AAIAEIEERQEKYGTIRELLIKDGIT 65
Query: 357 ANFKLMTERIKSNQLHK 373
L TE K + K
Sbjct: 66 PEELLNTEVTKPRRKAK 82
>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p. In Saccharomyces
cerevisiae, Sec2p is a GDP/GTP exchange factor for
Sec4p, which is required for vesicular transport at the
post-Golgi stage of yeast secretion.
Length = 90
Score = 28.1 bits (63), Expect = 3.0
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 19/78 (24%)
Query: 283 RDSKRDERKKLADEEAMRKIKQ-LEDQTHQL----QKQVATHKQEEEALLNEMEVTGQAF 337
D +++K+L E+ +K++Q +ED T L K VA ++E EA+
Sbjct: 1 ADLAEEKKKRLRAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAV----------- 49
Query: 338 EDMQEQNSRLLQQLREKD 355
+ +N +L +QL+EK+
Sbjct: 50 ---EIKNEKLEEQLKEKE 64
>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
The binding of the LYPxL motif of late HIV p6Gag and
EIAV p9Gag to this domain is necessary for viral
budding.This domain is generally central between an
N-terminal Bro1 domain, pfam03097 and a C-terminal
proline-rich domain. The retroviruses thus used this
domain to hijack the ESCRT system of the cell.
Length = 292
Score = 30.0 bits (68), Expect = 3.0
Identities = 36/134 (26%), Positives = 62/134 (46%), Gaps = 17/134 (12%)
Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQV----ATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
R KL EE ++ L +L+ + + E ++ + +++ +
Sbjct: 104 RSKL--EEHEENLELLSGPEEELEALLPSSSPSKTPEVSEQISRLRELLNKLNELKAERE 161
Query: 346 RLLQQLREK--DDANFKLMTE--RIKSNQLHKLAREEK---DTLREQV-QALSLQIEAMH 397
+LL++L+EK DD KL+ ++ S+ +L EE D L+E+V Q LS Q E +
Sbjct: 162 KLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEEELKKFDPLQERVEQNLSKQEELL- 220
Query: 398 TAIRKLEEKERFLQ 411
I+ E E FLQ
Sbjct: 221 KEIQ--EANEEFLQ 232
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 30.0 bits (68), Expect = 3.0
Identities = 25/126 (19%), Positives = 46/126 (36%), Gaps = 13/126 (10%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
+ A E+A + E Q L+ Q+A+ + E + QA +++ + L
Sbjct: 86 PRDYRAALEQAEAALAAAEAQLRNLRAQLASAQALIAQ--AEAQDLDQAQNELERRA-EL 142
Query: 348 LQ------QLREKDDANFKLM---TERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
Q + ++ A + ++ Q LA E + Q Q S + A+
Sbjct: 143 AQRGVVSREELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAE-AALDQ 201
Query: 399 AIRKLE 404
A LE
Sbjct: 202 AKLDLE 207
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 28.7 bits (65), Expect = 3.0
Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 10/60 (16%)
Query: 340 MQEQNSRLLQQLREKDDAN---FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
++++ +RL E+D+A KL E + +L K E + L ++++ L + E
Sbjct: 36 LKDELARLE---AERDEARQEIVKLTEENEELRELKK----EIEELEKELEDLEQRYETT 88
>gnl|CDD|179749 PRK04147, PRK04147, N-acetylneuraminate lyase; Provisional.
Length = 293
Score = 30.0 bits (68), Expect = 3.0
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
EEK++ L+ V A+ ++ L Q +N +A E A+ A +L
Sbjct: 56 TEEKKQVLEIVAEEAKGKVKLIAQVGSVNTAEAQELAKYATELG 99
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.2 bits (68), Expect = 3.1
Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 22/121 (18%)
Query: 240 VSICTSAFQMSFYKSRWFDFRSLSDSASN------IYGSLEFRKFACVVRDSKRDERKKL 293
+ I A++M F +F+ +++ D+ + I G ++ F + + KR E +KL
Sbjct: 205 LHIPKEAYKMDFV---FFNGQNVYDNNDHKDFCVEIEGGMDEHSFEDFLLEEKRRELEKL 261
Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
A EEA R+ +Q E+Q + +++ A +A + EV E+ LQ L +
Sbjct: 262 AKEEAERE-RQAEEQRRREEEKAAMEADRAQA---KAEV---------EKRREKLQNLLK 308
Query: 354 K 354
K
Sbjct: 309 K 309
>gnl|CDD|221352 pfam11975, Glyco_hydro_4C, Family 4 glycosyl hydrolase C-terminal
domain.
Length = 231
Score = 29.7 bits (67), Expect = 3.1
Identities = 9/58 (15%), Positives = 20/58 (34%), Gaps = 6/58 (10%)
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPARL 460
L++ E+ + L KE L + ++ + + + IY + A
Sbjct: 102 LDKVEKRIFEQLEKWYKEPELIVKPKKIDDKGSRYYGD------AAIAAIYNDKEAVF 153
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 29.1 bits (66), Expect = 3.2
Identities = 17/78 (21%), Positives = 36/78 (46%), Gaps = 9/78 (11%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT----HKQEEEALLNEMEVTGQAF---- 337
+ ++ K E + K+L+ +LQK AT ++++E L + Q
Sbjct: 22 QLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKL 81
Query: 338 -EDMQEQNSRLLQQLREK 354
+D+Q++ LQ++ +K
Sbjct: 82 QQDLQKRQQEELQKILDK 99
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.8 bits (65), Expect = 3.2
Identities = 13/76 (17%), Positives = 32/76 (42%), Gaps = 2/76 (2%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
+ KR+E+++ +E+ + ++ E + + Q ++ E L + D ++
Sbjct: 14 IEEKREEKEREKEEK--ERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDED 71
Query: 343 QNSRLLQQLREKDDAN 358
+ L +Q R D
Sbjct: 72 YDEELKEQERWDDPMA 87
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 29.6 bits (67), Expect = 3.2
Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 20/97 (20%)
Query: 346 RLLQQLREK------DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ----IEA 395
+ L+ L+EK D+ F +++ + E++ +EQV+ L Q +
Sbjct: 37 KTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSKEQVKLLKTQDLNYVRT 96
Query: 396 MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
A K EK ++ELHL + K
Sbjct: 97 KRQAEAKKIEKL----------KEELHLTDSGSAGPK 123
>gnl|CDD|184282 PRK13730, PRK13730, conjugal transfer pilus assembly protein TrbC;
Provisional.
Length = 212
Score = 29.5 bits (66), Expect = 3.3
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 1/48 (2%)
Query: 342 EQNSRLLQQLREKDDANFKLMTER-IKSNQLHKLAREEKDTLREQVQA 388
+Q L +QLREK D K E+ + N L + D L + QA
Sbjct: 35 KQQENLSRQLREKPDHQLKAWAEKQVLENPLQRSDNHFLDELVRKQQA 82
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 30.0 bits (68), Expect = 3.4
Identities = 26/124 (20%), Positives = 59/124 (47%), Gaps = 6/124 (4%)
Query: 276 RKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQ 335
RK + R+ K +E ++ +++ + K + ++ ++ K A ++EEE +ME+ +
Sbjct: 91 RKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEE---KQMEILEK 147
Query: 336 AFEDMQEQNSRLLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALS--LQ 392
A + + + L+ A K ER + ++ + R++ D L+ ++ LQ
Sbjct: 148 ALKSYLKIVKEENKSLQRLAKALQKESEERTQDETKMIEEYRDKIDALKNAIEVEKEGLQ 207
Query: 393 IEAM 396
EA+
Sbjct: 208 EEAI 211
>gnl|CDD|193185 pfam12709, Kinetocho_Slk19, Central kinetochore-associated. This
is a family of proteins integrally involved in the
central kinetochore. Slk19 is a yeast member and it may
play an important role in the timing of nuclear
migration. It may also participate, directly or
indirectly, in the maintenance of centromeric tensile
strength during mitotic stagnation, for instance during
activation of checkpoint controls, when cells need to
preserve nuclear integrity until cell cycle progression
can be resumed.
Length = 87
Score = 28.1 bits (63), Expect = 3.4
Identities = 14/41 (34%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
KKL +E + K L+ + QL+KQ+ T ++E++ LL ++
Sbjct: 49 KKL--DELELENKALKQEIEQLKKQLDTERKEKQELLKLLD 87
>gnl|CDD|221270 pfam11855, DUF3375, Protein of unknown function (DUF3375). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 479 to 499 amino acids in
length.
Length = 471
Score = 30.0 bits (68), Expect = 3.7
Identities = 27/156 (17%), Positives = 58/156 (37%), Gaps = 24/156 (15%)
Query: 259 FRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT 318
FR + D A + +FR+ R R R+++ D + R ++ ++
Sbjct: 177 FRQILDLARELPA--DFRRVEDNFRQLDRQLRERIIDWDGSRG--EVLEEIFD------- 225
Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQ-NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE 377
+A+ + + G++F + ++ E D+ + + L E
Sbjct: 226 ---GYDAIADSDQ--GRSFRAFWDFLLDP--ERQEELDEL----LRAVLARPPAQGLDPE 274
Query: 378 EKDTLREQVQALSLQIEAMHTAIRKLEEK-ERFLQT 412
++ LR + L E + R+L E RF+ +
Sbjct: 275 LREFLRRLHRDLLEAGEEVQRTRRRLSESLRRFVDS 310
>gnl|CDD|211667 TIGR01530, nadN, NAD pyrophosphatase/5'-nucleotidase NadN. This
model describes NadN of Haemophilus influenzae and a
small number of close homologs in pathogenic,
Gram-negative bacteria. NadN is a periplasmic enzyme
that cleaves NAD (nicotinamide adenine dinucleotide) to
NMN (nicotinamide mononucleotide) and AMP. The NMN must
be converted by a 5'-nucleotidase to nicotinamide
riboside for import. NadN belongs a large family of
5'-nucleotidases and has NMN 5'-nucleotidase activity
for NMN, AMP, etc [Transport and binding proteins,
Other, Biosynthesis of cofactors, prosthetic groups, and
carriers, Pyridine nucleotides].
Length = 545
Score = 29.9 bits (67), Expect = 3.7
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM--EVTGQA 336
DERKK D K L+D + + +K E++ L E+ +TG A
Sbjct: 303 DERKKALDTLKSMKSISLDDHDAKTDSLIEKYKSEKDRLAQEIVGVITGSA 353
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 29.6 bits (67), Expect = 3.8
Identities = 19/118 (16%), Positives = 44/118 (37%), Gaps = 15/118 (12%)
Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
+E KLA + ++ + +++ +Q++ +++ K+E E E+ R
Sbjct: 52 NEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRER-----------IEELKRA 100
Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
L Q R + + +R S L + + + R + + +L E
Sbjct: 101 LAQRRSDLSSASYQLEKRRASQ----LEKLQDEIKRTRSKLNALHSLLAEKRSFLCRE 154
>gnl|CDD|219841 pfam08438, MMR_HSR1_C, GTPase of unknown function C-terminal. This
domain is found at the C-terminus of pfam01926 in
archaeal and eukaryotic GTP-binding proteins. The
C-terminal domain of the GTP-binding proteins is
necessary for the complete activity of the protein of
interacting with the 50S ribosome and binding of both
adenine and guanine nucleotides, with a preference for
guanine nucleotides.
Length = 109
Score = 28.3 bits (64), Expect = 3.9
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 400 IRKLEEKERFLQTVLTNAEKELHLRNQA 427
I KL+EK V T+AE EL LR A
Sbjct: 14 IEKLKEKYPDEPVVPTSAEAELALRKAA 41
>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3. Aip3p/Bud6p is a
regulator of cell and cytoskeletal polarity in
Saccharomyces cerevisiae that was previously identified
as an actin-interacting protein. Actin-interacting
protein 3 (Aip3p) localizes at the cell cortex where
cytoskeleton assembly must be achieved to execute
polarized cell growth, and deletion of AIP3 causes gross
defects in cell and cytoskeletal polarity. Aip3p
localization is mediated by the secretory pathway,
mutations in early- or late-acting components of the
secretory apparatus lead to Aip3p mislocalization.
Length = 426
Score = 29.6 bits (67), Expect = 4.0
Identities = 31/130 (23%), Positives = 52/130 (40%), Gaps = 11/130 (8%)
Query: 314 KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDA------NFKLMTERIK 367
K + E ++L E+ V Q + ++ + EK D + + R
Sbjct: 148 KLNKEQRAELKSLQRELAVLRQTHNSFFTEIKESIKDILEKIDKFKSSSLSASGSSNRAY 207
Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMH--TAIRKLEEKERFLQTV---LTNAEKELH 422
K E+ D+L +V L IEA+ A R + ++ L+TV L A KEL
Sbjct: 208 VESSKKKLSEDSDSLLTKVDDLQDIIEALRKDVAQRGVRPSKKQLETVQKELETARKELK 267
Query: 423 LRNQAMDLNK 432
+ +D+ K
Sbjct: 268 KMEEYIDIEK 277
>gnl|CDD|133905 PHA00489, PHA00489, scaffolding protein.
Length = 101
Score = 28.0 bits (62), Expect = 4.1
Identities = 22/97 (22%), Positives = 37/97 (38%), Gaps = 6/97 (6%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
+ KL D E + + QL++ + E E L +E ED+
Sbjct: 5 LEEHEAILNKLGDPELTESERT--EALQQLRESYGSFHSEYEELTEALEKLTAEKEDLIV 62
Query: 343 QNSRLLQQL----REKDDANFKLMTERIKSNQLHKLA 375
NS+L +QL E+++ K +E I L +
Sbjct: 63 SNSKLFRQLGPTKEEEEEDKQKSFSETITIEDLEGNS 99
>gnl|CDD|223790 COG0718, COG0718, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 105
Score = 28.0 bits (63), Expect = 4.2
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
M +++L Q Q+QK++ QEE L + EVTG+A
Sbjct: 4 MMDMQKLMKQAQQMQKKMQ-KMQEE---LAQKEVTGKA 37
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 29.7 bits (68), Expect = 4.4
Identities = 26/126 (20%), Positives = 53/126 (42%), Gaps = 13/126 (10%)
Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQAL 389
VTG+A + + LL++L A + ER+++ +L +L +E + + A
Sbjct: 687 VTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAAAA 746
Query: 390 SLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESA----QSAADL 445
+ + A + + + L + + + LR A DL + + S + D
Sbjct: 747 AGDLLAQAKEVNGV----KVLAAQVEGVDAK-ALRTLADDL--KDKLGSGVVVLATVEDG 799
Query: 446 KLHLVI 451
K+ LV+
Sbjct: 800 KVSLVV 805
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 29.3 bits (66), Expect = 4.5
Identities = 14/68 (20%), Positives = 29/68 (42%), Gaps = 5/68 (7%)
Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
+EEE L E+ + FE + Q+L E + + E+ + + K ++ +
Sbjct: 150 EEEE--LAELRQQQRQFEQRRNAELAETQRLEEAER---RRREEKERRKKQDKERKQREK 204
Query: 381 TLREQVQA 388
E++ A
Sbjct: 205 ETAEKIAA 212
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 29.2 bits (66), Expect = 4.7
Identities = 34/155 (21%), Positives = 56/155 (36%), Gaps = 30/155 (19%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE------MEVTGQAFEDMQEQNSRL-- 347
E ++K++ LE + + K +Q+ LL E + Q E + E RL
Sbjct: 146 TELLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQHEERLAELRDRLIS 205
Query: 348 ----LQQLREKDDANFKLMTERIKSNQL-HKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
L ++ L E K++ L +KLA L Q++ALS +
Sbjct: 206 SIQNLLSRKQSRLERLILNRELEKNSLLENKLATA---NLTAQLKALSPL--------KT 254
Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIE 437
L +R V K + A +L +E
Sbjct: 255 L---KRGFAIVRRKDGKIVT---SAAELKPGDNLE 283
>gnl|CDD|233885 TIGR02479, FliA_WhiG, RNA polymerase sigma factor, FliA/WhiG
family. Most members of this family are the flagellar
operon sigma factor FliA, controlling transcription of
bacterial flagellar genes by RNA polymerase. An
exception is the sigma factor WhiG in the genus
Streptomyces, involved in the production of sporulating
aerial mycelium.
Length = 224
Score = 29.2 bits (66), Expect = 4.8
Identities = 32/134 (23%), Positives = 56/134 (41%), Gaps = 25/134 (18%)
Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-NSRLLQQLREKDDA 357
+K ++LE +L+ ++ EEE + E+ + + + + N+ L L E ++
Sbjct: 79 RQKARKLERAIRELEARLGREPTEEE-IAEELGMDLKEYRQALNEINALSLVSLDELLES 137
Query: 358 N-----FKLMTERIKS-NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
E KS + +L REE EA+ AI L E+E Q
Sbjct: 138 GDDGGSLIDRIEDDKSEDPEEELEREEL-------------REALAEAIESLSERE---Q 181
Query: 412 TVLT-NAEKELHLR 424
VL+ +EL+L+
Sbjct: 182 LVLSLYYYEELNLK 195
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 29.3 bits (66), Expect = 4.9
Identities = 29/136 (21%), Positives = 58/136 (42%), Gaps = 11/136 (8%)
Query: 317 ATHKQEEEALLNEMEVTGQA--------FEDMQEQNSRLLQQLREKDDANFKLMTERIKS 368
A K E E +L +++VT + E M + + + L E KL + S
Sbjct: 190 AEWKLEVERVLPQLKVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKKLSQD--LS 247
Query: 369 NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAM 428
L K++ EK L Q++ L + + +++EK ++ +EL+ ++ +
Sbjct: 248 KALEKISSREK-HLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEEL 306
Query: 429 DLNKRKAIESAQSAAD 444
+ K++ E S +D
Sbjct: 307 EQVKQEMEERGASMSD 322
>gnl|CDD|217472 pfam03280, Lipase_chap, Proteobacterial lipase chaperone protein.
Length = 189
Score = 28.8 bits (65), Expect = 4.9
Identities = 15/70 (21%), Positives = 28/70 (40%), Gaps = 8/70 (11%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
R R L EA +++ QL+ + Q++ + E + +L D +Q
Sbjct: 124 QDLRAARAALVGPEAAQRLAQLDQERAAWQQRYDDYLAERDQILA------AGLSDADKQ 177
Query: 344 NSRLLQQLRE 353
+ QLR+
Sbjct: 178 --AQIAQLRQ 185
>gnl|CDD|188641 cd00954, NAL, N-Acetylneuraminic acid aldolase, also called
N-acetylneuraminate lyase (NAL). N-Acetylneuraminic
acid aldolase, also called N-acetylneuraminate lyase
(NAL), which catalyses the reversible aldol reaction of
N-acetyl-D-mannosamine and pyruvate to give
N-acetyl-D-neuraminic acid (D-sialic acid). It has a
widespread application as biocatalyst for the synthesis
of sialic acid and its derivatives. This enzyme has been
shown to be quite specific for pyruvate as the donor,
but flexible to a variety of D- and, to some extent,
L-hexoses and pentoses as acceptor substrates. NAL is
member of dihydrodipicolinate synthase family that
comprises several pyruvate-dependent class I aldolases.
Length = 288
Score = 29.2 bits (66), Expect = 5.1
Identities = 12/52 (23%), Positives = 26/52 (50%), Gaps = 3/52 (5%)
Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
E + +EE+++ + V A+ ++ L LN +++ E A+ A +L
Sbjct: 47 EGFLLS---VEERKQIAEIVAEAAKGKVTLIAHVGSLNLKESQELAKHAEEL 95
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 28.9 bits (65), Expect = 5.1
Identities = 37/173 (21%), Positives = 77/173 (44%), Gaps = 47/173 (27%)
Query: 292 KLADEEAM-----RKIKQLEDQTHQLQKQV-------------------ATHKQEEEALL 327
K A++E +K+++LE + +L Q+ A + E+ L
Sbjct: 27 KAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLK 86
Query: 328 NEMEVTGQAFEDMQEQNSRL----LQQLREKDDANFKLMTERIKS-NQLHK-------LA 375
E E ++ E+ + + ++L++K K+M ER KS ++L+K L
Sbjct: 87 KEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKAYKKALNLE 146
Query: 376 RE------EKDTLREQVQALSLQIEAMHTAIRKL-EEKERFLQ-TVLTNAEKE 420
+E +KD +++ +I+A++ + K+ + K++F + T N EK+
Sbjct: 147 KELYEYLKDKDLTLKELDE---KIKAVNQSYEKIQKAKKKFNEYTKKYNKEKQ 196
>gnl|CDD|192780 pfam11594, Med28, Mediator complex subunit 28. Mediator is a large
complex of up to 33 proteins that is conserved from
plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Subunit Med28 of the
Mediator may function as a scaffolding protein within
Mediator by maintaining the stability of a submodule
within the head module, and components of this submodule
act together in a gene-regulatory programme to suppress
smooth muscle cell differentiation. Thus, mammalian
Mediator subunit Med28 functions as a repressor of
smooth muscle-cell differentiation, which could have
implications for disorders associated with abnormalities
in smooth muscle cell growth and differentiation,
including atherosclerosis, asthma, hypertension, and
smooth muscle tumours.
Length = 106
Score = 28.0 bits (62), Expect = 5.2
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 367 KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAE 418
K N+L + K TL+E+ Q L +++ K + K + + +LT+AE
Sbjct: 27 KRNELS--VFKPKKTLKEEAQKLKEEMQRKDQLQTKHDSKIDYWENLLTDAE 76
>gnl|CDD|148013 pfam06152, Phage_min_cap2, Phage minor capsid protein 2. Family of
related phage minor capsid proteins.
Length = 361
Score = 29.3 bits (66), Expect = 5.2
Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 5/94 (5%)
Query: 262 LSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEA---MRKIKQLEDQTHQLQKQVAT 318
LS +IY LE F ++R K + +LAD + K+ +L Q K +A
Sbjct: 11 LSQPVVDIYSKLELDLFDNMIRRLKTRKSLQLADNVEQWQIEKLNELGLLNKQNIKLIAK 70
Query: 319 H-KQEEEALLNEMEVTG-QAFEDMQEQNSRLLQQ 350
+ EEA+ ++ G + +D+ +Q S L +
Sbjct: 71 YTGVAEEAITRVIKNAGLKIIKDIDKQLSEALNK 104
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.4 bits (67), Expect = 5.3
Identities = 17/83 (20%), Positives = 29/83 (34%), Gaps = 24/83 (28%)
Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH--------LRNQAMDLNKR- 433
++ + ++ + I LEE ER L+ AE L L + L +
Sbjct: 508 KKLIGEDKEKLNEL---IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEE 564
Query: 434 ------------KAIESAQSAAD 444
+AI+ A+ AD
Sbjct: 565 DKLLEEAEKEAQQAIKEAKKEAD 587
Score = 29.0 bits (66), Expect = 8.2
Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
+ +R+ +K EEA +K+ E +L+++ ++EE+ LL E E Q
Sbjct: 527 ELERELEQKA--EEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ------- 577
Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
+++ +++ D K + + K A E + + L+ E +K
Sbjct: 578 ---AIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARK----RLNKANEKKEKKKKKQ 630
Query: 404 EEKERFLQ 411
+EK+ L+
Sbjct: 631 KEKQEELK 638
>gnl|CDD|234437 TIGR03998, thiol_BshC, bacillithiol biosynthesis cysteine-adding
enzyme BshC. Members of this protein family are BshC,
an enzyme required for bacillithiol biosynthesis and
described as a cysteine-adding enzyme. Bacillithiol is a
low-molecular-weight thiol, an analog of glutathione and
mycothiol, and is found largely in the Firmicutes
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Glutathione and analogs].
Length = 528
Score = 29.2 bits (66), Expect = 5.5
Identities = 15/99 (15%), Positives = 37/99 (37%), Gaps = 11/99 (11%)
Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
K+ +Q+ E + E +A E + + L ++L D + + E+ +
Sbjct: 400 LKERFLERQDPEPIDALFE---EAKEQITAAHKPLREEL-AAVDPTLQGLAEKNLAKITK 455
Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
++ EK LR + +++ + + L+
Sbjct: 456 QVEFLEKRLLRAIRRKH-------DVELQQFDRIQTNLR 487
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 28.5 bits (64), Expect = 5.5
Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 5/91 (5%)
Query: 337 FEDMQEQNSRLLQQLR----EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
E ++E N RL Q + A KL+ + Q K +++ + L E V+ L L+
Sbjct: 52 LELLREDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELK 111
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHL 423
+ + + +LEE+E L+ N+ E +
Sbjct: 112 AKNLSDQVSRLEERETELKKEY-NSLHERYT 141
>gnl|CDD|222255 pfam13600, DUF4140, N-terminal domain of unknown function
(DUF4140). This family is often found at the N-terminus
of its member proteins, with DUF4139, pfam13598, at the
C-terminus.
Length = 101
Score = 27.6 bits (62), Expect = 5.5
Identities = 12/41 (29%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL 326
+RD A A + +LE++ L+ ++A + E+EAL
Sbjct: 56 RRDYLPDEATSPA---LAELEEELRALEDELAALEAEKEAL 93
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 29.1 bits (65), Expect = 5.8
Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 11/91 (12%)
Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
E + ++ + K + E I N+ KD+ R +L + +AI
Sbjct: 388 YEFKHNVNDYFSQQKNVPVKSLEEIIAFNK--------KDSKRRIKYGQTLIEASEKSAI 439
Query: 401 RKLEEKERFLQTVLTNAEKEL--HLRNQAMD 429
K E ++ +QT NA+KEL +L + +D
Sbjct: 440 TKDEFEKV-VQTSQENAKKELDRYLVEKGLD 469
>gnl|CDD|225521 COG2974, RdgC, DNA recombination-dependent growth factor C [DNA
replication, recombination, and repair].
Length = 303
Score = 28.9 bits (65), Expect = 5.9
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 364 ERIKSNQLHKLAREEKDTLREQV 386
E I++ Q KL ++EKD L++ V
Sbjct: 88 EEIEAKQGRKLKKKEKDELKDDV 110
>gnl|CDD|191593 pfam06721, DUF1204, Protein of unknown function (DUF1204). This
family represents the C-terminus of a number of
Arabidopsis thaliana hypothetical proteins of unknown
function. Family members contain a conserved DFD motif.
Length = 232
Score = 28.9 bits (64), Expect = 6.0
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 14/81 (17%)
Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
LE K+ A K AL E E +A D+ R +DDA E
Sbjct: 7 LEKCLESASKEAAHAKDAHAALAVEKEKMAEALADLA----------RCQDDAE----EE 52
Query: 365 RIKSNQLHKLAREEKDTLREQ 385
+ K+++L A+ EK LRE+
Sbjct: 53 KEKTDELADRAKSEKKRLRER 73
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 28.0 bits (63), Expect = 6.0
Identities = 19/112 (16%), Positives = 45/112 (40%), Gaps = 15/112 (13%)
Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE------ 342
+ ++ +E+A ++ + + + +L+ Q+ + E + A E
Sbjct: 9 DLREKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQR 68
Query: 343 ----QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQA 388
+ R+ QQ +E ++ +R +L + RE K + L+E+ Q
Sbjct: 69 FIRQLDQRIQQQQQELALLQQEVEAKR---ERLLEARRELKALEKLKEKKQK 117
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 29.1 bits (65), Expect = 6.0
Identities = 23/81 (28%), Positives = 41/81 (50%), Gaps = 8/81 (9%)
Query: 296 EEAMRKIKQLED---QTHQLQKQVATHKQEEEALL----NEMEVTGQAFEDMQEQNSRLL 348
EE +RK + LE+ + + QKQ+A K + E E+ Q FE ++ + S+
Sbjct: 11 EEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK 70
Query: 349 QQLR-EKDDANFKLMTERIKS 368
++ + EK D K++ + I S
Sbjct: 71 KKFKKEKVDVRVKVVKKWINS 91
>gnl|CDD|214787 smart00721, BAR, BAR domain.
Length = 239
Score = 28.9 bits (65), Expect = 6.4
Identities = 27/105 (25%), Positives = 39/105 (37%), Gaps = 14/105 (13%)
Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
K+ ERK L + A K+K+ + + +K K EEE + Q FE+ Q
Sbjct: 141 KKLERKLLDYDSARHKLKKAKKSK-EKKKDEKLAKAEEE-----LRKAKQEFEESNAQLV 194
Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
L QL F N L L + + RE + L
Sbjct: 195 EELPQLVASRVDFFV--------NCLQALIEAQLNFHRESYKLLQ 231
>gnl|CDD|234967 PRK01642, cls, cardiolipin synthetase; Reviewed.
Length = 483
Score = 29.0 bits (66), Expect = 6.4
Identities = 7/46 (15%), Positives = 16/46 (34%)
Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
+R ER +L + ++ L+ H ++ + L
Sbjct: 60 LGKRRAERARLMWPSTAKWLRDLKACKHIFAEENSEVAAPLFRLCE 105
>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630). This
bacterial family of proteins have no known function.
Length = 81
Score = 26.9 bits (60), Expect = 6.6
Identities = 10/44 (22%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 362 MTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
M ++ ++++L EE + LR ++Q + H + +LE
Sbjct: 1 MDDQDILARINELVAEEHE-LRAKLQQGEIDESEEHARLARLEV 43
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 28.9 bits (65), Expect = 6.7
Identities = 24/158 (15%), Positives = 58/158 (36%), Gaps = 18/158 (11%)
Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
+ L E ++ ++ E+ L+ K+ ++ L N Q E+ + ++
Sbjct: 152 RALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPN-----KQDLEEYK-------KR 199
Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV----QALSLQIEAMHTAIRKLEE- 405
L + K E K + ++E ++++ L + + + +L++
Sbjct: 200 LEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKA 259
Query: 406 -KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
+ + A K L+N A+DL + + Q
Sbjct: 260 PQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRKYLQKF 297
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.5 bits (64), Expect = 6.9
Identities = 17/89 (19%), Positives = 42/89 (47%), Gaps = 10/89 (11%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
KI+ + + +LQK+ + E E+L N++E +++Q++ + + +K
Sbjct: 32 KIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQ-SKAEIKKLQKEIA 90
Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQAL 389
+ E I E ++ L+++ +A+
Sbjct: 91 ELKENI---------VERQELLKKRARAM 110
>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 628
Score = 29.2 bits (66), Expect = 7.3
Identities = 18/134 (13%), Positives = 47/134 (35%), Gaps = 18/134 (13%)
Query: 123 LSSTVFVYHCFSIFRFSSSTVF----VYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVY 178
S + ++ I F SS + ++S + + S+ ++ +
Sbjct: 10 SSINLSIFIILLISIFFSSNLPNISTSNASLGGNNNNNSIIN-------YNNSNFSISLL 62
Query: 179 HCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLTSTL 238
+ + +++ + + + + + S F+ S F++ + I + L
Sbjct: 63 KTLAFLSLINLLIWI-----LLGCPNINISLSNWLSNTLFNFSLSFIFDFYFILFFSVAL 117
Query: 239 FVSICTSAFQMSFY 252
FV+ S + S Y
Sbjct: 118 FVT--WSIIEFSHY 129
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 29.3 bits (66), Expect = 7.4
Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 19/142 (13%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE--EEALLN-EM--EVTGQAFED 339
+ R E +L E A ++ +QL+ L+ Q+ + +Q E AL + E+ E +G +
Sbjct: 200 NNRQELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALESTELLAEQSGDLPKS 259
Query: 340 MQEQ---N---SRLL-QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
+ Q N S+ L QQ + D + +R ++Q ++ R+ +TLREQ Q L +
Sbjct: 260 IVAQFKINRELSQALNQQAQRMDLIASQ---QRQAASQTLQV-RQALNTLREQSQWLGVS 315
Query: 393 I---EAMHTAIRKLEEKERFLQ 411
EA+ + +L E + Q
Sbjct: 316 NALGEALRAQVARLPEMPKPQQ 337
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 28.0 bits (63), Expect = 7.6
Identities = 22/96 (22%), Positives = 42/96 (43%), Gaps = 12/96 (12%)
Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
++L+E+ A + Q + ARE+ + EQ + + QI A E E
Sbjct: 50 ERLKEEAQALLA------EYEQELEEAREQASEIIEQAKKEAEQI-----AEEIKAEAEE 98
Query: 409 FLQTVLTNAEKELHL-RNQAMDLNKRKAIESAQSAA 443
L+ + AE E+ + +A++ + + E A + A
Sbjct: 99 ELERIKEAAEAEIEAEKERALEELRAEVAELAVAIA 134
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 27.6 bits (62), Expect = 7.7
Identities = 13/46 (28%), Positives = 29/46 (63%), Gaps = 4/46 (8%)
Query: 284 DSKRDERKKLA----DEEAMRKIKQLEDQTHQLQKQVATHKQEEEA 325
+ K+ E +KLA +E+A ++++LE + +L+ ++A + + EA
Sbjct: 29 EEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 28.0 bits (63), Expect = 7.7
Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
I ++ AIR+L++K+ L+ + E E ++ A NK A+
Sbjct: 3 ILSLRKAIRELDKKQESLEKKIKKLEAE--IKKLAKKGNKDAAL 44
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 28.5 bits (64), Expect = 8.0
Identities = 24/126 (19%), Positives = 58/126 (46%), Gaps = 10/126 (7%)
Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLRE-KDD 356
+++ LE L++ + K++EE L ++ + ++++ L + LR+ D+
Sbjct: 141 RMQLLEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADE 200
Query: 357 ANFKLMTERIKSNQLHK--LAREEK-DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
N + + ++ + + + K R+Q++ L +++ + AI L K+ L
Sbjct: 201 LN---LCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEE 257
Query: 414 LTNAEK 419
+ AEK
Sbjct: 258 IAEAEK 263
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 28.9 bits (65), Expect = 8.0
Identities = 15/94 (15%), Positives = 40/94 (42%), Gaps = 12/94 (12%)
Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT 333
RK ++ + E+++ +EE +R+ K+ + Q + +++ E +
Sbjct: 284 AIRK----GQEQQLQEKERRREEEQLRE-KEWDRQAINQARAAVLLERQ------ERRLR 332
Query: 334 GQAFEDMQEQNSRL-LQQLREKDDANFKLMTERI 366
+ + + N +L +Q +++ N +L T
Sbjct: 333 KELRKQLDSANQQLAKEQKAQQEYLNKELYTNAP 366
Score = 28.5 bits (64), Expect = 9.5
Identities = 41/208 (19%), Positives = 70/208 (33%), Gaps = 43/208 (20%)
Query: 276 RKFACVVRDSKRDERKKLA-----DEEAMRK----------IKQLEDQTHQLQKQVATHK 320
K A + D R E A EE R+ E+Q +++ +
Sbjct: 178 EKQADTLHDQLRVELDHRAQELASLEEECRRAVEQATADFNKALAEEQR---ERERREKQ 234
Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA----- 375
+E+E L E+ Q D+ +N + + +++ +R K +LA
Sbjct: 235 EEQEDNLAEIY--NQLTSDLLTENP---DVAQSSFGPH-RVIPDRWKGMSPEQLAAIRKG 288
Query: 376 ----REEKDTLREQVQALSLQIEAM-HTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
+EK+ RE+ Q + + R ER + LR Q
Sbjct: 289 QEQQLQEKERRREEEQLREKEWDRQAINQARAAVLLERQER------RLRKELRKQLDSA 342
Query: 431 NKRKAIESAQSAADLKLHLVIIYTNGPA 458
N++ A E A + +YTN P
Sbjct: 343 NQQLAKE---QKAQQEYLNKELYTNAPT 367
>gnl|CDD|182123 PRK09866, PRK09866, hypothetical protein; Provisional.
Length = 741
Score = 29.0 bits (65), Expect = 8.1
Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 1/74 (1%)
Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTN 416
AN L R +++L A++ ++ L + L++ E + I ++EE + LQ
Sbjct: 412 ANASLYALRSAAHKLLNYAQQAREYLDFRAHGLNVACEQLRQNIHQVEESLQLLQLNQAQ 471
Query: 417 AEKEL-HLRNQAMD 429
E+ H A+
Sbjct: 472 VSGEIKHEIELALT 485
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 28.7 bits (64), Expect = 8.4
Identities = 49/253 (19%), Positives = 91/253 (35%), Gaps = 68/253 (26%)
Query: 282 VRDSKRDERKKLADEEAMR---KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
+R S+ D R KLA +E + +QLE ++L + AT +L E +
Sbjct: 177 MRLSETDARIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKE-------LD 229
Query: 339 DMQEQNSRLLQQLREKDDANF---KLMTERIKSNQLHKLAREE---KDTLRE-------- 384
++E+N L KDD F +L+ ++ KL +E +LRE
Sbjct: 230 VLKEENMLL------KDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVA 283
Query: 385 ---QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
+ LQ + K+E + L EK + +Q DL R ++ +
Sbjct: 284 QEDVSKLSPLQYDCW---WEKVENLQDLLDRATNQVEKAALVLDQNQDL--RDKVDKLE- 337
Query: 442 AADLKLHLVIIYTNGPARLEFGKDREFSAKFVNFRRKSYFFVEFLDCLRTRYETRQRKCP 501
A L+ +FS+ ++ L+ + + + +
Sbjct: 338 ----------------ASLKEANVSKFSS-------------YKVELLQQKLKLLEERLQ 368
Query: 502 KCNAKANDYHRLY 514
+ + + Y +LY
Sbjct: 369 ASDHEIHSYIQLY 381
>gnl|CDD|233419 TIGR01450, recC, exodeoxyribonuclease V, gamma subunit. This model
describes the gamma subunit of exodeoxyribonuclease V.
Species containing this protein should also have the
alpha (TIGR01447) and beta (TIGR00609) subunits.
Candidates from Borrelia and from the Chlamydias differ
dramatically and score between trusted and noise cutoffs
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 1067
Score = 28.9 bits (65), Expect = 8.5
Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 18/81 (22%)
Query: 326 LLNEMEVTGQ--AFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLR 383
LL E+E + + AF + LL QL + D + + R N+ L +++
Sbjct: 302 LLPELEQSQEIDAFLEATPDT--LLHQL-QADILHLE---PRPVDNKRPSLPLDDR---- 351
Query: 384 EQVQALSLQIEAMHTAIRKLE 404
S+Q A H+ +R++E
Sbjct: 352 ------SIQFHACHSPLREVE 366
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 28.5 bits (64), Expect = 8.6
Identities = 16/96 (16%), Positives = 35/96 (36%), Gaps = 11/96 (11%)
Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
R +RK + K K+ + + K+ K++ L + E +E
Sbjct: 272 PVRPKRKT---KAQRNKEKR-RKELEREAKEEKQLKKKLAQLA-------RLKEIAKEVA 320
Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
+ + R+K+ + +++K +L K E
Sbjct: 321 QKEKARARKKEQRKERGEKKKLKRRKLGKHKYPEPP 356
>gnl|CDD|219969 pfam08687, ASD2, Apx/Shroom domain ASD2. This region is found in
the actin binding protein Shroom which mediates apical
contriction in epithelial cells and is required for
neural tube closure.
Length = 274
Score = 28.5 bits (64), Expect = 8.9
Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 21/100 (21%)
Query: 315 QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKL 374
+ K E LL +M+ DMQ+ Q E D+ + E I S KL
Sbjct: 69 STSAAKAE---LLIKMK-------DMQQ-----GQSEPEPDNDLQQKKVELISSLG-KKL 112
Query: 375 A--REEKDTLREQVQA---LSLQIEAMHTAIRKLEEKERF 409
EE+++L E +QA L ++EA K E E+F
Sbjct: 113 QDLHEEQESLLEDIQANNRLGEEVEAKVKQACKPNEFEKF 152
>gnl|CDD|201451 pfam00804, Syntaxin, Syntaxin. Syntaxins are the prototype family
of SNARE proteins. They usually consist of three main
regions - a C-terminal transmembrane region, a central
SNARE domain which is characteristic of and conserved in
all syntaxins (pfam05739), and an N-terminal domain that
is featured in this entry. This domain varies between
syntaxin isoforms; in syntaxin 1A it is found as three
alpha-helices with a left-handed twist. It may fold back
on the SNARE domain to allow the molecule to adopt a
'closed' configuration that prevents formation of the
core fusion complex - it thus has an auto-inhibitory
role. The function of syntaxins is determined by their
localisation. They are involved in neuronal exocytosis,
ER-Golgi transport and Golgi-endosome transport, for
example. They also interact with other proteins as well
as those involved in SNARE complexes. These include
vesicle coat proteins, Rab GTPases, and tethering
factors.
Length = 103
Score = 26.9 bits (60), Expect = 9.4
Identities = 22/94 (23%), Positives = 43/94 (45%), Gaps = 7/94 (7%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHK---QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
EE +I+++ +LQK ++ L E+E Q + + + L+ L+
Sbjct: 10 EEIREEIQKIRQNLEELQKLHKRILTAPDSDKELREELEELTQEIKQLARAIKKKLKSLQ 69
Query: 353 EKDDANFKLMT----ERIKSNQLHKLAREEKDTL 382
+ + + L +RI+ NQL L+++ KD L
Sbjct: 70 QLNKSEQGLGHSSAVDRIRKNQLANLSKKFKDVL 103
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
and chromosome partitioning].
Length = 212
Score = 28.2 bits (63), Expect = 9.5
Identities = 20/149 (13%), Positives = 55/149 (36%), Gaps = 18/149 (12%)
Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
+ + LED+ +L++++ E+ +Q ++L++ +A
Sbjct: 44 DENEDLEDEIDELKEELKEAADAED-------------SQAIQQAETEAEELKQAAEAEA 90
Query: 360 KLMTERI--KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
+ +R ++ ++ A+ + D E + ++ ++E + + ++ R +
Sbjct: 91 DDILKRASAQAQRVFGKAQYKADRYLEDAK-IAQELEDLKRQAQVERQRLRSDIEAQLAS 149
Query: 418 EKELHLRNQAMDLNKR--KAIESAQSAAD 444
K+ A E A+ AD
Sbjct: 150 AKQEDWDEILRSTVDEVEAANEEAERLAD 178
>gnl|CDD|163611 pfam10046, BLOC1_2, Biogenesis of lysosome-related organelles
complex-1 subunit 2. Members of this family of proteins
play a role in cellular proliferation, as well as in the
biogenesis of specialized organelles of the
endosomal-lysosomal system.
Length = 99
Score = 26.9 bits (60), Expect = 9.5
Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 346 RLLQQLREKDDANFKLMTERIKS--NQLHKLAR--EEKDTLREQVQALSLQIEAMHTAIR 401
+LL+++ E +K M + K ++ L + EE +Q+ A+ Q+ + A+
Sbjct: 24 KLLEKMNENTALKYKKMRQSAKGLRVEIEALNQKYEELKPYLQQIDAIEQQVTTLEEAVY 83
Query: 402 KLEE 405
+L+E
Sbjct: 84 ELDE 87
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 27.2 bits (61), Expect = 9.6
Identities = 24/135 (17%), Positives = 54/135 (40%), Gaps = 23/135 (17%)
Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN-SRLLQQLREK 354
++A R + + + + Q ++++ +Q ++ +GQ + +N + L E
Sbjct: 1 DKAARNLAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDE- 59
Query: 355 DDANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
I Q + A ++ + REQ Q + + + + KL E+E+
Sbjct: 60 ----------AIAQQQQELEQAEKQVEQAREQWQEANQERKKL----EKLLEREK----- 100
Query: 414 LTNAEKELHLRNQAM 428
+KE + R Q +
Sbjct: 101 -KKEQKEENRREQKL 114
>gnl|CDD|235389 PRK05287, PRK05287, hypothetical protein; Provisional.
Length = 250
Score = 28.2 bits (64), Expect = 9.7
Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 11/57 (19%)
Query: 302 IKQLEDQTHQLQK-----QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
+K+LE Q +LQK V EALL E+E QA + R+ Q LRE
Sbjct: 64 LKELERQRQKLQKWRGNPGVDQEA--LEALLQELE---QASAALNAAP-RIGQSLRE 114
>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex
component.
Length = 233
Score = 28.0 bits (63), Expect = 9.8
Identities = 24/97 (24%), Positives = 36/97 (37%), Gaps = 21/97 (21%)
Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
L + +L KQ+ + E E+L + +A QE+ + +L E E
Sbjct: 127 LSNTIDELNKQIELLEAEVESLSVQR-KKRKADAAKQER----IAELEEL--------LE 173
Query: 365 RIKSNQ------LHKLAREEKDTLREQVQALSLQIEA 395
R K + L L E D EQV + IE
Sbjct: 174 RHKFHISKLELILRLLENGELDP--EQVDDIKEDIEY 208
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.325 0.134 0.390
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,339,082
Number of extensions: 2470202
Number of successful extensions: 5613
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5003
Number of HSP's successfully gapped: 633
Length of query: 517
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 416
Effective length of database: 6,457,848
Effective search space: 2686464768
Effective search space used: 2686464768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 61 (27.4 bits)