RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2547
(1807 letters)
>gnl|CDD|215682 pfam00055, Laminin_N, Laminin N-terminal (Domain VI).
Length = 237
Score = 271 bits (695), Expect = 5e-83
Identities = 127/246 (51%), Positives = 158/246 (64%), Gaps = 18/246 (7%)
Query: 40 CYPATGNLLIGRENKLRASSTCGLHGNSTFCIVSFLEDK-KKCFTCNSSPRFDNNPAFSH 98
CYPATGNL I R L A+STCGLHG +CI+S L+ + KKCF C+S+ NP SH
Sbjct: 1 CYPATGNLAINRA--LSATSTCGLHGPEPYCILSHLQPRDKKCFLCDSN---SPNPRESH 55
Query: 99 KIHNIVYATVPRTKTISWWQSE---NGVE--NVTIQLDMEAEFHFTHLIIYFKTFRPAAM 153
I + P+ T WWQSE NGV+ NVTI LD+EAEFHFT++II FKTFRPAAM
Sbjct: 56 PISFLTDTFNPQDTT--WWQSETMQNGVQYPNVTITLDLEAEFHFTYVIIKFKTFRPAAM 113
Query: 154 LIERSHDFGKTWQVYRYFAADCAKSFPGIPRDGLRKL---TDVYCESRYSSNNPSSGGEV 210
+ ERS DFG TW Y+Y+A C ++PGIPR +R DV C SRYS P + GEV
Sbjct: 114 IYERSSDFG-TWIPYQYYAYSCESTYPGIPRRPIRTGRAEDDVICTSRYSDIEPLTEGEV 172
Query: 211 ILRVLPPSLHHSNIDPYSSDVQNLIKMTNLRINFTKLHTLGDDLLDTRRQIQEKYYYAIT 270
I L N DP S +Q +K TN+RI +LHTLGD+LLD+ ++ EKYYYAI+
Sbjct: 173 IFSTLEGRPSADNFDP-SPRLQEWLKATNIRITLQRLHTLGDNLLDSDPEVLEKYYYAIS 231
Query: 271 QMIVRG 276
++V G
Sbjct: 232 DIVVGG 237
>gnl|CDD|214532 smart00136, LamNT, Laminin N-terminal domain (domain VI).
N-terminal domain of laminins and laminin-related
protein such as Unc-6/ netrins.
Length = 238
Score = 253 bits (648), Expect = 1e-76
Identities = 99/250 (39%), Positives = 139/250 (55%), Gaps = 19/250 (7%)
Query: 34 SCERKSCYPATGNLLIGRENKLRASSTCGLHGNSTFCI-VSFLEDKKKCFTCNSSPRFDN 92
+ +SCYP NL GRE + A+STCG G +C V E KKC C++
Sbjct: 1 AGRPRSCYPPFVNLAFGRE--VTATSTCGEPGPERYCKLVGHTEQGKKCDYCDA-----R 53
Query: 93 NPAFSHKIHNIVYATVPRTKTISWWQSEN---GVENVTIQLDMEAEFHFTHLIIYFKTFR 149
NP SH N+ P T WWQSE G +NV + LD+ EFH T++I+ F + R
Sbjct: 54 NPRRSHPAENLTDGNNPNNPT--WWQSEPLSNGPQNVNLTLDLGKEFHVTYVILKFCSPR 111
Query: 150 PAAMLIERSHDFGKTWQVYRYFAADCAKSFPGIPRDGLRKL--TDVYCESRYSSNNPSSG 207
P+ ++ERS DFGKTWQ ++YF++DC ++F PR + K +V C S YS P G
Sbjct: 112 PSLWILERS-DFGKTWQPWQYFSSDCRRTFGRPPRGPITKGNEDEVICTSEYSDIVPLEG 170
Query: 208 GEVILRVLPPSLHHSNIDPYSSDV-QNLIKMTNLRINFTKLHTLGDDLLDTRRQIQEKYY 266
GE+ +L S D +S V Q + TN+R+ T+L TLGD+L+D R ++ +YY
Sbjct: 171 GEIAFSLLEG--RPSATDFDNSPVLQEWVTATNIRVRLTRLRTLGDELMDDRPEVTRRYY 228
Query: 267 YAITQMIVRG 276
YAI+ + V G
Sbjct: 229 YAISDIAVGG 238
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 86.3 bits (214), Expect = 1e-16
Identities = 101/607 (16%), Positives = 230/607 (37%), Gaps = 103/607 (16%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
E LL K+++++ +E +L+ + ++++++++ E +
Sbjct: 225 LELALLLAKLKELRKELEELEE---ELSRLEEELEELQEELEEAE-------------KE 268
Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
I + L +L EE+++L + E+L+ + +EG + L R+ + +L+ + E
Sbjct: 269 IEELKSELEELREELEELQE--ELLELKEEI---EELEGEISLLRERLE---ELENELEE 320
Query: 1334 TEKQINDAERQCKRTENLVTKTFPQ--------------FTQGQEENEKALQRLNEKITS 1379
E+++ + + + + + + + + +E+ L+ L E +
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380
Query: 1380 LEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN 1439
L + L +L + + R + L E + E + ++
Sbjct: 381 LREELAELEAELAEIRNE-----------------------LEELKREIESLEERLERLS 417
Query: 1440 EKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANIL 1499
E+ +E K + +E + E + + E E + EE+ + ++E
Sbjct: 418 ERLEDLKEELK--ELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEE 475
Query: 1500 AKEAYDHAASVRNKTAAYVA--STSNITKQLDEFLNA--PGA--TLADIRNISGLAVNKN 1553
+ +S+ + A S + + E L + PG +A++ + V +
Sbjct: 476 LQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAEL-----IKVKEK 530
Query: 1554 IQSNPEQIRERA--NEINNVIKSLTDIDTILTETAGDLAKANDLKR----KANLTKAGAD 1607
E E A N + V+ + + + + A + K N R + K
Sbjct: 531 Y----ETALEAALGNRLQAVV---VENEEVAKK-AIEFLKENKAGRATFLPLDRIKPLRS 582
Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
+ + + ++ E A++ D + +DL+ A++ + + +
Sbjct: 583 LKSDAAPGFLGLASDLIDFDPKYEPAVRFVLGDTL--------VVDDLEQARRLARKLRI 634
Query: 1668 KVKL------LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQET 1721
K ++ L E SI G RS++ + E K LE+E+A + Q L +
Sbjct: 635 KYRIVTLDGDLVEPSGSITGGSRNK-RSSLAQKRELKELEEELAELEAQLEKLEEELKSL 693
Query: 1722 DNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNI 1781
N+L S + R + + L + +L A ++L + ++ E EL EL + +
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753
Query: 1782 AELQKRI 1788
ELQ+R+
Sbjct: 754 EELQERL 760
Score = 70.1 bits (172), Expect = 1e-11
Identities = 51/360 (14%), Positives = 129/360 (35%), Gaps = 63/360 (17%)
Query: 1478 KAEEVFRGITTAKQESHAANILA--KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
K + + + E A + A ++ + S++N+ + + +QL+E
Sbjct: 659 KRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE--- 715
Query: 1536 GATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDL 1595
R + L + + + E++ + + + + + L +++ L E L + +
Sbjct: 716 -------RQLEEL--KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766
Query: 1596 KRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDL 1655
A +++ L E + ++ A+ + A ++L +
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELEELEEE---LEEAERRLDALERELESLEQRR 823
Query: 1656 DDAQQKSNETNVKVKLLQERLKSIQA--GFLQNGRSAM-----DVENEEKNLEKEVALAQ 1708
+ +Q+ E +++ L+E+L ++ L+ + ++E E++ LE E+ +
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELE 883
Query: 1709 KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS------------------ 1750
++ L +E +++L E R R + L K +L
Sbjct: 884 EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTL 943
Query: 1751 --------------------VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
VN A +++ E E + +S+ +L + +L + I+
Sbjct: 944 ETELEREIERLEEEIEALGPVNLRA-IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002
Score = 61.7 bits (150), Expect = 3e-09
Identities = 57/371 (15%), Positives = 127/371 (34%), Gaps = 51/371 (13%)
Query: 1442 ESKAEELRKHMIILSE-SNSAK---DYAETSKKL--INEKEAKAEEVFRGITT-----AK 1490
+K EE RK ++ E + +K E +KL E + E++ + +
Sbjct: 151 NAKPEERRK---LIEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLER 207
Query: 1491 QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAV 1550
Q A +A + A +A + K+L+E ++ +
Sbjct: 208 QAEKAERYQELKAELRELEL----ALLLAKLKELRKELEELEEELSRLEEELEELQ---- 259
Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
E++ E EI + L ++ +L + + + +
Sbjct: 260 --------EELEEAEKEIEELKSELEELRE-------ELEELQEELLELKEEIEELEGEI 304
Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK 1670
+ + + L E + E ++ K+ I A +++L + L++ +Q E +
Sbjct: 305 S---LLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361
Query: 1671 LLQERLKSIQAGF---LQNGRSAMDVENEEK--------NLEKEVALAQKQASGLRSRYQ 1719
L+E+L ++ + R + E L++E+ +++ L R +
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Query: 1720 ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSK 1779
+ +L + + L + +L +L E E + EL L K
Sbjct: 422 DLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEK 481
Query: 1780 NIAELQKRIQS 1790
++ L+ R+
Sbjct: 482 ELSSLEARLDR 492
Score = 53.2 bits (128), Expect = 1e-06
Identities = 74/439 (16%), Positives = 165/439 (37%), Gaps = 77/439 (17%)
Query: 1259 DPNGVGKTLGNTTQRINLAR-LALNDLTEEIKKLNK-TGEMLKENATLLQEN-NVEGAL- 1314
+P+G T G+ +R +LA+ L +L EE+ +L ++ +E +L E ++E L
Sbjct: 647 EPSGS-ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLE 705
Query: 1315 ELTRQAHDTSLK---LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQ 1371
EL RQ + + LK + E+++ + + + E + + + EE ++ L+
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELE----ELEEELEELQERLE 761
Query: 1372 RLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYA 1431
L E++ SLE+ L L E I ++
Sbjct: 762 ELEEELESLEEALAKLKEEI---------------------------------EELEEKR 788
Query: 1432 ETSKKLINEKESKAEELRKHMIILSE----SNSAKDYAETSKKLINEKEAKAEEVFRGIT 1487
+ ++ + E E + EE + + L ++ E + + E+ + EE +
Sbjct: 789 QALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848
Query: 1488 TAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISG 1547
+E KE + + + + + ++L+E ++R +
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE----------ELRELES 898
Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAN---DLKRKANLTKA 1604
+ E++RER E+ ++ L L E + + +L+R+ +
Sbjct: 899 ELAELKEEI--EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956
Query: 1605 GADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
++ V + E ++ E +++ ++D+ A++ L ++ +LD
Sbjct: 957 EIEALGP-VNL--RAIEEYEEVEERYE-ELKSQREDLEEAKEKLLEVIEELD-------- 1004
Query: 1665 TNVKVKLLQERLKSIQAGF 1683
K + +E I F
Sbjct: 1005 -KEKRERFKETFDKINENF 1022
Score = 44.7 bits (106), Expect = 5e-04
Identities = 52/320 (16%), Positives = 108/320 (33%), Gaps = 63/320 (19%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMN-----NEDPNGVGKTLG 1268
E LE +E+++ +E R +L L +++ +Q+ + E+ + + L
Sbjct: 695 NELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELE 754
Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTR-------- 1318
+R+ L L E + KL + E L+E LQE +E LE
Sbjct: 755 ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814
Query: 1319 ----------QAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEK 1368
+ +L+E+ E E+++++ E + + E + + + EE E
Sbjct: 815 ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL----EELEA 870
Query: 1369 ALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAK 1428
+ L +++ LE+ +L E + + + L+E
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESE-----------------------LAELKEEI 907
Query: 1429 DYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITT 1488
+ + + K + E + E ++ I E + E +
Sbjct: 908 EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEA-LGPV-- 964
Query: 1489 AKQESHAANILAKEAYDHAA 1508
N+ A E Y+
Sbjct: 965 --------NLRAIEEYEEVE 976
Score = 44.7 bits (106), Expect = 6e-04
Identities = 43/276 (15%), Positives = 99/276 (35%), Gaps = 50/276 (18%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIM-NNEDPNGVGKTLGNTTQRINLA 1277
LE+ + +K +E L ++++++++ + + + L + QR
Sbjct: 770 LEEALAKLKEEIEELEEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRERL 826
Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
+ +L EEI++L +E L+E E EL + + +L+ + E E +
Sbjct: 827 EQEIEELEEEIEEL-------EEKLDELEEELEELEKEL-EELKEELEELEAEKEELEDE 878
Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
+ + E + + E + + E ++ +++L E++ LE L L +
Sbjct: 879 LKELEEEKEELEEELRE----LESELAELKEEIEKLRERLEELEAKLERLEVEL------ 928
Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE 1457
+ E L E E + E L + + L
Sbjct: 929 -------------------------PELEEELEEEYEDTLETELEREIERLEEEIEALGP 963
Query: 1458 SNSA--KDYAETSKKLINEKEAKAEEVFRGITTAKQ 1491
N ++Y E ++ E +++ E++ +
Sbjct: 964 VNLRAIEEYEEVEERY-EELKSQREDLEEAKEKLLE 998
Score = 44.3 bits (105), Expect = 6e-04
Identities = 32/179 (17%), Positives = 71/179 (39%), Gaps = 14/179 (7%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
+E E LE+K+++++ +E + L + +++++ + + L +
Sbjct: 835 EEIEELEEKLDELEEELEELEKE---LEELKEELEELEAEKEE------LEDELKELEEE 885
Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELT-RQAHDTSLKLKEQST 1332
L +L E+ +L + E L+E L+ +EL + T
Sbjct: 886 KEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLET 945
Query: 1333 ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
E E++I E + + + + ++ EE E+ + L + LE+ L E+I
Sbjct: 946 ELEREIERLEEEIEALGPVNLRAIEEY----EEVEERYEELKSQREDLEEAKEKLLEVI 1000
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 80.6 bits (199), Expect = 5e-15
Identities = 93/552 (16%), Positives = 210/552 (38%), Gaps = 72/552 (13%)
Query: 1213 TKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
+ E LE+KIE ++ L L L + + +++ + L + +
Sbjct: 280 ERLLEELEEKIERLEELEREIEE----LEEELEGLRALLEELEELLEK------LKSLEE 329
Query: 1273 RINLARLALNDLTEEIKKL----NKTGEMLKENATLLQENN------VEGALELTRQAHD 1322
R+ L L E+++L N+ ++L+E L+E +E ALE +Q +
Sbjct: 330 RLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEE 389
Query: 1323 TSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEK 1382
+LKE+ E + + + + + E + + + EE E+ +++L E+I LE
Sbjct: 390 AIQELKEELAELSAALEEIQEELEELEKELEEL----ERELEELEEEIKKLEEQINQLES 445
Query: 1383 NLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
+ EL G+ C +C G + + E + + E+
Sbjct: 446 KELMIAEL--AGAGEKCP-VC-GQELPE-----------EHEKELLELYELELEELEEEL 490
Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
S+ +E + + E E + E E +E ++ N+L +
Sbjct: 491 SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELE-----EKLEKLENLLEEL 545
Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIR 1562
+ + + ++L E L L +R + ++ E+++
Sbjct: 546 EELKEKLQLQQLKEELRQLEDRLQELKELL----EELRLLRT-----RKEELEELRERLK 596
Query: 1563 ERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE 1622
E ++ + + L+ ++ +L A+ N+L+ + ++ K +Q ++ L +
Sbjct: 597 ELKKKLKELEERLSQLEELLQSLELSEAE-NELEEA--EEELESELEKLNLQAELEELLQ 653
Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL--LQERLKSIQ 1680
A + E ++ + +I R++L +I N+ ++ ++ +L L+E L+ +
Sbjct: 654 AALEE--LEEKVEELEAEI---RRELQRIENEEQLEEKLEELEQLEEELEQLREELEELL 708
Query: 1681 AGFLQNGRSAMDVENEEKNLE---KEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA 1737
+ + ++E+ + LE KE+ +K L E + L KA
Sbjct: 709 KKLGEIEQLIEELESRKAELEELKKELEKLEKALELL-----EELREKLGKAGLRADILR 763
Query: 1738 RGQM-LLNKASQ 1748
+ +A++
Sbjct: 764 NLLAQIEAEANE 775
Score = 69.4 bits (170), Expect = 1e-11
Identities = 98/559 (17%), Positives = 202/559 (36%), Gaps = 80/559 (14%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
L L EE+K+L K E+ +E +E +E +E + + KE+ E + ++ +
Sbjct: 205 LEALEEELKELKKLEEIQEE----QEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260
Query: 1341 AERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL---------I 1391
E + + + + EE E+ ++RL E +E+ +L L +
Sbjct: 261 IESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEEL 320
Query: 1392 CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEE---- 1447
+K + L K SE + KL+ E+ + EE
Sbjct: 321 LEKLKSLEERL--EKLEEKL------EKLESELEELAEEKNELAKLLEERLKELEERLEE 372
Query: 1448 --------------LRKHMIILSESNSAKDYA-ETSKKLINEKEAKAEEVFRGITTAKQE 1492
L + + L E + A E ++ + E E + EE+ R + ++E
Sbjct: 373 LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432
Query: 1493 SHAANILAKEAYDHAASVRNKTAAYVASTSNI----TKQLDEFLNAPGATLADIRNISGL 1548
I E + + A +A ++L E L ++ +
Sbjct: 433 -----IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHE---KELLELYELELE 484
Query: 1549 AVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADS 1608
+ + + ++ E EI + K L +++ L E + + L K
Sbjct: 485 ELEEELSRE-KEEAELREEIEELEKELRELEEELIELLELEEALKEEL-EEKLEKL---- 538
Query: 1609 TKNTVQKIVDVLTEARTAQ-----DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSN 1663
+N ++++ ++ + + Q E +Q K+ + R L +L++ +++
Sbjct: 539 -ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL-LRTRKEELEELRERLK 596
Query: 1664 ETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV--ALAQKQASG-----LRS 1716
E K+K L+ERL ++ L + ENE + E+E+ L + L++
Sbjct: 597 ELKKKLKELEERLSQLEE--LLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654
Query: 1717 RYQETDNKL------LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ 1770
+E + K+ + + + + L + QL ++L E
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEI 714
Query: 1771 ESELTELSKNIAELQKRIQ 1789
E + EL AEL++ +
Sbjct: 715 EQLIEELESRKAELEELKK 733
Score = 69.0 bits (169), Expect = 2e-11
Identities = 65/474 (13%), Positives = 154/474 (32%), Gaps = 88/474 (18%)
Query: 1364 EENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE 1423
E+ EK + L E I + + +L + +
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQL----------------------------SELL 198
Query: 1424 SNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVF 1483
+ ++L ++ K EE+++ + E +L EKE E
Sbjct: 199 EDIEDLLEALEEEL--KELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKA 256
Query: 1484 RGITTAKQESHAANILA---KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLA 1540
R + E A I +E + K I + +E A
Sbjct: 257 RLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL----RA 312
Query: 1541 DIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKAN 1600
+ + + + + + ER ++ ++ L L E +LAK L+ +
Sbjct: 313 LLEEL------EELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL--LEERLK 364
Query: 1601 LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQ 1660
+ + + ++K ++ L + A + + + +++L ++ +L++ ++
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424
Query: 1661 KSNETNVKVKLLQERLKSIQAG--FLQNGRSAMDV--------------------ENEEK 1698
+ E ++K L+E++ +++ + A + E E +
Sbjct: 425 ELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELE 484
Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKL-------------------LSKAESSGLKRARG 1739
LE+E++ K+ + LR +E + +L + + L+
Sbjct: 485 ELEEELSRE-KEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543
Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
++ K +L+QL + EL L EL++ ++ +
Sbjct: 544 ELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEE-LRERLK 596
Score = 54.0 bits (130), Expect = 8e-07
Identities = 49/284 (17%), Positives = 110/284 (38%), Gaps = 36/284 (12%)
Query: 1527 QLDEFLNAPGATLADI-RNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
+ D FL + +I + GL + + E ++E E I+ L + L E
Sbjct: 144 EFDAFLKSKPKERKEILDELFGLEKYEKLS---ELLKEVIKEAKAKIEELEGQLSELLED 200
Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR 1645
DL +A + + K + E + E A++ ++ +
Sbjct: 201 IEDLLEALEEELKELKKLEEIQEEQ-----------EEEELEQEIE-ALEERLAELEEEK 248
Query: 1646 KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE---K 1702
+ L ++ L + + E +K+ +E L+ ++ + +E + + LE +
Sbjct: 249 ERLEELKARLLEIESLELE---ALKIREEELRELERLLEE-------LEEKIERLEELER 298
Query: 1703 EVALAQKQASGLRSRYQETDNKL-LSKAESSGLKRARGQM--LLNKASQLSVNTTAKLKQ 1759
E+ +++ GLR+ +E + L K+ L++ ++ L ++ +L+ K
Sbjct: 299 EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL 358
Query: 1760 LNETEAMFNSQ----ESELTELSKNIAELQKRIQSCINFIVDKS 1799
L E + E EL + + + +L++ IQ + + S
Sbjct: 359 LEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELS 402
Score = 48.6 bits (116), Expect = 4e-05
Identities = 43/328 (13%), Positives = 112/328 (34%), Gaps = 20/328 (6%)
Query: 1470 KLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLD 1529
+ K + +E+ + ++ + L KE A + + ++ + L
Sbjct: 147 AFLKSKPKERKEILDELF-GLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLL 205
Query: 1530 EFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDT-ILTETAGD 1588
E L L + I + ++ E + ER E+ + L ++ +L + +
Sbjct: 206 EALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLE 265
Query: 1589 LAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK-DDISAARKD 1647
L + + + + + ++++ ++ E + E+ A +++ +
Sbjct: 266 LEALKIREEELRELERLLEELEEKIERLEELEREI--EELEEELEGLRALLEELEELLEK 323
Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKS----IQAGFLQNGRSAMDVENEEKNLEKE 1703
L + L+ ++K + +++ L E ++ + ++E E + +
Sbjct: 324 LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383
Query: 1704 VALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET 1763
+ ++ L+ E L E L + + +L++L E
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEE-----------LEELEKELEELERELEELEEE 432
Query: 1764 EAMFNSQESELTELSKNIAELQKRIQSC 1791
Q ++L IAEL + C
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEKC 460
>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
domai.
Length = 46
Score = 68.1 bits (167), Expect = 3e-14
Identities = 33/46 (71%), Positives = 33/46 (71%)
Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGFGPEGC 818
CDCDP GS S CD G C CKPNV GRRCDRCAPG YG GP GC
Sbjct: 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
Score = 58.9 bits (143), Expect = 6e-11
Identities = 18/42 (42%), Positives = 26/42 (61%)
Query: 821 CDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYP 862
CDC+ G+ CD +GQC+C+ GR CD+C+PGY+
Sbjct: 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
Score = 57.7 bits (140), Expect = 1e-10
Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 1079 CDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPNVEC 1125
CDCDP GS T C+ TGQC+CK GR+CD+C ++G+ C
Sbjct: 1 CDCDPGGSASGT-CDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
Score = 56.6 bits (137), Expect = 3e-10
Identities = 29/56 (51%), Positives = 32/56 (57%), Gaps = 10/56 (17%)
Query: 404 CDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGC 459
CDCDPGGS CDP T G+C CKPNV GRRCD C G++ P GC
Sbjct: 1 CDCDPGGSASGT-CDPDT------GQCECKPNVTGRRCDRCAPGYYGD---GPPGC 46
Score = 53.9 bits (130), Expect = 3e-09
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 462 CTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLS 504
C C+ G+ + C+ DTG C CK NV GR C++C P ++G
Sbjct: 1 CDCDPGGSASGT-CDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
Score = 53.1 bits (128), Expect = 5e-09
Identities = 23/46 (50%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 1128 CDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIG-TAPDC 1172
CDCDP GSA+ C + G C C + G RCD CA GY G P C
Sbjct: 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
Score = 50.4 bits (121), Expect = 6e-08
Identities = 21/49 (42%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHWKIASGTGC 1076
C C+ G + G CD TGQC C PNV G CDRC ++ GC
Sbjct: 1 CDCDPGG--SASGTCDPDTGQCECKPNVTGRRCDRCAPGYYG-DGPPGC 46
Score = 45.8 bits (109), Expect = 2e-06
Identities = 20/39 (51%), Positives = 24/39 (61%)
Query: 514 CDCDPGASYDNNCDVISGQCKCKPHASGRTCSTPEQSYY 552
CDCDPG S CD +GQC+CKP+ +GR C YY
Sbjct: 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYY 39
Score = 39.6 bits (93), Expect = 3e-04
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 867 CICN--GHADL-CDSKTGTCISCRNSTDGENCERCINGYYGDPRLG 909
C C+ G A CD TG C C+ + G C+RC GYYGD G
Sbjct: 1 CDCDPGGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPG 45
Score = 38.8 bits (91), Expect = 6e-04
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 976 CDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGN 1019
CDC + + G CD TG+C +C+ + G C C G++G+
Sbjct: 1 CDC--DPGGSASGTCDPDTGQC-ECKPNVTGRRCDRCAPGYYGD 41
Score = 36.1 bits (84), Expect = 0.005
Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 7/50 (14%)
Query: 278 CSCYGHASRCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPW 327
C C S A D G+CEC N G C++C Y
Sbjct: 1 CDCDPGGS-------ASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGP 43
Score = 35.7 bits (83), Expect = 0.008
Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 15/54 (27%)
Query: 341 CNCN---NHSTKCHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDP 391
C+C+ + S C D G C+ C+ N GR C+RC +Y D
Sbjct: 1 CDCDPGGSASGTCDPD-----------TGQCE-CKPNVTGRRCDRCAPGYYGDG 42
Score = 34.2 bits (79), Expect = 0.029
Identities = 18/50 (36%), Positives = 21/50 (42%), Gaps = 8/50 (16%)
Query: 917 CPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNP 966
C C G S + TC D CECK G RCD C Y+G+
Sbjct: 1 CDCD----PGGSASGTCDPD----TGQCECKPNVTGRRCDRCAPGYYGDG 42
>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like domain;
laminins are the major noncollagenous components of
basement membranes that mediate cell adhesion, growth
migration, and differentiation; the laminin-type
epidermal growth factor-like module occurs in tandem
arrays; the domain contains 4 disulfide bonds (loops
a-d) the first three resemble epidermal growth factor
(EGF); the number of copies of this domain in the
different forms of laminins is highly variable ranging
from 3 up to 22 copies.
Length = 50
Score = 65.8 bits (161), Expect = 2e-13
Identities = 28/49 (57%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGF--GPEGCK 819
CDC+ GS S CD G C CKPN GRRCDRCAPG YG GC+
Sbjct: 2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
Score = 61.2 bits (149), Expect = 9e-12
Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 1078 PCDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNP 1121
PCDC+ GS+ +C+ TGQC+CK GR+CD+C ++G P
Sbjct: 1 PCDCNGHGSLS-GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLP 43
Score = 60.8 bits (148), Expect = 1e-11
Identities = 22/44 (50%), Positives = 30/44 (68%)
Query: 820 ACDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYPT 863
CDCN G+L CD +GQC+C+ T GR CD+C+PGY+ P+
Sbjct: 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPS 44
Score = 59.3 bits (144), Expect = 4e-11
Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 8/58 (13%)
Query: 403 RCDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGCE 460
CDC+ GSL G CDP T G+C CKPN GRRCD C G++ + GC+
Sbjct: 1 PCDCNGHGSLS-GQCDPGT------GQCECKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50
Score = 56.2 bits (136), Expect = 5e-10
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 461 PCTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLSESEEYGCK 512
PC CN G+++ Q C+ TG C CK N GR C++C P ++GL GC+
Sbjct: 1 PCDCNGHGSLSGQ-CDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG-GCQ 50
Score = 55.1 bits (133), Expect = 1e-09
Identities = 21/49 (42%), Positives = 23/49 (46%)
Query: 1127 PCDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPC 1175
PCDC+ GS + QC G C C G RCD CA GY G C
Sbjct: 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGC 49
Score = 50.8 bits (122), Expect = 3e-08
Identities = 19/43 (44%), Positives = 24/43 (55%)
Query: 513 PCDCDPGASYDNNCDVISGQCKCKPHASGRTCSTPEQSYYSPP 555
PCDC+ S CD +GQC+CKP+ +GR C YY P
Sbjct: 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLP 43
Score = 48.5 bits (116), Expect = 3e-07
Identities = 23/51 (45%), Positives = 27/51 (52%), Gaps = 3/51 (5%)
Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHWKIAS-GTGCE 1077
C CN G G CD TGQC C PN G CDRC ++ + S G GC+
Sbjct: 2 CDCNGHGSLS--GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
Score = 48.1 bits (115), Expect = 3e-07
Identities = 23/47 (48%), Positives = 26/47 (55%), Gaps = 4/47 (8%)
Query: 866 RCICNGHADL---CDSKTGTCISCRNSTDGENCERCINGYYGDPRLG 909
C CNGH L CD TG C C+ +T G C+RC GYYG P G
Sbjct: 1 PCDCNGHGSLSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQG 46
Score = 47.0 bits (112), Expect = 1e-06
Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 7/53 (13%)
Query: 277 SCSCYGHASRCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPWKP 329
C C GH S + D G+CEC NT G C++C Y LP +
Sbjct: 1 PCDCNGHGSL-------SGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
Score = 42.7 bits (101), Expect = 3e-05
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 975 PCDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNALEQN 1024
PCDC N +L G CD TG+C +C+ +T G C C G++G +
Sbjct: 1 PCDC--NGHGSLSGQCDPGTGQC-ECKPNTTGRRCDRCAPGYYGLPSQGG 47
Score = 39.6 bits (93), Expect = 4e-04
Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 15/54 (27%)
Query: 341 CNCNNHSTK---CHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDP 391
C+CN H + C G C+ C+ NT GR C+RC +Y P
Sbjct: 2 CDCNGHGSLSGQCDPG-----------TGQCE-CKPNTTGRRCDRCAPGYYGLP 43
Score = 39.3 bits (92), Expect = 4e-04
Identities = 21/59 (35%), Positives = 25/59 (42%), Gaps = 9/59 (15%)
Query: 916 PCPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNPQERGGSCE 974
PC C G S + C CECK G RCD C Y+G P +GG C+
Sbjct: 1 PCDCNGHG----SLSGQC----DPGTGQCECKPNTTGRRCDRCAPGYYGLP-SQGGGCQ 50
>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V). This
family is like pfam00008 but has 8 conserved cysteines
instead of six.
Length = 49
Score = 65.1 bits (159), Expect = 3e-13
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGFGPEGCKAC 821
CDC+P GS S CD G C+CKP V GR CDRC PG YG + + C
Sbjct: 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 64.3 bits (157), Expect = 6e-13
Identities = 27/56 (48%), Positives = 31/56 (55%), Gaps = 7/56 (12%)
Query: 404 CDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGC 459
CDC+P GSL D CDP T G+C CKP V GR CD CK G++ GC
Sbjct: 1 CDCNPHGSLSDT-CDPET------GQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 61.2 bits (149), Expect = 7e-12
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 1079 CDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPNVECFPC 1128
CDC+P GS+ C+ TGQC CK G GR CD+C+ ++G P+ C
Sbjct: 1 CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 58.9 bits (143), Expect = 5e-11
Identities = 21/43 (48%), Positives = 28/43 (65%)
Query: 821 CDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYPT 863
CDCN G+L + CD +GQC C+ GR CD+C PGY+ P+
Sbjct: 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43
Score = 55.1 bits (133), Expect = 1e-09
Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 462 CTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLSESEEYGC 511
C CN G+++ C+ +TG C CK V GR C++C P ++GL GC
Sbjct: 1 CDCNPHGSLSDT-CDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 53.1 bits (128), Expect = 7e-09
Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 1128 CDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPCGEC 1178
CDC+P GS + C G C+C G+ G CD C GY G D P C
Sbjct: 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSD--PGQGC 49
Score = 51.2 bits (123), Expect = 3e-08
Identities = 21/51 (41%), Positives = 23/51 (45%), Gaps = 4/51 (7%)
Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHW--KIASGTGC 1076
C CN G CD TGQC C P V G CDRC ++ G GC
Sbjct: 1 CDCNPHGSLS--DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 43.9 bits (104), Expect = 1e-05
Identities = 20/43 (46%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query: 867 CICNGHADL---CDSKTGTCISCRNSTDGENCERCINGYYGDP 906
C CN H L CD +TG C C+ G +C+RC GYYG P
Sbjct: 1 CDCNPHGSLSDTCDPETGQC-LCKPGVTGRHCDRCKPGYYGLP 42
Score = 42.3 bits (100), Expect = 4e-05
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 514 CDCDPGASYDNNCDVISGQCKCKPHASGRTC 544
CDC+P S + CD +GQC CKP +GR C
Sbjct: 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHC 31
Score = 41.2 bits (97), Expect = 1e-04
Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 976 CDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNA 1020
CDC+ + L CD TG+C C+ G HC CK G++G
Sbjct: 1 CDCNPHGSL--SDTCDPETGQC-LCKPGVTGRHCDRCKPGYYGLP 42
Score = 39.6 bits (93), Expect = 4e-04
Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 8/57 (14%)
Query: 917 CPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNPQERGGSC 973
C C S +DTC + C CK G G CD C Y+G P + G C
Sbjct: 1 CDCNPHG----SLSDTC----DPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
Score = 38.9 bits (91), Expect = 6e-04
Identities = 16/55 (29%), Positives = 23/55 (41%), Gaps = 15/55 (27%)
Query: 341 CNCNNHSTK---CHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDPN 392
C+CN H + C G C C+ GR+C+RCK +Y P+
Sbjct: 1 CDCNPHGSLSDTCDP-----------ETGQCL-CKPGVTGRHCDRCKPGYYGLPS 43
Score = 36.6 bits (85), Expect = 0.004
Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 13/58 (22%)
Query: 278 CSCYGHAS---RCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPWKPAFG 332
C C H S C D G+C C G +C++C Y LP P G
Sbjct: 1 CDCNPHGSLSDTC----------DPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQG 48
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 74.3 bits (183), Expect = 5e-13
Identities = 115/636 (18%), Positives = 237/636 (37%), Gaps = 112/636 (17%)
Query: 1216 FELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN 1275
+E K + +V ++ + +++++ I E K L + IN
Sbjct: 160 YENAYKNLGEVIKEIKRRIERLEKF---IKRTENIEELIKEKE------KELEEVLREIN 210
Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK-----LKEQ 1330
L +L EE++KL K + L+E ++E E + + L+ L+E+
Sbjct: 211 EISSELPELREELEKLEKEVKELEELKEEIEELEKE---LESLEGSKRKLEEKIRELEER 267
Query: 1331 STETEKQINDAERQCKRTENL---------VTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
E +K+I + E + K + L +++ + ++ E EK L RL E+I +E
Sbjct: 268 IEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
Query: 1382 KNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEK 1441
+ + +L E L E + E +L E
Sbjct: 328 ERIKELEEKEERLE-------------------ELKK-KLKELEKRLEELEERHELYEEA 367
Query: 1442 ESKAEELRKHMIILSESNSAK-----DYAETSKKLINEKEAKAEEVFRGITTAKQESHAA 1496
++K EEL + L+ K + E +K+ I E+ +K + +E A
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427
Query: 1497 NILAKEAY------------DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRN 1544
K+A +H + + Y A I K+L E ++R
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEE---YTAELKRIEKELKEIEEKERKLRKELRE 484
Query: 1545 ISGLAVN--------------KNIQS-----NPEQIRERANEINNVIKSLTDIDTILTET 1585
+ + K ++ N E++ ++A E + + L + +
Sbjct: 485 LEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL 544
Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE---------ARTAQDMAE----- 1631
+L K +LK+K + D + + +++ L E +++
Sbjct: 545 KKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEY 604
Query: 1632 VAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
+ ++ A+ ++ K+L ++ +LD A ++ ET +++ L++ L+ ++ + +
Sbjct: 605 LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKY--SEEEYE 662
Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLS-KAESSGLKRARGQM-LLNKA-SQ 1748
++ E L +E+A + + L R +E L K E ++A+ ++ L KA +
Sbjct: 663 ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALER 722
Query: 1749 LSV--NTTAKLKQLNETEAMFNSQE------SELTE 1776
+ K K L + A+ E ELTE
Sbjct: 723 VEELREKVKKYKALLKERALSKVGEIASEIFEELTE 758
Score = 51.6 bits (124), Expect = 4e-06
Identities = 111/586 (18%), Positives = 209/586 (35%), Gaps = 133/586 (22%)
Query: 1252 KQIMNNEDPNGVGKTLGNTTQRINLARLALNDL---TEEIKKLNKTGEMLKENATLLQEN 1308
+QI+ +D K LG + I L TE I++L K E KE +L+E
Sbjct: 152 RQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKE--KELEEVLREI 209
Query: 1309 NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN-- 1366
N ++ + E +++ E++ K E + + + ++E
Sbjct: 210 N----------------EISSELPELREELEKLEKEVKELE----ELKEEIEELEKELES 249
Query: 1367 -EKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESN 1425
E + ++L EKI LE+ + +L + I + + E
Sbjct: 250 LEGSKRKLEEKIRELEERIEELKKEIEELE-----------------------EKVKELK 286
Query: 1426 SAKDYAETSKKLINEKESKAEELRKHMIILSE-SNSAKDYAETSKKLINEKEAKAEEVFR 1484
K+ AE KL E +ELR+ LS E K+L EKE + EE+ +
Sbjct: 287 ELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKEL-EEKEERLEELKK 345
Query: 1485 GITTAK------QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGAT 1538
+ + +E H AK + ++ + + + K+L+E A
Sbjct: 346 KLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL--TPEKLEKELEELEKA---- 399
Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTD------------------ID- 1579
+ I+ +I R E+ IK L +
Sbjct: 400 ------------KEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTE 447
Query: 1580 ----TILTETAGDLAK-ANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAI 1634
+L E +L + +LK + + ++++ VL + + E+A
Sbjct: 448 EHRKELLEEYTAELKRIEKELKEIEEKER----KLRKELRELEKVLKKESELIKLKELA- 502
Query: 1635 QTAKDDISAARKDLSQITNDLDDAQQKSNE-TNVKVKL---------LQERLKSIQAGFL 1684
+ + + L + +L++ ++K+ E +K KL L++ L+ ++ L
Sbjct: 503 ----EQLKELEEKLKKY--NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEE--L 554
Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS-----GLKRARG 1739
+ + +E + LE+E+A K+ L E + L + E LK A
Sbjct: 555 KKKLAE--LEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEK 612
Query: 1740 QM--LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
++ + +L ++L ETE EL EL K +E
Sbjct: 613 ELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE 658
Score = 40.4 bits (95), Expect = 0.011
Identities = 37/248 (14%), Positives = 97/248 (39%), Gaps = 36/248 (14%)
Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANL 1601
+R I GL +N N ++ + EI I+ L + + +++
Sbjct: 151 VRQILGLDDYENAYKNLGEVIK---EIKRRIERLEKF----------IKRTENIEELIKE 197
Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK 1661
+ + + +I L E R E+ + + K+L ++ ++++ +++
Sbjct: 198 KEKELEEVLREINEISSELPELR-----EEL------EKLEKEVKELEELKEEIEELEKE 246
Query: 1662 SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQET 1721
+ L+E+++ ++ +++ E + LE++V K+ Y +
Sbjct: 247 LESLEGSKRKLEEKIRELE-------ERIEELKKEIEELEEKVK-ELKELKEKAEEYIKL 298
Query: 1722 DNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNI 1781
++ + + + L++ + ++K+L E E + +L EL K +
Sbjct: 299 -SEFYEEYLD---ELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL 354
Query: 1782 AELQKRIQ 1789
EL++R +
Sbjct: 355 EELEERHE 362
Score = 37.4 bits (87), Expect = 0.087
Identities = 28/185 (15%), Positives = 79/185 (42%), Gaps = 14/185 (7%)
Query: 1211 AYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
K+ LEKK+++++ + + +L +++++++++ E L +
Sbjct: 553 ELKKKLAELEKKLDELEEELAELLKELEELG--FESVEELEERLKELEPFYNEYLELKDA 610
Query: 1271 TQRIN--LARLAL--NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK 1326
+ + L +L + ++L +T + L+E L+E + + E + + L+
Sbjct: 611 EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLE 670
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
L + ++ + E++ + + + K +E + ++ +++ LEK L
Sbjct: 671 LSRELAGLRAELEELEKRREEIKKTLEKL--------KEELEEREKAKKELEKLEKALER 722
Query: 1387 LNELI 1391
+ EL
Sbjct: 723 VEELR 727
Score = 35.4 bits (82), Expect = 0.33
Identities = 49/289 (16%), Positives = 105/289 (36%), Gaps = 48/289 (16%)
Query: 1517 YVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLT 1576
++ T NI + + E L +I IS ++ E E+ + K +
Sbjct: 184 FIKRTENIEELIKEKEKELEEVLREINEIS------------SELPELREELEKLEKEVK 231
Query: 1577 DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
+++ E ++ + +K + +KI + E R + E+
Sbjct: 232 ELE----ELKEEIEELEKELESLEGSKRKLE------EKIRE--LEERIEELKKEI---- 275
Query: 1637 AKDDISAARKDLSQITND----------LDDAQQKSNETNVKVKLLQERLKSIQAGFLQN 1686
+++ K+L ++ ++ + E ++ L+E + I+ ++
Sbjct: 276 --EELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEE-RIKE 332
Query: 1687 GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKA 1746
+ E K KE+ +K+ L R++ + K E LK+ + K
Sbjct: 333 LEEKEERLEELKKKLKEL---EKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKL 389
Query: 1747 -SQLSVNTTAKL---KQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
+L AK +++++ A + E+ EL K I EL+K C
Sbjct: 390 EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKC 438
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 74.3 bits (183), Expect = 5e-13
Identities = 110/657 (16%), Positives = 229/657 (34%), Gaps = 137/657 (20%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
++ E L++++E ++A +E +L + + ++++++Q+ T R
Sbjct: 344 EKLEELKEELESLEAELEELEA---ELEELESRLEELEEQL---------------ETLR 385
Query: 1274 --INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
+ L + L EI++L E L++ LQ+ E +L +L+ +
Sbjct: 386 SKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELK---ELQAEL 442
Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
E E+++ + + + +R E + + + EE E+AL ++ L+ L L L
Sbjct: 443 EELEEELEELQEELERLEEALEELREEL----EEAEQALDAAERELAQLQARLDSLERLQ 498
Query: 1392 CDKRGDPC------DNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN--EKES 1443
+ G N G +G G LSE LI+ E
Sbjct: 499 ENLEGFSEGVKALLKNQSGLSGIL---------GVLSE-------------LISVDEGYE 536
Query: 1444 KAEE--LRKHM--IILSESNSAKDYAETSKK-----------------LINEKEAKAEEV 1482
A E L + +++ N+AK K+ I + + +
Sbjct: 537 AAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKN 596
Query: 1483 FRG--------ITTAKQESHA-----ANILAKEAYDHAASVRNKTAAYVASTSNITKQLD 1529
G + + A +L + D+A + K +T
Sbjct: 597 IEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPG---YRIVTLD-G 652
Query: 1530 EFLNAPGATL--ADIRNISGLAVNKNIQSNPEQIRERANEINN----VIKSLTDIDTILT 1583
+ + G + N S L + I+ E+I E +I + + +++ +
Sbjct: 653 DLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEE 712
Query: 1584 ETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV---------------------DVLTE 1622
E + +L R+ + + + V+++ + L E
Sbjct: 713 ELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE 772
Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAG 1682
A AE I+ + I +++L + LD+ + + N + L+ERL+S++
Sbjct: 773 AEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832
Query: 1683 FLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
R D+E + + L +++ + +E +L S+ E+ +RA +
Sbjct: 833 IAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEA 888
Query: 1743 LNKASQLSVNTTAKLK-----------QLNETEAMFNSQESELTELSKNIAELQKRI 1788
L + +L+ +L E E L L I LQ+R+
Sbjct: 889 LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERL 945
Score = 72.4 bits (178), Expect = 2e-12
Identities = 96/566 (16%), Positives = 196/566 (34%), Gaps = 102/566 (18%)
Query: 1272 QRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
+ I + L L EI +L + ++L+E L+ E +L +L E+
Sbjct: 281 EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESK-LDELAEEL 339
Query: 1332 TETEKQINDAERQCKRTENLVTK---TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN 1388
E E+++ + + + + E + + + EE E+ L+ L K+ LE + LN
Sbjct: 340 AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN 399
Query: 1389 ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKDYAETSKKLINEKESKAEE 1447
+ + + ++L E E ++
Sbjct: 400 ------------------------------NEIERLEARLERLEDRRERLQQEIEELLKK 429
Query: 1448 LRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHA 1507
L + + ++ E ++ + E + + E + + ++E A A
Sbjct: 430 LEEAELKELQAEL-----EELEEELEELQEELERLEEALEELREELEEAEQALDAAEREL 484
Query: 1508 ASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIR----- 1562
A ++ + + + + + A ++N SGL+ + S E I
Sbjct: 485 AQLQARLDS-------LERLQENLEGFSEGVKALLKNQSGLSGILGVLS--ELISVDEGY 535
Query: 1563 ERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAG---ADSTKNTVQKIVDV 1619
E A I + + ++ E KA ++ L + DS K T +I
Sbjct: 536 EAA--IEAALGG--RLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGT--EIQGN 589
Query: 1620 LTEARTAQDMAEVAIQTAKDDISAA---RKDLS------QITNDLDDAQQKSNETNVKVK 1670
E E + AKD + RK LS + +DLD+A + + + +
Sbjct: 590 DREILKNI---EGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR 646
Query: 1671 LLQERLKSIQAGFLQNGRSA------MDVENEEKNLEKEVALAQ---------------- 1708
++ ++ G + G SA ++ E + LE+++ +
Sbjct: 647 IVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKE 706
Query: 1709 -----KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET 1763
++ LR +E ++ + + A + L + +QLS T ++ E
Sbjct: 707 LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEEL 766
Query: 1764 EAMFNSQESELTELSKNIAELQKRIQ 1789
E E EL E I EL+ +I+
Sbjct: 767 EERLEEAEEELAEAEAEIEELEAQIE 792
Score = 58.9 bits (143), Expect = 3e-08
Identities = 81/414 (19%), Positives = 149/414 (35%), Gaps = 78/414 (18%)
Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQ------ENNVEGALELTRQAHD 1322
+ + R L +L EE+++L K E L + L+ E VE E Q
Sbjct: 695 ELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSK 754
Query: 1323 TSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEK 1382
+L+ + E E+++ +AE + E + + Q Q +EE L+ L E + L
Sbjct: 755 ELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE----LKALREALDELRA 810
Query: 1383 NLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKL---IN 1439
L LNE + R A + L E ++ +E + L I
Sbjct: 811 ELTLLNEEAANLRERLESLERRIAATER------RLEDLEEQ--IEELSEDIESLAAEIE 862
Query: 1440 EKESKAEELRKHMIILS-ESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANI 1498
E E EEL + L E S ++ + + E + E+ + ++E
Sbjct: 863 ELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR- 921
Query: 1499 LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNP 1558
+ A + + N+ ++L E + TL + A+ I+ +
Sbjct: 922 ------EKLAQLELRLEGLEVRIDNLQERLSEEYSL---TLEEA-----EALENKIEDDE 967
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK-RKANLTKAGADSTKNTVQKIV 1617
E+ R R + N IK L ++ A + + +LK R LT Q
Sbjct: 968 EEARRRLKRLENKIKELGPVN----LAA--IEEYEELKERYDFLTA----------Q--- 1008
Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDD-AQQKSNETNVKVK 1670
K+D++ A++ L + ++D A+++ +T +V
Sbjct: 1009 --------------------KEDLTEAKETLEEAIEEIDREARERFKDTFDQVN 1042
Score = 58.5 bits (142), Expect = 3e-08
Identities = 58/295 (19%), Positives = 124/295 (42%), Gaps = 45/295 (15%)
Query: 1531 FLNAPGATLADIRNI---SGLA-----------VNKNIQSNPEQIR-------------E 1563
F+N L DI+++ +GL +++ I++ PE+ R E
Sbjct: 113 FINGQPCRLKDIQDLFLDTGLGKRSYSIIEQGKISEIIEAKPEERRAIFEEAAGISKYKE 172
Query: 1564 RANE----INNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGAD--STKNTVQK-- 1615
R E + ++L ++ IL E L L+R+A + + + ++
Sbjct: 173 RRKETERKLERTRENLDRLEDILNELERQL---KSLERQAEKAERYKELKAELRELELAL 229
Query: 1616 IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER 1675
+V L E R + ++++ A ++L ++T +L + ++K E ++V L+E
Sbjct: 230 LVLRLEELREELE-------ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEE 282
Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
++ +Q +E +++ L + +A ++Q L ++ +E ++KL AE
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342
Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
+ + L + L +L E E+ E +L L +A+L+ +I S
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIAS 397
Score = 56.2 bits (136), Expect = 2e-07
Identities = 104/604 (17%), Positives = 206/604 (34%), Gaps = 139/604 (23%)
Query: 1273 RINLARLALNDLTEEIKKLNKTGEMLKENATLLQE-----NNVEGALELTRQAHDTSLKL 1327
R NL RL D+ E+++ K+ E E A +E +E AL L + + L
Sbjct: 185 RENLDRL--EDILNELERQLKSLERQAEKAERYKELKAELRELELAL-LVLRLEE----L 237
Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ-EENEKALQRLNEKITSLEKNLPD 1386
+E+ E ++++ +AE + + + + + + EE + L E+I L+K L
Sbjct: 238 REELEELQEELKEAEEELEELTAELQE-----LEEKLEELRLEVSELEEEIEELQKELYA 292
Query: 1387 LNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN----EKE 1442
L I E + + E E
Sbjct: 293 LANEISRL-----------------------EQQKQILRERLANLERQLEELEAQLEELE 329
Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
SK +EL + L+E + + + + + + E +E + +E
Sbjct: 330 SKLDELAE---ELAELEEKLEELKEELESLEAELEELEAE-------LEELESRLEELEE 379
Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLAD-IRNISGL---AVNKNIQSNP 1558
+ S + +AS +N ++L+ L + I L ++
Sbjct: 380 QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ 439
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKAN----DLKRKANLTKAGADSTKNTVQ 1614
++ E E+ + + L ++ L E +L +A +R+ +A DS + +
Sbjct: 440 AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499
Query: 1615 K--------------------IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
I+ VL+E + + E AI+ A + + + + +
Sbjct: 500 NLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAA---LGGRLQAV--VVEN 554
Query: 1655 LDDAQQ-----KSNE---------TNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
L+ A++ K NE ++K +Q + I A D+ + L
Sbjct: 555 LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKL 614
Query: 1701 EK-------------EVALAQKQASGLRSRYQ--------------------ETDNKLLS 1727
K ++ A + A LR Y+ +T++ +L
Sbjct: 615 RKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILE 674
Query: 1728 KAESSGLKRARGQM--LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
+ ++ ++ L K ++L K+L E E EL ELS+ I+ L+
Sbjct: 675 R--RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
Query: 1786 KRIQ 1789
K +
Sbjct: 733 KDLA 736
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 61.2 bits (148), Expect = 3e-09
Identities = 41/188 (21%), Positives = 82/188 (43%), Gaps = 17/188 (9%)
Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
A T ++ A +Q A+ + A R++L+ +L AQQ+ + + LQ RLK++
Sbjct: 97 KRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTL 156
Query: 1680 ---------QAGFLQNGRSAM-----DVENEEKNLEKEVALAQKQASGLRSR---YQETD 1722
QA LQ + + ++++ +L+ A +++A L +R Q
Sbjct: 157 AEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQART 216
Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIA 1782
+L +A ++ Q + SQ + A+ +Q+ E E E+ L + +A
Sbjct: 217 EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVA 276
Query: 1783 ELQKRIQS 1790
+L+ Q+
Sbjct: 277 QLEAYYQA 284
Score = 33.8 bits (77), Expect = 0.87
Identities = 45/257 (17%), Positives = 88/257 (34%), Gaps = 37/257 (14%)
Query: 1559 EQIRERANEIN-NVIKSL--------------TDIDTILTETAGDLAKANDLKRKANLTK 1603
E +R+ N+ K+ T + T+ + A+A L+ +
Sbjct: 119 EAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQ 178
Query: 1604 AGADSTKNTVQKI----VDVLTEARTAQDMAEVAIQTAKD--DISAARKDLSQITNDLD- 1656
A A K+ V + + EA+ A A ++ +AA + +Q D
Sbjct: 179 ASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA 238
Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRS 1716
QK+ + + + ++ER + +Q +E + LE+EVA +
Sbjct: 239 QISQKAQQIAARAEQIRERERQLQ-----------RLETAQARLEQEVAQLEAYYQAYVR 287
Query: 1717 RYQETDNKLLSKAESSGLKR-ARGQMLLNKASQL--SVNTTAKLKQLNETEAMFNSQESE 1773
Q+ + + +R A+ Q +QL S N A L +L + Q
Sbjct: 288 LRQQAAATQRGQVLAGAAQRVAQAQAQAQAQAQLLSSANRPAAL-RLRRSPRRGRRQRPV 346
Query: 1774 LTELSKNIAELQKRIQS 1790
++ ++ S
Sbjct: 347 TRHTTRRRRPATRQSPS 363
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is involved
in determining the site of synaptic vesicle fusion. The
C-terminus is a PDZ-binding motif that binds directly to
RIM (a small G protein Rab-3A effector). The family also
contains four coiled-coil domains.
Length = 774
Score = 59.6 bits (144), Expect = 1e-08
Identities = 106/654 (16%), Positives = 220/654 (33%), Gaps = 105/654 (16%)
Query: 1205 QTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG 1264
+ SG T E + LEK + R + +L ++ T++ ++ ++
Sbjct: 95 KLEKSGNETGELQQLEKSEGQFSRIQAERERQAMELFLLRDTLEKLQLRM---------- 144
Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS 1324
+ L T L+ EEIKKL + + +A +E++ L R+A D
Sbjct: 145 ERLQQT----------LDARDEEIKKLLEMLQSKGLSAKAEEEDS--ERLGRIREAEDQQ 192
Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
L+ + EK+ + + + + E ++K + + KI EK L
Sbjct: 193 SHLEVLLEQKEKEHMMLREEIHQKLQM-----ERDDAKTEASQKLIDEKDTKIKEFEKML 247
Query: 1385 PDLNELI------CDKRGDPCDNLCGGAGCGKCGGMWCSNGT------LSESNSAKDYAE 1432
I CD + L + LS + +
Sbjct: 248 EKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQ 307
Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDYA---------------ETSKKLINEKEA 1477
T + ++ ++R+H+ L E + + E + +K A
Sbjct: 308 TE---LETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTA 364
Query: 1478 KAEEVFRGITTAKQESH-----------AANILAKEAYDHAASVRNKTAAY------VAS 1520
+ T E +L K+ + + R K + S
Sbjct: 365 SLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRS 424
Query: 1521 TSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQ------------IRERANEI 1568
T D L LA+ I K + E+ + E+
Sbjct: 425 LQTDTNT-DTALEKLEKALAEKERIIERL--KEQRDRDERYEQEEFETYKKEFEDLKEEV 481
Query: 1569 NNVIKSLTDIDTILTETAGDLAK--ANDLKRKANLTKAGA--DSTKNTVQKIVDVLTEAR 1624
N+ L++ + L +++K +N LK++++L +A + + +K+ L R
Sbjct: 482 QNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLR 541
Query: 1625 TAQ------DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK- 1677
+ ++ D + AR ++ ++ + L+ A+Q+ ++T ++ L + L+
Sbjct: 542 ANPESADRGSAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEK 601
Query: 1678 -SIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
Q A +E E K+ E E A +++ +++ ++A +
Sbjct: 602 AQRHLTKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQSTQIGHAQAAAHNHI 661
Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
Q L + QL + QL +++ + EL AE +K ++
Sbjct: 662 EHHVQKLESDLKQLRAEREQLVAQLEKSQQSLMEFQQELNAAE---AERRKHLE 712
Score = 38.5 bits (89), Expect = 0.039
Identities = 55/301 (18%), Positives = 105/301 (34%), Gaps = 47/301 (15%)
Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
EFE +K+ ED+K V+N + + L + + ++ +N+ QR
Sbjct: 466 EFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQL-----------KQRS 514
Query: 1275 NLARL--ALNDLTEEIKKLNKTGEMLKENATLLQ-----ENNVEGALELTRQAHDTSLKL 1327
+L R L + E+ +KL K + L+ N + + + A + +L
Sbjct: 515 DLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL 574
Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
++ + E++ +D E + R + K T+ QE+ E K L + L
Sbjct: 575 LDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERL 634
Query: 1388 NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEE 1447
+ ++ T A + + + ES ++
Sbjct: 635 EKSEAEEET-------------------LRQSTQIGHAQAAAHNHIEHHV-QKLESDLKQ 674
Query: 1448 LRKHM--IILSESNSAKDYAETSKKLIN---EKEAKAEEVFRGITTAKQESHAANILAKE 1502
LR ++ S + E ++L E+ EE KQE+ A I K+
Sbjct: 675 LRAEREQLVAQLEKSQQSLMEFQQELNAAEAERRKHLEENRE----LKQEALLAAISEKD 730
Query: 1503 A 1503
A
Sbjct: 731 A 731
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827). This
family consists of several plant proteins of unknown
function. Several sequences in this family are described
as being "myosin heavy chain-like".
Length = 484
Score = 57.7 bits (139), Expect = 4e-08
Identities = 95/473 (20%), Positives = 171/473 (36%), Gaps = 82/473 (17%)
Query: 1286 EEIKKLNKTGEMLKENATL--LQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
EE KL K E+ E L L+E LE T++ + LKLK + E E+Q
Sbjct: 43 EEANKLKKELEV-AEKEKLQVLKE------LESTKRTVED-LKLKLEKAEKEEQ------ 88
Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLC 1403
Q K+ L + QG +E E R L+ +L ++ +
Sbjct: 89 QAKQDSELAKLRAEELEQGIQELEVE--RYITATAELDSVKEELRKIRQE---------- 136
Query: 1404 GGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSES----N 1459
+ + E ++A AE + E K EEL K +I + ES +
Sbjct: 137 -------------YDALVEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAH 183
Query: 1460 SAKDYAETSKKLINEK--------EAKAEEVFRGITTAKQESHAANILAK-EAYDHAASV 1510
+A AE + + E + +E + + KQ+ L K
Sbjct: 184 AAHLEAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELES 243
Query: 1511 RNKTAAYVASTSNITKQLDE---FLNAPGATLADIRNISG-LAVN-KNIQSNPEQIRERA 1565
K + +AS +++ K+L+E L +RN+ L + + E++RE+
Sbjct: 244 LQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKE 303
Query: 1566 NEINNVIKSLTDIDTILTETAGDLA---------KANDLKRKANLTKAGADSTKNTVQKI 1616
E SL A K +A + A++ + ++K+
Sbjct: 304 GEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKL 363
Query: 1617 VDVLTEARTAQDMAEVAIQTAKDDISAAR----KDLSQITNDLDDAQQKSNETNVKVKLL 1672
+ + + A + AE+ ++ A + AA+ L++I + + ++ K+ L
Sbjct: 364 KEEAEQTKAALETAELRLEAALKEAEAAKAAEALALAEIKALQESEESAKADSPRKITLS 423
Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
E + + + E E+ EK+VA A Q + E+ KL
Sbjct: 424 LEEYEEL----------SKKAEEAEELAEKKVAAALAQVEEAKESENESLKKL 466
Score = 46.1 bits (109), Expect = 1e-04
Identities = 63/370 (17%), Positives = 143/370 (38%), Gaps = 34/370 (9%)
Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAK--DYAETSKKLINEKEAKAEEVFRGITTA 1489
ET L E+ A +L+K + + +E + E++K+ + + + K E+ + A
Sbjct: 35 ETELMLAQEE---ANKLKKELEV-AEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQA 90
Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA 1549
KQ+S A + A+E + + + + K+ + L + R+ +
Sbjct: 91 KQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKR 150
Query: 1550 VNKNIQS---NPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
+ I + N +++ E EI + +SL E + A K +
Sbjct: 151 AEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAA--LEKDQDRETYE 208
Query: 1607 DSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQ------ITNDLDDAQQ 1660
K +++ + + +D+ + + A ++ + +K++S + +L++A+
Sbjct: 209 KELKEAEKELERLKQDLDPEKDLEK--LAEASAELESLQKEISIMASVASVLKELEEAKA 266
Query: 1661 KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
+ + K L+ ++S++ ++E E+K LE+ L +K+ +
Sbjct: 267 NLEKAAEEEKSLRNLVESLK----------QELEEEKKELEE---LREKEGEAEEAASSL 313
Query: 1721 TDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET--EAMFNSQESELTELS 1778
+K+E R + + K Q S K+ E +E+E T+ +
Sbjct: 314 EAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAA 373
Query: 1779 KNIAELQKRI 1788
AEL+
Sbjct: 374 LETAELRLEA 383
Score = 43.4 bits (102), Expect = 0.001
Identities = 54/292 (18%), Positives = 98/292 (33%), Gaps = 44/292 (15%)
Query: 1211 AYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
Y KE + EK++E +K ++ + L ++ ++K+I V K L
Sbjct: 206 TYEKELKEAEKELERLKQDLD-PEKDLEKLAEASAELESLQKEISIMASVASVLKELEEA 264
Query: 1271 TQRINLA-------RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDT 1323
+ A R + L +E+++ K E L+E +E EL R +
Sbjct: 265 KANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEK 324
Query: 1324 SLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN 1383
K +E+ E ++ A + + + + +EE E+ L E
Sbjct: 325 ESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALET----AELR 380
Query: 1384 LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKES 1443
L + + +A+ A K + E E
Sbjct: 381 LEAALKEA------------------------------EAAKAAEALALAEIKALQESEE 410
Query: 1444 --KAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQES 1493
KA+ RK + L E AE +++L +K A A ++ ES
Sbjct: 411 SAKADSPRKITLSLEEYEELSKKAEEAEELAEKKVAAALAQVEEAKESENES 462
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 57.6 bits (139), Expect = 5e-08
Identities = 108/648 (16%), Positives = 225/648 (34%), Gaps = 101/648 (15%)
Query: 1183 DSILRGLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTV 1242
DS++ G Q K++ + I + ++ L+ I+ ++A + N L
Sbjct: 141 DSLISGDPAQRKKILDEILEINSLERN------YDKLKDVIDMLRAEISNIDYLEEKLKS 194
Query: 1243 MLTTIDDMKKQIMNNEDPNG-VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLK-- 1299
++++KKQI ++E + K + + N A N+L + +L+ +M
Sbjct: 195 SNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY 254
Query: 1300 ---------ENATLLQENNVEGALE--LTRQAHDTSLKLKEQSTETEKQINDAERQCKRT 1348
+ + L++NN LE + +D K + + K ND E + +
Sbjct: 255 ESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQIL 314
Query: 1349 ENLVTKTFPQFTQGQEENEKA---LQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
N+ + K LQ+ + DLN I + G
Sbjct: 315 SNIDAEI-----NKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYE------- 362
Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
+ NS E+ KK I E E + I +
Sbjct: 363 ----------------MDYNSYLKSIESLKKKIEEYSKNIERMSAF--ISEILKIQEIDP 404
Query: 1466 ETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNIT 1525
+ KK +NE K +++ +++ Q A E + + ++ V T+
Sbjct: 405 DAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVCGTTLGE 464
Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
++ + +N + + E+IRE E+ ++ + + D+
Sbjct: 465 EKSNHIINHYNEKKSRLE---------------EKIREIEIEVKDIDEKIVDLKKRKEYL 509
Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK----DDI 1641
+ + N + N + S + ++ I + E + D E K +D+
Sbjct: 510 ESE--EINKSINEYNKIE----SARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDL 563
Query: 1642 SAARKD----LSQITN-DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ----NGRSAMD 1692
+ R L+ I+ D++ + +SNE ++ L+ RL+ I+ GF +S +
Sbjct: 564 DSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIRE 623
Query: 1693 VENEEKNLEKEVALAQ----------KQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
+ENE NL + Q + + + E D+ + E + +R +
Sbjct: 624 IENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEIT----SRINDI 679
Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
+ + E+ + + ELS I ++ + ++S
Sbjct: 680 EDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLES 727
Score = 40.3 bits (94), Expect = 0.012
Identities = 86/536 (16%), Positives = 182/536 (33%), Gaps = 80/536 (14%)
Query: 1287 EIKKLNKTGEMLKENATLLQEN--NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQ 1344
EI L + + LK+ +L+ N++ E + ++ +K+Q + EK + ++
Sbjct: 160 EINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKE 219
Query: 1345 CKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCG 1404
+R + + L L+ SLE I D
Sbjct: 220 IERLSIEYNNAMDDYNNLKSA----LNELS----SLEDMKNRYESEIKTAESD------- 264
Query: 1405 GAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE-SKAEELRKHMIILSESNSAKD 1463
+ L ++N K+ E K+IN+ + + ++ + K
Sbjct: 265 ------------LSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQ 312
Query: 1464 YAETSKKLINEKEA---KAEEVFRGITTAKQESHAANILAKEAYDHAASVR-------NK 1513
IN+ A K + Q+ + I K YD +
Sbjct: 313 ILSNIDAEINKYHAIIKKLSVL--------QKDYNDYIKKKSRYDDLNNQILELEGYEMD 364
Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
+Y+ S ++ K+++E+ A I I + + +P+ I++ NEIN ++
Sbjct: 365 YNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQ-----EIDPDAIKKELNEINVKLQ 419
Query: 1574 SL-TDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
+ + + ++ ++L R + Q + V + E
Sbjct: 420 DISSKVSSLNQRIRALRENLDELSRNMEMLNG---------QSVCPV-----CGTTLGEE 465
Query: 1633 AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD 1692
+ + + L + +++ + +E V +K +E L+S + N + ++
Sbjct: 466 KSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIE 525
Query: 1693 VENEE----KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
+ K E+ + +++RY+ + L +S L A + L
Sbjct: 526 SARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLN-ALAVISLIDIET 584
Query: 1749 LSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
+ KQLN+ ES L E+ + + I I I +++NN N
Sbjct: 585 NRSRSNEIKKQLNDL-------ESRLQEIEIGFPDDKSYIDKSIREIENEANNLNN 633
Score = 31.0 bits (70), Expect = 6.7
Identities = 57/337 (16%), Positives = 118/337 (35%), Gaps = 54/337 (16%)
Query: 1212 YTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTT 1271
Y ++ LE+KI +++ V++ DL ++ + NE
Sbjct: 474 YNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLED 533
Query: 1272 QRINLARLA-----LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK 1326
+I + L ++ K L K ++ + + L V +++
Sbjct: 534 IKIKINELKDKHDKYEEIKNRYKSL-KLEDLDSKRTSWLNALAVISLIDIE--------T 584
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE--KNL 1384
+ +S E +KQ+ND E R + + F + +K+++ + + +L N
Sbjct: 585 NRSRSNEIKKQLNDLE---SRLQEIEI----GFPDDKSYIDKSIREIENEANNLNNKYNE 637
Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
N+++ +K DN + + D K I + +
Sbjct: 638 IQENKILIEKLRGKIDN-------------------YKKQIAEIDSIIPDLKEITSRIND 678
Query: 1445 AEE----LRKHM-IILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANIL 1499
E+ RK + + + E + INE + ++ + + K+ A L
Sbjct: 679 IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL 738
Query: 1500 --AKEAYDHA---ASVRNKTAAYVASTSNITKQLDEF 1531
+EA+D + A +R + + TS K L EF
Sbjct: 739 KRLREAFDKSGVPAMIRKSASQAM--TSLTRKYLFEF 773
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 55.8 bits (135), Expect = 2e-07
Identities = 109/646 (16%), Positives = 235/646 (36%), Gaps = 110/646 (17%)
Query: 1213 TKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQI--MNNEDPNGVGKTLGNT 1270
++ LE+++E + + + ++ +L +++ K+I + E+ V + +G
Sbjct: 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLL---EELNKKIKDLGEEEQLRVKEKIGEL 299
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKE--NATLLQENNVEGALELTRQAHDTSLK-- 1326
I ++ + E++ + L+ + L + +E +E + L
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIE-EERKRRDKLTEE 358
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
E E E + E V K F + ++ + L++L +I L++ L
Sbjct: 359 YAELKEELEDLRAELEE--------VDKEFAETRDELKDYREKLEKLKREINELKRELDR 410
Query: 1387 LNELICDKRGDPCD--NLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
L E + + D G ++ N ++ E I ++E K
Sbjct: 411 LQEELQRLSEELADLNAAIAGI--------------EAKINELEEEKEDKALEIKKQEWK 456
Query: 1445 AEELRKHMIILSESNSAK--DYAETSKKL--INEKEAKAEEVFRGITTAKQESHAANIL- 1499
E+L + + +Y K+L + + A+AE R + A +
Sbjct: 457 LEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVL 516
Query: 1500 ----------------AKEAYDHAASVR--NKTAAYVASTSNITKQLDEFL---NAPGAT 1538
E Y A V N+ V + K+ E L A AT
Sbjct: 517 KASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRAT 576
Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI----------------DTIL 1582
+ + ++ + + + E + VI D+ DT++
Sbjct: 577 FLPLNKM------RDERRDLSILSE-----DGVIGFAVDLVEFDPKYEPAFKYVFGDTLV 625
Query: 1583 TETAGDLAK-----------ANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
E + A+ +L K+ G+ + + I+ +E Q + E
Sbjct: 626 VEDI-EAARRLMGKYRMVTLEGELFEKSGAMTGGSRAPRG---GILFSRSEPAELQRLRE 681
Query: 1632 VAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
++ K ++S+ + +L +I N LD+ Q+ ++ + K+ +++ ++ ++ +
Sbjct: 682 -RLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLE 740
Query: 1692 DVENEEKNLEKEVALAQKQASGLRSR---YQETDNKL---LSKAESSGLKRARGQMLLNK 1745
++E + +LE+E+ + + L +R +E +KL L+ E+ L +R + +
Sbjct: 741 ELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEAR-LSHSRIPEIQAE 799
Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
S+L + +L E E N E L K I ELQ++
Sbjct: 800 LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845
Score = 55.8 bits (135), Expect = 2e-07
Identities = 64/384 (16%), Positives = 122/384 (31%), Gaps = 75/384 (19%)
Query: 1423 ESNSAKDYAETSKKLINEKESKAEEL-RKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
E +S + + ++E + + RK I E + E K+ + E E
Sbjct: 689 ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748
Query: 1482 VFRGITTAKQE--SHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
+ + I K E A I E H +L+E LN A L
Sbjct: 749 LEQEIENVKSELKELEARIEELEEDLH--------------------KLEEALNDLEARL 788
Query: 1540 AD--IRNISGL--AVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL-TETAGDLAKAND 1594
+ I I + + + I R EI + LT L E + D
Sbjct: 789 SHSRIPEIQAELSKLEEEVSR----IEARLREIEQKLNRLTLEKEYLEKEIQELQEQRID 844
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTA-QDMAEVAIQTAKD------DISAARKD 1647
LK + + ++ +++ + L E A +D+ K+ + +
Sbjct: 845 LKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERK 904
Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
+ ++ ++ +++ +E K++ L+E L I+ + D E E+ L E A
Sbjct: 905 IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGE------DEEIPEEELSLEDVQA 958
Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMF 1767
+ Q R E N A Q + E
Sbjct: 959 ELQRVEEEIRALEPVN--------------------MLAIQ----------EYEEVLKRL 988
Query: 1768 NSQESELTELSKNIAELQKRIQSC 1791
+ + + +L + + +RI+
Sbjct: 989 DELKEKRAKLEEERKAILERIEEY 1012
Score = 55.5 bits (134), Expect = 3e-07
Identities = 63/375 (16%), Positives = 143/375 (38%), Gaps = 71/375 (18%)
Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQ 1491
E ++ I + +E R+ + ++L E+E KAE ++ + K+
Sbjct: 180 EEVEENIERLDLIIDEKRQQL----------------ERLRRERE-KAER-YQALLKEKR 221
Query: 1492 ESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADI-RNISGLAV 1550
E E Y+ A I +QL L + IS L
Sbjct: 222 EY--------EGYELLKEKE----ALERQKEAIERQLASL----EEELEKLTEEISEL-- 263
Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTD-----IDTILTETAGDLAKA-----------ND 1594
K + E+I + E+N IK L + + + E ++A D
Sbjct: 264 EKRL----EEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELED 319
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
+ + +A D ++++ + E R +D K+++ R +L ++ +
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379
Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGL 1714
+ + + + K++ L+ + ++ R ++ E + L +E+A +G+
Sbjct: 380 FAETRDELKDYREKLEKLKREINELK-------RELDRLQEELQRLSEELADLNAAIAGI 432
Query: 1715 RSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESEL 1774
++ E + + KA +K+ K QL+ + + ++L + + ++ E EL
Sbjct: 433 EAKINELEEEKEDKALE--IKKQEW-----KLEQLAADLSKYEQELYDLKEEYDRVEKEL 485
Query: 1775 TELSKNIAELQKRIQ 1789
++L + +AE + + +
Sbjct: 486 SKLQRELAEAEAQAR 500
Score = 48.1 bits (115), Expect = 5e-05
Identities = 75/471 (15%), Positives = 143/471 (30%), Gaps = 140/471 (29%)
Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE-NNVEGALELTR-QAHDTS 1324
L + I L +++ +++++L + E + LL+E EG L +A +
Sbjct: 179 LEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQ 238
Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
+ E+Q+ E + E+ + + L +++ +E+ L
Sbjct: 239 KE------AIERQLASLEEE------------------LEKLTEEISELEKRLEEIEQLL 274
Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
+LN+ I D + E K+ I E E++
Sbjct: 275 EELNKKIKDLGEE-------------------------EQL-------RVKEKIGELEAE 302
Query: 1445 AEELRKHMIILSESNS-AKDYAETSKKL---INEKEAKAEEVFRGITTAKQESHAANILA 1500
L + ++E +D E KL I++ A+ EE+ R I ++
Sbjct: 303 IASLER---SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKL---- 355
Query: 1501 KEAYDHAASVRNKT---AAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSN 1557
E Y V T+ DE K+ +
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETR--DEL--------------------KDYREK 393
Query: 1558 PEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV 1617
E+++ NE+ + L + L+E DL + KI
Sbjct: 394 LEKLKREINELKRELDRLQEELQRLSEELADLN--AAIAGI--------------EAKIN 437
Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK 1677
++ E +I L Q+ DL +Q+ + + +++ L
Sbjct: 438 ELEEEKEDKAL-----------EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELS 486
Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSK 1728
+Q E A AQ +AS R R ++L
Sbjct: 487 KLQ-------------------RELAEAEAQARASEERVRGGRAVEEVLKA 518
Score = 38.1 bits (89), Expect = 0.054
Identities = 51/310 (16%), Positives = 105/310 (33%), Gaps = 56/310 (18%)
Query: 1209 SGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLG 1268
SGA T I ++ R ++ +K+++ + + L
Sbjct: 652 SGAMTGGSRAPRGGILFSRSEPAELQRLRER-------LEGLKREL------SSLQSELR 698
Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE--NNVE---GALELTRQAHDT 1323
R++ L+D + +I ++ K E L++ L+E +E +LE + +
Sbjct: 699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS 758
Query: 1324 SLK--------LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE 1375
LK L+E + E+ +ND E + + P+ + E+ + R+
Sbjct: 759 ELKELEARIEELEEDLHKLEEALNDLEARLSHSR------IPEIQAELSKLEEEVSRIEA 812
Query: 1376 KITSLEKNLPDLN---ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAE 1432
++ +E+ L L E + + + + + S + E
Sbjct: 813 RLREIEQKLNRLTLEKEYLEKEIQELQEQR---------------IDLKEQIKSIEKEIE 857
Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQE 1492
E E + EEL + L + K+ +E EA+ E+ R I + +
Sbjct: 858 NLNGKKEELEEELEELEAALRDLESR-----LGDLKKE-RDELEAQLRELERKIEELEAQ 911
Query: 1493 SHAANILAKE 1502
E
Sbjct: 912 IEKKRKRLSE 921
Score = 37.4 bits (87), Expect = 0.094
Identities = 49/305 (16%), Positives = 105/305 (34%), Gaps = 64/305 (20%)
Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
E L K+ E ++ E R L ++ + ++I E +R+
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLE---EELEKLTEEISELE-------------KRL 267
Query: 1275 NLARLALNDLTEEIKKLNKTGEMLKENATLLQ--------ENNVEGALELTRQAHDTSLK 1326
L +L ++IK L + E L+ + + E ++ A + K
Sbjct: 268 EEIEQLLEELNKKIKDLGE-EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAK 326
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE----------K 1376
L+ + + +I + ER+ + K ++ + +EE E L E +
Sbjct: 327 LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386
Query: 1377 ITSLEKNLPDL----NELI--CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKD 1429
+ + L L NEL D+ + L + L++ + +
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRL---------------SEELADLNAAIAG 431
Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESN--SAKDYAETSKKLINEKEAKAEEVFRGIT 1487
+L EKE KA E++K E ++ + + + + + V + ++
Sbjct: 432 IEAKINELEEEKEDKALEIKK-----QEWKLEQLAADLSKYEQELYDLKEEYDRVEKELS 486
Query: 1488 TAKQE 1492
++E
Sbjct: 487 KLQRE 491
Score = 35.0 bits (81), Expect = 0.45
Identities = 40/181 (22%), Positives = 66/181 (36%), Gaps = 28/181 (15%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
KE E L K E+++ +E + DL L D+KK+ E L ++
Sbjct: 854 KEIENLNGKKEELEEELEELEAALRDLESRLG---DLKKERDELE------AQLRELERK 904
Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
I + + + +L E L+E + +++ E E+ E
Sbjct: 905 IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED----------------EEIPE 948
Query: 1334 TEKQINDAERQCKRTENLVTKTFP---QFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
E + D + + +R E + P Q EE K L L EK LE+ + E
Sbjct: 949 EELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILER 1008
Query: 1391 I 1391
I
Sbjct: 1009 I 1009
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 55.3 bits (134), Expect = 2e-07
Identities = 101/572 (17%), Positives = 205/572 (35%), Gaps = 154/572 (26%)
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
++ L L +N+ ++KKLN TGE + E RQ D +
Sbjct: 32 ERKEELMNLPVNEELSKVKKLNLTGESEET-------------FEEWRQKWD------DI 72
Query: 1331 ST----ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN--- 1383
T + E+ + +AE F +F + + KA++ + E + ++E++
Sbjct: 73 VTNSLPDIEELLFEAEEL--------NDKF-RFLKAK----KAIKEIEELLDTIEEDIEQ 119
Query: 1384 -LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
L +LNEL ES E ++K + E +
Sbjct: 120 ILEELNELK-------------------------------ESE------EKNRKEVEELK 142
Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
K ELRK +L++S S Y +L E + +E+ + + + + L E
Sbjct: 143 DKYRELRKT--LLAKSFS---YGPALDEL----EKQLDELEEEFEQFVELTESGDYL--E 191
Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA---------VNKN 1553
A + + +T A I L E N L +++ G + +
Sbjct: 192 AREVLLKLEEETDALEQKMEEIPPLLKELQNEFPDQLEELKA--GYREMTEEGYHFDHLD 249
Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV 1613
I+ + ++E+ ++ +++ L DL +A + + + +
Sbjct: 250 IEKELQDLKEQIDQNLALLEEL------------DLDEAEE----------ENEEIEERI 287
Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK---SNETNVKVK 1670
+ D+L + A+ E I D + AR+ Q+ +LD QQ + + V+
Sbjct: 288 DTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELDRLQQSYTLNEDELETVR 347
Query: 1671 LLQERLKSIQAGFLQNGRSAMDVENEEK---NLEKEVALAQKQASGLRSRYQETDNKL-- 1725
L+++L+ ++A + Q + ++ L++E+ +KQ + +E L
Sbjct: 348 ELEKQLEELEAQYDQ---LVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSESLQG 404
Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLN------ETEAMFNSQESELTELSK 1779
L K E AR + L +L L++ N + F E+ L+
Sbjct: 405 LRKDE----LEAR-EKLQEYRQKLH-EIKRYLEKSNLPGLPEDYLEYFFVVSDEIEALAD 458
Query: 1780 -------NIAELQKR---IQSCINFIVDKSNN 1801
N+ + ++ + + +K+
Sbjct: 459 ELNEVPINMDAVNRQLEEATDDVETLKEKTEE 490
Score = 47.6 bits (114), Expect = 6e-05
Identities = 98/550 (17%), Positives = 188/550 (34%), Gaps = 171/550 (31%)
Query: 1217 ELLEKKIEDVKA----LVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
+ +E+ IE + L E+ ++ ++ + +++K ++ + G
Sbjct: 111 DTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAKSF------SYGPALD 164
Query: 1273 RINLARLA-LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
+ +L L + E+ +L ++G+ L +A + LKL+E++
Sbjct: 165 ELE-KQLDELEEEFEQFVELTESGDYL--------------------EAREVLLKLEEET 203
Query: 1332 TETEKQINDAERQCKRTENLVTKTFP-QFT-------QGQEEN--------EKALQRLNE 1375
E+++ + K +N FP Q + EE EK LQ L E
Sbjct: 204 DALEQKMEEIPPLLKELQN----EFPDQLEELKAGYREMTEEGYHFDHLDIEKELQDLKE 259
Query: 1376 KITSLEKNLPDL--------NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA 1427
+I L +L NE I ++ D L L + A
Sbjct: 260 QIDQNLALLEELDLDEAEEENEEIEER----IDTL---------------YDILEKEVKA 300
Query: 1428 KDYAETSKKLINEKESKAEELRKHMII----LSESNSAKDYAETSKKLINEKEAKAEEVF 1483
K + E + + + A E K +++ L +S Y +NE E E
Sbjct: 301 KKFVEKNIDKLTDFLEHAREQNKQLLLELDRLQQS-----Y------TLNEDEL---ETV 346
Query: 1484 RGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIR 1543
R + KQ L + + K AY + +++++ L
Sbjct: 347 RELE--KQLEE----LEAQYDQLVERIAEKKVAY-SELQEELEEIEKQL----------- 388
Query: 1544 NISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR---KA 1599
E+I + E++ ++ L + E + + +++KR K+
Sbjct: 389 ---------------EEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKS 433
Query: 1600 NLTKAGAD------STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
NL D + ++ + D L E +M V + L + T+
Sbjct: 434 NLPGLPEDYLEYFFVVSDEIEALADELNEVPI--NMDAVN------------RQLEEATD 479
Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
D++ ++K+ E L E+L IQ N ++ +EVA A K+A
Sbjct: 480 DVETLKEKTEELVDNA-TLAEQL--IQYA------------NRYRSDNEEVAEALKEAER 524
Query: 1714 L--RSRYQET 1721
L Y+E
Sbjct: 525 LFENYDYKEA 534
Score = 36.0 bits (84), Expect = 0.22
Identities = 39/178 (21%), Positives = 80/178 (44%), Gaps = 31/178 (17%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
KE + L+++I+ AL+E DL ++++++I TL + ++
Sbjct: 252 KELQDLKEQIDQNLALLEEL-----DLDEAEEENEEIEERI----------DTLYDILEK 296
Query: 1274 INLARLALNDLTEEIKKL-NKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQST 1332
A+ + + I KL + ++N LL E L+ +Q++ + E
Sbjct: 297 EVKAK---KFVEKNIDKLTDFLEHAREQNKQLLLE------LDRLQQSYTLNEDELETVR 347
Query: 1333 ETEKQINDAERQCKR-TENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNE 1389
E EKQ+ + E Q + E + K +++ QEE L+ + +++ +EK +L+E
Sbjct: 348 ELEKQLEELEAQYDQLVERIAEKKVA-YSELQEE----LEEIEKQLEEIEKEQEELSE 400
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino acids
in length. This family contains a P-loop motif suggesting
it is a nucleotide binding protein. It may be involved in
replication.
Length = 1198
Score = 55.1 bits (133), Expect = 4e-07
Identities = 34/168 (20%), Positives = 63/168 (37%), Gaps = 28/168 (16%)
Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK---LLQERLK 1677
TE R AE A+Q+A A + L Q +L++ ++ E +K L +RL+
Sbjct: 603 TELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ 662
Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA 1737
+ Q ++ + LE +A ++QA + +L A+ L
Sbjct: 663 NEQ-------------QSLKDKLELAIAERKQQA-------ETQLRQL--DAQLKQLLEQ 700
Query: 1738 RGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
+ L T +L + E +++L +LS I +
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQVVEG---ELDNQLAQLSAAIEAAR 745
Score = 48.9 bits (117), Expect = 3e-05
Identities = 91/552 (16%), Positives = 186/552 (33%), Gaps = 106/552 (19%)
Query: 1282 NDLTEEIKKLNKTGEMLKE--NATLLQENNVEGALE--LTRQAHDTSLKLKEQSTETEKQ 1337
L +++K N+ ++E + ALE L +Q L+ E+ E E +
Sbjct: 391 EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450
Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN--ELICDKR 1395
+ +++ + T T P+ + E N++AL++ E+ E N+ L KR
Sbjct: 451 LGRLKQR----LDSATAT-PEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKR 505
Query: 1396 GDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIIL 1455
D A L + A E +L + S LR
Sbjct: 506 RDEALEALQRA-----------ERRLLQLRQA--LDELELQLSPQAGSLLHFLRNE---- 548
Query: 1456 SESNSAKDYAETSKKLINEK------------EAKAEEVFRGITTAKQESHAANILAKEA 1503
A + E+ K+I+ + E + G++ L +
Sbjct: 549 -----APGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVS-----------LDLQR 592
Query: 1504 YDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRE 1563
D N+T +L E L L AV K + EQ+ +
Sbjct: 593 LDVPDYAANET------------ELRERLQQAEEALQS-------AVAK-QKQAEEQLVQ 632
Query: 1564 RANEINNVIKSLTDIDTILTETAGDLAKANDLKR--KANLTKAGADSTKNTVQKIVDVLT 1621
E+ ++ + T L + DL + + ++ K L A A+ + ++ +
Sbjct: 633 ANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDA 692
Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQ----ITNDLDD------AQQKSNETNVKVKL 1671
+ + + + ++ KDD R + + +LD+ A ++ T K +L
Sbjct: 693 QLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARL 752
Query: 1672 LQERLKSIQAGFLQN----GRSAMDVENEEKNLEKEVALAQKQASGLRSR---YQETDNK 1724
+ LK L + + +++ + + LE + + +R QET
Sbjct: 753 --KELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLH 810
Query: 1725 LLS--------KAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTE 1776
S + L+ + ++ + T + K+L + Q +L E
Sbjct: 811 RDSLREERPNLAIQLRELESSAEELQQELTRLIKD-TKLRRKKLEQERKALEKQLDQLDE 869
Query: 1777 LSKNIAELQKRI 1788
L + + + +++
Sbjct: 870 LLRGLRDEMRQL 881
Score = 43.9 bits (104), Expect = 0.001
Identities = 31/204 (15%), Positives = 71/204 (34%), Gaps = 25/204 (12%)
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
L+ + + S ++ + L +A + + A A+ + AR DL ++ N
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQN- 663
Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGL 1714
++ + + + ER + + ++ + K L ++ + L
Sbjct: 664 -----EQQSLKDKLELAIAERKQQAE-------TQLRQLDAQLKQLLEQQQAFLEA---L 708
Query: 1715 RSRYQETDNKLLSK-AESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNS---- 1769
+ ++E + L+K G + L A+LK+L +
Sbjct: 709 KDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELK--KQYDRELASL 766
Query: 1770 --QESELTELSKNIAELQKRIQSC 1791
+ + EL + I EL+ I+
Sbjct: 767 DVDPNTVKELKRQIEELETTIERI 790
Score = 39.7 bits (93), Expect = 0.018
Identities = 46/291 (15%), Positives = 106/291 (36%), Gaps = 58/291 (19%)
Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK 1598
+ADI+ + + +++ ++ + L ++ L G+L D +R
Sbjct: 232 IADIQALRAI----------QKVAPEIEKLQEDFEQLLSLELRLQHLHGELV--ADEERL 279
Query: 1599 ANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLD-- 1656
A + ++ Q + RT +D + A ++SAA L+ ++L+
Sbjct: 280 AEEQEERQEAKNRLRQ-------QLRTLEDQLKEARDELNQELSAANAKLAADRSELELL 332
Query: 1657 ----------DAQQKS-------------NETNVKVKLLQERLKSIQAGFLQNGRSAMDV 1693
D +Q E ++ L + + +Q + + + +
Sbjct: 333 EDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERL-KQKIKE 391
Query: 1694 ENE---EKNLEKEVALAQKQASGL---RSRYQETDNKLLSKAESSGLKRARGQML----L 1743
+ E EKN E+ A+ +++ Q +++L + E+ L+ + L
Sbjct: 392 QLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRL 451
Query: 1744 NKASQLSVNTTA---KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
+ Q + TA +L+QL + + E + N+ +LQ ++
Sbjct: 452 GRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQL 502
Score = 32.4 bits (74), Expect = 2.7
Identities = 32/179 (17%), Positives = 65/179 (36%), Gaps = 31/179 (17%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNE----DPNGVGKTLGNTTQRI 1274
L+ ++ + A +E R+ + ++KKQ + E D +
Sbjct: 730 LDNQLAQLSAAIEAA-RTQAK-----ARLKELKKQ-YDRELASLDVDP-----------N 771
Query: 1275 NLARLA--LNDLTEEIKKLNKTGEMLKENATLLQENNV-EGALELTRQAHDTSLKLKEQS 1331
+ L + +L I+++ ++E +QE + +L R L Q
Sbjct: 772 TVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPN------LAIQL 825
Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
E E + +++ R + Q ++ EK L +L+E + L + L EL
Sbjct: 826 RELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAEL 884
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile protein
found in all eukaryote cell types. This family consists
of the coiled-coil myosin heavy chain tail region. The
coiled-coil is composed of the tail from two molecules of
myosin. These can then assemble into the macromolecular
thick filament. The coiled-coil region provides the
structural backbone the thick filament.
Length = 859
Score = 53.5 bits (129), Expect = 9e-07
Identities = 125/657 (19%), Positives = 245/657 (37%), Gaps = 155/657 (23%)
Query: 1208 ASGAYTKEFELLEKK---IEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG 1264
A GA + EL +K+ + ++ +E + + ++K+ + +
Sbjct: 86 AGGATAAQIELNKKREAELAKLRKDLEEANLQHEE------ALATLRKK--HQD------ 131
Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKT--------GEMLKEN----ATLLQENNVEG 1312
A+N+L+E+I++L K ++ E A L Q +
Sbjct: 132 ---------------AINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKL 176
Query: 1313 ALELTRQAHDTSL-KLKEQSTETEKQINDAERQCKR--TENLVTKTFPQFTQGQEENEKA 1369
E + ++ L +L+ + E ++Q+ND Q R +EN T+ EE E
Sbjct: 177 NAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENS------DLTRQLEEAEAQ 230
Query: 1370 LQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES--NSA 1427
+ L++ + LE L + + ++ + L
Sbjct: 231 VSNLSKLKSQLESQLEEAKRSLEEESRE--------------------RANLQAQLRQLE 270
Query: 1428 KDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV--FRG 1485
D ++L E E+KAE R+ LS++N+ + + K +E +AEE+ +
Sbjct: 271 HDLDSLREQLEEESEAKAELERQ----LSKANA--EIQQWRSKFESEGALRAEELEELKK 324
Query: 1486 ITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLD----EFLNAPGATLAD 1541
K E + A + K + + S + +L+ E A A
Sbjct: 325 KLNQKIS---------ELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASEL 375
Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA----NDLKR 1597
+ +K + ++ E E++ + ++ T L +L + L+R
Sbjct: 376 EKKQKNF--DKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRR 433
Query: 1598 K--------ANLTKAGADSTKNT--VQKIVDVL----TEARTAQDMAEVAIQTAKDDISA 1643
+ +LT + +N ++K L E + A + AE A++ + +
Sbjct: 434 ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESKVLR 493
Query: 1644 ARKDLSQITNDLD-DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
A+ +LSQI ++++ +K E K Q ++S+QA + + +K LE
Sbjct: 494 AQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEG 553
Query: 1703 -----EVAL---------AQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLN---- 1744
E+AL AQK +YQ+ +L ++ E +RAR
Sbjct: 554 DINELEIALDHANKANAEAQKNV----KKYQQQVKELQTQVEEE--QRAREDAREQLAVA 607
Query: 1745 --KASQLSVNTTAKLKQLNETEAMFNSQ------ESELTELSKNIAELQKRIQSCIN 1793
+A+ L A+L++L A+ ++ E+EL E S+ + EL + S I
Sbjct: 608 ERRATALE----AELEEL--RSALEQAERARKQAETELAEASERVNELTAQNSSLIA 658
Score = 43.5 bits (103), Expect = 0.001
Identities = 111/563 (19%), Positives = 195/563 (34%), Gaps = 113/563 (20%)
Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
A A ++L ++ K +K K LQ ++ A R +LK + E +
Sbjct: 368 ANAAASELEKKQKNFDKILAEWKRKVDELQAE-LDTAQREARNLSTELFRLKNELEELKD 426
Query: 1337 QINDAERQCKRTENLVTKTFPQFTQGQE---ENEKALQRLNEKITSLEKNLPDLNELICD 1393
Q+ R+ K ++ + Q +G E EKA +RL + L+ L + +
Sbjct: 427 QVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALEL 486
Query: 1394 KRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
+ S ++ ++ +E ++L EKE + E RK+
Sbjct: 487 EE---------------------SKVLRAQVELSQIRSEIERRL-AEKEEEFENTRKNHQ 524
Query: 1454 ILSESNSAKDYAET---------SKKL---INEKEAKAEEVFRGITTAKQESHAANILAK 1501
ES A AE KKL INE E + + A++ K
Sbjct: 525 RAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQVK 584
Query: 1502 EAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA--VNKNIQSNPE 1559
E + + ++ A A L ++R+ A K ++
Sbjct: 585 ELQTQVEEEQRAREDAREQLAVAERR----ATALEAELEELRSALEQAERARKQAETELA 640
Query: 1560 QIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK--ANLTKAGADSTKNTVQKIV 1617
+ ER NE+ SL KRK L +D +
Sbjct: 641 EASERVNELTAQNSSLIAQ-----------------KRKLEGELAALQSD--------LD 675
Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKS--NETNVK-----VK 1670
+ + E + A++ A+ A D +++ +L Q+ S E K VK
Sbjct: 676 EAVNELKAAEERAKKAQA-----------DAARLAEELRQEQEHSQHLERLRKQLESQVK 724
Query: 1671 LLQERLKSIQAGFLQNGRSAMDV-ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
LQ RL +A L+ G+ + E + LE E+ Q+ R+ ET K L K
Sbjct: 725 ELQVRLDEAEAAALKGGKKMIQKLEARVRELEAELDGEQR-------RHAET-QKNLRKM 776
Query: 1730 E------SSGLKRARGQMLLNKASQLSVNTTAKLK----QLNETEAMFNSQESELTELSK 1779
E ++ + + + L AK+K QL E E + S+ + +
Sbjct: 777 ERRVKELQFQVEEDKKNLERLQ--DLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQR 834
Query: 1780 NIAELQKR---IQSCINFIVDKS 1799
+ + ++R + +N + KS
Sbjct: 835 ELEDAEERADTAERSLNKLRAKS 857
Score = 36.9 bits (86), Expect = 0.12
Identities = 41/212 (19%), Positives = 82/212 (38%), Gaps = 33/212 (15%)
Query: 1593 NDLKRK---ANLTKAGADSTKNTVQKIVDVLTEART----AQDMAEVAIQTAKDDISAAR 1645
N L RK + + + + V ++ + E ++ E + A+ AR
Sbjct: 11 NQLYRKESELSQLSSKLEDEQALVAQLQKKIKELEARIRELEEELEAE-RAARAKAEKAR 69
Query: 1646 KDLS----QITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE 1701
DLS +++ L++A + E K +A + + D+E E
Sbjct: 70 ADLSRELEELSERLEEAGGATAAQ-------IELNKKREAELAKLRK---DLEEANLQHE 119
Query: 1702 KEVALAQKQASGLRSRYQET-DNKLLSKAESSGLKRARGQMLL---NKASQLSVNTTAKL 1757
+ +A +K+ + E + L K ++ ++ + Q+ + +QL AKL
Sbjct: 120 EALATLRKKHQDAINELSEQIEQ--LQKQKAK-AEKEKSQLQAEVDDLLAQLDQIAKAKL 176
Query: 1758 KQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
E ES+L+EL + ELQ+++
Sbjct: 177 ----NAEKKAKQLESQLSELQVKLDELQRQLN 204
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 53.1 bits (127), Expect = 2e-06
Identities = 97/564 (17%), Positives = 203/564 (35%), Gaps = 90/564 (15%)
Query: 1263 VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTRQA 1320
+ + + L + +++ +E K L +T E+ KE QE V L+ +
Sbjct: 410 AAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGS 469
Query: 1321 HDTSLKLKEQSTETEKQINDAER----QCKRTE--NLVTKTFPQFTQGQEENEKALQRLN 1374
D L+L ++ + E++++ AE+ + + E +L + + ++ +++ Q +
Sbjct: 470 SDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNH 529
Query: 1375 EKIT-----SLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKD 1429
T L K+ D +E I + D L G + K
Sbjct: 530 HTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLG---------------YFPNKKQ 574
Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE-VFRGITT 1488
+ E + L K L+ K++ + E+ + E+ +F +
Sbjct: 575 LEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGS 634
Query: 1489 AKQESHAANILA--KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNIS 1546
+ES + +++ A + TA Y + IT+ DE N S
Sbjct: 635 QDEESDLERLKEEIEKSSKQRAMLAGATAVY---SQFITQLTDE-------------NQS 678
Query: 1547 GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
V + + +++E +++ + ++ D L T +L K +++ + A
Sbjct: 679 CCPVCQRVFQTEAELQEFISDLQSKLRLAPDK---LKSTESELKK---KEKRRDEMLGLA 732
Query: 1607 DSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETN 1666
++ + + E R IQ K+DI L I + + A K T+
Sbjct: 733 PGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESA--KVCLTD 790
Query: 1667 VKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLL 1726
V + ER + M++++ E+ + ++ A + Q S L Q+ + +
Sbjct: 791 VTI---MERFQ-------------MELKDVERKIAQQAA--KLQGSDLDRTVQQVNQEKQ 832
Query: 1727 SKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIA---- 1782
K ++ + L K Q + +Q+ ++ N +SE ++ N+
Sbjct: 833 EKQHELDTVVSKIE-LNRKLIQ------DQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQ 885
Query: 1783 ------ELQKRIQSCINFIVDKSN 1800
EL +QS I I D
Sbjct: 886 FEEQLVELSTEVQSLIREIKDAKE 909
Score = 41.6 bits (97), Expect = 0.005
Identities = 80/483 (16%), Positives = 180/483 (37%), Gaps = 68/483 (14%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQE------NNVEGALELTRQAHDTSLKLKEQSTET 1334
L L EEI+K +K ML + + + + + ++ T +L+E ++
Sbjct: 641 LERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFISDL 700
Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQ---GQEENEKALQRLNEKITSLEKNLPDLNELI 1391
+ ++ A + K TE+ + K + + + + ++I L L +N I
Sbjct: 701 QSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDI 760
Query: 1392 CDKRGDPCDN--LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELR 1449
+ D + L G + E SAK T ++ + + +++
Sbjct: 761 QRLKNDIEEQETLLGT--------------IMPEEESAKV-CLTDVTIMERFQMELKDVE 805
Query: 1450 KHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAAS 1509
+ + + D T +++ EK+ K E + + + + +E H S
Sbjct: 806 RKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHE-LDTVVSKIELNRKLIQDQQEQIQHLKS 864
Query: 1510 VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEIN 1569
N+ + ++ +F EQ+ E + E+
Sbjct: 865 KTNELKSEKLQIGTNLQRRQQF--------------------------EEQLVELSTEVQ 898
Query: 1570 NVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA--- 1626
++I+ + D + D + K L + S K K+ D+ + +
Sbjct: 899 SLIREIKDAKEQDSPLE--TFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGY 956
Query: 1627 QDMAEVAIQTAKDDISAARK-DLSQITNDLDDAQQKSNETNVKVKLLQERLKS--IQAGF 1683
E IQ KDD ++ +L+ + L++ ++ + N ++L+++ + + IQ +
Sbjct: 957 MKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERW 1016
Query: 1684 LQNGRSAMDVENEEKNLEKEVA-----LAQKQASGLRSRYQETDNKL--LSKAESSGLKR 1736
LQ+ + ENE K +E+E+ + Q Q ++ +Q+ + + + + L R
Sbjct: 1017 LQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGR 1076
Query: 1737 ARG 1739
+G
Sbjct: 1077 QKG 1079
Score = 36.9 bits (85), Expect = 0.13
Identities = 76/401 (18%), Positives = 146/401 (36%), Gaps = 60/401 (14%)
Query: 1432 ETSKKLINEKESKAEELR------KHMI--ILSESNSAKDYAETSKKLINEK---EAKAE 1480
E+S++++ E++ + L+ +H + I+ N K K++ + E K E
Sbjct: 234 ESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKME 293
Query: 1481 EVFRGITTAKQ-----ESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
+VF+G T +Q +H + KE + + + ++ E L
Sbjct: 294 KVFQG--TDEQLNDLYHNHQRTVREKE--RELVDCQRELEKLNKERRLLNQEKTELLVEQ 349
Query: 1536 GATL--ADIRNISGLAVNKNIQSNPEQIR----ERANEINNVIKSLTDIDTILTETAGDL 1589
G AD A + IQS ++ ER IK+ T++ E D
Sbjct: 350 GRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFH---TLVIERQEDE 406
Query: 1590 AKA-----NDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
AK DL+ K L + AD +I D ++ + ++ ++++
Sbjct: 407 AKTAAQLCADLQSKERLKQEQAD-------EIRDEKKGLGRTIELKKEILEKKQEELKFV 459
Query: 1645 RKDLSQITNDLDDAQQKSN------------ETNVKVKLLQERLKSIQAGFLQNGRS--A 1690
K+L Q+ D + E N + L++ +KS+Q R
Sbjct: 460 IKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRK 519
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQM-LLNKASQL 1749
+D E E+ N + Q L ++ + ++ + K +S + QL
Sbjct: 520 LDQEMEQLNHHTT---TRTQMEML-TKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQL 575
Query: 1750 SVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
+K K++N+T EL L +N + ++S
Sbjct: 576 EDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELES 616
Score = 36.9 bits (85), Expect = 0.14
Identities = 92/565 (16%), Positives = 191/565 (33%), Gaps = 78/565 (13%)
Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTG-EMLKENATLLQENNVEGALELTRQAHDTSL 1325
L N + I + L EIK L +M K+N+ L L
Sbjct: 250 LKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSEL-------------------EL 290
Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
K+++ T++Q+ND +RT + + E+ K + LN++ T L
Sbjct: 291 KMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQG 350
Query: 1386 DLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKA 1445
L +L D+ + S T E + + + +++ N
Sbjct: 351 RL-QLQADRH--QEHIR-------ARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVI 400
Query: 1446 EEL-RKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY 1504
E + + ++ +E + + + R I K+ IL K+
Sbjct: 401 ERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKE------ILEKKQ- 453
Query: 1505 DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRER 1564
++ + + D L R +S N ++ ++++
Sbjct: 454 ---EELKFV----IKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSL 506
Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
NE ++ + L +D E + LTK D + ++KI ++
Sbjct: 507 QNEKADLDRKLRKLDQ---EMEQLNHHTTTRTQMEMLTKDKMDKDEQ-IRKIKSRHSDEL 562
Query: 1625 TAQ-------DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK 1677
T+ E + + +I+ R L+++ +L +Q N N +++ +E+L
Sbjct: 563 TSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLS 622
Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQ------ASGLRSRYQET---------- 1721
S + + D E++ + L++E+ + KQ A+ + S++
Sbjct: 623 SYEDKLFD-VCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCP 681
Query: 1722 --DNKLLSKAESSGLKRARGQMLL---NKASQLSVNTTAKLKQLNETEAMFNSQESELTE 1776
++AE L +K K K+ +E + ++S +
Sbjct: 682 VCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDL 741
Query: 1777 LSKNIAELQKRIQSCINFIVDKSNN 1801
K I EL+ ++Q I N+
Sbjct: 742 KEKEIPELRNKLQKVNRDIQRLKND 766
Score = 35.0 bits (80), Expect = 0.51
Identities = 35/184 (19%), Positives = 76/184 (41%), Gaps = 16/184 (8%)
Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
T++++ + M ++ K+ R ++ L+ +++ ++
Sbjct: 190 TLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDP 249
Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR----YQETDNKLLS 1727
L+ RLK I+ N M ++NE K L+ +K S L + +Q TD +L
Sbjct: 250 LKNRLKEIE----HNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQL-- 303
Query: 1728 KAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
+ L + + K +L V+ +L++LN+ + N +++EL + + +LQ
Sbjct: 304 ----NDLYHNHQRTVREKEREL-VDCQRELEKLNKERRLLNQEKTEL-LVEQGRLQLQAD 357
Query: 1788 IQSC 1791
Sbjct: 358 RHQE 361
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists of
several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 51.9 bits (124), Expect = 3e-06
Identities = 86/594 (14%), Positives = 194/594 (32%), Gaps = 85/594 (14%)
Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
LE+K + E + +L V L K + E + L
Sbjct: 89 IELERKASTLAENYERELDRNLELEVRL-------KALEELE--KKAENEAAEAEEEAKL 139
Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE- 1335
+ L+ E KL E + A E+ +A + +LK +E E
Sbjct: 140 LKDKLD---AESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEE 196
Query: 1336 -KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDK 1394
++ + ++ ++ +N ++ L E++ E++ + +
Sbjct: 197 LREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQL 256
Query: 1395 RGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMII 1454
P E + ++ + + + E EEL
Sbjct: 257 LQIP--------------------ELERELAALREENRKLRSMKEDNELLKEELEDLQSR 296
Query: 1455 LSESNSAKDYAETSK----KLINE-KEAK--AEEVFRGITTAKQESHAANILAKEAYDHA 1507
L ++ + KL NE K K +++ + T S +L E
Sbjct: 297 LERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLK 356
Query: 1508 ASVRNKTAAYVASTSNITKQL--DEFLNA-PGATLADIRNISGLAVNKNIQSNPEQIRER 1564
+N + + A T Q E A + + A+ + +Q + +
Sbjct: 357 E--KNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKE 414
Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
+ + ++ S T + + + + L+ +L + S ++ + L E
Sbjct: 415 RDGLRAILNSYDKELTETSVSGQLMKR---LEEAEDLVQ-KVQSHLAKMENQLSELEE-- 468
Query: 1625 TAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFL 1684
+V Q KD + ++ + L ++ + L+ ++++++
Sbjct: 469 ------DVGQQ--KDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLER--- 517
Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK--RARGQML 1742
+ R + E+ LE ++ + L+ Y + K+L + + + + Q +
Sbjct: 518 ERDR----LRQEKSLLEMKL-----EHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTI 568
Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL------SKNIAELQKRIQS 1790
++ KLK+ + SQ +L + SK IA+L+K+++S
Sbjct: 569 EALQAECE-----KLKERLQALEEGKSQPGDLEKAVGSHISSKEIAQLKKQVES 617
Score = 43.8 bits (103), Expect = 0.001
Identities = 44/275 (16%), Positives = 92/275 (33%), Gaps = 35/275 (12%)
Query: 1522 SNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPE-QIRERANEINNVIKSLTDIDT 1580
S + L +FL+ P + L V+ + S+ + Q+ E + L +
Sbjct: 17 SILRSDLPKFLSQP------LEGSHHLGVSTSALSSLQKQVEESMTLLQRA--ELIRSKS 68
Query: 1581 ILTETAGDLAKAN--------DLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
L + +L + +L+RKA+ + + ++ L + AE
Sbjct: 69 KLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAEN 128
Query: 1633 AIQTAKDDISAARKDLSQITNDL--DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSA 1690
A+++ + L + L + Q I+ + A
Sbjct: 129 EAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESIS--------RIKNDLSEMQCRA 180
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
+ + E K LE E+ ++Q + E + KL L + N S
Sbjct: 181 QNADTELKLLESELEELREQLEECQKELAEAEKKL------QSLTSEQASSADN--SVKI 232
Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
+ +LK+ + + S + +L ++ + EL
Sbjct: 233 KHLEEELKRYEQDAEVVKSMKEQLLQIPELERELA 267
Score = 38.8 bits (90), Expect = 0.031
Identities = 83/493 (16%), Positives = 161/493 (32%), Gaps = 105/493 (21%)
Query: 1314 LELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL 1373
++L + L+ K E E++ + +R E E L+ L
Sbjct: 71 IQLENELMQKELEHKRAQIELERKASTLAENYERE-----------LDRNLELEVRLKAL 119
Query: 1374 NEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKD-YAE 1432
E E + E E+ KD
Sbjct: 120 EELEKKAENEAAEAEE---------------------------------EAKLLKDKLDA 146
Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDY------AET-SKKLINEKE---AKAEEV 1482
S KL NEKE + +E ++ I N + A+T K L +E E + EE
Sbjct: 147 ESLKLQNEKEDQLKEAKES--ISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEEC 204
Query: 1483 FRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADI 1542
+ + A+++ + + D+ SV+ K L+E L +
Sbjct: 205 QKELAEAEKKLQSLTSEQASSADN--SVKIK-------------HLEEELKRYEQDAEVV 249
Query: 1543 R-NISGLAVNKNIQSNPEQIRERANEINNVIKS-------LTDIDTILTETAGDLAKAND 1594
+ L ++ +RE ++ ++ + L D+ + L K D
Sbjct: 250 KSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLAD 309
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI-SAARKDLSQI-T 1652
L+ + + S K+ +Q D+ RT D++ + +++ +
Sbjct: 310 LELEKEKLENELKSWKSLLQ---DIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSA 366
Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
L+ Q+ LQ + + L+ + ++ + L++ + L K+
Sbjct: 367 KQLETTLQQ----------LQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERD 416
Query: 1713 GLRSRYQETDNKL----LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET----E 1764
GLR+ D +L +S L+ A L+ K +L +L E +
Sbjct: 417 GLRAILNSYDKELTETSVSGQLMKRLEEAED--LVQKVQSHLAKMENQLSELEEDVGQQK 474
Query: 1765 AMFNSQESELTEL 1777
N+ E+E+ L
Sbjct: 475 DRNNTLETEIKLL 487
Score = 33.0 bits (75), Expect = 1.9
Identities = 32/151 (21%), Positives = 61/151 (40%), Gaps = 11/151 (7%)
Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
L Q+ N+L QK E L+ + ++ + + +++E K LE+ A
Sbjct: 70 LIQLENEL---MQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKA 126
Query: 1708 QKQASGLRSRYQETDNKL------LSKAESSGLKRARGQ--MLLNKASQLSVNTTAKLKQ 1759
+ +A+ + +KL L + LK A+ + N S++ +
Sbjct: 127 ENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTE 186
Query: 1760 LNETEAMFNSQESELTELSKNIAELQKRIQS 1790
L E+ +L E K +AE +K++QS
Sbjct: 187 LKLLESELEELREQLEECQKELAEAEKKLQS 217
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 51.9 bits (125), Expect = 4e-06
Identities = 71/373 (19%), Positives = 140/373 (37%), Gaps = 73/373 (19%)
Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKL-INEKEAKA--EEVFRGITTAKQESHAANIL 1499
+L KH+I S + A DY + + ++ +EA E++ E + +
Sbjct: 253 QSDRDLFKHLITESTNYVAADYMRHANERRVHLEEALELRRELYTSRRQLAAEQYRLVEM 312
Query: 1500 AKE-----------------AYDHAASVRN------KTAAYVASTSNITKQLDEFLNAPG 1536
A+E A DH V+ K Y A + ++L+E
Sbjct: 313 ARELAELNEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQADLEELEERLEE------ 366
Query: 1537 ATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK 1596
V + E+ RA ++D + ++ A D +A D++
Sbjct: 367 ----------QNEVVEEADEQQEENEARAEAAE------EEVDELKSQLA-DYQQALDVQ 409
Query: 1597 RKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLD 1656
+ T+A Q+ V L A+ + ++ A+D + + + T +L
Sbjct: 410 Q----TRAIQ------YQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELL 459
Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS-AMDVENE-EKNLEKEVALAQKQASGL 1714
+QK + ++ + ++ + RS A DV E + L ++ LA+ Q L
Sbjct: 460 SLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAE-QLQQL 518
Query: 1715 RSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESEL 1774
R R E + +L + +R + K +++ +L+QL E E+ L
Sbjct: 519 RMRLSELEQRLRQQQR---AERLLAE--FCKRLGKNLDDEDELEQLQE------ELEARL 567
Query: 1775 TELSKNIAELQKR 1787
LS++++E ++R
Sbjct: 568 ESLSESVSEARER 580
Score = 33.4 bits (77), Expect = 1.5
Identities = 32/146 (21%), Positives = 54/146 (36%), Gaps = 29/146 (19%)
Query: 1643 AARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
R++L L Q + E ++ L E ++ Q+ ++A D N +
Sbjct: 290 ELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLE----QDYQAASDHLNLVQT--- 342
Query: 1703 EVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNE 1762
AL Q++ RYQ +L + E Q V A +Q E
Sbjct: 343 --ALRQQEKIE---RYQADLEELEERLEE----------------QNEVVEEAD-EQQEE 380
Query: 1763 TEAMFNSQESELTELSKNIAELQKRI 1788
EA + E E+ EL +A+ Q+ +
Sbjct: 381 NEARAEAAEEEVDELKSQLADYQQAL 406
Score = 33.0 bits (76), Expect = 2.0
Identities = 38/192 (19%), Positives = 78/192 (40%), Gaps = 41/192 (21%)
Query: 1606 ADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET 1665
A T+ ++ LTE Q A + + A + ++ ++ L+ A+Q+
Sbjct: 947 AQQTQRDAKQQAFALTEVV--QRRAHFSYEDAAEMLAKNSDLNEKLRQRLEQAEQERTR- 1003
Query: 1666 NVKVKLLQERLKSIQAGFLQNG------RSAMDVENEE-KNLEKEVALAQKQASGLRSRY 1718
+E+L+ QA Q +S+ D + + + L++E+ Q G+
Sbjct: 1004 ------AREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQEL-----QDLGVP--- 1049
Query: 1719 QETDNKLLSKAESSGLKRAR-GQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL 1777
A+S +RAR + L+ +LS N + + N+ E E+E+ L
Sbjct: 1050 ----------ADSGAEERARARRDELHA--RLSANRSRR----NQLEKQLTFCEAEMDNL 1093
Query: 1778 SKNIAELQKRIQ 1789
+K + +L++
Sbjct: 1094 TKKLRKLERDYH 1105
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 49.2 bits (118), Expect = 5e-06
Identities = 39/191 (20%), Positives = 76/191 (39%), Gaps = 36/191 (18%)
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
R +L KA + + K D + M E AI+ + +++ AR+ L+Q
Sbjct: 4 FSRLKDLVKANINEL---LDKAED-------PEKMLEQAIRDMESELAKARQALAQAI-- 51
Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE---KNLEKEVALAQKQA 1711
A+QK E K++ Q R + ++ + A+ NE+ + LE E + A
Sbjct: 52 ---ARQKQLER--KLEEAQARAEKLE----EKAELALQAGNEDLAREALE-EKQSLEDLA 101
Query: 1712 SGLRSRYQETDNKLL-SKAESSGLKR----ARGQMLLNKASQLSVNTTAKLKQL------ 1760
L + Q+ + ++ K + + L++ R + KA + + K+ +
Sbjct: 102 KALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARKAAAKAQEKVNRSLGGGSS 161
Query: 1761 NETEAMFNSQE 1771
+ A F E
Sbjct: 162 SSAMAAFERME 172
Score = 34.6 bits (80), Expect = 0.34
Identities = 28/156 (17%), Positives = 50/156 (32%), Gaps = 26/156 (16%)
Query: 1247 IDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGE--MLKENATL 1304
I DM+ ++ + L R L + +KL + E + N L
Sbjct: 33 IRDMESELAK------ARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDL 86
Query: 1305 LQE-----NNVEGALELTRQ----AHDTSLKLKEQSTETEKQINDAERQCKRTENL---V 1352
+E ++E + A + KLK+Q E++I + + K
Sbjct: 87 AREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAK-KEALKARKAA 145
Query: 1353 TKTFPQFTQ-----GQEENEKALQRLNEKITSLEKN 1383
K + + A +R+ EKI E
Sbjct: 146 AKAQEKVNRSLGGGSSSSAMAAFERMEEKIEEREAR 181
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 51.7 bits (123), Expect = 5e-06
Identities = 84/367 (22%), Positives = 153/367 (41%), Gaps = 41/367 (11%)
Query: 1431 AETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK 1490
A+ +KK E++ KA+E +K +++ AK AE +KK + + KAEE + AK
Sbjct: 1292 ADEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Query: 1491 QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAV 1550
E+ AA A+ A + A + K K+ +E A
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA---------------- 1393
Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
+ E+ +++A+E+ + D + A + KA++ K+KA K AD K
Sbjct: 1394 -DEAKKKAEEDKKKADELKKAAAAKKKADE-AKKKAEEKKKADEAKKKAEEAKK-ADEAK 1450
Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV- 1669
+ EA+ A++ + A + K D K ++ D+A++K+ E K
Sbjct: 1451 KKAE-------EAKKAEEAKKKAEEAKKAD---EAKKKAEEAKKADEAKKKAEEAKKKAD 1500
Query: 1670 ---KLLQERLKSIQAGFLQNGRSAMDVEN-EEKNLEKEVALAQ--KQASGLRSRYQETDN 1723
K + + K+ +A + + A + + EE E A+ K+A L+ +
Sbjct: 1501 EAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560
Query: 1724 KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQ---LNETEAMFNSQESELTELSKN 1780
+ KAE + M L KA + A++++ L E E ++E++ E +K
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 1781 IAELQKR 1787
AE K+
Sbjct: 1621 KAEELKK 1627
Score = 39.7 bits (92), Expect = 0.019
Identities = 70/369 (18%), Positives = 143/369 (38%), Gaps = 20/369 (5%)
Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV 1482
E+ +A++ AE ++K E + KA+ +K +++ AK AE KK +E + A
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417
Query: 1483 FRGITTAK--QESHAANILAKEAYD-HAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
+ K +E A+ K+A + A K A K+ +E A A
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK- 1476
Query: 1540 ADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKA 1599
+ + E+ +++A+E ++ D + A + KA++ +KA
Sbjct: 1477 ---KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA--KKAEEAKKADEA-KKA 1530
Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
K ++ K +K D L +A + E + + + A+
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590
Query: 1660 QKSNETNVKVKLLQERLKSIQAGFLQNGR-SAMDVENEEKNLEKEVALAQKQASGLRSRY 1718
+ E +K+ ++++K+ +A + + A +++ E+ +K L +K+A +
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650
Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELS 1778
+ L KAE +A K ++ + K+ E E + E +
Sbjct: 1651 E------LKKAEEENKIKAAE---EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Query: 1779 KNIAELQKR 1787
K EL+K+
Sbjct: 1702 KKAEELKKK 1710
Score = 30.9 bits (69), Expect = 7.9
Identities = 63/287 (21%), Positives = 103/287 (35%), Gaps = 49/287 (17%)
Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
EE KK + ++E L +E A E + K+K + + ++ Q
Sbjct: 1584 EEAKKAEE--ARIEEVMKLYEEEKKMKAEEAKKAEEA---KIKAEELKKAEEEKKKVEQL 1638
Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
K+ E K + + +EEN+ +K +K + + D++
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK---------- 1688
Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
+ + K AE +KK K+ +AEE +K + K A
Sbjct: 1689 ----------------KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732
Query: 1466 ETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNIT 1525
E +KK E + KAEE + K+ +H L KE A +R + A I
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAH----LKKEEEKKAEEIRKEKEAV------IE 1782
Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVI 1572
++LDE + R + K+I N I E E N VI
Sbjct: 1783 EELDE--------EDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVI 1821
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 49.7 bits (119), Expect = 5e-06
Identities = 51/241 (21%), Positives = 93/241 (38%), Gaps = 35/241 (14%)
Query: 1552 KNIQSNPEQIRERANEINNVIKSLT-DIDTI---LTETAGDLAKA-NDLKRKANLTKAGA 1606
KNIQ+ E + + EI + I L +ID + + ++A+ ++ + L K A
Sbjct: 48 KNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRA 107
Query: 1607 DS--TKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDL--SQIT--NDLDDAQQ 1660
+ T +DV+ +++ D+ TA I A K + Q L++ Q
Sbjct: 108 RAMQVNGTATSYIDVILNSKSFSDLISRV--TAISVIVDADKKILEQQKEDKKSLEEKQA 165
Query: 1661 KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQ 1719
+ + LQ L++ + +++ ALA K+AS L +
Sbjct: 166 ALEDKLETLVALQNELETQLN----------SLNSQKAEKNALIAALAAKEASALGEKAA 215
Query: 1720 ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSK 1779
+ K L AE++ + A+ + A+Q Q T A Q S +TE +
Sbjct: 216 LEEQKAL--AEAAAAEAAKQEAAAKAAAQ-----EQAALQAAATAA----QPSAVTESAS 264
Query: 1780 N 1780
Sbjct: 265 A 265
Score = 45.9 bits (109), Expect = 1e-04
Identities = 35/196 (17%), Positives = 72/196 (36%), Gaps = 39/196 (19%)
Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
A D IQ +S +K+ I N+++ + E K+ LQ+ + +A
Sbjct: 25 FAALLSDK----IQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKA 80
Query: 1682 GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQ------------ETDN------ 1723
+ + ++ ++N+ + L +K+A R+ + +
Sbjct: 81 EIKKLQK---EIAELKENIVERQELLKKRA---RAMQVNGTATSYIDVILNSKSFSDLIS 134
Query: 1724 -----KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL----NETEAMFNSQESEL 1774
++ A+ L++ + + Q ++ KL+ L NE E NS S+
Sbjct: 135 RVTAISVIVDADKKILEQQKEDKKSLEEKQAALED--KLETLVALQNELETQLNSLNSQK 192
Query: 1775 TELSKNIAELQKRIQS 1790
E + IA L + S
Sbjct: 193 AEKNALIAALAAKEAS 208
Score = 32.0 bits (73), Expect = 2.5
Identities = 40/243 (16%), Positives = 78/243 (32%), Gaps = 75/243 (30%)
Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKA---LQRLNEKITSLEKNLPDLN 1388
+E +K+ + + + + +N + + + + Q+E +++ +++L ++I L++N+ +
Sbjct: 41 SELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQ 100
Query: 1389 ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGT------LSESNSAKDYA----------- 1431
EL+ KR M + + S S D
Sbjct: 101 ELL-KKRAR---------------AMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVD 144
Query: 1432 ---------ETSKKLINEK----ESKAEELRKHMIILSES----------------NSAK 1462
+ KK + EK E K E L L A
Sbjct: 145 ADKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAA 204
Query: 1463 DYAETSKK---LINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVA 1519
A + L +K A ++ AA A+E AA TAA +
Sbjct: 205 KEASALGEKAALEEQKALAEAAAAE----AAKQEAAAKAAAQEQ---AALQAAATAAQPS 257
Query: 1520 STS 1522
+ +
Sbjct: 258 AVT 260
Score = 30.5 bits (69), Expect = 7.6
Identities = 11/50 (22%), Positives = 22/50 (44%)
Query: 1755 AKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
+ Q+ E ++ + + E+ + I +LQK I IV++ K
Sbjct: 56 SLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKK 105
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 51.2 bits (123), Expect = 5e-06
Identities = 55/375 (14%), Positives = 123/375 (32%), Gaps = 60/375 (16%)
Query: 1452 MIILSESNS-AKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAA-NILAKEAYDHAAS 1509
+ L E N + E KK + + E K E++ I ++E A N +A+ A
Sbjct: 83 VFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142
Query: 1510 VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEIN 1569
+ SN+++ L ++ + + +N + + E+++ + +
Sbjct: 143 KYD---------SNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLF 193
Query: 1570 NVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDM 1629
+ K + T+ ++ +A L++ + S L D
Sbjct: 194 SSNKPELALLTLSVIDFDEIEQAEILEKSIIGSSDVPISE----------LINNLGNSDW 243
Query: 1630 AEVAIQTAKDD---------ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
+ ++ ++ I+ RK ++ D+ Q+ E +++ L ++ +S
Sbjct: 244 VKEGLEYHEEGDTCPFCQQTITEERKA--ELEAHFDEEYQELIE---QLEELIDKYESHI 298
Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
L+ S +D E E + +V + L ++ KL K
Sbjct: 299 EKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEK------------ 346
Query: 1741 MLLNKASQLS--VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR---------IQ 1789
L S + T ++ +N+ N E E N+ + + + +
Sbjct: 347 --LKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVA 404
Query: 1790 SCINFIVDKSNNYKN 1804
I K
Sbjct: 405 ELKEDIDAYQKEKKG 419
Score = 45.8 bits (109), Expect = 2e-04
Identities = 20/181 (11%), Positives = 69/181 (38%), Gaps = 18/181 (9%)
Query: 1505 DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRER 1564
+ + + +I ++ + + +++ + A+ + ++ N +++ E+
Sbjct: 289 ELIDKYESHIEKALEELESIL-DTEKENSEFKLDVEELKALLE-ALEEILEKNLQKLEEK 346
Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
+ + I+ L I ++ + N+L R+ N + + +A+
Sbjct: 347 LKDPSTSIE-LESITDLIESINDIIDAINELIREHN----------EKIDNLKKEKNKAK 395
Query: 1625 TAQDMAEVA-----IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
+ VA I + + K ++ + ++ + + ++K L+++L +I
Sbjct: 396 KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNI 455
Query: 1680 Q 1680
+
Sbjct: 456 E 456
Score = 33.1 bits (76), Expect = 1.7
Identities = 24/125 (19%), Positives = 53/125 (42%), Gaps = 14/125 (11%)
Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
L + T I ++ + E I++ N+ + LK+ + + L L + +
Sbjct: 355 ELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKA-KKKLW--LHLVAELKEDID 411
Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
+++ EK IN E++ K Q + EK ++ L +++T++E
Sbjct: 412 AYQKEKKGLEKAINSLEKEIK-----------QLEAEIKALEKEIKELEKQLTNIEPTAD 460
Query: 1386 DLNEL 1390
++N+L
Sbjct: 461 EINKL 465
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have LPXTG
C-terminal anchoring motifs and a substantial number have
the KxYKxGKxW putative sorting signal at the N-terminus.
The tetracycline resistance plasmid pCF10 in Enterococcus
faecalis promotes conjugal plasmid transfer in response
to sex pheromones, but PgrA/Sec10 encoded by that
plasmid, a member of this family, specifically inhibits
the ability of cells to receive homologous plasmids. The
phenomenon is called surface exclusion.
Length = 356
Score = 50.1 bits (120), Expect = 6e-06
Identities = 35/181 (19%), Positives = 54/181 (29%), Gaps = 39/181 (21%)
Query: 1494 HAANILAKEAYDHAASVRNKTAAYVA-STSNITKQLDEFLNAPGATLADIRNISGLAVNK 1552
HA N+L + AY+ S SN F+N + +AD
Sbjct: 200 HAQNLLGDGKGN--------PGAYLGVSISNDGGVNIHFVNFNDSYIADGNKFDK----- 246
Query: 1553 NIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDL--AKANDLKRKANLTKAGADSTK 1610
I N SL + L DL A+ +A LT A
Sbjct: 247 -------------TPIPNTPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQTAYAA 293
Query: 1611 --NTVQKIVDVLTEARTAQD-MAEVAIQTAKDDISAAR-------KDLSQITNDLDDAQQ 1660
+ L A+ A+ + TA+ ++ A + L+ + DL Q
Sbjct: 294 AQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLNADLAKKQA 353
Query: 1661 K 1661
Sbjct: 354 A 354
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and
secretion / Signal transduction mechanisms].
Length = 408
Score = 49.6 bits (118), Expect = 1e-05
Identities = 46/323 (14%), Positives = 121/323 (37%), Gaps = 45/323 (13%)
Query: 1417 SNGTLSE--SNSAKDYAETSKKLINEKESKAEELRKHMIILSE--SNSAKDYAETSKKLI 1472
+ G L++ S+ D K NE ++ + +E S ++++ A ++ +L
Sbjct: 99 AAGDLTKRIDESSNDEFGQLAKSFNEMILNLRQIIDAVQDNAEALSGASEEIAASATELS 158
Query: 1473 NEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFL 1532
+ +AE + + ++ S +A A + AA + S + ++ E +
Sbjct: 159 ARADQQAESLEEVASAIEELSETVKEVAFNA--------KEAAALASEASQVAEEGGEEV 210
Query: 1533 NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET------- 1585
+ ++ I + + +++ E + EI + I++I +T
Sbjct: 211 RQ---AVEQMQEI-----AEELAEVVKKLSESSQEIEEITSV---INSIAEQTNLLALNA 259
Query: 1586 ------AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKD 1639
AG+ + A+ + A+ + ++ ++I ++ E + A ++ +
Sbjct: 260 AIEAARAGEAGRG--FAVVADEVRKLAERSADSAKEIGLLIEEIQNEAADAVEHMEESAS 317
Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE---NE 1696
++S K + + + L + E + + + + A + S +++ E
Sbjct: 318 EVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINASIEELDDVTQE 377
Query: 1697 EKNLEKEVALA----QKQASGLR 1715
+E+A A ++ A L
Sbjct: 378 NAAAVEELAAASEELKELAEKLL 400
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 50.1 bits (120), Expect = 1e-05
Identities = 102/551 (18%), Positives = 183/551 (33%), Gaps = 115/551 (20%)
Query: 1273 RINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL-----KL 1327
+ L L +ND ++KKL+ TG+ E E RQ D + +
Sbjct: 37 KNELLNLPVNDEISKVKKLHLTGQ-------------TETKFEEWRQKWDDIVTNSFADV 83
Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
+E E E + D R + ++ + Q T +E+ E+ L+ LNE + S EKN
Sbjct: 84 EEHLFEAE-ALADKFR-FNKAKHEIDDIEQQLTLIEEDIEQILEDLNELVESEEKN---- 137
Query: 1388 NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKL---INEKESK 1444
+ D L +D + E E K
Sbjct: 138 -----SEEIDHVLEL--------------------YEELRRDVLANRHQYGEAAPELEKK 172
Query: 1445 AEELRKHMIILSESNSAKDYAETSKKLINEKE------AKAEEVFRGITTAKQESHAANI 1498
E + + + E S+ DY E + L +E + E + + + E
Sbjct: 173 LENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTELPGQLQ 232
Query: 1499 LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNP 1558
K Y K Y NI +L+ + +
Sbjct: 233 DLKAGYR-----DMKEEGYHLEHVNIDSRLERLKEQLVENSELLTQL-----------EL 276
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
++ E I I+SL D+ + E AK + L ++K +
Sbjct: 277 DEAEEELGLIQEKIESLYDL--LEREVE---AKNVVEENLPILPDY--------LEKAKE 323
Query: 1619 VLTEARTAQDMAEV--AIQTAKDDISAAR---KDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
+++ V + + A+ ++ + R K+L ++ + LD+ + V LQ
Sbjct: 324 NNEH--LKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQ 381
Query: 1674 ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG 1733
+ L+ I+ V+ +L K+ A++ L+S+ E + + K+ G
Sbjct: 382 DNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLHEIK-RYMEKSNLPG 440
Query: 1734 LKRARGQMLLNKASQLSVNTTAKLKQLNE---------------TEAMFNSQESELTELS 1778
L L A +K+L+E TE M N+ E E E+
Sbjct: 441 LPE-TFLSLFFTAGH---EIQDLMKELSEVPINMEAVSALVDIATEDM-NTLEDETEEVV 495
Query: 1779 KNIAELQKRIQ 1789
+N ++ IQ
Sbjct: 496 ENAVLAEQLIQ 506
Score = 34.3 bits (79), Expect = 0.69
Identities = 90/491 (18%), Positives = 173/491 (35%), Gaps = 125/491 (25%)
Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
E +E+ +ED+ LVE+ ++S ++ +L +++++ ++ N G +
Sbjct: 118 EDIEQILEDLNELVESEEKNSEEIDHVLELYEELRRDVLANRHQYG---------EAAPE 168
Query: 1277 ARLALNDLTEEIKK---LNKTGEMLKENATLLQENNVEGALE-LTRQAHDTSLKLKEQST 1332
L ++ EE+ + L +G+ ++ L + AL + + SL L E T
Sbjct: 169 LEKKLENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERI--PSL-LAELQT 225
Query: 1333 ETEKQINDAERQCKRTENLVTKT--FPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
E Q+ D + + + + + L+RL E+ L +N L +L
Sbjct: 226 ELPGQLQDLKAGYRD---MKEEGYHLEHV-----NIDSRLERLKEQ---LVENSELLTQL 274
Query: 1391 ICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRK 1450
D+ AE LI EK +L +
Sbjct: 275 ELDE------------------------------------AEEELGLIQEKIESLYDLLE 298
Query: 1451 HMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY----DH 1506
+ AK+ E + ++ + KA+E K+E KE+Y
Sbjct: 299 REV------EAKNVVEENLPILPDYLEKAKEN---NEHLKEEIE----RVKESYRLAETE 345
Query: 1507 AASVR---NKTAAYVASTSNITKQLDEFLNAPGA---TLADIRNISGLAVNKNIQSNPEQ 1560
SVR + + I + ++ A L +I L +I+ E+
Sbjct: 346 LGSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEK--AL---TDIEDEQEK 400
Query: 1561 IRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR---KANLTKAGADST-----KN 1611
++E + SL + E L +++KR K+NL G T
Sbjct: 401 VQEH-------LTSLRKDELEARENLERLKSKLHEIKRYMEKSNLP--GLPETFLSLFFT 451
Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
+I D++ E ++EV I ++ A + T D++ + ++ E L
Sbjct: 452 AGHEIQDLMKE------LSEVPI-----NMEAVSALVDIATEDMNTLEDETEEVVENAVL 500
Query: 1672 LQERLKSIQAG 1682
+ + IQ G
Sbjct: 501 AE---QLIQYG 508
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 49.8 bits (119), Expect = 1e-05
Identities = 55/393 (13%), Positives = 113/393 (28%), Gaps = 37/393 (9%)
Query: 1422 SESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
A D AE K +E + ++L N+ + + T +N +
Sbjct: 178 GGVKGAADGAEKLKDGTDEASNGNKKLS------DLLNTLNNSSATFSDGLNALTSGLTT 231
Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLAD 1541
+ G+ E A + N+ +S + + L A L
Sbjct: 232 LTDGLNQLDSGLGTLAAGIGELKQGAEQL-NEGIGEFSSGLSELNSGVQDLAAGVPQLNQ 290
Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANL 1601
IS LA ++ + + + + + + T ++ + A+ L ++
Sbjct: 291 G--ISALAAGLSLPDSLGD---QFSSLQEALTQIAQGLKQKTSSSLEAAQ-GSLSSLQSM 344
Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVA---------IQTAKDDISAARKDLSQIT 1652
VD L Q + E + T D +S L Q +
Sbjct: 345 LALSKSLDLTAEGATVDALGAPDGVQWLDESQKTLATLSELLSTGIDGVSEGLDALEQAS 404
Query: 1653 NDLDDAQQKSNETNVKV----KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ 1708
L + K ++ L + + ++ L +A
Sbjct: 405 AQLAKSLAKLKTAVAQIAASIAQLLPGASEVL-----KTLKSKGLDKLLNQLNGALAKGS 459
Query: 1709 KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFN 1768
S ++ + S +GL L + ++ LS + L QL + +
Sbjct: 460 NALVQGLSDANDSFRSITSAQLKAGLNT-----LADGSNDLS-SLGPGLGQLADGSKLLA 513
Query: 1769 SQESELTELSKNIAELQKRIQSCINFIVDKSNN 1801
SEL S + + + + + D +
Sbjct: 514 DGLSELNTGSAQLRDGLGELSDGLTELADSLQD 546
Score = 42.1 bits (99), Expect = 0.003
Identities = 56/389 (14%), Positives = 118/389 (30%), Gaps = 56/389 (14%)
Query: 1424 SNSAKDYAETSKKLINEKESKAEELRK-----HMIILSESNSAKDYAETSKKLINEK--- 1475
+ KD + + + L + + ++ + +L +
Sbjct: 187 AEKLKDGTDEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGLGTL 246
Query: 1476 EAKAEEVFRGITTAKQESHA-------ANILAKEAYDHAASVRNKTAAYVASTSNITKQL 1528
A E+ +G + N ++ + +A A S
Sbjct: 247 AAGIGELKQGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQLNQGISALAAGLSLPDSLG 306
Query: 1529 DEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDID------TIL 1582
D+F + + I+ K S + + +++ +D T+
Sbjct: 307 DQFSS----LQEALTQIAQGLKQKTSSSLEAAQGSL-SSLQSMLALSKSLDLTAEGATVD 361
Query: 1583 TETAGDLAKANDLKRKANLTKAGADST-KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI 1641
A D + D +K T + ST + V + +D L +A + ++TA I
Sbjct: 362 ALGAPDGVQWLDESQKTLATLSELLSTGIDGVSEGLDALEQASAQLAKSLAKLKTAVAQI 421
Query: 1642 SAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE 1701
+A+ L +++ + + +L K A L G S
Sbjct: 422 AASIAQLLPGASEVLKTLKSKGLDKLLNQLNGALAKGSNA--LVQGLSD----------- 468
Query: 1702 KEVALAQKQASGLRSRYQETDNKLLSKAE--SSGLKRARGQMLLNKASQLSVNTTAKLK- 1758
A + S ++ + N L + SS L S+L + ++L
Sbjct: 469 ---ANDSFR-SITSAQLKAGLNTLADGSNDLSSLGPGLGQ---LADGSKLLADGLSELNT 521
Query: 1759 ---QLNETEAMFNSQESELTELSKNIAEL 1784
QL + + LTEL+ ++ +
Sbjct: 522 GSAQLRDGLGELS---DGLTELADSLQDA 547
Score = 35.6 bits (82), Expect = 0.28
Identities = 30/169 (17%), Positives = 59/169 (34%), Gaps = 27/169 (15%)
Query: 1626 AQDMAEVAIQTAKDDISAARKDLSQITNDLD--DAQQKSNETNVKVKLLQERLKSIQAGF 1683
A + E A ++IS ++T A + VK + + ++ G
Sbjct: 141 APKITEKAADKLLNEIS------KELTETYTKVVAFPTIYDLGGGVKGAADGAEKLKDGT 194
Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAE--SSGLKR--ARG 1739
+E N K+++ + + + + N L S + GL + +
Sbjct: 195 -----------DEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGL 243
Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
L +L +QLNE F+S SEL +++A ++
Sbjct: 244 GTLAAGIGELK----QGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQL 288
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 49.2 bits (117), Expect = 2e-05
Identities = 92/608 (15%), Positives = 169/608 (27%), Gaps = 122/608 (20%)
Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
E+ KK + ++++E L + L+L LK + + Q+ +
Sbjct: 168 EKRKKKERLKKLIEETENLAELIIDLEELKLQEL----KLKEQAKKALEYYQLKEKLE-- 221
Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
ENL+ + + + + + + L R ++ K + E I +
Sbjct: 222 LEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKE---- 277
Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKL-INEKESKAEELRKHMIILSESNSAKDY 1464
+ ++ +K+ + E E RK S K+
Sbjct: 278 -------------EEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEL 324
Query: 1465 AETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
+ K+L EKE E + +E +E + ++ K +
Sbjct: 325 KKLEKELKKEKEEIEE-----LEKELKELEIKREAEEEEEEQLEKLQEK--LEQLEEELL 377
Query: 1525 TKQLDEFLNAPGATLAD------IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI 1578
K+ E + ++N ++ EQ + E + +
Sbjct: 378 AKKKLES-ERLSSAAKLKEEELELKNEEEKEAKLLLE-LSEQEEDLLKEEKKEELKIVEE 435
Query: 1579 DTILTETAG------------DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA 1626
ET K K + ++ TK L R
Sbjct: 436 LEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQK 495
Query: 1627 QDMA------------------------------------EVAIQTAKDDISAARKDLSQ 1650
+ A VA+ K IS A
Sbjct: 496 LEEASQKESKAREGLAVLLALIKDGVGLRIISAAGRGGDLGVAVANYKVAISVAVIVEVS 555
Query: 1651 ITNDLDDAQQKS-----------NETNVKVKLLQERLKSIQAG-------FLQNGRSAMD 1692
D D +QK + LL+ LKSI Q ++ ++
Sbjct: 556 AVADEVDERQKLVRALTELPLPARTFVLLDPLLKLPLKSIAVLEIDPALNLAQLNKATLE 615
Query: 1693 VENEEKN------------LEKEVALAQKQASGLR---SRYQETDNKLLSKAESSGLKRA 1737
+ ++K L K + A+ + SGLR S + K KA S L +
Sbjct: 616 ADEDDKRAKVVEGILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKE 675
Query: 1738 RGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVD 1797
+ S AK + L E + ++ EL K E ++ + + D
Sbjct: 676 LLAEQELQEKAESEL--AKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQD 733
Query: 1798 KSNNYKNC 1805
K N
Sbjct: 734 KINEELKL 741
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 48.7 bits (117), Expect = 3e-05
Identities = 94/568 (16%), Positives = 192/568 (33%), Gaps = 153/568 (26%)
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
++ L L +ND E++KKLN TG+ ++ E RQ D E
Sbjct: 36 ERKQELENLPVNDELEKVKKLNLTGQSEEK-------------FEEWRQKWD------EI 76
Query: 1331 STET----EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN--- 1383
T + E+Q+ +AE F +F + + + + + +E++
Sbjct: 77 VTNSLPDIEEQLFEAEEL--------NDKF-RFRKAK----HEINEIESLLDLIEEDIEQ 123
Query: 1384 -LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
L +L EL+ ES E +++ + + +
Sbjct: 124 ILEELQELL-------------------------------ESE------EKNREEVEQLK 146
Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
ELRK +L+ S + +L E + E + + + + + + + E
Sbjct: 147 DLYRELRKS--LLANRFS---FGPALDEL----EKQLENLEEEFSQFVELTESGDYV--E 195
Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA---------VNKN 1553
A + + + AA I + L E L +++ G + +
Sbjct: 196 AREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKA--GYRELVEEGYHLDHLD 253
Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAND--------LKRKANLTKAG 1605
I+ + ++E +I+ + L ++D L E + + L+R+ KA
Sbjct: 254 IEKEIQDLKE---QIDENLALLEELD--LDEAEEKNEEIQERIDQLYDILEREV---KA- 304
Query: 1606 ADSTKNTVQKIVDVLTE--ARTAQDMAEVAIQT---------AKDDISAAR---KDLSQI 1651
+ V+K D L + + E+ + + ++ + R K L +
Sbjct: 305 ----RKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESL 360
Query: 1652 TNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
D+ ++ E + LQE L+ I + + + + L K+ A+++
Sbjct: 361 EKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKL 420
Query: 1712 SGLRSRYQETDNKLLSKAESSGL-----------KRARGQM--LLNKASQLSVNTTAKLK 1758
R++ E L E S L ++ L + + +N A +
Sbjct: 421 ERYRNKLHEIKRYL----EKSNLPGLPEDYLEMFFEVSDEIEALAEELEEKPINMEAVNR 476
Query: 1759 QLNETEAMFNSQESELTEL--SKNIAEL 1784
L E + E E EL + + E
Sbjct: 477 LLEEATEDVETLEEETEELVENATLTEQ 504
Score = 48.7 bits (117), Expect = 3e-05
Identities = 46/255 (18%), Positives = 100/255 (39%), Gaps = 51/255 (20%)
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA--DSTKNTVQKI 1616
E+ R++ +EI SL DI+ L E K K K + + + D + +++I
Sbjct: 67 EEWRQKWDEIVTN--SLPDIEEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQI 124
Query: 1617 VDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERL 1676
++ L E +++ ++ KD RK L + L+++L
Sbjct: 125 LEELQELLESEEKNREEVEQLKDLYRELRKSLL----------ANRFSFGPALDELEKQL 174
Query: 1677 KSIQAGFLQNGRSAMDVENEEKN----------LEKEVALAQKQASGLRSRYQETDNKL- 1725
++++ F S E + LE+E+A ++ + +E +L
Sbjct: 175 ENLEEEF-----SQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTELP 229
Query: 1726 --LSKAESSGLKRARGQML--------LNKASQLSVNTTAKLKQLNETEAMFNSQESELT 1775
L + + +G + +++ L+ ++ +Q++E A+ +E +L
Sbjct: 230 DQLQELK-AGYR----ELVEEGYHLDHLDIEKEI----QDLKEQIDENLALL--EELDLD 278
Query: 1776 ELSKNIAELQKRIQS 1790
E + E+Q+RI
Sbjct: 279 EAEEKNEEIQERIDQ 293
Score = 44.4 bits (106), Expect = 5e-04
Identities = 92/538 (17%), Positives = 191/538 (35%), Gaps = 169/538 (31%)
Query: 1217 ELLEKKIEDVKA----LVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
+L+E+ IE + L+E+ ++ ++ + +++K ++ N + G
Sbjct: 115 DLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRF------SFGPALD 168
Query: 1273 RINLARLA-LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
+ +L L + + +L ++G+ + +A + +L+E+
Sbjct: 169 ELE-KQLENLEEEFSQFVELTESGDYV--------------------EAREILDQLEEEL 207
Query: 1332 TETEKQINDAE---RQCKRTENLVTKTFP-QFTQ---G----QEEN--------EKALQR 1372
E+ + + ++ + P Q + G EE EK +Q
Sbjct: 208 AALEQIMEEIPELLKELQ-------TELPDQLQELKAGYRELVEEGYHLDHLDIEKEIQD 260
Query: 1373 LNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAE 1432
L E+I ++NL L EL D+ E N ++ E
Sbjct: 261 LKEQI---DENLALLEELDLDE--------------------------AEEKN--EEIQE 289
Query: 1433 TSKKL--INEKESKAE-ELRKHMIILSE--SNSAKDYAETSKKL--------INEKEAKA 1479
+L I E+E KA + K+ L + ++ + E +++ +NE E
Sbjct: 290 RIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESEL-- 347
Query: 1480 EEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
E R + KQ L K+ + + + AY S + ++L+E
Sbjct: 348 -ESVRQLE--KQL----ESLEKQYDEITERIAEQEIAY----SELQEELEE--------- 387
Query: 1540 ADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR- 1597
I E+I + +++ +++ L + E +++KR
Sbjct: 388 --------------ILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRY 433
Query: 1598 --KANLTKAGAD--STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
K+NL D V ++ L A+++ E I ++ A + L + T
Sbjct: 434 LEKSNLPGLPEDYLEMFFEVSDEIEAL-----AEELEEKPI-----NMEAVNRLLEEATE 483
Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
D++ ++++ E L E+L IQ N ++ +EVA A +A
Sbjct: 484 DVETLEEETEELVENA-TLTEQL--IQYA------------NRYRSDNEEVAEALNEA 526
Score = 37.1 bits (87), Expect = 0.11
Identities = 60/345 (17%), Positives = 122/345 (35%), Gaps = 74/345 (21%)
Query: 1471 LINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHA----ASVRNKTAAYVASTSNITK 1526
L++ E E++ + ES N +E + +R A S
Sbjct: 113 LLDLIEEDIEQILEELQELL-ESEEKN---REEVEQLKDLYRELRKSLLANRFSFGPALD 168
Query: 1527 QLDEFLNAPGATLADIRNISGL---AVNKNIQSNPEQIRE---RANEINNVIKSLTD-ID 1579
+L++ L N+ V + + RE + E ++ + + I
Sbjct: 169 ELEKQLE----------NLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIP 218
Query: 1580 TILTETAGDL-AKANDLK---RKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQ 1635
+L E +L + +LK R+ L + G + E IQ
Sbjct: 219 ELLKELQTELPDQLQELKAGYRE--LVEEGYHLDHLDI-----------------EKEIQ 259
Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERL-KSIQAGFLQNGRSAMDVE 1694
K+ I L ++ DLD+A++K+ E ++ L + L + ++A VE
Sbjct: 260 DLKEQIDENLALLEEL--DLDEAEEKNEEIQERIDQLYDILEREVKAR--------KYVE 309
Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETD----NKLLSKAESSGLKRARG-----QMLLNK 1745
L + A++Q L+ +E D + L+++E L+ R + L +
Sbjct: 310 KNSDTLPDFLEHAKEQNKELK---EEIDRVKQSYTLNESE---LESVRQLEKQLESLEKQ 363
Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
+++ + +E + +L E+ K +L + +Q
Sbjct: 364 YDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQG 408
Score = 35.2 bits (82), Expect = 0.38
Identities = 42/188 (22%), Positives = 75/188 (39%), Gaps = 51/188 (27%)
Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQI------MNNE--DPNGVGK 1265
KE + L+++I++ AL+E DL ++++++I + E V K
Sbjct: 256 KEIQDLKEQIDENLALLEEL-----DLDEAEEKNEEIQERIDQLYDILEREVKARKYVEK 310
Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
+ A+ +L EEI + +K++ TL E LE RQ
Sbjct: 311 NSDTLPDFLEHAKEQNKELKEEI-------DRVKQSYTL-----NESELESVRQ------ 352
Query: 1326 KLKEQSTETEKQINDAERQCKR-TENLVTKTFPQFTQGQEENEKALQRL----------N 1374
EKQ+ E+Q TE + + +++ QEE E+ L++L +
Sbjct: 353 --------LEKQLESLEKQYDEITERIAEQEIA-YSELQEELEEILKQLEEIEKEQEKLS 403
Query: 1375 EKITSLEK 1382
E + L K
Sbjct: 404 EMLQGLRK 411
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA a
protein that suppresses sigma54-dependent transcription.
The PspA protein, a negative regulator of the Escherichia
coli phage shock psp operon, is produced when virulence
factors are exported through secretins in many
Gram-negative pathogenic bacteria and its homologue in
plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to the
AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 47.0 bits (112), Expect = 3e-05
Identities = 38/213 (17%), Positives = 75/213 (35%), Gaps = 51/213 (23%)
Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
KR L +A + K D + M E AI+ + ++ AR+ L+Q+
Sbjct: 3 FKRLLRLLRANIHEG---LDKAED-------PEKMLEQAIRDMQSELGKARQALAQVI-- 50
Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE---------KNLEK--- 1702
A+QK E K++ +E+ K ++ R+A+ NEE LEK
Sbjct: 51 ---ARQKQLER--KLEEQKEQAKKLE----NKARAALTKGNEELAREALAEIATLEKQAE 101
Query: 1703 ----EVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLK 1758
++ + LR + + K+ +A+ L + +
Sbjct: 102 ALETQLTQQRSAVEQLRKQLAALETKIQQ-------LKAKKTALKARLKAAKAQ-----E 149
Query: 1759 QLNETEAMFNSQE--SELTELSKNIAELQKRIQ 1789
+N + +++ + + I E + R
Sbjct: 150 AVNTSLGSASTESATDSFERIEEKIEEREARAD 182
Score = 43.1 bits (102), Expect = 5e-04
Identities = 29/122 (23%), Positives = 40/122 (32%), Gaps = 16/122 (13%)
Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
AR AL EE+ + E L E ATL E E Q +L++Q E
Sbjct: 74 ARAALTKGNEELAR-----EALAEIATL--EKQAEALETQLTQQRSAVEQLRKQLAALET 126
Query: 1337 QINDAERQ---CKRTENLVTKTFPQFTQ-----GQEENEKALQRLNEKITSLEKNLPDLN 1388
+I + + K K E + +R+ EKI E
Sbjct: 127 KIQQLKAKKTALKARLK-AAKAQEAVNTSLGSASTESATDSFERIEEKIEEREARADAAA 185
Query: 1389 EL 1390
EL
Sbjct: 186 EL 187
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases, possibly
by mediating the dimerisation of kinase domains or by
targeting these kinases to the nuclear pore complex. MLP1
and MLP2 are involved in the process of telomere length
regulation, where they are thought to interact with
proteins such as Tel1p and modulate their activity.
Length = 132
Score = 44.6 bits (106), Expect = 5e-05
Identities = 25/127 (19%), Positives = 47/127 (37%), Gaps = 36/127 (28%)
Query: 1628 DMAEVAIQTAKDDISAARKDLSQITNDLDD-------AQQK------------------- 1661
+ ++ + A + + ++ DL++ AQQK
Sbjct: 6 SSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALR 65
Query: 1662 --SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQ 1719
NE ++ L+ +S QA + S E ++K LE E++ +K+ L +
Sbjct: 66 KQLNELKKEIAQLKAEAESAQAELSEAEESW---EEQKKMLEDELSELEKRIEELNEQ-- 120
Query: 1720 ETDNKLL 1726
NKLL
Sbjct: 121 ---NKLL 124
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 45.7 bits (109), Expect = 8e-05
Identities = 42/192 (21%), Positives = 81/192 (42%), Gaps = 29/192 (15%)
Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
+ + K + L EA+ + AE + A+ ++++ + + + DL+ ++++
Sbjct: 5 QLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALE 64
Query: 1668 KVKLLQERLK----SIQA-GFLQNGRSAMDVENEEK--NLEKEVALAQKQASGLRSRYQE 1720
K L+E K S + L+N RS D +EK LE ++ A++ A +Y+E
Sbjct: 65 K---LEEAEKAADESERGRKVLEN-RSLKD---DEKMEQLEAQLKEAKEIAEEADRKYEE 117
Query: 1721 TDNKL------LSKAESSGLKRARGQMLLNKASQLS---VNTTAKLKQLNETEAMFNSQE 1771
KL L +AE R + +K +L LK L +E + +E
Sbjct: 118 VARKLVVVEGDLERAE------ERAEAAESKIVELEEELKVVGNNLKSLEVSEEKASQRE 171
Query: 1772 SELTELSKNIAE 1783
E +++ E
Sbjct: 172 DSYEEKIRDLTE 183
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 47.2 bits (112), Expect = 8e-05
Identities = 108/528 (20%), Positives = 186/528 (35%), Gaps = 97/528 (18%)
Query: 1290 KLNKTGEMLKENA---TLLQEN-NVEGALELTRQAHDTSLKLKEQSTETEK----QINDA 1341
+L K E+ KEN LLQ+ E L+ + S + T T++ Q +
Sbjct: 179 RLRKQREVQKENEFYMQLLQQALPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGG 238
Query: 1342 ERQCKRTENLVTKTFPQFTQGQEENE------KALQRLNEKITSLEKNLPDLNELICDKR 1395
++ N T +G++ + ++ N + L + E+I +
Sbjct: 239 NHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTI-EVIENHS 297
Query: 1396 G--DPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
P + G S T S S+SA + SK + K S +
Sbjct: 298 NKSRPSSSSTNG-----------SKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPK 346
Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
S +N + + S NE + K +++ + A V N
Sbjct: 347 SHSSANGSVPSSSVSD---NESKQKRA----SKSSSGARDSKKDASGMSANGT---VENC 396
Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
ST + ++L+ DI K +Q+ +Q R+ +E+ N I
Sbjct: 397 IPENKISTPSAIERLE----------QDI---------KKLQAELQQARQNESELRNQIS 437
Query: 1574 SLTDIDTILTETAGDLAKAND-LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
LT ++ L G L K ND L+ K N + K ++Q + L ++ AE
Sbjct: 438 LLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEK 497
Query: 1633 AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD 1692
+ K RK + T AQ ++ +E +S++ + D
Sbjct: 498 QLAEEK-----KRKKEEEETAARAAAQAAASR--------EECAESLKQ-------AKQD 537
Query: 1693 VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA------RGQMLLNKA 1746
+E E K LE ++ L +++ L QE S+ E+ L A + ML N
Sbjct: 538 LEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLEN-- 595
Query: 1747 SQLSVNTTAKL----------KQLNETEAMFNSQESELTELSKNIAEL 1784
LS T KL +QL +E+E+ +L IA++
Sbjct: 596 -SLSAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIADI 642
Score = 32.6 bits (74), Expect = 2.4
Identities = 28/115 (24%), Positives = 44/115 (38%), Gaps = 15/115 (13%)
Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
+R+ E KK K E A + E E SLK +Q E E
Sbjct: 491 SRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAE--------SLKQAKQDLEMEI 542
Query: 1337 QINDAERQCKRTENLVTKTFPQFTQGQEENEK-------ALQRLNEKITSLEKNL 1384
+ + + + K E + + Q + +E+EK ALQ + +K LE +L
Sbjct: 543 KKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLENSL 597
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 46.7 bits (111), Expect = 1e-04
Identities = 68/412 (16%), Positives = 133/412 (32%), Gaps = 80/412 (19%)
Query: 1405 GAGCGKCGGM-----------WCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
AG G G + W G E N A + K+L E E + +L+
Sbjct: 122 AAGVGSLGSVRDELEKEADELWKPRGRKPEINVAL---KELKEL--EAEIREVQLK---- 172
Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
+ + + K L EA+ E + ++E K+ + R +
Sbjct: 173 -------TRTWKDLVKAL---DEAEEE-----LANLRKELRQLEKE-KQRLE-----RLR 211
Query: 1514 TAA-YVASTSNITKQLDEFLNAP------GATLADIRNISGLAVNKNIQSNPEQIRERAN 1566
+A + +QL + R A +N+ E + ER
Sbjct: 212 RLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELR-AARRNL----ELLTERLE 266
Query: 1567 EINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV------DVL 1620
+ + ++ +D L A + + ++ A D + +L
Sbjct: 267 ALQAELDEIS-LDEELLAQAAAIEALHQ--QRGEYRNAEQDLPDREGEIANAREAAAALL 323
Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLS-------QITNDLDDAQQKSNETNVKVKLLQ 1673
+ D V + AA++ ++ + L A+ E ++K ++
Sbjct: 324 AQIGPDADEEAVE---SLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELKQVR 380
Query: 1674 ERLKSIQA--------GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
+L ++ L + A D++ LE+EVA+A+++ + SR L
Sbjct: 381 AQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVAVAKRELAQALSRLGLWRGDL 440
Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL 1777
+ Q L +A ++ K +L E E + E +L L
Sbjct: 441 EELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHL 492
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 46.2 bits (110), Expect = 2e-04
Identities = 61/320 (19%), Positives = 113/320 (35%), Gaps = 43/320 (13%)
Query: 1472 INEKEAKAEEVFRGITTAKQE-SHAANIL--AKEAYDHAASVRNKTAAYVASTSNITK-Q 1527
I EA+ +E QE +H N L A+ + + + + Q
Sbjct: 211 IEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQ 270
Query: 1528 LDEFLNAPGATLADIRNISG----LAVNKN-IQSNPEQIR--ERANEINNVIKSLTDIDT 1580
L E A A A +R ++ L + N + S Q++ E++ + + L + D
Sbjct: 271 LKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDK 330
Query: 1581 ILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDD 1640
L E+ LA + ++ A D T +I L+ + Q EV IQ K +
Sbjct: 331 ELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQ--LEVLIQQVKRE 388
Query: 1641 ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA-GFLQNGRSAM-DVENEEK 1698
+ A+ L + +L++ + +++ +I + + + + +NE
Sbjct: 389 LQDAKSQLLKELRELEEELA----------EVDKKISTIPSEEQIAQLLEELGEAQNELF 438
Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG------------------LKRARGQ 1740
E E+ +Q L+ + L K + LK R +
Sbjct: 439 RSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIADKAKKTLKEFREK 498
Query: 1741 MLLNKASQLSVNTTAKLKQL 1760
+L K QL T K+L
Sbjct: 499 LLERKLQQLEEEITKSFKKL 518
Score = 39.7 bits (93), Expect = 0.017
Identities = 43/185 (23%), Positives = 72/185 (38%), Gaps = 27/185 (14%)
Query: 1215 EFELLEKKIEDVK-ALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
E LE I+ VK L + ++ +L + + ++ K+I + + L
Sbjct: 374 ELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLE----- 428
Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
L + E+ + E E E E A+E R+ D K K + E
Sbjct: 429 ------ELGEAQNELFRS----EAEIEELLRQLETLKE-AIEALRKTLDEKTKQKINAFE 477
Query: 1334 TEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICD 1393
E+ I A++ KT +F ++ E+ LQ+L E+IT K L + LI
Sbjct: 478 LERAITIADK--------AKKTLKEF--REKLLERKLQQLEEEITKSFKKLMRKHNLISR 527
Query: 1394 KRGDP 1398
+ DP
Sbjct: 528 LKIDP 532
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 45.8 bits (109), Expect = 2e-04
Identities = 31/183 (16%), Positives = 66/183 (36%), Gaps = 26/183 (14%)
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
+++ E A I I+ LTD L D + L K N A S QK++
Sbjct: 223 DELVEEAKTIKAEIEELTDELLNLVMDIEDPS--AALN-KLNTAAAKIKSKIEQFQKVIK 279
Query: 1619 VLTEART----AQDMAE------------VAIQTAKDDISAARKDLSQITNDLDDAQQKS 1662
+ + Q ++E +Q + + + A +L +I ++ ++ +K
Sbjct: 280 MYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339
Query: 1663 NETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
E K+ ++ L ++ A V+ + L+ E ++ + L+ +
Sbjct: 340 LELKNKISTNKQSLITLV-------DKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIV 392
Query: 1723 NKL 1725
Sbjct: 393 KTK 395
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 45.4 bits (108), Expect = 3e-04
Identities = 61/379 (16%), Positives = 136/379 (35%), Gaps = 65/379 (17%)
Query: 1436 KLINEKESKAEELRKHMIILSESNSAKDYA----ETSKKLINEKEAKAEEVFRGITTAK- 1490
+ +N ES+ EL + + ++ A + + +++ E E + EE + T +
Sbjct: 206 ERLNGLESELAELDE---EIERYEEQREQARETRDEADEVLEEHEERREE----LETLEA 258
Query: 1491 -----QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNI 1545
+E+ A +E + + ++ D+ L G AD +
Sbjct: 259 EIEDLRETIAETEREREELAEEVRDLRERL------EELEEERDDLLAEAGLDDADAEAV 312
Query: 1546 SGL-----AVNKNIQSNPEQIRERANEINNVIKSLT-DIDTILTETAGDLAKANDLKRKA 1599
++ ++ E+ R A N +SL D D + +A +L+ +
Sbjct: 313 EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESEL 372
Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
+ + + ++++ + + E R A V + A+D + + + D+ +
Sbjct: 373 EEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED-----FLEE--LREERDELR 425
Query: 1660 QKSNETNVKVKLLQERLKSIQAGFLQNGR--------------SAMDVENEEKNLEKEVA 1705
++ E ++ +ER++ +A L+ G+ ++ E+ E+ E E
Sbjct: 426 EREAELEATLRTARERVEEAEA-LLEAGKCPECGQPVEGSPHVETIE-EDRERVEELEAE 483
Query: 1706 LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
L L +E + + L +AE R + L + L + + E
Sbjct: 484 LED-----LEEEVEEVEER-LERAEDLVEAEDRIERLEERREDLE-------ELIAERRE 530
Query: 1766 MFNSQESELTELSKNIAEL 1784
+ EL + AEL
Sbjct: 531 TIEEKRERAEELRERAAEL 549
Score = 43.5 bits (103), Expect = 0.001
Identities = 94/548 (17%), Positives = 183/548 (33%), Gaps = 109/548 (19%)
Query: 1248 DDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE 1307
D +K QI E+ + +R+N L +L EEI++ + E +E T +
Sbjct: 190 DQLKAQIEEKEEKDLH--------ERLNGLESELAELDEEIERYEEQREQARE--TRDEA 239
Query: 1308 NNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENE 1367
+ V E R+ +T L+ + + + I + ER +R E
Sbjct: 240 DEVLEEHEERREELET---LEAEIEDLRETIAETER--EREE------------------ 276
Query: 1368 KALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA 1427
L E++ L + L +L E D A G + A
Sbjct: 277 -----LAEEVRDLRERLEELEEERDDLL----------AEAGL------------DDADA 309
Query: 1428 KDYAETSKKLINEKESKAEELRKHMIILSESNS-AKDYAETSKKLINEKEAKAEEVFRGI 1486
+ ++L + E + L + + N A+ E + L E +AEE
Sbjct: 310 EAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL----EERAEE----- 360
Query: 1487 TTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNIS 1546
+E A L E + +V ++ I + + F +AP D+ N
Sbjct: 361 ---LRE--EAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAP----VDLGNAE 411
Query: 1547 GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAG- 1605
E++RE +E+ L L + +A L + G
Sbjct: 412 DFL---------EELREERDELREREAELE---ATLRTARERVEEAEALLEAGKCPECGQ 459
Query: 1606 --ADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSN 1663
S V+ + E R + E ++ ++++ + L + DL +A+ +
Sbjct: 460 PVEGS------PHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE-DLVEAEDRIE 512
Query: 1664 ETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEK-NLEKEVALAQKQASGLRSRYQETD 1722
+ + L+E + + ++ R + E LE E ++ A+ +E
Sbjct: 513 RLEERREDLEELIAERRET-IEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAR 571
Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSV--NTTAKLKQLNETEAMFNSQESE----LTE 1776
++ + R + L + L+ + ++++L E E L E
Sbjct: 572 EEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAE 631
Query: 1777 LSKNIAEL 1784
+ EL
Sbjct: 632 KRERKREL 639
Score = 43.5 bits (103), Expect = 0.001
Identities = 102/555 (18%), Positives = 190/555 (34%), Gaps = 135/555 (24%)
Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
E E LE +IED++ + T R +L + D+++++ E+
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELA---EEVRDLRERLEELEEERDDL---------- 298
Query: 1275 NLARLALNDLTEEI---------KKLNKTGEMLKENATLLQE--NNVEGALELTRQAHDT 1323
LA L+D E + + + L+E Q E E +
Sbjct: 299 -LAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEER 357
Query: 1324 SLKLKEQSTETEKQINDAERQC-KRTENL---------VTKTFP----QFTQGQEENEKA 1369
+ +L+E++ E E ++ +A R E + + + F ++ E+
Sbjct: 358 AEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417
Query: 1370 LQ---RLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCG----------GMWC 1416
+ L E+ LE L E + + L C +CG +
Sbjct: 418 REERDELREREAELEATLRTARERVEEAE-----ALLEAGKCPECGQPVEGSPHVETIEE 472
Query: 1417 SNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKE 1476
+ E + + E + + E+ +AE+L + ++ E ++LI E+
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDLVE---AEDRIERLEERREDLEELIAERR 529
Query: 1477 AKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPG 1536
EE + A L + A + A K A + + +E
Sbjct: 530 ETIEE----------KRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREE------ 573
Query: 1537 ATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK 1596
+A++ S LA K + E+IR I + +I+ L E LA+ ND +
Sbjct: 574 --VAELN--SKLAELKERIESLERIRTLLAAIADAED---EIER-LREKREALAELNDER 625
Query: 1597 RKANLTKAGADSTKNTVQKIVDVLTEART-----AQDMAEVAIQTAKDDISAARKDLSQI 1651
R + L E R + E I+ A++D A + L Q+
Sbjct: 626 R--------------------ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQV 665
Query: 1652 TNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE----KEVALA 1707
LD+ +++ ++ LQ + + VENE + LE + AL
Sbjct: 666 EEKLDELREERDD-------LQAEIGA--------------VENELEELEELRERREALE 704
Query: 1708 QKQASGLRSRYQETD 1722
+ L + Y E +
Sbjct: 705 NRV-EALEALYDEAE 718
Score = 42.3 bits (100), Expect = 0.003
Identities = 86/437 (19%), Positives = 158/437 (36%), Gaps = 107/437 (24%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDP-NGVGKTLGNTTQRINLA 1277
LE ++E+ + VE R I++++++I + LGN +
Sbjct: 368 LESELEEAREAVE--DRRE--------EIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417
Query: 1278 RLALNDLTEEIKKLNKT----GEMLKENATLLQENN-------VEGA--LELTRQAHDTS 1324
R ++L E +L T E ++E LL+ VEG+ +E + +
Sbjct: 418 REERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERV 477
Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
+L+ + + E+++ + E + +R E+LV E E ++RL E+
Sbjct: 478 EELEAELEDLEEEVEEVEERLERAEDLV------------EAEDRIERLEER-------R 518
Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
DL ELI ++R ET I EK +
Sbjct: 519 EDLEELIAERR------------------------------------ET----IEEKRER 538
Query: 1445 AEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY 1504
AEELR+ + + AE ++ E E +AEE + + A E+
Sbjct: 539 AEELRE------RAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSK-LAELKERIESL 591
Query: 1505 DHAASVRNKTAAYVASTSNIT---KQLDEFLNAPGATLADIRN-ISGLA--VNKNIQSNP 1558
+ ++ A + + L E + LA+ R L ++
Sbjct: 592 ERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEA 651
Query: 1559 EQIRERANE-INNVIKSLTDIDTILTETAGDLAKA-----NDLKRKANLTKAGADSTKNT 1612
+ +ERA E + V + L + L E DL N+L+ L + ++ +N
Sbjct: 652 REDKERAEEYLEQVEEKLDE----LREERDDLQAEIGAVENELEELEEL-RERREALENR 706
Query: 1613 VQKIVDVLTEARTAQDM 1629
V+ + + EA + M
Sbjct: 707 VEALEALYDEAEELESM 723
Score = 40.0 bits (94), Expect = 0.013
Identities = 43/229 (18%), Positives = 93/229 (40%), Gaps = 23/229 (10%)
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
QI E+ E ++ + L +++ L E ++ + + + +A T+ AD +++
Sbjct: 194 AQIEEK--EEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADE-------VLE 244
Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKS 1678
E R + E I+ ++ I+ ++ ++ ++ D +++ E L+E
Sbjct: 245 EHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEE-------LEEERDD 297
Query: 1679 IQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR 1738
+ A + A VE + LE + + R Q + + S E + R
Sbjct: 298 LLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEER 357
Query: 1739 GQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
+ L +A++L +L E + E+ EL + I EL++R
Sbjct: 358 AEELREEAAELE-------SELEEAREAVEDRREEIEELEEEIEELRER 399
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 45.1 bits (107), Expect = 3e-04
Identities = 36/213 (16%), Positives = 81/213 (38%), Gaps = 27/213 (12%)
Query: 1602 TKAGADSTKNTVQKIVDV---LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
A K ++I + + E + + E +++ + +I++ L + +DL
Sbjct: 33 AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92
Query: 1659 QQKSNETNVKVKLL------QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
+++ + N ++ L Q R + Q LQ RS + ++ + + A
Sbjct: 93 RKQIADLNARLNALEVQEREQRRRLAEQLAALQ--RSGRNPPPALLVSPEDAQRSVRLAI 150
Query: 1713 GLR----SRYQETDN-----KLLSK------AESSGLKRARGQMLLNKASQLSVNTTAKL 1757
+R + D K L+ AE + L + + ++L+ +
Sbjct: 151 YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE-QRAQQAKLAQLLEERK 209
Query: 1758 KQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
K L + + ++ + +L EL N + L+ I S
Sbjct: 210 KTLAQLNSELSADQKKLEELRANESRLKNEIAS 242
Score = 36.6 bits (85), Expect = 0.10
Identities = 48/268 (17%), Positives = 89/268 (33%), Gaps = 60/268 (22%)
Query: 1549 AVNKNIQSNPEQIRERANEINNVIKSL-TDIDTI---LTETAGDLAKAND---------- 1594
A+ K I+ + +++ ++ +KSL T+I ++ L ETA DL K
Sbjct: 49 ALEKKIR----EQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLN 104
Query: 1595 ------LKRKANLTK-AGAD--STKNTVQKIV----DVLTEARTAQDMAEV--AIQTAKD 1639
+++ L + A S +N ++ D R A + A D
Sbjct: 105 ALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERID 164
Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKN 1699
+ A K L+ + ++ A +++ L S Q Q + A +E +K
Sbjct: 165 ALKATLKQLAAVRAEI--AAEQAE---------LTTLLSEQRA--QQAKLAQLLEERKKT 211
Query: 1700 ---LEKEVALAQK-------QASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQL 1749
L E++ QK S L++ + E++ A +
Sbjct: 212 LAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEA-- 269
Query: 1750 SVNTTAKLKQLNETEAMFNSQESELTEL 1777
T + + E M S L
Sbjct: 270 --KRTGETYKPTAPEKMLISSTGGFGAL 295
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 44.3 bits (105), Expect = 3e-04
Identities = 39/266 (14%), Positives = 89/266 (33%), Gaps = 62/266 (23%)
Query: 1552 KNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKN 1611
K ++ E+++E+ +E+ L K ++L K
Sbjct: 16 KQLKEEIEELKEKRDELRKEASELA-------------EKRDELNAK------------- 49
Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
V+++ + E R +D +Q K+ L ++ + + ++K NE N+ +
Sbjct: 50 -VRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRS 108
Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAES 1731
++ + I+ R + E+E L QK LR ++ L +
Sbjct: 109 IKSLEREIE-------RLEKKQQTSVLTPEEERELVQK-IKELRKELEDAKKALEENEKL 160
Query: 1732 SGLK------RARGQMLLNKASQLSVNTTAKLKQLNET---------------------E 1764
LK + + + + K +L+ +++ +
Sbjct: 161 KELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELS 220
Query: 1765 AMFNSQESELTELSKNIAELQKRIQS 1790
+ E L + EL+K+I++
Sbjct: 221 KKIDELHEEFRNLQNELRELEKKIKA 246
Score = 44.3 bits (105), Expect = 4e-04
Identities = 53/279 (18%), Positives = 101/279 (36%), Gaps = 43/279 (15%)
Query: 1435 KKLINEKESKAEELRKHMIILSE-----SNSAKDYAETSKKLINEKEAKAEEVFRGITTA 1489
K+ I E + K +ELRK L+E + ++ E +++L E++ EEV
Sbjct: 19 KEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEV----QEL 74
Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA 1549
K++ N +E ++ ++ +EF + + R I L
Sbjct: 75 KEKRDEINAKLQELRKEYRELK--------------EKRNEFNLGGRSIKSLEREIERLE 120
Query: 1550 -VNKNIQSNPEQIRERANEINNVIKSLTDIDTILT---ETAGDLAKANDLKRKANLTKAG 1605
+ PE+ RE +I + K L D L + A+ ++LK+KA
Sbjct: 121 KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180
Query: 1606 ADSTKNTVQKIVDVLTEA-------RTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
N Q+ + + + R D I ++ + N+L +
Sbjct: 181 IQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELREL 240
Query: 1659 QQK-----SNETNVKVK----LLQERLKSIQAGFLQNGR 1688
++K + E K + L+ER + I F + +
Sbjct: 241 EKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEK 279
Score = 31.6 bits (72), Expect = 3.7
Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 12/116 (10%)
Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLL 1743
L+ R + E EE +++ +K+AS L + E + K+ E + R +
Sbjct: 11 LELKRKQLKEEIEELKEKRDEL--RKEASELAEKRDELNAKVRELREKAQELREERDEIN 68
Query: 1744 NKASQL-----SVNTTA-----KLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
+ +L +N + ++L E FN + L + I L+K+ Q
Sbjct: 69 EEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQ 124
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis
sensory transducer). Thought to undergo reversible
methylation in response to attractants or repellants
during bacterial chemotaxis.
Length = 262
Score = 44.2 bits (105), Expect = 3e-04
Identities = 48/264 (18%), Positives = 95/264 (35%), Gaps = 51/264 (19%)
Query: 1436 KLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHA 1495
+ + E + AEE E + E I E A A+E+ +A + +
Sbjct: 4 EAVEEIAAGAEEQA------EELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEE 57
Query: 1496 ANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISG----LAVN 1551
++A +R V + ++L+E + G ++ I +I+ LA+N
Sbjct: 58 GREAVEDAITAMDQIREV----VEEAVSAVEELEESSDEIGEIVSVIDDIADQTNLLALN 113
Query: 1552 KNI-------------------QSNPEQIRERANEINNVIK--------SLTDIDTILTE 1584
I + E+ E A EI ++IK ++ ++ +E
Sbjct: 114 AAIEAARAGEAGRGFAVVADEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSE 173
Query: 1585 TAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE--------ARTAQDMAEVAIQT 1636
+ + + + VQ+I E ++A+V +T
Sbjct: 174 VEEGVELVEETGDALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQET 233
Query: 1637 AK--DDISAARKDLSQITNDLDDA 1658
A ++ISAA ++LS + +LD+
Sbjct: 234 AAMSEEISAAAEELSGLAEELDEL 257
Score = 36.9 bits (86), Expect = 0.071
Identities = 41/217 (18%), Positives = 75/217 (34%), Gaps = 38/217 (17%)
Query: 1606 ADSTKNTVQKIVDVLTE-ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
+ ++++ + E A A ++A A A+ R+ + +D ++ E
Sbjct: 20 LEELAERMEELSASIEEVAANADEIAATAQSAAEA-AEEGREAVEDAITAMDQIREVVEE 78
Query: 1665 TNVKVKLLQERLKSIQ--------------------------AGFLQNGRSAMDVENEEK 1698
V+ L+E I AG + GR V +E +
Sbjct: 79 AVSAVEELEESSDEIGEIVSVIDDIADQTNLLALNAAIEAARAG--EAGRGFAVVADEVR 136
Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQL------SVN 1752
L + A + K+ L QE N+ ++ E S + G L+ + SV
Sbjct: 137 KLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGDALEEIVDSVE 196
Query: 1753 TTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
A L Q A + Q + E++ I E+ + Q
Sbjct: 197 EIADLVQ-EIAAAT-DEQAAGSEEVNAAIDEIAQVTQ 231
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in the
lipid bilayer. The MscS proteins are heptamers of three
transmembrane subunits with seven converging M3 domains,
and this MscS_porin is towards the N-terminal of the
molecules. The high concentration of negative charges at
the extracellular entrance of the pore helps select the
cations for efflux.
Length = 239
Score = 43.8 bits (104), Expect = 4e-04
Identities = 41/216 (18%), Positives = 81/216 (37%), Gaps = 42/216 (19%)
Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
K I P+++RE E+ + K+ + L + L+++ T + +
Sbjct: 44 QKQIDDAPKELRELRQELEALKKTDAPVFPELANL-----SLSQLEQRLAQTLSQLQELQ 98
Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK 1670
+Q+ L E +T + A+ +S AR+ L +I N L
Sbjct: 99 EQLQQENSQLIELQTR-------PERAQQQLSEARRRLQEIRNRLQALSPGGTP------ 145
Query: 1671 LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA-----LAQKQASGLRSRYQETD--- 1722
L Q + +QA +A+ + EE LE L + Q L+ R + +
Sbjct: 146 LAQAQRTLLQA-----ELAALKAQIEELELELLSNNNRQELLRLQRDLLKKRIERLEAEL 200
Query: 1723 --------NKLLSKAESSGLKRARGQMLLNKASQLS 1750
K L+++E + ++ A + L +++ L
Sbjct: 201 QALQNAINRKRLAESEQA-VEEA--ERLAEQSAGLP 233
Score = 41.5 bits (98), Expect = 0.002
Identities = 43/215 (20%), Positives = 79/215 (36%), Gaps = 32/215 (14%)
Query: 1593 NDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQIT 1652
N L L K + + + ++ L A + + I +K +K +
Sbjct: 1 NQLD---QLNKRKLPTPEQ--KALIQDL---EQALNFLD-EIDKSKQKADQYQKQIDDAP 51
Query: 1653 NDLDDAQQ-----KSNETNVKVKL-------LQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
+L + +Q K + V +L L++RL + + ++ L
Sbjct: 52 KELRELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIEL 111
Query: 1701 EKEVALAQKQASGLRSRYQETDNKLLS-KAESSGLKRARGQMLLNKASQLSVNTTAKLKQ 1759
+ AQ+Q S R R QE N+L + + L +A+ + LL A++ +
Sbjct: 112 QTRPERAQQQLSEARRRLQEIRNRLQALSPGGTPLAQAQ-RTLLQAELAA---LKAQI-E 166
Query: 1760 LNETEAMFNSQESELT-----ELSKNIAELQKRIQ 1789
E E + N+ EL L K I L+ +Q
Sbjct: 167 ELELELLSNNNRQELLRLQRDLLKKRIERLEAELQ 201
Score = 31.1 bits (71), Expect = 4.7
Identities = 20/109 (18%), Positives = 41/109 (37%), Gaps = 17/109 (15%)
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
+D N+ K E + ++ N L + + S K + Q ++ A +
Sbjct: 3 LDQLNKRKLPTPEQKALIQD-------LEQALN-FLDEIDKSKQKADQYQKQIDDAPKEL 54
Query: 1751 VNTTAKLKQLNETEAM----FNSQ-----ESELTELSKNIAELQKRIQS 1790
+L+ L +T+A + E L + + ELQ+++Q
Sbjct: 55 RELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQ 103
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 45.2 bits (107), Expect = 4e-04
Identities = 116/660 (17%), Positives = 212/660 (32%), Gaps = 132/660 (20%)
Query: 1180 DNWDSILRGLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVE-------N 1232
+ + + + ++ NIK TG +++E + V + N
Sbjct: 209 NKKEDYDKKIKKINNEIRKLLKNIKCTGNMCKTDTYVDMVELYLLRVNEVPSNNYDNYLN 268
Query: 1233 TTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG---KTLGNTTQRINLARLALNDLTEEIK 1289
+ + I+ +KK++ +N+ + + +G+ +R N +K
Sbjct: 269 RAKELLESGS--DLINKIKKELGDNKTIYSINFIQEEIGDIIKRYNF----------HLK 316
Query: 1290 KLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTE 1349
K+ K KE +Q NN+ ++ LK++ E+ K + +
Sbjct: 317 KIEKG----KEYIKRIQNNNIPPQVKKDE--------LKKKYFESAKHYASFKFSLEMLS 364
Query: 1350 NLVTKTFPQFTQGQEENEKALQRLNEKI-----TSLEKNLPDLNELICDKR---GDPCDN 1401
+ EK L L K+ +E L D I
Sbjct: 365 --------MLDSLLIKKEKILNNLFNKLFGDLKEKIETLL-DSEYFISKYTNIISLSEHT 415
Query: 1402 LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSA 1461
L K S DYA S I E + K +E + N
Sbjct: 416 LKAAEDVLK-----------ENSQKIADYALYSNLEIIEIKKKYDE---------KINEL 455
Query: 1462 KDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVAST 1521
K K LI+ ++ + I K++ D S K +S
Sbjct: 456 KKSINQLKTLISIMKSF---------------YDLIISEKDSMD---SKEKK----ESSD 493
Query: 1522 SNITKQLDEFLNAPGATLADIRNISGLAVN--KNIQSNPEQIRERANEINNVIKSLTDID 1579
SN +++DE L + I+ + + N KNI+ I NEI +I+ +
Sbjct: 494 SNYQEKVDELLQ----IINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEGLIELIKY-- 547
Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKI---VDVLTEARTAQDMAEVAIQT 1636
L L + L ++ + KN ++ I VD + + + D + IQ
Sbjct: 548 --------YLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQ 599
Query: 1637 AKDDISAARKDLSQITNDLDDAQQK--SNETNVKVKLLQERLKSI------QAGFLQNGR 1688
++ I+ A + + N+ +D Q+K LQE L + +
Sbjct: 600 IEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEAK 659
Query: 1689 SAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
S D++ + E + S L K E L + ++ Q
Sbjct: 660 SKEDLQTLLNTSKNEYEKLEFMKS---DNIDNIIKNL--KKELQNLLSLKENII---KKQ 711
Query: 1749 LSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN----FIVDKSNNYKN 1804
L+ LN+ +N +S + E + +L+ IN FI+ N K+
Sbjct: 712 LNNIEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKD 771
Score = 42.9 bits (101), Expect = 0.002
Identities = 114/673 (16%), Positives = 237/673 (35%), Gaps = 126/673 (18%)
Query: 1194 TKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQ 1253
++ T+A +K E + ++ V + ++ + ++ + L I +M +
Sbjct: 1234 EELTTEAKGLKGEANRSTNVDELKEIKL---QVFSYLQQVIKENNKMENALHEIKNMYEF 1290
Query: 1254 IMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKE-NATLLQ----EN 1308
+++ D + K + N+T++ A + K+L KT ++K+ A + Q +N
Sbjct: 1291 LISI-DSEKILKEILNSTKK------AEEFSNDAKKELEKTDNLIKQVEAKIEQAKEHKN 1343
Query: 1309 NVEGALELTRQAHDTSL--KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN 1366
+ G+LE + + ++KE+ + K+IN ++ +
Sbjct: 1344 KIYGSLEDKQIDDEIKKIEQIKEEISNKRKEIN----------KYLSNIKSNKEKCDLHV 1393
Query: 1367 EKALQRLNEKITSLEKNLP----DLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLS 1422
A R +KI L K+ + E+ K D + KC +
Sbjct: 1394 RNA-SRGKDKIDFLNKHEAIEPSNSKEVNIIKITDNIN---------KC------KQYSN 1437
Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIIL----------SESNSAKDYAETSKKLI 1472
E+ ++ A+ + I + E + + + IL E+ + D +I
Sbjct: 1438 EAMETENKADENNDSIIKYEKEITNILNNSSILGKKTKLEKKKKEATNIMDDINGEHSII 1497
Query: 1473 NEKEAKAEEVFRGITTAKQESHAANILA----KEAYDHAASVRNKTAAYVASTSNITKQL 1528
K K+ E + ++L AY+ + + + NI ++
Sbjct: 1498 KTKLTKSSEKLNQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNILNIKDEI 1557
Query: 1529 DEFLNAPGATLADIRNISGLAVNKN-----------------IQSNPEQIRERANEINNV 1571
+ LN + DI IS + NKN I + + ++N +
Sbjct: 1558 ETILNKAQDLMRDISKISKIVENKNLENLNDKEADYVKYLDNILKEKQLMEAEYKKLNEI 1617
Query: 1572 IKSLTDIDTILTETAG--DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDM 1629
+ +I+ L + ++ + K DSTK ++ +V+ + +
Sbjct: 1618 YSDVDNIEKELKKHKKNYEIGLLEKVIEINKNIKLYMDSTKESLNSLVNNFSSLFNNFYL 1677
Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS 1689
+ I +++ +K L++I NE ++QE++K + S
Sbjct: 1678 NKYNI---NENLEKYKKKLNEI----------YNEFMESYNIIQEKMKEV---------S 1715
Query: 1690 AMDVE-NEEKNLEKEVALAQKQASGLRSRYQETDNKL--LSKAESSGLKRARGQMLLN-- 1744
DV+ NE K L +E AQK+ L ++ +E L + K ES + L
Sbjct: 1716 NDDVDYNEAKTLREE---AQKEEVNLNNKEEEAKKYLNDIKKQESFRFILYMKEKLDELS 1772
Query: 1745 ---KASQLSVNTTAK-LKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINF------ 1794
K V+ +K+ E N + + L + K + + ++
Sbjct: 1773 KMCKQQYNIVDEGYNYIKKKIEYIKTLNDENNLSDSL-NQAEDKNKEVANLTHYTNKNEA 1831
Query: 1795 -----IVDKSNNY 1802
V KS N+
Sbjct: 1832 KNLLGHVVKSANF 1844
Score = 39.0 bits (91), Expect = 0.027
Identities = 120/619 (19%), Positives = 220/619 (35%), Gaps = 120/619 (19%)
Query: 1195 KVITDASNIKQTG---ASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMK 1251
+ + NIK+ G + T +E ++ + VK + N ++ +L D+
Sbjct: 1509 NQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNILNIKDEIETILNKAQDLM 1568
Query: 1252 KQIMNNEDPNGVGKTLGNTTQRINLARLA--LNDLTEEIKKLNKTGEMLKENATLLQE-- 1307
+ I + + NL L D + + + K ++++ L E
Sbjct: 1569 RDI----------SKISKIVENKNLENLNDKEADYVKYLDNILKEKQLMEAEYKKLNEIY 1618
Query: 1308 ---NNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQE 1364
+N+E L+ ++ ++ L EK I + +
Sbjct: 1619 SDVDNIEKELKKHKKNYEIGL--------LEKVIEINKNIKLYMD--------------- 1655
Query: 1365 ENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES 1424
+++L L +SL N LN+ + +NL +
Sbjct: 1656 STKESLNSLVNNFSSLFNNF-YLNKY------NINENL---------------EKYKKKL 1693
Query: 1425 NSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFR 1484
N + S +I EK + + +E+ + ++ A+ + +N KE +A++
Sbjct: 1694 NEIYNEFMESYNIIQEKMKEVSN---DDVDYNEAKTLREEAQKEEVNLNNKEEEAKKYLN 1750
Query: 1485 GITTAKQESHAANILAKEAYDHAASVRNKTAAYVA--STSNITKQLDEFLNAPGATLADI 1542
I KQES + KE D S K + + I K+++ TL D
Sbjct: 1751 DIK--KQESFRFILYMKEKLDEL-SKMCKQQYNIVDEGYNYIKKKIEYI-----KTLNDE 1802
Query: 1543 RNISGL---AVNKNIQSNPEQIRERANEINN----VIKSLTDID-TILTETAG-DLAKAN 1593
N+S A +KN + NE N V+KS I I+T +L
Sbjct: 1803 NNLSDSLNQAEDKNKEVANLTHYTNKNEAKNLLGHVVKSANFIGIKIMTGLQPTELTPDA 1862
Query: 1594 DLKRKANLT---KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK--DDISAARKDL 1648
L+ LT + +D + + + L + QD + ++Q K DDI ++D
Sbjct: 1863 SLETAPELTFESENNSDLELDHLSSNKNELDVYKNIQDAYKSSLQILKYSDDIDKKQRDC 1922
Query: 1649 SQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ 1708
+++ D NE +K + E I V+N+E + ++
Sbjct: 1923 NKLVED-------GNEIYLKSTAINELKNMIN-----------SVKNKESAISNKIDNVS 1964
Query: 1709 KQASGLRSRYQETDNK---LLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
+ S L ++ D +L K E LK R KA T +LN+ +
Sbjct: 1965 NKLSEL-NKITCNDESYDEILEKEEYEELKDLRNSFNQEKA------ETLNNLKLNKIKE 2017
Query: 1766 MFNSQESELTELSKNIAEL 1784
FNS ++ L EL K++ L
Sbjct: 2018 DFNSYKNLLDELEKSVKTL 2036
Score = 39.0 bits (91), Expect = 0.027
Identities = 78/515 (15%), Positives = 176/515 (34%), Gaps = 92/515 (17%)
Query: 1230 VENTTRSSHDLTVMLTT----IDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLT 1285
++ + S +L+ +T I+++KK+I+ + NT
Sbjct: 2152 IDKANKLSSELSEAVTNSEEIIENIKKEIIEINE---------NT--------------- 2187
Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS--LKLKEQSTETEKQINDAER 1343
E+ L T + LKE N++ + + +KL+E +EK + ++
Sbjct: 2188 -EMNTLENTADKLKELY-----ENLKKKKNIINNIYKKINFIKLQEIENSSEKYNDISK- 2240
Query: 1344 QCKRTENLVTKTFPQFTQGQE--ENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDN 1401
N V +T Q ++ +N+ + + +KI EK L +++ + +
Sbjct: 2241 ----LFNNVVET-----QKKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNE 2291
Query: 1402 LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSES--N 1459
+ S E + +K E H++ + N
Sbjct: 2292 IYDD--------------IKSNIGDLYKL-EDTNNDELKKVKLYIENITHLLNRINTLIN 2336
Query: 1460 SAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVA 1519
+Y + + E E + + N L +E ++ K +
Sbjct: 2337 DLDNYQDENYGKDKNIELNNENN----SYIIKTKEKINNLKEEFSKLLKNI--KRNNTLC 2390
Query: 1520 STSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTD-- 1577
+ +NI + ++ I+ + N + QI E NEI N++
Sbjct: 2391 NNNNIKDFISNIGK----SVETIK--QRFSSNLPEKEKLHQIEENLNEIKNIMNETKRIS 2444
Query: 1578 -----IDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
+ IL + + K N+ A ++ + +KI L A
Sbjct: 2445 NVDAFTNKILQDIDNEKNKENNN-MNAEKIDDLIENVTSHNEKIKSELLIINDALR---- 2499
Query: 1633 AIQTAKDDISAARKDLSQITNDLDD-AQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
++ KD+++ L++ N+ ++ A+ + + + L+ + +
Sbjct: 2500 RVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNELLSYIDNEIK 2559
Query: 1692 DVENEEKNLEK--EVALAQKQASGLRSRYQETDNK 1724
++ENE+ L + ++ ++K+ + S QE +
Sbjct: 2560 ELENEKLKLLEKAKIEESRKERERIESETQEDNTD 2594
Score = 34.8 bits (80), Expect = 0.53
Identities = 71/417 (17%), Positives = 150/417 (35%), Gaps = 70/417 (16%)
Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK- 1490
K+ N ++ +EL K + L S + K E K I+ E++ + I +
Sbjct: 2013 NKIKEDFNSYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESINP 2072
Query: 1491 ------QESHAANI---------LAKEAYDHAASVR---NKTAAYVA-------STSNIT 1525
++ H I + + + + ++ K Y+A S
Sbjct: 2073 SLDELLKKGHKIEISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIKNNYNSIKKDI 2132
Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSL------TDID 1579
L+E+ + + + NI NK E + I N+ K + T+++
Sbjct: 2133 STLNEYFDEKQVSNYILTNID--KANKLSSELSEAVTNSEEIIENIKKEIIEINENTEMN 2190
Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV-LTEARTAQ----DMAEV-- 1632
T+ +LK+K N N +KI + L E + D++++
Sbjct: 2191 TLENTADKLKELYENLKKKKN-------IINNIYKKINFIKLQEIENSSEKYNDISKLFN 2243
Query: 1633 -AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ-ERLKSIQAGFLQNGRSA 1690
++T K + + ++ I + ++D +++ + L + I N
Sbjct: 2244 NVVETQKKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNEIYDDIKSNIGDL 2303
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
+E+ + K+V L + + L +R N L + + + K +K +L+
Sbjct: 2304 YKLEDTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGK--------DKNIELN 2355
Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCVS 1807
+ + + E N+ + E ++L KNI N + +NN K+ +S
Sbjct: 2356 NENNSYIIKTKEK---INNLKEEFSKLLKNIKR---------NNTLCNNNNIKDFIS 2400
Score = 33.3 bits (76), Expect = 1.6
Identities = 109/599 (18%), Positives = 202/599 (33%), Gaps = 119/599 (19%)
Query: 1247 IDDMKKQIMN-NEDPNGVGKTLGNTTQRINLARLALNDLTEE----IKKLN------KTG 1295
+D++ K I N N++ N + KTL R N + + L KL T
Sbjct: 865 VDNIIKDIENMNKNINII-KTLNIAINRSNSNKQLVEHLLNNKIDLKNKLEQHMKIINTD 923
Query: 1296 EMLKENATLLQENNV-EGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTK 1354
++++N L NN+ + ++ +Q DT K+ + EK + ++ K N
Sbjct: 924 NIIQKNEKLNLLNNLNKEKEKIEKQLSDT--KINNLKMQIEKTLEYYDKS-KENINGNDG 980
Query: 1355 TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGM 1414
T + ++ + + +I L N LN+ I D D
Sbjct: 981 THLE---KLDKEKDEWEHFKSEIDKLNVNYNILNKKIDDLIKKQHD-------------- 1023
Query: 1415 WCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE------------SNSAK 1462
D E KLI EK + EE I L E + K
Sbjct: 1024 --------------DIIELIDKLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKK 1069
Query: 1463 DYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTS 1522
K+ I E K E + + I E + ++NK+ +V +
Sbjct: 1070 YKNPKIKEEIKLLEEKVEALLKKI--------------DENKNKLIEIKNKSHEHVVNAD 1115
Query: 1523 NITKQLDEFLNAPGATLADI-RNISGLAVNKNIQS--NPEQIRERANEINNVIKSLTD-- 1577
+ E N +L I + + K ++ N NE+N I+ +
Sbjct: 1116 KEKNKQTEHYNKKKKSLEKIYKQME-----KTLKELENMNLEDITLNEVNE-IEIEYERI 1169
Query: 1578 -IDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
ID I+ + + K+ + + K D K + K +
Sbjct: 1170 LIDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKKNMSK--------ERNDHLTTFEYNA 1221
Query: 1637 AKDDISAARKDLSQITNDLDDAQQKSN-ETNVKVKLLQERLKSIQAGFLQNGRSAMDVEN 1695
D +A+ +++ ++T + + ++N TNV LK I+ + + N
Sbjct: 1222 YYDKATASYENIEELTTEAKGLKGEANRSTNVD------ELKEIKLQVFSYLQQVIKENN 1275
Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR-GQMLLNKASQLSVNTT 1754
+ +N E+ + L S E K + + + + + L K L
Sbjct: 1276 KMENALHEI---KNMYEFLISIDSEKILKEILNSTKKAEEFSNDAKKELEKTDNLIKQVE 1332
Query: 1755 AKLKQLNETE-AMFNSQESE--------LTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
AK++Q E + ++ S E + + ++ + I+ +K I + SN N
Sbjct: 1333 AKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYL------SNIKSN 1385
Score = 32.5 bits (74), Expect = 2.7
Identities = 115/630 (18%), Positives = 212/630 (33%), Gaps = 106/630 (16%)
Query: 1212 YTKEFELLEKKIEDVKALVENTTRSSHDLTVMLT-TIDDMKKQIMNNEDPNGVGKTLGNT 1270
Y ++ L+K I +K L+ + +S +DL + ++D +K+ ++ + L
Sbjct: 448 YDEKINELKKSINQLKTLI-SIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQI 506
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
I +N+ + I+ T E LK N +EG +EL + + L +
Sbjct: 507 INSIKEKNNIVNNNFKNIEDYYITIEGLK--------NEIEGLIELIKYYLQSIETLIKD 558
Query: 1331 STETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
ND + + K E V K + LN++I ++ + + +L
Sbjct: 559 EKLKRSMKNDIKNKIKYIEENVDHI------------KDIISLNDEIDNIIQQIEELINE 606
Query: 1391 ICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKDYAETSKKLINEKESKAEELR 1449
+ + K G L E + + + K L +E +SK +
Sbjct: 607 ALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEAKSKEDLQT 666
Query: 1450 KHMIILSE------------SNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAAN 1497
+E N K+ + + L++ KE ++ I S N
Sbjct: 667 LLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNS--LN 724
Query: 1498 ILAKEAYDHAASVRNKTAAYVASTS-------NITKQLDEF---LNAPGATLADIRNISG 1547
+ D +++ Y I + +EF L L D +N
Sbjct: 725 QYTIKYND----LKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTY- 779
Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTD-IDTILTETAGDLAKANDL--KRKANLTKA 1604
+ + I + N+I+N I L + L N L K +A+ K
Sbjct: 780 ----EEFLQYKDTILNKENKISNDINILKENKKNNQDL----LNSYNILIQKLEAHTEKN 831
Query: 1605 GADSTKNTVQKIVDVLTEARTAQDMAEV--AIQTAKD---DISAARKDLSQI-------- 1651
K +QK + E Q + DI K+++ I
Sbjct: 832 DE-ELKQLLQKFPTEDENLNLKELEKEFNENNQIVDNIIKDIENMNKNINIIKTLNIAIN 890
Query: 1652 ---TNDLDDAQQKSNETNVKVKLLQERLKSIQA-GFLQNGRSAMDVENEEKNLEKEVALA 1707
+N +N+ ++K KL Q +K I +Q + N NL KE
Sbjct: 891 RSNSNKQLVEHLLNNKIDLKNKLEQ-HMKIINTDNIIQKN----EKLNLLNNLNKEKEKI 945
Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK----LKQLNET 1763
+KQ LS + + LK + + L + N L++L++
Sbjct: 946 EKQ---------------LSDTKINNLK-MQIEKTLEYYDKSKENINGNDGTHLEKLDKE 989
Query: 1764 EAMFNSQESELTELSKNIAELQKRIQSCIN 1793
+ + +SE+ +L+ N L K+I I
Sbjct: 990 KDEWEHFKSEIDKLNVNYNILNKKIDDLIK 1019
Score = 31.0 bits (70), Expect = 8.8
Identities = 74/542 (13%), Positives = 177/542 (32%), Gaps = 101/542 (18%)
Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLN----KTGEMLKENATLLQENNVEGALELTRQAH 1321
+ + I + +++ ++ + K E+L+ ++ ++NN+ +
Sbjct: 468 IMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNI--VNNNFKNIE 525
Query: 1322 DTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
D + ++ E E I + + E L+ ++ + KI +E
Sbjct: 526 DYYITIEGLKNEIEGLIELIKYYLQSIETLIKD--------EKLKRSMKNDIKNKIKYIE 577
Query: 1382 KNLPDLNELI-----CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKK 1436
+N+ + ++I D + L ++E+ K+ K
Sbjct: 578 ENVDHIKDIISLNDEIDNIIQQIEEL------------------INEALFNKEKFINEKN 619
Query: 1437 LINEK-ESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHA 1495
+ EK + + K + + + K L +E ++K E++ + T+K E
Sbjct: 620 DLQEKVKYILNKFYKGDLQELLDELSHFLDDH-KYLYHEAKSK-EDLQTLLNTSKNEY-- 675
Query: 1496 ANILAKEAYDHAASVRNKTAAYVASTSNITKQL-DEFLNAPGATLADIRNISGLAVNKNI 1554
L D+ ++ + + ++ + + + LN + ++I
Sbjct: 676 -EKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNN---------------IEQDI 719
Query: 1555 QSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
++ Q + N++ + I + K + + + K+ +
Sbjct: 720 SNSLNQYTIKYNDLKSSI--------------------EEYKEE----EEKLEVYKHQII 755
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
+ D D+ + + D K N+ + + +L+E
Sbjct: 756 NRKNEFILHLYEND----------KDLPDGKNTYEEFLQYKDTILNKENKISNDINILKE 805
Query: 1675 RLKSIQAGFLQNGRSAMDVENE-EKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG 1733
K+ Q +E EKN E+ L QK + E +N L + E
Sbjct: 806 NKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFPT-------EDENLNLKELEKEF 858
Query: 1734 LKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
+ + + K + +K LN NS + + L N +L+ +++ +
Sbjct: 859 NENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVEHLLNNKIDLKNKLEQHMK 918
Query: 1794 FI 1795
I
Sbjct: 919 II 920
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 44.7 bits (105), Expect = 6e-04
Identities = 114/638 (17%), Positives = 248/638 (38%), Gaps = 125/638 (19%)
Query: 1202 NIKQTGASGAYTKEFELLEKKIE-DVKALVENTTRSSHDLTVMLTTIDDMKK---QIMNN 1257
NIK + + L++KI+ +KAL E +S + + + I+D++ + ++N
Sbjct: 1106 NIKYADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISN 1165
Query: 1258 EDPNGVGKTLGNTTQRINLARLALNDLTEEIKKL-NKTGEMLKENATLLQENNVEGALEL 1316
+DP + K + N +I+ + ++ +EIKKL N+ E+ K+ +L + +
Sbjct: 1166 DDPEEIEKKIENIVTKIDKKK----NIYDEIKKLLNEIAEIEKDKTSLEEVKGIN----- 1216
Query: 1317 TRQAHDTSL-KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE 1375
++ +L KL + + EK K++E++
Sbjct: 1217 --LSYGKNLGKLFLEKIDEEK---------KKSEHM------------------------ 1241
Query: 1376 KITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA-KDYAETS 1434
I ++E + DL+E+ K P G M T + S+ KD+ S
Sbjct: 1242 -IKAMEAYIEDLDEI---KEKSPEIENEMGIEMDIKAEM----ETFNISHDDDKDHHIIS 1293
Query: 1435 KKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESH 1494
KK + ++R+ + + E S + KK + + A++ I E
Sbjct: 1294 KK----HDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNE-- 1347
Query: 1495 AANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKN- 1553
ANI + + ++ Y K + + L+ + I++ L K+
Sbjct: 1348 IANIYNILKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLEECKSK 1407
Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDT-ILTETAG--------------------DLAKA 1592
I+S + +I+ IK + ++ IL+E + ++ A
Sbjct: 1408 IESTLD-----DKDIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNIEMA 1462
Query: 1593 ND-----LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD 1647
++ LK K + D N +++ +D + D AI+ K+ +KD
Sbjct: 1463 DNKSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKD 1522
Query: 1648 LSQITNDL------DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM---DVENEEK 1698
++++ N + + ++ + +K +++ K L+ +S +++ E+
Sbjct: 1523 VTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKF---ILEAEKSEQKIKEIKKEKF 1579
Query: 1699 NLEKEVA---LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
+E + A + K A ++ + +NK L ++ + K + T +
Sbjct: 1580 RIEDDAAKNDKSNKAAIDIQLSLENFENKFLKISD-----------IKKKINDCLKETES 1628
Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
K+++ +SQ++EL E N+ LQ+ ++S +
Sbjct: 1629 IEKKISSFS--IDSQDTELKENGDNLNSLQEFLESLKD 1664
Score = 42.7 bits (100), Expect = 0.002
Identities = 101/495 (20%), Positives = 192/495 (38%), Gaps = 89/495 (17%)
Query: 1220 EKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARL 1279
EKK D K+L+ +S I++ + I + + K NT + I
Sbjct: 889 EKKFNDSKSLINEINKS----------IEEEYQNINTLKKVDEYIKICENTKESIEKFHN 938
Query: 1280 ALNDLTEEIKKLNKTGEMLKENATLLQ--ENNVEGAL-----ELTRQAHDTSLKLKE-QS 1331
N L E LNK + +KE+ + + ++ + L EL + D SL E ++
Sbjct: 939 KQNILKE---ILNKNIDTIKESNLIEKSYKDKFDNTLIDKINELDKAFKDASLNDYEAKN 995
Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
E K ND + NL +E EKA + +KI KN+P++ I
Sbjct: 996 NELIKYFNDLK------ANLGKNKENMLYHQFDEKEKATNDIEQKIEDANKNIPNIEIAI 1049
Query: 1392 CD---KRGDPCDNLCG-GAGCGKCGGMWCSNGTLSESNSAKD-----------------Y 1430
D + G + + ++ N K+ Y
Sbjct: 1050 HTSIYNIIDEIEKEIGKNIELLNKEILEEAEINITNFNEIKEKLKHYNFDDFGKEENIKY 1109
Query: 1431 AETSKKLINEKESKAEELRKHMIILSE-SNSAKDYAETSKKLINEKEAKAEEVFRGITTA 1489
A+ K+ ++ ++ +++ H+ L E +++Y + K IN+ E A++
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPE 1169
Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEF--LNAPGATLADIRNISG 1547
+ E NI+ K + K Y I K L+E + +L +++ I+
Sbjct: 1170 EIEKKIENIVTK--------IDKKKNIY----DEIKKLLNEIAEIEKDKTSLEEVKGIN- 1216
Query: 1548 LAVNKNIQS-NPEQIRERANEINNVIKSL----TDIDTILT---ETAGDLAKANDLKRKA 1599
L+ KN+ E+I E + ++IK++ D+D I E ++ D+K +
Sbjct: 1217 LSYGKNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEM 1276
Query: 1600 NLTKAGAD-------STKNTVQKIVDVLTEA-RTAQDMAEVAIQTAKDDISAARKDLSQI 1651
D +K + I D+ ++ + +D +E + DI+ +K+L +
Sbjct: 1277 ETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSE------ESDINDIKKELQK- 1329
Query: 1652 TNDLDDAQQKSNETN 1666
+L DAQ+ +++ N
Sbjct: 1330 --NLLDAQKHNSDIN 1342
Score = 41.2 bits (96), Expect = 0.007
Identities = 97/472 (20%), Positives = 182/472 (38%), Gaps = 97/472 (20%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNN-EDPNGVGKTLGNTTQRINLA 1277
+E+KIED + N + H T + ID+++K+I N E N + +
Sbjct: 1031 IEQKIEDANKNIPNIEIAIH--TSIYNIIDEIEKEIGKNIELLNK------EILEEAEIN 1082
Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
N++ E++K N +E N++ A E+ + D +K + +
Sbjct: 1083 ITNFNEIKEKLKHYNFD--------DFGKEENIKYADEINKIKDD----IKNLDQKIDHH 1130
Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
I E K++EN + + Q ++ +KA+ N+ +EK + ++ I DK+ +
Sbjct: 1131 IKALEEIKKKSENYIDEIKAQINDLEDVADKAIS--NDDPEEIEKKIENIVTKI-DKKKN 1187
Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKD-------------YAETSKKL----IN 1439
D + L+E + KD Y + KL I+
Sbjct: 1188 IYDEI---------------KKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLEKID 1232
Query: 1440 EKESKAEELRKHM--IILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAAN 1497
E++ K+E + K M I + E ++ E + KAE I+ + H +
Sbjct: 1233 EEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDH--H 1290
Query: 1498 ILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSN 1557
I++K+ ++ + +R K+ + S +DI +I K +Q N
Sbjct: 1291 IISKKHDENISDIREKSLKIIEDFSE---------------ESDINDI-----KKELQKN 1330
Query: 1558 PEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV 1617
++ ++IN + + +I IL K N +K+ + K + + I
Sbjct: 1331 LLDAQKHNSDINLYLNEIANIYNIL--------KLNKIKKIIDEVKEYTKEIEENNKNIK 1382
Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV 1669
D L D +E I+ KDDI+ + S+I + LDD +K
Sbjct: 1383 DEL-------DKSEKLIKKIKDDIN-LEECKSKIESTLDDKDIDECIKKIKE 1426
>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein. The OEP family (Outer
membrane efflux protein) form trimeric channels that
allow export of a variety of substrates in Gram negative
bacteria. Each member of this family is composed of two
repeats. The trimeric channel is composed of a 12
stranded all beta sheet barrel that spans the outer
membrane, and a long all helical barrel that spans the
periplasm.
Length = 186
Score = 42.5 bits (100), Expect = 6e-04
Identities = 20/112 (17%), Positives = 47/112 (41%), Gaps = 7/112 (6%)
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
K + A+ + AE ++ A+ + R +++Q +L A+++ ++L +E
Sbjct: 78 KRRARVKAAKAQLEAAEAELEQARRQL---RLEVAQAYFNLLAAKEQLELAKQALELAEE 134
Query: 1675 RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLL 1726
L+ +A + S +DV E L + A+ + + + +L
Sbjct: 135 ALELAEARYEAGLISLLDVLQAEVELLE----ARLELLNAEADLELARAQLE 182
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 41.1 bits (97), Expect = 0.001
Identities = 22/137 (16%), Positives = 57/137 (41%), Gaps = 16/137 (11%)
Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
KI D L EA ++ A+ + + ++ AR+ S+I ++ A++++ ++ +
Sbjct: 40 AKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEI---IEQAKKEAE------QIAE 90
Query: 1674 ERLKSIQAGFLQN-GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
E + + + ++E E++ +E+ + + + L E KLL K
Sbjct: 91 EIKAEAEEELERIKEAAEAEIEAEKERALEEL---RAEVAELAVAIAE---KLLGKKVDE 144
Query: 1733 GLKRARGQMLLNKASQL 1749
++ + + +
Sbjct: 145 AAQKDLIDAFIAELGEN 161
Score = 37.7 bits (88), Expect = 0.018
Identities = 23/114 (20%), Positives = 46/114 (40%), Gaps = 11/114 (9%)
Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
K L ER I + R + + E+E+ A++QAS + + ++ ++ +
Sbjct: 33 KALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEI 92
Query: 1730 ESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
+ A ++ K + A+ + E E +E+ EL+ IAE
Sbjct: 93 K----AEAEEELERIKEA-------AEAEIEAEKERALEELRAEVAELAVAIAE 135
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene family
are HOOK1, HOOK2 and HOOK3. Different domains have been
identified in the three human HOOK proteins, and it was
demonstrated that the highly conserved NH2-domain
mediates attachment to microtubules, whereas the central
coiled-coil motif mediates homodimerisation and the more
divergent C-terminal domains are involved in binding to
specific organelles (organelle-binding domains). It has
been demonstrated that endogenous HOOK3 binds to Golgi
membranes, whereas both HOOK1 and HOOK2 are localised to
discrete but unidentified cellular structures. In mice
the Hook1 gene is predominantly expressed in the testis.
Hook1 function is necessary for the correct positioning
of microtubular structures within the haploid germ cell.
Disruption of Hook1 function in mice causes abnormal
sperm head shape and fragile attachment of the flagellum
to the sperm head.
Length = 713
Score = 43.6 bits (103), Expect = 0.001
Identities = 56/239 (23%), Positives = 94/239 (39%), Gaps = 57/239 (23%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQE-----NNVEGALEL-TRQAHDTSLKLKEQSTET 1334
LNDL ++K L + M +N L+E N G LE RQ + KL E+S +
Sbjct: 327 LNDLRRQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKA 386
Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN-ELICD 1393
+K E + KR +E+ E ALQ+ E++ + +L + N EL C
Sbjct: 387 DK----LEFEYKRL--------------EEKLE-ALQKEKERLLAERDSLRETNEELRC- 426
Query: 1394 KRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDY--AETSKKLIN-EKESKA----- 1445
G + + S N A + +E +KLI + E+K
Sbjct: 427 -----------GQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQ 475
Query: 1446 --------EELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAA 1496
EL++ L ++N + ET +L N++ + ++ + A QE +
Sbjct: 476 EGSENERITELQQL---LEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSK 531
Score = 36.3 bits (84), Expect = 0.17
Identities = 33/143 (23%), Positives = 62/143 (43%), Gaps = 41/143 (28%)
Query: 1647 DLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVAL 1706
+L Q+ L+DA +++NE +++L +R+ +Q VE+ +K L
Sbjct: 485 ELQQL---LEDANRRNNELETQLRLANQRILELQQ----------QVEDLQKAL------ 525
Query: 1707 AQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAM 1766
Q+Q SK+E S L +++ + L + + + K +Q+ E E
Sbjct: 526 -QEQG---------------SKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPD 569
Query: 1767 FNSQESELTELSKNIAELQKRIQ 1789
+ LS+ IAEL+ +Q
Sbjct: 570 QDQN------LSRKIAELEAALQ 586
>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
Length = 156
Score = 41.1 bits (97), Expect = 0.001
Identities = 23/102 (22%), Positives = 45/102 (44%), Gaps = 17/102 (16%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELT--RQAHDTSLKLKEQSTETEKQI 1338
+LT+ + L K E+ K+ L + E LE+T ++ + E ++
Sbjct: 59 RKNLTKASEILAKANELTKQYEQELSKARKEAQLEITQSQKEAKEIV---------ENEL 109
Query: 1339 NDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSL 1380
A+ K ++L+ + Q + EKAL+ L E++ +L
Sbjct: 110 KQAQ---KYIDSLLNEATKQLEA---QKEKALKSLEEQVDTL 145
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors for
host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 42.3 bits (100), Expect = 0.002
Identities = 46/229 (20%), Positives = 82/229 (35%), Gaps = 58/229 (25%)
Query: 1290 KLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTE 1349
+ ++ + L++N Q+ EL R+ D + E+ E E++I ER+ +R
Sbjct: 46 QRRESEKFLQQN-VEQQQACWRAIQELERELRDLAE---ERREEVERRIEAVEREEQRRT 101
Query: 1350 NLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI---CDKRGDPCDNLCGGA 1406
+ F ++++ L+R E + + L E + CD D L
Sbjct: 102 -----DYESFLNFAAQHKQRLRRTLENCDAALELARSLEEYVQEGCDHIPAKQDKL---- 152
Query: 1407 GCGKCGGMWCSNGTLSESNSA---------KDYAETSKKLINEKESKAEELRKHMI---- 1453
LSE A + T +L +KE + EEL + +
Sbjct: 153 -----------QQALSELLLAVRKERLEAFRALYLTLGELTYKKERRLEELDRQIRYYEH 201
Query: 1454 ---ILSES--NSAKDYAETSKKL-------------INEKEAKAEEVFR 1484
E+ +AK+Y+E K+L + K+ A E F
Sbjct: 202 QRESAMETLDPNAKEYSEAKKELLEQRAEVEGEINALRAKQDAALEDFE 250
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil domain
of unknwon function.
Length = 126
Score = 39.5 bits (93), Expect = 0.002
Identities = 23/115 (20%), Positives = 50/115 (43%), Gaps = 21/115 (18%)
Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
EE ++E L +K+ L+ + +K L + E+ KR R + + +L
Sbjct: 27 EELLKQREEELEKKEEE-LQESLIKF-DKFLKENEA---KRRRAEKKAEEEKKLRKEKEE 81
Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINF------IVDKSNNYKN 1804
++K+L EL EL I +L+++++ + +V++S ++
Sbjct: 82 EIKELKA----------ELEELKAEIEKLEEKLEEYQPYEEFLESVVERSPEFEE 126
Score = 35.3 bits (82), Expect = 0.072
Identities = 21/108 (19%), Positives = 50/108 (46%), Gaps = 7/108 (6%)
Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV-KLLQE-RLK 1677
L E R + ++A+ +++ + L Q +L+ +++ E+ +K K L+E K
Sbjct: 2 LLEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAK 61
Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
+A + A + + K E+E+ + + L++ ++ + KL
Sbjct: 62 RRRA-----EKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKL 104
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 42.6 bits (101), Expect = 0.002
Identities = 50/289 (17%), Positives = 92/289 (31%), Gaps = 72/289 (24%)
Query: 1506 HAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERA 1565
+A R + + + +++ QLD L + + A +K +Q + EQ
Sbjct: 23 SSAFARAASNGDLPTEADVQAQLD--------ALNKQKLLE--AEDKLVQQDLEQTLALL 72
Query: 1566 NEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEART 1625
++I+ + + L + L +A L + T+ T+ R
Sbjct: 73 DKIDRQKEETEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRETLST-----LSLRQ 125
Query: 1626 AQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ 1685
+ L+Q + L +AQ E N ++ LQ + + QA
Sbjct: 126 LESR------------------LAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQA---- 163
Query: 1686 NGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNK 1745
AL R Q+ N L R ++LL
Sbjct: 164 -------------------ALYANSQ-----RLQQIRNLLKGGKVGGKALRPSQRVLLQ- 198
Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQ-----ESELTELSKNIAELQKRIQ 1789
A Q +N A+ L N+Q + + L+ I L+ ++Q
Sbjct: 199 AEQALLN--AQN-DLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQ 244
Score = 33.0 bits (76), Expect = 1.9
Identities = 21/119 (17%), Positives = 48/119 (40%), Gaps = 15/119 (12%)
Query: 1282 NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDA 1341
D+ ++ LNK + E+ + Q ++E L L + KE++ + ++Q+ A
Sbjct: 39 ADVQAQLDALNKQKLLEAEDKLVQQ--DLEQTLALLDKIDRQ----KEETEQLKQQLAQA 92
Query: 1342 ERQCKR-TENLVTKTFPQFTQGQEENEKA----LQ-RLNEKITSLE---KNLPDLNELI 1391
+ ++ L + +E L+ RL + + L+ +L + N +
Sbjct: 93 PAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQL 151
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 41.6 bits (98), Expect = 0.003
Identities = 27/173 (15%), Positives = 63/173 (36%), Gaps = 20/173 (11%)
Query: 1572 IKSLTDIDTILTETAGDLAKAND-LKRKANLTKAGADSTKNTVQKIVDVL-TEARTAQDM 1629
++ L + L E + + + L + NL +S K ++K + L E + + +
Sbjct: 142 MQLLEGLKEGLEENLEGMKRDEELLNKDLNL----INSIKPKLRKKLQALKEEIASLRQL 197
Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS 1689
A+ ++ AR++L ++ + + +++ E LQ+ L+ +
Sbjct: 198 ADELNLCDPLELEKARQELRSLSVKISEKRKQLEE-------LQQELQELTIAIEALTNK 250
Query: 1690 AMDVENEEKNLEKEVALAQ----KQASGLRSRYQETDNKL---LSKAESSGLK 1735
++ E EK + K+ S L+++ S L
Sbjct: 251 KSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKLTGWKFLSLSGSTLS 303
Score = 31.5 bits (72), Expect = 3.9
Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 48/180 (26%)
Query: 1213 TKEFELLEK---KIEDVK-ALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLG 1268
++ ELL K I +K L + ++ + D++ N DP
Sbjct: 160 KRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADEL-----NLCDPL------- 207
Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL-KL 1327
+ AR L L+ +I + K L+E L QE Q ++ L
Sbjct: 208 ----ELEKARQELRSLSVKISEKRK---QLEE---LQQE----------LQELTIAIEAL 247
Query: 1328 KEQSTETEKQINDAER---QCKRTENLVTKTFPQFTQGQEENEKALQRL-NEKITSLEKN 1383
+ +E ++I +AE+ +C+ + LQ+L K SL +
Sbjct: 248 TNKKSELLEEIAEAEKIREECRGWSAKEISKL-------KAKVSLLQKLTGWKFLSLSGS 300
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 41.6 bits (98), Expect = 0.003
Identities = 55/278 (19%), Positives = 101/278 (36%), Gaps = 53/278 (19%)
Query: 1519 ASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI 1578
++ S+I +Q L + L D R + +++++ E A EI KSL +
Sbjct: 128 SAPSSIKEQKSSELKKVDSQLPDPRAL---LKGEDLKT-----VETAEEI---EKSLKAM 176
Query: 1579 DTILTETAGDLAKANDL----KRKANLTKAGADSTKN---TVQKIVDVLTEARTAQDMAE 1631
+ DL DL KR + K + ++ + E + +
Sbjct: 177 QQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQK-- 234
Query: 1632 VAIQTAKDDISAARKDL--------SQITNDLDDAQQKS--NETNVK---VKLLQERLKS 1678
I++AK+D+ +K L NDL + K ++K V L ++
Sbjct: 235 --IKSAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRK 292
Query: 1679 IQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNK---LLSKAESSGLK 1735
FL+ A + N+ K KE A + A+G +++ + L+ KA+ SG
Sbjct: 293 YLQKFLKYYDQAEPLLNKSKKEPKEEA-VETLANGEDYDFKDDNPYPDFLIKKADVSGKM 351
Query: 1736 RARG--------------QMLLNKASQLSVNTTAKLKQ 1759
LL K + ++++ A K
Sbjct: 352 PDGDVGEIKGEVSNITNDHDLLGKPTTINLSAEASGKG 389
>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
outer membrane / Intracellular trafficking and
secretion].
Length = 457
Score = 41.7 bits (98), Expect = 0.004
Identities = 35/239 (14%), Positives = 69/239 (28%), Gaps = 42/239 (17%)
Query: 1580 TILTETAG---DLAKANDLKRKANLTKAGADSTKNTVQK--------IVDVLTEARTAQD 1628
++ E A DL A + A T A A+ +K +DVL +A
Sbjct: 151 SLAAEVATAYFDLLAAQEQLALAEETLAAAEEQLELAEKRYDAGLATRLDVL-QAEAQLA 209
Query: 1629 MAEVAIQTAKDDISAARKDLSQITN-DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNG 1687
A + A+ ++ AR L+++ + + + +
Sbjct: 210 SARAQLAAAQAQLAQARNALARLLGLEPGELLDAPEPEALPALPPVLPDGLPSEALARRP 269
Query: 1688 --RSAMDVENEEKNLEKEVALAQKQ------ASGLRSRYQETDNKLLSKAESSGL----- 1734
+A E + + A+ + R + L + S
Sbjct: 270 DILAA---EAQLAAANANIGAARAAFLPTLSLTASYGRSSTNLSGLFGSSSRSWSVGPGL 326
Query: 1735 -------KRARGQMLLNKASQLSVNTTAKLKQLNETEA-MFNSQESELTELSKNIAELQ 1785
R R ++ + ++ + A L Q +T L L + +LQ
Sbjct: 327 SLPIFDGGRLRARV---RQAEAQYD--AALAQYEQTVLTARQEVADALAALEAALEQLQ 380
Score = 37.8 bits (88), Expect = 0.046
Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)
Query: 1620 LTEARTAQDMA----EVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER 1675
+ +A D A E + TA+ +++ A L L +Q V+ QE
Sbjct: 340 VRQAEAQYDAALAQYEQTVLTARQEVADALAALEAALEQLQALRQ-------AVEAAQEA 392
Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQ 1719
L+ + + R+ +DV + ++ L + AL Q + L +
Sbjct: 393 LELARERYQAGVRTLLDVLDAQRTLLQARQALLQARYDYLVALVN 437
>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV. The
binding of the LYPxL motif of late HIV p6Gag and EIAV
p9Gag to this domain is necessary for viral budding.This
domain is generally central between an N-terminal Bro1
domain, pfam03097 and a C-terminal proline-rich domain.
The retroviruses thus used this domain to hijack the
ESCRT system of the cell.
Length = 292
Score = 41.1 bits (97), Expect = 0.004
Identities = 45/281 (16%), Positives = 90/281 (32%), Gaps = 65/281 (23%)
Query: 1558 PEQIRERANEI------NNVIKSLTDIDTILTETAGDLAKANDL------KRKANLTKAG 1605
P + E+A EI + +SL D+ + L + +L + + K G
Sbjct: 8 PPSLLEKAEEIRSEGGIERLERSLDDLPELSQRNEEILDEIKELLDEEESEDEQLRAKYG 67
Query: 1606 ADST-----------KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
T + ++K L +A + ++ ++++ ++
Sbjct: 68 ERWTRPPSSELTAELREELRKYRGYLEQASESDAQLRSKLEEHEENLELLSGPEEELEAL 127
Query: 1655 LDDAQQ-KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
L + K+ E + ++ L+E L + ++ E + L +E
Sbjct: 128 LPSSSPSKTPEVSEQISRLRELLNKLNE-----------LKAEREKLLEE---------- 166
Query: 1714 LRSRYQETDN-KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL-NETEAMFNSQE 1771
L+ + Q+ D KLL K QL K L E + QE
Sbjct: 167 LKEKAQDDDISKLLIALN----KLGSSNE-----EQLFEEELKKFDPLQERVEQNLSKQE 217
Query: 1772 SELTEL---------SKNIAELQKRIQSCINFIVDKSNNYK 1803
L E+ + QK + + + + + YK
Sbjct: 218 ELLKEIQEANEEFLQERKSDSEQKEREKVLQKLENAYDKYK 258
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels of
palindromic sequences in DNA. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1042
Score = 41.5 bits (97), Expect = 0.005
Identities = 82/560 (14%), Positives = 154/560 (27%), Gaps = 87/560 (15%)
Query: 1270 TTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKE 1329
++ L + L L E +++ ++ L + +E L+ + +++E
Sbjct: 218 HERKQVLEKE-LKHLREALQQTQQSHAYLTQKREAQEE-----QLKKQQLLKQLRARIEE 271
Query: 1330 ---QSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
Q E+ R K L TQ +++ ++ L K+ S K L
Sbjct: 272 LRAQEAVLEETQERINRARKA-APLAAHI-KAVTQIEQQAQRIHTELQSKMRSRAKLLMK 329
Query: 1387 LNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAE 1446
+ + S+ +D E I E +
Sbjct: 330 RAAHVKQQSSIEEQR-------------RLLQTLHSQEIHIRDAHE-VATSIREISCQQH 375
Query: 1447 ELRKHMIILSESNSA---KDYAETSKKLINEKE--------------------AKAEEVF 1483
L +H+ L + + K + + I ++E AK ++
Sbjct: 376 TLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQEL 435
Query: 1484 RGITTAKQESHAANIL--AKEAYDHAASVRNKTAA---YVASTSNITKQLDEFLNAPGAT 1538
+ + K H + + I Q A
Sbjct: 436 QQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLAR 495
Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK 1598
L +Q P + N + DID T
Sbjct: 496 L------------LELQEEPCPLCGSCIHPN---PARQDIDNPGPLTRRMQRGEQ---TY 537
Query: 1599 ANLTKAGAD------STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQIT 1652
A L + D S + + + + E + Q + + + +L IT
Sbjct: 538 AQLETSEEDVYHQLTSERKQRASLKEQMQEIQ--QSFSILTQC--DNRSKEDIPNLQNIT 593
Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ--KQ 1710
L D +K +E + Q L LQ + DV + +E+AL
Sbjct: 594 VRLQDLTEKLSEAEDMLACEQHALLRK----LQPEQDLQDVRLHLQQCSQELALKLTALH 649
Query: 1711 ASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ 1770
A L + LS A Q+ L K T + L + + +
Sbjct: 650 ALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLREL 709
Query: 1771 ESELTELSKNIAELQKRIQS 1790
E+ + E + E++ S
Sbjct: 710 ETHIEEYDREFNEIENASSS 729
Score = 36.5 bits (84), Expect = 0.15
Identities = 62/395 (15%), Positives = 118/395 (29%), Gaps = 66/395 (16%)
Query: 1218 LLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLA 1277
+++ + L + H LT +K+Q+ + + TQ N +
Sbjct: 529 RMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQ------SFSILTQCDNRS 582
Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
+ + +L +L E L E +L +L + ++L Q E
Sbjct: 583 KEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELA 642
Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
+ L Q T QE + I L K L +L K
Sbjct: 643 L-----------KLTALHALQLTLTQERVREHALS----IRVLPKELLASRQLALQKMQS 687
Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE 1457
+ L K+ + L+ E E+ EE + +
Sbjct: 688 EKEQLTY----------------------WKEMLAQCQTLLRELETHIEEYDREFNEIEN 725
Query: 1458 -SNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILA--------KEAYDHAA 1508
S+S + +N+ + R + A+ E+H N E AA
Sbjct: 726 ASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHLAA 785
Query: 1509 SVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEI 1568
++ T K L+ + + DI N+ + + + ++ E++
Sbjct: 786 EIQFFNRLREEDTHL-LKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKS--- 841
Query: 1569 NNVIKSLTDIDTILTETAGDLAKANDLKRKANLTK 1603
+L +I L + K+ A LT+
Sbjct: 842 ----ATLGEITHQLLKY------EECSKQLAQLTQ 866
Score = 34.9 bits (80), Expect = 0.54
Identities = 70/508 (13%), Positives = 145/508 (28%), Gaps = 101/508 (19%)
Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
+ K L+ E+L + LL T DT + K+ + K + +A +Q
Sbjct: 187 AKKKSLHGKAELLTLRSQLLTL--------CTPCMPDTYHERKQVLEKELKHLREALQQT 238
Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
+++ +T Q + QEE K Q L + +E+
Sbjct: 239 QQSHAYLT----QKREAQEEQLKKQQLLKQLRARIEELRAQEAV---------------- 278
Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
L E+ + A + L ++ + ++ I +E
Sbjct: 279 ---------------LEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTE-------- 315
Query: 1466 ETSKKLINEKEAKAEEVF-RGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
+ K ++ + KQ+S H+ + + A VA
Sbjct: 316 ------LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVA----- 364
Query: 1525 TKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTE 1584
T + ++I L Q + + + DI
Sbjct: 365 TSIREISCQQ----HTLTQHIHTLQ----------QQKTTLTQKLQSLCKELDILQREQA 410
Query: 1585 TAG-DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISA 1643
T + DL+ + A A + Q+ ++ AI
Sbjct: 411 TIDTRTSAFRDLQGQ----LAHAKKQQELQQRYAELC----------AAAITCTAQCEKL 456
Query: 1644 ARKDLSQITNDLDDAQQKSNETNVKVKLLQE-RLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
+ L + L + +Q+ + LQE R K++ L +
Sbjct: 457 EKIHLQESAQSLKEREQQ--LQTKEQIHLQETRKKAVVLARLLE-LQEEPCPLCGSCIHP 513
Query: 1703 EVALAQKQASG-LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLN 1761
A G L R Q + S + + + L +++++
Sbjct: 514 NPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLK----EQMQEIQ 569
Query: 1762 ETEAMFNSQESELTELSKNIAELQKRIQ 1789
++ ++ ++ E N+ + R+Q
Sbjct: 570 QSFSILTQCDNRSKEDIPNLQNITVRLQ 597
Score = 31.9 bits (72), Expect = 4.0
Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%)
Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALEL---TRQAHDTSLKLKEQST 1332
AR L TE N E+ T + +++ ++ R+ LK E
Sbjct: 751 QARTVLKARTEA--HFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEI 808
Query: 1333 ETEK----QINDA--ERQCKRTENLVTK------TFPQFTQGQEENEKALQRLNE----- 1375
E I + E + E +++ T + T + E+ ++L +
Sbjct: 809 GQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQ 868
Query: 1376 -KITSLEKNLPDLNELICDKRGD 1397
KI L L +N++ GD
Sbjct: 869 AKIIQLSDKLNGINQIKIQFDGD 891
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from laminin
A, B1 and B2 may come together to form a triple helical
coiled-coil structure.
Length = 263
Score = 40.4 bits (95), Expect = 0.005
Identities = 41/174 (23%), Positives = 67/174 (38%), Gaps = 27/174 (15%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLAR 1278
L K +D + L + L + I +TLG+ +
Sbjct: 53 LSKLAQDAENLQK-------KANQALAKAQKVNAAIE---------RTLGHAKTLAEAIK 96
Query: 1279 LALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKE-QSTETEKQ 1337
+++ E ++KL L ENA L + + L Q L E + + +KQ
Sbjct: 97 NLRDNIKEILEKL----AGLGENAFALPGSALSRMLAEA-QRM-----LGEIRERDFQKQ 146
Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
+ +AE + K E L+ + F QEENE + L + + E L DL EL+
Sbjct: 147 LQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELL 200
Score = 35.1 bits (81), Expect = 0.27
Identities = 50/247 (20%), Positives = 98/247 (39%), Gaps = 34/247 (13%)
Query: 1523 NITKQLDEFLNAPGATLADI----RNISGLAVN-KNIQSNPEQIRERANEINNVIKSLTD 1577
N+T+QL L+ AT + + +S LA + +N+Q Q +A ++N I+
Sbjct: 27 NLTQQLSGRLSPISATKRQLEDLEKALSKLAQDAENLQKKANQALAKAQKVNAAIERTLG 86
Query: 1578 IDTILTETAGDL-AKANDLKRKA-----NLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
L E +L ++ K N + + + +L E R +
Sbjct: 87 HAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEAQRMLGEIRERDFQKQ 146
Query: 1632 VAIQTAKDDISAARKDLSQI----------TNDL-DDAQQKSNETNVKVKLLQERLKSIQ 1680
+Q A+ ++ AA + L+++ L + + NE K+ L+E L Q
Sbjct: 147 --LQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELLDEAQ 204
Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
A R A + + +E +++ S L+++ +E L+ S L +A
Sbjct: 205 A----KTRDAERLNLANQKRLREFQEKKQEVSELKNQAEEH----LATGRDS-LDQANLL 255
Query: 1741 -MLLNKA 1746
L+++A
Sbjct: 256 LQLMDQA 262
Score = 34.7 bits (80), Expect = 0.32
Identities = 49/241 (20%), Positives = 81/241 (33%), Gaps = 50/241 (20%)
Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
IL + I+ + + E++ + ++ Q+ A L K+A A K
Sbjct: 21 ILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQD---AENLQKKANQALAKA-QK 76
Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRN-ISGLAVN-------------KNIQSNPE 1559
A + T K L E + + +I ++GL N Q
Sbjct: 77 VNAAIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEAQRMLG 136
Query: 1560 QIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV 1619
+IRER D L +L A +L N VQ
Sbjct: 137 EIRER------------DFQKQLQNAEAELKAAEELL--------------NRVQTWF-- 168
Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
+T Q+ E + +DD++ LS + LD+AQ K+ + Q+RL+
Sbjct: 169 ----QTPQEENEGLFEALRDDLNEYEAKLSDLRELLDEAQAKTRDAERLNLANQKRLREF 224
Query: 1680 Q 1680
Q
Sbjct: 225 Q 225
Score = 32.4 bits (74), Expect = 1.9
Identities = 34/217 (15%), Positives = 67/217 (30%), Gaps = 41/217 (18%)
Query: 1189 LADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSH-----DLTVM 1243
L + + + A + A + L + I++++ ++ +
Sbjct: 63 LQKKANQALAKAQKVNA--AIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALP 120
Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENAT 1303
+ + M + + LG +R +L + E+K + ++
Sbjct: 121 GSALSRMLAEAQ---------RMLGEIRERDFQKQLQNAE--AELKAAEELLNRVQTWFQ 169
Query: 1304 LLQENNVEGALE-LTRQAHDTSLKLK---EQSTETEKQINDAERQCKRTENLVTKTFPQF 1359
QE N EG E L ++ KL E E + + DAER
Sbjct: 170 TPQEEN-EGLFEALRDDLNEYEAKLSDLRELLDEAQAKTRDAERL--------------- 213
Query: 1360 TQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRG 1396
N+K L+ EK + + E + R
Sbjct: 214 ---NLANQKRLREFQEKKQEVSELKNQAEEHLATGRD 247
>gnl|CDD|184061 PRK13454, PRK13454, F0F1 ATP synthase subunit B'; Provisional.
Length = 181
Score = 39.4 bits (92), Expect = 0.006
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 14/80 (17%)
Query: 1574 SLTDIDTILTETAG----DLAKANDLKRKANL-----TKAGADSTKNTVQKIVDVLTEAR 1624
+L I +L E G DLA A +LK+KA KA AD+ + Q+IV E R
Sbjct: 52 ALPRIGAVLAERQGTITNDLAAAEELKQKAVEAEKAYNKALADA-RAEAQRIV---AETR 107
Query: 1625 TA-QDMAEVAIQTAKDDISA 1643
Q +VAI A +I+A
Sbjct: 108 AEIQAELDVAIAKADAEIAA 127
>gnl|CDD|147988 pfam06120, Phage_HK97_TLTM, Tail length tape measure protein. This
family consists of the tail length tape measure protein
from bacteriophage HK97 and related sequences from
Escherichia coli O157:H7.
Length = 311
Score = 40.6 bits (94), Expect = 0.007
Identities = 49/224 (21%), Positives = 83/224 (37%), Gaps = 29/224 (12%)
Query: 1460 SAKDYAETSKKLINE-KEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYV 1518
SA DYA+T +++ ++ KE E A + A N ++ A +++ + Y
Sbjct: 52 SAIDYADTIEEIRDKLKEMSLPEASANEGKANEALDAQNRAIEDQEQKIAEIKDAISGYN 111
Query: 1519 ASTSNITKQLDEFL-----NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
A + +L + AT LAV + EQ+ + EI +I+
Sbjct: 112 AKLAANGASEGGWLVNHLTSMDDATEEAADFADALAVE---EERLEQMLAKRKEIQQLIE 168
Query: 1574 SLTD--IDTI-------------LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
+ D ID I L E G+ A+ + L N A N K
Sbjct: 169 DIEDQAIDLIRNKAAEMAGAVQSLLEMNGEHAEFDRLLGLGNKLLAARQGLANIPMKYAG 228
Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAAR-----KDLSQITNDLDD 1657
+ A + + AI+ A+ D+ + K ++Q D DD
Sbjct: 229 PVIPAADLDNKQKDAIEKAQRDLELSGLKGEAKAIAQAEFDADD 272
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 40.7 bits (95), Expect = 0.007
Identities = 43/196 (21%), Positives = 73/196 (37%), Gaps = 36/196 (18%)
Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI----------QAGF 1683
QTA + A D A+ + K+ Q +L+S+ +A
Sbjct: 129 FQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPADPGYAQAEAAV 188
Query: 1684 LQNGRSAMDVEN--EEKNLEKEVALAQK-----QASGLRSRYQETDNKLLSKAESSGLKR 1736
Q G+ A + + ++ A +A + +++Q T N S G +
Sbjct: 189 EQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQD 248
Query: 1737 -----ARGQMLLNKASQL-SVNTTAKLKQLNETEAMFNS-QESELTELSKNIAELQK--- 1786
AR ML+ ++ NT L+ N+ A+FN+ QE E+ K AE Q+
Sbjct: 249 NLSNVARLTMLMAMFIEIVGKNTEESLQ--NDL-ALFNALQEGRQAEMEKKSAEFQEETR 305
Query: 1787 ------RIQSCINFIV 1796
RI CI ++
Sbjct: 306 KAEETNRIMGCIGKVL 321
Score = 39.2 bits (91), Expect = 0.023
Identities = 48/283 (16%), Positives = 91/283 (32%), Gaps = 41/283 (14%)
Query: 1476 EAKAEEVFRGITTAKQESHAAN-ILAKEAYDHAASVRNKTAAYVASTSNITKQLD---EF 1531
EA E V + K A ++A +A D A ++ +A +T + D E
Sbjct: 21 EAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAI--NTVGLKPPTDAAREK 78
Query: 1532 LNAPGATLADIRNISGLAVNKN--------------IQSNPEQIRERANEINNVIKSLTD 1577
L++ G + + L + + I+S E + + E + +
Sbjct: 79 LSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQE 138
Query: 1578 IDTILTETAGDL----AKANDLKRKANLTKA---GADSTKNTVQKIVDVLTEARTAQDMA 1630
+ + + + ++K + D + + +A A
Sbjct: 139 ATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEA 198
Query: 1631 EVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS- 1689
+ A+ A D A D D+ K T Q ++ + L N
Sbjct: 199 KEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAAS--QNQVSQGEQDNLSNVARL 256
Query: 1690 ----AMDVE----NEEKNLEKEVALAQKQASGLRSRYQETDNK 1724
AM +E N E++L+ ++AL G R E + K
Sbjct: 257 TMLMAMFIEIVGKNTEESLQNDLALFNALQEG---RQAEMEKK 296
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is thought
to translocate protons through membrane (inner membrane
in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1) subunits.
This domain should not be confused with the ab CF(1)
proteins (in the head of the ATP synthase) which are
found in pfam00006.
Length = 132
Score = 38.4 bits (90), Expect = 0.007
Identities = 24/106 (22%), Positives = 45/106 (42%), Gaps = 13/106 (12%)
Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
+KI + + EA A + A+ ++ AR + S+I N+ QK E + Q
Sbjct: 33 EKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEE--ILAEAQ 90
Query: 1674 ERLKSIQAGFLQNGRSAMDVENEE----KNLEKEV-ALAQKQASGL 1714
+ + + + ++E E+ L ++V ALA + A L
Sbjct: 91 KDAERLLE------SARAEIEQEKEQALAELRQQVAALAVQIAEKL 130
Score = 36.1 bits (84), Expect = 0.038
Identities = 21/114 (18%), Positives = 44/114 (38%), Gaps = 11/114 (9%)
Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
K+L ER + I + E+++A A+ +AS + + ++ KL +
Sbjct: 26 KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85
Query: 1730 ESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
+ K A ++L + +++ E E ++ L+ IAE
Sbjct: 86 LAEAQKDAE-RLLESARAEIEQ----------EKEQALAELRQQVAALAVQIAE 128
>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein.
Length = 304
Score = 40.2 bits (94), Expect = 0.007
Identities = 29/193 (15%), Positives = 60/193 (31%), Gaps = 36/193 (18%)
Query: 1552 KNIQSNPEQ--IRERANEINNVIK---SLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
+Q + + + +K L +D + A D A+A +A+ A
Sbjct: 3 AEVQP-QVSGIVIRILVKEGDRVKAGDVLVRLDPTPYQAALDRAEAELAAAQADAQARLA 61
Query: 1607 DSTKNTVQKIV------------DVLTEARTAQD----MAEVA-IQTAKDDISAARKDLS 1649
+ ++ +Q +V R A A A ++ A+D++ +A L
Sbjct: 62 RAERDRLQALVAQAAIQSEQEADAARAALREALAASRGAASKAQLEAARDNLRSALALLP 121
Query: 1650 QITNDLDDAQ-----QKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
+ +Q +N + + +L Q E L +
Sbjct: 122 ISLGRIGASQATLGALLNNLQA-QGLAARAQLDEAYLQLAQ-------TAAELLRLIRAP 173
Query: 1705 ALAQKQASGLRSR 1717
A ++ S + R
Sbjct: 174 AQGERVQSEAQLR 186
>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS. This is small family of
Bacillus DegS proteins. The DegS-DegU two-component
regulatory system of Bacillus subtilis controls various
processes that characterize the transition from the
exponential to the stationary growth phase, including the
induction of extracellular degradative enzymes,
expression of late competence genes and down-regulation
of the sigma D regulon. The family also contains one
sequence from Thermoanaerobacter tengcongensis which is
described as a sensory transduction histidine kinase.
Length = 159
Score = 38.7 bits (91), Expect = 0.007
Identities = 26/167 (15%), Positives = 70/167 (41%), Gaps = 33/167 (19%)
Query: 1633 AIQTAKDDI----SAARKDLSQITNDLDDAQQKSNETNVKVKLLQE-------RLKSIQA 1681
++ +K++I AR++ ++ +L++ +++ +E +V L++ RL +
Sbjct: 10 TVEESKEEIFEIAEQARQEYERLEQELEEVKEEVSEVIKEVDKLEKQERRARKRLAEVSR 69
Query: 1682 GFLQNGRSAM-DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
F + + + + +L+ ++AL +++ LR R E + +L LK
Sbjct: 70 NFDKYSEEDIREAYEKAHDLQVKLALLRQEEKQLRKRRDELERRLK------NLKE---- 119
Query: 1741 MLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
+ +A L L L +L ++ + + + +++
Sbjct: 120 -TIERAENLVSQVGVVLNYLT----------GDLKQVGEQLEDAKQK 155
>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
biogenesis, outer membrane].
Length = 372
Score = 40.2 bits (94), Expect = 0.009
Identities = 28/143 (19%), Positives = 53/143 (37%), Gaps = 30/143 (20%)
Query: 1610 KNTVQKIVDVLTEARTA-------------QDMAEVAIQTAKDDISAARKDLSQITNDLD 1656
+ VQK + + +A + AEV + +S +L I LD
Sbjct: 185 EEEVQKAEERVKKASNDLTDYRIKNGVFDPKAQAEVQMSL----VSKLEDELIVIQAQLD 240
Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQ---------NGRSAMDVENEEKNLEKEVALA 1707
+ N N ++ L+ R++S++ LQ + +S + E + L E A
Sbjct: 241 TVKSVMNPENPQIPGLKARIESLRKQLLQEKQAISAGGSSQSLSNQAAEFQRLYLENTFA 300
Query: 1708 QKQASGLRSRYQ----ETDNKLL 1726
+K + + + E D + L
Sbjct: 301 EKAYAAALTSLESARIEADRQQL 323
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 40.6 bits (95), Expect = 0.010
Identities = 43/230 (18%), Positives = 83/230 (36%), Gaps = 27/230 (11%)
Query: 1562 RERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLT 1621
R R I + L +D L E A +L +R AGA S ++
Sbjct: 738 RARLRRIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLRAAHRRA-A 796
Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
EA + AE + A +AA Q +L+ + + ++ + L+++
Sbjct: 797 EAERQAESAERELARAARKAAAAAAAWKQARRELER---DAADLDLPTDP--DALEAVGL 851
Query: 1682 GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQM 1741
+ G +E +E+ A +A+ R+R ++ AE + RA
Sbjct: 852 ALKRFGDHLHTLE----VAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAE--- 904
Query: 1742 LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
+A + +L+ L E+ + + E+ +AE + + S
Sbjct: 905 -AEEA-------SLRLRTLEESVG------AMVDEIRARLAETRAALASG 940
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 39.2 bits (92), Expect = 0.015
Identities = 26/133 (19%), Positives = 59/133 (44%), Gaps = 17/133 (12%)
Query: 1590 AKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLS 1649
++ +L +L + G +++V + L+ A + + + +AA
Sbjct: 60 SQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAG 119
Query: 1650 QITNDLDDAQQKSNETNVKVKLLQERLKSI--QAGFLQNGRSAMDVENEEKNLEKE---- 1703
++ +LD +Q S +V+LL +++ ++ Q L+ +A+D E+++ E +
Sbjct: 120 ELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALE---AALDAS-EKRDRESQAKIA 175
Query: 1704 -------VALAQK 1709
VALAQ+
Sbjct: 176 DLGRRLNVALAQR 188
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 39.8 bits (94), Expect = 0.015
Identities = 27/121 (22%), Positives = 54/121 (44%), Gaps = 13/121 (10%)
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
+ I E ++N +I SL +++ L + +A L ++ A+ K +++ +
Sbjct: 509 KLIGEDKEKLNELIASLEELERELEQ---KAEEAEALLKE-------AEKLKEELEEKKE 558
Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKS 1678
L E + + E A + A+ I A+K+ +I +L Q+ +VK L E K
Sbjct: 559 KLQEEE-DKLLEE-AEKEAQQAIKEAKKEADEIIKELRQLQKGGY-ASVKAHELIEARKR 615
Query: 1679 I 1679
+
Sbjct: 616 L 616
>gnl|CDD|235136 PRK03598, PRK03598, putative efflux pump membrane fusion protein;
Provisional.
Length = 331
Score = 39.2 bits (92), Expect = 0.016
Identities = 35/160 (21%), Positives = 67/160 (41%), Gaps = 27/160 (16%)
Query: 1590 AKANDLKRKANLTK--AGADSTKNTVQKIVDV---LTEARTAQDMAEVAIQTAKDDISAA 1644
AKAN +A L AG ++I + +A+ A D A+ +
Sbjct: 86 AKANVSVAQAQLDLMLAGYRD-----EEIAQARAAVKQAQAAYDYAQNFYNRQQG--LWK 138
Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
+ +S NDL++A+ ++ +K Q++L + G + + +
Sbjct: 139 SRTISA--NDLENARSSRDQAQATLKSAQDKLSQYREGN-----------RPQDIAQAKA 185
Query: 1705 ALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR--GQML 1742
+LAQ QA+ ++ D +L++ ++ + L RA G ML
Sbjct: 186 SLAQAQAALAQAELNLQDTELIAPSDGTILTRAVEPGTML 225
>gnl|CDD|193488 cd04083, CBM6_discoidin-like, Carbohydrate Binding Module 6 (CBM6)
domains; many are found in association with discoidin
domain(s). This family includes carbohydrate binding
module 6 (CBM6) domains that are appended to several
carbohydrate binding enzymes; many are found in
association with carbohydrate-binding coagulation factor
5/8 C-terminal (discoidin) domain(s), for example the
CBM domain of Agl, an alpha-1,3-glucanase from Bacillus
circulans KA-304. Some CBM6 domains belonging to this
family are appended to glycoside hydrolase (GH) family
domains, including glycoside hydrolase family 31 (GH31),
for example the CBM domain of CtsY, a
3-alpha-isomaltosyltranasferase from Arthrobacter
globiformis A19. These CBM6s are non-catalytic
carbohydrate binding domains that facilitate the strong
binding of the GH catalytic modules with their
dedicated, insoluble substrates. CBM6 is an unusual CBM
as it represents a chimera of two distinct binding sites
with different modes of binding: binding site I within
the loop regions and binding site II on the concave face
of the beta-sandwich fold. GH31 has a wide range of
hydrolytic activities such as alpha-glucosidase,
alpha-xylosidase, 6-alpha-glucosyltransferase, or
alpha-1,4-glucan lyase, cleaving a terminal carbohydrate
moiety from a substrate that may be a starch or a
glycoprotein. Most characterized GH31 enzymes are
alpha-glucosidases.
Length = 124
Score = 37.0 bits (86), Expect = 0.016
Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 19/68 (27%)
Query: 561 YEAENANCKTDKCVVEIRQPLGGG----NENTW-TGPGFMKGFENTG-LVFEIDNIPTPM 614
YEAE+A L GG E T +G G++ GF G V N+P
Sbjct: 3 YEAEDAT-------------LSGGATVATEATGYSGTGYVVGFTAVGASVTFTVNVPEAG 49
Query: 615 DYDIVVRY 622
Y + +RY
Sbjct: 50 TYSLTLRY 57
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit. Within
the family is an example from a sodium-translocating
rather than proton-translocating ATP synthase [Energy
metabolism, ATP-proton motive force interconversion].
Length = 147
Score = 37.4 bits (87), Expect = 0.017
Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 17/104 (16%)
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE--TNVKVKLL 1672
KI D L A A+ A +A + A+ + A+ + +I +++A ++ +E K +
Sbjct: 30 KIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEI---IENANKRGSEILEEAKAEAR 86
Query: 1673 QERLKSIQAGFLQNGRSAMDVENEE--KNLEKEVA-----LAQK 1709
+ER K R+ ++ E E+ + L K+VA A+K
Sbjct: 87 EEREKIKAQ-----ARAEIEAEKEQAREELRKQVADLSVLGAEK 125
Score = 31.6 bits (72), Expect = 1.5
Identities = 26/110 (23%), Positives = 45/110 (40%), Gaps = 15/110 (13%)
Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKL 1757
+ +KE ALAQK+A + ++ +++ A RG +L +A + K+
Sbjct: 39 ERAKKEAALAQKKAQVILKEAKDEAQEIIENANK------RGSEILEEAKAEAREEREKI 92
Query: 1758 KQLNETE--AMFNSQESEL----TELSKNIAE--LQKRIQSCIN-FIVDK 1798
K E A EL +LS AE +++ I ++DK
Sbjct: 93 KAQARAEIEAEKEQAREELRKQVADLSVLGAEKIIERNIDKQAQKDLIDK 142
>gnl|CDD|114524 pfam05802, EspB, Enterobacterial EspB protein. EspB is a
type-III-secreted pore-forming protein of
enteropathogenic Escherichia coli (EPEC) which is
essential for EPEC pathogenesis. EspB is also found in
Citrobacter rodentium.
Length = 317
Score = 38.9 bits (90), Expect = 0.022
Identities = 23/100 (23%), Positives = 44/100 (44%), Gaps = 3/100 (3%)
Query: 1568 INNVIKSLTDIDTILTETAGDLA-KANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA 1626
IN+ K +DI T + A+++ KA L KA +S + ++ + +A
Sbjct: 119 INSATKGASDIAQKATSASSKAVNAASEVATKA-LVKA-TESVADAAEEASSTMQQAMAT 176
Query: 1627 QDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETN 1666
A DD++ + + SQ+ + DA QK++ +
Sbjct: 177 ATKAASRTSGVADDVATSAQKASQVAEEAADAAQKASRLS 216
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 38.1 bits (89), Expect = 0.022
Identities = 31/171 (18%), Positives = 69/171 (40%), Gaps = 23/171 (13%)
Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
+++ + +K + L + ++D++ Q+ E + +RI
Sbjct: 24 PRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLE------SEIQEIRERIKR 77
Query: 1277 ARLALNDLT--EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTET 1334
A L+ + E++ LN ++ KE + N++E L +L E+ +
Sbjct: 78 AEEKLSAVKDERELRALNIEIQIAKE-----RINSLEDELA----------ELMEEIEKL 122
Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
EK+I D + + +R E + + + + E + Q L+ K L++ L
Sbjct: 123 EKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLD 173
Score = 38.1 bits (89), Expect = 0.025
Identities = 29/176 (16%), Positives = 72/176 (40%), Gaps = 18/176 (10%)
Query: 1559 EQIRERANEINNVIKS----LTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
+++ R EI +K L ++ L +L + + +
Sbjct: 20 DRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAE 79
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
+ + + + R + + + IQ AK+ I++ +L+ +L + +K + +++ L+E
Sbjct: 80 EKLSAVKDERELRAL-NIEIQIAKERINSLEDELA----ELMEEIEKLEK---EIEDLKE 131
Query: 1675 RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAE 1730
RL+ ++ + + +E E + +E + L+ + D +LLS+ E
Sbjct: 132 RLERLEKNLAEA-EAR--LEEEVAEIREEGQELSSKREELKE---KLDPELLSEYE 181
Score = 36.2 bits (84), Expect = 0.12
Identities = 20/141 (14%), Positives = 56/141 (39%), Gaps = 4/141 (2%)
Query: 1639 DDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDV---EN 1695
+ A+ +L + L+ + + + +V L+ ++ I+ ++ + E
Sbjct: 31 KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRER-IKRAEEKLSAVKDER 89
Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
E + L E+ +A+++ + L E ++ + + R + L ++
Sbjct: 90 ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEE 149
Query: 1756 KLKQLNETEAMFNSQESELTE 1776
++ ++ E +S+ EL E
Sbjct: 150 EVAEIREEGQELSSKREELKE 170
Score = 34.6 bits (80), Expect = 0.37
Identities = 27/118 (22%), Positives = 54/118 (45%), Gaps = 12/118 (10%)
Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
++ LK +A ++ +E E ++LE +V+ + + +R R + + KL S
Sbjct: 30 RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL------S 83
Query: 1733 GLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
+K R LN Q AK + + + + E E+ +L K I +L++R++
Sbjct: 84 AVKDERELRALNIEIQ-----IAKERINSLEDELAELME-EIEKLEKEIEDLKERLER 135
>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948).
This family consists of bacterial sequences several of
which are thought to be general stress proteins.
Length = 90
Score = 35.6 bits (83), Expect = 0.028
Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 1560 QIRERANEINNVIKSLT-DIDTILTETAGDLAKANDL 1595
+ + +E+N + LT +D I TET LAK N L
Sbjct: 23 SLSKTLDEVNKTLAGLTKQVDGITTETEDLLAKTNVL 59
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120 is
partially characterized as a membrane fusion protein of
the DevBCA ABC exporter, probably a glycolipid exporter,
required for heterocyst formation. Most Cyanobacteria
have one member only, but Nostoc sp. PCC 7120 has seven
members.
Length = 327
Score = 38.3 bits (89), Expect = 0.028
Identities = 27/141 (19%), Positives = 49/141 (34%), Gaps = 20/141 (14%)
Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKAN-DLKRKANLTKAGA------DSTKN 1611
+ A + K + L +L A ++ R +L + GA DS
Sbjct: 83 IAAQRAARAAAKLFKDVAAQQATLNRLEAELETAQREVDRYRSLFRDGAVSASDLDSKAL 142
Query: 1612 TVQKIVDVLTEAR----TAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
++ + L EA D A A+ + +++ DL+Q A+ KS
Sbjct: 143 KLRTAEEELEEALASRSEQIDGARAALASLAEEVRETDVDLAQ-------AEVKSALEA- 194
Query: 1668 KVKLLQERLKSIQAGFLQNGR 1688
V+ + L+ +GR
Sbjct: 195 -VQQAEALLELTYVKAPIDGR 214
>gnl|CDD|218489 pfam05192, MutS_III, MutS domain III. This domain is found in
proteins of the MutS family (DNA mismatch repair
proteins) and is found associated with pfam00488,
pfam05188, pfam01624 and pfam05190. The MutS family of
proteins is named after the Salmonella typhimurium MutS
protein involved in mismatch repair; other members of the
family included the eukaryotic MSH 1,2,3, 4,5 and 6
proteins. These have various roles in DNA repair and
recombination. Human MSH has been implicated in
non-polyposis colorectal carcinoma (HNPCC) and is a
mismatch binding protein. The aligned region corresponds
with domain III, which is central to the structure of
Thermus aquaticus MutS as characterized in.
Length = 290
Score = 38.2 bits (89), Expect = 0.031
Identities = 47/260 (18%), Positives = 95/260 (36%), Gaps = 56/260 (21%)
Query: 1557 NPEQIRER-----------ANEINNVIKSLTDIDTILTETAGDLAKANDLKR-KANLTKA 1604
+ ++I R E+ +K + D++ +L+ A A DL +++L
Sbjct: 47 DLDEINARLDAVEELLEDLREELREALKGIPDLERLLSRLALGRASPRDLLALRSSLEAL 106
Query: 1605 ----GADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD----LSQITNDLD 1656
++ + + D L ++ E AI +D + +D +LD
Sbjct: 107 PALRKLLASAPLLADLADQL-PLPELLELLERAID---EDPPLSLRDGGVIKDGYDPELD 162
Query: 1657 DAQQKSNETNVKVKLLQER------LKSIQAGF-LQNGRSAMDVENEEKNLEKEVALAQK 1709
+ + +E K+ L ER +KS++ G+ G ++V+ + + + ++
Sbjct: 163 ELRALLDELREKLAELLERERERTGIKSLKVGYNRVFGYYVIEVKASKADKVPGDYI-RR 221
Query: 1710 QASGLRSRYQ-----ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETE 1764
+ R+ E + KLL +A L L++L E
Sbjct: 222 STTKNAVRFTTPELKELERKLLE--------------AEERALALE---KEILEEL--LE 262
Query: 1765 AMFNSQESELTELSKNIAEL 1784
+ + L EL+ IAEL
Sbjct: 263 RVLSEYAELLQELADAIAEL 282
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes. This
domain is about 200 amino acids in length. This domain is
found associated with pfam00013, pfam01966. This domain
has a single completely conserved residue A that may be
functionally important.
Length = 201
Score = 37.5 bits (88), Expect = 0.032
Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 34/147 (23%)
Query: 1641 ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
IS+A ++ QI +++A+ K E K LL+ + + + R+ + E E K
Sbjct: 29 ISSAEEEAKQI---IEEAK-KEAEALKKEALLEAKEEIHKL------RA--EAERELKER 76
Query: 1701 EKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL 1760
E+ Q+Q + +LL K E+ L R + + L K L K K+L
Sbjct: 77 RNEL---QRQ-----------EKRLLQKEET--LDR-KDESLEKKEESLE----EKEKEL 115
Query: 1761 NETEAMFNSQESELTEL-SKNIAELQK 1786
+ +E EL EL + EL++
Sbjct: 116 AARQQQLEEKEEELEELIEEQQQELER 142
>gnl|CDD|227109 COG4768, COG4768, Uncharacterized protein containing a divergent
version of the methyl-accepting chemotaxis-like domain
[General function prediction only].
Length = 139
Score = 36.3 bits (84), Expect = 0.035
Identities = 22/83 (26%), Positives = 32/83 (38%), Gaps = 5/83 (6%)
Query: 1567 EINNVIKSLT-DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV----QKIVDVLT 1621
E+ +K LT +D I ET L K N L A D + V Q + D+
Sbjct: 35 EVAKTLKGLTSQVDGITHETEELLHKTNTLAEDVQGKVATLDPVFDAVKDLGQSVSDLNQ 94
Query: 1622 EARTAQDMAEVAIQTAKDDISAA 1644
R A A++ + I+ A
Sbjct: 95 SVRHLATRATNAVEKNEKKIAQA 117
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 37.3 bits (87), Expect = 0.039
Identities = 24/115 (20%), Positives = 50/115 (43%), Gaps = 8/115 (6%)
Query: 1679 IQAGFLQNGRSAMD----VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGL 1734
+ + +L + SA + ++ E L++E+A Q+Q + L+ QE +L S E+
Sbjct: 53 VLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQEL-STLEAELE 111
Query: 1735 KRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
+ + + + S ++ + ++L E A + E L LQ+ Q
Sbjct: 112 RLQKELARIKQLSANAIELDEENRELREELAE---LKQENEALEAENERLQENEQ 163
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 38.4 bits (89), Expect = 0.040
Identities = 38/236 (16%), Positives = 87/236 (36%), Gaps = 35/236 (14%)
Query: 1541 DIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLA-----KANDL 1595
D +IS V + ++ + E+ ++ ++ + + D + + A+
Sbjct: 173 DTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKA 232
Query: 1596 KRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDL 1655
++KA+ + AD ++ V++ EA+ A+ + K+D A +I
Sbjct: 233 QQKADFAQDNADKQRDEVRQ---KQQEAKNLPKPADT--SSPKEDKQVAENQKREIEKAQ 287
Query: 1656 DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLR 1715
+ ++ E LK + + EK E + AQK+ +
Sbjct: 288 IEIKKNDEEALKAKDHKAFDLKQ-------------ESKASEKEAEDKELEAQKKREPVA 334
Query: 1716 SRYQETDNKLLSKA--------ESS----GLKRARGQMLLNKASQLSVNTTAKLKQ 1759
Q+T ++ ++ +SS GLK L++ + + T +L++
Sbjct: 335 EDLQKTKPQVEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLKTEVRLRE 390
Score = 31.9 bits (72), Expect = 3.9
Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 4/83 (4%)
Query: 1427 AKDYAETSKKLINEKESKAEELRKHM-IILSESNSAKDYAETSKKLINEKEAKAEEVFRG 1485
D + +K + + A++ R + E+ + A+TS E + AE R
Sbjct: 226 QIDADKAQQKA-DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSP--KEDKQVAENQKRE 282
Query: 1486 ITTAKQESHAANILAKEAYDHAA 1508
I A+ E + A +A DH A
Sbjct: 283 IEKAQIEIKKNDEEALKAKDHKA 305
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 37.6 bits (88), Expect = 0.044
Identities = 31/133 (23%), Positives = 64/133 (48%), Gaps = 23/133 (17%)
Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV-ALAQKQA---SGLRSRYQETDNKL 1725
+ L +RL A ++ R +E + K LE ++ L QK++ S L S Y++ +L
Sbjct: 7 QNLNDRL----ASYIDKVRF---LEQQNKELEAKIEELRQKKSAEPSRLYSLYEQEIREL 59
Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETE-AMFNSQESELTELSKNI--- 1781
+ + +RAR Q+ ++ +L+ ++ E E + S E+++ L K++
Sbjct: 60 RKQLDELTNERARLQLEIDNL-RLAA---EDFREKYEDELNLRQSAENDIVGLRKDLDEA 115
Query: 1782 ----AELQKRIQS 1790
+L+ +I+S
Sbjct: 116 TLARVDLEMKIES 128
>gnl|CDD|216100 pfam00754, F5_F8_type_C, F5/8 type C domain. This domain is also
known as the discoidin (DS) domain family.
Length = 128
Score = 35.6 bits (82), Expect = 0.051
Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 5/62 (8%)
Query: 112 KTISWWQSENGVENVTIQLDMEAEFHFTHLIIYF----KTFRPAAMLIERSHDFGKTWQV 167
+ W S+ G +Q+D+ T ++ + IE S+D G+ W
Sbjct: 23 NGSTAWSSKWGDAPQWLQVDLGKPKKITGVVTQGRQDGGNGYVKSYKIEYSND-GENWTE 81
Query: 168 YR 169
+
Sbjct: 82 VK 83
>gnl|CDD|146106 pfam03302, VSP, Giardia variant-specific surface protein.
Length = 397
Score = 37.6 bits (87), Expect = 0.056
Identities = 76/360 (21%), Positives = 108/360 (30%), Gaps = 69/360 (19%)
Query: 851 CDQCSPGYWNYPTCQRCICNGHADLCDSKTGTCISCRNSTDGENCERCINGYYGDPRLGI 910
CD+C PGY +C + KT C +C N E CE C + Y P
Sbjct: 1 CDECKPGYELSADKTKCTSSAPC-----KTENCKACSND-KREVCEECNSNNYLTPTSQC 54
Query: 911 DIPCRPCPCPGTVESGLSYADTCQLDPRS--QDVICECKEGYRGTRCDGCLDNYFGNPQE 968
C G Y T + + + + CK +C C D ++ +
Sbjct: 55 IDDCAKI--------GNYYYTTNANNKKICKECTVANCKTCEDQGQCQACNDGFYKSGDA 106
Query: 969 RGGSCEPCDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNALEQNCTEC 1028
E C + + C LTGK L RY +G C G C C
Sbjct: 107 CSPCHESCKTCSGGTASDCTEC--LTGKAL--RYGNDGTKGT-CGEGCTTGTGAGACKTC 161
Query: 1029 TCNILGIDHSK-----------GPCD----RTTGQCPCLPNVIGVSCDRCLKNHWKIASG 1073
I G + G C R T C G+ C C ++++ G
Sbjct: 162 GLTIDGTSYCSECATETEYPQNGVCTSTAARATATCKASSVANGM-CSSCANGYFRMNGG 220
Query: 1074 TGCEPCDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPN--VECFPCDCD 1131
E ++ G+ C+ G C + + N V C P C
Sbjct: 221 ---------------CYETTKFPGKSVCEEANSGGTCQKEAPGYKLNNGDLVTCSP-GCK 264
Query: 1132 PSGSAT-------------SQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPCGEC 1178
S T C + + +C G C TCA GY + C C
Sbjct: 265 TCTSNTVCTTCMDGYVKTSDSCTKCDSSCETCTG-ATTTCKTCATGYYKSGTGCVSCTSS 323
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 37.9 bits (88), Expect = 0.060
Identities = 47/271 (17%), Positives = 86/271 (31%), Gaps = 44/271 (16%)
Query: 1559 EQIRERANEINN-------VIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKN 1611
Q+ + A N + + + D L E + A + K T+
Sbjct: 217 AQVLQDAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEA 276
Query: 1612 TVQKIVDVL-TEARTAQDMAEVAIQTAK----------DDISAARKDLSQITNDLD---D 1657
V EA A+ +AE AIQ K + A + I + +
Sbjct: 277 EVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELE 336
Query: 1658 AQQKSNETNVKVKLLQERLKSI-----QAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
Q++S N + QE K+ G E EE ++ AL +
Sbjct: 337 PQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEA 396
Query: 1713 GLRSRYQETDNKLLSKAESSGLKRARG----QMLLNKASQLSVNTTAKLKQLNET-EAMF 1767
+ + + +KAE+ + A + + K + L E + +
Sbjct: 397 AEQEQVEIAVRAEAAKAEAE-AQAAEIKAEAEAIREKGKA----EAEAKRALAEAIQVLG 451
Query: 1768 NSQESE--------LTELSKNIAELQKRIQS 1790
++ +E L E+++ A+ K I S
Sbjct: 452 DAAAAELFKALVQALPEVAEEAAQPMKNIDS 482
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins and
components of bacterial type III secretion systems.
Length = 141
Score = 35.8 bits (83), Expect = 0.061
Identities = 18/114 (15%), Positives = 44/114 (38%), Gaps = 17/114 (14%)
Query: 1610 KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET---N 1666
+ +QK++D+ ++ A++ + A+ + L Q+ ++ +Q++ E
Sbjct: 1 EFRLQKLLDL---REKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAG 57
Query: 1667 VKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
L F++ + ++E+AL Q++ R R E
Sbjct: 58 TSALELSN-----YQRFIRQLDQRIQ------QQQQELALLQQEVEAKRERLLE 100
>gnl|CDD|224176 COG1256, FlgK, Flagellar hook-associated protein [Cell motility and
secretion].
Length = 552
Score = 37.8 bits (88), Expect = 0.062
Identities = 43/189 (22%), Positives = 70/189 (37%), Gaps = 35/189 (18%)
Query: 1623 ARTAQDMAEVAIQTAKDDISAA------RKDLSQITNDLDDAQQKSNETNVKVK------ 1670
A + + A+ A+ +IS A R+ + Q TN + T V V
Sbjct: 10 ALSGLNAAQAALDVTGHNISNANTPGYSRQRVIQTTNIPYLGGGLNVGTGVNVVSIQRLR 69
Query: 1671 ----LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNK-- 1724
Q R + Q+ +L S + + E L++ S L + + N
Sbjct: 70 DEFLTNQYRNANSQSSYLDTRASQLS--------QLESLLSEPSESSLSTLLNDFFNSLQ 121
Query: 1725 -LLSKAESSGLKRARGQMLLNKASQLSV---NTTAKLKQL-NETEAMFNSQESELTELSK 1779
L S + + Q +L+KA L NT +L L + A + E+ L K
Sbjct: 122 ELASNPSDTAAR----QAVLSKAQTLVNQINNTYEQLTDLRKDINAEIAATVDEVNSLLK 177
Query: 1780 NIAELQKRI 1788
IA+L K+I
Sbjct: 178 QIADLNKQI 186
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the synaptonemal
complex. Synaptonemal complexes are structures that are
formed between homologous chromosomes during meiotic
prophase.
Length = 787
Score = 37.8 bits (87), Expect = 0.072
Identities = 137/648 (21%), Positives = 255/648 (39%), Gaps = 104/648 (16%)
Query: 1188 GLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTI 1247
GL+ +K+ +A IK+ S E + E K+++ + ++E ++ +L +
Sbjct: 75 GLSRLYSKLYKEAEKIKKWKVS--IEAELKQKENKLQENRKIIEAQRKAIQELQFENEKV 132
Query: 1248 D-DMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQ 1306
++++I N+D + K T NL + E+ KK E ++ L
Sbjct: 133 SLKLEEEIQENKD---LIKENNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLN 189
Query: 1307 ENNVEGAL----ELTRQAHDTSL----KLKEQ-------STETEKQINDAERQCKRTENL 1351
NN+E + EL QA + L KLKE E +K+IND E+Q
Sbjct: 190 -NNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQ 248
Query: 1352 VTK---TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGC 1408
+T+ T EE+ +L EK ++NL +L E D+L
Sbjct: 249 ITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIE--------KKDHLTKELED 300
Query: 1409 GKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRK----HMIILSESNSAKDY 1464
K + + + +T +L EKE++ EEL K H +++E +
Sbjct: 301 IKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCS 360
Query: 1465 AETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
E + ++ K E+ + IT Q+ S+
Sbjct: 361 LEELLRTEQQRLEKNEDQLKIITMELQKK----------------------------SSE 392
Query: 1525 TKQLDEFLNAPGATLADIRNISG-----LAVNKNIQSNPEQIRERANEINNVI----KSL 1575
+++ +F N L +++ I L K + E+++ + E+ ++ K +
Sbjct: 393 LEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEI 452
Query: 1576 TDIDTILTETAGD----LAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
D++ LT L + DL K L K + + T +L Q+ ++
Sbjct: 453 HDLEIQLTAIKTSEEHYLKEVEDL--KTELEKEKLKNIELTAHCDKLLLENKELTQEASD 510
Query: 1632 VAIQTAK--DDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNG-- 1687
+ ++ K +DI +K ++ +++ ++K E N L++ L+S++ F+Q G
Sbjct: 511 MTLELKKHQEDIINCKKQEERMLKQIENLEEK--EMN-----LRDELESVREEFIQKGDE 563
Query: 1688 -RSAMD-VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNK 1745
+ +D E +++E EV +KQ L ++ N L + E+ K + L +
Sbjct: 564 VKCKLDKSEENARSIEYEVLKKEKQMKILENKC----NNLKKQIEN---KNKNIEELHQE 616
Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAEL----QKRIQ 1789
L +A+ KQLN E N E EL + E+ QK I+
Sbjct: 617 NKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIE 664
Score = 36.6 bits (84), Expect = 0.14
Identities = 113/586 (19%), Positives = 225/586 (38%), Gaps = 95/586 (16%)
Query: 1248 DDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE 1307
++ KK+I + E V L T++ N + DLT +++ L+E L E
Sbjct: 229 EEYKKEINDKE--KQVSLLLIQITEKEN----KMKDLTFLLEESRDKANQLEEKTKLQDE 282
Query: 1308 N---NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQE 1364
N +E LT++ D + L ++S T+K + + + + TKT Q T E
Sbjct: 283 NLKELIEKKDHLTKELEDIKMSL-QRSMSTQKALEEDLQ-------IATKTICQLT---E 331
Query: 1365 ENEKALQRLNEK-------ITSLEKNLPDLNELI------CDKRGDPCDNLCGGAGCGKC 1411
E E ++ LN+ +T E L EL+ +K D +
Sbjct: 332 EKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKII--------- 382
Query: 1412 GGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKL 1471
T+ + + E +K N KE + EEL+ IL+E D + +K+
Sbjct: 383 --------TMELQKKSSELEEMTK-FKNNKEVELEELK---KILAEDEKLLDEKKQFEKI 430
Query: 1472 INEKEAKAEEVFRGITTAKQESHAANI-----------LAKEAYDHAASVRNKTAAYVAS 1520
E + K +E+ + ++E H I KE D + + +
Sbjct: 431 AEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIEL 490
Query: 1521 TSNITKQLDEFLNAPGATLADIRNISGLAVN-KNIQSNPEQIRERANEINNVIKSLTDID 1579
T++ K L E + S + + K Q + +++ + I++L + +
Sbjct: 491 TAHCDKLLLE-------NKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKE 543
Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKD 1639
L + + + + +K + K D ++ + I + + + E K
Sbjct: 544 MNLRDELESVRE--EFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKK 601
Query: 1640 DISAARKDLSQITNDLDDAQQKSNETN-------VKVKLLQERLKSIQAGF---LQNGRS 1689
I K++ ++ + ++K + N +KV L+ L S + F + N +
Sbjct: 602 QIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQK 661
Query: 1690 AMDVEN-EEKNLEKEVALAQKQASGLRSRYQETDNKLLSK-AESSGL----KRARGQMLL 1743
++ + E+ L +EV A+ A +E D + K AE L K +++
Sbjct: 662 EIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIE 721
Query: 1744 NKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
+ S+L + + +Q + A+ E EL+ + + L+K+++
Sbjct: 722 ERDSELGLYKNKEQEQSSAKAAL----EIELSNIKAELLSLKKQLE 763
Score = 33.2 bits (75), Expect = 1.5
Identities = 59/290 (20%), Positives = 112/290 (38%), Gaps = 32/290 (11%)
Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
E L+ K +++ L++ + HDL + LT I ++ +
Sbjct: 432 EELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKE------------------- 472
Query: 1277 ARLALNDLTEEIKKLN-KTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
+ DL E++K K E+ LL EN ELT++A D +L+LK+ +
Sbjct: 473 ----VEDLKTELEKEKLKNIELTAHCDKLLLENK-----ELTQEASDMTLELKKHQEDII 523
Query: 1336 KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKR 1395
ER K+ ENL K + + E+ +Q+ +E L+K+ + + +
Sbjct: 524 NCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVL 583
Query: 1396 GDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIIL 1455
C + E ++ A K K+ A E++ + + L
Sbjct: 584 KKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLEL 643
Query: 1456 SESNSAKDYAE---TSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
+++ + + E +K I +K+ E++ + AK + A L KE
Sbjct: 644 ELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693
>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
organic solvents, periplasmic component [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 359
Score = 37.1 bits (86), Expect = 0.074
Identities = 39/217 (17%), Positives = 69/217 (31%), Gaps = 34/217 (15%)
Query: 1191 DQTTKVITDAS-NIKQTGASGA--------YTKEFELLEK---KIEDVK-ALVENTTRSS 1237
D + D++ +I+ TG G +L +E + +
Sbjct: 88 DSDYPIPADSTASIRTTGLLGEKYIELTPGDAGGPKLRLGATIPLERTQVPIDLEDLLG- 146
Query: 1238 HDLTVMLTTIDDMK-KQIMN--NEDPNGVGKTLGNTTQRINLARL--ALNDLTEEIKKLN 1292
DL ++L +D + I+N G G L NLA+ ALN +I L
Sbjct: 147 -DLLLLLGGLDPDRLNAILNEAAAALAGTGPQLNALLD--NLAQFTDALNARDGDIGAL- 202
Query: 1293 KTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDA----ERQCKRT 1348
+ LL +++ A + + D L ++DA
Sbjct: 203 -----IANLNQLL--DSLAAASDQLDRLLDNLATLTAALAARRDALDDALAALSALAATV 255
Query: 1349 ENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
+L+ + P Q L + + LE+ L
Sbjct: 256 NDLLAENRPNLNQALANLRPLATLLVDYLPGLEQLLH 292
>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3. Aip3p/Bud6p is a
regulator of cell and cytoskeletal polarity in
Saccharomyces cerevisiae that was previously identified
as an actin-interacting protein. Actin-interacting
protein 3 (Aip3p) localizes at the cell cortex where
cytoskeleton assembly must be achieved to execute
polarized cell growth, and deletion of AIP3 causes gross
defects in cell and cytoskeletal polarity. Aip3p
localization is mediated by the secretory pathway,
mutations in early- or late-acting components of the
secretory apparatus lead to Aip3p mislocalization.
Length = 426
Score = 37.0 bits (86), Expect = 0.091
Identities = 31/198 (15%), Positives = 75/198 (37%), Gaps = 12/198 (6%)
Query: 1469 KKLINEKEAKAEEVFRGITTA--KQES---HAANILAKEAYDHAASVRNKTAAYVASTSN 1523
KK I+++ + + A Q A + A + + +S+S
Sbjct: 80 KKHIDDEIDTLQNELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLASSSSA 139
Query: 1524 ITKQLDEFLNAPGATLADIRNIS-GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL 1582
I+ + A+++++ LAV + + E I ++++ + +
Sbjct: 140 ISLA-NNPDKLNKEQRAELKSLQRELAVLRQTHN--SFFTEIKESIKDILEKIDKFKSSS 196
Query: 1583 TETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDIS 1642
+G +A K L+ +DS V + D++ R +D+A+ ++ +K +
Sbjct: 197 LSASGSSNRAYVESSKKKLS-EDSDSLLTKVDDLQDIIEALR--KDVAQRGVRPSKKQLE 253
Query: 1643 AARKDLSQITNDLDDAQQ 1660
+K+L +L ++
Sbjct: 254 TVQKELETARKELKKMEE 271
Score = 30.8 bits (70), Expect = 7.2
Identities = 29/185 (15%), Positives = 68/185 (36%), Gaps = 34/185 (18%)
Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLAR 1278
++ +I+ ++ ++ ++ + + + ++ N+AR
Sbjct: 83 IDDEIDTLQNELDEVKQALESQREAIQRLKERQQNS------------------AANIAR 124
Query: 1279 LALNDL------TEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQST 1332
A + + I N ++ KE L+ +++ L + RQ H+ T
Sbjct: 125 PAASPSPVLASSSSAISLANNPDKLNKEQRAELK--SLQRELAVLRQTHN------SFFT 176
Query: 1333 ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELIC 1392
E ++ I D + + ++ + E+ K +L+E SL + DL ++I
Sbjct: 177 EIKESIKDILEKIDKFKSSSLSASGSSNRAYVESSKK--KLSEDSDSLLTKVDDLQDIIE 234
Query: 1393 DKRGD 1397
R D
Sbjct: 235 ALRKD 239
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an obligate
intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification of
the inclusion membrane by a set of C. trachomatis-encoded
proteins collectively designated Incs. One of these Incs,
IncA, is functionally associated with the homotypic
fusion of inclusions. This family probably includes
members of the wider Inc family rather than just IncA.
Length = 186
Score = 35.9 bits (83), Expect = 0.098
Identities = 20/113 (17%), Positives = 50/113 (44%), Gaps = 5/113 (4%)
Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDV 1693
++ + ++S ++ LS++ +L+D +++ E +++ L+E L+ ++ +
Sbjct: 74 LEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESL 133
Query: 1694 ENEEKNLEKEVALAQK----QASGLRSRYQETDNKLLSKAES-SGLKRARGQM 1741
E K L KE+ ++ + LR + L E+ L+ Q+
Sbjct: 134 EESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
Score = 32.8 bits (75), Expect = 0.99
Identities = 14/66 (21%), Positives = 34/66 (51%), Gaps = 4/66 (6%)
Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
+LK+Q +E ++++ D E + E + + + L+ L E++ SLE+++
Sbjct: 83 ELKQQLSELQEELEDLEERIAELE----SELEDLKEDLQLLRELLKSLEERLESLEESIK 138
Query: 1386 DLNELI 1391
+L + +
Sbjct: 139 ELAKEL 144
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts
with SMC proteins [Cell division and chromosome
partitioning].
Length = 622
Score = 36.9 bits (85), Expect = 0.12
Identities = 60/343 (17%), Positives = 121/343 (35%), Gaps = 30/343 (8%)
Query: 1472 INEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEF 1531
I + + + ++ I A + S L +E + S NK Y + + ++ E+
Sbjct: 273 IANLKTQNDNLYEKIQEAMKISQKIKTL-REKWRALKSDSNK---YENYVNAMKQKSQEW 328
Query: 1532 LNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAK 1591
+ L I+ E+I+ + I+ + K L + +
Sbjct: 329 PGK----------LEKLKSE--IELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQE 376
Query: 1592 ANDLKRKANLTKAGADS-------TKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
L R+ + +D K Q I L + D I ++ I
Sbjct: 377 REKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHN 436
Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
D S N + S K + E IQ ++ +E + KNL+ ++
Sbjct: 437 VNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDI 496
Query: 1705 ALAQKQASGLRSRYQETDNK-LLSKAESSGLKRA-RGQM--LLNKASQLSVNTTAKLKQL 1760
+ L E ++K LSK E+ A R ++ L + + L++ + +
Sbjct: 497 NELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLNLLSKTSILDA 556
Query: 1761 NETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYK 1803
E + S E +L EL ++ + +I + ++D ++ +K
Sbjct: 557 ---EQLVQSTEIKLDELKVDLNRKRYKIHKQVIHVIDITSKFK 596
Score = 36.5 bits (84), Expect = 0.16
Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 35/173 (20%)
Query: 1216 FELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN 1275
F+ LEK + +L++N TRS + ++D +I N E G
Sbjct: 409 FKSLEKTLRQYDSLIQNITRSRSQIG---HNVNDSSLKI-NIEQLFPKG----------- 453
Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
+ + E IKK + E N E + + + S+ L+E +
Sbjct: 454 ------SGINESIKKS-------------ILELNDEIQERIKTEENK-SITLEEDIKNLK 493
Query: 1336 KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN 1388
IN+ + ++ E +++ +F +EENE+ L +I LEK L DLN
Sbjct: 494 HDINELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLN 546
>gnl|CDD|218762 pfam05816, TelA, Toxic anion resistance protein (TelA). This family
consists of several prokaryotic TelA like proteins. TelA
and KlA are associated with tellurite resistance and
plasmid fertility inhibition.
Length = 333
Score = 36.4 bits (85), Expect = 0.12
Identities = 30/119 (25%), Positives = 50/119 (42%), Gaps = 36/119 (30%)
Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
L + + A+ND T E +L +NA LL++N++E A ++A +
Sbjct: 236 ALKRQKLAADAQK-AVNDTTNE---------LLLKNAELLKQNSIEIA----KEAERGVI 281
Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN-------EKALQRLNEKI 1377
+ ET K+ NL+ T + Q QEE EK L++L E++
Sbjct: 282 DI-----ETLKKAFQ---------NLIE-TIDETLQIQEEGRQKRKQAEKELEQLEEEL 325
>gnl|CDD|219690 pfam07996, T4SS, Type IV secretion system proteins. Members of this
family are components of the type IV secretion system.
They mediate intracellular transfer of macromolecules via
a mechanism ancestrally related to that of bacterial
conjugation machineries.
Length = 195
Score = 35.8 bits (83), Expect = 0.12
Identities = 31/168 (18%), Positives = 61/168 (36%), Gaps = 22/168 (13%)
Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAG 1682
+ A+ + K I A++ + IT +N V L ++ +S+
Sbjct: 18 LQQLAQWAQ-QLDQLKQQIQQAKQQYNSITGSRGLGALLNNPALRSV-LPKD-WQSVYDL 74
Query: 1683 FLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
L + + K+L + L + SR A+ A G+
Sbjct: 75 VLDGY-GGLS--SSAKSLYSKEKLFDTCSYPSGSRASICQRSADKAAQDK----AVGEQA 127
Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
++A+ N ++++QL + + + K IA+LQ RIQ+
Sbjct: 128 YDQAT----NRLSQIEQL--------LSQIDSAKDPKEIADLQARIQA 163
>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
kinase Etk; Provisional.
Length = 726
Score = 36.8 bits (85), Expect = 0.13
Identities = 41/189 (21%), Positives = 82/189 (43%), Gaps = 46/189 (24%)
Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
LT G D + +I++ + Q++A A Q ++ + ++ L ++ ++LD A+
Sbjct: 231 ELTMTGDDP--QLITRILNSIANNYLQQNIARQAAQDSQS-LEFLQRQLPEVRSELDQAE 287
Query: 1660 QKSN---------ETNVKVKLLQERLKSI----------QAGFLQ-------NGRSAMD- 1692
+K N + N++ K + E++ ++ +A Q R+ ++
Sbjct: 288 EKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELTFREAEISQLYKKDHPTYRALLEK 347
Query: 1693 ---VENEEKNLEKEV-ALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
+E E K L K V A+ Q LR LS+ +G RA LLN+ +
Sbjct: 348 RQTLEQERKRLNKRVSAMPSTQQEVLR----------LSRDVEAG--RAVYLQLLNRQQE 395
Query: 1749 LSVNTTAKL 1757
LS++ ++ +
Sbjct: 396 LSISKSSAI 404
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
Provisional.
Length = 387
Score = 36.3 bits (84), Expect = 0.13
Identities = 33/130 (25%), Positives = 41/130 (31%), Gaps = 22/130 (16%)
Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGI 1486
K AE KK E KA K +AK AE KK E + KA
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKK--AEAEAAAKAAAEAKKKAEAEAKKKAAAE---- 214
Query: 1487 TTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQ---LDEFLNAPGATLADIR 1543
K + A AK A + A+ AA A + K +D+
Sbjct: 215 AKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFG---------- 264
Query: 1544 NISGLAVNKN 1553
GL KN
Sbjct: 265 ---GLDSGKN 271
Score = 32.1 bits (73), Expect = 3.0
Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 5/84 (5%)
Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRS 1716
QQ++ E K QERLK ++ L A + + + + K+ AL QKQA +
Sbjct: 84 KEQQQAEELQQKQAAEQERLKQLEKERLA----AQEQKKQAEEAAKQAALKQKQAEEAAA 139
Query: 1717 RYQETDNKLLSKAESSGLKRARGQ 1740
+ K ++AE+ A +
Sbjct: 140 K-AAAAAKAKAEAEAKRAAAAAKK 162
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869). This
family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 36.9 bits (85), Expect = 0.14
Identities = 54/385 (14%), Positives = 125/385 (32%), Gaps = 38/385 (9%)
Query: 1417 SNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKL--INE 1474
S S+ + + E E E + + K E + I+E
Sbjct: 399 SADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVLKLVME-KHIISEISE 457
Query: 1475 KEAKAEEVFRGITTAKQESHAANILAKEAYD----HAASVRNKT-AAYVASTSNITKQLD 1529
E + + +A A R++ V+ S T L+
Sbjct: 458 ILEDIEAELDSLERSSNGDDENEEVAMVGSLVEESSAVIERSQELEGAVSRISEFTSVLE 517
Query: 1530 EFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDL 1589
+ L +++ ++ + N KSL D+ +E +
Sbjct: 518 HEVTVCQDLLDG---------KADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEIKKNF 568
Query: 1590 AKANDLKRKANLTKAGADSTKNTVQKIVDVL----TEARTAQDMAEVAIQTAKDDISAAR 1645
+++ N + + + ++ L + ++ +K+ +
Sbjct: 569 LGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETE 628
Query: 1646 KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA 1705
K L ++ ++LD +Q+ ++ ++K + E +S++ ++ + +LE E+
Sbjct: 629 KKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELE 688
Query: 1706 LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
++ L ++ +E + K+ + + Q K+KQ E A
Sbjct: 689 KERQNHQELIAKCRELEEKIERAEQ-------------EENMQKLDEEEQKIKQEKEIAA 735
Query: 1766 MFNSQESELTELSKNIAELQKRIQS 1790
+L E + I L K++QS
Sbjct: 736 ----AAEKLAECQETILSLGKQLQS 756
>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
Length = 175
Score = 35.2 bits (81), Expect = 0.14
Identities = 32/133 (24%), Positives = 57/133 (42%), Gaps = 17/133 (12%)
Query: 1561 IRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADST------KNTVQ 1614
+ ER I + I +A D A+A K + L KA A++ K +
Sbjct: 47 LEEREKGIQSSIDR--------AHSAKDEAEAILRKNRELLAKADAEADKIIREGKEYAE 98
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDI-SAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
K+ +TE A A+ I +AK++I R+ L + N++ D K E ++ L
Sbjct: 99 KLRAEITE--KAHTEAKKMIASAKEEIEQEKRRALDVLRNEVADLAVKGAEKIIRTSLDA 156
Query: 1674 ERLKSIQAGFLQN 1686
++ K + +Q+
Sbjct: 157 DKQKKVVDSMIQD 169
>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like. This domain is
found in the myotonic dystrophy protein kinase (DMPK) and
adopts a coiled coil structure. It plays a role in
dimerisation.
Length = 61
Score = 32.5 bits (74), Expect = 0.15
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 1660 QKSNETNVKVK-LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR 1717
Q + E ++ K LQE L+ ++A + + E + + LE EV +K+ LR+R
Sbjct: 3 QSALEAEIRAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular medium
across both inner and outer membranes. The secretion
signal is found in the C-terminus of the transported
protein. This model represents the adaptor protein
between the ATP-binding cassette (ABC) protein of the
inner membrane and the outer membrane protein, and is
called the membrane fusion protein. This model selects a
subfamily closely related to HlyD; it is defined narrowly
and excludes, for example, colicin V secretion protein
CvaA and multidrug efflux proteins [Protein fate, Protein
and peptide secretion and trafficking].
Length = 423
Score = 36.1 bits (84), Expect = 0.15
Identities = 33/175 (18%), Positives = 63/175 (36%), Gaps = 17/175 (9%)
Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
EA A+ ++V A+ AR L + + + + + + E +K Q
Sbjct: 79 VEADAAELESQVLRLEAE----VAR--LRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQ 132
Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
+ F +S + + E + + Q +A + Q + + S L+ AR +
Sbjct: 133 SLFESR-KSTLRAQLELILAQ----IKQLEAELAGLQAQLQALRQQLEVISEELE-ARRK 186
Query: 1741 MLLNKASQL--SVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
+ K L + ++ E + E+EL L + I ELQ Q
Sbjct: 187 L---KEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQ 238
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 36.2 bits (84), Expect = 0.17
Identities = 35/138 (25%), Positives = 57/138 (41%), Gaps = 30/138 (21%)
Query: 1610 KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR-KDLSQITNDLDDAQQ-----KSN 1663
K + QK +L E + Q E A R K L+ + + L QQ ++N
Sbjct: 192 KQSQQK--TLLYEQQAQQQKLEQA--------RNERKKTLTGLESSLQKDQQQLSELRAN 241
Query: 1664 ETNVKVKLLQ-ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
E+ ++ + + ER +A E E + + V QKQA S Y+ T+
Sbjct: 242 ESRLRDSIARAEREAKARA------------EREAREAAR-VRDKQKQAKRKGSTYKPTE 288
Query: 1723 NKLLSKAESSGLKRARGQ 1740
++ + + GL R RGQ
Sbjct: 289 SERSLMSRTGGLGRPRGQ 306
Score = 31.6 bits (72), Expect = 4.9
Identities = 23/115 (20%), Positives = 49/115 (42%), Gaps = 42/115 (36%)
Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
+++LKSIQ D+ K EK V Q+Q + L ++ ++ + + +++S
Sbjct: 46 RDQLKSIQQ----------DIA--AK--EKSVRQQQQQRASLLAQLKKQEEAI---SQAS 88
Query: 1733 GLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
++L ET+ N ++ EL+ +IA+L+++
Sbjct: 89 -------------------------RKLRETQNTLNQLNKQIDELNASIAKLEQQ 118
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 36.6 bits (85), Expect = 0.17
Identities = 29/112 (25%), Positives = 47/112 (41%), Gaps = 19/112 (16%)
Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAH----------DTSLKL 1327
R LN+ +E + + +LQ ++ LE +RQA D+ +L
Sbjct: 85 RQQLNNERDEPRSVPPNMSTDALEQEILQVSS--QLLEKSRQAQQEQDRAREISDSLSQL 142
Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFP----QFTQGQEENEKALQRLNE 1375
+Q TE +Q+N+ ER R + L T P Q T Q E+ ++E
Sbjct: 143 PQQQTEARRQLNEIER---RLQTLGTPNTPLAQAQLTALQAESAALKALVDE 191
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents the
N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 33.7 bits (78), Expect = 0.18
Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 17/110 (15%)
Query: 1552 KNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKA-GADSTK 1610
K I+ NPE ++E+ + + D+D +L D +R+ + + +
Sbjct: 5 KLIRENPEAVKEK---LRKRGVDVLDVDELL---------ELDEERRELQVELEELQAER 52
Query: 1611 NTVQKIVDVL--TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
N + K + + +AEV + KD++ A +L ++ +LD
Sbjct: 53 NELSKEIGKAKKKKEDAEALIAEV--KELKDELKALEAELRELEAELDKL 100
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 35.0 bits (81), Expect = 0.19
Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 7/88 (7%)
Query: 1629 MAEVAIQTA-KDDISAARKDLSQITNDLDDAQQKSNETNV---KVKLLQERLKSIQAGFL 1684
M++ +T D + S+ ++ ++ NE ++ L+ +L+ ++ +L
Sbjct: 1 MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYL 60
Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQAS 1712
R+ + EN K E+E A+K A
Sbjct: 61 ---RAQAEFENLRKRTEREREEAKKYAI 85
>gnl|CDD|163494 TIGR03782, Bac_Flav_CT_J, Bacteroides conjugative transposon TraJ
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage. This family is related conjugation system
proteins in the Proteobacteria, including TrbL of
Agrobacterium Ti plasmids and VirB6 [Cellular processes,
DNA transformation].
Length = 322
Score = 35.8 bits (83), Expect = 0.19
Identities = 20/86 (23%), Positives = 45/86 (52%), Gaps = 8/86 (9%)
Query: 1501 KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQ 1560
K+ ++ A VR+ AY+ S ++LDE G + +D++ ++G+ + + + +
Sbjct: 108 KDKLEYEAMVRDPETAYLVSDEEFDRKLDEL----GWSPSDMKTMAGMYMERAAYNMKKS 163
Query: 1561 IRERANEINNVI---KSLTDIDTILT 1583
+R+ E+ ++ +L IDT+ T
Sbjct: 164 VRDWFRELLELLFQAAALV-IDTLRT 188
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This family
consist of exonuclease VII, large subunit EC:3.1.11.6
This enzyme catalyzes exonucleolytic cleavage in either
5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme is
composed of one large subunit and 4 small ones.
Length = 295
Score = 35.7 bits (83), Expect = 0.20
Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 21/133 (15%)
Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDA-QQKSNETNVKVKLLQERLKSIQAGFLQNGR 1688
AE+ + + + L + L A + + + ++ LL+ERLKS+ L+
Sbjct: 138 AELLVPDRTE----LLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQ-- 191
Query: 1689 SAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
+EE+ E L L SR Q S+ E L R + L +
Sbjct: 192 ------HEERLAELRDRLISS-IQNLLSRKQ-------SRLERLILNRELEKNSLLENKL 237
Query: 1749 LSVNTTAKLKQLN 1761
+ N TA+LK L+
Sbjct: 238 ATANLTAQLKALS 250
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 34.5 bits (80), Expect = 0.21
Identities = 30/148 (20%), Positives = 59/148 (39%), Gaps = 32/148 (21%)
Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
D+ +S K L++ K +QA +++ +EK L+KE QKQA+
Sbjct: 22 DVQKVLSESPAGKAAQKQLEKEFKKLQA----------ELQKKEKELQKEEQKLQKQAA- 70
Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ-ES 1772
LS+ ++A+ Q L K +L A ++L + +
Sbjct: 71 -----------TLSEEA----RKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYD 115
Query: 1773 ELTELSKNIAELQKRIQSCINFIVDKSN 1800
++ + K +A+ + + ++DKS
Sbjct: 116 KIDKAIKEVAKEKG-----YDLVLDKSA 138
>gnl|CDD|220306 pfam09599, IpaC_SipC, Salmonella-Shigella invasin protein C
(IpaC_SipC). This entry represents a family of proteins
associated with bacterial type III secretion systems,
which are injection machines for virulence factors into
host cell cytoplasm. Characterized members of this
protein family are known to be secreted and are described
as invasins, including IpaC from Shigella flexneri and
SipC from Salmonella typhimurium. Members may be referred
to as invasins, pathogenicity island effectors, and cell
invasion proteins.
Length = 337
Score = 35.6 bits (82), Expect = 0.22
Identities = 45/265 (16%), Positives = 78/265 (29%), Gaps = 45/265 (16%)
Query: 1498 ILAKEAYDHAAS--VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKN-- 1553
+L+ +A AA+ VR AA +S Q+ + SG A+ KN
Sbjct: 110 LLSFDATKSAAASMVREGKAALSSSIIGSATQVGITGVGAKLQAKGLSKQSG-ALKKNLK 168
Query: 1554 -IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNT 1612
+ + ++N D + + + D+TK
Sbjct: 169 PSNKLSAEAAQLKLQLNGQNAP---------------KLVADELKHVKIKRTTGDATKML 213
Query: 1613 VQKIVDVLTEARTAQDMAEVAIQTAKDDISA-ARKDLSQITND--LDDAQQKSNETNVKV 1669
A AE Q ++ S + DL Q + AQ
Sbjct: 214 ----QKNKITASNKALSAEHKAQLSQRVESLQDKIDLHQQAYEQNTLKAQAMQM----IG 265
Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
L+ S AG + G EE ++ + + A + ET
Sbjct: 266 DLIMR--MSANAGNISGGSGQYAATQEEAE-QQISQASSRTA---STASNETR------- 312
Query: 1730 ESSGLKRARGQMLLNKASQLSVNTT 1754
E+S + Q LL ++ + +
Sbjct: 313 EASRKTKQLIQELLKILESINQSKS 337
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like GTP-binding
Golgi protein, and contribute to Golgi organisation in
animal and plant cells. The Rab6-binding domain appears
to be the same region as this C-terminal family.
Length = 121
Score = 33.7 bits (78), Expect = 0.23
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 1276 LARLAL--NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK-LKEQST 1332
LARL ++ +EI KL + E L+E ++E +E LE Q ++T+L+ L E+S
Sbjct: 40 LARLEAERDEARQEIVKLTEENEELRELKKEIEE--LEKELEDLEQRYETTLELLGEKSE 97
Query: 1333 ETEKQIND 1340
E+ D
Sbjct: 98 RVEELRAD 105
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner membrane
complex of itself with TolQ and TolR to the outer
membrane complex of TolB and OprL (also called Pal). Most
of the length of the protein consists of low-complexity
sequence that may differ in both length and composition
from one species to another, complicating efforts to
discriminate TolA (the most divergent gene in the tol-pal
system) from paralogs such as TonB. Selection of members
of the seed alignment and criteria for setting scoring
cutoffs are based largely conserved operon struction.
//The Tol-Pal complex is required for maintaining outer
membrane integrity. Also involved in transport (uptake)
of colicins and filamentous DNA, and implicated in
pathogenesis. Transport is energized by the proton motive
force. TolA is an inner membrane protein that interacts
with periplasmic TolB and with outer membrane porins
ompC, phoE and lamB [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 346
Score = 35.6 bits (82), Expect = 0.23
Identities = 41/135 (30%), Positives = 56/135 (41%), Gaps = 11/135 (8%)
Query: 1423 ESNSAKDYAETSKKLINEKESKA-EELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
E AK AE K E E KA EE +K +E + A +KK E + KAE
Sbjct: 122 EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ----AEEEAKAKAAAEAKKKAAEAKKKAEA 177
Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNA-PGATLA 1540
AK E+ A A+EA A + + K AA A+ + A A A
Sbjct: 178 E----AKAKAEA-KAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEA 232
Query: 1541 DIRNISGLAVNKNIQ 1555
++ +I GLA N +
Sbjct: 233 ELGDIFGLASGSNAE 247
Score = 34.0 bits (78), Expect = 0.67
Identities = 27/86 (31%), Positives = 35/86 (40%), Gaps = 15/86 (17%)
Query: 1431 AETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK 1490
AE + K EK+ +AEE + AK AE K E E KA+E + +
Sbjct: 107 AEQAAKQAEEKQKQAEEAK-----------AKQAAEAKAKAEAEAEKKAKEEAK----KQ 151
Query: 1491 QESHAANILAKEAYDHAASVRNKTAA 1516
E A A EA AA + K A
Sbjct: 152 AEEEAKAKAAAEAKKKAAEAKKKAEA 177
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 34.4 bits (80), Expect = 0.24
Identities = 21/103 (20%), Positives = 43/103 (41%), Gaps = 13/103 (12%)
Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET--NVKVKL 1671
+KI D L A A+ E+A + ++ AR + ++I ++ A++++ + K +
Sbjct: 38 KKIADGLAAAERAKKELELAQAKYEAQLAEARAEAAEI---IEQAKKRAAQIIEEAKAEA 94
Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA-----LAQK 1709
E + + + E L K+VA A+K
Sbjct: 95 EAEAARIKAQAQAEIEQERKRAREE---LRKQVADLAVAGAEK 134
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are RNase
Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in turnover
of yitJ riboswitch [Transcription, Degradation of RNA].
Length = 514
Score = 35.7 bits (83), Expect = 0.26
Identities = 31/170 (18%), Positives = 60/170 (35%), Gaps = 58/170 (34%)
Query: 1624 RTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGF 1683
R A+ A + AK I A+K+ + +K K ++ + R
Sbjct: 21 RIAEKKLGSAEELAKRIIEEAKKEA--------ETLKKEALLEAKEEVHKLR-------- 64
Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLL 1743
LE+E+ ++ R+ Q + +LL + E+ L R
Sbjct: 65 --------------AELERELK--ER-----RNELQRLERRLLQREET--LDR------- 94
Query: 1744 NKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
K++ L++ E +E EL+ KN+ E ++ ++ I
Sbjct: 95 ------------KMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132
>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug resistance
protein A). [Transport and binding proteins, Other].
Length = 334
Score = 35.2 bits (81), Expect = 0.29
Identities = 45/191 (23%), Positives = 72/191 (37%), Gaps = 42/191 (21%)
Query: 1577 DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
+ + ++ +G + + N + K G DVL R AE+A+
Sbjct: 41 NQLQVSSQVSGSVIEVN--VDDTDYVKQG------------DVL--VRLDPTNAELALAK 84
Query: 1637 AKDDISAARK--DLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE 1694
A+ +++A + +IT Q E+ K+KL Q R K +QA
Sbjct: 85 AEANLAALVRQTKQLEITV---QQLQAKVESL-KIKLEQAREKLLQA------------- 127
Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTT 1754
E +L + V L +K GL SR + + + + L A + LN L T
Sbjct: 128 --ELDLRRRVPLFKK---GLISREELDHARKALLSAKAALNAAIQEQ-LNANQALVRGTP 181
Query: 1755 AKLKQLNETEA 1765
K KQ EA
Sbjct: 182 LK-KQPAVQEA 191
Score = 34.4 bits (79), Expect = 0.52
Identities = 29/137 (21%), Positives = 47/137 (34%), Gaps = 23/137 (16%)
Query: 1594 DLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
DL+R+ L K G S + L AR A A+ A+ A + A + L + T
Sbjct: 130 DLRRRVPLFKKGLISREE--------LDHARKALLSAKAALNAAIQEQLNANQALVRGTP 181
Query: 1654 DLDDAQQKSNETNVKVKLLQ-ERLK------------SIQAG-FLQNGRSAMDVENEEKN 1699
+ + +K L +R +Q G + G+ M V E
Sbjct: 182 LKKQPAVQEAKERLKTAWLALKRTVIRAPFDGYVARRFVQVGQVVSPGQPLMAVVPAE-Q 240
Query: 1700 LEKEVALAQKQASGLRS 1716
+ E + Q +R
Sbjct: 241 MYVEANFKETQLKNVRI 257
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This family
consists of several eukaryotic proteins of unknown
function.
Length = 536
Score = 35.1 bits (81), Expect = 0.35
Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 21/234 (8%)
Query: 1579 DTILTETAGDLAKANDLKRKANLTKA-GADSTKNTVQ--KIVDVLTEARTAQDMAEVAIQ 1635
+ + ++ A+L DS + + V A A ++ +
Sbjct: 200 EYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMD 259
Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ-NGR-SAMDV 1693
T +D +L + Q++ +E ++ + +K++ + Q N + +
Sbjct: 260 TEEDRTKEREAELEAL-------QEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQ 312
Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
ENEE LE+E + K+ L + +N + A+ L A + LL A Q +
Sbjct: 313 ENEE--LEEEYKIK-KRTVELLP---DAENNV---AKLQALVVASSERLLELAQQWEAHR 363
Query: 1754 TAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCVS 1807
T + + + ++E E I +L+ +I+ + + K YK +
Sbjct: 364 TPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLD 417
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 34.9 bits (80), Expect = 0.36
Identities = 30/110 (27%), Positives = 41/110 (37%), Gaps = 15/110 (13%)
Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGI 1486
AK AE + K E ++KAE +AK AE K ++AKAE +
Sbjct: 169 AKKKAEEAAKAAEEAKAKAEAA-----------AAKKKAEAEAKA-AAEKAKAEAEAKAK 216
Query: 1487 TTAKQESHAANILAKE---AYDHAASVRNKTAAYVASTSNITKQLDEFLN 1533
K E+ A A E A A + + AA A LD+
Sbjct: 217 AEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFG 266
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 35.0 bits (80), Expect = 0.37
Identities = 26/136 (19%), Positives = 47/136 (34%), Gaps = 6/136 (4%)
Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV 1482
N E ++ ++KA L K +E + ++ A+ K +AKA +
Sbjct: 49 PKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAAL 108
Query: 1483 FR----GITTAKQESHAA--NILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPG 1536
+ G +E AA A A AA++ + + ++ D+
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168
Query: 1537 ATLADIRNISGLAVNK 1552
A A + LA K
Sbjct: 169 AAAAAKAKAAALAKQK 184
>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
unknown].
Length = 108
Score = 32.8 bits (75), Expect = 0.37
Identities = 25/138 (18%), Positives = 44/138 (31%), Gaps = 35/138 (25%)
Query: 1254 IMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGA 1313
M G + + + +L K GE+ E A +
Sbjct: 3 YMGEGLRKLALIGAGLAA-------ETAEKVQKLVDELVKKGELNAEEAKRFVD------ 49
Query: 1314 LELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL 1373
+L RQA KE E E++I + + + + L
Sbjct: 50 -DLLRQA-------KEAQGELEEKIPRKIEEMLSDLEVARQ-------------SEMDEL 88
Query: 1374 NEKITSLEKNLPDL-NEL 1390
E++ +LE+ + DL N+L
Sbjct: 89 TERVDALERQVADLENKL 106
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 35.3 bits (81), Expect = 0.44
Identities = 27/145 (18%), Positives = 44/145 (30%), Gaps = 26/145 (17%)
Query: 1663 NETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
N+T + LKS S D K LEK+ L + S + +Y+
Sbjct: 16 NDTIAVKNI--NVLKS----------SIKDSTQFWKMLEKQQKLTKDGLSASKGKYEGLS 63
Query: 1723 NKLLS--------KAESSGLKRA------RGQMLLNKASQLSVNTTAKLKQLNETEAMFN 1768
+ K E + RA Q + +Q + Q
Sbjct: 64 EAMEKQKKAYEDLKQEVKEVNRATQASKKAYQEYNAQYTQAENKLRSLSGQFGVASEQLM 123
Query: 1769 SQESELTELSKNIAELQKRIQSCIN 1793
Q+ E+ L I+ L K + +
Sbjct: 124 LQQKEIQRLQYAISTLNKSMAAQAR 148
>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex component.
Length = 233
Score = 34.2 bits (79), Expect = 0.49
Identities = 34/177 (19%), Positives = 59/177 (33%), Gaps = 53/177 (29%)
Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLK---- 1299
L + D + +++ N N +Q+ L +DL EIKKL + + +K
Sbjct: 21 LESFDSIYEKLENAT----------NASQKEKLE----SDLKREIKKLQRLRDQIKTWLS 66
Query: 1300 ----ENATLLQENN--VEGALE---------------------------LTRQAHDTSLK 1326
++ L EN +E A+E ++ +
Sbjct: 67 SSDIKDKDSLLENRKLIETAMERFKAVEKEMKTKAFSKEGLSAASKLDPKEKEKAEVCDF 126
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ--EENEKALQRLNEKITSLE 1381
L E KQI E + + K + + E E+ L+R I+ LE
Sbjct: 127 LSNTIDELNKQIELLEAEVESLSVQRKKRKADAAKQERIAELEELLERHKFHISKLE 183
>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
resistance [Inorganic ion transport and metabolism].
Length = 386
Score = 34.7 bits (80), Expect = 0.50
Identities = 36/143 (25%), Positives = 60/143 (41%), Gaps = 32/143 (22%)
Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARL-ALNDLTEEIKKLNKTGEMLKENA 1302
+TTI K Q+ + TL Q+ L A+N+ T E+L NA
Sbjct: 268 ITTIPIWKNQVA-------IALTLKR--QKKVLKAQQAVNE---------TTNELLLSNA 309
Query: 1303 TLLQENNVEGALELTRQAHDT-SLKLKEQST-----ETEKQINDAERQCKRTENLVTKTF 1356
+L+EN++E A + R D +LK +QS ET + + ++ T
Sbjct: 310 EMLKENSIEIAKQAERPTIDIETLKKTQQSVVAAIEETSQIQEEGRKKRADTIR------ 363
Query: 1357 PQFTQGQEENEKALQRLNEKITS 1379
+ + +EE +K L L + I+
Sbjct: 364 -ELDKLEEEIKKKLILLKQNISQ 385
Score = 33.5 bits (77), Expect = 1.1
Identities = 40/242 (16%), Positives = 70/242 (28%), Gaps = 66/242 (27%)
Query: 1190 ADQTTKVITDASNIKQTG---ASGAYTKEFELLEKKIEDVKALVEN-------------- 1232
A KV TDA + E +L + + L +
Sbjct: 11 ATINDKV-TDADLPEPENSEIVQKEKKAERDLQLENTKKADELAKQIDVSNQQSLLDFGA 69
Query: 1233 -----TTRSSHDL--TVMLTTIDDMKK------QIMNNEDPN---------------GVG 1264
S+ + V +T + D+ K I+ DP+
Sbjct: 70 EAQEKAAAFSNRMLNEVKITDVGDLGKMLEQLMFILKRFDPDELPSDKKGFLTKIFGRSK 129
Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATL--LQENNVEGALELTRQAHD 1322
++ + ++ + E + K K E+ ++N L L E N E L +
Sbjct: 130 SSIQEIFSKYQTIGAQIDRIIESLSK-GKD-ELTRDNKMLELLYEKNREYFEHLEKYIAA 187
Query: 1323 TSLKLKEQSTET---EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITS 1379
LK +E TE K ++ Q Q E + RL +++
Sbjct: 188 GELKDEELETEIIPELKTKAESGNQMD-------------VQQVNELTLFINRLEQRVYD 234
Query: 1380 LE 1381
L
Sbjct: 235 LL 236
>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
complex, found in both eukaryotes and archaea, that binds
and stabilizes newly synthesized polypeptides allowing
them to fold correctly. The complex contains two alpha
and four beta subunits, the two subunits being
evolutionarily related. In archaea, there is usually only
one gene for each subunit while in eukaryotes there two
or more paralogous genes encoding each subunit adding
heterogeneity to the structure of the hexamer. The
structure of the complex consists of a double beta barrel
assembly with six protruding coiled-coils.
Length = 129
Score = 32.7 bits (75), Expect = 0.51
Identities = 20/133 (15%), Positives = 46/133 (34%), Gaps = 17/133 (12%)
Query: 1558 PEQIRERAN----EINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV 1613
+++ + ++ + + L ++ LTE LK+ +
Sbjct: 1 LQELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKETLETLKKAEEEKELLVPLGAGLF 60
Query: 1614 QK---------IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
K +VD+ T + + E AI+ K + K + ++ L+ Q + E
Sbjct: 61 VKAEVKDDDKVLVDLGTGVYVEKSLEE-AIEFLKKRLETLEKQIEKLEKQLEKLQDQITE 119
Query: 1665 TNVKVKLLQERLK 1677
+ LQ+ +
Sbjct: 120 LQEE---LQQLQQ 129
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 33.7 bits (77), Expect = 0.58
Identities = 31/146 (21%), Positives = 48/146 (32%), Gaps = 16/146 (10%)
Query: 1260 PNGVGKT---------LGNTTQRINLARLALNDLTE-EIKKLNKTGEMLKENATLLQENN 1309
PNG GKT L T R+ ++ + EI+K + E T
Sbjct: 27 PNGSGKTTILDAIRWALYGKTSRLKKSKGRGIVKGDIEIEKDEGKKKTYVE-ITFENNKG 85
Query: 1310 VEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQ----GQEE 1365
+ T K K+ + + E Q + + L+ GQE
Sbjct: 86 KLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQ-EFIDELLKSDKEILPLLLYLGQER 144
Query: 1366 NEKALQRLNEKITSLEKNLPDLNELI 1391
E+ + EK LE+ +L EL
Sbjct: 145 LEELKFKRKEKKERLEELEKELEELE 170
>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
Length = 144
Score = 32.8 bits (75), Expect = 0.61
Identities = 33/170 (19%), Positives = 57/170 (33%), Gaps = 51/170 (30%)
Query: 1552 KNIQSNPEQIRERANEINNVIKSL----TDIDTILTETAGDLAKANDLKRKANLTKAGAD 1607
+ +Q I E+ I+S+ T + ++ G KA
Sbjct: 20 QKLQEELSSIDMMKMELLKSIESMEGLKTSEEILIPLGPGAFLKA--------------- 64
Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
KIVD + AI DI KD+S++ D +KS E
Sbjct: 65 -------KIVD-----------PDKAILGVGSDI-YLEKDVSEVIEDF----KKSVEELD 101
Query: 1668 KVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR 1717
K K +E K I+ ++ K LEK +++ + ++ +
Sbjct: 102 KTK--KEGNKKIE-------ELNKEITKLRKELEKRAQAIEQRQAQMKPK 142
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in humans
and is thought to affect keratinocyte proliferation.
Length = 739
Score = 34.6 bits (79), Expect = 0.64
Identities = 28/112 (25%), Positives = 55/112 (49%), Gaps = 11/112 (9%)
Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS 1324
++L + ++ AR + TEE L + L Q+ V G L + +
Sbjct: 535 ESLASAGAQLEAARAGQQESTEEAASLRQE---------LTQQQEVYGQA-LQEKVAEVE 584
Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEK 1376
+L+EQ ++TE+++N+A R+ + + + Q Q +E N++ L+RL E+
Sbjct: 585 TRLREQLSDTERRLNEARREHAKAVVSLRQIQRQAAQEKERNQE-LRRLQEE 635
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 32.6 bits (75), Expect = 0.73
Identities = 17/108 (15%), Positives = 42/108 (38%), Gaps = 11/108 (10%)
Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
E E K + E+ +K+ L+ + Q D LS+A + + + L K +
Sbjct: 24 EKEFKKRQAELEKLEKELQKLKEKLQ-KDAATLSEAA----REKKEKELQKKVQEFQRKQ 78
Query: 1754 TAKLKQLNETEA-MFNSQESELTELSKNIAELQKRIQSCINFIVDKSN 1800
+ L + + ++ + K +A+ + + ++D++
Sbjct: 79 QKLQQDLQKRQQEELQKILDKINKAIKEVAKKKG-----YDLVLDRNA 121
>gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family. Fibrinogen
is a protein involved in platelet aggregation and is
essential for the coagulation of blood. This domain forms
part of the central coiled coiled region of the protein
which is formed from two sets of three non-identical
chains (alpha, beta and gamma).
Length = 146
Score = 32.7 bits (75), Expect = 0.79
Identities = 9/42 (21%), Positives = 18/42 (42%)
Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQN 1686
K + + N LD ++ + VK +++ L+ Q N
Sbjct: 35 DKRIEDLENLLDQLANSTSSAHQYVKHIKDSLRGDQKQAQPN 76
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 33.5 bits (77), Expect = 0.82
Identities = 39/166 (23%), Positives = 82/166 (49%), Gaps = 24/166 (14%)
Query: 1644 ARKDLS---QIT-NDLDDAQQKSNETNVKVK----LLQERLKSIQAGFLQNGRSAMDVEN 1695
RK+L+ Q T D+ D ++ + N K+ L+E+LK + + + +
Sbjct: 104 KRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQH-FEKLL 162
Query: 1696 EEKNLEKEVALA-----QKQASGLRSRY-QETDNKLLSKAESSGLKRARGQMLLNKASQL 1749
+ K+LE+++ A Q+QA + ++ +E + L A+ S LK ++ +QL
Sbjct: 163 KTKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKEL----RAQL 218
Query: 1750 SVNTTAKLKQ----LNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
++ + K K+ LN++ +F + + E+ ++SK I +L+K +
Sbjct: 219 NLYSD-KFKEFQDTLNKSNEVFLTFKQEMEKMSKKIKKLEKENLAW 263
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 33.9 bits (78), Expect = 0.82
Identities = 27/96 (28%), Positives = 44/96 (45%), Gaps = 7/96 (7%)
Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS--VNT 1753
+E + E AL + + LR R Q+ + ++ E L+R + L+ K QL
Sbjct: 44 KEAEEKLEAALLEAKELLLRERNQQ---RQEARREREELQREEER-LVQKEEQLDARAEK 99
Query: 1754 TAKLK-QLNETEAMFNSQESELTELSKNIAELQKRI 1788
L+ QL E E +++E EL EL K + R+
Sbjct: 100 LDNLENQLEEREKALSARELELEELEKQLDNELYRV 135
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family. It
is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are multi-domain
proteins that act as scaffolding proteins and transducers
of a variety of signaling pathways that link membrane
dynamics and the underlying actin cytoskeleton. Most
members contain an N-terminal IMD, an SH3 domain, and a
WASP homology 2 (WH2) actin-binding motif at the
C-terminus, exccept for MIM which does not carry an SH3
domain. Some members contain additional domains and
motifs. The IMD domain binds and bundles actin filaments,
binds membranes and produces membrane protrusions, and
interacts with the small GTPase Rac.
Length = 223
Score = 33.1 bits (76), Expect = 0.83
Identities = 42/215 (19%), Positives = 79/215 (36%), Gaps = 52/215 (24%)
Query: 1589 LAKANDLKRKAN-LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD 1647
+ ++ L++A + + KI ++ +++R +Q++ E Q I K
Sbjct: 26 IKAGKKYQKALQALSQAAKVFF-DALAKIGELASQSRGSQELGEALKQ-----IVDTHKS 79
Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
+ + S E K E + + EK LE +
Sbjct: 80 I-----------EASLEQVAK-AFHGELILPL-----------------EKKLELD---- 106
Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQML-LNKASQLSVNTTAKLKQLNETEAM 1766
QK + Y+ + K + L +AR ++ L K SQ T ++L++
Sbjct: 107 QKVINKFEKDYK-KEYK----QKREDLDKARSELKKLQKKSQ-KSGTGKYQEKLDQALEE 160
Query: 1767 FNSQESELTELSKNI---AELQKRIQSCINFIVDK 1798
N ++ EL A L++R + C F+VDK
Sbjct: 161 LNDKQKELEAFVSQGLRDALLEERRRYC--FLVDK 193
>gnl|CDD|112039 pfam03207, OspD, Borrelia outer surface protein D (OspD).
Length = 264
Score = 33.3 bits (75), Expect = 0.87
Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 22/205 (10%)
Query: 1424 SNSAKDYAETSKKL--INEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
+NS K+ T+ L NE E + + ++S SA D + ++K +++ AE
Sbjct: 69 TNSLKEAKNTTDNLNASNEANKVVEAVINAVNLIS---SAADQVKGAQKNMHDLAQMAEI 125
Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASV-RNKTAAY--------VASTSNITKQLDEFL 1532
I + ++ A LAKEAY+ + +N Y S S+ +Q+ + +
Sbjct: 126 DLEKIKESSDKAIFAANLAKEAYNLTKAAEQNMQKLYKEQEEILESESDSDEIEQVSDEI 185
Query: 1533 NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA 1592
A + + +A +++ E I ++ N +K +D I TETA + K
Sbjct: 186 KQDSAEIKQAKEAVEIAWKATVEAKDELI-----DVENAVKEA--LDKIKTETANN-TKL 237
Query: 1593 NDLKRKANLTKAGADSTKNTVQKIV 1617
D++ A L A + Q++V
Sbjct: 238 ADIEEAAELVLQIAKNAAEIAQEVV 262
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA binding.
This is the middle region of a family of TATA element
modulatory factor 1 proteins conserved in eukaryotes that
contains at its N-terminal section a number of leucine
zippers that could potentially form coiled coil
structures. The whole proteins bind to the TATA element
of some RNA polymerase II promoters and repress their
activity. by competing with the binding of TATA binding
protein. TMFs are evolutionarily conserved golgins that
bind Rab6, a ubiquitous ras-like GTP-binding Golgi
protein, and contribute to Golgi organisation in animal
and plant cells.
Length = 74
Score = 30.7 bits (70), Expect = 0.93
Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 2/70 (2%)
Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQ-LNETEAMFNSQESELTEL 1777
+ KL K E G+ L K + + NT KL+ E E + +L +L
Sbjct: 1 SSLEKKLAEKDEQIAQLMEEGEKLSKKELKHN-NTIKKLRAKNKELEKEIAELKKKLEKL 59
Query: 1778 SKNIAELQKR 1787
K + L++R
Sbjct: 60 EKELENLEER 69
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed by
the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport and
binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 33.7 bits (77), Expect = 1.0
Identities = 22/132 (16%), Positives = 42/132 (31%), Gaps = 17/132 (12%)
Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTGEM-LKENATLLQENNVEGALELTRQAHDTSL 1325
L T R+ AL+ + + L G K + L Q+ E A D +
Sbjct: 137 LEETIGRLREELAALSREVGKQRGLLSRGLATFKRDRILQQQWREEQE---KYDAADKAR 193
Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
+ T+ +++ E ++ Q EK L+ + +I +
Sbjct: 194 AIYALQTKADERNL---------ETVLQSLSQADFQLAGVAEKELETVEARIKEARYEIE 244
Query: 1386 DLNELICDKRGD 1397
+L +K
Sbjct: 245 ELE----NKLNL 252
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging family
of proteins that among the multispan membrane proteins
display a novel topology. Mouse prominin and human
prominin (mouse)-like 1 (PROML1) are predicted to contain
five membrane spanning domains, with an N-terminal domain
exposed to the extracellular space followed by four,
alternating small cytoplasmic and large extracellular,
loops and a cytoplasmic C-terminal domain. The exact
function of prominin is unknown although in humans
defects in PROM1, the gene coding for prominin, cause
retinal degeneration.
Length = 807
Score = 33.8 bits (78), Expect = 1.0
Identities = 32/157 (20%), Positives = 59/157 (37%), Gaps = 31/157 (19%)
Query: 1607 DSTKNTVQKIVDVLT--EARTAQDMAEVAIQTAKDDISA-----------ARKDLSQITN 1653
+S K + I +LT R +D+A + DDI+ +L+
Sbjct: 576 ESLKVNLSTI-TLLTPEARRNLEDLASSGL----DDINFSNFLKQIQKPLTEVNLNSFAE 630
Query: 1654 DLDDAQQKSNETNVKVKLLQE--RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
+LD +KV L E L+SIQ F+ S ++ L V + +
Sbjct: 631 ELDALANNLPNGKLKVALKNEALDLRSIQQNFVPPMESLLE------KLNSNVKTLDESS 684
Query: 1712 SGLRSRYQETDNKLLSKAESS-GLKRARGQMLLNKAS 1747
+ L Q +L++ +++ G ++N+ S
Sbjct: 685 NDL----QNAVKDVLAEVQAAQGFLNNNASEIVNEES 717
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 31.8 bits (73), Expect = 1.1
Identities = 18/94 (19%), Positives = 41/94 (43%), Gaps = 11/94 (11%)
Query: 1628 DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ-ERLKSIQAGFLQN 1686
D A + A+ ++ A + L ++ + +Q+ + + + L++ QA F+
Sbjct: 1 DKAARNLAEAQQELEQAEEKLEELQQYRLEYRQQLSG---SGQGISAAELRNYQA-FISA 56
Query: 1687 GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
A+ ++E+ A+KQ R ++QE
Sbjct: 57 LDEAIA------QQQQELEQAEKQVEQAREQWQE 84
>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3.
Length = 424
Score = 33.5 bits (77), Expect = 1.1
Identities = 37/195 (18%), Positives = 78/195 (40%), Gaps = 10/195 (5%)
Query: 1469 KKLINEKEAKAEEVFRGITTA--KQESHAANILAKEAYDHAASVRNKTAAYVASTS-NIT 1525
KK I++ + E + + A Q+S L++ + A V AA S+ N +
Sbjct: 80 KKHIDDSLGELSEKIKSLKQALDDQQSTILQRLSELQQNTAKEVARPAAAPPPSSLPNSS 139
Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
D A L +R LAV + S+ E I+++++ + +
Sbjct: 140 SSPDSINPAGLKELQSLRR--ELAVLRQTHSD--FKSEIQASISSILEKINAFKSASLNA 195
Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR 1645
+G +A K L+ +DS V + D++ R +D+A+ ++ + +
Sbjct: 196 SGSSNRAYMESSKKKLS-EDSDSLLTKVDDLQDIIEALR--KDVAQRGVRPGPKQLETVQ 252
Query: 1646 KDLSQITNDLDDAQQ 1660
K++ + +L ++
Sbjct: 253 KEIQKAEKELKKMEE 267
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 33.7 bits (77), Expect = 1.1
Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 9/97 (9%)
Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSV 1751
+ E K L+KE+ L K+ + R +E + + K+E LK+ + L +A+ L
Sbjct: 61 TAKRESKALKKELLLEAKEEA--RKYREEIEQEF--KSERQELKQIESR-LTERATSLD- 114
Query: 1752 NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
K + L+ E S+E LT+ SK+I E ++++
Sbjct: 115 ---RKDENLSSKEKTLESKEQSLTDKSKHIDEREEQV 148
>gnl|CDD|179335 PRK01773, hscB, co-chaperone HscB; Provisional.
Length = 173
Score = 32.4 bits (74), Expect = 1.1
Identities = 33/131 (25%), Positives = 58/131 (44%), Gaps = 30/131 (22%)
Query: 1621 TEARTA-QDMAEV--AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ---- 1673
E R A Q AEV A+Q KD I A ++ T + + ++KS + L+Q
Sbjct: 43 QEQRLAMQKSAEVNDALQILKDPILRAEAIIALNTGEQQNLEEKSTQD--MAFLMQQMEW 100
Query: 1674 -ERLKSIQ--------AGFLQNGRSAMDVENEEKNLEKEV--ALAQKQ---ASGLRSRYQ 1719
E+L+ I+ F + +++ E++ + E+ AL +Q AS + R +
Sbjct: 101 REQLEEIEQQQDEDALTAFSK------EIKQEQQAILTELSTALNSQQWQQASQINDRLR 154
Query: 1720 ETDNKLLSKAE 1730
KL+ + E
Sbjct: 155 FI-KKLIIEIE 164
>gnl|CDD|145995 pfam03148, Tektin, Tektin family. Tektins are cytoskeletal proteins.
They have been demonstrated in such cellular sites as
centrioles, basal bodies, and along ciliary and flagellar
doublet microtubules. Tektins form unique protofilaments,
organised as longitudinal polymers of tektin heterodimers
with axial periodicity matching tubulin. Tektin
polypeptides consist of several alpha-helical regions
that are predicted to form coiled coils. Indeed, tektins
share considerable structural similarities with
intermediate filament proteins. Possible functional roles
for tektins are: stabilisation of tubulin protofilaments;
attachment of A and B-tubules in ciliary/flagellar
microtubule doublets and C-tubules in centrioles; binding
of axonemal components.
Length = 384
Score = 33.3 bits (77), Expect = 1.1
Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 2/58 (3%)
Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKAL-QRLNEKITSLEKNL 1384
+Q + E Q + AER + L+ +T Q + K L +R+++ I + L
Sbjct: 6 YQQYNQAEAQRDLAERLRHESRRLINETDATTKWTQADVTKKLGERISD-INFWKSEL 62
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 33.3 bits (77), Expect = 1.2
Identities = 50/237 (21%), Positives = 84/237 (35%), Gaps = 47/237 (19%)
Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAND-LKRKANLTKAGA 1606
L + SN E++ E I N ++ L+ D +T L+ L+ +L +
Sbjct: 210 LEEERKRLSNSEKLAE---AIQNALELLSGED----DTVSALSLLGRALEALEDL--SEY 260
Query: 1607 DST-KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET 1665
D + + + L E A + ++ D++ L ++ L + + +
Sbjct: 261 DGKLSELAELLEEALYELEEASE----ELRAYLDELEFDPNRLEEVEERLFALKSLARKY 316
Query: 1666 NVKVKLLQERLKSIQAGF--LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDN 1723
V ++ L E L I+ L N E + LEKEV + +
Sbjct: 317 GVTIEDLLEYLDKIKEELAQLDNS------EESLEALEKEVKKLKAE------------- 357
Query: 1724 KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMF----NSQESELTE 1776
LL AE+ L R A +L TA+LK L +A F E T
Sbjct: 358 -LLEAAEA--LSAIR----KKAAKELEKEVTAELKALAMEKARFTVELKPLEESPTA 407
Score = 32.2 bits (74), Expect = 3.0
Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 16/154 (10%)
Query: 1637 AKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENE 1696
A++ A + Q +L+D Q+K E + LLQ +L+ ++ LQ G ++E E
Sbjct: 155 AQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDE-ELEEE 213
Query: 1697 EKNLEKEVALAQKQASGLRSRYQETDNK-LLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
K L LA+ + L E D LS L RA L S+
Sbjct: 214 RKRLSNSEKLAEAIQNALELLSGEDDTVSALSL-----LGRALEA--LEDLSEYD----- 261
Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
+L+E + EL E S+ + ++
Sbjct: 262 --GKLSELAELLEEALYELEEASEELRAYLDELE 293
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 32.3 bits (74), Expect = 1.2
Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 10/68 (14%)
Query: 1436 KLINEKESKAEELR-KHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITT-AKQES 1493
+ +E + KA EL+ K+ L +AK E KK++ E ++KAE V+ I A +E
Sbjct: 46 EKADEDQKKARELKLKNERELK---NAK---EEGKKIVEEYKSKAENVYEEIVKEAHEE- 98
Query: 1494 HAANILAK 1501
A I+ +
Sbjct: 99 -ADLIIER 105
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 33.1 bits (76), Expect = 1.3
Identities = 66/288 (22%), Positives = 96/288 (33%), Gaps = 50/288 (17%)
Query: 1529 DEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGD 1588
+EF P I + + EQ +E + N+V K ++ D LT+
Sbjct: 109 EEFFQDPMLNELLYSKIGKVRYQRLKLRFEEQGKEITRDFNDVFKLNSESD--LTQNPHF 166
Query: 1589 LAKANDL-KRKANLTKAGADSTKNTVQKIVDVLTE-----ARTAQDMAEVAIQTA--KDD 1640
++ D K L K G T QK + T M + + D+
Sbjct: 167 TNESFDFELFKKLLVKQGLPRLAIT-QKDKLTIYSLAEKYKLTWDQMLNLIKNSLVNNDE 225
Query: 1641 IS--AARKDLSQI----TNDLDDAQQKSNETNVKVK---LLQERLKSIQAGFLQNGRSAM 1691
I RK LSQI N KSN + L E K I
Sbjct: 226 IDTEELRKLLSQIYKNEHNTKPTLSTKSNPEDFTEANQNLNTESKKVIS----------- 274
Query: 1692 DVENEEKNLEK-------EVALAQKQA-SGLRSRYQETD---NKLLSKAESSGLKRARGQ 1740
D+E LEK ++ L + L S D N LL + LK
Sbjct: 275 DLEL-TFPLEKLKSIRDGKITLEDLKLVEELISDQNLLDGVINVLL---DYVLLKN---D 327
Query: 1741 MLLNKASQLSV-NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
LNK L + N A+ K EA+ +++ + L+K + KR
Sbjct: 328 SKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKR 375
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 32.3 bits (74), Expect = 1.3
Identities = 29/130 (22%), Positives = 56/130 (43%), Gaps = 35/130 (26%)
Query: 1609 TKNTVQKIVDVL-----------------------TEARTAQDMAEVAI---------QT 1636
K VQK +D L E + +++ ++ + Q
Sbjct: 31 GKTAVQKALDELVDEGKIICKEYGKQKIYLCNQDQFELPSDEELNKLDMEIEELREEVQL 90
Query: 1637 AKDDISAARKDLSQITNDL--DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD-V 1693
K D S +L +T+DL ++ Q++ E +V+ ++E+L+S++ G+ M+ V
Sbjct: 91 LKQDCSTLEIELKSLTSDLTTEELQEEIQELKKEVREIEEKLESLEEGWKPVTPEEMEKV 150
Query: 1694 ENEEKNLEKE 1703
+ E K+L KE
Sbjct: 151 KKEYKDLHKE 160
>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
trafficking and secretion].
Length = 283
Score = 32.9 bits (75), Expect = 1.3
Identities = 30/183 (16%), Positives = 72/183 (39%), Gaps = 17/183 (9%)
Query: 1214 KEFELLEKKIEDV-KALVENTTRSSHDLTVMLTTIDDMKKQI--MNNEDPNGVGKTLGNT 1270
+E + + I + K ++T S D + ID++ K++ + N
Sbjct: 45 QELTAVRRSISRLGKVYAKHTEPSFSDKSEKEDEIDELSKKVNQDLQRCEKILKTKYKNL 104
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
++L + TE ++ G+ +++ + ++ V A L + +D +E+
Sbjct: 105 QSSFLQSKLLRDLNTECME-----GQRIQQKSAQFRKYQVLQAKFLRNKNNDQHPLEEEE 159
Query: 1331 STETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
E+ +Q + + L + E + + +E+I +L + + +LNE+
Sbjct: 160 DEESLSS--LGSQQTLQQQGLSNEEL-------EYQQILITERDEEIKNLARGIYELNEI 210
Query: 1391 ICD 1393
D
Sbjct: 211 FRD 213
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD and
GSK-3beta in intact cells.
Length = 711
Score = 33.3 bits (76), Expect = 1.3
Identities = 81/402 (20%), Positives = 148/402 (36%), Gaps = 74/402 (18%)
Query: 1429 DYAE------TSKKLI-NEKESKAE-ELRKHMI-ILSES----NSAKDYAETSKKLINEK 1475
DY+E + +K + ++S+ E E KH I L E NS + A K+ I EK
Sbjct: 98 DYSELEEENISLQKQVSVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKE-IAEK 156
Query: 1476 EAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
+ EE + T K E N L KE H S SN++ LD
Sbjct: 157 QL--EE---ALETLKTEREQKNALRKELSHH------LNLEDFYSLSNLSISLD------ 199
Query: 1536 GATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL-TETAGDLAK--- 1591
GL +++ +++ E + + N + + L T G+L
Sbjct: 200 -----------GLKFSEDPEASTEPNND-GEDENGHLNGGPGLAKSLGTPRKGELFPPAP 247
Query: 1592 -ANDLKRKANLT-----KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI---- 1641
+DL + NL+ K + + ++ L E +T + +E A+ + +
Sbjct: 248 GVSDLFSELNLSEIQKLKQQLEQVEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLT 307
Query: 1642 ---SAAR---KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVEN 1695
+A R Q + + S+E ++ + ++ + +++
Sbjct: 308 EHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDINGPEILECKYRVAVSEVGELKA 367
Query: 1696 EEKNLEKEVALAQKQAS----GLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSV 1751
E K L+ Q++ LR+ Q KLLS +SS + R L + +
Sbjct: 368 ELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDL-----EKEL 422
Query: 1752 NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
+ E++ N+ + EL S+ +A+L + C N
Sbjct: 423 RKLTHVAG--ESQGSLNAAQDELVTFSEELAQLYHHVCMCNN 462
>gnl|CDD|235558 PRK05683, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 676
Score = 33.5 bits (77), Expect = 1.3
Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELT----ELSKNIAELQKRI 1788
Q+LL +A LS + QLN+ + NSQ S +T L+ +IA K+I
Sbjct: 130 QLLLTQAQGLSKRFNSLSSQLNQQNSNINSQLSAMTDQVNNLTTSIASYNKQI 182
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated Ndc80
complex and is involved in chromosome segregation.
Length = 117
Score = 31.5 bits (72), Expect = 1.3
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALE 1315
++ LA+ LNDL E+ L E LKE L+E VEG
Sbjct: 6 NEKFRLAK-ELNDLESELNNLQSELEELKERLEELEEQEVEGDEV 49
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 32.5 bits (74), Expect = 1.4
Identities = 19/70 (27%), Positives = 29/70 (41%)
Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELS 1778
Q D L K E+ L+ + + + +S S + + E E ELTEL
Sbjct: 28 QLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELH 87
Query: 1779 KNIAELQKRI 1788
K EL +R+
Sbjct: 88 KKRGELAQRL 97
Score = 32.1 bits (73), Expect = 1.7
Identities = 25/111 (22%), Positives = 43/111 (38%), Gaps = 4/111 (3%)
Query: 1284 LTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
L E L + + + + G+ + KL+E+ TE K E
Sbjct: 36 LKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELHK--KRGEL 93
Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDK 1394
+ L+ Q + ++ EK + L +ITSLE + DL E + +K
Sbjct: 94 A--QRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREELQEK 142
Score = 31.7 bits (72), Expect = 2.6
Identities = 19/101 (18%), Positives = 37/101 (36%), Gaps = 19/101 (18%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
L L EE+ +L+K L + LL + + E+ + + E +I
Sbjct: 76 LAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEK----TIAEL----RSEITS 127
Query: 1341 AERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
E + + + +E EK + L +++ SL
Sbjct: 128 LETEIRDLREEL-----------QEKEKDNETLQDELISLN 157
>gnl|CDD|218979 pfam06295, DUF1043, Protein of unknown function (DUF1043). This
family consists of several hypothetical bacterial
proteins of unknown function.
Length = 128
Score = 31.7 bits (72), Expect = 1.4
Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 7/56 (12%)
Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESNSA-------KDYAETSKKLINEKEAK 1478
A++SK L+ E ++ K + ESN +DY+E + L K+ K
Sbjct: 73 MAKSSKFLLPELPAQDNPFIKKLSKNEESNENEEVKEQPRDYSEGASGLFRNKKKK 128
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural protein
GP20 sequences of around 180 residues in length. The
function of this family is unknown.
Length = 156
Score = 31.9 bits (73), Expect = 1.4
Identities = 27/130 (20%), Positives = 54/130 (41%), Gaps = 18/130 (13%)
Query: 1264 GKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDT 1323
GK + N +++ A + L +++K +K E LK+ + + E EL ++
Sbjct: 12 GKDIQNPKSKLDKANEERDSLKKQLKNRDKQIEDLKK-----KVKDNE---ELQKKIEKL 63
Query: 1324 SLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL--NEKITSLE 1381
+ K E E ++ D + N + + + N KA++ L +K+ +
Sbjct: 64 KQQNKTAKEEYEAKLAD-----TKLNNAIELALKKA---KARNAKAVKALLDKDKLKLDD 115
Query: 1382 KNLPDLNELI 1391
L L+E I
Sbjct: 116 DGLKGLDEQI 125
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 33.3 bits (76), Expect = 1.4
Identities = 29/130 (22%), Positives = 52/130 (40%), Gaps = 7/130 (5%)
Query: 1422 SESNSAKDYAETSKKLINEKESKAEELRKHMI-ILSESNSAKDYAETSKKLINEKEAKAE 1480
+E++ A++ AE + ++ R SE+N K YA++ + + +AE
Sbjct: 173 NEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAE 232
Query: 1481 EVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLA 1540
+ TA + A A A+ + N A S + +Q D T A
Sbjct: 233 SSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAKQQADR-----AKTEA 287
Query: 1541 D-IRNISGLA 1549
D + N++G A
Sbjct: 288 DKLGNMNGFA 297
>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
of Class IA Phosphoinositide 3-kinase Regulatory
subunits. PI3Ks catalyze the transfer of the
gamma-phosphoryl group from ATP to the 3-hydroxyl of the
inositol ring of D-myo-phosphatidylinositol (PtdIns) or
its derivatives. They play an important role in a variety
of fundamental cellular processes, including cell
motility, the Ras pathway, vesicle trafficking and
secretion, immune cell activation, and apoptosis. They
are classified according to their substrate specificity,
regulation, and domain structure. Class IA PI3Ks are
heterodimers of a p110 catalytic (C) subunit and a
p85-related regulatory (R) subunit. The R subunit
down-regulates PI3K basal activity, stabilizes the C
subunit, and plays a role in the activation downstream of
tyrosine kinases. All R subunits contain two SH2 domains
that flank an intervening helical domain (iSH2), which
binds to the N-terminal adaptor-binding domain (ABD) of
the catalytic subunit. In vertebrates, there are three
genes (PIK3R1, PIK3R2, and PIK3R3) that encode for
different Class IA PI3K R subunits.
Length = 152
Score = 31.8 bits (73), Expect = 1.5
Identities = 19/92 (20%), Positives = 31/92 (33%), Gaps = 22/92 (23%)
Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK---LKQLNETEAMFNSQ 1770
L + +E + + L K+ L K ++LS K L+ E MF Q
Sbjct: 6 LAKKLKEINKEYLDKSR-------EYDELYEKYNKLSQEIQLKRQALEAFEEAVKMFEEQ 58
Query: 1771 ------------ESELTELSKNIAELQKRIQS 1790
E L +N L+ R++
Sbjct: 59 LRTQEKFQKEAQPHEKQRLMENNELLKSRLKE 90
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The mitochondrial
release factors are prfA-like, although not included
above the trusted cutoff for this model. RF-1 does not
have a translational frameshift [Protein synthesis,
Translation factors].
Length = 360
Score = 33.1 bits (76), Expect = 1.5
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 21/119 (17%)
Query: 1703 EVALAQKQASGLRSRYQETDNKLLSKAE-SSGLKRARGQMLLNKASQLS-VNTTAK---- 1756
+ +L +K S L RY+E LLS E S + R L + SQL + +
Sbjct: 2 KPSLLEKLES-LLERYEEL-EALLSDPEVISDQDKLRK--LSKEYSQLEEIVDCYREYQQ 57
Query: 1757 -LKQLNETEAMFNSQ--------ESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCV 1806
+ + E + + + EL EL + I EL+++++ + + N+ KN +
Sbjct: 58 AQEDIKEAKEILEESDPEMREMAKEELEELEEKIEELEEQLK--VLLLPKDPNDEKNVI 114
>gnl|CDD|213765 TIGR03060, PS_II_psb29, photosystem II biogenesis protein Psp29.
Psp29, originally designated sll1414 in Synechocystis
6803, is found universally in Cyanobacteria and in
Arabidopsis. It was isolated and partially sequenced from
purified photosystem II (PS II) in Synechocystis. While
its function is unknown, mutant studies show an
impairment in photosystem II biogenesis and/or stability,
rather than in PS II core function [Energy metabolism,
Photosynthesis].
Length = 214
Score = 32.3 bits (74), Expect = 1.5
Identities = 28/132 (21%), Positives = 54/132 (40%), Gaps = 19/132 (14%)
Query: 1557 NPEQIRERANEINNVIKSLT--DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
+PEQ+RE A ++ K +I + LT+ A D + S +
Sbjct: 87 DPEQLREDAKQLLEQAKGKGLDEILSWLTQ-ANLSNGGGDTLQGIAGRHKFKYSRLFAI- 144
Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAA--------RKDLSQITNDLDDAQQKSNETN 1666
+ +L EA +D+ E + ++S A KDL ++L+ +Q
Sbjct: 145 GLYSLLEEAAPDKDIDEEDLNEILKELSEALGLSYDRVEKDLDLYKSNLEKMKQ------ 198
Query: 1667 VKVKLLQERLKS 1678
++L++E L++
Sbjct: 199 -ALELMEETLEA 209
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 33.0 bits (76), Expect = 1.5
Identities = 47/298 (15%), Positives = 87/298 (29%), Gaps = 39/298 (13%)
Query: 1456 SESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTA 1515
+E + ++ + KKL + ++K I A + + + + T
Sbjct: 8 AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67
Query: 1516 AYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSL 1575
S K A A + L +K + ++ I L
Sbjct: 68 DATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVL 127
Query: 1576 TDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQ 1635
D + D +D+ + D + V + E ++
Sbjct: 128 NQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE--------AKELE 179
Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNET----NVKVKLLQERLKSIQAGFLQNGRSAM 1691
DD + L A++ + T VK LK I G+ +
Sbjct: 180 KLSDDDDFVWDEDDSE--ALRQARKDAKLTATADPVK-----AYLKQI-------GKVKL 225
Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGL-------KRARGQML 1742
N E+EV LA++ +GL + + + L L KRA+ +L
Sbjct: 226 ------LNAEEEVELAKRIEAGLYAEELLAEGEKLDPELRRDLQWIGRDGKRAKNHLL 277
>gnl|CDD|128778 smart00502, BBC, B-Box C-terminal domain. Coiled coil region
C-terminal to (some) B-Box domains.
Length = 127
Score = 31.5 bits (72), Expect = 1.7
Identities = 21/113 (18%), Positives = 37/113 (32%), Gaps = 25/113 (22%)
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLS-----KAESSGLKRA---RGQML 1742
+E L K+ A + L S QE + KA L+ A R + L
Sbjct: 3 EALEELLTKLRKKAAELEDALKQLISIIQEVEENAADVEAQIKAAFDELRNALNKRKKQL 62
Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFI 1795
L + + K L + +++ + Q+++ INF
Sbjct: 63 LED-----LEEQKENKLKV------------LEQQLESLTQKQEKLSHAINFT 98
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation, ribosomal
structure and biogenesis].
Length = 363
Score = 33.0 bits (76), Expect = 1.7
Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 21/93 (22%)
Query: 1714 LRSRYQETDNKLLSKAE-SSGLKRARGQMLLNKA-SQLS--VNTTAKLKQ----LNETEA 1765
L RY+E LLS E S R L+K ++L V + K+ L + +
Sbjct: 12 LLERYEEL-EALLSDPEVISDPDEYRK---LSKEYAELEPIVEKYREYKKAQEDLEDAKE 67
Query: 1766 MFNSQ---------ESELTELSKNIAELQKRIQ 1789
M + E E+ EL I EL++ ++
Sbjct: 68 MLAEEKDPEMREMAEEEIKELEAKIEELEEELK 100
>gnl|CDD|218625 pfam05531, NPV_P10, Nucleopolyhedrovirus P10 protein. This family
consists of several nucleopolyhedrovirus P10 proteins
which are thought to be involved in the morphogenesis of
the polyhedra.
Length = 75
Score = 30.1 bits (68), Expect = 1.8
Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 7/63 (11%)
Query: 1619 VLTEARTAQDMAEV-----AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
+L + R D+ V A+QT DD+ A D++++ LD + KV +Q
Sbjct: 5 ILLQIR--ADIKAVDDKVDALQTQVDDLRANLPDVTELNEKLDAQSAQLTTLQTKVTNIQ 62
Query: 1674 ERL 1676
L
Sbjct: 63 SIL 65
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This family
of bacterial proteins has no known function. The proteins
are in the region of 500-600 amino acid residues in
length.
Length = 450
Score = 32.6 bits (75), Expect = 1.8
Identities = 15/73 (20%), Positives = 36/73 (49%), Gaps = 8/73 (10%)
Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKA--------LQRLNEK 1376
+LKE+ E E+++ + E + ++ + L+ K + ++ ++L+E+
Sbjct: 330 PELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEE 389
Query: 1377 ITSLEKNLPDLNE 1389
+ LE+ L +L E
Sbjct: 390 LEELEEELKELKE 402
Score = 32.6 bits (75), Expect = 2.4
Identities = 21/118 (17%), Positives = 43/118 (36%), Gaps = 13/118 (11%)
Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
K ++A L + E EV + + L+ +E + +L E L+
Sbjct: 305 GKEVKAKELGSPA--------GTKTEIEVGVDFPE---LKEELKELEEELKELEEE--LE 351
Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
+ + + K +QL + EL EL + + EL++ ++S +
Sbjct: 352 KIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYS 409
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 32.5 bits (74), Expect = 2.0
Identities = 27/117 (23%), Positives = 46/117 (39%), Gaps = 11/117 (9%)
Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
L L EE + L LK +E + + +Q + + ++ S E K+ D
Sbjct: 170 LKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELAKKTED 229
Query: 1341 AERQCKRTENLVTKTFPQFTQGQE-------ENEKALQRLNEKITSLEKNLPDLNEL 1390
ERQ + +L++ Q Q+ ENE+ Q L + + +L EL
Sbjct: 230 LERQQEEITHLLS----QIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQEL 282
Score = 30.2 bits (68), Expect = 9.5
Identities = 57/290 (19%), Positives = 104/290 (35%), Gaps = 77/290 (26%)
Query: 1540 ADIRNISGLAVNKNIQSNPEQIRE-------RANEINNVIKSLTDIDT---ILTETAGDL 1589
D + + + + EQ E AN + + K+ DID +L E DL
Sbjct: 20 DDDWKTPRASPDARLDLSTEQRAETLKYLALCANRVGQMTKTYNDIDAVTRLLEEKERDL 79
Query: 1590 AKAND-----LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
A LK+ L KN + E + A+D+I
Sbjct: 80 ELAARIGQSLLKQNKVL------MEKN----------------EKLEEQLGKARDEILQL 117
Query: 1645 RKDLSQ-------ITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE 1697
R +L+ ++ ++++ +S+E+ L+ ++ +G ++ E+
Sbjct: 118 RHELNLKDELLQFYSDADEESEDESSES--------TPLRPQESSSSSHGCFQLEALQEK 169
Query: 1698 -KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK 1756
K LE+E + +AS L+ ET + E Q L+N
Sbjct: 170 LKLLEEENEHLRSEASHLK---TETVT--YEEKE---------QQLVNDC---------- 205
Query: 1757 LKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCV 1806
+KQL E S EL + ++++ Q+ I ++ IVD K+
Sbjct: 206 VKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYA 255
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 32.9 bits (76), Expect = 2.0
Identities = 49/225 (21%), Positives = 78/225 (34%), Gaps = 29/225 (12%)
Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSES------------NSAKDYAETSKKLINE 1474
AK+ +SK+L+ + + E++ H + E AK AE + E
Sbjct: 10 AKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAE 69
Query: 1475 KEAKAEEVFRGITTAKQESHAANI-LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLN 1533
+EAKAE E+ AA A+ A D AA A + K+
Sbjct: 70 EEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPK 129
Query: 1534 APGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTE--TAGDLA- 1590
R K + + R R + + ++ E T +LA
Sbjct: 130 KKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAE 189
Query: 1591 ----KANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
KA ++ +K L K G +T N L + TA+ +AE
Sbjct: 190 KMAVKAAEVIKK--LFKLGVMATIN------QSL-DQETAELLAE 225
>gnl|CDD|237687 PRK14357, glmU, bifunctional N-acetylglucosamine-1-phosphate
uridyltransferase/glucosamine-1-phosphate
acetyltransferase; Provisional.
Length = 448
Score = 32.8 bits (75), Expect = 2.0
Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 23/103 (22%)
Query: 1224 EDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ-RINLARLALN 1282
+K L+E R D+T+++ + EDP G G+ + + + RI +
Sbjct: 105 NTLKRLIEEHNRKGADVTILVADL----------EDPTGYGRIIRDGGKYRI----VEDK 150
Query: 1283 DLTEEIKKLNK--------TGEMLKENATLLQENNVEGALELT 1317
D EE KK+ + +G+ L E ++ N +G LT
Sbjct: 151 DAPEEEKKIKEINTGIYVFSGDFLLEVLPKIKNENAKGEYYLT 193
>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
Staphylococcus aureus plasmid proteins with no
characterized function.
Length = 229
Score = 32.3 bits (73), Expect = 2.1
Identities = 26/129 (20%), Positives = 53/129 (41%), Gaps = 31/129 (24%)
Query: 1594 DLKRKANLTKAGADSTKNTVQK------------IVDVLTEARTAQDMAEVAIQTAKDDI 1641
+++ + LT + + +++ I+ + E + QD+ E ++ K ++
Sbjct: 20 NMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQDIREKELKAIKSEL 79
Query: 1642 SAARKDLS----QITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQN--GRSAMDVEN 1695
A +D +I ND V LLQE+LK + A +++ G+ V
Sbjct: 80 KEATEDFQDKGKKIHNDF-------------VDLLQEKLKHVDAEDIEDELGKDIYGVRE 126
Query: 1696 EEKNLEKEV 1704
E + +EV
Sbjct: 127 EIDRMLQEV 135
>gnl|CDD|236491 PRK09377, tsf, elongation factor Ts; Provisional.
Length = 290
Score = 32.4 bits (75), Expect = 2.3
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 4/62 (6%)
Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT---AK 1638
LTE GD+ KA + RK L KA + + + +V + + EV +T AK
Sbjct: 27 LTEADGDIEKAIEWLRKKGLAKAAKKAGRVAAEGLVAAKVDGNKGV-LVEVNSETDFVAK 85
Query: 1639 DD 1640
++
Sbjct: 86 NE 87
>gnl|CDD|213361 cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of
the Escherichia coli CorA-Salmonella typhimurium ZntB
family. A uncharacterized subfamily of the Escherichia
coli CorA-Salmonella typhimurium ZntB (EcCorA-ZntB_like)
family of the MIT superfamily of essential membrane
proteins involved in transporting divalent cations
(uptake or efflux) across membranes.The EcCorA-ZntB-like
family includes the Mg2+ transporters Escherichia coli
and Salmonella typhimurium CorAs, which can also
transport Co2+, and Ni2+. Structures of the intracellular
domain of EcCorA-ZntB-like family members, Vibrio
parahaemolyticus and Salmonella typhimurium ZntB, form
funnel-shaped homopentamers, the tip of the funnel is
formed from two C-terminal transmembrane (TM) helices
from each monomer, and the large opening of the funnel
from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport through Salmonella typhimurium CorA. Natural
variants such as GVN and GIN, such as occur in some ZntB
family proteins, may be associated with the transport of
different divalent cations, such as zinc and cadmium. The
functional diversity of MIT transporters may also be due
to minor structural differences regulating gating,
substrate selection, and transport.
Length = 289
Score = 32.1 bits (74), Expect = 2.3
Identities = 26/116 (22%), Positives = 44/116 (37%), Gaps = 39/116 (33%)
Query: 1559 EQIRERANEINNVIKS---------LTDIDTILT--ETAGDLAKAND-----LKRKANLT 1602
QI + +E+ ++ L D++ L T+ K+N+ LKR +L
Sbjct: 125 RQINKERDELEQELRKSTKNKELFQLLDLEKSLVYFSTS---LKSNEIVLEKLKRSRSLK 181
Query: 1603 KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
D + + DV+ E + A +MAE+ S I + L DA
Sbjct: 182 LDEEDK-----ELLEDVIIENKQAIEMAEIY---------------SDILSGLMDA 217
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 32.0 bits (73), Expect = 2.4
Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 13/103 (12%)
Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
E E+ LE + + A L+ +E + + G + + Q
Sbjct: 40 EELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGF------EEGQEEGRV 93
Query: 1754 TAKLKQL-----NETEAMFNSQESELTELSKNIAE--LQKRIQ 1789
+L +L E EA+ E +L +L IA + K ++
Sbjct: 94 LERLAKLIAEFQAELEALKEVVEKQLVQLVLEIARKVIGKELE 136
>gnl|CDD|206121 pfam13950, Epimerase_Csub, UDP-glucose 4-epimerase C-term subunit.
This domain is the very C-terminal subunit of
UDP-glucose 4-epimerase.
Length = 62
Score = 29.3 bits (67), Expect = 2.4
Identities = 9/17 (52%), Positives = 11/17 (64%), Gaps = 1/17 (5%)
Query: 442 DMCKDGFWNFNETNPNG 458
DMC+D W + NPNG
Sbjct: 46 DMCRDS-WRWQSKNPNG 61
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 32.4 bits (74), Expect = 2.4
Identities = 22/114 (19%), Positives = 44/114 (38%), Gaps = 10/114 (8%)
Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTRQAHDTSLK---LKEQST 1332
L +E+++L K E L++ LL N +E L+ R A + ++ +
Sbjct: 299 ENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKS 358
Query: 1333 ETEKQINDAE-RQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
+T + ++ K+ + Q E ++A+ T+LEK
Sbjct: 359 KTPSENAQRYFKKYKKLKGAKVNLDRQL----SELKEAIAYYESAKTALEKAEG 408
>gnl|CDD|223342 COG0264, Tsf, Translation elongation factor Ts [Translation,
ribosomal structure and biogenesis].
Length = 296
Score = 32.3 bits (74), Expect = 2.5
Identities = 20/107 (18%), Positives = 40/107 (37%), Gaps = 7/107 (6%)
Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV-LTEARTAQDMAEVAIQT---A 1637
L E GD+ KA + R+ + KA + + + ++ + + EV +T A
Sbjct: 27 LEEANGDIEKAIEWLREKGIAKAAKKAGRIAAEGLIAAKVDGDGKKAVLVEVNCETDFVA 86
Query: 1638 KDD--ISAARKDLSQI-TNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
K+ A K D ++ K ++E + ++ A
Sbjct: 87 KNAEFQELANKIAKAALEKKPADVEELKAAFEPGGKTVEEEIAALIA 133
>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A. Members of this
family are the phage shock protein PspA, from the phage
shock operon. This is a narrower family than the set of
PspA and its homologs, sometimes several in a genome, as
described by pfam04012. PspA appears to maintain the
protonmotive force under stress conditions that include
overexpression of certain phage secretins, heat shock,
ethanol, and protein export defects [Cellular processes,
Adaptations to atypical conditions].
Length = 219
Score = 31.9 bits (73), Expect = 2.5
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
L E + AQ++AE + +++A + L+++ D+ Q K E + K L R ++
Sbjct: 91 LIEKQKAQELAE----ALERELAAVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQAA 146
Query: 1680 QA 1681
+
Sbjct: 147 SS 148
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.4 bits (75), Expect = 2.7
Identities = 51/233 (21%), Positives = 87/233 (37%), Gaps = 64/233 (27%)
Query: 1285 TEEIKKLNKTGEMLKENATLLQENNVEGA-LELTRQAHDTSLKLKEQSTETEKQINDAER 1343
+IK+ + + + E A E + A LE + H + +++ E ++ E+
Sbjct: 30 EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEK 89
Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLC 1403
+ + E + + + +EE EK + L +K LEK +L ELI
Sbjct: 90 RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELI------------ 137
Query: 1404 GGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK--AEELRKHMIILS--ESN 1459
E ++L E+ S AEE ++ I+L E
Sbjct: 138 ---------------------------EEQLQEL--ERISGLTAEEAKE--ILLEKVEEE 166
Query: 1460 SAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILA----KEAYDHAA 1508
+ + A LI E E +A+E A ++ A ILA + A DH A
Sbjct: 167 ARHEAA----VLIKEIEEEAKE------EADKK--AKEILAQAIQRCAADHVA 207
>gnl|CDD|203372 pfam06009, Laminin_II, Laminin Domain II. It has been suggested that
the domains I and II from laminin A, B1 and B2 may come
together to form a triple helical coiled-coil structure.
Length = 138
Score = 30.9 bits (70), Expect = 2.7
Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 18/98 (18%)
Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKL-----LSKAESSGLKRARGQM--LLNKASQLS 1750
N+ K++ S +++ + + L L + + + A + L A L
Sbjct: 13 ANVLKQLEPISTNLSNTKAKLSKLNGSLEETNELVNSANKAVDDAGRSVRKLEKLAPDL- 71
Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
KLK L + E LS+NI+ +++ I
Sbjct: 72 ---LDKLKPLKQ-------LEVANQSLSRNISRIKELI 99
Score = 30.1 bits (68), Expect = 4.3
Identities = 16/114 (14%), Positives = 43/114 (37%), Gaps = 4/114 (3%)
Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK 1661
+K A T ++ L T + + + + ++ +DDA +
Sbjct: 1 SKELAREANETAANVLKQLEPISTNLSNTKAKLSKLNGSLEETNELVNSANKAVDDAGRS 60
Query: 1662 SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLR 1715
+ L ++LK ++ L+ ++ +++ +A A+K A+ ++
Sbjct: 61 VRKLEKLAPDLLDKLKPLKQ--LEVANQSLSRNISR--IKELIAQARKAANSIK 110
>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 247
Score = 32.0 bits (73), Expect = 2.7
Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 4/86 (4%)
Query: 1629 MAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGR 1688
M QT+K + S R + +L Q+K N +V+ L+ +L S++ +
Sbjct: 34 MLAAVFQTSKGE-SVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRS-VLTDD 91
Query: 1689 SAMDVENEEKNLEKEVALAQKQASGL 1714
+A+ E+ + L GL
Sbjct: 92 AAL--EDRLEKLRMLAGSVPVTGPGL 115
>gnl|CDD|217457 pfam03255, ACCA, Acetyl co-enzyme A carboxylase carboxyltransferase
alpha subunit. Acetyl co-enzyme A carboxylase
carboxyltransferase is composed of an alpha and beta
subunit.
Length = 145
Score = 31.1 bits (71), Expect = 2.8
Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 15/69 (21%)
Query: 1215 EFEL----LEKKIEDVKALVE-NTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGN 1269
EFE LE KI++++ L N S ++ + +D +K++I +N P
Sbjct: 4 EFEKPLAELEAKIDELRKLARKNDVDVSDEIHRLEKKLDKLKREIYSNLTP--------- 54
Query: 1270 TTQRINLAR 1278
QR+ LAR
Sbjct: 55 -WQRVQLAR 62
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal element
functions, Plasmid functions].
Length = 472
Score = 32.3 bits (74), Expect = 2.8
Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 27/108 (25%)
Query: 1281 LNDLTEEIKKLNK-TGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIN 1339
L L E+K+L K +++ EN L EN +L+++ ++QI
Sbjct: 61 LRTLVAEVKELRKRLAKLISENEALKAENE----------------RLQKREQSIDQQIQ 104
Query: 1340 DAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
A V + T+ E+ + Q+L I L++ L +
Sbjct: 105 QA----------VQSETQELTKEIEQLKSERQQLQGLIDQLQRRLAGV 142
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set of
eukaryotic tropomyosins. Within the yeast Tmp1 and Tmp2,
biochemical and sequence analyses indicate that Tpm2p
spans four actin monomers along a filament, whereas Tpmlp
spans five. Despite its shorter length, Tpm2p can compete
with Tpm1p for binding to F-actin. Over-expression of
Tpm2p in vivo alters the axial budding of haploids to a
bipolar pattern, and this can be partially suppressed by
co-over-expression of Tpm1p. This suggests distinct
functions for the two tropomyosins, and indicates that
the ratio between them is important for correct
morphogenesis. The family also contains higher eukaryote
Tmp3 members.
Length = 143
Score = 30.6 bits (70), Expect = 3.0
Identities = 42/178 (23%), Positives = 65/178 (36%), Gaps = 53/178 (29%)
Query: 1311 EGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ---EENE 1367
E A E + + +L++++ E E++I +++ ++ E V K Q + + EE+E
Sbjct: 10 ENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESE 69
Query: 1368 KALQR---LNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES 1424
K L +I LE EL
Sbjct: 70 KLATNAEALTRRIQLLE------EEL---------------------------------E 90
Query: 1425 NSAKDYAETSKKLINEKESKAEEL---RKHMIILSESNSAKDYAETSKKLINEKEAKA 1479
S K ET++KL E + KAEE K + + K Y E KK KEAK
Sbjct: 91 ESEKRLKETTEKL-READKKAEESERKVKALENERDEWEEK-YEELEKKY---KEAKK 143
Score = 29.9 bits (68), Expect = 6.7
Identities = 25/137 (18%), Positives = 55/137 (40%), Gaps = 39/137 (28%)
Query: 1622 EARTAQDMAEVAIQTAKD----------DISAARKDLSQITNDLD-------DAQQKSNE 1664
EA AQ+ AE + K+ +I++ +K Q+ +++ +A++K E
Sbjct: 8 EAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEE 67
Query: 1665 T----------NVKVKLLQERLKSIQAGFLQNGRSAM------DVENEEKNLEKEVALAQ 1708
+ +++LL+E L+ + + D + EE E++V +
Sbjct: 68 SEKLATNAEALTRRIQLLEEELEESEK----RLKETTEKLREADKKAEE--SERKVKALE 121
Query: 1709 KQASGLRSRYQETDNKL 1725
+ +Y+E + K
Sbjct: 122 NERDEWEEKYEELEKKY 138
>gnl|CDD|225559 COG3014, COG3014, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 449
Score = 32.2 bits (73), Expect = 3.0
Identities = 19/83 (22%), Positives = 29/83 (34%), Gaps = 9/83 (10%)
Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA----LAQ 1708
ND A+ + N N + Q R K +Q +D N+E+ A +
Sbjct: 139 NDSAKARVEFNRANER----QRRAKEFYYEEVQKAIKEIDSSKHNINMERSRAEVSEILN 194
Query: 1709 KQASGLRSRYQETDNKLLSKAES 1731
S +Y E LL+ S
Sbjct: 195 NTYSNYLDKY-EAYQGLLNPYVS 216
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 32.1 bits (73), Expect = 3.1
Identities = 25/105 (23%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 1284 LTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
++I++ +++ +L +E N ++A S K KE S E+ IN +
Sbjct: 28 RYKQIEQEDQS-RILNTLEEFEKEAN-------EKRAQYRSAKKKELSQLEEQLINQKKE 79
Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNE----KITSLEKNL 1384
Q + K F Q + +AL+ LN ++ LEK L
Sbjct: 80 QKNLFNEQI-KQFELALQDEIAKLEALELLNLEKDKELELLEKEL 123
Score = 30.6 bits (69), Expect = 8.2
Identities = 35/158 (22%), Positives = 67/158 (42%), Gaps = 27/158 (17%)
Query: 1640 DISAARKDLSQITNDLDDAQQK-SNETNVKVK----LLQERLKSIQAGFLQNGRSAMDVE 1694
SA +K+LSQ+ L + +++ N N ++K LQ+ + ++A L N ++ +
Sbjct: 58 YRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLN----LEKD 113
Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTT 1754
E + LEKE L +E +L + AE KR N ++ +
Sbjct: 114 KELELLEKE----------LDELSKELQKQLQNTAEIIEKKRE------NNKNEERLKFE 157
Query: 1755 AKLKQLNETEAMFNSQESELTELSKNI--AELQKRIQS 1790
+ K E E +L E + ++ E +++ +S
Sbjct: 158 NEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRES 195
>gnl|CDD|232995 TIGR00487, IF-2, translation initiation factor IF-2. This model
discriminates eubacterial (and mitochondrial) translation
initiation factor 2 (IF-2), encoded by the infB gene in
bacteria, from similar proteins in the Archaea and
Eukaryotes. In the bacteria and in organelles, the
initiator tRNA is charged with N-formyl-Met instead of
Met. This translation factor acts in delivering the
initator tRNA to the ribosome. It is one of a number of
GTP-binding translation factors recognized by the pfam
model GTP_EFTU [Protein synthesis, Translation factors].
Length = 587
Score = 32.0 bits (73), Expect = 3.4
Identities = 20/78 (25%), Positives = 32/78 (41%), Gaps = 3/78 (3%)
Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
LA +++ IKKL G M+ N L +E A E +++ + TE E
Sbjct: 17 LANKMNIKVSDIIKKLMLLGVMVTINQVLDKETAELVAEEF---GVKVEVRVTLEETEAE 73
Query: 1336 KQINDAERQCKRTENLVT 1353
+Q D+ +VT
Sbjct: 74 EQDEDSGDLLVERPPVVT 91
>gnl|CDD|206779 cd11386, MCP_signal, Methyl-accepting chemotaxis protein (MCP),
signaling domain. Methyl-accepting chemotaxis proteins
(MCPs or chemotaxis receptors) are an integral part of
the transmembrane protein complex that controls bacterial
chemotaxis, together with the histidine kinase CheA, the
receptor-coupling protein CheW, receptor-modification
enzymes, and localized phosphatases. MCPs contain a four
helix trans membrane region, an N-terminal periplasmic
ligand binding domain, and a C-terminal HAMP domain
followed by a cytoplasmic signaling domain. This
C-terminal signaling domain dimerizes into a four-helix
bundle and interacts with CheA through the adaptor
protein CheW.
Length = 200
Score = 31.1 bits (71), Expect = 3.6
Identities = 13/57 (22%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
A +A +AE + Q A + R+ N ++ + +E V+ L+E I
Sbjct: 11 AASADQVAETS-QQAAELAEKGREAAEDAINQMNQIDESVDEAVSAVEELEESSAEI 66
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 30.3 bits (69), Expect = 3.6
Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 1/65 (1%)
Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQG-QEENEKALQRLNEKITSLEKNLP 1385
L+EQ T+ E+++ E Q ++ + K Q + Q++ E+ + E+ L +
Sbjct: 25 LQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRLEQKNQLLFQIE 84
Query: 1386 DLNEL 1390
+ EL
Sbjct: 85 QVQEL 89
>gnl|CDD|183114 PRK11388, PRK11388, DNA-binding transcriptional regulator DhaR;
Provisional.
Length = 638
Score = 32.0 bits (73), Expect = 3.6
Identities = 30/129 (23%), Positives = 55/129 (42%), Gaps = 16/129 (12%)
Query: 1216 FEL----LEKKIEDVKALVENTTRS-SHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
FE+ L + ED+ ALV N RS + L DD ++++ P + L +
Sbjct: 488 FEITIPPLRMRREDIPALVNNKLRSLEKRFSTRLKIDDDALARLVSYRWPGNDFE-LRSV 546
Query: 1271 TQRINLA----RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALE---LTRQAHDT 1323
+ + L+ R+ L+DL E + T ++ +AT L + LE + A
Sbjct: 547 IENLALSSDNGRIRLSDLPEHLFTEQATDDV---SATRLSTSLSLAELEKEAIINAAQVC 603
Query: 1324 SLKLKEQST 1332
+++E +
Sbjct: 604 GGRIQEMAA 612
>gnl|CDD|112355 pfam03532, OMS28_porin, OMS28 porin.
Length = 253
Score = 31.3 bits (70), Expect = 3.7
Identities = 38/210 (18%), Positives = 86/210 (40%), Gaps = 17/210 (8%)
Query: 1198 TDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLT-------------VML 1244
+D + K+ + + + K EDV + +E SS +L M
Sbjct: 40 SDQKDNKKLDQKDQVNQALDTINKVTEDVSSKLEGVRESSLELVESNDAGVVKKFVGSMS 99
Query: 1245 TTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN-LARLALNDLTEEIKKLNKTGE--MLKEN 1301
D K ++ +++ V K G + N + ++ + E K ++ GE L E
Sbjct: 100 LMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVSVAGEATFLIEK 159
Query: 1302 ATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQ 1361
+L ++ LELT++ ++KE +E+ +++ ++ ++ N+V P +
Sbjct: 160 QIMLNKSPNNKELELTKEEFAKVDEVKETLMASERALDETVQEAQKVLNMVNGLNPS-NK 218
Query: 1362 GQEENEKALQRLNEKITSLEKNLPDLNELI 1391
Q +K + + + + + DL +++
Sbjct: 219 DQVLAKKDVAKAISNVVKVAQGARDLTKVM 248
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 3.7
Identities = 30/113 (26%), Positives = 46/113 (40%), Gaps = 10/113 (8%)
Query: 1284 LTEEIKK-LNKTGEMLKENATLL--QENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
L E +K+ L++ E LKE+ LL + + R D L+E+ + KQ+ D
Sbjct: 141 LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDA---LEEELRQ-LKQLED 196
Query: 1341 AERQCKRTE--NLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
C TE K + Q K L+ L E++ LE + DL
Sbjct: 197 ELEDCDPTELDRAKEK-LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKK 248
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members of
this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded and
aggregated proteins [Protein fate, Protein folding and
stabilization].
Length = 852
Score = 31.9 bits (73), Expect = 4.0
Identities = 18/83 (21%), Positives = 38/83 (45%), Gaps = 21/83 (25%)
Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML--------------L 1743
++LEKE+A +++ + L +++ KA G+++ + ++ L
Sbjct: 437 EDLEKELAELEEEYADLEEQWK------AEKAAIQGIQQIKEEIEQVRLELEQAEREGDL 490
Query: 1744 NKASQLSVNTTAKL-KQLNETEA 1765
KA++L +L K+L EA
Sbjct: 491 AKAAELQYGKLPELEKRLQAAEA 513
>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
Length = 238
Score = 31.2 bits (71), Expect = 4.2
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Query: 1307 ENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN 1366
V+ A+ +A D S + E+ TE E ++ + E E+ ++ +N
Sbjct: 7 PLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPEGDAAEAESSPDSGSAA-SETAADN 65
Query: 1367 EKALQRLNEKITSLEKNLPDLNE 1389
L +L +++ SL++ L +LN
Sbjct: 66 AARLAQLEQELESLKQELEELNS 88
>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
Members of this family of uncharacterized proteins have
no known function.
Length = 319
Score = 31.4 bits (71), Expect = 4.3
Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 2/82 (2%)
Query: 1635 QTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE 1694
Q + R++ Q++ D+ + +Q+ E +KLL++ + + F G +
Sbjct: 68 QDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEF--MGGGVRESF 125
Query: 1695 NEEKNLEKEVALAQKQASGLRS 1716
E +L ++ +++ L S
Sbjct: 126 EERNDLVSQLEKLREKCKQLES 147
Score = 31.4 bits (71), Expect = 4.6
Identities = 42/261 (16%), Positives = 93/261 (35%), Gaps = 30/261 (11%)
Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGD--LAKANDLKRKANLTKAGADS 1608
+++ E+ RE +++ ++ L E GD L + +++ G
Sbjct: 67 RQDLSQLLEKYREENQQLSTDVQELRQR---KAELQGDIKLLRQTIAQQRVEFMGGGVRE 123
Query: 1609 TKNTVQKIVDVLTEARTAQDMAEVAIQTA---KDDISAARKDLSQITNDLDDAQQKSNET 1665
+ +V L + R E +++ K+++ R + + + L +NE
Sbjct: 124 SFEERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMERDEYQRKAHRL------NNEL 177
Query: 1666 NVKVKLLQERLKSIQAGFLQNG--RSAMDVENEEKNLEKEVALAQKQA-------SGLRS 1716
+ + R+ + A ++N + ++ EE L K KQ + +++
Sbjct: 178 SYLLGGDPRRVVDVDALVMENRYLKEKINTLEEESELIKMTLAKYKQMLEAKNKKTMVKA 237
Query: 1717 RYQETDN-------KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNS 1769
+ N K + + +S + S L TA L N+ +
Sbjct: 238 GEGDDKNSTGVISMKQVKELLASEAIELELPVKAATISDLKSLATALLDLCNDKQMALAH 297
Query: 1770 QESELTELSKNIAELQKRIQS 1790
Q L +AEL+K++ +
Sbjct: 298 QRRTNKILGNRVAELEKKLAT 318
>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 613
Score = 31.6 bits (72), Expect = 4.5
Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 30/167 (17%)
Query: 1455 LSESNSAKDYAETSKKLINEKEAKAEEV-FRGITTAKQESHAA-NILAKEAYDHAASVRN 1512
++++ +Y + + E ++ GI Q+ A N A D A
Sbjct: 78 YKKASTELEYTDYEFSTLQEASQYFPDLDDTGILKDLQDYFNAWNDFASNPKDSAQKQ-- 135
Query: 1513 KTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVI 1572
+ + T+ L + L ++ VN E+++ +EIN++
Sbjct: 136 -------ALAQKTETLTNNIKDTRERLDTLQK----KVN-------EELKVTVDEINSLG 177
Query: 1573 KSLTDIDTIL--TETAGDLAKANDLKRKA-----NLTK-AGADSTKN 1611
K + +I+ + E L AN+L+ K L+K GA+ K+
Sbjct: 178 KQIAEINKQIKEVEAGKTLKHANELRDKRDELELTLSKLVGANVFKS 224
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 31.2 bits (72), Expect = 4.9
Identities = 27/94 (28%), Positives = 41/94 (43%), Gaps = 21/94 (22%)
Query: 1713 GLRSRYQETDNKLLSKAES-SGLKRARGQMLLNKA-SQLS--VNTTAKLKQ----LNETE 1764
L RY+E LLS E S KR R L+K ++L V + KQ L E +
Sbjct: 10 ALEERYEEL-EALLSDPEVISDQKRFR---KLSKEYAELEPIVEAYREYKQAQEDLEEAK 65
Query: 1765 AMFNSQ---------ESELTELSKNIAELQKRIQ 1789
M + + EL EL + + EL++ ++
Sbjct: 66 EMLEEESDPEMREMAKEELKELEERLEELEEELK 99
>gnl|CDD|218651 pfam05600, DUF773, Protein of unknown function (DUF773). This family
contains several eukaryotic sequences which are thought
to be CDK5 activator-binding proteins, however, the
function of this family is unknown.
Length = 506
Score = 31.6 bits (72), Expect = 5.1
Identities = 28/142 (19%), Positives = 53/142 (37%), Gaps = 23/142 (16%)
Query: 1550 VNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADST 1609
NK+ Q IRE+ IN I+ + + + I+ +G + + K T
Sbjct: 26 CNKDWQKAVLAIREK---INAAIQDMPEHEEIVQLLSGAYINYFHCLQIVEILKGTEADT 82
Query: 1610 KNTV-----------QKIV-----DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
KN Q+IV D + A AQ + ++ +I A RK +++
Sbjct: 83 KNIFGRYSSQRMKDWQEIVRLYEKDNVYLAEAAQIL----VRNVNYEIPALRKQIAKCQQ 138
Query: 1654 DLDDAQQKSNETNVKVKLLQER 1675
+ ++K + +E+
Sbjct: 139 LQQELERKEEDCKRSAADSREQ 160
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a number
of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and mature
spermatozoa, together with the detection of Gas8 also in
cilia at the apical surfaces of epithelial cells lining
the pulmonary bronchi and Fallopian tubes suggests that
the Gas8 protein may have a role in the functioning of
motile cellular appendages. Gas8 is a microtubule-binding
protein localised to regions of dynein regulation in
mammalian cells.
Length = 201
Score = 30.7 bits (70), Expect = 5.1
Identities = 34/181 (18%), Positives = 80/181 (44%), Gaps = 38/181 (20%)
Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER---LKSIQAGFLQNGRSA 1690
I++ K++I+ +K+ + + Q++ +K +E L+ + ++ +S
Sbjct: 29 IKSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL 88
Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQ--ETD-NKLLSKAESS--------GLKRARG 1739
+++ K LEKE+ + ++ L R++ E + ++L K E++ GLK
Sbjct: 89 KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGLKN--- 145
Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL--SKNI-----AELQKRIQSCI 1792
+LL + KL+ LNE +E++L E+ + N+ + +++ +
Sbjct: 146 -LLLEQ----------KLEALNE---ELEKKEAQLNEVLAAANLDPAALQAVSEKLDDVL 191
Query: 1793 N 1793
Sbjct: 192 E 192
>gnl|CDD|233894 TIGR02492, flgK_ends, flagellar hook-associated protein FlgK. The
flagellar hook-associated protein FlgK of bacterial
flagella has conserved N- and C-terminal domains. The
central region is highly variable in length and sequence,
and often contains substantial runs of low-complexity
sequence. This model is built from an alignment of FlgK
sequences with the central region excised. Note that
several other proteins of the flagellar apparatus also
are homologous in the N- and C-terminal regions to FlgK,
but are excluded from this model [Cellular processes,
Chemotaxis and motility].
Length = 324
Score = 31.2 bits (71), Expect = 5.1
Identities = 38/191 (19%), Positives = 68/191 (35%), Gaps = 37/191 (19%)
Query: 1623 ARTAQDMAEVAIQTAKDDISAA------RKDLSQITNDLDDAQQKSNETNVKVK------ 1670
A++ +++A+ ++I+ A R+ K T VKV
Sbjct: 6 AKSGLSASQIALSVTSNNIANANTPGYSRQRAVLTAVPAGGQGGKQIGTGVKVTSIRRIQ 65
Query: 1671 ----LLQERLKSIQAGFLQNGRSAM----DVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
Q R + AG+ + S + + N+ + + + QE
Sbjct: 66 DQFLDNQYRRANSSAGYADSRASGLSKIEQIFNDLDG-----SGLSTYLNNFFNALQE-- 118
Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSV---NTTAKLKQL-NETEAMFNSQESELTELS 1778
L +S L+ Q +L A L+ T+++L+ L A S +E+ L
Sbjct: 119 --LAKNPDSEALR----QAVLESAQALANQFNQTSSELQDLRKNINAEIKSAVTEINSLL 172
Query: 1779 KNIAELQKRIQ 1789
K IA L K I+
Sbjct: 173 KQIASLNKEIK 183
>gnl|CDD|237212 PRK12808, PRK12808, flagellin; Provisional.
Length = 476
Score = 31.5 bits (71), Expect = 5.5
Identities = 43/249 (17%), Positives = 82/249 (32%), Gaps = 28/249 (11%)
Query: 1550 VNKNIQSNPEQIRERANEINNVIKSL------TDIDTILTETAGDLAKANDLKRKAN--- 1600
+ N Q N + + + IN I++L I L + NDL AN
Sbjct: 124 IADNTQFNDKNLLNSNSSIN--IQTLDSSDGDKQIGIDLKSATLEALGINDLSIGANTDI 181
Query: 1601 --LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
++ A KNT+ K EA+ A D A+ + +I A D + DDA
Sbjct: 182 EAVSTKIAALDKNTIAK---ATVEAKAAFDKAKDDTKAEDSNILDAAADGFK-DGKADDA 237
Query: 1659 QQKSNETNVKVKLLQ-----ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
+ + E ++ + F + +D ++ L K +
Sbjct: 238 AKDVEAIKTALSAFTGAATLEEAEAAKTAFEVAQKDLVDTYTKKAALTK------DAVAD 291
Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESE 1773
L + + +K + +A+ N ++ + ++ N + E+
Sbjct: 292 LDTAKTTSTRSKAAKDLVAAYDKAKSGAKPNDVAKAYLEAKMAYEKDNNAIDGKSKLEAA 351
Query: 1774 LTELSKNIA 1782
L K+
Sbjct: 352 DDALEKDAI 360
>gnl|CDD|185750 cd09237, V_ScBro1_like, Protein-interacting V-domain of Saccharomyces
cerevisiae Bro1 and related domains. This family
contains the V-shaped (V) domain of Saccharomyces
cerevisiae Bro1, and related domains. It belongs to the
V_Alix_like superfamily which also includes the V-domain
of Saccharomyces cerevisiae Rim20 (also known as PalA),
mammalian Alix (apoptosis-linked gene-2 interacting
protein X), His-Domain type N23 protein tyrosine
phosphatase (HD-PTP, also known as PTPN23), and related
domains. Bro1 interacts with the ESCRT (Endosomal Sorting
Complexes Required for Transport) system, and
participates in endosomal trafficking. The mammalian Alix
V-domain (belonging to a different family) contains a
binding site, partially conserved in the superfamily, for
the retroviral late assembly (L) domain YPXnL motif. The
Alix V-domain is also a dimerization domain. Bro1 also
has an N-terminal Bro1-like domain, which binds Snf7, a
component of the ESCRT-III complex, and a C-terminal
proline-rich region (PRR). The C-terminal portion
(V-domain and PRR) of S. cerevisiae Bro1 interacts with
Doa4, a ubiquitin thiolesterase needed to remove
ubiquitin from MVB cargoes. It interacts with a YPxL
motif in the Doa4s catalytic domain to stimulate its
deubiquitination activity.
Length = 356
Score = 31.1 bits (71), Expect = 5.8
Identities = 37/195 (18%), Positives = 63/195 (32%), Gaps = 48/195 (24%)
Query: 1203 IKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNG 1262
I S + + ELL + ++ S +DL + I+ M+K+I+
Sbjct: 62 IVSGLKSSSVDSQLELLRPQSASWVNEID---SSYNDLDEEMKEIEKMRKKILA------ 112
Query: 1263 VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKEN----ATLLQENNVEGALELTR 1318
TQ + L L E++ KL K+ + +L+ + AL L
Sbjct: 113 ------KWTQSP-SSSLT-ASLREDLVKLKKSLVEASASDEKLFSLVDPVKEDIALLLNG 164
Query: 1319 QAHDTSLKLKEQSTETE----------------KQINDAERQCKRTENLVTKTFPQFTQG 1362
+ L S E KQI E + +
Sbjct: 165 GSLWEELFGFSSSGSPEPSLLDLDDSQNEQTVLKQIKQLEELLEDLNLIK---------- 214
Query: 1363 QEENEKALQRLNEKI 1377
EE ++ L+ L +KI
Sbjct: 215 -EERQRVLKDLKQKI 228
>gnl|CDD|215452 PLN02843, PLN02843, isoleucyl-tRNA synthetase.
Length = 974
Score = 31.3 bits (71), Expect = 6.2
Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 9/64 (14%)
Query: 344 NNHSTKCHFDEAVY-NSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDPNKEFSD-PEVC 401
++ + C Y GR+ GV + +G CERC + Y FSD P +C
Sbjct: 903 DDETAACGSYTGEYVEGAGRVWVGVSRA-----DGSKCERC--WNYSPAVGSFSDHPTLC 955
Query: 402 LRCD 405
RC
Sbjct: 956 ERCY 959
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 30.8 bits (70), Expect = 6.3
Identities = 23/112 (20%), Positives = 43/112 (38%), Gaps = 20/112 (17%)
Query: 1300 ENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQ- 1358
E +E E L+ T +A KL+ Q E++ + E + +R +
Sbjct: 70 EIVEAAEEEAKEELLKKTDEASSIIEKLQMQ---IEREQEEWEEELERLIEEAKAEGYEE 126
Query: 1359 -FTQGQEEN---------------EKALQRLNEKITSLEKNLPDLNELICDK 1394
+ +G+EE EK +++ E + S E+ + +L I K
Sbjct: 127 GYEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKK 178
>gnl|CDD|238105 cd00179, SynN, Syntaxin N-terminus domain; syntaxins are nervous
system-specific proteins implicated in the docking of
synaptic vesicles with the presynaptic plasma membrane;
they are a family of receptors for intracellular
transport vesicles; each target membrane may be
identified by a specific member of the syntaxin family;
syntaxins contain a moderately well conserved
amino-terminal domain, called Habc, whose structure is an
antiparallel three-helix bundle; a linker of about 30
amino acids connects this to the carboxy-terminal region,
designated H3 (t_SNARE), of the syntaxin cytoplasmic
domain; the highly conserved H3 region forms a single,
long alpha-helix when it is part of the core SNARE
complex and anchors the protein on the cytoplasmic
surface of cellular membranes; H3 is not included in
defining this domain.
Length = 151
Score = 29.9 bits (68), Expect = 6.4
Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 14/116 (12%)
Query: 1524 ITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILT 1583
I++ ++E L L + ++S ++I++ A EI +K L + +
Sbjct: 18 ISEDVEELQKLHSQLLTAPDADPELK--QELESLVQEIKKLAKEIKGKLKELEESNEQNE 75
Query: 1584 ETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQ-DMAEVAIQTAK 1638
G + D RK +K V+V+TE AQ E + +
Sbjct: 76 ALNGS---SVDRIRKT--------QHSGLSKKFVEVMTEFNKAQRKYRERYKERIQ 120
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere subunit.
CENP-Q is one of the components that assembles onto the
CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying centromere
identity. CENP-A nucleosomes directly recruit a proximal
CENP-A nucleosome associated complex (NAC) comprised of
CENP-M, CENP-N and CENP-T, CENP-U(50), CENP-C and CENP-H.
Assembly of the CENP-A NAC at centromeres is dependent on
CENP-M, CENP-N and CENP-T. Additionally, there are seven
other subunits which make up the CENP-A-nucleosome distal
(CAD) centromere, CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q,
CENP-R and CENP-S, also assembling on the CENP-A NAC.
Fta7 is the equivalent component of the fission yeast
Sim4 complex.
Length = 159
Score = 30.0 bits (68), Expect = 6.5
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 1742 LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
LL + L A+LKQL E +E+EL + + + EL+K +
Sbjct: 25 LLRRKRALERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAK 72
>gnl|CDD|236274 PRK08485, PRK08485, DNA polymerase III subunit delta'; Validated.
Length = 206
Score = 30.4 bits (69), Expect = 6.7
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 7/60 (11%)
Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
++L +L L D+ E +K+L K ++ KE L E+ L ++ + L E+ E
Sbjct: 121 LDLKKLDLKDIYEFLKELEKENKLSKEELKELIES-------LLKECVKYKIPLNEEELE 173
>gnl|CDD|214362 CHL00098, tsf, elongation factor Ts.
Length = 200
Score = 30.0 bits (68), Expect = 7.5
Identities = 40/181 (22%), Positives = 74/181 (40%), Gaps = 36/181 (19%)
Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD--VLTEARTAQDMAEVAIQTAKD 1639
L E GD KA + R+ L A ST+ T + +++ + T + + E+ +T
Sbjct: 23 LQEANGDFEKALESLRQKGLASANKKSTRITTEGLIESYIHTGGKLGV-LVEINCET--- 78
Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKN 1699
D A R++ ++ ++ A Q + NVK S D+ E N
Sbjct: 79 DFVARREEFQKLAKNI--AMQIAACPNVKYV------------------SLEDIPEEIIN 118
Query: 1700 LEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKA----SQLSVNTTA 1755
LEK++ + L+++ +E K++ E KR + LL++ ++V
Sbjct: 119 LEKKIESEKDD---LQNKPEEIKEKIV---EGRIKKRLKELSLLDQPFIRDQSITVEELI 172
Query: 1756 K 1756
K
Sbjct: 173 K 173
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.4 bits (69), Expect = 8.6
Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 10/129 (7%)
Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
EA +M+E ++ + DD D S + + + K K +R K
Sbjct: 229 PEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKE-- 286
Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
+ ++E E K EK++ Q + L+ +E K ++A ++ RG+
Sbjct: 287 -------KRRKELEREAK-EEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGE 338
Query: 1741 MLLNKASQL 1749
K +L
Sbjct: 339 KKKLKRRKL 347
>gnl|CDD|235588 PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed.
Length = 912
Score = 30.8 bits (71), Expect = 9.4
Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 1/30 (3%)
Query: 375 TEGRNCERCKQYFYKDPNKEFSDPEVCLRC 404
EG CERC Y +D + P +C RC
Sbjct: 878 AEGEKCERCWHYR-EDVGSDAEHPTLCGRC 906
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.134 0.422
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 86,377,504
Number of extensions: 8177261
Number of successful extensions: 9534
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9088
Number of HSP's successfully gapped: 453
Length of query: 1807
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1696
Effective length of database: 6,014,308
Effective search space: 10200266368
Effective search space used: 10200266368
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 66 (29.1 bits)