RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4474
(2174 letters)
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 69.3 bits (170), Expect = 2e-11
Identities = 117/800 (14%), Positives = 297/800 (37%), Gaps = 132/800 (16%)
Query: 732 LERSKHIVDDIEKKKDDLKPIQKEAQ----------QLSEWDSSIVPKVSDLETRWAAVK 781
L+R + I++++E++ LK ++++A+ +L E + ++ L R ++
Sbjct: 188 LDRLEDILNELERQ---LKSLERQAEKAERYKELKAELRELELAL------LVLRLEELR 238
Query: 782 SAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHE 841
+ +L + +Q+ E+ L ++ L ++ ++ +E++Q
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELR--------LEVSELEEEIEELQK-- 288
Query: 842 KLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPD 901
+ + N + E+ + +L ++ Q++ E
Sbjct: 289 ----ELYALANEISRLEQQKQILRERLANLE----RQLEELEAQLEELES---------- 330
Query: 902 NKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAK 961
K+D L +E +++++ + L L +E E ++E +E++
Sbjct: 331 -----------------KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
Query: 962 LKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLT 1021
L+ELE +E L LQ ++ L + + +
Sbjct: 374 LEELEEQLETLRSKVAQLELQIASLN-NEIERLEARLER--LEDRRERLQQEIEELLKKL 430
Query: 1022 VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVST 1081
+ +Q +L +E+ + ++++ L+ ++EEA+ L AE +LA+ +
Sbjct: 431 EEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQ-LQA 489
Query: 1082 QAEDISEALTVYQKLLDEINSWKNSQSDEDLVH--LADSIKP-------LDEVV-ERIEN 1131
+ + + + + + + +QS + L++ I ++ + R++
Sbjct: 490 RLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQA 549
Query: 1132 HAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
V L + + V F+ + + E+ GN + + I+ + + D+ +
Sbjct: 550 VVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVK 609
Query: 1192 YEA-----------------ILATA--------------SDKGDQLSSDGTISDRNEITE 1220
++ L A + GD + G I+ + T
Sbjct: 610 FDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTN 669
Query: 1221 Q-LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSR-LH 1278
+ R++++ L + +E+L ++ + +L ELEE+ E L + SR +
Sbjct: 670 SSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729
Query: 1279 NLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDK 1338
LRK + +L + E+ +L EL E+ + E ++ L E + EL+
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789
Query: 1339 QRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANE-HEAYKQAYNEAYDWLRKAKLGA 1397
Q +++ + K + EA+D+ + +E R+
Sbjct: 790 Q------------IEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837
Query: 1398 QANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEI 1457
+ D ++ ++ ++ I+ + + E ++LI++ + +L + ++ + +
Sbjct: 838 RRLEDL---EEQIEELSEDIESLAAEIEELEELIEELESELEALLNE-----RASLEEAL 889
Query: 1458 NQITLDWTNLQNTLQDIDKH 1477
+ + L L++++
Sbjct: 890 ALLRSELEELSEELRELESK 909
Score = 62.4 bits (152), Expect = 3e-09
Identities = 128/818 (15%), Positives = 308/818 (37%), Gaps = 101/818 (12%)
Query: 635 MTVLRDSWDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNEPSPVFTEI 694
+ LR+ + L +LK +L+ A EL++ + +S +E ++ E
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE-------- 285
Query: 695 DLNTKWQQYVNDELEWENHSQNTLQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLKPIQK 754
L + N+ E Q + L N+ ++ L + + + +++ ++L +++
Sbjct: 286 -LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEE 344
Query: 755 EAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQ 814
+ ++L E S+ ++ +LE ++S + + ++ + Q
Sbjct: 345 KLEELKEELESLEAELEELEAELEELESRLEEL--------------EEQLETLRSKVAQ 390
Query: 815 ISFQLMAHNSLYINNRQQTLEQIQHH-EKLLQDYSSYQNAVQDAEKWLLQISF----QLM 869
+ Q+ + N+ + LE+++ E+L Q+ +++AE LQ + +
Sbjct: 391 LELQIASLNNEIERLEAR-LERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEEL 449
Query: 870 AHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKL----APCFSAEEGSPIKDKIDRLEK 925
+ ++ LE+++ + + + + + +L A S E + K
Sbjct: 450 EELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVK 509
Query: 926 ETKAQSDRIQDALKTLNQLIGHRKEFEVDVENV-EAKLKELEV--------AVEGDVKTN 976
+ L L++LI + +E +E +L+ + V A+ +
Sbjct: 510 ALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNE 569
Query: 977 N--VLMLQELLNKYAQLNEEAGE-LNMMISNVSIATEHMNLNDADRLTVN---------- 1023
V L K ++ E L + + +A + + + R ++
Sbjct: 570 LGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVD 629
Query: 1024 --DSVSSMQKRYAQLSRTI-----------------EDRISTVNNKLQSVKRLQHKVEEA 1064
D+ + K+ R + S++ + + ++ L+ K+EE
Sbjct: 630 DLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEEL 689
Query: 1065 KALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDE 1124
+ + + E LA + + E++ E L +K L+E++ + L + L+
Sbjct: 690 EEKIAELEKALAE-LRKELEELEEELEQLRKELEELS--------RQISALRKDLARLEA 740
Query: 1125 VVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKY-- 1182
VE++E ++ L + I ++ + E + E + ++ +I++
Sbjct: 741 EVEQLEE----RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796
Query: 1183 --DKIIADIQEYEAILATASDKGDQLSSDGTISDRN--EITEQLQSRKQQLHNLRKTVEK 1238
+ + E A L +++ L +R +L+ ++Q+ L + +E
Sbjct: 797 ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIES 856
Query: 1239 LRQQNEKRAAESEKLGAELEEIIEALHARETAVPS-SSRLHNLRKTVEKLRQQNEKRAAE 1297
L + E+ E+L +ELE ++ + E A+ S L L + + +L + + E
Sbjct: 857 LAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRE 916
Query: 1298 SEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYET 1357
E+L +L ++ L E + L L E S+ +E + + + +R +
Sbjct: 917 LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKR 976
Query: 1358 LKKNIAE-------AIDKFETQANEHEAYKQAYNEAYD 1388
L+ I E AI+++E ++ + +
Sbjct: 977 LENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTE 1014
Score = 47.0 bits (112), Expect = 1e-04
Identities = 62/279 (22%), Positives = 117/279 (41%), Gaps = 34/279 (12%)
Query: 1771 ELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGN 1830
EL+ + E + L R ++ +++L QL+ ++E L + ++ LA +
Sbjct: 688 ELEEKIAELEKALAELRKELEEL---EEELEQLRKELEELSRQISALRKDLARLEAEVEQ 744
Query: 1831 VLKR-AQKKIE-AEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHS 1888
+ +R AQ E E + + LE+ + L E + +E L + E L E
Sbjct: 745 LEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE---IEELEAQIEQLKE--ELK 799
Query: 1889 PIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDE 1948
+RE L+A L E +E +E+ + AT + ++
Sbjct: 800 ALREALDELRAELTLL-----------------NEEAANLRERLESLERRIAATERRLED 842
Query: 1949 LEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLK 2008
LE I+E ++IE+ + E + ++ + L+++ E A+ LRS+L +
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLN----ERASLEEALALLRSELEE 898
Query: 2009 LKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKL 2047
L E L++ E + + EE +KLAQ+E + L
Sbjct: 899 LSEELRELESKRSELR---RELEELREKLAQLELRLEGL 934
Score = 47.0 bits (112), Expect = 1e-04
Identities = 50/298 (16%), Positives = 118/298 (39%), Gaps = 20/298 (6%)
Query: 1109 DEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEV 1168
E+L L D + L+ ++ +E A K + L+ + L ++ + E+
Sbjct: 185 RENLDRLEDILNELERQLKSLERQAE-KAERYKELKAELREL--ELALLVLRLEELREEL 241
Query: 1169 DGNSATVQEKIDKYDKIIADIQEYEAILATASDK------------GDQLSSDGTISDRN 1216
+ ++E ++ +++ A++QE E L + + + IS
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301
Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSR 1276
+ + L+ R L + +E ++ E + E + AELEE +E L ++ +
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL--EAE 359
Query: 1277 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVEL 1336
L L +E+L + E+ + E L + ++ + + + L + ++
Sbjct: 360 LEELEAELEELESRLEELEEQLETLRS---KVAQLELQIASLNNEIERLEARLERLEDRR 416
Query: 1337 DKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAK 1394
++ + E+ + + L+ + E ++ E E E ++A E + L +A+
Sbjct: 417 ERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAE 474
Score = 40.0 bits (94), Expect = 0.016
Identities = 66/379 (17%), Positives = 151/379 (39%), Gaps = 31/379 (8%)
Query: 731 LLERSKHIVD---DIEKKKDDLKPIQKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAK 787
+LER + I + IE+ ++ + ++K +L + + ++ L + A
Sbjct: 672 ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISAL 731
Query: 788 HSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDY 847
DL+ A+ + + K L ++ ++ ++ E E+L
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
Query: 848 SSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPC 907
+ ++ + L ++ +L N N ++ LE ++ + + + ++
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEA-ANLRERLESLERRIAATERRLEDLEEQIEE- 849
Query: 908 FSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEV 967
+E+ + +I+ LE+ + ++ L L ++E + +L+ELE
Sbjct: 850 -LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELES 908
Query: 968 AVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV-SIATEH--MNLNDADRLTVN- 1023
+++ L+EL K AQL L + I N+ +E + L +A+ L
Sbjct: 909 KR-SELRRE----LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKI 963
Query: 1024 -DSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ 1082
D ++R +L I + VN L + +EE + L ++ + ++ Q
Sbjct: 964 EDDEEEARRRLKRLENKI-KELGPVN--LAA-------IEEYEELKERYDF-----LTAQ 1008
Query: 1083 AEDISEALTVYQKLLDEIN 1101
ED++EA ++ ++EI+
Sbjct: 1009 KEDLTEAKETLEEAIEEID 1027
Score = 37.7 bits (88), Expect = 0.087
Identities = 62/325 (19%), Positives = 127/325 (39%), Gaps = 29/325 (8%)
Query: 1732 KKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQ 1791
+++ EL+ +L EE + L L+ E+KL EL+ + E + +++ +
Sbjct: 238 REELEELQEELKE---AEEELEELTAELQELEEKLE---ELRLEVSELEEEIEELQKELY 291
Query: 1792 DATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEA-EKDIGDCTN 1850
+ L Q K + E+ ++L + L +++ E+ + +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
Query: 1851 LDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYA---ALVTS 1907
+LE +A L+E+ + ++E+ ++ E +E S + + L++ +L L
Sbjct: 352 --ELESLEAELEELEAELEELESRLEELEEQLE-TLRSKVAQLELQIASLNNEIERLEAR 408
Query: 1908 VQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQW 1967
++ L + E+ + E L K EE + E +ELE ++E Q+ +E
Sbjct: 409 LERLEDRRERLQQEIEELLKKLEE-----AELKELQAELEELEEELEELQEELERLEEAL 463
Query: 1968 NSFRETLAQMTAWLDSVEKNIKQ---EIATPWTTTQELRSKLLKLKEILKKCEKGVGDHE 2024
RE L + LD+ E+ + Q + + + L +K +LK G
Sbjct: 464 EELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILG 523
Query: 2025 AFVDKYEECSQKLAQVEAKYAKLIE 2049
+ V+ Y IE
Sbjct: 524 VLSELIS--------VDEGYEAAIE 540
Score = 35.0 bits (81), Expect = 0.52
Identities = 42/281 (14%), Positives = 106/281 (37%), Gaps = 23/281 (8%)
Query: 1888 SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECD 1947
++EE + L +Q L ++E+ + +E + EE++ L+ + +
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301
Query: 1948 ELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLL 2007
+ + ++E N+E Q + + E L + E ++L
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESK--------LDELAEELAEL--EEKLEELKE 351
Query: 2008 KLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTAL 2067
+L+ + + E+ + E + EE ++L + +K A+L ++ ++ L
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQL------ELQIASLNNEIERL 405
Query: 2068 LTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
++ +++ + E + ++++LQ +E L +++ + +
Sbjct: 406 EARLERLEDRR-------ERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELER 458
Query: 2128 QVEAKYAKLIEPETSYEELQKRQGQLTALLAEKVDMDVLLN 2168
EA E E + + L + +L L A ++ L
Sbjct: 459 LEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499
Score = 35.0 bits (81), Expect = 0.54
Identities = 70/376 (18%), Positives = 153/376 (40%), Gaps = 55/376 (14%)
Query: 1517 NLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMTELESEEVK-----DLIQVGVHG 1571
+++LE+ L+E K A+ +K L + + + ELE E + + + +
Sbjct: 678 EIEELEEKIEELEE---KIAELEKALAELRKE----LEELEEELEQLRKELEELSRQISA 730
Query: 1572 LQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKV 1631
L+ +L L +E ++A ++E ++ +I +L+ L++ ++ L ++
Sbjct: 731 LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE------AEEELAEAEAEI 784
Query: 1632 NEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQG 1691
E++ ++ +L+ L + ++ +L + NE +L E+ +++ +
Sbjct: 785 EELEAQIEQLKEELKALREALDELRAEL-TLLNEE-AANLRERLESLERRIAATERRLED 842
Query: 1692 LESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEES 1751
LE ++ + +E SL I L + L++ L+ LNE+ E L S EE
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELES-ELEALLNERASLEEALALLRS--ELEEL 899
Query: 1752 VATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLH 1811
L + E K EL+ L+E + +L + + + L++++ +
Sbjct: 900 SEELREL----ESKR---SELRRELEELREKLAQLELRLEGL---EVRIDNLQERLSEEY 949
Query: 1812 QPSEQSKQQLATITTRHGNVLKRAQKKIEAE-KDIGDCTNL------------------- 1851
+ + + L +R K++E + K++G NL
Sbjct: 950 SLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGP-VNLAAIEEYEELKERYDFLTAQ 1008
Query: 1852 -DDLEKYKALLQEVIS 1866
+DL + K L+E I
Sbjct: 1009 KEDLTEAKETLEEAIE 1024
Score = 33.1 bits (76), Expect = 2.1
Identities = 49/272 (18%), Positives = 102/272 (37%), Gaps = 13/272 (4%)
Query: 417 ENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEKWIEGFQKKIE-AE 475
E + ++I ++ L+ L + K + L + I +K + E
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 476 KDIGDCTN-LDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALY 534
++ + L K L+ I + L + E L E A I E +++ L
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEER--LEEAEEELAEAEAE--IEELEAQIEQLK 795
Query: 535 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNGSKNVLKERLDT 594
L + L + L+ E A E L + A + + + ++E +
Sbjct: 796 EELKALREALDE-LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ-IEELSED 853
Query: 595 VNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSLREAMTVLRDSWDRLNRDLKSTST 654
+ +A I E + L+ L+ L+ ++ SL EA+ +LR + L+ +L+ +
Sbjct: 854 IESLAAEIEELEELIEELESELEALLN-----ERASLEEALALLRSELEELSEELRELES 908
Query: 655 QLKSYIARWNELDDLYNRFNMWLSGVENKLNE 686
+ EL + + + L G+E +++
Sbjct: 909 KRSELRRELEELREKLAQLELRLEGLEVRIDN 940
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 62.0 bits (151), Expect = 4e-09
Identities = 143/822 (17%), Positives = 321/822 (39%), Gaps = 109/822 (13%)
Query: 1177 EKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTV 1236
EK ++Y ++ A+++E E L ++ E+ ++L+ +++L L + +
Sbjct: 210 EKAERYQELKAELRELELALL--------------LAKLKELRKELEELEEELSRLEEEL 255
Query: 1237 EKLRQQNEKRAAESEKLGAELEEIIEALHARETAV-PSSSRLHNLRKTVEKLRQQNEKRA 1295
E+L+++ E+ E E+L +ELEE+ E L + + + L + LR++ E+
Sbjct: 256 EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315
Query: 1296 AESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRY 1355
E E+L LEE+ E + A + ++ LL E+ + EL++ + EL K ++
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSA-LLEEL 374
Query: 1356 ETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKAD 1415
E L + + E + + E + E + + L + + E+ K+
Sbjct: 375 EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL--KELEA 432
Query: 1416 KIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQNTLQDID 1475
+++++ L E + +++ + + + + +E+ ++ + L+ L ++
Sbjct: 433 ELEELQTELEELNEELEELEEQLEELRDR-----LKELERELAELQEELQRLEKELSSLE 487
Query: 1476 KHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKE 1535
+ + + LE G ++ EKY+ L+ L
Sbjct: 488 ARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKVK-----EKYETALEAALGNR 542
Query: 1536 ADN--------KKVLIDSFNKGIIG-MTELESEEVKDLIQV------GVHGLQVEL---- 1576
K I+ + G T L + +K L + G GL +L
Sbjct: 543 LQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFLGLASDLIDFD 602
Query: 1577 -------NALLQN-IEAEINKVANAAQERKAMQDKISKLQS--WLKQYIDYSGNKQALQS 1626
+L + + + + A + ++ +I L G++ S
Sbjct: 603 PKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSS 662
Query: 1627 --RLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSS 1684
+ ++ E++E E+ +L+ L + +++ +L S + +L + E+ +
Sbjct: 663 LAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDL--LEELRRQLEELERQLEE 720
Query: 1685 LSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSS 1744
L LE L Q L + + L + LQ L E +++ E
Sbjct: 721 LKRELAALEEELEQLQSRLEELEEELEELEEELE-----ELQERLEELEEELE------- 768
Query: 1745 WCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLK 1804
+ E + E++ ++++A + + ++ ++ L L+
Sbjct: 769 ---------------SLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813
Query: 1805 DKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQEV 1864
++ESL Q E+ +Q++ + ++ ++K++ ++ L++LEK L+E
Sbjct: 814 RELESLEQRRERLEQEIEELEEE----IEELEEKLDELEE-----ELEELEKELEELKEE 864
Query: 1865 IS-HNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQT 1923
+ ++ E L D+ + L EE L+ L + + L ++EK
Sbjct: 865 LEELEAEKEELEDELKEL---------EEEKEELEEELRELESELAELKEEIEKLRERLE 915
Query: 1924 EFLAKKEEVEAWLHKAHATVQEC--DELEAGIKEEQKNIETQ 1963
E AK E +E L + ++E D LE ++ E + +E +
Sbjct: 916 ELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957
Score = 57.4 bits (139), Expect = 9e-08
Identities = 64/339 (18%), Positives = 149/339 (43%), Gaps = 14/339 (4%)
Query: 1055 KRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVH 1114
K L+ ++ E +A L+K E +L + + + + L ++ L+E+ + + +L
Sbjct: 670 KELEEELAELEAQLEKLEEELKS-LKNELRSLEDLLEELRRQLEEL-ERQLEELKRELAA 727
Query: 1115 LADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGF---ITETTARVGEVDGN 1171
L + ++ L +E +E + L L+E+ L ++ + + + E++
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEK 787
Query: 1172 SATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
+QE++++ ++ + + + + + + + E+ E+++ +++L
Sbjct: 788 RQALQEELEELEEELEEAER-RLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDE 846
Query: 1232 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETAVPSSSRLHNLRKTVEKLRQQ 1290
L + +E+L ++ E+ E E+L AE EE+ + L E L L + +L+++
Sbjct: 847 LEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEE 906
Query: 1291 NEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQP 1350
EK E+L A+LE + L E ++ LE ++E + +RLE + + P
Sbjct: 907 IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLE---TELEREIERLEEEIEALGP 963
Query: 1351 Y----VQRYETLKKNIAEAIDKFETQANEHEAYKQAYNE 1385
++ YE +++ E + E E + E
Sbjct: 964 VNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002
Score = 55.5 bits (134), Expect = 3e-07
Identities = 134/774 (17%), Positives = 308/774 (39%), Gaps = 95/774 (12%)
Query: 627 QQDSLREAMTVLRDSWDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNE 686
+ + L E ++ L + + L +L+ +++ + EL + L ++ ++ E
Sbjct: 240 ELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE 299
Query: 687 PSPVFTEIDLNTKWQQYVNDELEWENHSQNTLQWLDNIRSDMKTLLERSKHIVDDIEKKK 746
+ EI L + + + +ELE + L+ L +K LE + +++++E+
Sbjct: 300 ---LEGEISLLRERLEELENELE---ELEERLEELKEKIEALKEELEERETLLEELEQLL 353
Query: 747 DDLKPIQKEAQQ-LSEWDSSIVPKVSDLETRWAAVKSA---WDAKHSDLSSEIAQYSSYQ 802
+L+ ++E ++ LS + L A +++ + +L EI
Sbjct: 354 AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERL 413
Query: 803 N----AVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLL-------QDYSSYQ 851
++D ++ L ++ +L + ++ E + E+L ++ + Q
Sbjct: 414 ERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQ 473
Query: 852 NAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVI------LSPDNKLA 905
+Q EK L + +L + Q ++ E L V + K
Sbjct: 474 EELQRLEKELSSLEARLDRLEAE-QRASQGVRAVLEALESGLPGVYGPVAELIKVKEKYE 532
Query: 906 PCFSAEEGSPIKDKI---------------------------DRLEKETKAQSDRIQDAL 938
A G+ ++ + DR++ +SD L
Sbjct: 533 TALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFL 592
Query: 939 KTLNQLIGHRKEFEVDVENV------------------EAKLKELEVAVEGDV------K 974
+ LI ++E V V + ++K V ++GD+
Sbjct: 593 GLASDLIDFDPKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSI 652
Query: 975 TNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYA 1034
T + L + +L E EL + + + E + + ++ D + ++++
Sbjct: 653 TGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLE 712
Query: 1035 QLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQ 1094
+L R +E+ + + +++LQ ++EE + L++ E +L + + E++ E L +
Sbjct: 713 ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEE-LQERLEELEEELESLE 771
Query: 1095 KLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVP---KLKSLLLLREQFTTLI 1151
+ L ++ + + +E L + ++ L+E +E E +L+SL RE+ I
Sbjct: 772 EALAKLKE-EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 1152 MQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGT 1211
++ I E ++ E++ +++++ +++ +++E EA D+ +L
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKEL---EELKEELEELEAEKEELEDELKELE---- 883
Query: 1212 ISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV 1271
++ E+ E+L+ + +L L++ +EKLR++ E+ A+ E+L EL E+ E L
Sbjct: 884 -EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDT 942
Query: 1272 PSSS---RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
+ + L + +E L N + E E++ EE+ E A + L
Sbjct: 943 LETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKL 996
Score = 54.3 bits (131), Expect = 8e-07
Identities = 67/357 (18%), Positives = 144/357 (40%), Gaps = 34/357 (9%)
Query: 1051 LQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDE 1110
+ ++R + +E + LL++ E QL + QAE YQ+L E+ + +
Sbjct: 178 ERKLERTEENLERLEDLLEELEKQL-EKLERQAEKAER----YQELKAELRELELALLLA 232
Query: 1111 DLVHLADSIKPLDEVVERIENH----------AVPKLKSLLLLREQFTTLIMQIVGFITE 1160
L L ++ L+E + R+E A +++ L E+ + ++ + E
Sbjct: 233 KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292
Query: 1161 TTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITE 1220
+ E++G + ++E++++ + +++E E L +K + L + E
Sbjct: 293 LKEEIEELEGEISLLRERLEELEN---ELEELEERLEELKEKIEALKEELE-----ERET 344
Query: 1221 QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNL 1280
L+ +Q L L + E+L ++ E E+L L E + L A + + L L
Sbjct: 345 LLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI--RNELEEL 402
Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSV-------- 1332
++ +E L ++ E+ + E L EL+E+ L +T ++ L L E+
Sbjct: 403 KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462
Query: 1333 -DVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYD 1388
++E + L+ + ++ + E + + EA + Y
Sbjct: 463 KELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYG 519
Score = 53.6 bits (129), Expect = 1e-06
Identities = 82/421 (19%), Positives = 167/421 (39%), Gaps = 70/421 (16%)
Query: 1653 ENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITW 1712
S R + + +LA L EK L +K L S E L +L
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEEL--- 714
Query: 1713 LNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMEL 1772
++Q ELK +L++ EE + L L+ E++L + E
Sbjct: 715 -------------------ERQLEELKRELAA---LEEELEQLQSRLEELEEELEELEEE 752
Query: 1773 KTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVL 1832
L E+ +L+ ++ S ++ L +LK++IE L + + +++L + L
Sbjct: 753 LEELQERLEELE------EELESLEEALAKLKEEIEELEEKRQALQEELEELEE----EL 802
Query: 1833 KRAQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIRE 1892
+ A+++++ AL +E+ S E L + E L E
Sbjct: 803 EEAERRLD------------------ALERELESLEQRRERLEQEIEEL---------EE 835
Query: 1893 ETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAG 1952
E L+ L ++ L ++E+ + E A+KEE+E L + +E +E
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Query: 1953 IKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPW--TTTQELRSKLLKLK 2010
++ E ++ + + E L L+ +++E+ + T EL ++ +L+
Sbjct: 896 LESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955
Query: 2011 EILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTE 2070
E ++ V +++YEE ++ +++++ L E + E+L + +L E
Sbjct: 956 EEIEALGP-VN--LRAIEEYEEVEERYEELKSQREDLEE---AKEKLLEVIEELDKEKRE 1009
Query: 2071 K 2071
+
Sbjct: 1010 R 1010
Score = 48.6 bits (116), Expect = 5e-05
Identities = 46/237 (19%), Positives = 102/237 (43%), Gaps = 10/237 (4%)
Query: 1158 ITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNE 1217
+ + + + ++++ +++ + + +++ L ++ S E
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQ-LQSRLEE 741
Query: 1218 ITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS-- 1275
+ E+L+ +++L L++ +E+L ++ E KL E+EE+ E A + +
Sbjct: 742 LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEE 801
Query: 1276 ------RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEV 1329
RL L + +E L Q+ E+ E E+L E+EE+ E L E ++ L L E+
Sbjct: 802 LEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEEL 861
Query: 1330 TSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEA 1386
EL+ ++ EL + + + E L++ + E + E E ++ E
Sbjct: 862 KEELEELEAEKEELEDELKE-LEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917
Score = 44.3 bits (105), Expect = 8e-04
Identities = 128/855 (14%), Positives = 315/855 (36%), Gaps = 123/855 (14%)
Query: 788 HSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDY 847
+ L + + + E+ L ++ +L + L+ E+L ++
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEE-----AEKEIEELKSE--LEELREEL 283
Query: 848 SSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPC 907
Q + + ++ + ++ ++ + + + LE+++ + L++ I + +L
Sbjct: 284 EELQEELLELKEEIEELEGEI-SLLRERLEELENELEELEERLEELKEKIEALKEEL--- 339
Query: 908 FSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEV 967
EE + +++++L E + + +++ L L + + E E + +L ELE
Sbjct: 340 ---EERETLLEELEQLLAELEEAKEELEEKLSALLEEL------EELFEALREELAELE- 389
Query: 968 AVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVS 1027
EL +L E E+ + + +E + + + +
Sbjct: 390 --------------AELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435
Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDIS 1087
+Q +L+ +E+ + +K L+ ++ E + LQ+ E +L S+ +
Sbjct: 436 ELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKEL----SSLEARLD 491
Query: 1088 EALTVYQKLLDEINSWKNSQSDEDLVH--LADSIKPLDEVVERIENHAVPKLKSLLLLRE 1145
+ + +S V+ +A+ IK ++ +E +L+++++ E
Sbjct: 492 RLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKVKEKYETALEAALGNRLQAVVVENE 551
Query: 1146 QFTTLIMQI--------VGFI---TETTARVGEVDGNSATVQEKID--KYDK-------- 1184
+ ++ F+ R + D + D +D
Sbjct: 552 EVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFLGLASDLIDFDPKYEPAVRF 611
Query: 1185 ------IIADIQEYEAILATASDK-------GDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
++ D+++ + K GD + G+I+ + ++K++L
Sbjct: 612 VLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKE 671
Query: 1232 LRKTVEKLRQQ---NEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLR 1288
L + + +L Q E+ + LE+++E L +L L + +E+L+
Sbjct: 672 LEEELAELEAQLEKLEEELKSLKNELRSLEDLLEEL---------RRQLEELERQLEELK 722
Query: 1289 QQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEV 1348
++ E E+L + LEE+ E L E ++ L L E+ L++ +L
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE--- 779
Query: 1349 QPYVQRYETLKKNIAEAIDKFETQANEHE-AYKQAYNEAYDWLRKAKLGAQANADCHGEQ 1407
++ E ++ + E +++ E + E E E ++ + Q + +
Sbjct: 780 --EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEEL---E 834
Query: 1408 QTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNL 1467
+ ++ +K+ ++ + L E +K ++ LK + E KE + E+ ++ + L
Sbjct: 835 EEIEELEEKLDELEEELEELEKELE---ELKEELEELE--AEKEELEDELKELEEEKEEL 889
Query: 1468 QNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKAL 1527
+ L++++ A+ LE K+E + + E+Y+
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELE-------AKLERLEVELPELEEELEEEYEDT 942
Query: 1528 LQEGLHKEADNKKVLIDSF---NKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIE 1584
L+ L +E + + I++ N I E E + EL + +++E
Sbjct: 943 LETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYE------------ELKSQREDLE 990
Query: 1585 AEINKVANAAQERKA 1599
K+ +E
Sbjct: 991 EAKEKLLEVIEELDK 1005
Score = 43.5 bits (103), Expect = 0.001
Identities = 66/347 (19%), Positives = 146/347 (42%), Gaps = 20/347 (5%)
Query: 1832 LKRAQKKIEAEKDIGDC--TNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSP 1889
L+R ++ +E +D+ + L+ LE+ + +++ L E + LA
Sbjct: 181 LERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL----ELALLLAKLKE 236
Query: 1890 IREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDEL 1949
+R+E L+ + L ++ L ++E+ + E ++ EE+ L + + E E
Sbjct: 237 LRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE 296
Query: 1950 EAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKL 2009
++ E + + + + E L + L + +K+E+ T +EL L +L
Sbjct: 297 IEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAEL 356
Query: 2010 KEILKKCEKGVGDHEAFVD-KYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALL 2068
+E ++ E+ + ++ +E ++LA++EA+ A++ EEL++ L L
Sbjct: 357 EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN---ELEELKREIESLEERL 413
Query: 2069 TEKVDMDVLLNACV-DLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
+ L + +L ++ E +E E L ++E+L++ +E L D++ E +
Sbjct: 414 ERLSERLEDLKEELKELEAELEELQTEL--EELNEELEELEEQLEELRDRLKELERELAE 471
Query: 2128 QVEAKYAKLIEPETSYEELQKRQGQLTALLAEKVDMDVLLNASVKVN 2174
E +EL + +L L AE+ + +
Sbjct: 472 LQEELQRL-------EKELSSLEARLDRLEAEQRASQGVRAVLEALE 511
Score = 42.4 bits (100), Expect = 0.004
Identities = 67/352 (19%), Positives = 148/352 (42%), Gaps = 30/352 (8%)
Query: 1575 ELNALLQNIEAEINK---VANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKV 1631
L LL+ +E ++ K A A+ + ++ ++ +L+ L ++ L+ +++
Sbjct: 190 RLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLA-KLKELRKELEELEEEL 248
Query: 1632 NEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAM--YRDLANLRYDFEKCVSSLSDVK 1689
+ ++E E+ +L+ IE +L E +L L+ + E+ +S ++
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Query: 1690 QGLESRLVQWNEYETSLGKLITWLNDTEN-VLKNFTLQPTLNE-----KKQQRELKGKLS 1743
+ LE + E E L +L + + + + TL L + ++ + EL+ KLS
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368
Query: 1744 SWCGYEESVATLLDWLKVTEKKLGDEM-ELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQ 1802
E + L + L+ +L E+ E++ L+E K +++ + + L
Sbjct: 369 ---ALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE---ERLERLSERLED 422
Query: 1803 LKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQ 1862
LK++++ L E+ + +L + L+ ++++E +D L +LE+ A LQ
Sbjct: 423 LKEELKELEAELEELQTELEELNEE----LEELEEQLEELRD-----RLKELERELAELQ 473
Query: 1863 EVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQ 1914
E + L+ L L A + + + + V G V++
Sbjct: 474 EELQRLEKE--LSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAE 523
Score = 42.0 bits (99), Expect = 0.004
Identities = 47/333 (14%), Positives = 134/333 (40%), Gaps = 24/333 (7%)
Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
++D ++ L ++ + ++++ + L L ++ + +E +E +L+ELE
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELE--------E 751
Query: 976 NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
+ + L +L L + + E + + + + ++R
Sbjct: 752 ELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA 811
Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
L R +E Q ++ L+ ++EE + L + E +L + + E++ E L +
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE-LEKELEELKEELEELEA 870
Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQI- 1154
+E+ ++L L + + L+E + +E+ + + LRE+ L ++
Sbjct: 871 EKEELE--------DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLE 922
Query: 1155 VGFITETTARVGEVDGNSATVQEKIDKYDKII-ADIQEYEAILATASDKGDQLSSDGTIS 1213
+ + T++ ++++ + + +I+ + A ++ +++
Sbjct: 923 RLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVE-----E 977
Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKR 1246
E+ Q + ++ L + +E+L ++ +R
Sbjct: 978 RYEELKSQREDLEEAKEKLLEVIEELDKEKRER 1010
Score = 40.5 bits (95), Expect = 0.011
Identities = 59/317 (18%), Positives = 123/317 (38%), Gaps = 32/317 (10%)
Query: 1229 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLR 1288
L L K +EKL +Q AE + EL+ + L L + E+L
Sbjct: 195 LEELEKQLEKLERQ--AEKAERYQ---ELKAELRELELALLLAKLKELRKELEELEEELS 249
Query: 1289 QQNEKRAAESEKLGA---ELEEIIEALHARETAVKTLPVLLLEVTS--VDVELDKQRLEL 1343
+ E+ E+L E+EE+ L ++ L LLE+ ++E + L
Sbjct: 250 RLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRE 309
Query: 1344 ATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADC 1403
+E++ ++ E + + E I+ + + E E + + A+L
Sbjct: 310 RLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ-----LLAELEEAKEE-- 362
Query: 1404 HGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLD 1463
++ ++++++ ++L E ++ +A N L E + +EI +
Sbjct: 363 --LEEKLSALLEELEELFEALREELAELEAELAEIRNEL--------EELKREIESLEER 412
Query: 1464 WTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEK 1523
L L+D+ + + + + + L + +E ++++E +D L +LE+
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRD-----RLKELER 467
Query: 1524 YKALLQEGLHKEADNKK 1540
A LQE L +
Sbjct: 468 ELAELQEELQRLEKELS 484
Score = 35.8 bits (83), Expect = 0.36
Identities = 50/237 (21%), Positives = 103/237 (43%), Gaps = 42/237 (17%)
Query: 1924 EFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDS 1983
++ +KEE E L + ++ ++L ++++ + +E Q + ++E A++
Sbjct: 169 KYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR----- 223
Query: 1984 VEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAK 2043
+ E+A +ELR +L +L+E L + E+ + + + +++ E+ ++E
Sbjct: 224 -----ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEK------EIEEL 272
Query: 2044 YAKLIEPETSYEELQKRQGQLTALLTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQ 2103
++L E EELQ+ +L + E LL E E L +
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLR----------ERLEE-----LENE 317
Query: 2104 MEKLQQAVEALYDKITVTESLILSQVEAKYAKLIEPETSYEELQKRQGQLTALLAEK 2160
+E+L++ +E L +KI + +L E ET EEL++ +L E
Sbjct: 318 LEELEERLEELKEKIEALK-----------EELEERETLLEELEQLLAELEEAKEEL 363
Score = 35.5 bits (82), Expect = 0.38
Identities = 73/374 (19%), Positives = 156/374 (41%), Gaps = 47/374 (12%)
Query: 182 KRYEDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQL-WDKLSSLTER--ELKGKL 238
+R EDL++ + K + +LE + +++Q+L+ +E + L KL L + EL+ +L
Sbjct: 189 ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEEL 248
Query: 239 SSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQ 298
S EE + L + L+ EK++ ELK+ L+E + +L+ Q++LL+
Sbjct: 249 SR---LEEELEELQEELEEAEKEIE---ELKSELEELREELEEL----------QEELLE 292
Query: 299 LKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLDTQEW 358
LK++IE L + +++ + R + ++ E E + +A + + + +
Sbjct: 293 LKEEIEEL-------EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Query: 359 IDATYNAVNMWGDLTLERVSLHSNL-----ERLKNLEKELGRIGHKTVALKNNVLESTGP 413
++ + + E S L E + L +EL + + ++N
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE------- 398
Query: 414 IGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEKWIEGFQKKIE 473
E + +EI + L L+ + + + + + + L + +E ++++E
Sbjct: 399 --LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLE 456
Query: 474 AEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQAL 533
+D L +LE+ A LQE + L+ L L A + +
Sbjct: 457 ELRD-----RLKELERELAELQEELQRLEKE--LSSLEARLDRLEAEQRASQGVRAVLEA 509
Query: 534 YAALVTSVQGLVSQ 547
+ + V G V++
Sbjct: 510 LESGLPGVYGPVAE 523
Score = 35.5 bits (82), Expect = 0.41
Identities = 39/274 (14%), Positives = 110/274 (40%), Gaps = 16/274 (5%)
Query: 143 TLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYEDLVDGILKTISQLEESL 202
L+ L + L + ++ + + + + ++ ++ K I +L+ L
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276
Query: 203 ----DIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGYEESVATLLDWLKVT 258
+ ++ Q+ +E+ ++L ++S L ER L+ + EE + L + ++
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRER-LEELENELEELEERLEELKEKIEAL 335
Query: 259 EKKLGDEMELKTTLDEKKAQL-----QVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQS 313
+++L + L L++ A+L ++ + ++ L++++ L +
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395
Query: 314 KQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLDTQEWIDATYNAVNMWGDLT 373
+ +L + ++ +R ++ +E E + + + + Q ++ + +
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEE-- 453
Query: 374 LERVSLHSNLERLKNLEKELGRIGHKTVALKNNV 407
L +RLK LE+EL + + L+ +
Sbjct: 454 ----QLEELRDRLKELERELAELQEELQRLEKEL 483
Score = 35.1 bits (81), Expect = 0.56
Identities = 69/358 (19%), Positives = 149/358 (41%), Gaps = 17/358 (4%)
Query: 1773 KTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVL 1832
K L E + +L + + LK+++ SL E+ ++QL + + +
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELK---SLKNELRSLEDLLEELRRQLEELERQLEELK 722
Query: 1833 KRAQKKIEAEKDI-GDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIR 1891
+ E + + L+ E+ + L +E+ +E L ++ E+L E A ++
Sbjct: 723 RELAALEEELEQLQSRLEELE--EELEELEEELEELQERLEELEEELESLEEALAK--LK 778
Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
EE L+ AL ++ L ++E+ + E +E + ++E +E
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIE 838
Query: 1952 GIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKE 2011
++E+ +E + + E L + L++ ++ ++ E+ +EL +L +L+
Sbjct: 839 ELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES 898
Query: 2012 ILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEE--LQKRQGQLTALLT 2069
L + ++ + E ++ EE KL ++E + +L E E L+ + L
Sbjct: 899 ELAELKE---EIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955
Query: 2070 EKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
E+++ +N + YE E +E L+ Q E L++A E L + I +
Sbjct: 956 EEIEALGPVNLRAI---EEYEEVEER-YEELKSQREDLEEAKEKLLEVIEELDKEKRE 1009
Score = 34.3 bits (79), Expect = 0.96
Identities = 65/331 (19%), Positives = 139/331 (41%), Gaps = 42/331 (12%)
Query: 1855 EKYKALLQEVISHNSDMETLNDKCEALMEL------------AAHSPIREETLRLQALYA 1902
+ + L +E+ S +++ +L D E L + + EE +LQ+
Sbjct: 681 AQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLE 740
Query: 1903 ALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIET 1962
L ++ L ++E+ E + E +E L K ++E +E ++EE + +E
Sbjct: 741 ELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEE 800
Query: 1963 QYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGD 2022
+ + + L + L+ + ++QEI +EL KL +L+E L++ EK
Sbjct: 801 ELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK---- 856
Query: 2023 HEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKVDMDVLLNACV 2082
+ EE ++L ++EA+ +L E +EL++ + +L L E L + +
Sbjct: 857 ------ELEELKEELEELEAEKEEL---EDELKELEEEKEELEEELRE-------LESEL 900
Query: 2083 DLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILSQVEAKYAKLIEPETS 2142
+ E E E L ++E+L+ + L +++ L + + +E E
Sbjct: 901 AELKEEIEKLRERLEE-LEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIE 959
Query: 2143 Y---------EELQKRQGQLTALLAEKVDMD 2164
EE ++ + + L +++ D++
Sbjct: 960 ALGPVNLRAIEEYEEVEERYEELKSQREDLE 990
Score = 32.0 bits (73), Expect = 4.7
Identities = 63/352 (17%), Positives = 149/352 (42%), Gaps = 21/352 (5%)
Query: 175 KAIVSVNKRYEDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTEREL 234
++ ++ + ++L + + + +QLE+ + + + ++ ++ ++L +L L ER+L
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEEL-ERQL 718
Query: 235 KGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQ 294
+ EE + L L+ E++L + E L E+ +L+ ++ S ++
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE------EELESLEE 772
Query: 295 DLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLD 354
L +LK++IE L + + +++L + +R E + + + + +
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEE 832
Query: 355 TQEWIDATYNAVNMWGDLTLERVSLHSNLERLKN----LEKELGRIGHKTVALKNNVLES 410
+E I+ ++ +L E L LE LK LE E + + L+ E
Sbjct: 833 LEEEIEELEEKLD---ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Query: 411 TGPIGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSS-----KNTLEKWI 465
+ + E+ ++ + L+ L+ ++ + + ++TLE +
Sbjct: 890 EEEL--RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETEL 947
Query: 466 EGFQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMEL 517
E +++E E + NL +E+Y+ + + S E L + E L+E+
Sbjct: 948 EREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEV 999
Score = 31.6 bits (72), Expect = 5.8
Identities = 46/271 (16%), Positives = 111/271 (40%), Gaps = 45/271 (16%)
Query: 1893 ETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAG 1952
E L+ A L ++ L +++ ++ EE+ L + ++E A
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 1953 IKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEI 2012
++EE + ++++ + +++E+ +EL+ +L +L+E
Sbjct: 728 LEEELEQLQSRL---------------------EELEEELEELEEELEELQERLEELEEE 766
Query: 2013 LKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKV 2072
L+ E+ + + +++ EE Q L + + E EE ++R L +
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELEEL------EEELEEAERRLDAL------ER 814
Query: 2073 DMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLI---LSQV 2129
+++ L L ++ E L ++E+L++ ++ L +++ E + ++
Sbjct: 815 ELESLEQRRERLEQEIEE---------LEEEIEELEEKLDELEEELEELEKELEELKEEL 865
Query: 2130 EAKYAKLIEPETSYEELQKRQGQLTALLAEK 2160
E A+ E E +EL++ + +L L E
Sbjct: 866 EELEAEKEELEDELKELEEEKEELEEELREL 896
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 55.2 bits (133), Expect = 4e-07
Identities = 81/512 (15%), Positives = 202/512 (39%), Gaps = 62/512 (12%)
Query: 1186 IADIQEYEAILATASDKGDQLSS-----DGTISDRNEITEQLQSRKQQLHNLRKTVEKLR 1240
+ +++YE + + + + +G +S+ E E L ++ K +E+++
Sbjct: 163 LFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQ 222
Query: 1241 QQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
++ E+ E E E E + E +E +RL + + + E+ E E+
Sbjct: 223 EEQEEEELEQEIEALE-ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELER 281
Query: 1301 LGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDK--QRLELATKEVQPYVQRYETL 1358
L ELEE IE L E ++ L L + ++ EL++ ++L+ + ++ ++ E L
Sbjct: 282 LLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKL 341
Query: 1359 KKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKIK 1418
+ + E ++ A E + E + L K A + K + I+
Sbjct: 342 ESELEELAEEKNELAKLLEERLKELEERLEELEKELEKAL---------ERLKQLEEAIQ 392
Query: 1419 QITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQ---------- 1468
++ + L E +++ + + E + +E+ ++ + L+
Sbjct: 393 ELKEELAELSAALEEIQEELEELEKE-----LEELERELEELEEEIKKLEEQINQLESKE 447
Query: 1469 NTLQDIDKHHAKCLSLWNDFL-----SSKNTLEKWIEGFQKKIESEKDIGD-CTNLDDLE 1522
+ ++ KC + E +E ++++ EK+ + +++LE
Sbjct: 448 LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELE 507
Query: 1523 KYKALLQEGLHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQN 1582
K L+E L + + ++ L + E+++ L L L+
Sbjct: 508 KELRELEEELIELLELEEALKEEL-----------EEKLEKLEN---------LLEELEE 547
Query: 1583 IEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKVNEIQESFPEIS 1642
++ ++ ++ +E + ++D++ +L+ L++ K+ L+ +++ E+++ E+
Sbjct: 548 LKEKL-QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606
Query: 1643 ---TKLQTLSDHIENSSRKLPSRTNEAMYRDL 1671
++L+ L +E S + E
Sbjct: 607 ERLSQLEELLQSLELSEAENELEEAEEELESE 638
Score = 45.9 bits (109), Expect = 3e-04
Identities = 111/592 (18%), Positives = 240/592 (40%), Gaps = 45/592 (7%)
Query: 1507 ESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVL---IDSFNKGIIGMTELESEEVKD 1563
E ++ + + L+ EK LL+E + + + L + + I + E EE+K+
Sbjct: 155 ERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKE 214
Query: 1564 LIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQA 1623
L ++ + E L Q IEA ++A +E++ +++ ++L + +A
Sbjct: 215 LKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLEL------EA 268
Query: 1624 LQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTN--------EAMYRDLANLR 1675
L+ R +++ E++ E+ K++ L + E + L +L
Sbjct: 269 LKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLE 328
Query: 1676 YDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFT---LQPTLNEK 1732
EK L ++ LE + NE L + + L + L+ L+ +
Sbjct: 329 ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLE 388
Query: 1733 KQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQD 1792
+ +ELK +L E A L + + E+ + EL+ L+E + +++
Sbjct: 389 EAIQELKEEL------AELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQ 442
Query: 1793 ATSHQQDLLQLKDKIE-----SLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGD 1847
S + + +L E P E K+ L L+ ++++ EK+ +
Sbjct: 443 LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELE----LEELEEELSREKEEAE 498
Query: 1848 -CTNLDDLEK-YKALLQEVISHNSDMETLND----KCEALMELAAHSPIREETLRLQALY 1901
+++LEK + L +E+I E L + K E L L +E L+LQ L
Sbjct: 499 LREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLK 558
Query: 1902 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIE 1961
+ ++ + ++++ L + +KEE+E + ++ ELE + + ++ ++
Sbjct: 559 E-ELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQ 617
Query: 1962 TQYL--QWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKG 2019
+ L N E ++ + L+ + + E +EL K+ +L+ +++ +
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELL-QAALEELEEKVEELEAEIRRELQR 676
Query: 2020 VGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEK 2071
+ + E +K EE Q ++E +L E E+++ +L + E
Sbjct: 677 IENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAEL 728
Score = 44.4 bits (105), Expect = 8e-04
Identities = 65/511 (12%), Positives = 188/511 (36%), Gaps = 48/511 (9%)
Query: 835 EQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQ 894
E++ E+ + + + + E L+ + ++ LE+++ + L+
Sbjct: 239 ERLAELEEEKERLEELKARLLEIESLELEALKIREEE----LRELERLLEELEEKIERLE 294
Query: 895 DVILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVD 954
++ + E + ++++ L ++ K+ +R++ + L +L +E +
Sbjct: 295 ELEREIEELEE---ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351
Query: 955 VENVEAKLKELEVAVE-------------GDVKTNNVLMLQELLNKYAQLNEEAGELNMM 1001
+ L+E +E + +QEL + A+L+ E+
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411
Query: 1002 ISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQ--------- 1052
+ + + + + +++ +L I S +
Sbjct: 412 LEELE----------KELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP 461
Query: 1053 --SVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDE 1110
+ + +E L + +L +S + ++ +E ++L E+ + +
Sbjct: 462 VCGQELPEEHEKELLELYELELEELEEELS-REKEEAELREEIEELEKELRELEEELIEL 520
Query: 1111 DLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQI-VGFITETTARVGEVD 1169
+ L E +E L+ L L+E+ ++ + + + + E+
Sbjct: 521 LE-----LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL 575
Query: 1170 GNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQL 1229
++ + ++ +++ ++E + L ++ QL + +E +L+ +++L
Sbjct: 576 EELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEEL 635
Query: 1230 HNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQ 1289
+ + + + E A E+L ++EE+ + + + +L + +E+L +
Sbjct: 636 ESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEE 695
Query: 1290 QNEKRAAESEKLGAELEEIIEALHARETAVK 1320
+ E+ E E+L +L EI + + E+
Sbjct: 696 ELEQLREELEELLKKLGEIEQLIEELESRKA 726
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 49.7 bits (119), Expect = 2e-05
Identities = 70/366 (19%), Positives = 151/366 (41%), Gaps = 49/366 (13%)
Query: 1022 VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVST 1081
+ + + +++ + L + + ++ Q + K+ E ++K QL +
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE----IEKEIEQLEQEEEK 734
Query: 1082 QAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLL 1141
E + E L EI + K +L L I+ L+E + ++E A+ L++ L
Sbjct: 735 LKERLEELEEDLSSLEQEIENVK-----SELKELEARIEELEEDLHKLEE-ALNDLEARL 788
Query: 1142 LLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASD 1201
+R+ E+ + ++E++ + + + +I E L +
Sbjct: 789 S-------------------HSRIPEIQAELSKLEEEVSRIEARLREI---EQKLNRLTL 826
Query: 1202 KGDQLSSD--GTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
+ + L + R ++ EQ++S ++++ NL E+L ++ E+ A L + L +
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886
Query: 1260 I---IEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
+ + L A+ + ++ L +EK R++ + A+ E L EL EI + E
Sbjct: 887 LKKERDELEAQLREL--ERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944
Query: 1317 TAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPY----VQRYETLKKNIAEAIDKFETQ 1372
+ L DV+ + QR+E + ++P +Q YE + K + E +K
Sbjct: 945 EIPEEELSL------EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKL 998
Query: 1373 ANEHEA 1378
E +A
Sbjct: 999 EEERKA 1004
Score = 47.4 bits (113), Expect = 1e-04
Identities = 50/300 (16%), Positives = 113/300 (37%), Gaps = 18/300 (6%)
Query: 1032 RYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALT 1091
RY L + + L+ + L+ + E + L E +L + ++ + ++ + L
Sbjct: 212 RYQALLKEKREYEGYE--LLKEKEALERQKEAIERQLASLEEELEK-LTEEISELEKRLE 268
Query: 1092 VYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLI 1151
++LL+E+N +E+ + + + I L+ + +E K + L E+ L
Sbjct: 269 EIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLE 328
Query: 1152 MQI-------VGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
+I E D + E ++ + + A+++E + A D+
Sbjct: 329 AEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELK 388
Query: 1205 QLSS--DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIE 1262
+ + NE+ +L +++L L + + L A+ +L E E+
Sbjct: 389 DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448
Query: 1263 ALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
+ +E +L L + K Q+ E +++ EL ++ L E +
Sbjct: 449 EIKKQEW------KLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARAS 502
Score = 45.4 bits (108), Expect = 4e-04
Identities = 59/377 (15%), Positives = 150/377 (39%), Gaps = 62/377 (16%)
Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQ-----LIGHRKEFEVDVENVEAKLKELEVAVE 970
+ +++RL +E + +++R Q LK + L+ ++ E E +E +L LE +E
Sbjct: 196 KRQQLERLRRE-REKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254
Query: 971 GDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQ 1030
L E ++ +L + E+ ++ + + +L + ++L V + + ++
Sbjct: 255 K---------LTEEIS---ELEKRLEEIEQLLE--ELNKKIKDLGEEEQLRVKEKIGELE 300
Query: 1031 KRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEAL 1090
A L R+I + K + ++ + ++ + +A + K +++ E++ +
Sbjct: 301 AEIASLERSIAE-------KERELEDAEERLAKLEAEIDKLLAEI--------EELEREI 345
Query: 1091 TVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTL 1150
+K D+ L + L E +E + K R++
Sbjct: 346 EEERKRRDK---------------LTEEYAELKEELEDLRAELEEVDKEFAETRDEL--- 387
Query: 1151 IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDG 1210
+ + + E+ +QE++ + + +AD+ A +A K ++L
Sbjct: 388 -KDYREKLEKLKREINELKRELDRLQEELQRLSEELADL---NAAIAGIEAKINEL---- 439
Query: 1211 TISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
++ + +++ ++ +L L + K Q+ E +++ EL ++ L E
Sbjct: 440 -EEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498
Query: 1271 VPSSSRLHNLRKTVEKL 1287
+S + VE++
Sbjct: 499 ARASEERVRGGRAVEEV 515
Score = 43.5 bits (103), Expect = 0.001
Identities = 64/329 (19%), Positives = 132/329 (40%), Gaps = 39/329 (11%)
Query: 1557 ESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYID 1616
E + +++ ++ GL+ EL++L + N++ +QE KI +++ +
Sbjct: 675 ELQRLRERLE----GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEI----- 725
Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNE--AMYRDLANL 1674
+ L+ +K+ E E E+ L +L IEN +L + DL L
Sbjct: 726 -----EQLEQEEEKLKERLE---ELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKL 777
Query: 1675 RYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKK- 1733
+ LS + + ++ E + ++ L + E L TL+ EK+
Sbjct: 778 EEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835
Query: 1734 -----QQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRT 1788
Q+ +LK ++ S E++ + L+ ++L + L+ + L+
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR---DLESRLGDLK---- 888
Query: 1789 IYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEK--DIG 1846
++ + L +L+ KIE L E+ +++L+ + + + + +IE K D
Sbjct: 889 --KERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEE 945
Query: 1847 DCTNLDDLEKYKALLQEVISHNSDMETLN 1875
LE +A LQ V +E +N
Sbjct: 946 IPEEELSLEDVQAELQRVEEEIRALEPVN 974
Score = 42.4 bits (100), Expect = 0.004
Identities = 52/256 (20%), Positives = 101/256 (39%), Gaps = 35/256 (13%)
Query: 1175 VQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTIS------DRNEITEQLQSRKQQ 1228
++E+++ + ++ +Q + D+ Q SD + + ++ ++ + K++
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKER 738
Query: 1229 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV------PSSSRLHNLRK 1282
L L + + L Q+ E +E ++L A +EE+ E LH E A+ S SR+ ++
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQA 798
Query: 1283 TVEKL---RQQNEKRAAE--------------SEKLGAELEEIIEALHARETAVKTLPVL 1325
+ KL + E R E EK EL+E L + +++
Sbjct: 799 ELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE--- 855
Query: 1326 LLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNE 1385
+E + E ++ LE ++ R LKK E + + E + +
Sbjct: 856 -IENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914
Query: 1386 AYDWL--RKAKLGAQA 1399
L KAKL A
Sbjct: 915 KRKRLSELKAKLEALE 930
Score = 41.2 bits (97), Expect = 0.007
Identities = 50/282 (17%), Positives = 102/282 (36%), Gaps = 48/282 (17%)
Query: 1177 EKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTV 1236
EK ++Y ++ + +EYE + + + + E+L+ +++ L K +
Sbjct: 208 EKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRL 267
Query: 1237 EKLRQQNEKRAAESEKLGAE------------------LEEIIEALH-----ARETAVPS 1273
E++ Q E+ + + LG E LE I A E
Sbjct: 268 EEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKL 327
Query: 1274 SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVD 1333
+ + L +E+L ++ E+ +KL E E + L L E+ VD
Sbjct: 328 EAEIDKLLAEIEELEREIEEERKRRDKLTEEYAE----------LKEELEDLRAELEEVD 377
Query: 1334 VELDKQRLELAT--KEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLR 1391
E + R EL ++++ + LK+ + ++ + + E A +
Sbjct: 378 KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEA--K 435
Query: 1392 KAKLGAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDK 1433
+L ++ +DKA +IK+ L + + K
Sbjct: 436 INEL-----------EEEKEDKALEIKKQEWKLEQLAADLSK 466
Score = 39.3 bits (92), Expect = 0.033
Identities = 62/336 (18%), Positives = 135/336 (40%), Gaps = 47/336 (13%)
Query: 1733 KQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQD 1792
++R++ +++ ++ L+ L+ E+ + L +DEK+ QL+ R +
Sbjct: 153 VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIE---RLDLIIDEKRQQLERLRREREK 209
Query: 1793 ATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLD 1852
A +Q LL+ K + E + L+R ++ IE + L
Sbjct: 210 AERYQ-ALLKEKREYE-------------GYELLKEKEALERQKEAIERQ--------LA 247
Query: 1853 DLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLV 1912
LE+ L E IS + L + + L EL I++ Q + ++ +
Sbjct: 248 SLEEELEKLTEEISELE--KRLEEIEQLLEELNKK--IKDLGEEEQLRVKEKIGELEAEI 303
Query: 1913 SQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRE 1972
+ +E++++ K+ E+E A + + + + E + +E + + R+
Sbjct: 304 ASLERSIA------EKERELE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353
Query: 1973 TLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEE 2032
L + A L ++++ E+ E R +L +E L+K ++ + + + +D+ +E
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413
Query: 2033 CSQKL--------AQVEAKYAKLIEPETSYEELQKR 2060
Q+L A + AK+ E E E+
Sbjct: 414 ELQRLSEELADLNAAIAGIEAKINELEEEKEDKALE 449
Score = 35.8 bits (83), Expect = 0.31
Identities = 49/307 (15%), Positives = 119/307 (38%), Gaps = 38/307 (12%)
Query: 718 LQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLK----PIQKEAQQLSEWDSSIVPKVSDL 773
L+ L S +++ L R ++ +D++ ++ D I+KE +QL + + + ++ +L
Sbjct: 683 LEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEL 742
Query: 774 ETRWAAVK---SAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNR 830
E ++++ ++ +L + I + + +++A N L
Sbjct: 743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL-------------NDLEARLS 789
Query: 831 QQTLEQIQHH-EKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHH 889
+ +IQ KL ++ S + +++ E+ L +++ + I Q+ ++
Sbjct: 790 HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE-IQELQEQRIDLKEQ 848
Query: 890 EKLLQDVILSPDNKLAPCFS------------AEEGSPIKDKIDRLEKETKAQSDRIQDA 937
K ++ I + + K +K + D LE + + +I++
Sbjct: 849 IKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908
Query: 938 LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEA-- 995
+ + E + +E +E +L E+E D + + E + Q EE
Sbjct: 909 EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIR 968
Query: 996 --GELNM 1000
+NM
Sbjct: 969 ALEPVNM 975
Score = 35.0 bits (81), Expect = 0.60
Identities = 63/294 (21%), Positives = 130/294 (44%), Gaps = 42/294 (14%)
Query: 1791 QDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTN 1850
Q+ ++ L +L++ + SL Q E K +L + R ++ ++ + ++
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR----IEELEEDLHKLEE-----A 780
Query: 1851 LDDLEKYKA--LLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSV 1908
L+DLE + + E+ + S +E + EA + I ++ RL L +
Sbjct: 781 LNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLR-----EIEQKLNRLTLEKEYLEKEI 835
Query: 1909 QGLVSQV------EKNLSDQTEFL-AKKEEVEAWLHKAHATVQECDELEAGIKEE----- 1956
Q L Q K++ + E L KKEE+E L + A +++ + +K+E
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
Query: 1957 ------QKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLK 2010
++ IE Q R+ L+++ A L+++E+ + EI P +E+ + L L+
Sbjct: 896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELS-EIEDPKGEDEEIPEEELSLE 954
Query: 2011 EILKKCEKGVGDHEAF-------VDKYEECSQKLAQVEAKYAKLIEPETSYEEL 2057
++ + ++ + A + +YEE ++L +++ K AKL E + E
Sbjct: 955 DVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILER 1008
Score = 32.7 bits (75), Expect = 2.7
Identities = 58/337 (17%), Positives = 116/337 (34%), Gaps = 55/337 (16%)
Query: 1729 LNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRT 1788
L ++K++ E L E + L E++L L + E + +L+
Sbjct: 216 LLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEK---LTEEISELEKRLEEIEQ 272
Query: 1789 IYQDATSHQQDL-----LQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEK 1843
+ ++ +DL L++K+KI L ++ +A +R + E
Sbjct: 273 LLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIA--------EKERELEDAEERL 324
Query: 1844 DIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETL--RLQALY 1901
+ + LL E+ ++E + + L E A E L L+ +
Sbjct: 325 AKLE-------AEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD 377
Query: 1902 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIE 1961
+ L + L + E++ L + +Q E A + IE
Sbjct: 378 KEFAETRDEL-KDYREKLEKLKR---EINELKRELDRLQEELQRLSEELADLNAAIAGIE 433
Query: 1962 TQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVG 2021
+ + L+ +++ EI + KL +L L K E+ +
Sbjct: 434 AKINE--------------LEEEKEDKALEIK-------KQEWKLEQLAADLSKYEQELY 472
Query: 2022 DHEAFVDKYE----ECSQKLAQVEAKYAKLIEPETSY 2054
D + D+ E + ++LA+ EA+ A+ E
Sbjct: 473 DLKEEYDRVEKELSKLQRELAEAEAQ-ARASEERVRG 508
Score = 32.7 bits (75), Expect = 2.9
Identities = 56/299 (18%), Positives = 118/299 (39%), Gaps = 32/299 (10%)
Query: 912 EGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEG 971
EG + + + LE++ +A ++ + L +L E E +E +E L+EL ++
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIK- 282
Query: 972 DVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV--SIATEHMNLNDADRLTVNDSV--S 1027
+ ++ E+ GEL I+++ SIA + L DA+
Sbjct: 283 ----------DLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEID 332
Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKR--LQHKVEEAKALLQKAESQLARPVSTQAED 1085
+ +L R IE+ + +R L + E K L+ ++L V + +
Sbjct: 333 KLLAEIEELEREIEE---------ERKRRDKLTEEYAELKEELEDLRAELEE-VDKEFAE 382
Query: 1086 ISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLRE 1145
+ L Y++ L+++ N E L L + ++ L E + + + L E
Sbjct: 383 TRDELKDYREKLEKLKREINELKRE-LDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441
Query: 1146 QFTTL---IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQ-EYEAILATAS 1200
+ I + + + A + + + ++E+ D+ +K ++ +Q E A A
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500
Score = 32.0 bits (73), Expect = 5.0
Identities = 42/219 (19%), Positives = 88/219 (40%), Gaps = 19/219 (8%)
Query: 1495 LEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMT 1554
LE+ + +++IE+ K L +LE L+E LHK + L + I
Sbjct: 742 LEEDLSSLEQEIENVKS-----ELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI 796
Query: 1555 ELESEEVKDLIQVGVHGLQVELNALLQNIEAEINKV-ANAAQERKAMQDKISKLQSWLKQ 1613
+ E ++++ + + A L+ IE ++N++ K +Q+ + +Q
Sbjct: 797 QAELSKLEEEVS--------RIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQ 848
Query: 1614 YIDYSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNE--AMYRDL 1671
+ L + +++ E E E+ L+ L + + ++ + + R +
Sbjct: 849 IKSIEKEIENLNGKKEELEEELE---ELEAALRDLESRLGDLKKERDELEAQLRELERKI 905
Query: 1672 ANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLI 1710
L EK LS++K LE+ + +E E G+
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 46.2 bits (110), Expect = 2e-04
Identities = 96/503 (19%), Positives = 204/503 (40%), Gaps = 74/503 (14%)
Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA---------- 1266
E E+L+ + L ++ +E+ ++ E+R E +K ELEE ++ L
Sbjct: 238 EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297
Query: 1267 -RETAVPSSSRLHNLRKTVEKLRQQN---EKRAAESEKLGAELEEI---IEALHARETAV 1319
E L + K + +L ++ E+R E E+ LEE+ ++ L R +
Sbjct: 298 LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357
Query: 1320 KTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETL---KKNIAEAIDKFETQANEH 1376
+ L E + EL++ + L + + E L K+ I E I K + E
Sbjct: 358 EERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL 417
Query: 1377 EAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVA 1436
+ + +A + L+KAK G + T + + + +++ T L
Sbjct: 418 KKEIKELKKAIEELKKAK----GKCPVCGRELTEEHRKELLEEYTAEL------------ 461
Query: 1437 LKNNVLESTGPIGKENINQEINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLE 1496
+ I +E+ +I L+ L++++ K L ++ + K E
Sbjct: 462 --------------KRIEKELKEIEEKERKLRKELRELE----KVLKKESELIKLKELAE 503
Query: 1497 KWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMTEL 1556
+ ++ ++K++ NL++LEK KA E L ++ K I S K + + EL
Sbjct: 504 Q-LKELEEKLKK-------YNLEELEK-KAEEYEKLKEKLIKLKGEIKSLKKELEKLEEL 554
Query: 1557 ESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYID 1616
+ + ++ + L+ EL LL+ +E + +ER + +L+ + +Y++
Sbjct: 555 KKK--LAELEKKLDELEEELAELLKELEELGFESVEELEER------LKELEPFYNEYLE 606
Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRK---LPSRTNEAMYRDLAN 1673
++ L+ ++ +++E + +L +E ++ L + +E Y +L
Sbjct: 607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELRE 666
Query: 1674 LRYDFEKCVSSLSDVKQGLESRL 1696
+ + ++ L + LE R
Sbjct: 667 EYLELSRELAGLRAELEELEKRR 689
Score = 40.0 bits (94), Expect = 0.017
Identities = 99/494 (20%), Positives = 212/494 (42%), Gaps = 39/494 (7%)
Query: 919 KIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNV 978
+I L+ A + + +K + + I ++F EN+E +KE E +E ++
Sbjct: 153 QILGLDDYENAY-KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLRE--- 208
Query: 979 LMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSR 1038
+ E+ ++ +L EE +L + + E + + + ++ S ++++ +L
Sbjct: 209 --INEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266
Query: 1039 TIED---RISTVNNKLQSVKRLQHKVEE---AKALLQKAESQLARPVSTQAEDISEALTV 1092
IE+ I + K++ +K L+ K EE ++ +L + + + E +
Sbjct: 267 RIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELRE-IEKRLSRLEEEING 325
Query: 1093 YQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE--NHAVPKLKSLLLLREQFTTL 1150
++ + E+ K + +E L + K L+E+ ER E A K + L L+++ T L
Sbjct: 326 IEERIKELEE-KEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL 384
Query: 1151 -IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDK----GDQ 1205
++ + E E++ + + +I + K I ++++ L A K G +
Sbjct: 385 TPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRE 444
Query: 1206 LSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQ---------NEKRAAESEKLGAE 1256
L+ + E T +L+ +++L + + KLR++ E + ++L +
Sbjct: 445 LTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQ 504
Query: 1257 LEEIIEAL--HARETAVPSSSRLHNLRKTVEKLRQQN---EKRAAESEKLGAELEEIIEA 1311
L+E+ E L + E + L++ + KL+ + +K + E+L +L E+ +
Sbjct: 505 LKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKK 564
Query: 1312 LHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFET 1371
L E + L L E+ VE ++RL KE++P+ Y LK E + +
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERL----KELEPFYNEYLELKDAEKELEREEKE 620
Query: 1372 QANEHEAYKQAYNE 1385
E +A+ E
Sbjct: 621 LKKLEEELDKAFEE 634
Score = 35.8 bits (83), Expect = 0.29
Identities = 92/499 (18%), Positives = 205/499 (41%), Gaps = 73/499 (14%)
Query: 1582 NIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDY----SGNKQALQSRLDKVNEIQES 1637
++ N N + K ++ +I +L+ ++K+ + ++ L+ L ++NEI
Sbjct: 156 GLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSE 215
Query: 1638 FPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLV 1697
PE+ +L+ L ++ +L E + E L + + LE R+
Sbjct: 216 LPELREELEKLEKEVK-ELEELKEEIEELEKELES-----LEGSKRKLEEKIRELEERIE 269
Query: 1698 QWNEYETSLGKLITWLNDTENVLKNF-TLQPTLNE-KKQQRELKGKLSSWCGYEESVATL 1755
+ + L + + L + + + + L E + RE++ +LS +
Sbjct: 270 ELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEER 329
Query: 1756 LDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSE 1815
+ L+ E++L + + L+++ +L+ +Y++A + +++L +LK ++ L E
Sbjct: 330 IKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL--TPE 387
Query: 1816 QSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLN 1875
+ +++L L++A+++IE E+ + + +++ L
Sbjct: 388 KLEKELEE--------LEKAKEEIE--------------EEISKITARIGELKKEIKELK 425
Query: 1876 DKCEALMELAAHSPI--REETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVE 1933
E L + P+ RE L + K L EE
Sbjct: 426 KAIEELKKAKGKCPVCGRE------------------LTEEHRKEL---------LEEYT 458
Query: 1934 AWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSF---RETLAQMTAWLDSVEKNIKQ 1990
A L + ++E +E E +++E + +E + + +E Q+ + ++K +
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518
Query: 1991 EIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLI-- 2048
E+ ++L+ KL+KLK +K +K + E K E +KL ++E + A+L+
Sbjct: 519 ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578
Query: 2049 ---EPETSYEELQKRQGQL 2064
S EEL++R +L
Sbjct: 579 LEELGFESVEELEERLKEL 597
Score = 33.1 bits (76), Expect = 2.1
Identities = 76/424 (17%), Positives = 172/424 (40%), Gaps = 41/424 (9%)
Query: 911 EEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFE-VDVENVEAKLKELEVAV 969
++ ++ +++ LE+ + ++A +L +K + E +E +L+ELE A
Sbjct: 345 KKLKELEKRLEELEERHEL----YEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKA- 399
Query: 970 EGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIA-----------TEHMNLNDAD 1018
K + ++ + +L +E EL I + A TE +
Sbjct: 400 ----KEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLE 455
Query: 1019 RLTVN-DSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLAR 1077
T + K + R + + + L+ L K++E L++ E +L
Sbjct: 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELI-KLKELAEQLKELEEKLK- 513
Query: 1078 PVSTQAEDISEALTVYQKLLDEINSWKNSQS--DEDLVHLADSIKPLDEVVERIENHAVP 1135
E++ + Y+KL +++ K ++L L + K L E+ ++++
Sbjct: 514 --KYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDE-LEE 570
Query: 1136 KLKSLL--LLREQFTTL--IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
+L LL L F ++ + + + + E+ ++ + + K+ ++ +
Sbjct: 571 ELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDK 630
Query: 1192 YEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLR---QQNEKR-- 1246
LA + ++L + ++ E+ + +++ L + + LR ++ EKR
Sbjct: 631 AFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRRE 690
Query: 1247 --AAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRA-AESEKLGA 1303
EKL ELEE +A E + R+ LR+ V+K + ++RA ++ ++ +
Sbjct: 691 EIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGEIAS 750
Query: 1304 ELEE 1307
E+ E
Sbjct: 751 EIFE 754
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 45.1 bits (106), Expect = 5e-04
Identities = 48/209 (22%), Positives = 89/209 (42%), Gaps = 9/209 (4%)
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
+E ++ + K++ +K E ++ +E + AE K E ++ EA A E
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545
Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVE 1335
+ + K E+L++ EK+ AE K E+ AL E A K + EV + E
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAK--KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
Query: 1336 LDKQRLELATKEVQP-----YVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWL 1390
K + E A K + +++ E KK + + K + + E K+A E + +
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA--EEENKI 1661
Query: 1391 RKAKLGAQANADCHGEQQTTKDKADKIKQ 1419
+ A+ +A D ++ K + D+ K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Score = 35.1 bits (80), Expect = 0.51
Identities = 42/203 (20%), Positives = 76/203 (37%), Gaps = 28/203 (13%)
Query: 1227 QQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEK 1286
++ ++K E + E + AE E+ E+ + EA A ++ + RK ++
Sbjct: 1224 KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKA-DE 1282
Query: 1287 LRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATK 1346
L++ EK+ A+ K E ++ EA E + ++ + A K
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAE--------------------EAKKADEAKK 1322
Query: 1347 EVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGE 1406
+ + ++ + KK EA E E EA D A+ A+A E
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA-------DEAEAAEEKAEAAEKKKEE 1375
Query: 1407 QQTTKDKADKIKQITKSLPEGQK 1429
+ D A K + K E +K
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKK 1398
Score = 34.0 bits (77), Expect = 1.4
Identities = 43/206 (20%), Positives = 75/206 (36%), Gaps = 7/206 (3%)
Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPS 1273
D + E ++ + + + E R+ E R AE K + E A E
Sbjct: 1138 DARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197
Query: 1274 SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVD 1333
+R + E+ R+ E R AE K +++ EA E A K E + +
Sbjct: 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA-----EEERNNE 1252
Query: 1334 VELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKA 1393
+ +A + + E +K A+ + K E + EA K + D +K
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARK--ADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
Query: 1394 KLGAQANADCHGEQQTTKDKADKIKQ 1419
A+ + + + K KAD K+
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKK 1336
Score = 31.6 bits (71), Expect = 6.8
Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 4/105 (3%)
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAEL----EEIIEALHARETAV 1271
+E ++ + K+ +K E + + K+ AE K E EE +A A++ A
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
Query: 1272 PSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
+ + +K E ++ +E + AE K E ++ EA A E
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 44.5 bits (106), Expect = 6e-04
Identities = 49/286 (17%), Positives = 107/286 (37%), Gaps = 36/286 (12%)
Query: 1048 NNKLQSVKRLQHKVEEAKALLQKAESQL----ARPVSTQAEDISEALTVYQKLLDEINSW 1103
N +L+ ++ L K+ EA L+ +L + + E + ++ L++I
Sbjct: 42 NERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKI--- 98
Query: 1104 KNSQSDEDLVHLADSIKPLDEVVERIENH--AVPKLKSL--LLLREQFTTLIMQIVGFIT 1159
++++ L + I L+ ++ +E + + L + VG +
Sbjct: 99 -----EKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVP 153
Query: 1160 ETTARVGEVDGNSATVQEKIDKYDKIIA-------DIQEYEAILATASDKGDQLSSDGTI 1212
E +++ + V+ + E E L + +L +GT
Sbjct: 154 EDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTP 213
Query: 1213 SDR-NEITEQLQSRKQQLHNLRKTVEKLRQQNEKR-AAESEKLGAELEEIIEALHARETA 1270
S+ EI E+L+ +++ +L + +++L ++ + A E L ELE +T
Sbjct: 214 SELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTD 273
Query: 1271 --------VPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
VP R+ L++ ++K E + E EE+
Sbjct: 274 KTFAIEGWVP-EDRVKKLKELIDKA--TGGSAYVEFVEPDEEEEEV 316
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 42.8 bits (101), Expect = 8e-04
Identities = 40/200 (20%), Positives = 83/200 (41%), Gaps = 4/200 (2%)
Query: 451 WNDFLSSKNTLEKWIEGFQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLND 509
FL + LE W+ ++ + + D +++ L +K++AL E+ +H +E LN+
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 510 KCEALMELAAH--SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEA 567
E L+E I+E L + L + ++E+ L Q +F +++E
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQ-QFFRDADDLEQ 120
Query: 568 WLHKAHATVQECAGNGSKNVLKERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQ 627
WL + A + ++E L + E + + + L + + L+ D
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 628 QDSLREAMTVLRDSWDRLNR 647
+ + E + L + W+ L
Sbjct: 181 DEEIEEKLEELNERWEELLE 200
Score = 42.4 bits (100), Expect = 0.001
Identities = 24/193 (12%), Positives = 78/193 (40%), Gaps = 9/193 (4%)
Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
++ + ++ W + + L D ++ LL++ + + ++ ++ ++ + +
Sbjct: 4 QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELG 63
Query: 757 QQLSEWDS----SIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWL 812
+QL E I ++ +L RW ++ + + L E + D E+WL
Sbjct: 64 EQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRL-EEALDLQQFFRDADDLEQWL 122
Query: 813 LQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHN 872
+ L + + + + E ++ H++L ++ +++ ++ + ++ +
Sbjct: 123 EEKEAALASED--LGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 873 SLYINNRQQTLEQ 885
I + + L +
Sbjct: 181 DEEIEEKLEELNE 193
Score = 36.7 bits (85), Expect = 0.086
Identities = 39/212 (18%), Positives = 88/212 (41%), Gaps = 6/212 (2%)
Query: 346 QAYTKAVLDTQEWIDATYNAVNMWGDLTLERVSLHSNLERLKNLEKELGRIGHK--TVAL 403
Q + + + + W+ ++ S+ + L++ + LE EL + +
Sbjct: 3 QQFLRDADELEAWLSEKEELLSSTDYGDDLE-SVEALLKKHEALEAELAAHEERVEALNE 61
Query: 404 KNNVLESTGPIGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEK 463
L G E I + + ++ W L+ + + + L + F + LE+
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQ-FFRDADDLEQ 120
Query: 464 WIEGFQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLNDKCEALMELAAHSP 522
W+E + + +E D ++++L +K+K L +E+ +H +++LN+ E L+E
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 523 IREETLRLQALYAALVTSVQGLVSQVEKNLSD 554
E +L+ L + L + +K L +
Sbjct: 181 DEEIEEKLEELNERW-EELLELAEERQKKLEE 211
Score = 33.6 bits (77), Expect = 0.85
Identities = 32/161 (19%), Positives = 63/161 (39%), Gaps = 20/161 (12%)
Query: 1442 LESTGPIGKENINQEINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEG 1501
L G E I + + ++ W L+ ++ + + L L F + LE+W+E
Sbjct: 66 LIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQ-FFRDADDLEQWLE- 123
Query: 1502 FQKKIESEKDIGDCTNLDDLEKYKALLQ--EGLHKEADNKKVLIDSFNKGIIGMTELESE 1559
E E + DLE + LL+ + L +E + + + S N E
Sbjct: 124 -----EKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLN-----------E 167
Query: 1560 EVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAM 1600
++L++ G E+ L+ + ++ A+ER+
Sbjct: 168 LAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKK 208
Score = 32.8 bits (75), Expect = 1.5
Identities = 17/99 (17%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 1146 QFTTLIMQIVGFITETTARVG--EVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKG 1203
QF + ++ E A + ++ + +V+E + K+ ++ +++ +E L + ++
Sbjct: 110 QFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELA 169
Query: 1204 DQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQ 1242
++L +G EI E+L+ ++ L + E+ +++
Sbjct: 170 EELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKK 208
Score = 32.0 bits (73), Expect = 2.4
Identities = 37/179 (20%), Positives = 78/179 (43%), Gaps = 14/179 (7%)
Query: 141 KTTLQEVLSHKRIIESLLDKSKSLPQINKDQA--LEKAIVSVNKRYEDLVDGILKTISQL 198
+ E+ +H+ +E+L + + L + A +++ + +N+R+E+L + + +L
Sbjct: 43 EALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRL 102
Query: 199 EESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGYEESVATLLDWLKVT 258
EE+LD+ Q F+ L E+E ESV LL K
Sbjct: 103 EEALDLQQFFRDADDL-----------EQWLEEKEAALASEDLGKDLESVEELLKKHKEL 151
Query: 259 EKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQL 317
E++L +L+E +L + ++ L +L ++ E L + +E+ +++L
Sbjct: 152 EEELEAHEPRLKSLNELAEEL-LEEGHPDADEEIEEKLEELNERWEELLELAEERQKKL 209
Score = 31.3 bits (71), Expect = 4.3
Identities = 25/108 (23%), Positives = 50/108 (46%), Gaps = 4/108 (3%)
Query: 1832 LKRAQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLNDKCEALMELAAH--S 1888
L ++ + + D +++ L +K++AL E+ +H +E LN+ E L+E
Sbjct: 16 LSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAE 75
Query: 1889 PIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWL 1936
I+E L + L + ++E+ L Q +F +++E WL
Sbjct: 76 EIQERLEELNQRWEELRELAEERRQRLEEALDLQ-QFFRDADDLEQWL 122
Score = 30.5 bits (69), Expect = 7.1
Identities = 25/158 (15%), Positives = 65/158 (41%), Gaps = 1/158 (0%)
Query: 950 EFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV-SIA 1008
+F D + +EA L E E + +++ ++ LL K+ L E + + +
Sbjct: 4 QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELG 63
Query: 1009 TEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALL 1068
+ + D + + + + +R+ +L E+R + L + + + + L
Sbjct: 64 EQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLE 123
Query: 1069 QKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNS 1106
+K + + + E + E L +++L +E+ + +
Sbjct: 124 EKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPR 161
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 41.6 bits (98), Expect = 0.005
Identities = 108/525 (20%), Positives = 208/525 (39%), Gaps = 68/525 (12%)
Query: 926 ETKAQSDRIQDA--LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQE 983
+ KAQ + ++ + LN L E + ++E E + +E + E
Sbjct: 191 QLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQ-REQARETRDE--------ADE 241
Query: 984 LLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDR 1043
+L ++ + EE L I ++ E + + +R + + V +++R +L +D
Sbjct: 242 VLEEHEERREELETLEAEIEDL---RETIAETEREREELAEEVRDLRERLEELEEERDDL 298
Query: 1044 ISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKL------L 1097
++ + ++ + EE L+ + +L + + L L
Sbjct: 299 LAEAGLDDADAEAVEARREE----LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL 354
Query: 1098 DEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGF 1157
+E ++ E L ++ + +++ E IE L E+ L + G
Sbjct: 355 EERAEELREEAAELESELEEAREAVEDRREEIEE-----------LEEEIEEL-RERFGD 402
Query: 1158 ITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASD---KGDQLSSDGTISD 1214
+G + ++E+ D+ + E EA L TA + + + L G
Sbjct: 403 ---APVDLGNAEDFLEELREERDE---LREREAELEATLRTARERVEEAEALLEAG---- 452
Query: 1215 RNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSS 1274
+ E Q + H +T+E+ R++ E+ AE E L E+EE+ E L E V +
Sbjct: 453 --KCPECGQPVEGSPH--VETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAE 508
Query: 1275 SRLHNL---RKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA----RETAVKTLPVLLL 1327
R+ L R+ +E+L + + E + EL E L A + A
Sbjct: 509 DRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAE 568
Query: 1328 EVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
E EL+ + EL KE ++R TL IA+A D+ E + EA + +E
Sbjct: 569 EAREEVAELNSKLAEL--KERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERR 626
Query: 1388 DWL-----RKAKLGAQANADCHGEQQTTKDKADK-IKQITKSLPE 1426
+ L RK +L A+ + E + K++A++ ++Q+ + L E
Sbjct: 627 ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDE 671
Score = 36.2 bits (84), Expect = 0.28
Identities = 50/233 (21%), Positives = 103/233 (44%), Gaps = 23/233 (9%)
Query: 911 EEGSPIKDKIDRLE--KETKAQSDRIQDALKTLNQLIG-HRKEFEVDVENVEAKLK---E 964
EE ++++++R E E + + +R+++ + L +LI R+ E E E + E
Sbjct: 489 EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAE 548
Query: 965 LEVAVEGDVKTNNVLML--QELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTV 1022
LE E + +E + A+LN + EL I ++ E + A
Sbjct: 549 LEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESL----ERIRTLLAAIADA 604
Query: 1023 NDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHK-----VEEAKALLQKAESQLAR 1077
D + ++++ L+ ++R + K + + L+ + +EEA+ ++AE L +
Sbjct: 605 EDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQ 664
Query: 1078 PVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE 1130
V + +++ E L EI + +N E+L L + + L+ VE +E
Sbjct: 665 -VEEKLDELREERD---DLQAEIGAVENEL--EELEELRERREALENRVEALE 711
Score = 33.5 bits (77), Expect = 1.6
Identities = 70/400 (17%), Positives = 153/400 (38%), Gaps = 72/400 (18%)
Query: 917 KDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTN 976
+++I+ LE+E + +R DA L +E + + + + ELE +
Sbjct: 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRT----- 437
Query: 977 NVLMLQELLNKYAQLNEEAG--------ELNMMISNVSIATEHMNLNDADRLTVNDSVSS 1028
+E + + L E E + + + E + +A+ + + V
Sbjct: 438 ----ARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493
Query: 1029 MQKRYAQLS--RTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDI 1086
+++R + EDRI + + + ++ L + E + +++ + + +A ++
Sbjct: 494 VEERLERAEDLVEAEDRIERLEERREDLEEL---IAERRETIEEKRERAEE-LRERAAEL 549
Query: 1087 SEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQ 1146
++ E +++E +A+ L E+ ERIE ++ ++++LL
Sbjct: 550 EAEAEEKREAAAE----AEEEAEEAREEVAELNSKLAELKERIE--SLERIRTLLAAIA- 602
Query: 1147 FTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQL 1206
+ + + + E D E LA ++ +L
Sbjct: 603 -------------DAEDEIERLREKREALAELND----------ERRERLAEKRERKREL 639
Query: 1207 SSDGTISDRNEITEQLQSRKQQLHN-LRKTVEKLRQQNEKRAAESEKLGA------ELEE 1259
++ D I E+ + K++ L + EKL + E+R ++GA ELEE
Sbjct: 640 EAE---FDEARI-EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695
Query: 1260 IIEALHARETAVPSSSRLHNLRKTVE--------KLRQQN 1291
+ E A E V + L++ + +E +LRQ+N
Sbjct: 696 LRERREALENRVEALEALYDEAEELESMYGDLRAELRQRN 735
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 40.5 bits (95), Expect = 0.010
Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 29/212 (13%)
Query: 1212 ISDRNEITEQLQSR-KQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
S + I EQ S K+ L L+ ++ K +E++ L+ + + R+
Sbjct: 127 ASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKD 186
Query: 1271 VPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEE-----------IIEALHARETAV 1319
+P+ L +K +E +++++ K E +K+ E ++ I A + +
Sbjct: 187 LPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDK 246
Query: 1320 KTLPVLLLEVTSVDVELDKQRLE----------------LATKEVQPYVQRYETLKKNIA 1363
K L L E+ + D +RLE L E++ Y+Q++
Sbjct: 247 KQLKADLAELKKA-PQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRKYLQKFLKYYDQAE 305
Query: 1364 EAIDKFETQANEHEAYKQAYNEAYDWLRKAKL 1395
++K + + E A E YD+
Sbjct: 306 PLLNKSKKEPKEEAVETLANGEDYDFKDDNPY 337
Score = 31.2 bits (71), Expect = 6.8
Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 8/99 (8%)
Query: 141 KTTLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYEDLVDGILKTISQLEE 200
L+ V + + I +SL + + KD ++ + KR E + +K +L++
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQK 216
Query: 201 SLDIFQQ--------FQQLQKAYQEDQKQLWDKLSSLTE 231
+ F + Q+++ A + Q + L E
Sbjct: 217 IKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAE 255
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 40.0 bits (94), Expect = 0.013
Identities = 46/245 (18%), Positives = 108/245 (44%), Gaps = 29/245 (11%)
Query: 1034 AQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVY 1093
+++ + +D+I +N ++Q++ +++ K + E+I+ Y
Sbjct: 166 SEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQR---KKNGENIARKQNKY 222
Query: 1094 QKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE--NHAVPKLKSLLLLREQFTTLI 1151
+L++E + K ++ +E L + + +++ + N A K+KS + EQF +I
Sbjct: 223 DELVEEAKTIK-AEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKI---EQFQKVI 278
Query: 1152 ------------MQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA 1199
Q I+E R+ ++ +Q ++K D I +++E
Sbjct: 279 KMYEKGGVCPTCTQQ---ISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQ 335
Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
S K +L + + + ++ IT +++K ++ +E+L+ + A E KL EL++
Sbjct: 336 SKKLLELKNKISTNKQSLITLVDKAKK-----VKAAIEELQAEFVDNAEELAKLQDELDK 390
Query: 1260 IIEAL 1264
I++
Sbjct: 391 IVKTK 395
Score = 35.0 bits (81), Expect = 0.56
Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 12/152 (7%)
Query: 1570 HGLQVELNALLQNIEAEINKVANA----AQERKAMQDKISKLQSWLKQYIDYS---GNKQ 1622
EL L N+ +I + A ++ KI + Q +K Y Q
Sbjct: 233 KAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQ 292
Query: 1623 ALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKL-----PSRTNEAMYRDLANLRYD 1677
+ D++ +I++ E+ L+ L I+ + S+ + ++ +
Sbjct: 293 QISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQS 352
Query: 1678 FEKCVSSLSDVKQGLESRLVQWNEYETSLGKL 1709
V VK +E ++ + L KL
Sbjct: 353 LITLVDKAKKVKAAIEELQAEFVDNAEELAKL 384
Score = 33.4 bits (77), Expect = 1.4
Identities = 34/211 (16%), Positives = 76/211 (36%), Gaps = 39/211 (18%)
Query: 917 KDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTN 976
I+ K+ R Q+ L + K + ++E + +L L + + + +
Sbjct: 201 NKNIEEQRKKNGENIARKQNKYDELVEEA---KTIKAEIEELTDELLNLVM--DIEDPSA 255
Query: 977 NVLMLQELLNKYAQLNEE----AGELNMMISNVSIATEHMNLNDADRL--TVNDSVSSMQ 1030
L +L A++ + + M T +++ + D + +Q
Sbjct: 256 A---LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQ 312
Query: 1031 KRYAQLSRTIED-------------RISTVNNKL----QSVKRLQHKVEEAKALLQKAES 1073
+L I++ ++ + NK+ QS+ L K ++ KA +++ ++
Sbjct: 313 HSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQA 372
Query: 1074 QLARPVSTQAEDISEALTVYQKLLDEINSWK 1104
+ D +E L Q LD+I K
Sbjct: 373 EFV--------DNAEELAKLQDELDKIVKTK 395
Score = 31.9 bits (73), Expect = 5.1
Identities = 40/205 (19%), Positives = 85/205 (41%), Gaps = 20/205 (9%)
Query: 1762 TEKKLGDEMELKTTLDEKKAQLQ-VYRTIYQDATSHQQDLLQLKDKIESLHQPSEQ---S 1817
T K +E K E A+ Q Y + ++A + + ++ +L D++ +L E +
Sbjct: 199 TYNKNIEEQRKKN--GENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAA 256
Query: 1818 KQQLATITTRHGNVLKRAQKKIEAEKDIGDC-TNLDDLEKYKALLQEVISHNSDMET--- 1873
+L T + + +++ QK I+ + G C T + + + ++ +++
Sbjct: 257 LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316
Query: 1874 -LNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSD-------QTEF 1925
L+ + L E+ E++ +L L + T+ Q L++ V+K Q EF
Sbjct: 317 KLDTAIDELEEIM--DEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374
Query: 1926 LAKKEEVEAWLHKAHATVQECDELE 1950
+ EE+ + V+ EL
Sbjct: 375 VDNAEELAKLQDELDKIVKTKSELV 399
Score = 31.1 bits (71), Expect = 8.8
Identities = 40/261 (15%), Positives = 104/261 (39%), Gaps = 51/261 (19%)
Query: 1065 KALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDE 1124
K +++ Q+ + + + I + + Y K ++E ++ E++ + DE
Sbjct: 173 KDKIRELNQQIQT-LDMKIDHIQQQIKTYNKNIEEQRK----KNGENIARKQNKY---DE 224
Query: 1125 VVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDK 1184
+VE + K+ + E+ T ++ +V I + +A + +++ +A ++ KI+++ K
Sbjct: 225 LVEEAKTI-----KAEI---EELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQK 276
Query: 1185 II-------------ADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
+I I E + DK +L ++ + ++L+ + +
Sbjct: 277 VIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQ--HSLEKLDTAIDELEEIMDEFNE 334
Query: 1232 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQN 1291
K + +L+ + I + ++ ++ +E+L+ +
Sbjct: 335 QSKKLLELKNK------------------ISTNKQSLITLVDKAK--KVKAAIEELQAEF 374
Query: 1292 EKRAAESEKLGAELEEIIEAL 1312
A E KL EL++I++
Sbjct: 375 VDNAEELAKLQDELDKIVKTK 395
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 40.1 bits (94), Expect = 0.015
Identities = 26/170 (15%), Positives = 62/170 (36%), Gaps = 9/170 (5%)
Query: 72 EKDLDTLKNTT---DVEAKNSLLQKLLTEKDQAGHKLTYLTSSG---EKLYLDTAAKGRE 125
+K LK T + L ++ L E ++ L E ++
Sbjct: 858 KKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSS 917
Query: 126 VVRQQLRALRDSKCSKTTLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYE 185
+ + L + L+++L++ + E + LP++NK +E + ++ YE
Sbjct: 918 DLIENLEFKTELI---ARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYE 974
Query: 186 DLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELK 235
DL+ + + ++ + F++ + L + L E ++
Sbjct: 975 DLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVE 1024
Score = 31.2 bits (71), Expect = 7.8
Identities = 82/468 (17%), Positives = 161/468 (34%), Gaps = 75/468 (16%)
Query: 729 KTLLERSKHIVDDIEK----KKDDLKPIQKEAQQLSEWDSSIVPKVSDLE---------- 774
+T+ +S V+ E+ K D+K I E +S I+ L
Sbjct: 867 ETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFK 926
Query: 775 ----TRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMA---HNSLYI 827
R + + D + S E + + + E L + S + +++ +
Sbjct: 927 TELIARLKKLLNNIDLE-EGPSIEYVKLPELNK-LHEVESKLKETSEEYEDLLKKSTILV 984
Query: 828 NNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQ 887
+ ++++ +K L + S A+Q++ K L ++ ++ S +
Sbjct: 985 REGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSAS--KIISSESTEL 1042
Query: 888 HHEKLLQDVILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGH 947
K LQ + + + ++L+ KA R +++L QL
Sbjct: 1043 SILKPLQKLK----------------GLLLLENNQLQARYKALKLRRENSLLDDKQL--- 1083
Query: 948 RKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLN-------KYAQLNEEAGELNM 1000
+E+ E LK + V + +V N++ +L K L E + L+
Sbjct: 1084 -----YQLESTENLLKTINVK-DLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQ 1137
Query: 1001 MISNVSIATEHMNLNDAD-----RLTVNDSVSSMQKRYAQL-SRTIEDRISTVNNKLQSV 1054
+++ + + +++ + +++ S A R + + +KL S
Sbjct: 1138 LVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKL-SS 1196
Query: 1055 KRLQHKVEEAKALLQKAESQLARPVSTQAEDISEA--LTVYQKLLDEINSWKN------S 1106
+ E AL K S R + + ISE T Y L N+ S
Sbjct: 1197 SEVNDLKNELIALFSKIFSGWPRGDKLK-KLISEGWVPTEYSTSLKGFNNLNKKFDTPAS 1255
Query: 1107 QSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLL--LLREQFTTLIM 1152
S+E L+ L +SI L + E + SLL + F L
Sbjct: 1256 MSNEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYINVGLFNALRT 1303
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 39.8 bits (93), Expect = 0.018
Identities = 60/300 (20%), Positives = 110/300 (36%), Gaps = 49/300 (16%)
Query: 1108 SDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGE 1167
D++L +AD+++ LDE + +E +L++L F + + T R
Sbjct: 218 DDDELTDVADALEQLDEYRDELE-----RLEALERALRNFLQ---RYRRY-ARTMLRRRA 268
Query: 1168 VDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQ 1227
SA Q YD++ D+ L TA ++ +L + +R + L++R +
Sbjct: 269 TRLRSAQTQ-----YDQLSRDLGRARDELETAREEERELDARTEALEREA--DALRTRLE 321
Query: 1228 QLHNLR-----KTVEKLRQQNE---KRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
L + +E+ R E AA++ + E E +E R + R
Sbjct: 322 ALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEE--ERRRLDEEAGR--- 376
Query: 1280 LRKTVEKLRQQNEKRAAESEKLG---AELEEIIEALHARETAVKTLPVLLLEVTSVDVEL 1336
L +LR E+ A +E+ G A E AL A+E L E+ ++D
Sbjct: 377 LDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQE---------LQELGALDARR 427
Query: 1337 DK-----QRLELATKEVQPYVQ---RYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYD 1388
+ ++ R E D+ + +A + A + +EA
Sbjct: 428 QDADRVIAQRSEQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEAVH 487
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 39.5 bits (92), Expect = 0.025
Identities = 81/447 (18%), Positives = 179/447 (40%), Gaps = 70/447 (15%)
Query: 714 SQNTLQWLDNIRSDMKTLLERSKHIVDD-IEKKKDDLKPIQKEAQQLSEWDSSIVPKVSD 772
+++ L + K L ER I++D + K ++ + K + + ++
Sbjct: 261 AESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAE 320
Query: 773 LETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQ 832
+ A +K K S L + Y ++ D +L++ M +NS Y+ + +
Sbjct: 321 INKYHAIIK-----KLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNS-YLKSIES 374
Query: 833 TLEQIQHHEKLLQDYSSYQNAVQDAE--------KWLLQISFQLMAHNSLYINNRQQTLE 884
++I+ + K ++ S++ + + + K L +I+ +L +S + Q+
Sbjct: 375 LKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRA 434
Query: 885 QIQHHEKLLQDV-ILSPDNKLAPCFS--------------AEEGSPIKDKIDRLEKETKA 929
++ ++L +++ +L+ + C + E+ S +++KI +E E K
Sbjct: 435 LRENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKD 494
Query: 930 QSDRIQDALKTL---------------NQLIGHRKEFEVDVENVEAKLKELEVAVEGDVK 974
++I D K N++ R + E D++ +LK+ E
Sbjct: 495 IDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLE-DIKIKINELKDKHDKYEEIKN 553
Query: 975 TNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQ---- 1030
L L++L +K +IS + I T N+ + +ND S +Q
Sbjct: 554 RYKSLKLEDLDSKRTSWL----NALAVISLIDIETNRSRSNEIKK-QLNDLESRLQEIEI 608
Query: 1031 -----KRYAQLS-RTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLA-------- 1076
K Y S R IE+ + +NNK ++ + +E+ + + + Q+A
Sbjct: 609 GFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPD 668
Query: 1077 -RPVSTQAEDISEALTVYQKLLDEINS 1102
+ ++++ DI + L +K LD+ +
Sbjct: 669 LKEITSRINDIEDNLKKSRKALDDAKA 695
Score = 38.0 bits (88), Expect = 0.062
Identities = 102/585 (17%), Positives = 217/585 (37%), Gaps = 49/585 (8%)
Query: 1157 FITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSS-DGTISDR 1215
F+ GE+D + + + KI+ +I E ++ D + IS+
Sbjct: 129 FLNSIFVGQGEMD---SLISGDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNI 185
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
+ + E+L+S +L N++K + +EK + + K L A + S
Sbjct: 186 DYLEEKLKSSNLELENIKKQIA----DDEKSHSITLKEIERLSIEYNN--AMDDYNNLKS 239
Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVE 1335
L+ L + + + L ELE+ K L +++ + V
Sbjct: 240 ALNELSSLEDMKNRYESEIKTAESDLSMELEKNNY--------YKELEERHMKIINDPVY 291
Query: 1336 LDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKL 1395
++ + K + + L NI I+K+ + ++ YN+ +
Sbjct: 292 KNRNYINDYFKYKNDIENKKQIL-SNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDD 350
Query: 1396 GAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQ 1455
+ G + I+ + K + E K I++ A + +L I + I +
Sbjct: 351 LNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEIL-KIQEIDPDAIKK 409
Query: 1456 EINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDC 1515
E+N+I + LQ+ + + + +L + +E + G +G+
Sbjct: 410 ELNEINVK---LQDISSKVSSLNQRIRALRENLDELSRNMEM-LNGQSVCPVCGTTLGEE 465
Query: 1516 TNLDDLEKY---KALLQEGL-HKEADNKKVLIDSFNKGIIGMTE-LESEEVKDLIQVGVH 1570
+ + Y K+ L+E + E + K ID + E LESEE+ I +
Sbjct: 466 KSNHIINHYNEKKSRLEEKIREIEIEVKD--IDEKIVDLKKRKEYLESEEINKSIN--EY 521
Query: 1571 GLQVELNALLQNIEAEINKVANAAQERKAMQDKI---------SKLQSWLK-----QYID 1616
A L++I+ +IN++ + + + ++++ SK SWL ID
Sbjct: 522 NKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLID 581
Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRY 1676
N+ ++N+++ EI +I+ S R++ + N + N
Sbjct: 582 IETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEAN--NLNNKYNEIQ 639
Query: 1677 DFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLK 1721
+ + + L + ++ + + L ++ + +ND E+ LK
Sbjct: 640 ENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLK 684
Score = 33.3 bits (76), Expect = 1.7
Identities = 87/540 (16%), Positives = 196/540 (36%), Gaps = 59/540 (10%)
Query: 1530 EGLHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIEAEINK 1589
L + D K +ID I + LE + + +EL + + I +
Sbjct: 162 NSLERNYDKLKDVIDMLRAEISNIDYLEEK---------LKSSNLELENIKKQIADDEKS 212
Query: 1590 VANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKVNEIQESFPEISTKLQTL- 1648
+ +E + + + + + Y + L S D N + + L
Sbjct: 213 HSITLKEIERLSIEY---NNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMEL 269
Query: 1649 --SDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSL 1706
+++ + + N+ +Y++ + D+ K + + + KQ L + + N+Y +
Sbjct: 270 EKNNYYKELEERHMKIINDPVYKNRNYIN-DYFKYKNDIENKKQILSNIDAEINKYHAII 328
Query: 1707 GKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKL 1766
KL D + +K + LN Q EL+G + Y +S+ +L ++ K +
Sbjct: 329 KKLSVLQKDYNDYIKKKSRYDDLN--NQILELEGYEMDYNSYLKSIESLKKKIEEYSKNI 386
Query: 1767 GD-EMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATIT 1825
+ L ++ + + L + K+ SL+Q ++ L ++
Sbjct: 387 ERMSAFISEILKIQEIDPDAIKKELNEINV---KLQDISSKVSSLNQRIRALRENLDELS 443
Query: 1826 TRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKY---KALLQEVISH-NSDMETLNDKCEAL 1881
R+ +L +G+ + + Y K+ L+E I +++ +++K L
Sbjct: 444 -RNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDL 502
Query: 1882 MELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHA 1941
+ + EE + Y + L + L H
Sbjct: 503 KKRKEYLE-SEEINKSINEY---------------NKIESARADLEDIKIKINELKDKHD 546
Query: 1942 TVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQE 2001
+E +K E +++++ W + +V I +I T + + E
Sbjct: 547 KYEEIKNRYKSLKLE--DLDSKRTSWLN-----------ALAVISLI--DIETNRSRSNE 591
Query: 2002 LRSKLLKLKEILKKCEKGVGDHEAFVDKY-EECSQKLAQVEAKYAKLIEPETSYEELQKR 2060
++ +L L+ L++ E G D ++++DK E + + KY ++ E + E+L+ +
Sbjct: 592 IKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGK 651
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the autophagic
process and in the vacuolar protein sorting pathway.
Apg14p may be a component specifically required for the
function of Apg6/Vps30p through the autophagic pathway.
There are 17 auto-phagosomal component proteins which are
categorized into six functional units, one of which is
the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
complex and the Atg2-Atg18 complex are essential for
nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol 3-phosphate
(PtdIns(3)P) at the pre-autophagosomal structure (PAS).
The localisation of this complex at the PAS is controlled
by Atg14. Autophagy mediates the cellular response to
nutrient deprivation, protein aggregation, and pathogen
invasion in humans, and malfunction of autophagy has been
implicated in multiple human diseases including cancer.
This effect seems to be mediated through direct
interaction of the human Atg14 with Beclin 1 in the human
phosphatidylinositol 3-kinase class III complex.
Length = 307
Score = 38.5 bits (90), Expect = 0.030
Identities = 22/116 (18%), Positives = 50/116 (43%), Gaps = 1/116 (0%)
Query: 1190 QEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAE 1249
Q+ E L A+++ +L++D + E+L + ++ L++ +E+ R++ E+
Sbjct: 41 QKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRA 100
Query: 1250 SEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAEL 1305
+ ++L L R + +++T KL + A + L EL
Sbjct: 101 LAQRRSDLSSASYQLEKRRASQL-EKLQDEIKRTRSKLNALHSLLAEKRSFLCREL 155
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 37.7 bits (88), Expect = 0.032
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
E+ E+L ++QL L++ ++L+Q+ AE E+L EL I + S+
Sbjct: 76 AELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL----------SA 125
Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIE 1310
L + +LR++ + E+E L AE E + E
Sbjct: 126 NAIELDEENRELREELAELKQENEALEAENERLQE 160
Score = 37.3 bits (87), Expect = 0.044
Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 3/80 (3%)
Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVP 1272
+ E+ EQL +Q+ L++ + L + E+ E ++ IE
Sbjct: 80 EELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENREL-- 137
Query: 1273 SSSRLHNLRKTVEKLRQQNE 1292
L L++ E L +NE
Sbjct: 138 -REELAELKQENEALEAENE 156
Score = 33.8 bits (78), Expect = 0.64
Identities = 23/101 (22%), Positives = 48/101 (47%), Gaps = 7/101 (6%)
Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
E+L +Q+L L++ + +L++Q + E+++L EL + L + +R+
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK---ELARIKQ 122
Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVK 1320
L +L ++N + E EL++ EAL A ++
Sbjct: 123 LSANAIELDEENRELREELA----ELKQENEALEAENERLQ 159
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 38.5 bits (90), Expect = 0.047
Identities = 48/258 (18%), Positives = 79/258 (30%), Gaps = 42/258 (16%)
Query: 1109 DEDLVHLAD---SIKPLDEVVER-IENHAVPKLKSLLLLREQFTTLIMQIVGFITETTAR 1164
D LVHL D K L E+ + + A L SLL + + + G +
Sbjct: 27 DFGLVHLEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFILPEE 86
Query: 1165 VGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA--------SDKGDQLSSDGTISDRN 1216
V +++ +++E I +K ++++E L D D L ++ +
Sbjct: 87 VEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPLAYLD 146
Query: 1217 EITEQLQSRK-----QQLHNLRKTVEKLRQQNEKRAA-ESEKLGAELEEIIEALHARETA 1270
L+ K L K E L E A E + A + ++
Sbjct: 147 FDLSLLRGLKFLLVRLGLVRREKL-EALVGVIEDEVALYGENVEASVVIVVAHGAEDLDK 205
Query: 1271 V--------------------PSSSRLHNLRKTVEK---LRQQNEKRAAESEKLGAELEE 1307
V PS E L + A +EK+ EL
Sbjct: 206 VSKILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLA 265
Query: 1308 IIEALHARETAVKTLPVL 1325
+ E L + L L
Sbjct: 266 VREILEIEKALGDVLSKL 283
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of the
C-terminal region.
Length = 546
Score = 38.4 bits (89), Expect = 0.049
Identities = 34/124 (27%), Positives = 55/124 (44%), Gaps = 11/124 (8%)
Query: 1160 ETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEIT 1219
E R+ E++ + T+ +K + D+++ Q+ EA L + Q TI R+E T
Sbjct: 217 ENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEET 276
Query: 1220 E--QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRL 1277
E LQ +QLH E LR E+ + +K G EE+ R+ + + L
Sbjct: 277 ERIDLQLENEQLH------EDLRTLQERLESSQQKAGLLGEELASLGSLRDHTI---AEL 327
Query: 1278 HNLR 1281
H R
Sbjct: 328 HQSR 331
>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
Length = 341
Score = 37.2 bits (87), Expect = 0.085
Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 15/93 (16%)
Query: 1050 KLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ------AEDISEALTVYQKLLDEINSW 1103
+L+ +++ H +EA+ALL + AR + + AL +Q L E
Sbjct: 124 RLRRLRQRAHLRDEARALLHSHDVGEARAFCEKLAKQAGIPQLHPALQRWQASLHET--- 180
Query: 1104 KNSQSDEDLVHLADSI--KPLDEVVER-IENHA 1133
+D +L+ L + PLD R I A
Sbjct: 181 ---HNDAELLRLYEREVLPPLDAQARRLISRAA 210
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 37.6 bits (87), Expect = 0.093
Identities = 44/142 (30%), Positives = 69/142 (48%), Gaps = 19/142 (13%)
Query: 1210 GTISDRNEITEQLQSRKQQLHNLRKTV--------EKLRQQNEKRAAESEKLGAELEEII 1261
GTIS E+ EQ+QS +Q+L +L++ V EK+R++ + + K ELE+
Sbjct: 1066 GTIS--RELREQIQSVRQELESLQRAVVTPVVTDEEKVRERYDALIENTSKRITELEKAK 1123
Query: 1262 EA-LHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR--ETA 1318
A L + A+ S L+NL++ V LR NEK + + + IE L +
Sbjct: 1124 LANLDVVKKAI---SNLNNLQQEVTLLR--NEKIRMHTGTDKVDFSD-IEKLEKQLQVID 1177
Query: 1319 VKTLPVLLLEVTSVDVELDKQR 1340
K LLEVT L+K++
Sbjct: 1178 TKLADAYLLEVTKQISALEKEK 1199
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino acids
in length. This family contains a P-loop motif suggesting
it is a nucleotide binding protein. It may be involved in
replication.
Length = 1198
Score = 37.4 bits (87), Expect = 0.11
Identities = 45/286 (15%), Positives = 103/286 (36%), Gaps = 28/286 (9%)
Query: 1025 SVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAE 1084
+ + R + + + ++RLQ++ + K L+ A ++ + TQ
Sbjct: 629 QLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLR 688
Query: 1085 DISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLR 1144
+ L LL++ ++ + D+ + + V ++N
Sbjct: 689 QLDAQLKQ---LLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDN------------- 732
Query: 1145 EQFTTLIMQIVGFITETTARVGEVDGNSATVQEKID----KYDKIIADIQEYEAILATAS 1200
Q L I T+ AR+ E+ +D ++ I+E E + +
Sbjct: 733 -QLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIA 791
Query: 1201 DKGDQLSS-----DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGA 1255
+ ++ T R+ + E+ + QL L + E+L+Q+ + +++
Sbjct: 792 VRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRK 851
Query: 1256 ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKL 1301
+LE+ +AL + + L LR + +L + E A +
Sbjct: 852 KLEQERKALEKQLDQL--DELLRGLRDEMRQLAELKEPANANQAEG 895
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 37.1 bits (86), Expect = 0.12
Identities = 43/212 (20%), Positives = 74/212 (34%), Gaps = 45/212 (21%)
Query: 1209 DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRA---------AESEKLGAELEE 1259
++ + L +++L NLRK + +L ++ ++ AE + L +L
Sbjct: 170 QLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAA 229
Query: 1260 IIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI----------- 1308
+ E + AV R R + R+ E E L AEL+EI
Sbjct: 230 LGEVIDLPPDAV---ERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAA 286
Query: 1309 -IEALHAR----ETAVKTLPVLLLEVTS-----------------VDVELDKQRLELATK 1346
IEALH + A + LP E+ + + + A +
Sbjct: 287 AIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKE 346
Query: 1347 EVQPYVQRYETLKKNIAEAIDKFETQANEHEA 1378
V +R E L + + A D E + E +
Sbjct: 347 TVTELEKRKEALDQALKSARDALEERERELKQ 378
Score = 35.9 bits (83), Expect = 0.29
Identities = 90/596 (15%), Positives = 195/596 (32%), Gaps = 80/596 (13%)
Query: 732 LERSKHIVDDIEKKKDDLKPIQKEAQQLSEWDS-----SIVPKVSDLETRWAAVKSAWDA 786
L + + +EK+K L+ +++ L+E + + + +V DL A
Sbjct: 190 LANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARA 249
Query: 787 KHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQD 846
+ + + A+Q L +IS ++ L Q E L Q
Sbjct: 250 ELRAARRNLELLTERLEALQAE---LDEISLD------------EELLAQAAAIEALHQQ 294
Query: 847 YSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAP 906
Y+NA QD + E LL + PD
Sbjct: 295 RGEYRNAEQDLPD--REGE-------------IANAREAA---AALLAQ--IGPDADEEA 334
Query: 907 CFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELE 966
S K+ + LEK +A ++ A L +E E +++ V A+L L
Sbjct: 335 VESLRPSLAAKETVTELEKRKEALDQALKSARDAL-------EERERELKQVRAQLAALP 387
Query: 967 VAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSV 1026
+ + + A L E+ + ++ A + L D +
Sbjct: 388 TVTVPEPLRAALADALRAGDIDAALAALEQEVAVAKRELAQALSRLGLWRGDLEELVALP 447
Query: 1027 SSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDI 1086
++ + E+ + K + L+ +E + L+ ++ A V T+ E +
Sbjct: 448 VPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHLDA--AGAVPTEEE-V 504
Query: 1087 SEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQ 1146
+ A L +I + ++++ D++ +R+ A
Sbjct: 505 AAARARRDALWQDIRRGYALLEPTASAYE-EAVESADQLADRLLREA------------- 550
Query: 1147 FTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQL 1206
+ + + + ++++ +K + ++ A L + +
Sbjct: 551 -------------QLVGELQSLRQQEEAARRRLEQLEKELEVLELALAAL-REAWQAQWA 596
Query: 1207 SSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA 1266
++ ++ E+ + L R +R K R + + +L A L ++A+
Sbjct: 597 AAGLPLTPA-EMEDWLAERATAREQVR-AYFKARAELDALLDRRARLRAALRAALKAVAI 654
Query: 1267 RETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
+ L R+ +E+ +Q ++A+ E+L + EA + A L
Sbjct: 655 VLPGEELAELLELARQLLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSAL 710
Score = 35.5 bits (82), Expect = 0.40
Identities = 40/195 (20%), Positives = 81/195 (41%), Gaps = 18/195 (9%)
Query: 1981 LDSVEKNIKQEIATPWT---TTQELRSKLLKLKEI---LKKCEKGVGDHEAFVDKYEECS 2034
L SV +++E W E+ L +LKE+ +++ + + V +E
Sbjct: 128 LGSVRDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAE 187
Query: 2035 QKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKVDMDVL--LNACVDLCDKVYES- 2091
++LA + + +L E + L++ + L L K L L +DL E
Sbjct: 188 EELANLRKELRQL---EKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERY 244
Query: 2092 -TSEPGHEPLRVQMEKLQ---QAVEALYDKITVTESLI--LSQVEAKYAKLIEPETSYEE 2145
+ R +E L +A++A D+I++ E L+ + +EA + + E + ++
Sbjct: 245 EEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQD 304
Query: 2146 LQKRQGQLTALLAEK 2160
L R+G++
Sbjct: 305 LPDREGEIANAREAA 319
>gnl|CDD|233885 TIGR02479, FliA_WhiG, RNA polymerase sigma factor, FliA/WhiG family.
Most members of this family are the flagellar operon
sigma factor FliA, controlling transcription of bacterial
flagellar genes by RNA polymerase. An exception is the
sigma factor WhiG in the genus Streptomyces, involved in
the production of sporulating aerial mycelium.
Length = 224
Score = 36.1 bits (84), Expect = 0.13
Identities = 27/110 (24%), Positives = 50/110 (45%), Gaps = 11/110 (10%)
Query: 1224 SRKQQLHNLRKTVEKLRQQNEKRAAESE---KLGAELEEIIEALHAR--------ETAVP 1272
S +Q+ L + + +L + + E E +LG +L+E +AL+ + +
Sbjct: 77 SLRQKARKLERAIRELEARLGREPTEEEIAEELGMDLKEYRQALNEINALSLVSLDELLE 136
Query: 1273 SSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
S +L +E + ++ + E E+L L E IE+L RE V +L
Sbjct: 137 SGDDGGSLIDRIEDDKSEDPEEELEREELREALAEAIESLSEREQLVLSL 186
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 36.9 bits (86), Expect = 0.15
Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 14/92 (15%)
Query: 1220 EQLQSRKQQLHNLRK------TVEKLRQQNEKR-------AAESEKLGAELEEIIEALHA 1266
EQLQ + +L L + E+L + KR E E+L ELE +E+L
Sbjct: 513 EQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSE 572
Query: 1267 R-ETAVPSSSRLHNLRKTVEKLRQQNEKRAAE 1297
A L + ++ Q+ RA
Sbjct: 573 SVSEARERRMALRQQLEQLQARIQRLAARAPA 604
Score = 34.9 bits (81), Expect = 0.67
Identities = 87/426 (20%), Positives = 161/426 (37%), Gaps = 73/426 (17%)
Query: 1244 EKRAAESEKLGA---ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
+ AAE +L EL E+ EA E ++S +L LRQQ + E+
Sbjct: 300 RQLAAEQYRLVEMARELAELNEAESDLEQDYQAAS--DHLNLVQTALRQQE-----KIER 352
Query: 1301 LGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKK 1360
A+LEE+ E L + V+ E + R E A +EV + LK
Sbjct: 353 YQADLEELEERLEEQNEVVEEADEQQEEN--------EARAEAAEEEV-------DELKS 397
Query: 1361 NIA---EAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKI 1417
+A +A+D +T+A +++ QA A L A NA+ + + K +
Sbjct: 398 QLADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTA-DNAE--DWLEEFQAKEQEA 454
Query: 1418 KQITKSLPEGQKLIDKTVALK------NNVLESTGPIGKENINQEINQITLDWTNLQNTL 1471
+ SL QKL A V + G + + ++ ++
Sbjct: 455 TEELLSL--EQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLA 512
Query: 1472 QDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEG 1531
+ + + + L L + E+ + F K++ + + D+LE+ + L+
Sbjct: 513 EQLQQLRMRLSELEQR-LRQQQRAERLLAEFCKRLGK-----NLDDEDELEQLQEELEAR 566
Query: 1532 LHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVA 1591
L E SE V + + ++ L L+ ++A I ++A
Sbjct: 567 L----------------------ESLSESVSEARER-----RMALRQQLEQLQARIQRLA 599
Query: 1592 NAAQERKAMQDKISKLQSWLKQYIDYS-GNKQALQSRLDKVNEIQESFPEISTKLQTLSD 1650
A A QD +++L+ + + S + +Q L++ E+ E++ + Q L +
Sbjct: 600 ARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659
Query: 1651 HIENSS 1656
IE S
Sbjct: 660 EIERLS 665
Score = 31.1 bits (71), Expect = 9.5
Identities = 28/133 (21%), Positives = 58/133 (43%), Gaps = 23/133 (17%)
Query: 193 KTISQLEESLDIFQ----QFQQLQKAYQE---DQKQLWDKLSSLTE-RELKGKLSSWCGY 244
++QLE + + Q QF+QL++ YQ+ Q+ + +LTE + + S Y
Sbjct: 918 NALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFS----Y 973
Query: 245 EESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIE 304
E++ + L +L L ++ Q + RT ++ Q L +++
Sbjct: 974 EDAA-----------EMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVL 1022
Query: 305 SLHQPSEQSKQQL 317
+ + S +K+Q+
Sbjct: 1023 ASLKSSYDAKRQM 1035
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 36.4 bits (84), Expect = 0.15
Identities = 30/154 (19%), Positives = 67/154 (43%), Gaps = 34/154 (22%)
Query: 1240 RQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESE 1299
R++ + + A + L ++ ++ E L N+R+++E+ ++++++ +++
Sbjct: 60 RRERQLKTAAGQLLQTKIRKLTE----------QDEGLRNIRESIEERQKESDQLKLQNQ 109
Query: 1300 KLGAELEEIIEALHARETAVKT------LPVLLLEVTSVDVELDKQRLELATKEVQPYVQ 1353
KL +L HAR +KT L L + + L Q L+ +E ++
Sbjct: 110 KLVNQL------GHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQ-LDALVQECNEKIE 162
Query: 1354 RYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
+ L + +AE + AY+Q N+ Y
Sbjct: 163 ENQELNRELAETL-----------AYQQELNDEY 185
Score = 33.3 bits (76), Expect = 1.4
Identities = 22/90 (24%), Positives = 47/90 (52%), Gaps = 1/90 (1%)
Query: 1222 LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETAVPSSSRLHNL 1280
LQ++ ++L + + +R+ E+R ES++L + ++++ L HAR + + R L
Sbjct: 73 LQTKIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKL 132
Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEIIE 1310
+ L+++N+ + + L E E IE
Sbjct: 133 EELSRHLKEENQCLQIQLDALVQECNEKIE 162
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and
secretion / Signal transduction mechanisms].
Length = 408
Score = 36.5 bits (84), Expect = 0.17
Identities = 36/281 (12%), Positives = 106/281 (37%), Gaps = 25/281 (8%)
Query: 980 MLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRT 1039
++ + + L+ + E+ + +S + + + V ++ + + +++
Sbjct: 132 IIDAVQDNAEALSGASEEIAASATELSARADQQAESLEE---VASAIEELSETVKEVAFN 188
Query: 1040 IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPV---STQAEDISEALTVYQKL 1096
++ + + Q + +V +A +Q+ +LA V S +++I E +V +
Sbjct: 189 AKEAAALASEASQVAEEGGEEVRQAVEQMQEIAEELAEVVKKLSESSQEIEEITSVINSI 248
Query: 1097 LDEIN---------SWKNSQSDEDLVHLADSIKPL--------DEVVERIENHAVPKLKS 1139
++ N + + ++ +AD ++ L E+ IE +
Sbjct: 249 AEQTNLLALNAAIEAARAGEAGRGFAVVADEVRKLAERSADSAKEIGLLIEE-IQNEAAD 307
Query: 1140 LLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA 1199
+ E+ + + + V + ET + +GE+ V + I + + + +
Sbjct: 308 AVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINAS 367
Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLR 1240
++ D ++ + + E+ + K+ L + V K +
Sbjct: 368 IEELDDVTQE-NAAAVEELAAASEELKELAEKLLELVAKFK 407
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 35.9 bits (83), Expect = 0.24
Identities = 36/218 (16%), Positives = 79/218 (36%), Gaps = 27/218 (12%)
Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
+D+ +LEK+ K+ + L E D++ + ++ +L +
Sbjct: 57 QQDQRAKLEKQLKSLE-------TEIASLEAQLIETADDLKKLRKQIADLNARLNA---- 105
Query: 976 NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
+ + + +L E+ L N A ++ DA R + Y
Sbjct: 106 ----LEVQEREQRRRLAEQLAALQRSGRNPPPA-LLVSPEDAQR------SVRLAIYYGA 154
Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
L+ +RI + L+ + ++ ++ +A L S+ Q +++ L +K
Sbjct: 155 LNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRA----QQAKLAQLLEERKK 210
Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHA 1133
L ++NS ++ + L L + L + E A
Sbjct: 211 TLAQLNSELSADQKK-LEELRANESRLKNEIASAEAAA 247
Score = 31.2 bits (71), Expect = 6.8
Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 2/99 (2%)
Query: 1222 LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN-L 1280
+R +++ L+ T+++L + AAE +L L E + + L
Sbjct: 156 NPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQL 215
Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEII-EALHARETA 1318
+ +++ E+ A +L E+ A ARE A
Sbjct: 216 NSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAA 254
>gnl|CDD|234078 TIGR02980, SigBFG, RNA polymerase sigma-70 factor, sigma-B/F/G
subfamily. This group of similar sigma-70 factors
includes clades found in Bacilli (including the
sporulation factors SigF:TIGR02885 and SigG:TIGR02850 as
well as SigB:TIGR02941), and the high GC gram positive
bacteria (Actinobacteria) where a variable number of them
are found depending on the lineage.
Length = 227
Score = 35.3 bits (82), Expect = 0.26
Identities = 28/111 (25%), Positives = 45/111 (40%), Gaps = 24/111 (21%)
Query: 1225 RKQQLHN-LRKTVEKLRQQNEKR--AAE-SEKLGAELEEIIEALHARETAVPSSSRLHNL 1280
R ++L + K E+L Q+ + AE +E+LG EE++EAL A S L
Sbjct: 84 RLKELGVKINKATEELTQRLGRSPTIAEIAERLGVSEEEVVEALEAGNAYQALS--LD-- 139
Query: 1281 RKTVEKLRQQNEKRAAESEKLGAE------------LEEIIEALHARETAV 1319
+ + A + LG E L+ ++ AL RE +
Sbjct: 140 ----ASIEDDDGDPIALLDTLGDEDDALETVEDRLALKPLLAALPERERRI 186
>gnl|CDD|168831 PRK07122, PRK07122, RNA polymerase sigma factor SigF; Reviewed.
Length = 264
Score = 34.8 bits (80), Expect = 0.38
Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 10/100 (10%)
Query: 1073 SQLARPVSTQAEDISEAL---TVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERI 1129
S+LA + E++ E L + Y I+S S D+ +AD++ +D +++I
Sbjct: 147 SELAAELGMDREEVVEGLVAGSAYN--TLSIDSGGGSGDDDAR-AIADTLGDVDAGLDQI 203
Query: 1130 ENH-AVPKLKSLLLLREQFTTLIMQIVGFITET--TARVG 1166
EN A+ L + L RE+ T L+++ +T+T RVG
Sbjct: 204 ENREALRPLLAALPERER-TVLVLRFFESMTQTQIAERVG 242
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 35.3 bits (82), Expect = 0.44
Identities = 32/142 (22%), Positives = 57/142 (40%), Gaps = 24/142 (16%)
Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLR------------QQNEKRAAESEKLGAELEEI 1260
+ ++ E+ + R Q+ L+ +E+L ++ KR + SEKL ++
Sbjct: 171 RELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKRLSNSEKLAEAIQNA 230
Query: 1261 IEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVK 1320
+E L + V + S L + +E L + + K + +E L L E+ EA +
Sbjct: 231 LELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSELAELLEEALYELEEASEELRAYLD 290
Query: 1321 TLPVLLLEVTSVDVELDKQRLE 1342
L E D RLE
Sbjct: 291 EL------------EFDPNRLE 300
Score = 31.8 bits (73), Expect = 5.2
Identities = 47/281 (16%), Positives = 101/281 (35%), Gaps = 86/281 (30%)
Query: 841 EKLLQDYSSYQNAVQDAEKWLLQ----------ISFQLMAHNSLYI--------NNRQQT 882
E + Y +++ A ++ E + + FQL L + ++
Sbjct: 157 EAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKR 216
Query: 883 LEQIQHHEKLLQDV-----ILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDA 937
L + EKL + + +LS ++ +T + + A
Sbjct: 217 L---SNSEKLAEAIQNALELLSGED-----------------------DTVSALSLLGRA 250
Query: 938 LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGE 997
L+ L L + + E +E L ELE A E EL +L +
Sbjct: 251 LEALEDLSEYDGKLSELAELLEEALYELEEASE------------ELRAYLDELEFDPNR 298
Query: 998 LNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYA-------QLSRTIEDRISTVNNK 1050
L + +RL ++ S+ ++Y + I++ ++ ++N
Sbjct: 299 LEEV---------------EERL---FALKSLARKYGVTIEDLLEYLDKIKEELAQLDNS 340
Query: 1051 LQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALT 1091
+S++ L+ +V++ KA L +A L+ A+++ + +T
Sbjct: 341 EESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVT 381
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles, clathrin-coated
and synaptic vesicles. They have important roles in
membrane trafficking processes. The 116kDa subunit
(subunit a) in the V-type ATPase is part of the V0
functional domain responsible for proton transport. The a
subunit is a transmembrane glycoprotein with multiple
putative transmembrane helices it has a hydrophilic amino
terminal and a hydrophobic carboxy terminal. It has roles
in proton transport and assembly of the V-type ATPase
complex. This subunit is encoded by two homologous gene
in yeast VPH1 and STV1.
Length = 707
Score = 35.0 bits (81), Expect = 0.47
Identities = 44/256 (17%), Positives = 87/256 (33%), Gaps = 27/256 (10%)
Query: 913 GSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGD 972
G P+KD + + + +++ + L I KE E ++E++E ++ ELE +
Sbjct: 51 GIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEI---KEVEENLESLEKEINELE--EWLN 105
Query: 973 VKTNNVLMLQELLN---KYAQLNEEAGEL-NMMISNVSIATEHMNLNDADRLTVNDSVSS 1028
V L E L + + L+ + L + +N + +
Sbjct: 106 VLDEEKSFLDENLEELSELSNLDIDFKYLRGAEGLKLGFVAGVINREKLE--AFERELWR 163
Query: 1029 MQKRYAQLSRTIEDRISTVNNKLQSV---KRLQHKVEEAKALLQKAESQLARPVSTQAED 1085
+ Y + + E K KV++ +L +L V +
Sbjct: 164 ACRGYIRQAEIEEPLEDPKKTVFIIFFVGKEDLDKVKK---ILDSFGFELY-DVPETEGE 219
Query: 1086 ISEALTVYQKLLDEINSWKNSQSDE---DLVHLADSIKPLDEVVERIENHAVPKLKSLLL 1142
SE ++ K ++E+ LV +AD + DE V + K L
Sbjct: 220 RSELISKVNKRIEELQRVLEQTESHLEKVLVKIADELLAWDEQVSKE------KAVYETL 273
Query: 1143 LREQFTTLIMQIVGFI 1158
+ T + G++
Sbjct: 274 NLFNYDTKTLIAEGWV 289
>gnl|CDD|177326 PHA01817, PHA01817, hypothetical protein.
Length = 479
Score = 35.1 bits (80), Expect = 0.48
Identities = 22/74 (29%), Positives = 37/74 (50%), Gaps = 6/74 (8%)
Query: 670 YNRFNMWLSGVENKLNEPSPVFTEIDLN-TKWQQYVNDE-----LEWENHSQNTLQWLDN 723
Y RFN+WL G E+K+ + EI ++ T + Y +E L W NH+ +L +
Sbjct: 33 YGRFNVWLGGNESKIRQVLKAVKEIGVSPTLFAVYEKNEGFSSGLGWLNHTSARGDYLTD 92
Query: 724 IRSDMKTLLERSKH 737
+ + L+ +SK
Sbjct: 93 AKFIARKLVSQSKQ 106
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 35.1 bits (81), Expect = 0.48
Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 22/199 (11%)
Query: 1175 VQEKIDKYDKIIADIQEYEAILATAS---------DKGDQLSSDGTISDRN-EITEQLQS 1224
++ KY +I ++ + I S + + + D E+ +
Sbjct: 768 EKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFND 827
Query: 1225 RKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTV 1284
+ +L++ +E++ ++ + + +KL L E I A++ L +
Sbjct: 828 KNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLD-----TAEKLEELY 882
Query: 1285 EKLRQQNE-KRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVT------SVDVELD 1337
+++ E K+ AE E L E + L + L+E + D
Sbjct: 883 ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDD 942
Query: 1338 KQRLELATKEVQPYVQRYE 1356
+ E +++ + +R+
Sbjct: 943 WKAKEEVEAKLKAFFRRFI 961
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 35.1 bits (82), Expect = 0.49
Identities = 18/83 (21%), Positives = 43/83 (51%), Gaps = 8/83 (9%)
Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI--IEALHARETA 1270
+ +E+ E++++ ++L L K +E+L+ + AA + L A+ +E+ ++ L A+
Sbjct: 712 AKPSELPERVEALLEELKELEKELEQLKAK--LAAAAAGDLLAQAKEVNGVKVLAAQVEG 769
Query: 1271 VPSSSRLHNLRKTVEKLRQQNEK 1293
V + + LR + L+ +
Sbjct: 770 VDAKA----LRTLADDLKDKLGS 788
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 35.0 bits (80), Expect = 0.50
Identities = 40/258 (15%), Positives = 98/258 (37%), Gaps = 14/258 (5%)
Query: 1139 SLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILAT 1198
L LR + T +++ ++ + + A + + + ++ + + D I ++ L T
Sbjct: 33 RLFGLRPKQTAVLVTVLTGMLISAATLAILFLLNRNLRSGVFQLDDIRPQLRALRTELGT 92
Query: 1199 ASDKGDQLSSDGTISDRNEIT------EQLQSRKQQLH-NLRKTVEKLRQQNEKRAAESE 1251
A + ++ + R+E+ E ++ NL K ++L + ++
Sbjct: 93 AQGEKRAAETERE-AARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQT 151
Query: 1252 KLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEA 1311
+L E+ + ++ S +L ++ + R+A+ E+ L A
Sbjct: 152 RLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA 211
Query: 1312 LHARETAVKTLPVLL----LEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAID 1367
AR + + D ++ ++ ++A + Q ++ E + + A
Sbjct: 212 AQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQ--IRERERQLQRLETAQA 269
Query: 1368 KFETQANEHEAYKQAYNE 1385
+ E + + EAY QAY
Sbjct: 270 RLEQEVAQLEAYYQAYVR 287
Score = 32.3 bits (73), Expect = 3.7
Identities = 33/159 (20%), Positives = 63/159 (39%), Gaps = 10/159 (6%)
Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
+E RL L T ++ L Q + + A +++++A + + V + A
Sbjct: 137 QELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSA 196
Query: 1952 GIKEEQKNIETQYLQWNSFRETLAQMTA----------WLDSVEKNIKQEIATPWTTTQE 2001
I++E +N+ T+ + E LA+ A D+ Q+IA +E
Sbjct: 197 QIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRE 256
Query: 2002 LRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQV 2040
+L +L+ + E+ V EA+ Y Q+ A
Sbjct: 257 RERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAAT 295
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 35.0 bits (81), Expect = 0.55
Identities = 28/177 (15%), Positives = 69/177 (38%), Gaps = 11/177 (6%)
Query: 1023 NDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ 1082
++ + ++ +L E I +L+S+ + + E K +++ ++ L +
Sbjct: 277 DEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALL-EALEEI 335
Query: 1083 AEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVV----ERIENHA--VPK 1136
E + L +KL D S + + + + D I ++E++ E+I+N K
Sbjct: 336 LEKNLQKLE--EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNK 393
Query: 1137 LKSLL--LLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
K L L + I + ++ ++ +I +K I ++++
Sbjct: 394 AKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEK 450
Score = 31.1 bits (71), Expect = 7.8
Identities = 80/412 (19%), Positives = 161/412 (39%), Gaps = 70/412 (16%)
Query: 1211 TISDRN-EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR-- 1267
T+ + N EI Q++ K++L L + +E+L + EK+ E EK + + A+
Sbjct: 85 TLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144
Query: 1268 ETAVPSSSRLHNLRKT-VEKLRQQNEKRAAES------EKLGAELEEIIEALHARETAVK 1320
++ + + + N +K EKL ++ + + E+L A+++ + + E A+
Sbjct: 145 DSNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKP-ELALL 203
Query: 1321 TLPVLLLE------------VTSVDVELDK--QRLELAT--KEVQPY------------- 1351
TL V+ + + S DV + + L + KE Y
Sbjct: 204 TLSVIDFDEIEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQT 263
Query: 1352 --VQRYETLKKNIAEAIDKFETQANEH-EAYKQAYNEAYDWLRKAKLGAQANADCHGEQQ 1408
+R L+ + E + Q E + Y+ +A + L + N++ + +
Sbjct: 264 ITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVE 323
Query: 1409 TTKDKADKIKQITKSLPEGQKLIDKTVALKNNV-LESTGPIGKE------NINQEI---N 1458
K + +++I + QKL +K ++ LES + + IN+ I N
Sbjct: 324 ELKALLEALEEILEKNL--QKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHN 381
Query: 1459 QITLDWTNLQNTLQDIDKHH--AKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCT 1516
+ + +N + H A+ + + K LEK I +K+I+ +
Sbjct: 382 EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLE------ 435
Query: 1517 NLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGI----IGMTELESEEVKDL 1564
+++ + ++E L K+ N + D NK + G ELE E
Sbjct: 436 --AEIKALEKEIKE-LEKQLTNIEPTADEINKLLKAYGFGNFELEKEAEDKG 484
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 35.0 bits (81), Expect = 0.58
Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 16/117 (13%)
Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAEL----EEIIEALHARE 1268
S R +T + + L L++ V RQ+ + AAE +L AEL I + L
Sbjct: 15 SRRELLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQL---- 70
Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGA---ELEEIIEALHARETAVKTL 1322
S L+ + + L + + A +L + E I L + L
Sbjct: 71 -----SQALNQQTERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRIL 122
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.8 bits (81), Expect = 0.59
Identities = 30/151 (19%), Positives = 62/151 (41%), Gaps = 21/151 (13%)
Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
+ + K++L+ L ++E+L ++ E++A E+E L E E++ E L
Sbjct: 509 KLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELE-------------- 554
Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQ 1339
EK + E+ E+ E ++ I+ A++ A + + L +
Sbjct: 555 -----EKKEKLQEEEDKLLEEAEKEAQQAIKE--AKKEADEIIKELRQLQKGGYASVKAH 607
Query: 1340 RLELATKEVQPYVQRYETLKKNIAEAIDKFE 1370
L A K + ++ E KK E ++ +
Sbjct: 608 ELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 33.6 bits (78), Expect = 1.6
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 22/101 (21%)
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
NE+ L+ +++L + E L + E+EKL ELEE E L E
Sbjct: 519 NELIASLEELERELEQKAEEAEALLK-------EAEKLKEELEEKKEKLQEEE------- 564
Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
K +E+ ++ ++ E++K E +EII+ L +
Sbjct: 565 -----DKLLEEAEKEAQQAIKEAKK---EADEIIKELRQLQ 597
Score = 31.3 bits (72), Expect = 6.3
Identities = 28/134 (20%), Positives = 48/134 (35%), Gaps = 30/134 (22%)
Query: 1256 ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1315
+L E+I L L + +E+ ++ E E+EKL ELEE E L
Sbjct: 517 KLNELIA-------------SLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563
Query: 1316 ETAVKTLPVLLLEVTSVDVELDK---QRLELATKEVQPYVQRYETLKKNIAEAIDKFETQ 1372
E + E +K Q ++ A KE ++ L+K ++ E
Sbjct: 564 EDKLLE-------------EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELI 610
Query: 1373 ANEHEAYKQAYNEA 1386
+ +A +
Sbjct: 611 -EARKRLNKANEKK 623
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 34.7 bits (80), Expect = 0.67
Identities = 35/125 (28%), Positives = 59/125 (47%), Gaps = 24/125 (19%)
Query: 1028 SMQKRYAQLSRT--------IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPV 1079
+++++Y LSR +E +S+ + +LQ L V+ A LQ A+++LA
Sbjct: 403 ALEQQYQDLSRNRDDWMIAEVEQMLSSASQQLQ----LTGNVQLALIALQNADARLATSD 458
Query: 1080 STQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKS 1139
S QA + +A+ +I K + S DL LA IK LD+ + +I+ A+P
Sbjct: 459 SPQAVAVRKAIA------QDIERLKAAPS-ADLTGLA--IK-LDDAIAKID--ALPLSGE 506
Query: 1140 LLLLR 1144
L
Sbjct: 507 ALPPH 511
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists of
several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 34.5 bits (79), Expect = 0.82
Identities = 82/397 (20%), Positives = 151/397 (38%), Gaps = 43/397 (10%)
Query: 938 LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQ--ELLNKYAQLNEEA 995
L++L + V ++++E +LK E E VK+ +LQ EL + A L EE
Sbjct: 215 LQSLTSEQASSADNSVKIKHLEEELKRYEQDAE-VVKSMKEQLLQIPELERELAALREEN 273
Query: 996 GELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVK 1055
+L M + + E L D +L R + R + +L+ K
Sbjct: 274 RKLRSMKEDNELLKE--ELEDLQS---------------RLERFEKMREKLADLELEKEK 316
Query: 1056 RLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHL 1115
L+++++ K+LLQ L P +D+S + V Q + KN L
Sbjct: 317 -LENELKSWKSLLQDIGLNLRTP-----DDLSRRIVVLQNE-ELQLKEKNGSISSSAKQL 369
Query: 1116 ADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITET---TARVGEVDGNS 1172
+++ L ++ + + K L L+ L ++ E A + D
Sbjct: 370 ETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKEL 429
Query: 1173 ATVQEKIDKYDKIIAD---IQEYEAILATASDKGDQLSSDGTI-SDRNE--------ITE 1220
++ +Q+ ++ LA ++ +L D DRN + E
Sbjct: 430 TETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLETEIKLLKE 489
Query: 1221 QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI-IEALHARETAVPSSSRLHN 1279
QL S ++ L +++ LR + E E ++L E + ++ H ++SR
Sbjct: 490 QLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGDYNASRTKV 549
Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
L ++ + + E L AE E++ E L A E
Sbjct: 550 LHMSLNPASEAEQIAKQTIEALQAECEKLKERLQALE 586
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 34.4 bits (79), Expect = 0.82
Identities = 40/264 (15%), Positives = 72/264 (27%), Gaps = 29/264 (10%)
Query: 1410 TKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQN 1469
K AD +++ E K L N T + +N +T T L +
Sbjct: 180 VKGAADGAEKLKDGTDEASNGNKKLSDLLN-----TLNNSSATFSDGLNALTSGLTTLTD 234
Query: 1470 TLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQ 1529
L +D + + L + I F + + DL L
Sbjct: 235 GLNQLDSGLGTLAAGIGELKQGAEQLNEGIGEFSSGLSELNSG-----VQDLAAGVPQLN 289
Query: 1530 EGLHKEADNKKVL-------------IDSFNKGIIGMTELESEEVKDLIQVGVHGLQVEL 1576
+G+ A + + +G+ T E + + LQ L
Sbjct: 290 QGISALAAGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLS----SLQSML 345
Query: 1577 NALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGN-KQALQSRLDKVNEIQ 1635
+A +Q + + Q L + + LD + +
Sbjct: 346 ALSKSLDLTAEGATVDALGAPDGVQW-LDESQKTLATLSELLSTGIDGVSEGLDALEQAS 404
Query: 1636 ESFPEISTKLQTLSDHIENSSRKL 1659
+ KL+T I S +L
Sbjct: 405 AQLAKSLAKLKTAVAQIAASIAQL 428
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome
partitioning].
Length = 1480
Score = 34.5 bits (79), Expect = 0.84
Identities = 33/178 (18%), Positives = 76/178 (42%), Gaps = 20/178 (11%)
Query: 795 IAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDYSSYQNAV 854
+ + ++ + + +++S +Q L Q Q E+LL D+ Q
Sbjct: 502 LREGPDQRHLAEQVQPLRMRLS------------ELEQRLRQQQSAERLLADFCKRQGKN 549
Query: 855 QDAEKWLLQISFQ--LMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPCFSAEE 912
DAE+ L+ S ++N ++ ++ ++ LQ I S + AP + A +
Sbjct: 550 LDAEELEALHQELEALIESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQR-APVWLAAQ 608
Query: 913 GSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVE 970
+ +++L +++ + QD + + QL+ +E V+ + + A+ L+ +E
Sbjct: 609 -----NALEQLSEQSGEEFTDSQDVTEYMQQLLEREREATVERDELGARKNALDEEIE 661
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 34.6 bits (79), Expect = 0.86
Identities = 66/395 (16%), Positives = 156/395 (39%), Gaps = 38/395 (9%)
Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
++D + KE DR+ K L L ++ ++E+ E +L E K
Sbjct: 575 LEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYED------KL 628
Query: 976 NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
+V Q+ + +L EE + + + ++ AT + N S + +R Q
Sbjct: 629 FDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQ 688
Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
+++ IS + +KL+ E +K ++ + I + +K
Sbjct: 689 TEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIID----LKEK 744
Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERI--ENHAVPKLKSLLLLREQFTTLIMQ 1153
+ E+ + K + + D+ L + I+ + ++ I E + + + + E+F +
Sbjct: 745 EIPELRN-KLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKD 803
Query: 1154 IVGFITETTARVGEVDGN------SATVQEKIDKYDKIIADIQEYEAILATASDKGDQLS 1207
+ I + A++ D + + QEK + D +++ I+ +
Sbjct: 804 VERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKL------------ 851
Query: 1208 SDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HA 1266
I D+ E + L+S+ +L + + + Q+ ++ + +L E++ +I + A
Sbjct: 852 ----IQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDA 907
Query: 1267 RETAVPSSSRLHNLRKTVEKL--RQQNEKRAAESE 1299
+E P + L ++ E+L ++ + A+ +
Sbjct: 908 KEQDSPLETFLEKDQQEKEELISSKETSNKKAQDK 942
Score = 32.7 bits (74), Expect = 3.3
Identities = 81/522 (15%), Positives = 189/522 (36%), Gaps = 61/522 (11%)
Query: 468 FQKKIEAEKDIGDCTNLDDLEKYKALLQEVIS-HNSDMETLNDKC---EALMELAAHSPI 523
+ E K + + + + L +EV S N + + + +L H+
Sbjct: 474 LELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTT 533
Query: 524 REETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNG 583
R + L ++ + S+ L+ + K+++E WLH +
Sbjct: 534 RTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSK---------SK 584
Query: 584 SKNVLKERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSLREAMTV--LRDS 641
N ++RL +N + + ++ ++ ++ + L S +D L +
Sbjct: 585 EINQTRDRLAKLNKELASLEQNKNHINNELESKEEQL----SSYEDKLFDVCGSQDEESD 640
Query: 642 WDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNEPSPVFTEIDLNTKWQ 701
+RL +++ +S Q +Y++F ++ + ++ PV +
Sbjct: 641 LERLKEEIEKSSKQRAMLAGATA----VYSQF---ITQLTDENQSCCPVCQRVFQTEAEL 693
Query: 702 QYVNDELEWENHS--------QNTLQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLK-PI 752
Q +L+ + ++ L+ + R +M L + I+D EK+ +L+ +
Sbjct: 694 QEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKL 753
Query: 753 QKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWL 812
QK + + + I + + L T +SA +++ +Q ++D E+ +
Sbjct: 754 QKVNRDIQRLKNDIEEQETLLGTIMPEEESA-----KVCLTDVTIMERFQMELKDVERKI 808
Query: 813 LQISFQLMAHNSL--YINNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMA 870
Q + +L + Q+ E+ + ++ + +QD ++ + + +
Sbjct: 809 AQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNE 868
Query: 871 HNSLYIN------NRQQTLEQIQHHEKLLQDVI---------LSPDNKLAPCFSAEEGSP 915
S + RQQ EQ+ +Q +I SP E+
Sbjct: 869 LKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEEL 928
Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVEN 957
I K E K D++ D + + + G+ K+ E +++
Sbjct: 929 ISSK----ETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQD 966
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 34.3 bits (79), Expect = 0.90
Identities = 26/108 (24%), Positives = 48/108 (44%), Gaps = 5/108 (4%)
Query: 1258 EEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARET 1317
E+ R R+ L +TVE+L ++N + E E+L E+E++ L
Sbjct: 405 EKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRR 464
Query: 1318 AVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEA 1365
V+ E+ + D +++ EL K+ +R E L++ +AE
Sbjct: 465 EVRDKVRKDREIRARDRRIERLEKELEEKK-----KRVEELERKLAEL 507
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of proteins
contain a band 4.1 domain (pfam00373), at their amino
terminus. This family represents the rest of these
proteins.
Length = 244
Score = 33.6 bits (77), Expect = 0.98
Identities = 27/107 (25%), Positives = 44/107 (41%), Gaps = 16/107 (14%)
Query: 1208 SDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1267
+ T + E +Q + Q L +E+ ++ E+ AA SE+ LE ++ A
Sbjct: 31 YEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAE 90
Query: 1268 ETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA 1314
V KL ++ EK+ AE+ +L EL E EA
Sbjct: 91 ----------------VAKLEEEREKKEAETRQLQQELREAQEAHER 121
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction endonuclease-like
fold, RmuC family [General function prediction only].
Length = 448
Score = 33.9 bits (78), Expect = 1.1
Identities = 26/101 (25%), Positives = 48/101 (47%), Gaps = 12/101 (11%)
Query: 1215 RNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAES-EKLGAELEEIIEALHARETAVPS 1273
+ + +QL ++QL L +++ +L + ++ A E E+L L E+ + +
Sbjct: 79 KARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQ---------N 129
Query: 1274 SSRLHNLRKTV-EKLRQQNEKR-AAESEKLGAELEEIIEAL 1312
+L + V EK R+Q E+R +E+ LEEI L
Sbjct: 130 LKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLL 170
>gnl|CDD|226081 COG3551, COG3551, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 402
Score = 33.8 bits (77), Expect = 1.1
Identities = 22/147 (14%), Positives = 44/147 (29%), Gaps = 11/147 (7%)
Query: 90 LLQKLLTEKDQAGHKLTYLTSSGEKLYLDTAAKGREVVRQQLRALRDSKCSKTTLQEVLS 149
L L D+ + + A K R + +L K + L
Sbjct: 190 HLSDLRQHPDRLVAAVN--------QRVGRAPKKRFLDDVRLFLDHSLKHFASALAAAAG 241
Query: 150 HKRIIESLLDKSKSLPQI---NKDQALEKAIVSVNKRYEDLVDGILKTISQLEESLDIFQ 206
+ + + + + L + + LE+ E + + + ++ S ES+ F
Sbjct: 242 SFQALLEVHEALQQLAVDRFDIEWKGLEREFSRAMPLLEYVGELLWQSASSHNESMTRFS 301
Query: 207 QFQQLQKAYQEDQKQLWDKLSSLTERE 233
+ Q+QL D L
Sbjct: 302 EQVADLAMSYTAQRQLNDGLRDRLLEA 328
>gnl|CDD|129992 TIGR00914, 2A0601, heavy metal efflux pump, CzcA family. This model
represents a family of H+/heavy metal cation antiporters.
This family is one of several subfamilies within the
scope of Pfam model pfam00873 [Cellular processes,
Detoxification, Transport and binding proteins, Cations
and iron carrying compounds].
Length = 1051
Score = 34.0 bits (78), Expect = 1.2
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 10/40 (25%)
Query: 1414 ADKIKQITKSLPEGQK---------LIDKTVA-LKNNVLE 1443
DK++ I K+LPEG + L+D +A +K N+LE
Sbjct: 310 GDKLETINKTLPEGVEIVTTYDRSQLVDAAIATVKKNLLE 349
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 31.5 bits (72), Expect = 1.2
Identities = 15/99 (15%), Positives = 42/99 (42%), Gaps = 6/99 (6%)
Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
++ + ++ W + L D + ++ LL++ + ++E ++ ++ + +
Sbjct: 2 QFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61
Query: 757 QQLSEWD----SSIVPKVSDLETRWAAVKSAWDAKHSDL 791
+QL E I ++ +L RW +K + + L
Sbjct: 62 EQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 29.6 bits (67), Expect = 5.4
Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 5/95 (5%)
Query: 1748 YEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKI 1807
+ L WL+ E+ L E +L L+ +A L+ + + +H++ + L +
Sbjct: 3 FLRDADELEAWLEEKEQLLASE-DLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61
Query: 1808 ESL----HQPSEQSKQQLATITTRHGNVLKRAQKK 1838
E L H +E+ +++L + R + + A+++
Sbjct: 62 EQLIEEGHPDAEEIEERLEELNERWEELKELAEER 96
>gnl|CDD|217653 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81. Family of
eukaryotic beta-1,3-glucanases. Within the Aspergillus
fumigatus protein two perfectly conserved Glu residues
(E550 or E554) have been proposed as putative
nucleophiles of the active site of the Engl1
endoglucanase, while the proton donor would be D475. The
endo-beta-1,3-glucanase activity is essential for
efficient spore release.
Length = 694
Score = 33.7 bits (77), Expect = 1.2
Identities = 21/102 (20%), Positives = 32/102 (31%), Gaps = 6/102 (5%)
Query: 530 LQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNGSKNVLK 589
LQ + A V S E NL D F K A + + + + K
Sbjct: 348 LQEIREAANKDVNVSDSSAESNL-DSMYFSGKIIAKYARI--LLVASEIL---YDEVLTK 401
Query: 590 ERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSL 631
L + M R+ +L DT K + ++ S
Sbjct: 402 ITLYKLKMAMPRLLGNFQPYPLLYDTKWKGIISSGWGSTSSQ 443
>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 755
Score = 33.7 bits (77), Expect = 1.4
Identities = 20/125 (16%), Positives = 34/125 (27%), Gaps = 23/125 (18%)
Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
+L +R +L + + K + + ++AA A L+ E A P S
Sbjct: 298 AELPARVAEL---QAQLNKAQHELAQKAAPLAAAQAALDAPAETATAPSAPAPQVS---- 350
Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQ 1339
AES + A + + V +
Sbjct: 351 ----------------AESSPAQPGSYLLAAPGDAPLGELAQAQSARERLAEESVPAAEP 394
Query: 1340 RLELA 1344
R LA
Sbjct: 395 RSRLA 399
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 33.8 bits (78), Expect = 1.5
Identities = 19/97 (19%), Positives = 40/97 (41%), Gaps = 13/97 (13%)
Query: 1029 MQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISE 1088
+Q++ L I + L V+RL +++E + L++ + +LA ++
Sbjct: 710 LQEQEDLLKE-IASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAK 768
Query: 1089 AL-----TVYQKLLDEINSWKNSQSDEDLVHLADSIK 1120
A V K +D + ++L +AD +K
Sbjct: 769 AEEIGGVKVLAKEVDGADM-------KELREIADDLK 798
>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
His-Domain type N23 protein tyrosine phosphatase and
related domains. This family contains the V-shaped (V)
domain of mammalian His-Domain type N23 protein tyrosine
phosphatase (HD-PTP, also known as PTPN23) and related
domains. It belongs to the V_Alix_like superfamily which
includes the V domains of Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, mammalian Alix
(apoptosis-linked gene-2 interacting protein X/ also
known as apoptosis-linked gene-2 interacting protein 1,
AIP1), and related domains. HD_PTP interacts with the
ESCRT (Endosomal Sorting Complexes Required for
Transport) system, and participates in cell migration and
endosomal trafficking. The related Alix V-domain
(belonging to a different family in this superfamily)
contains a binding site, partially conserved in the
superfamily, for the retroviral late assembly (L) domain
YPXnL motif. The Alix V-domain is also a dimerization
domain. In addition to the V-domain, HD_PTP also has an
N-terminal Bro1-like domain, a proline-rich region (PRR),
a catalytically inactive tyrosine phosphatase domain, and
a region containing a PEST motif. Bro1-like domains bind
components of the ESCRT-III complex, specifically to
CHMP4 in the case of HD-PTP. The Bro1-like domain of
HD-PTP can also bind human immunodeficiency virus type 1
(HIV-1) nucleocapsid. HD-PTP is encoded by the PTPN23
gene, a tumor suppressor gene candidate frequently absent
in human kidney, breast, lung, and cervical tumors. This
family also contains Drosophila Myopic, which promotes
epidermal growth factor receptor (EGFR) signaling, and
Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
promotes Notch signaling.
Length = 337
Score = 33.0 bits (76), Expect = 1.5
Identities = 39/171 (22%), Positives = 68/171 (39%), Gaps = 29/171 (16%)
Query: 935 QDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVK---------------TNNVL 979
D +K L + +G + DVE A L E+E +E + +V
Sbjct: 71 PDTIKNLVEAMGELSDVYQDVE---AMLNEIESLLEEEELQEKEFQEAVGKRGSSIAHVT 127
Query: 980 MLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRT 1039
L+ L KY + +E+A + SN + MNL+ A+ + + +QK +L
Sbjct: 128 ELKRELKKYKEAHEKASQ-----SNTELHKA-MNLHIANLKLLAGPLDELQK---KLPSP 178
Query: 1040 IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEAL 1090
+ +KR+ +KV E + + E QL + +DI+ L
Sbjct: 179 SLLDRPEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAI--HEDDITSKL 227
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and may
also be involved in protein-protein interactions.
Length = 218
Score = 32.7 bits (75), Expect = 1.6
Identities = 37/188 (19%), Positives = 72/188 (38%), Gaps = 29/188 (15%)
Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA-----RETAV 1271
E ++ + K + L + ++KL +Q ++ +LG+ L E +AL E
Sbjct: 1 EEDQEFEEAKDYILKLEEQLKKLSKQAQRLVKRRRELGSALGEFGKALIKLAKCEEEVGG 60
Query: 1272 PSSSRLHNLRKTVEKLRQQNEKRA-AESEKLGAELEEIIEALHARETAVKTLPVLLLEVT 1330
L L K E+L +E +A E KL L+E + A + + LL +
Sbjct: 61 ELGEALSKLGKAAEELSSLSEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQ 120
Query: 1331 SVDVELDKQRLELATKEVQPYV-----------------------QRYETLKKNIAEAID 1367
S+ +L ++ +L + P + +RYE + + + E +
Sbjct: 121 SLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKRYEEISERLKEELK 180
Query: 1368 KFETQANE 1375
+F +
Sbjct: 181 RFHEERAR 188
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 33.1 bits (76), Expect = 1.6
Identities = 28/127 (22%), Positives = 56/127 (44%), Gaps = 2/127 (1%)
Query: 1190 QEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAE 1249
QE + + + + + +RN+ ++ + +++L + + + +Q + RA +
Sbjct: 40 QEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEK 99
Query: 1250 SEKLGAELEEIIEALHARETAVPS-SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
+ L +LEE +AL ARE + +L N V L + + R + L AELEE
Sbjct: 100 LDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPE-QARKLLLKLLDAELEEE 158
Query: 1309 IEALHAR 1315
+
Sbjct: 159 KAQRVKK 165
>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase
B; EC 3.4.24.-) is mostly bacterial and includes
oligoendopeptidase F from Lactococcus lactis. This enzyme
hydrolyzes peptides containing between 7 and 17 amino
acids with fairly broad specificity. The PepF gene is
duplicated in L. lactis on the plasmid that bears it,
while a shortened second copy is found in Bacillus
subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis
has been identified in Bacillus subtilis as an inhibitor
of sporulation initiation when over expressed from a
multicopy plasmid.
Length = 581
Score = 33.3 bits (77), Expect = 1.6
Identities = 25/141 (17%), Positives = 59/141 (41%), Gaps = 17/141 (12%)
Query: 949 KEFEVDVENVEAKLKELEVAVEGDVKTNN--VLMLQELLNKYAQLNEEAGELNMMISNVS 1006
EF+ D++ ++ + EL+ ++ + ++ V L+ L+ +L E L +S +
Sbjct: 13 PEFQEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLELLRTLGAFVSCLL 72
Query: 1007 IATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIED---RISTVN-NKLQSVKRLQ---- 1058
A + D + + ++ + QL ++ +S + + L + L+
Sbjct: 73 SA----DTTDEEAQALLSQLNQLSASLEQLLTLLDKKLAALSEEDLDALLADPELKEHAF 128
Query: 1059 ---HKVEEAKALLQKAESQLA 1076
+ +AK LL + E +L
Sbjct: 129 FLEERRRQAKHLLSEEEEELI 149
Score = 31.4 bits (72), Expect = 7.4
Identities = 47/253 (18%), Positives = 88/253 (34%), Gaps = 61/253 (24%)
Query: 1145 EQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
+F + ++ I E + + + + E ++ +I +QE +L T
Sbjct: 13 PEFQEDLKKLKQLIDELKSLLDALLPSDEDAVETLEA---LITLLQELLELLRTLGAFVS 69
Query: 1205 QLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL 1264
L S T +E + L S+ QL + + L + +E +L+ ++
Sbjct: 70 CLLSADT---TDEEAQALLSQLNQLSASLEQLLTLLDKKLAALSE-----EDLDALLA-- 119
Query: 1265 HARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-----HA----R 1315
L +E+ R+Q L E EE+I L HA
Sbjct: 120 ---------DPELKEHAFFLEERRRQA------KHLLSEEEEELIAKLSVDGLHAWGKLY 164
Query: 1316 ETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANE 1375
+T TL + +E D + EL+ + + N+A D +A
Sbjct: 165 DTLTSTL--------KIPIEEDGEEKELSAGQAR-----------NLAYDPD----RAVR 201
Query: 1376 HEAYKQAYNEAYD 1388
AY+ A +A++
Sbjct: 202 KAAYE-AELKAWE 213
>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like. Vps5 is a sorting nexin that
functions in membrane trafficking. This is the C terminal
dimerisation domain.
Length = 236
Score = 32.6 bits (75), Expect = 1.7
Identities = 39/182 (21%), Positives = 77/182 (42%), Gaps = 30/182 (16%)
Query: 1216 NEITEQLQSRKQQLHNLRKTVEKL-RQQNEKRAAESEKLGAELEEIIEALHARETAVPSS 1274
+ + + S + QL L K +E L Q+ E +A E + + L + E + S
Sbjct: 27 EDKRQYIDSLESQLKKLYKALELLVNQRKELASAT-----GEFAKSLAMLASLELSTGLS 81
Query: 1275 SRLHNLRKTVEKLRQQNEKRA-AESEKLGAELEEII-------EALHARETAVKTLPVLL 1326
L L + E++++ E++A + LG ++E + + R A +
Sbjct: 82 RALSQLAEVEERIKELLERQALQDVLTLGETIDEYLRLIGSVKAVFNQRVKAWQQWQNAE 141
Query: 1327 LEVTSVDVELDK---------QRLELATKEVQPYVQRYETLKK---NIAEAI----DKFE 1370
+++ +L K +L+ A KEV ++ + KK +I+E I ++FE
Sbjct: 142 QDLSKKRAQLAKLKGAGKSRQDKLQQAKKEVDEAERKVQQAKKEFEDISETIKKELERFE 201
Query: 1371 TQ 1372
T+
Sbjct: 202 TE 203
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This family
consists of several hypothetical bacterial sequences. The
function of this family is unknown.
Length = 106
Score = 31.1 bits (71), Expect = 1.8
Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 8/77 (10%)
Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQ-------NEKRAAESEKLGAELEEIIEALHA 1266
D+ E+ + L +QQL+ L + +L+QQ N + E+E L LEE+ +
Sbjct: 2 DKKELFDALSELEQQLNQLLAELGELKQQLAELLEENAELRIENEHLRERLEELEQEQKE 61
Query: 1267 RETAVPSSSRLH-NLRK 1282
+E + NL +
Sbjct: 62 KEKGKKKLGEGYDNLAR 78
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 32.7 bits (76), Expect = 2.2
Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 22/104 (21%)
Query: 1057 LQHKVEEAKALLQKAESQLARP--VSTQA------------EDISEALTVYQKLLDEINS 1102
+ K+E + ++ E+ L+ P +S Q E I EA Y++ +++
Sbjct: 4 MLDKLEALEERYEELEALLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQEDLEE 63
Query: 1103 WK---NSQSDEDLVHLA-DSIKPLDEVVERIENHAVPKLKSLLL 1142
K +SD ++ +A + +K L+E +E +E +LK LLL
Sbjct: 64 AKEMLEEESDPEMREMAKEELKELEERLEELEE----ELKILLL 103
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 32.6 bits (74), Expect = 2.6
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 675 MWLSGVENKLNEPSPVFTEIDLNTK 699
+WL G K NEP+P T I NTK
Sbjct: 47 IWLGGRSKKENEPAPPSTMIATNTK 71
>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
Provisional.
Length = 638
Score = 32.8 bits (75), Expect = 2.7
Identities = 32/148 (21%), Positives = 58/148 (39%), Gaps = 21/148 (14%)
Query: 1178 KIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLH---NLRK 1234
K++ +D D+++Y+ L+ + +Q + Q+RK Q LR
Sbjct: 504 KVEPFD---GDLEDYQQWLSDVQKQENQTDEA----PKENNANSAQARKDQKRREAELRT 556
Query: 1235 TVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKT--VEKLRQQNE 1292
+ LR++ + E EKL A+L + E + S RK L+QQ
Sbjct: 557 QTQPLRKEIARLEKEMEKLNAQLAQA-------EEKLGDSELYDQSRKAELTACLQQQAS 609
Query: 1293 KRAA--ESEKLGAELEEIIEALHARETA 1318
++ E E E +E +E + +
Sbjct: 610 AKSGLEECEMAWLEAQEQLEQMLLEGQS 637
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120 is
partially characterized as a membrane fusion protein of
the DevBCA ABC exporter, probably a glycolipid exporter,
required for heterocyst formation. Most Cyanobacteria
have one member only, but Nostoc sp. PCC 7120 has seven
members.
Length = 327
Score = 32.5 bits (74), Expect = 2.7
Identities = 46/242 (19%), Positives = 94/242 (38%), Gaps = 26/242 (10%)
Query: 961 KLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRL 1020
++K+L VA V+ VL + + EL++ + + A RL
Sbjct: 26 RIKKLLVAEGDRVQAGQVLAELDSRPERTA------ELDVARTQLDEAKA--------RL 71
Query: 1021 T-VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARP- 1078
V + + +R V + ++ RL+ ++E A+ + + S
Sbjct: 72 AQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELETAQREVDRYRSLFRDGA 131
Query: 1079 VSTQAEDISEALTVYQKLLDEINSWKNSQSDE------DLVHLADSIKPLDEVVERIENH 1132
VS D S+AL + + +E+ S+S++ L LA+ ++ D + + E
Sbjct: 132 VSASDLD-SKALKL-RTAEEELEEALASRSEQIDGARAALASLAEEVRETDVDLAQAEVK 189
Query: 1133 AVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEY 1192
+ + + T + I G + + AR GEV G+ ++ + ++ A + Y
Sbjct: 190 SALEAVQQAEALLELTYVKAPIDGRVLKIHAREGEVIGSEGILE--MGDTSQMYAVAEVY 247
Query: 1193 EA 1194
E
Sbjct: 248 ET 249
>gnl|CDD|181087 PRK07701, flgL, flagellar hook-associated protein FlgL; Validated.
Length = 298
Score = 32.3 bits (74), Expect = 2.8
Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 23/100 (23%)
Query: 337 AYEGIVSIHQAYTKAVLDTQEWIDATYNAVNMWGDLTLERVSLHSNLERLKNLEKELGRI 396
Y ++ + Y K D + W++ T +A++ D+ L R
Sbjct: 51 RYRTDLAEVEQYQKNASDAKSWLENTESALDQATDI--------------------LQRA 90
Query: 397 GHKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQNT 436
V N T ++ I QEI Q+ + NT
Sbjct: 91 RELAVQAANGTNTQT---DRQAIAQEIEQLKEQLIQIANT 127
>gnl|CDD|233003 TIGR00512, salvage_mtnA, S-methyl-5-thioribose-1-phosphate isomerase.
The delineation of this family was based in part on a
discussion and neighbor-joining phylogenetic study, by
Kyrpides and Woese, of archaeal and other proteins
homologous to the alpha, beta, and delta subunits of
eukaryotic initiation factor 2B (eIF-2B), a five-subunit
molecule that catalyzes GTP recycling for eIF-2. This
clade is now recognized to include the methionine salvage
pathway enzyme MtnA [Amino acid biosynthesis, Aspartate
family].
Length = 331
Score = 32.2 bits (74), Expect = 2.8
Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 8/67 (11%)
Query: 1245 KRAAESEKLGAELEEIIEALH-ARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGA 1303
+ A E E+ A LEE ++ L +R TAV NL ++++R E ++ A
Sbjct: 61 READEREEFKALLEEKLQYLVSSRPTAV-------NLSWALDRMRAALEAAKTVADIKEA 113
Query: 1304 ELEEIIE 1310
L E
Sbjct: 114 LLAEAER 120
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.8 bits (72), Expect = 3.3
Identities = 35/171 (20%), Positives = 64/171 (37%), Gaps = 33/171 (19%)
Query: 1226 KQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVE 1285
K+ L++ +E+L+++ E+ E E+L AE EE+ E RL L
Sbjct: 134 KEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQE-------------RLKRLEVENS 180
Query: 1286 KLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELAT 1345
+L + +K E L +E+ + E + + +L K+ L LA
Sbjct: 181 RLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS-------------DLVKETLNLAP 227
Query: 1346 KEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLG 1396
K+++ Y +K + + A E +A + L K
Sbjct: 228 KDIEGQGYIYAEDEKEVEILLGTVYIAAPSRE-------DAVEELEIIKEA 271
Score = 30.6 bits (69), Expect = 7.6
Identities = 13/60 (21%), Positives = 30/60 (50%)
Query: 911 EEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVE 970
+E + +++ LE E + +R++ ++L K+ +V +++ + ELE VE
Sbjct: 149 KEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE 208
>gnl|CDD|181620 PRK09040, PRK09040, hypothetical protein; Provisional.
Length = 214
Score = 31.5 bits (72), Expect = 3.3
Identities = 17/38 (44%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 1564 LIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQ 1601
LI VGV G+Q+EL++ L EAE+ + AQ RK ++
Sbjct: 31 LILVGVIGVQLELSSKL---EAEVKQRQAEAQRRKTLE 65
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels of
palindromic sequences in DNA. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1042
Score = 32.2 bits (73), Expect = 3.6
Identities = 56/310 (18%), Positives = 97/310 (31%), Gaps = 32/310 (10%)
Query: 1776 LDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRA 1835
L E + L + +QDL + LH + L +T H L
Sbjct: 603 LSEAEDMLACEQHALLRKLQPEQDLQDV-----RLHLQQCSQELALK-LTALHALQLTLT 656
Query: 1836 QKKIEAEKDIGDCTNLDDLEKYKALLQEVISH----NSDMETLNDKCEALMELAAHSPIR 1891
Q+++ + L + LQ++ S E L L EL H
Sbjct: 657 QERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETH---- 712
Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
E + +S G S + + L + + KA + E
Sbjct: 713 IEEYDREFNEIENASSSLG--SDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEV 770
Query: 1952 GIKEEQKNIETQYLQWN--SFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKL 2009
Q E +L F + T L ++E I QEI +
Sbjct: 771 TA-ALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPS-------------DE 816
Query: 2010 KEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLT 2069
+ +CE V + E F+ + EE S L ++ + K E +L + Q ++ L
Sbjct: 817 DILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876
Query: 2070 EKVDMDVLLN 2079
+ ++ +
Sbjct: 877 KLNGINQIKI 886
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 32.1 bits (74), Expect = 3.6
Identities = 10/44 (22%), Positives = 27/44 (61%)
Query: 185 EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSS 228
E+LV+ + K I ES + + +++++ +QE +++ +++L
Sbjct: 94 EELVEELKKEIPAAFESEEYEARKEEIEEEFQEKREEAFEELEE 137
Score = 31.7 bits (73), Expect = 4.7
Identities = 11/39 (28%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 185 EDLVDGILKTISQLEESL-DIFQQFQQLQKAYQEDQKQL 222
E+ + + + I +LEE L +I +Q ++L++ +E ++L
Sbjct: 174 EEEREELEEKIDELEEELQEILRQLRELEREAREKLREL 212
>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein.
Length = 554
Score = 32.0 bits (72), Expect = 3.8
Identities = 41/226 (18%), Positives = 81/226 (35%), Gaps = 34/226 (15%)
Query: 918 DKIDRL------EKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEG 971
D E ++ RI + L++ H++E +V+ + + +++LE +
Sbjct: 202 DDFAGKFRNSDIELRFADENFRIILEIMDLSEFQNHQQEGKVNQDEILETIEQLESEI-- 259
Query: 972 DVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVN----DSVS 1027
N+ + E AG N +I + +A E + N + D ++
Sbjct: 260 -FDINSFFA--------NFIEESAGVHNHIIED--LADEFASENAIALSAIGDLKDDFLA 308
Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDIS 1087
++ R L ED + +SV+R V L+ L + +D+S
Sbjct: 309 FLEDRIKNLIEMSEDSLK------ESVQRNIDFVNSGFQELEDFSIGLKEELGGLKKDLS 362
Query: 1088 EALTVYQKLLDEINSWKNSQSDEDLVHLA--DSIKPLDEVVERIEN 1131
E + +EI N + HL + + L + E+ N
Sbjct: 363 EQQNLEA---EEILQGFNDFLHDSKDHLKTDEILPELIDFKEKFLN 405
>gnl|CDD|163497 TIGR03785, marine_sort_HK, proteobacterial dedicated sortase system
histidine kinase. This histidine kinase protein is
paired with an adjacent response regulator (TIGR03787)
gene. It co-occurs with a variant sortase enzyme
(TIGR03784), usually in the same gene neighborhood, in
proteobacterial species most of which are marine, and
with an LPXTG motif-containing sortase target conserved
protein (TIGR03788). Sortases and LPXTG proteins are far
more common in Gram-positive bacteria, where sortase
systems mediate attachment to the cell wall or
cross-linking of pilin structures. We give this predicted
sensor histidine kinase the gene symbol psdS, for
Proteobacterial Dedicated Sortase system Sensor histidine
kinase.
Length = 703
Score = 32.0 bits (73), Expect = 3.9
Identities = 17/61 (27%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 1603 KISKLQSWLKQYID----YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRK 1658
+I +L + ID SG A +SR D++ ++ SF ++ +L+ + ++EN S +
Sbjct: 433 RIRRLSDDAEAAIDSQGRISGAIPASRSR-DEIGDLSRSFAQMVARLRQYTHYLENMSSR 491
Query: 1659 L 1659
L
Sbjct: 492 L 492
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 32.2 bits (74), Expect = 4.0
Identities = 36/215 (16%), Positives = 63/215 (29%), Gaps = 27/215 (12%)
Query: 1237 EKLRQQNEKRAAESEKLGAELEEIIEALHARET--------AVPSSSRLHNLRKTVEKLR 1288
+ ++ E RAA+ + A ++A A T A P +S + R+ +
Sbjct: 471 ARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQA 530
Query: 1289 QQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEV 1348
+ + + + + A+ AR A K +V+ K + A
Sbjct: 531 RARQAEKQAAAAADPKKAAVAAAI-ARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARA 589
Query: 1349 QPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQ 1408
+ KK +A + K+A A KAK Q E
Sbjct: 590 KA--------KKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPV 641
Query: 1409 TTKD----------KADKIKQITKSLPEGQKLIDK 1433
+ KA K Q + + K
Sbjct: 642 DPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPK 676
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This family
of bacterial proteins has no known function. The proteins
are in the region of 500-600 amino acid residues in
length.
Length = 450
Score = 31.8 bits (73), Expect = 4.2
Identities = 24/99 (24%), Positives = 45/99 (45%), Gaps = 17/99 (17%)
Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEK-RAAESEKLGAELEEIIEALHARETAVP 1272
D E+ E+L+ +++L L + +EK+++ +K +L E E +E
Sbjct: 328 DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLE---------- 377
Query: 1273 SSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEA 1311
L +T EKL ++ E+ E ++L ELE +
Sbjct: 378 ------KLLETKEKLSEELEELEEELKELKEELESLYSE 410
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts
with SMC proteins [Cell division and chromosome
partitioning].
Length = 622
Score = 31.9 bits (72), Expect = 4.3
Identities = 26/124 (20%), Positives = 56/124 (45%), Gaps = 10/124 (8%)
Query: 1540 KVLIDSFNKGIIGMTELES------EEVKDLIQVGVHGLQVELNAL---LQNIEAEINKV 1590
K+L D F + +LE +E+K + VH + ++ L N+ +I +
Sbjct: 231 KLLFDYFTESYKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEA 290
Query: 1591 ANAAQERKAMQDKISKLQSWLKQYIDY-SGNKQALQSRLDKVNEIQESFPEISTKLQTLS 1649
+Q+ K +++K L+S +Y +Y + KQ Q K+ +++ +++ L
Sbjct: 291 MKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQ 350
Query: 1650 DHIE 1653
+I+
Sbjct: 351 SNID 354
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 30.0 bits (68), Expect = 4.4
Identities = 15/99 (15%), Positives = 40/99 (40%), Gaps = 6/99 (6%)
Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
++ + +D W + L D + ++ LL++ K + ++ +D ++ + + A
Sbjct: 5 QFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELA 64
Query: 757 QQLSEWD----SSIVPKVSDLETRWAAVKSAWDAKHSDL 791
++L I ++ +L RW + + L
Sbjct: 65 EKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKL 103
>gnl|CDD|133400 cd04773, HTH_TioE_rpt2, Second Helix-Turn-Helix DNA binding domain of
the regulatory protein TioE. Putative helix-turn-helix
(HTH) regulatory protein, TioE, and related proteins.
TioE is part of the thiocoraline gene cluster, which is
involved in the biosynthesis of the antitumor
thiocoraline from the marine actinomycete,
Micromonospora. These proteins share the N-terminal DNA
binding domain with other transcription regulators of the
MerR superfamily that promote transcription by
reconfiguring the spacer between the -35 and -10 promoter
elements. Proteins in this family are unique within the
MerR superfamily in that they are composed of just two
adjacent MerR-like N-terminal domains; this CD mainly
contains the C-terminal or second repeat (rpt2) of these
tandem MerR-like domain proteins.
Length = 108
Score = 30.0 bits (68), Expect = 4.6
Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 5/43 (11%)
Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1318
L + VE+LR +E L A LE+ AL R A
Sbjct: 59 LLEQIATVVEQLRHAGG-----TEALAAALEQRRVALTQRGRA 96
Score = 29.6 bits (67), Expect = 5.0
Identities = 15/54 (27%), Positives = 21/54 (38%), Gaps = 5/54 (9%)
Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
+ L+ L + VE+LR +E L A LE+ AL R A
Sbjct: 48 RLIHLLRRGGYLLEQIATVVEQLRHAGG-----TEALAAALEQRRVALTQRGRA 96
>gnl|CDD|222097 pfam13393, tRNA-synt_His, Histidyl-tRNA synthetase. This is a family
of class II aminoacyl-tRNA synthetase-like and ATP
phosphoribosyltransferase regulatory subunits.
Length = 308
Score = 31.4 bits (72), Expect = 5.4
Identities = 21/82 (25%), Positives = 29/82 (35%), Gaps = 3/82 (3%)
Query: 1241 QQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
Q+ R A K ELEE++ L + L L E L + + S
Sbjct: 169 QEEALRDALQRKDLPELEELLAELGLDPALADALLALPELYGDPEVL-DEALELLPGSPA 227
Query: 1301 LGA--ELEEIIEALHARETAVK 1320
A ELE + L A V+
Sbjct: 228 AAALDELEALAALLEALGPGVR 249
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 31.7 bits (72), Expect = 5.6
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 18/83 (21%)
Query: 918 DKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDV-----ENVEAKLKELEVAVEGD 972
DK R +TK Q+D + +T QL KE V E VEAKLKEL+ A+
Sbjct: 558 DKEKRDAVDTKNQADSV--VYQTEKQL----KELGDKVPADVKEKVEAKLKELKDAIASG 611
Query: 973 VKTNNVLMLQELLNKYAQLNEEA 995
Q++ + A LN+E
Sbjct: 612 -------STQKMKDAMAALNQEV 627
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is part
of complex B but is not, however, required for the core
subunits to stay associated. This protein is known as
Huntington-interacting protein-1 in humans.
Length = 355
Score = 31.2 bits (71), Expect = 5.6
Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 7/79 (8%)
Query: 1218 ITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA-------RETA 1270
+ QL+ Q+ R+T+ +++++ + + +L EL EI E L R +
Sbjct: 260 LNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVKQEMEERGAS 319
Query: 1271 VPSSSRLHNLRKTVEKLRQ 1289
+ S L +++ + KL++
Sbjct: 320 MSDGSPLVKIKQAITKLKE 338
>gnl|CDD|217331 pfam03036, Perilipin, Perilipin family. The perilipin family
includes lipid droplet-associated protein (perilipin)
and adipose differentiation-related protein
(adipophilin).
Length = 390
Score = 31.3 bits (71), Expect = 5.6
Identities = 15/45 (33%), Positives = 22/45 (48%)
Query: 505 ETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVE 549
+D +E S R T +LQ +LV+SVQGL + V+
Sbjct: 290 NQEDDARPEQLESRTLSLTRSLTQQLQTTCLSLVSSVQGLPANVQ 334
Score = 31.3 bits (71), Expect = 5.6
Identities = 15/45 (33%), Positives = 22/45 (48%)
Query: 1872 ETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVE 1916
+D +E S R T +LQ +LV+SVQGL + V+
Sbjct: 290 NQEDDARPEQLESRTLSLTRSLTQQLQTTCLSLVSSVQGLPANVQ 334
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding protein
(GBP) is induced by interferon-gamma during macrophage
induction. This family contains GBP1 and GPB2, both
GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 31.1 bits (71), Expect = 5.7
Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 1927 AKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEK 1986
AK++ +EA KA A E + L KEE++ +E Q S++E + Q+ +++ +
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQE---RSYQEHVKQLIEKMEAERE 257
Query: 1987 NIKQEIATPWTTTQELRSKLLKLKEILKK 2015
+ E +L+ + LKE K
Sbjct: 258 KLLAEQE--RMLEHKLQEQEELLKEGFKT 284
>gnl|CDD|218632 pfam05549, Allexi_40kDa, Allexivirus 40kDa protein.
Length = 271
Score = 31.3 bits (71), Expect = 5.8
Identities = 27/119 (22%), Positives = 48/119 (40%), Gaps = 23/119 (19%)
Query: 706 DELEWENHS--QNTLQWLDNIRSD---------------MKTLLER----SKHIVDDIEK 744
DEL + H+ QN+L+WL +I + T L R + + ++
Sbjct: 37 DELYGQLHALHQNSLEWLTHISHNVDSIINKLNPVNLTSQGTPLSRLRDALRTLTRTVDS 96
Query: 745 KKDDLKPIQKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQN 803
+ I+ Q S S ++ K++ +ET A+ + D S L+S +A S
Sbjct: 97 IYSVEQKIELSTQTPS--SSKLLKKLTSIETSLEALHAKLDELTSSLTSTLAGPSVSPT 153
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have LPXTG
C-terminal anchoring motifs and a substantial number have
the KxYKxGKxW putative sorting signal at the N-terminus.
The tetracycline resistance plasmid pCF10 in Enterococcus
faecalis promotes conjugal plasmid transfer in response
to sex pheromones, but PgrA/Sec10 encoded by that
plasmid, a member of this family, specifically inhibits
the ability of cells to receive homologous plasmids. The
phenomenon is called surface exclusion.
Length = 356
Score = 31.2 bits (71), Expect = 5.9
Identities = 17/116 (14%), Positives = 38/116 (32%)
Query: 992 NEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKL 1051
N+ ++ + N S + + +S+ ++Q A + + +N
Sbjct: 222 NDGGVNIHFVNFNDSYIADGNKFDKTPIPNTPNSLKALQAALATAQADLAAAQTALNTAQ 281
Query: 1052 QSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQ 1107
++ Q A+A L A+ +LA + + L Q L +
Sbjct: 282 AALTTAQTAYAAAQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKA 337
>gnl|CDD|214880 smart00880, CHAD, The CHAD domain is an alpha-helical domain
functionally associated with some members of the
adenylate cyclase family. It has conserved histidines
that may chelate metals.
Length = 262
Score = 31.2 bits (71), Expect = 6.1
Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 14/94 (14%)
Query: 1228 QLHNLRKTVEKLRQ-----QNEKRAAESEKLGAELEEIIEALH-ARETAVPSSSRLHNLR 1281
LH LR + +LR + + L AEL + L R+ V L
Sbjct: 24 ALHQLRVALRRLRSALRLFRPVLPREAAAALRAELRWLARELGPLRDLDV--------LL 75
Query: 1282 KTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1315
+ + +L + L A LE A
Sbjct: 76 ERLLELLAALLPELPALDALVAALEARRAAARRA 109
>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1. This entry is of a
family of proteins all approximately 300 residues in
length. The proteins have a single C-terminal
trans-membrane domain and a SNARE [soluble NSF
(N-ethylmaleimide-sensitive fusion protein) attachment
protein receptor] domain of approximately 60 residues.
The SNARE domains are essential for membrane fusion and
are conserved from yeasts to humans. Use1 is one of the
three protein subunits that make up the SNARE complex
and it is specifically required for Golgi-endoplasmic
reticulum retrograde transport.
Length = 251
Score = 30.9 bits (70), Expect = 6.6
Identities = 41/177 (23%), Positives = 69/177 (38%), Gaps = 25/177 (14%)
Query: 64 TDDWLTGLEKDLDTLKNTTDVEAKNSLLQKLLTEKDQAGHKLTYLTSSGEKLYLDTAAKG 123
+ D LT + +D LK + E +S +EK A KL S G T +
Sbjct: 51 SVDVLTEYRRRVDFLKGLLEAEKPSSP-----SEKALANQKL----SRGRA---PTVDEP 98
Query: 124 REVVRQQLRALRDSKCSKTTLQEVLSHKRIIESLLD--KSKSLPQINKDQALEKAIVSVN 181
R +++ S+ + +E+LS + D S IN D+A +I V
Sbjct: 99 RSPASKEIHQQTKSRYTSELRKELLSGSTSLRRSTDAKDRDSSGPINDDEASAASIDQVL 158
Query: 182 KRY----EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTEREL 234
+ + E L + +L L+E Q Q+ +ED + L + L ++ L
Sbjct: 159 QYHQNLQEKLAEEMLLLARNLKE------QSLAAQQIIKEDTEIL-SSSAKLADQNL 208
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.1 bits (70), Expect = 6.6
Identities = 41/209 (19%), Positives = 79/209 (37%), Gaps = 24/209 (11%)
Query: 1209 DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1268
D + D + +Q + Q + +K ++ +++ E+ A E + A +E ++ L
Sbjct: 51 DAVMVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKER 110
Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESE-KLGAELEEIIEALHARETAVKTLPVLLL 1327
+ + EK Q +K+ E K AE ++ EA A+ A
Sbjct: 111 LKAQEQQK---QAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA--------- 158
Query: 1328 EVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
E + + A K+ + + E K A K + +A A ++A EA
Sbjct: 159 EAAKLKAAAE------AKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEA- 211
Query: 1388 DWLRKAKLGAQANADCHGEQQTTKDKADK 1416
+AK A+ A+ E++ +K
Sbjct: 212 ----EAKAKAEKKAEAAAEEKAAAEKKKA 236
>gnl|CDD|224702 COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha
subunit [Lipid metabolism].
Length = 220
Score = 30.7 bits (70), Expect = 6.7
Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 1/32 (3%)
Query: 690 VFTEIDLNTK-WQQYVNDELEWENHSQNTLQW 720
+ + I N + +Q + ELE E Q TL
Sbjct: 74 IASYIGSNPEFGRQMLAGELEVELVPQGTLAE 105
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 31.1 bits (71), Expect = 6.8
Identities = 27/119 (22%), Positives = 52/119 (43%), Gaps = 9/119 (7%)
Query: 1751 SVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDA------TSHQQDLLQLK 1804
A LL L +++L L+ LD+KK +L+ Q + QQ L +L
Sbjct: 262 DSAELLQQLDQLQRRL--HRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELA 319
Query: 1805 DKIE-SLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQ 1862
++ +L KQ+L +T R ++R Q++++ + D L++ + L+
Sbjct: 320 IRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLE 378
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 31.2 bits (70), Expect = 7.1
Identities = 52/238 (21%), Positives = 98/238 (41%), Gaps = 31/238 (13%)
Query: 1888 SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEF--LAKKEEVEAWLHKAHATVQE 1945
S ++ +R LY+ L+ +S + NLS + F LAK ++ L+ ++
Sbjct: 58 SNLQATEIRADGLYSQLLG-----LSASQANLSKELNFSLLAKDAIMQMLLN----ARRD 108
Query: 1946 CDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSK 2005
D + A ++ Q ++ Y+ N F + A + S EK + + + L
Sbjct: 109 LDRINASFRQCQGDLII-YINNNRF------IAAIILS-EKQCQDQFKDMNKSCDAL--- 157
Query: 2006 LLKLKEILKKCEKGVGDHEAFVDKYEEC---------SQKLAQVEAKYAKLIEPETSYEE 2056
L KL E +K E + +A K +E Q A +A+ + E + + E
Sbjct: 158 LFKLGEKVKTLEMEIAKEKAICSKDKESLLAGKRQAEEQLEACGKARELQHQEQQLAEEN 217
Query: 2057 LQKRQGQLTALLTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEAL 2114
LQK Q L +K + D+L + + ++ H PL ++ +++A + L
Sbjct: 218 LQKVQALCIPLDKDKFEADLLNAWRDSIIPRSLDNLGYNLHHPLGPELASIRRACDHL 275
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 31.4 bits (71), Expect = 7.2
Identities = 37/186 (19%), Positives = 63/186 (33%), Gaps = 29/186 (15%)
Query: 1223 QSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPS-----SSRL 1277
R Q++ L + +++ R + ++ E LE A HAR + S +
Sbjct: 177 SGRNPQINQLLEKLKQERNEIDEAEKEYATYHKLLESR-RAEHARLAELRSELRADRDHI 235
Query: 1278 HNLRKTVEKLRQQNEKRAAESE------------KLGA-ELEEIIEALHARETAVKTLPV 1324
LR VE + E + E E + G LE+ L E + L V
Sbjct: 236 RALRDAVELWPRLQEWKQLEQELTRRREELATFPRDGVLRLEKREAHLQKTEAEIDALLV 295
Query: 1325 ----------LLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQAN 1374
L+ ++ + + ++ + ETL A+ DK E N
Sbjct: 296 RLAELKDLASQLIPAKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGN 355
Query: 1375 EHEAYK 1380
EA +
Sbjct: 356 GFEAER 361
>gnl|CDD|185746 cd08915, V_Alix_like, Protein-interacting V-domain of mammalian Alix
and related domains. This superfamily contains the
V-shaped (V) domain of mammalian Alix (apoptosis-linked
gene-2 interacting protein X), His-Domain type N23
protein tyrosine phosphatase (HD-PTP, also known as
PTPN23), Bro1 and Rim20 (also known as PalA) from
Saccharomyces cerevisiae, and related domains. Alix,
HD-PTP, Bro1, and Rim20 all interact with the ESCRT
(Endosomal Sorting Complexes Required for Transport)
system. Alix, also known as apoptosis-linked gene-2
interacting protein 1 (AIP1), participates in membrane
remodeling processes during the budding of enveloped
viruses, vesicle budding inside late endosomal
multivesicular bodies (MVBs), and the abscission
reactions of mammalian cell division. It also functions
in apoptosis. HD-PTP functions in cell migration and
endosomal trafficking, Bro1 in endosomal trafficking, and
Rim20 in the response to the external pH via the Rim101
pathway. The Alix V-domain contains a binding site,
partially conserved in this superfamily, for the
retroviral late assembly (L) domain YPXnL motif. The Alix
V-domain is also a dimerization domain. Members of this
superfamily have an N-terminal Bro1-like domain, which
binds components of the ESCRT-III complex. The Bro1-like
domains of Alix and HD-PTP can also bind human
immunodeficiency virus type 1 (HIV-1) nucleocapsid. Many
members, including Alix, HD-PTP, and Bro1, also have a
proline-rich region (PRR), which binds multiple partners
in Alix, including Tsg101 (tumor susceptibility gene 101,
a component of ESCRT-1) and the apoptotic protein ALG-2.
The C-terminal portion (V-domain and PRR) of Bro1
interacts with Doa4, a ubiquitin thiolesterase needed to
remove ubiquitin from MVB cargoes; it interacts with a
YPxL motif in Doa4s catalytic domain to stimulate its
deubiquitination activity. Rim20 may bind the ESCRT-III
subunit Snf7, bringing the protease Rim13 (a
YPxL-containing transcription factor) into proximity with
Rim101, and promoting the proteolytic activation of
Rim101. HD-PTP is encoded by the PTPN23 gene, a tumor
suppressor gene candidate often absent in human kidney,
breast, lung, and cervical tumors. HD-PTP has a
C-terminal catalytically inactive tyrosine phosphatase
domain.
Length = 342
Score = 31.2 bits (71), Expect = 7.3
Identities = 34/151 (22%), Positives = 57/151 (37%), Gaps = 8/151 (5%)
Query: 1227 QQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN-LRKTVE 1285
++L LR+ VE+L Q+ E+ E +L L R PSS L + V
Sbjct: 80 KELSKLRQNVEELLQECEELLEEEAAEDDQLRAKFGTLRWR---RPSSDEAAKELYEKVT 136
Query: 1286 KLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELAT 1345
KLR E+ + ++ E I L L + S LD + E+
Sbjct: 137 KLRGYLEQASNSDNEVLQCYESIDPNL---VLLCGGYKELKAFIPSPYPALDPEVSEVV- 192
Query: 1346 KEVQPYVQRYETLKKNIAEAIDKFETQANEH 1376
++P + L+K I + E ++ +
Sbjct: 193 SSLRPLLNEVSELEKERERFISELEIKSRNN 223
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal element
functions, Plasmid functions].
Length = 472
Score = 31.2 bits (71), Expect = 7.3
Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 10/100 (10%)
Query: 1228 QLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKL 1287
++ LRK + KL +NE AE+E+L + I + + + + + + K
Sbjct: 67 EVKELRKRLAKLISENEALKAENERLQKREQSIDQQI---------QQAVQSETQELTKE 117
Query: 1288 RQQNEKRAAESEKLGAELEEIIEALHARETAVKT-LPVLL 1326
+Q + + + L +L+ + + + + LPV L
Sbjct: 118 IEQLKSERQQLQGLIDQLQRRLAGVLTGPSGGGSDLPVGL 157
>gnl|CDD|135648 PRK05898, dnaE, DNA polymerase III DnaE; Validated.
Length = 971
Score = 31.4 bits (71), Expect = 7.8
Identities = 22/97 (22%), Positives = 45/97 (46%), Gaps = 14/97 (14%)
Query: 413 PIGKENINQEINQITLDWTNLQNTLQVIDKH---------HAKCLSIWNDFLSSKNTLEK 463
PI +N+N+ +N++ ++ +L + DK C+S N L++ + K
Sbjct: 192 PIQLDNLNKVLNELKVEIHDLPINIIKYDKQNSIISSEILKQLCISGLNKRLNANDGQVK 251
Query: 464 WIEGFQKKIEAEKDIGDCTNLDDLEKYKALLQEVISH 500
I + K+++ E DI + DD Y ++ + I+
Sbjct: 252 KI--YVKRLKYELDIINEKQFDD---YFLIVYDFINF 283
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in exopolysaccharide
biosynthesis [Cell envelope biogenesis, outer membrane].
Length = 458
Score = 30.9 bits (70), Expect = 8.2
Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 29/227 (12%)
Query: 1143 LREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDK---YDKIIADIQEYEAILATA 1199
LR + Q+ F + G + Q+ A + + EA LA+
Sbjct: 207 LRARLQEAEAQVEDFRAQHGL-TDAARGQLLSEQQLSALNTQLQSARARLAQAEARLASL 265
Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
+ + E + +Q +QQ +R+ + L + + + L A+L E
Sbjct: 266 LQLLPLGREAAALREVLE-SPTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAE 324
Query: 1260 IIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV 1319
+ A E ++ A E+ A LE+ + L R + +
Sbjct: 325 -LRQQIAAELR---------------QILASLPNELALLEQQEAALEKELAQLKGRLSKL 368
Query: 1320 KTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAI 1366
L V L E+ + E + E +QRY+ L A I
Sbjct: 369 PKLQVQLRELER-EAEAARSLYET-------LLQRYQELSIQEASPI 407
>gnl|CDD|203573 pfam07071, DUF1341, Protein of unknown function (DUF1341). This
family consists of several hypothetical bacterial
proteins of around 220 residues in length. The function
of this family is unknown.
Length = 218
Score = 30.4 bits (69), Expect = 8.2
Identities = 17/43 (39%), Positives = 20/43 (46%), Gaps = 2/43 (4%)
Query: 1421 TKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLD 1463
K L E + K A LE TG I +N +EI QI LD
Sbjct: 160 LKHLDEYAAVA-KACAEHGFYLEPTGGIDLDNF-EEIVQIALD 200
>gnl|CDD|183293 PRK11730, fadB, multifunctional fatty acid oxidation complex
subunit alpha; Reviewed.
Length = 715
Score = 31.0 bits (71), Expect = 8.2
Identities = 25/80 (31%), Positives = 34/80 (42%), Gaps = 26/80 (32%)
Query: 503 DMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNL----SDQTEF 558
D TL EAL L A S ++ L+TS K+ +D TEF
Sbjct: 32 DRATLASLGEALDALEAQSDLK----------GLLLTS--------AKDAFIVGADITEF 73
Query: 559 L----AKKEEVEAWLHKAHA 574
L A +EE+ WLH A++
Sbjct: 74 LSLFAAPEEELSQWLHFANS 93
Score = 31.0 bits (71), Expect = 8.2
Identities = 25/80 (31%), Positives = 34/80 (42%), Gaps = 26/80 (32%)
Query: 1870 DMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNL----SDQTEF 1925
D TL EAL L A S ++ L+TS K+ +D TEF
Sbjct: 32 DRATLASLGEALDALEAQSDLK----------GLLLTS--------AKDAFIVGADITEF 73
Query: 1926 L----AKKEEVEAWLHKAHA 1941
L A +EE+ WLH A++
Sbjct: 74 LSLFAAPEEELSQWLHFANS 93
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile protein
found in all eukaryote cell types. This family consists
of the coiled-coil myosin heavy chain tail region. The
coiled-coil is composed of the tail from two molecules of
myosin. These can then assemble into the macromolecular
thick filament. The coiled-coil region provides the
structural backbone the thick filament.
Length = 859
Score = 31.2 bits (71), Expect = 8.3
Identities = 84/494 (17%), Positives = 188/494 (38%), Gaps = 113/494 (22%)
Query: 916 IKDKIDRLEK-ETKAQSDRIQ------DALKTLNQLIGHRKEFEVDVENVEAKLKELEVA 968
+ ++I++L+K + KA+ ++ Q D L L+Q+ + E + +E++L EL+V
Sbjct: 136 LSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVK 195
Query: 969 VEGDVKTNNVLMLQELLNKYAQLNEEAGEL------------------NMMISNVSIATE 1010
++ + L +L ++ ++L E +L + + S + A
Sbjct: 196 LDELQRQ-----LNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKR 250
Query: 1011 HMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQK 1070
+ +R + + ++ L +E+ L+ ++ +A A +Q+
Sbjct: 251 SLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAE-------LERQLSKANAEIQQ 303
Query: 1071 AESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE 1130
S+ + +AE++ E + + E L+E E
Sbjct: 304 WRSKFESEGALRAEELEELKKKLNQKISE----------------------LEEAAEAAN 341
Query: 1131 --NHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIAD 1188
++ K KS ++ + + + + ++ +++K +DKI+A+
Sbjct: 342 AKCDSLEKTKS-------------RLQSELEDLQIELERANAAASELEKKQKNFDKILAE 388
Query: 1189 IQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAA 1248
+ D ++ + + ++ +L L+ +E+L+ Q E
Sbjct: 389 WK---------------RKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRR 433
Query: 1249 ESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
E++ L E+ ++ + L +H L K +L AE ++L A LEE
Sbjct: 434 ENKNLQDEIHDLTDQLGE------GGRNVHELEKARRRL-------EAEKDELQAALEEA 480
Query: 1309 IEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDK 1368
AL E+ V +E++ + E++++ LA KE + +E +KN AI+
Sbjct: 481 EAALELEESKVLR---AQVELSQIRSEIERR---LAEKE-----EEFENTRKNHQRAIES 529
Query: 1369 FETQANEHEAYKQA 1382
+ K
Sbjct: 530 LQATLEAEAKGKAE 543
>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase
B; EC 3.4.24.-) is mostly bacterial and includes
oligoendopeptidase F from Lactococcus lactis. This enzyme
hydrolyzes peptides containing between 7 and 17 amino
acids with fairly broad specificity. The PepF gene is
duplicated in L. lactis on the plasmid that bears it,
while a shortened second copy is found in Bacillus
subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis
has been identified in Bacillus subtilis as an inhibitor
of sporulation initiation when over expressed from a
multicopy plasmid.
Length = 586
Score = 31.0 bits (71), Expect = 8.4
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 16/124 (12%)
Query: 1215 RNEITEQLQSRKQQLHNLRKT-------VEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1267
R+E+ +L L +L KT ++KL++ + + + + E I+ AL
Sbjct: 1 RSEVPVEL---TWDLTDLFKTEEAFEAALKKLKELAKDFKKKYKGKLTDAETILSALDDY 57
Query: 1268 ETAVPSSSRLHN---LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPV 1324
E + SRL + L ++ + +N+ RAA K + +I L E+ + L
Sbjct: 58 EEILELISRLSHYASLPQSTDGTDPENQARAA---KFDSLAAKINAKLSFFESELLALDE 114
Query: 1325 LLLE 1328
+LE
Sbjct: 115 KILE 118
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.6 bits (69), Expect = 8.6
Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 13/134 (9%)
Query: 185 EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGY 244
+ L++ + + E+ D ++Q+ ++ + S L E + +
Sbjct: 28 DILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKE--------- 78
Query: 245 EESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQ-DATSHQQDLLQLKDKI 303
EE + L+ L+ + L E+ L E+K QL+ Y + ++ LQL+D +
Sbjct: 79 EERLLDELEELEKEDDDLDGELV---ELQEEKEQLENEELQYLREYNLFDRNNLQLEDNL 135
Query: 304 ESLHQPSEQSKQQL 317
+SL E S QL
Sbjct: 136 QSLELQYEYSLNQL 149
>gnl|CDD|222679 pfam14322, SusD-like_3, Starch-binding associating with outer
membrane. SusD is a secreted starch-binding protein with
an N-terminal lipid tail that allows it to associate with
the outer membrane.
Length = 190
Score = 30.0 bits (68), Expect = 8.6
Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 1161 TTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
TTA E + ATVQE YD+II D++E +L
Sbjct: 131 TTATEPEQNLPRATVQEV---YDQIIKDLEEAIPLLPDEIPSVP 171
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 30.8 bits (70), Expect = 9.2
Identities = 60/291 (20%), Positives = 117/291 (40%), Gaps = 21/291 (7%)
Query: 1042 DRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLAR--PVSTQAEDISEALTVYQKLLDE 1099
D I + L +V R + K E ++L + E+ A S + ED+++ + + L+E
Sbjct: 182 DLIDRLAGDLTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEE 241
Query: 1100 INSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFI- 1158
S + L E E++E + R Q L + +
Sbjct: 242 AQRSLESLEKK----FRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297
Query: 1159 -------TETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDK--GDQLSSD 1209
T+ + E + QE++++ DK + + A+ A + + D
Sbjct: 298 IPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEID 357
Query: 1210 GTISDRNEITEQL-QSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1268
+ +EI +L S QL L + V++ Q+ K E ELEE + + +
Sbjct: 358 KPATTDSEIPHRLSGSELTQLEVLIQQVKR-ELQDAKSQLLKEL--RELEEELAEVDKKI 414
Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETA 1318
+ +PS ++ L + + + + + + AE E+L +LE + EA+ R+T
Sbjct: 415 STIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTL 465
Score = 30.8 bits (70), Expect = 9.5
Identities = 53/294 (18%), Positives = 96/294 (32%), Gaps = 48/294 (16%)
Query: 1759 LKVTEKKLGDEMELKTTLDE-------KKAQLQVYRTIYQDATSHQQDLLQLKDKIESLH 1811
L ++ + L E K Q + Y + Q+ + +L + + +ESL
Sbjct: 191 LTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLE 250
Query: 1812 QP-----------SEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNL--------- 1851
+ EQ ++QL I + ++ + NL
Sbjct: 251 KKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQ 310
Query: 1852 -DDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQG 1910
++ + L QE + E + E+L +LA + ++ + A L +
Sbjct: 311 KEEQSQQNQLTQEELE-----ERDKELLESLPKLALPAEH------VKEIAAELAEIDKP 359
Query: 1911 LVSQVEKNL----SDQTEFLAKKEEVEAWLHKAHATV-QECDELEAGIKEEQKNIET--Q 1963
+ E S+ T+ ++V+ L A + + +E ELE + E K I T
Sbjct: 360 ATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPS 419
Query: 1964 YLQWNSFRETLAQMTAWLDSVEKNIKQ--EIATPWTTTQELRSKLLKLKEILKK 2015
Q E L + L E I++ E K L K K
Sbjct: 420 EEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKI 473
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number of
Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 30.8 bits (70), Expect = 9.3
Identities = 18/79 (22%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
Query: 2000 QELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLI---EPETS-YE 2055
Q+L+ +L + + + E V + A +++ E + + +VEA+ +L E S YE
Sbjct: 313 QQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYE 372
Query: 2056 ELQKRQGQLTALLTEKVDM 2074
+L R+ A +++++++
Sbjct: 373 QLLTRRES--AEVSKQMEV 389
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.127 0.349
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 106,788,951
Number of extensions: 10696749
Number of successful extensions: 15350
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13895
Number of HSP's successfully gapped: 1006
Length of query: 2174
Length of database: 10,937,602
Length adjustment: 112
Effective length of query: 2062
Effective length of database: 5,969,954
Effective search space: 12310045148
Effective search space used: 12310045148
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 67 (29.7 bits)