RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8596
(237 letters)
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 59.3 bits (144), Expect = 3e-10
Identities = 56/231 (24%), Positives = 105/231 (45%), Gaps = 19/231 (8%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYES 63
E ++SLE +I +KE+++ E+ + + E E + ER+I E + +
Sbjct: 298 ELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTE 357
Query: 64 EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
E EL K EL+ E+ ++ R L + E+LK E+ +
Sbjct: 358 EYAEL-----KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKR------ 406
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
ELDRLQE+ + +L A + ++A + E++ +++K + ++ +L
Sbjct: 407 -----ELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLA 461
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQAT 234
A L + E+ +KEEY+R + LS+L E +A + S+E + +A
Sbjct: 462 ADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAV 512
Score = 44.7 bits (106), Expect = 2e-05
Identities = 43/222 (19%), Positives = 95/222 (42%), Gaps = 11/222 (4%)
Query: 19 IGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL-----ELGKS 73
I +K +Q+ +LR +R++A++ + ++ E E L E+ + +L
Sbjct: 193 IDEKRQQLERLRREREKAERYQALLKEKREY---EGYELLKEKEALERQKEAIERQLASL 249
Query: 74 KAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRL 133
+ EL+K E+ E Q L L N+++K E Q AE+ L
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEEL---NKKIKDLGEEEQLRVKEKIGELEAEIASL 306
Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
+ + +L A+ L ++A+ +++ +E + ++ ++E+ + +L + + E+
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
Query: 194 ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
E ++ E E + E +++K K E+ + L
Sbjct: 367 EDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKREL 408
Score = 42.4 bits (100), Expect = 1e-04
Identities = 45/230 (19%), Positives = 106/230 (46%), Gaps = 14/230 (6%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQRDRADQ---ERQEERDLHERDIAEYKLKLHAYESEVEE 67
LS IG+ EK++ QL ++ ++ + E +E+ E++I K +L E+ +EE
Sbjct: 710 ELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769
Query: 68 LELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS 127
LE K E E S + + LS+LE E R++ L + R TL
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829
Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
+QE ++ + +++ ++ + E + +++ ++E++E+ + + L+++L
Sbjct: 830 YLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLG 885
Query: 188 NTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
+ + E + ++ + + + ++ +++K ++ L +A L
Sbjct: 886 DLKKERDELEAQLRELE-------RKIEELEAQIEKKRKRLSELKAKLEA 928
Score = 41.2 bits (97), Expect = 3e-04
Identities = 55/239 (23%), Positives = 105/239 (43%), Gaps = 27/239 (11%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEE----RDLHERDIAEYKLKLHAYESE 64
E L+SLEE + +++++L ++ + +Q +E +DL E + K K+ E+E
Sbjct: 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAE 302
Query: 65 VEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTL 124
+ LE + + E+ + + +L E E L+ E+E +
Sbjct: 303 IASLE-----RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER-------- 349
Query: 125 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEK-------SQG 177
D+L E+Y + +L +AEL V + R E K +EK+EK +
Sbjct: 350 ---KRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKR 406
Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
E+ RL+ +L+ E+ + + ++ L E++ LE+ K + +LE+ A L
Sbjct: 407 ELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
Score = 38.9 bits (91), Expect = 0.002
Identities = 41/205 (20%), Positives = 93/205 (45%), Gaps = 14/205 (6%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQERQE---ERDLHERDIAEYKLKLHAYESEV 65
E L+E + + E+ ++ L ++ + E +E + E D+ + + L+ E+ +
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788
Query: 66 EELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLS 125
+ + +AEL K EV R A Q+L+RL LE E L+ E++ Q
Sbjct: 789 SHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQ--------- 839
Query: 126 TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 185
+ L+E+ ++ + ++ + E + + + ++ ++ + E L+A+
Sbjct: 840 --EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897
Query: 186 LENTQGEMESMKEEYERTQGTLSRL 210
L + ++E ++ + E+ + LS L
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSEL 922
Score = 34.3 bits (79), Expect = 0.054
Identities = 40/208 (19%), Positives = 85/208 (40%), Gaps = 8/208 (3%)
Query: 30 REQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGA 89
+E +R +EE + +I+E + +L E +EEL K K ++ V
Sbjct: 239 KEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNK-KIKDLGEEEQLRVKEKIG 297
Query: 90 DWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQA 149
+ EA L R E ER + E AE+D+L + ++ ++ +
Sbjct: 298 ELEAEIASLERSIAEKERELEDAEERLA-------KLEAEIDKLLAEIEELEREIEEERK 350
Query: 150 ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSR 209
+ + ++ E + ++ ++E+ E + +L++ + ++E +K E + L R
Sbjct: 351 RRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDR 410
Query: 210 LLSERDKAVLEMDKSKEELERSQATLGE 237
L E + E+ + +A + E
Sbjct: 411 LQEELQRLSEELADLNAAIAGIEAKINE 438
Score = 33.1 bits (76), Expect = 0.14
Identities = 41/180 (22%), Positives = 81/180 (45%), Gaps = 15/180 (8%)
Query: 69 ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELEN---ERLKHELERSQTTFGRTTLS 125
+ K+ EL++ + R + RQ+L RL E ER + L+ + G L
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLK 230
Query: 126 TSAELDRLQEKYDKTCADLRRAQAELRV-VQADNERVRSEEKTMQEKVEK----SQGEVY 180
L+R +E ++ A L +L + +R+ E+ ++E +K + E
Sbjct: 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQL 290
Query: 181 RLKAKLENTQGEMESM-------KEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
R+K K+ + E+ S+ + E E + L++L +E DK + E+++ + E+E +
Sbjct: 291 RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERK 350
Score = 31.6 bits (72), Expect = 0.38
Identities = 41/192 (21%), Positives = 86/192 (44%), Gaps = 13/192 (6%)
Query: 14 SLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL--ELG 71
L E + ++++ LR + + D+E E RD +Y+ KL + E+ EL EL
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDEL----KDYREKLEKLKREINELKRELD 409
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
+ + EL + + E+ A +++ LE E E E+++ + +A+L
Sbjct: 410 RLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE----QLAADLS 465
Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
+ +++ + R + EL +Q + ++ + +E+V + LKA ++ G
Sbjct: 466 KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHG 525
Query: 192 ---EMESMKEEY 200
++ S+ E Y
Sbjct: 526 TVAQLGSVGERY 537
Score = 28.9 bits (65), Expect = 3.3
Identities = 49/251 (19%), Positives = 99/251 (39%), Gaps = 23/251 (9%)
Query: 10 GALSSLEEAIGDKEKQMNQLREQRDRADQ-ERQEERDLHERDIAEYKLKLHAYESEVEEL 68
G + I + +L+ R+R + +R+ E E +L + E
Sbjct: 657 GGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR 716
Query: 69 ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSA 128
++G+ + E+++ E + E + LS LE E E +K EL+ +
Sbjct: 717 KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK 776
Query: 129 ----------------------ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEK 166
EL +L+E+ + A LR + +L + + E + E +
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836
Query: 167 TMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKE 226
+QE+ + ++ ++ ++EN G+ E ++EE E + L L S E D+ +
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA 896
Query: 227 ELERSQATLGE 237
+L + + E
Sbjct: 897 QLRELERKIEE 907
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 53.9 bits (130), Expect = 2e-08
Identities = 51/239 (21%), Positives = 103/239 (43%), Gaps = 18/239 (7%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQE---ERDLHERDIAEYKLKLHAYES 63
+S+L + + E ++ QL E+ + +E E E + E + E + +L E+
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
Query: 64 EVEELE---------LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELER 114
E+EELE L + LD+ E+ + R+RL LE + LE
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
Query: 115 SQTTFGRTTL---STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEK 171
+ + S +AE++ L+E ++ ++L E ++ +RSE + + E+
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902
Query: 172 VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRL---LSERDKAVLEMDKSKEE 227
+ + + + L+ +LE + ++ ++ E + + L LSE LE ++ E
Sbjct: 903 LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALEN 961
Score = 48.9 bits (117), Expect = 9e-07
Identities = 59/219 (26%), Positives = 111/219 (50%), Gaps = 11/219 (5%)
Query: 24 KQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEVEELE--LGKSKAELD 78
++ +LRE+ + +E +EE + ++ E + KL EV ELE + + + EL
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
Query: 79 KAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYD 138
ANE+ R + R+RL+ LE + E L+ +LE + + EL L+EK +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELE----SKLDELAEELAELEEKLE 347
Query: 139 KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKE 198
+ +L +AEL ++A+ E + S + ++E++E + +V +L+ ++ + E+E ++
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407
Query: 199 EYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
ER + RL E ++ L + EL+ QA L E
Sbjct: 408 RLERLEDRRERLQQEIEE--LLKKLEEAELKELQAELEE 444
Score = 45.4 bits (108), Expect = 1e-05
Identities = 56/236 (23%), Positives = 106/236 (44%), Gaps = 16/236 (6%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRA-------DQERQEERDLHERDIAEYKLKLH 59
+ L LE+ + + E+Q+ L Q ++A + R+ E L + E + +L
Sbjct: 183 RTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELE 242
Query: 60 AYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTF 119
+ E++E E EL++ E+ E R +S LE E E L+ EL
Sbjct: 243 ELQEELKEAEE-----ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEI 297
Query: 120 GRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEV 179
R + L+E+ L +A+L +++ + + E ++EK+E+ + E+
Sbjct: 298 SRLE----QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
L+A+LE + E+E ++ E + L L S+ + L++ E+ER +A L
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL 409
Score = 43.9 bits (104), Expect = 4e-05
Identities = 44/227 (19%), Positives = 96/227 (42%), Gaps = 6/227 (2%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
AL+ L + + + E+++ QLR++ + ++ R R E +++ EL
Sbjct: 699 ALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR--LEAEVEQLEERIAQLSKEL 756
Query: 71 GKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAEL 130
+ +AE+++ + + + A + LE + E+LK EL+ AEL
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK----ALREALDELRAEL 812
Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
L E+ L + + + E + + + + E +E E+ L+ +E +
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE 872
Query: 191 GEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
E+E++ E + L+ L SE ++ E+ + + + + L E
Sbjct: 873 SELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEE 919
Score = 43.9 bits (104), Expect = 4e-05
Identities = 53/226 (23%), Positives = 87/226 (38%), Gaps = 12/226 (5%)
Query: 15 LEEAIGDKEKQMNQLREQRDRADQERQEERDL---HERDIAEYKLKLHAYESEVEELELG 71
LEE I + EK + +LR++ + ++E ++ R R I+ + L E+EVE+LE
Sbjct: 689 LEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEER 748
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
++ + E + E E E ELE + LD
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE--ELEAQIEQLKEELKALREALD 806
Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
L+ + L A LR ER + + E +E+ L +E+
Sbjct: 807 ELRAELT----LLNEEAANLRERLESLERRIAATERRLEDLEEQIE---ELSEDIESLAA 859
Query: 192 EMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
E+E ++E E + L LL+ER + + ELE L E
Sbjct: 860 EIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905
Score = 41.2 bits (97), Expect = 3e-04
Identities = 51/217 (23%), Positives = 99/217 (45%), Gaps = 20/217 (9%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQ---ERQEERDLHERDIAEYKLKLHAYESEV 65
+ L +L I E+Q LRE+ ++ E + + + E + E +L E ++
Sbjct: 287 QKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL 346
Query: 66 EEL--ELGKSKAELDKAANEVGRSGADWEAARQRLSRL-----ELEN---------ERLK 109
EEL EL +AEL++ E+ + E ++L L +LE ERL+
Sbjct: 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLE 406
Query: 110 HELERSQTTFGRTTLSTSAELDRLQE-KYDKTCADLRRAQAELRVVQADNERVRSEEKTM 168
LER + R L +L+E + + A+L + EL +Q + ER+ + +
Sbjct: 407 ARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEEL 466
Query: 169 QEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
+E++E+++ + + +L Q ++S++ E +G
Sbjct: 467 REELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
Score = 41.2 bits (97), Expect = 4e-04
Identities = 44/218 (20%), Positives = 91/218 (41%), Gaps = 11/218 (5%)
Query: 20 GDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDK 79
G + + L +R R +E +E+ + E IAE + L E+EELE EL++
Sbjct: 664 GSAKTNSSIL--ERRREIEELEEKIEELEEKIAELEKALAELRKELEELE-----EELEQ 716
Query: 80 AANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDK 139
E+ A R+ L+RLE E E+L+ + + +RL+E ++
Sbjct: 717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEE 776
Query: 140 TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
+ ++ E +++ + + E + E+ L + N + +ES++
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALREALDEL----RAELTLLNEEAANLRERLESLERR 832
Query: 200 YERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
T+ L L + ++ +++ E+E + + E
Sbjct: 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
Score = 32.7 bits (75), Expect = 0.16
Identities = 29/151 (19%), Positives = 63/151 (41%), Gaps = 3/151 (1%)
Query: 17 EAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAE 76
E++ + +++ E+ + +E +E+ + +A+ +L++ + +E+E LE + E
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413
Query: 77 LDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT---LSTSAELDRL 133
+ + + L L+ E E L+ ELE Q R EL+
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEA 473
Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSE 164
++ D +L + QA L ++ E +
Sbjct: 474 EQALDAAERELAQLQARLDSLERLQENLEGF 504
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 47.4 bits (113), Expect = 3e-06
Identities = 51/229 (22%), Positives = 97/229 (42%), Gaps = 5/229 (2%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRAD--QERQEERDLHERDIAEYKLKLHAYESEVEELE 69
L LE+ + + EKQ+ +L Q ++A+ QE + E E + KLK E E E E
Sbjct: 188 LERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEE 247
Query: 70 LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS-- 127
L + + EL++ E+ + + E + L L E E L+ EL + S
Sbjct: 248 LSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLL 307
Query: 128 -AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
L+ L+ + ++ L + ++ ++ + E + + +++ + + + L+ KL
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 187 ENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
E+E + E L L+E + E+ + E LE L
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416
Score = 47.0 bits (112), Expect = 5e-06
Identities = 46/232 (19%), Positives = 100/232 (43%), Gaps = 5/232 (2%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEV 65
+ L L E + + ++++ +L+E+ + + E +E + E ++ E + +L + ++
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKI 332
Query: 66 EEL--ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
E L EL + + L++ + E ++LS L E E L L
Sbjct: 333 EALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
EL+ L+ + + L R L ++ + + + +E + +Q ++E+ E+ L+
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELE 452
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
+LE + ++ ++ E Q L RL E +D+ + E SQ
Sbjct: 453 EQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVR 504
Score = 45.9 bits (109), Expect = 9e-06
Identities = 46/227 (20%), Positives = 102/227 (44%), Gaps = 5/227 (2%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYES 63
S E L L + + E+Q+ +L+ + ++E Q + E ++ E + +L +
Sbjct: 699 SLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE 758
Query: 64 EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
+EELE E A + + + + ELE E + E R
Sbjct: 759 RLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA--LEREL 816
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
S +RL+++ ++ ++ + +L ++ + E + E + ++E++E+ + E L+
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
+L+ + E E ++EE + L+ L E +K +++ + +LER
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLER 923
Score = 44.7 bits (106), Expect = 2e-05
Identities = 59/234 (25%), Positives = 113/234 (48%), Gaps = 9/234 (3%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEER---DLHERDIAEYKLKLHAYESEV 65
E L+ LE + E+++ L+ + + +E R + ER + E K +L A E E+
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732
Query: 66 EELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
E+L+ L + + EL++ E+ E + L LE +LK E+E +
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
EL+ L+E+ ++ L + EL ++ ER+ E + ++E++E+ + ++ L+
Sbjct: 793 ----EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
+LE + E+E +KEE E + L E + E ++ +EEL ++ L E
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAE 902
Score = 42.8 bits (101), Expect = 1e-04
Identities = 53/241 (21%), Positives = 112/241 (46%), Gaps = 12/241 (4%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDL----------HERDIAEYKLKL 58
E L L+E + + EK++ +L+ + + +E +E ++ E +I+ + +L
Sbjct: 252 EEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERL 311
Query: 59 HAYESEVEELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQ 116
E+E+EELE L + K +++ E+ E Q L+ LE E L+ +L
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371
Query: 117 TTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQ 176
+ EL L+ + + +L + E+ ++ ER+ + ++E++++ +
Sbjct: 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELE 431
Query: 177 GEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
E+ L+ +LE E+E ++E+ E + L L E + E+ + ++EL +A L
Sbjct: 432 AELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLD 491
Query: 237 E 237
Sbjct: 492 R 492
Score = 42.0 bits (99), Expect = 2e-04
Identities = 49/232 (21%), Positives = 106/232 (45%), Gaps = 11/232 (4%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVE 66
L LEE + E+++ +L+E+ + A++E +E + ++ E + +L + E+
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAEKEIEE----LKSELEELREELEELQEELL 291
Query: 67 ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
EL+ + E + + E L LE E LK ++E + +
Sbjct: 292 ELKEEIEELEGEISLLRERL-----EELENELEELEERLEELKEKIEALKEELEER-ETL 345
Query: 127 SAELDRLQEKYDKTCADLRRAQAELRV-VQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 185
EL++L + ++ +L + L ++ E +R E ++ ++ + + E+ LK +
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405
Query: 186 LENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
+E+ + +E + E E + L L +E ++ E+++ EELE + L E
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Score = 40.5 bits (95), Expect = 5e-04
Identities = 53/227 (23%), Positives = 117/227 (51%), Gaps = 16/227 (7%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEV 65
+ L++LEE + + ++ +L E+ + ++E QE + E ++ + L + E+
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781
Query: 66 EELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
EELE + EL++ E+ + +A + L LE ERL+ E+E +
Sbjct: 782 EELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE------- 834
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
E++ L+EK D+ +L + EL ++ + E + +E++ +++++++ + E L+
Sbjct: 835 ----EEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
+L + E+ +KEE E+ + L L ++ ++ +E+ + +EELE
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937
Score = 39.7 bits (93), Expect = 9e-04
Identities = 51/226 (22%), Positives = 95/226 (42%), Gaps = 24/226 (10%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQE---RQEERDLHERDIAEYKLKLHAYES 63
E L E + E+++ L ++R+R +QE +EE + E + E + +L E
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856
Query: 64 EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
E+EEL K EL++ E + + + LE E L+ EL +
Sbjct: 857 ELEEL-----KEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE------ 905
Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
E+++L+E+ ++ A L R + EL ++ + E + T++ ++E+ +
Sbjct: 906 -----EIEKLRERLEELEAKLERLEVELPELEEELEEEYED--TLETELEREIERLEEEI 958
Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELE 229
L EEYE + L S+R+ +K E +E
Sbjct: 959 EALGPVN---LRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIE 1001
Score = 38.2 bits (89), Expect = 0.003
Identities = 49/191 (25%), Positives = 91/191 (47%), Gaps = 12/191 (6%)
Query: 55 KLKLHAYESEVEELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHEL 112
K +L E E+ ELE L K + EL NE+ E R++L LE + E LK EL
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725
Query: 113 ---ERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ-------ADNERVR 162
E EL+ L+E+ ++ L + EL ++ + E +
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785
Query: 163 SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMD 222
+ + +QE++E+ + E+ + +L+ + E+ES+++ ER + + L E ++ ++D
Sbjct: 786 EKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLD 845
Query: 223 KSKEELERSQA 233
+ +EELE +
Sbjct: 846 ELEEELEELEK 856
Score = 35.8 bits (83), Expect = 0.017
Identities = 37/146 (25%), Positives = 69/146 (47%), Gaps = 4/146 (2%)
Query: 92 EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
A + EL+ E + EL EL+ L+E+ + +L Q EL
Sbjct: 207 RQAEKAERYQELKAELRELEL----ALLLAKLKELRKELEELEEELSRLEEELEELQEEL 262
Query: 152 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLL 211
+ + E ++SE + ++E++E+ Q E+ LK ++E +GE+ ++E E + L L
Sbjct: 263 EEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELE 322
Query: 212 SERDKAVLEMDKSKEELERSQATLGE 237
++ +++ KEELE + L E
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEE 348
Score = 33.1 bits (76), Expect = 0.13
Identities = 29/124 (23%), Positives = 61/124 (49%)
Query: 114 RSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVE 173
RS R EL L+ + +K +L+ + ELR ++ E +R + + ++ ++E
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719
Query: 174 KSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
+ + E+ L+ +LE Q +E ++EE E + L L ++ E++ +E L + +
Sbjct: 720 ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779
Query: 234 TLGE 237
+ E
Sbjct: 780 EIEE 783
Score = 29.3 bits (66), Expect = 2.0
Identities = 35/162 (21%), Positives = 74/162 (45%), Gaps = 11/162 (6%)
Query: 74 KAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRL 133
K EL + E+ A E + L L+ E L+ LE + L+ L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQ-----------LEEL 714
Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
+ + ++ +L + EL +Q+ E + E + ++E++E+ Q + L+ +LE+ + +
Sbjct: 715 ERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL 774
Query: 194 ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
+KEE E + L E ++ E+++++ L+ + L
Sbjct: 775 AKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816
Score = 28.9 bits (65), Expect = 3.2
Identities = 31/150 (20%), Positives = 66/150 (44%), Gaps = 1/150 (0%)
Query: 87 SGADWEAARQRLSRLELENERLKHELERSQTTFGRTTL-STSAELDRLQEKYDKTCADLR 145
+G + EL+ + +Q L S EL L++ ++ L
Sbjct: 653 TGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLE 712
Query: 146 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
+ +L ++ + + E + +Q ++E+ + E+ L+ +LE Q +E ++EE E +
Sbjct: 713 ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772
Query: 206 TLSRLLSERDKAVLEMDKSKEELERSQATL 235
L++L E ++ + +EELE + L
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEEL 802
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 40.4 bits (94), Expect = 6e-04
Identities = 45/144 (31%), Positives = 79/144 (54%), Gaps = 6/144 (4%)
Query: 97 RLSRLELENERLKHELERSQTTFGRTTLS----TSAELDRLQEKYDKTCADLRRAQAELR 152
+L + ENE LK ELER Q+ G + S S EL R + A + + +LR
Sbjct: 4 QLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLR 63
Query: 153 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLS 212
V Q +N+ + + +QE++ KSQ E+ RL+ KLE + E ++ + E+++G SR+ +
Sbjct: 64 VTQEENQHAQLTVQALQEEL-KSQRELNRLQQKLEKSGNETGELQ-QLEKSEGQFSRIQA 121
Query: 213 ERDKAVLEMDKSKEELERSQATLG 236
ER++ +E+ ++ LE+ Q +
Sbjct: 122 ERERQAMELFLLRDTLEKLQLRME 145
Score = 30.8 bits (69), Expect = 0.78
Identities = 44/176 (25%), Positives = 79/176 (44%), Gaps = 6/176 (3%)
Query: 61 YESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
YE E + L K A L A E + E R R + E + L+ ++E Q TF
Sbjct: 350 YELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFR 409
Query: 121 RTTLSTSAELDRLQEKYDKTCADLRRAQAEL------RVVQADNERVRSEEKTMQEKVEK 174
R E +RL+ T D + E R+++ E+ +E+ QE+ E
Sbjct: 410 RKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFET 469
Query: 175 SQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
+ E LK +++N Q ++ + + E + +S+L S + K +++++ ELE+
Sbjct: 470 YKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEK 525
Score = 30.0 bits (67), Expect = 1.2
Identities = 48/212 (22%), Positives = 96/212 (45%), Gaps = 3/212 (1%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
AL LE+A+ +KE+ + +L+EQRDR ++ QEE + ++++ + K ++ + ++ E EL
Sbjct: 434 ALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSEREL 493
Query: 71 GKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAEL 130
+ EL K S + + + +ELE R KHE + R ++
Sbjct: 494 ---QLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRG 550
Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
+ ++ AD A+ E+ + E+ E + + + E+ + + L Q
Sbjct: 551 SAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQ 610
Query: 191 GEMESMKEEYERTQGTLSRLLSERDKAVLEMD 222
+ E+ + E+ER L +K+ E +
Sbjct: 611 EKTEATRIEFERKSAELLEEAERLEKSEAEEE 642
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 39.7 bits (93), Expect = 8e-04
Identities = 43/196 (21%), Positives = 78/196 (39%), Gaps = 4/196 (2%)
Query: 7 SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVE 66
E L+ +EE I + +N+L E ++ +E +L+E + H Y
Sbjct: 108 DIEQQLTLIEEDIEQILEDLNELVESEEKNSEEIDHVLELYEELRRDVLANRHQYGEAAP 167
Query: 67 ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
ELE E + + S D+ AR+ L E L+ +ER +
Sbjct: 168 ELEKKLENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTEL 227
Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
+L L+ Y D++ L V D+ R +E+ ++ +Q E+ + +L
Sbjct: 228 PGQLQDLKAGYR----DMKEEGYHLEHVNIDSRLERLKEQLVENSELLTQLELDEAEEEL 283
Query: 187 ENTQGEMESMKEEYER 202
Q ++ES+ + ER
Sbjct: 284 GLIQEKIESLYDLLER 299
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 38.3 bits (89), Expect = 0.002
Identities = 61/243 (25%), Positives = 111/243 (45%), Gaps = 39/243 (16%)
Query: 8 SEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEE 67
S+ L L I + EK + L + R +A ++ E+ L E++ + K+ + E
Sbjct: 126 SDFQLEDLVGMIQNAEKNILLLNQARLQALEDL--EKILTEKEALQGKIN-------ILE 176
Query: 68 LELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS 127
+ L ++ A + AA E + + LE + E+L++EL T G S S
Sbjct: 177 MRLSETDARIKLAAQE-----------KIHVEILEEQLEKLRNELLIRGATEGLCVHSLS 225
Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS--QGEVYRLKAK 185
ELD L+E+ D++ +AEL V ERV EK E+S + L++K
Sbjct: 226 KELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEK------ERSLLDASLRELESK 279
Query: 186 LENTQGEM--------ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKE---ELERSQAT 234
Q ++ + E+ E Q L R ++ +KA L +D++++ ++++ +A+
Sbjct: 280 FIVAQEDVSKLSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEAS 339
Query: 235 LGE 237
L E
Sbjct: 340 LKE 342
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 37.5 bits (87), Expect = 0.005
Identities = 37/239 (15%), Positives = 86/239 (35%), Gaps = 19/239 (7%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
++ L+ + + ++ +L + ++ D ++ L+ + E +
Sbjct: 743 RIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLRAAHRRAAEAERQA 802
Query: 71 GKSKAELDKAANEVGRSGADWEAARQRLSR-------------LELENERLKHELERSQT 117
++ EL +AA + + A W+ AR+ L R LE LK + T
Sbjct: 803 ESAERELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALEAVGLALKRFGDHLHT 862
Query: 118 TFGRT--TLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQE----K 171
+ + + + +D R A + +A+ E +T++E
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922
Query: 172 VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
V++ + + +A L + E+ + E + R +R +A +D+ E +
Sbjct: 923 VDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATLDERAEARDH 981
Score = 27.5 bits (61), Expect = 8.1
Identities = 29/132 (21%), Positives = 54/132 (40%), Gaps = 8/132 (6%)
Query: 83 EVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCA 142
+V + + R L RLE L++ L+R + + L Q +YD+
Sbjct: 224 DVADALEQLDEYRDELERLEALERALRNFLQRYRRYARTMLRRRATRLRSAQTQYDQLSR 283
Query: 143 DLRR-------AQAELRVVQADNERVRSEEKTMQEKVEKSQG-EVYRLKAKLENTQGEME 194
DL R A+ E R + A E + E ++ ++E QG Y+ +LE + + E
Sbjct: 284 DLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAE 343
Query: 195 SMKEEYERTQGT 206
+++ +
Sbjct: 344 ALQAAAADARQA 355
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 36.9 bits (86), Expect = 0.007
Identities = 61/242 (25%), Positives = 101/242 (41%), Gaps = 33/242 (13%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRAD------QERQEERDLHERDIAEYKLKLHAYE 62
E L+ L+E I E+Q Q RE RD AD +ER+EE + E +I + + + E
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271
Query: 63 SEVEEL---------ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELE 113
E EEL L + + E D E G AD EA R LE +E L+ LE
Sbjct: 272 REREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLE 331
Query: 114 RSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVE 173
+ E + L+E D ++ E +R E ++ ++E
Sbjct: 332 ECRVAAQAHN----EEAESLREDADD--------------LEERAEELREEAAELESELE 373
Query: 174 KSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
+++ V + ++E + E+E ++E + L ++ E D+ +E +A
Sbjct: 374 EAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEA 433
Query: 234 TL 235
TL
Sbjct: 434 TL 435
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 36.5 bits (84), Expect = 0.009
Identities = 24/117 (20%), Positives = 50/117 (42%)
Query: 118 TFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 177
R S +LD ++ + +L AQ E R + + E RSE + +++ E +
Sbjct: 64 LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123
Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQAT 234
E+ + L Q E+ + ++ + Q L L +R + + + ++ QA+
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQAS 180
Score = 30.0 bits (67), Expect = 1.2
Identities = 36/188 (19%), Positives = 69/188 (36%), Gaps = 5/188 (2%)
Query: 21 DKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKA 80
D Q+ LR + A E++ ER+ A +L+ E E EL ++ L KA
Sbjct: 78 DIRPQLRALRTELGTAQGEKRAAET--EREAARSELQKARQEREAVRQELAAARQNLAKA 135
Query: 81 ANEVGRSGADWEAARQRLSRLELENERL---KHELERSQTTFGRTTLSTSAELDRLQEKY 137
E+ R + + RL L + +L L+ SQ + +++ L+ +
Sbjct: 136 QQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRS 195
Query: 138 DKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 197
+ + + QA E + Q+ + Q ++ K + E ++
Sbjct: 196 AQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIR 255
Query: 198 EEYERTQG 205
E + Q
Sbjct: 256 ERERQLQR 263
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 35.8 bits (83), Expect = 0.018
Identities = 47/247 (19%), Positives = 85/247 (34%), Gaps = 48/247 (19%)
Query: 9 EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE---- 64
E L SLE+ I + E+ +N L E++ D+ +E +L DI ++K L E
Sbjct: 87 EENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDI-DFK-YLRGAEGLKLGF 144
Query: 65 ----VEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
+ +L + EL W A R + + E+E E L+
Sbjct: 145 VAGVINREKLEAFEREL-------------WRACRGYIRQAEIE-EPLEDP--------K 182
Query: 121 RTTLSTSAELDRLQEKYDKTCADLR------RAQAELRVVQADNERVRSEEKTMQEKVEK 174
+T +K K R R EE +E+
Sbjct: 183 KTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRV--LEQ 240
Query: 175 SQGEVYRLKAKLENT----QGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
++ + ++ K+ + ++ K YE TL+ + + E ++LE+
Sbjct: 241 TESHLEKVLVKIADELLAWDEQVSKEKAVYE----TLNLFNYDTKTLIAEGWVPAKDLEK 296
Query: 231 SQATLGE 237
+A L
Sbjct: 297 LKAALEN 303
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 34.6 bits (80), Expect = 0.026
Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 67 ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
+ EL + + EL + ++ + + +Q LS LE E ERL+ EL R + +
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA-IEL 130
Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNER 160
E L+E+ + + +AE +Q + +R
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQENEQR 164
Score = 31.1 bits (71), Expect = 0.34
Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 144 LRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK------ 197
L Q EL +Q + ++ + +Q++ ++ + E+ L+A+LE Q E+ +K
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127
Query: 198 ----EEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
EE + L+ L E + E ++ +E +R G
Sbjct: 128 IELDEENRELREELAELKQENEALEAENERLQENEQRRWFLYG 170
Score = 29.2 bits (66), Expect = 1.6
Identities = 23/96 (23%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 23 EKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAAN 82
++++ +L+E+ ++ E + + E K +L E+E+E L+ K A + + +
Sbjct: 72 QQELAELQEELAELQEQLAELQQ----ENQELKQELSTLEAELERLQ--KELARIKQLSA 125
Query: 83 EVGRSGADWEAARQRLSRLELENERLKHELERSQTT 118
+ R+ L+ L+ ENE L+ E ER Q
Sbjct: 126 NAIELDEENRELREELAELKQENEALEAENERLQEN 161
Score = 28.1 bits (63), Expect = 2.9
Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 14/107 (13%)
Query: 92 EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
+AR+RL L+ E L+ EL Q E L+++ A+L R Q EL
Sbjct: 62 PSARERLPELQQELAELQEELAELQEQLAEL----QQENQELKQELSTLEAELERLQKEL 117
Query: 152 RVVQ----------ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 188
++ +N +R E ++++ E + E RL+ +
Sbjct: 118 ARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
Score = 26.9 bits (60), Expect = 9.1
Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 6/109 (5%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
L L++ + + ++++ +L+EQ QE QE + L+ + E +
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELS------TLEAELERLQKELARIK 121
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
+ A + E + +Q LE ENERL+ +R +G
Sbjct: 122 QLSANAIELDEENRELREELAELKQENEALEAENERLQENEQRRWFLYG 170
>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Protein
kinase C and Casein kinase Substrate in Neurons (PACSIN)
proteins, also called Synaptic dynamin-associated
proteins (Syndapins), act as regulators of cytoskeletal
and membrane dynamics. They bind both dynamin and
Wiskott-Aldrich syndrome protein (WASP), and may provide
direct links between the actin cytoskeletal machinery
through WASP and dynamin-dependent endocytosis.
Vetebrates harbor three isoforms with distinct
expression patterns and specific functions. PACSINs
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 34.6 bits (80), Expect = 0.027
Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 8/85 (9%)
Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
++++ ++ Y C + AQ + ++D + K +Q+KVEK + EV + K K E
Sbjct: 133 KKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYE 192
Query: 188 NTQGE--------MESMKEEYERTQ 204
+ ME M++ +++ Q
Sbjct: 193 KALEDLNKYNPRYMEDMEQVFDKCQ 217
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 35.1 bits (81), Expect = 0.028
Identities = 32/210 (15%), Positives = 82/210 (39%), Gaps = 20/210 (9%)
Query: 36 ADQERQEERDLHERDIAEYKLKLHAYESEVEELE--LGKSKAELDKAANEVGRSGADWEA 93
+ ++ +++IA + K+ + + +LE L + E+ ++ + D +
Sbjct: 32 SAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKK 91
Query: 94 ARQRLSRLELENERLKHELERSQ-------TTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
R++++ L L+ + + R+ + L E ++ +R
Sbjct: 92 LRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRS---VRL 148
Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
A + A ER+ + + T ++ ++A++ Q E+ ++ E Q
Sbjct: 149 AIYYGALNPARAERIDALKAT--------LKQLAAVRAEIAAEQAELTTLLSEQRAQQAK 200
Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATLG 236
L++LL ER K + +++ ++ L
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELR 230
Score = 29.7 bits (67), Expect = 1.3
Identities = 42/222 (18%), Positives = 81/222 (36%), Gaps = 23/222 (10%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
+ LE+ + E ++ L Q + ++ R + IA+ +L+A EV+E E
Sbjct: 61 RAKLEKQLKSLETEIASLEAQLIETADDLKKLR----KQIADLNARLNA--LEVQEREQR 114
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
+ AE A GR+ +S + + R +G
Sbjct: 115 RRLAEQLAALQRSGRN----PPPALLVSPEDAQRS------VRLAIYYGALN----PARA 160
Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
+ T L +AE+ QA+ + SE++ Q K+ + E + A+L +
Sbjct: 161 ERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220
Query: 192 EMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
+ EE + L ++ A K++E ++A
Sbjct: 221 ADQKKLEELRANESRLKNEIA---SAEAAAAKAREAAAAAEA 259
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 34.6 bits (80), Expect = 0.038
Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 20/97 (20%)
Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 201
A L +A+A L +A +R++ + Q + +++ + E E
Sbjct: 91 AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQN-------------ELE 137
Query: 202 RTQ-----GTLSRLLSERDKAVLEMDKSKEELERSQA 233
R G +SR E D+A + ++ L +QA
Sbjct: 138 RRAELAQRGVVSR--EELDRARAALQAAEAALAAAQA 172
Score = 28.4 bits (64), Expect = 3.2
Identities = 33/141 (23%), Positives = 54/141 (38%), Gaps = 15/141 (10%)
Query: 50 DIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLK 109
D +Y+ L E+ + E A+L ++ + A A + ++ +
Sbjct: 85 DPRDYRAALEQAEAALAAAE-----AQLRNLRAQLASAQALIAQAE------AQDLDQAQ 133
Query: 110 HELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQ 169
+ELER R +S ELDR + A L AQA + A E S Q
Sbjct: 134 NELERRAELAQRGVVSRE-ELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGA---Q 189
Query: 170 EKVEKSQGEVYRLKAKLENTQ 190
+V ++ + + K LE T
Sbjct: 190 AQVASAEAALDQAKLDLERTV 210
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 33.7 bits (78), Expect = 0.068
Identities = 35/203 (17%), Positives = 65/203 (32%), Gaps = 32/203 (15%)
Query: 49 RDIAEYKLKLHAYESEVEELELGK----SKAELDKAANEVGRSGADWEAARQRLSRLELE 104
R L + +V L + + EL+K E+ + + LE E
Sbjct: 63 RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQE 122
Query: 105 NERLK------------HELERSQTTFGRTT----LSTSAELDRLQEKYDKT-------- 140
ERL+ + G E D +Y T
Sbjct: 123 IERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVV 182
Query: 141 CADLRRAQAELRVV--QADNERVR-SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 197
L+ E+ + ER+ EE T E + + + E+ ++ + E+ E++ +
Sbjct: 183 VVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242
Query: 198 EEYERTQGTLSRLLS-ERDKAVL 219
++Y L L E ++A
Sbjct: 243 KKYLEELLALYEYLEIELERAEA 265
Score = 27.6 bits (62), Expect = 8.0
Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 22/90 (24%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQ-RDRADQERQEERDLHE-----RDIAEYKLK------- 57
+ ++E + + EK+ L E+ ++ A + +E L+E + AE K
Sbjct: 216 LIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTDKT 275
Query: 58 --LHAY--ESEVEELELGKSKAELDKAANE 83
+ + E V++L K +DKA
Sbjct: 276 FAIEGWVPEDRVKKL-----KELIDKATGG 300
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 33.8 bits (78), Expect = 0.073
Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 31/113 (27%)
Query: 95 RQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVV 154
R+RL++L ENE LK AE +RLQ + E +
Sbjct: 72 RKRLAKLISENEALK------------------AENERLQ-------------KREQSID 100
Query: 155 QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTL 207
Q + V+SE + + +++E+ + E +L+ ++ Q + + L
Sbjct: 101 QQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQRRLAGVLTGPSGGGSDL 153
Score = 30.0 bits (68), Expect = 1.1
Identities = 15/73 (20%), Positives = 42/73 (57%), Gaps = 7/73 (9%)
Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM----ESMK 197
A+++ + L + ++NE +++E + +Q++ E+S +++ +++ E+ E +K
Sbjct: 66 AEVKELRKRLAKLISENEALKAENERLQKR-EQSID--QQIQQAVQSETQELTKEIEQLK 122
Query: 198 EEYERTQGTLSRL 210
E ++ QG + +L
Sbjct: 123 SERQQLQGLIDQL 135
Score = 28.1 bits (63), Expect = 5.0
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 4/65 (6%)
Query: 177 GEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVL----EMDKSKEELERSQ 232
EV L+ +L E E++K E ER Q + + +AV E+ K E+L+ +
Sbjct: 66 AEVKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSER 125
Query: 233 ATLGE 237
L
Sbjct: 126 QQLQG 130
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120
is partially characterized as a membrane fusion protein
of the DevBCA ABC exporter, probably a glycolipid
exporter, required for heterocyst formation. Most
Cyanobacteria have one member only, but Nostoc sp. PCC
7120 has seven members.
Length = 327
Score = 32.9 bits (75), Expect = 0.13
Identities = 29/110 (26%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
AELD + + D+ A L + +A + + +R + + V Q + RL+A+LE
Sbjct: 55 AELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELE 114
Query: 188 NTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
Q E++ + + G +S S+ D L++ ++EELE + A+ E
Sbjct: 115 TAQREVDRYRSLF--RDGAVSA--SDLDSKALKLRTAEEELEEALASRSE 160
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 32.2 bits (73), Expect = 0.22
Identities = 30/117 (25%), Positives = 53/117 (45%), Gaps = 5/117 (4%)
Query: 88 GADWEAARQRLSRLELENERLKHELE-----RSQTTFGRTTLSTSAELDRLQEKYDKTCA 142
G D E L R EL+N ++ L+ +L+ LQ++ ++
Sbjct: 97 GHDVEHIDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLK 156
Query: 143 DLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
+L +AE VQ +R+ E ++E ++K GEVY LK + + + +E +EE
Sbjct: 157 ELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEE 213
Score = 29.5 bits (66), Expect = 1.4
Identities = 24/99 (24%), Positives = 51/99 (51%)
Query: 103 LENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVR 162
++ E ++ EL+ + G TL E L+E Y++ L Q E + + E +
Sbjct: 103 IDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELE 162
Query: 163 SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 201
+E + +QE++++ + E RL+ L+ GE+ +K+ ++
Sbjct: 163 AEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWD 201
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 32.3 bits (73), Expect = 0.22
Identities = 44/225 (19%), Positives = 97/225 (43%), Gaps = 12/225 (5%)
Query: 10 GALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELE 69
G L L+ I KE+++ L+ D ++ L ++ I+ + +L E E E
Sbjct: 330 GKLEKLKSEIELKEEEIKALQSNIDELHKQ------LRKQGISTEQFELMNQEREKLTRE 383
Query: 70 LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAE 129
L K + DK V + + + L + + + L + RS++ G +S +
Sbjct: 384 LDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLK 443
Query: 130 LDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEK---TMQEKVEKSQGEVYRLKAKL 186
++ Q + + ++ L + ER+++EE T++E ++ + ++ L L
Sbjct: 444 INIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQIL 503
Query: 187 ENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERS 231
E + E+ ++E ++ R L + +E++K ++EL
Sbjct: 504 EKLELELSEANSKFELSKEENERELVAQR---IEIEKLEKELNDL 545
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 31.9 bits (73), Expect = 0.24
Identities = 20/79 (25%), Positives = 32/79 (40%), Gaps = 4/79 (5%)
Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
D Q A L A+A+L + Q ER + ++ SQ ++ KA +E Q
Sbjct: 60 DDYQLALQAALAQLAAAEAQLELAQRSFERAE----RLVKRNAVSQADLDDAKAAVEAAQ 115
Query: 191 GEMESMKEEYERTQGTLSR 209
++E+ K Q L
Sbjct: 116 ADLEAAKASLASAQLNLRY 134
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.2 bits (71), Expect = 0.41
Identities = 13/53 (24%), Positives = 26/53 (49%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE 64
L L AI + ++L E+ A++ R+E R ++I++ K K+ +
Sbjct: 237 LQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKL 289
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.1 bits (71), Expect = 0.41
Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 9/64 (14%)
Query: 41 QEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSR 100
+E R+ +E ++ EY+ + +E+E + LE K A + + G D EA + L
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLE---------KKAKKAIKKGKDEEALAEELLE 118
Query: 101 LELE 104
LE E
Sbjct: 119 LEAE 122
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.3 bits (69), Expect = 0.47
Identities = 19/89 (21%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
A A+ ++ D ++V SE + ++ + E +L+A+L+ + E++ +++ ++ T
Sbjct: 12 AAAQAKIGVVDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAAT 71
Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATL 235
LS R E+ + ++EL++ Q
Sbjct: 72 LSE--EARKAKQQELQQKQQELQQKQQAA 98
>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1. This protein is
conserved from worms to humans. The protein of 413 amino
acids contains a central coiled-coil domain, possibly
the region that binds to clusterin. Cluap1 expression is
highest in the nucleus and gradually increases during
late S to G2/M phases of the cell cycle and returns to
the basal level in the G0/G1 phases. In addition, it is
upregulated in colon cancer tissues compared to
corresponding non-cancerous mucosa. It thus plays a
crucial role in the life of the cell.
Length = 269
Score = 30.7 bits (70), Expect = 0.51
Identities = 22/70 (31%), Positives = 39/70 (55%)
Query: 160 RVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVL 219
++ K + ++++ Q E+ LK+ N + ++E K+E ERTQ L L S R +
Sbjct: 168 ALKEAIKNLAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFME 227
Query: 220 EMDKSKEELE 229
E +K +EEL+
Sbjct: 228 EYEKLEEELQ 237
Score = 28.1 bits (63), Expect = 3.5
Identities = 18/64 (28%), Positives = 38/64 (59%)
Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
A L+ +QA+ + ++S+E ++ K+E+ + E+ R + +L+ Q + EEYE+ +
Sbjct: 176 LAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFMEEYEKLEEE 235
Query: 207 LSRL 210
L +L
Sbjct: 236 LQKL 239
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 31.1 bits (71), Expect = 0.52
Identities = 21/93 (22%), Positives = 36/93 (38%), Gaps = 7/93 (7%)
Query: 11 ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
AL + + + + + Q + +Q+E E AE + K E+++E+L
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL-- 203
Query: 71 GKSKAELDKAANEVGRSGADWEAARQRLSRLEL 103
+ KA + R E Q RLEL
Sbjct: 204 -QEKAAETSQERKQKRK----EITDQAAKRLEL 231
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 31.1 bits (71), Expect = 0.52
Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 2/70 (2%)
Query: 87 SGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
A W R+R E + + ELER +A L + ++ A LRR
Sbjct: 311 LLALWLRRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRR 370
Query: 147 AQAELRVVQA 156
AQ EL VQA
Sbjct: 371 AQDEL--VQA 378
>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
outer membrane / Intracellular trafficking and
secretion].
Length = 457
Score = 29.8 bits (67), Expect = 1.2
Identities = 13/80 (16%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 143 DLRRAQAELRVVQADNERVRSEEKTMQEKVEK---SQGEVYRLKAKLENTQGEMESMKEE 199
DL AQ +L + + + + +++ + ++ +V + +A+L + + ++ + + +
Sbjct: 162 DLLAAQEQLALAEETLAAAEEQLELAEKRYDAGLATRLDVLQAEAQLASARAQLAAAQAQ 221
Query: 200 YERTQGTLSRLLSERDKAVL 219
+ + L+RLL +L
Sbjct: 222 LAQARNALARLLGLEPGELL 241
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 29.9 bits (67), Expect = 1.2
Identities = 9/41 (21%), Positives = 23/41 (56%)
Query: 13 SSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAE 53
S ++ D++ Q++ ++ ++ D++++EE L E D
Sbjct: 5 SDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAML 45
>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
resistance protein A). [Transport and binding proteins,
Other].
Length = 334
Score = 29.8 bits (67), Expect = 1.3
Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 6/96 (6%)
Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQ--EKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
L AEL + +A+ +T Q V++ Q +V LK KLE + ++ + +
Sbjct: 71 VRLDPTNAELALAKAEANLAALVRQTKQLEITVQQLQAKVESLKIKLEQAREKLLQAELD 130
Query: 200 YERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
R + L R+ E+D +++ L ++A L
Sbjct: 131 LRRRVPLFKKGLISRE----ELDHARKALLSAKAAL 162
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 29.6 bits (67), Expect = 1.3
Identities = 18/93 (19%), Positives = 28/93 (30%), Gaps = 29/93 (31%)
Query: 70 LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAE 129
+G + + + N + + L L ENE LK EL
Sbjct: 43 VGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLE--------------- 87
Query: 130 LDRLQEKYDKTCADLRRAQAELRVVQADNERVR 162
L EL ++A+N R+R
Sbjct: 88 --------------LESRLQELEQLEAENARLR 106
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 29.3 bits (66), Expect = 1.3
Identities = 13/83 (15%), Positives = 40/83 (48%)
Query: 150 ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSR 209
++ + + +R+ K +++ ++K++ E+ L LE + E+E ++ + + + +
Sbjct: 11 AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQE 70
Query: 210 LLSERDKAVLEMDKSKEELERSQ 232
+ +A ++ K+E E
Sbjct: 71 IRERIKRAEEKLSAVKDERELRA 93
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.6 bits (67), Expect = 1.5
Identities = 13/48 (27%), Positives = 27/48 (56%)
Query: 158 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
E++R +EK ++E + E L +KLE Q ++E +++ E+ +
Sbjct: 323 TEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKS 370
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 29.2 bits (66), Expect = 1.7
Identities = 12/53 (22%), Positives = 26/53 (49%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE 64
L LE I D + ++L + A+++ ++ R ++I + K +L +S
Sbjct: 234 LQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKLLQSL 286
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.5 bits (67), Expect = 1.7
Identities = 17/90 (18%), Positives = 41/90 (45%), Gaps = 5/90 (5%)
Query: 121 RTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE-----KTMQEKVEKS 175
+T + + L+E+ + +L+ + EL ++ +++ + +E++EK
Sbjct: 320 KTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKL 379
Query: 176 QGEVYRLKAKLENTQGEMESMKEEYERTQG 205
+L +LE + E++ +KEE E
Sbjct: 380 LETKEKLSEELEELEEELKELKEELESLYS 409
Score = 27.2 bits (61), Expect = 9.7
Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 159 ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAV 218
+ + E K ++E++EK + + +L K +E+ E+ T +L E ++
Sbjct: 337 KELEEELKELEEELEKIKKLLKKLPKKARGQLPPE--KREQLEKLLETKEKLSEELEELE 394
Query: 219 LEMDKSKEELERSQA 233
E+ + KEELE +
Sbjct: 395 EELKELKEELESLYS 409
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 29.4 bits (67), Expect = 1.8
Identities = 18/58 (31%), Positives = 22/58 (37%), Gaps = 12/58 (20%)
Query: 57 KLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELER 114
+L A E ELE + EL +LE E ERLK ELE+
Sbjct: 2 RLEALEERNSELEEQIRQLELKL------------RDLEAENEKLERELERLKSELEK 47
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.4 bits (67), Expect = 1.9
Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 9/69 (13%)
Query: 15 LEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSK 74
L E I E+ +L ++ + A+ +E L E E + K + E ++L
Sbjct: 518 LNELIASLEELERELEQKAEEAEALLKEAEKLKE----ELEEKKEKLQEEEDKL-----L 568
Query: 75 AELDKAANE 83
E +K A +
Sbjct: 569 EEAEKEAQQ 577
>gnl|CDD|163494 TIGR03782, Bac_Flav_CT_J, Bacteroides conjugative transposon TraJ
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage. This family is related conjugation system
proteins in the Proteobacteria, including TrbL of
Agrobacterium Ti plasmids and VirB6 [Cellular processes,
DNA transformation].
Length = 322
Score = 28.9 bits (65), Expect = 2.3
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 17/75 (22%)
Query: 26 MNQLREQRDRADQE---RQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAE------ 76
MN+ REQ+D+ + E R E Y + ++ +++EL S +
Sbjct: 101 MNRYREQKDKLEYEAMVRDPET--------AYLVSDEEFDRKLDELGWSPSDMKTMAGMY 152
Query: 77 LDKAANEVGRSGADW 91
+++AA + +S DW
Sbjct: 153 MERAAYNMKKSVRDW 167
>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
Length = 256
Score = 28.6 bits (64), Expect = 2.3
Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 7/40 (17%)
Query: 165 EKTMQEKVEKSQGEVYRLKAKLENT-----QGE--MESMK 197
++ MQEK+ + E+ +++AKL+ T Q E E M+
Sbjct: 86 QRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQKEMME 125
>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 2 (PACSIN2). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 2 or Syndapin II is expressed ubiquitously and is
involved in the regulation of tubulin polymerization. It
associates with Golgi membranes and forms a complex with
dynamin II which is crucial in promoting vesicle
formation from the trans-Golgi network. PACSIN 2
contains an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 28.9 bits (64), Expect = 2.4
Identities = 26/103 (25%), Positives = 50/103 (48%), Gaps = 8/103 (7%)
Query: 129 ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 188
E++ ++ Y C + + A + +AD + K +Q+KVEK + +V + K K E
Sbjct: 134 EVEAAKKAYHTACKEEKLATSREANSKADPALNPEQLKKLQDKVEKCKQDVLKTKEKYEK 193
Query: 189 TQGE--------MESMKEEYERTQGTLSRLLSERDKAVLEMDK 223
+ E ME+M++ +E+ Q + L + +LE+ K
Sbjct: 194 SLKELDQTTPQYMENMEQVFEQCQQFEEKRLRFFREVLLEVQK 236
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.3 bits (65), Expect = 2.5
Identities = 35/217 (16%), Positives = 92/217 (42%), Gaps = 4/217 (1%)
Query: 21 DKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG--KSKAELD 78
++ K+ ++ +++ + A ++ E + E + K + +E + KA+
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
Query: 79 KAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYD 138
K A E ++ +A + + + + E K E + + AE R++E
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Query: 139 KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKE 198
+ + E + +A+ ++++EE E+ +K ++ + +A+ + E++ +E
Sbjct: 1600 LYEEEKKMKAEEAK--KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657
Query: 199 EYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
E + ++ E K E K++E+ +++ L
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEAL 1694
Score = 28.6 bits (63), Expect = 4.4
Identities = 32/200 (16%), Positives = 85/200 (42%), Gaps = 9/200 (4%)
Query: 2 QAHHCSSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAY 61
+A E + EE K ++ + E + +A++ ++ E + + + + K
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK-----E 1642
Query: 62 ESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGR 121
E ++ E K E +K A+ + + ++ E+E+ E + + +
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Query: 122 TTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYR 181
+ EL + + + K +L++A+ E ++ + ++ E+K E+ +K + E +
Sbjct: 1703 ----KAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Query: 182 LKAKLENTQGEMESMKEEYE 201
+ + + + E +++E E
Sbjct: 1759 IAHLKKEEEKKAEEIRKEKE 1778
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 28.8 bits (65), Expect = 2.7
Identities = 22/102 (21%), Positives = 43/102 (42%), Gaps = 1/102 (0%)
Query: 92 EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
E R+R++ LELE + L+ E R AEL L+ + A ++ + +
Sbjct: 422 EDLRRRIAALELELDALEREAALGADHDERLAELR-AELAALEAELAALEARWQQEKELV 480
Query: 152 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
+ A + ++ + + ++ L+A L + QGE
Sbjct: 481 EAILALRAELEADADAPADDDAALRAQLAELEAALASAQGEE 522
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 28.6 bits (65), Expect = 2.9
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 17/107 (15%)
Query: 145 RRAQAELRVVQADNERVRSEEKTMQ---------EKVEKSQGEVYRLKAKLENTQGEMES 195
R + E + + NE + E++ +Q E +EK + E+ + + +LE Q E+E
Sbjct: 69 LRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEK 128
Query: 196 MKEEYERTQGT-------LSRLLSERDKAVLEMDKSKEELERSQATL 235
+EE E +S L +E K +L ++K +EE A L
Sbjct: 129 KEEELEELIEEQLQELERISGLTAEEAKEIL-LEKVEEEARHEAAVL 174
>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349). This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 282 and
353 amino acids in length. There is a single completely
conserved residue D that may be functionally important.
Length = 262
Score = 28.3 bits (64), Expect = 3.0
Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 128 AELDRLQEKYDK--TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 175
A +RL ++ T DL + EL VQA+ E + ++ + + ++V S
Sbjct: 146 ASEERLLALLERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDRVAYS 195
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.6 bits (64), Expect = 3.0
Identities = 14/60 (23%), Positives = 29/60 (48%)
Query: 24 KQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANE 83
K +++ D + +RQE + + + +Y KL + E + E ++KA+ +K E
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein. Protein in this
family are transposases found in insects. This region is
about 230 amino acids in length and is found associated
with pfam05485.
Length = 236
Score = 28.3 bits (63), Expect = 3.0
Identities = 12/47 (25%), Positives = 27/47 (57%)
Query: 158 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQ 204
N E K++++K+ + E++ L+ +LE +Q +S+ + + TQ
Sbjct: 13 NHTTLMENKSLRQKIRALEKEMHSLRQQLEESQQLEKSLGKIFTETQ 59
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.9 bits (63), Expect = 3.2
Identities = 18/62 (29%), Positives = 24/62 (38%), Gaps = 7/62 (11%)
Query: 55 KLKLHAYESEVE--ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHEL 112
+K H E + ELE +AEL + E AR R ELE + + E
Sbjct: 48 SIKQHLAELAADLLELEAAAPRAELQAKIARYKK-----EKARYRSEAKELEAKAKEAEA 102
Query: 113 ER 114
E
Sbjct: 103 ES 104
Score = 27.2 bits (61), Expect = 5.9
Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 1/48 (2%)
Query: 22 KEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELE 69
KQ + D + E R + IA YK + Y SE +ELE
Sbjct: 48 SIKQ-HLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELE 94
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 28.2 bits (63), Expect = 3.4
Identities = 23/104 (22%), Positives = 49/104 (47%), Gaps = 4/104 (3%)
Query: 125 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 184
+L LQE+ + + ++EL ++ D + +R K+++E++E + + L
Sbjct: 83 ELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAK 142
Query: 185 KLENTQG----EMESMKEEYERTQGTLSRLLSERDKAVLEMDKS 224
+L + E+E ++EE ER Q L RL + +++
Sbjct: 143 ELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
Score = 27.8 bits (62), Expect = 3.8
Identities = 24/117 (20%), Positives = 51/117 (43%), Gaps = 8/117 (6%)
Query: 90 DWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQA 149
E + LS L+ + L+ ELE +EL+ L+E L+ +
Sbjct: 73 KLEELQGELSELKQQLSELQEELED----LEERIAELESELEDLKEDLQLLRELLKSLEE 128
Query: 150 ELRVVQADNERVRSE----EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 202
L ++ + + E + ++E+VE+ + E+ RL+ L+ Q ++ ++ E+
Sbjct: 129 RLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQ 185
Score = 27.0 bits (60), Expect = 7.6
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 5/97 (5%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
L LEE I + E ++ L+E R+ + L ER + + + E+ EL
Sbjct: 95 LEDLEERIAELESELEDLKEDL---QLLRELLKSLEER-LESLEESIKELAKELRELRQ- 149
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELENERL 108
+ E+++ E+ R + + ++ + L+ E+L
Sbjct: 150 DLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 28.3 bits (63), Expect = 3.9
Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 5/100 (5%)
Query: 96 QRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ 155
+ L+R + R E + + R A L +L+ + A+L+RA L +
Sbjct: 193 ETLAREAEDAARAADEAKTAAAAAAREAAPLKASLRKLERAKARADAELKRADKALAAAK 252
Query: 156 ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 195
D + R+EE+ Q+K + E +L+ + + E+
Sbjct: 253 TDEAKARAEER--QQKAAQQAAEA---ATQLDTAKADAEA 287
>gnl|CDD|177920 PLN02281, PLN02281, chlorophyllide a oxygenase.
Length = 536
Score = 28.2 bits (62), Expect = 4.0
Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Query: 161 VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLE 220
V+ E +QE++ K+ +V+ +A++ ++ M EE + R+++E DK
Sbjct: 126 VKKELAGLQEELSKAHQQVHISEARVSTALDKLAHM-EELVNDRLLPGRVVTELDKPSSS 184
Query: 221 MDKSKEELERSQATLG 236
S EL+R + G
Sbjct: 185 TTASAVELDREKTNTG 200
>gnl|CDD|240301 PTZ00166, PTZ00166, DNA polymerase delta catalytic subunit;
Provisional.
Length = 1054
Score = 28.5 bits (64), Expect = 4.1
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 207 LSRLLSERDKAVLEMDKSKEELERS 231
+ L++ R KA EM K+ L +
Sbjct: 617 VEELIAARKKAKKEMKDEKDPLLKK 641
>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
outer membrane].
Length = 372
Score = 28.1 bits (62), Expect = 4.1
Identities = 17/73 (23%), Positives = 27/73 (36%), Gaps = 1/73 (1%)
Query: 89 ADWEAARQRLSRLELE-NERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRA 147
A + A +L+R + +L+R + AELD Q A + A
Sbjct: 105 AALDQAEAQLARAQALLAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAEALVEAA 164
Query: 148 QAELRVVQADNER 160
QA L + + E
Sbjct: 165 QAALASAKLNLEY 177
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 28.4 bits (63), Expect = 4.4
Identities = 11/64 (17%), Positives = 29/64 (45%)
Query: 168 MQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEE 227
++E+ ++ GE + ++ +L +G + R + S + + +MD ++
Sbjct: 2309 LREEEKRIDGEAFLVEDRLTLGKGLSSDLMTFKLRRRSYYSLDILRVHGKIADMDTVHKD 2368
Query: 228 LERS 231
+ RS
Sbjct: 2369 VLRS 2372
>gnl|CDD|200547 cd10921, CE4_MJ0505_like, Putative catalytic domain of
uncharacterized protein MJ0505 from Methanocaldococcus
jannaschii and similar proteins. This family contains
an uncharacterized protein MJ0505 from
Methanocaldococcus jannaschii and its prokaryotic
homologs. Although their biochemical properties remain
to be determined, members in this family is composed of
a seven-stranded barrel with a detectable sequence
similarity to the six-stranded barrel rhizobial
NodB-like proteins, which remove N-linked or O-linked
acetyl groups of cell wall polysaccharides and belong to
a larger carbohydrate esterase 4 (CE4) superfamily.
Length = 206
Score = 27.7 bits (62), Expect = 4.9
Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 9/79 (11%)
Query: 46 LHERDIAEYKLKLHAYESEVEELELGKSKAE--LDKAANEVGRSGADWEAARQRL---SR 100
LHE + YK++LH Y+ E K+ AE L+KA + G D L R
Sbjct: 59 LHELEKEGYKVELHGYDHIYHEFNCNKTTAEEKLEKALEILSALGFDNI----TLFLPPR 114
Query: 101 LELENERLKHELERSQTTF 119
+ + LK LE + T
Sbjct: 115 YAISEDSLKVILEHNFTVI 133
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 28.0 bits (63), Expect = 5.1
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 14/76 (18%)
Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
+AEL RL+++ +K ++ R + +L NE ++ E VEK K KL
Sbjct: 813 AAELARLEKELEKLEKEIDRIEKKL-----SNEGFVAKAPE--EVVEKE-------KEKL 858
Query: 187 ENTQGEMESMKEEYER 202
Q ++ ++E
Sbjct: 859 AEYQVKLAKLEERLAV 874
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 28.1 bits (63), Expect = 5.3
Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 30/178 (16%)
Query: 60 AYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTF 119
A EE + + ++ E+ + + + RLE EN LK ELE
Sbjct: 393 ALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE------ 446
Query: 120 GRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEV 179
EL R EK + RR VR + + +E + +
Sbjct: 447 ---------ELKREIEKLESELERFRR-------------EVRDKVRKDRE-IRARDRRI 483
Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDK-SKEELERSQATLG 236
RL+ +LE + +E ++ + + LS + V ++K + E +E ++ G
Sbjct: 484 ERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYG 541
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 28.1 bits (63), Expect = 5.5
Identities = 29/160 (18%), Positives = 64/160 (40%), Gaps = 12/160 (7%)
Query: 2 QAHHCSSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAY 61
++ E L +L D +++ +L+++ + E+ + I E K + A
Sbjct: 358 RSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAA 417
Query: 62 ESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGR 121
E + + + +L+ E + LEL RLK L+ + T
Sbjct: 418 IEEDLQALESQLRQQLEAGKLE----------FNEEEYELELRLGRLKQRLDSATATPEE 467
Query: 122 TTLSTSAE--LDRLQEKYDKTCADLRRAQAELRVVQADNE 159
+ L++ QE+ ++ A++ + Q+ELR ++ +
Sbjct: 468 LEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKRRD 507
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 27.9 bits (62), Expect = 5.7
Identities = 19/138 (13%), Positives = 45/138 (32%), Gaps = 5/138 (3%)
Query: 86 RSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLR 145
+ + W A R +R E +R +++ ER G + +T +
Sbjct: 157 WNQSAWNA-RNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEA----N 211
Query: 146 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
R + + + E + + ++ ++ + + A +
Sbjct: 212 RFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAAT 271
Query: 206 TLSRLLSERDKAVLEMDK 223
+ +R + D+A E DK
Sbjct: 272 SANRAKQQADRAKTEADK 289
Score = 27.9 bits (62), Expect = 5.7
Identities = 17/91 (18%), Positives = 30/91 (32%), Gaps = 1/91 (1%)
Query: 28 QLREQRDRADQERQE-ERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGR 86
++ R AD E R D ++ ++E E + KAA
Sbjct: 195 NTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASA 254
Query: 87 SGADWEAARQRLSRLELENERLKHELERSQT 117
+ A A + R K + +R++T
Sbjct: 255 AHASEVNAANSATAAATSANRAKQQADRAKT 285
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.6 bits (62), Expect = 5.8
Identities = 32/149 (21%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 93 AARQRLSRLELENERLKHELERSQTTFGRT-TLSTSAELDRLQEKYDKTCADLRR--AQA 149
A ++L E +R+ E ++ T + L E+ +L+ + ++ + R +
Sbjct: 22 IAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRL 81
Query: 150 ELRVVQAD---NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
E R++Q + + ++ S +K +E +EK + E+ + L+ + E+E + E
Sbjct: 82 ERRLLQREETLDRKMESLDKK-EENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATL 235
+S L E K +L +++ +EE A L
Sbjct: 141 ISGLTQEEAKEIL-LEEVEEEARHEAAKL 168
>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
Length = 718
Score = 27.9 bits (62), Expect = 6.2
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 6/82 (7%)
Query: 156 ADNERVRSEEKTMQEK-----VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRL 210
+D+ R + EK + K + V L A + + G M+ + ++ + L
Sbjct: 63 SDDAECRLVCSKLAEKFGKHGLVKPKPSVRSLAAPVRMSDG-MDDSEVAKKKPEPDDGPL 121
Query: 211 LSERDKAVLEMDKSKEELERSQ 232
LSERD A +E K KEE +R
Sbjct: 122 LSERDLAKIERRKRKEERQREV 143
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 27.2 bits (61), Expect = 7.0
Identities = 22/90 (24%), Positives = 39/90 (43%), Gaps = 11/90 (12%)
Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS--------EEKTMQEKVEKSQGEV 179
AE+ LQ + D+ A++ A E Q + E K +EK++ +Q +
Sbjct: 142 AEIAALQAEIDEAQAEVNAAYQE---AQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARL 198
Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSR 209
LKA+L+ ++E+ K ER +
Sbjct: 199 ETLKARLDAAIAQLEAQKAALERNRQAAVA 228
>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
protein, TolC family. Members of This model are outer
membrane proteins from the TolC subfamily within the RND
(Resistance-Nodulation-cell Division) efflux systems.
These proteins, unlike the NodT subfamily, appear not to
be lipoproteins. All are believed to participate in type
I protein secretion, an ABC transporter system for
protein secretion without cleavage of a signal sequence,
although they may, like TolC, participate also in the
efflux of smaller molecules as well. This family
includes the well-documented examples TolC (E. coli),
PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
[Protein fate, Protein and peptide secretion and
trafficking, Transport and binding proteins, Porins].
Length = 415
Score = 27.3 bits (61), Expect = 7.3
Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 5/87 (5%)
Query: 120 GRTTLSTSAELDRLQEKYD--KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 177
S S+ L Q +D T +R A+A A E +R+ + + + ++
Sbjct: 64 NTDLNSGSSTLTLSQPLFDGGSTWNAVRAAEAA---ALAARETLRATAQDLILRTAEAYM 120
Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQ 204
EV R + L + + ++KE+ + +
Sbjct: 121 EVLRAQEILALAEANLAALKEQLDLAR 147
>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein.
Length = 304
Score = 27.5 bits (61), Expect = 7.4
Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 1/97 (1%)
Query: 142 ADLRRAQAELRVVQADNE-RVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEY 200
A L RA+AEL QAD + R+ E+ + + +A
Sbjct: 40 AALDRAEAELAAAQADAQARLARAERDRLQALVAQAAIQSEQEADAARAALREALAASRG 99
Query: 201 ERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
++ L A+ + S + SQATLG
Sbjct: 100 AASKAQLEAARDNLRSALALLPISLGRIGASQATLGA 136
>gnl|CDD|201956 pfam01756, ACOX, Acyl-CoA oxidase. This is a family of Acyl-CoA
oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA
into trans-2- enoyl-CoA.
Length = 186
Score = 26.8 bits (60), Expect = 7.6
Identities = 9/37 (24%), Positives = 19/37 (51%)
Query: 110 HELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
LE+ F R + ++D+++E+ K A++R
Sbjct: 90 WILEKHSGDFLRFGYLSPDQIDQVREQILKLLAEVRP 126
>gnl|CDD|227320 COG4987, CydC, ABC-type transport system involved in cytochrome bd
biosynthesis, fused ATPase and permease components
[Energy production and conversion / Posttranslational
modification, protein turnover, chaperones].
Length = 573
Score = 27.3 bits (61), Expect = 7.7
Identities = 12/52 (23%), Positives = 17/52 (32%), Gaps = 5/52 (9%)
Query: 105 NERLKHELE--RSQTT---FGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
L RSQ T G+ L D + + T A +AQ +
Sbjct: 187 GAHLAQGRAALRSQFTDWVQGQAELLIFGAEDAYRTALEATEASWLKAQRKQ 238
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 27.3 bits (61), Expect = 8.0
Identities = 32/223 (14%), Positives = 88/223 (39%), Gaps = 28/223 (12%)
Query: 12 LSSLEEAIGDKEKQMNQLREQ--------RDRADQERQEERDLHERDIAEYKLKLHAYES 63
+ +++ I K + + R++ +++ D+ +E + + +I E +L
Sbjct: 190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKA-EIEELTDELLNLVM 248
Query: 64 EVEELE--LGKSKAELDKAANEVG---------RSGADWEAARQRLSRLELENERLKHEL 112
++E+ L K K +++ G Q++S ++K +L
Sbjct: 249 DIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKL 308
Query: 113 ERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKV 172
+ Q + +LD ++ ++ + +L ++ + T+ +K
Sbjct: 309 KELQHSL--------EKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA 360
Query: 173 EKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERD 215
+K + + L+A+ + E+ +++E ++ T S L+ E+
Sbjct: 361 KKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKY 403
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 27.3 bits (61), Expect = 8.8
Identities = 17/79 (21%), Positives = 30/79 (37%), Gaps = 2/79 (2%)
Query: 30 REQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL--ELGKSKAELDKAANEVGRS 87
R ++ E + E ++ E K E+ L EL +++ L+ +
Sbjct: 197 RRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSE 256
Query: 88 GADWEAARQRLSRLELENE 106
G D R++L R E E
Sbjct: 257 GGDLFEEREQLERQLKEIE 275
>gnl|CDD|118668 pfam10140, YukC, WXG100 protein secretion system (Wss), protein
YukC. Members of this family of proteins include
predicted membrane proteins homologous to YukC in B.
subtilis. The YukC protein family would participate to
the formation of a translocon required for the secretion
of WXG100 proteins (pfam06013) in monoderm bacteria, the
WXG100 protein secretion system (Wss). This family
includes EssB in Staphylococcus aureus.
Length = 359
Score = 27.2 bits (61), Expect = 8.8
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 24 KQMNQLREQRDRADQERQEERDLHERDIAEYKLKL 58
K Q++ D + ERQE+ D E ++ EY +
Sbjct: 325 KYREQVKNDDDLSGDERQEKLDELEDELDEYWKER 359
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 27.0 bits (60), Expect = 9.1
Identities = 13/105 (12%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Query: 95 RQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVV 154
+ +RL+ + + + +L+ + + + + + E+ ++ A+L + + +++
Sbjct: 76 NELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLK-QLL 134
Query: 155 QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
+ E + + +++++ +S E L +++ GE++ + +E
Sbjct: 135 KPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQE 179
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 26.6 bits (59), Expect = 9.7
Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 3/93 (3%)
Query: 12 LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
S L + + K+ Q +L E++ + ++E E D +L+ E + + L
Sbjct: 92 QSRLNKLLSLKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLK 151
Query: 72 KSKAELDKAANEVGRSGADWEAARQRLSRLELE 104
+ K DK E+ R ++ +Q L ELE
Sbjct: 152 EKKEAEDK---EIARLKSEASKIKQELVDAELE 181
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.305 0.122 0.311
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,407,066
Number of extensions: 1088175
Number of successful extensions: 3280
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2393
Number of HSP's successfully gapped: 786
Length of query: 237
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 143
Effective length of database: 6,768,326
Effective search space: 967870618
Effective search space used: 967870618
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 57 (25.6 bits)