RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10418
(389 letters)
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 64.8 bits (158), Expect = 1e-12
Identities = 39/105 (37%), Positives = 69/105 (65%), Gaps = 5/105 (4%)
Query: 14 VFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHIS 73
V+D+A+ I +EFE++ID YG + V ++MP VI LE+L+ LAS+N+ +L+ L
Sbjct: 1 VYDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELL----- 55
Query: 74 KLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
+ ++E+L + E +E K+ Q+L+E + L++E K+L K++E
Sbjct: 56 REDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIED 100
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 42.7 bits (101), Expect = 3e-04
Identities = 36/174 (20%), Positives = 75/174 (43%), Gaps = 11/174 (6%)
Query: 202 RNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL-------EHEGSSPMKSQQLSPEID 254
N L+ +L LE + + + L+ E ++L L E ++ + E +
Sbjct: 195 LNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEE 254
Query: 255 I----SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLS 310
+ + LQ+L +++ LR ++ E+E+ ++ L +I RL + LR +
Sbjct: 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANL 314
Query: 311 QVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
+ Q+ L + + + L + E+A L+++L ++E E L E E
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELE 368
Score = 40.8 bits (96), Expect = 0.001
Identities = 56/267 (20%), Positives = 120/267 (44%), Gaps = 30/267 (11%)
Query: 54 LASKNERENTKLQELGDHISKLEHE--KLEREVEEIEESWKE----------ETQQLVEM 101
+ E +LQEL + + +L E +LE E+EE+++ + Q L E
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGY 161
+ L+ + ++L +LE S + + E++ L++L+ +++SL + L+E E
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELE-EKLEELKEELESLEAE--LEELEAEL 367
Query: 162 RNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
+ ++ + + ++ + ++ L+ RL LE+ R+
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLEL------QIASLNNEIERLEARLERLEDRRE---- 417
Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGY 281
RLQ+E ++L K+LE + +K Q E L++L+ +++ L + + +EL
Sbjct: 418 --RLQQEIEELLKKLE---EAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
Query: 282 RNSDVDNLKRQIDRLTSTVRELRRKQR 308
+D +R++ +L + + L R Q
Sbjct: 473 AEQALDAAERELAQLQARLDSLERLQE 499
Score = 40.0 bits (94), Expect = 0.002
Identities = 39/191 (20%), Positives = 78/191 (40%), Gaps = 26/191 (13%)
Query: 203 NTLKTRLNELEEERQCFKM--------VNRLQEENKKLSKRLEHEGSSPM---------- 244
L+ L L E + + L+ + ++L +LE S
Sbjct: 284 EELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELE 343
Query: 245 -KSQQLSPEID--ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVR 301
K ++L E++ + L++L +++ L ++ E++L S V L+ QI L + +
Sbjct: 344 EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIE 403
Query: 302 ELRRKQRLSQVQVRGLIDERADFL-----AALQDHQREVAALKQQLGLAEKENEDLYNSQ 356
L + + + L E + L A L++ Q E+ L+++L ++E E L +
Sbjct: 404 RLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEAL 463
Query: 357 ETTSEEERPVQ 367
E EE +
Sbjct: 464 EELREELEEAE 474
Score = 35.4 bits (82), Expect = 0.051
Identities = 53/303 (17%), Positives = 122/303 (40%), Gaps = 59/303 (19%)
Query: 43 KVIVALEHLECLASKNERENTKLQELGDHIS--KLEHEKLEREVEEIEESWKEETQQLVE 100
+ + LE L + E+ +L+EL IS + + +LE EVE++EE + +++L E
Sbjct: 699 ALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTE 758
Query: 101 MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELG 160
+ ++E ++L + E + + ++L +I ++L+ ++ +LR+ + EL
Sbjct: 759 LEAEIEELEERLEEAEEELAEAEAEIEELEAQI-----EQLKEELKALREALDELRAEL- 812
Query: 161 YRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHD-RNTLKTRLNELEEERQCF 219
L E + R L++ + +
Sbjct: 813 --------------------------------TLLNEEAANLRERLESLERRIAATERRL 840
Query: 220 KMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKEL 279
+ + EE + + L E ++ + E +L ++++L ++ E+ L
Sbjct: 841 EDLEEQIEELSEDIESLAAE----IEELEELIE-------ELESELEALLNERASLEEAL 889
Query: 280 GYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALK 339
S+++ L ++ L S ELRR+ + ++ A L+ + + L+
Sbjct: 890 ALLRSELEELSEELRELESKRSELRRELEELR-------EKLAQLELRLEGLEVRIDNLQ 942
Query: 340 QQL 342
++L
Sbjct: 943 ERL 945
Score = 33.9 bits (78), Expect = 0.16
Identities = 55/280 (19%), Positives = 112/280 (40%), Gaps = 43/280 (15%)
Query: 76 EHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDI 135
E E+LE ++EE+EE E + L E+ L+E ++L + + + L +
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD--- 734
Query: 136 SVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAEL 195
L +L +V+ L +++ KEL +L + + EL
Sbjct: 735 --LARLEAEVEQLEERIAQLSKELT------------ELEAEIEELEERLEE---AEEEL 777
Query: 196 KEILHDRNTLKTRLNELEEERQCFK-MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID 254
E + L+ ++ +L+EE + + ++ L+ E L++
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA------------------ 819
Query: 255 ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQV 314
LR +++SL ++ E+ L ++ L I+ L + + EL + ++
Sbjct: 820 ----ANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL 875
Query: 315 RGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYN 354
L++ERA AL + E+ L ++L E + +L
Sbjct: 876 EALLNERASLEEALALLRSELEELSEELRELESKRSELRR 915
Score = 31.6 bits (72), Expect = 0.95
Identities = 30/154 (19%), Positives = 66/154 (42%), Gaps = 7/154 (4%)
Query: 209 LNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSL 268
+LE R+ + + E ++ K LE + + ++L E+ L L +++ L
Sbjct: 178 ERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEEL 237
Query: 269 RDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAAL 328
R+++ ++EL +++ L ++ L + ELR ++V L +E + L
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELR-------LEVSELEEEIEELQKEL 290
Query: 329 QDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
E++ L+QQ + + +L E +
Sbjct: 291 YALANEISRLEQQKQILRERLANLERQLEELEAQ 324
Score = 29.6 bits (67), Expect = 3.2
Identities = 24/107 (22%), Positives = 48/107 (44%)
Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
+ +L + LR ++ E+EL +++ L RQI L + L + + ++
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750
Query: 316 GLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
L E + A +++ + + +++L AE E E+L E EE
Sbjct: 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE 797
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 42.4 bits (100), Expect = 3e-04
Identities = 45/238 (18%), Positives = 101/238 (42%), Gaps = 7/238 (2%)
Query: 120 GSSPMKSQQLSPEIDISVLQK----LRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLS 175
G S K L+ + ++ L++ L Q++ L ++++ + EL ++
Sbjct: 654 GGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEE 713
Query: 176 NKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEER-QCFKMVNRLQEENKKLSK 234
+ + EL+++ L+ L ELEEE + + + L+EE + L +
Sbjct: 714 LERQLEELK-RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772
Query: 235 RLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQID 294
L + ++ + L++L +++ ++ E+EL + L+++I+
Sbjct: 773 ALAKLKEEIEELEEKRQALQ-EELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 295 RLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDL 352
L + EL K + ++ L E + L++ + E L+ +L E+E E+L
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Score = 41.6 bits (98), Expect = 6e-04
Identities = 64/322 (19%), Positives = 137/322 (42%), Gaps = 30/322 (9%)
Query: 59 ERENTKLQELGDHISKLEH--EKLEREVEEIEE----SWKEETQQLVEMVNRLQEENKKL 112
ER L+ L D + +LE EKLER+ E+ E + +L ++ +L+E K+L
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241
Query: 113 SKRLEHEGSSPMKSQQLSPEID--ISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVD--N 168
+ E + ++L E++ +++L+ +++ LR+++ ++EL +++
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
+I L + ++ ELKE + L EE L++
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE---------LEQL 352
Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 288
+L + E E + L++L ++LR+++ E EL ++++
Sbjct: 353 LAELEEAKEEL-----------EEKLSALLEELEELFEALREELAELEAELAEIRNELEE 401
Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
LKR+I+ L + L + + +++ L E + L++ E+ L++QL
Sbjct: 402 LKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDR 461
Query: 349 NEDLYNSQETTSEEERPVQGPL 370
++L EE + ++ L
Sbjct: 462 LKELERELAELQEELQRLEKEL 483
Score = 40.5 bits (95), Expect = 0.001
Identities = 59/275 (21%), Positives = 122/275 (44%), Gaps = 23/275 (8%)
Query: 51 LECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENK 110
LE + E ++L+EL + + +L+ E LE ++E E + E L E + L+ E +
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLE--LKEEIEELEGEISLLRERLEELENELE 319
Query: 111 KLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSD 170
+L +RLE K + + + ++ + L +++ L ++ ++EL + S +
Sbjct: 320 ELEERLEEL-----KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL 374
Query: 171 IPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM-VNRLQEEN 229
EL E+ + ++ L EL+ E + + + RL E
Sbjct: 375 EELFEALR--------------EELAELEAELAEIRNELEELKREIESLEERLERLSERL 420
Query: 230 KKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNL 289
+ L + L+ E + ++ Q E L++L Q++ LRD+++ E+EL ++ L
Sbjct: 421 EDLKEELK-ELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRL 479
Query: 290 KRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADF 324
++++ L + + L +QR SQ L +
Sbjct: 480 EKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514
Score = 40.5 bits (95), Expect = 0.002
Identities = 64/293 (21%), Positives = 125/293 (42%), Gaps = 38/293 (12%)
Query: 72 ISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSP 131
+SK + K E E + E T++ +E + L EE +K ++LE + + Q+L
Sbjct: 167 VSKYKERKEEAERKL------ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKA 220
Query: 132 EIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFT 191
E+ L L ++ LR ++ E+EL +++
Sbjct: 221 ELRELELALLLAKLKELRKELEELEEELSRLEEELEE----------------------L 258
Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSP 251
EL+E + LK+ L EL EE + LQEE +L + +E E + +
Sbjct: 259 QEELEEAEKEIEELKSELEELREE------LEELQEELLELKEEIE-ELEGEISLLRERL 311
Query: 252 EIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQ 311
E + L++L +++ L++++ ++EL R + ++ L++ + L EL K
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371
Query: 312 VQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
++ L + + L + + E+A ++ +L ++E E L E SE
Sbjct: 372 EELEELFEALRE---ELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Score = 35.8 bits (83), Expect = 0.047
Identities = 64/323 (19%), Positives = 140/323 (43%), Gaps = 40/323 (12%)
Query: 35 DAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHISKLEHE-----KLEREVEEIEE 89
+ + + ++ E L+ L ++ L+EL + +LE + + +EE E
Sbjct: 674 EELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELE 733
Query: 90 SWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLR 149
+ ++L E + L+EE ++L +RLE ++ + S E ++ L++ +++ R
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEE-----LEEELESLEEALAKLKEEIEELEEKR 788
Query: 150 DQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRL 209
++ + +EL + + L R E++E+ + L+ +L
Sbjct: 789 QALQEELEELEEELEEAER----RLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844
Query: 210 NELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLR 269
+ELEEE + L++E ++L + LE + + + L++L + + L
Sbjct: 845 DELEEE------LEELEKELEELKEELEELEAEKEELE--------DELKELEEEKEELE 890
Query: 270 DQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQ 329
+++R E EL LK +I++L + EL K +V++ L +E +
Sbjct: 891 EELRELESEL-------AELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTL 943
Query: 330 DHQREVAALKQQLGLAEKENEDL 352
+ + E +++ E+E E L
Sbjct: 944 ETELE-----REIERLEEEIEAL 961
Score = 35.5 bits (82), Expect = 0.051
Identities = 25/124 (20%), Positives = 60/124 (48%), Gaps = 3/124 (2%)
Query: 240 GSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST 299
G S K L+ + ++ L++ ++ L Q+ E+EL +++ +L+ ++ L
Sbjct: 654 GGSRNKRSSLAQKRELKELEE---ELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQ 710
Query: 300 VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETT 359
+ EL R+ + ++ L +E + L++ + E+ L+++L ++ E+L E+
Sbjct: 711 LEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL 770
Query: 360 SEEE 363
E
Sbjct: 771 EEAL 774
Score = 29.3 bits (66), Expect = 5.1
Identities = 51/261 (19%), Positives = 107/261 (40%), Gaps = 35/261 (13%)
Query: 54 LASKNERENTKLQELGDHISKLEHEKLE-----REVEEIEESWKEETQQLVEMVNRLQEE 108
L + E K Q L + + +LE E E +E ES ++ ++L + + L+EE
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEE 836
Query: 109 NKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
++L ++L+ + E L++L+ +++ L + E EL + +
Sbjct: 837 IEELEEKLD--------ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888
Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
+ R +EL E+ + L+ RL ELE + + RL+ E
Sbjct: 889 LE---------------EELRELESELAELKEEIEKLRERLEELEAKLE------RLEVE 927
Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLK-EKELGYRNSDVD 287
+L + LE E ++++ + + G V+ + + E+ S +
Sbjct: 928 LPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQRE 987
Query: 288 NLKRQIDRLTSTVRELRRKQR 308
+L+ ++L + EL +++R
Sbjct: 988 DLEEAKEKLLEVIEELDKEKR 1008
>gnl|CDD|221118 pfam11461, RILP, Rab interacting lysosomal protein. RILP contains
a domain which contains two coiled-coil regions and is
found mainly in the cytosol. RILP is recruited onto late
endosomal and lysosomal membranes by Rab7 and acts as a
downstream effector of Rab7. This recruitment process is
important for phagosome maturation and fusion with late
endosomes and lysosomes.
Length = 60
Score = 36.5 bits (85), Expect = 0.001
Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQ 247
T EL+E+L +RN LK ++ L+EE +K + L E ++ L +P K +
Sbjct: 1 TLQELREVLQERNELKAKVFLLQEELAYYK--SELLPEERRPGLPLPSTKLAPSKQR 55
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 40.8 bits (96), Expect = 0.001
Identities = 61/330 (18%), Positives = 123/330 (37%), Gaps = 68/330 (20%)
Query: 47 ALEHLECLASKNERENT----KLQELGDHISKLEHE--KLEREVEEIEESWKEETQQLVE 100
A + L K E E + + L +E + LE E+E++ E E ++L E
Sbjct: 210 AERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEE 269
Query: 101 MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELG 160
+ L+E NKK+ E E +K + E +I+ L + ++ E+ L
Sbjct: 270 IEQLLEELNKKIKDLGEEE-QLRVKEKIGELEAEIA---SLERSIAEKERELEDAEERLA 325
Query: 161 YRNSDVDN--SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQC 218
+++D ++I +L E++E R+ L EL+EE +
Sbjct: 326 KLEAEIDKLLAEIEELER-----------------EIEEERKRRDKLTEEYAELKEELE- 367
Query: 219 FKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKE 278
+ L+E +K+ RD+++ ++
Sbjct: 368 -DLRAELEEVDKEF---------------------------------AETRDELKDYREK 393
Query: 279 LGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
L +++ LKR++DRL ++ L + + G+ + + +D E+
Sbjct: 394 LEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQ 453
Query: 339 KQQLGLA----EKENEDLYNSQETTSEEER 364
+ +L K ++LY+ +E E+
Sbjct: 454 EWKLEQLAADLSKYEQELYDLKEEYDRVEK 483
Score = 33.5 bits (77), Expect = 0.22
Identities = 13/89 (14%), Positives = 45/89 (50%)
Query: 255 ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQV 314
S L+++ ++D L ++ +++G +++ L+++ ++L + EL + ++
Sbjct: 694 QSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753
Query: 315 RGLIDERADFLAALQDHQREVAALKQQLG 343
+ E + A +++ + ++ L++ L
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEALN 782
Score = 31.6 bits (72), Expect = 0.78
Identities = 57/352 (16%), Positives = 136/352 (38%), Gaps = 81/352 (23%)
Query: 60 RENTKLQELGDHISKLEHE---------KLEREVEEIEESWKEETQQLVEMVNRLQEENK 110
E +LQ L + + L+ E ++E ++E+ + + ++++ E+ +++ +
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730
Query: 111 KLSKRLEHEGSSPMKSQQLSPEI--DISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
+ K E L EI S L++L +++ L + + E+ L + + +
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790
Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
S IP++ + L++LEEE ++ RL+E
Sbjct: 791 SRIPEIQAE-------------------------------LSKLEEEVS--RIEARLREI 817
Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 288
+KL++ ++ + L EI Q+L+ Q L++Q + +
Sbjct: 818 EQKLNRLT-------LEKEYLEKEI-----QELQEQRIDLKEQ--------------IKS 851
Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
++++I+ L EL + + +R L D + + ++ L++++ E +
Sbjct: 852 IEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQ 911
Query: 349 NED-----------LYNSQETTSEEERPVQGPLPYEPDDAPWKKSESGIRKL 389
E L +E SE E P ++ + ++ ++++
Sbjct: 912 IEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRV 963
Score = 29.7 bits (67), Expect = 3.9
Identities = 28/168 (16%), Positives = 64/168 (38%), Gaps = 12/168 (7%)
Query: 205 LKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQK---- 260
+ +L L ER+ E + L K + ++ + E A+++
Sbjct: 196 KRQQLERLRRERE-------KAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLAS 248
Query: 261 LRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTS-TVRELRRKQRLSQVQVRGLID 319
L +++ L +++ EK L ++ L ++I L ++ K + ++ L
Sbjct: 249 LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLER 308
Query: 320 ERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERPVQ 367
A+ L+D + +A L+ ++ E E+L E + +
Sbjct: 309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
Score = 29.3 bits (66), Expect = 4.9
Identities = 25/150 (16%), Positives = 62/150 (41%), Gaps = 4/150 (2%)
Query: 240 GSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST 299
GS + L + + LQ+LR +++ L+ ++ + EL + +D L +++ +
Sbjct: 658 GSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRK 717
Query: 300 VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETT 359
+ E+ ++ + + L + + L ++E+ +K +L E E+L E
Sbjct: 718 IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEEL----EED 773
Query: 360 SEEERPVQGPLPYEPDDAPWKKSESGIRKL 389
+ L + + ++ + KL
Sbjct: 774 LHKLEEALNDLEARLSHSRIPEIQAELSKL 803
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 39.7 bits (93), Expect = 0.003
Identities = 47/238 (19%), Positives = 96/238 (40%), Gaps = 28/238 (11%)
Query: 138 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTA---- 193
LQ+ + S + + E++L N++++ + + + R
Sbjct: 609 LQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSL 668
Query: 194 --ELKEILHDR--------NTLKTRLNELEEERQCFKMV---NRLQEENKKLSKRLEHEG 240
+L+ + +R L +L +L E++Q F + + ++L+K EG
Sbjct: 669 KDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEG 728
Query: 241 SSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST- 299
+ QLS I+ +A + + ++ L+ Q + L + V LKRQI+ L +T
Sbjct: 729 ELDNQLAQLSAAIE-AARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTI 787
Query: 300 ---------VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
VRE R + + + L +ER + L++ + L+Q+L K+
Sbjct: 788 ERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKD 845
Score = 31.6 bits (72), Expect = 0.80
Identities = 18/110 (16%), Positives = 46/110 (41%), Gaps = 1/110 (0%)
Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
LQ+ + S + + E++L N++++ KR + +++ R + Q + +
Sbjct: 607 ERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQ 666
Query: 316 GLIDERADFLAALQD-HQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
L D+ +A + + ++ L QL ++ + + + E R
Sbjct: 667 SLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELR 716
Score = 29.3 bits (66), Expect = 4.4
Identities = 37/184 (20%), Positives = 71/184 (38%), Gaps = 17/184 (9%)
Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSK---RLEHEGSSPMKSQQL 249
+ TL+ +L E +E L N KL+ LE +
Sbjct: 288 EAKNRLRQQLRTLEDQLKEARDELN-----QELSAANAKLAADRSELELLEDQKGAFE-- 340
Query: 250 SPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRL 309
+ DI LQ Q+ S+R ++ E L +++R+ +RL + + ++ L
Sbjct: 341 --DADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQ-KIKEQLERDL 397
Query: 310 SQVQVRGLI--DERADFLAALQDH-QREVAALKQQLGLAEKE-NEDLYNSQETTSEEERP 365
+ R +E+ AA+++ Q + L+QQL + E NE+ Y + ++
Sbjct: 398 EKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLGRLKQR 457
Query: 366 VQGP 369
+
Sbjct: 458 LDSA 461
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 35.4 bits (82), Expect = 0.027
Identities = 22/101 (21%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 267 SLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDE------ 320
S R+++ ++EL ++ L+ Q+ L +EL+++ + ++ L E
Sbjct: 63 SARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQ 122
Query: 321 -RADFLAALQDHQR---EVAALKQQLGLAEKENEDLYNSQE 357
A+ + ++++ E+A LKQ+ E ENE L +++
Sbjct: 123 LSANAIELDEENRELREELAELKQENEALEAENERLQENEQ 163
Score = 28.4 bits (64), Expect = 4.3
Identities = 19/118 (16%), Positives = 46/118 (38%), Gaps = 17/118 (14%)
Query: 204 TLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQL--SPEIDISALQKL 261
L+ L EL+EE LQE+ +L + + + + +++ +++L
Sbjct: 70 ELQQELAELQEEL------AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123
Query: 262 RGQVDSLRDQ-MRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLI 318
L ++ L+E+ + LK++ + L + L+ ++ G +
Sbjct: 124 SANAIELDEENRELREE--------LAELKQENEALEAENERLQENEQRRWFLYGGGV 173
Score = 28.4 bits (64), Expect = 5.3
Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 76 EHEKLEREVEEIEESWKEETQQLVEMVNR---LQEENKKLSKRLEHEGSSPMKSQQLSPE 132
E +L+ E+ E++E E Q+ E+ L+ E ++L K L + +L E
Sbjct: 74 ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEE 133
Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKEL 159
++LR ++ L+ + E E
Sbjct: 134 -----NRELREELAELKQENEALEAEN 155
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.8 bits (83), Expect = 0.034
Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 18/97 (18%)
Query: 213 EEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQM 272
+ ++ + V RL+EEN +L + LE +L EI +KL +++ R ++
Sbjct: 422 KRIKKLEETVERLEEENSELKRELE----------ELKREI-----EKLESELERFRREV 466
Query: 273 R---LKEKELGYRNSDVDNLKRQIDRLTSTVRELRRK 306
R K++E+ R+ ++ L+++++ V EL RK
Sbjct: 467 RDKVRKDREIRARDRRIERLEKELEEKKKRVEELERK 503
Score = 31.2 bits (71), Expect = 0.99
Identities = 25/98 (25%), Positives = 47/98 (47%), Gaps = 22/98 (22%)
Query: 68 LGDHISKLEHEKLERE-------VEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEG 120
L + +SK++ E+ RE +++ ++L E V RL+EEN +L + LE
Sbjct: 390 LAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE--- 446
Query: 121 SSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKE 158
+L EI +KL +++ R ++R K ++
Sbjct: 447 -------ELKREI-----EKLESELERFRREVRDKVRK 472
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 35.7 bits (83), Expect = 0.047
Identities = 52/254 (20%), Positives = 97/254 (38%), Gaps = 40/254 (15%)
Query: 2 DDEYSSTLTVIDVFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERE 61
L + + E + ++ + + + + E L+ L S +
Sbjct: 15 KSYKDEVLEALHELGVV-----HIEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKL 69
Query: 62 NTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
N L+E +S E+L ++VEE E ++E ++L E ++ L+ E K+L + +E
Sbjct: 70 NP-LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE---- 124
Query: 122 SPMKSQQLSP----EIDISVLQKLR------GQV--DSLRDQMRLKEKELGYRNSDVDNS 169
+L P ++D+S+L + G V D L + + E S
Sbjct: 125 ------RLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGY 178
Query: 170 DI------PDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVN 223
+LS++ +L R E +K L E+E+ER+
Sbjct: 179 VYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERE------ 232
Query: 224 RLQEENKKLSKRLE 237
L EE K+L+K+
Sbjct: 233 SLLEELKELAKKYL 246
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 34.7 bits (81), Expect = 0.084
Identities = 22/94 (23%), Positives = 37/94 (39%), Gaps = 25/94 (26%)
Query: 79 KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID---- 134
+ E ++E+ K + +L E V L EE K+L K LE +K++ +
Sbjct: 698 EQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQ-----LKAKLAAAAAGDLLA 752
Query: 135 -------ISVLQK---------LRGQVDSLRDQM 152
+ VL LR D L+D++
Sbjct: 753 QAKEVNGVKVLAAQVEGVDAKALRTLADDLKDKL 786
Score = 29.3 bits (67), Expect = 4.2
Identities = 25/100 (25%), Positives = 36/100 (36%), Gaps = 34/100 (34%)
Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 252
A LKE+ LK + +EL E V L EE K+L K LE +K++ +
Sbjct: 701 ALLKEL---AALLKAKPSELPER------VEALLEELKELEKELEQ-----LKAKLAAAA 746
Query: 253 IDISALQK--------------------LRGQVDSLRDQM 272
Q LR D L+D++
Sbjct: 747 AGDLLAQAKEVNGVKVLAAQVEGVDAKALRTLADDLKDKL 786
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 33.2 bits (76), Expect = 0.11
Identities = 20/85 (23%), Positives = 46/85 (54%)
Query: 258 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGL 317
+ +L +++ L++ ++L + L +++L+ I L +RELR+ R ++R
Sbjct: 102 IAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREE 161
Query: 318 IDERADFLAALQDHQREVAALKQQL 342
++ + L LQ+ +E+ +L +QL
Sbjct: 162 LERLQENLQRLQEAIQELQSLLEQL 186
Score = 29.7 bits (67), Expect = 2.0
Identities = 16/96 (16%), Positives = 39/96 (40%)
Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE 334
+ ++L ++ LK+Q+ L + +L + + ++ L ++ L+ +
Sbjct: 70 RPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEER 129
Query: 335 VAALKQQLGLAEKENEDLYNSQETTSEEERPVQGPL 370
+ +L++ + KE +L EE R L
Sbjct: 130 LESLEESIKELAKELRELRQDLREEVEELREELERL 165
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 34.4 bits (80), Expect = 0.12
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 7/82 (8%)
Query: 79 KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH---EGSSPMKSQQLSPEIDI 135
+LE ++E E +QL + V R EE K+ K +E E + + S+ LS ++
Sbjct: 740 ELEDLLKEAAEILGVPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEV 799
Query: 136 S----VLQKLRGQVDSLRDQMR 153
V++++ D LR
Sbjct: 800 GGVKVVVEEVDADEDELRKIAN 821
>gnl|CDD|148779 pfam07362, CcdA, Post-segregation antitoxin CcdA. This family
consists of several Enterobacterial post-segregation
antitoxin CcdA proteins. The F plasmid-carried bacterial
toxin, the CcdB protein, is known to act on DNA gyrase
in two different ways. CcdB poisons the gyrase-DNA
complex, blocking the passage of polymerases and leading
to double-strand breakage of the DNA. Alternatively, in
cells that overexpress CcdB, the A subunit of DNA gyrase
(GyrA) has been found as an inactive complex with CcdB.
Both poisoning and inactivation can be prevented and
reversed in the presence of the F plasmid-encoded
antidote, the CcdA protein.
Length = 72
Score = 31.1 bits (71), Expect = 0.14
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 64 KLQELGDHISKLEHEKLEREVEEIE-ESWKEETQQLVEMVNRLQEEN 109
+ + LG ++S L E+ + E WKEE ++ +E NR EEN
Sbjct: 16 QAKALGINLSATLETALREELRREKAERWKEENREAIEEYNRFIEEN 62
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 33.4 bits (77), Expect = 0.18
Identities = 23/90 (25%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTV---RELRRKQRLSQV 312
S LR Q++ + Q++ E EL + + L++Q++ ++ + R+L+ K +S++
Sbjct: 137 SRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRL 196
Query: 313 QVRGLIDERADFLAALQDHQREVAALKQQL 342
++ L ERA+ L + E+ LK+Q+
Sbjct: 197 ELLELERERAEAQGELGRLEAELEVLKRQI 226
Score = 32.3 bits (74), Expect = 0.42
Identities = 23/98 (23%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 256 SALQKLRGQVDSLRDQMR----LKEKELGYRNS------DVDNLKRQIDRLTSTVRELRR 305
+ LQ LR Q++ + +++ LKEK L R + + ++ RL + + L+R
Sbjct: 165 AQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKR 224
Query: 306 KQRLSQVQVRGLIDE-RADFLAALQDHQREVAALKQQL 342
+ Q++ + + R + L L + Q +A L+++L
Sbjct: 225 QIDELQLERQQIEQTFREEVLEELTEAQARLAELRERL 262
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 33.6 bits (77), Expect = 0.18
Identities = 64/323 (19%), Positives = 144/323 (44%), Gaps = 43/323 (13%)
Query: 40 IMPKVIVALEHLECLASKNERENTKLQELGDHISKLEH--EKLEREVEEIEESWKEETQQ 97
+ ++ ALE L+ L + +L EL + +++ E+LE+E+EE+E +E ++
Sbjct: 373 LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432
Query: 98 ---LVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRL 154
L E +N+L+ + +++ P+ Q+L E + +L+ +++ L +++
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSR 492
Query: 155 KEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEE 214
+++E R E++E+ + L+ L EL E
Sbjct: 493 EKEEAELRE------------------------------EIEELEKELRELEEELIELLE 522
Query: 215 ERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRL 274
+ + L+E+ +KL LE K Q + ++ L+ ++ L +++RL
Sbjct: 523 LEE--ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL 580
Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE 334
R +++ L+ ++ L ++EL ++RLSQ++ E ++ L++ + E
Sbjct: 581 LRT----RKEELEELRERLKELKKKLKEL--EERLSQLEELLQSLELSEAENELEEAEEE 634
Query: 335 VAALKQQLGLAEKENEDLYNSQE 357
+ + ++L L + E L + E
Sbjct: 635 LESELEKLNLQAELEELLQAALE 657
Score = 31.3 bits (71), Expect = 1.0
Identities = 63/306 (20%), Positives = 138/306 (45%), Gaps = 34/306 (11%)
Query: 49 EHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEE--SWKEETQQLVEMVNRLQ 106
+ LA E+ QEL + K E E E+EE+EE S ++E +L E + L+
Sbjct: 448 LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELE 507
Query: 107 EENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDV 166
+E ++L + L ++ + E + L+ L +++ L+++++L++ + R +
Sbjct: 508 KELRELEEELIELLE-LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566
Query: 167 DNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQ 226
ELKE+L + L+TR ELEE R+ K + +
Sbjct: 567 RLQ------------------------ELKELLEELRLLRTRKEELEELRERLKELKKKL 602
Query: 227 EENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDV 286
+E ++ +LE Q L + L++ +++S +++ L+ + + +
Sbjct: 603 KELEERLSQLEEL------LQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAAL 656
Query: 287 DNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAE 346
+ L+ +++ L + +R ++ ++ Q+ ++E L+ + E+ L ++LG E
Sbjct: 657 EELEEKVEELEAEIRRE-LQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIE 715
Query: 347 KENEDL 352
+ E+L
Sbjct: 716 QLIEEL 721
Score = 29.3 bits (66), Expect = 4.7
Identities = 58/342 (16%), Positives = 137/342 (40%), Gaps = 20/342 (5%)
Query: 23 KEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHISKLEHEKLER 82
+++EK+ + + + K+ L L E L+E + KLE + E+
Sbjct: 167 EKYEKLSELLK-EVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQ 225
Query: 83 EVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLR 142
E EE+E+ + ++L E+ + + ++ LE E + E ++ L++L
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIR--EEELRELERLL 283
Query: 143 GQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDR 202
+++ +++ E+E+ +++ + + + + + +
Sbjct: 284 EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEK--------LKSLEERLEKLE 335
Query: 203 NTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLR 262
L+ +ELEE + + +L EE K + E ++ + A+Q+L+
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELK 395
Query: 263 GQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERA 322
++ L + ++EL +++ L+R+++ L +++L + + +
Sbjct: 396 EELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE--------SKE 447
Query: 323 DFLAALQDHQREVAALKQQL-GLAEKENEDLYNSQETTSEEE 363
+A L + Q+L EKE +LY + EEE
Sbjct: 448 LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEE 489
>gnl|CDD|216831 pfam01988, VIT1, VIT family. This family includes the vacuolar
Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
transporter VIT1.
Length = 209
Score = 33.0 bits (76), Expect = 0.19
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 18/68 (26%)
Query: 52 ECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
E L+ K+ER+ LE +LERE E+E + E ++L E+ +
Sbjct: 52 EYLSVKSERD-------------LEEAELEREKRELENDPEGEREELAEI-----YRARG 93
Query: 112 LSKRLEHE 119
LS+ L +
Sbjct: 94 LSEELARQ 101
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 33.2 bits (76), Expect = 0.25
Identities = 14/77 (18%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 78 EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID--I 135
E++E+ ++ +++ WK+ + L L+E K+L + + +P++ Q++ E D
Sbjct: 167 EEIEKSLKAMQQKWKKRKKDL-PNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLK 225
Query: 136 SVLQKLRGQVDSLRDQM 152
+ + ++ S ++ +
Sbjct: 226 KEGKADKQKIKSAKNDL 242
>gnl|CDD|114908 pfam06216, RTBV_P46, Rice tungro bacilliform virus P46 protein.
This family consists of several Rice tungro bacilliform
virus P46 proteins. The function of this family is
unknown.
Length = 392
Score = 32.7 bits (74), Expect = 0.33
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 9/69 (13%)
Query: 48 LEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQE 107
++HL +S N KL + DHI E E ES+K E ++L + V+ L++
Sbjct: 42 IDHLFTFSSDNTERVRKLHIIADHIYLKEFEI---------ESYKLECEKLNDQVSHLKK 92
Query: 108 ENKKLSKRL 116
+N++L ++
Sbjct: 93 QNEELKAQI 101
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 31.6 bits (72), Expect = 0.53
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 4/70 (5%)
Query: 48 LEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQE 107
+E C ++N+ E+ +E + E + E +E+ EES + + ++L + N+L+E
Sbjct: 13 MEEDCCKENENKEEDKGKEEDLE----FEEIEKEEIIEDSEESNEVKIEELKDENNKLKE 68
Query: 108 ENKKLSKRLE 117
ENKKL LE
Sbjct: 69 ENKKLENELE 78
>gnl|CDD|226291 COG3768, COG3768, Predicted membrane protein [Function unknown].
Length = 350
Score = 31.6 bits (72), Expect = 0.63
Identities = 28/90 (31%), Positives = 39/90 (43%), Gaps = 20/90 (22%)
Query: 296 LTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDH-----QREVAAL-KQQLGLAE--- 346
+ S VRE RR RL Q Q R D AL H ++ V L QQ G+ E
Sbjct: 113 VGSVVREWRRLVRLRQRQ-----HLRDDARDALHSHGVKAGRKAVCELLAQQAGIPETAP 167
Query: 347 -----KENE-DLYNSQETTSEEERPVQGPL 370
+E+ ++ +++E ER V PL
Sbjct: 168 GRQRWQESLHEIIDARELVELAEREVLPPL 197
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 32.0 bits (73), Expect = 0.64
Identities = 52/283 (18%), Positives = 109/283 (38%), Gaps = 36/283 (12%)
Query: 51 LECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRL----- 105
LE E K + G + E E+LER+++EIE + K QL E+
Sbjct: 239 LEEAQRSLESLEKKFRSEGGDL-FEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297
Query: 106 ---QEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRG------QVDSLRDQM---- 152
++ K + E + +Q+ E D +L+ L V + ++
Sbjct: 298 IPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEID 357
Query: 153 --RLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLN 210
+ E+ +R S + + + L + +L D EL+E+ + + +++
Sbjct: 358 KPATTDSEIPHRLSGSELTQLEVLIQQVKRELQD--AKSQLLKELRELEEELAEVDKKIS 415
Query: 211 ELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRD 270
+ E Q +++ L E +L + + E + L+ L+ +++LR
Sbjct: 416 TIPSEEQIAQLLEELGEAQNELFRS------------EAEIEELLRQLETLKEAIEALRK 463
Query: 271 QMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQ 313
+ K K+ +++ D+ T++E R K ++Q
Sbjct: 464 TLDEKTKQKI-NAFELERAITIADKAKKTLKEFREKLLERKLQ 505
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 31.4 bits (71), Expect = 0.68
Identities = 18/83 (21%), Positives = 35/83 (42%)
Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
SA L V ++ ++ +L + +D+ K ++ L +TV+ L+ ++
Sbjct: 42 SATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELA 101
Query: 316 GLIDERADFLAALQDHQREVAAL 338
L + AA+ D Q V A
Sbjct: 102 DLSLTVSAHDAAISDLQTTVHAN 124
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 31.7 bits (73), Expect = 0.68
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 258 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRR 305
L+ L + L +Q+R E +L ++ + L+R+++RL S + +L+
Sbjct: 3 LEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50
Score = 29.4 bits (67), Expect = 3.7
Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 13/63 (20%)
Query: 64 KLQELGDHISKLEH--EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
+L+ L + S+LE +LE ++ ++E +L+ E ++L LE S
Sbjct: 2 RLEALEERNSELEEQIRQLELKLRDLEA-----------ENEKLERELERLKSELEKLKS 50
Query: 122 SPM 124
P+
Sbjct: 51 PPL 53
>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
Length = 341
Score = 31.4 bits (72), Expect = 0.83
Identities = 16/56 (28%), Positives = 21/56 (37%), Gaps = 10/56 (17%)
Query: 296 LTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE-----VAALKQQLGLAE 346
L VRE RR +RL Q R + A L H L +Q G+ +
Sbjct: 115 LGIVVREWRRLRRLRQRA-----HLRDEARALLHSHDVGEARAFCEKLAKQAGIPQ 165
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.6 bits (72), Expect = 0.91
Identities = 53/253 (20%), Positives = 112/253 (44%), Gaps = 32/253 (12%)
Query: 57 KNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRL 116
K E E KL+EL + + +LE + + +EE+E+ EE ++L E + +L+ E K L K L
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKA-EEYEKLKEKLIKLKGEIKSLKKEL 548
Query: 117 EHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSN 176
E K +L ++D +++ ++ + +ELG+ + + + +L
Sbjct: 549 EKLEELKKKLAELEKKLD---------ELEEELAELLKELEELGFESVEELEERLKEL-- 597
Query: 177 KAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEE-RQCFKMVNRLQEENKKLSKR 235
+P + ELK+ + + L +LEEE + F+ + ++ ++L K
Sbjct: 598 -------EPFYNEYL--ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKE 648
Query: 236 LEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDR 295
LE +L + ++LR + L ++ EL + +K+ +++
Sbjct: 649 LE----------ELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEK 698
Query: 296 LTSTVRELRRKQR 308
L + E + ++
Sbjct: 699 LKEELEEREKAKK 711
Score = 30.0 bits (68), Expect = 2.7
Identities = 51/227 (22%), Positives = 103/227 (45%), Gaps = 36/227 (15%)
Query: 13 DVFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHI 72
++ ++ E KE E+++ + + I ++ E LE L E+E +L+EL + I
Sbjct: 190 NIEELIKEKEKELEEVLRE-----INEISSELPELREELEKL----EKEVKELEELKEEI 240
Query: 73 SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 132
+LE E E + +E+ ++L E + L++E ++L ++++ + ++L +
Sbjct: 241 EELEKELESLEGSK--RKLEEKIRELEERIEELKKEIEELEEKVK-------ELKELKEK 291
Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN--SDIPDLSNKAVYDLDDPNRPRF 190
+ + KL + D++R EK L +++ I +L K
Sbjct: 292 AEEYI--KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK------------- 336
Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
L+E+ L+ RL ELEE + ++ +EE ++L KRL
Sbjct: 337 -EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLT 382
>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
Reviewed.
Length = 110
Score = 29.4 bits (67), Expect = 0.91
Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 10/71 (14%)
Query: 67 ELGDHISKLEH--EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH----EG 120
E+ D + LE L +E+ + K++ +L+E L+ EN KL +RLE E
Sbjct: 5 EIFDALDDLEQNLGVLLKELGAL----KKQLAELLEENTALRLENDKLRERLEELEAEEP 60
Query: 121 SSPMKSQQLSP 131
+ K ++
Sbjct: 61 AKEKKKKEGEG 71
>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 262
Score = 30.9 bits (70), Expect = 0.91
Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 2/102 (1%)
Query: 246 SQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRR 305
+ S A S+ D++ E+ + + L++Q+ +L + ELR
Sbjct: 19 AAPASAPAISGAPLSS-AGNGSVEDRVTPLERVQNAHSYRLTQLEQQLRQLQGKIEELRG 77
Query: 306 KQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEK 347
Q L Q Q ++ + + A L + AL Q +G
Sbjct: 78 IQEL-QYQNNQNVERQEENEARLDSLESGRQALAQGIGDQSG 118
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 31.1 bits (71), Expect = 0.97
Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 9/96 (9%)
Query: 197 EILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDIS 256
E+ D LK L ELEEE + L+EE +K+ K L+ P K++ P
Sbjct: 324 EVGVDFPELKEELKELEEE------LKELEEELEKIKKLLK---KLPKKARGQLPPEKRE 374
Query: 257 ALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQ 292
L+KL + L +++ E+EL ++++L +
Sbjct: 375 QLEKLLETKEKLSEELEELEEELKELKEELESLYSE 410
Score = 30.7 bits (70), Expect = 1.3
Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 11/121 (9%)
Query: 42 PKVIVALEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEM 101
P++ L+ LE + E E K+++L + K +L E E E E ++L E
Sbjct: 330 PELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEE 389
Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVD-SLRDQMRLKEKELG 160
+ L+EE K+L + LE L E ISV + + V + +++ ++E+G
Sbjct: 390 LEELEEELKELKEELE----------SLYSEGKISVNKTIYPGVKIHIGNKVLRIKREIG 439
Query: 161 Y 161
Sbjct: 440 P 440
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 31.2 bits (71), Expect = 1.0
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 8/81 (9%)
Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVN-RLQEENKKLSKRLEHEGSSPMKSQQLS 250
AE+KE L+ RL +L E + K N RLQ+ + + ++++ S +
Sbjct: 65 VAEVKE-------LRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKE 117
Query: 251 PEIDISALQKLRGQVDSLRDQ 271
E S Q+L+G +D L+ +
Sbjct: 118 IEQLKSERQQLQGLIDQLQRR 138
Score = 28.8 bits (65), Expect = 5.4
Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 5/91 (5%)
Query: 82 REVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKL 141
+E+ + E + L RLQ+ + + ++++ S + E S Q+L
Sbjct: 69 KELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQL 128
Query: 142 RGQVDSLRDQMRLKEKELGYRNSDVDNSDIP 172
+G +D L Q RL G SD+P
Sbjct: 129 QGLIDQL--QRRLAGVLTGPSGG---GSDLP 154
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 29.6 bits (67), Expect = 1.0
Identities = 19/54 (35%), Positives = 34/54 (62%), Gaps = 2/54 (3%)
Query: 66 QELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHE 119
+EL D +S+LE ++L + + E+ E K++ +L+E L+ EN+ L +RLE
Sbjct: 4 KELFDALSELE-QQLNQLLAELGE-LKQQLAELLEENAELRIENEHLRERLEEL 55
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 31.2 bits (71), Expect = 1.1
Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 10/77 (12%)
Query: 51 LECLASKNERENTKLQELGDHISKLEHEKLE----------REVEEIEESWKEETQQLVE 100
LE K D E LE +EVEE ES ++E +L E
Sbjct: 43 LESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEE 102
Query: 101 MVNRLQEENKKLSKRLE 117
+N L EE L + LE
Sbjct: 103 WLNVLDEEKSFLDENLE 119
Score = 30.4 bits (69), Expect = 1.8
Identities = 24/141 (17%), Positives = 51/141 (36%), Gaps = 16/141 (11%)
Query: 197 EILHDRNTLKTR-LNELEEERQCFKMVN---RLQEENKKLSKR---LEHEGSSPMKSQQL 249
+ L + ++ R LNE Q K VN R E +KL K ++ G +
Sbjct: 2 DALGELGLVQIRDLNEDVSAFQ-RKFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGK 60
Query: 250 SPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRL 309
L ++ L E E+ ++++L+++I+ L + L ++
Sbjct: 61 PDVPPSKEFLDLEEEILDL-------EAEIKEVEENLESLEKEINELEEWLNVLDEEKSF 113
Query: 310 SQVQVRGLIDERADFLAALQD 330
+ + E ++ +
Sbjct: 114 LDENLEE-LSELSNLDIDFKY 133
Score = 28.5 bits (64), Expect = 6.8
Identities = 29/115 (25%), Positives = 43/115 (37%), Gaps = 16/115 (13%)
Query: 130 SPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPN--- 186
D+S Q R V+ +R E++L S + IP D+
Sbjct: 14 DLNEDVSAFQ--RKFVNEVRR-CDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFL 70
Query: 187 --RPRFTT--AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
AE+KE+ + +L+ +NELEE N L EE L + LE
Sbjct: 71 DLEEEILDLEAEIKEVEENLESLEKEINELEEWL------NVLDEEKSFLDENLE 119
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 30.3 bits (69), Expect = 1.3
Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 13/87 (14%)
Query: 74 KLEHEKLEREVEEIEESWKEETQQLVEM---------VNRLQEENKKLSKRLEHEGSSPM 124
K EKL++E+EE+++ E Q+ ++ L EE K+L K L+ +
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127
Query: 125 KSQQLSPEIDISVLQKLRGQVDSLRDQ 151
K ++ PE ++KL+ + ++
Sbjct: 128 KYEKNDPE----RIEKLKEETKVAKEA 150
>gnl|CDD|181627 PRK09053, PRK09053, 3-carboxy-cis,cis-muconate cycloisomerase;
Provisional.
Length = 452
Score = 30.8 bits (70), Expect = 1.3
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 4/47 (8%)
Query: 303 LRRKQRLSQVQVRGLIDE---RADFLAALQDHQREVA-ALKQQLGLA 345
LR +QRL+ ++ R L+ + A LA+L + VA AL +L LA
Sbjct: 176 LRHRQRLAALRPRALVLQFGGAAGTLASLGEQALPVAQALAAELQLA 222
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.8 bits (70), Expect = 1.4
Identities = 39/244 (15%), Positives = 79/244 (32%), Gaps = 17/244 (6%)
Query: 79 KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVL 138
++ ++I E + +L + + L+ E L +L K ++ +++ L
Sbjct: 46 EIAALEKKIREQQDQ-RAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNAR-L 103
Query: 139 QKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEI 198
L Q R+Q R ++L N L + +D R +
Sbjct: 104 NALEVQ---EREQRRRLAEQLAALQRSGRNPPPALLVSP-----EDAQRSVRLAIYYGAL 155
Query: 199 LHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISAL 258
R R++ L+ + V + L E ++QQ +
Sbjct: 156 NPAR---AERIDALKATLKQLAAVRAEIAAEQAELTTLLSE----QRAQQAKLAQLLEER 208
Query: 259 QKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLI 318
+K Q++S + K +EL S + N + + RE + + R
Sbjct: 209 KKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAE 268
Query: 319 DERA 322
+R
Sbjct: 269 AKRT 272
>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 843
Score = 30.7 bits (70), Expect = 1.5
Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 5/48 (10%)
Query: 189 RFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL 236
RFTT ELKE+ + ++ LE E + + L+E+ L
Sbjct: 500 RFTTPELKELEEKLLDAEEKILALEYE-----LFDELREKILAHINEL 542
>gnl|CDD|233690 TIGR02025, BchH, magnesium chelatase, H subunit. This model
represents the H subunit of the magnesium chelatase
complex responsible for magnesium insertion into the
protoporphyrin IX ring in the biosynthesis of both
chlorophyll and bacteriochlorophyll. In
chlorophyll-utilizing species, this gene is known as
ChlH, while in bacteriochlorophyll-utilizing spoecies it
is called BchH. Subunit H is the largest (~140kDa) of
the three subunits (the others being BchD/ChlD and
BchI/ChlI), and is known to bind protoporphyrin IX.
Subunit H is homologous to the CobN subunit of
cobaltochelatase and by anology with that enzyme,
subunit H is believed to also bind the magnesium ion
which is inserted into the ring. In conjunction with the
hydrolysis of ATP by subunits I and D, a conformation
change is believed to happen in subunit H causing the
magnesium ion insertion into the distorted
protoporphyrin ring [Biosynthesis of cofactors,
prosthetic groups, and carriers, Chlorophyll and
bacteriochlorphyll].
Length = 1224
Score = 30.9 bits (70), Expect = 1.6
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 70 DHISKLEHEKLEREVEEIEESWK 92
I ++E+L E IEE W
Sbjct: 490 VRIPVEQYERLYPRTERIEEQWG 512
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.2 bits (68), Expect = 1.6
Identities = 17/99 (17%), Positives = 38/99 (38%)
Query: 264 QVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERAD 323
Q S + ++ +S++D LK++ +RL + EL ++ ++ L +E+
Sbjct: 50 QYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQ 109
Query: 324 FLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
+ RE + E + L E + +
Sbjct: 110 LENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLNQ 148
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 30.5 bits (69), Expect = 1.9
Identities = 41/273 (15%), Positives = 96/273 (35%), Gaps = 28/273 (10%)
Query: 74 KLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEI 133
+L + + E E E+ E ++L+ ++ K L ++L G P
Sbjct: 189 ELANLRKELRQLEKEKQRLERLRRLLPLLAER----KALEQQLAALGEVI-----DLPPD 239
Query: 134 DISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIP-DLSNKAVYDLDDPNRPRFTT 192
+ ++ R ++ + R + L + L +++D + +L +A ++ ++ R
Sbjct: 240 AVERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAA-IEALHQQRGEY 298
Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 252
++ L DR + +E L ++ +
Sbjct: 299 RNAEQDLPDRE----------------GEIANAREAAAALLAQIGPDADEEAVESLRPSL 342
Query: 253 IDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQV 312
+ +L + ++L ++ L R ++ ++ Q+ L + + L+
Sbjct: 343 AAKETVTELEKRKEALDQALKSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADA 402
Query: 313 QVRGLIDER-ADFLAALQDHQREVAALKQQLGL 344
G ID A + +RE+A +LGL
Sbjct: 403 LRAGDIDAALAALEQEVAVAKRELAQALSRLGL 435
Score = 29.0 bits (65), Expect = 6.0
Identities = 18/103 (17%), Positives = 36/103 (34%), Gaps = 1/103 (0%)
Query: 247 QQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQID-RLTSTVRELRR 305
+ L+ I AL + RG+ + + +E E+ L QI E R
Sbjct: 280 ELLAQAAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLR 339
Query: 306 KQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
++ V L + AL+ + + +++L +
Sbjct: 340 PSLAAKETVTELEKRKEALDQALKSARDALEERERELKQVRAQ 382
>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal. This domain is the C-terminal
region of the CASP family of proteins. It is a Golgi
membrane protein which is thought to have a role in
vesicle transport.
Length = 245
Score = 29.7 bits (67), Expect = 2.3
Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 202 RNTLKTRLNELEEE-RQCFKMVNRLQEENKKLSK 234
R+ + R ELEEE R+ + ++ L+ E + L
Sbjct: 88 RDRFRKRNTELEEELRKQNQTISSLRRELESLKA 121
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 2.3
Identities = 34/165 (20%), Positives = 70/165 (42%), Gaps = 23/165 (13%)
Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL-EHEGSSPMKSQQLS 250
AEL+++ + ++ RL E+ + + RL+E + L + + E + K ++
Sbjct: 481 EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAE 540
Query: 251 P------EIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVREL- 303
E++ A +K R ++ +E+ NS + LK +I+ L +R L
Sbjct: 541 ELRERAAELEAEAEEK-REAAAEAEEEAEEAREEVAELNSKLAELKERIESL-ERIRTLL 598
Query: 304 -----------RRKQRLSQVQVRGLIDERADFLAALQDHQREVAA 337
R +++ + DER + LA ++ +RE+ A
Sbjct: 599 AAIADAEDEIERLREKREALAELN--DERRERLAEKRERKRELEA 641
Score = 29.6 bits (67), Expect = 3.8
Identities = 64/336 (19%), Positives = 131/336 (38%), Gaps = 53/336 (15%)
Query: 49 EHLECLASKNE--RENTKLQELGDHISKLEHEKLEREVEEIEESWKEE---TQQLVEMVN 103
E LE L + + L + + E+LE EE+ + +E Q E
Sbjct: 286 ERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAE 345
Query: 104 RLQEENKKLSKRLE--HEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGY 161
L+E+ L +R E E ++ ++S+ + R +++ L +++ + G
Sbjct: 346 SLREDADDLEERAEELREEAAELESELEEAREAVE---DRREEIEELEEEIEELRERFGD 402
Query: 162 RNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
D+ N++ L+E+ +R+ L+ R ELE + +
Sbjct: 403 APVDLGNAE----------------------DFLEELREERDELREREAELEATLRTAR- 439
Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGY 281
R++E ++ L G P Q + + +++ R +V+ L ++ E+E+
Sbjct: 440 -ERVEE-----AEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493
Query: 282 RNSDVDNLK------RQIDRL---TSTVRELRRKQRLS----QVQVRGLIDERADFLAAL 328
++ + +I+RL + EL ++R + + + L ERA L A
Sbjct: 494 VEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEEL-RERAAELEAE 552
Query: 329 QDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
+ +RE AA ++ +E NS+ +E
Sbjct: 553 AEEKREAAAEAEEEAEEAREEVAELNSKLAELKERI 588
>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
I/polynucleotide phosphorylase. Sohlberg, et al present
characterization of two proteins from Streptomyces
coelicolor. The protein in this family was shown to have
poly(A) polymerase activity and may be responsible for
polyadenylating RNA in this species. Reference 2 showed
that a nearly identical plasmid-encoded protein from
Streptomyces antibioticus is a bifunctional enzyme that
acts also as a guanosine pentaphosphate synthetase.
Length = 719
Score = 30.2 bits (68), Expect = 2.4
Identities = 27/103 (26%), Positives = 45/103 (43%), Gaps = 7/103 (6%)
Query: 16 DIASEIGK---EFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHI 72
D+A + K EF D Y D ++ V L +A K ERE L E+ +
Sbjct: 242 DLAEKAAKPTGEFPLFPD-YQDDVYEAVEGAVKDELSAALTIAGKQEREE-ALDEVKALV 299
Query: 73 SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
+ E+ E +EI +++ T++LV R+ E ++ R
Sbjct: 300 AAKLAEQFEGREKEISAAYRAVTKKLVR--ERVLTEGVRIDGR 340
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 30.0 bits (68), Expect = 2.4
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 65 LQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
L G+H+ H++L+ V + E+ E Q L EM + L+ NK L + L GS
Sbjct: 216 LDGAGEHLGGRIHDELKSHVLPVLEAIVELGQVLQEMKDGLESVNKTLKE-LREGGS 271
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.2 bits (68), Expect = 2.4
Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 30/133 (22%)
Query: 189 RFTTAELKEILHDRNTLKTRLNELE-------EERQCFKMVNRLQEENKKLSKRLEHEGS 241
+ +LKE +++ L R +EL+ EE+ N LQ+EN+KL +RL
Sbjct: 175 KKALEDLKEAQEEKDELAQRCHELDKQVLLLQEEK------NSLQQENEKLQERLAQLEG 228
Query: 242 SPMKSQQLSPEIDISALQKLRGQVDSL----------RDQMRLKEKELGYRNSDVDNLKR 291
S + QL L+ Q++ L RD R+K +EL ++ L+
Sbjct: 229 SSLGPNQLGS----KKYNLLQSQLEQLQEENFRLEAARDDYRIKCEEL---EKELAELQH 281
Query: 292 QIDRLTSTVRELR 304
+ D LTS E +
Sbjct: 282 RNDELTSLAAESQ 294
>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
Members of this protein family are phage lysis
regulatory protein, including the well-studied protein
LysB (lysis protein B) of Enterobacteria phage P2. For
members of this family, genes are found in phage or in
prophage regions of bacterial genomes, typically near a
phage lysozyme or phage holin.
Length = 135
Score = 28.9 bits (65), Expect = 2.4
Identities = 18/87 (20%), Positives = 36/87 (41%), Gaps = 14/87 (16%)
Query: 266 DSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFL 325
+ R + + L + +++ + Q+ L L ++ +Q Q+R + A
Sbjct: 22 RNARADLERANRVLKAQQAELASKANQLIVLL----ALAKRNEEAQAQLR---QQLAQAR 74
Query: 326 AALQDHQREVAALKQQLGLAEKENEDL 352
A L ++ + LK +ENEDL
Sbjct: 75 ALLAQREQRIERLK-------RENEDL 94
>gnl|CDD|147197 pfam04906, Tweety, Tweety. The tweety (tty) gene has not been
characterized at the protein level. However, it is
thought to form a membrane protein with five potential
membrane-spanning regions. A number of potential
functions have been suggested in.
Length = 406
Score = 30.0 bits (68), Expect = 2.5
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 1/66 (1%)
Query: 278 ELGYRNSDVDNLKRQIDRLTS-TVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVA 336
+L Y + ++ ID L S T L ++ + +R D+L L+ Q V
Sbjct: 93 QLTYSLRNANHTVSGIDNLVSDTTNSLNETLEQHLTRLEEIFAKRTDYLQTLRFLQGNVG 152
Query: 337 ALKQQL 342
QQL
Sbjct: 153 NAVQQL 158
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 29.8 bits (67), Expect = 2.7
Identities = 61/279 (21%), Positives = 116/279 (41%), Gaps = 31/279 (11%)
Query: 56 SKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
++NER ++ QEL + LE ++ R +EI+E +K +L++ + L+EEN L K+
Sbjct: 55 AENERLSSLSQELKEENEMLELQRG-RLRDEIKE-YKFREARLLQDYSELEEENISLQKQ 112
Query: 116 LEHEGSSPMKSQQLSPE---IDISVLQK----LRGQVDSLRDQMRLKEKELG------YR 162
+ S ++ Q+ E +I L++ L Q++ + EK+L
Sbjct: 113 V-----SVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKEIAEKQLEEALETLKT 167
Query: 163 NSDVDNSDIPDLSNK-AVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
+ N+ +LS+ + D + + LK D N+ E+E
Sbjct: 168 EREQKNALRKELSHHLNLEDFYSLSNLSISLDGLK-FSEDPEASTEPNNDGEDENGHLNG 226
Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDS-------LRDQMRL 274
L + K E +P S S E+++S +QKL+ Q++ L ++
Sbjct: 227 GPGLAKSLGTPRKG-ELFPPAPGVSDLFS-ELNLSEIQKLKQQLEQVEREKASLLANLQE 284
Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQ 313
+ +L + +++RLT + LR Q + Q
Sbjct: 285 LQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQ 323
>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS. This is small family of
Bacillus DegS proteins. The DegS-DegU two-component
regulatory system of Bacillus subtilis controls various
processes that characterize the transition from the
exponential to the stationary growth phase, including
the induction of extracellular degradative enzymes,
expression of late competence genes and down-regulation
of the sigma D regulon. The family also contains one
sequence from Thermoanaerobacter tengcongensis which is
described as a sensory transduction histidine kinase.
Length = 159
Score = 28.7 bits (65), Expect = 3.0
Identities = 15/44 (34%), Positives = 33/44 (75%), Gaps = 4/44 (9%)
Query: 73 SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRL 116
++ E+E+LE+E+EE+ KEE ++++ V++L+++ ++ KRL
Sbjct: 25 ARQEYERLEQELEEV----KEEVSEVIKEVDKLEKQERRARKRL 64
>gnl|CDD|236123 PRK07883, PRK07883, hypothetical protein; Validated.
Length = 557
Score = 29.9 bits (68), Expect = 3.0
Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 12/55 (21%)
Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLS 310
+ L LR ++D L R +E R DRL + +R L R QRL+
Sbjct: 406 AVLAALRARIDRLAAAERFEEAA------------RLRDRLAALLRALARAQRLA 448
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 29.6 bits (66), Expect = 3.3
Identities = 49/326 (15%), Positives = 114/326 (34%), Gaps = 86/326 (26%)
Query: 59 ERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
T+ L + I E K+ ++++ + E W+ + N + +K +
Sbjct: 274 ANLKTQNDNLYEKIQ--EAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEW--- 328
Query: 119 EGSSPMKSQQLSPEIDI--SVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSN 176
P K ++L EI++ ++ L+ +D L Q+R +
Sbjct: 329 ----PGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQ--------------------- 363
Query: 177 KAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL 236
+T + + + +R L + ++++ ++ KL+K +
Sbjct: 364 ------------GISTEQFELMNQEREKL-------------TRELDKINIQSDKLTKSV 398
Query: 237 EHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDV---------- 286
KS++L + +L+K Q DSL + ++G+ +D
Sbjct: 399 --------KSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLF 450
Query: 287 -------DNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQR----EV 335
+++K+ I L ++E + + + + I + L E+
Sbjct: 451 PKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEKLELEL 510
Query: 336 AALKQQLGLAEKENEDLYNSQETTSE 361
+ + L+++ENE +Q E
Sbjct: 511 SEANSKFELSKEENERELVAQRIEIE 536
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 29.6 bits (66), Expect = 3.4
Identities = 32/177 (18%), Positives = 65/177 (36%), Gaps = 11/177 (6%)
Query: 194 ELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEI 253
E + +R ++ L + +ER+ + L + L+K + ++Q L +
Sbjct: 96 EKRAAETEREAARSELQKARQEREAVR--QELAAARQNLAKAQQELARLTKQAQDLQTRL 153
Query: 254 DISALQ--KLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQ 311
A Q +L Q SL+ + + S V +LK + ++ + L +
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRAN--- 210
Query: 312 VQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERPVQG 368
+ +E A AA Q + + Q ++ + + E E ER +Q
Sbjct: 211 -AAQARTEELARRAAAAQQTAQAIQQRDAQ---ISQKAQQIAARAEQIRERERQLQR 263
>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
[Replication, recombination, and repair].
Length = 114
Score = 28.2 bits (63), Expect = 3.5
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 6/55 (10%)
Query: 67 ELGDHISKLEHE--KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHE 119
E+ D + LE + L E+ + K+ LVE L+ EN+KL +RL
Sbjct: 5 EIFDQVDNLEEQLGVLLAELGGL----KQHLGSLVEENTALRLENEKLRERLGEP 55
>gnl|CDD|224365 COG1448, TyrB, Aspartate/tyrosine/aromatic aminotransferase [Amino
acid transport and metabolism].
Length = 396
Score = 29.5 bits (67), Expect = 3.6
Identities = 18/77 (23%), Positives = 28/77 (36%), Gaps = 23/77 (29%)
Query: 85 EEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRG- 143
E+ W++E L EM R+ E + L L+ G + + RG
Sbjct: 301 PELRAEWEQE---LEEMRQRILEMRQALVDALKALG----------APRNFDFIISQRGM 347
Query: 144 ---------QVDSLRDQ 151
QVD LR++
Sbjct: 348 FSYTGLSPEQVDRLREE 364
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 29.2 bits (66), Expect = 3.6
Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 73 SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 132
+ + E E E E + E KEE Q + QE K+L +++E E ++L E
Sbjct: 209 ERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAE------REKLLAE 262
Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKEL 159
+ + KL+ Q + L++ + + + L
Sbjct: 263 QERMLEHKLQEQEELLKEGFKTEAESL 289
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 29.7 bits (67), Expect = 3.9
Identities = 13/62 (20%), Positives = 26/62 (41%), Gaps = 6/62 (9%)
Query: 56 SKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
S + EN + + + I++L+ ++EE + +N+L E KL +
Sbjct: 916 SSDLIENLEFKT--ELIARLKKLLNNIDLEEG----PSIEYVKLPELNKLHEVESKLKET 969
Query: 116 LE 117
E
Sbjct: 970 SE 971
Score = 28.5 bits (64), Expect = 7.1
Identities = 40/238 (16%), Positives = 80/238 (33%), Gaps = 45/238 (18%)
Query: 135 ISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAE 194
+ L+ + SL+ E E+ + + I +L K A
Sbjct: 884 LQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELI-----------AR 932
Query: 195 LKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID 254
LK++L++ + + E + + +N+L E KL ++ S E +
Sbjct: 933 LKKLLNNIDLEEGPSIEYVKLPE----LNKLHEVESKL--------------KETSEEYE 974
Query: 255 --ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQV 312
+ L + + +++ +KEL + L+ +L E+ Q S++
Sbjct: 975 DLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKI 1034
Query: 313 ---------QVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSE 361
++ L + L Q ALK + +EN L + Q E
Sbjct: 1035 ISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLR-----RENSLLDDKQLYQLE 1087
>gnl|CDD|220597 pfam10139, Virul_Fac, Putative bacterial virulence factor. Members
of this family of prokaryotic proteins include various
putative virulence factor effector proteins. Their exact
function is, as yet, unknown.
Length = 852
Score = 29.3 bits (66), Expect = 4.0
Identities = 21/91 (23%), Positives = 29/91 (31%), Gaps = 8/91 (8%)
Query: 295 RLTSTVRELRRKQRLSQVQVRGLIDERAD-------FLAALQDHQREVAALKQQLGLAEK 347
+L REL + Q DERA+ L ALQ+ L + L
Sbjct: 560 QLDELQRELGENLLANYYQADD-EDERAEKRRIAEQLLRALQERAHRHGELLEGLLPPRD 618
Query: 348 ENEDLYNSQETTSEEERPVQGPLPYEPDDAP 378
+ LY Q+ S + P P
Sbjct: 619 QLRRLYLQQQEFSGGAADLSAPGEVADPPRP 649
>gnl|CDD|236510 PRK09428, pssA, phosphatidylserine synthase; Provisional.
Length = 451
Score = 29.4 bits (67), Expect = 4.1
Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 5/36 (13%)
Query: 174 LSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRL 209
L++ AV LD PNRP + KEI +D + RL
Sbjct: 192 LNSPAVNRLDQPNRP-----KTKEIKNDIRQFRQRL 222
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.4 bits (67), Expect = 4.2
Identities = 32/131 (24%), Positives = 60/131 (45%), Gaps = 26/131 (19%)
Query: 15 FDIASEIGKEFEKIIDQ----YGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGD 70
F+IA +G E II++ G D K+ + LE L + E++ + + L
Sbjct: 491 FEIAKRLGLP-ENIIEEAKKLIGEDKE-----KLNELIASLEELERELEQKAEEAEAL-- 542
Query: 71 HISKLEHEKLEREVEEIEESWKEETQQLV--------EMVNRLQEENKKLSKRL---EHE 119
E EKL+ E+EE +E +EE +L+ + + ++E ++ K L +
Sbjct: 543 ---LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKG 599
Query: 120 GSSPMKSQQLS 130
G + +K+ +L
Sbjct: 600 GYASVKAHELI 610
>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
protein. This family of transmembrane coiled-coil
containing proteins is conserved from worms to humans.
Its function is unknown.
Length = 387
Score = 29.0 bits (65), Expect = 4.2
Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 15/90 (16%)
Query: 258 LQKLRGQVDSLRD-QMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRG 316
L KL ++ +++ Q RL+E + L+R +T +++E R + + Q+
Sbjct: 207 LDKLLEELREIKEGQSRLEESYERLKEQ----LQRDYQYITQSLQEERYRYERLEEQLND 262
Query: 317 LIDERADFLAALQDHQREVAALKQQLGLAE 346
L + HQ E+A LKQ+L E
Sbjct: 263 L----------TELHQNEIANLKQELASME 282
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 29.3 bits (66), Expect = 4.4
Identities = 12/41 (29%), Positives = 21/41 (51%)
Query: 78 EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
+ E +++E+ + K +L + + R EE K L K LE
Sbjct: 704 NEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELES 744
>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein.
Length = 554
Score = 29.3 bits (65), Expect = 4.5
Identities = 24/101 (23%), Positives = 47/101 (46%), Gaps = 2/101 (1%)
Query: 52 ECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
+ AS+N + + +L D ++++ +E E+S KE Q+ ++ VN +E +
Sbjct: 286 DEFASENAIALSAIGDLKDDFLAFLEDRIKNLIEMSEDSLKESVQRNIDFVNSGFQELED 345
Query: 112 LSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQM 152
S L+ E + LS + ++ + L+G D L D
Sbjct: 346 FSIGLKEELGG--LKKDLSEQQNLEAEEILQGFNDFLHDSK 384
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 28.9 bits (65), Expect = 4.5
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 14/98 (14%)
Query: 79 KLEREVEEIEESWK--EET----QQLVEMVNRLQE----ENKKLSKRLEHEGSSPMK--S 126
+ E E +E+E+ ++ +ET +Q +E V +LQE + KRL + +K
Sbjct: 1 ECEEEWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRL-KQLIVSLKKLK 59
Query: 127 QQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNS 164
+ L+PE D ++++L Q+ ++Q E L +N
Sbjct: 60 KSLTPE-DSELVEQLEEQIKERKNQFFDMESFLPKKNG 96
>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase. This family of
alanyl-tRNA synthetases is limited to the archaea, and
is a subset of those sequences identified by the model
pfam07973 covering the second additional domain (SAD) of
alanyl and threonyl tRNA synthetases.
Length = 902
Score = 29.2 bits (66), Expect = 4.8
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 78 EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
++LE + E + K +QL E V R EE K+ K +E
Sbjct: 743 QELEDLLRESSDILKVPPEQLPETVKRFFEEWKEQRKEIER 783
>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
This family consists of a series of primate specific
nuclear pore complex interacting protein (NPIP)
sequences. The function of this family is unknown but is
well conserved from African apes to humans.
Length = 350
Score = 29.0 bits (64), Expect = 4.8
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 56 SKNERENTKLQELGDHISKL-----EHEKLEREVEEIEESWK---EETQQLVEMVNRLQE 107
+ +++ N K + +H+ KL EH + ER+V E EE+ K +E EM R Q
Sbjct: 131 NHHDKINGKRKTAKEHLRKLSMKECEHAEKERQVSEAEENGKLDMKEIHTYKEMFQRAQA 190
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 29.1 bits (66), Expect = 5.0
Identities = 9/62 (14%), Positives = 23/62 (37%)
Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
L++++ L + R++ SQ E + + + L+ QL +++
Sbjct: 147 LQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEK 206
Query: 349 NE 350
Sbjct: 207 AA 208
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 29.1 bits (65), Expect = 5.0
Identities = 68/324 (20%), Positives = 123/324 (37%), Gaps = 22/324 (6%)
Query: 54 LASKNERENTKLQELGDHISKLEHE--KLEREVEEIEESWKEETQQLVEMVNRLQEE--N 109
LA ERE + EL + LE K E E E EE K +L +LQ E +
Sbjct: 98 LAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKED 157
Query: 110 KKLSKRLEHEGSSPMKSQ-QLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
+ + S+ Q + + L+ L +++ LR+Q+ +KEL
Sbjct: 158 QLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEA-----E 212
Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
+ L+++ D+ + + ELK D +K+ +L + + + + L+EE
Sbjct: 213 KKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREE 272
Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKEL--------- 279
N+KL E + +K + + + +K+R ++ L + E EL
Sbjct: 273 NRKLRSM--KEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQD 330
Query: 280 -GYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
G D+L R+I L + +L+ K + L Q E+ L
Sbjct: 331 IGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILEL 390
Query: 339 KQQLGLAEKENEDLYNSQETTSEE 362
K++L + L ++E
Sbjct: 391 KKKLEALKALVRRLQRRLTLVTKE 414
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 28.8 bits (65), Expect = 5.5
Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 65 LQELGDHISKLEHEKLEREVEEIEESWKEE-TQQLVEMVNRLQEENKKLSKRL 116
L+E GD + E EK+E+ V E++E+ K E +++ LQ+ +KL++ +
Sbjct: 541 LKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIKAKTEELQQALQKLAEAM 593
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 28.5 bits (64), Expect = 5.5
Identities = 25/107 (23%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 54 LASKNERENTKLQELGDHISK--LEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
+ S + E D + + + L E EE+++ E +QL+E V L+EENK+
Sbjct: 43 VLSPVQSVVAAPFEFVDGVLEFLKSLKDLALENEELKKE-LAELEQLLEEVESLEEENKR 101
Query: 112 LSKRL----EHEGSSPMKSQQLSPEIDISVLQKL--RGQVDSLRDQM 152
L + L P+ ++ +S D + +G D ++ M
Sbjct: 102 LKELLDFKESSSDYDPIAARVISRSPDPWSQTIVIDKGSNDGIKKGM 148
Score = 28.1 bits (63), Expect = 8.7
Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 9/88 (10%)
Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL----EHEGSSPMKS 246
LK++ + LK L ELE+ + V L+EENK+L + L P+ +
Sbjct: 64 FLKSLKDLALENEELKKELAELEQLLE---EVESLEEENKRLKELLDFKESSSDYDPIAA 120
Query: 247 QQLSPEIDISALQKL--RGQVDSLRDQM 272
+ +S D + + +G D ++ M
Sbjct: 121 RVISRSPDPWSQTIVIDKGSNDGIKKGM 148
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 28.7 bits (64), Expect = 7.4
Identities = 30/140 (21%), Positives = 63/140 (45%), Gaps = 11/140 (7%)
Query: 186 NRPRFTTAE-LKEILHDR-------NTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
N+ R E L++IL ++ N L+ RL+E + + E ++ ++L
Sbjct: 148 NQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKLR 207
Query: 238 HEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMR-LKEKELGYRNSD--VDNLKRQID 294
+E ++ L L L+ + L+D ++ LK + + ++ V L+++
Sbjct: 208 NELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERS 267
Query: 295 RLTSTVRELRRKQRLSQVQV 314
L +++REL K ++Q V
Sbjct: 268 LLDASLRELESKFIVAQEDV 287
>gnl|CDD|222347 pfam13729, TraF_2, F plasmid transfer operon, TraF, protein.
Length = 267
Score = 28.0 bits (63), Expect = 7.5
Identities = 18/67 (26%), Positives = 25/67 (37%), Gaps = 7/67 (10%)
Query: 279 LGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
+G + SD DNL + D L+ L S V A AL+D + A
Sbjct: 25 VGAQVSDPDNLVDKFDDLSDIYDALSSALDNSDVD-----AAAARLAGALRDLDGDTAYA 79
Query: 339 KQQLGLA 345
Q G+
Sbjct: 80 --QAGVG 84
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 28.0 bits (63), Expect = 8.0
Identities = 40/207 (19%), Positives = 83/207 (40%), Gaps = 32/207 (15%)
Query: 51 LECLASKNERENTKLQELGD---------HISKLEHEKLEREVEEIEESWKEETQQLVEM 101
LE L S+ ER ++L I +L + + + E +L+E+
Sbjct: 64 LERLLSRIERGRASPRDLLRLYDSLEGLKEIRQLLESLDGPLLGLLLKVILEPLLELLEL 123
Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDIS--VLQKLRGQVDSLRDQMRLKEKEL 159
+ L ++ L + G + PE+D L++L +++ L + R +EL
Sbjct: 124 LLELLNDDDPL--EVNDGG---LIKDGFDPELDELREKLEELEEELEELLKKER---EEL 175
Query: 160 GYRNSDVDNSDIP----DLSNKAVYDLDD--------PNRPRFTTAELKEILHDRNTLKT 207
G + + + + +++ + N RFTT ELKE+ ++ K
Sbjct: 176 GIDSLKLGYNKVHGYYIEVTKSEAKKVPKDFIRRSSLKNTERFTTPELKELENELLEAKE 235
Query: 208 RLNELEEERQCFKMVNRLQEENKKLSK 234
+ LE+E +++ ++ E ++L
Sbjct: 236 EIERLEKE-ILRELLEKVLEYLEELRA 261
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 27.8 bits (62), Expect = 8.3
Identities = 30/165 (18%), Positives = 65/165 (39%), Gaps = 21/165 (12%)
Query: 54 LASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLS 113
L + ++L + + +L E + ++ E ++ ++ L+E +L
Sbjct: 19 LLDSLKELPAAAEKLSEALQELGKELPDLSNTDL----GEALEKFGKIQKELEEFRDQLE 74
Query: 114 KRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQM-RLKEKELGYRNSDVDNSDIP 172
++LE++ P+K D+ ++K R ++D R +EK R D+S +
Sbjct: 75 QKLENKVIEPLKEYLKK---DLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLA 131
Query: 173 DLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQ 217
+ + ++ R L LN+LEE+R+
Sbjct: 132 EAEEE-------------LQEAKEKYEELREELIEDLNKLEEKRK 163
>gnl|CDD|204811 pfam12026, DUF3513, Domain of unknown function (DUF3513). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 192 to 218 amino acids in length. This domain is
found associated with pfam00018, pfam08824. This domain
has a conserved QPP sequence motif.
Length = 208
Score = 27.8 bits (62), Expect = 8.6
Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 3/57 (5%)
Query: 59 ERENTKLQELGDHISKLEHEKLEREVEEIEE---SWKEETQQLVEMVNRLQEENKKL 112
+R +L E G + E+LEREV + E S + L E+++L
Sbjct: 30 QRTQKELLEKGPITQLKQFERLEREVAKPAEADLSGWSPGDLTPPPSSSLSPEDRQL 86
>gnl|CDD|220517 pfam10005, DUF2248, Uncharacterized protein conserved in bacteria
(DUF2248). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 340
Score = 28.0 bits (63), Expect = 8.7
Identities = 9/13 (69%), Positives = 10/13 (76%)
Query: 319 DERADFLAALQDH 331
DER D+ AALQ H
Sbjct: 209 DERQDYGAALQRH 221
>gnl|CDD|131187 TIGR02132, phaR_Bmeg, polyhydroxyalkanoic acid synthase, PhaR
subunit. This model describes a protein, PhaR,
localized to polyhydroxyalkanoic acid (PHA) inclusion
granules in Bacillus cereus and related species. PhaR is
required for PHA biosynthesis along with PhaC and may be
a regulatory subunit.
Length = 189
Score = 27.5 bits (60), Expect = 8.7
Identities = 22/107 (20%), Positives = 42/107 (39%), Gaps = 5/107 (4%)
Query: 261 LRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRK--QRLSQVQVRGLI 318
L +VD + + K EL + LK+ + +L ++ L +K + L ++ +
Sbjct: 84 LEEKVDLIEEFFDDKFDELEAQQEQAPALKKDVTKLKQDIKSLDKKLDKILELLEGQ--- 140
Query: 319 DERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERP 365
+ D L Q + + Q L EK+ + EE +P
Sbjct: 141 QKTQDELKETIQKQIKTQGEQLQAQLLEKQEALAAKLKAEAKEEAKP 187
>gnl|CDD|152481 pfam12046, DUF3529, Protein of unknown function (DUF3529). This
family of proteins is functionally uncharacterized. This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 176 to 190 amino acids
in length.
Length = 173
Score = 27.6 bits (62), Expect = 9.0
Identities = 12/47 (25%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 304 RRKQRLSQVQVRGLIDERADFLAAL--QDHQREVAALKQQLGLAEKE 348
+R R QV+++ L++ D + L + H+ E+ L ++L L
Sbjct: 128 QRASRPEQVELK-LVEADDDGGSDLRVRAHRDELIELAKELQLKSDG 173
>gnl|CDD|234005 TIGR02774, rexB_recomb, ATP-dependent nuclease subunit B. DNA
repair is accomplished by several different systems in
prokaryotes. Recombinational repair of double-stranded
DNA breaks involves the RecBCD pathway in some lineages,
and AddAB (also called RecAB) in other. The AddA protein
is conserved between the firmicutes and the
alphaproteobacteria, while the partner protein is not.
The partner may be designated AddB, as in Bacillus and
in alphaproteobacteria, or RexB as in Streptococcus and
Lactococcus. Note, however, that RexB proteins lack an
N-terminal GxxGxGK[ST] ATP-binding motif found in
Bacillus subtilis and related species, and this
difference may be important; this model represents
specifically RexB proteins as found in Streptococcus and
Lactococcus [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1076
Score = 28.3 bits (63), Expect = 9.2
Identities = 17/60 (28%), Positives = 22/60 (36%)
Query: 68 LGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQ 127
L ++ L E E E+ EE WK T L + +E L L S SQ
Sbjct: 475 LTKNLQDLATTLSEVEQEKQEEVWKTFTDILEQFATIFGQEKLSLDDFLALLHSGMSLSQ 534
>gnl|CDD|129266 TIGR00162, TIGR00162, TIGR00162 family protein. This model
represents one out of two closely related ortholgous
sets of proteins that, so far, are found only in but are
universal among the Archaea. This ortholog set includes
MJ1210 from Methanococcus jannaschii and AF0525 from
Archaeoglobus fulgidus while excluding MJ0106 and AF1251
[Hypothetical proteins, Conserved].
Length = 188
Score = 27.5 bits (61), Expect = 9.7
Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 75 LEHEKLEREVEEIEESWKEETQQLVEMVNRLQEE 108
+ LE VE +EE +E ++++ + ++EE
Sbjct: 138 CKMLSLEVSVEALEER-AKEMEKIIAKIKEMEEE 170
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.129 0.344
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,970,960
Number of extensions: 2013414
Number of successful extensions: 4810
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4108
Number of HSP's successfully gapped: 706
Length of query: 389
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 290
Effective length of database: 6,546,556
Effective search space: 1898501240
Effective search space used: 1898501240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (26.9 bits)