RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy597
(546 letters)
>gnl|CDD|220173 pfam09311, Rab5-bind, Rabaptin-like protein. Members of this
family are predominantly found in Rabaptin and allow for
binding to the GTPase Rab5. This interaction is
necessary and sufficient for Rab5-dependent recruitment
of Rabaptin5 to early endosomal membranes.
Length = 180
Score = 171 bits (436), Expect = 8e-51
Identities = 91/177 (51%), Positives = 115/177 (64%), Gaps = 2/177 (1%)
Query: 77 ELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANS 136
EL L+ +QV A+ LQ +EA +QKLR Q+RRL QEN WLR ELA TQQ+LQ+SEQ
Sbjct: 1 ELQLSFSQVQEAVQDQLQELEASRQKLRAQLRRLQQENQWLRGELALTQQELQASEQEVI 60
Query: 137 QLEEEKKHLEFMSSIRKYDSDIQGDENSGDQKSDKHKQDDPVVDLFPDEDNDDRNNLS-P 195
QL EE KHL+F+ R+ + + ++ K + DP+ +L P+ + + + P
Sbjct: 61 QLPEEVKHLQFLCLQRREELIESRTASEHLEEELKSEI-DPLKELKPNLEEELAAEIELP 119
Query: 196 TPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKY 252
S+ Q + YEIPARLRTLHNLVIQY SQGRYEVAVPLCK ALEDLEK SG
Sbjct: 120 QIQSEKDQLQASLYEIPARLRTLHNLVIQYESQGRYEVAVPLCKAALEDLEKESGHK 176
Score = 30.8 bits (70), Expect = 0.98
Identities = 22/49 (44%), Positives = 29/49 (59%)
Query: 328 GENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDV 376
A TL+NL + Y +G+Y+ A PLCK ALE EK G +HPD+
Sbjct: 132 LYEIPARLRTLHNLVIQYESQGRYEVAVPLCKAALEDLEKESGHKHPDL 180
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 76.3 bits (188), Expect = 3e-17
Identities = 38/79 (48%), Positives = 45/79 (56%), Gaps = 1/79 (1%)
Query: 289 HPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKR 348
HPD+A LN LALV R Y EA LL AL + + LGE+H A LNNLA LY
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARE-LGEDHPETARALNNLARLYLAL 59
Query: 349 GKYKEAEPLCKRALEIREK 367
G Y EA ++AL +RE
Sbjct: 60 GDYDEALEYLEKALALREA 78
Score = 72.8 bits (179), Expect = 4e-16
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 331 HAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQ 390
H +AA LNNLA++ + G Y EA L ++ALE+ LG++HP+ A+ LNNLA L
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELAR-ELGEDHPETARALNNLARLYLAL 59
Query: 391 SKYEEVERYYQRALEIYE 408
Y+E Y ++AL + E
Sbjct: 60 GDYDEALEYLEKALALRE 77
Score = 65.9 bits (161), Expect = 1e-13
Identities = 30/79 (37%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 373 HPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQ 432
HPD+A LNNLAL+ + Y+E ++ALE+ +LG D A+ NNLA YL
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELAR-ELGEDHPETARALNNLARLYLAL 59
Query: 433 GKYKEAEILYKQVLTRAHE 451
G Y EA ++ L
Sbjct: 60 GDYDEALEYLEKALALREA 78
Score = 52.8 bits (127), Expect = 5e-09
Identities = 27/117 (23%), Positives = 38/117 (32%), Gaps = 39/117 (33%)
Query: 415 DSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAK 474
++A NNLA + G Y EA
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEAL----------------------------------- 25
Query: 475 NREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
+ ++ALE+ +LG D A+ NNLA YL G Y EA ++ L
Sbjct: 26 ---ELLEKALELAR-ELGEDHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78
Score = 47.4 bits (113), Expect = 3e-07
Identities = 29/114 (25%), Positives = 39/114 (34%), Gaps = 37/114 (32%)
Query: 212 PARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYD 271
P L+NL + G Y+ A+ L ++ALE +
Sbjct: 2 PDLAAALNNLALVLRRLGDYDEALELLEKALELAREL----------------------- 38
Query: 272 IVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
G DHP+ A LN LA +Y Y EA L AL +RE
Sbjct: 39 --------------GEDHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78
Score = 35.0 bits (81), Expect = 0.009
Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 495 DSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGD 540
++A NNLA + G Y EA L ++ L A RE G +
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELA--RELGEDHPE 44
Score = 27.7 bits (62), Expect = 3.6
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 207 AGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALE 243
G + P R L+NL Y + G Y+ A+ ++AL
Sbjct: 38 LGEDHPETARALNNLARLYLALGDYDEALEYLEKALA 74
>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat.
Length = 42
Score = 61.4 bits (150), Expect = 2e-12
Identities = 23/42 (54%), Positives = 30/42 (71%)
Query: 334 VAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPD 375
AA+LNNLA +G+Y+EAE L + AL IR +VLG +HPD
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPDHPD 42
Score = 48.3 bits (116), Expect = 6e-08
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 292 VATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENH 331
A LN LA R Q +Y+EA LL +AL IR + LG +H
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPDH 40
Score = 41.8 bits (99), Expect = 2e-05
Identities = 18/39 (46%), Positives = 21/39 (53%)
Query: 376 VAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPD 414
A LNNLA + Q +YEE E + AL I LGPD
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPD 39
Score = 38.7 bits (91), Expect = 2e-04
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 418 VAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 456
A + NNLA QG+Y+EAE L ++ L R G
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAI-RRRVLGP 38
Score = 38.7 bits (91), Expect = 2e-04
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 498 VAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 536
A + NNLA QG+Y+EAE L ++ L R G
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAI-RRRVLGP 38
Score = 32.5 bits (75), Expect = 0.027
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 215 LRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSG 250
+L+NL +QGRYE A L ++AL + G
Sbjct: 2 AASLNNLARALRAQGRYEEAEELLEEALAIRRRVLG 37
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 60.1 bits (146), Expect = 2e-11
Identities = 36/110 (32%), Positives = 45/110 (40%), Gaps = 16/110 (14%)
Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
L +Y Y EA AL + A NLA Y K GKY+EA
Sbjct: 6 LGNLYYKLGDYDEALEYYEKALELDPDN--------ADAYYNLAAAYYKLGKYEEALEDY 57
Query: 359 KRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
++ALE+ PD AK NL L KYEE Y++ALE+
Sbjct: 58 EKALELD--------PDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99
Score = 47.4 bits (113), Expect = 7e-07
Identities = 41/149 (27%), Positives = 57/149 (38%), Gaps = 54/149 (36%)
Query: 378 KQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKE 437
+ L NL L Y+E YY++ALE+ D + A NLA+ Y K GKY+E
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALEL--------DPDNADAYYNLAAAYYKLGKYEE 52
Query: 438 AEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSN 497
A + Y++ALE+ D +
Sbjct: 53 AL--------------------------------------EDYEKALEL--------DPD 66
Query: 498 VAKTKNNLASCYLKQGKYKEAEILYKQVL 526
AK NL Y K GKY+EA Y++ L
Sbjct: 67 NAKAYYNLGLAYYKLGKYEEALEAYEKAL 95
Score = 41.2 bits (97), Expect = 1e-04
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 8/78 (10%)
Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
PD A LA Y KY+EA AL + A NL + Y K G
Sbjct: 31 PDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPD--------NAKAYYNLGLAYYKLG 82
Query: 350 KYKEAEPLCKRALEIREK 367
KY+EA ++ALE+
Sbjct: 83 KYEEALEAYEKALELDPN 100
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 49.3 bits (118), Expect = 1e-06
Identities = 42/149 (28%), Positives = 58/149 (38%), Gaps = 24/149 (16%)
Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
LAL Y Q Y +A L AL + A Y K G+ A+
Sbjct: 41 LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLV--------RAHYYQKLGENDLADESY 92
Query: 359 KRALEIREKVLGKEHPDVAKQLNNL-ALLCQNQSKYEEVERYYQRALEIYELKLGPDDSN 417
++AL P+ LNN A LC Q + EE + ++RAL P
Sbjct: 93 RKAL--------SLAPNNGDVLNNYGAFLCA-QGRPEEAMQQFERALA------DPAYGE 137
Query: 418 VAKTKNNLASCYLKQGKYKEAEILYKQVL 446
+ T NL C LK G++ +AE K+ L
Sbjct: 138 PSDTLENLGLCALKAGQFDQAEEYLKRAL 166
Score = 33.2 bits (76), Expect = 0.27
Identities = 40/198 (20%), Positives = 67/198 (33%), Gaps = 60/198 (30%)
Query: 329 ENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ 388
+ A LA+ Y ++G Y +A+ ++ALE P A Q
Sbjct: 29 TDRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEH--------DPSYYLAHLVRAHYYQ 80
Query: 389 NQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTR 448
+ + + Y++AL L P++ +V NN + QG+ +EA
Sbjct: 81 KLGENDLADESYRKAL-----SLAPNNGDVL---NNYGAFLCAQGRPEEAM--------- 123
Query: 449 AHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASC 508
+ ++RAL P + T NL C
Sbjct: 124 -----------------------------QQFERALA------DPAYGEPSDTLENLGLC 148
Query: 509 YLKQGKYKEAEILYKQVL 526
LK G++ +AE K+ L
Sbjct: 149 ALKAGQFDQAEEYLKRAL 166
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 49.3 bits (118), Expect = 1e-06
Identities = 48/202 (23%), Positives = 84/202 (41%), Gaps = 28/202 (13%)
Query: 329 ENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ 388
+ A LA+ Y ++G + A+ +ALE PD LAL Q
Sbjct: 25 TDRNKAAKIRVQLALGYLEQGDLEVAKENLDKALEHD--------PDDYLAYLALALYYQ 76
Query: 389 NQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ-VLT 447
+ E+ E ++RAL L P++ +V NN + +QGKY++A ++Q +
Sbjct: 77 QLGELEKAEDSFRRAL-----TLNPNNGDV---LNNYGTFLCQQGKYEQAMQQFEQAIED 128
Query: 448 RAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLAS 507
+ + + + N + + + K EKY RAL+I D ++ LA
Sbjct: 129 PLYPQPARSLE--NAGLCALKAGDFDK-AEKYLTRALQI--------DPQRPESLLELAE 177
Query: 508 CYLKQGKYKEAEILYKQVLTRA 529
Y +G+YK+A ++
Sbjct: 178 LYYLRGQYKDARAYLERYQQTY 199
Score = 43.5 bits (103), Expect = 1e-04
Identities = 38/160 (23%), Positives = 65/160 (40%), Gaps = 22/160 (13%)
Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
PD LAL Y+ + ++A + ALT+ LNN ++G
Sbjct: 62 PDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNN--------GDVLNNYGTFLCQQG 113
Query: 350 KYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYEL 409
KY++A ++A+E +P A+ L N L +++ E+Y RAL+I
Sbjct: 114 KYEQAMQQFEQAIE------DPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQI--- 164
Query: 410 KLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
D ++ LA Y +G+YK+A ++
Sbjct: 165 -----DPQRPESLLELAELYYLRGQYKDARAYLERYQQTY 199
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 43.1 bits (102), Expect = 1e-05
Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 9/73 (12%)
Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLAL-LCQNQSKY 393
A L NL K G Y EA ++ALE+ PD A+ NLAL + Y
Sbjct: 3 AEALKNLGNALFKLGDYDEAIEAYEKALEL--------DPDNAEAYYNLALAYLKLGKDY 54
Query: 394 EEVERYYQRALEI 406
EE ++ALE+
Sbjct: 55 EEALEDLEKALEL 67
Score = 41.5 bits (98), Expect = 3e-05
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 9/73 (12%)
Query: 375 DVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGK 434
D A+ L NL Y+E Y++ALE+ D + A+ NLA YLK GK
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALEL--------DPDNAEAYYNLALAYLKLGK 52
Query: 435 -YKEAEILYKQVL 446
Y+EA ++ L
Sbjct: 53 DYEEALEDLEKAL 65
Score = 38.8 bits (91), Expect = 4e-04
Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 9/77 (11%)
Query: 291 DVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGK 350
D A L L Y EA AL + A NLA+ Y K GK
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDN--------AEAYYNLALAYLKLGK 52
Query: 351 -YKEAEPLCKRALEIRE 366
Y+EA ++ALE+
Sbjct: 53 DYEEALEDLEKALELDP 69
Score = 33.4 bits (77), Expect = 0.029
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 476 REKYYQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGK-YKEAEILYKQVL 526
+ Y A+E YE L+L PD+ A+ NLA YLK GK Y+EA ++ L
Sbjct: 15 KLGDYDEAIEAYEKALELDPDN---AEAYYNLALAYLKLGKDYEEALEDLEKAL 65
Score = 29.2 bits (66), Expect = 0.75
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 497 NVAKTKNNLASCYLKQGKYKEAEILYKQVL 526
+ A+ NL + K G Y EA Y++ L
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKAL 30
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 46.2 bits (110), Expect = 4e-05
Identities = 50/223 (22%), Positives = 82/223 (36%), Gaps = 36/223 (16%)
Query: 225 YASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKVAQILGS 284
Y Q Y A+ ++AL+ +S A L L S A+ + L +
Sbjct: 713 YLRQKDYPAAIQAYRKALK-RAPSSQ---NAIKLHRALLASGN-------TAEAVKTLEA 761
Query: 285 GGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVL 344
HP+ A + LA +Y Q Y +A + A LNNLA L
Sbjct: 762 WLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDN--------AVVLNNLAWL 813
Query: 345 YGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRAL 404
Y + K A +RAL++ P++ L+ L L + + + ++A+
Sbjct: 814 YLEL-KDPRALEYAERALKLA--------PNIPAILDTLGWLLVEKGEADRALPLLRKAV 864
Query: 405 EIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLT 447
I P+ A + +LA L G+ EA ++L
Sbjct: 865 NI-----APEA---AAIRYHLALALLATGRKAEARKELDKLLN 899
Score = 35.8 bits (83), Expect = 0.059
Identities = 57/275 (20%), Positives = 91/275 (33%), Gaps = 70/275 (25%)
Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIRE------------------------- 324
PD A L +LA Y Y +A L AL ++
Sbjct: 632 PDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKI 691
Query: 325 -KTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNL 383
K+L + H A LY ++ Y A ++AL+ A +L+
Sbjct: 692 AKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKR------APSSQNAIKLHRA 745
Query: 384 ALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYK 443
L N ++ + + + N A + LA YL Q Y +A Y+
Sbjct: 746 LLASGNTAEAVKTLEAWLK-----------THPNDAVLRTALAELYLAQKDYDKAIKHYQ 794
Query: 444 QVLTRAHEREFGACDGDNKPI-----WQDRQKNKAKNREKYYQRALEIYE--LKLGPDDS 496
V+ +A DN + W + RALE E LKL P+
Sbjct: 795 TVVKKAP---------DNAVVLNNLAWLYLELKDP--------RALEYAERALKLAPNIP 837
Query: 497 NVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
+ T L +++G+ A L ++ + A E
Sbjct: 838 AILDT---LGWLLVEKGEADRALPLLRKAVNIAPE 869
Score = 34.3 bits (79), Expect = 0.20
Identities = 33/154 (21%), Positives = 51/154 (33%), Gaps = 28/154 (18%)
Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
D A +L + L Y + + A AL I ++L LA L
Sbjct: 120 DDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSL--------YAKLGLAQLALA 171
Query: 348 RGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIY 407
++ EA L L P L L + E AL Y
Sbjct: 172 ENRFDEARALIDEVLTA--------DPGNVDALLLKGDLLLSLGNIEL-------ALAAY 216
Query: 408 E--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 439
+ L P++ V LA+ ++ G+++EAE
Sbjct: 217 RKAIALRPNNIAV---LLALATILIEAGEFEEAE 247
Score = 33.9 bits (78), Expect = 0.25
Identities = 57/269 (21%), Positives = 87/269 (32%), Gaps = 59/269 (21%)
Query: 289 HPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKR 348
+P LA Y + + K+A +LN+A + L
Sbjct: 563 NPQEIEPALALAQYYLGKGQLKKALAILNEAA--------DAAPDSPEAWLMLGRAQLAA 614
Query: 349 GKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
G +A K+ L + PD A L LA Y + +RALE+
Sbjct: 615 GDLNKAVSSFKKLLAL--------QPDSALALLLLADAYAVMKNYAKAITSLKRALELKP 666
Query: 409 -----------LKLGPDDSNVAKT------KNN---------LASCYLKQGKYKEAEILY 442
L L + AK K + YL+Q Y A Y
Sbjct: 667 DNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAY 726
Query: 443 KQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYE--LKLGPDDSNVAK 500
++ A +R + + N A+ A++ E LK P+D A
Sbjct: 727 RK----ALKRAPSSQNAIKLHRALLASGNTAE--------AVKTLEAWLKTHPND---AV 771
Query: 501 TKNNLASCYLKQGKYKEAEILYKQVLTRA 529
+ LA YL Q Y +A Y+ V+ +A
Sbjct: 772 LRTALAELYLAQKDYDKAIKHYQTVVKKA 800
Score = 32.0 bits (73), Expect = 1.1
Identities = 38/151 (25%), Positives = 57/151 (37%), Gaps = 29/151 (19%)
Query: 300 ALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCK 359
A Y +NKYK A L +AL L +Y G Y AE +
Sbjct: 29 AKSYLQKNKYKAAIIQLKNALQKDPNDAEARFL--------LGKIYLALGDYAAAEKELR 80
Query: 360 RALEI---REKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDS 416
+AL + + +VL P +A+ A L Q + +Q+ L+ K DD
Sbjct: 81 KALSLGYPKNQVL----PLLAR-----AYLLQGK---------FQQVLDELPGKTLLDDE 122
Query: 417 NVAKTKNNLASCYLKQGKYKEAEILYKQVLT 447
A+ YL G+ + A+ Y+Q L
Sbjct: 123 GAAELLALRGLAYLGLGQLELAQKSYEQALA 153
Score = 28.9 bits (65), Expect = 9.0
Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 9/91 (9%)
Query: 359 KRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNV 418
+AL +K L K+ PD A N L + + + +++AL I + +
Sbjct: 448 DKALAAAKK-LEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSI--------EPDF 498
Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
NLA +++G +A +++VLT
Sbjct: 499 FPAAANLARIDIQEGNPDDAIQRFEKVLTID 529
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 42.9 bits (99), Expect = 3e-04
Identities = 56/239 (23%), Positives = 81/239 (33%), Gaps = 10/239 (4%)
Query: 295 MLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNL--AVLYGKRGKYK 352
+L LA++ K A L +A + LGE A+ L G
Sbjct: 4 LLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLL 63
Query: 353 EAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLG 412
+ + E + E + L NLA N E Y+ ALE+ E L
Sbjct: 64 LLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALA 123
Query: 413 PDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNK 472
D + G Y+EA LY++ L E A
Sbjct: 124 LDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELA--------EALLALGA 175
Query: 473 AKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
Y+ ALE+ E L + + A+ NL YLK GKY+EA Y++ L +
Sbjct: 176 LLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234
Score = 39.8 bits (91), Expect = 0.002
Identities = 57/274 (20%), Positives = 94/274 (34%), Gaps = 16/274 (5%)
Query: 253 EAAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEA 312
A L + + E+ +++ L + ++ L
Sbjct: 6 LALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLL 65
Query: 313 ANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKE 372
A L + E A L NLA G EA + ALE+ EK L +
Sbjct: 66 ALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALD 125
Query: 373 HPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQ 432
+ YEE Y++ALE+ P+ + +A+ L +
Sbjct: 126 PDPDLAEALLALGALYELGDYEEALELYEKALELD-----PELNELAEALLALGALLEAL 180
Query: 433 GKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLG 492
G+Y+EA L ++ L + + A + Y+ ALE YE K
Sbjct: 181 GRYEEALELLEKALKLNPDDDAEA----------LLNLGLLYLKLGKYEEALEYYE-KAL 229
Query: 493 PDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVL 526
D + A+ NLA L+ G+Y+EA ++ L
Sbjct: 230 ELDPDNAEALYNLALLLLELGRYEEALEALEKAL 263
Score = 34.8 bits (78), Expect = 0.082
Identities = 55/211 (26%), Positives = 87/211 (41%), Gaps = 21/211 (9%)
Query: 202 AQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDC 261
++ +P L NL + + G+YE A+ L ++AL
Sbjct: 82 LEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALY 141
Query: 262 ALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALT 321
L +E+ ++ ++A + ++A L L + +Y+EA LL AL
Sbjct: 142 ELGDYEEALELYEKALEL------DPELNELAEALLALGALLEALGRYEEALELLEKALK 195
Query: 322 IREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLN 381
+ N A L NL +LY K GKY+EA ++ALE+ PD A+ L
Sbjct: 196 L-------NPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELD--------PDNAEALY 240
Query: 382 NLALLCQNQSKYEEVERYYQRALEIYELKLG 412
NLALL +YEE ++ALE+
Sbjct: 241 NLALLLLELGRYEEALEALEKALELDPDLYN 271
Score = 33.7 bits (75), Expect = 0.22
Identities = 49/225 (21%), Positives = 82/225 (36%), Gaps = 24/225 (10%)
Query: 314 NLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEH 373
L AL I + L + A A L L G+ EA L + ALE+
Sbjct: 2 KDLLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALEL--------- 52
Query: 374 PDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQG 433
+ LL ++ R + + + N+A+ NL G
Sbjct: 53 ---LPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALG 109
Query: 434 KYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYE--LKL 491
KY+EA L ++ L + + + A Y+ ALE+YE L+L
Sbjct: 110 KYEEALELLEKALALDPDPDLA----------EALLALGALYELGDYEEALELYEKALEL 159
Query: 492 GPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 536
P+ + +A+ L + G+Y+EA L ++ L + + A
Sbjct: 160 DPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEA 204
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 36.6 bits (86), Expect = 7e-04
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
A L NL Y K GKY EA ++ALE+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 35.1 bits (82), Expect = 0.003
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
AK L NL KY+E YY++ALE+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNP 32
Score = 32.8 bits (76), Expect = 0.018
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
AK NL + YLK GKY EA Y++ L
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 32.8 bits (76), Expect = 0.018
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
AK NL + YLK GKY EA Y++ L
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 25.5 bits (57), Expect = 7.3
Identities = 10/33 (30%), Positives = 11/33 (33%)
Query: 293 ATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
A L L Y KY EA AL +
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 36.5 bits (85), Expect = 0.002
Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 8/67 (11%)
Query: 341 LAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYY 400
LA + G Y EA + AL +P A+ L L Q + E
Sbjct: 3 LARAALRAGDYDEALAALEAAL--------ARYPLAAEALLLLGEALLRQGRLAEAAALL 54
Query: 401 QRALEIY 407
+ AL
Sbjct: 55 RAALAAD 61
Score = 33.4 bits (77), Expect = 0.029
Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 8/67 (11%)
Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
LA Y EA L AL + A L L ++G+ EA L
Sbjct: 3 LARAALRAGDYDEALAALEAAL--------ARYPLAAEALLLLGEALLRQGRLAEAAALL 54
Query: 359 KRALEIR 365
+ AL
Sbjct: 55 RAALAAD 61
Score = 32.3 bits (74), Expect = 0.065
Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)
Query: 400 YQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
Y AL E L P + L L+QG+ EA L + L
Sbjct: 13 YDEALAALEAALARYPLAAEAL---LLLGEALLRQGRLAEAAALLRAALAAD 61
Score = 32.3 bits (74), Expect = 0.065
Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)
Query: 480 YQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 529
Y AL E L P + L L+QG+ EA L + L
Sbjct: 13 YDEALAALEAALARYPLAAEAL---LLLGEALLRQGRLAEAAALLRAALAAD 61
Score = 27.6 bits (62), Expect = 3.1
Identities = 12/36 (33%), Positives = 15/36 (41%)
Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIR 323
+P A L +L Q + EAA LL AL
Sbjct: 26 RYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61
>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat.
Length = 36
Score = 35.3 bits (82), Expect = 0.003
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 337 TLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKE 372
L+NL LY K G Y++A L +RAL + + +E
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36
Score = 30.3 bits (69), Expect = 0.17
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 380 LNNLALLCQNQSKYEEVERYYQRALEIYELKLGPD 414
L+NL L + YE+ Y+RAL + + +
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36
Score = 29.9 bits (68), Expect = 0.21
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 423 NNLASCYLKQGKYKEAEILYKQVLTRAHERE 453
+NL Y K G Y++A LY++ L A + E
Sbjct: 3 SNLGRLYRKLGDYEKAISLYERALALAKDPE 33
Score = 29.9 bits (68), Expect = 0.21
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 503 NNLASCYLKQGKYKEAEILYKQVLTRAHERE 533
+NL Y K G Y++A LY++ L A + E
Sbjct: 3 SNLGRLYRKLGDYEKAISLYERALALAKDPE 33
Score = 27.2 bits (61), Expect = 1.9
Identities = 9/35 (25%), Positives = 16/35 (45%)
Query: 296 LNILALVYRDQNKYKEAANLLNDALTIREKTLGEN 330
L+ L +YR Y++A +L AL + +
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 35.1 bits (82), Expect = 0.003
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
A L NL Y K G Y EA ++ALE+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 33.6 bits (78), Expect = 0.011
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
A+ NL + YLK G Y EA Y++ L
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 33.6 bits (78), Expect = 0.011
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
A+ NL + YLK G Y EA Y++ L
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 32.0 bits (74), Expect = 0.036
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEI 406
A+ L NL Y+E YY++ALE+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALEL 30
Score = 25.5 bits (57), Expect = 8.8
Identities = 9/33 (27%), Positives = 10/33 (30%)
Query: 293 ATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
A L L Y Y EA AL +
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 37.3 bits (87), Expect = 0.009
Identities = 12/54 (22%), Positives = 29/54 (53%)
Query: 93 LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
L ++ E +L+ ++ L ++ A L+ E +Q+L + E +L++E ++
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK 121
Score = 33.8 bits (78), Expect = 0.12
Identities = 24/125 (19%), Positives = 57/125 (45%), Gaps = 18/125 (14%)
Query: 29 TNTKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ-VMT 87
++T + ++ L L+ E + +E+ ++ L ++ E + ++
Sbjct: 59 SDTPSARERLPELQQELAEL--------------QEELAELQEQLAELQQENQELKQELS 104
Query: 88 ALAGHLQNVEAEKQKLRT---QVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKH 144
L L+ ++ E +++ L +EN LR+ELA +Q+ ++ E N +L+E ++
Sbjct: 105 TLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
Query: 145 LEFMS 149
F+
Sbjct: 165 RWFLY 169
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 36.5 bits (85), Expect = 0.037
Identities = 38/195 (19%), Positives = 72/195 (36%), Gaps = 38/195 (19%)
Query: 34 VQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQVMTALAGHL 93
+ Q + +++ + L+ E GG + +E ++ V LE I + QV+ + L
Sbjct: 201 LAQNYQETQDQVSKDLDGAGE-HLGGRIHDELKSHVLPVLEAI---VELGQVLQEMKDGL 256
Query: 94 QNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEFMSSIRK 153
++V ++LR +L +R L N+ KL ++ S +SS+
Sbjct: 257 ESVNKTLKELREGGSQLRDGLRGVRRRLNNSLNKLCLTQMCAS----------ILSSLSI 306
Query: 154 YDSDIQGDENSGDQKSDKHKQDDPVVDLFPDEDNDDRNNLSPTPPSQFAQQVNAGY---- 209
D D PD + NL + F+ V G
Sbjct: 307 LALDA-------DFNQ------------LPDVT-EILENLEEVVKADFSSIVQEGNSTFE 346
Query: 210 EIPARLRTLHNLVIQ 224
EIP++++ + V+
Sbjct: 347 EIPSKVKNQTSSVVP 361
>gnl|CDD|221837 pfam12895, Apc3, Anaphase-promoting complex, cyclosome, subunit 3.
Apc3, otherwise known as Cdc27, is one of the subunits
of the anaphase-promoting complex or cyclosome. The
anaphase-promoting complex is a multiprotein subunit E3
ubiquitin ligase complex that controls segregation of
chromosomes and exit from mitosis in eukaryotes. The
protein members of this family contain TPR repeats just
as those of Apc7 do, and it appears that these TPR units
bind the C-termini of the APC co-activators CDH1 and
CDC20.
Length = 80
Score = 33.0 bits (76), Expect = 0.049
Identities = 28/99 (28%), Positives = 40/99 (40%), Gaps = 19/99 (19%)
Query: 347 KRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEI 406
+G Y+ A L EK+L + L Q Q Y+RA E+
Sbjct: 1 DQGNYENAIFLA-------EKLLALTPSNEDAYLLAQCYFLQGQ---------YKRAYEL 44
Query: 407 YELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQV 445
KL D+S+ + LA C LK GKY EA + ++
Sbjct: 45 LR-KLKLDNSSGCRYL--LAQCLLKLGKYDEAIAVLEKA 80
Score = 28.0 bits (63), Expect = 2.5
Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 476 REKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQV 525
+ Y+RA E+ KL D+S+ + LA C LK GKY EA + ++
Sbjct: 34 LQGQYKRAYELLR-KLKLDNSSGCRYL--LAQCLLKLGKYDEAIAVLEKA 80
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 35.8 bits (83), Expect = 0.051
Identities = 26/126 (20%), Positives = 45/126 (35%), Gaps = 22/126 (17%)
Query: 71 NSLEIIELGLNEAQVMTAL-----AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQ 125
+S IE + G E+ K LR Q+ ++ + L ELA Q
Sbjct: 105 DSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQ 164
Query: 126 QKLQSSEQ----ANSQLE-----EEKK------HLEFMSSIRKYDSDIQGDENSGDQKSD 170
+LQ+ Q + +LE +EK LE + ++ E + +
Sbjct: 165 AQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLE--AELEVL 222
Query: 171 KHKQDD 176
K + D+
Sbjct: 223 KRQIDE 228
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
repeats [Intracellular trafficking and secretion].
Length = 257
Score = 34.7 bits (80), Expect = 0.082
Identities = 30/165 (18%), Positives = 48/165 (29%), Gaps = 30/165 (18%)
Query: 372 EHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLK 431
+P + L + E ++A L P D + N L + +
Sbjct: 95 AYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAAR-----LAPTD---WEAWNLLGAALDQ 146
Query: 432 QGKYKEAEILYKQVLTRAHEREFGACDGDNKP-----IWQDRQKNKAKNREKYYQRALEI 486
G++ EA Y+Q L A + + + + A
Sbjct: 147 LGRFDEARRAYRQALELA--------PNEPSIANNLGMSLLLRGDLEDAE-TLLLPAYL- 196
Query: 487 YELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
DS V NLA QG ++EAE + Q L
Sbjct: 197 ----SPAADSRVR---QNLALVVGLQGDFREAEDIAVQELLSEQA 234
Score = 33.2 bits (76), Expect = 0.26
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 16/125 (12%)
Query: 317 NDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDV 376
D+ + + L + G + EA + ++A + P
Sbjct: 82 ADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLA--------PTD 133
Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYK 436
+ N L +++E R Y++ALE+ N NNL L +G +
Sbjct: 134 WEAWNLLGAALDQLGRFDEARRAYRQALEL--------APNEPSIANNLGMSLLLRGDLE 185
Query: 437 EAEIL 441
+AE L
Sbjct: 186 DAETL 190
>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K. CENP-K is one
of seven new CENP-A-nucleosome distal (CAD) centromere
components (the others being CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S) that are identified as
assembling on the CENP-A nucleosome associated complex,
NAC. The CENP-A NAC is essential, as disruption of the
complex causes errors of chromosome alignment and
segregation that preclude cell survival despite
continued centromere-derived mitotic checkpoint
signalling. CENP-K is centromere-associated through its
interaction with one or more components of the CENP-A
NAC.
Length = 273
Score = 34.9 bits (80), Expect = 0.095
Identities = 24/128 (18%), Positives = 54/128 (42%), Gaps = 12/128 (9%)
Query: 23 NQEEIMTNTKTVQQGLEALKNEHTSILNSLSETQH----GGPVEEEKRNMVKNSLEIIEL 78
++++ +VQ E LK + L E Q + E +N V
Sbjct: 102 LRQDLEMVLSSVQSKNEKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVT------- 154
Query: 79 GLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQL 138
+E+++ L + ++ K+KL + + ++E+ L + A+ ++K + ++ + QL
Sbjct: 155 -FSESRIFQELKTKMLRIKEYKEKLLSALGEFLEEHFPLPEAQASIKKKRKGIQEPSVQL 213
Query: 139 EEEKKHLE 146
+ LE
Sbjct: 214 ITLHEILE 221
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 35.0 bits (81), Expect = 0.13
Identities = 24/136 (17%), Positives = 54/136 (39%), Gaps = 1/136 (0%)
Query: 31 TKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ-VMTAL 89
K ++ LE LK E + L Q E+ + ++ IE +NE +
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDK 446
Query: 90 AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEFMS 149
A ++ E + ++L + + QE L++E +++L ++ ++ E + + E
Sbjct: 447 ALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERV 506
Query: 150 SIRKYDSDIQGDENSG 165
+ ++ G
Sbjct: 507 RGGRAVEEVLKASIQG 522
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 34.3 bits (79), Expect = 0.16
Identities = 30/113 (26%), Positives = 45/113 (39%), Gaps = 15/113 (13%)
Query: 339 NNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVER 398
L L R + + EPL K A EK L K+HP+ L+ L L + +
Sbjct: 296 PRLCRLIP-RLRPGDPEPLIKAA----EKWL-KQHPEDPLLLSTLGRLALKNKLWGKASE 349
Query: 399 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
+ A LKL P S LA + G+ +EAE + ++ L +
Sbjct: 350 ALEAA-----LKLRPSAS----DYAELADALDQLGEPEEAEQVRREALLLTRQ 393
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 34.2 bits (78), Expect = 0.17
Identities = 29/117 (24%), Positives = 53/117 (45%), Gaps = 9/117 (7%)
Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
P+ + N + DQNK+ EA + A+ + ++T + V +N L+ +
Sbjct: 464 PEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETKPM-YMNVLPLINKALALFQWKQ 522
Query: 350 KYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEI 406
+ EAE LC++AL I P+ + +A L Q +E + ++RA E+
Sbjct: 523 DFIEAENLCEKALII--------DPECDIAVATMAQLLLQQGDVDEALKLFERAAEL 571
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 29.8 bits (68), Expect = 0.22
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVL 446
A+ NL Y K G Y+EA Y++ L
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKAL 28
Score = 29.8 bits (68), Expect = 0.22
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVL 526
A+ NL Y K G Y+EA Y++ L
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKAL 28
Score = 29.8 bits (68), Expect = 0.24
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
A L NL + Y K G Y+EA ++ALE+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
Score = 28.6 bits (65), Expect = 0.51
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
A+ L NL L YEE Y++ALE+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDP 32
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 33.8 bits (78), Expect = 0.23
Identities = 36/157 (22%), Positives = 61/157 (38%), Gaps = 15/157 (9%)
Query: 12 KIESIGKMTVMNQEEIMTNTKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKN 71
KIE K+ M ++ T Q + + T I + L E QH + +
Sbjct: 270 KIEQFQKVIKMYEKG--GVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAID---- 323
Query: 72 SLEIIELGLNEAQV--------MTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELAN 123
LE I NE ++ L + + +K++ + L E +ELA
Sbjct: 324 ELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAK 383
Query: 124 TQQKLQSSEQANSQLEEEKKHLEFMSSIRKYDSDIQG 160
Q +L + S+L +EK H ++ + K DS I+
Sbjct: 384 LQDELDKIVKTKSELVKEKYHRGIVTDLLK-DSGIKA 419
>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
N-terminal. The DAZ gene-product - Deleted in
Azoospermia - and a closely related sequence are
required early in germ-cell development in order to
maintain germ-cell populations. This family is the
N-terminal region that is the only part of the protein
in some fungi and lower metazoa.
Length = 118
Score = 31.5 bits (72), Expect = 0.34
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 110 LVQENAWLRDELANTQQKLQSSEQANSQLEEEKK 143
L Q N+ L + L Q+ + EQ S+ EE K
Sbjct: 78 LTQSNSELEERLQEALQEAEEREQELSKQSEELK 111
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 31.1 bits (71), Expect = 0.37
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 89 LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKK 143
L L + AE +L+ Q+ L++ENA LR E + +++L+ EQ + E+ KK
Sbjct: 13 LEQQLNQLLAELGELKQQLAELLEENAELRIENEHLRERLEELEQEQKEKEKGKK 67
>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
Provisional.
Length = 657
Score = 33.1 bits (75), Expect = 0.37
Identities = 33/149 (22%), Positives = 66/149 (44%), Gaps = 17/149 (11%)
Query: 42 KNEHTSILNSLSETQHGGPVEEEKRNMVKNS----------LEIIELGLN-EAQVMTA-- 88
K++ T +++ T +GG +E+ M+++S E++E+ N E Q+ TA
Sbjct: 509 KDKATGKTQNITITANGGLSKEQIEQMIRDSEQHAEADRVKRELVEVRNNAETQLTTAER 568
Query: 89 -LAGHLQNVEAEKQKLRT---QVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKH 144
L +AEK+ ++T ++R+ ++ +D+LA KLQ + + E ++
Sbjct: 569 QLGEWKYVSDAEKENVKTLVAELRKAMENPNVAKDDLAAATDKLQKAVMECGRTEYQQAA 628
Query: 145 LEFMSSIRKYDSDIQGDENSGDQKSDKHK 173
S Q + Q S++ K
Sbjct: 629 AANSGSSSNSGEQQQQQQQQQQQNSEEKK 657
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 32.6 bits (75), Expect = 0.45
Identities = 23/98 (23%), Positives = 43/98 (43%), Gaps = 3/98 (3%)
Query: 37 GLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQV-MTALAGHLQN 95
LE N+ + +S++ + E +R+ ++ L + A+ LA L +
Sbjct: 70 SLERQGNQ--DLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127
Query: 96 VEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQ 133
+ + QV L Q+ A LR +LA + L +SE+
Sbjct: 128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEK 165
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 28.7 bits (65), Expect = 0.55
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 8/41 (19%)
Query: 399 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 439
Y++ALE+ P+++ NLA L G+Y EA
Sbjct: 1 LYEKALELD-----PNNAEAY---YNLALLLLNLGQYDEAL 33
Score = 28.7 bits (65), Expect = 0.55
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 8/41 (19%)
Query: 479 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 519
Y++ALE+ P+++ NLA L G+Y EA
Sbjct: 1 LYEKALELD-----PNNAEAY---YNLALLLLNLGQYDEAL 33
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 32.8 bits (75), Expect = 0.57
Identities = 38/200 (19%), Positives = 58/200 (29%), Gaps = 55/200 (27%)
Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
PD L + + + NK KEA L AL + ++ + NLA K
Sbjct: 335 AQPDNPYYLELAGDILLEANKAKEAIERLKKALALD------PNSPLLQL--NLAQALLK 386
Query: 348 RGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIY 407
GK +EA + R L P+ +A
Sbjct: 387 GGKPQEAIRILNRYL--------FNDPE-----------------DPNGWDLLAQAYA-- 419
Query: 408 ELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIW-- 465
+LG N A+ A Y G+ ++A I + + P W
Sbjct: 420 --ELG----NRAEALLARAEGYALAGRLEQAIIFLMRASQQV---------KLGFPDWAR 464
Query: 466 -QDR--QKNKAKNREKYYQR 482
R Q + K Y +
Sbjct: 465 ADARIDQLRQQNEEFKPYTK 484
>gnl|CDD|236865 PRK11160, PRK11160, cysteine/glutathione ABC transporter
membrane/ATP-binding component; Reviewed.
Length = 574
Score = 32.1 bits (74), Expect = 0.76
Identities = 11/54 (20%), Positives = 25/54 (46%), Gaps = 9/54 (16%)
Query: 94 QNVEAEKQKLRTQVRRLVQENAWL---------RDELANTQQKLQSSEQANSQL 138
Q++ + + R Q+ +Q A L R +L T+Q+ ++++ + L
Sbjct: 190 QDLTHLRAQYRVQLTEWLQGQAELTLFGAEDRYRQQLEQTEQQWLAAQRRQANL 243
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 32.0 bits (73), Expect = 1.0
Identities = 27/117 (23%), Positives = 53/117 (45%), Gaps = 8/117 (6%)
Query: 38 LEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ--------VMTAL 89
LE L+ E + L E + + ++ LE + L ++E + + AL
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYAL 293
Query: 90 AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
A + +E +KQ LR ++ L ++ L +L + KL + ++LEE+ + L+
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350
>gnl|CDD|223054 PHA03359, PHA03359, UL17 tegument protein; Provisional.
Length = 686
Score = 31.9 bits (73), Expect = 1.1
Identities = 38/163 (23%), Positives = 54/163 (33%), Gaps = 29/163 (17%)
Query: 210 EIPARLRTLHNLVIQYA----SQGRYEVAVP----LCKQALEDLEKTSG----KYEAAET 257
I ++L+ Y RY P LC+Q + + EAA
Sbjct: 136 AIGDAAEEYYDLLFTYDELVPPGTRYNADGPRLAALCRQFCAYVRSHAALPPAAKEAAAH 195
Query: 258 LEDCALRSRKESYDIVKQAK-----VAQILGSGGHDHPDVATMLN-----ILALVYRDQN 307
LE C + + D V A Q+ GG D P + IL+L+ R
Sbjct: 196 LEACLGEDGRSNGDPVLSADAYIDPEEQLTAPGGFDTPAADARIEEDDREILSLIRR--- 252
Query: 308 KYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGK 350
AA ++ +R E AVA+ L A G G
Sbjct: 253 ----AAEVVASRHPVRAAAPREVGGAVASGLRQGARALGAAGA 291
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 31.8 bits (72), Expect = 1.2
Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 5/69 (7%)
Query: 94 QNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQS--SEQANSQLEEEK-KHLEFMSS 150
QN + E + L +++ L ++ + ELA ++KLQS SEQA+S K KHLE
Sbjct: 181 QNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEE--E 238
Query: 151 IRKYDSDIQ 159
+++Y+ D +
Sbjct: 239 LKRYEQDAE 247
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 31.3 bits (71), Expect = 1.5
Identities = 26/114 (22%), Positives = 45/114 (39%), Gaps = 5/114 (4%)
Query: 36 QGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVK---NSLEIIELGLNEAQVMTALAGH 92
+ L AL+N S E + + N EI+E A ++ +
Sbjct: 308 EHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDINGPEILECKYRVA--VSEVGEL 365
Query: 93 LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
++A K + R E LR+E+ N +KL S E+++ + +E LE
Sbjct: 366 KAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDLE 419
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 30.7 bits (70), Expect = 1.8
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 4/38 (10%)
Query: 108 RRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL 145
L +EN L+ EL + +LQ QLE E L
Sbjct: 72 FDLREENEELKKELLELESRLQE----LEQLEAENARL 105
Score = 29.6 bits (67), Expect = 4.1
Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 5/58 (8%)
Query: 83 AQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEE 140
++G +++ A LR + L +E EL + Q+L+ E N++L E
Sbjct: 55 NAPREFVSGVFESL-ASLFDLREENEELKKELL----ELESRLQELEQLEAENARLRE 107
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 30.8 bits (70), Expect = 1.8
Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query: 89 LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQA-NSQLEEEKKHLE 146
L ++ + KL ++ L EN L+ + Q++Q + Q+ +L +E + L+
Sbjct: 64 LVAEVKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLK 122
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 30.4 bits (68), Expect = 2.5
Identities = 14/51 (27%), Positives = 30/51 (58%)
Query: 96 VEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
+E + LR + R+ +ENA L+ + +Q Q++++A ++ E+E + E
Sbjct: 283 IEELARGLRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQARE 333
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 28.0 bits (63), Expect = 2.6
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 102 KLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL 145
+L +++++ V+ A L+ E+ +++ + + N +LEEE K L
Sbjct: 8 ELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKL 51
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 30.0 bits (68), Expect = 2.8
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 2/87 (2%)
Query: 62 EEEKRNMVKNSLEIIELGLN-EAQVMTALAGHLQNVEAEK-QKLRTQVRRLVQENAWLRD 119
+ N +K L L E + LA L + + +K R ++R L + + R
Sbjct: 169 DLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRK 228
Query: 120 ELANTQQKLQSSEQANSQLEEEKKHLE 146
+L QQ+LQ A L +K L
Sbjct: 229 QLEELQQELQELTIAIEALTNKKSELL 255
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 27.7 bits (62), Expect = 2.9
Identities = 14/54 (25%), Positives = 27/54 (50%)
Query: 93 LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
+ + ++ K +++L +N L E+A ++KL+ E+ LEE K E
Sbjct: 21 GEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74
>gnl|CDD|235966 PRK07207, PRK07207, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 965
Score = 30.3 bits (69), Expect = 2.9
Identities = 29/111 (26%), Positives = 42/111 (37%), Gaps = 22/111 (19%)
Query: 237 LCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATML 296
L ++A E LE + AETL K YD V +V + A +L
Sbjct: 169 LIEEACEGLEAVDAEPILAETL--------KNLYDGVPMDEVYK------------ALIL 208
Query: 297 NILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
L+ ++ N A LL D T+R + LG T +A Y +
Sbjct: 209 AARTLIEKEPNYSYVTARLLLD--TLRREALGFLGVPEEVTQAEMATRYAE 257
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.8 bits (67), Expect = 3.3
Identities = 20/98 (20%), Positives = 39/98 (39%), Gaps = 2/98 (2%)
Query: 61 VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDE 120
E++R+ K L +E E AL L ++ E+++L ++ L +E+ L E
Sbjct: 40 DAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGE 99
Query: 121 LANTQQKLQS--SEQANSQLEEEKKHLEFMSSIRKYDS 156
L Q++ + +E+ E + S
Sbjct: 100 LVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQS 137
>gnl|CDD|234861 PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional.
Length = 552
Score = 30.0 bits (69), Expect = 3.3
Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 15/49 (30%)
Query: 313 ANLLN-DALTIREKTLGENHAAVAA-------TLNN-------LAVLYG 346
A LL+ D LT+ KTL EN A L+N LA+L G
Sbjct: 330 AGLLHGDCLTVTGKTLAENLADAPDPDQDVIRPLDNPISPTGGLAILKG 378
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 26.7 bits (60), Expect = 3.4
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
A L +Y + G Y+EA+ ++ALE+
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPN 33
Score = 26.3 bits (59), Expect = 4.4
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLT 447
A+ L YL+ G Y+EA+ Y++ L
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALE 29
Score = 26.3 bits (59), Expect = 4.4
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLT 527
A+ L YL+ G Y+EA+ Y++ L
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALE 29
>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
Provisional.
Length = 656
Score = 30.1 bits (67), Expect = 3.8
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)
Query: 212 PARLRTLHNLVIQYASQGRYEVAVPLCKQALE---DLEKTSGKYEAA 255
+R + G+ E A+PL +Q+L DL Y A
Sbjct: 281 SDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAMYARA 327
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.5 bits (66), Expect = 4.3
Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 9/89 (10%)
Query: 61 VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRR---LVQENAWL 117
V +E +N ++ GL Q + ++ E K+KL + L++E L
Sbjct: 108 VRKELKN------ALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEEL 161
Query: 118 RDELANTQQKLQSSEQANSQLEEEKKHLE 146
E Q++L+ E NS+LEE K L
Sbjct: 162 EAEYEEVQERLKRLEVENSRLEEMLKKLP 190
>gnl|CDD|235315 PRK04841, PRK04841, transcriptional regulator MalT; Provisional.
Length = 903
Score = 29.9 bits (68), Expect = 4.4
Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 1/65 (1%)
Query: 384 ALLCQNQSKYEEVERYYQRA-LEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILY 442
A L Q+Q +Y EV RA E+ + + D + A+ A + G +EAE L
Sbjct: 416 AWLAQSQHRYSEVNTLLARAEQELKDRNIELDGTLQAEFNALRAQVAINDGDPEEAERLA 475
Query: 443 KQVLT 447
+ L
Sbjct: 476 ELALA 480
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.7 bits (67), Expect = 4.6
Identities = 13/71 (18%), Positives = 30/71 (42%)
Query: 81 NEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEE 140
E L+ ++ + +L + + ++ E L ++ ++L + Q N +LEE
Sbjct: 260 TEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEE 319
Query: 141 EKKHLEFMSSI 151
E K + +
Sbjct: 320 EYKIKKRTVEL 330
>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
Members of this protein family are uncharacterized
tetratricopeptide repeat (TPR) proteins invariably found
in heme biosynthesis gene clusters. The absence of any
invariant residues other than Ala argues against this
protein serving as an enzyme per se. The gene symbol
hemY assigned in E. coli is unfortunate in that an
unrelated protein, protoporphyrinogen oxidase (HemG in
E. coli) is designated HemY in Bacillus subtilis
[Unknown function, General].
Length = 367
Score = 29.6 bits (67), Expect = 4.7
Identities = 32/105 (30%), Positives = 47/105 (44%), Gaps = 15/105 (14%)
Query: 340 NLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERY 399
L LYG R + + PL KRA EK L K+HPD A L L LC Q + + + Y
Sbjct: 277 ELLRLYG-RLQPGDPSPLIKRA----EKWL-KKHPDDALLLLALGRLCLRQQLWGKAQSY 330
Query: 400 YQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ 444
+ + L L P + + LA + + G + A Y++
Sbjct: 331 LEAS-----LSLAPTEEAHLE----LAQLFEQLGDTEAAAQHYRK 366
>gnl|CDD|221956 pfam13174, TPR_6, Tetratricopeptide repeat.
Length = 33
Score = 25.9 bits (58), Expect = 4.7
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 424 NLASCYLKQGKYKEAEILYKQVLTR 448
LA YLK G EA+ +++L R
Sbjct: 5 KLALAYLKLGDTDEAKEALERLLKR 29
Score = 25.9 bits (58), Expect = 4.7
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 504 NLASCYLKQGKYKEAEILYKQVLTR 528
LA YLK G EA+ +++L R
Sbjct: 5 KLALAYLKLGDTDEAKEALERLLKR 29
>gnl|CDD|224631 COG1717, RPL32, Ribosomal protein L32E [Translation, ribosomal
structure and biogenesis].
Length = 133
Score = 28.5 bits (64), Expect = 5.0
Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 19/92 (20%)
Query: 186 DNDDRNNLSPTPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDL 245
D+ R L PP V GY P +R LH G EV V ++DL
Sbjct: 47 DSKMRRKLKGKPP-----MVKIGYRSPKAVRGLH-------PSGYEEVLV----HNVKDL 90
Query: 246 EKTSGKYEAAETLEDCALRSRKESYDIVKQAK 277
EK + +AA R R E I+++A+
Sbjct: 91 EKLDPETQAARIASTVGARKRIE---ILERAR 119
>gnl|CDD|221262 pfam11846, DUF3366, Domain of unknown function (DUF3366). This
domain is functionally uncharacterized. This domain is
found in bacteria. This presumed domain is about 200
amino acids in length.
Length = 193
Score = 28.9 bits (65), Expect = 5.1
Identities = 21/114 (18%), Positives = 30/114 (26%), Gaps = 22/114 (19%)
Query: 425 LASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDR--------------QK 470
Y+ + Y E L + TR N +WQDR
Sbjct: 64 AVLAYMLRD-YHRVERLTRFEGTR-PANITPLRRVTNPLLWQDRFDWADYATALLLGITP 121
Query: 471 NKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ 524
Y++ AL++ K P LA G EA +
Sbjct: 122 QNPALLAPYHEWALKLLRYKPRP------AVYRRLALALALLGDPAEARDWMAE 169
>gnl|CDD|226409 COG3893, COG3893, Inactivated superfamily I helicase [DNA
replication, recombination, and repair].
Length = 697
Score = 29.4 bits (66), Expect = 5.1
Identities = 30/109 (27%), Positives = 42/109 (38%), Gaps = 14/109 (12%)
Query: 236 PLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKV--AQILGSGGHDHPDVA 293
PL + LE L+ +AA LE ALR K S + + A++LG+ D P
Sbjct: 399 PLARFGLEFLDLR----DAANVLELAALRGGKVSPGYGELPALLEARLLGTSDDDLPPAW 454
Query: 294 TMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLA 342
L + A LL ++ + LG H A L LA
Sbjct: 455 RAG----LDPDN----IALARLLLQRISEAKAPLGAGHFEAIADLLALA 495
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 28.6 bits (64), Expect = 5.2
Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 93 LQNVE---AEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL-EFM 148
LQN++ Q L+ + RL +N L+ +++L+ Q S +EE+ + L + M
Sbjct: 89 LQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDIM 148
Query: 149 SSIRK 153
RK
Sbjct: 149 DRARK 153
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 5.2
Identities = 18/94 (19%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Query: 61 VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDE 120
+EE+ + ++ NE Q + L+ +E E ++LR ++ L+++ +
Sbjct: 659 LEEKVEELEAEIRRELQRIENEEQ-LEEKLEELEQLEEELEQLREELEELLKKLGEIEQL 717
Query: 121 LANTQQKLQSSEQANSQLEEEKKHLEFMSSIRKY 154
+ + + E+ +LE+ +K LE + +R+
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEELREK 751
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 29.6 bits (66), Expect = 5.5
Identities = 15/58 (25%), Positives = 28/58 (48%)
Query: 89 LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
A Q Q+ Q+ + Q+ A +++ +++LQ E A ++LE+E LE
Sbjct: 222 RAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLE 279
Score = 28.5 bits (63), Expect = 9.7
Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 1/78 (1%)
Query: 70 KNSLEIIELGLNEAQV-MTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKL 128
+ L + A+ A LQ E++ +R ++ Q A + ELA ++
Sbjct: 87 RTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQA 146
Query: 129 QSSEQANSQLEEEKKHLE 146
Q + L E+++ LE
Sbjct: 147 QDLQTRLKTLAEQRRQLE 164
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 28.4 bits (64), Expect = 5.8
Identities = 14/57 (24%), Positives = 29/57 (50%)
Query: 90 AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
+++E +KL ++ RL L+ +L + +++L + QLE++ K LE
Sbjct: 49 LEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLE 105
>gnl|CDD|216192 pfam00920, ILVD_EDD, Dehydratase family.
Length = 521
Score = 29.4 bits (67), Expect = 6.0
Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 17/56 (30%)
Query: 308 KYKEAANLLN-DALTIREKTLGEN---------HAAVAATLNN-------LAVLYG 346
K A LL+ DALT+ +TLGEN V L+N LA+L G
Sbjct: 295 KELLDAGLLHGDALTVTGRTLGENLAEAPILDPDGDVIRPLDNPFKPEGGLAILKG 350
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 29.3 bits (66), Expect = 6.8
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 7/57 (12%)
Query: 89 LAGHLQNVEAEKQKLRTQVRRLV-------QENAWLRDELANTQQKLQSSEQANSQL 138
L Q +A+ +LR + + + A L +LA+ Q Q+ EQ L
Sbjct: 355 LVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQQYQDL 411
>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF. Members of
this protein family are the product of one of seven
genes regularly clustered in operons to encode the
proteins of the tol-pal system, which is critical for
maintaining the integrity of the bacterial outer
membrane. The gene for this periplasmic protein has been
designated orf2 and ybgF. All members of the seed
alignment were from unique tol-pal gene regions from
completed bacterial genomes. The architecture of this
protein is a signal sequence, a low-complexity region
usually rich in Asn and Gln, a well-conserved region
with tandem repeats that resemble the tetratricopeptide
(TPR) repeat, involved in protein-protein interaction.
Length = 117
Score = 27.6 bits (62), Expect = 7.2
Identities = 21/115 (18%), Positives = 39/115 (33%), Gaps = 16/115 (13%)
Query: 337 TLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ---NQSKY 393
+ A+L K G Y +A + L K++P N L + Q Y
Sbjct: 2 AYYDAALLVLKAGDYADAIQAFQAFL--------KKYPKSTYAPNAHYWLGEAYYAQGDY 53
Query: 394 EEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTR 448
+ + + ++ Y P L + G ++A+ +QV+ R
Sbjct: 54 ADAAKAFLAVVKKY-----PKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKR 103
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 28.7 bits (65), Expect = 7.2
Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 11/72 (15%)
Query: 80 LNEAQVMTALAGHLQN----VEAEKQKLRTQVRRLVQENAWLR-------DELANTQQKL 128
L AQ A A Q + Q+L ++R+L++E LR +AN QQ++
Sbjct: 28 LAAAQQTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEI 87
Query: 129 QSSEQANSQLEE 140
S +Q Q+E+
Sbjct: 88 ASLQQQIEQIEK 99
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 28.9 bits (65), Expect = 7.8
Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 6/51 (11%)
Query: 109 RLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL--EFMSSIRKYDSD 157
L EN L+ ELA +Q L+ LEEE K L D D
Sbjct: 70 DLALENEELKKELAELEQLLEE----VESLEEENKRLKELLDFKESSSDYD 116
>gnl|CDD|184448 PRK14010, PRK14010, potassium-transporting ATPase subunit B;
Provisional.
Length = 673
Score = 28.9 bits (64), Expect = 7.9
Identities = 25/105 (23%), Positives = 38/105 (36%), Gaps = 13/105 (12%)
Query: 195 PTPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYE- 253
P S F + V A YE T I V L + DL + G+Y
Sbjct: 321 PVKSSSFERLVKAAYESSIADDTPEGRSI-----------VKLAYKQHIDLPQEVGEYIP 369
Query: 254 -AAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLN 297
AET + +E Y + V ++ +GGH D+ ++
Sbjct: 370 FTAETRMSGVKFTTREVYKGAPNSMVKRVKEAGGHIPVDLDALVK 414
>gnl|CDD|240030 cd04673, Nudix_Hydrolase_15, Members of the Nudix hydrolase
superfamily catalyze the hydrolysis of NUcleoside
DIphosphates linked to other moieties, X. Enzymes
belonging to this superfamily require a divalent cation,
such as Mg2+ or Mn2+, for their activity and contain a
highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V), which functions
as a metal binding and catalytic site. Substrates of
nudix hydrolases include intact and oxidatively damaged
nucleoside triphosphates, dinucleoside polyphosphates,
nucleotide-sugars and dinucleotide enzymes. These
substrates are metabolites or cell signaling molecules
that require regulation during different stages of the
cell cycle or during periods of stress. In general, the
role of the nudix hydrolase is to sanitize the
nucleotide pools and to maintain cell viability, thereby
serving as surveillance & "house-cleaning" enzymes.
Substrate specificity is used to define families within
the superfamily. Differences in substrate specificity
are determined by the N-terminal extension or by
residues in variable loop regions. Mechanistically,
substrate hydrolysis occurs by a nucleophilic
substitution reaction, with variation in the numbers and
roles of divalent cations required.
Length = 122
Score = 27.6 bits (62), Expect = 8.4
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)
Query: 250 GKYEAAETLEDCALRSRKESYDIVKQAKVAQILG 283
GK E ETLE ALR E + A+V ++L
Sbjct: 32 GKVELGETLEQAALRELLEETGLE--AEVGRLLT 63
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 28.8 bits (65), Expect = 8.9
Identities = 15/54 (27%), Positives = 28/54 (51%)
Query: 93 LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
L ++ E L+ Q+ +E A + QQ+L + E ++LEE+++ LE
Sbjct: 144 LHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELE 197
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.4 bits (64), Expect = 9.1
Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 4/99 (4%)
Query: 62 EEEKRNMVKNSLEIIELGLN-EAQVMTALAGHLQNV-EAEKQKLRTQVRRLVQENAWLRD 119
E E N +K L + L E + + L L++ E + + ++++L+QE
Sbjct: 166 ELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVK 225
Query: 120 ELANTQQKLQSSEQANSQLEEEKKHLEFMSSIRKYDSDI 158
+L +++LQ E L +K E + I + + +
Sbjct: 226 KLEELEEELQELESKIEDLTNKK--SELNTEIAEAEKKL 262
>gnl|CDD|233734 TIGR02120, GspF, type II secretion system protein F. This membrane
protein is a component of the terminal branch complex of
the general secretion pathway (GSP), also known as
the"Type II" secretion pathway. The GSP transports
proteins (generally virulence-associated cell wall
hydrolases) across the outer membrase of the bacterial
cell. Transport across the inner membrane is often, but
not exclusively handled by the Sec system. This model
was constructed from the broader subfamily model,
pfam00482 which includes components of pilin complexes
(PilC) as well as other related genes. GspF is nearly
always gene clustered with other GSP subunits. Some
genes from Xylella and Xanthomonas strains score below
the trusted cutoff due to excessive divergence from the
family such that a sequence from Deinococcus which does
not appear to be GspF scores higher [Protein fate,
Protein and peptide secretion and trafficking].
Length = 399
Score = 28.4 bits (64), Expect = 9.2
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 6/57 (10%)
Query: 51 SLSE--TQHGGPVEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRT 105
SL++ QH R +V E V+ LA +L+ +A + K+ T
Sbjct: 112 SLADALAQHPRDFPPLYRALVAAG----EASGALDAVLERLADYLEERQALRSKITT 164
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 27.6 bits (62), Expect = 9.5
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 89 LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEF 147
L L +E E Q+L Q ++ ++E + +Q Q EQ ++ E+K L F
Sbjct: 25 LQEQLTQLEQELQQLEFQGQKAIKEIR--KQSAQQIEQIQQQVEQERAKRLEQKNQLLF 81
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.128 0.355
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,853,063
Number of extensions: 2604159
Number of successful extensions: 3064
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2881
Number of HSP's successfully gapped: 340
Length of query: 546
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 444
Effective length of database: 6,413,494
Effective search space: 2847591336
Effective search space used: 2847591336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.2 bits)