RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10738
(609 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 56.6 bits (137), Expect = 5e-10
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 32/131 (24%)
Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMV 121
LGN Y+ LGDY +A++Y+++ L L + D + NL +GKY+EA+
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD------AYYNLAAAYYKLGKYEEALE 55
Query: 122 CCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIK 181
++ LE+ A YNLG Y+ GK +EA++
Sbjct: 56 DYEKALELDPDNAK------AYYNLGLAYYKLGK--------------------YEEALE 89
Query: 182 YYEQNLALMKE 192
YE+ L L
Sbjct: 90 AYEKALELDPN 100
Score = 54.3 bits (131), Expect = 2e-09
Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 12/108 (11%)
Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
A NLGN YY LG++ +A+ Y+++ L++ + D A NL ++ LG+Y+ A E
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD------AYYNLAAAYYKLGKYEEALE 55
Query: 262 HYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLII 309
Y++ L L D A+A Y+LG Y L Y A++ + + L +
Sbjct: 56 DYEKALELDPD------NAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Score = 50.1 bits (120), Expect = 1e-07
Identities = 37/126 (29%), Positives = 60/126 (47%), Gaps = 32/126 (25%)
Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGR 201
AL NLGN+Y+ G EA++YYE+ AL E++ A
Sbjct: 2 ALLNLGNLYYKLGD--------------------YDEALEYYEK--AL--ELDPDNA--D 35
Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
A NL YY LG +++A+ +++ L++ + A NLG ++ LG+Y+ A E
Sbjct: 36 AYYNLAAAYYKLGKYEEALEDYEKALELDPDNAK------AYYNLGLAYYKLGKYEEALE 89
Query: 262 HYKRTL 267
Y++ L
Sbjct: 90 AYEKAL 95
Score = 46.6 bits (111), Expect = 1e-06
Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 32/125 (25%)
Query: 105 NLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQD 164
NLGN +G YDEA+ ++ LE+ A YNL Y+ GK
Sbjct: 5 NLGNLYYKLGDYDEALEYYEKALELDP------DNADAYYNLAAAYYKLGK--------- 49
Query: 165 PGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQ 224
+EA++ YE+ L L + A NLG YY LG +++A+ ++
Sbjct: 50 -----------YEEALEDYEKALELDPDNAK------AYYNLGLAYYKLGKYEEALEAYE 92
Query: 225 ERLKI 229
+ L++
Sbjct: 93 KALEL 97
Score = 45.8 bits (109), Expect = 3e-06
Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 10/102 (9%)
Query: 28 EGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDL 87
G K GD + +++ A++ D+ + Y L AY+ LG Y++A++ +++ L
Sbjct: 6 LGNLYYKLGDYDEALEYYEKALELDPDN----ADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 88 TLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEI 129
L + AK+ NLG +GKY+EA+ ++ LE+
Sbjct: 62 ELDPDN------AKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Score = 42.4 bits (100), Expect = 4e-05
Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 12/108 (11%)
Query: 245 NLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHL 304
NLGN + LG+Y A E+Y++ L L D A A Y+L Y L Y A++ +
Sbjct: 5 NLGNLYYKLGDYDEALEYYEKALELDPD------NADAYYNLAAAYYKLGKYEEALEDYE 58
Query: 305 RHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISKQ 352
+ L + +A ++LG A+ G +E+AL K LE+
Sbjct: 59 KALELDPD------NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 32.4 bits (74), Expect = 0.16
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)
Query: 281 QACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKAL 340
+A +LGN Y L DY A++Y+ + A +L A ++L A+ G +E+AL
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEK----ALEL--DPDNADAYYNLAAAYYKLGKYEEAL 54
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 54.5 bits (129), Expect = 5e-08
Identities = 57/260 (21%), Positives = 104/260 (40%), Gaps = 41/260 (15%)
Query: 25 LALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHK 84
L L L K G + + A++ + L L+ LG LG Y++A++ +
Sbjct: 62 LLLLALALLKLGRLEEALELLEKALEL--ELLPNLAEALLNLGLLLEALGKYEEALELLE 119
Query: 85 QDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALY 144
+ L L + + L +G Y+EA+ ++ LE+ +LN+ AL
Sbjct: 120 KALALDPDPDLAEALLAL-----GALYELGDYEEALELYEKALELDPELNEL---AEALL 171
Query: 145 NLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACG 204
LG + A G+ +EA++ E+ L L + + A
Sbjct: 172 ALGALLEALGR--------------------YEEALELLEKALKLNPDDD-----AEALL 206
Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYK 264
NLG Y LG +++A+ Y+++ L++ + + NL + LG Y+ A E +
Sbjct: 207 NLGLLYLKLGKYEEALEYYEKALELDPDNAEALY------NLALLLLELGRYEEALEALE 260
Query: 265 RTLVLAQDLGDRAVEAQACY 284
+ L L DL + +
Sbjct: 261 KALELDPDLYNLGLALLLLL 280
Score = 50.6 bits (119), Expect = 9e-07
Identities = 57/254 (22%), Positives = 104/254 (40%), Gaps = 43/254 (16%)
Query: 63 YSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVC 122
L A LG ++A++ ++ A +E A++ NLG L+ +GKY+EA+
Sbjct: 62 LLLLALALLKLGRLEEALELLEK----ALELELLPNLAEALLNLGLLLEALGKYEEALEL 117
Query: 123 CKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKY 182
++ L + + + G +EA++
Sbjct: 118 LEKALALDPDPDLA-----EALLALGALYELGD--------------------YEEALEL 152
Query: 183 YEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRA 242
YE+ L L E+N+ A LG LG +++A+ ++ LK+ + + A
Sbjct: 153 YEKALELDPELNELAEALL---ALGALLEALGRYEEALELLEKALKL-----NPDDDAEA 204
Query: 243 NSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDY 302
NLG ++ LG+Y+ A E+Y++ L L D A+A Y+L L Y A++
Sbjct: 205 LLNLGLLYLKLGKYEEALEYYEKALELDPD------NAEALYNLALLLLELGRYEEALEA 258
Query: 303 HLRHLIIAQQLMDR 316
+ L + L +
Sbjct: 259 LEKALELDPDLYNL 272
Score = 39.8 bits (91), Expect = 0.003
Identities = 43/176 (24%), Positives = 75/176 (42%), Gaps = 17/176 (9%)
Query: 176 LQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGD 235
L+EA++ E+ L L N A NLG LG +++A+ ++ L + +
Sbjct: 75 LEEALELLEKALELELLPNL----AEALLNLGLLLEALGKYEEALELLEKALALDPDPDL 130
Query: 236 KAAERRANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRD 295
A + LG+Y+ A E Y++ L L +L + A +A +LG L
Sbjct: 131 AEALLALGA-----LYELGDYEEALELYEKALELDPELNELA---EALLALGALLEALGR 182
Query: 296 YPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISK 351
Y A++ + L + + A +LG + G +E+AL + K LE+
Sbjct: 183 YEEALELLEKALKLNPDD-----DAEALLNLGLLYLKLGKYEEALEYYEKALELDP 233
Score = 33.3 bits (74), Expect = 0.34
Identities = 28/113 (24%), Positives = 50/113 (44%), Gaps = 9/113 (7%)
Query: 24 ELALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYH 83
L G L G + + A++ DD LG Y LG Y++A++Y+
Sbjct: 169 ALLALGALLEALGRYEEALELLEKALKLNPDDDA---EALLNLGLLYLKLGKYEEALEYY 225
Query: 84 KQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDK 136
++ L L + A++ NL L +G+Y+EA+ ++ LE+ L +
Sbjct: 226 EKALELDPD------NAEALYNLALLLLELGRYEEALEALEKALELDPDLYNL 272
Score = 31.4 bits (69), Expect = 1.2
Identities = 30/144 (20%), Positives = 52/144 (36%), Gaps = 13/144 (9%)
Query: 210 YYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYKRTLVL 269
LLG +A+ +E L++ + L + + LG + A E ++ L L
Sbjct: 33 LELLGELAEALELLEEALELLPN----SDLAGLLLLLALALLKLGRLEEALELLEKALEL 88
Query: 270 AQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNA 329
A+A +LG L Y A++ + L + A
Sbjct: 89 ELLPNL----AEALLNLGLLLEALGKYEEALELLEKALALDPDPDLA-----EALLALGA 139
Query: 330 HAARGNHEKALYFATKHLEISKQL 353
G++E+AL K LE+ +L
Sbjct: 140 LYELGDYEEALELYEKALELDPEL 163
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 49.7 bits (119), Expect = 7e-08
Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAER-RANSNLGNSHIFLGEYQAASEHY 263
NL LG++ +A+ ++ L++ARE G+ E RA +NL ++ LG+Y A E+
Sbjct: 10 NLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYL 69
Query: 264 KRTLVLAQD 272
++ L L +
Sbjct: 70 EKALALREA 78
Score = 49.3 bits (118), Expect = 1e-07
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 57 RTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGE-AKSSGNLGNTLKVMGK 115
L+A + L LGDY +A++ ++ L LAR + + E A++ NL +G
Sbjct: 2 PDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGD 61
Query: 116 YDEAMVCCKRHLEISRQ 132
YDEA+ ++ L +
Sbjct: 62 YDEALEYLEKALALREA 78
Score = 44.7 bits (106), Expect = 4e-06
Identities = 20/93 (21%), Positives = 34/93 (36%), Gaps = 21/93 (22%)
Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQG 200
AL NL V G EA++ E+ L L +E+ + +
Sbjct: 6 AALNNLALVLRRLGD--------------------YDEALELLEKALELARELGEDHPET 45
Query: 201 RAC-GNLGNTYYLLGNFKQAIYYHQERLKIARE 232
NL Y LG++ +A+ Y ++ L +
Sbjct: 46 ARALNNLARLYLALGDYDEALEYLEKALALREA 78
Score = 43.9 bits (104), Expect = 7e-06
Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 21/94 (22%)
Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQL-NDKLSEGRALYNLGNVYHAKGKAIG 158
A + NL L+ +G YDEA+ ++ LE++R+L D RAL NL +Y A G
Sbjct: 5 AAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGD--- 61
Query: 159 KVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKE 192
EA++Y E+ LAL +
Sbjct: 62 -----------------YDEALEYLEKALALREA 78
Score = 42.7 bits (101), Expect = 2e-05
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVE-AQACYSLGNTYTLLRDYPTA 299
A +NL LG+Y A E ++ L LA++LG+ E A+A +L Y L DY A
Sbjct: 6 AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65
Query: 300 IDYHLRHLIIAQQ 312
++Y + L + +
Sbjct: 66 LEYLEKALALREA 78
Score = 40.4 bits (95), Expect = 1e-04
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWS-LGNAHAARGNHEK 338
A A +L L DY A++ + L +A++L + E + L + A G++++
Sbjct: 5 AAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDE 64
Query: 339 ALYFATKHLEISKQ 352
AL + K L + +
Sbjct: 65 ALEYLEKALALREA 78
Score = 37.3 bits (87), Expect = 0.002
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 5/68 (7%)
Query: 29 GERLCKAGDCRAGVAFFQAAIQ----AGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHK 84
L + GD + + A++ G D T A + L Y LGDY +A++Y +
Sbjct: 12 ALVLRRLGDYDEALELLEKALELARELGEDHPETARA-LNNLARLYLALGDYDEALEYLE 70
Query: 85 QDLTLARN 92
+ L L
Sbjct: 71 KALALREA 78
>gnl|CDD|145376 pfam02188, GoLoco, GoLoco motif.
Length = 23
Score = 45.0 bits (108), Expect = 9e-07
Identities = 16/23 (69%), Positives = 20/23 (86%)
Query: 443 EESFFELLSRFQSERMDDQRCSL 465
+E FF+LL+R QS R+DDQRCSL
Sbjct: 1 DEDFFDLLARAQSSRLDDQRCSL 23
Score = 43.5 bits (104), Expect = 3e-06
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 567 DEDFFSLITRLQSGRMEDQRATV 589
DEDFF L+ R QS R++DQR ++
Sbjct: 1 DEDFFDLLARAQSSRLDDQRCSL 23
Score = 41.5 bits (99), Expect = 1e-05
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 536 DDGFLDMLIRCQGARLEDQRSHL 558
D+ F D+L R Q +RL+DQR L
Sbjct: 1 DEDFFDLLARAQSSRLDDQRCSL 23
Score = 38.5 bits (91), Expect = 2e-04
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 502 DDDLLDMIVGLQSKRMDEQRAAL 524
D+D D++ QS R+D+QR +L
Sbjct: 1 DEDFFDLLARAQSSRLDDQRCSL 23
>gnl|CDD|214645 smart00390, GoLoco, LGN motif, putative GEFs specific for G-alpha
GTPases. GEF specific for Galpha_i proteins.
Length = 23
Score = 40.9 bits (97), Expect = 3e-05
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 443 EESFFELLSRFQSERMDDQRCSL 465
+E F+LL R QS RMDDQRC L
Sbjct: 1 DEDLFDLLLRMQSSRMDDQRCEL 23
Score = 40.2 bits (95), Expect = 4e-05
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 567 DEDFFSLITRLQSGRMEDQRATV 589
DED F L+ R+QS RM+DQR +
Sbjct: 1 DEDLFDLLLRMQSSRMDDQRCEL 23
Score = 39.0 bits (92), Expect = 1e-04
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 536 DDGFLDMLIRCQGARLEDQRSHL 558
D+ D+L+R Q +R++DQR L
Sbjct: 1 DEDLFDLLLRMQSSRMDDQRCEL 23
Score = 36.7 bits (86), Expect = 8e-04
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 502 DDDLLDMIVGLQSKRMDEQRAAL 524
D+DL D+++ +QS RMD+QR L
Sbjct: 1 DEDLFDLLLRMQSSRMDDQRCEL 23
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 38.2 bits (90), Expect = 3e-04
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARNM 93
LGNAY LGDY +A++Y+++ L L N
Sbjct: 3 ALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 37.8 bits (89), Expect = 3e-04
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIAREF 233
A NLGN Y LG++ +A+ Y+++ L++
Sbjct: 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 34.0 bits (79), Expect = 0.009
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQ 132
A++ NLGN +G YDEA+ ++ LE+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 31.3 bits (72), Expect = 0.066
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQ 312
A+A Y+LGN Y L DY A++Y+ + L +
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 29.0 bits (66), Expect = 0.50
Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 20/49 (40%)
Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLAL 189
ALYNLGN Y G EA++YYE+ L L
Sbjct: 2 EALYNLGNAYLKLGD--------------------YDEALEYYEKALEL 30
Score = 29.0 bits (66), Expect = 0.54
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
A NLGN+++ LG+Y A E+Y++ L L +
Sbjct: 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 29.0 bits (66), Expect = 0.58
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 321 RACWSLGNAHAARGNHEKALYFATKHLEI 349
A ++LGNA+ G++++AL + K LE+
Sbjct: 2 EALYNLGNAYLKLGDYDEALEYYEKALEL 30
>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat.
Length = 36
Score = 37.2 bits (87), Expect = 5e-04
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 63 YSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDK 96
S LG Y LGDY+KA+ +++ L LA++ ED+
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALALAKDPEDR 35
Score = 31.4 bits (72), Expect = 0.060
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKA 237
A NLG Y LG++++AI ++ L +A++ D+
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36
Score = 31.1 bits (71), Expect = 0.11
Identities = 17/36 (47%), Positives = 22/36 (61%)
Query: 242 ANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRA 277
A SNLG + LG+Y+ A Y+R L LA+D DR
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36
Score = 31.1 bits (71), Expect = 0.11
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 282 ACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQL 313
A +LG Y L DY AI + R L +A+
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKDP 32
Score = 29.1 bits (66), Expect = 0.53
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 20/54 (37%)
Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEIND 195
AL NLG +Y G ++AI YE+ LAL K+ D
Sbjct: 1 ALSNLGRLYRKLGD--------------------YEKAISLYERALALAKDPED 34
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 34.7 bits (81), Expect = 0.005
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEI 129
AK+ NLGN +GKYDEA+ ++ LE+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALEL 30
Score = 34.7 bits (81), Expect = 0.005
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIAREF 233
+A NLGN Y LG + +A+ Y+++ L++
Sbjct: 2 KALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 34.3 bits (80), Expect = 0.006
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARN 92
LGNAY LG Y +A++Y+++ L L N
Sbjct: 3 ALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 29.7 bits (68), Expect = 0.25
Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 20/49 (40%)
Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLAL 189
+ALYNLGN Y GK EA++YYE+ L L
Sbjct: 2 KALYNLGNAYLKLGK--------------------YDEALEYYEKALEL 30
Score = 28.9 bits (66), Expect = 0.45
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 245 NLGNSHIFLGEYQAASEHYKRTLVLAQD 272
NLGN+++ LG+Y A E+Y++ L L +
Sbjct: 6 NLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 28.9 bits (66), Expect = 0.50
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQ 312
A+A Y+LGN Y L Y A++Y+ + L +
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 25.5 bits (57), Expect = 8.5
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 326 LGNAHAARGNHEKALYFATKHLEI 349
LGNA+ G +++AL + K LE+
Sbjct: 7 LGNAYLKLGKYDEALEYYEKALEL 30
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 33.8 bits (78), Expect = 0.024
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 59 LSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGK-YD 117
+ LGNA F LGDY +A++ +++ L L + A++ NL +GK Y+
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPD------NAEAYYNLALAYLKLGKDYE 55
Query: 118 EAMVCCKRHLEIS 130
EA+ ++ LE+
Sbjct: 56 EALEDLEKALELD 68
Score = 33.1 bits (76), Expect = 0.042
Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 7/67 (10%)
Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGE-YQAASEHY 263
NLGN + LG++ +AI +++ L++ + + A NL +++ LG+ Y+ A E
Sbjct: 8 NLGNALFKLGDYDEAIEAYEKALELDPDNAE------AYYNLALAYLKLGKDYEEALEDL 61
Query: 264 KRTLVLA 270
++ L L
Sbjct: 62 EKALELD 68
Score = 30.7 bits (70), Expect = 0.29
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 23 LELALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLG-DYQKAMQ 81
L G L K GD + ++ A++ D+ + Y L AY LG DY++A++
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDN----AEAYYNLALAYLKLGKDYEEALE 59
Query: 82 YHKQDLTLA 90
++ L L
Sbjct: 60 DLEKALELD 68
Score = 28.0 bits (63), Expect = 2.5
Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 27/90 (30%)
Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGR 201
AL NLGN G EAI+ YE+ L E++ A+
Sbjct: 5 ALKNLGNALFKLGD--------------------YDEAIEAYEKAL----ELDPDNAEAY 40
Query: 202 ACGNLGNTYYLLG-NFKQAIYYHQERLKIA 230
NL Y LG ++++A+ ++ L++
Sbjct: 41 Y--NLALAYLKLGKDYEEALEDLEKALELD 68
>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
Provisional.
Length = 172
Score = 35.4 bits (82), Expect = 0.034
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 23/101 (22%)
Query: 74 GDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVM-------GKYDEAMVCCKRH 126
G+Y +A++ +++ L L + D+ L M G++D+A+ +
Sbjct: 49 GEYAEALENYEEALKLEEDPNDR----------SYILYNMGIIYASNGEHDKALEYYHQA 98
Query: 127 LEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGE 167
LE LN K AL N+ +YH +G+ + G QD E
Sbjct: 99 LE----LNPKQPS--ALNNIAVIYHKRGEKAEEAGDQDEAE 133
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 30.9 bits (71), Expect = 0.10
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARN 92
LG AY+ LGDY++A++ +++ L L N
Sbjct: 3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
Score = 27.1 bits (61), Expect = 2.4
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 201 RACGNLGNTYYLLGNFKQAIYYHQE 225
A NLG YY LG++++A+ +++
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEK 26
Score = 26.7 bits (60), Expect = 3.3
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
A NLG ++ LG+Y+ A E Y++ L L +
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 33.5 bits (77), Expect = 0.22
Identities = 45/249 (18%), Positives = 79/249 (31%), Gaps = 42/249 (16%)
Query: 53 TDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKV 112
T D + I QL Y GD + A + + L + A + L +
Sbjct: 24 TTDRNKAAKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYL----AYLA--LALYYQQ 77
Query: 113 MGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEV 172
+G+ ++A + + LN G L N G +GK
Sbjct: 78 LGELEKA----EDSFRRALTLNPN--NGDVLNNYGTFLCQQGKY---------------- 115
Query: 173 KVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIARE 232
++A++ +EQ + R+ N G G+F +A Y L+I +
Sbjct: 116 ----EQAMQQFEQAIED----PLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQ 167
Query: 233 FGDK---AAERRANS-NLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGN 288
+ AE ++ +L YQ +L L + + A G
Sbjct: 168 RPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAESLWLGIRIARALGDVAAAQRYGA 227
Query: 289 TYTLLRDYP 297
L + +P
Sbjct: 228 --QLQKLFP 234
>gnl|CDD|151112 pfam10579, Rapsyn_N, Rapsyn N-terminal myristoylation and linker
region. Neuromuscular junction formation relies upon
the clustering of acetylcholine receptors and other
proteins in the muscle membrane. Rapsyn is a peripheral
membrane protein that is selectively concentrated at the
neuromuscular junction and is essential for the
formation of synaptic acetylcholine receptor aggregates.
Acetylcholine receptors fail to aggregate beneath nerve
terminals in mice where rapsyn has been knocked out. The
N-terminal six amino acids of rapsyn are its
myristoylation site, and myristoylation is necessary for
the targeting of the protein to the membrane.
Length = 80
Score = 31.0 bits (70), Expect = 0.28
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 315 DRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISKQLGDP 356
DR G R L AH+ G +++ L FA + +++L DP
Sbjct: 38 DREGRFRVLGCLITAHSEMGKYKEMLEFAVAQIGTARELEDP 79
>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat.
Length = 42
Score = 29.8 bits (68), Expect = 0.29
Identities = 8/31 (25%), Positives = 15/31 (48%)
Query: 61 AIYSQLGNAYFYLGDYQKAMQYHKQDLTLAR 91
A + L A G Y++A + ++ L + R
Sbjct: 3 ASLNNLARALRAQGRYEEAEELLEEALAIRR 33
Score = 27.9 bits (63), Expect = 1.5
Identities = 9/36 (25%), Positives = 14/36 (38%), Gaps = 1/36 (2%)
Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIARE-FGD 235
+ NL G +++A +E L I R G
Sbjct: 3 ASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGP 38
Score = 25.6 bits (57), Expect = 9.1
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 105 NLGNTLKVMGKYDEAMVCCKRHLEISRQL 133
NL L+ G+Y+EA + L I R++
Sbjct: 7 NLARALRAQGRYEEAEELLEEALAIRRRV 35
>gnl|CDD|200469 cd11330, AmyAc_OligoGlu, Alpha amylase catalytic domain found in
oligo-1,6-glucosidase (also called isomaltase;
sucrase-isomaltase; alpha-limit dextrinase) and related
proteins. Oligo-1,6-glucosidase (EC 3.2.1.10)
hydrolyzes the alpha-1,6-glucosidic linkage of
isomalto-oligosaccharides, pannose, and dextran. Unlike
alpha-1,4-glucosidases (EC 3.2.1.20), it fails to
hydrolyze the alpha-1,4-glucosidic bonds of
maltosaccharides. The Alpha-amylase family comprises the
largest family of glycoside hydrolases (GH), with the
majority of enzymes acting on starch, glycogen, and
related oligo- and polysaccharides. These proteins
catalyze the transformation of alpha-1,4 and alpha-1,6
glucosidic linkages with retention of the anomeric
center. The protein is described as having 3 domains: A,
B, C. A is a (beta/alpha) 8-barrel; B is a loop between
the beta 3 strand and alpha 3 helix of A; C is the
C-terminal extension characterized by a Greek key. The
majority of the enzymes have an active site cleft found
between domains A and B where a triad of catalytic
residues (Asp, Glu and Asp) performs catalysis. Other
members of this family have lost the catalytic activity
as in the case of the human 4F2hc, or only have 2
residues that serve as the catalytic nucleophile and the
acid/base, such as Thermus A4 beta-galactosidase with 2
Glu residues (GH42) and human alpha-galactosidase with 2
Asp residues (GH31). The family members are quite
extensive and include: alpha amylase,
maltosyltransferase, cyclodextrin glycotransferase,
maltogenic amylase, neopullulanase, isoamylase,
1,4-alpha-D-glucan maltotetrahydrolase,
4-alpha-glucotransferase, oligo-1,6-glucosidase,
amylosucrase, sucrose phosphorylase, and amylomaltase.
Length = 472
Score = 33.0 bits (76), Expect = 0.43
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 12/80 (15%)
Query: 261 EHYKRTLVLAQDLGDRAVEAQACYSLGN-------TYTLLRDYPTAIDYHLRHLIIAQQL 313
E+ R LV + D +E A Y+ G ++ LL +A +R + +
Sbjct: 256 EYPGRFLV-GEVSDDDPLEVMAEYTSGGDRLHMAYSFDLLGRPFSA--AVVRDAL--EAF 310
Query: 314 MDRVGEGRACWSLGNAHAAR 333
+G CW+ N R
Sbjct: 311 EAEAPDGWPCWAFSNHDVPR 330
>gnl|CDD|223861 COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function
prediction only].
Length = 292
Score = 31.9 bits (72), Expect = 0.92
Identities = 30/154 (19%), Positives = 52/154 (33%), Gaps = 20/154 (12%)
Query: 210 YYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYKRTLVL 269
+ A+ + L + A ++ G + +Y A + Y++
Sbjct: 11 SLFAAAEENAVSTLGKLLSEGGDPLMAAVALKSALLNGAGSAYPPDYAKALKSYEK---- 66
Query: 270 AQDLGDRAVEAQACYSLG----NTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWS 325
A +LGD A A LG + RD A D+ + G A ++
Sbjct: 67 AAELGD----AAALALLGQMYGAGKGVSRDKTKAADW-------YRCAAAD-GLAEALFN 114
Query: 326 LGNAHAARGNHEKALYFATKHLEISKQLGDPLGQ 359
LG +A L A K+ E + +LG+
Sbjct: 115 LGLMYANGRGVPLDLVKALKYYEKAAKLGNVEAA 148
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 27.4 bits (62), Expect = 1.7
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 62 IYSQLGNAYFYLGDYQKAMQY 82
Y LG Y LGDY++A +Y
Sbjct: 3 AYYLLGQIYLQLGDYEEAKEY 23
>gnl|CDD|234389 TIGR03903, TOMM_kin_cyc, TOMM system kinase/cyclase fusion protein.
This model represents proteins of 1350 in length, in
multiple species of Burkholderia, in Acidovorax avenae
subsp. citrulli AAC00-1 and Delftia acidovorans SPH-1,
and in multiple copies in Sorangium cellulosum, in
genomic neighborhoods that include a
cyclodehydratase/docking scaffold fusion protein
(TIGR03882) and a member of the thiazole/oxazole modified
metabolite (TOMM) precursor family TIGR03795. It has a
kinase domain in the N-terminal 300 amino acids, followed
by a cyclase homology domain, followed by regions without
named domain definitions. It is a probable
bacteriocin-like metabolite biosynthesis protein
[Cellular processes, Toxin production and resistance].
Length = 1266
Score = 30.6 bits (69), Expect = 3.2
Identities = 26/107 (24%), Positives = 41/107 (38%), Gaps = 7/107 (6%)
Query: 47 AAIQAGTDDLRTLSAIYSQLGNAYFYL------GDYQKAMQYHKQDLTLARNMEDKLGEA 100
A +A D LR A G Y A+ + L LA + +++ A
Sbjct: 1157 AGARADIDALRQSGCGLGLTYYASLLAELDAERGRYGAALAIIDECLALAESPDERYYLA 1216
Query: 101 KSSGNLGNTLKVMG-KYDEAMVCCKRHLEISRQLNDKLSEGRALYNL 146
+ G L+ G + A C +R +E++RQ K E +A L
Sbjct: 1217 ELLLKKGRCLRAAGERAVAAEACLRRAVEVARQQGMKRIERKAAAAL 1263
>gnl|CDD|221855 pfam12921, ATP13, Mitochondrial ATPase expression. ATP13 is
necessary for the expression of subunit 9 of
mitochondrial ATPase. The protein has a basic amino
terminal signal sequence that is cleaved upon import
into mitochondria.
Length = 125
Score = 28.8 bits (65), Expect = 3.6
Identities = 9/28 (32%), Positives = 11/28 (39%), Gaps = 4/28 (14%)
Query: 55 DLRTLSAIYSQLGNAYFYLGDYQKAMQY 82
R L AI S +Y Y D A+
Sbjct: 50 SSRLLKAIVS----SYGYNNDISSALSI 73
>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core. This is
the most conserved part of the core region of Ufd2P
ubiquitin elongating factor or E4, running from helix
alpha-11 to alpha-38. It consists of 31 helices of
variable length connected by loops of variable size
forming a compact unit; the helical packing pattern of
the compact unit consists of five structural repeats
that resemble tandem Armadillo (ARM) repeats. This
domain is involved in ubiquitination as it binds Cdc48p
and escorts ubiquitinated proteins from Cdc48p to the
proteasome for degradation. The core is structurally
similar to the nuclear transporter protein
importin-alpha. The core is associated with the U-box at
the C-terminus, pfam04564, which has ligase activity.
Length = 625
Score = 29.9 bits (68), Expect = 4.3
Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Query: 66 LGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCK 124
L +LG ++ +K+ L + ++D+L E + S + LK + K E + K
Sbjct: 174 LTLRALHLG-IGPLIEKYKRLLRELKRLQDELEELEQSRSNWAQLKRLEKQLEILSAEK 231
>gnl|CDD|219814 pfam08385, DHC_N1, Dynein heavy chain, N-terminal region 1. Dynein
heavy chains interact with other heavy chains to form
dimers, and with intermediate chain-light chain
complexes to form a basal cargo binding unit. The region
featured in this family includes the sequences
implicated in mediating these interactions. It is
thought to be flexible and not to adopt a rigid
conformation.
Length = 577
Score = 29.6 bits (67), Expect = 5.3
Identities = 21/106 (19%), Positives = 36/106 (33%), Gaps = 24/106 (22%)
Query: 170 EEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAI--------- 220
+K L EA + Y + L ++ ++D + +L L +
Sbjct: 85 RGLKDALNEA-QDYNKFLKDLEPLDDL----LSATDLQKIREALPPIFNHLRLKWSISKY 139
Query: 221 YYHQERL-----KIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
YY ER KI+ + D+ S L S + +Y E
Sbjct: 140 YYPLERALVLLEKISNDLIDQL-----KSYLSPSDLMSLDYDEFLE 180
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 29.7 bits (67), Expect = 5.6
Identities = 66/393 (16%), Positives = 126/393 (32%), Gaps = 110/393 (27%)
Query: 34 KAGDCRAGVAFFQAAIQAGTDDLRT---LSAIYSQLGNAYFYLGDYQKAMQYHKQDLT-- 88
+ G+ + F+ + +LR L+ +Y + GN + +KA + + Q++
Sbjct: 511 QEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPA 570
Query: 89 --LARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNL 146
LA+ G L L ++ +EA E L GRA
Sbjct: 571 LALAQYYL-------GKGQLKKALAIL---NEAADAAPDSPEAWLML------GRAQLAA 614
Query: 147 GNVYHAKGKAIGKVGQQDPGEYPEEVKVC--------LQEAIKYYEQNLAL--------- 189
G++ A + K+ P + + +AI ++ L L
Sbjct: 615 GDLNKAV-SSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQI 673
Query: 190 ---------------------MKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLK 228
+++ + AA G G+ Y ++ AI +++ LK
Sbjct: 674 GLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFEL--EGDLYLRQKDYPAAIQAYRKALK 731
Query: 229 IA-------------REFGDKA-AERR-------------ANSNLGNSHIFLGEYQAASE 261
A G+ A A + + L ++ +Y A +
Sbjct: 732 RAPSSQNAIKLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIK 791
Query: 262 HYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLIIA---QQLMDRVG 318
HY+ + A D + V + P A++Y R L +A ++D
Sbjct: 792 HYQTVVKKAPD--NAVVLNNLAWLYLE-----LKDPRALEYAERALKLAPNIPAILD--- 841
Query: 319 EGRACWSLGNAHAARGNHEKALYFATKHLEISK 351
+LG +G ++AL K + I+
Sbjct: 842 ------TLGWLLVEKGEADRALPLLRKAVNIAP 868
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 29.7 bits (67), Expect = 5.7
Identities = 13/91 (14%), Positives = 21/91 (23%), Gaps = 15/91 (16%)
Query: 205 NLGNTYYLLGNFKQAIYYHQERLK-----------IAREFGDKAAERRANSNLGNSHIFL 253
NL G ++AI L +A+ + + A +
Sbjct: 379 NLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYALA 438
Query: 254 GEYQAASEHYKRTLVLAQDLGDRAVEAQACY 284
G + A R A A
Sbjct: 439 GRLEQAIIFLMR----ASQQVKLGFPDWARA 465
>gnl|CDD|176247 cd08287, FDH_like_ADH3, formaldehyde dehydrogenase (FDH)-like.
This group contains proteins identified as alcohol
dehydrogenases and glutathione-dependant formaldehyde
dehydrogenases (FDH) of the zinc-dependent/medium chain
alcohol dehydrogenase family. The MDR family uses
NAD(H) as a cofactor in the interconversion of alcohols
and aldehydes, or ketones. FDH converts formaldehyde
and NAD to formate and NADH. The initial step in this
process the spontaneous formation of a
S-(hydroxymethyl)glutathione adduct from formaldehyde
and glutathione, followed by FDH-mediated oxidation (and
detoxification) of the adduct to S-formylglutathione.
The medium chain alcohol dehydrogenase family (MDR) has
a NAD(P)(H)-binding domain in a Rossmann fold of a
beta-alpha form. The N-terminal region typically has an
all-beta catalytic domain. These proteins typically form
dimers (typically higher plants, mammals) or tetramers
(yeast, bacteria), and have 2 tightly bound zinc atoms
per subunit.
Length = 345
Score = 29.2 bits (66), Expect = 6.7
Identities = 12/24 (50%), Positives = 16/24 (66%), Gaps = 2/24 (8%)
Query: 219 AIYYHQERLKIAREFG--DKAAER 240
A+ H++R +AREFG D AER
Sbjct: 198 AMSRHEDRQALAREFGATDIVAER 221
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 29.2 bits (65), Expect = 7.1
Identities = 15/90 (16%), Positives = 36/90 (40%), Gaps = 1/90 (1%)
Query: 413 EQLDLIKLTPENKGKQSGSSNSSELSKCDEEESF-FELLSRFQSERMDDQRCSLSARQSE 471
+Q+ + K + N E K + ++F + S+ + +D+ ++
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREP 332
Query: 472 NKENLQRITNTKKSQPSAPRATTIPAHTPG 501
E+LQ+ ++QP++ I + P
Sbjct: 333 VAEDLQKTKPQVEAQPTSLNEDAIDSSNPV 362
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 25.6 bits (57), Expect = 8.2
Identities = 8/38 (21%), Positives = 17/38 (44%), Gaps = 4/38 (10%)
Query: 44 FFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQ 81
++ A++ ++ + Y L LG Y +A+Q
Sbjct: 1 LYEKALELDPNN----AEAYYNLALLLLNLGQYDEALQ 34
>gnl|CDD|193336 pfam12862, Apc5, Anaphase-promoting complex subunit 5. Apc5 is a
subunit of the anaphase-promoting complex/cyclosome
(APC/C) which is a multisubunit ubiquitin ligase that
mediates the proteolysis of cell cycle proteins in
mitosis and G1. Apc5, although it does not harbour a
classical RNA binding domain, Apc5 binds the poly(A)
binding protein (PABP), which directly binds the
internal ribosome entry site (IRES) of growth factor 2
mRNA. PABP was found to enhance IRES-mediated
translation, whereas Apc5 over-expression counteracted
this effect. In addition to its association with the
APC/C complex, Apc5 binds much heavier complexes and
co-sediments with the ribosomal fraction. The N-terminus
of Afi1 serves to stabilise the union between Apc4 and
Apc5, both of which lie towards the bottom-front of the
APC. This region of the Apc5 member proteins carries a
TPR-like motif.
Length = 91
Score = 27.2 bits (61), Expect = 8.2
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKA 237
A NL + + G+ ++AI +E +++ARE D A
Sbjct: 40 ALLNLASLHADFGHNEEAIAAIEEAIRLARENKDTA 75
>gnl|CDD|187766 cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas1 is the most universal CRISPR system protein thought
to be involved in spacer integration; Cas1 is
metal-dependent deoxyribonuclease, also binds RNA; Shown
to possess a unique fold consisting of a N-terminal
beta-strand domain and a C-terminal alpha-helical
domain.
Length = 317
Score = 28.7 bits (65), Expect = 8.7
Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 5/71 (7%)
Query: 217 KQAIYY--HQERLKIAREF-GDKAAERRAN-SNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
Q Y ++RL++ARE K +R L A E L A+
Sbjct: 89 AQYEAYLDPEKRLELAREIVRGKIRNQRRVLKRYARDGKELLLALAELEELLEKLDKAKS 148
Query: 273 LGD-RAVEAQA 282
+ + R +E A
Sbjct: 149 IEELRGIEGNA 159
>gnl|CDD|130680 TIGR01619, hyp_HI0040, TIGR01619 family protein. This model
represents a hypothetical equivalog of gamma
proteobacteria, includes HI0040. These sequences do not
have any similarity to known proteins by PSI-BLAST.
Length = 249
Score = 28.4 bits (63), Expect = 9.0
Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 524 LPHLADRENAIPDDGFLDMLIRCQGARLEDQRSHLP 559
+ AD ENA+PD+ D LI Q ++ Q S LP
Sbjct: 42 FDYAADDENALPDESEYDKLIN-QIFKILTQLSALP 76
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 28.5 bits (64), Expect = 9.3
Identities = 48/248 (19%), Positives = 75/248 (30%), Gaps = 76/248 (30%)
Query: 55 DLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMED----------KLGEAKSSG 104
D + QL Y GDY +A + ++ L + KLGE +
Sbjct: 30 DRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLAD 89
Query: 105 ------------------NLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKL--SEGRALY 144
N G L G+ +EAM + R L D L
Sbjct: 90 ESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQ------QFERALADPAYGEPSDTLE 143
Query: 145 NLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACG 204
NLG G+ +A +Y ++ L L + +
Sbjct: 144 NLGLCALKAGQ--------------------FDQAEEYLKRALELDPQFPPALLE----- 178
Query: 205 NLGNTYYLLGNFKQAIYYHQERL--------------KIAREFGDKAAERRANSNLGNSH 250
L +Y G++ A Y + +IA+ GD+AA +R + L
Sbjct: 179 -LARLHYKAGDYAPARLYLERYQQRGGAQAESLLLGIRIAKRLGDRAAAQRYQAQLQRLF 237
Query: 251 IFLGEYQA 258
+ EYQ
Sbjct: 238 PYSEEYQT 245
>gnl|CDD|237959 PRK15404, PRK15404, leucine ABC transporter subunit
substrate-binding protein LivK; Provisional.
Length = 369
Score = 28.8 bits (65), Expect = 9.3
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 7/34 (20%)
Query: 561 EGSTVPDEDFFSLITRLQSGRMEDQRATVPSVYF 594
EG T D+DF +LI +L+ V VY+
Sbjct: 196 EGITAGDKDFSALIAKLKK-------ENVDFVYY 222
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 28.9 bits (65), Expect = 9.5
Identities = 21/63 (33%), Positives = 28/63 (44%), Gaps = 10/63 (15%)
Query: 230 AREFGDKAAER-----RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD---LGDRAVEAQ 281
ARE KA + RA+ LG + G+YQ A E +R VL Q+ L +
Sbjct: 199 ARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALER--VLEQNPEYLSEVLEMLY 256
Query: 282 ACY 284
CY
Sbjct: 257 ECY 259
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.132 0.379
Gapped
Lambda K H
0.267 0.0834 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,507,517
Number of extensions: 2989358
Number of successful extensions: 2737
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2666
Number of HSP's successfully gapped: 106
Length of query: 609
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 506
Effective length of database: 6,369,140
Effective search space: 3222784840
Effective search space used: 3222784840
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.5 bits)