RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10738
         (609 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 56.6 bits (137), Expect = 5e-10
 Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 32/131 (24%)

Query: 62  IYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMV 121
               LGN Y+ LGDY +A++Y+++ L L  +  D      +  NL      +GKY+EA+ 
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD------AYYNLAAAYYKLGKYEEALE 55

Query: 122 CCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIK 181
             ++ LE+            A YNLG  Y+  GK                     +EA++
Sbjct: 56  DYEKALELDPDNAK------AYYNLGLAYYKLGK--------------------YEEALE 89

Query: 182 YYEQNLALMKE 192
            YE+ L L   
Sbjct: 90  AYEKALELDPN 100



 Score = 54.3 bits (131), Expect = 2e-09
 Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 12/108 (11%)

Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
           A  NLGN YY LG++ +A+ Y+++ L++  +  D      A  NL  ++  LG+Y+ A E
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD------AYYNLAAAYYKLGKYEEALE 55

Query: 262 HYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLII 309
            Y++ L L  D       A+A Y+LG  Y  L  Y  A++ + + L +
Sbjct: 56  DYEKALELDPD------NAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 50.1 bits (120), Expect = 1e-07
 Identities = 37/126 (29%), Positives = 60/126 (47%), Gaps = 32/126 (25%)

Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGR 201
           AL NLGN+Y+  G                       EA++YYE+  AL  E++   A   
Sbjct: 2   ALLNLGNLYYKLGD--------------------YDEALEYYEK--AL--ELDPDNA--D 35

Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
           A  NL   YY LG +++A+  +++ L++  +         A  NLG ++  LG+Y+ A E
Sbjct: 36  AYYNLAAAYYKLGKYEEALEDYEKALELDPDNAK------AYYNLGLAYYKLGKYEEALE 89

Query: 262 HYKRTL 267
            Y++ L
Sbjct: 90  AYEKAL 95



 Score = 46.6 bits (111), Expect = 1e-06
 Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 32/125 (25%)

Query: 105 NLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQD 164
           NLGN    +G YDEA+   ++ LE+            A YNL   Y+  GK         
Sbjct: 5   NLGNLYYKLGDYDEALEYYEKALELDP------DNADAYYNLAAAYYKLGK--------- 49

Query: 165 PGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQ 224
                       +EA++ YE+ L L  +         A  NLG  YY LG +++A+  ++
Sbjct: 50  -----------YEEALEDYEKALELDPDNAK------AYYNLGLAYYKLGKYEEALEAYE 92

Query: 225 ERLKI 229
           + L++
Sbjct: 93  KALEL 97



 Score = 45.8 bits (109), Expect = 3e-06
 Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 10/102 (9%)

Query: 28  EGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDL 87
            G    K GD    + +++ A++   D+    +  Y  L  AY+ LG Y++A++ +++ L
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDN----ADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 88  TLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEI 129
            L  +       AK+  NLG     +GKY+EA+   ++ LE+
Sbjct: 62  ELDPDN------AKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 42.4 bits (100), Expect = 4e-05
 Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 12/108 (11%)

Query: 245 NLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHL 304
           NLGN +  LG+Y  A E+Y++ L L  D       A A Y+L   Y  L  Y  A++ + 
Sbjct: 5   NLGNLYYKLGDYDEALEYYEKALELDPD------NADAYYNLAAAYYKLGKYEEALEDYE 58

Query: 305 RHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISKQ 352
           + L +           +A ++LG A+   G +E+AL    K LE+   
Sbjct: 59  KALELDPD------NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 32.4 bits (74), Expect = 0.16
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)

Query: 281 QACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKAL 340
           +A  +LGN Y  L DY  A++Y+ +    A +L        A ++L  A+   G +E+AL
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEK----ALEL--DPDNADAYYNLAAAYYKLGKYEEAL 54


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 54.5 bits (129), Expect = 5e-08
 Identities = 57/260 (21%), Positives = 104/260 (40%), Gaps = 41/260 (15%)

Query: 25  LALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHK 84
           L L    L K G     +   + A++   + L  L+     LG     LG Y++A++  +
Sbjct: 62  LLLLALALLKLGRLEEALELLEKALEL--ELLPNLAEALLNLGLLLEALGKYEEALELLE 119

Query: 85  QDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALY 144
           + L L  + +               L  +G Y+EA+   ++ LE+  +LN+      AL 
Sbjct: 120 KALALDPDPDLAEALLAL-----GALYELGDYEEALELYEKALELDPELNEL---AEALL 171

Query: 145 NLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACG 204
            LG +  A G+                     +EA++  E+ L L  + +       A  
Sbjct: 172 ALGALLEALGR--------------------YEEALELLEKALKLNPDDD-----AEALL 206

Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYK 264
           NLG  Y  LG +++A+ Y+++ L++  +  +         NL    + LG Y+ A E  +
Sbjct: 207 NLGLLYLKLGKYEEALEYYEKALELDPDNAEALY------NLALLLLELGRYEEALEALE 260

Query: 265 RTLVLAQDLGDRAVEAQACY 284
           + L L  DL +  +      
Sbjct: 261 KALELDPDLYNLGLALLLLL 280



 Score = 50.6 bits (119), Expect = 9e-07
 Identities = 57/254 (22%), Positives = 104/254 (40%), Gaps = 43/254 (16%)

Query: 63  YSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVC 122
              L  A   LG  ++A++  ++    A  +E     A++  NLG  L+ +GKY+EA+  
Sbjct: 62  LLLLALALLKLGRLEEALELLEK----ALELELLPNLAEALLNLGLLLEALGKYEEALEL 117

Query: 123 CKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKY 182
            ++ L +    +                +  G                      +EA++ 
Sbjct: 118 LEKALALDPDPDLA-----EALLALGALYELGD--------------------YEEALEL 152

Query: 183 YEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRA 242
           YE+ L L  E+N+ A        LG     LG +++A+   ++ LK+     +   +  A
Sbjct: 153 YEKALELDPELNELAEALL---ALGALLEALGRYEEALELLEKALKL-----NPDDDAEA 204

Query: 243 NSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDY 302
             NLG  ++ LG+Y+ A E+Y++ L L  D       A+A Y+L      L  Y  A++ 
Sbjct: 205 LLNLGLLYLKLGKYEEALEYYEKALELDPD------NAEALYNLALLLLELGRYEEALEA 258

Query: 303 HLRHLIIAQQLMDR 316
             + L +   L + 
Sbjct: 259 LEKALELDPDLYNL 272



 Score = 39.8 bits (91), Expect = 0.003
 Identities = 43/176 (24%), Positives = 75/176 (42%), Gaps = 17/176 (9%)

Query: 176 LQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGD 235
           L+EA++  E+ L L    N       A  NLG     LG +++A+   ++ L +  +   
Sbjct: 75  LEEALELLEKALELELLPNL----AEALLNLGLLLEALGKYEEALELLEKALALDPDPDL 130

Query: 236 KAAERRANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRD 295
             A     +        LG+Y+ A E Y++ L L  +L + A   +A  +LG     L  
Sbjct: 131 AEALLALGA-----LYELGDYEEALELYEKALELDPELNELA---EALLALGALLEALGR 182

Query: 296 YPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISK 351
           Y  A++   + L +         +  A  +LG  +   G +E+AL +  K LE+  
Sbjct: 183 YEEALELLEKALKLNPDD-----DAEALLNLGLLYLKLGKYEEALEYYEKALELDP 233



 Score = 33.3 bits (74), Expect = 0.34
 Identities = 28/113 (24%), Positives = 50/113 (44%), Gaps = 9/113 (7%)

Query: 24  ELALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYH 83
            L   G  L   G     +   + A++   DD          LG  Y  LG Y++A++Y+
Sbjct: 169 ALLALGALLEALGRYEEALELLEKALKLNPDDDA---EALLNLGLLYLKLGKYEEALEYY 225

Query: 84  KQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDK 136
           ++ L L  +       A++  NL   L  +G+Y+EA+   ++ LE+   L + 
Sbjct: 226 EKALELDPD------NAEALYNLALLLLELGRYEEALEALEKALELDPDLYNL 272



 Score = 31.4 bits (69), Expect = 1.2
 Identities = 30/144 (20%), Positives = 52/144 (36%), Gaps = 13/144 (9%)

Query: 210 YYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYKRTLVL 269
             LLG   +A+   +E L++       +        L  + + LG  + A E  ++ L L
Sbjct: 33  LELLGELAEALELLEEALELLPN----SDLAGLLLLLALALLKLGRLEEALELLEKALEL 88

Query: 270 AQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWSLGNA 329
                     A+A  +LG     L  Y  A++   + L +                   A
Sbjct: 89  ELLPNL----AEALLNLGLLLEALGKYEEALELLEKALALDPDPDLA-----EALLALGA 139

Query: 330 HAARGNHEKALYFATKHLEISKQL 353
               G++E+AL    K LE+  +L
Sbjct: 140 LYELGDYEEALELYEKALELDPEL 163


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 49.7 bits (119), Expect = 7e-08
 Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 1/69 (1%)

Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAER-RANSNLGNSHIFLGEYQAASEHY 263
           NL      LG++ +A+   ++ L++ARE G+   E  RA +NL   ++ LG+Y  A E+ 
Sbjct: 10  NLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYL 69

Query: 264 KRTLVLAQD 272
           ++ L L + 
Sbjct: 70  EKALALREA 78



 Score = 49.3 bits (118), Expect = 1e-07
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 57  RTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGE-AKSSGNLGNTLKVMGK 115
             L+A  + L      LGDY +A++  ++ L LAR + +   E A++  NL      +G 
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGD 61

Query: 116 YDEAMVCCKRHLEISRQ 132
           YDEA+   ++ L +   
Sbjct: 62  YDEALEYLEKALALREA 78



 Score = 44.7 bits (106), Expect = 4e-06
 Identities = 20/93 (21%), Positives = 34/93 (36%), Gaps = 21/93 (22%)

Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQG 200
            AL NL  V    G                       EA++  E+ L L +E+ +   + 
Sbjct: 6   AALNNLALVLRRLGD--------------------YDEALELLEKALELARELGEDHPET 45

Query: 201 RAC-GNLGNTYYLLGNFKQAIYYHQERLKIARE 232
                NL   Y  LG++ +A+ Y ++ L +   
Sbjct: 46  ARALNNLARLYLALGDYDEALEYLEKALALREA 78



 Score = 43.9 bits (104), Expect = 7e-06
 Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 21/94 (22%)

Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQL-NDKLSEGRALYNLGNVYHAKGKAIG 158
           A +  NL   L+ +G YDEA+   ++ LE++R+L  D     RAL NL  +Y A G    
Sbjct: 5   AAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGD--- 61

Query: 159 KVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKE 192
                              EA++Y E+ LAL + 
Sbjct: 62  -----------------YDEALEYLEKALALREA 78



 Score = 42.7 bits (101), Expect = 2e-05
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVE-AQACYSLGNTYTLLRDYPTA 299
            A +NL      LG+Y  A E  ++ L LA++LG+   E A+A  +L   Y  L DY  A
Sbjct: 6   AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65

Query: 300 IDYHLRHLIIAQQ 312
           ++Y  + L + + 
Sbjct: 66  LEYLEKALALREA 78



 Score = 40.4 bits (95), Expect = 1e-04
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWS-LGNAHAARGNHEK 338
           A A  +L      L DY  A++   + L +A++L +   E     + L   + A G++++
Sbjct: 5   AAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDE 64

Query: 339 ALYFATKHLEISKQ 352
           AL +  K L + + 
Sbjct: 65  ALEYLEKALALREA 78



 Score = 37.3 bits (87), Expect = 0.002
 Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 5/68 (7%)

Query: 29 GERLCKAGDCRAGVAFFQAAIQ----AGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHK 84
             L + GD    +   + A++     G D   T  A  + L   Y  LGDY +A++Y +
Sbjct: 12 ALVLRRLGDYDEALELLEKALELARELGEDHPETARA-LNNLARLYLALGDYDEALEYLE 70

Query: 85 QDLTLARN 92
          + L L   
Sbjct: 71 KALALREA 78


>gnl|CDD|145376 pfam02188, GoLoco, GoLoco motif. 
          Length = 23

 Score = 45.0 bits (108), Expect = 9e-07
 Identities = 16/23 (69%), Positives = 20/23 (86%)

Query: 443 EESFFELLSRFQSERMDDQRCSL 465
           +E FF+LL+R QS R+DDQRCSL
Sbjct: 1   DEDFFDLLARAQSSRLDDQRCSL 23



 Score = 43.5 bits (104), Expect = 3e-06
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 567 DEDFFSLITRLQSGRMEDQRATV 589
           DEDFF L+ R QS R++DQR ++
Sbjct: 1   DEDFFDLLARAQSSRLDDQRCSL 23



 Score = 41.5 bits (99), Expect = 1e-05
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 536 DDGFLDMLIRCQGARLEDQRSHL 558
           D+ F D+L R Q +RL+DQR  L
Sbjct: 1   DEDFFDLLARAQSSRLDDQRCSL 23



 Score = 38.5 bits (91), Expect = 2e-04
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 502 DDDLLDMIVGLQSKRMDEQRAAL 524
           D+D  D++   QS R+D+QR +L
Sbjct: 1   DEDFFDLLARAQSSRLDDQRCSL 23


>gnl|CDD|214645 smart00390, GoLoco, LGN motif, putative GEFs specific for G-alpha
           GTPases.  GEF specific for Galpha_i proteins.
          Length = 23

 Score = 40.9 bits (97), Expect = 3e-05
 Identities = 15/23 (65%), Positives = 17/23 (73%)

Query: 443 EESFFELLSRFQSERMDDQRCSL 465
           +E  F+LL R QS RMDDQRC L
Sbjct: 1   DEDLFDLLLRMQSSRMDDQRCEL 23



 Score = 40.2 bits (95), Expect = 4e-05
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 567 DEDFFSLITRLQSGRMEDQRATV 589
           DED F L+ R+QS RM+DQR  +
Sbjct: 1   DEDLFDLLLRMQSSRMDDQRCEL 23



 Score = 39.0 bits (92), Expect = 1e-04
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 536 DDGFLDMLIRCQGARLEDQRSHL 558
           D+   D+L+R Q +R++DQR  L
Sbjct: 1   DEDLFDLLLRMQSSRMDDQRCEL 23



 Score = 36.7 bits (86), Expect = 8e-04
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 502 DDDLLDMIVGLQSKRMDEQRAAL 524
           D+DL D+++ +QS RMD+QR  L
Sbjct: 1   DEDLFDLLLRMQSSRMDDQRCEL 23


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
          or more copies in proteins. Contain a minimum of 34
          amino acids each and self-associate via a "knobs and
          holes" mechanism.
          Length = 34

 Score = 38.2 bits (90), Expect = 3e-04
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARNM 93
              LGNAY  LGDY +A++Y+++ L L  N 
Sbjct: 3  ALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 37.8 bits (89), Expect = 3e-04
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIAREF 233
            A  NLGN Y  LG++ +A+ Y+++ L++    
Sbjct: 2   EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 34.0 bits (79), Expect = 0.009
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQ 132
           A++  NLGN    +G YDEA+   ++ LE+   
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 31.3 bits (72), Expect = 0.066
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQ 312
           A+A Y+LGN Y  L DY  A++Y+ + L +   
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 29.0 bits (66), Expect = 0.50
 Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 20/49 (40%)

Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLAL 189
            ALYNLGN Y   G                       EA++YYE+ L L
Sbjct: 2   EALYNLGNAYLKLGD--------------------YDEALEYYEKALEL 30



 Score = 29.0 bits (66), Expect = 0.54
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
            A  NLGN+++ LG+Y  A E+Y++ L L  +
Sbjct: 2   EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 29.0 bits (66), Expect = 0.58
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 321 RACWSLGNAHAARGNHEKALYFATKHLEI 349
            A ++LGNA+   G++++AL +  K LE+
Sbjct: 2   EALYNLGNAYLKLGDYDEALEYYEKALEL 30


>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat. 
          Length = 36

 Score = 37.2 bits (87), Expect = 5e-04
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 63 YSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDK 96
           S LG  Y  LGDY+KA+  +++ L LA++ ED+
Sbjct: 2  LSNLGRLYRKLGDYEKAISLYERALALAKDPEDR 35



 Score = 31.4 bits (72), Expect = 0.060
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKA 237
           A  NLG  Y  LG++++AI  ++  L +A++  D+ 
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36



 Score = 31.1 bits (71), Expect = 0.11
 Identities = 17/36 (47%), Positives = 22/36 (61%)

Query: 242 ANSNLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRA 277
           A SNLG  +  LG+Y+ A   Y+R L LA+D  DR 
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36



 Score = 31.1 bits (71), Expect = 0.11
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 282 ACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQL 313
           A  +LG  Y  L DY  AI  + R L +A+  
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKDP 32



 Score = 29.1 bits (66), Expect = 0.53
 Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 20/54 (37%)

Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEIND 195
           AL NLG +Y   G                      ++AI  YE+ LAL K+  D
Sbjct: 1   ALSNLGRLYRKLGD--------------------YEKAISLYERALALAKDPED 34


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 34.7 bits (81), Expect = 0.005
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 100 AKSSGNLGNTLKVMGKYDEAMVCCKRHLEI 129
           AK+  NLGN    +GKYDEA+   ++ LE+
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALEL 30



 Score = 34.7 bits (81), Expect = 0.005
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIAREF 233
           +A  NLGN Y  LG + +A+ Y+++ L++    
Sbjct: 2   KALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 34.3 bits (80), Expect = 0.006
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARN 92
              LGNAY  LG Y +A++Y+++ L L  N
Sbjct: 3  ALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 29.7 bits (68), Expect = 0.25
 Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 20/49 (40%)

Query: 141 RALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLAL 189
           +ALYNLGN Y   GK                      EA++YYE+ L L
Sbjct: 2   KALYNLGNAYLKLGK--------------------YDEALEYYEKALEL 30



 Score = 28.9 bits (66), Expect = 0.45
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 245 NLGNSHIFLGEYQAASEHYKRTLVLAQD 272
           NLGN+++ LG+Y  A E+Y++ L L  +
Sbjct: 6   NLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 28.9 bits (66), Expect = 0.50
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 280 AQACYSLGNTYTLLRDYPTAIDYHLRHLIIAQQ 312
           A+A Y+LGN Y  L  Y  A++Y+ + L +   
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 25.5 bits (57), Expect = 8.5
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 326 LGNAHAARGNHEKALYFATKHLEI 349
           LGNA+   G +++AL +  K LE+
Sbjct: 7   LGNAYLKLGKYDEALEYYEKALEL 30


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 33.8 bits (78), Expect = 0.024
 Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 7/73 (9%)

Query: 59  LSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGK-YD 117
            +     LGNA F LGDY +A++ +++ L L  +       A++  NL      +GK Y+
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPD------NAEAYYNLALAYLKLGKDYE 55

Query: 118 EAMVCCKRHLEIS 130
           EA+   ++ LE+ 
Sbjct: 56  EALEDLEKALELD 68



 Score = 33.1 bits (76), Expect = 0.042
 Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 7/67 (10%)

Query: 205 NLGNTYYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGE-YQAASEHY 263
           NLGN  + LG++ +AI  +++ L++  +  +      A  NL  +++ LG+ Y+ A E  
Sbjct: 8   NLGNALFKLGDYDEAIEAYEKALELDPDNAE------AYYNLALAYLKLGKDYEEALEDL 61

Query: 264 KRTLVLA 270
           ++ L L 
Sbjct: 62  EKALELD 68



 Score = 30.7 bits (70), Expect = 0.29
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 23 LELALEGERLCKAGDCRAGVAFFQAAIQAGTDDLRTLSAIYSQLGNAYFYLG-DYQKAMQ 81
            L   G  L K GD    +  ++ A++   D+    +  Y  L  AY  LG DY++A++
Sbjct: 4  EALKNLGNALFKLGDYDEAIEAYEKALELDPDN----AEAYYNLALAYLKLGKDYEEALE 59

Query: 82 YHKQDLTLA 90
            ++ L L 
Sbjct: 60 DLEKALELD 68



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 27/90 (30%)

Query: 142 ALYNLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGR 201
           AL NLGN     G                       EAI+ YE+ L    E++   A+  
Sbjct: 5   ALKNLGNALFKLGD--------------------YDEAIEAYEKAL----ELDPDNAEAY 40

Query: 202 ACGNLGNTYYLLG-NFKQAIYYHQERLKIA 230
              NL   Y  LG ++++A+   ++ L++ 
Sbjct: 41  Y--NLALAYLKLGKDYEEALEDLEKALELD 68


>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
           Provisional.
          Length = 172

 Score = 35.4 bits (82), Expect = 0.034
 Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 23/101 (22%)

Query: 74  GDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVM-------GKYDEAMVCCKRH 126
           G+Y +A++ +++ L L  +  D+             L  M       G++D+A+    + 
Sbjct: 49  GEYAEALENYEEALKLEEDPNDR----------SYILYNMGIIYASNGEHDKALEYYHQA 98

Query: 127 LEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGE 167
           LE    LN K     AL N+  +YH +G+   + G QD  E
Sbjct: 99  LE----LNPKQPS--ALNNIAVIYHKRGEKAEEAGDQDEAE 133


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
          includes outlying Tetratricopeptide-like repeats (TPR)
          that are not matched by pfam00515.
          Length = 34

 Score = 30.9 bits (71), Expect = 0.10
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 62 IYSQLGNAYFYLGDYQKAMQYHKQDLTLARN 92
              LG AY+ LGDY++A++ +++ L L  N
Sbjct: 3  ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33



 Score = 27.1 bits (61), Expect = 2.4
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 201 RACGNLGNTYYLLGNFKQAIYYHQE 225
            A  NLG  YY LG++++A+  +++
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEK 26



 Score = 26.7 bits (60), Expect = 3.3
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 241 RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
            A  NLG ++  LG+Y+ A E Y++ L L  +
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 45/249 (18%), Positives = 79/249 (31%), Gaps = 42/249 (16%)

Query: 53  TDDLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKV 112
           T D    + I  QL   Y   GD + A +   + L    +       A  +  L    + 
Sbjct: 24  TTDRNKAAKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYL----AYLA--LALYYQQ 77

Query: 113 MGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNLGNVYHAKGKAIGKVGQQDPGEYPEEV 172
           +G+ ++A    +     +  LN     G  L N G     +GK                 
Sbjct: 78  LGELEKA----EDSFRRALTLNPN--NGDVLNNYGTFLCQQGKY---------------- 115

Query: 173 KVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLKIARE 232
               ++A++ +EQ +             R+  N G      G+F +A  Y    L+I  +
Sbjct: 116 ----EQAMQQFEQAIED----PLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQ 167

Query: 233 FGDK---AAERRANS-NLGNSHIFLGEYQAASEHYKRTLVLAQDLGDRAVEAQACYSLGN 288
             +     AE         ++  +L  YQ        +L L   +     +  A    G 
Sbjct: 168 RPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAESLWLGIRIARALGDVAAAQRYGA 227

Query: 289 TYTLLRDYP 297
              L + +P
Sbjct: 228 --QLQKLFP 234


>gnl|CDD|151112 pfam10579, Rapsyn_N, Rapsyn N-terminal myristoylation and linker
           region.  Neuromuscular junction formation relies upon
           the clustering of acetylcholine receptors and other
           proteins in the muscle membrane. Rapsyn is a peripheral
           membrane protein that is selectively concentrated at the
           neuromuscular junction and is essential for the
           formation of synaptic acetylcholine receptor aggregates.
           Acetylcholine receptors fail to aggregate beneath nerve
           terminals in mice where rapsyn has been knocked out. The
           N-terminal six amino acids of rapsyn are its
           myristoylation site, and myristoylation is necessary for
           the targeting of the protein to the membrane.
          Length = 80

 Score = 31.0 bits (70), Expect = 0.28
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 315 DRVGEGRACWSLGNAHAARGNHEKALYFATKHLEISKQLGDP 356
           DR G  R    L  AH+  G +++ L FA   +  +++L DP
Sbjct: 38  DREGRFRVLGCLITAHSEMGKYKEMLEFAVAQIGTARELEDP 79


>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat. 
          Length = 42

 Score = 29.8 bits (68), Expect = 0.29
 Identities = 8/31 (25%), Positives = 15/31 (48%)

Query: 61 AIYSQLGNAYFYLGDYQKAMQYHKQDLTLAR 91
          A  + L  A    G Y++A +  ++ L + R
Sbjct: 3  ASLNNLARALRAQGRYEEAEELLEEALAIRR 33



 Score = 27.9 bits (63), Expect = 1.5
 Identities = 9/36 (25%), Positives = 14/36 (38%), Gaps = 1/36 (2%)

Query: 201 RACGNLGNTYYLLGNFKQAIYYHQERLKIARE-FGD 235
            +  NL       G +++A    +E L I R   G 
Sbjct: 3   ASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGP 38



 Score = 25.6 bits (57), Expect = 9.1
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 105 NLGNTLKVMGKYDEAMVCCKRHLEISRQL 133
           NL   L+  G+Y+EA    +  L I R++
Sbjct: 7   NLARALRAQGRYEEAEELLEEALAIRRRV 35


>gnl|CDD|200469 cd11330, AmyAc_OligoGlu, Alpha amylase catalytic domain found in
           oligo-1,6-glucosidase (also called isomaltase;
           sucrase-isomaltase; alpha-limit dextrinase) and related
           proteins.  Oligo-1,6-glucosidase (EC 3.2.1.10)
           hydrolyzes the alpha-1,6-glucosidic linkage of
           isomalto-oligosaccharides, pannose, and dextran. Unlike
           alpha-1,4-glucosidases (EC 3.2.1.20), it fails to
           hydrolyze the alpha-1,4-glucosidic bonds of
           maltosaccharides. The Alpha-amylase family comprises the
           largest family of glycoside hydrolases (GH), with the
           majority of enzymes acting on starch, glycogen, and
           related oligo- and polysaccharides. These proteins
           catalyze the transformation of alpha-1,4 and alpha-1,6
           glucosidic linkages with retention of the anomeric
           center. The protein is described as having 3 domains: A,
           B, C. A is a (beta/alpha) 8-barrel; B is a loop between
           the beta 3 strand and alpha 3 helix of A; C is the
           C-terminal extension characterized by a Greek key. The
           majority of the enzymes have an active site cleft found
           between domains A and B where a triad of catalytic
           residues (Asp, Glu and Asp) performs catalysis. Other
           members of this family have lost the catalytic activity
           as in the case of the human 4F2hc, or only have 2
           residues that serve as the catalytic nucleophile and the
           acid/base, such as Thermus A4 beta-galactosidase with 2
           Glu residues (GH42) and human alpha-galactosidase with 2
           Asp residues (GH31). The family members are quite
           extensive and include: alpha amylase,
           maltosyltransferase, cyclodextrin glycotransferase,
           maltogenic amylase, neopullulanase, isoamylase,
           1,4-alpha-D-glucan maltotetrahydrolase,
           4-alpha-glucotransferase, oligo-1,6-glucosidase,
           amylosucrase, sucrose phosphorylase, and amylomaltase.
          Length = 472

 Score = 33.0 bits (76), Expect = 0.43
 Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 12/80 (15%)

Query: 261 EHYKRTLVLAQDLGDRAVEAQACYSLGN-------TYTLLRDYPTAIDYHLRHLIIAQQL 313
           E+  R LV  +   D  +E  A Y+ G        ++ LL    +A    +R  +  +  
Sbjct: 256 EYPGRFLV-GEVSDDDPLEVMAEYTSGGDRLHMAYSFDLLGRPFSA--AVVRDAL--EAF 310

Query: 314 MDRVGEGRACWSLGNAHAAR 333
                +G  CW+  N    R
Sbjct: 311 EAEAPDGWPCWAFSNHDVPR 330


>gnl|CDD|223861 COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function
           prediction only].
          Length = 292

 Score = 31.9 bits (72), Expect = 0.92
 Identities = 30/154 (19%), Positives = 52/154 (33%), Gaps = 20/154 (12%)

Query: 210 YYLLGNFKQAIYYHQERLKIAREFGDKAAERRANSNLGNSHIFLGEYQAASEHYKRTLVL 269
                  + A+    + L    +    A   ++    G    +  +Y  A + Y++    
Sbjct: 11  SLFAAAEENAVSTLGKLLSEGGDPLMAAVALKSALLNGAGSAYPPDYAKALKSYEK---- 66

Query: 270 AQDLGDRAVEAQACYSLG----NTYTLLRDYPTAIDYHLRHLIIAQQLMDRVGEGRACWS 325
           A +LGD    A A   LG        + RD   A D+        +      G   A ++
Sbjct: 67  AAELGD----AAALALLGQMYGAGKGVSRDKTKAADW-------YRCAAAD-GLAEALFN 114

Query: 326 LGNAHAARGNHEKALYFATKHLEISKQLGDPLGQ 359
           LG  +A        L  A K+ E + +LG+    
Sbjct: 115 LGLMYANGRGVPLDLVKALKYYEKAAKLGNVEAA 148


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 27.4 bits (62), Expect = 1.7
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 62 IYSQLGNAYFYLGDYQKAMQY 82
           Y  LG  Y  LGDY++A +Y
Sbjct: 3  AYYLLGQIYLQLGDYEEAKEY 23


>gnl|CDD|234389 TIGR03903, TOMM_kin_cyc, TOMM system kinase/cyclase fusion protein.
            This model represents proteins of 1350 in length, in
            multiple species of Burkholderia, in Acidovorax avenae
            subsp. citrulli AAC00-1 and Delftia acidovorans SPH-1,
            and in multiple copies in Sorangium cellulosum, in
            genomic neighborhoods that include a
            cyclodehydratase/docking scaffold fusion protein
            (TIGR03882) and a member of the thiazole/oxazole modified
            metabolite (TOMM) precursor family TIGR03795. It has a
            kinase domain in the N-terminal 300 amino acids, followed
            by a cyclase homology domain, followed by regions without
            named domain definitions. It is a probable
            bacteriocin-like metabolite biosynthesis protein
            [Cellular processes, Toxin production and resistance].
          Length = 1266

 Score = 30.6 bits (69), Expect = 3.2
 Identities = 26/107 (24%), Positives = 41/107 (38%), Gaps = 7/107 (6%)

Query: 47   AAIQAGTDDLRTLSAIYSQLGNAYFYL------GDYQKAMQYHKQDLTLARNMEDKLGEA 100
            A  +A  D LR           A          G Y  A+    + L LA + +++   A
Sbjct: 1157 AGARADIDALRQSGCGLGLTYYASLLAELDAERGRYGAALAIIDECLALAESPDERYYLA 1216

Query: 101  KSSGNLGNTLKVMG-KYDEAMVCCKRHLEISRQLNDKLSEGRALYNL 146
            +     G  L+  G +   A  C +R +E++RQ   K  E +A   L
Sbjct: 1217 ELLLKKGRCLRAAGERAVAAEACLRRAVEVARQQGMKRIERKAAAAL 1263


>gnl|CDD|221855 pfam12921, ATP13, Mitochondrial ATPase expression.  ATP13 is
          necessary for the expression of subunit 9 of
          mitochondrial ATPase. The protein has a basic amino
          terminal signal sequence that is cleaved upon import
          into mitochondria.
          Length = 125

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 9/28 (32%), Positives = 11/28 (39%), Gaps = 4/28 (14%)

Query: 55 DLRTLSAIYSQLGNAYFYLGDYQKAMQY 82
            R L AI S    +Y Y  D   A+  
Sbjct: 50 SSRLLKAIVS----SYGYNNDISSALSI 73


>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core.  This is
           the most conserved part of the core region of Ufd2P
           ubiquitin elongating factor or E4, running from helix
           alpha-11 to alpha-38. It consists of 31 helices of
           variable length connected by loops of variable size
           forming a compact unit; the helical packing pattern of
           the compact unit consists of five structural repeats
           that resemble tandem Armadillo (ARM) repeats. This
           domain is involved in ubiquitination as it binds Cdc48p
           and escorts ubiquitinated proteins from Cdc48p to the
           proteasome for degradation. The core is structurally
           similar to the nuclear transporter protein
           importin-alpha. The core is associated with the U-box at
           the C-terminus, pfam04564, which has ligase activity.
          Length = 625

 Score = 29.9 bits (68), Expect = 4.3
 Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 66  LGNAYFYLGDYQKAMQYHKQDLTLARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCK 124
           L     +LG     ++ +K+ L   + ++D+L E + S +    LK + K  E +   K
Sbjct: 174 LTLRALHLG-IGPLIEKYKRLLRELKRLQDELEELEQSRSNWAQLKRLEKQLEILSAEK 231


>gnl|CDD|219814 pfam08385, DHC_N1, Dynein heavy chain, N-terminal region 1.  Dynein
           heavy chains interact with other heavy chains to form
           dimers, and with intermediate chain-light chain
           complexes to form a basal cargo binding unit. The region
           featured in this family includes the sequences
           implicated in mediating these interactions. It is
           thought to be flexible and not to adopt a rigid
           conformation.
          Length = 577

 Score = 29.6 bits (67), Expect = 5.3
 Identities = 21/106 (19%), Positives = 36/106 (33%), Gaps = 24/106 (22%)

Query: 170 EEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACGNLGNTYYLLGNFKQAI--------- 220
             +K  L EA + Y + L  ++ ++D      +  +L      L      +         
Sbjct: 85  RGLKDALNEA-QDYNKFLKDLEPLDDL----LSATDLQKIREALPPIFNHLRLKWSISKY 139

Query: 221 YYHQERL-----KIAREFGDKAAERRANSNLGNSHIFLGEYQAASE 261
           YY  ER      KI+ +  D+       S L  S +   +Y    E
Sbjct: 140 YYPLERALVLLEKISNDLIDQL-----KSYLSPSDLMSLDYDEFLE 180


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 29.7 bits (67), Expect = 5.6
 Identities = 66/393 (16%), Positives = 126/393 (32%), Gaps = 110/393 (27%)

Query: 34  KAGDCRAGVAFFQAAIQAGTDDLRT---LSAIYSQLGNAYFYLGDYQKAMQYHKQDLT-- 88
           + G+    +  F+  +     +LR    L+ +Y + GN    +   +KA + + Q++   
Sbjct: 511 QEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPA 570

Query: 89  --LARNMEDKLGEAKSSGNLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKLSEGRALYNL 146
             LA+            G L   L ++   +EA        E    L      GRA    
Sbjct: 571 LALAQYYL-------GKGQLKKALAIL---NEAADAAPDSPEAWLML------GRAQLAA 614

Query: 147 GNVYHAKGKAIGKVGQQDPGEYPEEVKVC--------LQEAIKYYEQNLAL--------- 189
           G++  A   +  K+    P      + +           +AI   ++ L L         
Sbjct: 615 GDLNKAV-SSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQI 673

Query: 190 ---------------------MKEINDTAAQGRACGNLGNTYYLLGNFKQAIYYHQERLK 228
                                +++ +  AA G      G+ Y    ++  AI  +++ LK
Sbjct: 674 GLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFEL--EGDLYLRQKDYPAAIQAYRKALK 731

Query: 229 IA-------------REFGDKA-AERR-------------ANSNLGNSHIFLGEYQAASE 261
            A                G+ A A +                + L   ++   +Y  A +
Sbjct: 732 RAPSSQNAIKLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIK 791

Query: 262 HYKRTLVLAQDLGDRAVEAQACYSLGNTYTLLRDYPTAIDYHLRHLIIA---QQLMDRVG 318
           HY+  +  A D  +  V     +            P A++Y  R L +A     ++D   
Sbjct: 792 HYQTVVKKAPD--NAVVLNNLAWLYLE-----LKDPRALEYAERALKLAPNIPAILD--- 841

Query: 319 EGRACWSLGNAHAARGNHEKALYFATKHLEISK 351
                 +LG     +G  ++AL    K + I+ 
Sbjct: 842 ------TLGWLLVEKGEADRALPLLRKAVNIAP 868


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 29.7 bits (67), Expect = 5.7
 Identities = 13/91 (14%), Positives = 21/91 (23%), Gaps = 15/91 (16%)

Query: 205 NLGNTYYLLGNFKQAIYYHQERLK-----------IAREFGDKAAERRANSNLGNSHIFL 253
           NL       G  ++AI      L            +A+ + +      A       +   
Sbjct: 379 NLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYALA 438

Query: 254 GEYQAASEHYKRTLVLAQDLGDRAVEAQACY 284
           G  + A     R    A           A  
Sbjct: 439 GRLEQAIIFLMR----ASQQVKLGFPDWARA 465


>gnl|CDD|176247 cd08287, FDH_like_ADH3, formaldehyde dehydrogenase (FDH)-like.
           This group contains proteins identified as alcohol
           dehydrogenases and glutathione-dependant formaldehyde
           dehydrogenases (FDH) of the zinc-dependent/medium chain
           alcohol dehydrogenase family.  The MDR family uses
           NAD(H) as a cofactor in the interconversion of alcohols
           and aldehydes, or ketones.  FDH converts formaldehyde
           and NAD to formate and NADH. The initial step in this
           process the spontaneous formation of a
           S-(hydroxymethyl)glutathione adduct from formaldehyde
           and glutathione, followed by FDH-mediated oxidation (and
           detoxification) of the adduct to S-formylglutathione.
           The medium chain alcohol dehydrogenase family (MDR) has
           a NAD(P)(H)-binding domain in a Rossmann fold of a
           beta-alpha form. The N-terminal region typically has an
           all-beta catalytic domain. These proteins typically form
           dimers (typically higher plants, mammals) or tetramers
           (yeast, bacteria), and have 2 tightly bound zinc atoms
           per subunit.
          Length = 345

 Score = 29.2 bits (66), Expect = 6.7
 Identities = 12/24 (50%), Positives = 16/24 (66%), Gaps = 2/24 (8%)

Query: 219 AIYYHQERLKIAREFG--DKAAER 240
           A+  H++R  +AREFG  D  AER
Sbjct: 198 AMSRHEDRQALAREFGATDIVAER 221


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 29.2 bits (65), Expect = 7.1
 Identities = 15/90 (16%), Positives = 36/90 (40%), Gaps = 1/90 (1%)

Query: 413 EQLDLIKLTPENKGKQSGSSNSSELSKCDEEESF-FELLSRFQSERMDDQRCSLSARQSE 471
           +Q+   +     K +     N  E  K  + ++F  +  S+   +  +D+      ++  
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREP 332

Query: 472 NKENLQRITNTKKSQPSAPRATTIPAHTPG 501
             E+LQ+     ++QP++     I +  P 
Sbjct: 333 VAEDLQKTKPQVEAQPTSLNEDAIDSSNPV 362


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 25.6 bits (57), Expect = 8.2
 Identities = 8/38 (21%), Positives = 17/38 (44%), Gaps = 4/38 (10%)

Query: 44 FFQAAIQAGTDDLRTLSAIYSQLGNAYFYLGDYQKAMQ 81
           ++ A++   ++    +  Y  L      LG Y +A+Q
Sbjct: 1  LYEKALELDPNN----AEAYYNLALLLLNLGQYDEALQ 34


>gnl|CDD|193336 pfam12862, Apc5, Anaphase-promoting complex subunit 5.  Apc5 is a
           subunit of the anaphase-promoting complex/cyclosome
           (APC/C) which is a multisubunit ubiquitin ligase that
           mediates the proteolysis of cell cycle proteins in
           mitosis and G1. Apc5, although it does not harbour a
           classical RNA binding domain, Apc5 binds the poly(A)
           binding protein (PABP), which directly binds the
           internal ribosome entry site (IRES) of growth factor 2
           mRNA. PABP was found to enhance IRES-mediated
           translation, whereas Apc5 over-expression counteracted
           this effect. In addition to its association with the
           APC/C complex, Apc5 binds much heavier complexes and
           co-sediments with the ribosomal fraction. The N-terminus
           of Afi1 serves to stabilise the union between Apc4 and
           Apc5, both of which lie towards the bottom-front of the
           APC. This region of the Apc5 member proteins carries a
           TPR-like motif.
          Length = 91

 Score = 27.2 bits (61), Expect = 8.2
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 202 ACGNLGNTYYLLGNFKQAIYYHQERLKIAREFGDKA 237
           A  NL + +   G+ ++AI   +E +++ARE  D A
Sbjct: 40  ALLNLASLHADFGHNEEAIAAIEEAIRLARENKDTA 75


>gnl|CDD|187766 cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1. 
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas1 is the most universal CRISPR system protein thought
           to be involved in spacer integration; Cas1 is
           metal-dependent deoxyribonuclease, also binds RNA; Shown
           to possess a unique fold consisting of a N-terminal
           beta-strand domain and a C-terminal alpha-helical
           domain.
          Length = 317

 Score = 28.7 bits (65), Expect = 8.7
 Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 5/71 (7%)

Query: 217 KQAIYY--HQERLKIAREF-GDKAAERRAN-SNLGNSHIFLGEYQAASEHYKRTLVLAQD 272
            Q   Y   ++RL++ARE    K   +R            L    A  E     L  A+ 
Sbjct: 89  AQYEAYLDPEKRLELAREIVRGKIRNQRRVLKRYARDGKELLLALAELEELLEKLDKAKS 148

Query: 273 LGD-RAVEAQA 282
           + + R +E  A
Sbjct: 149 IEELRGIEGNA 159


>gnl|CDD|130680 TIGR01619, hyp_HI0040, TIGR01619 family protein.  This model
           represents a hypothetical equivalog of gamma
           proteobacteria, includes HI0040. These sequences do not
           have any similarity to known proteins by PSI-BLAST.
          Length = 249

 Score = 28.4 bits (63), Expect = 9.0
 Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 524 LPHLADRENAIPDDGFLDMLIRCQGARLEDQRSHLP 559
             + AD ENA+PD+   D LI  Q  ++  Q S LP
Sbjct: 42  FDYAADDENALPDESEYDKLIN-QIFKILTQLSALP 76


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 48/248 (19%), Positives = 75/248 (30%), Gaps = 76/248 (30%)

Query: 55  DLRTLSAIYSQLGNAYFYLGDYQKAMQYHKQDLTLARNMED----------KLGEAKSSG 104
           D    +    QL   Y   GDY +A +  ++ L    +             KLGE   + 
Sbjct: 30  DRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLAD 89

Query: 105 ------------------NLGNTLKVMGKYDEAMVCCKRHLEISRQLNDKL--SEGRALY 144
                             N G  L   G+ +EAM       +  R L D         L 
Sbjct: 90  ESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQ------QFERALADPAYGEPSDTLE 143

Query: 145 NLGNVYHAKGKAIGKVGQQDPGEYPEEVKVCLQEAIKYYEQNLALMKEINDTAAQGRACG 204
           NLG      G+                      +A +Y ++ L L  +      +     
Sbjct: 144 NLGLCALKAGQ--------------------FDQAEEYLKRALELDPQFPPALLE----- 178

Query: 205 NLGNTYYLLGNFKQAIYYHQERL--------------KIAREFGDKAAERRANSNLGNSH 250
            L   +Y  G++  A  Y +                 +IA+  GD+AA +R  + L    
Sbjct: 179 -LARLHYKAGDYAPARLYLERYQQRGGAQAESLLLGIRIAKRLGDRAAAQRYQAQLQRLF 237

Query: 251 IFLGEYQA 258
            +  EYQ 
Sbjct: 238 PYSEEYQT 245


>gnl|CDD|237959 PRK15404, PRK15404, leucine ABC transporter subunit
           substrate-binding protein LivK; Provisional.
          Length = 369

 Score = 28.8 bits (65), Expect = 9.3
 Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 7/34 (20%)

Query: 561 EGSTVPDEDFFSLITRLQSGRMEDQRATVPSVYF 594
           EG T  D+DF +LI +L+          V  VY+
Sbjct: 196 EGITAGDKDFSALIAKLKK-------ENVDFVYY 222


>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
           [Carbohydrate transport and metabolism].
          Length = 389

 Score = 28.9 bits (65), Expect = 9.5
 Identities = 21/63 (33%), Positives = 28/63 (44%), Gaps = 10/63 (15%)

Query: 230 AREFGDKAAER-----RANSNLGNSHIFLGEYQAASEHYKRTLVLAQD---LGDRAVEAQ 281
           ARE   KA +      RA+  LG   +  G+YQ A E  +R  VL Q+   L +      
Sbjct: 199 ARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALER--VLEQNPEYLSEVLEMLY 256

Query: 282 ACY 284
            CY
Sbjct: 257 ECY 259


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.132    0.379 

Gapped
Lambda     K      H
   0.267   0.0834    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,507,517
Number of extensions: 2989358
Number of successful extensions: 2737
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2666
Number of HSP's successfully gapped: 106
Length of query: 609
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 506
Effective length of database: 6,369,140
Effective search space: 3222784840
Effective search space used: 3222784840
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.5 bits)