RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12594
         (542 letters)



>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 55.8 bits (135), Expect = 1e-09
 Identities = 28/119 (23%), Positives = 56/119 (47%), Gaps = 1/119 (0%)

Query: 161 ERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNN 220
           E ++Q G LP LV +L+S   +++  AA+AL  +   + +  ++  +   +P L++ L +
Sbjct: 1   EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60

Query: 221 DEVNVKADAFLCLADIAKDSPSLASTIAD-GATTLAVEHLTYPHGLVQKNAATLLRDMC 278
           ++  V   A   L ++A         + + G     V  L   +  +QKNA   L ++ 
Sbjct: 61  EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119



 Score = 55.4 bits (134), Expect = 2e-09
 Identities = 30/117 (25%), Positives = 58/117 (49%)

Query: 121 LIEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQ 180
           +I+   L +++S L   D + +  AA AL  L+  N    + +V+ G LP LVQ+L S  
Sbjct: 3   VIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSED 62

Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIA 237
             +  AA +AL  +     +      +   +P L+  L++   +++ +A   L+++A
Sbjct: 63  EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119



 Score = 44.6 bits (106), Expect = 1e-05
 Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 6/120 (5%)

Query: 287 LVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCP 346
            V+ AGGL  L+ L+  +       A  AL  +SA +    + ++E  G+ +L  +L+  
Sbjct: 2   AVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSE 61

Query: 347 HKEQLQVAAAWTLGQIGKHSVDNVTALCEENVLSTLLSKITQTCVSDNLKSKCKNALNIL 406
             E++  AA W L  +     DN   + E   +  L++      +  + +   KNA   L
Sbjct: 62  D-EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVN-----LLDSSNEDIQKNATGAL 115



 Score = 43.8 bits (104), Expect = 2e-05
 Identities = 22/119 (18%), Positives = 50/119 (42%), Gaps = 1/119 (0%)

Query: 78  QEVLERNLVALVLKDLTKQN-KHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHD 136
           + V++   +  ++  L+  +    ++ A  +    A  ++    ++E   L +++  L  
Sbjct: 1   EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60

Query: 137 FDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIV 195
            D +    A  AL  LA   +     +++ G +P LV +L S    ++  A  AL  + 
Sbjct: 61  EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119



 Score = 32.7 bits (75), Expect = 0.14
 Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 3/102 (2%)

Query: 4   KSVKNSIDHYQRARLCFVQTMSAFADKPSTIHFLKDLNVVDLIVPLLADKVPSIQHAALA 63
             + +S ++ QR     +  +SA  +    I  + +   +  +V LL  +   +  AAL 
Sbjct: 14  SLLSSSDENVQREAAWALSNLSAGNNDN--IQAVVEAGGLPALVQLLKSEDEEVVKAALW 71

Query: 64  TLGKLMCAKHVDIRQEVLERNLVALVLKDLTKQNKHYKKTAL 105
            L  L  A   D +  VLE   V  ++  L   N+  +K A 
Sbjct: 72  ALRNL-AAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNAT 112


>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
           trafficking and secretion].
          Length = 526

 Score = 56.1 bits (135), Expect = 3e-08
 Identities = 52/237 (21%), Positives = 99/237 (41%), Gaps = 5/237 (2%)

Query: 152 LAKDNKHFAERIVQCGALPPLVQILASPQLS-LKHAAAFALDGIVKHSPELAESASDCDV 210
           L+K+     + ++  G +P  V+ +   Q   L+  AA+AL  I   + +  +   D   
Sbjct: 99  LSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGA 158

Query: 211 IPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIAD-GA--TTLAVEHLTYPHGLVQ 267
           +PL I+ L++ E +V+  A   L +IA DS      +   GA    L +   +  H  + 
Sbjct: 159 VPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHISML 218

Query: 268 KNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIA 327
           +NA   L ++C+      +   ++  L  L +LI        + A  A+ Y+S    +  
Sbjct: 219 RNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAISYLSDGPNEKI 278

Query: 328 EQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSVDNVTALCEENVLSTLLS 384
           + ++++   G L  +L      ++Q  A  ++G I   S D    +     L    S
Sbjct: 279 QAVLDVGIPGRLVELLSHE-SAKIQTPALRSVGNIVTGSDDQTQVIINCGALKAFRS 334



 Score = 51.0 bits (122), Expect = 1e-06
 Identities = 51/260 (19%), Positives = 105/260 (40%), Gaps = 46/260 (17%)

Query: 131 ISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILAS--PQLSLKHAAA 188
           I  L   + D R  A  ALG +A D++   + ++QCGAL PL+ +L S    +S+   A 
Sbjct: 163 IQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHISMLRNAT 222

Query: 189 FALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIA 248
           + L  + +      + ++    +P+L K + + +  V  DA   ++ ++         + 
Sbjct: 223 WTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAISYLSDGPNEKIQAVL 282

Query: 249 D-GATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGY 307
           D G     VE L++    +Q                                        
Sbjct: 283 DVGIPGRLVELLSHESAKIQT--------------------------------------- 303

Query: 308 NSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSV 367
              PA+ ++G +   S+   + II    + + +++L  P KE ++  A WT+  I   + 
Sbjct: 304 ---PALRSVGNIVTGSDDQTQVIINCGALKAFRSLLSSP-KENIRKEACWTISNITAGNT 359

Query: 368 DNVTALCEENVLSTLLSKIT 387
           + + A+ + N++  L+  ++
Sbjct: 360 EQIQAVIDANLIPPLIHLLS 379



 Score = 42.2 bits (99), Expect = 6e-04
 Identities = 20/67 (29%), Positives = 38/67 (56%)

Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDE 222
           I+ CGAL     +L+SP+ +++  A + +  I   + E  ++  D ++IP LI  L++ E
Sbjct: 323 IINCGALKAFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAE 382

Query: 223 VNVKADA 229
             +K +A
Sbjct: 383 YKIKKEA 389



 Score = 35.3 bits (81), Expect = 0.087
 Identities = 50/282 (17%), Positives = 108/282 (38%), Gaps = 20/282 (7%)

Query: 35  HFLKDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALVLKDLT 94
             + D   V L + LL+     ++  A+  LG +        R  VL+   +  +L  L 
Sbjct: 151 KVVVDAGAVPLFIQLLSSTEDDVREQAVWALGNI-AGDSEGCRDYVLQCGALEPLLGLLL 209

Query: 95  KQNKHYK--KTALHMLTCVAQQDERCAGLIEDKDLDSIISSL-------HDFDPDTRANA 145
               H    + A   L+ +      C G     D  +I  +L       +  DP+   +A
Sbjct: 210 SSAIHISMLRNATWTLSNL------CRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDA 263

Query: 146 ALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESA 205
             A+  L+       + ++  G    LV++L+     ++  A  ++  IV  S +  +  
Sbjct: 264 CWAISYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIVTGSDDQTQVI 323

Query: 206 SDCDVIPLLIKALNNDEVNVKADAFLCLADI-AKDSPSLASTIADGATTLAVEHLTYPHG 264
            +C  +      L++ + N++ +A   +++I A ++  + + I        +  L+    
Sbjct: 324 INCGALKAFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEY 383

Query: 265 LVQKNAATLLRDMCKHNLRLANLV---VLAGGLDGLLRLIDC 303
            ++K A   + +     L   +++   V  G +  L  L+D 
Sbjct: 384 KIKKEACWAISNATSGGLNRPDIIRYLVSQGFIKPLCDLLDV 425


>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
           Provisional.
          Length = 2102

 Score = 46.2 bits (110), Expect = 5e-05
 Identities = 50/205 (24%), Positives = 87/205 (42%), Gaps = 12/205 (5%)

Query: 49  LLADKVPSIQHAALATLGKLM-CAKHVDIRQEVLERNLVALVLKDLTKQNKHYKKTALHM 107
           LL +  P  Q  A  TL KL+  A    I Q      L AL+L DL +   H      H+
Sbjct: 538 LLKNGGPKGQEIAAKTLTKLVRTADAATISQ------LTALLLGDLPESKVHVLDVLGHV 591

Query: 108 LTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCG 167
           L+  + +D    G   +  L ++I  L     +T+  AA  L  +    +   E +    
Sbjct: 592 LSVASLEDLVREGSAANDALRTLIQLLSSSKEETQEKAASVLADIFSSRQDLCESLATDE 651

Query: 168 ALPPLVQILASPQLSLKHAAAFALDGI---VKHSPELAESASDCDVIPLLIKALNNDEVN 224
            + P +++L +   ++   +A AL  +   +K + +++ +A   D I  LIK   +  + 
Sbjct: 652 IINPCIKLLTNNTEAVATQSARALAALSRSIKENRKVSYAAE--DAIKPLIKLAKSSSIE 709

Query: 225 VKADAFLCLADIAKDSPSLASTIAD 249
           V   A   LA++  D    A  +A+
Sbjct: 710 VAEQAVCALANLLSDPEVAAEALAE 734



 Score = 43.2 bits (102), Expect = 4e-04
 Identities = 27/97 (27%), Positives = 50/97 (51%), Gaps = 1/97 (1%)

Query: 122 IEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASP-Q 180
           +E   +D ++  L   + D +ANAA  L  L    +    +++  GA+  L+++L    +
Sbjct: 186 LEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQGNE 245

Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKA 217
           +S++  AA AL+ +   S E  ++ +D   IP LI A
Sbjct: 246 VSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINA 282



 Score = 37.4 bits (87), Expect = 0.022
 Identities = 63/250 (25%), Positives = 102/250 (40%), Gaps = 49/250 (19%)

Query: 131 ISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAA-- 188
           +S L       + NAA  LG L K+      +++  G +PPL+ +L S     + AAA  
Sbjct: 64  VSLLRSGTLGAKVNAAAVLGVLCKEED-LRVKVLLGGCIPPLLSLLKSGSAEAQKAAAEA 122

Query: 189 -----------------FALDGIVKHSPELAE-----SASDCDVIPLLIKALNNDEVNVK 226
                            F+ +G+V   P L +     +  D  V  LL  AL N      
Sbjct: 123 IYAVSSGGLSDHVGSKIFSTEGVV---PSLWDQLQPGNKQDKVVEGLLTGALRN------ 173

Query: 227 ADAFLCLADIAKDSPSLASTIADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLAN 286
               LC           ++T+  G   + V+ L+  +   Q NAA+LL  +        +
Sbjct: 174 ----LC----GSTDGFWSATLEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSIS 225

Query: 287 LVVLAGGLDGLLRLIDCNGGYNSLP----AIMALGYMSAQSEQIAEQIIELQGVGSLQTI 342
            V+ AG +  LL+L+   G  N +     A  AL  +S+QS++  + I +  G+ +L   
Sbjct: 226 KVLDAGAVKQLLKLL---GQGNEVSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINA 282

Query: 343 LQCPHKEQLQ 352
              P KE +Q
Sbjct: 283 TVAPSKEFMQ 292



 Score = 30.1 bits (68), Expect = 3.9
 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 2/85 (2%)

Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDE 222
           I   G +PPLVQ+L +     K  +A  L  +  HS ++         +P L+  L N  
Sbjct: 484 ITAAGGIPPLVQLLETGSQKAKEDSATVLWNLCCHSEDIRACVESAGAVPALLWLLKNGG 543

Query: 223 VNVKADAFLCLADIAKDSPSLASTI 247
              +  A   L  + + + +  +TI
Sbjct: 544 PKGQEIAAKTLTKLVRTADA--ATI 566



 Score = 30.1 bits (68), Expect = 4.1
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 127  LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHA 186
            ++ +++ L       R +AA AL  L    +H  +  +   A+ PLV++L +   S +HA
Sbjct: 1234 VNQLVAVLRLGSRSARYSAARALQELF-SAEHIRDSELARQAVQPLVEMLNTGSESEQHA 1292

Query: 187  AAFAL 191
            A  AL
Sbjct: 1293 AIGAL 1297


>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
           conversion].
          Length = 335

 Score = 43.4 bits (102), Expect = 2e-04
 Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 33/236 (13%)

Query: 37  LKDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALV------- 89
           L     V L+  LL+D+ P ++ AA   LG+L   + V    E+LE +    V       
Sbjct: 70  LGSEEAVPLLRELLSDEDPRVRDAAADALGELGDPEAVPPLVELLENDENEGVRAAAARA 129

Query: 90  ------------LKDLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSI---ISSL 134
                       L +  +        A      +  +      L E  D ++I   I  L
Sbjct: 130 LGKLGDERALDPLLEALQDEDSGSAAAALDAALLDVRAAAAEALGELGDPEAIPLLIELL 189

Query: 135 HDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALD-- 192
            D D D R  AA ALG L  +N   A+          LV+ L+   L ++ AA  AL   
Sbjct: 190 EDEDADVRRAAASALGQLGSENVEAADL---------LVKALSDESLEVRKAALLALGEI 240

Query: 193 GIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIA 248
           G  +    LA++  D DVI  L+ A     +++   A   L  +  ++ ++    A
Sbjct: 241 GDEEAVDALAKALEDEDVILALLAAAALGALDLAEAALPLLLLLIDEANAVRLEAA 296



 Score = 37.2 bits (86), Expect = 0.016
 Identities = 63/289 (21%), Positives = 108/289 (37%), Gaps = 54/289 (18%)

Query: 92  DLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALGG 151
            +          AL  L  +       A     +  D ++  L D D   R +AA+ALG 
Sbjct: 14  LVLLSYGSEAAAALQALAELDDLILELA----PEAADELLKLLEDEDLLVRLSAAVALGE 69

Query: 152 LAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVI 211
           L       +E      A+P L ++L+     ++ AAA AL G +           D + +
Sbjct: 70  LG------SEE-----AVPLLRELLSDEDPRVRDAAADAL-GELG----------DPEAV 107

Query: 212 PLLIKALNNDE-VNVKADAFLCLADIAKDSPSLASTIA---DGATTLAVEHLTYPHGLVQ 267
           P L++ L NDE   V+A A   L  +  D  +L   +    D  +  A   L      V+
Sbjct: 108 PPLVELLENDENEGVRAAAARALGKL-GDERALDPLLEALQDEDSGSAAAALDAALLDVR 166

Query: 268 KNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIA 327
             AA  L ++   +     L  L   L+     +          A  ALG + +++ + A
Sbjct: 167 AAAAEALGEL--GDPEAIPL--LIELLEDEDADVRR-------AAASALGQLGSENVEAA 215

Query: 328 EQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSVDNVTALCEE 376
           + +++                 +++ AA   LG+IG    + V AL + 
Sbjct: 216 DLLVKALS----------DESLEVRKAALLALGEIG--DEEAVDALAKA 252



 Score = 32.6 bits (74), Expect = 0.44
 Identities = 48/205 (23%), Positives = 82/205 (40%), Gaps = 35/205 (17%)

Query: 42  VVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALVLKDLTKQNKHYK 101
             D ++ LL D+   ++ +A   LG+L             E   V L+ + L+ ++   +
Sbjct: 44  AADELLKLLEDEDLLVRLSAAVALGEL----------GSEEA--VPLLRELLSDEDPRVR 91

Query: 102 KTALHMLTCVAQQDERCAGLIEDKDLDSIISSL-HDFDPDTRANAALALGGLAKDNKHFA 160
             A   L            L + + +  ++  L +D +   RA AA ALG L        
Sbjct: 92  DAAADAL----------GELGDPEAVPPLVELLENDENEGVRAAAARALGKL------GD 135

Query: 161 ERIVQCGALPPLVQILASP-QLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALN 219
           ER     AL PL++ L      S   A   AL  +   + E      D + IPLLI+ L 
Sbjct: 136 ER-----ALDPLLEALQDEDSGSAAAALDAALLDVRAAAAEALGELGDPEAIPLLIELLE 190

Query: 220 NDEVNVKADAFLCLADIAKDSPSLA 244
           +++ +V+  A   L  +  ++   A
Sbjct: 191 DEDADVRRAAASALGQLGSENVEAA 215


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 36.5 bits (85), Expect = 0.003
 Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 22/92 (23%)

Query: 134 LHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDG 193
           L D DP+ RA AA ALG L               ALP L+++L  P   ++ AAA AL G
Sbjct: 9   LSDPDPEVRAAAARALGELGDPE-----------ALPALLELLKDPDPEVRRAAAEAL-G 56

Query: 194 IVKHSPELAESASDCDVIPLLIKALNNDEVNV 225
            +             + +P L++ L +D+  V
Sbjct: 57  KLGDP----------EALPALLELLQDDDDAV 78



 Score = 31.2 bits (71), Expect = 0.23
 Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 13/77 (16%)

Query: 117 RCAGLIEDKD-LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQI 175
           R  G + D + L +++  L D DP+ R  AA ALG L               ALP L+++
Sbjct: 22  RALGELGDPEALPALLELLKDPDPEVRRAAAEALGKLGDPE-----------ALPALLEL 70

Query: 176 LASPQ-LSLKHAAAFAL 191
           L       ++ AAA AL
Sbjct: 71  LQDDDDAVVRAAAASAL 87



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 37 LKDLNVVDLIVPLLADKVPSIQHAALATLGKL 68
          L D   +  ++ LL D  P ++ AA   LGKL
Sbjct: 27 LGDPEALPALLELLKDPDPEVRRAAAEALGKL 58



 Score = 27.3 bits (61), Expect = 5.8
 Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 12/69 (17%)

Query: 169 LPPLVQILA-SPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKA 227
           L  L++ L   P   ++ AAA AL           E     + +P L++ L + +  V+ 
Sbjct: 1   LEALLEALLSDPDPEVRAAAARAL----------GELGDP-EALPALLELLKDPDPEVRR 49

Query: 228 DAFLCLADI 236
            A   L  +
Sbjct: 50  AAAEALGKL 58


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
           related to armadillo/beta-catenin-like repeats (see
           pfam00514). These EZ repeats are found in subunits of
           cyanobacterial phycocyanin lyase and other proteins and
           probably carry out a scaffolding role.
          Length = 55

 Score = 34.7 bits (80), Expect = 0.008
 Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 139 PDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFAL 191
            + R  AALALG LA          V    LP L+ +L      ++ AAA+AL
Sbjct: 1   WEVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWAL 52



 Score = 34.0 bits (78), Expect = 0.011
 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 182 SLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADI 236
            ++ AAA AL  +    PEL   A   +++P L+  L +D+  V+  A   L  I
Sbjct: 2   EVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 28.6 bits (64), Expect = 1.1
 Identities = 13/51 (25%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 312 AIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQI 362
           A +ALG ++    ++    +  + + +L  +L+    ++++ AAAW LG+I
Sbjct: 7   AALALGALAGGGPELLRPAVP-ELLPALLPLLKDDD-DEVREAAAWALGRI 55



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 13/50 (26%), Positives = 21/50 (42%)

Query: 101 KKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALG 150
           ++ A   L  +A            + L +++  L D D + R  AA ALG
Sbjct: 4   REAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALG 53


>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat.  Approx. 40
           amino acid repeat. Tandem repeats form super-helix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. CAUTION: This family does
           not contain all known armadillo repeats.
          Length = 41

 Score = 31.7 bits (73), Expect = 0.054
 Identities = 14/36 (38%), Positives = 25/36 (69%)

Query: 156 NKHFAERIVQCGALPPLVQILASPQLSLKHAAAFAL 191
           +    + +++ GA+PPLVQ+L+SP   ++  AA+AL
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWAL 36



 Score = 28.2 bits (64), Expect = 0.86
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 31 PSTIHFLKDLNVVDLIVPLLADKVPSIQHAALATLGKL 68
          P     + +   V  +V LL+     +Q  A   L  L
Sbjct: 2  PENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39


>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats.  Approx. 40
           amino acid repeat. Tandem repeats form superhelix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. Involved in transducing
           the Wingless/Wnt signal. In plakoglobin arm repeats bind
           alpha-catenin and N-cadherin.
          Length = 41

 Score = 29.3 bits (67), Expect = 0.34
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFAL 191
           +V  G LP LV++L S    +   AA+AL
Sbjct: 8   VVDAGGLPALVELLKSEDEEVVKEAAWAL 36


>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
           protein; Provisional.
          Length = 897

 Score = 32.9 bits (75), Expect = 0.46
 Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 21/108 (19%)

Query: 127 LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHA 186
            D++ +   D DP  RA A  AL  L       A               L +    ++  
Sbjct: 777 GDAVRALTGDPDPLVRAAALAALAELGCPPDDVA----------AATAALRASAWQVRQG 826

Query: 187 AAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLA 234
           AA AL G    + ++A        +P L++AL +  ++V+  A L L 
Sbjct: 827 AARALAGA---AADVA--------VPALVEALTDPHLDVRKAAVLALT 863


>gnl|CDD|221393 pfam12044, Metallopep, Putative peptidase family.  This family of
           proteins is functionally uncharacterized. However, it
           does contain an HEXXH motif characteristic of
           metallopeptidases. This protein is found in bacteria and
           eukaryotes. Proteins in this family are typically
           between 625 to 773 amino acids in length.
          Length = 419

 Score = 32.3 bits (74), Expect = 0.68
 Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 4/60 (6%)

Query: 487 VSPHYLED--LINRLDQ--FNPQKSSNLPDCKDSGSGNPSGSAQSLQRSEASTTSTASIY 542
             P   +D    +RLD   F    S  LP    S   + S S   +       TS+A IY
Sbjct: 360 SRPILPKDECTWHRLDALRFRYHPSFRLPVDGLSAKSDDSPSVYPVGNGGVFITSSAGIY 419


>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region.  This family
           consists of the N terminal region of various alpha, beta
           and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
           protein complexes. The adaptor protein (AP) complexes
           are involved in the formation of clathrin-coated pits
           and vesicles. The N-terminal region of the various
           adaptor proteins (APs) is constant by comparison to the
           C-terminal which is variable within members of the AP-2
           family; and it has been proposed that this constant
           region interacts with another uniform component of the
           coated vesicles.
          Length = 522

 Score = 31.8 bits (73), Expect = 0.87
 Identities = 21/122 (17%), Positives = 46/122 (37%), Gaps = 4/122 (3%)

Query: 168 ALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKA 227
             P + ++L      ++  AA A+  + +  P+L        ++P L + L++ +  V +
Sbjct: 114 LAPDIKKLLVDRDPYVRKKAALAILKLYRKDPDLVRD----FLVPELKELLSDKDPGVVS 169

Query: 228 DAFLCLADIAKDSPSLASTIADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANL 287
            A   L +I K+     + +           LT  +  +Q     LL      + R    
Sbjct: 170 AAVALLYEIRKNDRLYLNKLLPLLVRRLCNLLTVCNPWLQVKILRLLTRYAPQDPREPKE 229

Query: 288 VV 289
           ++
Sbjct: 230 LL 231


>gnl|CDD|170047 PRK09687, PRK09687, putative lyase; Provisional.
          Length = 280

 Score = 30.5 bits (69), Expect = 1.8
 Identities = 33/154 (21%), Positives = 49/154 (31%), Gaps = 21/154 (13%)

Query: 33  TIHFLKDLNVVDLIVPLLADKVPSIQHAALATLGKL-MCAKHVD----IRQEVLERNLVA 87
            +      +V  L + L + K P  +      L +L M  +  D    I   +   +  A
Sbjct: 46  VLQLRGGQDVFRLAIELCSSKNPIERDIGADILSQLGMAKRCQDNVFNILNNLALEDKSA 105

Query: 88  LVLKDL-------TKQNKHYKKTALHMLTCVA---QQDERCA---GLIEDKD---LDSII 131
            V            K+N  Y    +      A     + R A    L    D   +  +I
Sbjct: 106 CVRASAINATGHRCKKNPLYSPKIVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLI 165

Query: 132 SSLHDFDPDTRANAALALGGLAKDNKHFAERIVQ 165
           + L D + D R  AA AL     DN    E  V 
Sbjct: 166 NLLKDPNGDVRNWAAFALNSNKYDNPDIREAFVA 199


>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat.  The HEAT repeat family is related
          to armadillo/beta-catenin-like repeats (see pfam00514).
          Length = 31

 Score = 26.3 bits (59), Expect = 3.2
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 42 VVDLIVPLLADKVPSIQHAALATLGKLM 69
          ++ L++ LL D  P ++ AA   LG L 
Sbjct: 1  LLPLLLELLNDPDPEVREAAAEALGALA 28



 Score = 25.6 bits (57), Expect = 6.2
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 210 VIPLLIKALNNDEVNVKADAFLCLADIAK 238
           ++PLL++ LN+ +  V+  A   L  +A+
Sbjct: 1   LLPLLLELLNDPDPEVREAAAEALGALAE 29


>gnl|CDD|129779 TIGR00696, wecG_tagA_cpsF, bacterial polymer biosynthesis proteins,
           WecB/TagA/CpsF family.  The WecG member of this
           superfamily, believed to be
           UDP-N-acetyl-D-mannosaminuronic acid transferase, plays
           a role in enterobacterial common antigen (eca) synthesis
           in Escherichia coli. Another family member, the Bacillus
           subtilis TagA protein, is involved in the biosynthesis
           of the cell wall polymer poly(glycerol phosphate). The
           third family member, CpsF, CMP-N-acetylneuraminic acid
           synthetase has a role in the capsular polysaccharide
           biosynthesis pathway [Cell envelope, Biosynthesis and
           degradation of surface polysaccharides and
           lipopolysaccharides].
          Length = 177

 Score = 29.4 bits (66), Expect = 3.5
 Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 2/105 (1%)

Query: 247 IADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCN-- 304
             DG   +    L       +     L+ ++C+   +    + L GG   +L+ +     
Sbjct: 10  TPDGIGVVWGLKLLGYPQQSRVAGPDLMEELCQRAGKEKLPIFLYGGKPDVLQQLKVKLI 69

Query: 305 GGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKE 349
             Y  L  + A G +  +  + A   I   G G +   L CP +E
Sbjct: 70  KEYPKLKIVGAFGPLEPEERKAALAKIARSGAGIVFVGLGCPKQE 114


>gnl|CDD|218626 pfam05536, Neurochondrin, Neurochondrin.  This family contains
           several eukaryotic neurochondrin proteins. Neurochondrin
           induces hydroxyapatite resorptive activity in bone
           marrow cells resistant to bafilomycin A1, an inhibitor
           of macrophage- and osteoclast-mediated resorption.
           Expression of the gene is localised to chondrocyte,
           osteoblast, and osteocyte in the bone and to the
           hippocampus and Purkinje cell layer of cerebellum in the
           brain.
          Length = 542

 Score = 29.7 bits (67), Expect = 4.5
 Identities = 35/161 (21%), Positives = 60/161 (37%), Gaps = 23/161 (14%)

Query: 116 ERCAGLIEDKDLDSIISSL---------HDFDPDTRANAALALGGLAKDNKHFAERIVQC 166
           E+C  L+         ++L          D +  TR     A+G        F +R++  
Sbjct: 8   EKCLSLLRGARDTEKFAALLLVTKLVKAGDINSKTRRLIFEAVG------FKFLDRLLLT 61

Query: 167 GALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNN-DEVNV 225
           G+ P          L +   A F  +      PELA  +   + IP L++ L    ++++
Sbjct: 62  GSGPSGCPRHVYLDLGVTILACFCSE------PELARHSQVVNRIPTLLECLLKSSDLSM 115

Query: 226 KADAFLCLADIAKDSPSLASTIADGA-TTLAVEHLTYPHGL 265
             D + CL+ IA       + +A GA   L   +     GL
Sbjct: 116 VEDCYECLSAIAATPRGAKALLALGAVPALCQAYSNGSFGL 156


>gnl|CDD|128837 smart00567, EZ_HEAT, E-Z type HEAT repeats.  Present in subunits of
           cyanobacterial phycocyanin lyase, and other proteins.
           Probable scaffolding role.
          Length = 30

 Score = 25.8 bits (58), Expect = 4.7
 Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 11/41 (26%)

Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNND 221
             ++H AAFAL G +             + +P LIKAL ++
Sbjct: 1   PLVRHEAAFAL-GQLGDE----------EAVPALIKALEDE 30


>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain.  This
           family is involved in biosynthesis of guanosine
           nucleotide. Members of this family contain a TIM barrel
           structure. In the inosine monophosphate dehydrogenases 2
           CBS domains pfam00571 are inserted in the TIM barrel.
           This family is a member of the common phosphate binding
           site TIM barrel family.
          Length = 467

 Score = 29.5 bits (67), Expect = 4.8
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 38  KDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALV 89
            D  +V LI     +K     +A+    G+L+    V  R++ LER   ALV
Sbjct: 182 DDGELVGLITRKDIEKARDYPNASKDAQGRLLVGAAVGTREDDLER-AEALV 232


>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
           [Intracellular trafficking and secretion].
          Length = 757

 Score = 29.7 bits (67), Expect = 4.8
 Identities = 26/124 (20%), Positives = 51/124 (41%), Gaps = 13/124 (10%)

Query: 116 ERCAGLIEDKDLDSIISSLHDF---DPDTRANAALALGGLAKDNKHFAERIVQCGALPPL 172
           ER A L  +  L ++ +   D    + + R  A   L  L +      E +     + P+
Sbjct: 80  ERYAKLKPELALLAVNTIQKDLQDPNEEIRGFALRTLSLL-RVK----ELLGN--IIDPI 132

Query: 173 VQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLC 232
            ++L  P   ++  AA A+  + +   +L       D   +L + + + +  V A+A   
Sbjct: 133 KKLLTDPHAYVRKTAALAVAKLYRLDKDLYHELGLID---ILKELVADSDPIVIANALAS 189

Query: 233 LADI 236
           LA+I
Sbjct: 190 LAEI 193



 Score = 29.3 bits (66), Expect = 5.9
 Identities = 29/119 (24%), Positives = 48/119 (40%), Gaps = 12/119 (10%)

Query: 85  LVALVLKDLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRAN 144
            V  + KDL   N+  +  AL  L+ +  ++      +    +D I   L D     R  
Sbjct: 93  AVNTIQKDLQDPNEEIRGFALRTLSLLRVKE------LLGNIIDPIKKLLTDPHAYVRKT 146

Query: 145 AALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAE 203
           AALA+  L + +K       + G +  L +++A     +   A  +L  I    PELA 
Sbjct: 147 AALAVAKLYRLDKD---LYHELGLIDILKELVADSDPIVIANALASLAEID---PELAH 199


>gnl|CDD|236718 PRK10574, PRK10574, putative major pilin subunit; Provisional.
          Length = 146

 Score = 27.7 bits (62), Expect = 8.9
 Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 302 DCNGGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQ 340
            C+ G N +P+     Y+SA S  +A+ +I L G  SL 
Sbjct: 65  TCDAGSNGIPSPTTSRYVSAMS--VAKGVITLTGQESLN 101


>gnl|CDD|99876 cd06066, H2MP_NAD-link-bidir, Endopeptidases that belong to the
           bidirectional NAD-linked hydrogenase group. This group
           of endopeptidases are highly specific carboxyl-terminal
           protease (HoxW protease) which releases a 24-amino-acid
           peptide from HoxH prior to progression of subunit
           assembly. These bidirectional hydrogenases are
           heteropentamers encoded by the hox (hydrogen oxidation)
           genes, in which complex HoxEFU shows the diaphorase
           activity, and HoxYH constitutes the NiFe-hydrogenase.
          Length = 139

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 158 HFAERIVQCGALPPLVQILASPQLSLKHAAAFA 190
             AERI +   L P V++LA  QL+ + A   A
Sbjct: 18  AVAERIEEW--LLPGVEVLAVHQLTPELAEDLA 48


>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
          Length = 307

 Score = 28.5 bits (64), Expect = 9.8
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 4/46 (8%)

Query: 319 MSAQSEQIAEQ----IIELQGVGSLQTILQCPHKEQLQVAAAWTLG 360
           M+AQ+E +A Q    I  L G+G L  ++Q P    L  AA +T G
Sbjct: 117 MAAQTELMAAQRENGISMLGGIGCLPLLIQMPFFSALYFAAQYTKG 162


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 27.6 bits (61), Expect = 10.0
 Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)

Query: 147 LALGGLAKDNKHFAERIVQCGALPPL-VQILASPQ 180
             L   AK++   A RI++   +  + VQ+ A PQ
Sbjct: 9   DELKDQAKESNKIASRILKIKGIKRIVVQLNAVPQ 43


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.132    0.382 

Gapped
Lambda     K      H
   0.267   0.0885    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,635,861
Number of extensions: 2614693
Number of successful extensions: 2346
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2318
Number of HSP's successfully gapped: 71
Length of query: 542
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 440
Effective length of database: 6,413,494
Effective search space: 2821937360
Effective search space used: 2821937360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.0 bits)