RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2110
         (622 letters)



>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  102 bits (253), Expect = 6e-23
 Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 32/392 (8%)

Query: 146 TCAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKR 205
           + AF+  G LL  G +DG I LWD             +H      L   S       +  
Sbjct: 70  SIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSS------PDGN 123

Query: 206 GTLLAVGCNDGRIVLWDFLTRG-IAKIISAHMHPVCSISWSRNGYKIISAST-DNNVCIW 263
             LLA    DG + LWD  T G + + +  H   V S+++S +G  + S S+ D  + +W
Sbjct: 124 SILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLW 183

Query: 264 DVLTGECEQKYRFPS-PILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDEDSDL 322
           D+ TG+        + P+  + F P    +           L D      +         
Sbjct: 184 DLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSD 243

Query: 323 SIVASFDRRGQHVVTGNSKGKVLVLSCPAL-ELKASFKASTTAIKSIEFARRGNCLLVNT 381
           S+V+SF   G  + +G+S G + +    +   L  +    ++++ S+ F+  G  L   +
Sbjct: 244 SVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGS 303

Query: 382 GDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALY 441
            D  +R+++    L  GK      ++  +  V+        FS DG  + +G +    + 
Sbjct: 304 SDGTVRLWD----LETGKLLSSLTLKGHEGPVS-----SLSFSPDGSLLVSGGSDDGTIR 354

Query: 442 IWEKSVGNLVKILHGTKGELLLDVVWHPVRPIIASISS-GVVSLWAQNQVENWSAFAPDF 500
           +W+   G  +K L G     +L V + P   +++S S+ G V LW               
Sbjct: 355 LWDLRTGKPLKTLEG--HSNVLSVSFSPDGRVVSSGSTDGTVRLW----------DLSTG 402

Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQ 532
             L     +  R +  D S + KS+A GS D 
Sbjct: 403 SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDN 434



 Score = 78.6 bits (192), Expect = 2e-15
 Identities = 101/474 (21%), Positives = 169/474 (35%), Gaps = 99/474 (20%)

Query: 40  GNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDG-EYVCA 98
           G  LL  + D  I++++         D   + I+ L+ L + ++ K    S DG   + A
Sbjct: 77  GELLLSGSSDGTIKLWD--------LDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLA 128

Query: 99  GSARQHALYIWEKS-VGNLVKILHGTKGELLLDVVEFDGALDCISLAVTCAFNKRGTLLA 157
            S+    + +W+ S  G L++ L G    +                    AF+  G LLA
Sbjct: 129 SSSLDGTVKLWDLSTPGKLIRTLEGHSESVTS-----------------LAFSPDGKLLA 171

Query: 158 VGCN-DGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGC-ND 215
            G + DG I LWD  T      ++ H                + AF+  G LL     +D
Sbjct: 172 SGSSLDGTIKLWDLRTGKPLSTLAGH-----------TDPVSSLAFSPDGGLLIASGSSD 220

Query: 216 GRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYR 275
           G I LWD  T  + +   +        S+S +G  + S S+D  + +WD+ +     +  
Sbjct: 221 GTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTL 280

Query: 276 FP--SPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDEDSDLSIVASFDRRGQ 333
               S +L V F P                               D  L    S D    
Sbjct: 281 SGHSSSVLSVAFSP-------------------------------DGKLLASGSSDGT-- 307

Query: 334 HVVTGNSKGKVLVLSCPALELKASFKASTTAIKSIEFARRGNCLLVNTG-DRLIRVYNHQ 392
            V   + +   L+ S        + K     + S+ F+  G+ L+     D  IR+++ +
Sbjct: 308 -VRLWDLETGKLLSSL-------TLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR 359

Query: 393 EVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALYIWEKSVGNLVK 452
                        +  +             FS DG  V +GS     + +W+ S G+L++
Sbjct: 360 TGKPLKTLEGHSNVLSVS------------FSPDGRVVSSGST-DGTVRLWDLSTGSLLR 406

Query: 453 ILHGTKGELLLDVVWHPVRPIIASISS-GVVSLWAQNQVENWSAFAPDFKELDE 505
            L G    +   + + P    +AS SS   + LW         +F+PD K L  
Sbjct: 407 NLDGHTSRVTS-LDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGKVLAS 459



 Score = 62.8 bits (151), Expect = 2e-10
 Identities = 55/254 (21%), Positives = 103/254 (40%), Gaps = 43/254 (16%)

Query: 40  GNCLLVNTGDRLIRVYNHQEVLNAGK--DGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVC 97
           G+ L   + D  IR+++ +   +  +   G    +                FS DG+ + 
Sbjct: 253 GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSV------------LSVAFSPDGKLLA 300

Query: 98  AGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVTCAFNKRGTLLA 157
           +GS+    + +W+   G L+           L +   +G +  +S      F+  G+LL 
Sbjct: 301 SGSS-DGTVRLWDLETGKLLS---------SLTLKGHEGPVSSLS------FSPDGSLLV 344

Query: 158 VGC-NDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDG 216
            G  +DG I LWD  T    K +  H            S  ++ +F+  G +++ G  DG
Sbjct: 345 SGGSDDGTIRLWDLRTGKPLKTLEGH------------SNVLSVSFSPDGRVVSSGSTDG 392

Query: 217 RIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYRF 276
            + LWD  T  + + +  H   V S+ +S +G  + S S+DN + +WD+ T      +  
Sbjct: 393 TVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSP 452

Query: 277 PSPILKVQFHPRND 290
              +L  +    + 
Sbjct: 453 DGKVLASKSSDLSV 466



 Score = 56.3 bits (134), Expect = 3e-08
 Identities = 66/350 (18%), Positives = 133/350 (38%), Gaps = 39/350 (11%)

Query: 231 IISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECE-QKYRF--PSPILKVQFHP 287
           ++  H   + SI++S +G  ++S S+D  + +WD+  GE   +       S + K+    
Sbjct: 60  LLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSS 119

Query: 288 RNDHMFLVCPMRH---AAVLVDTDGDHQIVPIDEDSDLSIVASFDRRGQHVVTGNS-KGK 343
            + +  L+          +   +     I  ++  S+     +F   G+ + +G+S  G 
Sbjct: 120 PDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGT 179

Query: 344 VLVLSCPALELKASFKASTTAIKSIEFARRGNCLLVNTG-DRLIRVYNHQEVLNAGKDGD 402
           + +      +  ++    T  + S+ F+  G  L+ +   D  IR+++    L+ GK   
Sbjct: 180 IKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWD----LSTGK--- 232

Query: 403 PEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALYIWE-KSVGNLVKILHGTKGEL 461
                    L   +      FS DG  + A  +    + +W+ +S  +L++ L G     
Sbjct: 233 ----LLRSTLSGHSDSVVSSFSPDGSLL-ASGSSDGTIRLWDLRSSSSLLRTLSGHSSS- 286

Query: 462 LLDVVWHPVRPIIASISS-GVVSLW--------------AQNQVENWSAFAPDFKELDEN 506
           +L V + P   ++AS SS G V LW                    +  +F+PD   L   
Sbjct: 287 VLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSG 346

Query: 507 VEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEV 556
              D     +D+        +  +     L +  +    V +  S+D  V
Sbjct: 347 GSDDGTIRLWDLR--TGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTV 394


>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 89.3 bits (222), Expect = 1e-19
 Identities = 78/315 (24%), Positives = 130/315 (41%), Gaps = 52/315 (16%)

Query: 145 VTC-AFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFN 203
           VTC AF+  G LLA G  DG I +WD  T  + + +  H  P               A +
Sbjct: 12  VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-----------VRDVAAS 60

Query: 204 KRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIW 263
             GT LA G +D  I LWD  T    + ++ H   V S+++S +G  + S+S D  + +W
Sbjct: 61  ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120

Query: 264 DVLTGECEQKYRF-PSPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDE---- 318
           DV TG+C    R     +  V F P  D  F+             DG  ++  +      
Sbjct: 121 DVETGKCLTTLRGHTDWVNSVAFSP--DGTFVAS--------SSQDGTIKLWDLRTGKCV 170

Query: 319 ------DSDLSIVASFDRRGQHVVTGNSKGKVLVLSCPALELKASFKASTTAIKSIEFAR 372
                   +++ VA F   G+ +++ +S G + +      +   + +     + S+ F+ 
Sbjct: 171 ATLTGHTGEVNSVA-FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP 229

Query: 373 RGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQ---DLVNKTMWKKCCFSGDGEY 429
            G  L   + D  IRV++ +           E +Q L    + V         +S DG+ 
Sbjct: 230 DGYLLASGSEDGTIRVWDLR---------TGECVQTLSGHTNSVTS-----LAWSPDGKR 275

Query: 430 VCAGSARQHALYIWE 444
           + +GSA    + IW+
Sbjct: 276 LASGSA-DGTIRIWD 289



 Score = 85.1 bits (211), Expect = 3e-18
 Identities = 52/203 (25%), Positives = 81/203 (39%), Gaps = 32/203 (15%)

Query: 87  CCFSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVT 146
             FS DG  + + S R   + +W+   G  +  L G                      V 
Sbjct: 99  VAFSPDGRILSSSS-RDKTIKVWDVETGKCLTTLRGHTD------------------WVN 139

Query: 147 C-AFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKR 205
             AF+  GT +A    DG I LWD  T      ++ H           ++   + AF+  
Sbjct: 140 SVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGH--------TGEVN---SVAFSPD 188

Query: 206 GTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDV 265
           G  L    +DG I LWD  T      +  H + V S+++S +GY + S S D  + +WD+
Sbjct: 189 GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL 248

Query: 266 LTGECEQKYR-FPSPILKVQFHP 287
            TGEC Q      + +  + + P
Sbjct: 249 RTGECVQTLSGHTNSVTSLAWSP 271



 Score = 64.7 bits (158), Expect = 2e-11
 Identities = 51/302 (16%), Positives = 100/302 (33%), Gaps = 60/302 (19%)

Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYRFPS-PILKVQFHPR 288
           + +  H   V  +++S +G  + + S D  + +WD+ TGE  +  +  + P+  V     
Sbjct: 3   RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA- 61

Query: 289 NDHMFLVCPMRHAAVLVDTDGDHQIVP----------IDEDSDLSIVAS----------- 327
           +              L D +    +            +    D  I++S           
Sbjct: 62  DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121

Query: 328 ---------------------FDRRGQHVVTGNSKGKVLVLSCPALELKASFKASTTAIK 366
                                F   G  V + +  G + +      +  A+    T  + 
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVN 181

Query: 367 SIEFARRGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCC-FSG 425
           S+ F+  G  LL ++ D  I++++                + L  L           FS 
Sbjct: 182 SVAFSPDGEKLLSSSSDGTIKLWDLST------------GKCLGTLRGHENGVNSVAFSP 229

Query: 426 DGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVWHPVRPIIASISS-GVVSL 484
           DG  + +GS     + +W+   G  V+ L G     +  + W P    +AS S+ G + +
Sbjct: 230 DGYLLASGSEDGT-IRVWDLRTGECVQTLSGHTNS-VTSLAWSPDGKRLASGSADGTIRI 287

Query: 485 WA 486
           W 
Sbjct: 288 WD 289



 Score = 52.0 bits (125), Expect = 4e-07
 Identities = 38/137 (27%), Positives = 55/137 (40%), Gaps = 31/137 (22%)

Query: 87  CCFSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVT 146
             FS DGE + + S+      +W+ S G  +  L G +  +                  +
Sbjct: 183 VAFSPDGEKLLSSSSDGTI-KLWDLSTGKCLGTLRGHENGV-----------------NS 224

Query: 147 CAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTC-AFNKR 205
            AF+  G LLA G  DG I +WD  T    + +S H +             VT  A++  
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------------VTSLAWSPD 272

Query: 206 GTLLAVGCNDGRIVLWD 222
           G  LA G  DG I +WD
Sbjct: 273 GKRLASGSADGTIRIWD 289



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 32/134 (23%), Positives = 56/134 (41%), Gaps = 16/134 (11%)

Query: 354 LKASFKASTTAIKSIEFARRGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLV 413
           L+ + K  T  +  + F+  G  L   +GD  I+V+            D E  + L+ L 
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVW------------DLETGELLRTLK 48

Query: 414 NKTMWKKCC-FSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVWHPVRP 472
             T   +    S DG Y+ +GS+    + +W+   G  V+ L G     +  V + P   
Sbjct: 49  GHTGPVRDVAASADGTYLASGSS-DKTIRLWDLETGECVRTLTGHTSY-VSSVAFSPDGR 106

Query: 473 IIASISS-GVVSLW 485
           I++S S    + +W
Sbjct: 107 ILSSSSRDKTIKVW 120



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 30/132 (22%), Positives = 47/132 (35%), Gaps = 33/132 (25%)

Query: 40  GNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCC-FSGDGEYVCA 98
           G  LL ++ D  I++++                + L  L           FS DG  + +
Sbjct: 189 GEKLLSSSSDGTIKLWDLST------------GKCLGTLRGHENGVNSVAFSPDGYLLAS 236

Query: 99  GSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVTC-AFNKRGTLLA 157
           GS     + +W+   G  V+ L G                     +VT  A++  G  LA
Sbjct: 237 GSEDGT-IRVWDLRTGECVQTLSGHTN------------------SVTSLAWSPDGKRLA 277

Query: 158 VGCNDGRIVLWD 169
            G  DG I +WD
Sbjct: 278 SGSADGTIRIWD 289


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 45.0 bits (107), Expect = 1e-06
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
           + +  H  PV S+++S +G  + S S D  V +WD
Sbjct: 5   RTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 32.3 bits (74), Expect = 0.034
 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 144 AVTC-AFNKRGTLLAVGCNDGRIVLWD 169
            VT  AF+  G LLA G +DG + +WD
Sbjct: 13  PVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 32.3 bits (74), Expect = 0.034
 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 197 AVTC-AFNKRGTLLAVGCNDGRIVLWD 222
            VT  AF+  G LLA G +DG + +WD
Sbjct: 13  PVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 26.2 bits (58), Expect = 5.6
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 353 ELKASFKASTTAIKSIEFARRGNCLLVNTGDRLIRVYN 390
           +L  + K  T  + S+ F+  GN L   + D  +RV++
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 43.8 bits (104), Expect = 3e-06
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
           K +  H  PV S+++S +G  + S S D  + +WD
Sbjct: 6   KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 32.7 bits (75), Expect = 0.030
 Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 144 AVTC-AFNKRGTLLAVGCNDGRIVLWD 169
            VT  AF+  G  LA G +DG I LWD
Sbjct: 14  PVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 32.7 bits (75), Expect = 0.030
 Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 197 AVTC-AFNKRGTLLAVGCNDGRIVLWD 222
            VT  AF+  G  LA G +DG I LWD
Sbjct: 14  PVTSVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|222072 pfam13360, PQQ_2, PQQ-like domain.  This domain contains several
           repeats of the PQQ repeat.
          Length = 234

 Score = 35.8 bits (83), Expect = 0.040
 Identities = 26/147 (17%), Positives = 43/147 (29%), Gaps = 14/147 (9%)

Query: 204 KRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIW 263
             G + A+    G+ VLW      +          V       +G ++  A+    +   
Sbjct: 1   ADGVVAALDAATGK-VLW---RVDLGGTALGGGVAV-------DGGRLYVATGKGELVAL 49

Query: 264 DVLTGECEQKYRFPSPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGD--HQIVPIDEDSD 321
           D  TG+   +      IL          + +V        L    G              
Sbjct: 50  DAATGKLLWRKDLSGEILGAPTV-AGGVVVVVTADGSLYALDAETGKLLWSYQRSLPPLT 108

Query: 322 LSIVASFDRRGQHVVTGNSKGKVLVLS 348
           L   +S    G  V+ G S GK++ L 
Sbjct: 109 LRGSSSPAIVGDTVIVGFSSGKLVALD 135


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 35.2 bits (81), Expect = 0.095
 Identities = 28/109 (25%), Positives = 40/109 (36%), Gaps = 14/109 (12%)

Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVD 557
               + DE     E ES+ + S ED       D+++E+ + +  S        SSDEE  
Sbjct: 165 ESSDKDDEEESESEDESKSEESAED----DSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220

Query: 558 DSKVLEFLPIAPDVEDPE--------DGPGSTSAPVTSSPPSKTTKYKS 598
           +      +    D  + E            S S    SSPPSK  K K 
Sbjct: 221 EEA--PSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKK 267



 Score = 34.4 bits (79), Expect = 0.18
 Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 11/96 (11%)

Query: 502 ELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKV 561
           E D + E +E     D S  D  +   S D++E  E    +     +   SDE   +   
Sbjct: 188 EDDSDDEEEEDSDSEDYSQYDGMLV-DSSDEEEGEEAPSINYNEDTSESESDESDSEIS- 245

Query: 562 LEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYK 597
                 +    +       +S P       KT+   
Sbjct: 246 ---ESRSVSDSE------ESSPPSKKPKEKKTSSTF 272


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 33.5 bits (76), Expect = 0.37
 Identities = 22/68 (32%), Positives = 27/68 (39%), Gaps = 11/68 (16%)

Query: 491 ENWSAFA---PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVA 547
           E WSA     PD       VE D  +SE D   ED      SDD+ +   ID    +   
Sbjct: 689 EIWSALVKSRPD-------VEDDSDDSELD-FAEDDFSDSTSDDEPKLDAIDDEDAKSEG 740

Query: 548 AFCSSDEE 555
           +  S  EE
Sbjct: 741 SQESDQEE 748


>gnl|CDD|204739 pfam11768, DUF3312, Protein of unknown function (DUF3312).  This is
           a eukaryotic family of uncharacterized proteins. This
           family shows similarity to WD40 repeat proteins.
          Length = 544

 Score = 33.2 bits (76), Expect = 0.46
 Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 13/81 (16%)

Query: 142 SLAVTCAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCA 201
           S   +CA +     L +GC D  +VL+D   RG+  +  A + P               A
Sbjct: 259 SQVCSCARSPDEEKLLLGCIDSSLVLYD-EHRGVTLLAQAAVIPHL------------IA 305

Query: 202 FNKRGTLLAVGCNDGRIVLWD 222
           ++    ++ V    G +  +D
Sbjct: 306 WHPDSAIVLVANEQGELQCFD 326


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 32.7 bits (74), Expect = 0.62
 Identities = 24/80 (30%), Positives = 31/80 (38%), Gaps = 16/80 (20%)

Query: 501 KELDENVEYDERESEFDIS---DEDKSVAQGSDDQDEELEIDVTSVEPV----------- 546
           KE +E    +E + E D     DED+      DD+D+E E D  S               
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETD 102

Query: 547 --AAFCSSDEEVDDSKVLEF 564
             A F  SD+E DD     F
Sbjct: 103 NEAGFADSDDESDDGSEYVF 122


>gnl|CDD|234748 PRK00402, PRK00402, 3-isopropylmalate dehydratase large subunit;
           Reviewed.
          Length = 418

 Score = 32.5 bits (75), Expect = 0.71
 Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 15/58 (25%)

Query: 496 FAPDFKELDENVEY--DERESEFDI--SDEDKSVAQGSDDQDEELEIDVTSVEPVAAF 549
           FAPD    ++ +EY  +    ++    SDED    +         EID++ +EP  A 
Sbjct: 226 FAPD----EKTLEYLKERAGRDYKPWKSDEDAEYEE-------VYEIDLSKLEPQVAA 272


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 32.7 bits (75), Expect = 0.76
 Identities = 14/85 (16%), Positives = 27/85 (31%), Gaps = 3/85 (3%)

Query: 518  ISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVLEFLPIAPDVEDPED- 576
            I++ D +VAQ   +  E +         +     + E V     +   P AP V +    
Sbjct: 931  ITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAE 990

Query: 577  --GPGSTSAPVTSSPPSKTTKYKSF 599
                 +    V  +   + T   + 
Sbjct: 991  VETVTAVEPEVAPAQVPEATVEHNH 1015


>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
           kinases [Cell motility and secretion / Signal
           transduction mechanisms].
          Length = 716

 Score = 32.3 bits (74), Expect = 0.81
 Identities = 14/106 (13%), Positives = 39/106 (36%), Gaps = 1/106 (0%)

Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVD 557
           PD ++L+    ++E E       +++ +    +   E  E+++ +VE  A      E  +
Sbjct: 223 PDLEDLEAEAAFEESEVVLATEQDEELIRDVLELVVEAEELEIAAVELDAELLELAES-E 281

Query: 558 DSKVLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKSFDIQL 603
            +        A  + +      +  + + ++P +K     +     
Sbjct: 282 QAADDVLAAQAEPLAEKSAAEAAKLSALEAAPAAKAAAAAAGASSS 327


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 32.3 bits (74), Expect = 0.82
 Identities = 21/96 (21%), Positives = 33/96 (34%), Gaps = 22/96 (22%)

Query: 508 EYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVLEFLPI 567
           + +E E   D  DE+       DD D E E +   +         DEE +DS        
Sbjct: 322 DEEEEEDGVDDEDEE------DDDDDLEEEEEDVDLS----DEEEDEEDEDS-------- 363

Query: 568 APDVED--PEDGPGSTSAPVTSSPPSKTTKYKSFDI 601
             D ED   E+          S+  +++    +F  
Sbjct: 364 --DDEDDEEEEEEEKEKKKKKSAESTRSELPFTFPC 397


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 10/47 (21%)

Query: 500 FKELDENVEY--------DERESEFDISDEDKSVAQGSDDQDEELEI 538
           F+E +++ E+        +E +S+FD S++D+      D+++ E E+
Sbjct: 34  FEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPE--SDDEEEGEKEL 78


>gnl|CDD|233530 TIGR01686, FkbH, FkbH-like domain.  This model describes a domain
           of a family of proteins of unknown overall function. One
           of these, however, is a modular polyketide synthase 4800
           amino acids in length from Streptomyces avermilitis in
           which this domain is the C-terminal segment. By
           contrast, the FkbH protein from Streptomyces
           hygroscopicus aparently contains only this domain. The
           remaining members of the family all contain an
           additional N-terminal domain of between 200 and 275
           amino acids which show less than 20% identity to one
           another. It seems likely then that these proteins are
           involved in disparate functions, probably the
           biosynthesis of different natural products. For
           instance, the FkbH gene is found in a gene cluster
           believed to be responsible for the biosynthesis of
           unususal "PKS extender units" in the ascomycin pathway.
           This domain is composed of two parts, the first of which
           is a member of subfamily IIIC (TIGR01681) of the
           haloacid dehalogenase (HAD) superfamily of
           aspartate-nucleophile hydrolases. All of the
           characterized enzymes within subfamily III are
           phosphatases, and all of the active site residues
           characteristic of HAD-superfamily phosphatases are
           present in this domain. The C-terminal portion of this
           domain is unique to this family (by BLAST).
          Length = 320

 Score = 31.4 bits (71), Expect = 1.5
 Identities = 12/59 (20%), Positives = 20/59 (33%)

Query: 202 FNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNV 260
             K+G LLA+   +          R    I+ A      SI+W      +   +   N+
Sbjct: 43  LKKQGFLLALASKNDEDDAKKVFERRKDFILQAEDFDARSINWGPKSESLRKIAKKLNL 101


>gnl|CDD|213306 cd05940, FATP_FACS, Fatty acid transport proteins (FATP) play dual
           roles as fatty acid transporters and its activation
           enzymes.  Fatty acid transport protein (FATP) transports
           long-chain or very-long-chain fatty acids across the
           plasma membrane. FATPs also have fatty acid CoA
           synthetase activity, thus playing dual roles as fatty
           acid transporters and its activation enzymes. At least
           five copies of FATPs are identified in mammalian cells.
           This family also includes prokaryotic FATPs. FATPs are
           the key players in the trafficking of exogenous fatty
           acids into the cell and in intracellular fatty acid
           homeostasis.
          Length = 444

 Score = 31.1 bits (71), Expect = 1.7
 Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 9/64 (14%)

Query: 161 NDGRIVLWD-FLTR-GIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDGRI 218
           N  R  +WD F  R G+ +I+      EF G+ +         FNK G +  +      +
Sbjct: 204 NGLRPDIWDEFKERFGVPRIV------EFYGSTEGNV-GFINLFNKPGAVGRLPPAAIAV 256

Query: 219 VLWD 222
           V +D
Sbjct: 257 VKYD 260


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 30.8 bits (69), Expect = 2.2
 Identities = 13/39 (33%), Positives = 24/39 (61%)

Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEID 539
           K++DE+VE    ESE    +E + V    ++ +EEL+++
Sbjct: 144 KKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLE 182


>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
          Length = 601

 Score = 30.7 bits (70), Expect = 2.5
 Identities = 14/75 (18%), Positives = 24/75 (32%), Gaps = 16/75 (21%)

Query: 563 EFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYK----------------SFDIQLEGA 606
             LP+   VE  + G   T  P T   P   T+ +                   + +   
Sbjct: 186 SLLPVPIRVEGEKGGVNETPPPWTRDYPDPETRREALLAYGEELLGFTPLDVIPLDVPSG 245

Query: 607 ACQGIGIILPTGVNL 621
             +G+  +LP  V+ 
Sbjct: 246 GLEGVAYVLPYAVSP 260


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 30.4 bits (68), Expect = 4.1
 Identities = 19/106 (17%), Positives = 33/106 (31%), Gaps = 9/106 (8%)

Query: 504  DENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPV-----AAFCSSDEEVDD 558
            D   + DE E   D+  +D+       D  E         E +           + +V  
Sbjct: 3949 DRQEKEDEEEMSDDVGIDDEI----QPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004

Query: 559  SKVLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKSFDIQLE 604
               LE + +    E+ E+       P+    P +       DIQ +
Sbjct: 4005 DSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 29.9 bits (67), Expect = 5.3
 Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 5/104 (4%)

Query: 162 DGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDGRIVLW 221
           DG I+ W     G+ + IS  +     G    + + V+   +    L + G  D  + +W
Sbjct: 97  DGTIMGWGIPEEGLTQNISDPI-VHLQGHTKKVGI-VSFHPSAMNVLASAGA-DMVVNVW 153

Query: 222 DFLTRGIAK-IISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
           D + RG A  +I  H   + S+ W+ +G  + + S D  + I D
Sbjct: 154 D-VERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIID 196


>gnl|CDD|130410 TIGR01343, hacA_fam, homoaconitate hydratase family protein.  This
           model represents a subfamily of proteins consisting of
           aconitase, homoaconitase, 3-isopropylmalate dehydratase,
           and uncharacterized proteins. The majority of the
           members of this family have been designated as
           3-isopropylmalate dehydratase large subunit (LeuC) in
           microbial genome annotation, but the only characterized
           member is Thermus thermophilus homoaconitase, an enzyme
           of a non-aspartate pathway of Lys biosynthesis.
          Length = 412

 Score = 29.7 bits (67), Expect = 5.4
 Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 507 VEYDERESEF--DISDEDKSVAQGSDDQD--EELEIDVTSVEPVAAFCSSDEEVDDSKVL 562
           +E DE+   +  +   E   V +  +D +  +E+EID + +EPV A   + + V     +
Sbjct: 223 IEPDEKTRAYLKERYKEPFRVYKSDEDAEYAKEVEIDASQLEPVVAAPHNVDNVHPVSEV 282

Query: 563 EFLPI 567
           E + +
Sbjct: 283 EGIEV 287


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.2 bits (66), Expect = 5.6
 Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 8/98 (8%)

Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSK 560
           + L+ + + DE E      DED      SDD+ E   IDV S + + +  S DEE  D  
Sbjct: 90  QGLESDDDDDEEEEWEVEEDED------SDDEGE--WIDVESDKEIESSDSEDEEEKDEA 141

Query: 561 VLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKS 598
             +    + +    ED   +       +   K ++  +
Sbjct: 142 AKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELAT 179


>gnl|CDD|179566 PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated.
          Length = 380

 Score = 29.4 bits (66), Expect = 5.8
 Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 6/36 (16%)

Query: 477 ISSGVVSLWAQNQVENWSAFAPDFKELDENVEYDER 512
             +G+  L A    ENW A+   F E D +  Y ER
Sbjct: 12  FVAGIRELMAS---ENWEAY---FAECDRDSVYPER 41


>gnl|CDD|237212 PRK12808, PRK12808, flagellin; Provisional.
          Length = 476

 Score = 29.5 bits (66), Expect = 6.2
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 6/62 (9%)

Query: 488 NQVENWSAFAPDFKELDENVEYDERESEFD---ISDEDKSVA---QGSDDQDEELEIDVT 541
           N  +N +A   +F EL + + Y    ++F+   + + + S+      S D D+++ ID+ 
Sbjct: 102 NTDKNQAALQKEFAELQKQITYIADNTQFNDKNLLNSNSSINIQTLDSSDGDKQIGIDLK 161

Query: 542 SV 543
           S 
Sbjct: 162 SA 163


>gnl|CDD|220430 pfam09829, DUF2057, Uncharacterized protein conserved in bacteria
           (DUF2057).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 189

 Score = 28.4 bits (64), Expect = 7.6
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 9/61 (14%)

Query: 298 MRHAAVLVDTDGDHQIV-----PIDEDSDLSIVASFDRRGQHVVTGNSKGKVLVLSCPAL 352
               + L   DG++QIV       D   D  +V S       +VT ++  + L LS P +
Sbjct: 24  FSGTSSLELPDGENQIVVRYEKLFDSGGDRELVKS----DPIIVTFDASDQDLTLSLPKI 79

Query: 353 E 353
            
Sbjct: 80  R 80


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.5 bits (64), Expect = 7.9
 Identities = 14/49 (28%), Positives = 25/49 (51%)

Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPV 546
            D ++ DE  E +  E + + SDE++   +   + + E EI   S +PV
Sbjct: 82  GDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPSWKPV 130


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.0 bits (65), Expect = 8.1
 Identities = 30/102 (29%), Positives = 40/102 (39%), Gaps = 27/102 (26%)

Query: 504 DENVEYDERESEFDISDEDKS-VAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVL 562
           D N + +    E + SDE +   A+ +D + EE E+D       AA  S D E D+S   
Sbjct: 242 DNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMD-------AAEASEDSESDESD-- 292

Query: 563 EFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTT----KYKSFD 600
                    ED E  PG  + P T   P         YK F 
Sbjct: 293 ---------EDTET-PGEDARPAT---PFTELMEEVDYKVFT 321


>gnl|CDD|225115 COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal
           transduction mechanisms].
          Length = 890

 Score = 29.2 bits (66), Expect = 9.2
 Identities = 11/41 (26%), Positives = 18/41 (43%)

Query: 488 NQVENWSAFAPDFKELDENVEYDERESEFDISDEDKSVAQG 528
           N +EN   +AP   E+  N   +     F + DE   + +G
Sbjct: 782 NLLENALKYAPPGSEIRINAGVERENVVFSVIDEGPGIPEG 822


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.417 

Gapped
Lambda     K      H
   0.267   0.0728    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,402,273
Number of extensions: 3064770
Number of successful extensions: 2850
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2778
Number of HSP's successfully gapped: 53
Length of query: 622
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 519
Effective length of database: 6,369,140
Effective search space: 3305583660
Effective search space used: 3305583660
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.7 bits)