RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10286
         (748 letters)



>gnl|CDD|222983 PHA03098, PHA03098, kelch-like protein; Provisional.
          Length = 534

 Score =  217 bits (555), Expect = 3e-62
 Identities = 113/498 (22%), Positives = 219/498 (43%), Gaps = 25/498 (5%)

Query: 196 DACADFLKKRFHPNNVLDYYVL--FSCRAMEALINFAYSGRVTIHSQNVQSLMVVASFLQ 253
            + +++ KK F  NN  +  +       +   +I + Y+G++ I S NV+ ++ +A++L 
Sbjct: 31  SSSSEYFKKMFK-NNFKENEINLNIDYDSFNEVIKYIYTGKINITSNNVKDILSIANYLI 89

Query: 254 MQKVADACADFLKKRFHPNNVLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGL 313
           +  + + C +++ K    NN + I +F+    C +L  AA  Y++     +    +FI L
Sbjct: 90  IDFLINLCINYIIKIIDDNNCIDIYRFSFFYGCKKLYSAAYNYIRNNIELIYNDPDFIYL 149

Query: 314 GVNEVNDIVKRSELHLMSEEQVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLAD 373
             NE+  I+   +L++ SE+ V E +++W+    + +   +  +L  +R+  LS   +  
Sbjct: 150 SKNELIKILSDDKLNVSSEDVVLEIIIKWLTSKKNNKYKDICLILKVLRITFLSEEGIKK 209

Query: 374 RVATEALIRSSHECRDLVDEARDFHLMPERRFLLAGEKTTPRRCNYVMGHIFAVGGLTKA 433
               +  I+     + + ++     +  ++  L    K  PR   +       +     +
Sbjct: 210 LKRWKLRIKKK---KIVFNKRCIKIIYSKKYNL---NKILPRSSTFGSIIYIHIT---MS 260

Query: 434 GDSLSTVEVFDPLVGRWQMAEE--------ETLSNAVISTKSCLTKAGDSLSTVEVFDPL 485
             + + +  + PL     + +           L+N +      + K   S+++V  +D  
Sbjct: 261 IFTYNYITNYSPLSEINTIIDIHYVYCFGSVVLNNVIYFIG-GMNKNNLSVNSVVSYDTK 319

Query: 486 VGRWQMAEAMSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
              W     +   R   GV V  NR+Y  GG   S  L+TVE + P    W +  P+ F 
Sbjct: 320 TKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFP 379

Query: 546 RSAVGAAALNDKLYVCGG-YDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSY 604
           R       +N+ +YV GG       L TVEC+  + ++W     +      G  I  D  
Sbjct: 380 RYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGK 439

Query: 605 VYALGGH---DGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYDGA 661
           +Y +GG    D + +++ VE Y+P T++WT +  +   R    +   NNKIYV GG    
Sbjct: 440 IYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYE 499

Query: 662 IFLQSVEMYDPITDEWKM 679
            ++  +E+YD  T+ W +
Sbjct: 500 YYINEIEVYDDKTNTWTL 517



 Score =  148 bits (376), Expect = 6e-38
 Identities = 66/238 (27%), Positives = 112/238 (47%), Gaps = 5/238 (2%)

Query: 503 GVAVMKNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVC 561
           G  V+ N +Y  GG N +   +++V  +D   + WNKV  + + R   G    N+++YV 
Sbjct: 289 GSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVI 348

Query: 562 GGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGG-HDGLSIFDSV 620
           GG     SLNTVE ++P + +WR    +   R    V+  ++ +Y +GG      +  +V
Sbjct: 349 GGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTV 408

Query: 621 ERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGY---DGAIFLQSVEMYDPITDEW 677
           E +   T++W+   P+            + KIYV GG    D       VE Y+P+T++W
Sbjct: 409 ECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKW 468

Query: 678 KMIASMNVMRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEG 735
             ++S+N  R   +L     K++ +GG      +  +EVYD  T++W          G
Sbjct: 469 TELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCKFPKVIG 526



 Score =  119 bits (300), Expect = 3e-28
 Identities = 53/216 (24%), Positives = 93/216 (43%), Gaps = 13/216 (6%)

Query: 517 YNGSERLSTVEEFDPVRRVWNKVS---PMCFKRSAVGAAALNDKLYVCGGYDG-VSSLNT 572
                  + +  + P+  +   +      CF     G+  LN+ +Y  GG +    S+N+
Sbjct: 258 TMSIFTYNYITNYSPLSEINTIIDIHYVYCF-----GSVVLNNVIYFIGGMNKNNLSVNS 312

Query: 573 VECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTS 632
           V  Y+     W  V  +   R   GV  F++ +Y +GG       ++VE + P   +W  
Sbjct: 313 VVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWRE 372

Query: 633 VKPMLTKRCRLGVAALNNKIYVCGG-YDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVA 691
             P++  R    V  +NN IYV GG       L++VE +   T++W   + + +      
Sbjct: 373 EPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGC 432

Query: 692 LVANMGKLWAIGGY---DGVSNLPTVEVYDPSTDSW 724
            + + GK++ IGG    D +     VE Y+P T+ W
Sbjct: 433 AIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKW 468



 Score = 89.4 bits (222), Expect = 1e-18
 Identities = 47/196 (23%), Positives = 79/196 (40%), Gaps = 20/196 (10%)

Query: 414 PRRCNYVM---GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQMAEEETL----SNAVISTK 466
           PR+   V      I+ +GG+     SL+TVE + P   +W+  EE  L     N  +   
Sbjct: 332 PRKNPGVTVFNNRIYVIGGI-YNSISLNTVESWKPGESKWR--EEPPLIFPRYNPCVVNV 388

Query: 467 SCL-------TKAGDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLYAFGGY-- 517
           + L       +K  + L TVE F     +W     + +            ++Y  GG   
Sbjct: 389 NNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISY 448

Query: 518 -NGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECY 576
            +  +  + VE ++PV   W ++S + F R        N+K+YV GG      +N +E Y
Sbjct: 449 IDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVY 508

Query: 577 EPDKDQWRIVKSMQKH 592
           +   + W +     K 
Sbjct: 509 DDKTNTWTLFCKFPKV 524



 Score = 48.6 bits (116), Expect = 1e-05
 Identities = 26/91 (28%), Positives = 55/91 (60%), Gaps = 4/91 (4%)

Query: 125 IVLAATIPYFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMV 184
           I+L+++  YF+ MF ++    K+ EI +  ID  +   +I ++Y+G++ I S NV+ ++ 
Sbjct: 28  IILSSSSEYFKKMFKNN---FKENEINLN-IDYDSFNEVIKYIYTGKINITSNNVKDILS 83

Query: 185 VASFLQMQKVADACADFLKKRFHPNNVLDYY 215
           +A++L +  + + C +++ K    NN +D Y
Sbjct: 84  IANYLIIDFLINLCINYIIKIIDDNNCIDIY 114


>gnl|CDD|149006 pfam07707, BACK, BTB And C-terminal Kelch.  This domain is found
           associated with pfam00651 and pfam01344. The BACK domain
           is found juxtaposed to the BTB domain; they are
           separated by as little as two residues. This family
           appears to be closely related to the BTB domain (Finn
           RD, personal observation).
          Length = 101

 Score =  128 bits (325), Expect = 3e-35
 Identities = 50/101 (49%), Positives = 71/101 (70%)

Query: 274 VLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGLGVNEVNDIVKRSELHLMSEE 333
            LGI +FADT  C +L+E A +++ + F EVS S+EF+ L   ++ +++   EL++ SEE
Sbjct: 1   CLGIYRFADTYGCEELAEVALRFILKNFLEVSKSEEFLELSKEQLLELLSSDELNVPSEE 60

Query: 334 QVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADR 374
           +VFEAV++WVKH+   R   LP LL+AVRLPLLSP YL D 
Sbjct: 61  EVFEAVIKWVKHDVENRKKHLPELLSAVRLPLLSPEYLLDV 101


>gnl|CDD|197943 smart00875, BACK, BTB And C-terminal Kelch.  The BACK domain is
           found juxtaposed to the BTB domain; they are separated
           by as little as two residues.
          Length = 101

 Score =  120 bits (303), Expect = 2e-32
 Identities = 48/101 (47%), Positives = 70/101 (69%)

Query: 274 VLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGLGVNEVNDIVKRSELHLMSEE 333
            LGIR+FAD     +L+E A +++ Q F EVS S+EF+ L + ++ +++   +L++ SEE
Sbjct: 1   CLGIRRFADAHGLEELAEKALRFILQNFSEVSSSEEFLELPLEQLLELLSSDDLNVSSEE 60

Query: 334 QVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADR 374
           +VFEAV+RWVKH+  +R   LP LL  VRLPLLSP YL + 
Sbjct: 61  EVFEAVLRWVKHDPEKRREHLPELLEHVRLPLLSPDYLLEV 101


>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain.  The BTB (for BR-C, ttk and bab) or
           POZ (for Pox virus and Zinc finger) domain is present
           near the N-terminus of a fraction of zinc finger
           (pfam00096) proteins and in proteins that contain the
           pfam01344 motif such as Kelch and a family of pox virus
           proteins. The BTB/POZ domain mediates homomeric
           dimerisation and in some instances heteromeric
           dimerisation. The structure of the dimerised PLZF
           BTB/POZ domain has been solved and consists of a tightly
           intertwined homodimer. The central scaffolding of the
           protein is made up of a cluster of alpha-helices flanked
           by short beta-sheets at both the top and bottom of the
           molecule. POZ domains from several zinc finger proteins
           have been shown to mediate transcriptional repression
           and to interact with components of histone deacetylase
           co-repressor complexes including N-CoR and SMRT. The POZ
           or BTB domain is also known as BR-C/Ttk or ZiN.
          Length = 101

 Score =  101 bits (253), Expect = 1e-25
 Identities = 42/139 (30%), Positives = 65/139 (46%), Gaps = 38/139 (27%)

Query: 65  MEEIRRQGKLCDVTIKVDDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAV 124
           + E+R  G+LCDVT+ V D+ F  H+ VLAA  PYF+A+FT +                 
Sbjct: 1   LNELRENGELCDVTLVVGDKEFHAHKAVLAACSPYFKALFTGNKEV-------------- 46

Query: 125 IVLAATIPYFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMV 184
                                   EIT++ +     EAL+ F+Y+G++ I  +NV  L+ 
Sbjct: 47  ------------------------EITLEDVSPEDFEALLEFIYTGKLEITEENVDDLLA 82

Query: 185 VASFLQMQKVADACADFLK 203
           +A  LQ+  + D C +FL 
Sbjct: 83  LADKLQIPALIDKCEEFLI 101



 Score = 55.3 bits (134), Expect = 2e-09
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 197 ACADFLKKRFHPNNVLDYYVL-FSCRAMEALINFAYSGRVTIHSQNVQSLMVVASFLQMQ 255
           AC+ + K  F  N  ++  +   S    EAL+ F Y+G++ I  +NV  L+ +A  LQ+ 
Sbjct: 31  ACSPYFKALFTGNKEVEITLEDVSPEDFEALLEFIYTGKLEITEENVDDLLALADKLQIP 90

Query: 256 KVADACADFLK 266
            + D C +FL 
Sbjct: 91  ALIDKCEEFLI 101


>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac.  Domain
           in Broad-Complex, Tramtrack and Bric a brac. Also known
           as POZ (poxvirus and zinc finger) domain. Known to be a
           protein-protein interaction motif found at the N-termini
           of several C2H2-type transcription factors as well as
           Shaw-type potassium channels. Known structure reveals a
           tightly intertwined dimer formed via interactions
           between N-terminal strand and helix structures. However
           in a subset of BTB/POZ domains, these two secondary
           structures appear to be missing. Be aware SMART predicts
           BTB/POZ domains without the beta1- and alpha1-secondary
           structures.
          Length = 97

 Score = 90.1 bits (224), Expect = 1e-21
 Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 34/130 (26%)

Query: 76  DVTIKVDDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAVIVLAATIPYFQ 135
           DVT+ V  + F  H+ VLAA               S                    PYF+
Sbjct: 1   DVTLVVGGKKFHAHKAVLAA--------------HS--------------------PYFK 26

Query: 136 AMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMVVASFLQMQKVA 195
           A+F+SD  ES + EI +  +      AL+NF+Y+G++ +  +NV+ L+ +A +LQ+  + 
Sbjct: 27  ALFSSDFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEENVEELLELADYLQIPGLV 86

Query: 196 DACADFLKKR 205
           + C +FL K 
Sbjct: 87  ELCEEFLLKL 96



 Score = 54.2 bits (131), Expect = 4e-09
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 197 ACADFLKKRFHPNNVLD-----YYVLFSCRAMEALINFAYSGRVTIHSQNVQSLMVVASF 251
           A + + K  F  +         Y    S     AL+NF Y+G++ +  +NV+ L+ +A +
Sbjct: 20  AHSPYFKALFSSDFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEENVEELLELADY 79

Query: 252 LQMQKVADACADFLKKR 268
           LQ+  + + C +FL K 
Sbjct: 80  LQIPGLVELCEEFLLKL 96


>gnl|CDD|128874 smart00612, Kelch, Kelch domain. 
          Length = 47

 Score = 72.6 bits (179), Expect = 3e-16
 Identities = 22/47 (46%), Positives = 29/47 (61%)

Query: 651 KIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVALVANMG 697
           KIYV GG+DG   L+SVE+YDP T++W  + SM   RS   +    G
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 67.2 bits (165), Expect = 3e-14
 Identities = 21/46 (45%), Positives = 28/46 (60%)

Query: 605 VYALGGHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNN 650
           +Y +GG DG     SVE YDP+T++WT +  M T R   GVA +N 
Sbjct: 2   IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 64.5 bits (158), Expect = 2e-13
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 557 KLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFD 602
           K+YV GG+DG   L +VE Y+P+ ++W  + SM   RS  GV   +
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVIN 46



 Score = 57.6 bits (140), Expect = 7e-11
 Identities = 17/47 (36%), Positives = 26/47 (55%)

Query: 510 RLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALND 556
           ++Y  GG++G +RL +VE +DP    W  +  M   RS  G A +N 
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 51.4 bits (124), Expect = 1e-08
 Identities = 20/45 (44%), Positives = 27/45 (60%)

Query: 698 KLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEGGVGVGVI 742
           K++ +GG+DG   L +VEVYDP T+ W  +  M     G GV VI
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVI 45



 Score = 41.0 bits (97), Expect = 5e-05
 Identities = 15/37 (40%), Positives = 20/37 (54%)

Query: 473 GDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKN 509
           G  L +VEV+DP   +W    +M   RS  GVAV+  
Sbjct: 11  GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 34.1 bits (79), Expect = 0.012
 Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 423 HIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
            I+ VGG    G  L +VEV+DP   +W  
Sbjct: 1   KIYVVGGFD-GGQRLKSVEVYDPETNKWTP 29


>gnl|CDD|201739 pfam01344, Kelch_1, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that the
           Drosophila ring canal kelch protein is related to
           Galactose Oxidase, for which a structure has been
           solved. The kelch motif forms a beta sheet. Several of
           these sheets associate to form a beta propeller
           structure as found in pfam00064, pfam00400 and
           pfam00415.
          Length = 46

 Score = 68.1 bits (167), Expect = 1e-14
 Identities = 24/46 (52%), Positives = 27/46 (58%)

Query: 639 KRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMN 684
            R   GV  L  KIYV GGYDG   L SVE+YDP T+ W  + SM 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 64.6 bits (158), Expect = 2e-13
 Identities = 21/46 (45%), Positives = 28/46 (60%)

Query: 545 KRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQ 590
            RS  G   L  K+YV GGYDG  SL++VE Y+P+ + W  + SM 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 61.5 bits (150), Expect = 2e-12
 Identities = 18/45 (40%), Positives = 26/45 (57%)

Query: 592 HRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSVKPM 636
            RS  GV+     +Y +GG+DG     SVE YDP+T+ W+ +  M
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45



 Score = 58.8 bits (143), Expect = 2e-11
 Identities = 19/44 (43%), Positives = 28/44 (63%)

Query: 499 RSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPM 542
           RS  GV V+  ++Y  GGY+G + LS+VE +DP    W+K+  M
Sbjct: 2   RSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45



 Score = 56.5 bits (137), Expect = 2e-10
 Identities = 21/45 (46%), Positives = 30/45 (66%)

Query: 686 MRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPM 730
            RS   +V   GK++ IGGYDG  +L +VEVYDP T++W+ +  M
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45



 Score = 36.1 bits (84), Expect = 0.003
 Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)

Query: 415 RRCNYVM----GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQ 451
            R    +    G I+ +GG    G SLS+VEV+DP    W 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDG-GQSLSSVEVYDPETNTWS 40


>gnl|CDD|165086 PHA02713, PHA02713, hypothetical protein; Provisional.
          Length = 557

 Score = 70.8 bits (173), Expect = 1e-12
 Identities = 43/169 (25%), Positives = 75/169 (44%), Gaps = 19/169 (11%)

Query: 505 AVMKNRLYAFGGYN-GSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGG 563
           A++ N +   GGYN  +  L+ V + +   ++  ++ PM   R     A ++D +Y  GG
Sbjct: 300 AIVDNEIIIAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGG 359

Query: 564 YDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGG------------- 610
            +G +   T+ECY    D+W+++  M    S+ G+   D Y+Y +GG             
Sbjct: 360 QNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHM 419

Query: 611 -----HDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYV 654
                 +     + V RYD   + W ++    T   R GV +  + IYV
Sbjct: 420 NSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYV 468



 Score = 50.8 bits (121), Expect = 2e-06
 Identities = 22/70 (31%), Positives = 37/70 (52%)

Query: 635 PMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVALVA 694
           PM+  RCR  +A +++ IY  GG +G    +++E Y    D+WKM+  M +  S   +  
Sbjct: 337 PMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCV 396

Query: 695 NMGKLWAIGG 704
               ++ IGG
Sbjct: 397 LDQYIYIIGG 406



 Score = 46.5 bits (110), Expect = 4e-05
 Identities = 36/138 (26%), Positives = 63/138 (45%), Gaps = 1/138 (0%)

Query: 551 AAALNDKLYVCGGYD-GVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALG 609
           +A +++++ + GGY+    SLN V     +      +  M K+R    +   D  +YA+G
Sbjct: 299 SAIVDNEIIIAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIG 358

Query: 610 GHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEM 669
           G +G ++  ++E Y    D+W  +  M       G+  L+  IY+ GG    I   SV  
Sbjct: 359 GQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHH 418

Query: 670 YDPITDEWKMIASMNVMR 687
            + I  E    +S  V+R
Sbjct: 419 MNSIDMEEDTHSSNKVIR 436



 Score = 36.1 bits (83), Expect = 0.073
 Identities = 26/131 (19%), Positives = 52/131 (39%), Gaps = 36/131 (27%)

Query: 74  LCDVTIKV-DDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAVIVLAATIP 132
           LCDV I + D +    H+ +LAA   YF+ +FT+ M                        
Sbjct: 25  LCDVIITIGDGEEIKAHKTILAAGSKYFRTLFTTPM------------------------ 60

Query: 133 YFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMVVASFLQMQ 192
                    +       + +Q  D  A++ ++ ++Y+  ++  S NV  ++  A +L + 
Sbjct: 61  ---------IIRDLVTRVNLQMFDKDAVKNIVQYLYNRHIS--SMNVIDVLKCADYLLID 109

Query: 193 KVADACADFLK 203
            +   C  ++K
Sbjct: 110 DLVTDCESYIK 120



 Score = 33.1 bits (75), Expect = 0.63
 Identities = 29/112 (25%), Positives = 48/112 (42%), Gaps = 7/112 (6%)

Query: 623 YDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYD-GAIFLQSVEMYDPITDEWKMIA 681
           Y+  T E++ +  +         A ++N+I + GGY+     L  V     I  E K+  
Sbjct: 277 YNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLNKVY---KINIENKIHV 333

Query: 682 SMNVM---RSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPM 730
            +  M   R R +L      ++AIGG +G +   T+E Y    D W  +  M
Sbjct: 334 ELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDM 385



 Score = 31.5 bits (71), Expect = 1.9
 Identities = 33/176 (18%), Positives = 66/176 (37%), Gaps = 23/176 (13%)

Query: 478 TVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLYAFGG------YNGSERLSTVE---- 527
           T+E +     +W+M   M +  S  G+ V+   +Y  GG      Y     +++++    
Sbjct: 368 TIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEED 427

Query: 528 --------EFDPVRRVWNKVSPMCFKRSAVGAAALNDKLY-VCGGYDGVSSLNTVECYEP 578
                    +D V  +W  +          G  +  D +Y VC   D  +    +  Y  
Sbjct: 428 THSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNT 487

Query: 579 DK-DQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSV 633
           +  + W ++ + +   SA   I  D+ +  L  ++   + D+   Y   T EW  +
Sbjct: 488 NTYNGWELITTTESRLSALHTILHDNTIMMLHCYESYMLQDTFNVY---TYEWNHI 540


>gnl|CDD|165153 PHA02790, PHA02790, Kelch-like protein; Provisional.
          Length = 480

 Score = 63.1 bits (153), Expect = 3e-10
 Identities = 57/219 (26%), Positives = 92/219 (42%), Gaps = 23/219 (10%)

Query: 511 LYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSL 570
           +Y  GG+  +E  +     + +   W  + PM   R        N+KLYV GG    +S 
Sbjct: 274 VYLIGGWMNNEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTS- 332

Query: 571 NTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYD-PKTDE 629
             VE +      W  + S+ K R    V + ++ +Y +GGH   S  D+   Y  P  D+
Sbjct: 333 --VERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGH---SETDTTTEYLLPNHDQ 387

Query: 630 W----TSVKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNV 685
           W    ++  P   K C L       ++++ G        ++ E Y   ++ W +I     
Sbjct: 388 WQFGPSTYYPHY-KSCAL---VFGRRLFLVG--------RNAEFYCESSNTWTLIDDPIY 435

Query: 686 MRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSW 724
            R    L+    KL  IGG+   S + T+EVY+  T SW
Sbjct: 436 PRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSW 474



 Score = 60.1 bits (145), Expect = 3e-09
 Identities = 88/467 (18%), Positives = 183/467 (39%), Gaps = 72/467 (15%)

Query: 222 AMEALINFAYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFHPNN-----VLG 276
           ++ +++ ++Y+G+V I S NV +L+  +    ++ +   C +F+ + F         ++G
Sbjct: 73  SLTSIVIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMMG 132

Query: 277 IRQFADTLNCLQLSEAADKYVQQYFHEVS--MSDEFIGLGVNEVNDIVKRSELHLMSEEQ 334
           I      L C         ++ ++F E+   + D F  L +     I++  EL++  E+ 
Sbjct: 133 IEYGLSNLLC-----HTKDFIAKHFLELEDDIIDNFDYLSMKL---ILESDELNVPDEDY 184

Query: 335 VFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADRVATEALIRSSHECRDLVDEA 394
           V + V++W     +     L  +   +R   LSP             R  +  + ++D  
Sbjct: 185 VVDFVIKWYMKRRNRLGNLLLLIKNVIRSNYLSP-------------RGINNVKWILDCT 231

Query: 395 RDFHL--MPERRFLLAGEKTTPRRCNYVMGHIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
           + FH    P + +     +  P   + ++  IF +   T  G+ +        L+G W  
Sbjct: 232 KIFHCDKQPRKSYKYPFIEY-PMNMDQII-DIFHMCTSTHVGEVVY-------LIGGWMN 282

Query: 453 AEEETLSNAVISTKSCLTKAGDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLY 512
            E   + N  I+  + ++                  W     M+  R         N+LY
Sbjct: 283 NE---IHNNAIAV-NYISN----------------NWIPIPPMNSPRLYASGVPANNKLY 322

Query: 513 AFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSLNT 572
             GG       ++VE +      W  +  +   R     A++N+ +YV GG+    +  T
Sbjct: 323 VVGGLPNP---TSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDT--T 377

Query: 573 VECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTS 632
            E   P+ DQW+   S          + F   ++ +G         + E Y   ++ WT 
Sbjct: 378 TEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVG--------RNAEFYCESSNTWTL 429

Query: 633 VKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKM 679
           +   +  R    +  ++NK+ + GG+    ++ ++E+Y+  T  W +
Sbjct: 430 IDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWNI 476



 Score = 36.6 bits (84), Expect = 0.056
 Identities = 22/116 (18%), Positives = 58/116 (50%), Gaps = 6/116 (5%)

Query: 108 MAESKQREITMQGIDAVIVLAATI-----PYFQAMFTSDMAESKQREITM-QGIDAVAME 161
           ++ +K+ +  ++ I   I++ +TI     PYF+        ++K     +   +D  ++ 
Sbjct: 16  LSMTKKFKTIIEAIGGNIIVNSTILKKLSPYFRTHLRQKYTKNKDPVTRVCLDLDIHSLT 75

Query: 162 ALINFVYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFHPNNVLDYYVL 217
           +++ + Y+G+V I S NV +L+  +    ++ +   C +F+ + F     ++ Y++
Sbjct: 76  SIVIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMM 131


>gnl|CDD|177844 PLN02193, PLN02193, nitrile-specifier protein.
          Length = 470

 Score = 60.4 bits (146), Expect = 2e-09
 Identities = 61/257 (23%), Positives = 100/257 (38%), Gaps = 59/257 (22%)

Query: 484 PLVGRWQMAEAMSM---LRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVS 540
            L+G+W   E       LR   G+A + N++Y+FGG           EF P +       
Sbjct: 148 KLLGKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGG-----------EFTPNQ------- 189

Query: 541 PMCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQK--HRSAGGV 598
                        ++  LYV               ++ +   W I  +     H S  GV
Sbjct: 190 ------------PIDKHLYV---------------FDLETRTWSISPATGDVPHLSCLGV 222

Query: 599 --IAFDSYVYALGGHDGLSIFDSVERYDPKTDEW---TSVKPMLTKRCRLGVAALNNKIY 653
             ++  S +Y  GG D    ++    +D  T+EW   T V+   T R    +AA    +Y
Sbjct: 223 RMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVY 282

Query: 654 VCGGYDGAIFLQSVEMYDPITDEWKMIASMN---VMRSRVALVANMGKLWAIGGYDGVSN 710
           V GG      L++++ Y+ +  +W   ++      +R    L    GK+W + G++G   
Sbjct: 283 VFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGCE- 341

Query: 711 LPTVEVYDPSTDSWAFV 727
           +  V  YDP  D W  V
Sbjct: 342 VDDVHYYDPVQDKWTQV 358



 Score = 58.0 bits (140), Expect = 9e-09
 Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 14/190 (7%)

Query: 481 VFDPLVGRWQMAEAMSMLR--SRVGVAVMK--NRLYAFGGYNGSERLSTVEEFDPVRRVW 536
           VFD     W ++ A   +   S +GV ++   + LY FGG + S + +    FD     W
Sbjct: 197 VFDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEW 256

Query: 537 NKVSPM---CFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKH- 592
             ++P+      RS    AA  + +YV GG    + L T++ Y     +W    +     
Sbjct: 257 KLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSF 316

Query: 593 --RSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGV---AA 647
             R   G+      V+ + G +G  + D V  YDP  D+WT V+    +     V   AA
Sbjct: 317 SIRGGAGLEVVQGKVWVVYGFNGCEV-DDVHYYDPVQDKWTQVETFGVRPSERSVFASAA 375

Query: 648 LNNKIYVCGG 657
           +   I + GG
Sbjct: 376 VGKHIVIFGG 385



 Score = 51.9 bits (124), Expect = 7e-07
 Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 13/151 (8%)

Query: 482 FDPLVGRWQM---AEAMSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNK 538
           FD     W++    E     RS   +A  +  +Y FGG + + RL T++ ++ V + W  
Sbjct: 249 FDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWFH 308

Query: 539 VSP---MCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSA 595
            S        R   G   +  K++V  G++G   ++ V  Y+P +D+W  V++     S 
Sbjct: 309 CSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGC-EVDDVHYYDPVQDKWTQVETFGVRPSE 367

Query: 596 GGVIAFDSYVYALGGHDGLSIFDSVERYDPK 626
             V A      A+G H  + IF      DP 
Sbjct: 368 RSVFA----SAAVGKH--IVIFGGEIAMDPL 392



 Score = 35.7 bits (82), Expect = 0.079
 Identities = 38/164 (23%), Positives = 71/164 (43%), Gaps = 16/164 (9%)

Query: 593 RSAGGVIAFDSYVYALGGH--DGLSIFDSVERYDPKTDEWTSVKP----MLTKRCRLGV- 645
           R + G+    + +Y+ GG       I   +  +D +T  W S+ P    +    C LGV 
Sbjct: 166 RCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLETRTW-SISPATGDVPHLSC-LGVR 223

Query: 646 -AALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMN---VMRSRVALVANMGKLWA 701
             ++ + +YV GG D +        +D  T+EWK++  +      RS  ++ A+   ++ 
Sbjct: 224 MVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYV 283

Query: 702 IGGYDGVSNLPTVEVYDPSTDSWAFVAP---MCAHEGGVGVGVI 742
            GG    + L T++ Y+     W   +      +  GG G+ V+
Sbjct: 284 FGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVV 327


>gnl|CDD|206134 pfam13964, Kelch_6, Kelch motif. 
          Length = 50

 Score = 47.8 bits (114), Expect = 2e-07
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 640 RCRLGVAALNNKIYVCGGY-DGAIFLQSVEMYDPITDEWKMIASMNVMR 687
           R      ++  KIYV GGY +G+     V +YDP T  W+ +  +   R
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 44.7 bits (106), Expect = 2e-06
 Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 1/50 (2%)

Query: 592 HRSAGGVIAFDSYVYALGG-HDGLSIFDSVERYDPKTDEWTSVKPMLTKR 640
            R+    ++    +Y  GG  +G    + V  YDP+T  W  + P+ T R
Sbjct: 1   PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 38.6 bits (90), Expect = 4e-04
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 1/47 (2%)

Query: 546 RSAVGAAALNDKLYVCGGY-DGVSSLNTVECYEPDKDQWRIVKSMQK 591
           R+   A ++  K+YV GGY +G  + N V  Y+P+   W  +  +  
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPT 48



 Score = 38.2 bits (89), Expect = 5e-04
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 499 RSRVGVAVMKNRLYAFGGY-NGSERLSTVEEFDPVRRVWNKVSPM 542
           R+      +  ++Y FGGY NGS+  + V  +DP    W K+ P+
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPL 46



 Score = 37.8 bits (88), Expect = 7e-04
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 686 MRSRVALVANMGKLWAIGGYDGVSN-LPTVEVYDPSTDSWAFVAPM 730
            R+  + V+  GK++  GGY   S     V VYDP T SW  + P+
Sbjct: 1   PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPL 46



 Score = 35.5 bits (82), Expect = 0.004
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 422 GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
           G I+  GG +    + + V V+DP  G W+ 
Sbjct: 12  GKIYVFGGYSNGSKASNKVLVYDPETGSWEK 42


>gnl|CDD|237617 PRK14131, PRK14131, N-acetylneuraminic acid mutarotase;
           Provisional.
          Length = 376

 Score = 52.3 bits (126), Expect = 5e-07
 Identities = 41/136 (30%), Positives = 58/136 (42%), Gaps = 21/136 (15%)

Query: 541 PMCFKRSAVGAAAL-NDKLYVCGGYDGVS--SLNTVECYEPDKDQWRIVKSMQKHRSAGG 597
           P+ FK    G  A+ N+ +YV  G  G S   L+      P K  W  + +         
Sbjct: 26  PVPFKN---GTGAIDNNTVYVGLGSAGTSWYKLDL---NAPSK-GWTKIAAFPGGPREQA 78

Query: 598 VIAF-DSYVYALGG-----HDG-LSIFDSVERYDPKTDEWTSVKPMLTKRCRLG---VAA 647
           V AF D  +Y  GG      +G   +FD V +YDPKT+ W  +    +     G   V+ 
Sbjct: 79  VAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKL-DTRSPVGLAGHVAVSL 137

Query: 648 LNNKIYVCGGYDGAIF 663
            N K Y+ GG +  IF
Sbjct: 138 HNGKAYITGGVNKNIF 153



 Score = 37.3 bits (87), Expect = 0.026
 Identities = 21/78 (26%), Positives = 30/78 (38%), Gaps = 16/78 (20%)

Query: 499 RSRVGVAVMKNRLYAFGGYNGSER------LSTVEEFDPVRRVWNKV---SPMCFKRSAV 549
           R +   A +  +LY FGG   +           V ++DP    W K+   SP        
Sbjct: 75  REQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSP----VGLA 130

Query: 550 GAAAL---NDKLYVCGGY 564
           G  A+   N K Y+ GG 
Sbjct: 131 GHVAVSLHNGKAYITGGV 148



 Score = 33.8 bits (78), Expect = 0.27
 Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 12/90 (13%)

Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
           G  A+ NN +YV  G  G  +   +++  P +  W  IA+     R +       GKL+ 
Sbjct: 32  GTGAIDNNTVYVGLGSAGTSWY-KLDLNAP-SKGWTKIAAFPGGPREQAVAAFIDGKLYV 89

Query: 702 IGGYDGVSN-------LPTVEVYDPSTDSW 724
            GG  G +N          V  YDP T+SW
Sbjct: 90  FGGI-GKTNSEGSPQVFDDVYKYDPKTNSW 118


>gnl|CDD|177814 PLN02153, PLN02153, epithiospecifier protein.
          Length = 341

 Score = 48.8 bits (116), Expect = 5e-06
 Identities = 53/238 (22%), Positives = 97/238 (40%), Gaps = 38/238 (15%)

Query: 499 RSRVGVAVMKNRLYAFGG-YNGSERLST-VEEFDPVRRVWNKVSP-------MCFKRSAV 549
           R   G+AV+ ++LY+FGG    +E +   +  FD     W+            C     V
Sbjct: 23  RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCL---GV 79

Query: 550 GAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKH-----RSAGGVIAFDSY 604
              A+  KLY+ GG D     +    Y+  K++W  +  + +      R+   + + +++
Sbjct: 80  RMVAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENH 139

Query: 605 VYALGG--HDGL----SIFDSVERYDPKTDEWTSVK---PMLTKRCRLGVAALNNKIYVC 655
           VY  GG    GL      F ++E Y+    +W  +        KR   G A +  KI+V 
Sbjct: 140 VYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVV 199

Query: 656 GGYDGAIF--------LQSVEMYDPITDEWKMIASMNVMRSRVALVANMGKLWAIGGY 705
            G+  +I           +V+ +DP + +W  + +     S  ++ A+      +G Y
Sbjct: 200 YGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSVFAHA----VVGKY 253



 Score = 43.1 bits (101), Expect = 3e-04
 Identities = 32/151 (21%), Positives = 59/151 (39%), Gaps = 19/151 (12%)

Query: 593 RSAGGVIAFDSYVYALGGH--DGLSIFDSVERYDPKTDEWTSVKP---MLTKRCRLGV-- 645
           R + G+      +Y+ GG       I   +  +D  T  W+       +    C LGV  
Sbjct: 23  RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISC-LGVRM 81

Query: 646 AALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVM-----RSRVALVANMGKLW 700
            A+  K+Y+ GG D          YD + +EW  +  ++       R+  ++ ++   ++
Sbjct: 82  VAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVY 141

Query: 701 AIGGYD--GVSNLP----TVEVYDPSTDSWA 725
             GG    G+   P    T+E Y+ +   W 
Sbjct: 142 VFGGVSKGGLMKTPERFRTIEAYNIADGKWV 172



 Score = 38.8 bits (90), Expect = 0.009
 Identities = 50/210 (23%), Positives = 84/210 (40%), Gaps = 37/210 (17%)

Query: 481 VFDPLVGRWQMAEA------MSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRR 534
           VFD     W +A A      +S L  R+ VAV   +LY FGG +     S    +D V+ 
Sbjct: 54  VFDFNTHTWSIAPANGDVPRISCLGVRM-VAV-GTKLYIFGGRDEKREFSDFYSYDTVKN 111

Query: 535 VWNKVSPMCFK-----RSAVGAAALNDKLYVCGGYDGVSSLN------TVECYEPDKDQW 583
            W  ++ +  +     R+    A+  + +YV GG      +       T+E Y     +W
Sbjct: 112 EWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKW 171

Query: 584 RIV----KSMQKHRSAGGVIA---------FDSYVYALGGHDGLSIFDSVERYDPKTDEW 630
             +    ++ +K   AG  +          F + +   G  D  S  ++V+ +DP + +W
Sbjct: 172 VQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYES--NAVQFFDPASGKW 229

Query: 631 TSVKPMLTK---RCRLGVAALNNKIYVCGG 657
           T V+    K   R     A +   I + GG
Sbjct: 230 TEVETTGAKPSARSVFAHAVVGKYIIIFGG 259



 Score = 34.6 bits (79), Expect = 0.19
 Identities = 29/104 (27%), Positives = 44/104 (42%), Gaps = 9/104 (8%)

Query: 640 RCRLGVAALNNKIYVCGG--YDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVA-----L 692
           RC  G+A + +K+Y  GG         + + ++D  T  W  IA  N    R++     +
Sbjct: 23  RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWS-IAPANGDVPRISCLGVRM 81

Query: 693 VANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEGG 736
           VA   KL+  GG D          YD   + W F+  +   EGG
Sbjct: 82  VAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKL-DEEGG 124


>gnl|CDD|116261 pfam07646, Kelch_2, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that Drosophila
           ring canal kelch protein is related to Galactose Oxidase
           for which a structure has been solved. The kelch motif
           forms a beta sheet. Several of these sheets associate to
           form a beta propeller structure as found in pfam00064,
           pfam00400 and pfam00415.
          Length = 48

 Score = 41.6 bits (98), Expect = 3e-05
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 2/47 (4%)

Query: 499 RSRVGVAVMKNRLYAFGGY--NGSERLSTVEEFDPVRRVWNKVSPMC 543
           R      V+  +LY  GG    G    S +   DP   VW ++  + 
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPALP 48



 Score = 40.8 bits (96), Expect = 6e-05
 Identities = 11/46 (23%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 546 RSAVGAAALNDKLYVCGGY--DGVSSLNTVECYEPDKDQWRIVKSM 589
           R    +  +  KLYV GG    G  S + +   +P+ + W  + ++
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47



 Score = 33.9 bits (78), Expect = 0.018
 Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 2/41 (4%)

Query: 640 RCRLGVAALNNKIYVCGGY--DGAIFLQSVEMYDPITDEWK 678
           R       +  K+YV GG    G +    + + DP T+ W 
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWT 42



 Score = 32.7 bits (75), Expect = 0.046
 Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 2/46 (4%)

Query: 593 RSAGGVIAFDSYVYALGG--HDGLSIFDSVERYDPKTDEWTSVKPM 636
           R     +     +Y +GG    G      +   DP+T+ WT +  +
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47



 Score = 32.0 bits (73), Expect = 0.087
 Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 2/47 (4%)

Query: 687 RSRVALVANMGKLWAIGGY--DGVSNLPTVEVYDPSTDSWAFVAPMC 731
           R   A V   GKL+ +GG    G  +   + V DP T+ W  +  + 
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPALP 48


>gnl|CDD|211835 TIGR03548, mutarot_permut, cyclically-permuted mutarotase family
           protein.  Members of this protein family show
           essentially full-length homology, cyclically permuted,
           to YjhT from Escherichia coli. YjhT was shown to act as
           a mutarotase for sialic acid, and by this ability to be
           able to act as a virulence factor. Members of the YjhT
           family (TIGR03547) and this cyclically-permuted family
           have multiple repeats of the beta-propeller-forming
           Kelch repeat.
          Length = 323

 Score = 44.8 bits (106), Expect = 1e-04
 Identities = 39/170 (22%), Positives = 65/170 (38%), Gaps = 12/170 (7%)

Query: 509 NRLYAFGGYNGSERLSTVEEF---DPVRRVWNK-VSPMCFKRSAVGAAALNDKLYVCGGY 564
           N +Y  GG N SE  S+V      +    +  + +  + F      A   + KLYV GG 
Sbjct: 73  NGIYYIGGSNSSESFSSVYRITLDESKEALIIETLPSLPFAMDNGSATYKDGKLYVGGGN 132

Query: 565 DGVSSLNTVECYEPDKDQWRIVKSMQ-KHRSAGGVIAFDSYVYALGGHDGLSIFDSVERY 623
                 N   C+  +  +W  +       R     +   + +Y  GG D    +    +Y
Sbjct: 133 ANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPVCVKLQNELYVFGGGDN-IAYTDGYKY 191

Query: 624 DPKTDEWTSVKPMLTK---RCRLGVAAL---NNKIYVCGGYDGAIFLQSV 667
            PKT  W  V   L+       LG A++    + +   GG++  +F  +V
Sbjct: 192 SPKTGTWEKVADPLSDGEPISLLGAASIAINESLLLCIGGFNYDVFNDAV 241


>gnl|CDD|234253 TIGR03547, muta_rot_YjhT, mutatrotase, YjhT family.  Members of
           this protein family contain multiple copies of the
           beta-propeller-forming Kelch repeat. All are full-length
           homologs to YjhT of Escherichia coli, which has been
           identified as a mutarotase for sialic acid. This protein
           improves bacterial ability to obtain host sialic acid,
           and thus serves as a virulence factor. Some bacteria
           carry what appears to be a cyclically permuted homolog
           of this protein.
          Length = 346

 Score = 43.9 bits (104), Expect = 2e-04
 Identities = 39/134 (29%), Positives = 56/134 (41%), Gaps = 17/134 (12%)

Query: 541 PMCFKRSAVGAAAL-NDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVI 599
           P+ FK    G  A+  DK+YV  G  G S    ++  +P K  W+ +           V 
Sbjct: 5   PVGFKN---GTGAIIGDKVYVGLGSAGTS-WYKLDLKKPSK-GWQKIADFPGGPRNQAVA 59

Query: 600 AF-DSYVYALGG-----HDG-LSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAAL---N 649
           A  D  +Y  GG      +G   +FD V RYDPK + W  +    +    LG +     N
Sbjct: 60  AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKL-DTRSPVGLLGASGFSLHN 118

Query: 650 NKIYVCGGYDGAIF 663
            + Y  GG +  IF
Sbjct: 119 GQAYFTGGVNKNIF 132



 Score = 35.8 bits (83), Expect = 0.066
 Identities = 48/206 (23%), Positives = 76/206 (36%), Gaps = 38/206 (18%)

Query: 469 LTKAGDSLSTVEVFDPLVGRWQ-MAEAMSMLRSRVGVAVMKNRLYAFGGY--NGSER--- 522
           L  AG S   +++  P  G WQ +A+     R++   A +  +LY FGG     SE    
Sbjct: 24  LGSAGTSWYKLDLKKPSKG-WQKIADFPGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQ 82

Query: 523 -LSTVEEFDPVRRVWNKV---SPMCFKRSAVGAAAL---NDKLYVCGGYDGVSSLNTVEC 575
               V  +DP +  W K+   SP       +GA+     N + Y  GG +     N  + 
Sbjct: 83  VFDDVYRYDPKKNSWQKLDTRSP----VGLLGASGFSLHNGQAYFTGGVN----KNIFDG 134

Query: 576 YEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIF--DSVERYDPKTDEWTSV 633
           Y  D     +        +       D  + A         F   +V  YDP T++W ++
Sbjct: 135 YFAD-----LSA------ADKDSEPKDKLIAAYFSQPPEDYFWNKNVLSYDPSTNQWRNL 183

Query: 634 --KPMLTKRCRLGVAALNNKIYVCGG 657
              P L       +    NK+ +  G
Sbjct: 184 GENPFLG-TAGSAIVHKGNKLLLING 208



 Score = 33.8 bits (78), Expect = 0.29
 Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 10/89 (11%)

Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
           G  A+  +K+YV  G  G  + + +++  P    W+ IA      R++    A  GKL+ 
Sbjct: 11  GTGAIIGDKVYVGLGSAGTSWYK-LDLKKPSKG-WQKIADFPGGPRNQAVAAAIDGKLYV 68

Query: 702 IGGY--DGVSNLPTVE----VYDPSTDSW 724
            GG         P V      YDP  +SW
Sbjct: 69  FGGIGKANSEGSPQVFDDVYRYDPKKNSW 97



 Score = 32.7 bits (75), Expect = 0.66
 Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 16/73 (21%)

Query: 646 AALNNKIYVCGGYDGA------IFLQSVEMYDPITDEWKMIASMNVMRSRVALVA----- 694
           AA++ K+YV GG   A           V  YDP  + W+ + +    RS V L+      
Sbjct: 60  AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDT----RSPVGLLGASGFS 115

Query: 695 -NMGKLWAIGGYD 706
            + G+ +  GG +
Sbjct: 116 LHNGQAYFTGGVN 128


>gnl|CDD|222113 pfam13415, Kelch_3, Galactose oxidase, central domain. 
          Length = 48

 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 649 NNKIYVCGGYDG-AIFLQSVEMYDPITDEWKMIASM 683
             KIYV GG       L  + +YD  T+ W+ +  +
Sbjct: 1   GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36



 Score = 37.3 bits (87), Expect = 0.001
 Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 1/48 (2%)

Query: 508 KNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAAL 554
             ++Y FGG      RL+ +  +D     W K+  +   R+   A  +
Sbjct: 1   GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48



 Score = 31.1 bits (71), Expect = 0.15
 Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 555 NDKLYVCGGYDGVS-SLNTVECYEPDKDQWRIVKSM 589
             K+YV GG       LN +  Y+ D + W  +  +
Sbjct: 1   GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36



 Score = 30.3 bits (69), Expect = 0.29
 Identities = 8/41 (19%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query: 602 DSYVYALGGHDGLSIF-DSVERYDPKTDEWTSVKPMLTKRC 641
              +Y  GG        + +  YD  T+ W  +  +   R 
Sbjct: 1   GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRA 41



 Score = 29.9 bits (68), Expect = 0.38
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 1/35 (2%)

Query: 697 GKLWAIGGYDGVSN-LPTVEVYDPSTDSWAFVAPM 730
           GK++  GG       L  + VYD  T++W  +  +
Sbjct: 2   GKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36


>gnl|CDD|225597 COG3055, COG3055, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 381

 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 33/128 (25%), Positives = 46/128 (35%), Gaps = 21/128 (16%)

Query: 550 GAAAL-NDKLYVCGGYDGVS--SLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVY 606
           GA AL  D +YV  G  G +   L+     +P K   +I       R+          +Y
Sbjct: 40  GAGALIGDTVYVGLGSAGTAFYVLDL---KKPGKGWTKIADFPGGARNQAVAAVIGGKLY 96

Query: 607 ALGGH-----DGLSIFDSVERYDPKTDEWT---SVKPML---TKRCRLGVAALNNKIYVC 655
             GG+         +F+   RYDP T+ W    +  P          L       KIY  
Sbjct: 97  VFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLN----GTKIYFF 152

Query: 656 GGYDGAIF 663
           GG +  IF
Sbjct: 153 GGVNQNIF 160



 Score = 36.2 bits (84), Expect = 0.057
 Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 9/88 (10%)

Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
           G  AL  + +YV  G  G  F   +++  P    W  IA      R++       GKL+ 
Sbjct: 40  GAGALIGDTVYVGLGSAGTAFY-VLDLKKP-GKGWTKIADFPGGARNQAVAAVIGGKLYV 97

Query: 702 IGGY-DGVSNLPTVE----VYDPSTDSW 724
            GGY   VS+ P V      YDPST+SW
Sbjct: 98  FGGYGKSVSSSPQVFNDAYRYDPSTNSW 125



 Score = 35.0 bits (81), Expect = 0.12
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 9/69 (13%)

Query: 505 AVMKNRLYAFGGYNGSER-----LSTVEEFDPVRRVWNKV---SPMCFKRSAVGAAALND 556
           AV+  +LY FGGY  S        +    +DP    W+K+   SP      A   +    
Sbjct: 89  AVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLV-GASTFSLNGT 147

Query: 557 KLYVCGGYD 565
           K+Y  GG +
Sbjct: 148 KIYFFGGVN 156



 Score = 34.7 bits (80), Expect = 0.17
 Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)

Query: 646 AALNNKIYVCGGYDGA-----IFLQSVEMYDPITDEWKMIASMNVMRS--RVALVANMGK 698
           A +  K+YV GGY  +             YDP T+ W  + + +            N  K
Sbjct: 89  AVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTK 148

Query: 699 LWAIGGYD 706
           ++  GG +
Sbjct: 149 IYFFGGVN 156


>gnl|CDD|205596 pfam13418, Kelch_4, Galactose oxidase, central domain. 
          Length = 49

 Score = 37.6 bits (88), Expect = 8e-04
 Identities = 13/40 (32%), Positives = 18/40 (45%), Gaps = 1/40 (2%)

Query: 504 VAVMKNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPM 542
            ++   RLY FGG N     LS V  FD     W ++  +
Sbjct: 8   TSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 30.3 bits (69), Expect = 0.26
 Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 602 DSYVYALGGHDGL-SIFDSVERYDPKTDEWTSVKPM 636
           D  +Y  GG +   S+   V  +D  T+ WT +  +
Sbjct: 12  DGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 29.5 bits (67), Expect = 0.54
 Identities = 9/36 (25%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 649 NNKIYVCGGYDG-AIFLQSVEMYDPITDEWKMIASM 683
           + ++Y+ GG +     L  V ++D  T+ W  + S+
Sbjct: 12  DGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 28.8 bits (65), Expect = 0.97
 Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 697 GKLWAIGGYDG-VSNLPTVEVYDPSTDSWAFVAPM 730
           G+L+  GG +   S L  V V+D ST++W  +  +
Sbjct: 13  GRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47


>gnl|CDD|224531 COG1615, COG1615, Uncharacterized conserved protein [Function
           unknown].
          Length = 885

 Score = 33.2 bits (76), Expect = 0.69
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 496 SMLRSRVGVAVMKNRLYAF-GGYNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
           S+     GV  ++N + A    Y+G+      +  DPV   W K+ P  FK
Sbjct: 573 SLPTDANGVNYIRNSVKALVDAYDGTVTFYQSDLNDPVINAWMKIFPGTFK 623


>gnl|CDD|132110 TIGR03066, Gem_osc_para_1, Gemmata obscuriglobus paralogous family
           TIGR03066.  This model represents an uncharacterized
           paralogous family in Gemmata obscuriglobus UQM 2246, a
           member of the Planctomycetes. This family shows sequence
           similarity to TIGR03067, which is also found in Gemmata
           obscuriglobus as well as in a few other species
           [Hypothetical proteins, Conserved].
          Length = 111

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 5/36 (13%)

Query: 425 FAVGGLTKAGDSLSTVEVFDP--LVGRWQMAEEETL 458
            A+ G  +A D     E  D   LVG+W+ ++ +  
Sbjct: 3   LALAGGGRADDK---DEKIDAKKLVGKWETSKTKDD 35


>gnl|CDD|221245 pfam11822, DUF3342, Domain of unknown function (DUF3342).  This
           family of proteins are functionally uncharacterized.
           This family is found in bacteria. This presumed domain
           is typically between 170 to 303 amino acids in length.
           The N-terminal half of this family is a BTB-like domain.
          Length = 302

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 178 NVQSLMVVASFLQMQKVADACADFLKKR 205
           NV S+++ +SFLQM+++ D C  +    
Sbjct: 79  NVVSILLSSSFLQMEQLVDECLMYCHAH 106



 Score = 31.6 bits (72), Expect = 1.4
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 241 NVQSLMVVASFLQMQKVADACADFLKKR 268
           NV S+++ +SFLQM+++ D C  +    
Sbjct: 79  NVVSILLSSSFLQMEQLVDECLMYCHAH 106


>gnl|CDD|235047 PRK02509, PRK02509, hypothetical protein; Provisional.
          Length = 973

 Score = 31.9 bits (73), Expect = 1.6
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 517 YNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
           YNG  +    +  DP+ + W+K+ P  FK
Sbjct: 724 YNGDVQFYVADPNDPLIQTWSKIFPQLFK 752


>gnl|CDD|148056 pfam06223, Phage_tail_T, Minor tail protein T.  Minor tail protein
           T is located at the distal end and is involved in the
           assembly of the initiator complex for tail
           polymerisation.
          Length = 103

 Score = 28.7 bits (64), Expect = 3.9
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 278 RQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGL 313
           R+    ++  +L E AD +V   F +V +  EF  L
Sbjct: 12  RRMLAEMSATELGEWADHFVTHSFSDVLLDAEFATL 47


>gnl|CDD|235525 PRK05595, PRK05595, replicative DNA helicase; Provisional.
          Length = 444

 Score = 30.2 bits (68), Expect = 4.0
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 8/54 (14%)

Query: 111 SKQREITMQ-GIDAVIVLAATIPYFQAMFTSDMAESKQREIT--MQGIDAVAME 161
           SK R + ++ GID ++     I Y Q M     +ES+Q+E++   + I A+A E
Sbjct: 301 SKCRRLKIEHGIDMIL-----IDYLQLMSGGKGSESRQQEVSEISRSIKALAKE 349


>gnl|CDD|132488 TIGR03447, mycothiol_MshC, cysteine--1-D-myo-inosityl
           2-amino-2-deoxy-alpha-D-glucopyranoside ligase.  Members
           of this protein family are MshC,
           l-cysteine:1-D-myo-inosityl
           2-amino-2-deoxy-alpha-D-glucopyranoside ligase, an
           enzyme that uses ATP to ligate a Cys residue to a
           mycothiol precursor molecule, in the second to last step
           in mycothiol biosynthesis. This enzyme shows
           considerable homology to Cys--tRNA ligases, and many
           instances are misannotated as such. Mycothiol is found
           in Mycobacterium tuberculosis, Corynebacterium
           glutamicum, Streptomyces coelicolor, and various other
           members of the Actinobacteria. Mycothiol is an analog to
           glutathione [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Glutathione and analogs].
          Length = 411

 Score = 30.1 bits (68), Expect = 4.6
 Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 11/65 (16%)

Query: 359 AAVRLPLLSPHYLADRVATEALIRSSHECRDL---------VDEARDFHLMPERRFLLAG 409
           AA+RL LL+ HY  DR  T+A++  +                 +A D  L+   R  LA 
Sbjct: 310 AAIRLGLLAGHYRQDRDWTDAVLAEAEARLARWRAALALPDAPDATD--LIARLRQHLAN 367

Query: 410 EKTTP 414
           +  TP
Sbjct: 368 DLDTP 372


>gnl|CDD|217678 pfam03699, UPF0182, Uncharacterized protein family (UPF0182).  This
           family contains uncharacterized integral membrane
           proteins.
          Length = 771

 Score = 30.2 bits (69), Expect = 5.5
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 517 YNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
           Y+G+     V+  DP+ + W K+ P  FK
Sbjct: 603 YDGTVDFYVVDPEDPILKTWQKIFPGLFK 631


>gnl|CDD|200475 cd11336, AmyAc_MTSase, Alpha amylase catalytic domain found in
           maltooligosyl trehalose synthase (MTSase).
           Maltooligosyl trehalose synthase (MTSase) domain. MTSase
           and maltooligosyl trehalose trehalohydrolase (MTHase)
           work together to produce trehalose. MTSase is
           responsible for converting the alpha-1,4-glucosidic
           linkage to an alpha,alpha-1,1-glucosidic linkage at the
           reducing end of the maltooligosaccharide through an
           intramolecular transglucosylation reaction, while MTHase
           hydrolyzes the penultimate alpha-1,4 linkage of the
           reducing end, resulting in the release of trehalose. The
           Alpha-amylase family comprises the largest family of
           glycoside hydrolases (GH), with the majority of enzymes
           acting on starch, glycogen, and related oligo- and
           polysaccharides. These proteins catalyze the
           transformation of alpha-1,4 and alpha-1,6 glucosidic
           linkages with retention of the anomeric center. The
           protein is described as having 3 domains: A, B, C. A is
           a (beta/alpha) 8-barrel; B is a loop between the beta 3
           strand and alpha 3 helix of A; C is the C-terminal
           extension characterized by a Greek key. The majority of
           the enzymes have an active site cleft found between
           domains A and B where a triad of catalytic residues
           (Asp, Glu and Asp) performs catalysis. Other members of
           this family have lost the catalytic activity as in the
           case of the human 4F2hc, or only have 2 residues that
           serve as the catalytic nucleophile and the acid/base,
           such as Thermus A4 beta-galactosidase with 2 Glu
           residues (GH42) and human alpha-galactosidase with 2 Asp
           residues (GH31). The family members are quite extensive
           and include: alpha amylase, maltosyltransferase,
           cyclodextrin glycotransferase, maltogenic amylase,
           neopullulanase, isoamylase, 1,4-alpha-D-glucan
           maltotetrahydrolase, 4-alpha-glucotransferase,
           oligo-1,6-glucosidase, amylosucrase, sucrose
           phosphorylase, and amylomaltase.
          Length = 660

 Score = 29.8 bits (68), Expect = 6.2
 Identities = 31/117 (26%), Positives = 41/117 (35%), Gaps = 35/117 (29%)

Query: 323 KRSE-----LHLMSE--EQVFEAVMRWVKHNASERAPSLPR----------LLAAVRLPL 365
           KR E     L ++SE  E+  EAV RW + NA  R    P           L+ A  L  
Sbjct: 449 KRGEDVRARLAVLSELPEEWAEAVRRWRRLNAPLRTGPAPDPNDEYLLYQTLVGAWPLDG 508

Query: 366 LSPHYLADRVAT-------EALIRSSH---------ECRDLVDEARDFHLMPERRFL 406
            +    A+R+A        EA + +S               VD        P R FL
Sbjct: 509 DALADFAERLAAYMLKALREAKLHTSWTDPDEAYEEAVAAFVDAL--LDPPPSRAFL 563


>gnl|CDD|133321 cd04121, Rab40, Rab GTPase family 40 (Rab40) contains Rab40a,
           Rab40b and Rab40c.  The Rab40 subfamily contains Rab40a,
           Rab40b, and Rab40c, which are all highly homologous. In
           rat, Rab40c is localized to the perinuclear recycling
           compartment (PRC), and is distributed in a
           tissue-specific manor, with high expression in brain,
           heart, kidney, and testis, low expression in lung and
           liver, and no expression in spleen and skeletal muscle.
           Rab40c is highly expressed in differentiated
           oligodendrocytes but minimally expressed in
           oligodendrocyte progenitors, suggesting a role in the
           vesicular transport of myelin components. Unlike most
           other Ras-superfamily proteins, Rab40c was shown to have
           a much lower affinity for GTP, and an affinity for GDP
           that is lower than for GTP. GTPase activating proteins
           (GAPs) interact with GTP-bound Rab and accelerate the
           hydrolysis of GTP to GDP. Guanine nucleotide exchange
           factors (GEFs) interact with GDP-bound Rabs to promote
           the formation of the GTP-bound state. Rabs are further
           regulated by guanine nucleotide dissociation inhibitors
           (GDIs), which facilitate Rab recycling by masking
           C-terminal lipid binding and promoting cytosolic
           localization. Most Rab GTPases contain a lipid
           modification site at the C-terminus, with sequence
           motifs CC, CXC, or CCX. Lipid binding is essential for
           membrane attachment, a key feature of most Rab proteins.
          Length = 189

 Score = 28.7 bits (64), Expect = 7.1
 Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 336 FEAVMRWVKHNASERAPSLPRLLAAVRLPL 365
           F+ + RW+K    E AP +P++L   RL L
Sbjct: 94  FDGIDRWIKE-IDEHAPGVPKILVGNRLHL 122


>gnl|CDD|183518 PRK12418, PRK12418, cysteinyl-tRNA synthetase; Provisional.
          Length = 384

 Score = 29.5 bits (67), Expect = 7.7
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 359 AAVRLPLLSPHYLADRVATEALIRSSHE 386
           AA+RL LL+ HY ADR  T+A++  +  
Sbjct: 283 AAIRLALLAGHYRADREWTDAVLAEAEA 310


>gnl|CDD|129733 TIGR00647, MG103, conserved hypothetical protein.  [Hypothetical
           proteins, Conserved].
          Length = 279

 Score = 29.1 bits (65), Expect = 8.4
 Identities = 29/148 (19%), Positives = 46/148 (31%), Gaps = 23/148 (15%)

Query: 185 VASFLQM--------QKVADACADFLKKRFHPNNVLDY-------YVLFSCRAMEALINF 229
           +A   Q                 + LKK+F  +    +        V FS  A   ++  
Sbjct: 25  LAGLFQNNLKLLFTRHWSFKVQTELLKKQFDVSLEFLFRKKMRLKIVRFSFNADLKILGE 84

Query: 230 AYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFH----PNNVLGIRQFADTLN 285
                 TI    V+   ++ +FL    V+D         FH     NN   I Q     N
Sbjct: 85  LLVFDRTISKLLVKKSYMIGAFLSGGSVSDL---EHSSNFHLQISSNNEEQIDQLCKLFN 141

Query: 286 CLQLSEAADKYVQQYFHEVSMSDEFIGL 313
             +L+    + +  Y        EF+ L
Sbjct: 142 EFKLTSKTLQLI-VYLKSYEKICEFLKL 168


>gnl|CDD|238867 cd01829, SGNH_hydrolase_peri2, SGNH_peri2; putative periplasmic
           member of the SGNH-family of hydrolases, a diverse
           family of lipases and esterases. The tertiary fold of
           the enzyme is substantially different from that of the
           alpha/beta hydrolase family and unique among all known
           hydrolases; its active site closely resembles the
           Ser-His-Asp(Glu) triad found in other serine hydrolases.
          Length = 200

 Score = 28.8 bits (65), Expect = 9.2
 Identities = 8/29 (27%), Positives = 10/29 (34%)

Query: 605 VYALGGHDGLSIFDSVERYDPKTDEWTSV 633
           V  LG +D   I D        + EW   
Sbjct: 64  VVFLGANDRQDIRDGDGYLKFGSPEWEEE 92


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.134    0.399 

Gapped
Lambda     K      H
   0.267   0.0714    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,220,656
Number of extensions: 3748712
Number of successful extensions: 3154
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3088
Number of HSP's successfully gapped: 102
Length of query: 748
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 644
Effective length of database: 6,324,786
Effective search space: 4073162184
Effective search space used: 4073162184
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)