RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14726
         (712 letters)



>gnl|CDD|222983 PHA03098, PHA03098, kelch-like protein; Provisional.
          Length = 534

 Score =  249 bits (637), Expect = 3e-74
 Identities = 134/534 (25%), Positives = 233/534 (43%), Gaps = 40/534 (7%)

Query: 93  CDATIVL-ADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIP-GVDASVMTSLI 150
           CD +I++        +H+ IL + S+YF+ +F      +  F   EI   +D      +I
Sbjct: 10  CDESIIIVNGGGIIKVHKIILSSSSEYFKKMF------KNNFKENEINLNIDYDSFNEVI 63

Query: 151 EYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLTTPRNCFGILLFSRDHFCK 210
           +Y Y  K++IT +NV  +L   ++  +    +LC ++I ++    NC  I  FS  + CK
Sbjct: 64  KYIYTGKINITSNNVKDILSIANYLIIDFLINLCINYIIKIIDDNNCIDIYRFSFFYGCK 123

Query: 211 KLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYD 270
           KL      YI  N   +     + +++S+ EL +I+S D+LNV SE+ V E+II+W+   
Sbjct: 124 KLYSAAYNYIRNNIELIYN-DPDFIYLSKNELIKILSDDKLNVSSEDVVLEIIIKWLTSK 182

Query: 271 PENRKKHIVSLMKNIRLGLLETEFF------REKVKHHPYVQSTPECRPIIIETFKFLYD 324
             N+ K I  ++K +R+  L  E        + ++K    V +   C  II      L  
Sbjct: 183 KNNKYKDICLILKVLRITFLSEEGIKKLKRWKLRIKKKKIVFNK-RCIKIIYSKKYNL-- 239

Query: 325 LELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPM 384
                  +  +P             +  I         +++  Y        +I  +  +
Sbjct: 240 -------NKILPRSS------TFGSIIYIHITMSIFTYNYITNY----SPLSEINTIIDI 282

Query: 385 GPRAYHGTAVIGYNIYVIGGFDGNEY-YNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHH 443
                 G+ V+   IY IGG + N    NS   ++  TK W ++  + + R      V +
Sbjct: 283 HYVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFN 342

Query: 444 DVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGG-FNGH 502
           + IY +GG       N+VE +   E++W    P+   R +     +   IY+ GG     
Sbjct: 343 NRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKND 402

Query: 503 NCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGF---NGMSRMCNGEKYN 559
             L + E +    N+W+   P+     G   I +   IYVIGG    + +      E YN
Sbjct: 403 ELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYN 462

Query: 560 PVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWY 613
           PVT  W+++  +  PR N ++ + ++ I+ +GG      I  +E YD+KT+ W 
Sbjct: 463 PVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWT 516



 Score =  103 bits (259), Expect = 4e-23
 Identities = 56/197 (28%), Positives = 85/197 (43%), Gaps = 12/197 (6%)

Query: 334 EVPTPEIARPR------VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPR 387
               PE+  PR      V +  ++ IGG       + VE++     +W   E+   + PR
Sbjct: 323 WNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKW--REEPPLIFPR 380

Query: 388 AYHGTAVIGYNIYVIGG-FDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVI 446
                  +   IYVIGG    +E   +  CF+  T  W + +P+           H   I
Sbjct: 381 YNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKI 440

Query: 447 YAMGGY---NGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHN 503
           Y +GG    +  +  N VE YN   N+W+ ++ +N  R +AS      KIY+ GG     
Sbjct: 441 YVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEY 500

Query: 504 CLNSCEVYDPECNQWTL 520
            +N  EVYD + N WTL
Sbjct: 501 YINEIEVYDDKTNTWTL 517



 Score = 67.1 bits (164), Expect = 2e-11
 Identities = 38/174 (21%), Positives = 66/174 (37%), Gaps = 6/174 (3%)

Query: 463 KYNYTENQWSLIAPMNVERSDAS--ATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTL 520
           K  + +    +I       +     ++T    IYI          N    Y P     T+
Sbjct: 221 KIVFNKRCIKIIYSKKYNLNKILPRSSTFGSIIYIHIT-MSIFTYNYITNYSPLSEINTI 279

Query: 521 IEPMRHRRSGVSCIAYHECIYVIGGFNG-MSRMCNGEKYNPVTKTWSQVPDMYNPRSNFA 579
           I+   H       +  +  IY IGG N     + +   Y+  TK+W++VP++  PR N  
Sbjct: 280 ID--IHYVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPG 337

Query: 580 IEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVIMGL 633
           + V ++ I+ IGG     ++  VE +     +W E   +   R       +  L
Sbjct: 338 VTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNL 391


>gnl|CDD|149006 pfam07707, BACK, BTB And C-terminal Kelch.  This domain is found
           associated with pfam00651 and pfam01344. The BACK domain
           is found juxtaposed to the BTB domain; they are
           separated by as little as two residues. This family
           appears to be closely related to the BTB domain (Finn
           RD, personal observation).
          Length = 101

 Score =  119 bits (302), Expect = 3e-32
 Identities = 45/101 (44%), Positives = 73/101 (72%), Gaps = 1/101 (0%)

Query: 197 CFGILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSE 256
           C GI  F+  + C++L E   ++IL+NF++V+ +SEE + +S+++L E++SSDELNV SE
Sbjct: 1   CLGIYRFADTYGCEELAEVALRFILKNFLEVS-KSEEFLELSKEQLLELLSSDELNVPSE 59

Query: 257 ETVWELIIRWIDYDPENRKKHIVSLMKNIRLGLLETEFFRE 297
           E V+E +I+W+ +D ENRKKH+  L+  +RL LL  E+  +
Sbjct: 60  EEVFEAVIKWVKHDVENRKKHLPELLSAVRLPLLSPEYLLD 100


>gnl|CDD|197943 smart00875, BACK, BTB And C-terminal Kelch.  The BACK domain is
           found juxtaposed to the BTB domain; they are separated
           by as little as two residues.
          Length = 101

 Score =  111 bits (279), Expect = 3e-29
 Identities = 41/102 (40%), Positives = 70/102 (68%), Gaps = 1/102 (0%)

Query: 197 CFGILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSE 256
           C GI  F+  H  ++L E   ++IL+NF +V+  SEE + +  ++L E++SSD+LNV SE
Sbjct: 1   CLGIRRFADAHGLEELAEKALRFILQNFSEVSS-SEEFLELPLEQLLELLSSDDLNVSSE 59

Query: 257 ETVWELIIRWIDYDPENRKKHIVSLMKNIRLGLLETEFFREK 298
           E V+E ++RW+ +DPE R++H+  L++++RL LL  ++  E 
Sbjct: 60  EEVFEAVLRWVKHDPEKRREHLPELLEHVRLPLLSPDYLLEV 101


>gnl|CDD|165153 PHA02790, PHA02790, Kelch-like protein; Provisional.
          Length = 480

 Score =  115 bits (290), Expect = 3e-27
 Identities = 118/525 (22%), Positives = 220/525 (41%), Gaps = 82/525 (15%)

Query: 96  TIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIP------GVDASVMTSL 149
           TI+ A   +  ++  IL   S YFR       H ++K++  + P       +D   +TS+
Sbjct: 24  TIIEAIGGNIIVNSTILKKLSPYFRT------HLRQKYTKNKDPVTRVCLDLDIHSLTSI 77

Query: 150 IEYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLTTPRNCFGILLFSRDHFC 209
           + Y+Y  K++I   NV +LL A+    +      C +FI R      C    +   ++  
Sbjct: 78  VIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMMGIEYGL 137

Query: 210 KKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDY 269
             L   T+ +I ++F+++  E + I       ++ I+ SDELNV  E+ V + +I+W   
Sbjct: 138 SNLLCHTKDFIAKHFLEL--EDDIIDNFDYLSMKLILESDELNVPDEDYVVDFVIKWY-M 194

Query: 270 DPENRKKHIVSLMKNIRLGLLETEFFREKVKHHPYVQSTPECRPIIIETFKFLYDLELIA 329
              NR  +++ L+KN+    + + +                  P  I   K++ D   I 
Sbjct: 195 KRRNRLGNLLLLIKNV----IRSNYLS----------------PRGINNVKWILDCTKIF 234

Query: 330 QIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAY 389
             D         +PR  ++               F+E Y    D+ + I          +
Sbjct: 235 HCD--------KQPRKSYKY-------------PFIE-YPMNMDQIIDI----------F 262

Query: 390 H--GTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVIY 447
           H   +  +G  +Y+IGG+  NE +N+    N ++  W  I PMN  R Y S    ++ +Y
Sbjct: 263 HMCTSTHVGEVVYLIGGWMNNEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLY 322

Query: 448 AMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHNCLNS 507
            +GG        SVE++ + +  W  +  +   R + +  ++   IY+ GG +  +   +
Sbjct: 323 VVGGLPN---PTSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETD--TT 377

Query: 508 CEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQ 567
            E   P  +QW       +       + +   ++++G         N E Y   + TW+ 
Sbjct: 378 TEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGR--------NAEFYCESSNTWTL 429

Query: 568 VPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEW 612
           + D   PR N  + ++D+ +  IGGF   + I  +E Y+ +T  W
Sbjct: 430 IDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSW 474


>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain.  The BTB (for BR-C, ttk and bab) or
           POZ (for Pox virus and Zinc finger) domain is present
           near the N-terminus of a fraction of zinc finger
           (pfam00096) proteins and in proteins that contain the
           pfam01344 motif such as Kelch and a family of pox virus
           proteins. The BTB/POZ domain mediates homomeric
           dimerisation and in some instances heteromeric
           dimerisation. The structure of the dimerised PLZF
           BTB/POZ domain has been solved and consists of a tightly
           intertwined homodimer. The central scaffolding of the
           protein is made up of a cluster of alpha-helices flanked
           by short beta-sheets at both the top and bottom of the
           molecule. POZ domains from several zinc finger proteins
           have been shown to mediate transcriptional repression
           and to interact with components of histone deacetylase
           co-repressor complexes including N-CoR and SMRT. The POZ
           or BTB domain is also known as BR-C/Ttk or ZiN.
          Length = 101

 Score = 98.8 bits (247), Expect = 8e-25
 Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 6/107 (5%)

Query: 83  LNEFREKNLLCDATIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIPGVD 142
           LNE RE   LCD T+V   +  F  H+ +L ACS YF+ALFT +         + +  V 
Sbjct: 1   LNELRENGELCDVTLV-VGDKEFHAHKAVLAACSPYFKALFTGNK-----EVEITLEDVS 54

Query: 143 ASVMTSLIEYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIK 189
                +L+E+ Y  KL ITE+NV  LL   D   +    D C +F+ 
Sbjct: 55  PEDFEALLEFIYTGKLEITEENVDDLLALADKLQIPALIDKCEEFLI 101


>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac.  Domain
           in Broad-Complex, Tramtrack and Bric a brac. Also known
           as POZ (poxvirus and zinc finger) domain. Known to be a
           protein-protein interaction motif found at the N-termini
           of several C2H2-type transcription factors as well as
           Shaw-type potassium channels. Known structure reveals a
           tightly intertwined dimer formed via interactions
           between N-terminal strand and helix structures. However
           in a subset of BTB/POZ domains, these two secondary
           structures appear to be missing. Be aware SMART predicts
           BTB/POZ domains without the beta1- and alpha1-secondary
           structures.
          Length = 97

 Score = 84.3 bits (209), Expect = 1e-19
 Identities = 30/99 (30%), Positives = 51/99 (51%), Gaps = 2/99 (2%)

Query: 94  DATIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIPGVDASVMTSLIEYA 153
           D T+V      F  H+ +L A S YF+ALF++   + +K S + +  V      +L+ + 
Sbjct: 1   DVTLV-VGGKKFHAHKAVLAAHSPYFKALFSSDFKESDK-SEIYLDDVSPEDFRALLNFL 58

Query: 154 YLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLT 192
           Y  KL + E+NV  LL   D+  +    +LC +F+ +L 
Sbjct: 59  YTGKLDLPEENVEELLELADYLQIPGLVELCEEFLLKLL 97


>gnl|CDD|128874 smart00612, Kelch, Kelch domain. 
          Length = 47

 Score = 69.9 bits (172), Expect = 3e-15
 Identities = 20/47 (42%), Positives = 29/47 (61%)

Query: 539 CIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMYNPRSNFAIEVIDD 585
            IYV+GGF+G  R+ + E Y+P T  W+ +P M  PRS   + VI+ 
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 61.0 bits (149), Expect = 3e-12
 Identities = 22/47 (46%), Positives = 27/47 (57%)

Query: 492 KIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHE 538
           KIY+ GGF+G   L S EVYDPE N+WT +  M   RSG      + 
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 60.3 bits (147), Expect = 7e-12
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query: 445 VIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQG 491
            IY +GG++G +R  SVE Y+   N+W+ +  M   RS      + G
Sbjct: 1   KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 58.7 bits (143), Expect = 2e-11
 Identities = 15/46 (32%), Positives = 21/46 (45%)

Query: 399 IYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHD 444
           IYV+GGFDG +   S   ++  T  W  +  M   R     AV + 
Sbjct: 2   IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 56.4 bits (137), Expect = 1e-10
 Identities = 17/46 (36%), Positives = 24/46 (52%)

Query: 587 IFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVIMG 632
           I+ +GGF+G   +  VE YD +T++W     M   RS     VI G
Sbjct: 2   IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47



 Score = 42.5 bits (101), Expect = 1e-05
 Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 350 LFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPM-GPRAYHGTAVIG 396
           ++ +GG+ GG     VE YD   ++W     +  M  PR+ HG AVI 
Sbjct: 2   IYVVGGFDGGQRLKSVEVYDPETNKW---TPLPSMPTPRSGHGVAVIN 46


>gnl|CDD|177844 PLN02193, PLN02193, nitrile-specifier protein.
          Length = 470

 Score = 75.4 bits (185), Expect = 3e-14
 Identities = 50/165 (30%), Positives = 74/165 (44%), Gaps = 20/165 (12%)

Query: 395 IGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMN---FKRCYVSTAVHHDVIYAMGG 451
           IG  +YV GG D +  YN    F+  T  WK + P+      R + S A   + +Y  GG
Sbjct: 227 IGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286

Query: 452 YNGQRRQNSVEKYNYTENQW---SLIAPMNVERSDASATTLQGKIYITGGFNGHNCLNSC 508
            +   R  +++ YN  + +W   S        R  A    +QGK+++  GFNG      C
Sbjct: 287 VSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNG------C 340

Query: 509 EV-----YDPECNQWTLIEPMRHR---RSGVSCIAYHECIYVIGG 545
           EV     YDP  ++WT +E    R   RS  +  A  + I + GG
Sbjct: 341 EVDDVHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGG 385



 Score = 70.8 bits (173), Expect = 8e-13
 Identities = 63/253 (24%), Positives = 95/253 (37%), Gaps = 63/253 (24%)

Query: 374 RWVKIEQV-DPMGPRAYHGTAVIGYNIYVIGG-FDGNEYYNS-CRCFNAVTKVWKEIAPM 430
           +W+K+EQ  +  G R  HG A +G  IY  GG F  N+  +     F+  T+ W  I+P 
Sbjct: 152 KWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLETRTW-SISPA 210

Query: 431 N--------FKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERS 482
                         VS       +Y  GG +  R+ N    ++ T N+W L+ P      
Sbjct: 211 TGDVPHLSCLGVRMVSIG---STLYVFGGRDASRQYNGFYSFDTTTNEWKLLTP------ 261

Query: 483 DASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYV 542
                                                 +E     RS  S  A  E +YV
Sbjct: 262 --------------------------------------VEEGPTPRSFHSMAADEENVYV 283

Query: 543 IGGFNGMSRMCNGEKYNPVTKTW---SQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTI 599
            GG +  +R+   + YN V K W   S   D ++ R    +EV+   ++ + GFNG   +
Sbjct: 284 FGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGC-EV 342

Query: 600 YHVECYDEKTDEW 612
             V  YD   D+W
Sbjct: 343 DDVHYYDPVQDKW 355


>gnl|CDD|201739 pfam01344, Kelch_1, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that the
           Drosophila ring canal kelch protein is related to
           Galactose Oxidase, for which a structure has been
           solved. The kelch motif forms a beta sheet. Several of
           these sheets associate to form a beta propeller
           structure as found in pfam00064, pfam00400 and
           pfam00415.
          Length = 46

 Score = 63.1 bits (154), Expect = 7e-13
 Identities = 22/45 (48%), Positives = 28/45 (62%)

Query: 481 RSDASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMR 525
           RS A    L GKIY+ GG++G   L+S EVYDPE N W+ +  M 
Sbjct: 2   RSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 61.1 bits (149), Expect = 3e-12
 Identities = 16/46 (34%), Positives = 22/46 (47%)

Query: 433 KRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMN 478
            R      V    IY +GGY+G +  +SVE Y+   N WS +  M 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 60.8 bits (148), Expect = 4e-12
 Identities = 19/45 (42%), Positives = 27/45 (60%)

Query: 527 RRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDM 571
            RSG   +     IYVIGG++G   + + E Y+P T TWS++P M
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45



 Score = 58.1 bits (141), Expect = 4e-11
 Identities = 17/46 (36%), Positives = 26/46 (56%)

Query: 386 PRAYHGTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMN 431
           PR+  G  V+G  IYVIGG+DG +  +S   ++  T  W ++  M 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 57.3 bits (139), Expect = 7e-11
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMN 619
           PRS   + V+   I+ IGG++G  ++  VE YD +T+ W +   M 
Sbjct: 1   PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46



 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 345 VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKI 378
           V    ++ IGG+ GG     VE YD   + W K+
Sbjct: 9   VLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKL 42


>gnl|CDD|165086 PHA02713, PHA02713, hypothetical protein; Provisional.
          Length = 557

 Score = 67.7 bits (165), Expect = 1e-11
 Identities = 109/561 (19%), Positives = 206/561 (36%), Gaps = 73/561 (13%)

Query: 58  IENAIKNKKNFSSIDRAMSVQAMQALNEFREKNLLCDATIVLADNSHFPIHRNILCACSD 117
           I++   N++  S+I   +            + ++LCD  I + D      H+ IL A S 
Sbjct: 3   IDDIKHNRRVVSNISNLL------------DDDILCDVIITIGDGEEIKAHKTILAAGSK 50

Query: 118 YFRALFTTSLHDQEKFSLVEIPGVDASVMTSLIEYAYLRKLHITEDNVASLLLATDFFCM 177
           YFR LFTT +  ++  + V +   D   + ++++Y Y R  HI+  NV  +L   D+  +
Sbjct: 51  YFRTLFTTPMIIRDLVTRVNLQMFDKDAVKNIVQYLYNR--HISSMNVIDVLKCADYLLI 108

Query: 178 SKAQDLCCDFIKRLTTPRNCFGIL--LFSRDHFCKKLEEWTRKYILRNFVDVARESEEIV 235
                 C  +IK  T    C  +   L+   H    + ++ ++ ++ N   +        
Sbjct: 109 DDLVTDCESYIKDYTNHDTCIYMYHRLYEMSHI--PIVKYIKRMLMSNIPTLITTDAFKK 166

Query: 236 FISEQELEEIISSDELNVKSEETVWELIIRWIDYDPENRKK--------HIVSLMKNIRL 287
            + E   + I ++D + +  E     ++++W++Y+    ++         I +L K  RL
Sbjct: 167 TVFEILFDIISTNDNVYLYREGYKVTILLKWLEYNYITEEQLLCILSCIDIQNLDKKSRL 226

Query: 288 GLLETEFFREKVKHHPYVQSTPECRPIIIETFKFLY--------------------DLEL 327
            L   +          ++    + R II      +                     +  +
Sbjct: 227 LLYSNKTINMYPSCIQFLLDNKQNRNIIPRQLCLVCHDTKYNVCNPCILVYNINTMEYSV 286

Query: 328 IAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPR 387
           I+ I   +     A   V +EI+ A GG++  +P+   + Y    +  + +E    +  R
Sbjct: 287 ISTIPNHIIN--YASAIVDNEIIIA-GGYNFNNPS-LNKVYKINIENKIHVELPPMIKNR 342

Query: 388 AYHGTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVIY 447
                AVI   IY IGG +G     +  C+      WK +  M          V    IY
Sbjct: 343 CRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIY 402

Query: 448 AMGG------YNGQRRQNSVE------------KYNYTENQWSLIAPMNVERSDASATTL 489
            +GG      Y      NS++            +Y+   N W  +             + 
Sbjct: 403 IIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSH 462

Query: 490 QGKIYITGGFNGHNCLNSCEV-YDPEC-NQWTLIEPMRHRRSGVSCIAYHECIYVIGGFN 547
           +  IY+         + +C   Y+    N W LI     R S +  I +   I ++  + 
Sbjct: 463 KDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILHDNTIMMLHCYE 522

Query: 548 GMSRMCNGEKYNPVTKTWSQV 568
             S M   + +N  T  W+ +
Sbjct: 523 --SYMLQ-DTFNVYTYEWNHI 540



 Score = 58.9 bits (142), Expect = 5e-09
 Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 2/131 (1%)

Query: 502 HNCLNSC-EVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNG-EKYN 559
           +N  N C  VY+    ++++I  + +     +       I + GG+N  +   N   K N
Sbjct: 267 YNVCNPCILVYNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLNKVYKIN 326

Query: 560 PVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMN 619
              K   ++P M   R  F++ VIDD I+AIGG NG      +ECY    D+W    DM 
Sbjct: 327 IENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMP 386

Query: 620 IYRSALSACVI 630
           I  S+   CV+
Sbjct: 387 IALSSYGMCVL 397



 Score = 51.9 bits (124), Expect = 9e-07
 Identities = 43/154 (27%), Positives = 70/154 (45%), Gaps = 8/154 (5%)

Query: 464 YNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHN-CLNSCEVYDPECNQWTLIE 522
           YN    ++S+I+ +     + ++  +  +I I GG+N +N  LN  +VY         +E
Sbjct: 277 YNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLN--KVYKINIENKIHVE 334

Query: 523 --PMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMYNPRSNFAI 580
             PM   R   S     + IY IGG NG +     E Y      W  +PDM    S++ +
Sbjct: 335 LPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGM 394

Query: 581 EVIDDMIFAIGG---FNGVTTIYHVECYDEKTDE 611
            V+D  I+ IGG       T+++H+   D + D 
Sbjct: 395 CVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDT 428


>gnl|CDD|177814 PLN02153, PLN02153, epithiospecifier protein.
          Length = 341

 Score = 64.6 bits (157), Expect = 5e-11
 Identities = 65/255 (25%), Positives = 108/255 (42%), Gaps = 41/255 (16%)

Query: 375 WVKIEQ---VDPMGPRAYHGTAVIGYNIYVIGG-FDGNEYYNS-CRCFNAVTKVWKEIAP 429
           W+K+EQ     P GPR  HG AV+G  +Y  GG    NE+ +     F+  T  W  IAP
Sbjct: 9   WIKVEQKGGKGP-GPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTW-SIAP 66

Query: 430 MNFKRCYVST------AVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVE--- 480
            N     +S       AV    +Y  GG + +R  +    Y+  +N+W+ +  ++ E   
Sbjct: 67  ANGDVPRISCLGVRMVAVGTK-LYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGP 125

Query: 481 --RSDASATTLQGKIYITGGFNGHNCLN------SCEVYDPECNQWT-LIEPMRH--RRS 529
             R+  S  + +  +Y+ GG +    +       + E Y+    +W  L +P  +  +R 
Sbjct: 126 EARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRG 185

Query: 530 GVSCIAYHECIYVIGGFNGMSRMCNGEK---------YNPVTKTWSQVPDM---YNPRSN 577
           G         I+V+ GF   S +  G+          ++P +  W++V       + RS 
Sbjct: 186 GAGFAVVQGKIWVVYGF-ATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV 244

Query: 578 FAIEVIDDMIFAIGG 592
           FA  V+   I   GG
Sbjct: 245 FAHAVVGKYIIIFGG 259


>gnl|CDD|211835 TIGR03548, mutarot_permut, cyclically-permuted mutarotase family
           protein.  Members of this protein family show
           essentially full-length homology, cyclically permuted,
           to YjhT from Escherichia coli. YjhT was shown to act as
           a mutarotase for sialic acid, and by this ability to be
           able to act as a virulence factor. Members of the YjhT
           family (TIGR03547) and this cyclically-permuted family
           have multiple repeats of the beta-propeller-forming
           Kelch repeat.
          Length = 323

 Score = 55.5 bits (134), Expect = 3e-08
 Identities = 47/184 (25%), Positives = 81/184 (44%), Gaps = 14/184 (7%)

Query: 423 VWKEIAPMNFKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSL----IAPMN 478
            W +   + +   Y ++    + IY +GG N     +SV +    E++ +L    +  + 
Sbjct: 52  KWVKAGQLPYAAAYGASISTENGIYYIGGSNSSESFSSVYRITLDESKEALIIETLPSLP 111

Query: 479 VERSDASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIE--PMRHRRSGVSCIAY 536
               + SAT   GK+Y+ GG       N    ++ E  +W  +   P   R   V C+  
Sbjct: 112 FAMDNGSATYKDGKLYVGGGNANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPV-CVKL 170

Query: 537 HECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMY---NPRS---NFAIEVIDDMIFAI 590
              +YV GG + ++   +G KY+P T TW +V D      P S     +I + + ++  I
Sbjct: 171 QNELYVFGGGDNIAYT-DGYKYSPKTGTWEKVADPLSDGEPISLLGAASIAINESLLLCI 229

Query: 591 GGFN 594
           GGFN
Sbjct: 230 GGFN 233



 Score = 34.7 bits (80), Expect = 0.15
 Identities = 35/173 (20%), Positives = 60/173 (34%), Gaps = 27/173 (15%)

Query: 345 VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAY---HGTAV-IGYNIY 400
             + I +  G  S  S +     Y    D   +   ++ +    +   +G+A      +Y
Sbjct: 71  TENGIYYIGGSNSSESFSS---VYRITLDESKEALIIETLPSLPFAMDNGSATYKDGKLY 127

Query: 401 VIGGFDGNEYYNSCRCFNAVTKVWKEIA--PMNFKRCYVSTAVHHDVIYAMGGYNGQRRQ 458
           V GG    +  N   CFN  T+ W+E+   P   +   V   + +++    GG N     
Sbjct: 128 VGGGNANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPVCVKLQNELYVFGGGDN----I 183

Query: 459 NSVEKYNYT--ENQWSLIAPMNVERSDASATTLQG---------KIYITGGFN 500
              + Y Y+     W     +    SD    +L G          +   GGFN
Sbjct: 184 AYTDGYKYSPKTGTWE---KVADPLSDGEPISLLGAASIAINESLLLCIGGFN 233


>gnl|CDD|206134 pfam13964, Kelch_6, Kelch motif. 
          Length = 50

 Score = 48.6 bits (116), Expect = 9e-08
 Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 386 PRAYHGTAVIGYNIYVIGGF-DGNEYYNSCRCFNAVTKVWKEIAPMNFKR 434
           PR  H    +G  IYV GG+ +G++  N    ++  T  W+++ P+   R
Sbjct: 1   PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 48.6 bits (116), Expect = 1e-07
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 528 RSGVSCIAYHECIYVIGGFNGMSRMCN-GEKYNPVTKTWSQVPDMYNPR 575
           R+G S ++    IYV GG++  S+  N    Y+P T +W ++P +  PR
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 45.9 bits (109), Expect = 9e-07
 Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 481 RSDASATTLQGKIYITGGF-NGHNCLNSCEVYDPECNQWTLIEPMRHRR 528
           R+  SA ++ GKIY+ GG+ NG    N   VYDPE   W  + P+   R
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 42.1 bits (99), Expect = 2e-05
 Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 1/49 (2%)

Query: 434 RCYVSTAVHHDVIYAMGGY-NGQRRQNSVEKYNYTENQWSLIAPMNVER 481
           R   S       IY  GGY NG +  N V  Y+     W  + P+   R
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50



 Score = 37.8 bits (88), Expect = 6e-04
 Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 1/50 (2%)

Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIYH-VECYDEKTDEWYEATDMNIYR 622
           PR+  +   +   I+  GG++  +   + V  YD +T  W +   +   R
Sbjct: 1   PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50


>gnl|CDD|222113 pfam13415, Kelch_3, Galactose oxidase, central domain. 
          Length = 48

 Score = 40.7 bits (96), Expect = 5e-05
 Identities = 15/47 (31%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 491 GKIYITGGFNGHN-CLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAY 536
           GKIY+ GG       LN   VYD + N W  +  +   R+G +    
Sbjct: 2   GKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48



 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 540 IYVIGGFNGMSRMCNG-EKYNPVTKTWSQVPDMYNPRSNFAI 580
           IYV GG        N    Y+  T TW ++ D+  PR+  A 
Sbjct: 4   IYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAA 45



 Score = 37.7 bits (88), Expect = 6e-04
 Identities = 10/46 (21%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 398 NIYVIGGFDG-NEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVH 442
            IYV GG        N    ++  T  W+++  +   R   +  V 
Sbjct: 3   KIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48



 Score = 31.1 bits (71), Expect = 0.16
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 584 DDMIFAIGGFNGVTTIYH-VECYDEKTDEWYEATDMNIYRSALSACVI 630
              I+  GG     T  + +  YD  T+ W +  D+   R+  +A VI
Sbjct: 1   GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48


>gnl|CDD|205596 pfam13418, Kelch_4, Galactose oxidase, central domain. 
          Length = 49

 Score = 40.3 bits (95), Expect = 7e-05
 Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 2/46 (4%)

Query: 481 RSDASATTLQ-GKIYITGGFNG-HNCLNSCEVYDPECNQWTLIEPM 524
           R+  ++T++  G++Y+ GG N   + L+   V+D   N WT +  +
Sbjct: 2   RAYHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 39.6 bits (93), Expect = 1e-04
 Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 386 PRAYHGTAVIGYN-IYVIGGFDGN-EYYNSCRCFNAVTKVWKEIAPM 430
           PRAYH +  IG   +Y+ GG + +    +    F+  T  W  +  +
Sbjct: 1   PRAYHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 31.8 bits (73), Expect = 0.073
 Identities = 9/44 (20%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query: 435 CYVSTAVHHDVIYAMGGYNGQ-RRQNSVEKYNYTENQWSLIAPM 477
            + ST++    +Y  GG N      + V  ++ + N W+ +  +
Sbjct: 4   YHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47



 Score = 31.5 bits (72), Expect = 0.10
 Identities = 9/33 (27%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 540 IYVIGGFNG-MSRMCNGEKYNPVTKTWSQVPDM 571
           +Y+ GG N   S + +   ++  T TW+++P +
Sbjct: 15  LYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47


>gnl|CDD|237617 PRK14131, PRK14131, N-acetylneuraminic acid mutarotase;
           Provisional.
          Length = 376

 Score = 43.5 bits (103), Expect = 3e-04
 Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 22/127 (17%)

Query: 391 GT-AVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNF---KRCYVSTAVHHDVI 446
           GT A+    +YV  G  G  +Y      NA +K W +IA   F    R     A     +
Sbjct: 32  GTGAIDNNTVYVGLGSAGTSWYK--LDLNAPSKGWTKIA--AFPGGPREQAVAAFIDGKL 87

Query: 447 YAMGGYNGQ-------RRQNSVEKYNYTENQWS---LIAPMNVERSDASATTLQG-KIYI 495
           Y  GG  G+       +  + V KY+   N W      +P+ +  +   A +L   K YI
Sbjct: 88  YVFGG-IGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGL--AGHVAVSLHNGKAYI 144

Query: 496 TGGFNGH 502
           TGG N +
Sbjct: 145 TGGVNKN 151



 Score = 43.5 bits (103), Expect = 3e-04
 Identities = 25/93 (26%), Positives = 35/93 (37%), Gaps = 11/93 (11%)

Query: 465 NYTENQWSLIAPM-NVERSDASATTLQGKIYITGGFNGHN------CLNSCEVYDPECNQ 517
           N     W+ IA      R  A A  + GK+Y+ GG    N        +    YDP+ N 
Sbjct: 58  NAPSKGWTKIAAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNS 117

Query: 518 WTLIEPMRHRRS--GVSCIAYHEC-IYVIGGFN 547
           W  +   R      G   ++ H    Y+ GG N
Sbjct: 118 WQKL-DTRSPVGLAGHVAVSLHNGKAYITGGVN 149



 Score = 38.8 bits (91), Expect = 0.008
 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 11/89 (12%)

Query: 374 RWVKIEQVDPMGPRAYHGTAVIGYNIYVIGGF-----DG-NEYYNSCRCFNAVTKVWKEI 427
            W KI    P GPR     A I   +YV GG      +G  + ++    ++  T  W+++
Sbjct: 63  GWTKIAAF-PGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKL 121

Query: 428 ---APMNFKRCYVSTAVHHDVIYAMGGYN 453
              +P+     +V+ ++H+   Y  GG N
Sbjct: 122 DTRSPVGLA-GHVAVSLHNGKAYITGGVN 149



 Score = 31.5 bits (72), Expect = 1.6
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 11/69 (15%)

Query: 350 LFAIGGW----SGGSPTDF--VETYDTRADRWVKIEQVDPMGPRAYHGTAVIGYNIYVIG 403
           L+  GG     S GSP  F  V  YD + + W K++   P+G   +   ++     Y+ G
Sbjct: 87  LYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITG 146

Query: 404 G-----FDG 407
           G     FDG
Sbjct: 147 GVNKNIFDG 155


>gnl|CDD|234253 TIGR03547, muta_rot_YjhT, mutatrotase, YjhT family.  Members of
           this protein family contain multiple copies of the
           beta-propeller-forming Kelch repeat. All are full-length
           homologs to YjhT of Escherichia coli, which has been
           identified as a mutarotase for sialic acid. This protein
           improves bacterial ability to obtain host sialic acid,
           and thus serves as a virulence factor. Some bacteria
           carry what appears to be a cyclically permuted homolog
           of this protein.
          Length = 346

 Score = 43.1 bits (102), Expect = 3e-04
 Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 18/126 (14%)

Query: 390 HGT-AVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNF---KRCYVSTAVHHDV 445
           +GT A+IG  +YV  G  G  +Y         +K W++IA  +F    R     A     
Sbjct: 10  NGTGAIIGDKVYVGLGSAGTSWYK--LDLKKPSKGWQKIA--DFPGGPRNQAVAAAIDGK 65

Query: 446 IYAMGGYNGQ-------RRQNSVEKYNYTENQWS-LIAPMNVERSDASATTLQ-GKIYIT 496
           +Y  GG  G+       +  + V +Y+  +N W  L     V    AS  +L  G+ Y T
Sbjct: 66  LYVFGG-IGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFT 124

Query: 497 GGFNGH 502
           GG N +
Sbjct: 125 GGVNKN 130



 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 26/97 (26%), Positives = 35/97 (36%), Gaps = 19/97 (19%)

Query: 465 NYTENQWSLIAPM-NVERSDASATTLQGKIYITGGFNGHN-------CLNSCEVYDPECN 516
                 W  IA      R+ A A  + GK+Y+ GG  G           +    YDP+ N
Sbjct: 37  KKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGG-IGKANSEGSPQVFDDVYRYDPKKN 95

Query: 517 QWTLIEPMRHRRS-----GVSCIAYHEC-IYVIGGFN 547
            W  ++     RS     G S  + H    Y  GG N
Sbjct: 96  SWQKLD----TRSPVGLLGASGFSLHNGQAYFTGGVN 128



 Score = 35.8 bits (83), Expect = 0.060
 Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 24/153 (15%)

Query: 375 WVKIEQVDPMGPRAYHGTAVIGYNIYVIGGFDGNE------YYNSCRCFNAVTKVWKEI- 427
           W KI    P GPR     A I   +YV GG            ++    ++     W+++ 
Sbjct: 43  WQKIADF-PGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLD 101

Query: 428 --APMNFKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDAS 485
             +P+         ++H+   Y  GG N    +N  + Y + +   S     + E  D  
Sbjct: 102 TRSPVGLLG-ASGFSLHNGQAYFTGGVN----KNIFDGY-FAD--LSAADK-DSEPKDKL 152

Query: 486 ATTLQGKIYITGGFNGHNCLNSCEVYDPECNQW 518
                 +      +N      +   YDP  NQW
Sbjct: 153 IAAYFSQPPEDYFWN-----KNVLSYDPSTNQW 180


>gnl|CDD|225597 COG3055, COG3055, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 381

 Score = 40.4 bits (95), Expect = 0.003
 Identities = 23/79 (29%), Positives = 30/79 (37%), Gaps = 15/79 (18%)

Query: 480 ERSDASATTLQGKIYITGGFNGHN-----CLNSCEVYDPECNQWTLIEPMRHRRS----- 529
            R+ A A  + GK+Y+ GG+           N    YDP  N W         RS     
Sbjct: 82  ARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWH----KLDTRSPTGLV 137

Query: 530 GVSCIAYHEC-IYVIGGFN 547
           G S  + +   IY  GG N
Sbjct: 138 GASTFSLNGTKIYFFGGVN 156



 Score = 30.8 bits (70), Expect = 2.3
 Identities = 28/108 (25%), Positives = 40/108 (37%), Gaps = 14/108 (12%)

Query: 355 GWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAYHGTAVIGYNIYVIGGFDGNE----- 409
           G +G +   +V         W KI    P G R     AVIG  +YV GG+  +      
Sbjct: 54  GSAGTA--FYVLDLKKPGKGWTKIADF-PGGARNQAVAAVIGGKLYVFGGYGKSVSSSPQ 110

Query: 410 YYNSCRCFNAVTKVWKE---IAPMNFKRCYVSTAVHHDV-IYAMGGYN 453
            +N    ++  T  W +    +P        ST   +   IY  GG N
Sbjct: 111 VFNDAYRYDPSTNSWHKLDTRSPTGLVG--ASTFSLNGTKIYFFGGVN 156


>gnl|CDD|116261 pfam07646, Kelch_2, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that Drosophila
           ring canal kelch protein is related to Galactose Oxidase
           for which a structure has been solved. The kelch motif
           forms a beta sheet. Several of these sheets associate to
           form a beta propeller structure as found in pfam00064,
           pfam00400 and pfam00415.
          Length = 48

 Score = 35.0 bits (81), Expect = 0.007
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 481 RSDASATTLQGKIYITGGFNGHNCLNSCEV--YDPECNQWTLIEPM 524
           R   ++  + GK+Y+ GG  G   L+S ++   DPE N WT +  +
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47



 Score = 33.5 bits (77), Expect = 0.021
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 386 PRAYHGTAVIGYNIYVIGGFDGN--EYYNSCRCFNAVTKVWKEIAPM 430
           PR  H + V+G  +YV+GG  G      +     +  T VW E+  +
Sbjct: 1   PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47



 Score = 33.5 bits (77), Expect = 0.023
 Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 527 RRSGVSCIAYHECIYVIGGFNGMSRMCNGEK---YNPVTKTWSQVPDM 571
            R   + +     +YV+GG  G+    +       +P T  W+++P +
Sbjct: 1   PRYPHASVVVGGKLYVVGGSTGLG-DLSSSDLWVLDPETNVWTELPAL 47



 Score = 32.3 bits (74), Expect = 0.052
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 574 PRSNFAIEVIDDMIFAIGGF--NGVTTIYHVECYDEKTDEWYEATDM 618
           PR   A  V+   ++ +GG    G  +   +   D +T+ W E   +
Sbjct: 1   PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 8/46 (17%), Positives = 18/46 (39%), Gaps = 2/46 (4%)

Query: 434 RCYVSTAVHHDVIYAMGGY--NGQRRQNSVEKYNYTENQWSLIAPM 477
           R   ++ V    +Y +GG    G    + +   +   N W+ +  +
Sbjct: 2   RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47


>gnl|CDD|222419 pfam13854, Kelch_5, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that the
           Drosophila ring canal kelch protein is related to
           Galactose Oxidase for which a structure has been solved.
           The kelch motif forms a beta sheet. Several of these
           sheets associate to form a beta propeller structure as
           found in pfam00064, pfam00400 and pfam00415.
          Length = 42

 Score = 33.7 bits (78), Expect = 0.015
 Identities = 10/29 (34%), Positives = 15/29 (51%), Gaps = 2/29 (6%)

Query: 386 PRAYHGTAVIGYNIYVIGGFDG--NEYYN 412
           PRA H   V+G  IY+ GG+     +  +
Sbjct: 4   PRANHCAVVVGGEIYLYGGYTSGNGQSSD 32



 Score = 28.7 bits (65), Expect = 0.74
 Identities = 7/27 (25%), Positives = 11/27 (40%)

Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIY 600
           PR+N    V+   I+  GG+       
Sbjct: 4   PRANHCAVVVGGEIYLYGGYTSGNGQS 30


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 35.9 bits (84), Expect = 0.053
 Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 6/48 (12%)

Query: 242 LEEIISSDELNVKSEETVWELIIRWIDY--DPENRKKHIVSLMKNIRL 287
           L EII    LNVK  E   ELI + ++    P+ +KK      K++R+
Sbjct: 174 LNEIIE-KGLNVKQTE---ELIKKLLEKPEKPKKKKKRRKGFSKDVRI 217


>gnl|CDD|205932 pfam13758, Prefoldin_3, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 97

 Score = 32.3 bits (74), Expect = 0.17
 Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 9/71 (12%)

Query: 216 TRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYDPENRK 275
           TR+ +LR    + R+    + ++E+E++EI+   +   ++ + V +LI R IDY  +N  
Sbjct: 29  TREDLLR----IGRDFGGSL-VNEKEIKEILGEKQGITRTRQQVVDLISRRIDYVKQN-- 81

Query: 276 KHIVSLMKNIR 286
             I +L K + 
Sbjct: 82  --IATLEKRLE 90


>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin.  The members of this family are
           enzymes that cleave peptides. These proteases require
           zinc for catalysis.
          Length = 159

 Score = 31.8 bits (73), Expect = 0.54
 Identities = 13/37 (35%), Positives = 17/37 (45%)

Query: 337 TPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRAD 373
           TP+++R  V   I  A   WS  +P  F E     AD
Sbjct: 15  TPDLSRDEVRRAIRRAFKVWSEVTPLTFTEVPTGTAD 51


>gnl|CDD|170033 PRK09644, PRK09644, RNA polymerase sigma factor SigM; Provisional.
          Length = 165

 Score = 31.4 bits (71), Expect = 0.97
 Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 200 ILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETV 259
           I L   D+  +K++ W  K     F+D  R+ +++ F+   E+E I +         +  
Sbjct: 41  IYLEDYDN--QKVKPWLFKVAYHTFIDFVRKEKKVSFVGTDEIEAIQAESTEEYVVAKNS 98

Query: 260 WELIIRWIDYDPENRKKHI 278
           +E +I+ I   P    + I
Sbjct: 99  YEKLIQIIHTLPVIEAQAI 117


>gnl|CDD|236567 PRK09559, PRK09559, putative global regulator; Reviewed.
          Length = 327

 Score = 31.2 bits (71), Expect = 1.8
 Identities = 19/74 (25%), Positives = 33/74 (44%), Gaps = 2/74 (2%)

Query: 98  VLADNSHFPIHRNILCACSDYFRALFTTS--LHDQEKFSLVEIPGVDASVMTSLIEYAYL 155
           V AD S     +++L A  D    +++        + F+ +E   V  + +T L +YA  
Sbjct: 45  VTADVSQLTEDQHLLAAHCDAKGKMWSNLRLFRRGDGFAWIERRSVRENQLTELKKYAVF 104

Query: 156 RKLHITEDNVASLL 169
            K+ I  D+   LL
Sbjct: 105 SKVTIAPDDERVLL 118


>gnl|CDD|222785 PHA00368, PHA00368, internal virion protein D.
          Length = 1315

 Score = 30.9 bits (70), Expect = 2.8
 Identities = 24/111 (21%), Positives = 39/111 (35%), Gaps = 20/111 (18%)

Query: 571 MYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVI 630
            YN   +   + + D  F+ GG             +    E Y    + I R  L A + 
Sbjct: 564 TYNDLYDAMEDAMKDPEFSTGGAKMSR--------EAARQEIYRRVALAIERPELQANLT 615

Query: 631 MGLPNVYDYIHQHRDSLMEEKRQKLLAIEGRQRQNQSEEGNSSQQSVLPSS 681
                V D + +H D     KR+ +        +N +  GN+   S+ P S
Sbjct: 616 KAERKVMDILKEHFDL----KREMM--------ENPAIFGNTKAVSIFPGS 654


>gnl|CDD|181899 PRK09485, mmuM, homocysteine methyltransferase; Provisional.
          Length = 304

 Score = 30.6 bits (70), Expect = 3.1
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query: 555 GEKYNPVTKTWSQVPD 570
           GE Y+ VTKTW    D
Sbjct: 250 GEVYDAVTKTWHGPAD 265


>gnl|CDD|223185 COG0107, HisF, Imidazoleglycerol-phosphate synthase [Amino acid
           transport and metabolism].
          Length = 256

 Score = 30.2 bits (69), Expect = 3.3
 Identities = 18/52 (34%), Positives = 22/52 (42%), Gaps = 14/52 (26%)

Query: 323 YDLELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETY-DTRAD 373
           YDLEL   +   V  P IA            GG   G P  FVE + + +AD
Sbjct: 186 YDLELTRAVREAVNIPVIAS-----------GG--AGKPEHFVEAFTEGKAD 224


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 29.8 bits (67), Expect = 4.3
 Identities = 20/96 (20%), Positives = 44/96 (45%), Gaps = 10/96 (10%)

Query: 211 KLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYD 270
           KL  W +    ++  ++ +E+E+     ++EL +          ++E + +L+ R+ + +
Sbjct: 35  KLTPWQQLATKQDVEELRKETEQR----QKELADEKLEVRKQKATKEDL-KLLQRFQEEE 89

Query: 271 PENRKKHIVSLMKNI-----RLGLLETEFFREKVKH 301
               K+ I  L   I     R G+L  + FR+ V+ 
Sbjct: 90  FRATKEDIKRLETIITGLGARWGILSEDAFRQGVRE 125


>gnl|CDD|220221 pfam09402, MSC, Man1-Src1p-C-terminal domain.  MAN1 is an integral
           protein of the inner nuclear membrane which binds to
           chromatin associated proteins and plays a role in
           nuclear organisation. The C terminal nucleoplasmic
           region forms a DNA binding winged helix and binds to
           Smad. This C-terminal tail is also found in S.
           cerevisiae and is thought to consist of three conserved
           helices followed by two downstream strands.
          Length = 326

 Score = 29.7 bits (67), Expect = 5.2
 Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 10/91 (10%)

Query: 210 KKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIIS---SDELNVKSEETVWELIIRW 266
           ++L +   K+          E +    ISE EL E++S   S  +N +  E +W+  I+ 
Sbjct: 104 EELRKRNAKHE-------CGEDDLSPGISENELYELLSEKKSPWINEEEFEDLWKAAIKE 156

Query: 267 IDYDPENRKKHIVSLMKNIRLGLLETEFFRE 297
           +  +PE   +    L KN        +  R 
Sbjct: 157 LKENPEVVIRPEPILSKNGDGSTETQKTLRS 187


>gnl|CDD|240082 cd04731, HisF, The cyclase subunit of imidazoleglycerol phosphate
           synthase (HisF). Imidazole glycerol phosphate synthase
           (IGPS) catalyzes the fifth step of histidine
           biosynthesis, the formation of the imidazole ring. IGPS
           converts
           N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-
           carboxamide ribonucleotide (PRFAR) to imidazole glycerol
           phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide)
           ribonucleotide (AICAR). This conversion involves two
           tightly coupled reactions in distinct active sites of
           IGPS. The two catalytic domains can be fused, like in
           fungi and plants, or peformed by a heterodimer
           (HisH-glutaminase and HisF-cyclase), like in bacteria.
          Length = 243

 Score = 29.4 bits (67), Expect = 5.4
 Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 17/46 (36%)

Query: 323 YDLELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGG--SPTDFVE 366
           YDLELI  +   V  P IA               SGG   P  FVE
Sbjct: 180 YDLELIRAVSSAVNIPVIA---------------SGGAGKPEHFVE 210


>gnl|CDD|129873 TIGR00791, gntP, gluconate transporter.  This family includes known
           gluconate transporters of E. coli and Bacillus species
           as well as an idonate transporter from E. coli
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 440

 Score = 29.2 bits (66), Expect = 7.5
 Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 115 CSDYFRALFTTSLHDQEKFSLVEIPGVDASVMTSLI 150
                      SL  Q+ FS  E+P    S+ T+L+
Sbjct: 197 LGRIVPPSPEASLGSQKTFSEEEMPSFGISLFTALL 232


>gnl|CDD|236761 PRK10795, PRK10795, penicillin-binding protein 2; Provisional.
          Length = 634

 Score = 29.4 bits (66), Expect = 7.9
 Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 42  EDMASRLDLTEDDMTHIENAIKNKKNFSSIDRAMSVQAMQ----ALNEFR 87
           + + S +DLT+DD+          + F+SI    ++  +Q    A+N++R
Sbjct: 104 DALRSVVDLTDDDIAAFRKERARSRRFTSIPVKTNLTEVQVARFAVNQYR 153


>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
          Length = 349

 Score = 29.1 bits (65), Expect = 8.0
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 668 EEGNSSQQSVLPSSNSNVSSSDG 690
           E  +SS  S   S +S  SSSDG
Sbjct: 265 EGRSSSSSSRSSSDSSTSSSSDG 287


>gnl|CDD|217114 pfam02574, S-methyl_trans, Homocysteine S-methyltransferase.  This
           is a family of related homocysteine S-methyltransferases
           enzymes: 5-methyltetrahydrofolate--homocysteine
           S-methyltransferases also known EC:2.1.1.13;
           Betaine--homocysteine S-methyltransferase (vitamin B12
           dependent), EC:2.1.1.5; and Homocysteine
           S-methyltransferase, EC:2.1.1.10.
          Length = 303

 Score = 28.9 bits (65), Expect = 8.8
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 1/48 (2%)

Query: 555 GEKYNPVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHV 602
           GE Y+   K +   PD   P      E +      IGG  G TT  H+
Sbjct: 248 GEPYDAGKKEYDGTPDELAPSYWSLKEFVASGARIIGGCCG-TTPDHI 294


>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae 
           Rho GTPase activating protein Rgd1 and similar proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization.
           Saccharomyces cerevisiae Rgd1 is a GTPase activating
           protein (GAP) with activity towards Rho3p and Rho4p,
           which are involved in bud growth and cytokinesis,
           respectively. At low pH, S. cerevisiae Rgd1 is required
           for cell survival and the activation of the protein
           kinase C pathway, which is important in cell integrity
           and the maintenance of cell shape. It contains an
           N-terminal F-BAR domain and a C-terminal Rho GAP domain.
           The F-BAR domain of S. cerevisiae Rgd1 binds to
           phosphoinositides and plays an important role in the
           localization of the protein to the bud tip/neck during
           the cell cycle. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 234

 Score = 28.5 bits (64), Expect = 9.9
 Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 2/42 (4%)

Query: 52  EDDMTHIENAIKNKKNF-SSIDRAMSVQAMQALNEFREKNLL 92
           E    H   A+K+  +    ID A+ +Q  Q      E  +L
Sbjct: 194 ELLSRHRPEAVKDLFDLILEIDAALRLQ-YQKYALPNELLVL 234


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.135    0.417 

Gapped
Lambda     K      H
   0.267   0.0714    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,708,738
Number of extensions: 3605118
Number of successful extensions: 3281
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3203
Number of HSP's successfully gapped: 93
Length of query: 712
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 608
Effective length of database: 6,324,786
Effective search space: 3845469888
Effective search space used: 3845469888
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)