RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8877
         (180 letters)



>gnl|CDD|222419 pfam13854, Kelch_5, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that the
           Drosophila ring canal kelch protein is related to
           Galactose Oxidase for which a structure has been solved.
           The kelch motif forms a beta sheet. Several of these
           sheets associate to form a beta propeller structure as
           found in pfam00064, pfam00400 and pfam00415.
          Length = 42

 Score = 44.5 bits (106), Expect = 4e-07
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 106 VPVGRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTR 145
           +PV R +H A V G  +Y++GGY     +  +D+Y     
Sbjct: 1   LPVPRANHCAVVVGGEIYLYGGYTSGNGQSSDDVYVLSLP 40



 Score = 24.9 bits (55), Expect = 5.0
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 11 PRRVNHAAVLNDNKIFTFGGY 31
            R NH AV+   +I+ +GGY
Sbjct: 3  VPRANHCAVVVGGEIYLYGGY 23



 Score = 24.9 bits (55), Expect = 5.7
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 47 PPSYRDFHTANIIDGRMYIWGGR 69
           P  R  H A ++ G +Y++GG 
Sbjct: 1  LPVPRANHCAVVVGGEIYLYGGY 23


>gnl|CDD|177844 PLN02193, PLN02193, nitrile-specifier protein.
          Length = 470

 Score = 42.6 bits (100), Expect = 4e-05
 Identities = 40/140 (28%), Positives = 56/140 (40%), Gaps = 16/140 (11%)

Query: 44  DGEPPSYRDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTT 103
            GE P  R  H    +  ++Y +GG         + ++     L   DL+  TW     T
Sbjct: 159 KGEGPGLRCSHGIAQVGNKIYSFGGE-------FTPNQPIDKHLYVFDLETRTWSISPAT 211

Query: 104 GSVP----VGRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPP 159
           G VP    +G R  S    G  LY+FGG +      YN  Y + T   EW  + P    P
Sbjct: 212 GDVPHLSCLGVRMVSI---GSTLYVFGGRDA--SRQYNGFYSFDTTTNEWKLLTPVEEGP 266

Query: 160 TKRRRQSCIIKGDTLIMFGG 179
           T R   S     + + +FGG
Sbjct: 267 TPRSFHSMAADEENVYVFGG 286



 Score = 33.8 bits (77), Expect = 0.035
 Identities = 29/134 (21%), Positives = 55/134 (41%), Gaps = 12/134 (8%)

Query: 46  EPPSYRDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGS 105
           E P+ R FH+    +  +Y++GG    + L           L   ++    W   +T G 
Sbjct: 264 EGPTPRSFHSMAADEENVYVFGGVSATARL---------KTLDSYNIVDKKWFHCSTPGD 314

Query: 106 VPVGRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPPTKRRRQ 165
               R      V    +++  G+NG      +D++ Y   + +W +V   G  P++R   
Sbjct: 315 SFSIRGGAGLEVVQGKVWVVYGFNGC---EVDDVHYYDPVQDKWTQVETFGVRPSERSVF 371

Query: 166 SCIIKGDTLIMFGG 179
           +    G  +++FGG
Sbjct: 372 ASAAVGKHIVIFGG 385



 Score = 26.5 bits (58), Expect = 9.7
 Identities = 24/87 (27%), Positives = 37/87 (42%), Gaps = 7/87 (8%)

Query: 97  WIRPNTTGSVPVGRRSHSAFVYGDGLYIFGGY---NGLVEEHYNDIYRYCTRRQEWARVI 153
           WI+    G  P  R SH     G+ +Y FGG    N  +++H   +Y +    + W+   
Sbjct: 153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKH---LYVFDLETRTWSISP 209

Query: 154 PHGAPPTKRRRQSCIIK-GDTLIMFGG 179
             G  P        ++  G TL +FGG
Sbjct: 210 ATGDVPHLSCLGVRMVSIGSTLYVFGG 236


>gnl|CDD|201739 pfam01344, Kelch_1, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that the
           Drosophila ring canal kelch protein is related to
           Galactose Oxidase, for which a structure has been
           solved. The kelch motif forms a beta sheet. Several of
           these sheets associate to form a beta propeller
           structure as found in pfam00064, pfam00400 and
           pfam00415.
          Length = 46

 Score = 36.1 bits (84), Expect = 4e-04
 Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 110 RRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARV 152
           R      V G  +Y+ GGY+G   +  + +  Y      W+++
Sbjct: 2   RSGAGVVVLGGKIYVIGGYDG--GQSLSSVEVYDPETNTWSKL 42



 Score = 28.0 bits (63), Expect = 0.45
 Identities = 8/24 (33%), Positives = 11/24 (45%)

Query: 12 RRVNHAAVLNDNKIFTFGGYCSGE 35
           R     V+   KI+  GGY  G+
Sbjct: 1  PRSGAGVVVLGGKIYVIGGYDGGQ 24


>gnl|CDD|225597 COG3055, COG3055, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 381

 Score = 39.3 bits (92), Expect = 6e-04
 Identities = 23/80 (28%), Positives = 33/80 (41%), Gaps = 7/80 (8%)

Query: 51  RDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGR 110
           R+   A +I G++Y++GG G       S S     D    D    +W + +T    P G 
Sbjct: 83  RNQAVAAVIGGKLYVFGGYGKSV----SSSPQVFNDAYRYDPSTNSWHKLDTRS--PTGL 136

Query: 111 RSHSAFVYGDG-LYIFGGYN 129
              S F      +Y FGG N
Sbjct: 137 VGASTFSLNGTKIYFFGGVN 156



 Score = 30.8 bits (70), Expect = 0.33
 Identities = 23/79 (29%), Positives = 32/79 (40%), Gaps = 9/79 (11%)

Query: 107 PVGRRSHSAFVYGDG-LYIFGGY----NGLVEEHYNDIYRYCTRRQEWARVIPHGAPPTK 161
           P G R+ +      G LY+FGGY    +   +  +ND YRY      W ++      PT 
Sbjct: 79  PGGARNQAVAAVIGGKLYVFGGYGKSVSSSPQV-FNDAYRYDPSTNSWHKLDTRS--PTG 135

Query: 162 RRRQSCIIKGDT-LIMFGG 179
               S      T +  FGG
Sbjct: 136 LVGASTFSLNGTKIYFFGG 154


>gnl|CDD|237617 PRK14131, PRK14131, N-acetylneuraminic acid mutarotase;
           Provisional.
          Length = 376

 Score = 38.8 bits (91), Expect = 8e-04
 Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 51  RDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGR 110
           R+   A  IDG++Y++GG G  +S    GS     D+   D K  +W + +T    PVG 
Sbjct: 75  REQAVAAFIDGKLYVFGGIGKTNS---EGSPQVFDDVYKYDPKTNSWQKLDTRS--PVGL 129

Query: 111 RSHSAFVY-GDGLYIFGGYN 129
             H A        YI GG N
Sbjct: 130 AGHVAVSLHNGKAYITGGVN 149



 Score = 26.9 bits (60), Expect = 6.0
 Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 107 PVGRRSHSAFVYGDG-LYIFGGYNGLVEEH----YNDIYRYCTRRQEWARV 152
           P G R  +   + DG LY+FGG      E     ++D+Y+Y  +   W ++
Sbjct: 71  PGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKL 121


>gnl|CDD|206134 pfam13964, Kelch_6, Kelch motif. 
          Length = 50

 Score = 34.7 bits (80), Expect = 0.002
 Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%)

Query: 110 RRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPPTKRR 163
           R  HSA   G  +Y+FGGY+    +  N +  Y      W        P    R
Sbjct: 2   RTGHSAVSVGGKIYVFGGYSNGS-KASNKVLVYDPETGSWE----KLPPLPTPR 50



 Score = 26.3 bits (58), Expect = 2.3
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 13 RVNHAAVLNDNKIFTFGGYCSGEDYVKR 40
          R  H+AV    KI+ FGGY +G     +
Sbjct: 2  RTGHSAVSVGGKIYVFGGYSNGSKASNK 29


>gnl|CDD|234253 TIGR03547, muta_rot_YjhT, mutatrotase, YjhT family.  Members of
           this protein family contain multiple copies of the
           beta-propeller-forming Kelch repeat. All are full-length
           homologs to YjhT of Escherichia coli, which has been
           identified as a mutarotase for sialic acid. This protein
           improves bacterial ability to obtain host sialic acid,
           and thus serves as a virulence factor. Some bacteria
           carry what appears to be a cyclically permuted homolog
           of this protein.
          Length = 346

 Score = 36.9 bits (86), Expect = 0.003
 Identities = 27/80 (33%), Positives = 38/80 (47%), Gaps = 6/80 (7%)

Query: 51  RDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGR 110
           R+   A  IDG++Y++GG G  +S    GS     D+   D K  +W + +T    PVG 
Sbjct: 54  RNQAVAAAIDGKLYVFGGIGKANS---EGSPQVFDDVYRYDPKKNSWQKLDTR--SPVGL 108

Query: 111 RSHSAFVYGDG-LYIFGGYN 129
              S F   +G  Y  GG N
Sbjct: 109 LGASGFSLHNGQAYFTGGVN 128


>gnl|CDD|205596 pfam13418, Kelch_4, Galactose oxidase, central domain. 
          Length = 49

 Score = 33.0 bits (76), Expect = 0.008
 Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 2/45 (4%)

Query: 109 GRRSHSAF-VYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARV 152
            R  H++  +    LY+FGG N       +D++ +      W R+
Sbjct: 1   PRAYHTSTSIGDGRLYLFGGENED-GSVLSDVWVFDLSTNTWTRL 44



 Score = 28.4 bits (64), Expect = 0.39
 Identities = 11/28 (39%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 51 RDFHTANII-DGRMYIWGGRGDESSLYH 77
          R +HT+  I DGR+Y++GG  ++ S+  
Sbjct: 2  RAYHTSTSIGDGRLYLFGGENEDGSVLS 29


>gnl|CDD|222113 pfam13415, Kelch_3, Galactose oxidase, central domain. 
          Length = 48

 Score = 32.3 bits (74), Expect = 0.013
 Identities = 12/45 (26%), Positives = 18/45 (40%), Gaps = 4/45 (8%)

Query: 119 GDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPPTKRR 163
           G  +Y+FGG         ND++ Y      W ++   G  P  R 
Sbjct: 1   GGKIYVFGGLGDG-GTRLNDLWVYDLDTNTWEKL---GDLPGPRA 41



 Score = 26.9 bits (60), Expect = 1.0
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 11/59 (18%)

Query: 60  DGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGRRSHSAFVY 118
            G++Y++GG GD  +  +    VY      LD   +  +     G +P  R  H+A V 
Sbjct: 1   GGKIYVFGGLGDGGTRLND-LWVYD-----LDTNTWEKL-----GDLPGPRAGHAATVI 48


>gnl|CDD|177814 PLN02153, PLN02153, epithiospecifier protein.
          Length = 341

 Score = 34.6 bits (79), Expect = 0.020
 Identities = 29/89 (32%), Positives = 36/89 (40%), Gaps = 10/89 (11%)

Query: 97  WIRPNTTGSV-PVGRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPH 155
           WI+    G   P  R SH   V GD LY FGG     E    D+Y +      W+    +
Sbjct: 9   WIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPAN 68

Query: 156 GAPPTKRRRQSC-----IIKGDTLIMFGG 179
           G  P    R SC     +  G  L +FGG
Sbjct: 69  GDVP----RISCLGVRMVAVGTKLYIFGG 93



 Score = 33.8 bits (77), Expect = 0.038
 Identities = 30/114 (26%), Positives = 45/114 (39%), Gaps = 10/114 (8%)

Query: 37  YVKRRPMDGEPPSYRDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFT 96
           ++K     G+ P  R  H   ++  ++Y +GG           +E    DL   D    T
Sbjct: 9   WIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGE-------LKPNEHIDKDLYVFDFNTHT 61

Query: 97  WIRPNTTGSVP-VGRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEW 149
           W      G VP +          G  LYIFGG +   +  ++D Y Y T + EW
Sbjct: 62  WSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDE--KREFSDFYSYDTVKNEW 113



 Score = 33.8 bits (77), Expect = 0.038
 Identities = 36/144 (25%), Positives = 52/144 (36%), Gaps = 32/144 (22%)

Query: 9   GGPR-RVNHAAVLNDNKIFTFGGYCSGEDYVKRRPMDGEPPSYRDFHTANIIDGRMY--- 64
           GGP  R  H+   ++N ++ FGG       V +  +   P  +R     NI DG+     
Sbjct: 123 GGPEARTFHSMASDENHVYVFGG-------VSKGGLMKTPERFRTIEAYNIADGKWVQLP 175

Query: 65  --------------------IWGGRGDESSLYHSGSEVYCPDLV-YLDLKYFTWIRPNTT 103
                               IW   G  +S+   G   Y  + V + D     W    TT
Sbjct: 176 DPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETT 235

Query: 104 GSVPVGRRSHSAFVYGDGLYIFGG 127
           G+ P  R   +  V G  + IFGG
Sbjct: 236 GAKPSARSVFAHAVVGKYIIIFGG 259


>gnl|CDD|116261 pfam07646, Kelch_2, Kelch motif.  The kelch motif was initially
           discovered in Kelch. In this protein there are six
           copies of the motif. It has been shown that Drosophila
           ring canal kelch protein is related to Galactose Oxidase
           for which a structure has been solved. The kelch motif
           forms a beta sheet. Several of these sheets associate to
           form a beta propeller structure as found in pfam00064,
           pfam00400 and pfam00415.
          Length = 48

 Score = 29.6 bits (67), Expect = 0.10
 Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 1/44 (2%)

Query: 110 RRSHSAFVYGDG-LYIFGGYNGLVEEHYNDIYRYCTRRQEWARV 152
            R   A V   G LY+ GG  GL +   +D++        W  +
Sbjct: 1   PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTEL 44



 Score = 26.9 bits (60), Expect = 1.2
 Identities = 9/40 (22%), Positives = 15/40 (37%)

Query: 12 RRVNHAAVLNDNKIFTFGGYCSGEDYVKRRPMDGEPPSYR 51
           R  HA+V+   K++  GG     D         +P +  
Sbjct: 1  PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNV 40



 Score = 25.0 bits (55), Expect = 4.9
 Identities = 8/30 (26%), Positives = 14/30 (46%)

Query: 54 HTANIIDGRMYIWGGRGDESSLYHSGSEVY 83
          H + ++ G++Y+ GG      L  S   V 
Sbjct: 5  HASVVVGGKLYVVGGSTGLGDLSSSDLWVL 34


>gnl|CDD|222983 PHA03098, PHA03098, kelch-like protein; Provisional.
          Length = 534

 Score = 30.9 bits (70), Expect = 0.40
 Identities = 33/180 (18%), Positives = 60/180 (33%), Gaps = 30/180 (16%)

Query: 13  RVNHAAVLNDNKIFTFGG------------YCSGEDYVKRRPMDGEPPSYRDFHTANIID 60
           R N    + +N+I+  GG            +  GE   +  P    P   R       ++
Sbjct: 333 RKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFP---RYNPCVVNVN 389

Query: 61  GRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGRRSHSAFVYGD 120
             +Y+ GG      L  +  E +        L    W + +     P+      A  +  
Sbjct: 390 NLIYVIGGISKNDELLKT-VECF-------SLNTNKWSKGSPL---PISHYGGCAIYHDG 438

Query: 121 GLYIFGGYNGL-VEEHYNDIYRYCTRRQEWARVIPHGAPPTKRRRQSCIIKGDTLIMFGG 179
            +Y+ GG + +   + YN +  Y     +W  +     P   R   S  I  + + + GG
Sbjct: 439 KIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFP---RINASLCIFNNKIYVVGG 495


>gnl|CDD|237370 PRK13380, PRK13380, glycine cleavage system protein H; Provisional.
          Length = 144

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 12/48 (25%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 77  HSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGRRSHSAFVYGDGLYI 124
            +G E+  P  +   +K   W+R    G+V VG   ++  + GD +++
Sbjct: 4   ENGCEI--PSELRYTIKNHEWLRLEGDGTVTVGITDYAQTMAGDVVFV 49


>gnl|CDD|215414 PLN02772, PLN02772, guanylate kinase.
          Length = 398

 Score = 28.3 bits (63), Expect = 2.3
 Identities = 17/78 (21%), Positives = 30/78 (38%), Gaps = 8/78 (10%)

Query: 44  DGEPPSYRDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTT 103
           +G     ++  T+  I  + Y+ GG  + ++L      V       LD     W+ P   
Sbjct: 18  NGFGVKPKNRETSVTIGDKTYVIGGNHEGNTL---SIGVQI-----LDKITNNWVSPIVL 69

Query: 104 GSVPVGRRSHSAFVYGDG 121
           G+ P   + +SA V    
Sbjct: 70  GTGPKPCKGYSAVVLNKD 87


>gnl|CDD|128874 smart00612, Kelch, Kelch domain. 
          Length = 47

 Score = 26.0 bits (58), Expect = 2.6
 Identities = 8/43 (18%), Positives = 16/43 (37%), Gaps = 7/43 (16%)

Query: 122 LYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPPTKRRR 164
           +Y+ GG++G   +    +  Y     +W  +     P     R
Sbjct: 2   IYVVGGFDG--GQRLKSVEVYDPETNKWTPL-----PSMPTPR 37


>gnl|CDD|183569 PRK12517, PRK12517, RNA polymerase sigma factor; Provisional.
          Length = 188

 Score = 27.3 bits (61), Expect = 3.5
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query: 128 YNGLVEEHYNDIYRY 142
           Y  LV+  + DIYRY
Sbjct: 25  YEALVKALHADIYRY 39


>gnl|CDD|217361 pfam03089, RAG2, Recombination activating protein 2.  V-D-J
           recombination is the combinatorial process by which the
           huge range of immunoglobulin and T cell binding
           specificity is generated from a limited amount of
           genetic material. This process is synergistically
           activated by RAG1 and RAG2 in developing lymphocytes.
           Defects in RAG2 in humans are a cause of severe combined
           immunodeficiency B cell negative and Omenn syndrome.
          Length = 337

 Score = 27.7 bits (62), Expect = 4.1
 Identities = 29/101 (28%), Positives = 44/101 (43%), Gaps = 24/101 (23%)

Query: 45  GEPPSYRDFHTANIIDGR----MYIWGGRGDESSLYHSG--------SEVYCPDLVYL-D 91
           G+ P  R  HT N++  R      ++GGR    S    G        S V CP  V+L D
Sbjct: 82  GDVPEARYGHTINVVHSRGKTACVLFGGR----SYMPPGQRTTENWNSVVDCPPQVFLID 137

Query: 92  LKY---FTWIRPNTTGSVPVGRRSHSAFVYGDGLYIFGGYN 129
           L++    ++I P        G+  H +    D +YI GG++
Sbjct: 138 LEFGCCTSYILPELQD----GQSFHVSLARNDTVYILGGHS 174


>gnl|CDD|187825 cd09694, Csm6_III-A, CRISPR/Cas system-associated protein Csm6.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Protein of this family often fused to HTH domain;
           loosely associated with CRISPR/Cas systems.
          Length = 181

 Score = 26.7 bits (59), Expect = 5.1
 Identities = 12/51 (23%), Positives = 14/51 (27%)

Query: 109 GRRSHSAFVYGDGLYIFGGYNGLVEEHYNDIYRYCTRRQEWARVIPHGAPP 159
           G R     V G G     G +GL  E   D+                G  P
Sbjct: 85  GARQQVKLVPGLGRDFNYGLSGLSLERMADVLVQLAEGNVGIVTFTGGFKP 135


>gnl|CDD|221616 pfam12516, DUF3719, Protein of unknown function (DUF3719).  This
           domain family is found in eukaryotes, and is
           approximately 70 amino acids in length. There is a
           conserved HLR sequence motif. There are two completely
           conserved residues (W and H) that may be functionally
           important.
          Length = 68

 Score = 25.5 bits (56), Expect = 5.3
 Identities = 11/45 (24%), Positives = 12/45 (26%), Gaps = 10/45 (22%)

Query: 128 YNGLVEEHYNDIYRYCTRRQEWARVIPH-------GAPPTKRRRQ 165
           Y G        +   C   QEWA   PH          P     Q
Sbjct: 3   YEGKDSGQTQGLQEEC---QEWAASFPHLRILGRQLVIPQDEGFQ 44


>gnl|CDD|241369 cd13215, PH-GRAM1_AGT26, Autophagy-related protein 26/Sterol
           3-beta-glucosyltransferase Pleckstrin homology (PH)
           domain, repeat 1.  ATG26 (also called
           UGT51/UDP-glycosyltransferase 51), a member of the
           glycosyltransferase 28 family, resulting in the
           biosynthesis of sterol glucoside. ATG26 in decane
           metabolism and autophagy. There are 32 known
           autophagy-related (ATG) proteins, 17 are components of
           the core autophagic machinery essential for all
           autophagy-related pathways and 15 are the additional
           components required only for certain pathways or
           species. The core autophagic machinery includes 1) the
           ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and
           ATG27), 2) the phosphatidylinositol 3-kinase complex
           (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the
           ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7,
           ATG8, ATG10, ATG12, and ATG16). Less is known about how
           the core machinery is adapted or modulated with
           additional components to accommodate the nonselective
           sequestration of bulk cytosol (autophagosome formation)
           or selective sequestration of specific cargos (Cvt
           vesicle, pexophagosome, or bacteria-containing
           autophagosome formation). The pexophagosome-specific
           additions include the ATG30-ATG11-ATG17
           receptor-adaptors complex, the coiled-coil protein
           ATG25, and the sterol glucosyltransferase ATG26. ATG26
           is necessary for the degradation of medium peroxisomes.
           It contains 2 GRAM domains and a single PH domain. PH
           domains are only found in eukaryotes. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. PH domains also have
           diverse functions. They are often involved in targeting
           proteins to the plasma membrane, but few display strong
           specificity in lipid binding. Any specificity is usually
           determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 130

 Score = 26.4 bits (59), Expect = 6.4
 Identities = 10/37 (27%), Positives = 16/37 (43%), Gaps = 1/37 (2%)

Query: 69  RGDESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGS 105
           +GD  S Y S +++Y P    +DL+Y           
Sbjct: 53  KGDVLSWYRSSTDLYFPHGT-IDLRYGISAEITDKDK 88


>gnl|CDD|177609 PHA03372, PHA03372, DNA packaging terminase subunit 1; Provisional.
          Length = 668

 Score = 27.1 bits (60), Expect = 7.1
 Identities = 13/43 (30%), Positives = 18/43 (41%), Gaps = 4/43 (9%)

Query: 77  HSGSEVYCPDLVYLDLKYFTWIRPNTTGSVPVGRRSHSAFVYG 119
             G  +Y    VYLD  + +  R + TG   VG       +YG
Sbjct: 450 FLGKTLY----VYLDPAFTSNRRASGTGIAAVGTYRDQYIIYG 488


>gnl|CDD|239001 cd02046, hsp47, Heat shock protein 47 (Hsp47), also called
           colligin, because of its collagen binding ability, is a
           chaperone specific for procollagen. It has been shown to
           be essential for collagen biosynthesis, but its exact
           function is still unclear. Hsp47 is a non-inhibitory
           member of the SERPIN superfamily and corresponds to
           clade H.
          Length = 366

 Score = 26.4 bits (58), Expect = 8.3
 Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 4/63 (6%)

Query: 51  RDFHTANIIDGRMYIWGGRGD--ESSLYHSGSEVYCPDLVYLDLKYFTWIRPNTTGSVP- 107
           +D + +N+       W   G+  +  +Y    E+  P L Y D  +   ++ N T S+  
Sbjct: 304 KDLYLSNVFHAAALEWDTEGNPFDPDIY-GREEMRNPKLFYADHPFIFLVKDNKTNSILF 362

Query: 108 VGR 110
           +GR
Sbjct: 363 IGR 365


>gnl|CDD|233756 TIGR02166, dmsA_ynfE, anaerobic dimethyl sulfoxide reductase, A
           subunit, DmsA/YnfE family.  Members of this family
           include known and probable dimethyl sulfoxide reductase
           (DMSO reductase) A chains. In E. coli, dmsA encodes the
           canonical anaerobic DMSO reductase A chain. The paralog
           ynfE, as part of ynfFGH expressed from a multicopy
           plasmid, could complement a dmsABC deletion, suggesting
           a similar function and some overlap in specificity,
           although YnfE could not substitute for DmsA in a mixed
           complex.
          Length = 797

 Score = 26.7 bits (59), Expect = 9.0
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 4/42 (9%)

Query: 49  SYRDFHTANIIDGRMYIWGGRGDESSLYHSGSEVYCPDLVYL 90
            Y  + TA I +   Y +G   D SSL     ++    LV +
Sbjct: 183 QYGSYSTAQINEAMPYTYGISADGSSL----DDIENSKLVVM 220


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.143    0.478 

Gapped
Lambda     K      H
   0.267   0.0802    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,829,109
Number of extensions: 905767
Number of successful extensions: 720
Number of sequences better than 10.0: 1
Number of HSP's gapped: 704
Number of HSP's successfully gapped: 47
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.2 bits)