RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17757
         (225 letters)



>gnl|CDD|224276 COG1357, COG1357, Pentapeptide repeats containing protein [Function
           unknown].
          Length = 238

 Score = 78.8 bits (194), Expect = 7e-18
 Identities = 49/131 (37%), Positives = 63/131 (48%), Gaps = 9/131 (6%)

Query: 91  LSRRDVINALILTPI-TAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACS------ 143
           LS  D+ NA +     +      G +L G++L   +L   N            +      
Sbjct: 61  LSGADLSNANLRGADLSKGADLSGADLEGANLRGANLSGANLSGANLSNATRANLSGADL 120

Query: 144 --VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANL 201
              NL  ANL G+N++G NL  A L NANL   DL  A L+GADL   +LSG+DL  ANL
Sbjct: 121 SGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANLSGADLRGANL 180

Query: 202 RGANLKDAALE 212
            GANL  A L 
Sbjct: 181 SGANLSGADLR 191



 Score = 74.9 bits (184), Expect = 2e-16
 Identities = 40/101 (39%), Positives = 60/101 (59%)

Query: 111 FQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNAN 170
             G +L+G++LS  +L   N            + NL GA+L G++++G +L  A L  A+
Sbjct: 115 LSGADLSGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANLSGAD 174

Query: 171 LQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
           L+  +L  A L+GADL N +LSG+DL  A+L GANL +A L
Sbjct: 175 LRGANLSGANLSGADLRNANLSGADLSGADLTGANLVNANL 215



 Score = 74.1 bits (182), Expect = 4e-16
 Identities = 47/119 (39%), Positives = 59/119 (49%), Gaps = 18/119 (15%)

Query: 109 LRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACS-------------VNLKGANLEGSN 155
           L   G NL+G+DLS  +LR  +    KG  L                  NL GANL  + 
Sbjct: 54  LDLSGANLSGADLSNANLRGADL--SKGADLSGADLEGANLRGANLSGANLSGANLSNAT 111

Query: 156 MA---GVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
            A   G +L  A L NANL   +L  A L GA+L N +LSG+DL  A+L GA+L  A L
Sbjct: 112 RANLSGADLSGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANL 170



 Score = 65.3 bits (159), Expect = 6e-13
 Identities = 43/105 (40%), Positives = 57/105 (54%), Gaps = 10/105 (9%)

Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
                NL+G+DLS  DL   +              NL GA+L G+N++G NL  A L+NA
Sbjct: 144 NLSNANLSGADLSGADLSGADLS----------GANLSGADLRGANLSGANLSGADLRNA 193

Query: 170 NLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALELM 214
           NL   DL  A L GA+L N +L G++L  ANL GA+L  A L  +
Sbjct: 194 NLSGADLSGADLTGANLVNANLGGANLFGANLLGADLSGATLGDV 238



 Score = 54.9 bits (132), Expect = 3e-09
 Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 12/103 (11%)

Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
           R QG+    ++L   DL+    +           +N + ANL G +++G NL  A L NA
Sbjct: 18  RLQGIRELKANLLGADLQGSELQ--------LGDLNFENANLSGLDLSGANLSGADLSNA 69

Query: 170 NLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALE 212
           NL+  DL      GADL   DL G++L  ANL GANL  A L 
Sbjct: 70  NLRGADLS----KGADLSGADLEGANLRGANLSGANLSGANLS 108



 Score = 41.4 bits (97), Expect = 1e-04
 Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 142 CSVNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANL 201
            S  L+G     +N+ G +L         L + +   A L+G DL   +LSG+DL  ANL
Sbjct: 15  ESSRLQGIRELKANLLGADL---QGSELQLGDLNFENANLSGLDLSGANLSGADLSNANL 71

Query: 202 RGANLKDAA 210
           RGA+L   A
Sbjct: 72  RGADLSKGA 80



 Score = 35.2 bits (81), Expect = 0.013
 Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 142 CSVNLKGANLEGSNMAGVNLRVATLKNANLQ--NCDLRAAVLAGADLENCDLSGSDLHEA 199
           C + L   +LE S + G+    A L  A+LQ     L       A+L   DLSG++L  A
Sbjct: 5   CEIFLADLDLESSRLQGIRELKANLLGADLQGSELQLGDLNFENANLSGLDLSGANLSGA 64

Query: 200 NLRGANLKDAAL 211
           +L  ANL+ A L
Sbjct: 65  DLSNANLRGADL 76



 Score = 29.8 bits (67), Expect = 0.88
 Identities = 26/84 (30%), Positives = 35/84 (41%), Gaps = 15/84 (17%)

Query: 91  LSRRDVINALILTPITAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGAN 150
           LS  D+  A +     +    +  NL+G+DLS  DL                  NL  AN
Sbjct: 170 LSGADLRGANLSGANLSGADLRNANLSGADLSGADLTGA---------------NLVNAN 214

Query: 151 LEGSNMAGVNLRVATLKNANLQNC 174
           L G+N+ G NL  A L  A L + 
Sbjct: 215 LGGANLFGANLLGADLSGATLGDV 238


>gnl|CDD|185119 PRK15197, PRK15197, secreted effector protein PipB; Provisional.
          Length = 291

 Score = 68.0 bits (166), Expect = 1e-13
 Identities = 47/121 (38%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 101 ILTPITAELRFQGVNLA-----GSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSN 155
           ++T     L  +GVNLA     G DLS +D  N +F+     TL   +VNL GANL  +N
Sbjct: 147 LITE-DGFLNLRGVNLAHKDFQGEDLSDIDASNADFR---ETTLS--NVNLVGANLCCAN 200

Query: 156 MAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALELML 215
           +  VNL  + +  ANL + DL  A ++G +L    L GSDL +  L GA L   AL L  
Sbjct: 201 LHAVNLMGSNMTKANLTHADLTCANMSGVNLTAAILFGSDLTDTKLNGAKLDKIALTLAK 260

Query: 216 T 216
            
Sbjct: 261 A 261



 Score = 41.5 bits (97), Expect = 1e-04
 Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 10/87 (11%)

Query: 130 NFKVRKGLTLPACSVNLKGANL-----EGSNMAGVNLRVATLKNANLQNCDLRAAVLAGA 184
           N +    L      +NL+G NL     +G +++ ++   A  +   L N +L  A L  A
Sbjct: 140 NIQQPDSLITEDGFLNLRGVNLAHKDFQGEDLSDIDASNADFRETTLSNVNLVGANLCCA 199

Query: 185 DLENCDLSGSDLHEANLRGANLKDAAL 211
           +L   +L GS     N+  ANL  A L
Sbjct: 200 NLHAVNLMGS-----NMTKANLTHADL 221


>gnl|CDD|202161 pfam02214, K_tetra, K+ channel tetramerisation domain.  The
          N-terminal, cytoplasmic tetramerisation domain (T1) of
          voltage-gated K+ channels encodes molecular
          determinants for subfamily-specific assembly of
          alpha-subunits into functional tetrameric channels. It
          is distantly related to the BTB/POZ domain pfam00651.
          Length = 92

 Score = 63.3 bits (155), Expect = 2e-13
 Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 16 MNPGDVDSTGAYLIDRSPTYFEPILNYLR-NGQLVLDPNINPEGVLEEARFFGIESLV 72
                D T  Y  DRSP +FE ILN+ R  G+L     +  +  LEE  F+G+  L 
Sbjct: 31 RCDFYDDDTNEYFFDRSPKHFETILNFYRTGGKLHRPEEVCLDSFLEELEFYGLGELA 88


>gnl|CDD|109845 pfam00805, Pentapeptide, Pentapeptide repeats (8 copies).  These
           repeats are found in many cyanobacterial proteins. The
           repeats were first identified in hglK. The function of
           these repeats is unknown. The structure of this repeat
           has been predicted to be a beta-helix. The repeat can be
           approximately described as A(D/N)LXX, where X can be any
           amino acid.
          Length = 40

 Score = 53.5 bits (130), Expect = 3e-10
 Identities = 22/40 (55%), Positives = 26/40 (65%)

Query: 169 ANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKD 208
           ANL   +LR A L GADL   DL+G++L  ANL GANL  
Sbjct: 1   ANLSGANLRGADLRGADLSGADLTGANLSGANLSGANLSG 40



 Score = 48.9 bits (118), Expect = 2e-08
 Identities = 21/36 (58%), Positives = 23/36 (63%)

Query: 174 CDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDA 209
            +L  A L GADL   DLSG+DL  ANL GANL  A
Sbjct: 1   ANLSGANLRGADLRGADLSGADLTGANLSGANLSGA 36



 Score = 45.1 bits (108), Expect = 4e-07
 Identities = 19/40 (47%), Positives = 27/40 (67%)

Query: 154 SNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSG 193
           +N++G NLR A L+ A+L   DL  A L+GA+L   +LSG
Sbjct: 1   ANLSGANLRGADLRGADLSGADLTGANLSGANLSGANLSG 40



 Score = 43.5 bits (104), Expect = 1e-06
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query: 160 NLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDL 196
           NL  A L+ A+L+  DL  A L GA+L   +LSG++L
Sbjct: 2   NLSGANLRGADLRGADLSGADLTGANLSGANLSGANL 38



 Score = 43.5 bits (104), Expect = 1e-06
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 179 AVLAGADLENCDLSGSDLHEANLRGANLKDAALE 212
           A L+GA+L   DL G+DL  A+L GANL  A L 
Sbjct: 1   ANLSGANLRGADLRGADLSGADLTGANLSGANLS 34



 Score = 40.4 bits (96), Expect = 2e-05
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 144 VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLR 177
            NL+GA+L G++++G +L  A L  ANL   +L 
Sbjct: 6   ANLRGADLRGADLSGADLTGANLSGANLSGANLS 39



 Score = 35.8 bits (84), Expect = 8e-04
 Identities = 17/48 (35%), Positives = 22/48 (45%), Gaps = 10/48 (20%)

Query: 111 FQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAG 158
             G NL G+DL   DL   +              NL GANL G+N++G
Sbjct: 3   LSGANLRGADLRGADLSGADLT----------GANLSGANLSGANLSG 40



 Score = 31.6 bits (73), Expect = 0.024
 Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 10/48 (20%)

Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLR 162
           NL+G++L   DLR  +              +L GANL G+N++G NL 
Sbjct: 2   NLSGANLRGADLRGADL----------SGADLTGANLSGANLSGANLS 39


>gnl|CDD|205777 pfam13599, Pentapeptide_4, Pentapeptide repeats (9 copies). 
          Length = 77

 Score = 54.6 bits (132), Expect = 3e-10
 Identities = 20/69 (28%), Positives = 31/69 (44%)

Query: 144 VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRG 203
            + +  + E  ++  VN   ATLKN N  NC L     +  DL + D + +DL  A    
Sbjct: 1   ADFRNCSFENCDLKNVNFSNATLKNCNFSNCKLFNVNFSDCDLRDVDFTEADLKNAKFSN 60

Query: 204 ANLKDAALE 212
            +LK+    
Sbjct: 61  CDLKNVNFS 69



 Score = 50.4 bits (121), Expect = 1e-08
 Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 11/87 (12%)

Query: 125 DLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGA 184
           D RN +F+          + +LK  N   + +   N     L N N  +CDLR      A
Sbjct: 2   DFRNCSFE----------NCDLKNVNFSNATLKNCNFSNCKLFNVNFSDCDLRDVDFTEA 51

Query: 185 DLENCDLSGSDLHEANLRGANLKDAAL 211
           DL+N   S  DL   N  G +LKD   
Sbjct: 52  DLKNAKFSNCDLKNVNFSG-SLKDCDF 77



 Score = 46.9 bits (112), Expect = 2e-07
 Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 11/82 (13%)

Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
            F+  +L   + S   L+N NF           +  L   N    ++  V+   A LKNA
Sbjct: 7   SFENCDLKNVNFSNATLKNCNFS----------NCKLFNVNFSDCDLRDVDFTEADLKNA 56

Query: 170 NLQNCDLRAAVLAGADLENCDL 191
              NCDL+    +G  L++CD 
Sbjct: 57  KFSNCDLKNVNFSG-SLKDCDF 77



 Score = 46.5 bits (111), Expect = 3e-07
 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 15/85 (17%)

Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
            F+  +    DL      N+NF           +  LK  N     +  VN     L++ 
Sbjct: 2   DFRNCSFENCDLK-----NVNF----------SNATLKNCNFSNCKLFNVNFSDCDLRDV 46

Query: 170 NLQNCDLRAAVLAGADLENCDLSGS 194
           +    DL+ A  +  DL+N + SGS
Sbjct: 47  DFTEADLKNAKFSNCDLKNVNFSGS 71



 Score = 26.5 bits (59), Expect = 3.7
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 10/47 (21%)

Query: 108 ELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS 154
            + F   +L   D +  DL+N  F           + +LK  N  GS
Sbjct: 35  NVNFSDCDLRDVDFTEADLKNAKFS----------NCDLKNVNFSGS 71


>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac.
          Domain in Broad-Complex, Tramtrack and Bric a brac.
          Also known as POZ (poxvirus and zinc finger) domain.
          Known to be a protein-protein interaction motif found
          at the N-termini of several C2H2-type transcription
          factors as well as Shaw-type potassium channels. Known
          structure reveals a tightly intertwined dimer formed
          via interactions between N-terminal strand and helix
          structures. However in a subset of BTB/POZ domains,
          these two secondary structures appear to be missing. Be
          aware SMART predicts BTB/POZ domains without the beta1-
          and alpha1-secondary structures.
          Length = 97

 Score = 52.7 bits (127), Expect = 2e-09
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 1/67 (1%)

Query: 17 NPGDVDSTGAYLIDRSPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLM 76
          +  + D +  YL D SP  F  +LN+L  G+L L    N E +LE A +  I  LV    
Sbjct: 32 DFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEE-NVEELLELADYLQIPGLVELCE 90

Query: 77 EIIMSRE 83
          E ++   
Sbjct: 91 EFLLKLL 97


>gnl|CDD|185118 PRK15196, PRK15196, secreted effector protein PipB2; Provisional.
          Length = 350

 Score = 51.9 bits (124), Expect = 5e-08
 Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 11/107 (10%)

Query: 113 GVNLAGSDLSRLDLRNINFKVRKGLTLPACSV---NLKGANLEGSNMAGVNLRVATLKNA 169
           G NL G+ L   D    NFK   G  L   S+   N K A LE S M G  L  A L  A
Sbjct: 176 GANLEGAVLFMADCEGANFK---GANLSGTSLGDSNFKNACLEDSIMCGATLDHANLTGA 232

Query: 170 NLQN-----CDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
           NLQ+     C +     +GA++++ +LSG+ L  A++ GA L+ A +
Sbjct: 233 NLQHASLLGCSMIECNCSGANMDHTNLSGATLIRADMSGATLQGATI 279



 Score = 51.2 bits (122), Expect = 1e-07
 Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 30/131 (22%)

Query: 111 FQGVNLAGSDL-----------------SRLDLRNINFKVRKGLTLPACSV---NLKGAN 150
           F+G NL+G+ L                 + LD  N+     +  +L  CS+   N  GAN
Sbjct: 194 FKGANLSGTSLGDSNFKNACLEDSIMCGATLDHANLTGANLQHASLLGCSMIECNCSGAN 253

Query: 151 LEGSNMAG-----VNLRVATLKNANLQNCDLRAAVLAGADLE-----NCDLSGSDLHEAN 200
           ++ +N++G      ++  ATL+ A +    +  AVL  A+L      + +L G+DL EAN
Sbjct: 254 MDHTNLSGATLIRADMSGATLQGATIMAAIMEGAVLTRANLRKASFISTNLDGADLAEAN 313

Query: 201 LRGANLKDAAL 211
           L     KD  L
Sbjct: 314 LNNTCFKDCTL 324



 Score = 48.5 bits (115), Expect = 9e-07
 Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACS-VNLKGANLEGSNMAGVNLRV---------- 163
           NL  +DLS  +L          L +  C   N KGANL G+++   N +           
Sbjct: 168 NLCDADLSGANLEG------AVLFMADCEGANFKGANLSGTSLGDSNFKNACLEDSIMCG 221

Query: 164 ATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDA 209
           ATL +ANL   +L+ A L G  +  C+ SG+++   NL GA L  A
Sbjct: 222 ATLDHANLTGANLQHASLLGCSMIECNCSGANMDHTNLSGATLIRA 267



 Score = 48.1 bits (114), Expect = 1e-06
 Identities = 32/106 (30%), Positives = 55/106 (51%), Gaps = 20/106 (18%)

Query: 113 GVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS----------NMAGVNLR 162
           G+NL G+ L+  +L   N           C  +L GANLEG+          N  G NL 
Sbjct: 151 GMNLKGAVLTGANLTAENL----------CDADLSGANLEGAVLFMADCEGANFKGANLS 200

Query: 163 VATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKD 208
             +L ++N +N  L  +++ GA L++ +L+G++L  A+L G ++ +
Sbjct: 201 GTSLGDSNFKNACLEDSIMCGATLDHANLTGANLQHASLLGCSMIE 246



 Score = 47.3 bits (112), Expect = 2e-06
 Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 3/82 (3%)

Query: 134 RKGLTLPACSVNLK---GANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCD 190
           R   +LP  SV L    G NL+G+ + G NL    L +A+L   +L  AVL  AD E  +
Sbjct: 134 RCEFSLPQDSVILTAQGGMNLKGAVLTGANLTAENLCDADLSGANLEGAVLFMADCEGAN 193

Query: 191 LSGSDLHEANLRGANLKDAALE 212
             G++L   +L  +N K+A LE
Sbjct: 194 FKGANLSGTSLGDSNFKNACLE 215



 Score = 43.1 bits (101), Expect = 6e-05
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 5/72 (6%)

Query: 145 NLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLS-----GSDLHEA 199
           NL  A+L G+N+ G  L +A  + AN +  +L    L  ++ +N  L      G+ L  A
Sbjct: 168 NLCDADLSGANLEGAVLFMADCEGANFKGANLSGTSLGDSNFKNACLEDSIMCGATLDHA 227

Query: 200 NLRGANLKDAAL 211
           NL GANL+ A+L
Sbjct: 228 NLTGANLQHASL 239



 Score = 37.3 bits (86), Expect = 0.004
 Identities = 22/87 (25%), Positives = 37/87 (42%), Gaps = 10/87 (11%)

Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQNC 174
           NL+G+ L R D+     +             +  A +EG+ +   NLR A+  + NL   
Sbjct: 258 NLSGATLIRADMSGATLQ----------GATIMAAIMEGAVLTRANLRKASFISTNLDGA 307

Query: 175 DLRAAVLAGADLENCDLSGSDLHEANL 201
           DL  A L     ++C L+     +A +
Sbjct: 308 DLAEANLNNTCFKDCTLTHLRTEDATM 334


>gnl|CDD|185275 PRK15377, PRK15377, E3 ubiquitin-protein ligase SopA; Provisional.
          Length = 782

 Score = 48.5 bits (115), Expect = 1e-06
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 114 VNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQN 173
           +  A  D S  D RNINF        P+  +   G  ++G N +      + + + +   
Sbjct: 190 LTYADRDFSNADFRNINFSK----INPSGFMLRDGDIIKGFNFSNSKFTYSDISHLHFDE 245

Query: 174 CDLRAAVLAGADLENCDLSGSDLHEANLRGA 204
           C    + L+     N   S SD++EA+L+ +
Sbjct: 246 CRFTYSTLSDVVCSNTKFSNSDMNEASLKYS 276


>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain.  The BTB (for BR-C, ttk and bab) or
           POZ (for Pox virus and Zinc finger) domain is present
           near the N-terminus of a fraction of zinc finger
           (pfam00096) proteins and in proteins that contain the
           pfam01344 motif such as Kelch and a family of pox virus
           proteins. The BTB/POZ domain mediates homomeric
           dimerisation and in some instances heteromeric
           dimerisation. The structure of the dimerised PLZF
           BTB/POZ domain has been solved and consists of a tightly
           intertwined homodimer. The central scaffolding of the
           protein is made up of a cluster of alpha-helices flanked
           by short beta-sheets at both the top and bottom of the
           molecule. POZ domains from several zinc finger proteins
           have been shown to mediate transcriptional repression
           and to interact with components of histone deacetylase
           co-repressor complexes including N-CoR and SMRT. The POZ
           or BTB domain is also known as BR-C/Ttk or ZiN.
          Length = 101

 Score = 31.4 bits (72), Expect = 0.090
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 27  YLIDRSPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLMEII 79
            L D SP  FE +L ++  G+L +    N + +L  A    I +L+ +  E +
Sbjct: 49  TLEDVSPEDFEALLEFIYTGKLEITE-ENVDDLLALADKLQIPALIDKCEEFL 100


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
           permease components [Defense mechanisms].
          Length = 567

 Score = 32.4 bits (74), Expect = 0.18
 Identities = 11/75 (14%), Positives = 26/75 (34%), Gaps = 1/75 (1%)

Query: 65  FFGIESLVPQLMEIIMSRERSRDMMPLSRRDVINALILTPITAELRFQGVNLAGSDLSRL 124
              +  +V  L     + ER  +++            L      + F+ V+       + 
Sbjct: 285 ILQLGEVVSLLQRASAAAERLFELLDEEPEVEDPPDPLKDTIGSIEFENVSF-SYPGKKP 343

Query: 125 DLRNINFKVRKGLTL 139
            L++I+F +  G  +
Sbjct: 344 VLKDISFSIEPGEKV 358


>gnl|CDD|177853 PLN02202, PLN02202, carbonate dehydratase.
          Length = 284

 Score = 32.0 bits (72), Expect = 0.20
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 6/45 (13%)

Query: 108 ELRFQGVNLAGSDLSRLDLRNINFKVRK------GLTLPACSVNL 146
           E RF+G + A  ++ ++D R+I  K RK       LT P CS N+
Sbjct: 176 EERFKGNHTAQVEVGKIDTRHIERKTRKYFRYIGSLTTPPCSENV 220


>gnl|CDD|200495 cd09295, Sema, The Sema domain, a protein interacting module, of
           semaphorins and plexins.  Both semaphorins and plexins
           have a Sema domain on their N-termini. Plexins function
           as receptors for the semaphorins. Evolutionarily,
           plexins may be the ancestor of semaphorins. Semaphorins
           are regulatory molecules in the development of the
           nervous system and in axonal guidance. They also play
           important roles in other biological processes, such as
           angiogenesis, immune regulation, respiration systems,
           and cancer. Semaphorins can be divided into 7 classes.
           Vertebrates have members in classes 3-7, whereas classes
           1 and 2 are known only in invertebrates. Class 2 and 3
           semaphorins are secreted; classes 1 and 4 through 6 are
           transmembrane proteins; and class 7 is membrane
           associated via glycosylphosphatidylinositol (GPI)
           linkage. Plexins are a large family of transmembrane
           proteins, which are divided into four types (A-D)
           according to sequence similarity. In vertebrates, type A
           plexins serve as co-receptors for neuropilins to mediate
           the signalling of class 3 semaphorins. Plexins serve as
           direct receptors for several other members of the
           semaphorin family: class 6 semaphorins signal through
           type A plexins and class 4 semaphorins through type B
           plexins. This family also includes the MET and RON
           receptor tyrosine kinases. The Sema domain is located at
           the N-terminus and contains four disulfide bonds formed
           by eight conserved cysteine residues. It serves to
           recognize and bind receptors.
          Length = 392

 Score = 30.6 bits (69), Expect = 0.55
 Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 6/69 (8%)

Query: 101 ILTPITAELRFQ--GVNLA--GSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNM 156
           +L            G N A       RLD+     KVR     P C ++ K +N+ G N+
Sbjct: 69  VLQQKGDLDILAVCGSNAAQPSCGSYRLDVLVELGKVRWPSGRPRCPIDNKHSNM-GVNV 127

Query: 157 AGVNLRVAT 165
               L  AT
Sbjct: 128 DSK-LYSAT 135


>gnl|CDD|221494 pfam12257, DUF3608, Protein of unknown function (DUF3608).  This
           domain family is found in eukaryotes, and is
           approximately 280 amino acids in length. The family is
           found in association with pfam00610.
          Length = 281

 Score = 29.7 bits (67), Expect = 1.2
 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 10/83 (12%)

Query: 95  DVINALILTPITAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS 154
           D +N +    I   LR +  N     +      +    V +G  LP     +K   LE  
Sbjct: 154 DQVNIIHWDEIMETLRKEFANFKRDLMLNKTSEDKGKSVIRGRFLPV----IKSNILEAI 209

Query: 155 NMAGVNLRVATLKNANLQNCDLR 177
           N+A       TL N   +  DL+
Sbjct: 210 NLA------TTLVNDPFRQTDLK 226


>gnl|CDD|181028 PRK07559, PRK07559, 2'-deoxycytidine 5'-triphosphate deaminase;
           Provisional.
          Length = 365

 Score = 28.7 bits (65), Expect = 2.3
 Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 4/34 (11%)

Query: 20  DVDSTGAYLIDRSPTYFEPILNYLRNGQLVLDPN 53
           DVD  GA+ +     ++EP+     +G+L+LDP 
Sbjct: 224 DVDRIGAHDVLD---FWEPLYAR-GDGELILDPG 253


>gnl|CDD|233117 TIGR00759, aceE, pyruvate dehydrogenase E1 component, homodimeric
           type.  Most members of this family are pyruvate
           dehydrogenase complex, E1 component. Note: this family
           was classified as subfamily rather than equivalog
           because it includes a counterexample from Pseudomonas
           putida, MdeB, that is active as an E1 component of an
           alpha-ketoglutarate dehydrogenase complex rather than a
           pyruvate dehydrogase complex. The second pyruvate
           dehydrogenase complex E1 protein from Alcaligenes
           eutrophus, PdhE, complements an aceE mutant of E. coli
           but is not part of a pyruvate dehydrogenase complex
           operon, is more similar to the Pseudomonas putida MdeB
           than to E. coli AceE, and may have also have a different
           primary specificity.
          Length = 885

 Score = 29.0 bits (65), Expect = 2.4
 Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 46  GQLVLDPNINPEGV---LEEARFFGIESLVPQL 75
            +L+ D  I    V    +EAR FG+E L  Q+
Sbjct: 500 NKLLKDKEIGKRIVPIVPDEARTFGMEGLFRQI 532


>gnl|CDD|182045 PRK09718, PRK09718, hypothetical protein; Validated.
          Length = 512

 Score = 28.9 bits (64), Expect = 2.8
 Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 15/107 (14%)

Query: 120 DLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKN----------- 168
           DLS LDL + +FK      +   S NL+    E   +   N   A + N           
Sbjct: 91  DLSDLDLTSCHFKGDVISKVSFLSSNLQHVTFECKEIGDCNFTTAIVDNVIFKCRRLHNV 150

Query: 169 ----ANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
               A+ +  D    +L   D     L+ S+  E  +R +N  +  L
Sbjct: 151 IFIKASGEYVDFSKNILDTVDFSQSQLTHSNFRECQIRNSNFDNCYL 197


>gnl|CDD|220128 pfam09157, TruB-C_2, Pseudouridine synthase II TruB, C-terminal. 
          Members of this family adopt a secondary structure
          consisting of a four-stranded beta sheet and one alpha
          helix. They are predominantly RNA-binding domains,
          mostly found in Pseudouridine synthase II TruB.
          Length = 57

 Score = 26.0 bits (58), Expect = 3.4
 Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 4/31 (12%)

Query: 42 YLRNGQLVLDPNINPEGVL----EEARFFGI 68
              GQ V       EG++    EE RF GI
Sbjct: 12 RFLQGQPVRVDGAPAEGLVRVYGEEGRFLGI 42


>gnl|CDD|100001 cd04955, GT1_like_6, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found in
           certain bacteria and Archaea.
          Length = 363

 Score = 28.3 bits (64), Expect = 3.6
 Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 1/53 (1%)

Query: 48  LVLDPNINPEGVLEEARFFGIESLVPQLM-EIIMSRERSRDMMPLSRRDVINA 99
           L  D   N E + ++A +F +   +  L+ E+    E    M   +R  +   
Sbjct: 296 LASDNPFNREVLGDKAIYFKVGDDLASLLEELEADPEEVSAMAKAARERIREK 348


>gnl|CDD|226443 COG3933, COG3933, Transcriptional antiterminator [Transcription].
          Length = 470

 Score = 28.1 bits (63), Expect = 3.7
 Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 28  LIDR-SPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLMEIIMSRERSR 86
           L+D  S T F  I++      + + PN++   VLE  R   +   + Q+ + I       
Sbjct: 171 LVDMGSLTSFGSIISEEFGIPVKVIPNVSTSMVLEAGRKISLGLNLEQIYQSIQKDFPCY 230

Query: 87  DMMPLSRRDVINALILTPITAE 108
              P   ++   AL+ T IT E
Sbjct: 231 WCTPWPAKNKKKALVTTCITGE 252


>gnl|CDD|235339 PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed.
          Length = 357

 Score = 28.0 bits (63), Expect = 4.1
 Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 17/50 (34%)

Query: 94  RDV-INALILTPITAELRFQGVNLAGSDLSRLDLRNINFK------VRKG 136
           RD+ I ALIL          G  L  S+L  LDL ++N K       RKG
Sbjct: 196 RDLAIIALIL----------GSGLRVSELVNLDLSDLNLKQMTIDVTRKG 235


>gnl|CDD|163637 cd07394, MPP_Vps29, Homo sapiens Vps29 and related proteins,
           metallophosphatase domain.  Vps29 (vacuolar sorting
           protein 29), also known as vacuolar membrane protein
           Pep11, is a subunit of the retromer complex which is
           responsible for the retrieval of mannose-6-phosphate
           receptors (MPRs) from the endosomes for retrograde
           transport back to the Golgi. Vps29 has a phosphoesterase
           fold that acts as a protein interaction scaffold for
           retromer complex assembly as well as a phosphatase with
           specificity for the cytoplasmic tail of the MPR.  The
           retromer includes the following 5 subunits: Vps35,
           Vps26, Vps29, and a dimer of the sorting nexins Vps5
           (Snx1), and Vps17 (Snx2).  Vps29 belongs to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets. This domain is thought to allow for
           productive metal coordination.
          Length = 178

 Score = 27.2 bits (61), Expect = 5.1
 Identities = 9/17 (52%), Positives = 12/17 (70%), Gaps = 2/17 (11%)

Query: 11  EGGYFMNPGDVDSTGAY 27
           EG +F+NPG    TGA+
Sbjct: 124 EGKFFINPGSA--TGAF 138


>gnl|CDD|234514 TIGR04231, seadorna_VP5, seadornavirus VP5 protein.  This protein
          family occurs in the seadornavirus virus group, with
          designations VP5 in Banna virus, and VP6 in Kadipiro
          virus and Liao ning virus. The function is unassigned.
          Length = 505

 Score = 28.0 bits (62), Expect = 5.3
 Identities = 12/42 (28%), Positives = 16/42 (38%)

Query: 4  KMFAEGSEGGYFMNPGDVDSTGAYLIDRSPTYFEPILNYLRN 45
          K F E  E    +   +  S  A+L   S   F  + N  RN
Sbjct: 19 KFFDEKEEKDVILTASEFKSNNAFLFKDSGNLFLGLENIFRN 60


>gnl|CDD|236500 PRK09405, aceE, pyruvate dehydrogenase subunit E1; Reviewed.
          Length = 891

 Score = 27.8 bits (63), Expect = 5.4
 Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 3/32 (9%)

Query: 47  QLVLDPNINPEGVL---EEARFFGIESLVPQL 75
            L+ D  I    V    +EAR FG+E L  Q+
Sbjct: 507 ILLKDKEIGKRIVPIIPDEARTFGMEGLFRQI 538


>gnl|CDD|222805 PHA00527, PHA00527, hypothetical protein.
          Length = 129

 Score = 26.9 bits (59), Expect = 5.9
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 13  GYFMNPGDVDSTGAYLIDRSPTYFEP 38
           G    PGD + T  Y++DR  ++F P
Sbjct: 93  GVTTEPGDANETYCYMLDRMFSHFLP 118


>gnl|CDD|237267 PRK13012, PRK13012, 2-oxoacid dehydrogenase subunit E1;
           Provisional.
          Length = 896

 Score = 27.2 bits (61), Expect = 8.3
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 46  GQLVLDPNINPEGV---LEEARFFGIESLVPQL 75
           G L+ D  + P  V    +EAR FG+ +L  Q+
Sbjct: 514 GNLLKDKALGPRIVPIVADEARTFGMANLFRQV 546


>gnl|CDD|224857 COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism].
          Length = 289

 Score = 26.9 bits (60), Expect = 9.2
 Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 11/37 (29%)

Query: 10 SEGGYFMNPGDVDSTGAYLIDRSPTYFEPILNYLRNG 46
          S   YF+ PGD +    Y ++R           LR+G
Sbjct: 59 SLHSYFLRPGDPEQPIIYDVER-----------LRDG 84


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 27.3 bits (60), Expect = 9.5
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 68   IESLVPQLMEIIMSRERSRDM 88
            IESL   L+EII SR + R+ 
Sbjct: 1241 IESLAHALVEIIKSRSQQRNF 1261


>gnl|CDD|223971 COG1041, COG1041, Predicted DNA modification methylase [DNA
           replication, recombination, and repair].
          Length = 347

 Score = 27.0 bits (60), Expect = 9.8
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 7/45 (15%)

Query: 32  SPTYFEPI-------LNYLRNGQLVLDPNINPEGVLEEARFFGIE 69
            P   +P        L  ++ G+LVLDP     G+L EA   G  
Sbjct: 177 RPGSMDPRLARAMVNLARVKRGELVLDPFCGTGGILIEAGLMGAR 221


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.385 

Gapped
Lambda     K      H
   0.267   0.0732    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,532,966
Number of extensions: 1119026
Number of successful extensions: 1113
Number of sequences better than 10.0: 1
Number of HSP's gapped: 978
Number of HSP's successfully gapped: 67
Length of query: 225
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 132
Effective length of database: 6,812,680
Effective search space: 899273760
Effective search space used: 899273760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.7 bits)