RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3106
         (433 letters)



>gnl|CDD|238532 cd01099, PAN_AP_HGF, Subfamily of PAN/APPLE-like domains; present
           in N-terminal (N) domains of plasminogen/hepatocyte
           growth factor proteins, and various proteins found in
           Bilateria, such as leech anti-platelet proteins.
           PAN/APPLE domains fulfill diverse biological functions
           by mediating protein-protein or protein-carbohydrate
           interactions.
          Length = 80

 Score = 67.9 bits (166), Expect = 2e-14
 Identities = 27/82 (32%), Positives = 38/82 (46%), Gaps = 2/82 (2%)

Query: 30  ERAWCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMD 89
              + F  V    L+  VK   T  S  +C   CL E +F CRS NYN  + EC +SD D
Sbjct: 1   LNDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSKECILSDED 60

Query: 90  RITVAGHGAFEETKDIEYLENN 111
           R++           +++Y EN 
Sbjct: 61  RMS--SGVKLLYDSNVDYYENK 80



 Score = 44.4 bits (105), Expect = 4e-06
 Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 4/75 (5%)

Query: 119 LCEFKKLAGRILKTVDSVYQ--DVASIEECRELCLNS-PYRCHSYDYGDTGDLVCRLSHH 175
           L +FK +       V  V     VAS+EEC   CL    + C S++Y +     C LS  
Sbjct: 1   LNDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNY-NYKSKECILSDE 59

Query: 176 SRATLVGIQEYRNTV 190
            R +      Y + V
Sbjct: 60  DRMSSGVKLLYDSNV 74


>gnl|CDD|215662 pfam00024, PAN_1, PAN domain.  The PAN domain contains a conserved
           core of three disulphide bridges. In some members of the
           family there is an additional fourth disulphide bridge
           the links the N and C termini of the domain. The domain
           is found in diverse proteins, in some they mediate
           protein-protein interactions, in others they mediate
           protein-carbohydrate interactions.
          Length = 78

 Score = 59.9 bits (145), Expect = 1e-11
 Identities = 22/78 (28%), Positives = 33/78 (42%), Gaps = 2/78 (2%)

Query: 35  FDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVA 94
           F+RV G SL G    V    S  +C  +C  E   +CRS  YN  + EC +      +  
Sbjct: 3   FERVPGTSLSGTDVSVVDVPSAEECAALCTNEPR-VCRSFTYNPKSKECLLKSSSSGS-L 60

Query: 95  GHGAFEETKDIEYLENNC 112
                   + ++Y E +C
Sbjct: 61  PPRLKRSDQGVDYYEKSC 78



 Score = 54.5 bits (131), Expect = 8e-10
 Identities = 22/61 (36%), Positives = 28/61 (45%), Gaps = 1/61 (1%)

Query: 120 CEFKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHSRAT 179
           C F+++ G  L   D    DV S EEC  LC N P  C S+ Y +     C L   S  +
Sbjct: 1   CSFERVPGTSLSGTDVSVVDVPSAEECAALCTNEPRVCRSFTY-NPKSKECLLKSSSSGS 59

Query: 180 L 180
           L
Sbjct: 60  L 60


>gnl|CDD|215716 pfam00100, Zona_pellucida, Zona pellucida-like domain. 
          Length = 252

 Score = 58.3 bits (141), Expect = 1e-09
 Identities = 40/157 (25%), Positives = 56/157 (35%), Gaps = 14/157 (8%)

Query: 182 GIQEYRNTVGIEVGNR---NTVIQHHDTIITSSDLGLAVTCQYDLTNKTVSNEVDLGVQG 238
           G    R   GI   N       +     I   SD  L V+C Y   + +VS+     V  
Sbjct: 53  GTTRQRTGDGIVYENEIVSQPSVSVGPPITRDSDRRLPVSCSYTSRSVSVSSVA---VLP 109

Query: 239 DVKSALTEEVVVDSPNVAMKI-TDRSGADVKP-SAEVGDPLALRFEILDPNS--PYEIFV 294
            V           S   ++++ TD S     P + E+GDPL +   +       P  +F+
Sbjct: 110 TVPPVSPSPSGEGSLTFSLRLYTDDSYTSDYPVTIELGDPLYVEATVSVLPRSDPLVLFL 169

Query: 295 RELVAMDGVD---SSEIVLIDSNGCPTDHFIMGPLYK 328
               A  G D   S    LI+ NGCP D      L  
Sbjct: 170 DSCWATPGPDPDSSPRYDLIE-NGCPVDGDSTSTLSH 205


>gnl|CDD|214680 smart00473, PAN_AP, divergent subfamily of APPLE domains.
          Apple-like domains present in Plasminogen, C. elegans
          hypothetical ORFs and the extracellular portion of
          plant receptor-like protein kinases. Predicted to
          possess protein- and/or carbohydrate-binding functions.
          Length = 78

 Score = 50.3 bits (120), Expect = 3e-08
 Identities = 19/54 (35%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 34 CFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSD 87
          CF R+    L GF + V +  S  +C   CL   +  CRS  YNN T  C +  
Sbjct: 5  CFVRLPNTKLPGFSRIVISVASLEECASKCL-NSNCSCRSFTYNNGTKGCLLWS 57



 Score = 39.1 bits (91), Expect = 2e-04
 Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 122 FKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHSR 177
           F +L    L     +   VAS+EEC   CLNS   C S+ Y + G   C L   S 
Sbjct: 6   FVRLPNTKLPGFSRIVISVASLEECASKCLNSNCSCRSFTY-NNGTKGCLLWSESS 60


>gnl|CDD|214579 smart00241, ZP, Zona pellucida (ZP) domain.  ZP proteins are
           responsible for sperm-adhesion fo the zona pellucida. ZP
           domains are also present in multidomain transmembrane
           proteins such as glycoprotein GP2, uromodulin and
           TGF-beta receptor type III (betaglycan).
          Length = 252

 Score = 47.4 bits (113), Expect = 5e-06
 Identities = 37/149 (24%), Positives = 51/149 (34%), Gaps = 27/149 (18%)

Query: 186 YRNTVGIEVGNRNTVIQHHDTIITSSD-LGLAVTCQYDLTNKTVSNEVDLGVQGDVKSAL 244
           Y NT+         V   H   IT  D       C Y    K   N        DV +  
Sbjct: 67  YSNTL--------VVSPFHPGFITRDDRAAYHFQCFYPENEKVSLN-------LDVSTIP 111

Query: 245 TEEVVVDS---PNVAMKI-TDRSGADVKPSA--EVGDPLALRFEILDPNSP-YEIFVREL 297
             E+   S      + ++  D S      SA   +GDP+   +E    + P   + V   
Sbjct: 112 PTELSSVSEGPLTCSYRLYKDDSFGSPYQSADYVLGDPVYHEWECDGADDPPLGLLVDNC 171

Query: 298 VAMDGVDSSEIV---LIDSNGCPTDHFIM 323
            A  G D S      +ID NGCP D ++ 
Sbjct: 172 YATPGPDPSSGPKYFIID-NGCPVDGYLD 199


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
           Provisional.
          Length = 522

 Score = 42.4 bits (100), Expect = 4e-04
 Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 403 QIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            + + NN ++N NNNNNNNN  N+NN N+N 
Sbjct: 7   HLSVHNNADDNYNNNNNNNNQINSNNPNNNG 37



 Score = 41.6 bits (98), Expect = 7e-04
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
            ++N NNNNNNNN  N+NN NNN N
Sbjct: 14  ADDNYNNNNNNNNQINSNNPNNNGN 38



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 403 QIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
               ++ + +NN ++N NNNNNNNN  N N 
Sbjct: 2   VNLREHLSVHNNADDNYNNNNNNNNQINSNN 32



 Score = 40.1 bits (94), Expect = 0.002
 Identities = 17/26 (65%), Positives = 19/26 (73%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
            NNNNNNNN  N+NN NNN NN  +K
Sbjct: 18  YNNNNNNNNQINSNNPNNNGNNQASK 43



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           ++N NNNNNNNN  N+NN NNN +N
Sbjct: 15  DDNYNNNNNNNNQINSNNPNNNGNN 39



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           +N NNNNNNNN  N+NN NNN N+ 
Sbjct: 16  DNYNNNNNNNNQINSNNPNNNGNNQ 40



 Score = 38.6 bits (90), Expect = 0.006
 Identities = 17/26 (65%), Positives = 18/26 (69%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
            N NNNNNNNN  N+NN NNN NN  
Sbjct: 16  DNYNNNNNNNNQINSNNPNNNGNNQA 41



 Score = 35.5 bits (82), Expect = 0.056
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            NNNNNN  N+NN NNN NN  +   +
Sbjct: 20  NNNNNNNQINSNNPNNNGNNQASKLPR 46



 Score = 34.7 bits (80), Expect = 0.092
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            NNNN  N+NN NNN NN  +     K
Sbjct: 22  NNNNNQINSNNPNNNGNNQASKLPRGK 48



 Score = 32.8 bits (75), Expect = 0.35
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           N N   + + +NN ++N NNNNN+N
Sbjct: 1   NVNLREHLSVHNNADDNYNNNNNNN 25



 Score = 29.3 bits (66), Expect = 4.3
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 410 NNNNNNNNNNNNNNNNNNNNNHNK 433
           N N   + + +NN ++N NNN+N 
Sbjct: 1   NVNLREHLSVHNNADDNYNNNNNN 24


>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
          Length = 54

 Score = 35.4 bits (82), Expect = 0.003
 Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 384 VNFLRYYDSLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNN 429
            NFL     L+           L NN  + N +N+++N +++NNNN
Sbjct: 11  FNFL-----LAWSLVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 28.1 bits (63), Expect = 1.0
 Identities = 6/22 (27%), Positives = 15/22 (68%)

Query: 411 NNNNNNNNNNNNNNNNNNNNHN 432
             NN  + N +N+++N ++++N
Sbjct: 28  LLNNWPSTNASNSSSNTSSSNN 49



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 6/22 (27%), Positives = 13/22 (59%)

Query: 412 NNNNNNNNNNNNNNNNNNNHNK 433
             NN  + N +N+++N ++ N 
Sbjct: 28  LLNNWPSTNASNSSSNTSSSNN 49


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 34.2 bits (79), Expect = 0.017
 Identities = 3/24 (12%), Positives = 20/24 (83%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNN 430
            + +N++++++N++N  +++++++
Sbjct: 28  SSGSNSSSSSSNSSNGGSSSSSSS 51



 Score = 33.1 bits (76), Expect = 0.043
 Identities = 3/23 (13%), Positives = 19/23 (82%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNN 430
            +N++++++N++N  +++++++ 
Sbjct: 30  GSNSSSSSSNSSNGGSSSSSSSG 52



 Score = 33.1 bits (76), Expect = 0.050
 Identities = 3/24 (12%), Positives = 20/24 (83%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNN 430
            +N++++++N++N  +++++++ +
Sbjct: 30  GSNSSSSSSNSSNGGSSSSSSSGD 53



 Score = 32.3 bits (74), Expect = 0.100
 Identities = 3/25 (12%), Positives = 21/25 (84%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           N++++++N++N  +++++++ ++ +
Sbjct: 32  NSSSSSSNSSNGGSSSSSSSGDSSS 56



 Score = 31.9 bits (73), Expect = 0.11
 Identities = 3/24 (12%), Positives = 19/24 (79%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNN 430
             +++ +N++++++N++N  ++++
Sbjct: 25  SVSSSGSNSSSSSSNSSNGGSSSS 48



 Score = 31.9 bits (73), Expect = 0.11
 Identities = 3/23 (13%), Positives = 20/23 (86%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNN 430
           +++ +N++++++N++N  +++++
Sbjct: 27  SSSGSNSSSSSSNSSNGGSSSSS 49



 Score = 30.8 bits (70), Expect = 0.31
 Identities = 2/23 (8%), Positives = 19/23 (82%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNN 430
           ++++N++N  +++++++ ++++ 
Sbjct: 35  SSSSNSSNGGSSSSSSSGDSSSG 57



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 3/22 (13%), Positives = 17/22 (77%)

Query: 411 NNNNNNNNNNNNNNNNNNNNHN 432
            +++ +N++++++N++N  + +
Sbjct: 26  VSSSGSNSSSSSSNSSNGGSSS 47


>gnl|CDD|205936 pfam13763, DUF4167, Domain of unknown function (DUF4167). 
          Length = 80

 Score = 33.4 bits (77), Expect = 0.028
 Identities = 10/26 (38%), Positives = 12/26 (46%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
              NNNN NNN +  N  N   + N 
Sbjct: 1   RGRNNNNRNNNRSGGNPRNRVFDSNG 26



 Score = 33.1 bits (76), Expect = 0.033
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNH 431
             NNNN NNN +  N  N   ++N 
Sbjct: 2   GRNNNNRNNNRSGGNPRNRVFDSNG 26



 Score = 31.1 bits (71), Expect = 0.20
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNH 431
           +  NNNN NNN +  N  N   +++
Sbjct: 1   RGRNNNNRNNNRSGGNPRNRVFDSN 25



 Score = 30.7 bits (70), Expect = 0.23
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            NNN NNN +  N  N   ++N    K
Sbjct: 4   NNNNRNNNRSGGNPRNRVFDSNGPDVK 30



 Score = 27.3 bits (61), Expect = 3.6
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
            N NNN +  N  N   ++N  +   
Sbjct: 6   NNRNNNRSGGNPRNRVFDSNGPDVKV 31


>gnl|CDD|152107 pfam11671, Apis_Csd, Complementary sex determiner protein.  This
           family of proteins represents the complementary sex
           determiner in the honeybee. In the honeybee, the
           mechanism of sex determination depends on the csd gene
           which produces an SR-type protein. Males are homozygous
           while females are homozygous for the csd gene.
           Heterozygosity generates an active protein which
           initiates female development.
          Length = 146

 Score = 34.7 bits (79), Expect = 0.036
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 411 NNNNNNNNNNNNNNNNNNNNHNK 433
           + +N +NN N NNN NN NN+NK
Sbjct: 51  SLSNLSNNYNYNNNYNNYNNYNK 73



 Score = 34.3 bits (78), Expect = 0.038
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 406 LQNNNNNNNNNNNNNNNNNNNNNNNH 431
           + + +N +NN N NNN NN NN N H
Sbjct: 49  ISSLSNLSNNYNYNNNYNNYNNYNKH 74



 Score = 32.0 bits (72), Expect = 0.25
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 403 QIFLQNNNNNNNNNNNNNNNNNNNNNNNH 431
           +I    +N +NN N NNN NN NN N ++
Sbjct: 47  KIISSLSNLSNNYNYNNNYNNYNNYNKHY 75



 Score = 32.0 bits (72), Expect = 0.29
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           +N +NN N NNN NN NN N + +N
Sbjct: 53  SNLSNNYNYNNNYNNYNNYNKHYYN 77



 Score = 29.3 bits (65), Expect = 1.9
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 406 LQNNNNNNNNNNNNNNNNNNNNNNNH 431
           L NN N NNN NN NN N +  N N+
Sbjct: 55  LSNNYNYNNNYNNYNNYNKHYYNINY 80



 Score = 29.3 bits (65), Expect = 2.0
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
            +NN N NNN NN NN N +  N N
Sbjct: 55  LSNNYNYNNNYNNYNNYNKHYYNIN 79


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 36.2 bits (83), Expect = 0.039
 Identities = 16/24 (66%), Positives = 20/24 (83%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNN 430
           ++N  NNN NN ++NNNNNNNNNN
Sbjct: 672 KDNKENNNKNNKDDNNNNNNNNNN 695



 Score = 36.2 bits (83), Expect = 0.042
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNNNNNNNNN 430
           P +   +NNN NN ++NNNNNNNNNN  NN
Sbjct: 670 PTKDNKENNNKNNKDDNNNNNNNNNNKINN 699



 Score = 35.8 bits (82), Expect = 0.054
 Identities = 17/25 (68%), Positives = 19/25 (76%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           N  NNN NN ++NNNNNNNNNN  N
Sbjct: 674 NKENNNKNNKDDNNNNNNNNNNKIN 698



 Score = 35.4 bits (81), Expect = 0.060
 Identities = 16/24 (66%), Positives = 20/24 (83%)

Query: 410 NNNNNNNNNNNNNNNNNNNNNHNK 433
           +N  NNN NN ++NNNNNNNN+NK
Sbjct: 673 DNKENNNKNNKDDNNNNNNNNNNK 696



 Score = 35.0 bits (80), Expect = 0.096
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
             NNN NN ++NNNNNNNNNN  +N
Sbjct: 675 KENNNKNNKDDNNNNNNNNNNKINN 699



 Score = 34.6 bits (79), Expect = 0.13
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNH 431
           +N  NNN NN ++NNNNNNNNN+
Sbjct: 673 DNKENNNKNNKDDNNNNNNNNNN 695



 Score = 33.1 bits (75), Expect = 0.32
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNH 431
           + NNN NN ++NNNNNNNNNN  N+
Sbjct: 675 KENNNKNNKDDNNNNNNNNNNKINN 699



 Score = 31.2 bits (70), Expect = 1.4
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNH 431
             NN ++NNNNNNNNNN  NN  ++
Sbjct: 679 NKNNKDDNNNNNNNNNNKINNAGSY 703


>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
          Length = 536

 Score = 35.1 bits (80), Expect = 0.079
 Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 4/36 (11%)

Query: 401 PLQIFLQNNNNN----NNNNNNNNNNNNNNNNNNHN 432
           P+ ++ Q  N N    N NNNN N +NNN N +N+N
Sbjct: 79  PVTLYQQQPNANDSYPNGNNNNPNGDNNNPNGSNNN 114



 Score = 31.7 bits (71), Expect = 0.91
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 400 SPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHN 432
           +P     Q   N N++  N NNNN N +NNN N
Sbjct: 77  APPVTLYQQQPNANDSYPNGNNNNPNGDNNNPN 109


>gnl|CDD|197324 cd09090, INPP5c_ScInp51p-like, Catalytic inositol polyphosphate
           5-phosphatase (INPP5c) domain of Saccharomyces
           cerevisiae Inp51p, Inp52p, and Inp53p, and related
           proteins.  This subfamily contains the INPP5c domain of
           three Saccharomyces cerevisiae synaptojanin-like
           inositol polyphosphate 5-phosphatases (INP51, INP52, and
           INP53), Schizosaccharomyces pombe synaptojanin
           (SPsynaptojanin), and related proteins. It belongs to a
           family of Mg2+-dependent inositol polyphosphate
           5-phosphatases, which hydrolyze the 5-phosphate from the
           inositol ring of various 5-position phosphorylated
           phosphoinositides (PIs) and inositol phosphates (IPs),
           and to the large EEP
           (exonuclease/endonuclease/phosphatase) superfamily that
           contains functionally diverse enzymes that share a
           common catalytic mechanism of cleaving phosphodiester
           bonds. In addition to this INPP5c domain, these proteins
           have an N-terminal catalytic Sac1-like domain (found in
           other proteins including the phophoinositide phosphatase
           Sac1p), and a C-terminal  proline-rich domain (PRD). The
           Sac1 domain allows Inp52p and Inp53p to recognize and
           dephosphorylate a wider range of substrates including
           PI3P, PI4P, and PI(3,5)P2. The Sac1 domain of Inp51p is
           non-functional. Disruption of any two of INP51, INP52,
           and INP53, in S. cerevisiae leads to abnormal vacuolar
           and plasma membrane morphology. During hyperosmotic
           stress, Inp52p and Inp53p localize at actin patches,
           where they may facilitate the hydrolysis of PI(4,5)P2,
           and consequently promote actin rearrangement to regulate
           cell growth. SPsynaptojanin is also active against a
           range of soluble and lipid inositol phosphates,
           including I(1,4,5)P3, I(1,3,4,5)P4, I(1,4,5,6)P4,
           PI(4,5)P2, and PIP3. Transformation of S. cerevisiae
           with a plasmid expressing the SPsynaptojanin
           5-phosphatase domain rescues inp51/inp52/inp53
           triple-mutant strains.
          Length = 291

 Score = 34.2 bits (79), Expect = 0.11
 Identities = 19/78 (24%), Positives = 32/78 (41%), Gaps = 6/78 (7%)

Query: 159 SYDYGDTGDLVCRLSHHSRATLVGIQE-YRNTVGIEVG---NRNTVIQHHDTIITSSDLG 214
            +DYGDT    C ++ H  A L   +E   +   I  G   +R   I+ HD +I   D  
Sbjct: 127 RFDYGDTS--FCFVTSHLAAGLTNYEERNNDYKTIARGLRFSRGRTIKDHDHVIWLGDFN 184

Query: 215 LAVTCQYDLTNKTVSNEV 232
             ++   +   + + N  
Sbjct: 185 YRISLTNEDVRRFILNGK 202


>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
           (Spore_YhcN_YlaJ).  This entry contains YhcN and YlaJ,
           which are predicted lipoproteins that have been detected
           as spore proteins but not vegetative proteins in
           Bacillus subtilis. Both appear to be expressed under
           control of the RNA polymerase sigma-G factor. The
           YlaJ-like members of this family have a low-complexity,
           strongly acidic, 40-residue C-terminal domain.
          Length = 169

 Score = 32.3 bits (74), Expect = 0.25
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 406 LQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           + +N+ N NN NN NN NNNN      K
Sbjct: 23  VTDNDGNTNNVNNRNNRNNNNGYGRLTK 50



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 13/31 (41%), Positives = 14/31 (45%)

Query: 403 QIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           Q    N+ N NN NN NN NNNN        
Sbjct: 21  QNVTDNDGNTNNVNNRNNRNNNNGYGRLTKN 51



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 12/25 (48%), Positives = 13/25 (52%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
            N NN NN NN NNNN       +N
Sbjct: 28  GNTNNVNNRNNRNNNNGYGRLTKNN 52



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 10/25 (40%), Positives = 10/25 (40%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           N NN NNNN       NN    N  
Sbjct: 35  NRNNRNNNNGYGRLTKNNVTGENGY 59



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           NN NNNN       NN    N  +N
Sbjct: 37  NNRNNNNGYGRLTKNNVTGENGYNN 61



 Score = 28.5 bits (64), Expect = 5.5
 Identities = 12/27 (44%), Positives = 13/27 (48%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            NN NN NN NNNN       NN   +
Sbjct: 30  TNNVNNRNNRNNNNGYGRLTKNNVTGE 56



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 11/25 (44%), Positives = 11/25 (44%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
            NN NN NNNN       NN    N
Sbjct: 33  VNNRNNRNNNNGYGRLTKNNVTGEN 57



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 12/26 (46%), Positives = 12/26 (46%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
             NN NN NN NNNN       NN  
Sbjct: 29  NTNNVNNRNNRNNNNGYGRLTKNNVT 54


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 33.2 bits (75), Expect = 0.27
 Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 6/32 (18%)

Query: 408 NNNNNN------NNNNNNNNNNNNNNNNNHNK 433
           N NN N      N N N N+N N N+N N N+
Sbjct: 324 NTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQ 355



 Score = 31.7 bits (71), Expect = 0.73
 Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 6/31 (19%)

Query: 408 NNNNNNNNNN------NNNNNNNNNNNNNHN 432
           N N N NN N      N N N N+N N N N
Sbjct: 320 NTNTNTNNTNTSTPSKNTNTNTNSNTNTNSN 350



 Score = 31.7 bits (71), Expect = 0.92
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
            N N+N N N+N N N  ++NNN N
Sbjct: 339 TNTNSNTNTNSNTNANQGSSNNNSN 363



 Score = 30.5 bits (68), Expect = 1.7
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNHN 432
           N N N N+N N N+N N N  + N
Sbjct: 336 NTNTNTNSNTNTNSNTNANQGSSN 359



 Score = 30.5 bits (68), Expect = 1.8
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           N N N+N N N+N N N  ++NN++
Sbjct: 338 NTNTNSNTNTNSNTNANQGSSNNNS 362



 Score = 30.2 bits (67), Expect = 2.7
 Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 6/31 (19%)

Query: 408 NNNNNNNNNNNN------NNNNNNNNNNNHN 432
           N N N N NN N      N N N N+N N N
Sbjct: 318 NTNTNTNTNNTNTSTPSKNTNTNTNSNTNTN 348


>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1).  This
           family consists of the yeast mitochondrial ribosomal
           proteins VAR1. Mitochondria possess their own ribosomes
           responsible for the synthesis of a small number of
           proteins encoded by the mitochondrial genome. In yeast
           the two ribosomal RNAs and a single ribosomal protein,
           VAR1, are products of mitochondrial genes, and the
           remaining approximately 80 ribosomal proteins are
           encoded in the nucleus. VAR1 along with 15S rRNA are
           necessary for the formation of mature 37S subunits.
          Length = 350

 Score = 32.6 bits (74), Expect = 0.34
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 405 FLQNNNNNNNNNNNNNNNNNNNNNNNHN 432
           ++ N NNNNN   NN  NN+NNN N +N
Sbjct: 229 YINNINNNNNIKYNNMINNSNNNLNINN 256



 Score = 32.6 bits (74), Expect = 0.42
 Identities = 15/25 (60%), Positives = 17/25 (68%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           NNNN   NN  NN+NNN N NN +N
Sbjct: 235 NNNNIKYNNMINNSNNNLNINNIYN 259



 Score = 32.2 bits (73), Expect = 0.46
 Identities = 15/26 (57%), Positives = 16/26 (61%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
           NNNNN   NN  NN+NNN N NN   
Sbjct: 234 NNNNNIKYNNMINNSNNNLNINNIYN 259



 Score = 31.5 bits (71), Expect = 0.94
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 406 LQNNNNNNNNNNNNNNNNNNNNNNN 430
           + N+NNN N NN  NN N N  + +
Sbjct: 245 INNSNNNLNINNIYNNLNINKISMD 269



 Score = 31.1 bits (70), Expect = 1.0
 Identities = 14/25 (56%), Positives = 16/25 (64%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNHNK 433
           NN NNNNN   NN  NN+NNN +  
Sbjct: 231 NNINNNNNIKYNNMINNSNNNLNIN 255



 Score = 31.1 bits (70), Expect = 1.3
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNHN 432
           NN+NNN N NN  NN N N  + +
Sbjct: 246 NNSNNNLNINNIYNNLNINKISMD 269



 Score = 30.7 bits (69), Expect = 1.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 404 IFLQNNNNNNNNNNNNNNNNNNNNNN 429
           +   +NNN N NN  NN N N  + +
Sbjct: 244 MINNSNNNLNINNIYNNLNINKISMD 269



 Score = 29.9 bits (67), Expect = 2.6
 Identities = 22/64 (34%), Positives = 30/64 (46%), Gaps = 7/64 (10%)

Query: 370 IIIYGDINDTNGESVNFLRYYDSLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNN 429
           I+  G +N     S+       SLS        +++L+NN N N   N   NN+N  NNN
Sbjct: 281 ILFKGRLNKNISRSIKTRLLNGSLSN-------KLYLKNNINKNYKLNYIPNNHNIYNNN 333

Query: 430 NHNK 433
           N NK
Sbjct: 334 NVNK 337



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           NN  NN+NNN N NN  NN N N  
Sbjct: 242 NNMINNSNNNLNINNIYNNLNINKI 266



 Score = 29.5 bits (66), Expect = 3.3
 Identities = 13/28 (46%), Positives = 15/28 (53%)

Query: 406 LQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
             N  NN+NNN N NN  NN N N  + 
Sbjct: 241 YNNMINNSNNNLNINNIYNNLNINKISM 268



 Score = 28.0 bits (62), Expect = 9.3
 Identities = 16/29 (55%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 406 LQNNNNNNNNNNNNNNNNN--NNNNNNHN 432
           +  N  NN NNNNN   NN  NN+NNN N
Sbjct: 225 ISMNYINNINNNNNIKYNNMINNSNNNLN 253


>gnl|CDD|221195 pfam11737, DUF3300, Protein of unknown function (DUF3300).  This
           hypothetical bacterial gene product has a long
           hydrophobic segment and is thus likely to be a membrane
           protein.
          Length = 230

 Score = 32.2 bits (74), Expect = 0.40
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHN 432
           +N+NN  N NN+   ++      HN
Sbjct: 206 HNHNNRINRNNDRVVSSGGQRWQHN 230



 Score = 29.2 bits (66), Expect = 3.6
 Identities = 7/24 (29%), Positives = 13/24 (54%)

Query: 410 NNNNNNNNNNNNNNNNNNNNNHNK 433
           N+N+NN  N NN+   ++     +
Sbjct: 205 NHNHNNRINRNNDRVVSSGGQRWQ 228


>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 628

 Score = 32.3 bits (74), Expect = 0.52
 Identities = 15/38 (39%), Positives = 21/38 (55%)

Query: 393 LSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNN 430
           L + FF S L     +N +   NNNNN+  N NN+N +
Sbjct: 21  LISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFS 58



 Score = 30.3 bits (69), Expect = 2.0
 Identities = 13/39 (33%), Positives = 18/39 (46%)

Query: 392 SLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNN 430
            L    F S     +  +N +   NNNNN+  N NN+N 
Sbjct: 19  ILLISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNF 57



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 392 SLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           +LS    L     F  N  N + +N +   NNNNN+  N+N 
Sbjct: 13  NLSIFIILLISIFFSSNLPNISTSNASLGGNNNNNSIINYNN 54



 Score = 28.4 bits (64), Expect = 8.1
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 392 SLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNN 426
             S    +S     L  NNNNN+  N NN+N + +
Sbjct: 26  FSSNLPNISTSNASLGGNNNNNSIINYNNSNFSIS 60


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
           Provisional.
          Length = 2722

 Score = 32.5 bits (74), Expect = 0.53
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 398 FLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNN 430
           F      F    N   ++NNNNNN NN++ N  
Sbjct: 37  FQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRK 69



 Score = 31.7 bits (72), Expect = 0.84
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
             Q    N  N  N   ++NNNNNN NN++ N+
Sbjct: 36  NFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINR 68



 Score = 30.2 bits (68), Expect = 2.8
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNHN 432
           NNNNNN NN++ N   +N N    
Sbjct: 55  NNNNNNYNNSSINRKKDNKNYMKR 78



 Score = 28.6 bits (64), Expect = 9.0
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNH 431
           NNNNNN NN++ N   +N N    
Sbjct: 55  NNNNNNYNNSSINRKKDNKNYMKR 78


>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
           (DUF912).  This family consists of several
           Nucleopolyhedrovirus proteins of unknown function.
          Length = 101

 Score = 30.0 bits (68), Expect = 0.76
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNNNNNNNNN 430
           PL++ L ++N++NN   N NN  +    N+
Sbjct: 18  PLKLALNDDNDDNNAPINANNTQSVYPKND 47



 Score = 28.4 bits (64), Expect = 2.8
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 410 NNNNNNNNNNNNNNNNNNNNNHNK 433
           N++N++NN   N NN  +    N 
Sbjct: 24  NDDNDDNNAPINANNTQSVYPKND 47


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 31.6 bits (72), Expect = 0.83
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
           N N    NN +N NNN N NN    K
Sbjct: 217 NINKEEKNNGSNVNNNGNKNNKEEQK 242



 Score = 31.2 bits (71), Expect = 1.2
 Identities = 10/26 (38%), Positives = 11/26 (42%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
           N    NN +N NNN N NN       
Sbjct: 219 NKEEKNNGSNVNNNGNKNNKEEQKGN 244



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 402 LQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           L   +++   N N    NN +N NNN N +NK
Sbjct: 207 LDTLIKSKEININKEEKNNGSNVNNNGNKNNK 238



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHN 432
             +I +     NN +N NNN N NN      N
Sbjct: 213 SKEININKEEKNNGSNVNNNGNKNNKEEQKGN 244



 Score = 28.5 bits (64), Expect = 7.6
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
               N N    NN +N NNN N N+ +
Sbjct: 213 SKEININKEEKNNGSNVNNNGNKNNKE 239


>gnl|CDD|238533 cd01100, APPLE_Factor_XI_like, Subfamily of PAN/APPLE-like
          domains; present in plasma prekallikrein/coagulation
          factor XI, microneme antigen proteins, and a few
          prokaryotic proteins. PAN/APPLE domains fulfill diverse
          biological functions by mediating protein-protein or
          protein-carbohydrate interactions.
          Length = 73

 Score = 28.9 bits (65), Expect = 0.96
 Identities = 12/52 (23%), Positives = 18/52 (34%), Gaps = 2/52 (3%)

Query: 34 CFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHM 85
          CF +       G       A+S   C   C    D  C +  YN  + +C +
Sbjct: 5  CFRQGSNVDFRGGDLSTVFASSAEQCQAACTA--DPGCLAFTYNTKSKKCFL 54


>gnl|CDD|188873 cd09213, Luminal_IRE1_like, The Luminal domain, a dimerization
           domain, of Inositol-requiring protein 1-like proteins.
           The Luminal domain is a dimerization domain present in
           Inositol-requiring protein 1 (IRE1), eukaryotic
           translation Initiation Factor 2-Alpha Kinase 3
           (EIF2AK3), and similar proteins. IRE1 and EIF2AK3 are
           serine/threonine protein kinases (STKs) and are type I
           transmembrane proteins that are localized in the
           endoplasmic reticulum (ER). They are kinase receptors
           that are activated through the release of BiP, a
           chaperone bound to their luminal domains under
           unstressed conditions. This results in dimerization
           through their luminal domains, allowing
           trans-autophosphorylation of their kinase domains and
           activation. They play roles in the signaling of the
           unfolded protein response (UPR), which is activated when
           protein misfolding is detected in the ER in order to
           decrease the synthesis of new proteins and increase the
           capacity of the ER to cope with the stress. IRE1, also
           called Endoplasmic reticulum (ER)-to-nucleus signaling
           protein (or ERN), contains an endoribonuclease domain in
           its cytoplasmic side and acts as an ER stress sensor. It
           is the oldest and most conserved component of the UPR in
           eukaryotes. Its activation results in the cleavage of
           its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
           promoting a splicing event that enables translation into
           a transcription factor which activates the UPR. EIF2AK3,
           also called PKR-like Endoplasmic Reticulum Kinase
           (PERK), phosphorylates the alpha subunit of eIF-2,
           resulting in the downregulation of protein synthesis. It
           functions as the central regulator of translational
           control during the UPR pathway. In addition to the eIF-2
           alpha subunit, EIF2AK3 also phosphorylates Nrf2, a
           leucine zipper transcription factor which regulates
           cellular redox status and promotes cell survival during
           the UPR.
          Length = 312

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 4/39 (10%), Positives = 11/39 (28%), Gaps = 4/39 (10%)

Query: 399 LSPLQIFLQ----NNNNNNNNNNNNNNNNNNNNNNNHNK 433
               Q+F       + +  +  +N     N + +     
Sbjct: 271 HENGQLFALSSPSKSEDKESAISNILEGENTSPSALEGV 309


>gnl|CDD|222130 pfam13439, Glyco_transf_4, Glycosyltransferase Family 4. 
          Length = 171

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 4/35 (11%)

Query: 287 NSPYEIFVRELVAMDGVDSSEIVLIDSNGCPTDHF 321
            S     V + +A  GV   +I +I  NG  T+ F
Sbjct: 140 VSEE---VADELARYGVPEEKIRVI-PNGVDTERF 170


>gnl|CDD|227698 COG5411, COG5411, Phosphatidylinositol 5-phosphate phosphatase
           [Signal transduction mechanisms].
          Length = 460

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 23/78 (29%), Positives = 33/78 (42%), Gaps = 10/78 (12%)

Query: 159 SYDYGDTGDLVCRLSHHSRATLVGIQE----YRNTVGIEVGNRNTVIQHHDTIITSSDLG 214
            ++Y  T    C ++ H  A +  I+E    YR+       +R   I  HDTI    DL 
Sbjct: 159 RFNYERTS--FCFVNSHLAAGVNNIEERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLN 216

Query: 215 LAVTCQYDLTNKTVSNEV 232
             VT     TN+ V  E+
Sbjct: 217 YRVT----STNEEVRPEI 230


>gnl|CDD|151365 pfam10918, DUF2718, Protein of unknown function (DUF2718).  This
           viral family of proteins has no known function.
          Length = 140

 Score = 29.2 bits (65), Expect = 2.0
 Identities = 14/67 (20%), Positives = 21/67 (31%), Gaps = 11/67 (16%)

Query: 357 SSIQNIQILTMHEIIIYGDINDTNGESVNFLRYYDSLSTCFFLSPLQIFLQNNNNNNNNN 416
           S I N + L + E      I++ N   V     +D      F+     F +         
Sbjct: 64  SPIPNEKPLKLKEQAEKPPISERNNNGV-----FD------FMKIPNPFKRYYEYCYPYP 112

Query: 417 NNNNNNN 423
           N N  N 
Sbjct: 113 NKNAKNQ 119


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 30.3 bits (68), Expect = 2.7
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           P  I  +NN ++N+NN++ +++N+  +N   N+
Sbjct: 312 PKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNR 344


>gnl|CDD|150787 pfam10154, DUF2362, Uncharacterized conserved protein (DUF2362).
           This is a family of proteins conserved from nematodes to
           humans. The function is not known.
          Length = 501

 Score = 29.8 bits (67), Expect = 3.3
 Identities = 6/22 (27%), Positives = 10/22 (45%)

Query: 412 NNNNNNNNNNNNNNNNNNNHNK 433
            + + NN    +N N N+   K
Sbjct: 154 TDQDVNNLAAQHNENQNSIEKK 175



 Score = 28.6 bits (64), Expect = 7.2
 Identities = 7/23 (30%), Positives = 10/23 (43%)

Query: 401 PLQIFLQNNNNNNNNNNNNNNNN 423
            L   L + + NN    +N N N
Sbjct: 148 ELGKSLTDQDVNNLAAQHNENQN 170


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 28.4 bits (64), Expect = 3.5
 Identities = 6/22 (27%), Positives = 8/22 (36%)

Query: 404 IFLQNNNNNNNNNNNNNNNNNN 425
           +FL     N   N  +N   N 
Sbjct: 2   LFLLKKMYNKKKNRKSNIKENK 23


>gnl|CDD|214445 MTH00166, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 160

 Score = 28.6 bits (65), Expect = 3.5
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 397 FFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            FL  L + L N   N+  +N NNN N   N+ + +K
Sbjct: 85  LFLLMLLLMLNNFIKNSEMSNFNNNFNFFENSFSLSK 121


>gnl|CDD|235152 PRK03705, PRK03705, glycogen debranching enzyme; Provisional.
          Length = 658

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 6/51 (11%)

Query: 383 SVNFLRYYDSLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           S+N +  +D     F L     F  N  +N  N   N +  NNN +NNH K
Sbjct: 435 SINLVTAHDG----FTLRDCVCF--NQKHNEANGEENRDGTNNNYSNNHGK 479


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 29.4 bits (66), Expect = 3.7
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            + NNNN +NN N +  N ++ N   K
Sbjct: 58  PSENNNNKSNNENKDEENPSSTNPEKK 84


>gnl|CDD|214411 MTH00044, ND4, NADH dehydrogenase subunit 4; Validated.
          Length = 458

 Score = 29.6 bits (67), Expect = 4.0
 Identities = 17/73 (23%), Positives = 32/73 (43%), Gaps = 11/73 (15%)

Query: 357 SSIQNIQILTMHEIIIYGDINDTNGESVNFLRYY---DSLST-----CFFLSPLQIFLQN 408
            SI    ++++  + +    N  +  S + L      DSLS+       +L+PL +    
Sbjct: 26  ISISQSSLISLLSLFLLN--NHWS-SSWHNLSLSLATDSLSSPLIILSCWLAPLSLLASI 82

Query: 409 NNNNNNNNNNNNN 421
           N+ +NN+ NN   
Sbjct: 83  NHLSNNSPNNQRT 95


>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
           family.  YhcN and YlaJ are predicted lipoproteins that
           have been detected as spore proteins but not vegetative
           proteins in Bacillus subtilis. Both appear to be
           expressed under control of the RNA polymerase sigma-G
           factor. The YlaJ-like members of this family have a
           low-complexity, strongly acidic 40-residue C-terminal
           domain that is not included in the seed alignment for
           this model. A portion of the low-complexity region
           between the lipoprotein signal sequence and the main
           conserved region of the protein family was also excised
           from the seed alignment [Cellular processes, Sporulation
           and germination].
          Length = 158

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 12/42 (28%), Positives = 15/42 (35%)

Query: 392 SLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            L     L+      + N  NNN    N  + NNN  N  N 
Sbjct: 5   ILLLLLVLTGCTNAQKKNPPNNNVGETNVMSKNNNGMNTTND 46


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 408 NNNNNNNNNNNNNNNNNNNNNNNHNK 433
               +N N +   N   NN NN  NK
Sbjct: 116 FKKKSNKNGSVEKNECINNKNNVINK 141



 Score = 28.5 bits (64), Expect = 6.3
 Identities = 10/30 (33%), Positives = 13/30 (43%)

Query: 404 IFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            F + +N N +   N   NN NN  N   K
Sbjct: 115 YFKKKSNKNGSVEKNECINNKNNVINKKIK 144


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 7/36 (19%), Positives = 12/36 (33%)

Query: 398 FLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
            L  L   +  + +   +  N +   N N N   N 
Sbjct: 342 SLPSLLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNT 377


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 29.1 bits (66), Expect = 5.9
 Identities = 0/26 (0%), Positives = 4/26 (15%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
           +  +        +  +          
Sbjct: 210 EQGDRREERGRRDGGDRRGRRRRRDR 235



 Score = 29.1 bits (66), Expect = 6.2
 Identities = 0/26 (0%), Positives = 5/26 (19%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
           + +     +        +  +     
Sbjct: 205 RRDRREQGDRREERGRRDGGDRRGRR 230



 Score = 28.7 bits (65), Expect = 7.0
 Identities = 0/26 (0%), Positives = 5/26 (19%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
            + +  +     +        +    
Sbjct: 201 DDRDRRDRREQGDRREERGRRDGGDR 226



 Score = 28.7 bits (65), Expect = 7.9
 Identities = 1/26 (3%), Positives = 5/26 (19%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
           Q +        +  +        +  
Sbjct: 211 QGDRREERGRRDGGDRRGRRRRRDRR 236



 Score = 28.7 bits (65), Expect = 8.1
 Identities = 0/25 (0%), Positives = 6/25 (24%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNH 431
           ++  +     +        +  +  
Sbjct: 203 RDRRDRREQGDRREERGRRDGGDRR 227


>gnl|CDD|215096 PLN00192, PLN00192, aldehyde oxidase.
          Length = 1344

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 7/48 (14%)

Query: 389 YYDSLSTCF---FLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHNK 433
           Y  SL+  F   FLSPL      +N  ++N   +  +N   N + H+ 
Sbjct: 513 YRSSLAVGFLFDFLSPLI----ESNAKSSNGWLDGGSNTKQNPDQHDD 556


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 28.3 bits (63), Expect = 6.9
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
            N   N   +N+ + N NNN   N+N
Sbjct: 110 DNPQANAMQDNSFHENFNNNYPGNYN 135


>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390).  This
           family consists of several Paramecium bursaria chlorella
           virus 1 (PBCV-1) proteins of around 250 residues in
           length. The function of this family is unknown.
          Length = 229

 Score = 28.1 bits (63), Expect = 7.4
 Identities = 10/24 (41%), Positives = 10/24 (41%)

Query: 409 NNNNNNNNNNNNNNNNNNNNNNHN 432
           N N    NNN NN N   N  N  
Sbjct: 58  NINTGVVNNNVNNINKTINTKNIT 81



 Score = 28.1 bits (63), Expect = 7.5
 Identities = 10/26 (38%), Positives = 10/26 (38%)

Query: 407 QNNNNNNNNNNNNNNNNNNNNNNNHN 432
             N    NNN NN N   N  N   N
Sbjct: 58  NINTGVVNNNVNNINKTINTKNITIN 83


>gnl|CDD|202525 pfam03069, FmdA_AmdA, Acetamidase/Formamidase family.  This family
           includes amidohydrolases of formamide EC:3.5.1.49 and
           acetamide. Methylophilus methylotrophus fmdA forms a
           homotrimer suggesting all the members of this family
           also do.
          Length = 316

 Score = 28.6 bits (64), Expect = 7.5
 Identities = 18/80 (22%), Positives = 26/80 (32%), Gaps = 16/80 (20%)

Query: 262 RSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVAMDGVDSSEIVL--IDSNGCPTD 319
           R   D+ P+AEV     +R E LD             A+ G   +               
Sbjct: 7   RWHPDIPPAAEVKPGEIVRVETLD-------------ALGGQIKNNDSADDYKKIDLSVV 53

Query: 320 HFIMGPLY-KAADTGKFLNC 338
           H + GP+  + A  G  L  
Sbjct: 54  HPLTGPIEVEGAKPGDLLVV 73


>gnl|CDD|226445 COG3935, DnaD, Putative primosome component and related proteins
           [DNA replication, recombination, and repair].
          Length = 246

 Score = 28.2 bits (63), Expect = 8.8
 Identities = 13/64 (20%), Positives = 19/64 (29%), Gaps = 6/64 (9%)

Query: 376 INDTNGESVNFLRYYDSLSTCFFLSPLQIFLQNNNNNN------NNNNNNNNNNNNNNNN 429
           I D N    N         T   +     + Q+N+           NN +  +N   N N
Sbjct: 65  IKDVNKTIRNLTGRGLLQGTQEEIDNTIKYEQDNSPPLWKLLQLFVNNKSLEDNELKNQN 124

Query: 430 NHNK 433
              K
Sbjct: 125 EDKK 128


>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
           Mitochondrial-encoded NADH-quinone
           oxidoreductase/respiratory complex I, the second domain
           of the Nad11/75-kDa subunit of some protists.
           NADH-quinone oxidoreductase is the first
           energy-transducting complex in the respiratory chain and
           functions as a redox pump that uses the redox energy to
           translocate H+ ions across the membrane, resulting in a
           significant contribution to energy production. The nad11
           gene codes for the largest (75-kDa) subunit of the
           mitochondrial NADH-quinone oxidoreductase, it
           constitutes the electron input part of the enzyme, or
           the so-called NADH dehydrogenase fragment. The Nad11
           subunit is made of two domains: the first contains three
           binding sites for FeS clusters (the fer2 domain), the
           second domain (this CD), is of unknown function or, as
           postulated, has lost an ancestral formate dehydrogenase
           activity that became redundant during the evolution of
           the complex I enzyme. Although only vestigial sequence
           evidence remains of a molybdopterin binding site, this
           protein domain belongs to the molybdopterin_binding
           (MopB) superfamily of proteins.
          Length = 366

 Score = 28.1 bits (63), Expect = 9.1
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 392 SLSTCFFLSPLQIFLQNNNNNNNNNNNNNNNNNNNNNNNHN 432
            L T F+    +  L    + N N+NN   NNN  N +  N
Sbjct: 99  DLETLFYY---KKLLNKLGSLNTNSNNFLENNNYFNLDLEN 136


>gnl|CDD|223072 PHA03393, odv-e66, occlusion-derived virus envelope protein E66;
           Provisional.
          Length = 682

 Score = 28.3 bits (64), Expect = 9.8
 Identities = 10/27 (37%), Positives = 11/27 (40%)

Query: 404 IFLQNNNNNNNNNNNNNNNNNNNNNNN 430
           I L  +NNNNN  N  N   N      
Sbjct: 18  ILLWQSNNNNNTPNPPNPPPNPPPPPP 44


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.407 

Gapped
Lambda     K      H
   0.267   0.0649    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,952,889
Number of extensions: 1997722
Number of successful extensions: 3915
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3239
Number of HSP's successfully gapped: 213
Length of query: 433
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 333
Effective length of database: 6,502,202
Effective search space: 2165233266
Effective search space used: 2165233266
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.1 bits)