RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10158
         (1266 letters)



>gnl|CDD|215860 pfam00328, His_Phos_2, Histidine phosphatase superfamily (branch
           2).  The histidine phosphatase superfamily is so named
           because catalysis centres on a conserved His residue
           that is transiently phosphorylated during the catalytic
           cycle. Other conserved residues contribute to a
           'phosphate pocket' and interact with the phospho group
           of substrate before, during and after its transfer to
           the His residue. Structure and sequence analyses show
           that different families contribute different additional
           residues to the 'phosphate pocket' and, more
           surprisingly, differ in the position, in sequence and in
           three dimensions, of a catalytically essential acidic
           residue. The superfamily may be divided into two main
           branches.The smaller branch 2 contains predominantly
           eukaryotic proteins. The catalytic functions in members
           include phytase, glucose-1-phosphatase and multiple
           inositol polyphosphate phosphatase. The in vivo roles of
           the mammalian acid phosphatases in branch 2 are not
           fully understood, although activity against
           lysophosphatidic acid and tyrosine-phosphorylated
           proteins has been demonstrated.
          Length = 327

 Score =  147 bits (374), Expect = 6e-39
 Identities = 79/392 (20%), Positives = 124/392 (31%), Gaps = 104/392 (26%)

Query: 513 EPSLVLILKWGGELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASD 572
              +   L WG  LTP GR QA ELGR  R  Y G       LL        ++ I +SD
Sbjct: 40  LNKVQWPLGWGQ-LTPRGRAQAFELGRYLRQRYVG-------LLPDGYP-PSEVYIRSSD 90

Query: 573 EGRVQMTAAAFAKGLLALEGELTPILVQMVKSANTNGLLDNDSDASKHQNIVKAKLHDLL 632
             R   +A AF  GL   EG                   D D+D    Q I    L ++ 
Sbjct: 91  SNRTLASAQAFLAGLFPPEG-------------------DIDNDLLDWQPIPVVTLPEVK 131

Query: 633 QRDRTFTPEDRDKVNPCNATSINIAMDFVKNPVQCCKRIHELIHVLQHIIQKKLEDVKCK 692
           +          D    C       A D      +    + +L+ V    I K+L  +   
Sbjct: 132 KALANLLLLGYD---SCP------AFDESLVEKRVDPELAKLLAVYLEPIAKRLSQLLPG 182

Query: 693 ESSLYHGESWELMGRRWSKIEKDFCMKNYKYDISKIPDIYDCIKYDLQHNQHTVQFDQAE 752
           E++L            W+ ++      N   D S  PD++    YD  H +        E
Sbjct: 183 ETNL-------TGLDVWALLDLCLFETN-ASDNSPFPDLFTG--YDALHLE---YLSDLE 229

Query: 753 ELYLNAKYMADIVIPQEYGMTMSEKLTISQGICVPLLKKIRADLQRNVEESEENVNRLNP 812
           E Y  +                     I   +   LLK++   L  +             
Sbjct: 230 EYYGLSGI------------GPELAKLIGGPLLNELLKRLTNALVPDQT----------- 266

Query: 813 QYSHGVSSPGRHVRTRLYFTSESHIHSLLTVLRYGGLTESVHMNDEQWMRAMEYVSMVSE 872
                       ++  LYF  ++ I  LL+ L   GL + +                   
Sbjct: 267 ---------RLDLKLYLYFAHDTTIAPLLSAL---GLFDQLPGLSSGE------------ 302

Query: 873 LNYMSQVVIMLYEDPTKDPTSDERFHIELHFS 904
             Y +++V  L++D        + F++ L ++
Sbjct: 303 PPYAARLVFELWQD-------GKEFYVRLLYN 327



 Score = 50.5 bits (121), Expect = 3e-06
 Identities = 13/41 (31%), Positives = 19/41 (46%)

Query: 385 ELRCVVAVIRHGDRTPKQKMKVEVRHPKFFEIFYKYGGQND 425
           EL  V  V RHGDRTP + +  +     F  +  K+    +
Sbjct: 1   ELEFVQVVSRHGDRTPTKSLPKDPEAWPFKLLSEKHNDFLN 41


>gnl|CDD|132717 cd07061, HP_HAP_like, Histidine phosphatase domain found in
           histidine acid phosphatases and phytases; contains a His
           residue which is phosphorylated during the reaction.
           Catalytic domain of HAP (histidine acid phosphatases)
           and phytases (myo-inositol hexakisphosphate
           phosphohydrolases). The conserved catalytic core of this
           domain contains a His residue which is phosphorylated in
           the reaction. Functions in this subgroup include roles
           in metabolism, signaling, or regulation, for example
           Escherichia coli glucose-1-phosphatase functions to
           scavenge glucose from glucose-1-phosphate and the
           signaling molecules inositol 1,3,4,5,6-pentakisphosphate
           (InsP5) and inositol hexakisphosphate (InsP6) are in
           vivo substrates for eukaryotic multiple inositol
           polyphosphate phosphatase 1 (Minpp1). Phytases scavenge
           phosphate from extracellular sources and are added to
           animal feed while prostatic acid phosphatase (PAP) has
           been used for many years as a serum marker for prostate
           cancer. Recently PAP has been shown in mouse models to
           suppress pain by functioning as an
           ecto-5prime-nucleotidase. In vivo it dephosphorylates
           extracellular adenosine monophosphate (AMP) generating
           adenosine,and leading to the activation of A1-adenosine
           receptors in dorsal spinal cord.
          Length = 242

 Score = 84.0 bits (208), Expect = 8e-18
 Identities = 36/90 (40%), Positives = 44/90 (48%), Gaps = 8/90 (8%)

Query: 524 GELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAF 583
           GELTP GR QA ELGR FR  Y         LL LHS  R DL I +SD  R   +A AF
Sbjct: 17  GELTPFGRQQAFELGRYFRQRYGE-------LLLLHSYNRSDLYIRSSDSQRTLQSAQAF 69

Query: 584 AKGLLALEG-ELTPILVQMVKSANTNGLLD 612
             GL   +G +   +     +  + + L D
Sbjct: 70  LAGLFPPDGWQPIAVHTIPEEEDDVSNLFD 99



 Score = 50.5 bits (121), Expect = 1e-06
 Identities = 32/182 (17%), Positives = 58/182 (31%), Gaps = 34/182 (18%)

Query: 724 DISKIPDIYDCIKYDLQHNQHTVQFDQ---AEELYLNAKYMADIVIPQEYGMTMSEKLTI 780
           +   + +++D   Y+     ++  F      EE            +   YG      L  
Sbjct: 90  EEDDVSNLFDLCAYETVAKGYSAPFCDLFTEEEWVKLEYLN---DLKFYYGYGPGNPLAR 146

Query: 781 SQGICVPLLKKIRADLQRNVEESEENVNRLNPQYSHGVSSPGRHVRTRLYFTSESHIHSL 840
           +QG   PLL ++              + RL     +G S        R  +   SH  ++
Sbjct: 147 AQGS--PLLNEL--------------LARLT----NGPSGSQTFPLDRKLYLYFSHDTTI 186

Query: 841 LTVLRYGGLTESVHMNDEQWMRAMEYVSMVSELNYMSQVVIMLYEDP--TKDPTSDERFH 898
           L +L   GL +        ++R        SE +Y      +++E      D  S  R  
Sbjct: 187 LPLLTALGLFDFAEPLPPDFLRG------FSESDYPPFAARLVFELWRCPGDGESYVRVL 240

Query: 899 IE 900
           + 
Sbjct: 241 VN 242



 Score = 38.1 bits (89), Expect = 0.018
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query: 385 ELRCVVAVIRHGDRTP 400
           EL  V  + RHGDR P
Sbjct: 1   ELEQVQVLSRHGDRYP 16


>gnl|CDD|132716 cd07040, HP, Histidine phosphatase domain found in a functionally
           diverse set of proteins, mostly phosphatases; contains a
           His residue which is phosphorylated during the reaction.
            Catalytic domain of a functionally diverse set of
           proteins, most of which are phosphatases. The conserved
           catalytic core of this domain contains a His residue
           which is phosphorylated in the reaction. This set of
           proteins includes cofactor-dependent and
           cofactor-independent phosphoglycerate mutases (dPGM, and
           BPGM respectively), fructose-2,6-bisphosphatase
           (F26BP)ase, Sts-1, SixA, histidine acid phosphatases,
           phytases, and related proteins. Functions include roles
           in metabolism, signaling, or regulation, for example
           F26BPase affects glycolysis and gluconeogenesis through
           controlling the concentration of F26BP; BPGM controls
           the concentration of 2,3-BPG (the main allosteric
           effector of hemoglobin in human blood cells); human
           Sts-1 is a T-cell regulator; Escherichia coli Six A
           participates in the ArcB-dependent His-to-Asp
           phosphorelay signaling system; phytases scavenge
           phosphate from extracellular sources. Deficiency and
           mutation in many of the human members result in disease,
           for example erythrocyte BPGM deficiency is a disease
           associated with a decrease in the concentration of
           2,3-BPG. Clinical applications include the use of
           prostatic acid phosphatase (PAP) as a serum marker for
           prostate cancer. Agricultural applications include the
           addition of phytases to animal feed.
          Length = 153

 Score = 56.3 bits (136), Expect = 4e-09
 Identities = 20/66 (30%), Positives = 24/66 (36%), Gaps = 17/66 (25%)

Query: 522 WGGELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAA 581
             G LT  GR QA ELG+  R  Y                     +IY+S   R   TA 
Sbjct: 22  GDGPLTEKGRQQARELGKALRERYI-----------------KFDRIYSSPLKRAIQTAE 64

Query: 582 AFAKGL 587
              +GL
Sbjct: 65  IILEGL 70


>gnl|CDD|132718 cd07067, HP_PGM_like, Histidine phosphatase domain found in
           phosphoglycerate mutases and related proteins, mostly
           phosphatases; contains a His residue which is
           phosphorylated during the reaction.  Subgroup of the
           catalytic domain of a functionally diverse set of
           proteins, most of which are phosphatases. The conserved
           catalytic core of this domain contains a His residue
           which is phosphorylated in the reaction. This subgroup
           contains cofactor-dependent and cofactor-independent
           phosphoglycerate mutases (dPGM, and BPGM respectively),
           fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and
           related proteins. Functions include roles in metabolism,
           signaling, or regulation, for example, F26BPase affects
           glycolysis and gluconeogenesis through controlling the
           concentration of F26BP; BPGM controls the concentration
           of 2,3-BPG (the main allosteric effector of hemoglobin
           in human blood cells); human Sts-1 is a T-cell
           regulator; Escherichia coli Six A participates in the
           ArcB-dependent His-to-Asp phosphorelay signaling system.
           Deficiency and mutation in many of the human members
           result in disease, for example erythrocyte BPGM
           deficiency is a disease associated with a decrease in
           the concentration of 2,3-BPG.
          Length = 153

 Score = 41.5 bits (98), Expect = 6e-04
 Identities = 21/74 (28%), Positives = 26/74 (35%), Gaps = 17/74 (22%)

Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
            LT  GR QA  LG+                 RL        +IY+S   R   TA    
Sbjct: 25  PLTEKGREQARALGK-----------------RLKELGIKFDRIYSSPLKRAIQTAEIIL 67

Query: 585 KGLLALEGELTPIL 598
           + L  L  E+ P L
Sbjct: 68  EELPGLPVEVDPRL 81


>gnl|CDD|214859 smart00855, PGAM, Phosphoglycerate mutase family.  Phosphoglycerate
           mutase (PGAM) and bisphosphoglycerate mutase (BPGM) are
           structurally related enzymes that catalyse reactions
           involving the transfer of phospho groups between the
           three carbon atoms of phosphoglycerate... Both enzymes
           can catalyse three different reactions with different
           specificities, the isomerization of 2-phosphoglycerate
           (2-PGA) to 3-phosphoglycerate (3-PGA) with
           2,3-diphosphoglycerate (2,3-DPG) as the primer of the
           reaction, the synthesis of 2,3-DPG from 1,3-DPG with
           3-PGA as a primer and the degradation of 2,3-DPG to
           3-PGA (phosphatase activity). In mammals, PGAM is a
           dimeric protein with two isoforms, the M (muscle) and B
           (brain) forms. In yeast, PGAM is a tetrameric protein.
          Length = 158

 Score = 37.4 bits (87), Expect = 0.016
 Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 16/60 (26%)

Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
            LT  GR QAE LGR               LL      R D  +Y+S   R + TA A A
Sbjct: 25  PLTELGRAQAEALGR---------------LLASLLLPRFD-VVYSSPLKRARQTAEALA 68


>gnl|CDD|233121 TIGR00768, rimK_fam, alpha-L-glutamate ligases, RimK family.  This
           family, related to bacterial glutathione synthetases,
           contains at least two different alpha-L-glutamate
           ligases. One is RimK, as in E. coli, which adds
           additional Glu residues to the native Glu-Glu C-terminus
           of ribosomal protein S6, but not to Lys-Glu mutants.
           Most species with a member of this subfamily lack an S6
           homolog ending in Glu-Glu, however. Members in
           Methanococcus jannaschii act instead as a
           tetrahydromethanopterin:alpha-l-glutamate ligase
           (MJ0620) and a gamma-F420-2:alpha-l-glutamate ligase
           (MJ1001).
          Length = 277

 Score = 37.3 bits (87), Expect = 0.035
 Identities = 59/296 (19%), Positives = 100/296 (33%), Gaps = 57/296 (19%)

Query: 61  KSQSKPMKEILTRLE-EFEFIKM----IVFSEETIQKPVDEWPIVDCLISFHSKGFPLEK 115
           +   K +KE    L  +++ +      + F+E    + + E  +V   I    +G     
Sbjct: 10  RLDEKMLKEAAEELGIDYKVVTPPAIPLTFNEG--PRELAELDVVIVRIVSMFRGL---A 64

Query: 116 AIKYANLRKPFVINNLNMQYDIQDRRKVYALLEKEGIEIPRYAVLDRESPDPVKHELVES 175
             +Y       VIN+ +   +  D+     LL K G+  PR  +               +
Sbjct: 65  VARYLESLGVPVINSSDAILNAGDKFLTSQLLAKAGLPQPRTGL---------------A 109

Query: 176 EDHVEVNGIV--FNKPFVEKPVSAEDHNIYIYYPTSAGGGSQRLFRKIGSRSSVYSPESR 233
               E   ++     P V KPV             S G        K  + + +   E  
Sbjct: 110 GSPEEALKLIEEIGFPVVLKPVFG-----------SWGRLVSLARDKQAAETLLEHFEQL 158

Query: 234 VRKSGSFIYEDFMPTDGT-DVKVYTVGPDYAHAEARKSP-------ALDGKVERDSEGKE 285
                 F  ++++   G  D++V+ VG +   A  R +        A  GK E       
Sbjct: 159 NGPQNLFYVQEYIKKPGGRDIRVFVVGDEVIAAIYRITSGHWRTNLARGGKAE------- 211

Query: 286 IRYPVILSNAEKLISRKVCLAFKQTVCGFDLLRANGKSF-VCDVNGFSFVKNSNKY 340
              P  L+   + ++ K   A    V G DLL +  +   V +VN     KNS K 
Sbjct: 212 ---PCPLTEEIEELAIKAAKALGLDVVGIDLLESEDRGLLVNEVNPNPEFKNSVKT 264


>gnl|CDD|215845 pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch
           1).  The histidine phosphatase superfamily is so named
           because catalysis centres on a conserved His residue
           that is transiently phosphorylated during the catalytic
           cycle. Other conserved residues contribute to a
           'phosphate pocket' and interact with the phospho group
           of substrate before, during and after its transfer to
           the His residue. Structure and sequence analyses show
           that different families contribute different additional
           residues to the 'phosphate pocket' and, more
           surprisingly, differ in the position, in sequence and in
           three dimensions, of a catalytically essential acidic
           residue. The superfamily may be divided into two main
           branches. The larger branch 1 contains a wide variety of
           catalytic functions, the best known being fructose
           2,6-bisphosphatase (found in a bifunctional protein with
           2-phosphofructokinase) and cofactor-dependent
           phosphoglycerate mutase. The latter is an unusual
           example of a mutase activity in the superfamily: the
           vast majority of members appear to be phosphatases. The
           bacterial regulatory protein phosphatase SixA is also in
           branch 1 and has a minimal, and possible ancestral-like
           structure, lacking the large domain insertions that
           contribute to binding of small molecules in branch 1
           members.
          Length = 154

 Score = 35.1 bits (81), Expect = 0.072
 Identities = 17/62 (27%), Positives = 22/62 (35%), Gaps = 19/62 (30%)

Query: 526 LTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFAK 585
           LT  GR QA  LG+  +                      D +IY+S   R   TA   A+
Sbjct: 25  LTELGREQARALGKRLK------------------GIPFD-RIYSSPLLRAIQTAEILAE 65

Query: 586 GL 587
            L
Sbjct: 66  AL 67


>gnl|CDD|117020 pfam08443, RimK, RimK-like ATP-grasp domain.  This ATP-grasp domain
           is found in the ribosomal S6 modification enzyme RimK.
          Length = 190

 Score = 35.5 bits (82), Expect = 0.076
 Identities = 45/214 (21%), Positives = 71/214 (33%), Gaps = 51/214 (23%)

Query: 139 DRRKVYALLEKEGIEIPRYAVLDRESPDPVKHE-LVESEDHVEVNGIVFNKPFVEKPVSA 197
           D+ K + LL K GI +P   +    +  P   E  +E     ++ G     P V K V  
Sbjct: 3   DKAKSHQLLAKHGIPVPNTGL----AWSPEDAEKFIE-----QIKGF----PVVVKSVF- 48

Query: 198 EDHNIYIYYPTSAGGGSQ-RLFRKIGSRSSVYSPESRVRKSGSFIY-EDFMPTDGT-DVK 254
                          GSQ           S+       +   + I  ++F+   G  D++
Sbjct: 49  ---------------GSQGIGVFLAEDEQSLEQLLEAFKWLKNQILVQEFIAEAGNRDIR 93

Query: 255 VYTVGPDYA--------HAEARKSPALDGKVERDSEGKEIRYPVILSNAEKLISRKVCLA 306
              VG +            + R +    G  E          P  LS  E+ I+ K   A
Sbjct: 94  CLVVGGEVVGAIHRQSNEGDFRTNLHRGGVAE----------PYQLSQEEEEIAIKAAQA 143

Query: 307 FKQTVCGFDLLRANGKSFVCDVNGFSFVKNSNKY 340
               V G D+ R+     VC+VN    +K   + 
Sbjct: 144 MGLDVLGVDIFRSKRGLLVCEVNSSPGLKGIERT 177


>gnl|CDD|223483 COG0406, phoE, Broad specificity phosphatase PhoE and related
           phosphatases [General function prediction only].
          Length = 208

 Score = 35.5 bits (82), Expect = 0.099
 Identities = 19/63 (30%), Positives = 21/63 (33%), Gaps = 17/63 (26%)

Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
            LT  GR QAE L                 L      F     IY+S   R Q TA   A
Sbjct: 28  PLTEEGRAQAEALAER--------------LAARDIGFDA---IYSSPLKRAQQTAEPLA 70

Query: 585 KGL 587
           + L
Sbjct: 71  EEL 73


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 36.7 bits (84), Expect = 0.11
 Identities = 36/144 (25%), Positives = 57/144 (39%), Gaps = 10/144 (6%)

Query: 892  TSDERFHIELHFSPG----VNCCVQKNLPPGPGFRPHSRNDQKK-NLPRIDQEDTEFYST 946
            T D   H+ L+ SPG          +  P  P  R    ++Q+  ++     E       
Sbjct: 730  TPDRAAHLLLYGSPGGGDGAEPSAAQESPANPWPRAPPCDEQEPLSVSPYGPEPDRPPDD 789

Query: 947  DAEDNTGSSKSTSDQDSPTSAEGPSVDQSKGKFVLSQPIPITVKDLKRKNSVGDPCPSIV 1006
            D E   G  + +S+  +    EG +  ++ G   L   +P  V +  R N +  PCP  +
Sbjct: 790  DFETRKGLKRKSSEDHADPIPEGNATKKTCGLQGLPDSLPPAVPETDRDNPLLPPCP--I 847

Query: 1007 APEG---HPYRRPRSPNEEQRSRS 1027
             PEG    P   P+ P E Q  +S
Sbjct: 848  TPEGPPCPPREEPQQPQEPQEPQS 871


>gnl|CDD|178268 PLN02662, PLN02662, cinnamyl-alcohol dehydrogenase family protein.
          Length = 322

 Score = 32.0 bits (73), Expect = 2.2
 Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 3/27 (11%)

Query: 352 ILRELAPTLHIPWSVPFQLDDPPFVPT 378
           IL EL PTL +P       DD P+VPT
Sbjct: 263 ILHELYPTLQLPEKCA---DDKPYVPT 286


>gnl|CDD|162743 TIGR02172, Fb_sc_TIGR02172, Fibrobacter succinogenes paralogous
           family TIGR02172.  This model describes a paralogous
           family of five proteins, likely to be enzymes, in the
           rumen bacterium Fibrobacter succinogenes S85. Members
           show homology to proteins described by pfam01636, a
           phosphotransferase enzyme family associated with
           resistance to aminoglycoside antibiotics. However,
           members of this family score below the current trusted
           and noise cutoffs for pfam01636.
          Length = 226

 Score = 30.9 bits (70), Expect = 3.0
 Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 670 RIHELIHVLQHIIQKKLEDVKCKESSLYHGESWELMGRRWSKIEKDFCMKNYKYD----I 725
           R+ E+  +   +  KKL   KC  S+    +S++   R++ + EKDF  K+YK      I
Sbjct: 92  RLEEIAKIFAEM-AKKLHSTKCDTSTF---QSYKEKIRKFIE-EKDFVPKDYKEKARAFI 146

Query: 726 SKIPDIYDCIKYDLQ 740
            ++PD   C+  D Q
Sbjct: 147 KEVPDTSTCLHGDFQ 161


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
            translational repressor [Translation, ribosomal structure
            and biogenesis].
          Length = 777

 Score = 31.6 bits (72), Expect = 3.6
 Identities = 26/142 (18%), Positives = 50/142 (35%), Gaps = 23/142 (16%)

Query: 1095 NLRNMIPSTTNVTALDGFGGVPS-IRPLETL-HNALSLKHLDNFLGKMIRVIPSKTPASS 1152
            + +   PS+T    L      P+   P+ +   ++ +          +   I   T +SS
Sbjct: 19   HSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKI---TSSSS 75

Query: 1153 PPKYPSTPIEHSLSGARLGWSGPPSFVSESGPSS---------------PNALSDYFFKS 1197
              + PS     ++S +    SG  S + E   SS                +        S
Sbjct: 76   SRRKPSGSWSVAISSST---SGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSS 132

Query: 1198 GLNTSDSCSSEFSSSRSSVPSS 1219
               +S + SS F+S++  +P+ 
Sbjct: 133  VTLSSSTASSMFNSNKLPLPNP 154


>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
           This family consists of a number of hypothetical
           proteins from Arabidopsis thaliana and Oryza sativa. The
           function of this family is unknown.
          Length = 766

 Score = 31.2 bits (70), Expect = 4.1
 Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 5/60 (8%)

Query: 457 PEIEEKQGKLEQLKGVLEMYGH--FSGINRKVQMKYQPKGRPRGSSSDEEEEDVCKPKEP 514
            E+ E    ++ L G++   G+   SG+ R+ + K  P+GR R  +   EE +V  P  P
Sbjct: 677 HEVTED---IQMLGGLMRATGYSWMSGLTRRGRRKGGPRGRRRSVTIINEEVEVSPPPAP 733


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 31.3 bits (71), Expect = 4.3
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 9/82 (10%)

Query: 427 HVKLKKPKQLQEVLDIARMLLTEIENNSADPEIEEKQGKLEQLKGVLEMYGHFSGINRKV 486
            +K +K +++Q++L+     +    NN         +G+L+ L    E++ HF+  ++  
Sbjct: 67  ELKKQKKQEIQKILEQQNAAIDADMNNKG-------KGRLKYLLQQTEIFAHFAKGDQSA 119

Query: 487 QMKYQPKGRPRGSSSD-EEEED 507
             K + KGR R +S   EEEED
Sbjct: 120 SAK-KAKGRGRHASKLTEEEED 140


>gnl|CDD|223267 COG0189, RimK, Glutathione synthase/Ribosomal protein S6
           modification enzyme (glutaminyl transferase) [Coenzyme
           metabolism / Translation, ribosomal structure and
           biogenesis].
          Length = 318

 Score = 30.8 bits (70), Expect = 4.6
 Identities = 50/274 (18%), Positives = 85/274 (31%), Gaps = 51/274 (18%)

Query: 71  LTRLEEFEFIKMIVFSEETIQKPVDEWPIVDCLISFHSKGFPLEKAIKYANL---RKPFV 127
           L RL E   +  + +     +       + D +I    K  P + A ++  L   +   V
Sbjct: 51  LARLVEVGEVIGLHYELIEEEDLSLLDEL-DVIIM--RKDPPFDFATRFLRLAERKGVPV 107

Query: 128 INNLNMQYDIQDRRKVYALLEKEGIEIPRYAVLDRESPDPVKHELVESEDHVEVNGIVFN 187
           IN+       +++     LL K GI +P   +     PD    E V              
Sbjct: 108 INDPQSIRRCRNKLYTTQLLAKAGIPVPPTLIT--RDPDEAA-EFVAE---------HLG 155

Query: 188 KPFVEKPVSAEDHNIYIYYPTSAGGGSQRLFRKIGSRSSVYSPESRVRKSGSFIYEDFMP 247
            P V KP+              +GG    L                       I ++++P
Sbjct: 156 FPVVLKPLD------------GSGGRGVFLVEDADPELLSLLETLTQEGRKLIIVQEYIP 203

Query: 248 TDGTDVKVY-----TVGPDYA------HAEARKSPALDGKVERDSEGKEIRYPVILSNAE 296
               D +        V   YA        + R + A  G+ E          P  L+  E
Sbjct: 204 KAKRDDRRVLVGGGEVVAIYALARIPASGDFRSNLARGGRAE----------PCELTEEE 253

Query: 297 KLISRKVCLAFKQTVCGFDLLRANGKSFVCDVNG 330
           + ++ K   A    + G D++      +V +VN 
Sbjct: 254 EELAVKAAPALGLGLVGVDIIEDKDGLYVTEVNV 287


>gnl|CDD|219602 pfam07841, DM4_12, DM4/DM12 family.  This family contains sequences
            derived from hypothetical proteins expressed by two
            insect species, D. melanogaster and A. gambiae. The
            region in question is approximately 115 amino acid
            residues long and contains four highly- conserved
            cysteine residues.
          Length = 83

 Score = 28.4 bits (64), Expect = 4.8
 Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 1125 HNALSLKHLDNFLGKMIRVI--PSKTPASSPPKY 1156
               L L   +  LG+++R++  PS++    PP+Y
Sbjct: 30   AAQLPLDEHNGLLGELLRIVFTPSESEDELPPEY 63


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
            related proteins that is plant specific.
          Length = 313

 Score = 30.7 bits (69), Expect = 4.9
 Identities = 17/76 (22%), Positives = 26/76 (34%), Gaps = 1/76 (1%)

Query: 1145 PSKTPASSPPKYPSTPIEHSLSGARLGWSGPPSFVSESGPSSPNALSDYFFKSGLNTSDS 1204
             + +P SS  +  S+    + + +R      P+    S   + N  S   F        S
Sbjct: 13   DASSPRSSSRRRLSSSFLSTSASSRPRRLNAPAS-PPSSSPARNTSSSSSFGLSKQRPSS 71

Query: 1205 CSSEFSSSRSSVPSSE 1220
             S    SSR   PS  
Sbjct: 72   LSRGRLSSRFVSPSRG 87


>gnl|CDD|232869 TIGR00197, yjeF_nterm, yjeF N-terminal region.  The protein region
           corresponding to This model shows no clear homology to
           any protein of known function. This model is built on
           yeast protein YNL200C and the N-terminal regions of E.
           coli yjeF and its orthologs in various species. The
           C-terminal region of yjeF and its orthologs shows
           similarity to hydroxyethylthiazole kinase (thiM) and
           other enzymes involved in thiamine biosynthesis. Yeast
           YKL151C and B. subtilis yxkO match the yjeF C-terminal
           domain but lack this region [Unknown function, General].
          Length = 205

 Score = 30.1 bits (68), Expect = 5.6
 Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 8/80 (10%)

Query: 55  VCAMAKKSQSKPMKEILTRLEEFEFIKMIVFS--EETIQKPVDEWPIVDCLISFHSKGFP 112
           V  + K+ + +  ++    +   + +K+   S  E  + KP D   I+D ++    KG  
Sbjct: 75  VFLLKKEERIEFTEQ---AVVNLKALKVGGISIDEGNLVKPEDCDVIIDAILGTGVKGPL 131

Query: 113 LE---KAIKYANLRKPFVIN 129
            E     ++  N     +++
Sbjct: 132 REPFKTIVEEINELPAPIVS 151


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
            fungi is controlled by the loci that determine the mating
            type of an individual, and only individuals with
            differing mating types can mate. Basidiomycete fungi have
            evolved a unique mating system, termed tetrapolar or
            bifactorial incompatibility, in which mating type is
            determined by two unlinked loci; compatibility at both
            loci is required for mating to occur. The multi-allelic
            tetrapolar mating system is considered to be a novel
            innovation that could have only evolved once, and is thus
            unique to the mushroom fungi. This domain is C-terminal
            to the homeodomain transcription factor region.
          Length = 418

 Score = 30.5 bits (69), Expect = 5.9
 Identities = 28/124 (22%), Positives = 43/124 (34%), Gaps = 21/124 (16%)

Query: 1102 STTNVTALDGFGGVPSIRPLETLHNALSLKHLDNFLGKM---IRVIPSKTPASSPPKYPS 1158
            ST  V AL       S  PL+      S    + +   +      +    P+SS      
Sbjct: 288  STFAVPALTSSSVDQSATPLD---QGFSNFGSNMYSEPLNPTNDSLLYGLPSSSSLYANR 344

Query: 1159 T--PIEHSLSGARLGWSGPPSFVSESGPSSPNALSDYFFKSGLNTSDSCSSEFSSSRSSV 1216
            T  P   S S + L +S   +  S S  +S + L+              +   S S  ++
Sbjct: 345  TIFPAWASTSVSPLDFSTLFNQPSPSPMASQSILAP-------------AQPTSPSPVAL 391

Query: 1217 PSSE 1220
            PSSE
Sbjct: 392  PSSE 395


>gnl|CDD|234304 TIGR03671, cca_archaeal, CCA-adding enzyme. 
          Length = 408

 Score = 30.3 bits (69), Expect = 7.9
 Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 19/72 (26%)

Query: 140 RRKVYALLEKEGIEIPRYAV-------------LDRESPDPVKHE--LVESEDHV----E 180
            R +  LLE+EG E+ RY V                E P    H    V   DH     E
Sbjct: 304 GRSLVKLLEREGFEVLRYGVWADENTCYLLLELESAELPRVKLHVGPPVWVRDHAEKFIE 363

Query: 181 VNGIVFNKPFVE 192
               V   PF+E
Sbjct: 364 KYEDVDYGPFIE 375


>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III.
            This is the very C-terminal region of neural adhesion
            molecule L1 proteins that are also known as Bravo or
            NrCAM. It lies upstream of the IG and Fn3 domains and has
            the highly conserved motif FIGEY. The function is not
            known.
          Length = 118

 Score = 28.4 bits (64), Expect = 8.9
 Identities = 10/49 (20%), Positives = 18/49 (36%), Gaps = 1/49 (2%)

Query: 993  KRKNSVGDPCPSIVAPEG-HPYRRPRSPNEEQRSRSYDQQHQRPKGASK 1040
            +++++ GDP    +  EG   Y RP       R        +    +S 
Sbjct: 40   EKEDAPGDPEDQPMKDEGFGEYSRPEDNKPLPRGSQQSLDGEIGDDSST 88


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.134    0.395 

Gapped
Lambda     K      H
   0.267   0.0657    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,369,744
Number of extensions: 6384977
Number of successful extensions: 5410
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5397
Number of HSP's successfully gapped: 36
Length of query: 1266
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1158
Effective length of database: 6,147,370
Effective search space: 7118654460
Effective search space used: 7118654460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)