RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6546
         (294 letters)



>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 34.7 bits (79), Expect = 0.050
 Identities = 38/199 (19%), Positives = 63/199 (31%), Gaps = 36/199 (18%)

Query: 94  LTRHPCPQCSRDFATVKILKTHLKT-----HEGIIYPCPE--CDKTFLSTSGLREHRKVH 146
                  QC+  F+    L  HL++          + CP   C K F     L+ H  +H
Sbjct: 287 SLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLH 346

Query: 147 AGVRYICH---ACPREFSSEATMKAHVASHNGVQHACDQCDRTFSCMPYLRQHQKLHSGI 203
             +            +FS     +                         L+Q++ L +  
Sbjct: 347 TSISPAKEKLLNSSSKFSPLLNNEPPQ---------------------SLQQYKDLKNDK 385

Query: 204 TYSC--PSCSKSFNRKDRYRMHLKLH---QGVRHTCKECSKEFSNEYNLKNHMVLHDVRA 258
                  SC ++F R     +H+  H   +        CSK F+  YNL  H  +H   A
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445

Query: 259 YVCELCARQFTRVDHFNHH 277
            +     + F R    ++H
Sbjct: 446 PLLCSILKSFRRDLDLSNH 464


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 34.9 bits (80), Expect = 0.052
 Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 1/30 (3%)

Query: 7   PTTNNPSTTNTTPTPTTQTSSTTPTRHPCP 36
            TT+   TT TTPT T     TT T  P  
Sbjct: 716 TTTSEDPTTTTTPTTTG-PEETTETAEPTT 744



 Score = 34.5 bits (79), Expect = 0.067
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 6   SPTTNNPSTTNTTPTPTTQTSSTTPT 31
              T   + T T P  TT+T+  T T
Sbjct: 720 EDPTTTTTPTTTGPEETTETAEPTTT 745



 Score = 31.8 bits (72), Expect = 0.51
 Identities = 11/26 (42%), Positives = 12/26 (46%)

Query: 6   SPTTNNPSTTNTTPTPTTQTSSTTPT 31
           +PTT  P  T  T  PTT T   T  
Sbjct: 727 TPTTTGPEETTETAEPTTTTEEPTEE 752



 Score = 31.4 bits (71), Expect = 0.66
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 6   SPTTNNPSTTNTTPTPTTQTSSTTPT 31
           + T    +T    PT TT+  +   T
Sbjct: 729 TTTGPEETTETAEPTTTTEEPTEETT 754


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 31.4 bits (71), Expect = 0.18
 Identities = 23/80 (28%), Positives = 36/80 (45%), Gaps = 8/80 (10%)

Query: 67  KTFLSTLTGTPQGARGRAVYLSRMSTTLTRHP--CPQCSRDFATVKILKTHLK--THEGI 122
           +  + TL   PQ     + YL ++ T+    P  CP C   F++   LK H++   H  +
Sbjct: 43  RAVVKTLIYNPQ-LLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV 101

Query: 123 IYPCPECDKTFLSTSGLREH 142
              CP C K F +T    +H
Sbjct: 102 ---CPVCGKEFRNTDSTLDH 118



 Score = 26.8 bits (59), Expect = 7.2
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 5/40 (12%)

Query: 35  CPQCSRDFATVKILKTHLK--THEGIIYPCPECDKTFLST 72
           CP C   F++   LK H++   H  +   CP C K F +T
Sbjct: 76  CPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNT 112


>gnl|CDD|223609 COG0535, COG0535, Predicted Fe-S oxidoreductases [General function
           prediction only].
          Length = 347

 Score = 32.7 bits (74), Expect = 0.21
 Identities = 19/99 (19%), Positives = 32/99 (32%), Gaps = 18/99 (18%)

Query: 118 THEGIIYPCPECDKTFLSTSGLREHRKVHAGVRYICHACPREFSSEATMKAHVASHNGVQ 177
             +G +YPCP     FL   G+RE          + +   RE                ++
Sbjct: 266 DPDGEVYPCP-----FLPELGVREESFKEIWEESLLNKL-RERD------------ELLE 307

Query: 178 HACDQCDRTFSCMPYLRQHQKLHSGITYSCPSCSKSFNR 216
            +C +C+    C     +  +L   I  + P C      
Sbjct: 308 GSCGKCEYREYCGGCRARAYELTGDIYGADPFCPLVKKS 346



 Score = 27.3 bits (60), Expect = 10.0
 Identities = 18/87 (20%), Positives = 24/87 (27%), Gaps = 11/87 (12%)

Query: 54  THEGIIYPCPECDKTFLSTLTGTPQGARGRAVYLS-----RMSTTLTRHPCPQCSRDFAT 108
             +G +YPCP     FL  L G  + +       S     R    L    C +C      
Sbjct: 266 DPDGEVYPCP-----FLPEL-GVREESFKEIWEESLLNKLRERDELLEGSCGKCEYREYC 319

Query: 109 VKILKTHLKTHEGIIYPCPECDKTFLS 135
                   +    I    P C     S
Sbjct: 320 GGCRARAYELTGDIYGADPFCPLVKKS 346


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 29.1 bits (66), Expect = 0.23
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 231 RHTCKECSKEFSNEYNLKNHM 251
           +  C  C K F +E  L+NH+
Sbjct: 1   QFYCVACDKYFKSENALENHL 21



 Score = 27.2 bits (61), Expect = 1.1
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 150 RYICHACPREFSSEATMKAHVAS 172
           ++ C AC + F SE  ++ H+ S
Sbjct: 1   QFYCVACDKYFKSENALENHLKS 23


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 32.5 bits (74), Expect = 0.25
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 1/29 (3%)

Query: 6   SPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
             TT++PS T  T +PT  ++STT T  P
Sbjct: 814 QTTTSSPSPT-QTTSPTQTSTSTTTTTSP 841



 Score = 31.8 bits (72), Expect = 0.43
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 5   LSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQ 37
           +   T +   T T     T TSS +PT+   P 
Sbjct: 797 ILIETTSVVITKTVTQTQTTTSSPSPTQTTSPT 829



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 3   FLLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
            +++ T     TT ++P+PT  TS T  +
Sbjct: 804 VVITKTVTQTQTTTSSPSPTQTTSPTQTS 832


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 28.2 bits (63), Expect = 0.39
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 124 YPCPECDKTFLSTSGLREHRKVH 146
           Y CPEC K F S S LREH + H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 0.51
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 232 HTCKECSKEFSNEYNLKNHMVLH 254
           + C EC K F ++  L+ HM  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.7 bits (59), Expect = 1.5
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 205 YSCPSCSKSFNRKDRYRMHLKLH 227
           Y CP C K F  K   R H++ H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A.  Members
           of this protein family are CbsA, one subunit of a highly
           glycosylated, heterodimeric, mono-heme cytochrome
           b558/566, found in Sulfolobus acidocaldarius and several
           other members of the Sulfolobales, a branch of the
           Crenarchaeota.
          Length = 465

 Score = 31.5 bits (71), Expect = 0.55
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 4   LLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
           L++     P++T T+ +P+T TSS  P+
Sbjct: 412 LVTTPPTPPTSTTTSTSPSTTTSSAIPS 439


>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger. 
          Length = 27

 Score = 27.6 bits (62), Expect = 0.68
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 123 IYPCPECDKTFLSTSGLREHRKVH 146
           ++ C  C KTF S   L  H+K H
Sbjct: 1   VHTCGVCGKTFSSLQALGGHKKSH 24


>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048).  This
          presumed domain is functionally uncharacterized. This
          domain family is found in eukaryotes, and is typically
          between 228 and 257 amino acids in length.
          Length = 242

 Score = 30.5 bits (69), Expect = 0.76
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 4  LLSPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
          LLSP +  P +   +   +T +  + P+RHP
Sbjct: 7  LLSPNSGPPFSPLASSGGSTDSDGSRPSRHP 37


>gnl|CDD|216786 pfam01921, tRNA-synt_1f, tRNA synthetases class I (K).  This family
           includes only lysyl tRNA synthetases from prokaryotes.
          Length = 357

 Score = 30.7 bits (70), Expect = 0.80
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 2/42 (4%)

Query: 35  CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTLTGT 76
           CP+C R   T  +++   +    I Y C EC       +TG 
Sbjct: 174 CPKCGRV-LTTPVVEYDAE-GGTITYKCDECGHEGEVDITGG 213


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 31.3 bits (70), Expect = 0.83
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 5    LSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCP 36
            LS  ++ P+ T  TP P++ +SS+  +    P
Sbjct: 1228 LSAASSPPAATTPTPPPSSSSSSSAQSISTSP 1259


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 26.9 bits (60), Expect = 0.98
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 233 TCKECSKEFSNEYNLKNHMVLH 254
            C +C K FS + NLK H+  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.4 bits (56), Expect = 3.9
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 206 SCPSCSKSFNRKDRYRMHLKLH 227
            CP C KSF+RK   + HL+ H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.0 bits (55), Expect = 4.9
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 126 CPECDKTFLSTSGLREHRKVH 146
           CP+C K+F   S L+ H + H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies).  This
           family contains two copies of a C2H2-like zinc finger
           domain.
          Length = 100

 Score = 28.8 bits (65), Expect = 1.2
 Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 9/79 (11%)

Query: 99  CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTS-GLRE--HRKVHAGVRYICHA 155
           C  C+    TV+    H+    G   P    ++ +L    GL      K+H G    C  
Sbjct: 2   CLFCNHTSDTVEENLEHMFKSHGFFIP----EREYLVDLEGLLNYLREKIHEG--NECLY 55

Query: 156 CPREFSSEATMKAHVASHN 174
           C ++F S   ++ H+    
Sbjct: 56  CGKQFKSLEALRQHMRDKG 74



 Score = 26.5 bits (59), Expect = 7.7
 Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 222 MHLKLHQGVRHTCKECSKEFSNEYNLKNHM 251
           +  K+H+G  + C  C K+F +   L+ HM
Sbjct: 43  LREKIHEG--NECLYCGKQFKSLEALRQHM 70


>gnl|CDD|214426 MTH00123, ATP8, ATP synthase F0 subunit 8; Provisional.
          Length = 54

 Score = 27.5 bits (62), Expect = 1.4
 Identities = 11/22 (50%), Positives = 12/22 (54%)

Query: 4  LLSPTTNNPSTTNTTPTPTTQT 25
          L    TNNP+ TNTT   T  T
Sbjct: 29 LSFTPTNNPTKTNTTTNTTPWT 50



 Score = 27.2 bits (61), Expect = 1.8
 Identities = 7/29 (24%), Positives = 13/29 (44%)

Query: 3  FLLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
           ++ P   + + TN      T T++T  T
Sbjct: 22 LIIQPKILSFTPTNNPTKTNTTTNTTPWT 50


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 30.1 bits (67), Expect = 1.4
 Identities = 25/88 (28%), Positives = 38/88 (43%), Gaps = 10/88 (11%)

Query: 116 LKTHEGIIYPCP--ECDKTFLSTSGLREHRKVHAGVRYICHACPREFSSEATMKAHVASH 173
           LK  +G  Y CP   C+K + + +GL+ H   H       H  P    S   M    A  
Sbjct: 342 LKVKDGKPYKCPVEGCNKKYKNQNGLKYHML-HGHQNQKLHENP----SPEKMNIFSAKD 396

Query: 174 NGVQHACDQCDRTFSCMPYLRQHQKLHS 201
               + C+ CD+ +  +  L+ H+K HS
Sbjct: 397 K--PYRCEVCDKRYKNLNGLKYHRK-HS 421



 Score = 29.7 bits (66), Expect = 1.9
 Identities = 20/89 (22%), Positives = 41/89 (46%), Gaps = 9/89 (10%)

Query: 197 QKLHSGITYSCP--SCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFSNEYNLKNHMVLH 254
            K+  G  Y CP   C+K +  ++  + H+      +   +  S E  N ++ K+     
Sbjct: 342 LKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKD----- 396

Query: 255 DVRAYVCELCARQFTRVDHFNHHRRLQHD 283
             + Y CE+C +++  ++   +HR+  HD
Sbjct: 397 --KPYRCEVCDKRYKNLNGLKYHRKHSHD 423


>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS) similar to mycosubtilin
           synthase subunit A (MycA).  The adenylation (A) domain
           of NRPS recognizes a specific amino acid or hydroxy acid
           and activates it as (amino)-acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms thioester
           to the enzyme-bound cofactor phosphopantetheine of a
           peptidyl carrier protein domain. This family includes
           NRPS similar to mycosubtilin synthase subunit A (MycA).
           Mycosubtilin, which is characterized by a beta-amino
           fatty acid moiety linked to the circular heptapeptide
           Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
           family of lipopeptide antibiotics. The mycosubtilin
           synthase subunit A (MycA) combines functional domains
           derived from peptide synthetases, amino transferases,
           and fatty acid synthases. Nonribosomal peptide
           synthetases are large multifunction enzymes that
           synthesize many therapeutically useful peptides. NRPS
           has a distinct modular structure in which each module is
           responsible for the recognition, activation, and, in
           some cases, modification of a single amino acid residue
           of the final peptide product. The modules can be
           subdivided into domains that catalyze specific
           biochemical reactions.
          Length = 499

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 12/43 (27%), Positives = 17/43 (39%)

Query: 35  CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTLTGTP 77
            P+    F TV + + HLK    II   P+          G+P
Sbjct: 277 FPKPGEPFKTVCLDRNHLKVGVKIIEIDPKNSDAISFVEVGSP 319



 Score = 29.5 bits (67), Expect = 2.3
 Identities = 10/30 (33%), Positives = 14/30 (46%)

Query: 99  CPQCSRDFATVKILKTHLKTHEGIIYPCPE 128
            P+    F TV + + HLK    II   P+
Sbjct: 277 FPKPGEPFKTVCLDRNHLKVGVKIIEIDPK 306


>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3.  The
          reovirus inner capsid protein lambda-1 displays
          nucleoside triphosphate phosphohydrolase (NTPase),
          RNA-5'-triphosphatase (RTPase), and RNA helicase
          activity and may play a role in the transcription of
          the virus genome, the unwinding or reannealing of
          double-stranded RNA during RNA synthesis. The RTPase
          activity constitutes the first step in the capping of
          RNA, resulting in a 5'-diphosphorylated RNA
          plus-strand. lambda1 is an Orthoreovirus core protein,
          VP3 is the homologous core protein in Aquareoviruses.
          Length = 1166

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 8/37 (21%), Positives = 19/37 (51%)

Query: 18 TPTPTTQTSSTTPTRHPCPQCSRDFATVKILKTHLKT 54
          +P  + + +   P+ + C  C  +F+++  L  H +T
Sbjct: 59 SPPQSKKATPINPSSYVCNVCMAEFSSMDQLAEHQRT 95



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 256 VRAYVCELCARQFTRVDHFNHHRRLQH 282
             +YVC +C  +F+ +D    H+R  H
Sbjct: 71  PSSYVCNVCMAEFSSMDQLAEHQRTTH 97



 Score = 28.2 bits (63), Expect = 7.0
 Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 170 VASHNGVQHACDQCDRTFSCMPYLRQHQKL-HS 201
               N   + C+ C   FS M  L +HQ+  HS
Sbjct: 66  ATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98


>gnl|CDD|239922 cd04476, RPA1_DBD_C, RPA1_DBD_C: A subfamily of OB folds
           corresponding to the C-terminal OB fold, the
           ssDNA-binding domain (DBD)-C, of human RPA1 (also called
           RPA70). RPA1 is the large subunit of Replication protein
           A (RPA). RPA is a nuclear ssDNA-binding protein (SSB)
           which appears to be involved in all aspects of DNA
           metabolism including replication, recombination, and
           repair. RPA also mediates specific interactions of
           various nuclear proteins. In animals, plants, and fungi,
           RPA is a heterotrimer with subunits of 70KDa (RPA1),
           32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C,
           RPA1 contains three other OB folds: DBD-A, DBD-B, and
           RPA1N. The major DNA binding activity of RPA is
           associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is
           involved in DNA binding and trimerization. It contains
           two structural insertions not found to date in other
           OB-folds: a zinc ribbon and a three-helix bundle. RPA1
           DBD-C also contains a Cys4-type zinc-binding motif,
           which plays a role in the ssDNA binding function of this
           domain. It appears that zinc itself may not be required
           for ssDNA binding.
          Length = 166

 Score = 28.8 bits (65), Expect = 1.9
 Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 11/49 (22%)

Query: 179 ACDQCDRTFSCMPYLRQHQKLHSGITYSCPSCSKSFNR-KDRYRMHLKL 226
           AC  C++                  TY C  C+KS    + RY + L +
Sbjct: 36  ACPGCNKKV----------VEEGNGTYRCEKCNKSVPNPEYRYILSLNV 74


>gnl|CDD|225380 COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein
          involved in phosphonate metabolism [Inorganic ion
          transport and metabolism].
          Length = 112

 Score = 28.1 bits (63), Expect = 2.0
 Identities = 9/35 (25%), Positives = 15/35 (42%), Gaps = 10/35 (28%)

Query: 31 TRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPEC 65
          +  PCP+C+ ++             +G    CPEC
Sbjct: 2  SLPPCPKCNSEYTY----------EDGGQLICPEC 26



 Score = 28.1 bits (63), Expect = 2.0
 Identities = 9/35 (25%), Positives = 15/35 (42%), Gaps = 10/35 (28%)

Query: 95  TRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPEC 129
           +  PCP+C+ ++             +G    CPEC
Sbjct: 2   SLPPCPKCNSEYTY----------EDGGQLICPEC 26


>gnl|CDD|203307 pfam05693, Glycogen_syn, Glycogen synthase.  This family consists
           of the eukaryotic glycogen synthase proteins GYS1, GYS2
           and GYS3. Glycogen synthase (GS) is the enzyme
           responsible for the synthesis of -1,4-linked glucose
           chains in glycogen. It is the rate limiting enzyme in
           the synthesis of the polysaccharide, and its activity is
           highly regulated through phosphorylation at multiple
           sites and also by allosteric effectors, mainly glucose
           6-phosphate (G6P).
          Length = 633

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 9/41 (21%), Positives = 12/41 (29%)

Query: 1   DHFLLSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQCSRD 41
           D F               P P +   S +P+  P P  S  
Sbjct: 589 DQFKQEVEETISDNEFKIPRPASVPPSPSPSDAPSPHSSDA 629


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 25.8 bits (57), Expect = 2.7
 Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 48 LKTHLKTHEGI-IYPCPECDKTFLS 71
          L+ H++TH G   Y CP C K+F S
Sbjct: 2  LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 25.8 bits (57), Expect = 2.7
 Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 112 LKTHLKTHEGI-IYPCPECDKTFLS 135
           L+ H++TH G   Y CP C K+F S
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 25.8 bits (57), Expect = 3.2
 Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 1/26 (3%)

Query: 192 YLRQHQKLHSGIT-YSCPSCSKSFNR 216
            LR+H + H+G   Y CP C KSF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 28.9 bits (64), Expect = 2.8
 Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 9/89 (10%)

Query: 7   PTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPECD 66
           P T++P+      + +   +  +P R P  Q           K    +      P PE  
Sbjct: 122 PRTSSPARALLASSGSQHKTQKSPVRIPFMQ--------NPAKPPPLSKNASSRPRPEPG 173

Query: 67  KTFLSTLTGTPQGARGRAVYLSRMSTTLT 95
               + + G P GARG  + L RM++  +
Sbjct: 174 SRGRAGMNGGP-GARGSRLELVRMASAKS 201


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 5   LSPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
           L+ T+++PS   TT + +T+TS T  T   
Sbjct: 80  LTTTSSSPSNDTTTASTSTKTSPTVSTTVT 109


>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is typically
          between 46 to 94 amino acids in length. This domain is
          found associated with pfam01462, pfam00560.
          Length = 122

 Score = 27.6 bits (61), Expect = 3.8
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 4  LLSPTTNNPSTTNTTPTPTTQTSSTT 29
          ++ PTT   +TT TT  PTT T  TT
Sbjct: 43 IICPTTTTTTTTTTTTMPTTTTLPTT 68


>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein.  This small,
           uncommon, poorly conserved protein is found primarily in
           the Firmicutes. It features are pair of CxxC motifs
           separated by about 20 amino acids, followed by a highly
           hydrophobic region of about 45 amino acids. It has no
           conserved gene neighborhood, and its function is
           unknown.
          Length = 94

 Score = 26.9 bits (60), Expect = 4.0
 Identities = 8/45 (17%), Positives = 21/45 (46%), Gaps = 4/45 (8%)

Query: 206 SCPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKE--FSNEYNLK 248
            C +C++ F+ K+  +    L++ ++  C  C  +   + +   K
Sbjct: 2   ICKNCNEKFSYKELLKSLFSLYRPIK--CPNCGTKQYLTAKSRKK 44


>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132.  Glycoprotein UL132 is a
          low-abundance structural component of Human
          cytomegalovirus (HCMV). The function of this protein is
          not fully understood.
          Length = 235

 Score = 27.7 bits (61), Expect = 6.1
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 5  LSPTTNNPSTTNTTPTPTTQTSSTTPT 31
          ++ +T  P+ T  T T T  TS+TT  
Sbjct: 1  MTSSTTTPANTTATVTVTVATSNTTSV 27


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 27.6 bits (60), Expect = 6.8
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 7   PTTNNPSTTNTTPTPTTQTSSTTPT 31
           PTT+N ST N T   T Q  + T  
Sbjct: 92  PTTSNASTINVTTKVTAQNITATEA 116


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 24.9 bits (54), Expect = 7.0
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 205 YSCPSCSKSFNRKDRYRMHLKLH 227
           + CP C KSF+ KD  + HL+ H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 24.5 bits (53), Expect = 7.3
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 124 YPCPECDKTFLSTSGLREHRKVH 146
           + CP C K+F S   L+ H + H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 24.1 bits (52), Expect = 9.9
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 232 HTCKECSKEFSNEYNLKNHMVLH 254
             C  C K FS++  LK H+  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
           modification].
          Length = 470

 Score = 28.1 bits (62), Expect = 7.3
 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)

Query: 207 CPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFS---NEYNLKNHM-VLHDVRAYVCE 262
           C  C + F+R   +  HL+   G RH CKE   +     +EY L  ++  L D  +    
Sbjct: 241 CVKCGREFSRSKVFEYHLE---GKRH-CKEGQGKEEFVYSEYVLHRYLKYLGDPVSETRS 296

Query: 263 LCARQFT 269
           L  R   
Sbjct: 297 LVLRSLA 303



 Score = 28.1 bits (62), Expect = 7.4
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 35  CPQCSRDFATVKILKTHLK 53
           C +C R+F+  K+ + HL+
Sbjct: 241 CVKCGREFSRSKVFEYHLE 259



 Score = 28.1 bits (62), Expect = 7.4
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 99  CPQCSRDFATVKILKTHLK 117
           C +C R+F+  K+ + HL+
Sbjct: 241 CVKCGREFSRSKVFEYHLE 259


>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 129

 Score = 27.0 bits (60), Expect = 7.9
 Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 4/40 (10%)

Query: 207 CPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFSNEYN 246
           CP C KS N   +     + HQ  R+ CK C   F+ E  
Sbjct: 33  CPRC-KSSNVV-KIGGIRRGHQ--RYKCKSCGSTFTVETG 68


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 27.8 bits (61), Expect = 7.9
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 12  PSTTNTTPTPTTQTSSTTPTRHP 34
           P+  NTT + TT T++TT T  P
Sbjct: 676 PAAANTTTSSTTTTTTTTTTAAP 698


>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family.  All
           proteins in this family for which functions are known
           are 3'-5' DNA-DNA helicases. These proteins are used for
           recombination, recombinational repair, and possibly
           maintenance of chromosome stability. This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 470

 Score = 27.8 bits (62), Expect = 7.9
 Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 187 FSCMPYLRQHQKLHSGITYSCPSCSKS 213
              + ++R+  K  SGI Y CPS  KS
Sbjct: 214 EDLLRFIRKEFKGKSGIIY-CPSRKKS 239


>gnl|CDD|227443 COG5112, UFD2, U1-like Zn-finger-containing protein [General
           function prediction only].
          Length = 126

 Score = 26.6 bits (58), Expect = 9.1
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 94  LTRHPCPQCSRDFATVKILKTHLK 117
           L +H C +C+R F T K L  H K
Sbjct: 53  LGQHYCIECARYFITEKALMEHKK 76


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 27.4 bits (61), Expect = 9.5
 Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 3/29 (10%)

Query: 259 YVCELCARQFTRVDHFNHHRR---LQHDP 284
           Y+CE C + +        HR+   LQH P
Sbjct: 159 YICEFCLKYYGSQTSLVRHRKKCSLQHPP 187


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 27.4 bits (61), Expect = 9.5
 Identities = 8/32 (25%), Positives = 14/32 (43%)

Query: 2   HFLLSPTTNNPSTTNTTPTPTTQTSSTTPTRH 33
               SP + +PS   +TP+P +   S+     
Sbjct: 126 SLFFSPASFSPSAAPSTPSPNSAKFSSRSNPG 157


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.132    0.431 

Gapped
Lambda     K      H
   0.267   0.0720    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,948,626
Number of extensions: 1200566
Number of successful extensions: 1934
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1864
Number of HSP's successfully gapped: 147
Length of query: 294
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 198
Effective length of database: 6,679,618
Effective search space: 1322564364
Effective search space used: 1322564364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 59 (26.5 bits)