RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18165
         (259 letters)



>gnl|CDD|150843 pfam10232, Med8, Mediator of RNA polymerase II transcription
           complex subunit 8.  Arc32, or Med8, is one of the
           subunits of the Mediator complex of RNA polymerase II.
           The region conserved contains two alpha helices
           putatively necessary for binding to other subunits
           within the core of the Mediator complex. The N-terminus
           of Med8 binds to the essential core Head part of
           Mediator and the C-terminus hinges to Med18 on the
           non-essential part of the Head that also includes Med20.
          Length = 226

 Score =  177 bits (449), Expect = 3e-55
 Identities = 99/265 (37%), Positives = 134/265 (50%), Gaps = 45/265 (16%)

Query: 1   MQR-EEKQLESSLESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAI 59
           MQR  EKQL  SLE+V  +L ++K  L  ++ K++ + +  +WPTF D FA+LS Q  ++
Sbjct: 1   MQRTSEKQLVDSLEAVRNRLAQLKQSLTSLIDKLEQEYELPSWPTFQDQFAILSSQLVSL 60

Query: 60  AKVLSHDKTPPIRNLTVLPLILNPEREEELLRLTEGRVPAFTHDLVPDYLRTKQEPDVEH 119
            K+L+H+K   +RN  V PL L P R++E               LV   LRTK +P+VE 
Sbjct: 61  TKILAHEK-DLLRNTVVYPLPLFPTRDQE--------------PLVTTLLRTKPDPEVEE 105

Query: 120 KLIQLEHKAASLSYDAAQKQA-----ATFNKVITHVLDIVSKAREEWESEAGARGGTGQT 174
            + Q   KA + + D A KQ         NK +++ LD V+K REEWE E+  R    QT
Sbjct: 106 WMKQARQKAENSTKDTAAKQEVEIDILQQNKEVSNWLDKVTKLREEWEFESSHRVEIEQT 165

Query: 175 SSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPM 234
           S   DT+ LVAAV  G+GL                       R   P  G  M   GQP 
Sbjct: 166 SEEEDTNQLVAAVKDGKGL-------------------RPAIRQASPMGGKQMSQAGQP- 205

Query: 235 GMMGKNPSGIKTNIKAAAAIHPYSR 259
              GK PS I T IK + ++HPY R
Sbjct: 206 ---GKMPSDILTFIK-SGSVHPYQR 226


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 34.1 bits (78), Expect = 0.043
 Identities = 19/58 (32%), Positives = 20/58 (34%), Gaps = 1/58 (1%)

Query: 186 AVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
           A G          G G PVG G      G G P GP  G   G GG P G      + 
Sbjct: 22  APGQPAQQGYANQGMGGPVGGGGG-PGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEAE 78



 Score = 29.9 bits (67), Expect = 1.0
 Identities = 20/50 (40%), Positives = 20/50 (40%), Gaps = 4/50 (8%)

Query: 198 PGPGAPVGPGM----MVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
           P PG P   G     M  P G G   G G G P GP G   G  G  P G
Sbjct: 21  PAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGG 70



 Score = 27.9 bits (62), Expect = 5.5
 Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
           PG G   G G  +  GG G     G   P G GG   G MG    G
Sbjct: 185 PGGGGYGGGGGGMGGGGGGGMGEMGGMGPQGGGG-GYGGMGAPGQG 229


>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
           [RNA processing and modification].
          Length = 654

 Score = 33.5 bits (76), Expect = 0.10
 Identities = 26/62 (41%), Positives = 33/62 (53%), Gaps = 9/62 (14%)

Query: 201 GAPVGPGMMVAPGGPGRPVGPGPGMPMG----PGGQPM-GMMGKNPSGIKTNIKAAAAIH 255
           GA  G G  ++P  P   +  GPG PMG    PGG P  GMMG  PSG+  N    +A+H
Sbjct: 574 GAEEG-GGNISPHVPAGFMAAGPGAPMGAFGYPGGIPFQGMMGSGPSGMPAN---GSAMH 629

Query: 256 PY 257
            +
Sbjct: 630 SH 631


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.9 bits (75), Expect = 0.15
 Identities = 16/63 (25%), Positives = 20/63 (31%), Gaps = 7/63 (11%)

Query: 189 MGRGLKMIMPGPGAPVGPGMMVAPGG----PGRPVGPGPGMPMGPGGQPMGMMGKNPSGI 244
             R L M  P  GA   P      G      G+P+G      M     P G     P+G+
Sbjct: 381 RMRQLPMGSPMGGAMGQP-PYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGG--PLRPNGL 437

Query: 245 KTN 247
              
Sbjct: 438 APM 440



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 7/60 (11%)

Query: 190 GRGLKMIMPG--PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQP----MGMMGKNPSG 243
           G+G +    G   G P    MM  P GPG P+ P    PM     P         K P  
Sbjct: 402 GQGPQQQFNGQPLGWPRMS-MMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQ 460


>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
          Length = 993

 Score = 32.8 bits (75), Expect = 0.19
 Identities = 22/61 (36%), Positives = 28/61 (45%), Gaps = 8/61 (13%)

Query: 199 GPG-APVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAAIHPY 257
           GPG  P+G    +AP  P  PV P  G+P     QP+G +   P G       +A I P 
Sbjct: 739 GPGMGPIGVKKHLAPFLPSHPVVPTGGIPRPEKTQPLGTISAAPWG-------SALILPI 791

Query: 258 S 258
           S
Sbjct: 792 S 792


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 32.3 bits (73), Expect = 0.22
 Identities = 20/48 (41%), Positives = 20/48 (41%), Gaps = 6/48 (12%)

Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPG----PGMPMGPGGQPMGMMGKNP 241
           P PG    P MM     PG    PG    PG PMGP  Q     G NP
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQ--RPGFNP 260



 Score = 30.4 bits (68), Expect = 0.89
 Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 5/46 (10%)

Query: 197 MPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPS 242
           M  PG    PG    PG P      GP M   PG  P   M   P 
Sbjct: 229 MNRPGFRPQPGGFNHPGTP-----MGPNMQQRPGFNPNQGMNPPPH 269


>gnl|CDD|182724 PRK10780, PRK10780, periplasmic chaperone; Provisional.
          Length = 165

 Score = 31.4 bits (71), Expect = 0.29
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query: 110 RTKQEPDVEHKLIQLEHKAASLSYDAAQKQAATFNKVITHVLDIVSK 156
           RTK E DV  +      KA +   D  ++     NK++T +   V  
Sbjct: 85  RTKLEKDVMAQRQTFSQKAQAFEQDRRRRSNEERNKILTRIQTAVKS 131


>gnl|CDD|224213 COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy
           production and conversion].
          Length = 346

 Score = 30.7 bits (70), Expect = 0.77
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 26  LVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVL 63
           L G+  +++     L++   L+ FALL G    +  VL
Sbjct: 144 LQGVPIELNGGYAGLSFDQLLNPFALLCGLGLVLMYVL 181


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.4 bits (68), Expect = 1.0
 Identities = 23/103 (22%), Positives = 38/103 (36%), Gaps = 10/103 (9%)

Query: 158 REEWESEAGARGGTGQTSSLTDTHSLVAAVGMG--------RGLKMIMPGPGAP-VGPGM 208
           +   +++ G    +G T+S TD ++  +  G+G                  G P +G   
Sbjct: 137 KGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGSHAQQDPAVSQSGVVGVPGLGVPG 196

Query: 209 MVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAA 251
           +  PGG G    PG G+    G  P   +G         I A+
Sbjct: 197 VGVPGGGGAGALPGVGVG-RAGVSPGVGVGGLGGVPGVGILAS 238


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 14/46 (30%), Positives = 15/46 (32%)

Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
           PG G    PG   APG P     PG        G P   +   P  
Sbjct: 394 PGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDA 439


>gnl|CDD|218323 pfam04910, Tcf25, Transcriptional repressor TCF25.  Members of this
           family are transcriptional repressors. They may act by
           increasing histone deacetylase activity at promoter
           regions.
          Length = 342

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 2/50 (4%)

Query: 29  MLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPPIRNLTVLP 78
           +L  +D + D L     +D +AL S ++  +            +NL++LP
Sbjct: 125 LLLSLDPEEDPLGALLLIDYYALRSKEYEWLIDFAESPLA--YKNLSLLP 172


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 29.2 bits (65), Expect = 2.4
 Identities = 18/76 (23%), Positives = 26/76 (34%), Gaps = 8/76 (10%)

Query: 11  SLESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPP 70
           S +     L+ + S+   M FK+    D+       DN AL      A  K L       
Sbjct: 162 SEDETASALD-LVSKKGKMAFKLHQIDDAE-----HDNKAL--WFQHAQKKYLFTSNEKE 213

Query: 71  IRNLTVLPLILNPERE 86
           +  LT   +    E E
Sbjct: 214 LDELTKSMIPKIREGE 229


>gnl|CDD|240095 cd04747, OYE_like_5_FMN, Old yellow enzyme (OYE)-related FMN
           binding domain, group 5.  Each monomer of OYE contains
           FMN as a non-covalently bound cofactor, uses NADPH as a
           reducing agent with oxygens, quinones, and
           alpha,beta-unsaturated aldehydes and ketones, and can
           act as electron acceptors in the catalytic reaction.
           Other members of OYE family include trimethylamine
           dehydrogenase, 2,4-dienoyl-CoA reductase, enoate
           reductase, pentaerythriol tetranitrate reductase,
           xenobiotic reductase, and morphinone reductase.
          Length = 361

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 187 VGMGRGLKMIMPGPGAPVGPGMMVAPGGP-GRP 218
           VG  R L         P+ P  +V PG P GR 
Sbjct: 101 VGAMRKLGTPPFPDVPPLSPSGLVGPGKPVGRE 133


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 28.7 bits (65), Expect = 2.5
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 2/52 (3%)

Query: 118 EHKLIQLEHKAASLSYDAAQKQAATFNK-VITHVLDIVSKAREEWESEAGAR 168
           E K+   E +A  +  + A+K+A    K  +    + + K R E E E   R
Sbjct: 26  EAKISSAEEEAKQI-IEEAKKEAEALKKEALLEAKEEIHKLRAEAERELKER 76


>gnl|CDD|221405 pfam12069, DUF3549, Protein of unknown function (DUF3549).  This
           family of proteins is functionally uncharacterized. This
           protein is found in bacteria. Proteins in this family
           are about 340 amino acids in length. This protein has a
           conserved LDE sequence motif.
          Length = 342

 Score = 29.1 bits (66), Expect = 2.5
 Identities = 14/47 (29%), Positives = 20/47 (42%), Gaps = 5/47 (10%)

Query: 45  FLDNFALLSGQ--FSAIAKVLSHDKTPPIRNLTVLPLILNPEREEEL 89
           FL+  A    Q  F+ +   L     P +R   +L L+ NP R   L
Sbjct: 290 FLEALAQTQDQALFNQLFADLVA--IPELRP-QLLMLLRNPNRSPAL 333


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 28.8 bits (64), Expect = 3.2
 Identities = 11/27 (40%), Positives = 12/27 (44%)

Query: 212 PGGPGRPVGPGPGMPMGPGGQPMGMMG 238
            G    P+GPGPG PMG      G   
Sbjct: 85  RGPQMGPMGPGPGRPMGQQMGGPGTAS 111



 Score = 28.0 bits (62), Expect = 6.1
 Identities = 24/97 (24%), Positives = 37/97 (38%), Gaps = 7/97 (7%)

Query: 145 KVITHVLDIVSKAREEWESEAGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPV 204
           ++I H  D+  KA ++   +    GG G    +    +L      G  +  + PGPG P+
Sbjct: 45  RLIIHFRDMSKKAAQQQVLQ----GGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPM 100

Query: 205 GPGM--MVAPGGPGRPVGPGPGMPMGPGGQ-PMGMMG 238
           G  M          + +     MPMG  G  P  M  
Sbjct: 101 GQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSR 137


>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182. 
          Length = 238

 Score = 28.3 bits (63), Expect = 3.4
 Identities = 20/75 (26%), Positives = 26/75 (34%), Gaps = 5/75 (6%)

Query: 169 GGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMG 228
           GG G   +L    SL +      GL  I+ G   P       +  G   PVG G G  + 
Sbjct: 47  GGGGNNPNLASLSSLTSQ-----GLGKILSGLQPPPLGNGGGSGAGGPGPVGGGGGPGVA 101

Query: 229 PGGQPMGMMGKNPSG 243
           P         + PS 
Sbjct: 102 PNNIQPNAQAQQPST 116


>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region.  The region
           described in this family is found towards the N-terminus
           of various eukaryotic fork head/HNF-3-related
           transcription factors (which contain the pfam00250
           domain). These proteins play key roles in embryogenesis,
           maintenance of differentiated cell states, and
           tumorigenesis.
          Length = 137

 Score = 27.6 bits (61), Expect = 3.8
 Identities = 10/58 (17%), Positives = 16/58 (27%), Gaps = 1/58 (1%)

Query: 188 GMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGP-GMPMGPGGQPMGMMGKNPSGI 244
             G G      G       G  ++P G    +GP P G               + + +
Sbjct: 64  SPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRASSQNNL 121



 Score = 27.6 bits (61), Expect = 4.2
 Identities = 13/77 (16%), Positives = 18/77 (23%), Gaps = 2/77 (2%)

Query: 163 SEAGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPG 222
             AG     G +SS  +  S            M M   G  + P   +   GP      G
Sbjct: 45  GYAGPGAMNGMSSSSMNGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGG 104

Query: 223 PGM--PMGPGGQPMGMM 237
                           +
Sbjct: 105 SLSPNMSMSRASSQNNL 121



 Score = 27.2 bits (60), Expect = 5.9
 Identities = 14/63 (22%), Positives = 17/63 (26%), Gaps = 6/63 (9%)

Query: 186 AVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIK 245
                    M M G   P     M +    G   G G           MGMMG +  G  
Sbjct: 32  TSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMSPGYGGAGSP------MGMMGMSSMGTS 85

Query: 246 TNI 248
            + 
Sbjct: 86  LSP 88


>gnl|CDD|215271 PLN02491, PLN02491, carotenoid 9,10(9',10')-cleavage dioxygenase.
          Length = 545

 Score = 28.7 bits (64), Expect = 3.9
 Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 30/76 (39%)

Query: 12 LESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPPI 71
          LE +++KL    SQ              L++         LSG F+ +      D+TPP+
Sbjct: 34 LEKLIVKLMHDASQ-------------PLHY---------LSGNFAPV-----RDETPPV 66

Query: 72 RNLTV---LPLILNPE 84
          ++L V   LP  LN E
Sbjct: 67 KDLPVRGHLPECLNGE 82


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 28.1 bits (63), Expect = 4.3
 Identities = 16/56 (28%), Positives = 16/56 (28%), Gaps = 1/56 (1%)

Query: 199 GPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPG-GQPMGMMGKNPSGIKTNIKAAAA 253
           G   P      V P  P    G G G P   G GQ  G     P          AA
Sbjct: 90  GAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPGSFLGGAAQTAA 145


>gnl|CDD|212017 cd11627, HR1_Ste20-like, Protein kinase C-related kinase homology
          region 1 (HR1) Rho-binding domain of
          Schizosaccharomyces pombe Ste20-like proteins.  This
          group is composed of predominantly uncharacterized
          fungal proteins, which contain two known domains: HR1
          at the N-terminal region and REM (Ras exchanger motif)
          at the C-terminal region. One member protein from
          Schizosaccharomyces pombe is named Ste16 while its gene
          is called ste20 (a target of rapamycin complex 2
          subunit). It is a subunit in the protein kinase TOR
          complexes in fission yeast. The REM domain is usually
          found in nucleotide exchange factors for Ras-like small
          GTPases. HR1 domains are anti-parallel coiled-coil
          (ACC) domains that bind small GTPases from the Rho
          family.
          Length = 71

 Score = 26.2 bits (58), Expect = 4.3
 Identities = 10/26 (38%), Positives = 15/26 (57%), Gaps = 2/26 (7%)

Query: 1  MQREEKQLESSLESVVMKLNEIKSQL 26
           QR   ++ES L S   K+ ++ SQL
Sbjct: 41 DQR--ARVESELNSSNRKIAQLTSQL 64


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 19/49 (38%), Positives = 25/49 (51%), Gaps = 7/49 (14%)

Query: 138 KQAATFNKVITHVLDIVSKAREEWESEAG---ARGGTGQTSSLTDTHSL 183
           KQ    N +I+H+  I+ K    +E EA    AR G G   SL DT +L
Sbjct: 174 KQIPQ-NSIISHLKTILEKEGVSYEPEALEILARSGNG---SLRDTLTL 218


>gnl|CDD|215113 PLN02168, PLN02168, copper ion binding / pectinesterase.
          Length = 545

 Score = 28.0 bits (62), Expect = 5.4
 Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 3/49 (6%)

Query: 205 GPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAA 253
           G  ++  P  P  PVGP    P+ P         +    I+ ++   AA
Sbjct: 291 GVALIRYPNSPLDPVGPL---PLAPALHDYFSSVEQALSIRMDLNVGAA 336


>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763).  This
           eukaryotic family of proteins has no known function.
          Length = 91

 Score = 26.6 bits (59), Expect = 5.8
 Identities = 12/32 (37%), Positives = 13/32 (40%)

Query: 212 PGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
            GG G P  PG G  MG  G   G      +G
Sbjct: 59  DGGRGGPGPPGGGRRMGRIGGGGGPSRPPMAG 90


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 27.5 bits (61), Expect = 6.0
 Identities = 11/38 (28%), Positives = 11/38 (28%), Gaps = 2/38 (5%)

Query: 200 PGAPVGPGMMVAPGGPG--RPVGPGPGMPMGPGGQPMG 235
           P  P  P       G     P  P   MP G    PM 
Sbjct: 132 PLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMM 169


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 28.0 bits (62), Expect = 6.9
 Identities = 19/61 (31%), Positives = 24/61 (39%), Gaps = 1/61 (1%)

Query: 200  PGAPVGPGMMVAPGGPGRPVGP-GPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAAIHPYS 258
            P  P+ P     P  P  P GP  P +P+G    P G + + P       K AA   P  
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883

Query: 259  R 259
            R
Sbjct: 2884 R 2884


>gnl|CDD|182670 PRK10715, flk, flagella biosynthesis regulator; Provisional.
          Length = 335

 Score = 27.7 bits (62), Expect = 7.2
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 213 GGPGRPVGPGPGMPMGPGGQPMGM 236
           G PG+P G G   P   G QP+  
Sbjct: 6   GPPGQPPGGGRNPPSTAGDQPLSP 29


>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
           Members of this family belong to the collagen
           superfamily. Collagens are generally extracellular
           structural proteins involved in formation of connective
           tissue structure. The alignment contains 20 copies of
           the G-X-Y repeat that forms a triple helix. The first
           position of the repeat is glycine, the second and third
           positions can be any residue but are frequently proline
           and hydroxyproline. Collagens are post translationally
           modified by proline hydroxylase to form the
           hydroxyproline residues. Defective hydroxylation is the
           cause of scurvy. Some members of the collagen
           superfamily are not involved in connective tissue
           structure but share the same triple helical structure.
          Length = 60

 Score = 25.1 bits (56), Expect = 8.0
 Identities = 18/34 (52%), Positives = 18/34 (52%)

Query: 200 PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQP 233
           PG P  PG    PG PG P  PGP  P GP G P
Sbjct: 6   PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 39


>gnl|CDD|197548 smart00157, PRP, Major prion protein.  The prion protein is a major
           component of scrapie-associated fibrils in
           Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
           syndrome and bovine spongiform encephalopathy.
          Length = 218

 Score = 27.1 bits (60), Expect = 8.3
 Identities = 21/47 (44%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPG-GQPMGMMGKNPSG 243
           PG G+P G       GG G+P G G G P G G GQP G     P G
Sbjct: 18  PGQGSPGGNRYPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHG 64


>gnl|CDD|153333 cd07649, F-BAR_GAS7, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Growth Arrest Specific
           protein 7.  F-BAR domains are dimerization modules that
           bind and bend membranes and are found in proteins
           involved in membrane dynamics and actin reorganization.
           Growth Arrest Specific protein 7 (GAS7) is mainly
           expressed in the brain and is required for neurite
           outgrowth. It may also play a role in the protection and
           migration of embryonic stem cells. Treatment-related
           acute myeloid leukemia (AML) has been reported resulting
           from mixed-lineage leukemia (MLL)-GAS7 translocations as
           a complication of primary cancer treatment. GAS7
           contains an N-terminal SH3 domain, followed by a WW
           domain, and a central F-BAR domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 233

 Score = 27.3 bits (60), Expect = 8.6
 Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 111 TKQEPDVEHKLIQLEHKAASLSYDAAQK---QAATFNKVITHVLDIVSKAREEWESE 164
            +++ D+E K  QLE K ++ + +  +K   ++      +   +D+ ++A+ +W  E
Sbjct: 131 LERQKDLEGKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRCVDLYNQAQSKWFEE 187


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 27.2 bits (60), Expect = 8.9
 Identities = 22/74 (29%), Positives = 27/74 (36%), Gaps = 16/74 (21%)

Query: 186 AVGMGRGLKMIM----PGPGAPVGPGMMVA-------PGGPGRP-VGPGP----GMPMGP 229
           A G+G GL  ++     G GAP G     +       PG    P   P P    G P GP
Sbjct: 5   AAGLGLGLWSLLWLLCAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGP 64

Query: 230 GGQPMGMMGKNPSG 243
              P     K+P  
Sbjct: 65  ASPPKSGPPKSPPA 78


>gnl|CDD|163682 cd07944, DRE_TIM_HOA_like, 4-hydroxy-2-oxovalerate aldolase-like,
           N-terminal catalytic TIM barrel domain.  This family of
           bacterial enzymes is sequence-similar to
           4-hydroxy-2-oxovalerate aldolase (HOA) but its exact
           function is unknown.  This family includes the
           Bacteroides vulgatus Bvu_2661 protein and belongs to the
           DRE-TIM metallolyase superfamily.  DRE-TIM metallolyases
           include 2-isopropylmalate synthase (IPMS),
           alpha-isopropylmalate synthase (LeuA),
           3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate
           synthase, citramalate synthase, 4-hydroxy-2-oxovalerate
           aldolase, re-citrate synthase, transcarboxylase 5S,
           pyruvate carboxylase, AksA, and FrbC.  These members all
           share a conserved  triose-phosphate isomerase (TIM)
           barrel domain consisting of a core beta(8)-alpha(8)
           motif with the eight parallel beta strands forming an
           enclosed barrel surrounded by eight alpha helices.  The
           domain has a catalytic center containing a divalent
           cation-binding site formed by a cluster of invariant
           residues that cap the core of the barrel.  In addition,
           the catalytic site includes three invariant residues -
           an aspartate (D), an arginine (R), and a glutamate (E) -
           which is the basis for the domain name "DRE-TIM".
          Length = 266

 Score = 27.1 bits (61), Expect = 8.9
 Identities = 4/27 (14%), Positives = 9/27 (33%)

Query: 232 QPMGMMGKNPSGIKTNIKAAAAIHPYS 258
             M + G +   +   ++    I P  
Sbjct: 128 NLMAISGYSDEELLELLELVNEIKPDV 154


>gnl|CDD|232988 TIGR00468, pheS, phenylalanyl-tRNA synthetase, alpha subunit.  Most
           phenylalanyl-tRNA synthetases are heterodimeric, with 2
           alpha (pheS) and 2 beta (pheT) subunits. This model
           describes the alpha subunit, which shows some similarity
           to class II aminoacyl-tRNA ligases. Mitochondrial
           phenylalanyl-tRNA synthetase is a single polypeptide
           chain, active as a monomer, and similar to this chain
           rather than to the beta chain, but excluded from this
           model. An interesting feature of the alignment of all
           sequences captured by this model is a deep split between
           non-spirochete bacterial examples and all other
           examples; supporting this split is a relative deletion
           of about 50 residues in the former set between two
           motifs well conserved throughout the alignment [Protein
           synthesis, tRNA aminoacylation].
          Length = 293

 Score = 27.3 bits (61), Expect = 9.1
 Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 21/84 (25%)

Query: 10  SSLESVVMKLNEIKSQLVGMLFKID---------HDRDSLNWP----------TF-LDNF 49
            SL  +   ++EI+   +G+ F  +          + D+LN P          TF + + 
Sbjct: 69  GSLHPLTRVIDEIRDIFLGLGFTEETGPEVETDFWNFDALNIPQDHPARDMQDTFYIKDR 128

Query: 50  ALLSGQFSAI-AKVLSHDKTPPIR 72
            LL    +A+  + +   + PPIR
Sbjct: 129 LLLRTHTTAVQLRTMEEQEKPPIR 152


>gnl|CDD|221598 pfam12484, PE_PPE_C, Polymorphic PE/PPE proteins C terminal.  This
           domain family is found in bacteria, and is approximately
           90 amino acids in length. The family is found in
           association with pfam00823. There is a conserved SVP
           sequence motif. There is a single completely conserved
           residue W that may be functionally important. The
           proteins in this family are PE/PPE proteins implicated
           in immunostimulation and virulence.
          Length = 81

 Score = 25.8 bits (57), Expect = 9.1
 Identities = 9/32 (28%), Positives = 9/32 (28%)

Query: 200 PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGG 231
             A   PG  VA           PGMP     
Sbjct: 29  AAAAALPGAGVAAAAEAAAGAVLPGMPGAAAA 60


>gnl|CDD|173848 cd01159, NcnH, Naphthocyclinone hydroxylase.  Naphthocyclinone is
           an aromatic polyketide and an antibiotic, which is
           active against Gram-positive bacteria.  Polyketides are
           secondary metabolites, which have important biological
           functions such as antitumor, immunosupressive or
           antibiotic activities. NcnH is a hydroxylase involved in
           the biosynthesis of naphthocyclinone and possibly other
           polyketides.
          Length = 370

 Score = 27.3 bits (61), Expect = 9.7
 Identities = 15/58 (25%), Positives = 21/58 (36%), Gaps = 1/58 (1%)

Query: 165 AGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPG 222
           A A G     +S+  THS + A       + +       +  G   APGG    V  G
Sbjct: 63  AEACGSAAWVASIVATHSRMLAAFPPEAQEEVWGDGPDTLLAGS-YAPGGRAERVDGG 119


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.134    0.390 

Gapped
Lambda     K      H
   0.267   0.0613    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,381,983
Number of extensions: 1310507
Number of successful extensions: 1295
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1175
Number of HSP's successfully gapped: 91
Length of query: 259
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 164
Effective length of database: 6,723,972
Effective search space: 1102731408
Effective search space used: 1102731408
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 58 (26.4 bits)