RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9966
         (1101 letters)



>gnl|CDD|224157 COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold
           involved in RNA processing [Translation, ribosomal
           structure and biogenesis].
          Length = 427

 Score =  224 bits (573), Expect = 2e-64
 Identities = 97/284 (34%), Positives = 136/284 (47%), Gaps = 8/284 (2%)

Query: 1   MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
              ++V    I+YTGD     DR L  A +  C  D+LI ESTY   +  ++   ER F+
Sbjct: 148 AILLEVDGGRILYTGDVKRRKDRLLNGAELPPC-IDVLIVESTYGDRLHPNRDEVERRFI 206

Query: 61  KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
           + V   ++RGG VLIP FALGRAQEL ++L           PIY    +   A  Y K  
Sbjct: 207 ESVKAALERGGTVLIPAFALGRAQELLLILRELGFAG--DYPIYVDGPIARVALAYAKYP 264

Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
           I      + K    R     +  R   +  ID  GP VV A PGML  G S    K    
Sbjct: 265 IGLDLPDLLKVAESR-FRFVESRRNSMREGID-KGPAVVLAAPGMLKGGRSRYYLKHLLS 322

Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
            E N +++PG+  +GT+G  +L G   +       I VK  +E + FSAHAD   +++ I
Sbjct: 323 DEKNWVLLPGYQAEGTLGRVLLEGGTSVH-IKGIEIKVKARVEELDFSAHADGDELLEFI 381

Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEF--NLDCFMPANGESC 282
           +   P  V+LVHGE      L+ ++ +E     +  +PANGE  
Sbjct: 382 KDISPPKVVLVHGEPEYGAALRARLLEELIGIRELELPANGEEY 425


>gnl|CDD|188365 TIGR03675, arCOG00543, arCOG00543 universal archaeal
           KH-domain/beta-lactamase-domain protein.  This family of
           proteins is universal in the archaea and consistsof an
           N-terminal type-1 KH-domain (pfam00013) a central
           beta-lactamase-domain (pfam00753) with a C-terminal
           motif associated with RNA metabolism (pfam07521).
           KH-domains are associated with RNA-binding, so taken
           together, this protein is a likely metal-dependent
           RNAase. This family was defined in as arCOG01782.
          Length = 630

 Score =  179 bits (457), Expect = 3e-47
 Identities = 93/287 (32%), Positives = 154/287 (53%), Gaps = 17/287 (5%)

Query: 11  IVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDFLKKVHECVD 68
           IVYTGD+     R L  A     R + LI ESTY      + S+   E++ +K V+E + 
Sbjct: 341 IVYTGDFKYEKTRLLDPAVNKFPRVETLIMESTYGGRDDYQPSREEAEKELIKVVNETIK 400

Query: 69  RGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTNQK 127
           RGGKVLIPVFA+GRAQE+ ++LE    +  + + P+Y   G+  +AT  +  +  + N++
Sbjct: 401 RGGKVLIPVFAVGRAQEVMLVLEEAMRKGLIPEVPVYLD-GMIWEATAIHTAYPEYLNKE 459

Query: 128 IRKTFVQR--NMFD---FKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
           +R+       N F    F  +   D+    I++  P ++ AT GML+ G  +  FK  AP
Sbjct: 460 LRERIFHEGENPFLSEIFHRVEGSDERREIIESDEPAIILATSGMLNGGPVVEYFKLLAP 519

Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYMS-FSAHADAKGI 236
              N L+  G+  +GT+G ++ SG +++   +    + I + M +E +  FS H+D + +
Sbjct: 520 DPRNSLVFVGYQAEGTLGRRIQSGWREIPLTDDGKTETIKINMEVETIEGFSGHSDRRQL 579

Query: 237 MQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
           M  ++  +PK   +LL HGE SK + L   I ++F ++   P N E+
Sbjct: 580 MNYVRRMQPKPEKILLNHGEPSKCLDLASSIYKKFKIETIAPKNLET 626


>gnl|CDD|224696 COG1782, COG1782, Predicted metal-dependent RNase, consists of a
           metallo-beta-lactamase domain and an RNA-binding KH
           domain [General function prediction only].
          Length = 637

 Score =  179 bits (455), Expect = 5e-47
 Identities = 95/292 (32%), Positives = 160/292 (54%), Gaps = 18/292 (6%)

Query: 7   GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDFLKKVH 64
           G  +IVYTGD+     R L  A     R + LI ESTY     ++  +   E++ +K ++
Sbjct: 343 GLYNIVYTGDFKFEKTRLLEPANNKFPRVETLIMESTYGGRDDVQPPREEAEKELIKVIN 402

Query: 65  ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
           + + RGGKVLIPVFA+GR+QE+ I+LE    +  + + P+Y   G+  +AT  +  +  +
Sbjct: 403 DTLKRGGKVLIPVFAVGRSQEVMIVLEEAMRKGLIPEVPVYLD-GMIWEATAIHTAYPEY 461

Query: 124 TNQKIRKTFVQR--NMF---DFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFK 176
            N+++R+       N F    FK +   D+    I++  P ++ AT GML+ G  +  FK
Sbjct: 462 LNKELRERIFHEGENPFLSEIFKRVEGSDERQEIIESDEPAIILATSGMLNGGPVVEYFK 521

Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYMS-FSAHA 231
             AP   N LI  G+  +GT+G ++ SG K++    +    ++I V M +E +  FS H+
Sbjct: 522 HLAPDPKNTLIFVGYQAEGTLGRRIQSGAKEIPIPGEDGKTEVIKVNMEVETIEGFSGHS 581

Query: 232 DAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
           D + +M+ ++   PK   +LL HGE SK + L   I+++F ++ + P N E+
Sbjct: 582 DRRQLMKYVRRMNPKPEKILLNHGEPSKCLDLASSIRRKFKIETYAPKNLET 633


>gnl|CDD|214983 smart01027, Beta-Casp, Beta-Casp domain.  The beta-CASP domain is
           found C terminal to the beta-lactamase domain in
           pre-mRNA 3'-end-processing endonuclease. The active site
           of this enzyme is located at the interface of these two
           domains.
          Length = 126

 Score =  139 bits (354), Expect = 1e-38
 Identities = 47/126 (37%), Positives = 66/126 (52%), Gaps = 7/126 (5%)

Query: 83  AQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQ-RNMFDF 140
            QEL ++LE  W    L   PIY    +  +AT  YK +  W + +IRK F Q RN FDF
Sbjct: 1   TQELLLILEELWREGELPNVPIYLDSPMAARATEIYKSYPEWMSDEIRKRFEQGRNPFDF 60

Query: 141 KHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQG 195
           K+++        K   D  GP V+ A+ GML  G S    K+ AP   N +I+ G+  +G
Sbjct: 61  KNLKFVKSLEESKRLNDYKGPKVIIASSGMLTGGRSRHYLKRLAPDPRNTVILTGYQAEG 120

Query: 196 TIGHKV 201
           T+G K+
Sbjct: 121 TLGRKL 126


>gnl|CDD|220932 pfam10996, Beta-Casp, Beta-Casp domain.  The beta-CASP domain is
           found C terminal to the beta-lactamase domain in
           pre-mRNA 3'-end-processing endonuclease. The active site
           of this enzyme is located at the interface of these two
           domains.
          Length = 124

 Score =  115 bits (291), Expect = 3e-30
 Identities = 44/125 (35%), Positives = 62/125 (49%), Gaps = 7/125 (5%)

Query: 83  AQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFK 141
           AQEL +LLE  W     +  PIY    +  KAT  YK +  W +    K F+ R  F F 
Sbjct: 1   AQELLLLLEELWREGIPKDVPIYLDSPMAIKATEIYKSYPEWLSDSA-KEFLLRGAFPFD 59

Query: 142 HIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGT 196
           +++        K   +  GP V+ A+ GML  G SL   KK AP   N +I+ G+  +GT
Sbjct: 60  NLKFVRSVDESKELNNLGGPKVIIASSGMLEGGRSLHYLKKLAPDPKNTVILTGYQAEGT 119

Query: 197 IGHKV 201
           +G K+
Sbjct: 120 LGRKL 124


>gnl|CDD|203663 pfam07521, RMMBL, RNA-metabolising metallo-beta-lactamase.  The
           metallo-beta-lactamase fold contains five sequence
           motifs. The first four motifs are found in pfam00753 and
           are common to all metallo-beta-lactamases. The fifth
           motif appears to be specific to function. This entry
           represents the fifth motif from metallo-beta-lactamases
           involved in RNA metabolism.
          Length = 43

 Score = 59.0 bits (144), Expect = 2e-11
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 214 QIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEAS 256
           ++I VK  +E + FS HAD + +++L++  +P+ V+LVHGE  
Sbjct: 1   EVIPVKARVETLHFSGHADQEELLELLKLLKPEKVILVHGEPR 43


>gnl|CDD|222313 pfam13679, Methyltransf_32, Methyltransferase domain.  This family
           appears to be a methyltransferase domain.
          Length = 143

 Score = 57.2 bits (139), Expect = 1e-09
 Identities = 27/102 (26%), Positives = 36/102 (35%), Gaps = 17/102 (16%)

Query: 779 KRKSKAPVVSI-KDEELVVCKDKCKQI---------THFVTPDSDISSILSQAYPQDSLH 828
           KRK    VV I    ELV   +K               F+       + +S   P     
Sbjct: 49  KRKYGVRVVGIDARAELV---EKANARAQKLGFNKRMSFLEGTIA-GATVSARLPDPVD- 103

Query: 829 NVCIMGLHTCGDLSGTALRLFTKSSLQCLVQVGCCYHLLEEE 870
              +  LH C   +  ALR       + +V V CCYH L E+
Sbjct: 104 --VVTALHACDTATDDALRFALCKQAKAIVLVPCCYHELAEQ 143



 Score = 43.3 bits (103), Expect = 8e-05
 Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 9/77 (11%)

Query: 669 KKSYEVQVMSQVVAAVTNSCDSSH----IIDLGGGQGYLSTILALQ-----HGKKTLSLD 719
           KK ++V+ +++ V  +            ++D G G+GYL  IL        +G + + +D
Sbjct: 1   KKLHQVERLAEFVEPLIKEVLDKTGCITVVDHGAGKGYLGFILYYLLFKRKYGVRVVGID 60

Query: 720 YNQVNTHGAAVRSKKLE 736
                   A  R++KL 
Sbjct: 61  ARAELVEKANARAQKLG 77


>gnl|CDD|200373 TIGR04122, Xnuc_lig_assoc, putative exonuclease, DNA
           ligase-associated.  Members of this protein family
           frequently are found annotated as a putative exonuclease
           involved in mRNA processing. This protein is found,
           exclusively in bacteria, associated with three other
           proteins: an ATP-dependent DNA ligase, a helicase, and
           putative phosphoesterase.
          Length = 326

 Score = 41.8 bits (99), Expect = 0.001
 Identities = 34/126 (26%), Positives = 51/126 (40%), Gaps = 24/126 (19%)

Query: 3   QVKV--GNQSIVYTGDYNMTPDRHLGAAWIDKCRP------DLLITESTYATTI--RDSK 52
           QV++  G +  V +GDY   PD          C P      D  ITE+T+   +      
Sbjct: 93  QVRLEYGGEVWVVSGDYKRAPDP--------TCAPFEPVPCDTFITEATFGLPVFRWPPP 144

Query: 53  RCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEK 112
                  L    +  + G   L+  +ALG+AQ +  LL   ++R     PIY   G  E 
Sbjct: 145 AEEIAKILAWWAQNPEAGRASLLGAYALGKAQRVLALLRAAYDR-----PIYLH-GAVEN 198

Query: 113 ATNYYK 118
            T+ Y+
Sbjct: 199 LTDLYR 204


>gnl|CDD|202525 pfam03069, FmdA_AmdA, Acetamidase/Formamidase family.  This family
           includes amidohydrolases of formamide EC:3.5.1.49 and
           acetamide. Methylophilus methylotrophus fmdA forms a
           homotrimer suggesting all the members of this family
           also do.
          Length = 316

 Score = 40.5 bits (95), Expect = 0.004
 Identities = 21/100 (21%), Positives = 34/100 (34%), Gaps = 19/100 (19%)

Query: 274 FMPANGESCFVQTDMKISIDVSVNLLKEEAVKY----NSEPPNPLKERQIHGVLVIKDSS 329
               +GE CF   +M   + V V ++K              P P++ R    ++V     
Sbjct: 189 ASQGDGEVCFCAIEMSGEVTVKVEVIKGGLEWPGLETPPFDPGPVEPRFYDWLIV----- 243

Query: 330 ISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLEL 369
              + VDEA KE            +  I+       T+E 
Sbjct: 244 -EGISVDEAIKEH---------YLDATIAYRRACLNTIEY 273


>gnl|CDD|217904 pfam04113, Gpi16, Gpi16 subunit, GPI transamidase component.  GPI
           (glycosyl phosphatidyl inositol) transamidase is a
           multi-protein complex. Gpi16, Gpi8 and Gaa1 for a
           sub-complex of the GPI transamidase. GPI transamidase
           that adds glycosylphosphatidylinositols (GPIs) to newly
           synthesised proteins. Gpi16 is an essential
           N-glycosylated transmembrane glycoprotein. Gpi16 is
           largely found on the lumenal side of the ER. It has a
           single C-terminal transmembrane domain and a small
           C-terminal, cytosolic extension with an ER retrieval
           motif.
          Length = 559

 Score = 37.4 bits (87), Expect = 0.039
 Identities = 36/237 (15%), Positives = 81/237 (34%), Gaps = 47/237 (19%)

Query: 267 QEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLV-I 325
           Q   +D     + +    +  M+ ++DV +++ + +  +   +   P         L   
Sbjct: 211 QSMAIDVRPVCDDDQGECKLQMEQTVDVVLDVERSKRPRLKPDTQKPCHSDDTCFPLGDT 270

Query: 326 KDSSISLMD-----VDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQD 380
            + S SL +     +  +C     S+  ++   +  ++ S    +        + ++  D
Sbjct: 271 SEVSWSLSELFGRPLKGSCPLADSSKVCLQVPESRSVNVSPGAREVKG-TDGFIRTFDLD 329

Query: 381 KSQEYKIQLTE----SSLSIDSVLLSVER--IDDKNKRVFVTWPNQDEEVGKIVLHVLKS 434
            S  + ++L      + ++ D  +L  ER       +R            G + +     
Sbjct: 330 NSTPFDLKLPWPESTTVVTSDQPVLKAERSLTGHGQER------------GGLRIIF--- 374

Query: 435 MSNTQGQP-----FEFYPYTDTVFVLSSSRYVQGTYYLHTYLTSVTKHQKETMELIK 486
            +N          FE  P+           +V+   YLHT    +   QK T +LI+
Sbjct: 375 -TNPSSSTVEFIYFEILPW-----------FVRV--YLHTLQIKIDGQQKPTKDLIE 417


>gnl|CDD|100107 cd02440, AdoMet_MTases, S-adenosylmethionine-dependent
           methyltransferases (SAM or AdoMet-MTase), class I;
           AdoMet-MTases are enzymes that use
           S-adenosyl-L-methionine (SAM or AdoMet) as a substrate
           for methyltransfer, creating the product
           S-adenosyl-L-homocysteine (AdoHcy). There are at least
           five structurally distinct families of AdoMet-MTases,
           class I being the largest and most diverse. Within this
           class enzymes can be classified by different substrate
           specificities (small molecules, lipids, nucleic acids,
           etc.) and different target atoms for methylation
           (nitrogen, oxygen, carbon, sulfur, etc.).
          Length = 107

 Score = 33.2 bits (76), Expect = 0.16
 Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 9/70 (12%)

Query: 693 IIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE---------DSEKESK 743
           ++DLG G G L+  LA   G +   +D + V    A   +  L          D+E+   
Sbjct: 2   VLDLGCGTGALALALASGPGARVTGVDISPVALELARKAAAALLADNVEVLKGDAEELPP 61

Query: 744 GPFQSYAGVI 753
              +S+  +I
Sbjct: 62  EADESFDVII 71


>gnl|CDD|218968 pfam06276, FhuF, Ferric iron reductase FhuF-like transporter.  This
            family consists of several bacterial ferric iron
            reductase protein (FhuF) sequences. FhuF is involved in
            the reduction of ferric iron in cytoplasmic ferrioxamine
            B. This family also includes the IucA and IucC proteins.
          Length = 163

 Score = 32.0 bits (73), Expect = 0.83
 Identities = 20/99 (20%), Positives = 31/99 (31%), Gaps = 25/99 (25%)

Query: 1004 YFLKTALAPVIEALIVLDRVL--------------------YLREQQKPQIISNLFLSLL 1043
             +L   L P++  L+     L                    YLR+ Q  ++  +  L L 
Sbjct: 6    RYLGLLLPPLLHLLLAYGIALEAHGQNVLLVLDEDGRPVRLYLRDFQGVRLSPDPLLELP 65

Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEAL 1082
                      L    +   E +  YY     L P+I AL
Sbjct: 66   EAEKSS----LLYDDEELRERIF-YYLFVNHLRPLIAAL 99


>gnl|CDD|129037 smart00801, dDENN, Domain always found downstream of DENN domain,
           found in a variety of signalling proteins.  The dDENN
           domain is part of the tripartite DENN domain. It is
           always found downstream of the DENN domain itself, which
           is found in a variety of signalling proteins involved in
           Rab-mediated processes or regulation of MAPKs signalling
           pathways. The DENN domain is always encircled on both
           sides by more divergent domains, called uDENN (for
           upstream DENN) and dDENN (for downstream DENN). The
           function of the DENN domain remains to date unclear,
           although it appears to represent a good candidate for a
           GTP/GDP exchange activity.
          Length = 69

 Score = 29.2 bits (66), Expect = 1.9
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 125 NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVF 160
           N +IR+ F++  +  F   R F +     PGP++ F
Sbjct: 1   NDEIREAFLRFFVNLFGGYRNFLRELRKEPGPVITF 36


>gnl|CDD|221897 pfam13017, Maelstrom, piRNA pathway germ-plasm component.
           Maelstrom is a germ-plasm component protein, that is
           shown to be functionally involved in the piRNA pathway.
           It is conserved throughout Eukaryota, though it appears
           to have been lost from all examined teleost fish
           species. The domain architecture shows that it is
           coupled with several DNA- and RNA- related domains such
           as HMG box, SR-25-like and HDAC_interact domains.
           Sequence analysis and fold recognition have found a
           distant similarity between Maelstrom domain and the DnaQ
           3'-5' exonuclease family with the RNase H fold
           (Exonuc_X-T, pfam00929); notably, that the Maelstrom
           domains from basal eukaryotes contain the conserved
           3'-5' exonuclease active site residues
           (Asp-Glu-Asp-His-Asp, DEDHD). However, the animal and
           some amoeba maelstrom contain another set of conserved
           residues (Glu-His-His-Cys-His-Cys, EHHCHC). This
           evolutionary link together with structural examinations
           leads to the hypothesis that Maelstrom domains may have
           a potential nuclease-transposase activity or RNA-binding
           ability that may be implicated in piRNA biogenesis. A
           protein function evolution mode, namely "active site
           switch", has been proposed, in which the amoeba
           Maelstrom domains are the possible evolutionary
           intermediates due to their harbouring of the specific
           characteristics of both 3'-5' exonuclease and Maelstrom
           domains.
          Length = 213

 Score = 30.8 bits (70), Expect = 2.7
 Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 2/67 (2%)

Query: 218 VKMAIEYMSFSAHADAKGIMQLIQY--CEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFM 275
           VK  ++Y++  A    +  +   QY     K  +L +  + K+   K       N D + 
Sbjct: 99  VKSCLKYLASDAGESERIKVYDFQYLFFVLKKEVLDYAGSIKVFPSKHITDALLNKDFYE 158

Query: 276 PANGESC 282
             +G SC
Sbjct: 159 YTSGISC 165


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination, and
            repair].
          Length = 1187

 Score = 31.2 bits (71), Expect = 3.7
 Identities = 15/81 (18%), Positives = 22/81 (27%)

Query: 927  TLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEV 986
                  R L      S          + GRL         Y    + KLK   +VD    
Sbjct: 425  EESDPKRLLYLLSALSDREARSRLEGLAGRLRRIIRRLSPYSLLKLMKLKKRPDVDRHLA 484

Query: 987  TRLFNSHQREYEYLQIYYFLK 1007
              +    +   E L+    +K
Sbjct: 485  EEVRTLAEFVRELLKDEERVK 505


>gnl|CDD|220645 pfam10239, DUF2465, Protein of unknown function (DUF2465).  FAM98A
           and B proteins are found from worms to humans but their
           function is unknown. This entry is of a family of
           proteins that is rich in glycines.
          Length = 318

 Score = 30.2 bits (68), Expect = 5.3
 Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 10/85 (11%)

Query: 567 RLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNS 626
           RL S  +   ++ +  L S+LQA    ++V   K       K + E +  S V   +   
Sbjct: 89  RLKSKED--CLLLLLFLSSELQA---ARIVLKKKPE-----KMALEEEKCSPVFQELKLI 138

Query: 627 CDSSHIIDLGSTITVDELFSKLQAR 651
           C +  +    + IT+ ELFS ++ +
Sbjct: 139 CATLGLPKPPADITLSELFSGIEKK 163


>gnl|CDD|173809 cd07483, Peptidases_S8_Subtilisin_Novo-like, Peptidase S8 family
           domain in Subtilisin_Novo-like proteins.  Subtilisins
           are a group of alkaline proteinases originating from
           different strains of Bacillus subtilis.  Novo is one of
           the strains that produced enzymes belonging to this
           group.  The enzymes obtained from the Novo and BPN'
           strains are identical.  The Carlsburg and Novo
           subtilisins are thought to have arisen from a common
           ancestral protein.  They have similar peptidase and
           esterase activities, pH profiles, catalyze
           transesterification reactions, and are both inhibited by
           diispropyl fluorophosphate, though they differ in 85
           positions in the amino acid sequence.  Members of the
           peptidases S8 and S35 clan include endopeptidases,
           exopeptidases and also a tripeptidyl-peptidase. The S8
           family has an Asp/His/Ser catalytic triad similar to
           that found in trypsin-like proteases, but do not share
           their three-dimensional structure and are not homologous
           to trypsin. The S53 family contains a catalytic triad
           Glu/Asp/Ser with an additional acidic residue Asp in the
           oxyanion hole, similar to that of subtilisin.. The
           stability of these enzymes may be enhanced by calcium,
           some members have been shown to bind up to 4 ions via
           binding sites with different affinity. Some members of
           this clan contain disulfide bonds. These enzymes can be
           intra- and extracellular, some function at extreme
           temperatures and pH values.
          Length = 291

 Score = 30.0 bits (68), Expect = 5.9
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 12/48 (25%)

Query: 217 DVKMAIEY----------MSFSAH--ADAKGIMQLIQYCEPKNVLLVH 252
           D+  AI Y          MSF      + + +   I+Y E K VL+VH
Sbjct: 129 DIANAIRYAVDNGAKVINMSFGKSFSPNKEWVDDAIKYAESKGVLIVH 176


>gnl|CDD|223668 COG0595, COG0595, mRNA degradation ribonucleases J1/J2
           (metallo-beta-lactamase superfamily) [Translation,
           ribosomal structure and biogenesis; Replication,
           recombination and repair].
          Length = 555

 Score = 30.3 bits (69), Expect = 7.4
 Identities = 17/50 (34%), Positives = 22/50 (44%), Gaps = 7/50 (14%)

Query: 4   VKVGNQSIVYTGDYNMTPDRHLGA-------AWIDKCRPDLLITESTYAT 46
           +K    +IVYTGD+   P    G        A I K     LI++ST A 
Sbjct: 154 IKTPEGNIVYTGDFKFDPTPVDGEPTDLARLAEIGKEGVLALISDSTNAE 203


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 30.1 bits (68), Expect = 7.5
 Identities = 18/107 (16%), Positives = 39/107 (36%), Gaps = 13/107 (12%)

Query: 484 LIKNTIDEMIEFITPNIACVNTHTV-HFMVEKLWET--TIPKAILSQVNNSPSFDSLLEE 540
            + N ++++ E  +       +       V  L +   ++P        N   FDSL+ +
Sbjct: 476 ALGNLLNKLDELWSKVSDAKESSISSSRTVLSLLDELPSLPDHD----QNITEFDSLISQ 531

Query: 541 FWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKL 587
                Q   +  +S +    Q           + ES  ++D+++ KL
Sbjct: 532 LSSHYQSEDLTFSSLINFLNQLIHVL------SEESRTSLDDIYDKL 572


>gnl|CDD|224370 COG1453, COG1453, Predicted oxidoreductases of the aldo/keto
            reductase family [General function prediction only].
          Length = 391

 Score = 29.6 bits (67), Expect = 9.7
 Identities = 34/148 (22%), Positives = 49/148 (33%), Gaps = 42/148 (28%)

Query: 931  FYRALLEKYLRSKITINDEEPKVVGRLATK-CSNFVEYVHRAVDKLKLDLEVDDEEVTRL 989
            F    L+   R K+            LATK  S  V+               D E++ R+
Sbjct: 65   FLGKALKDGYREKVK-----------LATKLPSWPVK---------------DREDMERI 98

Query: 990  FNSHQREYEYLQIYYF-------LKTALAPVIEALIVLDRVLYLREQQKPQIISNLFLSL 1042
            FN    + E L   Y        L T     IE L V D   +L + +    I N   S 
Sbjct: 99   FN---EQLEKLGTDYIDYYLIHGLNTETWEKIERLGVFD---FLEKAKAEGKIRNAGFS- 151

Query: 1043 LTQVDDEEVTRLFDSHQREYEYLQIYYF 1070
                  E    + D++  ++  LQ  Y 
Sbjct: 152  -FHGSTEVFKEIVDAYPWDFVQLQYNYI 178


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.387 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 54,659,579
Number of extensions: 5367131
Number of successful extensions: 4483
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4460
Number of HSP's successfully gapped: 34
Length of query: 1101
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 994
Effective length of database: 6,191,724
Effective search space: 6154573656
Effective search space used: 6154573656
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.3 bits)