RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14776
         (1022 letters)



>gnl|CDD|238333 cd00593, RIBOc, RIBOc. Ribonuclease III C terminal domain. This
           group consists of eukaryotic, bacterial and archeal
           ribonuclease III (RNAse III) proteins. RNAse III is a
           double stranded RNA-specific endonuclease. Prokaryotic
           RNAse III is important in post-transcriptional control
           of mRNA stability and translational efficiency. It is
           involved in the processing of ribosomal RNA precursors.
           Prokaryotic RNAse III also plays a role in the
           maturation of tRNA precursors and in the processing of
           phage and plasmid transcripts. Eukaryotic RNase III's
           participate (through direct cleavage) in rRNA
           processing, in processing of small nucleolar RNAs
           (snoRNAs) and snRNA's (components of the spliceosome).
           In eukaryotes RNase III or RNaseIII like enzymes such as
           Dicer are involved in RNAi (RNA interference) and miRNA
           (micro-RNA) gene silencing.
          Length = 133

 Score =  144 bits (366), Expect = 2e-40
 Identities = 47/120 (39%), Positives = 69/120 (57%)

Query: 812 RKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLA 871
              A E    +NERLEFLGDAV+E + + +LF  FPD+ EG L   R+A+V N+ LA LA
Sbjct: 10  PSYANEHGRFNNERLEFLGDAVLELVVTEYLFKKFPDLSEGDLTRLRSALVSNETLARLA 69

Query: 872 KRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQVFSSVLYKDNE 931
           + +GL  ++    G +          +A+ FEAL+GA++LDGG + A +    +L    E
Sbjct: 70  RELGLGKYLRLGKGEEKSGGRLRPKILADVFEALIGAIYLDGGFEAARKFLLRLLGPLIE 129



 Score = 56.9 bits (138), Expect = 1e-09
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 981  RLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
             LL    T  S    +    +N+RLEFLGD VL+L+ ++YL+
Sbjct: 1    SLLLEALTHPSYANEHGRF-NNERLEFLGDAVLELVVTEYLF 41


>gnl|CDD|197778 smart00535, RIBOc, Ribonuclease III family. 
          Length = 129

 Score =  143 bits (364), Expect = 4e-40
 Identities = 44/109 (40%), Positives = 71/109 (65%)

Query: 818 SNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLE 877
           +   HNERLEFLGDAV+E + + +L+  +PD+ EG L+  R+A+V N+ LA LAK++GL 
Sbjct: 14  NEHEHNERLEFLGDAVLELVVTEYLYKKYPDLSEGDLSRLRSALVSNETLARLAKKLGLG 73

Query: 878 DFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQVFSSVL 926
           +F+    G  +    +    +A+ FEAL+GA++LD G++ A +    +L
Sbjct: 74  EFIRLGRGEAISGGRDKPKILADVFEALIGAIYLDSGLEAAREFIRDLL 122



 Score = 52.6 bits (127), Expect = 4e-08
 Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 981  RLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
             LL R  T  S    +     N+RLEFLGD VL+L+ ++YLY
Sbjct: 1    SLLLRALTHASYSNEH---EHNERLEFLGDAVLELVVTEYLY 39



 Score = 29.5 bits (67), Expect = 3.9
 Identities = 9/18 (50%), Positives = 9/18 (50%)

Query: 750 LLQLALTHPSYRENFGTN 767
           LL  ALTH SY      N
Sbjct: 2   LLLRALTHASYSNEHEHN 19


>gnl|CDD|234633 PRK00102, rnc, ribonuclease III; Reviewed.
          Length = 229

 Score =  134 bits (341), Expect = 1e-35
 Identities = 57/203 (28%), Positives = 86/203 (42%), Gaps = 67/203 (33%)

Query: 733 SLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHF 792
            L  L+ ++ Y F++  LL  ALTH SY                                
Sbjct: 4   DLEELQKKLGYTFKDPELLIQALTHRSYANENKGLKH----------------------- 40

Query: 793 QNTRKRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEG 852
                                         NERLEFLGDAV+E + S +LF  FPD++EG
Sbjct: 41  ------------------------------NERLEFLGDAVLELVVSEYLFKRFPDLDEG 70

Query: 853 GLATYRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKH-------AMANCFEAL 905
            L+  RAA+V+ + LA +A+ +GL +++L   G       E K         +A+ FEAL
Sbjct: 71  DLSKLRAALVREESLAEIARELGLGEYLLLGKG-------EEKSGGRRRPSILADAFEAL 123

Query: 906 MGALFLDGGIDVADQVFSSVLYK 928
           +GA++LD G++ A +    +   
Sbjct: 124 IGAIYLDQGLEAARKFILRLFEP 146



 Score = 65.9 bits (162), Expect = 7e-12
 Identities = 26/59 (44%), Positives = 34/59 (57%)

Query: 964  KQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
            K L+ L+   G  F    LL +  T RS    N  L  N+RLEFLGD VL+L+ S+YL+
Sbjct: 3    KDLEELQKKLGYTFKDPELLIQALTHRSYANENKGLKHNERLEFLGDAVLELVVSEYLF 61


>gnl|CDD|223644 COG0571, Rnc, dsRNA-specific ribonuclease [Transcription].
          Length = 235

 Score =  125 bits (316), Expect = 3e-32
 Identities = 54/215 (25%), Positives = 90/215 (41%), Gaps = 57/215 (26%)

Query: 729 RFHRSLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDR 788
              + L  LE ++ Y F+++ LL+ ALTH SY                            
Sbjct: 3   MMIKKLEALEKKLGYTFKDKELLEQALTHRSYANEHKAVE-------------------- 42

Query: 789 RIHFQNTRKRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPD 848
                                            +NERLEFLGDAV+  + + +LF  +P+
Sbjct: 43  ---------------------------------NNERLEFLGDAVLGLVVAEYLFKKYPN 69

Query: 849 IEEGGLATYRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGA 908
           + EG L+  RAA+V  + LA +A+ +GL D++    G +       +  +A+ FEAL+GA
Sbjct: 70  LPEGELSKLRAALVSEESLAEIARELGLGDYLRLGKGEEKSGGRRRESILADAFEALIGA 129

Query: 909 LFLDGGIDVADQVFSSVLYKDNEVRFKQWMYYPQH 943
           ++LD G++ A +    +       R ++     Q 
Sbjct: 130 IYLDSGLEAARKFILKLFLP----RLEEIDAGDQF 160



 Score = 65.7 bits (161), Expect = 9e-12
 Identities = 24/59 (40%), Positives = 35/59 (59%)

Query: 964  KQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
            K+L+ LE   G  F    LL +  T RS    +  + +N+RLEFLGD VL L+ ++YL+
Sbjct: 6    KKLEALEKKLGYTFKDKELLEQALTHRSYANEHKAVENNERLEFLGDAVLGLVVAEYLF 64


>gnl|CDD|233772 TIGR02191, RNaseIII, ribonuclease III, bacterial.  This family
           consists of bacterial examples of ribonuclease III. This
           enzyme cleaves double-stranded rRNA. It is involved in
           processing ribosomal RNA precursors. It is found even in
           minimal genones such as Mycoplasma genitalium and
           Buchnera aphidicola, and in some cases has been shown to
           be an essential gene. These bacterial proteins contain a
           double-stranded RNA binding motif (pfam00035) and a
           ribonuclease III domain (pfam00636). Eukaryotic homologs
           tend to be much longer proteins with additional domains,
           localized to the nucleus, and not included in this
           family [Transcription, RNA processing].
          Length = 220

 Score =  123 bits (310), Expect = 1e-31
 Identities = 60/192 (31%), Positives = 88/192 (45%), Gaps = 52/192 (27%)

Query: 737 LEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHFQNTR 796
           LE R+ YKF+N+ LL+ ALTH SY      N  H                          
Sbjct: 1   LEKRLGYKFKNKELLEQALTHSSYA-----NEHHKGVK---------------------- 33

Query: 797 KRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLAT 856
                                    +NERLEFLGDAV+  + + +LF  FPD+ EG L+ 
Sbjct: 34  -------------------------NNERLEFLGDAVLGLVVAEYLFKNFPDLSEGELSK 68

Query: 857 YRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGID 916
            RAA+V  + LA +A+ +GL  F+L   G +       +  +A+ FEAL+GA++LD G++
Sbjct: 69  LRAALVSEESLAEVARELGLGKFLLLGKGEEKSGGRRRESILADAFEALIGAIYLDSGLE 128

Query: 917 VADQVFSSVLYK 928
            A +    +L  
Sbjct: 129 AARKFILKLLIP 140



 Score = 53.4 bits (129), Expect = 1e-07
 Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 8/62 (12%)

Query: 962  LLKQLQYLEYSSGVIFNHIRLLARVFTTRSMGY-TNLTLGSNQRLEFLGDTVLQLITSDY 1020
            L K+L       G  F +  LL +  T  S     +  + +N+RLEFLGD VL L+ ++Y
Sbjct: 1    LEKRL-------GYKFKNKELLEQALTHSSYANEHHKGVKNNERLEFLGDAVLGLVVAEY 53

Query: 1021 LY 1022
            L+
Sbjct: 54   LF 55


>gnl|CDD|216036 pfam00636, Ribonuclease_3, Ribonuclease III domain. 
          Length = 91

 Score =  110 bits (278), Expect = 5e-29
 Identities = 42/91 (46%), Positives = 60/91 (65%)

Query: 824 ERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFMLYA 883
           ERLEFLGDAV+E   S +L+  FPD+ EG L   R+A+V N+ LA LA+++GL+ ++   
Sbjct: 1   ERLEFLGDAVLELYVSEYLYEKFPDLSEGELHRLRSALVSNEALAKLARKLGLDKYLREG 60

Query: 884 HGSDLCHEVELKHAMANCFEALMGALFLDGG 914
            G  +         +A+ FEAL+GAL+LDGG
Sbjct: 61  EGDIVKRASVKDKVLADAFEALIGALYLDGG 91



 Score = 39.5 bits (93), Expect = 6e-04
 Identities = 14/20 (70%), Positives = 17/20 (85%)

Query: 1003 QRLEFLGDTVLQLITSDYLY 1022
            +RLEFLGD VL+L  S+YLY
Sbjct: 1    ERLEFLGDAVLELYVSEYLY 20


>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
          Length = 413

 Score = 74.6 bits (183), Expect = 6e-14
 Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 14/108 (12%)

Query: 821 THNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFM 880
           THNERLEFLGD+V+    +  LF  F  ++EG L+  RA +V+ Q L  +A+ + + + +
Sbjct: 32  THNERLEFLGDSVLNCAVAALLFQRFGKLDEGDLSRVRANLVKQQSLYEIAQALNISEGL 91

Query: 881 LYAHGSDLCHEVELKHA-------MANCFEALMGALFLDGGIDVADQV 921
               G       EL+         +A+ FEA++GA+FLDGG + A  V
Sbjct: 92  RLGEG-------ELRSGGFRRPSILADAFEAIIGAVFLDGGFEAAQGV 132



 Score = 36.0 bits (83), Expect = 0.10
 Identities = 18/48 (37%), Positives = 28/48 (58%)

Query: 733 SLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGV 780
            L+ LE R++Y+F+N  LL+ ALTH S+        +   +S+ NC V
Sbjct: 2   PLSQLESRLRYEFRNAELLRQALTHRSHSATHNERLEFLGDSVLNCAV 49



 Score = 35.6 bits (82), Expect = 0.13
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 12/55 (21%)

Query: 959  QSPLLKQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVL 1013
             S L  +L+Y        F +  LL +  T RS   T+     N+RLEFLGD+VL
Sbjct: 3    LSQLESRLRYE-------FRNAELLRQALTHRSHSATH-----NERLEFLGDSVL 45


>gnl|CDD|171444 PRK12371, PRK12371, ribonuclease III; Reviewed.
          Length = 235

 Score = 70.9 bits (174), Expect = 2e-13
 Identities = 51/185 (27%), Positives = 75/185 (40%), Gaps = 61/185 (32%)

Query: 737 LEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHFQNTR 796
           LE+R  ++F N+  L+ ALTH S                     R  + G+         
Sbjct: 15  LEERTGHRFANKERLERALTHSS--------------------ARASKQGNY-------- 46

Query: 797 KRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLAT 856
                                      ERLEFLGD V+    +  LF  FPD  EG L+ 
Sbjct: 47  ---------------------------ERLEFLGDRVLGLCVAEMLFEAFPDASEGELSV 79

Query: 857 YRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAM---ANCFEALMGALFLDG 913
               +V  +  A +A  IGL D +    GSD+  ++  K  +   A+  EAL+ A++LDG
Sbjct: 80  RLNQLVNAETCAAIADEIGLHDLI--RTGSDV-KKLTGKRLLNVRADVVEALIAAIYLDG 136

Query: 914 GIDVA 918
           G++ A
Sbjct: 137 GLEAA 141



 Score = 33.9 bits (78), Expect = 0.34
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 1000 GSNQRLEFLGDTVLQLITSDYLY 1022
            G+ +RLEFLGD VL L  ++ L+
Sbjct: 44   GNYERLEFLGDRVLGLCVAEMLF 66


>gnl|CDD|173181 PRK14718, PRK14718, ribonuclease III; Provisional.
          Length = 467

 Score = 73.3 bits (179), Expect = 2e-13
 Identities = 39/101 (38%), Positives = 58/101 (57%)

Query: 821 THNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFM 880
           THNERLEFLGD+V+    +  LF  F  ++EG L+  RA +V+ Q L  +A+ + + D +
Sbjct: 32  THNERLEFLGDSVLNCAVAALLFQRFGKLDEGDLSRVRANLVKQQSLYEIAQALNISDGL 91

Query: 881 LYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQV 921
               G            +A+ FEA++GA+FLDGG + A  V
Sbjct: 92  RLGEGELRSGGFRRPSILADAFEAIIGAVFLDGGFEAAQGV 132



 Score = 35.2 bits (80), Expect = 0.18
 Identities = 18/47 (38%), Positives = 28/47 (59%)

Query: 734 LNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGV 780
           L+ LE R++Y+F+N  LL+ ALTH S+        +   +S+ NC V
Sbjct: 3   LSQLESRLRYEFRNAELLRQALTHRSHSATHNERLEFLGDSVLNCAV 49



 Score = 30.9 bits (69), Expect = 3.5
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 5/46 (10%)

Query: 977  FNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
            F +  LL +  T RS   T+     N+RLEFLGD+VL    +  L+
Sbjct: 14   FRNAELLRQALTHRSHSATH-----NERLEFLGDSVLNCAVAALLF 54


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 45.7 bits (108), Expect = 1e-04
 Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
           SR   R   S +R  R  SR ++     R  +S+S   R+      RRSR + R  S+S 
Sbjct: 32  SRFRDRHRRSRERSYREDSRPRDR----RRYDSRS--PRSLRYSSVRRSRDRPRRRSRSV 85

Query: 199 SEVPRQACARRSRSRSRILSPKRS 222
             + +    RR R RS     ++ 
Sbjct: 86  RSIEQH--RRRLRDRSPSNQWRKD 107



 Score = 42.6 bits (100), Expect = 0.001
 Identities = 28/95 (29%), Positives = 35/95 (36%), Gaps = 1/95 (1%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
           SR   RD  S + R RS  RS+      RS            + +R  SRS       S 
Sbjct: 12  SRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSV 71

Query: 199 SEVPRQACARRSRSRSRILSPKRSGEALSNLDKWR 233
               R    RRSRS   I   +R     S  ++WR
Sbjct: 72  RR-SRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWR 105



 Score = 36.8 bits (85), Expect = 0.068
 Identities = 23/103 (22%), Positives = 32/103 (31%), Gaps = 1/103 (0%)

Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPR 203
            D E  + R R   RS         + S+       +  +  R  S+ R   +  S  PR
Sbjct: 5   PDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPR 64

Query: 204 QACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSSEREMTEK 246
            +    S  RSR    +RS    S     R     S      K
Sbjct: 65  -SLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRK 106



 Score = 33.3 bits (76), Expect = 0.81
 Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 4/53 (7%)

Query: 139 SRSMHRDDESNQRRNRSS----SRSKNNGYSHRSENSKSSPTRTETNLKRRRS 187
           SRS      S+ RR+R      SRS  +   HR      SP+       ++RS
Sbjct: 60  SRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRS 112



 Score = 32.2 bits (73), Expect = 1.6
 Identities = 16/65 (24%), Positives = 23/65 (35%), Gaps = 4/65 (6%)

Query: 140 RSMHRDDESNQRRNRSSSRSKNNGYSH----RSENSKSSPTRTETNLKRRRSRSKSRSES 195
           RS   D     RR   S   ++  YS     R    + S +       RRR R +S S  
Sbjct: 44  RSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQ 103

Query: 196 KSRSE 200
             + +
Sbjct: 104 WRKDD 108


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 39.9 bits (93), Expect = 0.001
 Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 5/87 (5%)

Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENS---KSSPTRTETNLKRRRSRSKSRSE 194
           M RS  R    ++RR RS SR +      RS +    +   +R+ +  + RRSRS  R  
Sbjct: 1   MGRSRSRSPRRSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHR 60

Query: 195 SKSRSEVPRQACARRSRSRSRILSPKR 221
           S+SRS   R+   R     +R   PK+
Sbjct: 61  SRSRSPSRRRDRKRERDKDAR--EPKK 85



 Score = 36.4 bits (84), Expect = 0.029
 Identities = 27/82 (32%), Positives = 35/82 (42%), Gaps = 18/82 (21%)

Query: 142 MHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSES-KSRSE 200
           M R    + RR+R   RS++     R                R RSRS+ R    +SRS 
Sbjct: 1   MGRSRSRSPRRSRRRGRSRSRDRRERR---------------RERSRSRERDRRRRSRSR 45

Query: 201 VPRQACARRSRSRSRILSPKRS 222
            P +  +RRSRS  R  S  RS
Sbjct: 46  SPHR--SRRSRSPRRHRSRSRS 65



 Score = 34.1 bits (78), Expect = 0.16
 Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 2/78 (2%)

Query: 139 SRSMHRD-DESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKS 197
           SR   R      + R R  SRS+      RS +     +R   + +R RSRS+S S  + 
Sbjct: 12  SRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRD 71

Query: 198 RSEVPRQACARRSRSRSR 215
           R    R   AR  + R R
Sbjct: 72  RKR-ERDKDAREPKKRER 88


>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein.  This family
           consists of transformer proteins from several Drosophila
           species and also from Ceratitis capitata (Mediterranean
           fruit fly). The transformer locus (tra) produces an RNA
           processing protein that alternatively splices the
           doublesex pre-mRNA in the sex determination hierarchy of
           Drosophila melanogaster.
          Length = 182

 Score = 38.9 bits (90), Expect = 0.006
 Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 2/67 (2%)

Query: 150 QRRNRSSSRSKNNGYSHRSEN-SKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACAR 208
           ++R   S+R   +    R+ + S+S      +  +R RSRS+SR+ S SR      +  R
Sbjct: 40  RQRKTQSTRPTTSHRGRRTRSRSRSQSAERNSCQRRHRSRSRSRNRSDSRHRST-SSTER 98

Query: 209 RSRSRSR 215
           R RSRSR
Sbjct: 99  RRRSRSR 105



 Score = 38.5 bits (89), Expect = 0.008
 Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 3/51 (5%)

Query: 172 KSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRS 222
           K+  TR  T+ + RR+RS+SRS+S  R+   R+    RSRSRSR  S  R 
Sbjct: 43  KTQSTRPTTSHRGRRTRSRSRSQSAERNSCQRR---HRSRSRSRNRSDSRH 90



 Score = 35.8 bits (82), Expect = 0.055
 Identities = 26/54 (48%), Positives = 34/54 (62%), Gaps = 8/54 (14%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSR 192
           SRS   +  S QRR+RS SRS+N     RS++   S + TE   +RRRSRS+SR
Sbjct: 62  SRSQSAERNSCQRRHRSRSRSRN-----RSDSRHRSTSSTE---RRRRSRSRSR 107



 Score = 29.2 bits (65), Expect = 7.5
 Identities = 18/54 (33%), Positives = 21/54 (38%), Gaps = 1/54 (1%)

Query: 166 HRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSP 219
            R   S    T       R RSRS+S +E  S     R     R+RS SR  S 
Sbjct: 41  QRKTQSTRPTTSHRGRRTRSRSRSQS-AERNSCQRRHRSRSRSRNRSDSRHRST 93


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 39.4 bits (92), Expect = 0.012
 Identities = 37/223 (16%), Positives = 59/223 (26%), Gaps = 38/223 (17%)

Query: 8   SSHSSRGSCSFSSPMSEDVKPNIQQMALNEPSSSALQFNTVDVKQPPPGFSQSCN--HKI 65
           S  +S    +      E+        A + P +            PP   S         
Sbjct: 164 SDAASSRQAALPLSSPEETAR-----APSSPPA------EPPPSTPPAAASPRPPRRSSP 212

Query: 66  LVSKSFD--CKTGTTQVDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPT 123
           + + +       G +  D+  +++  + S+  +                CP P       
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE-------CPLP------- 258

Query: 124 QLHLKHPLAGQTNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSS----PTRTE 179
                 P               +  + R   +SS S     S     S       P+   
Sbjct: 259 -----RPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPR 313

Query: 180 TNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRS 222
            +     SR  S S + S SE  R A      S SR  SP R 
Sbjct: 314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356



 Score = 34.8 bits (80), Expect = 0.27
 Identities = 17/77 (22%), Positives = 25/77 (32%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
           S S     ES+     SSS S          +   SP+ +        S  + R      
Sbjct: 315 SSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374

Query: 199 SEVPRQACARRSRSRSR 215
              P  +  R +R R+R
Sbjct: 375 PSSPAASAGRPTRRRAR 391



 Score = 31.7 bits (72), Expect = 2.4
 Identities = 17/95 (17%), Positives = 23/95 (24%), Gaps = 9/95 (9%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
           S S      +      S SRS +           SSP +     +   S + S      R
Sbjct: 330 SSSSESSRGAAVSPGPSPSRSPSPS-RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRR 388

Query: 199 --------SEVPRQACARRSRSRSRILSPKRSGEA 225
                       R A  R    R R         +
Sbjct: 389 RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS 423



 Score = 30.5 bits (69), Expect = 5.6
 Identities = 19/75 (25%), Positives = 27/75 (36%), Gaps = 5/75 (6%)

Query: 148 SNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACA 207
           S+   +R SS S  +  S  S  +  SP  + +     RS S SR    +    PR+   
Sbjct: 316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPS-----RSPSPSRPPPPADPSSPRKRPR 370

Query: 208 RRSRSRSRILSPKRS 222
                 S   S  R 
Sbjct: 371 PSRAPSSPAASAGRP 385


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
           hydrophilic C-term.  This domain is a hydrophilic region
           found at the C-terminus of plant and metazoan
           pre-mRNA-splicing factor 38 proteins. The function is
           not known.
          Length = 97

 Score = 35.1 bits (81), Expect = 0.025
 Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 144 RDDESNQRRNRSSSRSKNNG-YSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVP 202
            ++E ++   R + R  + G  S R    + S  R  +  +RRR R + R+  + R +  
Sbjct: 17  EEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRD 76

Query: 203 RQACAR-RSRSRSR 215
           R    R RSRSRSR
Sbjct: 77  RDRYDRSRSRSRSR 90



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 16/62 (25%), Positives = 22/62 (35%)

Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKS 197
             RS  R      RR + S + +           +    R      R RSRS+SRS  + 
Sbjct: 36  GRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRSRSRSRSRSRDRR 95

Query: 198 RS 199
           R 
Sbjct: 96  RR 97


>gnl|CDD|237620 PRK14136, recX, recombination regulator RecX; Provisional.
          Length = 309

 Score = 35.0 bits (80), Expect = 0.16
 Identities = 26/97 (26%), Positives = 39/97 (40%), Gaps = 5/97 (5%)

Query: 137 IMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESK 196
            +S    RD  +   R   +  +  +  + R+E +    TRT  + +R R  +       
Sbjct: 81  RVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRTSQHPRRTRRAAGPFHSDS 140

Query: 197 SRSEVPRQACARRSRSRSRILSPKRS--GEALSNLDK 231
           S S       A RSR+ SR   P RS  G AL  L +
Sbjct: 141 SPSASSEDDGAARSRASSR---PARSLKGRALGYLSR 174


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 33.6 bits (77), Expect = 0.16
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 5/77 (6%)

Query: 286 ECTQKTRTICDEFESVIQNFVSVKESKPQFDYTANRKKIKRT----NRKKIKRKCSHCAS 341
           EC QKT  +    + + Q     KE +        ++K  R     + KKI  K    +S
Sbjct: 47  EC-QKTSQLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105

Query: 342 NPTVSSGSESSSDSSSG 358
           + + SS S SSS S S 
Sbjct: 106 SSSSSSSSSSSSSSESS 122


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 34.5 bits (79), Expect = 0.27
 Identities = 21/72 (29%), Positives = 26/72 (36%), Gaps = 2/72 (2%)

Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPR 203
           RD E  + RN +    K    S R   S+    R       R  R +SRS S +R    R
Sbjct: 3   RDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNR--YYR 60

Query: 204 QACARRSRSRSR 215
               R  R   R
Sbjct: 61  PRGDRSYRRDDR 72



 Score = 31.4 bits (71), Expect = 2.4
 Identities = 21/77 (27%), Positives = 29/77 (37%), Gaps = 7/77 (9%)

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
            R   R+D     + R  SR +    S   + S+    R     +R RSRS+S +    R
Sbjct: 6   ERGRLRNDTRRSDKGRERSRRR----SRSRDRSRRRRDRDYYRGRRGRSRSRSPN-RYYR 60

Query: 199 SEVPRQACARRSRSRSR 215
               R    RR   RS 
Sbjct: 61  PRGDRS--YRRDDRRSG 75


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 34.2 bits (78), Expect = 0.32
 Identities = 27/111 (24%), Positives = 44/111 (39%), Gaps = 12/111 (10%)

Query: 121 SPTQLHLKHPLAGQTNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRS----ENSKSSPT 176
           +P+      P        ++       +    NR+ +    +  S  S     ++   P 
Sbjct: 259 TPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGTPDDGSAISEDSAGGSHHTMRRPP 318

Query: 177 RTETNLKR--RRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEA 225
            + +  +R  RR+RS+SRS S+SRS   R    RR R R     P R  E+
Sbjct: 319 HSTSGERRGRRRNRSESRSRSRSRSGSRRY---RRRRGRG---VPGRRSES 363


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
           is a family of proteins that seem to be involved,
           directly or indirectly, in the salt sensitivity of some
           cellular functions in yeast. These proteins also
           interact with the splicing factor Msl1p.
          Length = 189

 Score = 33.0 bits (75), Expect = 0.42
 Identities = 25/109 (22%), Positives = 41/109 (37%), Gaps = 7/109 (6%)

Query: 145 DDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEV--- 201
           DDE+++     S     +  +    N  SS    +   KR ++ S S + S   +     
Sbjct: 2   DDETHKYAQPHSQ-QNTDELNTVIHNDSSSSKSKKKKKKRSKATSPSHNASDQSTNNVMS 60

Query: 202 -PRQACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSS--EREMTEKF 247
            P    AR         S +    A+ N  + R+ N S+  ERE   +F
Sbjct: 61  TPSAILARPQPLSYPFGSQQSQQNAVKNSKEKRIWNTSTQEERENIREF 109


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
           plant proteins with unknown function.
          Length = 308

 Score = 33.5 bits (77), Expect = 0.51
 Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 12/113 (10%)

Query: 123 TQLHLKHPLAGQTNIMSRSMHRDDES----NQRRNRSSSRSKNNGYSHRSENSKSSPTRT 178
           +++       G     S    +++      N+R N+S S S    Y    +  + +   T
Sbjct: 86  SEILDSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEY-KVYKAEEPADAST 144

Query: 179 ETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDK 231
           +T+ +R R  S++ S   SR E+   + +    S S       S E L +L K
Sbjct: 145 QTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSS-------SPETLESLIK 190


>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
           domain. 
          Length = 379

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 35/227 (15%), Positives = 56/227 (24%), Gaps = 36/227 (15%)

Query: 18  FSSPMSEDVKPNIQQMALNEPSSSALQFNTVDVKQPPPGFSQSCNHKILVSKSFDCKTGT 77
             SP S + +  +Q              +     Q     S+S N +     +   +   
Sbjct: 174 KGSPYSWEQEYLLQHGGQQHSHLQRHG-DEPFGAQSSGILSRSENRRTRNLANNTSRKSD 232

Query: 78  TQVDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPTQLHLKHPLAGQTNI 137
           T     P      Q + +  Q                         Q  L H   G++  
Sbjct: 233 TSRSVGPVRQSQIQRSRLGLQ-----------------------ANQGKLAHGQQGRSGS 269

Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETN-----LKR-------R 185
           +    H           SSS    N  S  S     S  R          +R        
Sbjct: 270 IRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAV 329

Query: 186 RSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKW 232
             RS       S++  P  +C       S+  S       ++ L+ W
Sbjct: 330 ELRSIPGGSVSSQNAGPLLSCWWLQFRNSKPCSDYCLHHIVNLLEDW 376


>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen.  The core
           antigen of hepatitis viruses possesses a carboxyl
           terminus rich in arginine. On this basis it was
           predicted that the core antigen would bind DNA. There is
           some experimental evidence to support this.
          Length = 182

 Score = 31.7 bits (72), Expect = 1.3
 Identities = 17/49 (34%), Positives = 22/49 (44%)

Query: 166 HRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRS 214
           +R  N+    T  ET + RRR RS  R     R    +    RRS+S S
Sbjct: 131 YRPPNAPILSTLPETIVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSPS 179


>gnl|CDD|241416 cd13262, PH_RasSynGAP-like, Synaptic Ras-GTPase activating protein
           family Pleckstrin homology (PH) domain.  The RasSynGAP
           family is composed of members: DAB2IP, nGAP, and SynGAP.
           Neuronal growth-associated proteins (nGAPs) are growth
           cone markers found in multiple types of neurons. There
           are many nGAPs including Cap1 (Adenylate
           cyclase-associated protein 1), Capzb (Capping protein
           (actin filament) muscle Z-line, beta), Clptm1 (Cleft lip
           and palate associated transmembrane protein 1), Cotl1
           (Coactosin-like 1), Crmp1 (Collapsin response mediator
           protein 1), Cyfip1 (Cytoplasmic FMR1 interacting protein
           1), Fabp7 (Fatty acid binding protein 7, brain), Farp2
           (FERM, RhoGEF and pleckstrin domain protein 2), Gap43
           (Growth associated protein 43), Gnao1 (Guanine
           nucleotide binding protein (G protein), alpha activating
           activity polypeptide O), Gnai2 (Guanine nucleotide
           binding protein (G protein), alpha inhibiting 2), Pacs1
           (Phosphofurin acidic cluster sorting protein 1), Rtn1
           (Reticulon 1), Sept2 (Septin 2), Snap25
           (Synaptosomal-associated protein 25), Strap
           (Serine/threonine kinase receptor associated protein),
           Stx7 (Syntaxin 7), and Tmod2 (Tropomodulin 2). SynGAP, a
           neuronal Ras-GAP, has been shown display both Ras-GAP
           activity and Ras-related protein (Rap)-GAP activity.
           Saccharomyces cerevisiae Bud2 and GAP1 members CAPRI
           (Ca2+-promoted Ras inactivator) and RASAL
           (Ras-GTPase-activating-like protein) also possess this
           dual activity. Human DOC-2/DAB2-interacting protein
           (DAB2IP) is encoded by a tumor suppressor gene and a
           newly recognized member of the Ras-GTPase-activating
           family. DAB2IP is a critical component of many signal
           transduction pathways mediated by Ras and tumor necrosis
           factors including apoptosis pathways, and it is involved
           in the formation of many types of tumors. DAB2IP
           participates in regulation of gene expression and
           pluripotency of cells. It has been reported that DAB2IP
           was expressed in different tumor tissues. Little
           information is available concerning the expression
           levels of DAB2IP in normal tissues and cells, however,
           and no studies of its expression patterns during the
           development of human embryos have been reported. DAB2IP
           was expressed primarily in cell cytoplasm throughout the
           fetal development. The expression levels varied among
           tissues and different gestational ages. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 124

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 4/61 (6%)

Query: 170 NSKSSPTRTETNLKRRRSRSKSRSESKSRS----EVPRQACARRSRSRSRILSPKRSGEA 225
             K S +RT++    R SR+ S  + K R        R+   R SRS   +LS   +   
Sbjct: 1   RLKPSISRTKSQPNVRLSRNSSFKQKKLRRRKKRTTAREGSLRSSRSHESLLSSSSAAMM 60

Query: 226 L 226
           L
Sbjct: 61  L 61


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 31.1 bits (70), Expect = 1.9
 Identities = 20/68 (29%), Positives = 24/68 (35%), Gaps = 10/68 (14%)

Query: 165 SHRSENSKSS-PTRTETNLKRR---------RSRSKSRSESKSRSEVPRQACARRSRSRS 214
           S  SE S+SS   R++    R             S S SE  SRS   R    R    RS
Sbjct: 68  SQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS 127

Query: 215 RILSPKRS 222
                + S
Sbjct: 128 GPPPSEGS 135


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 32.1 bits (72), Expect = 2.3
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 337  SHCASNPTVSSGSESSSDSSSGEEDAEEDN 366
            S  +S+ +  S S SSSDSSS EED +E N
Sbjct: 1253 SSSSSSSSSCSSSSSSSDSSSSEEDGDEKN 1282


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 31.7 bits (72), Expect = 2.3
 Identities = 17/150 (11%), Positives = 49/150 (32%), Gaps = 11/150 (7%)

Query: 80  VDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPTQLHL-KHPLAGQTNIM 138
             NLP TN++          +   T               +      +  +     +   
Sbjct: 96  YKNLPQTNINQLLTKNKYDDNYSLTTLIQ----------NLFNLNSDISDYEQPRNSEKS 145

Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
           +   +++ +S+ + +  +  SK +   ++   S ++   + +N +    +    ++S S+
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQ 205

Query: 199 SEVPRQACARRSRSRSRILSPKRSGEALSN 228
                 A  + S   ++ +S       L  
Sbjct: 206 PASDDTANQKSSSKDNQSMSDSALDSILDQ 235


>gnl|CDD|223046 PHA03328, PHA03328, nuclear egress lamina protein UL31;
           Provisional.
          Length = 316

 Score = 31.2 bits (71), Expect = 2.6
 Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 165 SHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKR 221
           + R  +S    +R      RR  R  SR  S+ RS   R+  +RRS +R   L+   
Sbjct: 5   ALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRS---RRRSSRRSSTRRAELADTE 58


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 31.2 bits (71), Expect = 2.8
 Identities = 17/97 (17%), Positives = 35/97 (36%), Gaps = 16/97 (16%)

Query: 143 HRDDESNQRRN------RSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESK 196
           + +    + R+      RS+SR  + G S R+ +   + +  +  +    +        K
Sbjct: 148 YIEGFRGRSRSSSRSSSRSNSRGPSRG-SSRNNSRNRNSSSPDDLVAAVLAALAKLGFGK 206

Query: 197 SRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKWR 233
            +S          S+  SR+     +  A   L+K R
Sbjct: 207 QKS---------SSKKPSRVTKKSAAEAAKKQLNKPR 234


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 30.8 bits (70), Expect = 3.4
 Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 9/67 (13%)

Query: 157 SRSKNNGYSHRSENSKSSPTRT--ETNLKRRRS-------RSKSRSESKSRSEVPRQACA 207
           S S++   S   +  KSS      +T LK + S        ++S   SKS +  PR+  +
Sbjct: 57  SHSRSPSRSRLHKRKKSSRRSPMSDTLLKSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSS 116

Query: 208 RRSRSRS 214
              R R+
Sbjct: 117 SAHRRRN 123



 Score = 30.4 bits (69), Expect = 5.2
 Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 8/71 (11%)

Query: 145 DDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQ 204
           D E    R R  S       S R  +  S+P+     L RR SRS SR    SR+++   
Sbjct: 164 DFEREAAREREHSSR---RASRRGRSGYSTPSA---ALSRRGSRSASR--RGSRADLSMT 215

Query: 205 ACARRSRSRSR 215
               R      
Sbjct: 216 PLEARRADAED 226


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 8/55 (14%)

Query: 322 KKIKRTNRKKIKRKCSHCASNPTVSSGSESSSDSSSGEEDAEEDNWMMELQLKQN 376
           KK K+   KK  +K            GS+SS +SS  EE+ EED     +++ + 
Sbjct: 96  KKKKKKKAKKGNKK--------EEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142


>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
           and toxic compound extrusion (MATE)-like proteins.
           Escherichia coli MurJ (MviN) has been identified as
           essential for murein biosynthesis. It has been suggested
           that MurJ functions as the peptidoglycan lipid II
           flippase which is involved in translocation of
           lipid-anchored peptidoglycan precursors across the
           cytoplasmic membrane, though results obtained in
           Bacillus subtilis seem to indicate that its MurJ
           homologs are not essential for growth. Some MviN family
           members (e.g. in Mycobacterium tuberculosis) possess an
           extended C-terminal region that contains an
           intracellular pseudo-kinase domain and an extracellular
           domain resembling carbohydrate-binding proteins.
           Proteins from the MATE family are involved in exporting
           metabolites across the cell membrane and are often
           responsible for multidrug resistance (MDR).
          Length = 420

 Score = 30.1 bits (69), Expect = 6.6
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 566 FTSSFVPVISEKLAKSKDD 584
            +++FVPV SE L K  ++
Sbjct: 50  LSAAFVPVFSEYLEKEGEE 68


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 30.4 bits (69), Expect = 6.8
 Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 15/86 (17%)

Query: 140 RSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSES---- 195
           +   R    N RR+R+  R   +    R E  ++     E N + RR   +  +E+    
Sbjct: 603 QQDRRKPRQNNRRDRNERRDTRDN-RTRREGRENR----EENRRNRRQAQQQTAETRESQ 657

Query: 196 ------KSRSEVPRQACARRSRSRSR 215
                 K+R++  +Q   RR R R R
Sbjct: 658 QAEVTEKARTQDEQQQAPRRERQRRR 683


>gnl|CDD|152169 pfam11733, NP1-WLL, Non-capsid protein NP1.  This family is the
           non-capsid protein NP1 of the ssDNA, Parvovirinae virus
           Bocavirus of cattle and humans.
          Length = 98

 Score = 27.8 bits (61), Expect = 8.8
 Identities = 24/96 (25%), Positives = 37/96 (38%)

Query: 159 SKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILS 218
           S N    HRS   K SP R E     + +  +SRS S  R    R + +       + LS
Sbjct: 3   SGNMKDKHRSYKRKGSPERGERKRHWQTTHHRSRSRSPIRHRGERGSGSYHQEHPIKHLS 62

Query: 219 PKRSGEALSNLDKWRMNNCSSEREMTEKFNTLAAME 254
              + +    + K R +    + + T  FN  +A  
Sbjct: 63  SCTASKTSDKVLKTRESTSGKKTKHTIPFNVFSAHR 98


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 30.0 bits (67), Expect = 9.4
 Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 6/126 (4%)

Query: 135 TNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSE 194
           T      +HR++ S+   N  +S S + G +  S  + SSP + +T  K       + + 
Sbjct: 309 TGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTK------HTNNT 362

Query: 195 SKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSSEREMTEKFNTLAAME 254
             ++    +         R      KRS     +  K R  +  SE         +A + 
Sbjct: 363 KNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEKSEYIKEFTMEEVAKLT 422

Query: 255 ADQIIE 260
            D  I+
Sbjct: 423 KDTTIK 428


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 29.5 bits (66), Expect = 9.7
 Identities = 16/74 (21%), Positives = 25/74 (33%), Gaps = 4/74 (5%)

Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSS----PTRTETNLKRRRSRSKSRSESKSRS 199
            D  S   RN  SS ++ +   HRS  S         +   N+   + RS   S  +  S
Sbjct: 123 HDAVSRLERNSYSSAARRSSGEHRSAASSQGSRHTKLKKRKNIHNLKRRSPELSSKREVS 182

Query: 200 EVPRQACARRSRSR 213
                     + +R
Sbjct: 183 FTLESPSVPDTATR 196


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.131    0.390 

Gapped
Lambda     K      H
   0.267   0.0777    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,939,620
Number of extensions: 4957323
Number of successful extensions: 4748
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4514
Number of HSP's successfully gapped: 96
Length of query: 1022
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 915
Effective length of database: 6,191,724
Effective search space: 5665427460
Effective search space used: 5665427460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.3 bits)