RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14776
(1022 letters)
>gnl|CDD|238333 cd00593, RIBOc, RIBOc. Ribonuclease III C terminal domain. This
group consists of eukaryotic, bacterial and archeal
ribonuclease III (RNAse III) proteins. RNAse III is a
double stranded RNA-specific endonuclease. Prokaryotic
RNAse III is important in post-transcriptional control
of mRNA stability and translational efficiency. It is
involved in the processing of ribosomal RNA precursors.
Prokaryotic RNAse III also plays a role in the
maturation of tRNA precursors and in the processing of
phage and plasmid transcripts. Eukaryotic RNase III's
participate (through direct cleavage) in rRNA
processing, in processing of small nucleolar RNAs
(snoRNAs) and snRNA's (components of the spliceosome).
In eukaryotes RNase III or RNaseIII like enzymes such as
Dicer are involved in RNAi (RNA interference) and miRNA
(micro-RNA) gene silencing.
Length = 133
Score = 144 bits (366), Expect = 2e-40
Identities = 47/120 (39%), Positives = 69/120 (57%)
Query: 812 RKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLA 871
A E +NERLEFLGDAV+E + + +LF FPD+ EG L R+A+V N+ LA LA
Sbjct: 10 PSYANEHGRFNNERLEFLGDAVLELVVTEYLFKKFPDLSEGDLTRLRSALVSNETLARLA 69
Query: 872 KRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQVFSSVLYKDNE 931
+ +GL ++ G + +A+ FEAL+GA++LDGG + A + +L E
Sbjct: 70 RELGLGKYLRLGKGEEKSGGRLRPKILADVFEALIGAIYLDGGFEAARKFLLRLLGPLIE 129
Score = 56.9 bits (138), Expect = 1e-09
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 981 RLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
LL T S + +N+RLEFLGD VL+L+ ++YL+
Sbjct: 1 SLLLEALTHPSYANEHGRF-NNERLEFLGDAVLELVVTEYLF 41
>gnl|CDD|197778 smart00535, RIBOc, Ribonuclease III family.
Length = 129
Score = 143 bits (364), Expect = 4e-40
Identities = 44/109 (40%), Positives = 71/109 (65%)
Query: 818 SNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLE 877
+ HNERLEFLGDAV+E + + +L+ +PD+ EG L+ R+A+V N+ LA LAK++GL
Sbjct: 14 NEHEHNERLEFLGDAVLELVVTEYLYKKYPDLSEGDLSRLRSALVSNETLARLAKKLGLG 73
Query: 878 DFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQVFSSVL 926
+F+ G + + +A+ FEAL+GA++LD G++ A + +L
Sbjct: 74 EFIRLGRGEAISGGRDKPKILADVFEALIGAIYLDSGLEAAREFIRDLL 122
Score = 52.6 bits (127), Expect = 4e-08
Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 981 RLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
LL R T S + N+RLEFLGD VL+L+ ++YLY
Sbjct: 1 SLLLRALTHASYSNEH---EHNERLEFLGDAVLELVVTEYLY 39
Score = 29.5 bits (67), Expect = 3.9
Identities = 9/18 (50%), Positives = 9/18 (50%)
Query: 750 LLQLALTHPSYRENFGTN 767
LL ALTH SY N
Sbjct: 2 LLLRALTHASYSNEHEHN 19
>gnl|CDD|234633 PRK00102, rnc, ribonuclease III; Reviewed.
Length = 229
Score = 134 bits (341), Expect = 1e-35
Identities = 57/203 (28%), Positives = 86/203 (42%), Gaps = 67/203 (33%)
Query: 733 SLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHF 792
L L+ ++ Y F++ LL ALTH SY
Sbjct: 4 DLEELQKKLGYTFKDPELLIQALTHRSYANENKGLKH----------------------- 40
Query: 793 QNTRKRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEG 852
NERLEFLGDAV+E + S +LF FPD++EG
Sbjct: 41 ------------------------------NERLEFLGDAVLELVVSEYLFKRFPDLDEG 70
Query: 853 GLATYRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKH-------AMANCFEAL 905
L+ RAA+V+ + LA +A+ +GL +++L G E K +A+ FEAL
Sbjct: 71 DLSKLRAALVREESLAEIARELGLGEYLLLGKG-------EEKSGGRRRPSILADAFEAL 123
Query: 906 MGALFLDGGIDVADQVFSSVLYK 928
+GA++LD G++ A + +
Sbjct: 124 IGAIYLDQGLEAARKFILRLFEP 146
Score = 65.9 bits (162), Expect = 7e-12
Identities = 26/59 (44%), Positives = 34/59 (57%)
Query: 964 KQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
K L+ L+ G F LL + T RS N L N+RLEFLGD VL+L+ S+YL+
Sbjct: 3 KDLEELQKKLGYTFKDPELLIQALTHRSYANENKGLKHNERLEFLGDAVLELVVSEYLF 61
>gnl|CDD|223644 COG0571, Rnc, dsRNA-specific ribonuclease [Transcription].
Length = 235
Score = 125 bits (316), Expect = 3e-32
Identities = 54/215 (25%), Positives = 90/215 (41%), Gaps = 57/215 (26%)
Query: 729 RFHRSLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDR 788
+ L LE ++ Y F+++ LL+ ALTH SY
Sbjct: 3 MMIKKLEALEKKLGYTFKDKELLEQALTHRSYANEHKAVE-------------------- 42
Query: 789 RIHFQNTRKRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPD 848
+NERLEFLGDAV+ + + +LF +P+
Sbjct: 43 ---------------------------------NNERLEFLGDAVLGLVVAEYLFKKYPN 69
Query: 849 IEEGGLATYRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGA 908
+ EG L+ RAA+V + LA +A+ +GL D++ G + + +A+ FEAL+GA
Sbjct: 70 LPEGELSKLRAALVSEESLAEIARELGLGDYLRLGKGEEKSGGRRRESILADAFEALIGA 129
Query: 909 LFLDGGIDVADQVFSSVLYKDNEVRFKQWMYYPQH 943
++LD G++ A + + R ++ Q
Sbjct: 130 IYLDSGLEAARKFILKLFLP----RLEEIDAGDQF 160
Score = 65.7 bits (161), Expect = 9e-12
Identities = 24/59 (40%), Positives = 35/59 (59%)
Query: 964 KQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
K+L+ LE G F LL + T RS + + +N+RLEFLGD VL L+ ++YL+
Sbjct: 6 KKLEALEKKLGYTFKDKELLEQALTHRSYANEHKAVENNERLEFLGDAVLGLVVAEYLF 64
>gnl|CDD|233772 TIGR02191, RNaseIII, ribonuclease III, bacterial. This family
consists of bacterial examples of ribonuclease III. This
enzyme cleaves double-stranded rRNA. It is involved in
processing ribosomal RNA precursors. It is found even in
minimal genones such as Mycoplasma genitalium and
Buchnera aphidicola, and in some cases has been shown to
be an essential gene. These bacterial proteins contain a
double-stranded RNA binding motif (pfam00035) and a
ribonuclease III domain (pfam00636). Eukaryotic homologs
tend to be much longer proteins with additional domains,
localized to the nucleus, and not included in this
family [Transcription, RNA processing].
Length = 220
Score = 123 bits (310), Expect = 1e-31
Identities = 60/192 (31%), Positives = 88/192 (45%), Gaps = 52/192 (27%)
Query: 737 LEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHFQNTR 796
LE R+ YKF+N+ LL+ ALTH SY N H
Sbjct: 1 LEKRLGYKFKNKELLEQALTHSSYA-----NEHHKGVK---------------------- 33
Query: 797 KRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLAT 856
+NERLEFLGDAV+ + + +LF FPD+ EG L+
Sbjct: 34 -------------------------NNERLEFLGDAVLGLVVAEYLFKNFPDLSEGELSK 68
Query: 857 YRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAMANCFEALMGALFLDGGID 916
RAA+V + LA +A+ +GL F+L G + + +A+ FEAL+GA++LD G++
Sbjct: 69 LRAALVSEESLAEVARELGLGKFLLLGKGEEKSGGRRRESILADAFEALIGAIYLDSGLE 128
Query: 917 VADQVFSSVLYK 928
A + +L
Sbjct: 129 AARKFILKLLIP 140
Score = 53.4 bits (129), Expect = 1e-07
Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 8/62 (12%)
Query: 962 LLKQLQYLEYSSGVIFNHIRLLARVFTTRSMGY-TNLTLGSNQRLEFLGDTVLQLITSDY 1020
L K+L G F + LL + T S + + +N+RLEFLGD VL L+ ++Y
Sbjct: 1 LEKRL-------GYKFKNKELLEQALTHSSYANEHHKGVKNNERLEFLGDAVLGLVVAEY 53
Query: 1021 LY 1022
L+
Sbjct: 54 LF 55
>gnl|CDD|216036 pfam00636, Ribonuclease_3, Ribonuclease III domain.
Length = 91
Score = 110 bits (278), Expect = 5e-29
Identities = 42/91 (46%), Positives = 60/91 (65%)
Query: 824 ERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFMLYA 883
ERLEFLGDAV+E S +L+ FPD+ EG L R+A+V N+ LA LA+++GL+ ++
Sbjct: 1 ERLEFLGDAVLELYVSEYLYEKFPDLSEGELHRLRSALVSNEALAKLARKLGLDKYLREG 60
Query: 884 HGSDLCHEVELKHAMANCFEALMGALFLDGG 914
G + +A+ FEAL+GAL+LDGG
Sbjct: 61 EGDIVKRASVKDKVLADAFEALIGALYLDGG 91
Score = 39.5 bits (93), Expect = 6e-04
Identities = 14/20 (70%), Positives = 17/20 (85%)
Query: 1003 QRLEFLGDTVLQLITSDYLY 1022
+RLEFLGD VL+L S+YLY
Sbjct: 1 ERLEFLGDAVLELYVSEYLY 20
>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
Length = 413
Score = 74.6 bits (183), Expect = 6e-14
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 14/108 (12%)
Query: 821 THNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFM 880
THNERLEFLGD+V+ + LF F ++EG L+ RA +V+ Q L +A+ + + + +
Sbjct: 32 THNERLEFLGDSVLNCAVAALLFQRFGKLDEGDLSRVRANLVKQQSLYEIAQALNISEGL 91
Query: 881 LYAHGSDLCHEVELKHA-------MANCFEALMGALFLDGGIDVADQV 921
G EL+ +A+ FEA++GA+FLDGG + A V
Sbjct: 92 RLGEG-------ELRSGGFRRPSILADAFEAIIGAVFLDGGFEAAQGV 132
Score = 36.0 bits (83), Expect = 0.10
Identities = 18/48 (37%), Positives = 28/48 (58%)
Query: 733 SLNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGV 780
L+ LE R++Y+F+N LL+ ALTH S+ + +S+ NC V
Sbjct: 2 PLSQLESRLRYEFRNAELLRQALTHRSHSATHNERLEFLGDSVLNCAV 49
Score = 35.6 bits (82), Expect = 0.13
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 12/55 (21%)
Query: 959 QSPLLKQLQYLEYSSGVIFNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVL 1013
S L +L+Y F + LL + T RS T+ N+RLEFLGD+VL
Sbjct: 3 LSQLESRLRYE-------FRNAELLRQALTHRSHSATH-----NERLEFLGDSVL 45
>gnl|CDD|171444 PRK12371, PRK12371, ribonuclease III; Reviewed.
Length = 235
Score = 70.9 bits (174), Expect = 2e-13
Identities = 51/185 (27%), Positives = 75/185 (40%), Gaps = 61/185 (32%)
Query: 737 LEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGVRQPEYGDRRIHFQNTR 796
LE+R ++F N+ L+ ALTH S R + G+
Sbjct: 15 LEERTGHRFANKERLERALTHSS--------------------ARASKQGNY-------- 46
Query: 797 KRGINTLINIMSKLGRKVATESNITHNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLAT 856
ERLEFLGD V+ + LF FPD EG L+
Sbjct: 47 ---------------------------ERLEFLGDRVLGLCVAEMLFEAFPDASEGELSV 79
Query: 857 YRAAIVQNQHLAHLAKRIGLEDFMLYAHGSDLCHEVELKHAM---ANCFEALMGALFLDG 913
+V + A +A IGL D + GSD+ ++ K + A+ EAL+ A++LDG
Sbjct: 80 RLNQLVNAETCAAIADEIGLHDLI--RTGSDV-KKLTGKRLLNVRADVVEALIAAIYLDG 136
Query: 914 GIDVA 918
G++ A
Sbjct: 137 GLEAA 141
Score = 33.9 bits (78), Expect = 0.34
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 1000 GSNQRLEFLGDTVLQLITSDYLY 1022
G+ +RLEFLGD VL L ++ L+
Sbjct: 44 GNYERLEFLGDRVLGLCVAEMLF 66
>gnl|CDD|173181 PRK14718, PRK14718, ribonuclease III; Provisional.
Length = 467
Score = 73.3 bits (179), Expect = 2e-13
Identities = 39/101 (38%), Positives = 58/101 (57%)
Query: 821 THNERLEFLGDAVVEFITSIHLFHVFPDIEEGGLATYRAAIVQNQHLAHLAKRIGLEDFM 880
THNERLEFLGD+V+ + LF F ++EG L+ RA +V+ Q L +A+ + + D +
Sbjct: 32 THNERLEFLGDSVLNCAVAALLFQRFGKLDEGDLSRVRANLVKQQSLYEIAQALNISDGL 91
Query: 881 LYAHGSDLCHEVELKHAMANCFEALMGALFLDGGIDVADQV 921
G +A+ FEA++GA+FLDGG + A V
Sbjct: 92 RLGEGELRSGGFRRPSILADAFEAIIGAVFLDGGFEAAQGV 132
Score = 35.2 bits (80), Expect = 0.18
Identities = 18/47 (38%), Positives = 28/47 (59%)
Query: 734 LNVLEDRIQYKFQNRYLLQLALTHPSYRENFGTNPDHARNSLSNCGV 780
L+ LE R++Y+F+N LL+ ALTH S+ + +S+ NC V
Sbjct: 3 LSQLESRLRYEFRNAELLRQALTHRSHSATHNERLEFLGDSVLNCAV 49
Score = 30.9 bits (69), Expect = 3.5
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 977 FNHIRLLARVFTTRSMGYTNLTLGSNQRLEFLGDTVLQLITSDYLY 1022
F + LL + T RS T+ N+RLEFLGD+VL + L+
Sbjct: 14 FRNAELLRQALTHRSHSATH-----NERLEFLGDSVLNCAVAALLF 54
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 45.7 bits (108), Expect = 1e-04
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 8/84 (9%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
SR R S +R R SR ++ R +S+S R+ RRSR + R S+S
Sbjct: 32 SRFRDRHRRSRERSYREDSRPRDR----RRYDSRS--PRSLRYSSVRRSRDRPRRRSRSV 85
Query: 199 SEVPRQACARRSRSRSRILSPKRS 222
+ + RR R RS ++
Sbjct: 86 RSIEQH--RRRLRDRSPSNQWRKD 107
Score = 42.6 bits (100), Expect = 0.001
Identities = 28/95 (29%), Positives = 35/95 (36%), Gaps = 1/95 (1%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
SR RD S + R RS RS+ RS + +R SRS S
Sbjct: 12 SRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSV 71
Query: 199 SEVPRQACARRSRSRSRILSPKRSGEALSNLDKWR 233
R RRSRS I +R S ++WR
Sbjct: 72 RR-SRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWR 105
Score = 36.8 bits (85), Expect = 0.068
Identities = 23/103 (22%), Positives = 32/103 (31%), Gaps = 1/103 (0%)
Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPR 203
D E + R R RS + S+ + + R S+ R + S PR
Sbjct: 5 PDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPR 64
Query: 204 QACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSSEREMTEK 246
+ S RSR +RS S R S K
Sbjct: 65 -SLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRK 106
Score = 33.3 bits (76), Expect = 0.81
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 4/53 (7%)
Query: 139 SRSMHRDDESNQRRNRSS----SRSKNNGYSHRSENSKSSPTRTETNLKRRRS 187
SRS S+ RR+R SRS + HR SP+ ++RS
Sbjct: 60 SRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRS 112
Score = 32.2 bits (73), Expect = 1.6
Identities = 16/65 (24%), Positives = 23/65 (35%), Gaps = 4/65 (6%)
Query: 140 RSMHRDDESNQRRNRSSSRSKNNGYSH----RSENSKSSPTRTETNLKRRRSRSKSRSES 195
RS D RR S ++ YS R + S + RRR R +S S
Sbjct: 44 RSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQ 103
Query: 196 KSRSE 200
+ +
Sbjct: 104 WRKDD 108
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 39.9 bits (93), Expect = 0.001
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENS---KSSPTRTETNLKRRRSRSKSRSE 194
M RS R ++RR RS SR + RS + + +R+ + + RRSRS R
Sbjct: 1 MGRSRSRSPRRSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHR 60
Query: 195 SKSRSEVPRQACARRSRSRSRILSPKR 221
S+SRS R+ R +R PK+
Sbjct: 61 SRSRSPSRRRDRKRERDKDAR--EPKK 85
Score = 36.4 bits (84), Expect = 0.029
Identities = 27/82 (32%), Positives = 35/82 (42%), Gaps = 18/82 (21%)
Query: 142 MHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSES-KSRSE 200
M R + RR+R RS++ R R RSRS+ R +SRS
Sbjct: 1 MGRSRSRSPRRSRRRGRSRSRDRRERR---------------RERSRSRERDRRRRSRSR 45
Query: 201 VPRQACARRSRSRSRILSPKRS 222
P + +RRSRS R S RS
Sbjct: 46 SPHR--SRRSRSPRRHRSRSRS 65
Score = 34.1 bits (78), Expect = 0.16
Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 2/78 (2%)
Query: 139 SRSMHRD-DESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKS 197
SR R + R R SRS+ RS + +R + +R RSRS+S S +
Sbjct: 12 SRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRD 71
Query: 198 RSEVPRQACARRSRSRSR 215
R R AR + R R
Sbjct: 72 RKR-ERDKDAREPKKRER 88
>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein. This family
consists of transformer proteins from several Drosophila
species and also from Ceratitis capitata (Mediterranean
fruit fly). The transformer locus (tra) produces an RNA
processing protein that alternatively splices the
doublesex pre-mRNA in the sex determination hierarchy of
Drosophila melanogaster.
Length = 182
Score = 38.9 bits (90), Expect = 0.006
Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query: 150 QRRNRSSSRSKNNGYSHRSEN-SKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACAR 208
++R S+R + R+ + S+S + +R RSRS+SR+ S SR + R
Sbjct: 40 RQRKTQSTRPTTSHRGRRTRSRSRSQSAERNSCQRRHRSRSRSRNRSDSRHRST-SSTER 98
Query: 209 RSRSRSR 215
R RSRSR
Sbjct: 99 RRRSRSR 105
Score = 38.5 bits (89), Expect = 0.008
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 172 KSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRS 222
K+ TR T+ + RR+RS+SRS+S R+ R+ RSRSRSR S R
Sbjct: 43 KTQSTRPTTSHRGRRTRSRSRSQSAERNSCQRR---HRSRSRSRNRSDSRH 90
Score = 35.8 bits (82), Expect = 0.055
Identities = 26/54 (48%), Positives = 34/54 (62%), Gaps = 8/54 (14%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSR 192
SRS + S QRR+RS SRS+N RS++ S + TE +RRRSRS+SR
Sbjct: 62 SRSQSAERNSCQRRHRSRSRSRN-----RSDSRHRSTSSTE---RRRRSRSRSR 107
Score = 29.2 bits (65), Expect = 7.5
Identities = 18/54 (33%), Positives = 21/54 (38%), Gaps = 1/54 (1%)
Query: 166 HRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSP 219
R S T R RSRS+S +E S R R+RS SR S
Sbjct: 41 QRKTQSTRPTTSHRGRRTRSRSRSQS-AERNSCQRRHRSRSRSRNRSDSRHRST 93
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 39.4 bits (92), Expect = 0.012
Identities = 37/223 (16%), Positives = 59/223 (26%), Gaps = 38/223 (17%)
Query: 8 SSHSSRGSCSFSSPMSEDVKPNIQQMALNEPSSSALQFNTVDVKQPPPGFSQSCN--HKI 65
S +S + E+ A + P + PP S
Sbjct: 164 SDAASSRQAALPLSSPEETAR-----APSSPPA------EPPPSTPPAAASPRPPRRSSP 212
Query: 66 LVSKSFD--CKTGTTQVDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPT 123
+ + + G + D+ +++ + S+ + CP P
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE-------CPLP------- 258
Query: 124 QLHLKHPLAGQTNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSS----PTRTE 179
P + + R +SS S S S P+
Sbjct: 259 -----RPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPR 313
Query: 180 TNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRS 222
+ SR S S + S SE R A S SR SP R
Sbjct: 314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356
Score = 34.8 bits (80), Expect = 0.27
Identities = 17/77 (22%), Positives = 25/77 (32%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
S S ES+ SSS S + SP+ + S + R
Sbjct: 315 SSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
Query: 199 SEVPRQACARRSRSRSR 215
P + R +R R+R
Sbjct: 375 PSSPAASAGRPTRRRAR 391
Score = 31.7 bits (72), Expect = 2.4
Identities = 17/95 (17%), Positives = 23/95 (24%), Gaps = 9/95 (9%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
S S + S SRS + SSP + + S + S R
Sbjct: 330 SSSSESSRGAAVSPGPSPSRSPSPS-RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRR 388
Query: 199 --------SEVPRQACARRSRSRSRILSPKRSGEA 225
R A R R R +
Sbjct: 389 RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS 423
Score = 30.5 bits (69), Expect = 5.6
Identities = 19/75 (25%), Positives = 27/75 (36%), Gaps = 5/75 (6%)
Query: 148 SNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACA 207
S+ +R SS S + S S + SP + + RS S SR + PR+
Sbjct: 316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPS-----RSPSPSRPPPPADPSSPRKRPR 370
Query: 208 RRSRSRSRILSPKRS 222
S S R
Sbjct: 371 PSRAPSSPAASAGRP 385
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 35.1 bits (81), Expect = 0.025
Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 2/74 (2%)
Query: 144 RDDESNQRRNRSSSRSKNNG-YSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVP 202
++E ++ R + R + G S R + S R + +RRR R + R+ + R +
Sbjct: 17 EEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRD 76
Query: 203 RQACAR-RSRSRSR 215
R R RSRSRSR
Sbjct: 77 RDRYDRSRSRSRSR 90
Score = 29.4 bits (66), Expect = 2.3
Identities = 16/62 (25%), Positives = 22/62 (35%)
Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKS 197
RS R RR + S + + + R R RSRS+SRS +
Sbjct: 36 GRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRSRSRSRSRSRDRR 95
Query: 198 RS 199
R
Sbjct: 96 RR 97
>gnl|CDD|237620 PRK14136, recX, recombination regulator RecX; Provisional.
Length = 309
Score = 35.0 bits (80), Expect = 0.16
Identities = 26/97 (26%), Positives = 39/97 (40%), Gaps = 5/97 (5%)
Query: 137 IMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESK 196
+S RD + R + + + + R+E + TRT + +R R +
Sbjct: 81 RVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRTSQHPRRTRRAAGPFHSDS 140
Query: 197 SRSEVPRQACARRSRSRSRILSPKRS--GEALSNLDK 231
S S A RSR+ SR P RS G AL L +
Sbjct: 141 SPSASSEDDGAARSRASSR---PARSLKGRALGYLSR 174
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 33.6 bits (77), Expect = 0.16
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 5/77 (6%)
Query: 286 ECTQKTRTICDEFESVIQNFVSVKESKPQFDYTANRKKIKRT----NRKKIKRKCSHCAS 341
EC QKT + + + Q KE + ++K R + KKI K +S
Sbjct: 47 EC-QKTSQLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105
Query: 342 NPTVSSGSESSSDSSSG 358
+ + SS S SSS S S
Sbjct: 106 SSSSSSSSSSSSSSESS 122
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 34.5 bits (79), Expect = 0.27
Identities = 21/72 (29%), Positives = 26/72 (36%), Gaps = 2/72 (2%)
Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPR 203
RD E + RN + K S R S+ R R R +SRS S +R R
Sbjct: 3 RDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNR--YYR 60
Query: 204 QACARRSRSRSR 215
R R R
Sbjct: 61 PRGDRSYRRDDR 72
Score = 31.4 bits (71), Expect = 2.4
Identities = 21/77 (27%), Positives = 29/77 (37%), Gaps = 7/77 (9%)
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
R R+D + R SR + S + S+ R +R RSRS+S + R
Sbjct: 6 ERGRLRNDTRRSDKGRERSRRR----SRSRDRSRRRRDRDYYRGRRGRSRSRSPN-RYYR 60
Query: 199 SEVPRQACARRSRSRSR 215
R RR RS
Sbjct: 61 PRGDRS--YRRDDRRSG 75
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 34.2 bits (78), Expect = 0.32
Identities = 27/111 (24%), Positives = 44/111 (39%), Gaps = 12/111 (10%)
Query: 121 SPTQLHLKHPLAGQTNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRS----ENSKSSPT 176
+P+ P ++ + NR+ + + S S ++ P
Sbjct: 259 TPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGTPDDGSAISEDSAGGSHHTMRRPP 318
Query: 177 RTETNLKR--RRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEA 225
+ + +R RR+RS+SRS S+SRS R RR R R P R E+
Sbjct: 319 HSTSGERRGRRRNRSESRSRSRSRSGSRRY---RRRRGRG---VPGRRSES 363
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 33.0 bits (75), Expect = 0.42
Identities = 25/109 (22%), Positives = 41/109 (37%), Gaps = 7/109 (6%)
Query: 145 DDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEV--- 201
DDE+++ S + + N SS + KR ++ S S + S +
Sbjct: 2 DDETHKYAQPHSQ-QNTDELNTVIHNDSSSSKSKKKKKKRSKATSPSHNASDQSTNNVMS 60
Query: 202 -PRQACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSS--EREMTEKF 247
P AR S + A+ N + R+ N S+ ERE +F
Sbjct: 61 TPSAILARPQPLSYPFGSQQSQQNAVKNSKEKRIWNTSTQEERENIREF 109
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 33.5 bits (77), Expect = 0.51
Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 12/113 (10%)
Query: 123 TQLHLKHPLAGQTNIMSRSMHRDDES----NQRRNRSSSRSKNNGYSHRSENSKSSPTRT 178
+++ G S +++ N+R N+S S S Y + + + T
Sbjct: 86 SEILDSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEY-KVYKAEEPADAST 144
Query: 179 ETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDK 231
+T+ +R R S++ S SR E+ + + S S S E L +L K
Sbjct: 145 QTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSS-------SPETLESLIK 190
>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
domain.
Length = 379
Score = 32.5 bits (74), Expect = 1.1
Identities = 35/227 (15%), Positives = 56/227 (24%), Gaps = 36/227 (15%)
Query: 18 FSSPMSEDVKPNIQQMALNEPSSSALQFNTVDVKQPPPGFSQSCNHKILVSKSFDCKTGT 77
SP S + + +Q + Q S+S N + + +
Sbjct: 174 KGSPYSWEQEYLLQHGGQQHSHLQRHG-DEPFGAQSSGILSRSENRRTRNLANNTSRKSD 232
Query: 78 TQVDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPTQLHLKHPLAGQTNI 137
T P Q + + Q Q L H G++
Sbjct: 233 TSRSVGPVRQSQIQRSRLGLQ-----------------------ANQGKLAHGQQGRSGS 269
Query: 138 MSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETN-----LKR-------R 185
+ H SSS N S S S R +R
Sbjct: 270 IRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAV 329
Query: 186 RSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKW 232
RS S++ P +C S+ S ++ L+ W
Sbjct: 330 ELRSIPGGSVSSQNAGPLLSCWWLQFRNSKPCSDYCLHHIVNLLEDW 376
>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen. The core
antigen of hepatitis viruses possesses a carboxyl
terminus rich in arginine. On this basis it was
predicted that the core antigen would bind DNA. There is
some experimental evidence to support this.
Length = 182
Score = 31.7 bits (72), Expect = 1.3
Identities = 17/49 (34%), Positives = 22/49 (44%)
Query: 166 HRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRS 214
+R N+ T ET + RRR RS R R + RRS+S S
Sbjct: 131 YRPPNAPILSTLPETIVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSPS 179
>gnl|CDD|241416 cd13262, PH_RasSynGAP-like, Synaptic Ras-GTPase activating protein
family Pleckstrin homology (PH) domain. The RasSynGAP
family is composed of members: DAB2IP, nGAP, and SynGAP.
Neuronal growth-associated proteins (nGAPs) are growth
cone markers found in multiple types of neurons. There
are many nGAPs including Cap1 (Adenylate
cyclase-associated protein 1), Capzb (Capping protein
(actin filament) muscle Z-line, beta), Clptm1 (Cleft lip
and palate associated transmembrane protein 1), Cotl1
(Coactosin-like 1), Crmp1 (Collapsin response mediator
protein 1), Cyfip1 (Cytoplasmic FMR1 interacting protein
1), Fabp7 (Fatty acid binding protein 7, brain), Farp2
(FERM, RhoGEF and pleckstrin domain protein 2), Gap43
(Growth associated protein 43), Gnao1 (Guanine
nucleotide binding protein (G protein), alpha activating
activity polypeptide O), Gnai2 (Guanine nucleotide
binding protein (G protein), alpha inhibiting 2), Pacs1
(Phosphofurin acidic cluster sorting protein 1), Rtn1
(Reticulon 1), Sept2 (Septin 2), Snap25
(Synaptosomal-associated protein 25), Strap
(Serine/threonine kinase receptor associated protein),
Stx7 (Syntaxin 7), and Tmod2 (Tropomodulin 2). SynGAP, a
neuronal Ras-GAP, has been shown display both Ras-GAP
activity and Ras-related protein (Rap)-GAP activity.
Saccharomyces cerevisiae Bud2 and GAP1 members CAPRI
(Ca2+-promoted Ras inactivator) and RASAL
(Ras-GTPase-activating-like protein) also possess this
dual activity. Human DOC-2/DAB2-interacting protein
(DAB2IP) is encoded by a tumor suppressor gene and a
newly recognized member of the Ras-GTPase-activating
family. DAB2IP is a critical component of many signal
transduction pathways mediated by Ras and tumor necrosis
factors including apoptosis pathways, and it is involved
in the formation of many types of tumors. DAB2IP
participates in regulation of gene expression and
pluripotency of cells. It has been reported that DAB2IP
was expressed in different tumor tissues. Little
information is available concerning the expression
levels of DAB2IP in normal tissues and cells, however,
and no studies of its expression patterns during the
development of human embryos have been reported. DAB2IP
was expressed primarily in cell cytoplasm throughout the
fetal development. The expression levels varied among
tissues and different gestational ages. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 124
Score = 30.4 bits (69), Expect = 1.7
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 4/61 (6%)
Query: 170 NSKSSPTRTETNLKRRRSRSKSRSESKSRS----EVPRQACARRSRSRSRILSPKRSGEA 225
K S +RT++ R SR+ S + K R R+ R SRS +LS +
Sbjct: 1 RLKPSISRTKSQPNVRLSRNSSFKQKKLRRRKKRTTAREGSLRSSRSHESLLSSSSAAMM 60
Query: 226 L 226
L
Sbjct: 61 L 61
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 31.1 bits (70), Expect = 1.9
Identities = 20/68 (29%), Positives = 24/68 (35%), Gaps = 10/68 (14%)
Query: 165 SHRSENSKSS-PTRTETNLKRR---------RSRSKSRSESKSRSEVPRQACARRSRSRS 214
S SE S+SS R++ R S S SE SRS R R RS
Sbjct: 68 SQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS 127
Query: 215 RILSPKRS 222
+ S
Sbjct: 128 GPPPSEGS 135
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 32.1 bits (72), Expect = 2.3
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 337 SHCASNPTVSSGSESSSDSSSGEEDAEEDN 366
S +S+ + S S SSSDSSS EED +E N
Sbjct: 1253 SSSSSSSSSCSSSSSSSDSSSSEEDGDEKN 1282
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.7 bits (72), Expect = 2.3
Identities = 17/150 (11%), Positives = 49/150 (32%), Gaps = 11/150 (7%)
Query: 80 VDNLPSTNVSNQSNPITPQTSTGRTNQYNRWTICPAPMCQISPTQLHL-KHPLAGQTNIM 138
NLP TN++ + T + + + +
Sbjct: 96 YKNLPQTNINQLLTKNKYDDNYSLTTLIQ----------NLFNLNSDISDYEQPRNSEKS 145
Query: 139 SRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSR 198
+ +++ +S+ + + + SK + ++ S ++ + +N + + ++S S+
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQ 205
Query: 199 SEVPRQACARRSRSRSRILSPKRSGEALSN 228
A + S ++ +S L
Sbjct: 206 PASDDTANQKSSSKDNQSMSDSALDSILDQ 235
>gnl|CDD|223046 PHA03328, PHA03328, nuclear egress lamina protein UL31;
Provisional.
Length = 316
Score = 31.2 bits (71), Expect = 2.6
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 165 SHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILSPKR 221
+ R +S +R RR R SR S+ RS R+ +RRS +R L+
Sbjct: 5 ALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRS---RRRSSRRSSTRRAELADTE 58
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 31.2 bits (71), Expect = 2.8
Identities = 17/97 (17%), Positives = 35/97 (36%), Gaps = 16/97 (16%)
Query: 143 HRDDESNQRRN------RSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESK 196
+ + + R+ RS+SR + G S R+ + + + + + + K
Sbjct: 148 YIEGFRGRSRSSSRSSSRSNSRGPSRG-SSRNNSRNRNSSSPDDLVAAVLAALAKLGFGK 206
Query: 197 SRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKWR 233
+S S+ SR+ + A L+K R
Sbjct: 207 QKS---------SSKKPSRVTKKSAAEAAKKQLNKPR 234
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 30.8 bits (70), Expect = 3.4
Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 9/67 (13%)
Query: 157 SRSKNNGYSHRSENSKSSPTRT--ETNLKRRRS-------RSKSRSESKSRSEVPRQACA 207
S S++ S + KSS +T LK + S ++S SKS + PR+ +
Sbjct: 57 SHSRSPSRSRLHKRKKSSRRSPMSDTLLKSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSS 116
Query: 208 RRSRSRS 214
R R+
Sbjct: 117 SAHRRRN 123
Score = 30.4 bits (69), Expect = 5.2
Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 8/71 (11%)
Query: 145 DDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQ 204
D E R R S S R + S+P+ L RR SRS SR SR+++
Sbjct: 164 DFEREAAREREHSSR---RASRRGRSGYSTPSA---ALSRRGSRSASR--RGSRADLSMT 215
Query: 205 ACARRSRSRSR 215
R
Sbjct: 216 PLEARRADAED 226
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 29.6 bits (67), Expect = 3.7
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 8/55 (14%)
Query: 322 KKIKRTNRKKIKRKCSHCASNPTVSSGSESSSDSSSGEEDAEEDNWMMELQLKQN 376
KK K+ KK +K GS+SS +SS EE+ EED +++ +
Sbjct: 96 KKKKKKKAKKGNKK--------EEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
and toxic compound extrusion (MATE)-like proteins.
Escherichia coli MurJ (MviN) has been identified as
essential for murein biosynthesis. It has been suggested
that MurJ functions as the peptidoglycan lipid II
flippase which is involved in translocation of
lipid-anchored peptidoglycan precursors across the
cytoplasmic membrane, though results obtained in
Bacillus subtilis seem to indicate that its MurJ
homologs are not essential for growth. Some MviN family
members (e.g. in Mycobacterium tuberculosis) possess an
extended C-terminal region that contains an
intracellular pseudo-kinase domain and an extracellular
domain resembling carbohydrate-binding proteins.
Proteins from the MATE family are involved in exporting
metabolites across the cell membrane and are often
responsible for multidrug resistance (MDR).
Length = 420
Score = 30.1 bits (69), Expect = 6.6
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 566 FTSSFVPVISEKLAKSKDD 584
+++FVPV SE L K ++
Sbjct: 50 LSAAFVPVFSEYLEKEGEE 68
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 30.4 bits (69), Expect = 6.8
Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 15/86 (17%)
Query: 140 RSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSES---- 195
+ R N RR+R+ R + R E ++ E N + RR + +E+
Sbjct: 603 QQDRRKPRQNNRRDRNERRDTRDN-RTRREGRENR----EENRRNRRQAQQQTAETRESQ 657
Query: 196 ------KSRSEVPRQACARRSRSRSR 215
K+R++ +Q RR R R R
Sbjct: 658 QAEVTEKARTQDEQQQAPRRERQRRR 683
>gnl|CDD|152169 pfam11733, NP1-WLL, Non-capsid protein NP1. This family is the
non-capsid protein NP1 of the ssDNA, Parvovirinae virus
Bocavirus of cattle and humans.
Length = 98
Score = 27.8 bits (61), Expect = 8.8
Identities = 24/96 (25%), Positives = 37/96 (38%)
Query: 159 SKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSESKSRSEVPRQACARRSRSRSRILS 218
S N HRS K SP R E + + +SRS S R R + + + LS
Sbjct: 3 SGNMKDKHRSYKRKGSPERGERKRHWQTTHHRSRSRSPIRHRGERGSGSYHQEHPIKHLS 62
Query: 219 PKRSGEALSNLDKWRMNNCSSEREMTEKFNTLAAME 254
+ + + K R + + + T FN +A
Sbjct: 63 SCTASKTSDKVLKTRESTSGKKTKHTIPFNVFSAHR 98
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 30.0 bits (67), Expect = 9.4
Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 6/126 (4%)
Query: 135 TNIMSRSMHRDDESNQRRNRSSSRSKNNGYSHRSENSKSSPTRTETNLKRRRSRSKSRSE 194
T +HR++ S+ N +S S + G + S + SSP + +T K + +
Sbjct: 309 TGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTK------HTNNT 362
Query: 195 SKSRSEVPRQACARRSRSRSRILSPKRSGEALSNLDKWRMNNCSSEREMTEKFNTLAAME 254
++ + R KRS + K R + SE +A +
Sbjct: 363 KNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEKSEYIKEFTMEEVAKLT 422
Query: 255 ADQIIE 260
D I+
Sbjct: 423 KDTTIK 428
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 29.5 bits (66), Expect = 9.7
Identities = 16/74 (21%), Positives = 25/74 (33%), Gaps = 4/74 (5%)
Query: 144 RDDESNQRRNRSSSRSKNNGYSHRSENSKSS----PTRTETNLKRRRSRSKSRSESKSRS 199
D S RN SS ++ + HRS S + N+ + RS S + S
Sbjct: 123 HDAVSRLERNSYSSAARRSSGEHRSAASSQGSRHTKLKKRKNIHNLKRRSPELSSKREVS 182
Query: 200 EVPRQACARRSRSR 213
+ +R
Sbjct: 183 FTLESPSVPDTATR 196
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.131 0.390
Gapped
Lambda K H
0.267 0.0777 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,939,620
Number of extensions: 4957323
Number of successful extensions: 4748
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4514
Number of HSP's successfully gapped: 96
Length of query: 1022
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 915
Effective length of database: 6,191,724
Effective search space: 5665427460
Effective search space used: 5665427460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.3 bits)