RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2838
(458 letters)
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.8 bits (103), Expect = 2e-04
Identities = 30/122 (24%), Positives = 43/122 (35%), Gaps = 16/122 (13%)
Query: 9 PSPAPSTTSRRDSFPPPHPLGQNGFDPTGQVKSEYNYYQLSSDEKFEILKSISASFTPGY 68
P P P R P +P+G V + + + + + P
Sbjct: 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPG--RVAVPRFRVPQPAPSR 2986
Query: 69 GGFLPNTPPLGHHYLSPSGLPGKAEYLEIHFCFVYTWVKTEPN-----QTNHPPSETEDS 123
+TPPL H L S + A L +H +T+P QT PP +TEDS
Sbjct: 2987 EAPASSTPPLTGHSL--SRVSSWASSLALH-------EETDPPPVSLKQTLWPPDDTEDS 3037
Query: 124 DD 125
D
Sbjct: 3038 DA 3039
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 36.9 bits (85), Expect = 0.022
Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
G GG D G+ ++ + ++ EEEEE E EEEEEE
Sbjct: 854 GGGGSDG--GDSEEEEEEEEEEEEEEEEEEEEEEEE 887
Score = 33.4 bits (76), Expect = 0.31
Identities = 15/25 (60%), Positives = 16/25 (64%)
Query: 430 GDDIRGDGDQGEEEEESEMEEEEEE 454
G D G D G+ EEE E EEEEEE
Sbjct: 851 GVDGGGGSDGGDSEEEEEEEEEEEE 875
Score = 31.1 bits (70), Expect = 1.6
Identities = 15/36 (41%), Positives = 21/36 (58%)
Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
G+ GGD ++ + ++ EEEEE E EEE EE
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.0 bits (67), Expect = 3.1
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
+GG + ++ + ++ EEEEE E EE EE E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 29.6 bits (66), Expect = 4.5
Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 9/93 (9%)
Query: 370 EAHPGCEYKKRGRGRPKLEVPGGGR----MVGNVVGGSGVGGGQRMVENQRMLGNGGGDQ 425
EA E + +G + E+P + G + G+ + + G +
Sbjct: 666 EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGE---TEAEEVEHEGETE 722
Query: 426 VRGNGDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
G D+ G+ + GEE EE E E E E G
Sbjct: 723 AEGTEDE--GEIETGEEGEEVEDEGEGEAEGKH 753
>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies). This
family contains two copies of a C2H2-like zinc finger
domain.
Length = 100
Score = 33.8 bits (78), Expect = 0.034
Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 8/85 (9%)
Query: 211 CDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKS-VHEGIKYKCEE 269
C FC+ T + HM +H F + L+++++ +HEG +C
Sbjct: 2 CLFCNHTSDTVEENLEHMFKSHG---FFIPE-REYLVDLEGLLNYLREKIHEG--NECLY 55
Query: 270 CTKTFAQKQNLLSH-VQAIHMGIKY 293
C K F + L H H I Y
Sbjct: 56 CGKQFKSLEALRQHMRDKGHCKIPY 80
Score = 26.8 bits (60), Expect = 9.3
Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 13/81 (16%)
Query: 294 TCEVCSKSYNQVTNLLSHMTTVHQ----AQKYKCADCGRLFAQEHTLRTHTLSVHSGVRL 349
C C+ + + V L HM H ++Y D L + LR +H G
Sbjct: 1 DCLFCNHTSDTVEENLEHMFKSHGFFIPEREY-LVDLEGLL---NYLR---EKIHEG--N 51
Query: 350 TCRTCGLFCDSKRALTAHVRE 370
C CG S AL H+R+
Sbjct: 52 ECLYCGKQFKSLEALRQHMRD 72
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 35.8 bits (82), Expect = 0.047
Identities = 20/159 (12%), Positives = 43/159 (27%), Gaps = 15/159 (9%)
Query: 197 TYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHI 256
T NS+H + D + + N + + + + S P+ S
Sbjct: 161 TPQSNSLHPPLPANSLSKDPSSNLSLLI--SSNVSTSIPSSSENSPLSSSYSIPSSSSDQ 218
Query: 257 KSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYN--------QVTNL 308
+ ++ + S + S+S Q ++
Sbjct: 219 NLENSSSSLPLTTNSQLSPKSLLSQSPSSLSS---SDSSSSASESPRSSLPTASSQSSSP 275
Query: 309 LSHMTTVHQAQ--KYKCADCGRLFAQEHTLRTHTLSVHS 345
++ + K C F++ L H SV+
Sbjct: 276 NESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314
Score = 30.0 bits (67), Expect = 2.8
Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 5/76 (6%)
Query: 186 CGKSFTLKQNLTYHVNSIHLGVRI---KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHC 242
C ++F NL+ H HL R K C K+F NLI H N C
Sbjct: 394 CIRNFKRDSNLSLH-IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLC-SI 451
Query: 243 VKSFNQKPNLISHIKS 258
+KSF + +L +H K
Sbjct: 452 LKSFRRDLDLSNHGKD 467
Score = 29.7 bits (66), Expect = 4.2
Identities = 23/107 (21%), Positives = 36/107 (33%), Gaps = 2/107 (1%)
Query: 154 SQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDF 213
S T + C C SF+ ++LT H+ S +C +
Sbjct: 7 QSSSSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSY 66
Query: 214 --CDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKS 258
CDK+F+ L H+ + HN + N K + S S
Sbjct: 67 SGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLSSS 113
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 30.0 bits (68), Expect = 0.13
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 210 KCDFCDKTFAHKPNLIAHMNS 230
KC C K+F+ K NL H+ +
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRT 21
Score = 28.1 bits (63), Expect = 0.67
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 238 KCDHCVKSFNQKPNLISHIKS 258
KC C KSF++K NL H+++
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRT 21
Score = 27.3 bits (61), Expect = 1.3
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 183 CDQCGKSFTLKQNLTYHVNS 202
C CGKSF+ K NL H+ +
Sbjct: 2 CPDCGKSFSRKSNLKRHLRT 21
Score = 26.9 bits (60), Expect = 1.9
Identities = 10/23 (43%), Positives = 14/23 (60%), Gaps = 1/23 (4%)
Query: 266 KCEECTKTFAQKQNLLSHVQAIH 288
KC +C K+F++K NL H H
Sbjct: 1 KCPDCGKSFSRKSNLKRH-LRTH 22
Score = 25.0 bits (55), Expect = 7.1
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 294 TCEVCSKSYNQVTNLLSHMTT 314
C C KS+++ +NL H+ T
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRT 21
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 29.9 bits (67), Expect = 0.15
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 237 FKCDHCVKSFNQKPNLISHIKSVH 260
FKC C KSF+ K L H++ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 29.1 bits (65), Expect = 0.31
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 210 KCDFCDKTFAHKPNLIAHMNSAH 232
KC C K+F+ K L H+ H
Sbjct: 2 KCPLCGKSFSSKDALKRHLRKHH 24
Score = 28.4 bits (63), Expect = 0.55
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 181 FPCDQCGKSFTLKQNLTYHVNSIH 204
F C CGKSF+ K L H+ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 28.4 bits (63), Expect = 0.63
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 265 YKCEECTKTFAQKQNLLSHVQAIH 288
+KC C K+F+ K L H++ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 27.6 bits (61), Expect = 0.89
Identities = 7/24 (29%), Positives = 12/24 (50%)
Query: 293 YTCEVCSKSYNQVTNLLSHMTTVH 316
+ C +C KS++ L H+ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 25.7 bits (56), Expect = 4.5
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 321 YKCADCGRLFAQEHTLRTHTLSVH 344
+KC CG+ F+ + L+ H H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 31.4 bits (72), Expect = 0.15
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 441 EEEEESEMEEEEEEAGF 457
E++EE E EEE+++ GF
Sbjct: 69 EKKEEEEEEEEDDDMGF 85
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 28.6 bits (64), Expect = 0.40
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 265 YKCEECTKTFAQKQNLLSHVQ 285
Y+C EC K F K L H++
Sbjct: 1 YRCPECGKVFKSKSALREHMR 21
Score = 28.2 bits (63), Expect = 0.68
Identities = 7/22 (31%), Positives = 12/22 (54%)
Query: 237 FKCDHCVKSFNQKPNLISHIKS 258
++C C K F K L H+++
Sbjct: 1 YRCPECGKVFKSKSALREHMRT 22
Score = 27.0 bits (60), Expect = 1.4
Identities = 8/19 (42%), Positives = 9/19 (47%)
Query: 210 KCDFCDKTFAHKPNLIAHM 228
+C C K F K L HM
Sbjct: 2 RCPECGKVFKSKSALREHM 20
Score = 27.0 bits (60), Expect = 1.6
Identities = 8/22 (36%), Positives = 10/22 (45%)
Query: 293 YTCEVCSKSYNQVTNLLSHMTT 314
Y C C K + + L HM T
Sbjct: 1 YRCPECGKVFKSKSALREHMRT 22
Score = 26.7 bits (59), Expect = 2.3
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 181 FPCDQCGKSFTLKQNLTYHV 200
+ C +CGK F K L H+
Sbjct: 1 YRCPECGKVFKSKSALREHM 20
>gnl|CDD|132031 TIGR02986, restrict_Alw26I, type II restriction endonuclease,
Alw26I/Eco31I/Esp3I family. Members of this family are
type II restriction endonucleases of the
Alw26I/Eco31I/Esp3I family. Characterized specificities
of three members are GGTCTC, CGTCTC, and the shared
subsequence GTCTC [DNA metabolism,
Restriction/modification].
Length = 424
Score = 32.5 bits (74), Expect = 0.46
Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 6/51 (11%)
Query: 177 PVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAH 227
P PC CGK+ +L Y + + R+K F D+TF K LI
Sbjct: 79 PTGVKPCQTCGKTM----SLGYSYPNHNFAKRLKKTFPDETF--KDQLIED 123
>gnl|CDD|128906 smart00659, RPOLCX, RNA polymerase subunit CX. present in RNA
polymerase I, II and III.
Length = 44
Score = 28.9 bits (65), Expect = 0.50
Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 11/36 (30%)
Query: 320 KYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCG 355
Y C +CGR + VR CR CG
Sbjct: 2 IYICGECGREN---------EIKSKDVVR--CRECG 26
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 30.3 bits (69), Expect = 0.61
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 439 QGEEEEESEMEEEEEEAGFE 458
+ EEEEE E +EE EE
Sbjct: 80 KKEEEEEEEEKEESEEEAAA 99
Score = 29.5 bits (67), Expect = 1.0
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 438 DQGEEEEESEMEEEEEEAG 456
+ EEEEE E EE EEEA
Sbjct: 80 KKEEEEEEEEKEESEEEAA 98
Score = 29.2 bits (66), Expect = 1.4
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 435 GDGDQGEEEEESEMEEEEEE 454
EE++ E EEEEE+
Sbjct: 71 AAAAAAAEEKKEEEEEEEEK 90
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 28.1 bits (63), Expect = 0.69
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 251 NLISHIKSVHEGIKYKCEECTKTFAQ 276
NL H+++ YKC C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 27.4 bits (61), Expect = 1.1
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 307 NLLSHMTTVHQAQKYKCADCGRLFAQ 332
NL HM T + YKC CG+ F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|140324 PTZ00303, PTZ00303, phosphatidylinositol kinase; Provisional.
Length = 1374
Score = 32.4 bits (73), Expect = 0.70
Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 13/57 (22%)
Query: 323 CADCGRLFAQEHTLRTHTLSVHSGVRLT-CRTCGL----FCDSKRALTAHVREAHPG 374
C CGR F +LS G R CR+CG+ FC +KRA + + A PG
Sbjct: 463 CPSCGRAFI--------SLSRPLGTRAHHCRSCGIRLCVFCITKRAHYSFAKLAKPG 511
>gnl|CDD|202452 pfam02892, zf-BED, BED zinc finger.
Length = 43
Score = 28.4 bits (64), Expect = 0.78
Identities = 6/28 (21%), Positives = 11/28 (39%), Gaps = 3/28 (10%)
Query: 208 RIKCDFCDKTFAHKP---NLIAHMNSAH 232
+ +C +C K + NL H+
Sbjct: 15 KARCKYCGKILSGGGGTSNLKRHLVRKC 42
>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger.
Length = 27
Score = 28.0 bits (63), Expect = 0.83
Identities = 8/21 (38%), Positives = 9/21 (42%)
Query: 210 KCDFCDKTFAHKPNLIAHMNS 230
C C KTF+ L H S
Sbjct: 3 TCGVCGKTFSSLQALGGHKKS 23
Score = 26.0 bits (58), Expect = 4.3
Identities = 7/26 (26%), Positives = 13/26 (50%)
Query: 293 YTCEVCSKSYNQVTNLLSHMTTVHQA 318
+TC VC K+++ + L H +
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSHCSL 27
>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type. This is a zinc-finger
domain with the CxxCx(12)Hx(6)H motif, found in multiple
copies in a wide range of proteins from plants to
metazoans. Some member proteins, particularly those from
plants, are annotated as being RNA-binding.
Length = 25
Score = 27.5 bits (62), Expect = 0.94
Identities = 6/20 (30%), Positives = 11/20 (55%)
Query: 211 CDFCDKTFAHKPNLIAHMNS 230
C+ C+ TF + L +H+
Sbjct: 3 CELCNVTFTSESQLKSHLRG 22
Score = 25.2 bits (56), Expect = 8.2
Identities = 6/20 (30%), Positives = 13/20 (65%)
Query: 293 YTCEVCSKSYNQVTNLLSHM 312
+ CE+C+ ++ + L SH+
Sbjct: 1 FYCELCNVTFTSESQLKSHL 20
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 31.8 bits (73), Expect = 0.98
Identities = 18/77 (23%), Positives = 27/77 (35%), Gaps = 3/77 (3%)
Query: 380 RGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQ---VRGNGDDIRGD 436
+G+G+P + G G G GG + E Q+ L + G Q + G G +
Sbjct: 631 QGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEA 690
Query: 437 GDQGEEEEESEMEEEEE 453
M EE
Sbjct: 691 ARDALGRAGRAMGGAEE 707
Score = 30.3 bits (69), Expect = 2.8
Identities = 9/49 (18%), Positives = 15/49 (30%)
Query: 407 GGQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEA 455
G Q+ + Q G G Q G + + E ++A
Sbjct: 620 GEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 29.6 bits (67), Expect = 1.0
Identities = 12/16 (75%), Positives = 12/16 (75%)
Query: 441 EEEEESEMEEEEEEAG 456
EEEEE E EE EEEA
Sbjct: 82 EEEEEEEEEESEEEAM 97
>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 129
Score = 29.7 bits (67), Expect = 1.2
Identities = 14/47 (29%), Positives = 18/47 (38%), Gaps = 6/47 (12%)
Query: 286 AIHMGIKYTCEVC--SKSYNQVTNLLSHMTTVHQAQKYKCADCGRLF 330
A + ++ T C KS N V Q+YKC CG F
Sbjct: 21 AYAIRMQITKVNCPRCKSSNVVKIGGI----RRGHQRYKCKSCGSTF 63
Score = 28.5 bits (64), Expect = 3.2
Identities = 15/82 (18%), Positives = 24/82 (29%), Gaps = 13/82 (15%)
Query: 195 NLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLIS 254
+ Y + ++ C C + K I H R+KC C +F +
Sbjct: 19 DAAYAIRMQI--TKVNCPRCKSSNVVKIGGI---RRGH--QRYKCKSCGSTFTVE----- 66
Query: 255 HIKSVHEGIKYKCEECTKTFAQ 276
S YK + T
Sbjct: 67 -TGSPLSKALYKIKLQAVTLYM 87
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 30.0 bits (68), Expect = 1.3
Identities = 7/22 (31%), Positives = 13/22 (59%)
Query: 431 DDIRGDGDQGEEEEESEMEEEE 452
+++ D D +EE E E + E+
Sbjct: 8 EELEDDIDDLDEEAEEEEDGED 29
Score = 28.8 bits (65), Expect = 2.8
Identities = 7/28 (25%), Positives = 16/28 (57%)
Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAGFE 458
D+ + + ++ ++ + E EEEE G +
Sbjct: 2 DEPDEEEELEDDIDDLDEEAEEEEDGED 29
>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 402
Score = 30.9 bits (70), Expect = 1.5
Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 11/73 (15%)
Query: 235 VRFKCDHC----VKSFNQKPNLISHIK------SVHEGI-KYKCEECTKTFAQKQNLLSH 283
+ +C C ++ + I + + +YKC C K FA+K + +
Sbjct: 37 RKHRCPECGQRTIRRHGWRIRKIQDLPLFEVPVYLFLRKRRYKCCRCGKRFAEKVDFVDR 96
Query: 284 VQAIHMGIKYTCE 296
Q I
Sbjct: 97 AQRITRRWAQAAA 109
>gnl|CDD|235148 PRK03655, PRK03655, putative ion channel protein; Provisional.
Length = 414
Score = 30.9 bits (70), Expect = 1.7
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 64 FTPGYGGFLPNTPPLGHHYLSPSGLPG 90
F+PG+ G P T PL + PS LPG
Sbjct: 76 FSPGHAGPDPATEPLIGAPVPPSALPG 102
>gnl|CDD|235987 PRK07314, PRK07314, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 411
Score = 30.1 bits (69), Expect = 2.3
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 399 VVGGSGVGGGQRMVENQRMLGNGGGDQV 426
V+ GSG+GG + + E L G +V
Sbjct: 101 VIIGSGIGGLETIEEQHITLLEKGPRRV 128
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 29.1 bits (65), Expect = 2.5
Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 2/52 (3%)
Query: 237 FKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHVQAIH 288
+ C C+ F+ +L HI+ C C K F + L HV H
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.0 bits (68), Expect = 2.6
Identities = 13/26 (50%), Positives = 15/26 (57%)
Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAG 456
D+ D+ EEEEE E EE E E G
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 29.8 bits (67), Expect = 2.9
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 425 QVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
+VR + D + ++GEE+EE E EEE EE
Sbjct: 72 EVRISEDGEEEEVEEGEEDEEEEGEEESEE 101
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.4 bits (68), Expect = 2.9
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
D + DD D D +EE++ E EEEEEE
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
>gnl|CDD|224907 COG1996, RPC10, DNA-directed RNA polymerase, subunit RPC10
(contains C4-type Zn-finger) [Transcription].
Length = 49
Score = 26.9 bits (60), Expect = 2.9
Identities = 12/36 (33%), Positives = 14/36 (38%), Gaps = 10/36 (27%)
Query: 320 KYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCG 355
+YKCA CGR G+R C CG
Sbjct: 6 EYKCARCGREVEL--------DQETRGIR--CPYCG 31
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 28.7 bits (64), Expect = 3.3
Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 3/41 (7%)
Query: 417 MLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEE---EEEE 454
M+GN D G + EEE+ E++E E EE
Sbjct: 8 MVGNQLKDVKGMLGGKEDEGDESDAEEEDEEIQEALREAEE 48
>gnl|CDD|225566 COG3022, COG3022, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 253
Score = 29.6 bits (67), Expect = 3.3
Identities = 8/28 (28%), Positives = 15/28 (53%)
Query: 116 PPSETEDSDDPLGESMVVEPEYTPVTQI 143
P++T D + PL + + +PE +Q
Sbjct: 6 SPAKTLDEESPLSTTELTQPELLDYSQE 33
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.6 bits (67), Expect = 3.5
Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 8/97 (8%)
Query: 362 RALTAHVREAHPG-CEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGG---QRMVENQRM 417
R+L + RE +P + K RG+ G + G G+ G ++ E +R
Sbjct: 25 RSLISLYREVNPELLKKKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEER- 83
Query: 418 LGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
+Q G D D ++ E EE E ++E E
Sbjct: 84 -KKKEAEQ--GLESDDDDDEEEEWEVEEDEDSDDEGE 117
>gnl|CDD|214746 smart00614, ZnF_BED, BED zinc finger. DNA-binding domain in
chromatin-boundary-element-binding proteins and
transposases.
Length = 50
Score = 26.6 bits (59), Expect = 3.6
Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 5/34 (14%)
Query: 204 HLGVRIKCDFCDKTFAHKP-----NLIAHMNSAH 232
+ R KC +C K + NL H+ H
Sbjct: 14 NGKQRAKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.4 bits (64), Expect = 3.7
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 423 GDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
G + + +D D ++ EEEE+ E +++E+E
Sbjct: 103 GQHLVASEEDESDDDEEDEEEEDDEEDDDEDE 134
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 29.4 bits (66), Expect = 4.2
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 422 GGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
G D + + D+ GD DQ + E+SE EE E
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREESE 255
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 29.3 bits (66), Expect = 4.2
Identities = 13/47 (27%), Positives = 18/47 (38%), Gaps = 9/47 (19%)
Query: 408 GQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
GQ + L G DD + E E+SE E ++EE
Sbjct: 196 GQAL--GLSALRTPGE-----PDDDATVETS--SESEDSESESDDEE 233
>gnl|CDD|177288 PHA00616, PHA00616, hypothetical protein.
Length = 44
Score = 26.3 bits (58), Expect = 4.4
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 321 YKCADCGRLFAQEHTLRTHTLSVHSGVRLT 350
Y+C CG +F ++ + H LSVH +LT
Sbjct: 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLT 31
Score = 26.3 bits (58), Expect = 4.5
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 265 YKCEECTKTFAQKQNLLSHVQAIHMGIKYTCE 296
Y+C C F +K+ ++ H+ ++H K T E
Sbjct: 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33
>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor and similar
proteins. Ig2_FGFR_like: domain similar to the second
immunoglobulin (Ig)-like domain of fibroblast growth
factor (FGF) receptor. FGF receptors bind FGF signaling
polypeptides. FGFs participate in multiple processes
such as morphogenesis, development, and angiogenesis.
FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
-2, -3, -4). Receptor diversity is controlled by
alternative splicing producing splice variants with
different ligand binding characteristics and different
expression patterns. FGFRs have an extracellular region
comprised of three Ig-like domains, a single
transmembrane helix, and an intracellular tyrosine
kinase domain. Ligand binding and specificity reside in
the Ig-like domains 2 and 3, and the linker region that
connects these two. FGFR activation and signaling depend
on FGF-induced dimerization, a process involving cell
surface heparin or heparin sulfate proteoglycans. This
group also contains fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 does not have a protein
tyrosine kinase domain at its C terminus; neither does
its cytoplasmic domain appear to interact with a
signaling partner. It has been suggested that FGFRL1 may
not have any direct signaling function, but instead acts
as a decoy receptor trapping FGFs and preventing them
from binding other receptors.
Length = 85
Score = 27.3 bits (61), Expect = 4.5
Identities = 23/72 (31%), Positives = 28/72 (38%), Gaps = 6/72 (8%)
Query: 227 HMNSAHNGVRFKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNL-LSHVQ 285
H A + VR K C S N +P I+ +K K K +K L L V
Sbjct: 4 HAVPAGSTVRLK---CPASGNPRPT-ITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVV 59
Query: 286 AIHMGIKYTCEV 297
G KYTC V
Sbjct: 60 PSDSG-KYTCIV 70
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 27.2 bits (61), Expect = 4.6
Identities = 8/28 (28%), Positives = 18/28 (64%), Gaps = 5/28 (17%)
Query: 430 GDDIRGDGDQGEEEEESEMEEEEEEAGF 457
G+D+ + E+EE+ E +++++ GF
Sbjct: 54 GEDL-----ESEDEEDEEEDDDDDMDGF 76
>gnl|CDD|220556 pfam10082, DUF2320, Uncharacterized protein conserved in bacteria
(DUF2320). This domain, found in various hypothetical
bacterial proteins, has no known function.
Length = 380
Score = 29.2 bits (66), Expect = 4.9
Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 9/60 (15%)
Query: 119 ETEDSDDPLGESMVVEPEYTPVTQIGGTAGLE--FNRSQ-------FEKTFQNYSMTTTG 169
+ ED P ++ +G +AG+E F R + T+Q+ +T
Sbjct: 106 DHEDRGSPDAVGGGLQKYPVKYHTLGASAGVERDFGRLRVSTGGGVDRTTYQDARLTDGT 165
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 27.6 bits (61), Expect = 5.1
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 423 GDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
G Q + + + D+ +EEEE E EEEE E
Sbjct: 43 GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74
>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 493
Score = 29.2 bits (65), Expect = 5.1
Identities = 36/163 (22%), Positives = 50/163 (30%), Gaps = 36/163 (22%)
Query: 180 KFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFC---DKTFAH------KPNLIAHMNS 230
K P +C + + L H + H + C C K F + L H N
Sbjct: 153 KCPKSKCHRRCGSLKELKKHYKAQH--GFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNG 210
Query: 231 AHNGVRFK----CDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKT-------FAQKQN 279
FK C C F L H + HE C C F ++
Sbjct: 211 GLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYED 266
Query: 280 LLSHVQAIHMGIKYTCEVCSKSYNQV------TNLLSHMTTVH 316
L +H + H Y C + + T LL H+T H
Sbjct: 267 LEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHLTRFH 305
>gnl|CDD|221105 pfam11397, GlcNAc, Glycosyltransferase (GlcNAc). GlcNAc is an
enzyme that carries out the first glycosylation step of
hydroxylated Skp1, a ubiquitous eukaryotic protein, in
the cytoplasm.
Length = 321
Score = 28.8 bits (65), Expect = 5.2
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 55 EILKSISASFTPGYGGFLPNTPPLGHHYLSPS 86
EIL S A +T GY + P+ L H Y P
Sbjct: 217 EILYSARA-WTHGYDLYHPHRLLLFHLYTRPK 247
>gnl|CDD|222729 pfam14391, DUF4421, Domain of unknown function (DUF4421). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 336 and 370 amino
acids in length.
Length = 306
Score = 28.8 bits (65), Expect = 5.3
Identities = 14/66 (21%), Positives = 21/66 (31%), Gaps = 18/66 (27%)
Query: 39 VKSEYNYYQLSSD-------------EKFEILKSISASFTPGYGGFLPNTPPLGHH-YLS 84
Y Y++L D KF+ +K S + GYG + L+
Sbjct: 167 PGISYYYHKLEFDSSLLPSLIQKTLSLKFQKVKYFDISISAGYGY----NWVFAKNWLLT 222
Query: 85 PSGLPG 90
S P
Sbjct: 223 GSLSPA 228
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 27.6 bits (61), Expect = 5.4
Identities = 10/39 (25%), Positives = 16/39 (41%)
Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEAGF 457
G + +EE++ E EEEE++ GF
Sbjct: 70 GVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLGF 108
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 28.8 bits (64), Expect = 5.5
Identities = 29/152 (19%), Positives = 50/152 (32%), Gaps = 18/152 (11%)
Query: 315 VHQAQKYKCADCGRL-----------FAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRA 363
V + + +KC C RL +H R L C CG +
Sbjct: 166 VLENENFKCMSCNRLGQYSCLRCKVCCCNDHV-RRKGFKYDKDKNLPCPKCGYPTTETKD 224
Query: 364 LTAHVREAHPGCEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGG 423
L+ R ++ ++ RG + G GR N S G + V + G
Sbjct: 225 LSMSTRSY----KFGRQTRGFRSDDDQGYGRYKQN--LSSNKYGDFKGVNYGYESDDDEG 278
Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEEA 455
++ GD D E+ +++ +E+
Sbjct: 279 SSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDC 310
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.6 bits (64), Expect = 5.5
Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 2/49 (4%)
Query: 406 GGGQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
+ + E ML + + D+ D EEEEE E E+E+ +
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDE--EDEKDEEEEEEEEEEDEDFD 190
Score = 28.2 bits (63), Expect = 7.7
Identities = 6/31 (19%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
D + D + ++ EEEE+ + ++++++
Sbjct: 167 DVDEEDEKD--EEEEEEEEEEDEDFDDDDDD 195
>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
This domain family is found in archaea and eukaryotes,
and is approximately 30 amino acids in length. The
mammalian members of this group occur multiple times
along the protein, joined by flexible linkers, and are
referred to as JAZ - dsRNA-binding ZF protein -
zinc-fingers. The JAZ proteins are expressed in all
tissues tested and localise in the nucleus, particularly
the nucleolus. JAZ preferentially binds to
double-stranded (ds) RNA or RNA/DNA hybrids rather than
DNA. In addition to binding double-stranded RNA, these
zinc-fingers are required for nucleolar localisation.
Length = 27
Score = 25.6 bits (57), Expect = 5.8
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 211 CDFCDKTFAHKPNLIAHMNS 230
C CDK F + L H+ S
Sbjct: 4 CVACDKYFKSENALENHLKS 23
>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
Length = 629
Score = 29.0 bits (65), Expect = 5.8
Identities = 13/45 (28%), Positives = 17/45 (37%)
Query: 391 GGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQVRGNGDDIRG 435
G R G GG GG+ +R G G G + G + R
Sbjct: 569 GERRGGGRGFGGERREGGRNFSGERREGGRGDGRRFSGERREGRA 613
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 28.9 bits (64), Expect = 5.9
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 437 GDQGEEEEESEMEEEEEEAGFE 458
D+ +E+EE E EEEEEEA E
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVE 153
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 28.9 bits (65), Expect = 6.1
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAG 456
+D + D EEE+E E EEE+E+ G
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407
>gnl|CDD|217969 pfam04216, FdhE, Protein involved in formate dehydrogenase
formation. The function of these proteins is unknown.
They may possibly be involved in the formation of
formate dehydrogenase.
Length = 283
Score = 28.5 bits (64), Expect = 6.3
Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 7/46 (15%)
Query: 207 VRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFK-CDHC---VKSFNQ 248
VR+KC C T K + GVR + CD C +K +Q
Sbjct: 204 VRVKCTNCGST---KGLAYWSLEEGEPGVRAETCDTCHSYLKILDQ 246
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.8 bits (65), Expect = 6.7
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 431 DDIRGDGDQGEEEEESEMEEEEEE 454
D D + + ++E + EEEEEE
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEE 375
>gnl|CDD|222704 pfam14353, CpXC, CpXC protein. This presumed domain is
functionally uncharacterized. This domain is found in
bacteria and archaea, and is typically between 122 and
134 amino acids in length. It contains four conserved
cysteines forming two CpXC motifs.
Length = 128
Score = 27.3 bits (61), Expect = 9.2
Identities = 8/20 (40%), Positives = 8/20 (40%)
Query: 180 KFPCDQCGKSFTLKQNLTYH 199
F C CG F L YH
Sbjct: 38 TFTCPSCGHEFRLDYPFLYH 57
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 28.4 bits (63), Expect = 9.4
Identities = 13/89 (14%), Positives = 19/89 (21%), Gaps = 10/89 (11%)
Query: 370 EAHPGCEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQVRGN 429
E + P+ + PG + E Q
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPS----- 224
Query: 430 GDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
+ Q E E+ E E E F
Sbjct: 225 -----PNTQQAVEHEDEPTEPEREGPPFP 248
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 27.3 bits (61), Expect = 10.0
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 435 GDGDQGEEEEESEMEEEEEEAGFE 458
D ++ E E++E E++E GF
Sbjct: 83 NDDEEDAETEDTEDVEDDEWEGFP 106
>gnl|CDD|188824 cd09440, LIM1_SF3, The first Lim domain of pollen specific protein
SF3. The first Lim domain of pollen specific protein
SF3: SF3 is a Lim protein that is found exclusively in
mature plant pollen grains. It contains two LIM domains.
The exact function of SF3 is unknown. It may be a
transcription factor required for the expression of late
pollen genes. It is possible that SF3 protein is
involved in controlling pollen-specific processes such
as male gamete maturation, pollen tube formation, or
even fertilization. As in other LIM domains, this domain
family is 50-60 amino acids in size and shares two
characteristic zinc finger motifs. The two zinc fingers
contain eight conserved residues, mostly cysteines and
histidines, which coordinately bond to two zinc atoms.
LIM domains function as adaptors or scaffolds to support
the assembly of multimeric protein.
Length = 63
Score = 25.9 bits (57), Expect = 10.0
Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 210 KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHC 242
KC CDKT + + ++ ++ F+C HC
Sbjct: 4 KCKACDKT-VYLVDQLSADGVVYHKSCFRCSHC 35
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.417
Gapped
Lambda K H
0.267 0.0707 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,752,952
Number of extensions: 2129715
Number of successful extensions: 3559
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3406
Number of HSP's successfully gapped: 120
Length of query: 458
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 358
Effective length of database: 6,502,202
Effective search space: 2327788316
Effective search space used: 2327788316
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.3 bits)