RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2838
         (458 letters)



>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 30/122 (24%), Positives = 43/122 (35%), Gaps = 16/122 (13%)

Query: 9    PSPAPSTTSRRDSFPPPHPLGQNGFDPTGQVKSEYNYYQLSSDEKFEILKSISASFTPGY 68
            P P P    R      P        +P+G V   +    +    +  + +       P  
Sbjct: 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPG--RVAVPRFRVPQPAPSR 2986

Query: 69   GGFLPNTPPLGHHYLSPSGLPGKAEYLEIHFCFVYTWVKTEPN-----QTNHPPSETEDS 123
                 +TPPL  H L  S +   A  L +H        +T+P      QT  PP +TEDS
Sbjct: 2987 EAPASSTPPLTGHSL--SRVSSWASSLALH-------EETDPPPVSLKQTLWPPDDTEDS 3037

Query: 124  DD 125
            D 
Sbjct: 3038 DA 3039


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 36.9 bits (85), Expect = 0.022
 Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           G GG D   G+ ++   + ++ EEEEE E EEEEEE
Sbjct: 854 GGGGSDG--GDSEEEEEEEEEEEEEEEEEEEEEEEE 887



 Score = 33.4 bits (76), Expect = 0.31
 Identities = 15/25 (60%), Positives = 16/25 (64%)

Query: 430 GDDIRGDGDQGEEEEESEMEEEEEE 454
           G D  G  D G+ EEE E EEEEEE
Sbjct: 851 GVDGGGGSDGGDSEEEEEEEEEEEE 875



 Score = 31.1 bits (70), Expect = 1.6
 Identities = 15/36 (41%), Positives = 21/36 (58%)

Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           G+ GGD      ++   + ++ EEEEE E EEE EE
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 30.0 bits (67), Expect = 3.1
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
            +GG  +     ++   + ++ EEEEE E EE EE    E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 29.6 bits (66), Expect = 4.5
 Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 9/93 (9%)

Query: 370 EAHPGCEYKKRGRGRPKLEVPGGGR----MVGNVVGGSGVGGGQRMVENQRMLGNGGGDQ 425
           EA    E + +G    + E+P   +      G +        G+        + + G  +
Sbjct: 666 EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGE---TEAEEVEHEGETE 722

Query: 426 VRGNGDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
             G  D+  G+ + GEE EE E E E E  G  
Sbjct: 723 AEGTEDE--GEIETGEEGEEVEDEGEGEAEGKH 753


>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies).  This
           family contains two copies of a C2H2-like zinc finger
           domain.
          Length = 100

 Score = 33.8 bits (78), Expect = 0.034
 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 8/85 (9%)

Query: 211 CDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKS-VHEGIKYKCEE 269
           C FC+ T       + HM  +H    F      +       L+++++  +HEG   +C  
Sbjct: 2   CLFCNHTSDTVEENLEHMFKSHG---FFIPE-REYLVDLEGLLNYLREKIHEG--NECLY 55

Query: 270 CTKTFAQKQNLLSH-VQAIHMGIKY 293
           C K F   + L  H     H  I Y
Sbjct: 56  CGKQFKSLEALRQHMRDKGHCKIPY 80



 Score = 26.8 bits (60), Expect = 9.3
 Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 13/81 (16%)

Query: 294 TCEVCSKSYNQVTNLLSHMTTVHQ----AQKYKCADCGRLFAQEHTLRTHTLSVHSGVRL 349
            C  C+ + + V   L HM   H      ++Y   D   L    + LR     +H G   
Sbjct: 1   DCLFCNHTSDTVEENLEHMFKSHGFFIPEREY-LVDLEGLL---NYLR---EKIHEG--N 51

Query: 350 TCRTCGLFCDSKRALTAHVRE 370
            C  CG    S  AL  H+R+
Sbjct: 52  ECLYCGKQFKSLEALRQHMRD 72


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 35.8 bits (82), Expect = 0.047
 Identities = 20/159 (12%), Positives = 43/159 (27%), Gaps = 15/159 (9%)

Query: 197 TYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHI 256
           T   NS+H  +       D +      +    N + +      +  + S    P+  S  
Sbjct: 161 TPQSNSLHPPLPANSLSKDPSSNLSLLI--SSNVSTSIPSSSENSPLSSSYSIPSSSSDQ 218

Query: 257 KSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYN--------QVTNL 308
              +          ++   +     S           +    S+S          Q ++ 
Sbjct: 219 NLENSSSSLPLTTNSQLSPKSLLSQSPSSLSS---SDSSSSASESPRSSLPTASSQSSSP 275

Query: 309 LSHMTTVHQAQ--KYKCADCGRLFAQEHTLRTHTLSVHS 345
               ++  +      K   C   F++   L  H  SV+ 
Sbjct: 276 NESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314



 Score = 30.0 bits (67), Expect = 2.8
 Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 186 CGKSFTLKQNLTYHVNSIHLGVRI---KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHC 242
           C ++F    NL+ H    HL  R    K   C K+F    NLI H     N     C   
Sbjct: 394 CIRNFKRDSNLSLH-IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLC-SI 451

Query: 243 VKSFNQKPNLISHIKS 258
           +KSF +  +L +H K 
Sbjct: 452 LKSFRRDLDLSNHGKD 467



 Score = 29.7 bits (66), Expect = 4.2
 Identities = 23/107 (21%), Positives = 36/107 (33%), Gaps = 2/107 (1%)

Query: 154 SQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDF 213
                     S T            +   C  C  SF+  ++LT H+ S       +C +
Sbjct: 7   QSSSSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSY 66

Query: 214 --CDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKS 258
             CDK+F+    L  H+ + HN         +   N K +  S   S
Sbjct: 67  SGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLSSS 113


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 30.0 bits (68), Expect = 0.13
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 210 KCDFCDKTFAHKPNLIAHMNS 230
           KC  C K+F+ K NL  H+ +
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRT 21



 Score = 28.1 bits (63), Expect = 0.67
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 238 KCDHCVKSFNQKPNLISHIKS 258
           KC  C KSF++K NL  H+++
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRT 21



 Score = 27.3 bits (61), Expect = 1.3
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 183 CDQCGKSFTLKQNLTYHVNS 202
           C  CGKSF+ K NL  H+ +
Sbjct: 2   CPDCGKSFSRKSNLKRHLRT 21



 Score = 26.9 bits (60), Expect = 1.9
 Identities = 10/23 (43%), Positives = 14/23 (60%), Gaps = 1/23 (4%)

Query: 266 KCEECTKTFAQKQNLLSHVQAIH 288
           KC +C K+F++K NL  H    H
Sbjct: 1   KCPDCGKSFSRKSNLKRH-LRTH 22



 Score = 25.0 bits (55), Expect = 7.1
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 294 TCEVCSKSYNQVTNLLSHMTT 314
            C  C KS+++ +NL  H+ T
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRT 21


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 29.9 bits (67), Expect = 0.15
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 237 FKCDHCVKSFNQKPNLISHIKSVH 260
           FKC  C KSF+ K  L  H++  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 29.1 bits (65), Expect = 0.31
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 210 KCDFCDKTFAHKPNLIAHMNSAH 232
           KC  C K+F+ K  L  H+   H
Sbjct: 2   KCPLCGKSFSSKDALKRHLRKHH 24



 Score = 28.4 bits (63), Expect = 0.55
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 181 FPCDQCGKSFTLKQNLTYHVNSIH 204
           F C  CGKSF+ K  L  H+   H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 28.4 bits (63), Expect = 0.63
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 265 YKCEECTKTFAQKQNLLSHVQAIH 288
           +KC  C K+F+ K  L  H++  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 27.6 bits (61), Expect = 0.89
 Identities = 7/24 (29%), Positives = 12/24 (50%)

Query: 293 YTCEVCSKSYNQVTNLLSHMTTVH 316
           + C +C KS++    L  H+   H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 25.7 bits (56), Expect = 4.5
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 321 YKCADCGRLFAQEHTLRTHTLSVH 344
           +KC  CG+ F+ +  L+ H    H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 31.4 bits (72), Expect = 0.15
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 441 EEEEESEMEEEEEEAGF 457
           E++EE E EEE+++ GF
Sbjct: 69  EKKEEEEEEEEDDDMGF 85


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 28.6 bits (64), Expect = 0.40
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 265 YKCEECTKTFAQKQNLLSHVQ 285
           Y+C EC K F  K  L  H++
Sbjct: 1   YRCPECGKVFKSKSALREHMR 21



 Score = 28.2 bits (63), Expect = 0.68
 Identities = 7/22 (31%), Positives = 12/22 (54%)

Query: 237 FKCDHCVKSFNQKPNLISHIKS 258
           ++C  C K F  K  L  H+++
Sbjct: 1   YRCPECGKVFKSKSALREHMRT 22



 Score = 27.0 bits (60), Expect = 1.4
 Identities = 8/19 (42%), Positives = 9/19 (47%)

Query: 210 KCDFCDKTFAHKPNLIAHM 228
           +C  C K F  K  L  HM
Sbjct: 2   RCPECGKVFKSKSALREHM 20



 Score = 27.0 bits (60), Expect = 1.6
 Identities = 8/22 (36%), Positives = 10/22 (45%)

Query: 293 YTCEVCSKSYNQVTNLLSHMTT 314
           Y C  C K +   + L  HM T
Sbjct: 1   YRCPECGKVFKSKSALREHMRT 22



 Score = 26.7 bits (59), Expect = 2.3
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 181 FPCDQCGKSFTLKQNLTYHV 200
           + C +CGK F  K  L  H+
Sbjct: 1   YRCPECGKVFKSKSALREHM 20


>gnl|CDD|132031 TIGR02986, restrict_Alw26I, type II restriction endonuclease,
           Alw26I/Eco31I/Esp3I family.  Members of this family are
           type II restriction endonucleases of the
           Alw26I/Eco31I/Esp3I family. Characterized specificities
           of three members are GGTCTC, CGTCTC, and the shared
           subsequence GTCTC [DNA metabolism,
           Restriction/modification].
          Length = 424

 Score = 32.5 bits (74), Expect = 0.46
 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 6/51 (11%)

Query: 177 PVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAH 227
           P    PC  CGK+     +L Y   + +   R+K  F D+TF  K  LI  
Sbjct: 79  PTGVKPCQTCGKTM----SLGYSYPNHNFAKRLKKTFPDETF--KDQLIED 123


>gnl|CDD|128906 smart00659, RPOLCX, RNA polymerase subunit CX.  present in RNA
           polymerase I, II and III.
          Length = 44

 Score = 28.9 bits (65), Expect = 0.50
 Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 11/36 (30%)

Query: 320 KYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCG 355
            Y C +CGR            +     VR  CR CG
Sbjct: 2   IYICGECGREN---------EIKSKDVVR--CRECG 26


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 30.3 bits (69), Expect = 0.61
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 439 QGEEEEESEMEEEEEEAGFE 458
           + EEEEE E +EE EE    
Sbjct: 80  KKEEEEEEEEKEESEEEAAA 99



 Score = 29.5 bits (67), Expect = 1.0
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 438 DQGEEEEESEMEEEEEEAG 456
            + EEEEE E EE EEEA 
Sbjct: 80  KKEEEEEEEEKEESEEEAA 98



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 435 GDGDQGEEEEESEMEEEEEE 454
                  EE++ E EEEEE+
Sbjct: 71  AAAAAAAEEKKEEEEEEEEK 90


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 28.1 bits (63), Expect = 0.69
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 251 NLISHIKSVHEGIKYKCEECTKTFAQ 276
           NL  H+++      YKC  C K+F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 27.4 bits (61), Expect = 1.1
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 307 NLLSHMTTVHQAQKYKCADCGRLFAQ 332
           NL  HM T    + YKC  CG+ F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|140324 PTZ00303, PTZ00303, phosphatidylinositol kinase; Provisional.
          Length = 1374

 Score = 32.4 bits (73), Expect = 0.70
 Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 13/57 (22%)

Query: 323 CADCGRLFAQEHTLRTHTLSVHSGVRLT-CRTCGL----FCDSKRALTAHVREAHPG 374
           C  CGR F         +LS   G R   CR+CG+    FC +KRA  +  + A PG
Sbjct: 463 CPSCGRAFI--------SLSRPLGTRAHHCRSCGIRLCVFCITKRAHYSFAKLAKPG 511


>gnl|CDD|202452 pfam02892, zf-BED, BED zinc finger. 
          Length = 43

 Score = 28.4 bits (64), Expect = 0.78
 Identities = 6/28 (21%), Positives = 11/28 (39%), Gaps = 3/28 (10%)

Query: 208 RIKCDFCDKTFAHKP---NLIAHMNSAH 232
           + +C +C K  +      NL  H+    
Sbjct: 15  KARCKYCGKILSGGGGTSNLKRHLVRKC 42


>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger. 
          Length = 27

 Score = 28.0 bits (63), Expect = 0.83
 Identities = 8/21 (38%), Positives = 9/21 (42%)

Query: 210 KCDFCDKTFAHKPNLIAHMNS 230
            C  C KTF+    L  H  S
Sbjct: 3   TCGVCGKTFSSLQALGGHKKS 23



 Score = 26.0 bits (58), Expect = 4.3
 Identities = 7/26 (26%), Positives = 13/26 (50%)

Query: 293 YTCEVCSKSYNQVTNLLSHMTTVHQA 318
           +TC VC K+++ +  L  H  +    
Sbjct: 2   HTCGVCGKTFSSLQALGGHKKSHCSL 27


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 27.5 bits (62), Expect = 0.94
 Identities = 6/20 (30%), Positives = 11/20 (55%)

Query: 211 CDFCDKTFAHKPNLIAHMNS 230
           C+ C+ TF  +  L +H+  
Sbjct: 3   CELCNVTFTSESQLKSHLRG 22



 Score = 25.2 bits (56), Expect = 8.2
 Identities = 6/20 (30%), Positives = 13/20 (65%)

Query: 293 YTCEVCSKSYNQVTNLLSHM 312
           + CE+C+ ++   + L SH+
Sbjct: 1   FYCELCNVTFTSESQLKSHL 20


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 31.8 bits (73), Expect = 0.98
 Identities = 18/77 (23%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 380 RGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQ---VRGNGDDIRGD 436
           +G+G+P  +   G        G  G GG   + E Q+ L +  G Q   + G G +    
Sbjct: 631 QGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEA 690

Query: 437 GDQGEEEEESEMEEEEE 453
                      M   EE
Sbjct: 691 ARDALGRAGRAMGGAEE 707



 Score = 30.3 bits (69), Expect = 2.8
 Identities = 9/49 (18%), Positives = 15/49 (30%)

Query: 407 GGQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEA 455
           G Q+  + Q   G G   Q    G   +        +       E ++A
Sbjct: 620 GEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 12/16 (75%), Positives = 12/16 (75%)

Query: 441 EEEEESEMEEEEEEAG 456
           EEEEE E EE EEEA 
Sbjct: 82  EEEEEEEEEESEEEAM 97


>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 129

 Score = 29.7 bits (67), Expect = 1.2
 Identities = 14/47 (29%), Positives = 18/47 (38%), Gaps = 6/47 (12%)

Query: 286 AIHMGIKYTCEVC--SKSYNQVTNLLSHMTTVHQAQKYKCADCGRLF 330
           A  + ++ T   C   KS N V             Q+YKC  CG  F
Sbjct: 21  AYAIRMQITKVNCPRCKSSNVVKIGGI----RRGHQRYKCKSCGSTF 63



 Score = 28.5 bits (64), Expect = 3.2
 Identities = 15/82 (18%), Positives = 24/82 (29%), Gaps = 13/82 (15%)

Query: 195 NLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLIS 254
           +  Y +       ++ C  C  +   K   I      H   R+KC  C  +F  +     
Sbjct: 19  DAAYAIRMQI--TKVNCPRCKSSNVVKIGGI---RRGH--QRYKCKSCGSTFTVE----- 66

Query: 255 HIKSVHEGIKYKCEECTKTFAQ 276
              S      YK +    T   
Sbjct: 67  -TGSPLSKALYKIKLQAVTLYM 87


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 7/22 (31%), Positives = 13/22 (59%)

Query: 431 DDIRGDGDQGEEEEESEMEEEE 452
           +++  D D  +EE E E + E+
Sbjct: 8   EELEDDIDDLDEEAEEEEDGED 29



 Score = 28.8 bits (65), Expect = 2.8
 Identities = 7/28 (25%), Positives = 16/28 (57%)

Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAGFE 458
           D+   + +  ++ ++ + E EEEE G +
Sbjct: 2   DEPDEEEELEDDIDDLDEEAEEEEDGED 29


>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 402

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 11/73 (15%)

Query: 235 VRFKCDHC----VKSFNQKPNLISHIK------SVHEGI-KYKCEECTKTFAQKQNLLSH 283
            + +C  C    ++    +   I  +        +     +YKC  C K FA+K + +  
Sbjct: 37  RKHRCPECGQRTIRRHGWRIRKIQDLPLFEVPVYLFLRKRRYKCCRCGKRFAEKVDFVDR 96

Query: 284 VQAIHMGIKYTCE 296
            Q I         
Sbjct: 97  AQRITRRWAQAAA 109


>gnl|CDD|235148 PRK03655, PRK03655, putative ion channel protein; Provisional.
          Length = 414

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 64  FTPGYGGFLPNTPPLGHHYLSPSGLPG 90
           F+PG+ G  P T PL    + PS LPG
Sbjct: 76  FSPGHAGPDPATEPLIGAPVPPSALPG 102


>gnl|CDD|235987 PRK07314, PRK07314, 3-oxoacyl-(acyl carrier protein) synthase II;
           Reviewed.
          Length = 411

 Score = 30.1 bits (69), Expect = 2.3
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 399 VVGGSGVGGGQRMVENQRMLGNGGGDQV 426
           V+ GSG+GG + + E    L   G  +V
Sbjct: 101 VIIGSGIGGLETIEEQHITLLEKGPRRV 128


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 29.1 bits (65), Expect = 2.5
 Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 237 FKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHVQAIH 288
           + C  C+  F+   +L  HI+         C  C K F    + L HV   H
Sbjct: 74  YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.0 bits (68), Expect = 2.6
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAG 456
           D+     D+ EEEEE E EE E E G
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 425 QVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           +VR + D    + ++GEE+EE E EEE EE
Sbjct: 72  EVRISEDGEEEEVEEGEEDEEEEGEEESEE 101


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.4 bits (68), Expect = 2.9
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           D    + DD   D D  +EE++ E EEEEEE
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179


>gnl|CDD|224907 COG1996, RPC10, DNA-directed RNA polymerase, subunit RPC10
           (contains C4-type Zn-finger) [Transcription].
          Length = 49

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 12/36 (33%), Positives = 14/36 (38%), Gaps = 10/36 (27%)

Query: 320 KYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCG 355
           +YKCA CGR                 G+R  C  CG
Sbjct: 6   EYKCARCGREVEL--------DQETRGIR--CPYCG 31


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 28.7 bits (64), Expect = 3.3
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 417 MLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEE---EEEE 454
           M+GN   D     G       +   EEE+ E++E   E EE
Sbjct: 8   MVGNQLKDVKGMLGGKEDEGDESDAEEEDEEIQEALREAEE 48


>gnl|CDD|225566 COG3022, COG3022, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 253

 Score = 29.6 bits (67), Expect = 3.3
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 116 PPSETEDSDDPLGESMVVEPEYTPVTQI 143
            P++T D + PL  + + +PE    +Q 
Sbjct: 6   SPAKTLDEESPLSTTELTQPELLDYSQE 33


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.6 bits (67), Expect = 3.5
 Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 8/97 (8%)

Query: 362 RALTAHVREAHPG-CEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGG---QRMVENQRM 417
           R+L +  RE +P   + K RG+        G  +  G      G+ G    ++  E +R 
Sbjct: 25  RSLISLYREVNPELLKKKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEER- 83

Query: 418 LGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
                 +Q  G   D   D ++  E EE E  ++E E
Sbjct: 84  -KKKEAEQ--GLESDDDDDEEEEWEVEEDEDSDDEGE 117


>gnl|CDD|214746 smart00614, ZnF_BED, BED zinc finger.  DNA-binding domain in
           chromatin-boundary-element-binding proteins and
           transposases.
          Length = 50

 Score = 26.6 bits (59), Expect = 3.6
 Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 5/34 (14%)

Query: 204 HLGVRIKCDFCDKTFAHKP-----NLIAHMNSAH 232
           +   R KC +C K  +        NL  H+   H
Sbjct: 14  NGKQRAKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.4 bits (64), Expect = 3.7
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 423 GDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           G  +  + +D   D ++ EEEE+ E +++E+E
Sbjct: 103 GQHLVASEEDESDDDEEDEEEEDDEEDDDEDE 134


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.4 bits (66), Expect = 4.2
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 422 GGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           G D +  + D+  GD DQ +  E+SE   EE E
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREESE 255


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 29.3 bits (66), Expect = 4.2
 Identities = 13/47 (27%), Positives = 18/47 (38%), Gaps = 9/47 (19%)

Query: 408 GQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           GQ +      L   G        DD   +     E E+SE E ++EE
Sbjct: 196 GQAL--GLSALRTPGE-----PDDDATVETS--SESEDSESESDDEE 233


>gnl|CDD|177288 PHA00616, PHA00616, hypothetical protein.
          Length = 44

 Score = 26.3 bits (58), Expect = 4.4
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 321 YKCADCGRLFAQEHTLRTHTLSVHSGVRLT 350
           Y+C  CG +F ++  +  H LSVH   +LT
Sbjct: 2   YQCLRCGGIFRKKKEVIEHLLSVHKQNKLT 31



 Score = 26.3 bits (58), Expect = 4.5
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 265 YKCEECTKTFAQKQNLLSHVQAIHMGIKYTCE 296
           Y+C  C   F +K+ ++ H+ ++H   K T E
Sbjct: 2   YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33


>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor and similar
           proteins.  Ig2_FGFR_like: domain similar to the second
           immunoglobulin (Ig)-like domain of fibroblast growth
           factor (FGF) receptor. FGF receptors bind FGF signaling
           polypeptides. FGFs participate in multiple processes
           such as morphogenesis, development, and angiogenesis.
           FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
           -2, -3, -4). Receptor diversity is controlled by
           alternative splicing producing splice variants with
           different ligand binding characteristics and different
           expression patterns. FGFRs have an extracellular region
           comprised of three Ig-like domains, a single
           transmembrane helix, and an intracellular tyrosine
           kinase domain. Ligand binding and specificity reside in
           the Ig-like domains 2 and 3, and the linker region that
           connects these two. FGFR activation and signaling depend
           on FGF-induced dimerization, a process involving cell
           surface heparin or heparin sulfate proteoglycans. This
           group also contains fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 does not have a protein
           tyrosine kinase domain at its C terminus; neither does
           its cytoplasmic domain appear to interact with a
           signaling partner. It has been suggested that FGFRL1 may
           not have any direct signaling function, but instead acts
           as a decoy receptor trapping FGFs and preventing them
           from binding other receptors.
          Length = 85

 Score = 27.3 bits (61), Expect = 4.5
 Identities = 23/72 (31%), Positives = 28/72 (38%), Gaps = 6/72 (8%)

Query: 227 HMNSAHNGVRFKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNL-LSHVQ 285
           H   A + VR K   C  S N +P  I+ +K      K       K   +K  L L  V 
Sbjct: 4   HAVPAGSTVRLK---CPASGNPRPT-ITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVV 59

Query: 286 AIHMGIKYTCEV 297
               G KYTC V
Sbjct: 60  PSDSG-KYTCIV 70


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 27.2 bits (61), Expect = 4.6
 Identities = 8/28 (28%), Positives = 18/28 (64%), Gaps = 5/28 (17%)

Query: 430 GDDIRGDGDQGEEEEESEMEEEEEEAGF 457
           G+D+     + E+EE+ E +++++  GF
Sbjct: 54  GEDL-----ESEDEEDEEEDDDDDMDGF 76


>gnl|CDD|220556 pfam10082, DUF2320, Uncharacterized protein conserved in bacteria
           (DUF2320).  This domain, found in various hypothetical
           bacterial proteins, has no known function.
          Length = 380

 Score = 29.2 bits (66), Expect = 4.9
 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 9/60 (15%)

Query: 119 ETEDSDDPLGESMVVEPEYTPVTQIGGTAGLE--FNRSQ-------FEKTFQNYSMTTTG 169
           + ED   P      ++        +G +AG+E  F R +          T+Q+  +T   
Sbjct: 106 DHEDRGSPDAVGGGLQKYPVKYHTLGASAGVERDFGRLRVSTGGGVDRTTYQDARLTDGT 165


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 27.6 bits (61), Expect = 5.1
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 423 GDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           G Q   +  +   + D+ +EEEE E EEEE E
Sbjct: 43  GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74


>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 493

 Score = 29.2 bits (65), Expect = 5.1
 Identities = 36/163 (22%), Positives = 50/163 (30%), Gaps = 36/163 (22%)

Query: 180 KFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFC---DKTFAH------KPNLIAHMNS 230
           K P  +C +     + L  H  + H    + C  C    K F +         L  H N 
Sbjct: 153 KCPKSKCHRRCGSLKELKKHYKAQH--GFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNG 210

Query: 231 AHNGVRFK----CDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKT-------FAQKQN 279
                 FK    C  C   F     L  H +  HE     C  C          F   ++
Sbjct: 211 GLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYED 266

Query: 280 LLSHVQAIHMGIKYTCEVCSKSYNQV------TNLLSHMTTVH 316
           L +H +  H    Y C   +    +       T LL H+T  H
Sbjct: 267 LEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHLTRFH 305


>gnl|CDD|221105 pfam11397, GlcNAc, Glycosyltransferase (GlcNAc).  GlcNAc is an
           enzyme that carries out the first glycosylation step of
           hydroxylated Skp1, a ubiquitous eukaryotic protein, in
           the cytoplasm.
          Length = 321

 Score = 28.8 bits (65), Expect = 5.2
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 55  EILKSISASFTPGYGGFLPNTPPLGHHYLSPS 86
           EIL S  A +T GY  + P+   L H Y  P 
Sbjct: 217 EILYSARA-WTHGYDLYHPHRLLLFHLYTRPK 247


>gnl|CDD|222729 pfam14391, DUF4421, Domain of unknown function (DUF4421).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 336 and 370 amino
           acids in length.
          Length = 306

 Score = 28.8 bits (65), Expect = 5.3
 Identities = 14/66 (21%), Positives = 21/66 (31%), Gaps = 18/66 (27%)

Query: 39  VKSEYNYYQLSSD-------------EKFEILKSISASFTPGYGGFLPNTPPLGHH-YLS 84
               Y Y++L  D              KF+ +K    S + GYG           +  L+
Sbjct: 167 PGISYYYHKLEFDSSLLPSLIQKTLSLKFQKVKYFDISISAGYGY----NWVFAKNWLLT 222

Query: 85  PSGLPG 90
            S  P 
Sbjct: 223 GSLSPA 228


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 27.6 bits (61), Expect = 5.4
 Identities = 10/39 (25%), Positives = 16/39 (41%)

Query: 419 GNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEEAGF 457
           G                  +  +EE++ E EEEE++ GF
Sbjct: 70  GVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLGF 108


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 28.8 bits (64), Expect = 5.5
 Identities = 29/152 (19%), Positives = 50/152 (32%), Gaps = 18/152 (11%)

Query: 315 VHQAQKYKCADCGRL-----------FAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRA 363
           V + + +KC  C RL              +H  R           L C  CG      + 
Sbjct: 166 VLENENFKCMSCNRLGQYSCLRCKVCCCNDHV-RRKGFKYDKDKNLPCPKCGYPTTETKD 224

Query: 364 LTAHVREAHPGCEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGG 423
           L+   R      ++ ++ RG    +  G GR   N    S   G  + V       +  G
Sbjct: 225 LSMSTRSY----KFGRQTRGFRSDDDQGYGRYKQN--LSSNKYGDFKGVNYGYESDDDEG 278

Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEEA 455
                  ++  GD D  E+ +++     +E+ 
Sbjct: 279 SSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDC 310


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 2/49 (4%)

Query: 406 GGGQRMVENQRMLGNGGGDQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
              + + E   ML     +    + D+   D    EEEEE E E+E+ +
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDE--EDEKDEEEEEEEEEEDEDFD 190



 Score = 28.2 bits (63), Expect = 7.7
 Identities = 6/31 (19%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 424 DQVRGNGDDIRGDGDQGEEEEESEMEEEEEE 454
           D    +  D   + ++ EEEE+ + ++++++
Sbjct: 167 DVDEEDEKD--EEEEEEEEEEDEDFDDDDDD 195


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 25.6 bits (57), Expect = 5.8
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query: 211 CDFCDKTFAHKPNLIAHMNS 230
           C  CDK F  +  L  H+ S
Sbjct: 4   CVACDKYFKSENALENHLKS 23


>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
          Length = 629

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 13/45 (28%), Positives = 17/45 (37%)

Query: 391 GGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQVRGNGDDIRG 435
           G  R  G   GG    GG+     +R  G G G +  G   + R 
Sbjct: 569 GERRGGGRGFGGERREGGRNFSGERREGGRGDGRRFSGERREGRA 613


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 28.9 bits (64), Expect = 5.9
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 437 GDQGEEEEESEMEEEEEEAGFE 458
            D+ +E+EE E EEEEEEA  E
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVE 153


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 28.9 bits (65), Expect = 6.1
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 431 DDIRGDGDQGEEEEESEMEEEEEEAG 456
           +D   + D  EEE+E E EEE+E+ G
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407


>gnl|CDD|217969 pfam04216, FdhE, Protein involved in formate dehydrogenase
           formation.  The function of these proteins is unknown.
           They may possibly be involved in the formation of
           formate dehydrogenase.
          Length = 283

 Score = 28.5 bits (64), Expect = 6.3
 Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 7/46 (15%)

Query: 207 VRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFK-CDHC---VKSFNQ 248
           VR+KC  C  T   K      +     GVR + CD C   +K  +Q
Sbjct: 204 VRVKCTNCGST---KGLAYWSLEEGEPGVRAETCDTCHSYLKILDQ 246


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.8 bits (65), Expect = 6.7
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 431 DDIRGDGDQGEEEEESEMEEEEEE 454
            D   D +  + ++E + EEEEEE
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEE 375


>gnl|CDD|222704 pfam14353, CpXC, CpXC protein.  This presumed domain is
           functionally uncharacterized. This domain is found in
           bacteria and archaea, and is typically between 122 and
           134 amino acids in length. It contains four conserved
           cysteines forming two CpXC motifs.
          Length = 128

 Score = 27.3 bits (61), Expect = 9.2
 Identities = 8/20 (40%), Positives = 8/20 (40%)

Query: 180 KFPCDQCGKSFTLKQNLTYH 199
            F C  CG  F L     YH
Sbjct: 38  TFTCPSCGHEFRLDYPFLYH 57


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 28.4 bits (63), Expect = 9.4
 Identities = 13/89 (14%), Positives = 19/89 (21%), Gaps = 10/89 (11%)

Query: 370 EAHPGCEYKKRGRGRPKLEVPGGGRMVGNVVGGSGVGGGQRMVENQRMLGNGGGDQVRGN 429
                 E  +     P+ + PG  +                  E Q              
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPS----- 224

Query: 430 GDDIRGDGDQGEEEEESEMEEEEEEAGFE 458
                 +  Q  E E+   E E E   F 
Sbjct: 225 -----PNTQQAVEHEDEPTEPEREGPPFP 248


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 27.3 bits (61), Expect = 10.0
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 435 GDGDQGEEEEESEMEEEEEEAGFE 458
            D ++  E E++E  E++E  GF 
Sbjct: 83  NDDEEDAETEDTEDVEDDEWEGFP 106


>gnl|CDD|188824 cd09440, LIM1_SF3, The first Lim domain of pollen specific protein
           SF3.  The first Lim domain of pollen specific protein
           SF3: SF3 is a Lim protein that is found exclusively in
           mature plant pollen grains. It contains two LIM domains.
           The exact function of SF3 is unknown. It may be a
           transcription factor required for the expression of late
           pollen genes. It is possible that SF3 protein is
           involved in controlling pollen-specific processes such
           as male gamete maturation, pollen tube formation, or
           even fertilization. As in other LIM domains, this domain
           family is 50-60 amino acids in size and shares two
           characteristic zinc finger motifs. The two zinc fingers
           contain eight conserved residues, mostly cysteines and
           histidines, which coordinately bond to two zinc atoms.
           LIM domains function as adaptors or scaffolds to support
           the assembly of multimeric protein.
          Length = 63

 Score = 25.9 bits (57), Expect = 10.0
 Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 210 KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHC 242
           KC  CDKT  +  + ++     ++   F+C HC
Sbjct: 4   KCKACDKT-VYLVDQLSADGVVYHKSCFRCSHC 35


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.134    0.417 

Gapped
Lambda     K      H
   0.267   0.0707    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,752,952
Number of extensions: 2129715
Number of successful extensions: 3559
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3406
Number of HSP's successfully gapped: 120
Length of query: 458
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 358
Effective length of database: 6,502,202
Effective search space: 2327788316
Effective search space used: 2327788316
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.3 bits)