RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14676
         (1827 letters)



>gnl|CDD|219941 pfam08623, TIP120, TATA-binding protein interacting (TIP20).  TIP120
            (also known as cullin-associated and
            neddylation-dissociated protein 1) is a TATA binding
            protein interacting protein that enhances transcription.
          Length = 169

 Score =  210 bits (537), Expect = 1e-62
 Identities = 86/163 (52%), Positives = 117/163 (71%), Gaps = 6/163 (3%)

Query: 1598 PHIHILLPPVI----IIKTLIREVEMGPFKHTVDDGLDIRKAAFECMYTLLDTCLDRVDM 1653
            P ++ LLP +     + K LIREV+MGPFKH VDDGL++RK A+E MYTLLD+C  R+D+
Sbjct: 5    PLLNELLPALYSETKVKKELIREVQMGPFKHVVDDGLELRKLAYETMYTLLDSCFSRLDI 64

Query: 1654 FEFLNHLESGLRDHYDIKMLTYLMLARLAQMTPAALLQRLERIVEPLRNTVTLKVKANSV 1713
             EFL+ +E+GL D +DIKML  LML++LA + P  +L RL+R+ EPLR T++ KVK N+V
Sbjct: 65   PEFLDRIEAGLSDEHDIKMLALLMLSKLAVLAPEEVLARLDRLAEPLRKTLSKKVKENAV 124

Query: 1714 KQEYEKQDELKRSAMRAVAAL-LTIQDADKNPAL-NDFVATIK 1754
            KQE EKQ EL RS +R V +L L +  A  +P + N+FV  +K
Sbjct: 125  KQEIEKQQELIRSVLRLVVSLNLKLPSAQDSPVVWNEFVEQVK 167


>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region.  This family
            consists of the N terminal region of various alpha, beta
            and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
            protein complexes. The adaptor protein (AP) complexes are
            involved in the formation of clathrin-coated pits and
            vesicles. The N-terminal region of the various adaptor
            proteins (APs) is constant by comparison to the
            C-terminal which is variable within members of the AP-2
            family; and it has been proposed that this constant
            region interacts with another uniform component of the
            coated vesicles.
          Length = 522

 Score = 47.6 bits (114), Expect = 5e-05
 Identities = 72/440 (16%), Positives = 151/440 (34%), Gaps = 88/440 (20%)

Query: 1085 FLLSPVIASVGDNFYKITAEALLVLQQFVKVIRPLTSPSSYDFA-------PFTSDIYQC 1137
            FL+  +   + D    + + A+ +L +  K  R   +                 +   Q 
Sbjct: 151  FLVPELKELLSDKDPGVVSAAVALLYEIRKNDRLYLNKLLPLLVRRLCNLLTVCNPWLQV 210

Query: 1138 TLVRLKAADIDQEVKERAISCMGQIICNLGDYLKSELPVCLPILLDRLRNEITRLTTVKA 1197
             ++RL      Q+ +E       +++ ++ + L++              N       VK 
Sbjct: 211  KILRLLTRYAPQDPREPK-----ELLEDILNLLQN-------------SNNAVLYEAVKT 252

Query: 1198 LTKIAASPLRIDLRPILTDGIPILGSFLRKNQRALKLSTLLLLDTLLVNYPSAISTEMLN 1257
            +  +   P       ++   +  LG  L      L+   L  L+ +L  +P A    + +
Sbjct: 253  IIHLDPEP------ELIVLAVNALGRLLSSPDENLRYVALRNLNKILEKHPPA----VQH 302

Query: 1258 KVTVELAPLVSESDLHIAQLTLTLLTSIAKIQPNALVEPAAASILPQIGRHTDLTSVADL 1317
               + +  L ++ D+ I    L LL  +         E     I+ ++ ++  ++ +AD 
Sbjct: 303  LDLI-IFCLKTDDDISIRLRALDLLYKLVD-------ESNVKEIVKELLKY--VSEIAD- 351

Query: 1318 KQVILNAFTPTSEEVKSAASYTLGSIAVGNLQ---QYVPFVLKEIENQPKR-QYLLLHSL 1373
                         E K      +G +A          +  +L+ +          ++  +
Sbjct: 352  ------------PEFKIKLVKAIGRLAEKFPTDAEWCIDVLLELLSLAGSYVVDEIVEVI 399

Query: 1374 KEIISWQSTSTSGVAQLQQFVPAIWAQLFLHCECQEEGTRNVVAECLGK---LTLIDPSN 1430
            ++II           +L++++     +L    + +    R      LG+   L    PS+
Sbjct: 400  RDII-------RKYPELREYILEHLCELL--EDIESPEARAAALWILGEYGELIPNSPSD 450

Query: 1431 LLPRLQESLKSNSALMRTTVLTA-VKFTISDQMTSSDKDFRFMATNDLMSELQKDSIKLD 1489
            LL  + E     S  +R  +LTA VK ++    T  D+       N ++  +   + +  
Sbjct: 451  LLRSILEVFVLESLKVRLALLTALVKLSL----TFPDE----EVQNLIVQVVLSLATQDS 502

Query: 1490 DDSE---RKVVKMLLRLLED 1506
             D E   R V    LRLL  
Sbjct: 503  SDLELRDRAV--EYLRLLSL 520



 Score = 32.6 bits (75), Expect = 2.5
 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 5/69 (7%)

Query: 638 IPILGSFLRKNQRALKLSTLLLLDTLLVNYPSAISTEMLNKVTVELAPLVSESDLHIAQL 697
           +  LG  L      L+   L  L+ +L  +P A    + +   + +  L ++ D+ I   
Sbjct: 267 VNALGRLLSSPDENLRYVALRNLNKILEKHPPA----VQHLDLI-IFCLKTDDDISIRLR 321

Query: 698 TLTLLTSIA 706
            L LL  + 
Sbjct: 322 ALDLLYKLV 330


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 42.6 bits (101), Expect = 7e-05
 Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 13/46 (28%)

Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
           D  S AEW+EEE        EG     ED E E E+D E  DDDDM
Sbjct: 41  DYDSDAEWEEEE--------EG-----EDLESEDEEDEEEDDDDDM 73



 Score = 36.8 bits (86), Expect = 0.008
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 14/46 (30%)

Query: 214 NYNYEDEEEGLGGEGGGEGMDTEEEDGED---EGEDDNEYSDDDDM 256
           +Y+Y+ + E             EEE+GED   E E+D E  DDDDM
Sbjct: 39  DYDYDSDAEW-----------EEEEEGEDLESEDEEDEEEDDDDDM 73



 Score = 31.4 bits (72), Expect = 0.54
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 4/38 (10%)

Query: 211 YDPNYNYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDN 248
           YD +  +E+EEEG   E   E    E+E+ +D+ + D 
Sbjct: 42  YDSDAEWEEEEEGEDLESEDE----EDEEEDDDDDMDG 75


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
           related to armadillo/beta-catenin-like repeats (see
           pfam00514). These EZ repeats are found in subunits of
           cyanobacterial phycocyanin lyase and other proteins and
           probably carry out a scaffolding role.
          Length = 55

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 16/54 (29%), Positives = 20/54 (37%)

Query: 53  SVQLEALDILSDLLSRFGALLPSFHFVILTALLPQLSSPRQAVRKRTIVALSHL 106
            V+  A   L  L      LL      +L ALLP L      VR+    AL  +
Sbjct: 2   EVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 1150 EVKERAISCMGQIICNLGDYLKSELPVCLPILLDRLR--NEITRLTTVKALTKI 1201
            EV+E A   +G +     + L+  +P  LP LL  L+  ++  R     AL +I
Sbjct: 2    EVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 258 WKVRRAAAKCLEAIIATRRDLIGDMYRLVSPPLIARFKEREENVK 302
           W+VR AAA  L A+     +L+      + P L+   K+ ++ V+
Sbjct: 1   WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVR 45



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 438 WKVRRAAAKCLEAIIATRRDLIGDMYRLVSPPLIARFKEREENVK 482
           W+VR AAA  L A+     +L+      + P L+   K+ ++ V+
Sbjct: 1   WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVR 45


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 41.5 bits (97), Expect = 0.006
 Identities = 14/38 (36%), Positives = 28/38 (73%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           + +E+G+ G GG +G D+EEE+ E+E E++ E  ++++
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883



 Score = 41.1 bits (96), Expect = 0.006
 Identities = 14/37 (37%), Positives = 27/37 (72%)

Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
            +E+G+ G GG +G D+EEE+ E+E E++ E  ++++
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883


>gnl|CDD|182354 PRK10281, PRK10281, hypothetical protein; Provisional.
          Length = 299

 Score = 39.3 bits (92), Expect = 0.016
 Identities = 30/102 (29%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 1193 TTVKALTKIAASPLRI-----DLRPILTDGIPILGSFLRKNQRALKLSTL-LLLDTLLVN 1246
             TV   T     P+ I     D R  +T G P     L    RA  ++ L L  D +L  
Sbjct: 94   CTVWQTTLAGILPVDIEKENDDYRISMTQGTPEFEPPLEGETRAAIINALGLTEDDILPG 153

Query: 1247 YPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTSIAKI 1288
             P  +++   +KV +   PL  E DL      L  LT+I+K 
Sbjct: 154  LPIQVASTGHSKVMI---PLKPEVDLDALSPNLAALTAISKQ 192



 Score = 35.8 bits (83), Expect = 0.17
 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 629 DLRPILTDGIPILGSFLRKNQRALKLSTL-LLLDTLLVNYPSAISTEMLNKVTVELAPLV 687
           D R  +T G P     L    RA  ++ L L  D +L   P  +++   +KV +   PL 
Sbjct: 115 DYRISMTQGTPEFEPPLEGETRAAIINALGLTEDDILPGLPIQVASTGHSKVMI---PLK 171

Query: 688 SESDLHIAQLTLTLLTSIAKI 708
            E DL      L  LT+I+K 
Sbjct: 172 PEVDLDALSPNLAALTAISKQ 192


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 38.1 bits (89), Expect = 0.037
 Identities = 14/50 (28%), Positives = 24/50 (48%)

Query: 389 IPDVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSW 438
           IP +  + +W EEE    E        +++D E+E E + +   DD+  W
Sbjct: 69  IPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEW 118



 Score = 32.7 bits (75), Expect = 1.6
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 217 YEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
            + +EE    +   +G++++++D E+E  +  E  D DD
Sbjct: 76  EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 35.1 bits (81), Expect = 0.21
 Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 8/47 (17%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYS--------DDDDM 256
            DEE+    E   E  + +E+  +D+ +DD++Y+        +DDD 
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDY 214



 Score = 34.7 bits (80), Expect = 0.31
 Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 8/46 (17%)

Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYS--------DDDDM 436
           DEE+    E   E  + +E+  +D+ +DD++Y+        +DDD 
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDY 214



 Score = 33.6 bits (77), Expect = 0.74
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 220 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           E E    +   E  + EEE+ E+E ED ++  DDDD
Sbjct: 162 ELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 33.6 bits (77), Expect = 0.74
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           E E    +   E  + EEE+ E+E ED ++  DDDD
Sbjct: 162 ELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 32.4 bits (74), Expect = 1.4
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 256
           +  +E    +   E    EEE+ E+E ED++   DDDD 
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 32.4 bits (74), Expect = 1.5
 Identities = 14/40 (35%), Positives = 20/40 (50%)

Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
           E   +E    +   E    EEE+ E+E ED++   DDDD 
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 32.4 bits (74), Expect = 1.5
 Identities = 14/38 (36%), Positives = 23/38 (60%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           E + + L  E   E  + +EE+ E+E E+D ++ DDDD
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 32.4 bits (74), Expect = 1.8
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           E E+     E   E  + EEE+ ED  +DD++  DDDD
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD--DDDD 199



 Score = 32.0 bits (73), Expect = 2.3
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 231 EGMDTEEEDGEDEGEDDNEYSDDDDMS 257
           +  D ++E+ E+E E+++E  DDDD  
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195



 Score = 32.0 bits (73), Expect = 2.3
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 411 EGMDTEEEDGEDEGEDDNEYSDDDDMS 437
           +  D ++E+ E+E E+++E  DDDD  
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195



 Score = 31.3 bits (71), Expect = 3.9
 Identities = 15/36 (41%), Positives = 21/36 (58%)

Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           E E +  E   +  + EEE+ EDE  DD++  DDDD
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDD 199


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 34.6 bits (80), Expect = 0.40
 Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 5/93 (5%)

Query: 345 PSLVKALQCQLREKSIRTRQDCLALLRELVQTLPGCLAPSLPALIPDVTSVAEWDEEEGL 404
           P  V    C  ++K +      L  ++E+++   G          P+V    E D EE L
Sbjct: 227 PQYVIVTTCSDKDKGMEIIGAALEAIKEVIKKKGGDFKVKGE---PEVVGGDEEDLEELL 283

Query: 405 GG--EGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
               E   E   +E ED ++E ED+ E  DDD+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316


>gnl|CDD|204198 pfam09324, DUF1981, Domain of unknown function (DUF1981).  Members of
            this family of functionally uncharacterized domains are
            found in various plant and yeast protein transport
            proteins.
          Length = 86

 Score = 31.8 bits (73), Expect = 0.49
 Identities = 16/53 (30%), Positives = 21/53 (39%), Gaps = 9/53 (16%)

Query: 1129 PFTSDIYQCTLVRLKAADIDQEVKERAISCMGQIICNLGDYLKSELPVCLPIL 1181
            PF   I             + EVKE  + C+ QII +  D LKS       +L
Sbjct: 21   PF-EYIMSNNQ--------NVEVKELVLECISQIILSRADNLKSGWKTIFGVL 64


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 34.7 bits (80), Expect = 0.60
 Identities = 25/114 (21%), Positives = 39/114 (34%), Gaps = 6/114 (5%)

Query: 645 LRKNQRALKLSTLLLLDTLLVNYPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTS 704
               Q    L  L       VN       E   +     A L+ +      QL+  L   
Sbjct: 22  AESAQLEAALQLLQEA----VNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQ 77

Query: 705 IAKIQPNALVEPAAASILPQVLQIAKSPLLQGAALTSMLEFFKSL--VLAELPG 756
             ++   A  +   A++L Q+LQ +++   Q A L   L   + L   L  LP 
Sbjct: 78  TERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRILLQQLGPLPE 131


>gnl|CDD|218375 pfam05004, IFRD, Interferon-related developmental regulator (IFRD).
            Interferon-related developmental regulator (IFRD1) is
           the human homologue of the rat early response protein
           PC4 and its murine homologue TIS7. The exact function of
           IFRD1 is unknown but it has been shown that PC4 is
           necessary to muscle differentiation and that it might
           have a role in signal transduction. This family also
           contains IFRD2 and its murine equivalent SKMc15 which
           are highly expressed soon after gastrulation and in the
           hepatic primordium, suggesting an involvement in early
           hematopoiesis.
          Length = 309

 Score = 34.0 bits (78), Expect = 0.69
 Identities = 24/86 (27%), Positives = 37/86 (43%), Gaps = 8/86 (9%)

Query: 519 LLSTQVPSLVKALQCQLREKSIRTRQDCLALLRELMTSSDKDFRFMATNDLMSEL----- 573
           +L+T  P L   L     +  I   +  LALL EL    ++DF       L+S+L     
Sbjct: 225 ILATHQPKLPTLLSSDDLDVRIAAGE-TLALLFELARDEEEDFVEEDMEALLSQLRQLAT 283

Query: 574 --QKDSIKLDDDSERKVVKMLLRLLE 597
              K   K D  ++R   + +LR +E
Sbjct: 284 DSSKYRAKKDRRAQRSTFRDVLRFVE 309


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 34.0 bits (78), Expect = 0.86
 Identities = 35/228 (15%), Positives = 71/228 (31%), Gaps = 29/228 (12%)

Query: 223 GLGGEGGGEGMDTEEEDGEDE-GEDDNEYSDDDDMSWKVRRAAAKCLEAIIATRRDLIG- 280
           GL      E  +T  E  + E  +++    ++D    K ++   +  E+ +    + +  
Sbjct: 148 GLADSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFEALNV 207

Query: 281 ---DMYRLVSP---PLIARFKEREENVKSDIFHAYMALLKGTRTF--SPLTPSDPNQMDQ 332
              D+   V P      A F   +E     +   ++++    R    S   P + +  D 
Sbjct: 208 KKLDLEFAVDPLFKKTSADF---DEGGAKGLLLNHLSIDSEGRLVFDSSEDPEELSSEDD 264

Query: 333 EDSPLCLLSTQVPSLVKALQCQLREKSIRTRQDCLALLRELVQTLPGCLAPSLPALIPDV 392
           E      +          L   L ++ I         L     T      PS     P +
Sbjct: 265 EFPDDGEIDLTPLKFFPLLDR-LEKRPICPS------LAGFQLT-----DPSGSLDDPFL 312

Query: 393 TSVAEWDEEEGLGGEGGGEGMDTEEE----DGEDEGEDDNEYSDDDDM 436
            +  + + E+           + ++     D   E EDD +   +D  
Sbjct: 313 KAPEDTENEDESDDNADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGG 360


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 31.5 bits (71), Expect = 1.0
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)

Query: 213 PNYNYEDEEEGLGGEGGGEGMDTEEED------GEDEGEDDNEYSDDDDMSWKVRRAA 264
           P    E+EE G   + G + M+ EEE        E EGE++    +++      +RAA
Sbjct: 33  PANGNENEENG--AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA 88



 Score = 30.3 bits (68), Expect = 2.3
 Identities = 11/38 (28%), Positives = 18/38 (47%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           E +E+    EG GE  + EEE+  +         D++D
Sbjct: 56  EVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEED 93



 Score = 29.9 bits (67), Expect = 4.3
 Identities = 12/45 (26%), Positives = 23/45 (51%)

Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAA 444
            +EG       E +D ++E+ E EGE++    +++      +RAA
Sbjct: 44  AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA 88



 Score = 29.5 bits (66), Expect = 4.6
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           D     E  +E+    EG GE  + EEE+  +         D++D
Sbjct: 49  DEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEED 93



 Score = 29.5 bits (66), Expect = 5.2
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           E E  G + G + M+ EEE  ED+ E++ E  +++ 
Sbjct: 38  ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEG 73


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 33.8 bits (78), Expect = 1.0
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 217 YEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
            E+EE+G+  E   +  D  EE+ ED    D E  ++D+ S
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363



 Score = 33.4 bits (77), Expect = 1.5
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 3/40 (7%)

Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
            +E +++   G G GE  + EEEDG D   D++E  DDDD
Sbjct: 305 EFEPDDDDNFGLGQGEEDEEEEEDGVD---DEDEEDDDDD 341



 Score = 33.0 bits (76), Expect = 1.7
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           E D+++  G   G E  + EE+  +DE E+D    DDDD
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEED----DDDD 341



 Score = 31.9 bits (73), Expect = 4.5
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 437
           +EE+G+  E   +  D  EE+ ED    D E  ++D+ S
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363


>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 20/102 (19%), Positives = 33/102 (32%), Gaps = 10/102 (9%)

Query: 519 LLSTQVPSLVKALQCQLREKSIRT--------RQDCLALLRELMTSSDKDFRFMATNDLM 570
           LLS+   ++ +     L   S               L  L +L+ S D++    A    +
Sbjct: 15  LLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAAL-WAL 73

Query: 571 SELQKDSIKLDDDSERK-VVKMLLRLLEDKNGEVQNLAVKCL 611
             L                V  L+ LL+  N ++Q  A   L
Sbjct: 74  RNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGAL 115


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 33.0 bits (75), Expect = 1.2
 Identities = 16/32 (50%), Positives = 17/32 (53%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE 249
           EDEE G    G G     EEE+ E EG DD E
Sbjct: 192 EDEEVGSDSYGEGNRELNEEEEEEAEGSDDGE 223



 Score = 32.7 bits (74), Expect = 1.7
 Identities = 16/33 (48%), Positives = 17/33 (51%)

Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE 429
           E DEE G    G G     EEE+ E EG DD E
Sbjct: 191 EEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGE 223



 Score = 31.1 bits (70), Expect = 4.8
 Identities = 14/70 (20%), Positives = 24/70 (34%), Gaps = 6/70 (8%)

Query: 200 SITELCLTYITYDPNYNYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWK 259
            +T   + +   + N      +  +  E   E    EE DG +E ED+       D   +
Sbjct: 150 RVTRFNIVWDNDEDNDEAPPAQPDVDNE---EEERLEESDGREEEEDE---EVGSDSYGE 203

Query: 260 VRRAAAKCLE 269
             R   +  E
Sbjct: 204 GNRELNEEEE 213


>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain.  This putative domain is found
           to be the most conserved region in mediator of
           replication checkpoint protein 1.
          Length = 145

 Score = 32.0 bits (73), Expect = 1.3
 Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 10/53 (18%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGED----DNEYSDDDDMSWKVRRAAAK 266
           EDE  GLGG       D E+ D +D+ E     D+EYS +   + ++    A+
Sbjct: 18  EDEWAGLGG------ADDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAE 64



 Score = 30.0 bits (68), Expect = 6.5
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 10/52 (19%)

Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGED----DNEYSDDDDMSWKVRRAAAK 446
           DE  GLGG       D E+ D +D+ E     D+EYS +   + ++    A+
Sbjct: 19  DEWAGLGG------ADDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAE 64


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 33.0 bits (75), Expect = 1.5
 Identities = 10/37 (27%), Positives = 14/37 (37%), Gaps = 1/37 (2%)

Query: 219 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           ++     G   G   D +E        D+ E  DDDD
Sbjct: 259 NKYGDFKGVNYGYESDDDEGS-SSNDYDEEEDGDDDD 294



 Score = 33.0 bits (75), Expect = 1.5
 Identities = 10/37 (27%), Positives = 14/37 (37%), Gaps = 1/37 (2%)

Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           ++     G   G   D +E        D+ E  DDDD
Sbjct: 259 NKYGDFKGVNYGYESDDDEGS-SSNDYDEEEDGDDDD 294


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 32.1 bits (73), Expect = 2.1
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)

Query: 399 DEEEGLGGEGGGEGMDTEEED-GEDEGEDDNEYSDDD 434
           DEE+G            +EE+  ED+ E+  E  D D
Sbjct: 196 DEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232



 Score = 30.5 bits (69), Expect = 7.3
 Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDE-GEDDNEYSDDD 254
           EDEE+G            +EE+ +++  E+  E  D D
Sbjct: 195 EDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 32.7 bits (74), Expect = 2.1
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 224 LGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           LG E   E M+ E++D ED+ +DD+E  DDDD
Sbjct: 41  LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72



 Score = 32.7 bits (74), Expect = 2.1
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 404 LGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           LG E   E M+ E++D ED+ +DD+E  DDDD
Sbjct: 41  LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72



 Score = 31.9 bits (72), Expect = 4.0
 Identities = 13/40 (32%), Positives = 24/40 (60%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
           E EEE +  E   E  D +++D +++ +DD++  DD+D  
Sbjct: 44  EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83



 Score = 30.3 bits (68), Expect = 9.8
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 437
           EEE +  E   E  D +++D +++ +DD++  DD+D  
Sbjct: 46  EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 30.0 bits (68), Expect = 2.3
 Identities = 19/65 (29%), Positives = 30/65 (46%), Gaps = 11/65 (16%)

Query: 548 ALLRELMTSSDKDFRFMATNDLMSELQKDSIKLDDDSERKVVKMLLRLLEDKNGEVQNLA 607
           ALL  L++  D + R  A    + EL           + + +  LL LL+D + EV+  A
Sbjct: 3   ALLEALLSDPDPEVRAAAAR-ALGELG----------DPEALPALLELLKDPDPEVRRAA 51

Query: 608 VKCLG 612
            + LG
Sbjct: 52  AEALG 56


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 32.6 bits (75), Expect = 2.3
 Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 3/42 (7%)

Query: 219 DEEEGLGGEGGGEGM---DTEEEDGEDEGEDDNEYSDDDDMS 257
            +++G   E GG G    + E E  + E E ++ +    D+S
Sbjct: 2   ADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTGSDIS 43



 Score = 32.6 bits (75), Expect = 2.3
 Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 3/42 (7%)

Query: 399 DEEEGLGGEGGGEGM---DTEEEDGEDEGEDDNEYSDDDDMS 437
            +++G   E GG G    + E E  + E E ++ +    D+S
Sbjct: 2   ADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTGSDIS 43


>gnl|CDD|197630 smart00288, VHS, Domain present in VPS-27, Hrs and STAM.  Unpublished
            observations. Domain of unknown function.
          Length = 133

 Score = 30.7 bits (70), Expect = 2.9
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 1/50 (2%)

Query: 1484 DSIKLDDDSERKVVKMLLRLLEDKNGEVQNLAVKCLGPLV-NKVKEFQVE 1532
            D I    D  +  V++L + L +KN  V  LA+  L   V N   +F +E
Sbjct: 26   DLINSTPDGPKDAVRLLKKRLNNKNPHVALLALTLLDACVKNCGSKFHLE 75


>gnl|CDD|219458 pfam07539, DRIM, Down-regulated in metastasis.  These eukaryotic
            proteins include DRIM (Down-Regulated In Metastasis),
            which is differentially expressed in metastatic and
            non-metastatic human breast carcinoma cells. It is
            believed to be involved in processing of non-coding RNA.
          Length = 140

 Score = 30.6 bits (70), Expect = 3.3
 Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 9/67 (13%)

Query: 1495 KVVKMLLRLLEDKNGEVQNLAVKCL----GPLVNKVKEFQVETIVDALCSN-----MVSD 1545
            ++ ++LL LL  ++ EVQ LA+ C+     P +   ++     + D    +     ++S 
Sbjct: 17   ELYELLLTLLSSRDPEVQKLALDCILTYKNPAITPYRDNLKNLLDDKTFRDELTTFLLSS 76

Query: 1546 KEQLRDI 1552
             E + D 
Sbjct: 77   DEVIEDE 83


>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat.  The HEAT repeat family is related to
           armadillo/beta-catenin-like repeats (see pfam00514).
          Length = 31

 Score = 27.5 bits (62), Expect = 4.2
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 588 VVKMLLRLLEDKNGEVQNLAVKCLGPLFKAL 618
           ++ +LL LL D + EV+  A + LG L + L
Sbjct: 1   LLPLLLELLNDPDPEVREAAAEALGALAEVL 31



 Score = 27.5 bits (62), Expect = 5.6
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 1496 VVKMLLRLLEDKNGEVQNLAVKCLGPLVNKV 1526
            ++ +LL LL D + EV+  A + LG L   +
Sbjct: 1    LLPLLLELLNDPDPEVREAAAEALGALAEVL 31


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 29.7 bits (66), Expect = 4.3
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 225 GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
           G E      D EE+D +++ EDD+E  D+DD
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDD 38



 Score = 29.7 bits (66), Expect = 4.3
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 405 GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
           G E      D EE+D +++ EDD+E  D+DD
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDD 38


>gnl|CDD|237614 PRK14113, PRK14113, urease accessory protein UreE; Provisional.
          Length = 152

 Score = 30.3 bits (68), Expect = 5.1
 Identities = 28/129 (21%), Positives = 53/129 (41%), Gaps = 25/129 (19%)

Query: 1484 DSIKLDDDSERKVVKMLLRLLEDKNGEVQNLAVKCLGPLVNKVKEFQVETIVDALCSNMV 1543
            DSI LD D+ +K     ++ + D   E+        G  + +     V  I+   C  ++
Sbjct: 17   DSITLDHDTRKKG---RIKSVTDNGEEI--------GVFLERGHPLLVGEILKTECGKII 65

Query: 1544 SDKEQLRDISSIGLKTVIS---VVYCLGDRNSSSNMK-----------IDTLAFTHTLLT 1589
              K +  D+++   +  ++     Y LG+R++   +            ++ LA  + L  
Sbjct: 66   QVKGKAEDVATASAEDWLAFSKACYHLGNRHTKLQIGERWVRFKPDHVLEDLAENYGLSV 125

Query: 1590 THPPAVFHP 1598
             H PAVF P
Sbjct: 126  NHEPAVFEP 134


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 31.3 bits (71), Expect = 5.3
 Identities = 12/51 (23%), Positives = 22/51 (43%), Gaps = 6/51 (11%)

Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEY------SDDDD 435
           D     E + E+    E   E    +EE+ + + ED ++Y      S D++
Sbjct: 168 DKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEE 218


>gnl|CDD|222974 PHA03082, PHA03082, DNA-dependent RNA polymerase subunit;
          Provisional.
          Length = 63

 Score = 28.2 bits (63), Expect = 5.9
 Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 9  RDISSIGLKTVISELPLGSNALA--ANVCRRITGKLSTAIEKQEDVSVQLEALDI 61
          RD+S    + +I +  L    L    N C R+  KLST IE Q +++VQ   LDI
Sbjct: 12 RDLSEERYRLLIKKKSL-KKVLRTVKNSCCRL--KLSTQIEPQRNLTVQ-PLLDI 62


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 15/55 (27%), Positives = 22/55 (40%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAAAKCLEAII 272
           E+EEE    +   +  D +E   E E EDD  +   D    K     A+ L   +
Sbjct: 11  EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYL 65


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 31.0 bits (71), Expect = 6.9
 Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 36/151 (23%)

Query: 303 SDIFHAYMALLKGTRTFSP----------LTPSDPNQMDQEDSPLCLLSTQVPSLVKALQ 352
           SD       ++     +SP              D   +D+E+        +  ++V+AL 
Sbjct: 253 SDPVENARIIINELEKYSPKLAEKPRWLVFNKID--LLDEEE-----AEERAKAIVEALG 305

Query: 353 CQLREKSIR--TRQDCLALLRELVQTLPGCLAPSLPALIPDVTSVAE-----WDEEEGLG 405
            +     I   +      L  +L+  +        P    +     E     WD+     
Sbjct: 306 WEGPVYLISAASGLGVKELCWDLMTFIE-----ENPREEAEEAEAPEKVEFMWDDYHR-- 358

Query: 406 GEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
                E ++  EE+ +D+ +DD +  DD+ +
Sbjct: 359 -----EQLEEVEEEDDDDWDDDWDEDDDEGV 384


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 31.0 bits (70), Expect = 7.7
 Identities = 14/46 (30%), Positives = 20/46 (43%), Gaps = 4/46 (8%)

Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGED----EGEDDNEYSDDDDMS 257
           N ED E G     G +  + +E +  D    EGE D   + +D  S
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSES 288


>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
            modification].
          Length = 975

 Score = 31.1 bits (70), Expect = 8.1
 Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 5/163 (3%)

Query: 1129 PFTSDIYQCTLVRLKAADIDQEVKERAISCMGQIICNLGD-YLKSELPVCLPILLDRLRN 1187
            P  S I    L  L++   D  V+ RA   MG +   L       EL     IL + L  
Sbjct: 600  PHLSMIVSTILKLLRSKPPD--VRIRAADLMGSLAKVLKACGETKELAKLGNILYENLGE 657

Query: 1188 EITRL--TTVKALTKIAASPLRIDLRPILTDGIPILGSFLRKNQRALKLSTLLLLDTLLV 1245
            +   +  + +KA+  I +      ++P ++  +P L   LR   + +  +T+ L+ T+ +
Sbjct: 658  DYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSLTPILRNKHQKVVANTIALVGTICM 717

Query: 1246 NYPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTSIAKI 1288
            N P  I      ++  EL   +   +  I +        I++ 
Sbjct: 718  NSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRA 760


>gnl|CDD|215718 pfam00103, Hormone_1, Somatotropin hormone family. 
          Length = 198

 Score = 30.0 bits (68), Expect = 8.1
 Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 9/45 (20%)

Query: 1460 DQMTSSDKDFRFMATNDLMSELQKDSIKLDDDSERKVVKMLLRLL 1504
              + SSD+D R  A  +L S L++DS K+D           L+LL
Sbjct: 153  GSLKSSDEDRRLFAFYNLFSCLRRDSHKVD---------TYLKLL 188


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 30.7 bits (69), Expect = 8.3
 Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 4/44 (9%)

Query: 218 EDEEEGLGGEGGGEGMDTEE----EDGEDEGEDDNEYSDDDDMS 257
           E EE+G G   G EG   +E    +   + GE++   SD DD+ 
Sbjct: 224 EQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLP 267


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 30.9 bits (70), Expect = 8.4
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 2/48 (4%)

Query: 212 DPNYNYEDEEEGL--GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
           +   N E++EE L       G   D +E   EDE ED+ +  + +  +
Sbjct: 6   NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPA 53


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 29.2 bits (66), Expect = 9.4
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 230 GEGMDTEEEDGEDEGEDDNEYSDDDD 255
               D  ++D EDE E+D+E  DD+D
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDED 133



 Score = 29.2 bits (66), Expect = 9.4
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 410 GEGMDTEEEDGEDEGEDDNEYSDDDD 435
               D  ++D EDE E+D+E  DD+D
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDED 133


>gnl|CDD|200601 cd10979, CE4_PuuE_like, Putative catalytic domain of
           uncharacterized prokaryotic polysaccharide deacetylases
           similar to bacterial PuuE allantoinases.  The family
           includes a group of uncharacterized prokaryotic
           polysaccharide deacetylases (DCAs) that show high
           sequence similarity to the catalytic domain of bacterial
           PuuE (purine utilization E) allantoinases. PuuE
           allantoinase specifically catalyzes the hydrolysis of
           (S)-allantoin into allantoic acid. It functions as a
           homotetramer. Its monomer is composed of a 7-stranded
           barrel with detectable sequence similarity to the
           6-stranded barrel NodB homology domain of DCA-like
           proteins in the CE4 superfamily, which removes N-linked
           or O-linked acetyl groups from cell wall
           polysaccharides. PuuE allantoinase appears to be
           metal-independent and acts on a small substrate
           molecule, which is distinct from the common feature of
           DCAs which are normally metal ion dependent and
           recognize multimeric substrates.
          Length = 281

 Score = 30.3 bits (69), Expect = 9.4
 Identities = 8/44 (18%), Positives = 12/44 (27%)

Query: 143 AAICRQAGHRFAEHIERVMPLILKYSREEDDELREFCLQACEAF 186
               R+ G  F  H      L       ++ E+    L   E  
Sbjct: 95  IEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKA 138



 Score = 30.3 bits (69), Expect = 9.4
 Identities = 8/44 (18%), Positives = 12/44 (27%)

Query: 881 AAICRQAGHRFAEHIERVMPLILKYSREEDDELREFCLQACEAF 924
               R+ G  F  H      L       ++ E+    L   E  
Sbjct: 95  IEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKA 138


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 29.2 bits (66), Expect = 9.7
 Identities = 20/66 (30%), Positives = 27/66 (40%), Gaps = 19/66 (28%)

Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE-----------YS----DDD----DMSW 258
           ++EEE        +    EEEDGED   D+ E           Y     DD+    ++S 
Sbjct: 5   DEEEELEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYEELSA 64

Query: 259 KVRRAA 264
             RRAA
Sbjct: 65  AARRAA 70


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.5 bits (70), Expect = 9.8
 Identities = 7/56 (12%), Positives = 20/56 (35%)

Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAAAKCLEAI 271
            + D          G  ++  ++D ++E E+D           ++     +  +A+
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKAL 227


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.133    0.376 

Gapped
Lambda     K      H
   0.267   0.0737    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 91,384,114
Number of extensions: 9236771
Number of successful extensions: 11260
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10835
Number of HSP's successfully gapped: 136
Length of query: 1827
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1716
Effective length of database: 6,014,308
Effective search space: 10320552528
Effective search space used: 10320552528
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (29.2 bits)