RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17955
         (547 letters)



>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 43.2 bits (101), Expect = 3e-04
 Identities = 57/333 (17%), Positives = 130/333 (39%), Gaps = 8/333 (2%)

Query: 63   EKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAA 122
             KK   E   + +  +K  EA KK E     +  K++  + +  E+        ++   A
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475

Query: 123  LLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLT 182
              K EE  K +  + K+   K+     ++ +   +K      +E  KKA      E+   
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535

Query: 183  KEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQ--DERRRAAGPSESIASSLTQGATE 240
             +   + +  +K  E  K +E K ++E+ K E ++  +E +  A      A    +   E
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595

Query: 241  ASRGRPDEARLCAD---RKIIRQRAGKELNKRQQQLEQAHSMLLRHH--EKTQELEYRQQ 295
                  +E +       +K    +   E  K+ ++ ++    L +    EK +  E ++ 
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655

Query: 296  KSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVEC 355
            +  + ++  +  ++ + +    E+     E E K   AL+ + +       L + +  E 
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715

Query: 356  HHLKE-RLHYELEILMAYQSKNKMQAEAQQLED 387
               +E +   E   + A ++K + + + ++ E+
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748



 Score = 38.2 bits (88), Expect = 0.012
 Identities = 76/440 (17%), Positives = 173/440 (39%), Gaps = 32/440 (7%)

Query: 63   EKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRM--RREHQ 120
            E++   E I + +  R    A ++   KA + R   EL + E  ++    K+   +++  
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305

Query: 121  AALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKK 180
             A  K EE  K +  + K+   K+     ++ + E +K      +E E  A    + E+K
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365

Query: 181  LTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERR----RAAGPSESIASSLTQ 236
                   +++  +K   A KK E K   +  K++  +D+++    + A  ++  A    +
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKK 1425

Query: 237  GATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQK 296
             A E  +   DEA+  A+      +   E  K+ ++ ++A     +  E  +  E +  K
Sbjct: 1426 KAEEKKKA--DEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK--K 1477

Query: 297  SVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECH 356
                 ++    ++   E   + D   ++ +  KK    +  ++ K    + DE+++ E  
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK----KADEAKKAEEA 1533

Query: 357  HLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIR 416
               +      E   A + K     +A++L+     ++A  E K   E +     + E  +
Sbjct: 1534 KKADEAKKAEEKKKADELK-----KAEELKKAEEKKKA-EEAKKAEEDKNMALRKAEEAK 1587

Query: 417  LLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFL 476
               E +  E+    +E  +M         + +++   E ++        ++   + +Q  
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMK--------AEEAKKAEEAKIKAEELKKAEEEKKKVEQLK 1639

Query: 477  GERQERIRLLHERQERELES 496
             +  E  +   E ++ E E+
Sbjct: 1640 KKEAEEKKKAEELKKAEEEN 1659



 Score = 34.3 bits (78), Expect = 0.18
 Identities = 64/316 (20%), Positives = 123/316 (38%), Gaps = 15/316 (4%)

Query: 50   QKEHMQAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQM 109
             KE  +A   T       +E  A++    K  EA K +E K   E   R+  +    E  
Sbjct: 1083 AKEDNRADEATEEAFGKAEE--AKKTETGKAEEARKAEEAKKKAED-ARKAEEARKAEDA 1139

Query: 110  SGYKRMRREHQAALL----KLEEKCKIEMDQHKSLLDKEYETLLQQFSR---ELEKLTLK 162
               +  R+   A  +    K E+  K E  +      K       +  R   EL K    
Sbjct: 1140 RKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDA 1199

Query: 163  HTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKE--YKASKERWKRELSQDER 220
              +E  +KA+     E+    E   + +  +K  EA K  E   KA +ER   E+ + E 
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259

Query: 221  RRAAGPSESIASSLTQGATEASRGRPDEARLCAD--RKIIRQRAGKELNKRQQQLEQAHS 278
             R A  +   A+   + A +A   +  E +  AD  +K   ++   E  K+ ++ ++A  
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319

Query: 279  MLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQ 338
               +  E  ++ +  ++K+  + +  +  +  + E    E      + E  +K   E K+
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKA-EAEAAADEAEAAEEKAEAAEKKKEEAKK 1378

Query: 339  QPKSLKIRLDESQEVE 354
            +  + K + +E ++ +
Sbjct: 1379 KADAAKKKAEEKKKAD 1394


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 42.6 bits (100), Expect = 6e-04
 Identities = 62/396 (15%), Positives = 136/396 (34%), Gaps = 35/396 (8%)

Query: 49  QQKEHMQAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKK----EYKASKERWKRELSQEE 104
           Q  +     +     +   +E  A     RK  E  KK     E  A       EL  +E
Sbjct: 141 QGGKVEIIAMMKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQE 200

Query: 105 MHEQMSGYKRMRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHT 164
           +  +    K +        L+LEE+  + +D  K    +E   LLQ+  R+ ++      
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLN--EERIDLLQELLRDEQEEIESSK 258

Query: 165 SELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAA 224
            ELEK+ ++     K+  +E   ++  + ++   AK++E   S+         D+  +  
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLK 318

Query: 225 GPSESIASSLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHH 284
              + +                             ++  KEL K ++++E+    L    
Sbjct: 319 ESEKEL-----------------------------KKLEKELKKEKEEIEELEKELKELE 349

Query: 285 EKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLK 344
            K +  E  +++      + +   +        E     S  +LK++      ++ K  K
Sbjct: 350 IKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAK 409

Query: 345 IRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMET 404
           + L+ S++ E    +E+      +    +S    Q +  + ++ +  +   L +      
Sbjct: 410 LLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELK 469

Query: 405 QQFLGERQERIRLLHERQERELESFDQESNRMGFRY 440
           +     ++ ++  L E+ E  L     E        
Sbjct: 470 KSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESK 505


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 37.4 bits (87), Expect = 0.024
 Identities = 55/343 (16%), Positives = 131/343 (38%), Gaps = 25/343 (7%)

Query: 162 KHTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERR 221
           +   ELE++     +  +KL +E+ + ++  R + +  ++   +  +   + E  + E  
Sbjct: 667 RELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA 726

Query: 222 RAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQRAG---KELNKRQQQLEQAHS 278
                 E + S L +   E      +   L    + + +      + L K ++++E+   
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786

Query: 279 MLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALE--- 335
                 E+ +ELE  ++      R D + R+ ++    +E      E+  ++   LE   
Sbjct: 787 KRQALQEELEELE--EELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844

Query: 336 --LKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRR 393
             L+++ + L+  L+E +E       E+   E E+    + K +++ E ++LE       
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES----EL 900

Query: 394 ALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVEL 453
           A L++++         E+        E +   LE    E         +  + ++  +E 
Sbjct: 901 AELKEEI---------EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETE--LER 949

Query: 454 EDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELES 496
           E             +++   +   E +ER   L  ++E   E+
Sbjct: 950 EIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEA 992



 Score = 29.7 bits (67), Expect = 5.0
 Identities = 48/252 (19%), Positives = 105/252 (41%), Gaps = 20/252 (7%)

Query: 264 KELNKRQQQLEQAHSMLLRHHEKTQEL----------EYRQQKSVHSLREDQVTR----- 308
           +EL K+ ++LE+      R+ E   EL          + ++ +      E++++R     
Sbjct: 196 EELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEEL 255

Query: 309 -QHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLD-ESQEVECHHLKERL-HYE 365
            + Q EL   E  +   + EL++      + Q + L+++ + E  E E   L+ERL   E
Sbjct: 256 EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315

Query: 366 LEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQ--QFLGERQERIRLLHERQE 423
            E+    +   +++ + + L++ +  R  LLE+   +  +  +   E +E++  L E  E
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375

Query: 424 RELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLGERQERI 483
              E+  +E   +     +I    +      + +  R   L ++++   ++      E  
Sbjct: 376 ELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435

Query: 484 RLLHERQERELE 495
            L  E +E   E
Sbjct: 436 ELQTELEELNEE 447


>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family.  This family of
          proteins includes several glycine rich proteins as well
          as two nodulins 16 and 24. The family also contains
          proteins that are induced in response to various
          stresses.
          Length = 91

 Score = 33.2 bits (76), Expect = 0.061
 Identities = 10/28 (35%), Positives = 11/28 (39%)

Query: 9  RHQGAHHSGAGGGSNNVGGGGGGGAPNN 36
             G +  G GGG    G  GGGG    
Sbjct: 47 GGGGGYGGGGGGGYGGGGYYGGGGGYGG 74



 Score = 29.7 bits (67), Expect = 0.86
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 9  RHQGAHHSGAGGGSNNVGGGGGGG 32
           + G  + G GGG    GGG  GG
Sbjct: 59 GYGGGGYYGGGGGYGGGGGGYPGG 82


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.8 bits (83), Expect = 0.069
 Identities = 42/299 (14%), Positives = 103/299 (34%), Gaps = 19/299 (6%)

Query: 142 DKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKK 201
            +E E L ++     EK+       L +  K     E++L +     ++  R++      
Sbjct: 676 RREIEELEEKIEELEEKIAELEK-ALAELRKELEELEEELEQLRKELEELSRQISAL--- 731

Query: 202 KEYKASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQR 261
                ++   + E  ++   + +     + + + +          + A   A+ +     
Sbjct: 732 -RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE----- 785

Query: 262 AGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYM 321
              EL  + +QL++    L    ++ +       +   +LRE   + + +   T +    
Sbjct: 786 ---ELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842

Query: 322 HRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAE 381
              + E   +    L  + + L+  ++E +      L ER   E  + +      ++  E
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902

Query: 382 AQQLEDRVSVRRALLEQ------KMDMETQQFLGERQERIRLLHERQERELESFDQESN 434
            ++LE + S  R  LE+      ++++  +            L E     LE  +   N
Sbjct: 903 LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALEN 961


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 35.1 bits (82), Expect = 0.095
 Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 14/106 (13%)

Query: 326 QELKKKHALELKQQPKSLKIRLD---ESQEVECHHLKERLHYELEILMAYQSKNKMQAEA 382
           + +KK+  LE K++   L+   +     +  E   L++RL  + E L      ++     
Sbjct: 52  EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENL------DRKLELL 105

Query: 383 QQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELES 428
           ++ E+ +  +   LEQK     QQ L +++E +  L E Q +ELE 
Sbjct: 106 EKREEELEKKEKELEQK-----QQELEKKEEELEELIEEQLQELER 146



 Score = 32.4 bits (75), Expect = 0.62
 Identities = 37/181 (20%), Positives = 71/181 (39%), Gaps = 31/181 (17%)

Query: 255 RKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLRE--DQVTRQHQT 312
           +KI   +  +   + ++ LE+A           +E     ++ +H LR   ++  R+ + 
Sbjct: 26  KKIAEAKIKEAEEEAKRILEEAK---KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82

Query: 313 ELTNQEDYMHRSEQEL--------KKKHALELKQQPKSLKIRLDESQEVECHHLKERLHY 364
           EL   E  + + E+ L        K++  LE K++    K +  E +E E   L E    
Sbjct: 83  ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142

Query: 365 ELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQER 424
           ELE +           EA          + +L +K++ E +    E    I+ + E  + 
Sbjct: 143 ELERISGLT-----AEEA----------KEILLEKVEEEARH---EAAVLIKEIEEEAKE 184

Query: 425 E 425
           E
Sbjct: 185 E 185



 Score = 30.1 bits (69), Expect = 3.2
 Identities = 26/124 (20%), Positives = 55/124 (44%), Gaps = 13/124 (10%)

Query: 64  KKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAAL 123
           +KL K ++ +++   +  E  +K+E +  K+  + E  Q+E+ ++    + +  E     
Sbjct: 85  QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-- 142

Query: 124 LKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTK 183
            +LE    +  ++ K +       LL++   E          E+E++AK       K  K
Sbjct: 143 -ELERISGLTAEEAKEI-------LLEKVEEEARHEAAVLIKEIEEEAKEEA---DKKAK 191

Query: 184 EIIA 187
           EI+A
Sbjct: 192 EILA 195


>gnl|CDD|204270 pfam09579, Spore_YtfJ, Sporulation protein YtfJ (Spore_YtfJ).
          Proteins in this family are encoded by bacterial
          genomes if, and only if, the species is capable of
          endospore formation. YtfJ was confirmed in spores of B.
          subtilis; it appears to be expressed in the forespore
          under control of SigF.
          Length = 81

 Score = 32.1 bits (74), Expect = 0.10
 Identities = 11/24 (45%), Positives = 11/24 (45%), Gaps = 7/24 (29%)

Query: 17 GAGGGS-------NNVGGGGGGGA 33
          GAGGG           GGGGG G 
Sbjct: 19 GAGGGEGKDKKGKTGFGGGGGAGV 42


>gnl|CDD|185612 PTZ00430, PTZ00430, glucose-6-phosphate isomerase; Provisional.
          Length = 552

 Score = 34.2 bits (79), Expect = 0.17
 Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 10/90 (11%)

Query: 99  ELSQEEMHEQMSGYKR-MRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRE-L 156
           +L   + ++ +      +++ H   LLK EE+ K         L KE++ +    SR+ L
Sbjct: 2   DLESLKSYKNLLSLAEKLKKVHLRDLLKDEERNK--------SLIKEFKGVTLDLSRQRL 53

Query: 157 EKLTLKHTSELEKKAKLNTSCEKKLTKEII 186
           ++ TLK   EL ++AKL    +     E I
Sbjct: 54  DEETLKLLIELAEEAKLKEKIKDMFNGEKI 83


>gnl|CDD|217958 pfam04197, Birna_RdRp, Birnavirus RNA dependent RNA polymerase
           (VP1).  Birnaviruses are dsRNA viruses. This family
           corresponds to the RNA dependent RNA polymerase. This
           protein is also known as VP1. All of the birnavirus VP1
           proteins contain conserved RdRp motifs that reside in
           the catalytic "palm" domain of all classes of
           polymerases. However, the birnavirus RdRps lack the
           highly conserved Gly-Asp-Asp (GDD) sequence, a component
           of the proposed catalytic site of this enzyme family
           that exists in the conserved motif VI of the palm domain
           of other RdRps.
          Length = 855

 Score = 33.7 bits (77), Expect = 0.31
 Identities = 37/158 (23%), Positives = 58/158 (36%), Gaps = 20/158 (12%)

Query: 149 LQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKEIIARQ---DCDRKVFEAAKKKEYK 205
           + +F  +LE L+LK     +  A LNT    K     + +       +KV  A K   YK
Sbjct: 667 MSEFGEDLEGLSLKEPVTNQTLADLNTPLPSK--DPKVRKPTPRSGLKKVSNALKSGTYK 724

Query: 206 ASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEAR--LCADR----KIIR 259
             K    R+L+   + R     ++ A      A  A++ +P  +     A R     +  
Sbjct: 725 NPKSAVGRKLTARAKSRINDDPQAEAL-----ADLATKLKPASSTVDNWASRTEELSVQL 779

Query: 260 QRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKS 297
           Q   K     + QLE+    L    E  Q  +    KS
Sbjct: 780 QLLLKANTLAKSQLEETREAL----EVVQSDKIITGKS 813


>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763).  This
          eukaryotic family of proteins has no known function.
          Length = 91

 Score = 30.8 bits (70), Expect = 0.37
 Identities = 9/20 (45%), Positives = 11/20 (55%)

Query: 17 GAGGGSNNVGGGGGGGAPNN 36
          G  GG   +G  GGGG P+ 
Sbjct: 66 GPPGGGRRMGRIGGGGGPSR 85


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 32.9 bits (76), Expect = 0.43
 Identities = 19/124 (15%), Positives = 55/124 (44%), Gaps = 10/124 (8%)

Query: 249 ARLCADRKIIRQRAGKELNKRQQQLEQAHSML------LRHHEKTQELEYRQQKSVHSLR 302
            RL   ++ + +   + L +++Q+L+Q    L          ++ Q L+  QQ+   +L 
Sbjct: 267 QRLQQLQQRLARAMRRRLEQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALE 326

Query: 303 EDQVTRQHQTELTNQEDYMHRSEQELK--KKHALELKQQ-PKSLKIRLDESQEVECHHLK 359
                 + + E  +Q       ++ ++  ++   +L+Q+  ++++ +L   ++     L 
Sbjct: 327 RRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQ-RLEALA 385

Query: 360 ERLH 363
           ++L 
Sbjct: 386 QQLE 389



 Score = 29.0 bits (66), Expect = 6.4
 Identities = 26/139 (18%), Positives = 54/139 (38%), Gaps = 14/139 (10%)

Query: 285 EKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPK--- 341
           E  Q L+  QQ+   ++R     ++ + +         R    LK +    L  Q +   
Sbjct: 264 ELLQRLQQLQQRLARAMRRRLEQKRQRLD-----QLARR----LKFQSPERLLAQQQQRL 314

Query: 342 -SLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKM 400
             L+ RL  + E      K+RL    + L     + +++   Q+LE      R  + +++
Sbjct: 315 DRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQL 374

Query: 401 DMETQQFLGERQERIRLLH 419
             + Q+ L    +++  L 
Sbjct: 375 KRKRQR-LEALAQQLEALS 392


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 32.7 bits (75), Expect = 0.49
 Identities = 25/121 (20%), Positives = 51/121 (42%), Gaps = 7/121 (5%)

Query: 299 HSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHL 358
            +  E     +   EL  +E+     E+ LK+K    L +  + L  RL+  +      L
Sbjct: 167 SAKEELDQLSKKLAELKAEEE--EELERALKEKREELLSKLEEELLARLESKEAALEKQL 224

Query: 359 K---ERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGER--QE 413
           +   ER   EL      + + +++ +A+  E ++    AL   ++  E  + + E+  +E
Sbjct: 225 RLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEE 284

Query: 414 R 414
           R
Sbjct: 285 R 285


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.8 bits (75), Expect = 0.50
 Identities = 79/425 (18%), Positives = 164/425 (38%), Gaps = 58/425 (13%)

Query: 85  KKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAALLKLEEKCKIEMDQHKSLLDKE 144
            ++  +  +         EE+ E++   +    + +  L KLE + + E+ + K+ L K 
Sbjct: 300 IEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELE-ELAEEKNELAKL 358

Query: 145 YETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTK--EIIAR-QDCDRKVFEAAKK 201
            E  L++    LE+L  +    LE+  +L  + ++   +  E+ A  ++   ++ E  K+
Sbjct: 359 LEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKE 418

Query: 202 KEYKASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQR 261
            E        +REL + E        +       +       G  ++  +C         
Sbjct: 419 LEEL------ERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCG-------- 464

Query: 262 AGKELNKRQQQLEQAHSMLLRHHEKTQ--ELEYRQQKSVHSLREDQVTRQHQTELTNQED 319
                    Q+L + H   L    + +  ELE    +        +   + + EL   E+
Sbjct: 465 ---------QELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEE 515

Query: 320 YMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQ 379
            +    +  +     EL+++ + L+  L+E +      LKE+L              +++
Sbjct: 516 ELIELLELEEALKE-ELEEKLEKLENLLEELE-----ELKEKLQ-----------LQQLK 558

Query: 380 AEAQQLEDRVSVRRALLEQKMDMET-QQFLGERQERIRLLHERQERELESFDQESNRMGF 438
            E +QLEDR+   + LLE+   + T ++ L E +ER++ L ++ +   E   Q    +  
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELL-- 616

Query: 439 RYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELESFD 498
                    QS    E    +  A  E + ++E      E +E ++   E  E ++E  +
Sbjct: 617 ---------QSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 499 QESNR 503
            E  R
Sbjct: 668 AEIRR 672



 Score = 31.3 bits (71), Expect = 1.5
 Identities = 44/246 (17%), Positives = 90/246 (36%), Gaps = 9/246 (3%)

Query: 113 KRMRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAK 172
           + + RE + A L+ E +   E+++    L++E   LL+      E+L  K         +
Sbjct: 488 EELSREKEEAELREEIE---ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEE 544

Query: 173 LNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPS---ES 229
           L    EK   +++           +  K+   +    R ++E  ++ R R        + 
Sbjct: 545 LEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604

Query: 230 IASSLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQE 289
           +   L+Q           EA    +       +  E    Q +LE+     L   E+  E
Sbjct: 605 LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVE 664

Query: 290 --LEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQ-ELKKKHALELKQQPKSLKIR 346
                 +++      E+Q+  + +     +E+     E+ E   K   E++Q  + L+ R
Sbjct: 665 ELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESR 724

Query: 347 LDESQE 352
             E +E
Sbjct: 725 KAELEE 730


>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
           protein, R64 PilN family.  Several related protein
           families encode outer membrane pore proteins for type II
           secretion, type III secretion, and type IV pilus
           formation. This protein family appears to encode a
           secretin for pilus formation, although it is quite
           different from PilQ. Members include the PilN
           lipoprotein of the plasmid R64 thin pilus, a type IV
           pilus. Scoring between the trusted and noise cutoffs are
           examples of bundle-forming pilus B (bfpB) [Cell
           envelope, Surface structures, Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 497

 Score = 32.6 bits (74), Expect = 0.56
 Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 16  SGAGGGSNNVGGGGGGGAPNNFATIRTTSIVTKQQKEHMQAKLNTS 61
           S AG GS++ GG G  G+  + A    +S+    Q+  +++ L++S
Sbjct: 178 STAGSGSSSSGGSGNSGSTQSTAVKLESSVHNDIQQS-IKSMLSSS 222


>gnl|CDD|221784 pfam12810, Gly_rich, Glycine rich protein.  This family of proteins
           is greatly expanded in Trichomonas vaginalis. The
           proteins are composed of several glycine rich motifs
           interspersed through the sequence. Although many
           proteins have been annotated by similarity in the family
           these annotations given the biased composition of the
           sequences these are unlikely to be functionally
           relevant.
          Length = 248

 Score = 31.8 bits (73), Expect = 0.74
 Identities = 13/38 (34%), Positives = 14/38 (36%), Gaps = 12/38 (31%)

Query: 7   SSRHQGAHHSGA------------GGGSNNVGGGGGGG 32
           SS   G    G             GGG  +  GGGGGG
Sbjct: 156 SSGGNGGFGYGGSGNGGGGGGGYFGGGGGHYAGGGGGG 193


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 32.2 bits (73), Expect = 0.88
 Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 376  NKMQAEAQQLEDRVSVRRALLE-QKMDMETQQFLGERQERIRLLHERQERELE 427
             ++    Q LE    V+R L E Q   +E Q+ + ERQ R+  L  +    LE
Sbjct: 1578 KEVDQLQQSLERLAEVQRELAEGQVKVIEGQKQMAERQSRLSQLENKILGVLE 1630


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 31.9 bits (72), Expect = 1.1
 Identities = 41/253 (16%), Positives = 90/253 (35%), Gaps = 30/253 (11%)

Query: 246 PDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQ 305
           P       +RK + ++  K L +  QQ +Q+H+ L +  E  +E   +QQ         +
Sbjct: 211 PCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIE 270

Query: 306 VTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYE 365
             R  +  L   ++                + +  K+  +        +     +R+H E
Sbjct: 271 ELRAQEAVLEETQE---------------RINRARKAAPLAAHIKAVTQIEQQAQRIHTE 315

Query: 366 LEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERE 425
           L+  M  ++K  M+            R A ++Q+  +E Q+ L +      +        
Sbjct: 316 LQSKMRSRAKLLMK------------RAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEV 363

Query: 426 LESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQ---QFLGERQER 482
             S  + S +     + I    Q +  L  ++      L+     +     +    R  +
Sbjct: 364 ATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQ 423

Query: 483 IRLLHERQERELE 495
            +L H ++++EL+
Sbjct: 424 GQLAHAKKQQELQ 436


>gnl|CDD|130587 TIGR01524, ATPase-IIIB_Mg, magnesium-translocating P-type ATPase.
           This model describes the magnesium translocating P-type
           ATPase found in a limited number of bacterial species
           and best described in Salmonella typhimurium, which
           contains two isoforms. These transporters are active in
           low external Mg2+ concentrations and pump the ion into
           the cytoplasm. The magnesium ATPases have been
           classified as type IIIB by a phylogenetic analysis
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 867

 Score = 31.4 bits (71), Expect = 1.4
 Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 11/102 (10%)

Query: 415 IRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRAL----LEQKMDM 470
           +  L E   R+  S  ++ + + F + +  +S    V +E+R  V R +    +E+ + +
Sbjct: 391 LAKLDESAARQTASRWKKVDEIPFDFDRRRLS----VVVENRAEVTRLICKGAVEEMLTV 446

Query: 471 ETQQFLGERQERIRLLHERQERELESFDQESNRMGFSALAIA 512
            T +        +  L E ++ EL+    E NR G   +A+A
Sbjct: 447 CTHK---RFGGAVVTLSESEKSELQDMTAEMNRQGIRVIAVA 485


>gnl|CDD|236474 PRK09350, PRK09350, poxB regulator PoxA; Provisional.
          Length = 306

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 312 TELTNQEDYMHRSEQELKKKHALELKQQP 340
            ELT+  +   R EQ+ +K+ A  L QQP
Sbjct: 239 HELTDAREQRQRFEQDNRKRAARGLPQQP 267


>gnl|CDD|227244 COG4907, COG4907, Predicted membrane protein [Function unknown].
          Length = 595

 Score = 31.1 bits (70), Expect = 2.0
 Identities = 11/26 (42%), Positives = 12/26 (46%)

Query: 7   SSRHQGAHHSGAGGGSNNVGGGGGGG 32
           SSR   +   G G      GGGGGG 
Sbjct: 569 SSRRSSSSGGGGGFSGGGSGGGGGGA 594


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
          short domain found in bacterial type II/III secretory
          system proteins. The architecture of these proteins
          suggest that this family may be functionally analogous
          to pfam03958.
          Length = 95

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 7/40 (17%), Positives = 12/40 (30%)

Query: 5  RDSSRHQGAHHSGAGGGSNNVGGGGGGGAPNNFATIRTTS 44
            SS    +  S +   +         G  ++   I T S
Sbjct: 25 SVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITTES 64


>gnl|CDD|131941 TIGR02895, spore_sigI, RNA polymerase sigma-I factor.  Members of
           this sigma factor protein family are strictly limited to
           endospore-forming species in the Firmicutes lineage of
           bacteria, but are not universally present among such
           species. Sigma-I was shown to be induced by heat shock
           (PMID:11157964) in Bacillus subtilis and is suggested by
           its phylogenetic profile to be connected to the program
           of sporulation (PMID:16311624) [Transcription,
           Transcription factors, Cellular processes, Sporulation
           and germination].
          Length = 218

 Score = 29.7 bits (67), Expect = 2.9
 Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 9/90 (10%)

Query: 317 QEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHY--------ELEI 368
            E+Y +  E E ++   LE K+  K   I   E  +V   H   R           E E 
Sbjct: 107 MEEYRNEIENENRRLEILEYKKLLKQFGIEFVELVKVSPKHRDTRKKAIKIAKVIVENEE 166

Query: 369 LMAYQSKNKMQAEAQQLEDRVSVRRALLEQ 398
           L+ Y  + K +   +++E+RV + R  +E+
Sbjct: 167 LLEYLIRKK-KLPIKEIEERVRISRKTIER 195


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 30.1 bits (67), Expect = 3.2
 Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 14/83 (16%)

Query: 346 RLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQ 405
              E  E E   LKERL   +  L  Y+ +   Q +  Q              K+  E Q
Sbjct: 363 TQAEKYEQEIARLKERLRVSVRRLEEYERRLLGQEQQMQ--------------KLLQEYQ 408

Query: 406 QFLGERQERIRLLHERQERELES 428
             L + +ER+R   E ++ +++S
Sbjct: 409 ARLEDSEERLRRQQEEKDSQMKS 431



 Score = 29.3 bits (65), Expect = 6.9
 Identities = 29/113 (25%), Positives = 50/113 (44%), Gaps = 11/113 (9%)

Query: 410 ERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMD 469
           E  E     +  +  + E ++QE  R+  R R   VS +   E E R+  +   + QK+ 
Sbjct: 349 EDIEYETEENREEGTQAEKYEQEIARLKERLR---VSVRRLEEYERRLLGQEQQM-QKLL 404

Query: 470 METQQFLGERQERIRLLHERQERELESFDQESNRMGFSALAIAEASKESYPDM 522
            E Q  L + +ER+R       R+ E  D +   +    +A+ E  K+ + DM
Sbjct: 405 QEYQARLEDSEERLR-------RQQEEKDSQMKSIISRLMAVEEELKKDHADM 450


>gnl|CDD|235903 PRK06991, PRK06991, ferredoxin; Provisional.
          Length = 270

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 17/85 (20%)

Query: 198 AAKKKEYKASKERWKRELSQDERR---RAAGPSESIASSLTQGATEASRGRPDEARLCAD 254
            A +  + A + R +RE    E R   RAA  + + A+  +  A  A+     EA+    
Sbjct: 153 DAARARHDARQARLRREREAAEARAAARAAASAAAAAAEASAAAAPAADD--AEAK---K 207

Query: 255 RKIIR---QRAGKELNKRQQQLEQA 276
           R II    +RA      R+++ E A
Sbjct: 208 RAIIAAALERA------RKKKEELA 226


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 30.3 bits (68), Expect = 3.5
 Identities = 54/321 (16%), Positives = 131/321 (40%), Gaps = 24/321 (7%)

Query: 66  LTKEIIARQDCDRKVFEAAKKKEYKASK-----ERWKRELSQEEMHEQ-MSGYKRMRREH 119
           +  +I   +  +  ++E  ++    + K     E+W+   S    +E  ++  K+  +E 
Sbjct: 269 INTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEW 328

Query: 120 QAALLKLEEKCKIEMDQHKSLLDKE----YETLLQQFSRELEKLTLKHTSELEKKAKLNT 175
              L KL+ + +++ ++ K+L         +   Q  S E  +L  +   +L ++     
Sbjct: 329 PGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKIN 388

Query: 176 SCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPSE---SIAS 232
               KLTK + +R+   + +F++ +K   +          S+ +       S    +I  
Sbjct: 389 IQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQ 448

Query: 233 SLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEY 292
              +G+      +     L      I++R   E NK     E   ++    +E TQ LE 
Sbjct: 449 LFPKGSGINESIKKSILEL---NDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEK 505

Query: 293 RQQKSVHSLREDQVTRQHQTELTNQEDY-MHRSEQELKKKH------ALELKQQPKSLKI 345
            + +   +  + +++++        +   + + E+EL   +       L+ +Q  +S +I
Sbjct: 506 LELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLNLLSKTSILDAEQLVQSTEI 565

Query: 346 RLDESQEVECHHLKERLHYEL 366
           +LDE + V+ +  + ++H ++
Sbjct: 566 KLDELK-VDLNRKRYKIHKQV 585


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.2 bits (68), Expect = 3.6
 Identities = 33/130 (25%), Positives = 63/130 (48%), Gaps = 5/130 (3%)

Query: 300 SLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDE--SQEVECHH 357
           S  E     Q   E  N+ +    ++  L  +  LEL+QQ + L+  L E  S+  +   
Sbjct: 478 SENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSL 537

Query: 358 LKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRL 417
           LK +L   LE L   ++  ++Q + +Q+E+    +   L +K+  E +  L ++ E +R 
Sbjct: 538 LKSKLEEHLEQLH--EANEELQKKREQIEELEPDQDQNLSRKIA-ELEAALQKKDEDMRA 594

Query: 418 LHERQERELE 427
           + ER ++ +E
Sbjct: 595 MEERYKKYVE 604


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 29.5 bits (67), Expect = 4.4
 Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 310 HQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLD---ESQEVECH 356
            +   T   + M R ++  +     ELK++P+ L+ R+     S E   +
Sbjct: 275 IEKFGTRANEVMKREKELFELYKKPELKEKPEELEKRIGAGKYSSEYASN 324


>gnl|CDD|221737 pfam12726, SEN1_N, SEN1 N terminal.  This domain is found at the N
           terminal of the helicase SEN1. SEN1 is a Pol II
           termination factor for noncoding RNA genes. The N
           terminal of SEN1, unlike the C terminal, is not required
           for growth.
          Length = 724

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 123 LLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEK 169
           L K   K K E+   K+  +K  + L +  S+ LE+L+    SEL  
Sbjct: 469 LEKFPPKGKDELTAEKTQFNKIVQQLTELISQILERLSDFDPSELSS 515


>gnl|CDD|161783 TIGR00237, xseA, exodeoxyribonuclease VII, large subunit.  This
           family consist of exodeoxyribonuclease VII, large
           subunit XseA which catalyses exonucleolytic cleavage in
           either the 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. Exonuclease VII consists of
           one large subunit and four small subunits [DNA
           metabolism, Degradation of DNA].
          Length = 432

 Score = 29.8 bits (67), Expect = 4.6
 Identities = 21/113 (18%), Positives = 44/113 (38%), Gaps = 1/113 (0%)

Query: 327 ELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLE 386
           E+   +  EL Q+     +RL  + +   H  K RL   +  L     +NK+  +  Q E
Sbjct: 251 EIVSPNQDELLQRLDGFNVRLHRAFDTLLHQKKARLEQLVASLQRQHPQNKLALQQLQFE 310

Query: 387 DRVSVRRALLEQKMDMETQQFLGERQERIRLL-HERQERELESFDQESNRMGF 438
                ++A L ++++   Q+     +   +     +  R     +Q + R+  
Sbjct: 311 KLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSPQILRAQTRTEQLNRRLNA 363


>gnl|CDD|223219 COG0141, HisD, Histidinol dehydrogenase [Amino acid transport and
           metabolism].
          Length = 425

 Score = 29.5 bits (67), Expect = 4.7
 Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 1/47 (2%)

Query: 398 QKMDMETQQFLGERQERIRLLHERQ-ERELESFDQESNRMGFRYRQI 443
           Q++D E ++ L    E I   HE Q  ++     +    +G R+R I
Sbjct: 71  QRLDPEVKEALEVAAENIEAFHEAQLPKDWLVETEPGVVLGQRWRPI 117


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
           family consist of exonuclease VII, large subunit
           EC:3.1.11.6 This enzyme catalyzes exonucleolytic
           cleavage in either 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. This exonuclease VII enzyme
           is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 29.2 bits (66), Expect = 5.1
 Identities = 23/118 (19%), Positives = 44/118 (37%), Gaps = 11/118 (9%)

Query: 332 HALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSV 391
              EL Q+ + L+ RL  + +      ++RL   L   +   S+  ++   ++L +    
Sbjct: 144 DRTELLQKLEGLEQRLSRALKNRLEKEQDRL-NLLRERLKSLSRRLLEQHEERLAE---- 198

Query: 392 RRALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQS 449
               L  ++    Q  L  +Q   RL      RELE      N++        + + S
Sbjct: 199 ----LRDRLISSIQNLLSRKQS--RLERLILNRELEKNSLLENKLATANLTAQLKALS 250


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 29.3 bits (66), Expect = 5.8
 Identities = 55/266 (20%), Positives = 101/266 (37%), Gaps = 17/266 (6%)

Query: 245 RPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLRED 304
           RPDE R   D         K   +  Q   +A   L    +K QEL  R       L E 
Sbjct: 142 RPDEQRQLLDTFAGANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLDFLQFQLEEL 201

Query: 305 QVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERL-H 363
           +     + +L   ED    +EQ+ +  +  +L++  ++    L    +V+   L E L  
Sbjct: 202 E-----EADLQPGEDEALEAEQQ-RLSNLEKLRELSQNALAALRGDVDVQEGSLLEGLGE 255

Query: 364 YELEILMAYQSK-NKMQAEAQQLEDRV-----SVRRALLEQKMDMETQQFLGERQERIRL 417
            +L +         ++  +       V      ++  L E + D E    + ER  +I+ 
Sbjct: 256 AQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQIKR 315

Query: 418 LHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLG 477
           L  +    +E   + + ++     Q+  S +S   LE+ V      LE+++D        
Sbjct: 316 LKRKYGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDK----LEEELDKAAVALSL 371

Query: 478 ERQERIRLLHERQERELESFDQESNR 503
            R++    L +R E+EL++   E   
Sbjct: 372 IRRKAAERLAKRVEQELKALAMEKAE 397


>gnl|CDD|234722 PRK00310, rpsC, 30S ribosomal protein S3; Reviewed.
          Length = 232

 Score = 29.0 bits (66), Expect = 5.9
 Identities = 11/13 (84%), Positives = 11/13 (84%)

Query: 382 AQQLEDRVSVRRA 394
           AQQLE RVS RRA
Sbjct: 121 AQQLERRVSFRRA 133


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.1 bits (63), Expect = 6.0
 Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 6/129 (4%)

Query: 293 RQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQE 352
            Q K    L   Q+ ++H+ EL   E    R+ ++L+++   EL++ PK ++      Q+
Sbjct: 7   EQLKDRFFLERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRA----EQK 62

Query: 353 VECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQ 412
                 KE L  E + L   Q   K+    +Q + R+   +   EQK   + ++FL +++
Sbjct: 63  TRLKMFKESLKIEKKELK--QEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQEREFLAKQE 120

Query: 413 ERIRLLHER 421
           E +    ++
Sbjct: 121 ENLEEALQQ 129


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.9 bits (65), Expect = 6.9
 Identities = 25/121 (20%), Positives = 53/121 (43%)

Query: 177 CEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPSESIASSLTQ 236
            +KKL K +  +Q  +R+      +   + +KE+++    Q  ++ A   +        +
Sbjct: 100 RQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAE 159

Query: 237 GATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQK 296
            A+ +  G     R  +  +  ++    EL K +QQ ++      +  +K QE E R+QK
Sbjct: 160 SASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219

Query: 297 S 297
           +
Sbjct: 220 A 220


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 28.6 bits (64), Expect = 6.9
 Identities = 12/61 (19%), Positives = 26/61 (42%)

Query: 259 RQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQE 318
           +QR        +Q+ ++A     R+ +K Q LE ++   +   +E    ++       +E
Sbjct: 38  QQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEARE 97

Query: 319 D 319
           D
Sbjct: 98  D 98


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 29.3 bits (66), Expect = 7.1
 Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 16/233 (6%)

Query: 277 HSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALEL 336
           H   L H E  +E E   ++       +       + L+   DY+   +    +   L  
Sbjct: 26  HDFGLVHLEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFILP- 84

Query: 337 KQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALL 396
            ++ + L+  L   +EV     K     E       +  +++  E + LED +     L 
Sbjct: 85  -EEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPL- 142

Query: 397 EQKMDMETQQFLGERQERIRL-LHERQEREL--ESFDQESNRMGFRYRQ---ILVSSQSR 450
              +D +     G +   +RL L  R++ E      + E    G        I+V+  + 
Sbjct: 143 -AYLDFDLSLLRGLKFLLVRLGLVRREKLEALVGVIEDEVALYGENVEASVVIVVAHGA- 200

Query: 451 VELEDRVS-VRRALLEQKMDMETQQFLGERQERIRLLHERQE--RELESFDQE 500
            E  D+VS +   L  +  ++              L     E   ELES   E
Sbjct: 201 -EDLDKVSKILNELGFELYEVPEFDGGPSELISE-LEEVIAEIQDELESLRSE 251


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.0 bits (66), Expect = 7.4
 Identities = 29/140 (20%), Positives = 58/140 (41%), Gaps = 17/140 (12%)

Query: 168 EKKAKLNTSCEKKLTKEIIARQDCDRKVFEA-AKKKEYKASKERWKRELSQDERRRAAGP 226
           E K KLN         E    Q    K  EA A  KE +  KE  + +  + +       
Sbjct: 513 EDKEKLNELIASLEELERELEQ----KAEEAEALLKEAEKLKEELEEKKEKLQEEE---- 564

Query: 227 SESIASSLTQGATEASRGRPDEARLCAD---RKIIRQRAGKELNKRQQQLEQAHSMLLRH 283
            + +     + A +A +    EA+  AD   +++ + + G   + +  +L +A   L + 
Sbjct: 565 -DKLLEEAEKEAQQAIK----EAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKA 619

Query: 284 HEKTQELEYRQQKSVHSLRE 303
           +EK ++ + +Q++    L+ 
Sbjct: 620 NEKKEKKKKKQKEKQEELKV 639


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 29.3 bits (65), Expect = 7.5
 Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 2/83 (2%)

Query: 448 QSRVELEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERE-LESFDQES-NRMG 505
           + R+E  +R    R   E+   +E ++   ER ER RL  +R ER+ L+  ++E  +R+ 
Sbjct: 457 KKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLE 516

Query: 506 FSALAIAEASKESYPDMDGSLSG 528
              L  A  +      M+  LS 
Sbjct: 517 RDRLEKARRNSYFLKGMENGLSA 539


>gnl|CDD|216179 pfam00898, Orbi_VP2, Orbivirus outer capsid protein VP2.  VP2 acts
           as an anchor for VP1 and VP3. VP2 contains a
           non-specific DNA and RNA binding domain in the
           N-terminus.
          Length = 946

 Score = 28.9 bits (65), Expect = 8.4
 Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 125 KLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKE 184
           K  E CKI     + +LD E E       ++   L+++   +L++K +        + + 
Sbjct: 260 KALELCKILSAIGRKMLDTEEE------PKDEMDLSVRFQFKLDEKFRRTDQERVNIFEV 313

Query: 185 IIARQDCDR 193
              R D DR
Sbjct: 314 GGHRTDEDR 322


>gnl|CDD|236092 PRK07772, PRK07772, single-stranded DNA-binding protein;
           Provisional.
          Length = 186

 Score = 28.1 bits (63), Expect = 8.6
 Identities = 12/29 (41%), Positives = 12/29 (41%)

Query: 7   SSRHQGAHHSGAGGGSNNVGGGGGGGAPN 35
                G    G GGG    GGGGGGG   
Sbjct: 122 RGGGGGGGGGGFGGGGGGSGGGGGGGGGG 150


>gnl|CDD|233515 TIGR01659, sex-lethal, sex-lethal family splicing factor.  This
           model describes the sex-lethal family of splicing
           factors found in Dipteran insects. The sex-lethal
           phenotype, however, may be limited to the Melanogasters
           and closely related species. In Drosophila the protein
           acts as an inhibitor of splicing. This subfamily is most
           closely related to the ELAV/HUD subfamily of splicing
           factors (TIGR01661).
          Length = 346

 Score = 28.4 bits (63), Expect = 8.9
 Identities = 9/25 (36%), Positives = 10/25 (40%)

Query: 12  GAHHSGAGGGSNNVGGGGGGGAPNN 36
           G  + G  G  N    GG G  P N
Sbjct: 288 GHGNMGNMGHGNMGMAGGSGMNPPN 312


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
           recombination, and repair].
          Length = 440

 Score = 28.8 bits (65), Expect = 9.3
 Identities = 25/126 (19%), Positives = 49/126 (38%), Gaps = 18/126 (14%)

Query: 335 ELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQS-------KNKMQAEAQQLED 387
           EL QQ   L+ RL          L+  L  + + L            +  +  + Q+L++
Sbjct: 265 ELLQQLDQLQRRLH-------RALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDE 317

Query: 388 RVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSS 447
                R  LE ++ ++ Q+     Q     L+ + +R+ +   Q   R+    R+ L   
Sbjct: 318 LAIRLRRALENQLALKKQRLERLTQR----LNPQIQRQQQRLQQLERRLDKALRRQLKRK 373

Query: 448 QSRVEL 453
           + R+E 
Sbjct: 374 RERLEA 379


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.125    0.338 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,914,402
Number of extensions: 2679235
Number of successful extensions: 4394
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3903
Number of HSP's successfully gapped: 497
Length of query: 547
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 445
Effective length of database: 6,413,494
Effective search space: 2854004830
Effective search space used: 2854004830
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (27.4 bits)