RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6603
         (724 letters)



>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score =  469 bits (1210), Expect = e-159
 Identities = 171/332 (51%), Positives = 233/332 (70%), Gaps = 5/332 (1%)

Query: 168 FEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKH 227
           +EHSELLLY++ +I++SGDLEEA++HL+  ++QI D+L V E    L LKLG+  EA   
Sbjct: 1   YEHSELLLYKNSLIEESGDLEEALEHLEEKEKQIVDRLAVMEMRADLLLKLGRKEEAEAT 60

Query: 228 YESLIERNQENTLYYNKLVEAKQLT----NNDDIFQLLTHYISKYPKATVPKRLSLNYVS 283
           Y +L++RN EN  YY  L +A  L     + + +  L      KYP++  P+RL L+++ 
Sbjct: 61  YRALLDRNPENYDYYEGLQKALGLEISSGDIEALLALYDELAEKYPRSDAPRRLPLDFLE 120

Query: 284 GDQFRTEIDKYLRHGFHKGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTGHFSLQ 343
           GD+F+   D YLR    KGVP LF +L+SLYS++EK  I+++LVE YV +L  +G FS  
Sbjct: 121 GDEFKELADAYLRRMLRKGVPSLFADLKSLYSDSEKVDILEELVEGYVSSLKSSGSFSGS 180

Query: 344 DEG-EVEPASALLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHA 402
           D   E EP S LLW  ++LAQHYD+LG   KAL YI+ AI+HTPTL+EL++TK RI KHA
Sbjct: 181 DSNSEKEPPSTLLWTLYFLAQHYDYLGSRDKALEYIDKAIEHTPTLVELYMTKARILKHA 240

Query: 403 GDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRANLIKEAEETCSKFTREGVSAMENLNE 462
           GD+ +A + ++EA+SLD ADRYINSKCAKYMLRAN  +EAE+T S FTR+G   + +LN+
Sbjct: 241 GDLQKAAEAMEEARSLDLADRYINSKCAKYMLRANENEEAEKTASLFTRDGTGPLADLND 300

Query: 463 MQCMWFQTECALAYQRLGRWGDTLKKCHEVDR 494
           MQCMWF TE   AYQR G++G  LK+ H V++
Sbjct: 301 MQCMWFLTEDGEAYQRQGKYGLALKRFHAVEK 332



 Score =  204 bits (521), Expect = 8e-58
 Identities = 64/103 (62%), Positives = 82/103 (79%)

Query: 610 REGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRHFSEIIEDQFDFHTYC 669
           R+G   + +LN+MQCMWF TE   AYQR G++G  LK+ H V++HF EI EDQFDFH+YC
Sbjct: 289 RDGTGPLADLNDMQCMWFLTEDGEAYQRQGKYGLALKRFHAVEKHFDEIQEDQFDFHSYC 348

Query: 670 MRKMTLRSYVRLLRLEDVLRSHPFYYTAARCAIQVYLRLHDRP 712
           +RKMTLR+YV +LR ED LRSHPFY+ AA  AI++YL+LHD+P
Sbjct: 349 LRKMTLRAYVDMLRWEDRLRSHPFYFKAALGAIEIYLKLHDKP 391


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 46.6 bits (111), Expect = 5e-05
 Identities = 105/592 (17%), Positives = 197/592 (33%), Gaps = 82/592 (13%)

Query: 42  GRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRD 101
            R +EA   +   L  D  +     + G L  S    + A+  YR A+    +NI ++  
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232

Query: 102 LSLLQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQ 161
           L+ + I+  + E   +    L    P    +    A+      +++ A       R+T Q
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDA-------RETLQ 285

Query: 162 VTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
                  E+   LL         G+LE+A ++L++  +   +         +++L+LG+ 
Sbjct: 286 DALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRV 345

Query: 222 NEAMKHYESLIERNQENTLYYN----KLVEAKQLTNNDDIFQLLTHYISKYPKATVPKRL 277
           +EA+      +  + ++    +      +         +     T    +   A     +
Sbjct: 346 DEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGI 405

Query: 278 S-------------------LNYVSGDQFRTEIDKYLRHG-FHKGV-------------P 304
           S                   L+   G      I  YLR G F K +              
Sbjct: 406 SKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNA 465

Query: 305 PLFVNLRSLYSNTEKCKIIQDLVESYVEALS-KTGHFSLQDEGEVEPASALLWVYHYLAQ 363
            L   L ++Y        +    E++ +ALS +   F         PA+A L        
Sbjct: 466 SLHNLLGAIYLGKGD---LAKAREAFEKALSIEPDFF---------PAAANLARIDIQEG 513

Query: 364 HYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDTADR 423
           + D       A+      +   P  +   +    +Y   G+  EA  WL++A  L+  + 
Sbjct: 514 NPD------DAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEI 567

Query: 424 YINSKCAKYMLRANLIKEAE---ETCSKFTREGVSAMENLNEMQCMWFQTECALA-YQRL 479
                 A+Y L    +K+A       +    +   A   L   Q        A++ +++L
Sbjct: 568 EPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKL 627

Query: 480 -GRWGDT------LKKCHEVDRKCYEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTL 532
                D+      L   + V +       Y   +   K+ L      ++ E        L
Sbjct: 628 LALQPDSALALLLLADAYAVMKN------YAKAITSLKRALELKP--DNTEAQIGLAQLL 679

Query: 533 NCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 584
               R E A +  +   +   K+ + + + G L    K Y  AI+ YR ALK
Sbjct: 680 LAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALK 731



 Score = 45.1 bits (107), Expect = 1e-04
 Identities = 123/616 (19%), Positives = 215/616 (34%), Gaps = 81/616 (13%)

Query: 3   YEHKQYKNGLKFAKQIL-TNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKS 61
           ++ K Y++  +  +  L + P   E+   L + G +   LG  E+AY+Y+ + L+    S
Sbjct: 272 FQKKNYEDARETLQDALKSAP---EYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNS 328

Query: 62  HVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQ 121
           H    +   +Q    + DEAI     AL  + D+   +  L    + + D E   E   +
Sbjct: 329 HQARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAK 388

Query: 122 LFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVI 181
              L P   A+     ++     D   A   +       Q+    +   ++LLL  S + 
Sbjct: 389 ATELDPENAAARTQLGISKLSQGDPSEA---IADLETAAQL--DPELGRADLLLILSYL- 442

Query: 182 QDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQEN-TL 240
             SG  ++A+    + +++  D  ++    GA+ L  G   +A + +E  +    +    
Sbjct: 443 -RSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPA 501

Query: 241 YYNKLVEAKQLTNNDDIFQLLTHYISKYPKATVPKRLSLNYVSGDQFRTEIDKYLRHGFH 300
             N      Q  N DD  Q     ++  PK  +   L+L  +           YLR G  
Sbjct: 502 AANLARIDIQEGNPDDAIQRFEKVLTIDPK-NLRAILALAGL-----------YLRTG-- 547

Query: 301 KGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTGHFSLQDEGEVEPASALLWVYHY 360
                          N E             EA++     +  +  E+EPA AL      
Sbjct: 548 ---------------NEE-------------EAVAWLEKAAELNPQEIEPALAL------ 573

Query: 361 LAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDT 420
            AQ+Y   G   KAL  +N A D  P   E ++  GR    AGD+ +A     +  +L  
Sbjct: 574 -AQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQP 632

Query: 421 ADRYINSKCAKYMLRANLIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLG 480
                    A          +A  +     +  +    +  E Q    Q   A       
Sbjct: 633 DSALALLLLADAYAVMKNYAKAITS----LKRALELKPDNTEAQIGLAQLLLA-----AK 683

Query: 481 RWGDTLKKCHEVDRKCYEHKQ----------YKNGLKFAKQILTNPKYAEHGETLAMK-G 529
           R     K    + ++  +              +     A Q              A+K  
Sbjct: 684 RTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLH 743

Query: 530 LTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
             L   G   EA + +   L+      V       L  + K YD+AIK Y+  +K   DN
Sbjct: 744 RALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDN 803

Query: 590 IQIMRDLSLLQIQMRD 605
             ++ +L+ L ++++D
Sbjct: 804 AVVLNNLAWLYLELKD 819



 Score = 40.1 bits (94), Expect = 0.005
 Identities = 98/587 (16%), Positives = 188/587 (32%), Gaps = 94/587 (16%)

Query: 76  KKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPTQRASWIG 135
            + + A K Y  AL  +  ++     L+ L +     +  R    ++    P    + + 
Sbjct: 139 GQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLL 198

Query: 136 FAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLD 195
                  L + ++A   L A+RK   +       +  +LL  + ++ ++G+ EEA KH D
Sbjct: 199 KGDLLLSLGNIELA---LAAYRKAIAL----RPNNIAVLLALATILIEAGEFEEAEKHAD 251

Query: 196 RFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQE--NTLYYNKLVEAKQLTN 253
              ++  +          +  +   Y +A +  +  ++   E    L      E  QL N
Sbjct: 252 ALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAPEYLPALLLAGASEY-QLGN 310

Query: 254 NDDIFQLLTHYISKYPKATVPKR------LSLNYVSGDQFRTEIDKYLRHGFHKGVPPLF 307
            +  +Q L   +   P +   +R      L L  V  D+    +                
Sbjct: 311 LEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRV--DEAIATLS--------------- 353

Query: 308 VNLRSLYSNTEKCKIIQDLVESYVEA-LSKTGHFSLQDEGEVEPASA------------- 353
               +L  + +    +  L E+Y+     +     L    E++P +A             
Sbjct: 354 ---PALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLSQ 410

Query: 354 -------------------LLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVT 394
                              L      L   Y   G   KAL          P    L   
Sbjct: 411 GDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNL 470

Query: 395 KGRIYKHAGDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRANLIKEAEETCSKFTR--- 451
            G IY   GD+ +A +  ++A S++       +  A+  ++     +A +   K      
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530

Query: 452 EGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKCYE-----------HK 500
           + + A+  L  +       E A+A+         L+K  E++ +  E             
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAW---------LEKAAELNPQEIEPALALAQYYLGKG 581

Query: 501 QYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWH 560
           Q K  L    +        +  E   M G      G   +A    ++ L     S +   
Sbjct: 582 QLKKALAILNEAADA--APDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALL 639

Query: 561 VYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLE 607
           +        K Y +AI   + AL+ + DN +    L+ L +  +  E
Sbjct: 640 LLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686



 Score = 33.5 bits (77), Expect = 0.45
 Identities = 27/115 (23%), Positives = 45/115 (39%), Gaps = 5/115 (4%)

Query: 42  GRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRD 101
           G   EA + +   L+      V       L  + K YD+AIK Y+  +K   DN  ++ +
Sbjct: 750 GNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNN 809

Query: 102 LSLLQIQMRDLE--GYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILE 154
           L+ L ++++D     Y E   +   L P   A             + D A  +L 
Sbjct: 810 LAWLYLELKDPRALEYAERALK---LAPNIPAILDTLGWLLVEKGEADRALPLLR 861



 Score = 31.6 bits (72), Expect = 1.6
 Identities = 52/240 (21%), Positives = 83/240 (34%), Gaps = 38/240 (15%)

Query: 381 AIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRAN--- 437
           A+   P   E     G+IY   GD   A K L +A SL      +    A+  L      
Sbjct: 48  ALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQ 107

Query: 438 -LIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKC 496
            ++ E     +    EG + +  L             LAY  LG+         E+ +K 
Sbjct: 108 QVLDELPGK-TLLDDEGAAELLALR-----------GLAYLGLGQ--------LELAQKS 147

Query: 497 YEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSH 556
           YE     +           P+       LA   L  N   R +EA   +   L  D  + 
Sbjct: 148 YEQALAID-----------PRSLYAKLGLAQLALAEN---RFDEARALIDEVLTADPGNV 193

Query: 557 VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYREGVSAM 616
               + G L  S    + A+  YR A+    +NI ++  L+ + I+  + E   +   A+
Sbjct: 194 DALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADAL 253


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 22/85 (25%), Positives = 36/85 (42%)

Query: 34  KGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEH 93
            G     LG  +EA EY  + L  D  +   ++          KY+EA++ Y  AL+ + 
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDP 65

Query: 94  DNIQIMRDLSLLQIQMRDLEGYRET 118
           DN +   +L L   ++   E   E 
Sbjct: 66  DNAKAYYNLGLAYYKLGKYEEALEA 90



 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 22/84 (26%), Positives = 36/84 (42%)

Query: 528 KGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEH 587
            G     LG  +EA EY  + L  D  +   ++          KY+EA++ Y  AL+ + 
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDP 65

Query: 588 DNIQIMRDLSLLQIQMRDLEGYRE 611
           DN +   +L L   ++   E   E
Sbjct: 66  DNAKAYYNLGLAYYKLGKYEEALE 89



 Score = 38.1 bits (89), Expect = 0.002
 Identities = 18/63 (28%), Positives = 27/63 (42%)

Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQ 416
            Y+ LA  Y  LG   +AL     A++  P   + +   G  Y   G   EA +  ++A 
Sbjct: 36  AYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKAL 95

Query: 417 SLD 419
            LD
Sbjct: 96  ELD 98



 Score = 36.6 bits (85), Expect = 0.006
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 1  KCYEHKQYKNGLKFAKQILT-NPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDL 59
            Y+   Y   L++ ++ L  +P  A+              LG+ EEA E   + L  D 
Sbjct: 9  LYYKLGDYDEALEYYEKALELDPDNAD---AYYNLAAAYYKLGKYEEALEDYEKALELDP 65

Query: 60 KSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 90
           +   ++  GL      KY+EA++ Y  AL+
Sbjct: 66 DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 36.6 bits (85), Expect = 0.006
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 495 KCYEHKQYKNGLKFAKQILT-NPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDL 553
             Y+   Y   L++ ++ L  +P  A+              LG+ EEA E   + L  D 
Sbjct: 9   LYYKLGDYDEALEYYEKALELDPDNAD---AYYNLAAAYYKLGKYEEALEDYEKALELDP 65

Query: 554 KSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 584
            +   ++  GL      KY+EA++ Y  AL+
Sbjct: 66  DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 35.8 bits (83), Expect = 0.009
 Identities = 21/95 (22%), Positives = 32/95 (33%)

Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQ 416
               L   Y  LGD  +AL Y   A++  P   + +      Y   G   EA +  ++A 
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 417 SLDTADRYINSKCAKYMLRANLIKEAEETCSKFTR 451
            LD  +            +    +EA E   K   
Sbjct: 62  ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 32.4 bits (74), Expect = 0.16
 Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 2/93 (2%)

Query: 172 ELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESL 231
           E LL    +    GD +EA+++ ++  E   D         A   KLG+Y EA++ YE  
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 232 IERNQENTLYYNKLVEAKQLTNNDDIFQLLTHY 264
           +E + +N   Y  L             + L  Y
Sbjct: 61  LELDPDNAKAYYNL--GLAYYKLGKYEEALEAY 91



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 24/131 (18%), Positives = 42/131 (32%), Gaps = 41/131 (31%)

Query: 69  GLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPT 128
           G L      YDEA++ Y  AL+                                  L P 
Sbjct: 7   GNLYYKLGDYDEALEYYEKALE----------------------------------LDPD 32

Query: 129 QRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLE 188
              ++   A AY+ L  ++ A   LE + K  ++        ++      +     G  E
Sbjct: 33  NADAYYNLAAAYYKLGKYEEA---LEDYEKALELDPDN----AKAYYNLGLAYYKLGKYE 85

Query: 189 EAVKHLDRFKE 199
           EA++  ++  E
Sbjct: 86  EALEAYEKALE 96



 Score = 28.1 bits (63), Expect = 5.0
 Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 7/103 (6%)

Query: 133 WIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVK 192
            +     Y+ L D+D A   LE + K  ++      ++++     +      G  EEA++
Sbjct: 3   LLNLGNLYYKLGDYDEA---LEYYEKALELDP----DNADAYYNLAAAYYKLGKYEEALE 55

Query: 193 HLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERN 235
             ++  E   D        G    KLG+Y EA++ YE  +E +
Sbjct: 56  DYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 43.9 bits (104), Expect = 2e-04
 Identities = 23/130 (17%), Positives = 47/130 (36%), Gaps = 7/130 (5%)

Query: 127 PTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGD 186
               A+  G A+  +L   +D A  +L+     Q     Y     ++LL           
Sbjct: 303 RGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEA-------NK 355

Query: 187 LEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQENTLYYNKLV 246
            +EA++ L +      +   ++       LK G+  EA++     +  + E+   ++ L 
Sbjct: 356 AKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLA 415

Query: 247 EAKQLTNNDD 256
           +A     N  
Sbjct: 416 QAYAELGNRA 425



 Score = 30.1 bits (68), Expect = 4.3
 Identities = 27/186 (14%), Positives = 49/186 (26%), Gaps = 11/186 (5%)

Query: 15  AKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRS 74
           A+Q        +  +    +                     R+        +   L    
Sbjct: 260 AEQ-SPPYNKLDSPDFQLARARIRAKYEALPNQQAADLLAKRSKRGGLAAQYGRALQTYL 318

Query: 75  DKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPTQRASWI 134
             +YDEA+K  +  +  + DN   +     + ++    +   E   +   L P      +
Sbjct: 319 AGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQL 378

Query: 135 GFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHL 194
             A A         A  IL          N Y F   E      ++ Q   +L    + L
Sbjct: 379 NLAQALLKGGKPQEAIRIL----------NRYLFNDPEDPNGWDLLAQAYAELGNRAEAL 428

Query: 195 DRFKEQ 200
               E 
Sbjct: 429 LARAEG 434


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 38.1 bits (89), Expect = 0.015
 Identities = 32/182 (17%), Positives = 68/182 (37%), Gaps = 11/182 (6%)

Query: 94  DNIQIMRDLSL--LQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACN 151
            +  +  +L+   L +  RD    RE   QL  + P           AY  L  +     
Sbjct: 149 GDDTLAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLA 208

Query: 152 ILEAFRKTQQVTNS----YDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTV 207
           IL   RK   +++      + +  E LL Q+    D    E           ++ +   +
Sbjct: 209 ILPKLRKAGLLSDEEAARLEQQAWEGLLQQAR---DDNGSEGLKTWWKNQPRKLRNDPEL 265

Query: 208 EETYGALKLKLGQYNEAMKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQLLTHYISK 267
              Y    ++LG ++EA +  E  ++R  +  L   +L+   +  + + + +    ++ +
Sbjct: 266 VVAYAERLIRLGDHDEAQEIIEDALKRQWDPRLC--RLIPRLRPGDPEPLIKAAEKWLKQ 323

Query: 268 YP 269
           +P
Sbjct: 324 HP 325


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 37.5 bits (85), Expect = 0.018
 Identities = 52/235 (22%), Positives = 91/235 (38%), Gaps = 6/235 (2%)

Query: 6   KQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSH--V 63
            +    L+  ++ L     ++    L +  L L  LGR EEA E + + L  +L  +   
Sbjct: 37  GELAEALELLEEALELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAE 96

Query: 64  CWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSL-LQIQMRDLEGYRETRYQL 122
                GLL  +  KY+EA++    AL  + D       L+L    ++ D E   E   + 
Sbjct: 97  ALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKA 156

Query: 123 FMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQ 182
             L P              L    +      EA    ++       + +E LL   ++  
Sbjct: 157 LELDPELNEL---AEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYL 213

Query: 183 DSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQE 237
             G  EEA+++ ++  E   D          L L+LG+Y EA++  E  +E + +
Sbjct: 214 KLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268



 Score = 30.2 bits (66), Expect = 4.3
 Identities = 42/219 (19%), Positives = 74/219 (33%), Gaps = 7/219 (3%)

Query: 432 YMLRANLIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHE 491
             +    + +           G++ +E L E+       E AL           L     
Sbjct: 8   LAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLLAL 67

Query: 492 VDRKCYEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRN 551
              K     + +  L+  ++ L         E L   GL L  LG+ EEA E + + L  
Sbjct: 68  ALLKL---GRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALAL 124

Query: 552 DLKSHVCWHVYGL-LQRSDKKYDEAIKCYRNALK---WEHDNIQIMRDLSLLQIQMRDLE 607
           D    +   +  L        Y+EA++ Y  AL+     ++  + +  L  L   +   E
Sbjct: 125 DPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYE 184

Query: 608 GYREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLK 646
              E +     LN            L Y +LG++ + L+
Sbjct: 185 EALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALE 223


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 33.8 bits (78), Expect = 0.029
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 63  VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRD 111
                 G        YDEAI+ Y  AL+ + DN +   +L+L  +++  
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGK 52



 Score = 33.8 bits (78), Expect = 0.029
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 557 VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRD 605
                 G        YDEAI+ Y  AL+ + DN +   +L+L  +++  
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGK 52



 Score = 33.4 bits (77), Expect = 0.035
 Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 1/64 (1%)

Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAG-DVLEAYKWLDEA 415
               L      LGD  +A+     A++  P   E +      Y   G D  EA + L++A
Sbjct: 5   ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKA 64

Query: 416 QSLD 419
             LD
Sbjct: 65  LELD 68



 Score = 30.4 bits (69), Expect = 0.42
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 29 ETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGL-LQRSDKKYDEAIKCYRN 87
          E L   G  L  LG  +EA E   + L  D  +   ++   L   +  K Y+EA++    
Sbjct: 4  EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEK 63

Query: 88 ALK 90
          AL+
Sbjct: 64 ALE 66



 Score = 30.4 bits (69), Expect = 0.42
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 523 ETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGL-LQRSDKKYDEAIKCYRN 581
           E L   G  L  LG  +EA E   + L  D  +   ++   L   +  K Y+EA++    
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEK 63

Query: 582 ALK 584
           AL+
Sbjct: 64  ALE 66



 Score = 28.4 bits (64), Expect = 2.0
 Identities = 15/44 (34%), Positives = 21/44 (47%)

Query: 212 GALKLKLGQYNEAMKHYESLIERNQENTLYYNKLVEAKQLTNND 255
           G    KLG Y+EA++ YE  +E + +N   Y  L  A      D
Sbjct: 10  GNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKD 53


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 33.9 bits (78), Expect = 0.032
 Identities = 20/75 (26%), Positives = 31/75 (41%), Gaps = 7/75 (9%)

Query: 354 LLWVYHYLAQHYDHLGDTMKALNYINAAI-------DHTPTLIELFVTKGRIYKHAGDVL 406
           L    + LA     LGD  +AL  +  A+       +  P          R+Y   GD  
Sbjct: 4   LAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYD 63

Query: 407 EAYKWLDEAQSLDTA 421
           EA ++L++A +L  A
Sbjct: 64  EALEYLEKALALREA 78



 Score = 28.1 bits (63), Expect = 3.8
 Identities = 16/72 (22%), Positives = 32/72 (44%), Gaps = 7/72 (9%)

Query: 173 LLLYQSMVIQDSGDLEEAVKHLDR-------FKEQIHDKLTVEETYGALKLKLGQYNEAM 225
            L   ++V++  GD +EA++ L++         E   +          L L LG Y+EA+
Sbjct: 7   ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66

Query: 226 KHYESLIERNQE 237
           ++ E  +   + 
Sbjct: 67  EYLEKALALREA 78



 Score = 26.9 bits (60), Expect = 8.9
 Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)

Query: 30 TLAMKGLTLNCLGRKEEAYEYVRRGL-------RNDLKSHVCWHVYGLLQRSDKKYDEAI 82
           L    L L  LG  +EA E + + L        +  ++    +    L  +   YDEA+
Sbjct: 7  ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66

Query: 83 KCYRNALK 90
          +    AL 
Sbjct: 67 EYLEKALA 74



 Score = 26.9 bits (60), Expect = 8.9
 Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)

Query: 524 TLAMKGLTLNCLGRKEEAYEYVRRGL-------RNDLKSHVCWHVYGLLQRSDKKYDEAI 576
            L    L L  LG  +EA E + + L        +  ++    +    L  +   YDEA+
Sbjct: 7   ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66

Query: 577 KCYRNALK 584
           +    AL 
Sbjct: 67  EYLEKALA 74


>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
           subunit; Provisional.
          Length = 987

 Score = 36.8 bits (85), Expect = 0.052
 Identities = 15/62 (24%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 358 YHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQS 417
           Y +L       G    ALN +  +++  P+    +V +  IY+   +V  A   L  A  
Sbjct: 579 YWWLHAQRYIPGQPELALNDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALE 637

Query: 418 LD 419
           L+
Sbjct: 638 LE 639


>gnl|CDD|215244 PLN02445, PLN02445, anthranilate synthase component I.
          Length = 523

 Score = 35.4 bits (82), Expect = 0.10
 Identities = 23/99 (23%), Positives = 47/99 (47%), Gaps = 12/99 (12%)

Query: 168 FEHSELLLYQSMVIQDSG--DLEEA----VKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
           F+H E   Y    ++      +EEA    +K L+    ++ D    + + G++KL   Q+
Sbjct: 165 FDHVEKKAYVIHWVRLDRYSSVEEAYEDGMKRLEALVSRLQDINPPKLSPGSVKLSTNQF 224

Query: 222 NEAMKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQL 260
             +++  +S +   +    Y N +++AK+     DIFQ+
Sbjct: 225 GPSLE--KSNMTSEE----YKNAVLQAKEHILAGDIFQI 257


>gnl|CDD|220676 pfam10288, DUF2392, Protein of unknown function (DUF2392).  This is
           a family of proteins conserved from plants to humans.
           The function is not known. It carries a characteristic
           GRG sequence motif.
          Length = 104

 Score = 32.2 bits (74), Expect = 0.18
 Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 285 DQFRTEIDKYLRHGFHKGVPPLFVNLRSLY-SNTEKCKIIQDLVESYVEAL 334
           D    EI+ Y      K +PPL +N      S   K   I +L E Y + L
Sbjct: 39  DLLLKEIEAYCSL---KNLPPLLINSLIQIASKLSKNMSINELTEQYFDNL 86


>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
           unknown].
          Length = 565

 Score = 34.8 bits (80), Expect = 0.20
 Identities = 34/123 (27%), Positives = 48/123 (39%), Gaps = 19/123 (15%)

Query: 309 NLRSLYSNTEKCKI--IQDLVESYVEALSKTGHFSLQDE----GEVEPASALLWVYHYLA 362
           N+ S Y   +   I  I+  + S   A  K+    L+DE    G VE A    ++ H L 
Sbjct: 51  NIISFYQENQLPTIRSIRIFLISMKIAFEKSISAILEDEPLSNGYVEEA----FLKHELP 106

Query: 363 QHYDHLGDTM----KALNYINAAIDHTPTLIEL---FVTK--GRIYKHAGDVLEAYKWLD 413
             YD   + M    KA+  I + + H   L +L             KHA D LE    LD
Sbjct: 107 NGYDRYEEIMEEQKKAIKDILSDVSHILDLGKLDTEDFKDEVDSARKHAKDTLEKLYKLD 166

Query: 414 EAQ 416
           +  
Sbjct: 167 QEG 169


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 34.2 bits (78), Expect = 0.27
 Identities = 43/233 (18%), Positives = 88/233 (37%), Gaps = 21/233 (9%)

Query: 162 VTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
           VT SY        + ++ +  + GD ++A +  D+  +   +   +      L    G++
Sbjct: 364 VTQSY--------IKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEF 415

Query: 222 NEAMKHYESLIERNQENTLYYNKL-VEAKQLTNNDDIFQLLTHYISKYPKA-TVPKRLSL 279
            +A K Y+  I+ + +    + +L V   +  +              +P+A  V      
Sbjct: 416 AQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGE 475

Query: 280 NYVSGDQFRTEIDKY-LRHGFHKGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTG 338
             +  ++F   I+K+       K   P+++N+  L +   K   +    + ++EA     
Sbjct: 476 LLLDQNKFDEAIEKFDTAIELEKETKPMYMNVLPLIN---KALALFQWKQDFIEA-ENLC 531

Query: 339 HFSLQDEGEVEPASALLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIEL 391
             +L  + E + A A       +AQ     GD  +AL     A +   T  EL
Sbjct: 532 EKALIIDPECDIAVAT------MAQLLLQQGDVDEALKLFERAAELARTEGEL 578


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 29.7 bits (68), Expect = 0.32
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 69 GLLQRSDKKYDEAIKCYRNALK 90
          G       KYDEA++ Y  AL+
Sbjct: 8  GNAYLKLGKYDEALEYYEKALE 29



 Score = 29.7 bits (68), Expect = 0.32
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 563 GLLQRSDKKYDEAIKCYRNALK 584
           G       KYDEA++ Y  AL+
Sbjct: 8   GNAYLKLGKYDEALEYYEKALE 29



 Score = 27.0 bits (61), Expect = 3.1
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 212 GALKLKLGQYNEAMKHYESLIERNQEN 238
           G   LKLG+Y+EA+++YE  +E N  N
Sbjct: 8   GNAYLKLGKYDEALEYYEKALELNPNN 34


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 30.0 bits (68), Expect = 0.53
 Identities = 17/59 (28%), Positives = 24/59 (40%)

Query: 361 LAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLD 419
           LA+     GD  +AL  + AA+   P   E  +  G      G + EA   L  A + D
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61


>gnl|CDD|198356 cd07778, FGGY_L-RBK_like, L-ribulokinase-like proteins; a subfamily
           of the FGGY family of carbohydrate kinases.  This
           subfamily is composed of a group of putative bacterial
           L-ribulokinases (RBK; EC 2.7.1.16) and similar proteins.
           L-RBK catalyzes the MgATP-dependent phosphorylation of a
           variety of sugar substrates. Members of this subfamily
           belong to the FGGY family of carbohydrate kinases, the
           monomers of which contain two large domains, which are
           separated by a deep cleft that forms the active site.
           This model includes both the N-terminal domain, which
           adopts a ribonuclease H-like fold, and the structurally
           related C-terminal domain.
          Length = 466

 Score = 32.3 bits (74), Expect = 0.94
 Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 27/125 (21%)

Query: 105 LQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTN 164
           LQ    DL    + +Y +     T   + IG +   H    FD    +L  FR T + T 
Sbjct: 56  LQACYADLANDVKEKYGV---TLTTIGA-IGISAMMHGYLAFDKNGKLLVPFR-TWRNTI 110

Query: 165 SYDF--EHSELL------------LYQSMVIQDSGDLEEAVKHLDRFKE---QIHDKLTV 207
           + +   E +EL             LYQ+++     + EE VK +         +H +LT 
Sbjct: 111 TEEAAAELTELFGFNIPQRWSIAHLYQAIL-----NKEEHVKEVAFLTTLAGYVHWQLTG 165

Query: 208 EETYG 212
           E+  G
Sbjct: 166 EKVLG 170


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
          includes outlying Tetratricopeptide-like repeats (TPR)
          that are not matched by pfam00515.
          Length = 34

 Score = 27.9 bits (63), Expect = 1.4
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 65 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 95
           +  GL       Y+EA++ Y  AL+ + +N
Sbjct: 4  LYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 27.9 bits (63), Expect = 1.4
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 559 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
            +  GL       Y+EA++ Y  AL+ + +N
Sbjct: 4   LYNLGLAYYKLGDYEEALEAYEKALELDPNN 34


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
          or more copies in proteins. Contain a minimum of 34
          amino acids each and self-associate via a "knobs and
          holes" mechanism.
          Length = 34

 Score = 27.8 bits (63), Expect = 1.5
 Identities = 8/22 (36%), Positives = 11/22 (50%)

Query: 69 GLLQRSDKKYDEAIKCYRNALK 90
          G        YDEA++ Y  AL+
Sbjct: 8  GNAYLKLGDYDEALEYYEKALE 29



 Score = 27.8 bits (63), Expect = 1.5
 Identities = 8/22 (36%), Positives = 11/22 (50%)

Query: 563 GLLQRSDKKYDEAIKCYRNALK 584
           G        YDEA++ Y  AL+
Sbjct: 8   GNAYLKLGDYDEALEYYEKALE 29



 Score = 27.0 bits (61), Expect = 2.6
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 212 GALKLKLGQYNEAMKHYESLIERNQEN 238
           G   LKLG Y+EA+++YE  +E +  N
Sbjct: 8   GNAYLKLGDYDEALEYYEKALELDPNN 34


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 27.8 bits (63), Expect = 1.6
 Identities = 8/31 (25%), Positives = 17/31 (54%)

Query: 65 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 95
          +++ G +      Y+EA + Y  AL+ + +N
Sbjct: 4  YYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34



 Score = 27.8 bits (63), Expect = 1.6
 Identities = 8/31 (25%), Positives = 17/31 (54%)

Query: 559 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
           +++ G +      Y+EA + Y  AL+ + +N
Sbjct: 4   YYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34


>gnl|CDD|203842 pfam08032, SpoU_sub_bind, RNA 2'-O ribose methyltransferase
           substrate binding.  This domain is a RNA 2'-O ribose
           methyltransferase substrate binding domain.
          Length = 74

 Score = 28.4 bits (64), Expect = 2.8
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 378 INAAIDHTPTLIELFVTKGRIYKHAGDVLEAYK 410
           +  A+   P + +L+VT+GR      ++L+  K
Sbjct: 8   VEEALKSGPEIEKLYVTEGRRDPRLKELLKLAK 40


>gnl|CDD|223260 COG0182, COG0182, Predicted translation initiation factor 2B
           subunit, eIF-2B alpha/beta/delta family [Translation,
           ribosomal structure and biogenesis].
          Length = 346

 Score = 30.7 bits (70), Expect = 3.0
 Identities = 17/99 (17%), Positives = 32/99 (32%), Gaps = 23/99 (23%)

Query: 180 VIQDSGDLEEAVKHLDRFKEQIHD--------KLTVEETYGALK-------LKLGQYNEA 224
             ++    EE ++ L++  E +             ++    A K        K     EA
Sbjct: 66  ESKNDSKGEEFIEALEKAAETLKSTRPTAVNLFWALDRMLNAAKEAIEVKEPKESILQEA 125

Query: 225 MKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQLLTH 263
            +  E  +E N+       +L+         D   +LTH
Sbjct: 126 EEIAEEDLEANRAIGENGAELLP--------DGDTVLTH 156


>gnl|CDD|215450 PLN02837, PLN02837, threonine-tRNA ligase.
          Length = 614

 Score = 30.6 bits (69), Expect = 3.7
 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 8/59 (13%)

Query: 151 NILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEE 209
           +I+E   K     + YD      LLY   V +   DL +   HLD +KE ++D++ +E+
Sbjct: 251 HIIEDSWKKMHFEHGYD------LLYTPHVAK--ADLWKTSGHLDFYKENMYDQMDIED 301


>gnl|CDD|100115 cd05804, StaR_like, StaR_like; a well-conserved protein found in
           bacteria, plants, and animals. A family member from
           Streptomyces toyocaensis, StaR is part of a gene cluster
           involved in the biosynthesis of glycopeptide antibiotics
           (GPAs), specifically A47934. It has been speculated that
           StaR could be a flavoprotein hydroxylating a tyrosine
           sidechain. Some family members have been annotated as
           proteins containing tetratricopeptide (TPR) repeats,
           which may at least indicate mostly alpha-helical
           secondary structure.
          Length = 355

 Score = 29.9 bits (68), Expect = 4.5
 Identities = 31/127 (24%), Positives = 49/127 (38%), Gaps = 33/127 (25%)

Query: 7   QYKNGLKFAKQILT-NPK--YAEHGETLA----MKGLTLNCLGRKEEAYEYVRRGLR--- 56
           QY    + A++ L  NP   +A H   +A    M+G       R +E   ++        
Sbjct: 129 QYDRAEEAARRALELNPDDAWAVH--AVAHVLEMQG-------RFKEGIAFMESWRDTWD 179

Query: 57  --NDLKSHVCWHV-YGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDL----SLLQIQM 109
             + L+ H  WH+    L+R D  Y+ A+  Y   +           DL    SLL    
Sbjct: 180 CSSMLRGHNWWHLALFYLERGD--YEAALAIYDTHI-APSAESDPALDLLDAASLL---W 233

Query: 110 R-DLEGY 115
           R +L G+
Sbjct: 234 RLELAGH 240



 Score = 29.9 bits (68), Expect = 4.5
 Identities = 31/127 (24%), Positives = 49/127 (38%), Gaps = 33/127 (25%)

Query: 501 QYKNGLKFAKQILT-NPK--YAEHGETLA----MKGLTLNCLGRKEEAYEYVRRGLR--- 550
           QY    + A++ L  NP   +A H   +A    M+G       R +E   ++        
Sbjct: 129 QYDRAEEAARRALELNPDDAWAVH--AVAHVLEMQG-------RFKEGIAFMESWRDTWD 179

Query: 551 --NDLKSHVCWHV-YGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDL----SLLQIQM 603
             + L+ H  WH+    L+R D  Y+ A+  Y   +           DL    SLL    
Sbjct: 180 CSSMLRGHNWWHLALFYLERGD--YEAALAIYDTHI-APSAESDPALDLLDAASLL---W 233

Query: 604 R-DLEGY 609
           R +L G+
Sbjct: 234 RLELAGH 240


>gnl|CDD|237283 PRK13106, ubiA, prenyltransferase; Reviewed.
          Length = 300

 Score = 29.7 bits (67), Expect = 4.7
 Identities = 10/32 (31%), Positives = 13/32 (40%)

Query: 671 RKMTLRSYVRLLRLEDVLRSHPFYYTAARCAI 702
            +      +R LR+E    S P  Y  A  AI
Sbjct: 11  TRSKFYIILRFLRIEQTFFSLPMAYLGAFVAI 42


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 26.7 bits (59), Expect = 5.0
 Identities = 11/44 (25%), Positives = 17/44 (38%)

Query: 355 LWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRI 398
                 LA+    LGD  +AL  +  A+   P   E  +   R+
Sbjct: 1   PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|221837 pfam12895, Apc3, Anaphase-promoting complex, cyclosome, subunit
          3.  Apc3, otherwise known as Cdc27, is one of the
          subunits of the anaphase-promoting complex or
          cyclosome. The anaphase-promoting complex is a
          multiprotein subunit E3 ubiquitin ligase complex that
          controls segregation of chromosomes and exit from
          mitosis in eukaryotes. The protein members of this
          family contain TPR repeats just as those of Apc7 do,
          and it appears that these TPR units bind the C-termini
          of the APC co-activators CDH1 and CDC20.
          Length = 80

 Score = 27.6 bits (62), Expect = 6.0
 Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 5/85 (5%)

Query: 4  EHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHV 63
          +   Y+N +  A+++L      E    LA         G+ + AYE +R+   +   S  
Sbjct: 1  DQGNYENAIFLAEKLLALTPSNEDAYLLA---QCYFLQGQYKRAYELLRKLKLD--NSSG 55

Query: 64 CWHVYGLLQRSDKKYDEAIKCYRNA 88
          C ++         KYDEAI     A
Sbjct: 56 CRYLLAQCLLKLGKYDEAIAVLEKA 80



 Score = 27.6 bits (62), Expect = 6.0
 Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 5/85 (5%)

Query: 498 EHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHV 557
           +   Y+N +  A+++L      E    LA         G+ + AYE +R+   +   S  
Sbjct: 1   DQGNYENAIFLAEKLLALTPSNEDAYLLA---QCYFLQGQYKRAYELLRKLKLD--NSSG 55

Query: 558 CWHVYGLLQRSDKKYDEAIKCYRNA 582
           C ++         KYDEAI     A
Sbjct: 56  CRYLLAQCLLKLGKYDEAIAVLEKA 80


>gnl|CDD|151696 pfam11254, DUF3053, Protein of unknown function (DUF3053).  Some
           members in this family of proteins are annotated as the
           membrane protein YiaF. No function is currently known.
          Length = 229

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 19/99 (19%)

Query: 125 LRPTQRASWIGFAMAYHLLHDFD------MACNILEAFRKTQQVTNSYDFEHSELLLYQS 178
           L   Q+ ++  +A  Y +L DF       +  ++L A  +   + ++ D           
Sbjct: 49  LTADQKKAFGPYADDYAILTDFSQQMDQAVDGSLLPALDQIGSIRSAQD----------- 97

Query: 179 MVIQDSGDLEEAVKHLDRFKEQIHD-KLTVEETYGALKL 216
             +    DL ++   L+   +Q+ + K   +  + ALK 
Sbjct: 98  -YLTKRDDLRQSRGGLNLLGQQLQNAKAQADAAHSALKQ 135


>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily.  This domain is
           present is the N-terminal region of a group of
           polymorphic toxin proteins in bacteria. It is predicted
           to use Type VII secretion pathway to mediate export of
           bacterial toxins.
          Length = 204

 Score = 28.8 bits (65), Expect = 8.5
 Identities = 21/124 (16%), Positives = 43/124 (34%), Gaps = 19/124 (15%)

Query: 309 NLRSLYSNTEK--CKIIQDLVESYVEALSKTGHFSLQDEGEVEPAS-ALL---WVYHYLA 362
            +++ Y +        I + ++ Y+E L K   F       VE +S   +   ++ H L 
Sbjct: 50  AIKAFYQDLHLPVLDSILEFIDEYIEFLEKIKSFLSD----VESSSNGYIDEEFLEHELK 105

Query: 363 QHYDHLGDTMKAL-NYINAAIDHTPTLIELFVTKGRIY--------KHAGDVLEAYKWLD 413
                  +  + L + IN+ +     ++ L       +        K   D LE     D
Sbjct: 106 NGLKKAEEKTEELKDAINSILASISDIVHLPKFSTSAFQEQLDKAKKKRNDTLEKLYEFD 165

Query: 414 EAQS 417
           +  +
Sbjct: 166 QELT 169


>gnl|CDD|218497 pfam05205, COMPASS-Shg1, COMPASS (Complex proteins associated with
           Set1p) component shg1.  The Shg1 subunit is one of the
           eight subunits of the COMPASS complex, complex
           associated with SET1, conserved in yeasts and in other
           eukaryotes up to humans. It is associated with the
           region of the Set1 protein that is N-terminal to the
           C-terminus, ie Set1-560-900. The function of Shg1 seems
           to be to slightly inhibit histone 3 lysine 4 (H3K4) di-
           and tri-methylation, and it is a pioneer protein. The
           COMPASS complex functions to methylate the fourth lysine
           of Histone 3 and for silencing of genes close to the
           telomeres of chromosomes.
          Length = 106

 Score = 27.6 bits (62), Expect = 9.2
 Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 193 HLDRFKEQIHDKLTVEETYGALKLKLGQYNEAM-KHYESLIERNQ 236
           H D+ ++ I ++    ET   L L++ +  E   +    L+ +N+
Sbjct: 12  HFDKLRKDILEEFEKSETEQNLTLRVQELVETEVEKDPQLLSKNR 56


>gnl|CDD|221398 pfam12056, DUF3537, Protein of unknown function (DUF3537).  This
           family of transmembrane proteins are functionally
           uncharacterized. This protein is found in eukaryotes.
           Proteins in this family are typically between 427 to 453
           amino acids in length.
          Length = 398

 Score = 28.9 bits (65), Expect = 9.6
 Identities = 10/45 (22%), Positives = 19/45 (42%)

Query: 463 MQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKCYEHKQYKNGLK 507
           + C+ F+  C L   RL  +    +   +V+    EH + +  L 
Sbjct: 176 LVCVLFRLICHLQILRLDDFAKLFEGESDVESVLKEHLRIRRQLS 220


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.135    0.414 

Gapped
Lambda     K      H
   0.267   0.0748    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,562,733
Number of extensions: 3599180
Number of successful extensions: 3422
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3376
Number of HSP's successfully gapped: 101
Length of query: 724
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 620
Effective length of database: 6,324,786
Effective search space: 3921367320
Effective search space used: 3921367320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)