RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16067
         (1386 letters)



>gnl|CDD|188655 cd09823, peroxinectin_like, peroxinectin_like animal heme
            peroxidases.  Peroxinectin is an arthropod protein that
            plays a role in invertebrate immunity mechanisms.
            Specifically, peroxinectins are secreted as cell-adhesive
            and opsonic peroxidases. The immunity mechanism appears
            to involve an interaction between peroxinectin and a
            transmembrane receptor of the integrin family. Human
            myeloperoxidase, which is included in this wider family,
            has also been reported to interact with integrins.
          Length = 378

 Score =  272 bits (698), Expect = 2e-81
 Identities = 142/485 (29%), Positives = 198/485 (40%), Gaps = 136/485 (28%)

Query: 693  QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
            Q+ Q T F+D S +YGS  ++AR+L               RTF                 
Sbjct: 3    QLNQVTSFLDGSQVYGSSEEEARKL---------------RTF----------------- 30

Query: 753  YDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQH 812
               G ++  +      +    L F N   +       S                  CF  
Sbjct: 31   -KGGLLKTQR-----RNGRELLPFSNNPTDD--CSLSSAGKP--------------CF-- 66

Query: 813  IVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARR 872
                     L GD RVN+Q  L    T++LR HN +A++L + NP W   DE ++QEAR+
Sbjct: 67   ---------LAGDGRVNEQPGLTSMHTLFLREHNRIADELKKLNPHWD--DERLFQEARK 115

Query: 873  LVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGE 932
            +VI   QHI YNE+LPIL+G    +  GL   +SGYF GYDP V+PS L +FAA AFR  
Sbjct: 116  IVIAQMQHITYNEFLPILLGRELMEKFGLYLLTSGYFNGYDPNVDPSILNEFAAAAFR-- 173

Query: 933  SYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQEL 992
                          +                        HSL+PGT + ++E       +
Sbjct: 174  --------------FG-----------------------HSLVPGTFERLDENYRPQGSV 196

Query: 993  PFSSTMQHPGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVD 1052
                   +P  + E     D  LRGL  Q  Q  DRF T++     F  G        +D
Sbjct: 197  NLHDLFFNPDRLYEE-GGLDPLLRGLATQPAQKVDRFFTDELTTHFFFRGGNPFG---LD 252

Query: 1053 LVGLTL------GFF----YLWYCGG-----FLLQ--------VTRLSQVYRGVNDIDYY 1089
            L  L +      G      Y  +CG      F           + +L ++Y+ V+DID Y
Sbjct: 253  LAALNIQRGRDHGLPGYNDYREFCGLPRATTFDDLLGIMSPETIQKLRRLYKSVDDIDLY 312

Query: 1090 IGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLT 1149
            +GGL E  + G L  P+F C+  E F R + GDR++YE      P +FTP  QL      
Sbjct: 313  VGGLSEKPVPGGLVGPTFACIIGEQFRRLRRGDRFWYENGGQ--PSSFTP-AQLNEIRKV 369

Query: 1150 SLASL 1154
            SLA +
Sbjct: 370  SLARI 374



 Score =  158 bits (401), Expect = 7e-42
 Identities = 60/130 (46%), Positives = 77/130 (59%), Gaps = 3/130 (2%)

Query: 277 RGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG 336
              FG DLAA +IQRGRD+GL  Y DYRE CGL    +F DL   M  E + +L ++Y+ 
Sbjct: 246 GNPFGLDLAALNIQRGRDHGLPGYNDYREFCGLPRATTFDDLLGIMSPETIQKLRRLYKS 305

Query: 337 VNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQ 396
           V+DID Y+GGL E  + G L  P+F C+  E F R + GDR++YE      P +FTP  Q
Sbjct: 306 VDDIDLYVGGLSEKPVPGGLVGPTFACIIGEQFRRLRRGDRFWYENGGQ--PSSFTP-AQ 362

Query: 397 LTLTSLTSLA 406
           L      SLA
Sbjct: 363 LNEIRKVSLA 372



 Score = 30.6 bits (70), Expect = 6.6
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 540 EAQLDAIKKVFLSQIIC 556
            AQL+ I+KV L++IIC
Sbjct: 360 PAQLNEIRKVSLARIIC 376


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score =  250 bits (639), Expect = 3e-70
 Identities = 109/157 (69%), Positives = 138/157 (87%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
            ++P+KE KTLT+ D+GIGMTKADLVNNLGTIA+SGTKAFMEAL+AG D+SMIGQFGVGFY
Sbjct: 65   VVPDKENKTLTVEDNGIGMTKADLVNNLGTIARSGTKAFMEALEAGGDMSMIGQFGVGFY 124

Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIE 1346
            SAYLVAD+VTVTSK+N DE Y+WESSAGG+FTI       + RGT+I LH+KEDQ EY+E
Sbjct: 125  SAYLVADRVTVTSKNNSDESYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLE 184

Query: 1347 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE 1383
             +++KE++KKHS+FIGY I+L+VEK  EKE+++++EE
Sbjct: 185  PRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEE 221



 Score =  138 bits (349), Expect = 2e-33
 Identities = 63/121 (52%), Positives = 88/121 (72%), Gaps = 6/121 (4%)

Query: 434 MTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
           +TVTSK+N DE Y+WESSAGG+FTI       + RGT+I LH+KEDQ EY+E +++KE++
Sbjct: 133 VTVTSKNNSDESYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLEPRRLKELI 192

Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE-----DEAQLDAIKK 548
           KKHS+FIGY I+L+VEK  EKE++ DE+EE  ++  E+ + PK+E     DE +    KK
Sbjct: 193 KKHSEFIGYDIELMVEKTTEKEVT-DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKK 251

Query: 549 V 549
           V
Sbjct: 252 V 252



 Score = 63.5 bits (154), Expect = 6e-10
 Identities = 28/40 (70%), Positives = 35/40 (87%)

Query: 1   IVETLRQKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
           I++ LR++  AD+NDKAVKDLV LLF+TSLL+SGF LEDP
Sbjct: 614 IIKELRRRVGADENDKAVKDLVFLLFDTSLLTSGFQLEDP 653


>gnl|CDD|217366 pfam03098, An_peroxidase, Animal haem peroxidase. 
          Length = 521

 Score =  227 bits (580), Expect = 5e-64
 Identities = 111/357 (31%), Positives = 163/357 (45%), Gaps = 79/357 (22%)

Query: 822  LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
            L GD RVN+   L    T++LR HN +A++L   NP WS  DE ++QEAR +VI  +Q I
Sbjct: 204  LAGDSRVNENPGLTALHTLFLREHNRIADELKALNPHWS--DEKLFQEARLIVIAQYQKI 261

Query: 882  VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLS 941
             YNE+LP L+G    +   L                                        
Sbjct: 262  TYNEYLPALLGPDLMRANWLLL-------------------------------------- 283

Query: 942  PTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHP 1001
                 Y+ GYDP V+PS   +FA  A+R  HSLIP  ++ ++E R    E+P   T  +P
Sbjct: 284  -----YYTGYDPNVDPSISNEFATAAYRFGHSLIPPGLERLDEFRTIAPEIPLHDTFFNP 338

Query: 1002 GDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDLVGLT---- 1057
              ILE  +  D  LRGL +Q  +  D  ++++  + LF  G  +     +DL  L     
Sbjct: 339  SRILE--EGIDPLLRGLASQPAELLDNSLSDELRNRLF--GPRNFPGSGLDLAALNIQRG 394

Query: 1058 ----LGFFYLW--YCG-----------GFLLQ---VTRLSQVYRGVNDIDYYIGGLLESV 1097
                L  +  +  +CG             +       +L ++Y   +DID ++GGL E  
Sbjct: 395  RDHGLPPYNEYRRFCGLKPATSFEDLTDEIGDEELAEKLKELYGDPDDIDLWVGGLAEKP 454

Query: 1098 IRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLASL 1154
            + G L  P+F C+ AE F R + GDR++YE     NPG FT  +QL     T+LA +
Sbjct: 455  VPGGLVGPTFACIIAEQFLRLRDGDRFWYE-----NPGVFTG-EQLEEIRKTTLARV 505



 Score =  176 bits (449), Expect = 6e-47
 Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 55/242 (22%)

Query: 597 PKCCSDSGDRVRSTSGNCRPILIPSNDDFYSQFNRRCLEYKRSLAINCSVGALQPDKICP 656
             CC+D+          C PI +P  D ++S F R CL + RS           P     
Sbjct: 92  SPCCADN-------HPECFPIPVPPGDPYFSPFGR-CLPFFRSA----------PACGTG 133

Query: 657 GRESNPRPSAYKADVLATKLIRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARE 716
              + PR                             QI Q T ++D+S +YGS  ++A +
Sbjct: 134 PSCNLPRE----------------------------QINQLTSYLDLSQVYGSSEEEADK 165

Query: 717 LRSLSRGKLRV----REVNGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTV 772
           LR+   GKL+V       NG+  +P   P      S   C+ AGD RVN+   L    T+
Sbjct: 166 LRTFKDGKLKVNGEFPPPNGKGLLPAPPPGPSGCLS---CFLAGDSRVNENPGLTALHTL 222

Query: 773 WLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGDIRVNQQL 832
           +LR HN +A++L   NP WS  DE ++QEAR +VI  +Q I YNE+LP L+G   +    
Sbjct: 223 FLREHNRIADELKALNPHWS--DEKLFQEARLIVIAQYQKITYNEYLPALLGPDLMRANW 280

Query: 833 DL 834
            L
Sbjct: 281 LL 282



 Score =  141 bits (358), Expect = 4e-35
 Identities = 59/156 (37%), Positives = 83/156 (53%), Gaps = 17/156 (10%)

Query: 253 FSQLVAHDITLNPQESTAVPTRGLRGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTV 312
            S  + + +        +            DLAA +IQRGRD+GL PY +YR  CGL   
Sbjct: 365 LSDELRNRLFGPRNFPGSGL----------DLAALNIQRGRDHGLPPYNEYRRFCGLKPA 414

Query: 313 RSFGDLNDFMDSEQV-TRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYR 371
            SF DL D +  E++  +L ++Y   +DID ++GGL E  + G L  P+F C+ AE F R
Sbjct: 415 TSFEDLTDEIGDEELAEKLKELYGDPDDIDLWVGGLAEKPVPGGLVGPTFACIIAEQFLR 474

Query: 372 YKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLAS 407
            + GDR++YE     NPG FT  +QL     T+LA 
Sbjct: 475 LRDGDRFWYE-----NPGVFTG-EQLEEIRKTTLAR 504



 Score =  102 bits (255), Expect = 3e-22
 Identities = 41/92 (44%), Positives = 54/92 (58%), Gaps = 5/92 (5%)

Query: 179 YRTLDGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVFLP 238
           YR+ DG+CNNL+ P WGA  T   RL+ P YEDGVS+ R S V+ SPLP+ RV+S   L 
Sbjct: 1   YRSADGSCNNLQNPLWGAAGTPFTRLLPPVYEDGVSTPRGS-VNGSPLPSPRVVSNKLLA 59

Query: 239 DRNISDVH----TRMYLEFSQLVAHDITLNPQ 266
            +N         T M  ++ Q + HD+T  P 
Sbjct: 60  RKNGFIPPDPGLTLMLAQWGQFIDHDLTFTPH 91


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score =  226 bits (580), Expect = 5e-63
 Identities = 78/151 (51%), Positives = 105/151 (69%), Gaps = 6/151 (3%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQ--AGADISMIGQFGVG 1284
            I  +KE +TLTI D+GIGMT+ +++ NLGTIAKSGTK F+E L+     D  +IGQFGVG
Sbjct: 66   ISFDKEARTLTISDNGIGMTREEVIENLGTIAKSGTKEFLEKLKGDQKKDSQLIGQFGVG 125

Query: 1285 FYSAYLVADKVTVTSKH--NDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQA 1342
            FYSA++VADKVTV ++      E   WES   G +TI+    E   RGT+I LH+KED+ 
Sbjct: 126  FYSAFMVADKVTVITRSAGPAAEAVRWESDGEGEYTIEE--IEKEERGTEITLHLKEDED 183

Query: 1343 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKER 1373
            E+++E +I+ I+KK+S FI  PIKL  E+E 
Sbjct: 184  EFLDEWRIRSIIKKYSDFIPVPIKLEKEEEE 214



 Score = 97.5 bits (244), Expect = 1e-20
 Identities = 37/103 (35%), Positives = 57/103 (55%), Gaps = 16/103 (15%)

Query: 435 TVTSKH--NDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEI 492
           TV ++      E   WES   G +TI+    E   RGT+I LH+KED+ E+++E +I+ I
Sbjct: 137 TVITRSAGPAAEAVRWESDGEGEYTIEE--IEKEERGTEITLHLKEDEDEFLDEWRIRSI 194

Query: 493 VKKHSQFIGYPIKLLVEKERE------------KELSEDEEEE 523
           +KK+S FI  PIKL  E+E               E++++E +E
Sbjct: 195 IKKYSDFIPVPIKLEKEEEETINSASALWTRSKSEITDEEYKE 237



 Score = 38.2 bits (90), Expect = 0.031
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 7   QKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
           +K   + ++   KDL  LL++ +LL+ G +LEDP
Sbjct: 564 KKLADEADEAKFKDLAELLYDQALLAEGGSLEDP 597


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
            modification, protein turnover, chaperones].
          Length = 623

 Score =  224 bits (572), Expect = 6e-62
 Identities = 84/158 (53%), Positives = 117/158 (74%), Gaps = 3/158 (1%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEAL-QAGADISMIGQFGVGF 1285
            I  +K+ KTLTI D+GIGMTK +++ NLGTIAKSGTK F+E+L +   D  +IGQFGVGF
Sbjct: 67   ISFDKDNKTLTISDNGIGMTKDEVIENLGTIAKSGTKEFLESLSEDQKDSDLIGQFGVGF 126

Query: 1286 YSAYLVADKVTVTSKHND-DEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY 1344
            YSA++VADKVTV ++    DE Y WES   G +T++  + EP  RGT+I LH+KE++ E+
Sbjct: 127  YSAFMVADKVTVITRSAGEDEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEF 185

Query: 1345 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEE 1382
            +EE +++EIVKK+S  I YPI +  EKE+++E+ E E 
Sbjct: 186  LEEWRLREIVKKYSDHIAYPIYIEGEKEKDEEVIEWET 223



 Score =  104 bits (262), Expect = 9e-23
 Identities = 42/107 (39%), Positives = 65/107 (60%), Gaps = 10/107 (9%)

Query: 435 TVTSKHND-DEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
           TV ++    DE Y WES   G +T++  + EP  RGT+I LH+KE++ E++EE +++EIV
Sbjct: 137 TVITRSAGEDEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEFLEEWRLREIV 195

Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEE--------EEKKEEEKEED 532
           KK+S  I YPI +  EKE+++E+ E E            K E  +E+
Sbjct: 196 KKYSDHIAYPIYIEGEKEKDEEVIEWETINKAKALWTRNKSEITDEE 242



 Score = 40.3 bits (95), Expect = 0.008
 Identities = 14/34 (41%), Positives = 24/34 (70%)

Query: 7   QKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
           +K  + +++ +V DLV LL++ +LL+ G  LEDP
Sbjct: 574 KKLASLEDEASVADLVELLYDQALLAEGGPLEDP 607


>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
          Length = 814

 Score =  180 bits (458), Expect = 2e-46
 Identities = 90/156 (57%), Positives = 116/156 (74%), Gaps = 2/156 (1%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEAL-QAGADISMIGQFGVGF 1285
            I  NKE+  L+I D+GIGMTK DL+NNLGTIAKSGT  F+EA+ ++G D+S+IGQFGVGF
Sbjct: 128  ISANKEKNILSITDTGIGMTKEDLINNLGTIAKSGTSNFLEAISKSGGDMSLIGQFGVGF 187

Query: 1286 YSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPD-NSEPLGRGTKIVLHMKEDQAEY 1344
            YSA+LVADKV V +K+N+DEQYIWES+A   FTI  D     L RGT+I LH+KED    
Sbjct: 188  YSAFLVADKVIVYTKNNNDEQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNL 247

Query: 1345 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED 1380
            + +KK+ +++ K+SQFI YPI LL E    +E+  D
Sbjct: 248  MNDKKLVDLISKYSQFIQYPIYLLHENVYTEEVLAD 283



 Score = 82.0 bits (202), Expect = 1e-15
 Identities = 42/89 (47%), Positives = 58/89 (65%), Gaps = 1/89 (1%)

Query: 436 VTSKHNDDEQYIWESSAGGSFTIKPD-NSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVK 494
           V +K+N+DEQYIWES+A   FTI  D     L RGT+I LH+KED    + +KK+ +++ 
Sbjct: 199 VYTKNNNDEQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNLMNDKKLVDLIS 258

Query: 495 KHSQFIGYPIKLLVEKEREKELSEDEEEE 523
           K+SQFI YPI LL E    +E+  D  +E
Sbjct: 259 KYSQFIQYPIYLLHENVYTEEVLADIAKE 287


>gnl|CDD|188647 cd05396, An_peroxidase_like, Animal heme peroxidases and related
            proteins.  A diverse family of enzymes, which includes
            prostaglandin G/H synthase, thyroid peroxidase,
            myeloperoxidase, linoleate diol synthase,
            lactoperoxidase, peroxinectin, peroxidasin, and others.
            Despite its name, this family is not restricted to
            metazoans: members are found in fungi, plants, and
            bacteria as well.
          Length = 370

 Score =  162 bits (413), Expect = 1e-43
 Identities = 109/480 (22%), Positives = 162/480 (33%), Gaps = 141/480 (29%)

Query: 693  QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
            Q+   T ++D S +YGS+PD AR LR+   G L+                N V   S   
Sbjct: 1    QLNARTPYLDGSSIYGSNPDVARALRTFKGGLLKT---------------NEVKGPSY-- 43

Query: 753  YDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQH 812
                              T  L F+N          P                   CF  
Sbjct: 44   -----------------GTELLPFNNPNPSMGTIGLPP----------------TRCF-- 68

Query: 813  IVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARR 872
                     + GD RVN+ L L    T++LR HN +A++L +++P W   DE +YQEAR 
Sbjct: 69   ---------IAGDPRVNENLLLLAVHTLFLREHNRLADRLKKEHPEWD--DERLYQEARL 117

Query: 873  LVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGE 932
            +VI  +Q I YNE+LP ++G+                                       
Sbjct: 118  IVIAQYQLITYNEYLPAILGKFT------------------------------------- 140

Query: 933  SYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPG--TIKLVNERRVTNQ 990
                     P         DP+V P  L++F   A+R  HSL+P        N +     
Sbjct: 141  --------DPRDDLVLLFPDPDVVPYVLSEFFTAAYRFGHSLVPEGVDRIDENGQPKEIP 192

Query: 991  ELPFSSTMQHPGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVI 1050
            ++P      +    +      D  LRG   Q        I +   D +F      LE V 
Sbjct: 193  DVPLKDFFFNTSRSILSDTGLDPLLRGFLRQPAG----LIDQNVDDVMF--LFGPLEGVG 246

Query: 1051 VDLVGLTL---------------GFFYLWYCGGFLLQVT-------RLSQVYRGVNDIDY 1088
            +DL  L +                F  L         +        +L+++Y   +D+D 
Sbjct: 247  LDLAALNIQRGRDLGLPSYNEVRRFIGL-KPPTSFQDILTDPELAKKLAELYGDPDDVDL 305

Query: 1089 YIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSL 1148
            ++GGLLE  +           +  E F R   GDR++Y        G     +   L SL
Sbjct: 306  WVGGLLEKKVPPARLGELLATIILEQFKRLVDGDRFYYVNYN--PFGKSGKEELEKLISL 363



 Score = 99.4 bits (248), Expect = 5e-22
 Identities = 47/172 (27%), Positives = 68/172 (39%), Gaps = 14/172 (8%)

Query: 233 SRVFLPDRNISDVHTRMYLEFSQLVAHDITLNPQESTAVPTRGLRGEFGD--DLAARDIQ 290
           SR  L D  +  +   +     Q           ++            G   DLAA +IQ
Sbjct: 204 SRSILSDTGLDPL---LRGFLRQPA-----GLIDQNVDDVMFLFGPLEGVGLDLAALNIQ 255

Query: 291 RGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRGVNDIDYYIGGLLES 350
           RGRD GL  Y + R   GL    SF D+    D E   +L+++Y   +D+D ++GGLLE 
Sbjct: 256 RGRDLGLPSYNEVRRFIGLKPPTSFQDILT--DPELAKKLAELYGDPDDVDLWVGGLLEK 313

Query: 351 VIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSL 402
            +           +  E F R   GDR++Y        G     +   L SL
Sbjct: 314 KVPPARLGELLATIILEQFKRLVDGDRFYYVNYN--PFGKSGKEELEKLISL 363


>gnl|CDD|188658 cd09826, peroxidasin_like, Animal heme peroxidase domain of
            peroxidasin and related proteins.  Peroxidasin is a
            secreted heme peroxidase which is involved in hydrogen
            peroxide metabolism and peroxidative reactions in the
            cardiovascular system. The domain co-occurs with
            extracellular matrix domains and may play a role in the
            formation of the extracellular matrix.
          Length = 440

 Score =  159 bits (404), Expect = 1e-41
 Identities = 131/498 (26%), Positives = 194/498 (38%), Gaps = 159/498 (31%)

Query: 693  QIVQSTHFIDVSFMYGSHPDKARELRSLS--RGKLRV--REVNGRTFMP--TGDPSNCVN 746
            QI Q T +ID S +YGS  ++A ELR L+  RG LRV      G+  +P     P +C  
Sbjct: 39   QINQLTSYIDASNVYGSSDEEALELRDLASDRGLLRVGIVSEAGKPLLPFERDSPMDCRR 98

Query: 747  ASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLV 806
              +                                     ++P                 
Sbjct: 99   DPN-------------------------------------ESP----------------- 104

Query: 807  IGCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELV 866
            I CF           L GD R N+QL L    T+WLR HN +A +L++ NP W    E +
Sbjct: 105  IPCF-----------LAGDHRANEQLGLTSMHTLWLREHNRIASELLELNPHWD--GETI 151

Query: 867  YQEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAA 926
            Y E R++V    QHI Y+ WLP ++G                  G +             
Sbjct: 152  YHETRKIVGAQMQHITYSHWLPKILGP----------------VGME------------- 182

Query: 927  GAFRGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERR 986
                               G ++GY+P VNPS   +FA  AFR  H+LI   +  ++E  
Sbjct: 183  -----------------MLGEYRGYNPNVNPSIANEFATAAFRFGHTLINPILFRLDEDF 225

Query: 987  VTNQE--LPFSSTMQHPGDILEVMDVFDSTLRGL--TNQLQQYQDRFITEQTIDFLFRVG 1042
                E  LP       P  ++    + D  LRGL  T    +  D+ +  +  + LF + 
Sbjct: 226  QPIPEGHLPLHKAFFAPYRLVNEGGI-DPLLRGLFATAAKDRVPDQLLNTELTEKLFEMA 284

Query: 1043 SLSLEKVIVDLVGLT--------LGFFYLW--YCGGFLLQ--------------VTRLSQ 1078
                 +V +DL  L         L  +  +  +C   + +                +L +
Sbjct: 285  ----HEVALDLAALNIQRGRDHGLPGYNDYRKFCNLSVAETFEDLKNEIKNDDVREKLKR 340

Query: 1079 VYRGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFT 1138
            +Y    +ID ++GG+LE ++ G    P+  C+ AE F R + GDR++YE     NPG F+
Sbjct: 341  LYGHPGNIDLFVGGILEDLLPGARVGPTLACLLAEQFRRLRDGDRFWYE-----NPGVFS 395

Query: 1139 PGKQLTLTSLTSLAS-LC 1155
            P  QLT    TSLA  LC
Sbjct: 396  P-AQLTQIKKTSLARVLC 412



 Score =  117 bits (296), Expect = 1e-27
 Identities = 55/126 (43%), Positives = 77/126 (61%), Gaps = 7/126 (5%)

Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDL-NDFMDSEQVTRLSQVYRGVNDID 341
           DLAA +IQRGRD+GL  Y DYR+ C LS   +F DL N+  + +   +L ++Y    +ID
Sbjct: 290 DLAALNIQRGRDHGLPGYNDYRKFCNLSVAETFEDLKNEIKNDDVREKLKRLYGHPGNID 349

Query: 342 YYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTS 401
            ++GG+LE ++ G    P+  C+ AE F R + GDR++YE     NPG F+P  QLT   
Sbjct: 350 LFVGGILEDLLPGARVGPTLACLLAEQFRRLRDGDRFWYE-----NPGVFSP-AQLTQIK 403

Query: 402 LTSLAS 407
            TSLA 
Sbjct: 404 KTSLAR 409



 Score = 37.7 bits (88), Expect = 0.048
 Identities = 14/45 (31%), Positives = 22/45 (48%)

Query: 540 EAQLDAIKKVFLSQIICIGSDRIVKMQLQSFYMPSERQFNVTLSD 584
            AQL  IKK  L++++C   D I ++Q   F +P      V+   
Sbjct: 396 PAQLTQIKKTSLARVLCDNGDNITRVQEDVFLVPGNPHGYVSCES 440


>gnl|CDD|188654 cd09822, peroxinectin_like_bacterial, Uncharacterized family of heme
            peroxidases, mostly bacterial.  Animal heme peroxidases
            are diverse family of enzymes which are not restricted to
            animals. Members are also found in metazoans, fungi, and
            plants, and also in bacteria - like most members of this
            family of uncharacterized proteins.
          Length = 420

 Score =  146 bits (371), Expect = 1e-37
 Identities = 95/343 (27%), Positives = 135/343 (39%), Gaps = 98/343 (28%)

Query: 822  LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
            L GD+R N+   L    T+++R HN +A++L ++NPS S  DE +YQ AR +VI   Q I
Sbjct: 114  LAGDVRANENPGLTALHTLFVREHNRLADELARRNPSLS--DEEIYQAARAIVIAEIQAI 171

Query: 882  VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLS 941
             YNE+LP L+GE+                              A  A+            
Sbjct: 172  TYNEFLPALLGEN------------------------------ALPAYS----------- 190

Query: 942  PTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHP 1001
                    GYD  VNP    +F+  A+R  HS++   +   +E       L       +P
Sbjct: 191  --------GYDETVNPGISNEFSTAAYRFGHSMLSSELLRGDEDGTEATSLALRDAFFNP 242

Query: 1002 GDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDLVGLT---- 1057
             ++ E     D  LRGL +Q+ Q  D FI +   +FLF            DL  L     
Sbjct: 243  DELEENG--IDPLLRGLASQVAQEIDTFIVDDVRNFLFGPPGAGG----FDLAALNIQRG 296

Query: 1058 --------------LGFFYLWYCGGFLLQVT-------------RLSQVYRGVNDIDYYI 1090
                          LG          L  VT             RL+ VY  V+ ID ++
Sbjct: 297  RDHGLPSYNQLREALG----------LPAVTSFSDITSDPDLAARLASVYGDVDQIDLWV 346

Query: 1091 GGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATN 1133
            GGL E  + G L   +F  + A+ F R + GDR+FYE      
Sbjct: 347  GGLAEDHVNGGLVGETFSTIIADQFTRLRDGDRFFYENDDLLL 389



 Score =  123 bits (310), Expect = 1e-29
 Identities = 53/137 (38%), Positives = 77/137 (56%), Gaps = 7/137 (5%)

Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
           QI   T +ID S +YGS  ++A  LRS   GKL+    N    +P  +     N +  + 
Sbjct: 50  QINAITAYIDGSNVYGSDEERADALRSFGGGKLKTSVANAGDLLPFNEA-GLPNDNGGVP 108

Query: 753 YD----AGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIG 808
            D    AGD+R N+   L    T+++R HN +A++L ++NPS S  DE +YQ AR +VI 
Sbjct: 109 ADDLFLAGDVRANENPGLTALHTLFVREHNRLADELARRNPSLS--DEEIYQAARAIVIA 166

Query: 809 CFQHIVYNEWLPILIGD 825
             Q I YNE+LP L+G+
Sbjct: 167 EIQAITYNEFLPALLGE 183



 Score = 97.8 bits (244), Expect = 3e-21
 Identities = 48/113 (42%), Positives = 62/113 (54%), Gaps = 2/113 (1%)

Query: 275 GLRGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVY 334
           G  G  G DLAA +IQRGRD+GL  Y   RE  GL  V SF D+    D +   RL+ VY
Sbjct: 279 GPPGAGGFDLAALNIQRGRDHGLPSYNQLREALGLPAVTSFSDITS--DPDLAARLASVY 336

Query: 335 RGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATN 387
             V+ ID ++GGL E  + G L   +F  + A+ F R + GDR+FYE      
Sbjct: 337 GDVDQIDLWVGGLAEDHVNGGLVGETFSTIIADQFTRLRDGDRFFYENDDLLL 389


>gnl|CDD|188657 cd09825, thyroid_peroxidase, Thyroid peroxidase (TPO).  TPO is a
            member of the animal heme peroxidase family, which is
            expressed in the thyroid and involved in the processing
            of iodine and iodine compounds. Specifically, TPO
            oxidizes iodide via hydrogen peroxide to form active
            iodine, which is then, for example, incorporated into the
            tyrosine residues of thyroglobulin to yield mono- and
            di-iodotyrosines.
          Length = 565

 Score =  140 bits (355), Expect = 2e-34
 Identities = 138/584 (23%), Positives = 201/584 (34%), Gaps = 208/584 (35%)

Query: 614  CRPILIPSNDDFYSQFNRRCLEYKRSLAINCSVGALQPDKICPG-RESNPRPSAYKADVL 672
            C PI +PS D       R CL + RS A+ C  G      +      +NPR         
Sbjct: 103  CFPIQLPSEDPRILG--RACLPFFRSSAV-CGTGDTST--LFGNLSLANPRE-------- 149

Query: 673  ATKLIRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARELRSLSR--GKLRVRE- 729
                                QI   T FID S +YGS    AR LR LS   G LRV   
Sbjct: 150  --------------------QINGLTSFIDASTVYGSTLALARSLRDLSSDDGLLRVNSK 189

Query: 730  --VNGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQ 787
               +GR ++P                                         +  E++   
Sbjct: 190  FDDSGRDYLP-----------------------------------------FQPEEVSSC 208

Query: 788  NPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNY 847
            NP  +  +           + CF           L GD R ++ L L  S T+WLR HN 
Sbjct: 209  NPDPNGGE----------RVPCF-----------LAGDGRASEVLTLTASHTLWLREHNR 247

Query: 848  VAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSG 907
            +A  L   NP W    E +YQEAR++V    Q I + +++P ++G          P +  
Sbjct: 248  LARALKSINPHWDG--EQIYQEARKIVGALHQIITFRDYIPKILG----------PEA-- 293

Query: 908  YFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGA 967
             F  Y                                 GY++GYDP VNP+    F+  A
Sbjct: 294  -FDQY--------------------------------GGYYEGYDPTVNPTVSNVFSTAA 320

Query: 968  FRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHPGDILEVMDVF------------DSTL 1015
            FR  H+ I  T++ ++E         F      P   L + D F            D  +
Sbjct: 321  FRFGHATIHPTVRRLDEN--------FQEHPVLPN--LALHDAFFSPWRLVREGGLDPVI 370

Query: 1016 RGLTNQLQ--QYQDRFITEQTIDFLF-RVGSLSLEKVIVDLVGLT--------LGFFYLW 1064
            RGL          D  + E+  + LF    S +L     DL  L         L  +  W
Sbjct: 371  RGLIGGPAKLVTPDDLMNEELTEKLFVLSNSSTL-----DLASLNLQRGRDHGLPGYNDW 425

Query: 1065 --YCGGFLLQ--------------VTRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFR 1108
              +CG   L                 ++  +Y+  ++ID ++GGL E  + G  + P F 
Sbjct: 426  REFCGLPRLATPADLATAIADQAVADKILDLYKHPDNIDVWLGGLAEDFLPGARTGPLFA 485

Query: 1109 CVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLA 1152
            C+  +     + GDR+++E     N   FT   Q       SL+
Sbjct: 486  CLIGKQMKALRDGDRFWWE-----NSNVFTDA-QRRELRKHSLS 523



 Score = 90.2 bits (224), Expect = 3e-18
 Identities = 41/125 (32%), Positives = 67/125 (53%), Gaps = 7/125 (5%)

Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVT-RLSQVYRGVNDID 341
           DLA+ ++QRGRD+GL  Y D+RE CGL  + +  DL   +  + V  ++  +Y+  ++ID
Sbjct: 405 DLASLNLQRGRDHGLPGYNDWREFCGLPRLATPADLATAIADQAVADKILDLYKHPDNID 464

Query: 342 YYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTS 401
            ++GGL E  + G  + P F C+  +     + GDR+++E     N   FT   Q     
Sbjct: 465 VWLGGLAEDFLPGARTGPLFACLIGKQMKALRDGDRFWWE-----NSNVFTDA-QRRELR 518

Query: 402 LTSLA 406
             SL+
Sbjct: 519 KHSLS 523



 Score = 30.5 bits (69), Expect = 8.4
 Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 8/89 (8%)

Query: 195 GAVDTAHVRLMRPYYEDGVSSYR----VSQVDASPLPNARVLSRVFLPDRNIS----DVH 246
           GA +T   R + P YEDG S           +   LP+ R +S   +   + +    D++
Sbjct: 1   GASNTPLARWLPPIYEDGFSEPVGWNKERLYNGFTLPSVREVSNKIMRTSSTAVTPDDLY 60

Query: 247 TRMYLEFSQLVAHDITLNPQESTAVPTRG 275
           + M   + Q + HDI   PQ  +     G
Sbjct: 61  SHMLTVWGQYIDHDIDFTPQSVSRTMFIG 89


>gnl|CDD|188656 cd09824, myeloperoxidase_like, Myeloperoxidases, eosinophil
            peroxidases, and lactoperoxidases.  This well conserved
            family of animal heme peroxidases contains members with
            somewhat diverse functions. Myeloperoxidases are
            lysosomal proteins found in azurophilic granules of
            neutrophils and the lysosomes of monocytes. They are
            involved in the formation of microbicidal agents upon
            activation of activated neutrophils (neutrophils
            undergoing respiratory bursts as a result of
            phagocytosis), by catalyzing the conversion of hydrogen
            peroxide to hypochlorous acid. As a heme protein,
            myeloperoxidase is responsible for the greenish tint of
            pus, which is rich in neutrophils. Eosinophil peroxidases
            are haloperoxidases as well, preferring bromide over
            chloride. Expressed by eosinophil granulocytes, they are
            involved in attacking multicellular parasites and play
            roles in various inflammatory diseases such as asthma.
            The haloperoxidase lactoperoxidase is secreted from
            mucosal glands and provides antibacterial activity by
            oxidizing a variety of substrates such as bromide or
            chloride in the presence of hydrogen peroxide.
          Length = 411

 Score =  120 bits (302), Expect = 1e-28
 Identities = 118/495 (23%), Positives = 182/495 (36%), Gaps = 152/495 (30%)

Query: 693  QIVQSTHFIDVSFMYGSHPDKARELRSLS--RGKLRVRE---VNGRTFMPTGDPSNCVNA 747
            QI   T F+D S +YGS P  A+ LR+L+   G L V +    NG   +P          
Sbjct: 14   QINALTSFVDASMVYGSEPSLAKXLRNLTNQLGLLAVNQRFTDNGLALLP---------- 63

Query: 748  SSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVI 807
                         N   D    +       N  A                         I
Sbjct: 64   -----------FENLHNDPCALR-------NTSAN------------------------I 81

Query: 808  GCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVY 867
             CF           L GD RV++   LA   T+ LR HN +A +L + NP W    E +Y
Sbjct: 82   PCF-----------LAGDTRVSENPGLAALHTLLLREHNRLARELHRLNPHWDG--ETLY 128

Query: 868  QEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAG 927
            QEAR++V    Q I Y ++LP+++GE     +           GY+  V+P     F   
Sbjct: 129  QEARKIVGAMVQIITYRDYLPLILGEDAAARLPPYR-------GYNESVDPRIANVFTTA 181

Query: 928  AFRGESYTKDVGLSPTSSGYFQGYDP-EVNPSTLADFAAGAFRGLHSLIPGTIKLVNERR 986
              RG        + P      + Y P   NP             LH     + +++ E  
Sbjct: 182  FRRGH-----TTVQPFVFRLDENYQPHPPNPQV----------PLHKAFFASWRIIRE-- 224

Query: 987  VTNQELPFSSTMQHPGDILEVMDVFDSTLRGL--TNQLQQYQDRFITEQTIDFLFRVGSL 1044
                           G I       D  LRGL  T      Q++ + ++  + LF+    
Sbjct: 225  ---------------GGI-------DPILRGLMATPAKLNNQNQMLVDELRERLFQ---- 258

Query: 1045 SLEKVIVDLVGL--------TLGFFYLW--YCGGFLLQ-VTRLSQV-------------Y 1080
              +++ +DL  L         L  +  W  +CG    Q +  L+ V             Y
Sbjct: 259  QTKRMGLDLAALNLQRGRDHGLPGYNAWRRFCGLSQPQNLAELAAVLNNTVLARKLLDLY 318

Query: 1081 RGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPG 1140
               ++ID +IGG+ E ++ G    P   C+ +  F R + GDR+++E     NPG FT  
Sbjct: 319  GTPDNIDIWIGGVAEPLVPGGRVGPLLACLISRQFRRIRDGDRFWWE-----NPGVFTEE 373

Query: 1141 KQLTLTSLTSLASLC 1155
            ++ +L S++    +C
Sbjct: 374  QRESLRSVSLSRIIC 388



 Score = 80.9 bits (200), Expect = 1e-15
 Identities = 44/126 (34%), Positives = 73/126 (57%), Gaps = 6/126 (4%)

Query: 281 GDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTR-LSQVYRGVND 339
           G DLAA ++QRGRD+GL  Y  +R  CGLS  ++  +L   +++  + R L  +Y   ++
Sbjct: 264 GLDLAALNLQRGRDHGLPGYNAWRRFCGLSQPQNLAELAAVLNNTVLARKLLDLYGTPDN 323

Query: 340 IDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTL 399
           ID +IGG+ E ++ G    P   C+ +  F R + GDR+++E     NPG FT  ++ +L
Sbjct: 324 IDIWIGGVAEPLVPGGRVGPLLACLISRQFRRIRDGDRFWWE-----NPGVFTEEQRESL 378

Query: 400 TSLTSL 405
            S++  
Sbjct: 379 RSVSLS 384


>gnl|CDD|188652 cd09820, dual_peroxidase_like, Dual oxidase and related animal heme
            peroxidases.  Animal heme peroxidases of the dual-oxidase
            like subfamily play vital roles in the innate mucosal
            immunity of gut epithelia. They provide reactive oxygen
            species which help control infection.
          Length = 558

 Score =  113 bits (285), Expect = 1e-25
 Identities = 88/355 (24%), Positives = 144/355 (40%), Gaps = 100/355 (28%)

Query: 824  GDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVY 883
            G+ R N+   L     +W R+HNY+A+++ +++P WS  DE ++QEAR+ VI  +Q+IV+
Sbjct: 204  GNPRGNENPFLLTFGILWFRYHNYLAQRIAREHPDWS--DEDIFQEARKWVIATYQNIVF 261

Query: 884  NEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLSPT 943
             EWLP L+G +                                       YT        
Sbjct: 262  YEWLPALLGTNVPP------------------------------------YT-------- 277

Query: 944  SSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQH--- 1000
                  GY P V+P    +F A AFR  H+L+P  +   N +    +E+  +S       
Sbjct: 278  ------GYKPHVDPGISHEFQAAAFRFGHTLVPPGVYRRNRQC-NFREVLTTSGGSPALR 330

Query: 1001 -------PGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDL 1053
                     + L   D+ D  L G+ +Q+ + +D  I E   D+LF  G L   +   DL
Sbjct: 331  LCNTYWNSQEPLLKSDI-DELLLGMASQIAEREDNIIVEDLRDYLF--GPLEFSR--RDL 385

Query: 1054 VGLT-----------------------LGFFYLWYCGGFLLQ---VTRLSQVYRG-VNDI 1086
            + L                           +       F      + RL+++Y   ++ +
Sbjct: 386  MALNIQRGRDHGLPDYNTAREAFGLPPRTTWSDINPDLFKKDPELLERLAELYGNDLSKL 445

Query: 1087 DYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGK 1141
            D Y+GG+LES  +G      FR +  + F R + GDR+++E       G FT  +
Sbjct: 446  DLYVGGMLES--KGGGPGELFRAIILDQFQRLRDGDRFWFEN---VKNGLFTAEE 495



 Score =  105 bits (264), Expect = 3e-23
 Identities = 49/142 (34%), Positives = 75/142 (52%), Gaps = 12/142 (8%)

Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLR-------VREVNGRTFM--PTGDPSN 743
           Q+ + T +ID S +YGS    +  LRS S G+L         R    R  +  P   PS 
Sbjct: 133 QLNEVTSWIDGSSIYGSSKAWSDALRSFSGGRLASGDDGGFPRRNTNRLPLANPP-PPSY 191

Query: 744 CVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEAR 803
                    +  G+ R N+   L     +W R+HNY+A+++ +++P WS  DE ++QEAR
Sbjct: 192 HGTRGPERLFKLGNPRGNENPFLLTFGILWFRYHNYLAQRIAREHPDWS--DEDIFQEAR 249

Query: 804 RLVIGCFQHIVYNEWLPILIGD 825
           + VI  +Q+IV+ EWLP L+G 
Sbjct: 250 KWVIATYQNIVFYEWLPALLGT 271



 Score = 78.1 bits (193), Expect = 1e-14
 Identities = 42/117 (35%), Positives = 64/117 (54%), Gaps = 9/117 (7%)

Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLND---FMDSEQVTRLSQVYRG-VN 338
           DL A +IQRGRD+GL  Y   RE  GL    ++ D+N      D E + RL+++Y   ++
Sbjct: 384 DLMALNIQRGRDHGLPDYNTAREAFGLPPRTTWSDINPDLFKKDPELLERLAELYGNDLS 443

Query: 339 DIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGK 395
            +D Y+GG+LES  +G      FR +  + F R + GDR+++E       G FT  +
Sbjct: 444 KLDLYVGGMLES--KGGGPGELFRAIILDQFQRLRDGDRFWFEN---VKNGLFTAEE 495



 Score = 56.5 bits (137), Expect = 7e-08
 Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 14/85 (16%)

Query: 183 DGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVF------ 236
           DG  NNL  P WGA D+   R +  +Y DGV  Y  S  +    PN R LS +       
Sbjct: 2   DGWYNNLAHPEWGAADSRLTRRLPAHYSDGV--YAPSGEE---RPNPRSLSNLLMKGESG 56

Query: 237 LPDRNISDVHTRMYLEFSQLVAHDI 261
           LP        T + + F Q V  +I
Sbjct: 57  LPSTRNR---TALLVFFGQHVVSEI 78


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 97.1 bits (242), Expect = 1e-20
 Identities = 38/62 (61%), Positives = 49/62 (79%), Gaps = 2/62 (3%)

Query: 482 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE--EKEEDKTPKLED 539
           EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEEE+KEE  E+EE  T K E+
Sbjct: 1   EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEE 60

Query: 540 EA 541
             
Sbjct: 61  VD 62



 Score = 79.0 bits (195), Expect = 6e-15
 Identities = 28/41 (68%), Positives = 36/41 (87%)

Query: 1343 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE 1383
            EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEE
Sbjct: 1    EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEE 41



 Score = 73.3 bits (180), Expect = 4e-13
 Identities = 31/64 (48%), Positives = 43/64 (67%), Gaps = 9/64 (14%)

Query: 1   IVETLRQKADADKNDKAVKDLVNLLFETSLLSSGFTLEDPQ---------VELTLSGRPD 51
           I++ LR++ +ADKNDK VKDL  LL+ET+LL+SGF+LEDP+         ++L LS   D
Sbjct: 439 IIKELRKRVEADKNDKTVKDLARLLYETALLTSGFSLEDPKAFASRIYRMIKLGLSIDED 498

Query: 52  RSPE 55
              E
Sbjct: 499 EEVE 502


>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
          Length = 601

 Score = 89.6 bits (223), Expect = 4e-18
 Identities = 43/147 (29%), Positives = 79/147 (53%), Gaps = 4/147 (2%)

Query: 1228 IPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYS 1287
            + +    TL + D+GIG+T+ ++   L TI +S  +   E L    +   +GQFG+G  S
Sbjct: 57   LTDAGGGTLIVEDNGIGLTEEEVHEFLATIGRSSKRD--ENLGFARN-DFLGQFGIGLLS 113

Query: 1288 AYLVADKVTVTSKHNDDEQYI-WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIE 1346
             +LVAD++ V S+   D   + W   A G+++++   +E    GT + L  + D  E++E
Sbjct: 114  CFLVADEIVVVSRSAKDGPAVEWRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWLE 173

Query: 1347 EKKIKEIVKKHSQFIGYPIKLLVEKER 1373
             + ++E+ KK+   +  PI++  EK  
Sbjct: 174  RETVEELAKKYGSLLPVPIRVEGEKGG 200



 Score = 42.6 bits (101), Expect = 0.002
 Identities = 17/65 (26%), Positives = 35/65 (53%)

Query: 448 WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLL 507
           W   A G+++++   +E    GT + L  + D  E++E + ++E+ KK+   +  PI++ 
Sbjct: 136 WRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWLERETVEELAKKYGSLLPVPIRVE 195

Query: 508 VEKER 512
            EK  
Sbjct: 196 GEKGG 200


>gnl|CDD|188648 cd09816, prostaglandin_endoperoxide_synthase, Animal prostaglandin
           endoperoxide synthase and related bacterial proteins.
           Animal prostaglandin endoperoxide synthases, including
           prostaglandin H2 synthase and a set of similar bacterial
           proteins which may function as cyclooxygenases.
           Prostaglandin H2 synthase catalyzes the synthesis of
           prostaglandin H2 from arachidonic acid. In two reaction
           steps, arachidonic acid is converted to Prostaglandin
           G2, a peroxide (cyclooxygenase activity) and
           subsequently converted to the end product via the
           enzyme's peroxidase activity. Prostaglandin H2 synthase
           is the target of aspirin and other non-steroid
           anti-inflammatory drugs such as ibuprofen, which block
           the substrate's access to the active site and may
           acetylate a conserved serine residue. In humans and
           other mammals, prostaglandin H2 synthase (PGHS), also
           called cyclooxygenase (COX) is present as at least two
           isozymes, PGHS-1 (or COX-1) and PGHS-2 (or COX-2),
           respectively. PGHS-1 is expressed constitutively in most
           mammalian cells, while the expression of PGHS-2 is
           induced via inflammation response in endothelial cells,
           activated macrophages, and others. COX-3 is a splice
           variant of COX-1.
          Length = 490

 Score = 70.0 bits (172), Expect = 4e-12
 Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 18/138 (13%)

Query: 697 STHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPT-------------GDPSN 743
           S H ID+S +YG    +   LR    GKL+ + +NG  + P                P  
Sbjct: 127 SNHGIDLSQIYGLTEARTHALRLFKDGKLKSQMINGEEYPPYLFEDGGVKMEFPPLVPPL 186

Query: 744 CVNASSNI---CYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQ 800
               +       +  G  R N    L +  T+WLR HN V + L +++P W   DE ++Q
Sbjct: 187 GDELTPEREAKLFAVGHERFNLTPGLFMLNTIWLREHNRVCDILKKEHPDWD--DERLFQ 244

Query: 801 EARRLVIGCFQHIVYNEW 818
            AR ++IG    IV  ++
Sbjct: 245 TARNILIGELIKIVIEDY 262



 Score = 51.5 bits (124), Expect = 3e-06
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 4/77 (5%)

Query: 275 GLR--GEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQ 332
           GLR    F   +  R I++GR   L  + DYR+  GL    SF +L    D E    L +
Sbjct: 349 GLRNTPPFLLPVEVRSIEQGRKLRLASFNDYRKRFGLPPYTSFEELTG--DPEVAAELEE 406

Query: 333 VYRGVNDIDYYIGGLLE 349
           +Y  V+ +++Y+G   E
Sbjct: 407 LYGDVDAVEFYVGLFAE 423



 Score = 50.3 bits (121), Expect = 5e-06
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 2/64 (3%)

Query: 823 IGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIV 882
           +G  R N    L +  T+WLR HN V + L +++P W   DE ++Q AR ++IG    IV
Sbjct: 201 VGHERFNLTPGLFMLNTIWLREHNRVCDILKKEHPDWD--DERLFQTARNILIGELIKIV 258

Query: 883 YNEW 886
             ++
Sbjct: 259 IEDY 262


>gnl|CDD|188650 cd09818, PIOX_like, Animal heme oxidases similar to plant
           pathogen-inducible oxygenases.  This is a diverse family
           of oxygenases related to the animal heme peroxidases,
           with members from plants, animals, and bacteria. The
           plant pathogen-inducible oxygenases (PIOX) oxygenate
           fatty acids into 2R-hydroperoxides. They may be involved
           in the hypersensitive reaction, rapid and localized cell
           death induced by infection with pathogens, and the
           rapidly induced expression of PIOX may be caused by the
           oxidative burst that occurs in the process of cell
           death.
          Length = 484

 Score = 65.0 bits (159), Expect = 2e-10
 Identities = 38/162 (23%), Positives = 70/162 (43%), Gaps = 19/162 (11%)

Query: 698 THFIDVSFMYGSHPDKARELRSLSR-GKLRVREVNGRTFMPTGDPSNCVNASSNICYDAG 756
           TH+ D S +YGS  +  + LR+    GKL++        +P  D    +  +       G
Sbjct: 91  THWWDGSQIYGSTEEAQKRLRTFPPDGKLKLDA---DGLLPV-DEHTGLPLT-------G 139

Query: 757 DIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYN 816
               N  + L++  T+++R HN + + L ++ P WS  DE ++ +AR +       I   
Sbjct: 140 FND-NWWVGLSLLHTLFVREHNAICDALRKEYPDWS--DEQLFDKARLVNAALMAKIHTV 196

Query: 817 EWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPS 858
           EW P ++     +  L++A+    W      +   L +   S
Sbjct: 197 EWTPAILA----HPTLEIAMRANWWGLLGERLKRVLGRDGTS 234



 Score = 57.3 bits (139), Expect = 3e-08
 Identities = 38/99 (38%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG-VNDID 341
           DLAA DI R R+ G+  Y ++R L  L   +SF DL    D E    L +VY G V  +D
Sbjct: 341 DLAAIDILRDRERGVPRYNEFRRLLHLPPAKSFEDLTG--DEEVAAELREVYGGDVEKVD 398

Query: 342 YYIGGLLESVIRG-TLSSPSFRCVFAEAFYRYKYGDRYF 379
             +G L E +  G   S  +FR     A  R K  DR+F
Sbjct: 399 LLVGLLAEPLPPGFGFSDTAFRIFILMASRRLK-SDRFF 436



 Score = 40.3 bits (95), Expect = 0.007
 Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 10/82 (12%)

Query: 818 WLP-------ILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEA 870
            LP        L G    N  + L++  T+++R HN + + L ++ P WS  DE ++ +A
Sbjct: 126 LLPVDEHTGLPLTGFND-NWWVGLSLLHTLFVREHNAICDALRKEYPDWS--DEQLFDKA 182

Query: 871 RRLVIGCFQHIVYNEWLPILIG 892
           R +       I   EW P ++ 
Sbjct: 183 RLVNAALMAKIHTVEWTPAILA 204



 Score = 32.3 bits (74), Expect = 1.9
 Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 180 RTLDGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVFL 237
           RT DG+ N+L  P  G+V T   R +         +      D    PN RV+SR  L
Sbjct: 1   RTADGSYNDLDNPSMGSVGTRFGRNVPL------DATFPEDKDELLTPNPRVISRRLL 52


>gnl|CDD|188653 cd09821, An_peroxidase_bacterial_2, Uncharacterized bacterial
           family of heme peroxidases.  Animal heme peroxidases are
           diverse family of enzymes which are not restricted to
           metazoans; members are also found in fungi, and plants,
           and in bacteria - like this family of uncharacterized
           proteins.
          Length = 570

 Score = 60.1 bits (146), Expect = 6e-09
 Identities = 49/178 (27%), Positives = 74/178 (41%), Gaps = 41/178 (23%)

Query: 824 GDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQ----------------NPSWSNRDELVY 867
           GD RVN+ + L    TV+ R HN + +++                   N +W    E ++
Sbjct: 181 GDGRVNENIGLTAVHTVFHREHNRLVDQIKDTLLQSADLAFANEAGGNNLAWDG--ERLF 238

Query: 868 QEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAG 927
           Q AR      +QH+V+ E+   +         G+     G F GY+PE+NPS  A+FA  
Sbjct: 239 QAARFANEMQYQHLVFEEFARRIQP-------GIDG--FGSFNGYNPEINPSISAEFAHA 289

Query: 928 AFR-GES-YTKDVGL--------SPTSSGYFQGY--DPEVNPSTL-ADFAAGAF-RGL 971
            +R G S  T+ V               G    +       P+TL A+  AGA  RG+
Sbjct: 290 VYRFGHSMLTETVTRIGPDADEGLDNQVGLIDAFLNPVAFLPATLYAEEGAGAILRGM 347



 Score = 42.8 bits (101), Expect = 0.001
 Identities = 44/186 (23%), Positives = 64/186 (34%), Gaps = 59/186 (31%)

Query: 696 QSTHFIDVSFMYGSHPDKARELR-SLSRGKLRVREVNGRT---------FM--------- 736
            +T F+D +  YGSH      LR     G    R + G T         F+         
Sbjct: 86  VTTPFVDQNQTYGSHASHQVFLREYDGDGVATGRLLEGATGGSARTGHAFLDDIAHNAAP 145

Query: 737 ----------PTGDPSNCVNASSNICYD---------AGDIRVNQQLDLAVSQTVWLRFH 777
                     PT DP       S   YD         AGD RVN+ + L    TV+ R H
Sbjct: 146 KGGLGSLRDNPTEDPPGPGAPGS---YDNELLDAHFVAGDGRVNENIGLTAVHTVFHREH 202

Query: 778 NYVAEKLIQQ----------------NPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPI 821
           N + +++                   N +W    E ++Q AR      +QH+V+ E+   
Sbjct: 203 NRLVDQIKDTLLQSADLAFANEAGGNNLAWDG--ERLFQAARFANEMQYQHLVFEEFARR 260

Query: 822 LIGDIR 827
           +   I 
Sbjct: 261 IQPGID 266



 Score = 39.7 bits (93), Expect = 0.013
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 336 GVNDIDYYIGGLLES-VIRGTLSSPSFRCVFAEAFYRYKYGDRYFY 380
           G++++D ++GGL E  V  G +   +F  VF E   R + GDR++Y
Sbjct: 482 GLDNVDLWVGGLAEKQVPFGGMLGSTFNFVFEEQMDRLQDGDRFYY 527



 Score = 39.7 bits (93), Expect = 0.013
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 1082 GVNDIDYYIGGLLES-VIRGTLSSPSFRCVFAEAFYRYKYGDRYFY 1126
            G++++D ++GGL E  V  G +   +F  VF E   R + GDR++Y
Sbjct: 482  GLDNVDLWVGGLAEKQVPFGGMLGSTFNFVFEEQMDRLQDGDRFYY 527


>gnl|CDD|188651 cd09819, An_peroxidase_bacterial_1, Uncharacterized bacterial
           family of heme peroxidases.  Animal heme peroxidases are
           diverse family of enzymes which are not restricted to
           metazoans; members are also found in fungi, and plants,
           and in bacteria - like this family of uncharacterized
           proteins.
          Length = 465

 Score = 57.7 bits (140), Expect = 2e-08
 Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 7/109 (6%)

Query: 822 LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
           LIGD R ++ L +A     +LRFHN V + L             +++EARRLV   +Q +
Sbjct: 145 LIGDPRNDENLIVAQLHLAFLRFHNAVVDALRAHGTPGD----ELFEEARRLVRWHYQWL 200

Query: 882 VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFR 930
           V N++LP +       DV  +     +++ +        + +F+  A+R
Sbjct: 201 VLNDFLPRICDPDVVDDVLANG--RRFYRFFREGKPFMPV-EFSVAAYR 246



 Score = 50.8 bits (122), Expect = 4e-06
 Identities = 38/136 (27%), Positives = 54/136 (39%), Gaps = 33/136 (24%)

Query: 706 MYGSHPDKARELRSLS----RGKLRVREVNGRTFMPTGDPSNCVNASSNICYDAGDIRVN 761
           +YG  PD +  L   +      KLRV    GR   P G      +        A D+  N
Sbjct: 92  VYGGGPDGSPYLYDQATPNDGAKLRV----GRE-SPGGPGGLPGD-------GARDLPRN 139

Query: 762 QQLD-----------LAVSQT--VWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIG 808
            Q             L V+Q    +LRFHN V + L             +++EARRLV  
Sbjct: 140 GQGTALIGDPRNDENLIVAQLHLAFLRFHNAVVDALRAHGTPGD----ELFEEARRLVRW 195

Query: 809 CFQHIVYNEWLPILIG 824
            +Q +V N++LP +  
Sbjct: 196 HYQWLVLNDFLPRICD 211


>gnl|CDD|217081 pfam02518, HATPase_c, Histidine kinase-, DNA gyrase B-, and
            HSP90-like ATPase.  This family represents the
            structurally related ATPase domains of histidine kinase,
            DNA gyrase B and HSP90.
          Length = 111

 Score = 52.7 bits (127), Expect = 4e-08
 Identities = 21/114 (18%), Positives = 37/114 (32%), Gaps = 33/114 (28%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
            +  +     +T+ D+GIG+   DL            K F    +  +    +G  G+G  
Sbjct: 31   LERDGGRLRITVEDNGIGIPPEDL-----------PKIFEPFFRTDSSSRKVGGTGLGLS 79

Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKED 1340
                + +                    GG+ T+    SEP G GT     +  +
Sbjct: 80   IVRKLVELH------------------GGTITV---ESEP-GGGTTFTFTLPLE 111


>gnl|CDD|222246 pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and
            HSP90-like ATPase.  This family represents, additionally,
            the structurally related ATPase domains of histidine
            kinase, DNA gyrase B and HSP90.
          Length = 134

 Score = 48.8 bits (117), Expect = 1e-06
 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 13/82 (15%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNL--GTIAKSGTKAFMEALQAGADISMIGQFGVG 1284
            I P++ E  ++I D+G GM+  +L N L  G  +K G +          D + +G+ G+G
Sbjct: 26   IDPDRGEDGISIEDNGGGMSYEELRNALKLGRSSKEGER----------DSTTLGRKGIG 75

Query: 1285 -FYSAYLVADKVTVTSKHNDDE 1305
               ++  +  K+TVTSK   + 
Sbjct: 76   MKLASLSLGRKLTVTSKKEGES 97


>gnl|CDD|214643 smart00387, HATPase_c, Histidine kinase-like ATPases.  Histidine
            kinase-, DNA gyrase B-, phytochrome-like ATPases.
          Length = 111

 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 16/96 (16%), Positives = 29/96 (30%), Gaps = 17/96 (17%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
            +  + +   +T+ D+G G+   DL            K F    +       IG  G+G  
Sbjct: 31   LERDGDHVEITVEDNGPGIPPEDL-----------EKIFEPFFRTDKRSRKIGGTGLGLS 79

Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPD 1322
                + +          +         G +FTI   
Sbjct: 80   IVKKLVELHGGEISVESEPG------GGTTFTITLP 109


>gnl|CDD|238030 cd00075, HATPase_c, Histidine kinase-like ATPases; This family
            includes several ATP-binding proteins for example:
            histidine kinase, DNA gyrase B, topoisomerases, heat
            shock protein HSP90, phytochrome-like ATPases and DNA
            mismatch repair proteins.
          Length = 103

 Score = 39.6 bits (93), Expect = 0.001
 Identities = 10/80 (12%), Positives = 25/80 (31%), Gaps = 16/80 (20%)

Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
            +  + +   + + D+G G+ + DL       +                    G  G+G  
Sbjct: 27   VERDGDHLEIRVEDNGPGIPEEDLERIFERFSDGSRS------------RKGGGTGLGLS 74

Query: 1287 SAYLVAD----KVTVTSKHN 1302
                + +    ++ V S+  
Sbjct: 75   IVKKLVELHGGRIEVESEPG 94


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 7/62 (11%)

Query: 485 EEKKIKEIVKKHSQFIG----YPIK--LLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 538
           E KK      K  +  G    +P K  +L E E +K   E+E+E +KE  K E +    +
Sbjct: 20  EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAE-RAEARK 78

Query: 539 DE 540
             
Sbjct: 79  RG 80


>gnl|CDD|177921 PLN02283, PLN02283, alpha-dioxygenase.
          Length = 633

 Score = 40.9 bits (96), Expect = 0.005
 Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 23/133 (17%)

Query: 679 LKNFFF---TDVETCTFQIVQS-----THFIDVSFMYGSHPDKARELRSLSRGKLRVREV 730
           LK+F F    +V T +  I        T + D S +YGS+    R +R+   GKL++ E 
Sbjct: 184 LKSFKFYKTKEVPTGSPDIKTGSLNIRTPWWDGSVIYGSNEKGLRRVRTFKDGKLKISED 243

Query: 731 NGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPS 790
                   G P             +GD+R N    +++ Q ++++ HN V + L ++ P 
Sbjct: 244 GLLLHDEDGIPI------------SGDVR-NSWAGVSLLQALFVKEHNAVCDALKEEYPD 290

Query: 791 WSNRDELVYQEAR 803
           +   DE +Y+ AR
Sbjct: 291 FD--DEELYRHAR 301



 Score = 37.8 bits (88), Expect = 0.043
 Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG-VNDID 341
           D+AA +I R R+ G+  Y ++R    +  +  + DL D  D E +  L +VY   V  +D
Sbjct: 472 DMAALEIYRDRERGVARYNEFRRNLLMIPISKWEDLTD--DEEAIEVLREVYGDDVEKLD 529

Query: 342 YYIGGLLESVIRG-TLSSPSFRCVFAEAFYRYKYGDRYF 379
             +G + E  I+G  +S  +F      A  R +  DR+F
Sbjct: 530 LLVGLMAEKKIKGFAISETAFFIFLLMASRRLE-ADRFF 567


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 38.0 bits (89), Expect = 0.034
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 470 TKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
           TK +  + E       EKK +E  K+  +      K   E+E EKE  E+E+EE++EE +
Sbjct: 409 TKKIKKIVEK-----AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 530 EEDKTPKLEDEAQ 542
           EE +  + + + Q
Sbjct: 464 EEKEEEEEKKKKQ 476



 Score = 35.7 bits (83), Expect = 0.20
 Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 10/75 (13%)

Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
           L + E++ E++   K               IK +VEK  +K   E +E++KK    ++ +
Sbjct: 392 LELTEEEIEFLTGSKKATKK----------IKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441

Query: 534 TPKLEDEAQLDAIKK 548
             + E++ + +  K+
Sbjct: 442 EEEEEEKEKKEEEKE 456


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 38.3 bits (90), Expect = 0.039
 Identities = 19/90 (21%), Positives = 48/90 (53%), Gaps = 2/90 (2%)

Query: 462 NSEPLGRGTKIVLHMKEDQA-EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE 520
             +    G K+++ +  +   +++++K + +++K++ +F G+ I  +  +  + +   ++
Sbjct: 116 KQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKF-GFGILKIDFEIDDSKEELEK 174

Query: 521 EEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
            E +KEEE E+     LE   +L+A KK  
Sbjct: 175 FEAQKEEEDEKLAKEALEAMKKLEAEKKKQ 204


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 34.8 bits (80), Expect = 0.050
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 511 EREKELSEDEEEEKKEEEKEEDKTPKLE 538
             E++  E EEE+KKEEEKEE++   L 
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 32.1 bits (73), Expect = 0.47
 Identities = 11/24 (45%), Positives = 20/24 (83%)

Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
           E++ EEK+EE+K+E++  + E+EA
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 31.7 bits (72), Expect = 0.65
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
           E++ E++  E ++EE+KEEE+EE 
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 30.9 bits (70), Expect = 1.4
 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 2/51 (3%)

Query: 484 IEEKKIKEIVKKHSQFI--GYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
           +EE  I E +KK +       P         EK   ++EE++K+EE++EE+
Sbjct: 45  LEEVNIDEAIKKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 34.7 bits (80), Expect = 0.069
 Identities = 16/55 (29%), Positives = 21/55 (38%), Gaps = 6/55 (10%)

Query: 484 IEEKKIKEIVKK-HSQFIGYPI-----KLLVEKEREKELSEDEEEEKKEEEKEED 532
           +E   I E++K                      E   E  E EEEEK+EE +EE 
Sbjct: 45  LEGVDIDEVIKNAAEAPAAAAAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEES 99


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 35.4 bits (82), Expect = 0.090
 Identities = 28/109 (25%), Positives = 60/109 (55%), Gaps = 14/109 (12%)

Query: 435 TVTSKHNDDE------QYIWESSAGGSFTIKP--DNSEPLGRGTKIVLH-MKEDQAEYIE 485
            V ++   +E      +YI E  +  S ++K      +P   G K+++    E +A +++
Sbjct: 51  EVENQTTTEELLQDYWEYIVEELSKKSPSLKSLLKKQKPELEGNKLIIKVPNEIEANFLK 110

Query: 486 EKKIKEIVKKHSQFIGYP-IKLLVE---KEREKELSEDEEEEKKEEEKE 530
           +KK+ ++++++ +F G+  +K+ VE    E  +E  E+ EE+K+EEE +
Sbjct: 111 KKKLDKLLEEYIKF-GFGKLKIDVEVDEDESSEEEIEEFEEQKEEEEAK 158


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 33.2 bits (76), Expect = 0.11
 Identities = 14/65 (21%), Positives = 33/65 (50%), Gaps = 11/65 (16%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
           M+      IEE+K+ EI +           +++E E  ++  +DE+++  +++ ++D   
Sbjct: 24  MEARTEALIEEEKV-EIARS----------VMIEGEEPEDDDDDEDDDDDDDKDDKDDDD 72

Query: 536 KLEDE 540
             +DE
Sbjct: 73  DDDDE 77


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 36.5 bits (84), Expect = 0.12
 Identities = 15/27 (55%), Positives = 23/27 (85%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTP 535
           E+E E+E  E+EEEE++EEE+EE++ P
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEP 893



 Score = 35.7 bits (82), Expect = 0.23
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDK 533
           E+E E+E  E+EEEE++EEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 35.0 bits (80), Expect = 0.33
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 540
           E+E E+E  E+EEEE++EEE+EE+    L  E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 35.0 bits (80), Expect = 0.38
 Identities = 15/36 (41%), Positives = 26/36 (72%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
           + E E+E  E+EEEE++EEE+EE++  + E+   L+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 36.0 bits (84), Expect = 0.12
 Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 475 HMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
             ++++ E    +K +    +H + +    + + EKE  +    +EE E+ E  +EE+  
Sbjct: 268 QAEDEELEQENAEKRRMKRLEHRREL---EQQIEEKEERRAAEREEELEEGERLREEEA- 323

Query: 535 PKLEDEAQLDAIKKVFLSQ 553
              E +A+++  ++  L +
Sbjct: 324 ---ERQARIEEERQRLLKE 339



 Score = 34.1 bits (79), Expect = 0.49
 Identities = 18/57 (31%), Positives = 33/57 (57%), Gaps = 7/57 (12%)

Query: 475 HMKEDQAEYIEEKKI--KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
            + +++   IEEKK    E  ++  +     I  ++E+ER K L+E+EE E+K +E+
Sbjct: 17  KVNKERDAQIEEKKRIKAEEKEEERR-----IDEMMEEERLKALAEEEERERKRKEE 68



 Score = 32.6 bits (75), Expect = 1.3
 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 11/81 (13%)

Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
             M E      EE + +   K+  Q      KL  E +   E   + +EE+KE E+EE+ 
Sbjct: 99  EQMDEIIERIQEEDEAEAQEKREKQ-----KKLREEIDEFNEERIERKEEEKEREREEEL 153

Query: 534 ------TPKLEDEAQLDAIKK 548
                   K E E + +A ++
Sbjct: 154 KILEYQREKAEREEEREAERR 174



 Score = 32.2 bits (74), Expect = 1.8
 Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 9/76 (11%)

Query: 475 HMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--D 532
            +K    E   E+K KE  ++    +    + + E+E+ ++    EE E++ +E+E+  +
Sbjct: 51  RLKALAEEEERERKRKEERREGRAVL---QEQIEEREKRRQ----EEYEERLQEREQMDE 103

Query: 533 KTPKLEDEAQLDAIKK 548
              ++++E + +A +K
Sbjct: 104 IIERIQEEDEAEAQEK 119



 Score = 31.0 bits (71), Expect = 4.3
 Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 8/59 (13%)

Query: 476 MKEDQAEYIEEK-KIKEIVKKHSQFIGYPIKLL--VEKEREKELSEDEEEEKKEEEKEE 531
           ++E+  E+ EE+ + KE  K+  +      + L  +E +REK   E+E E ++ E KEE
Sbjct: 126 LREEIDEFNEERIERKEEEKERER-----EEELKILEYQREKAEREEEREAERRERKEE 179



 Score = 30.3 bits (69), Expect = 6.9
 Identities = 17/93 (18%), Positives = 40/93 (43%), Gaps = 12/93 (12%)

Query: 471 KIVLHMKEDQ---AEYIEEKKIKEIVK-KHSQFIGYPIKLLVEKEREKELSED-----EE 521
           KI+ + +E      E   E++ ++  K +    +        E E E+E  ++      +
Sbjct: 154 KILEYQREKAEREEEREAERRERKEEKEREVARLR---AQQEEAEDEREELDELRADLYQ 210

Query: 522 EEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQI 554
           EE + +E++++K    +   Q   +++    QI
Sbjct: 211 EEYERKERQKEKEEAEKRRRQKQELQRAREEQI 243



 Score = 30.3 bits (69), Expect = 7.1
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 15/71 (21%)

Query: 476 MKEDQAEYIEEKKIKEIVKKH--SQFIGYPIKLLVEKEREKELSED-----------EEE 522
            +ED+AE  +EK+ K+   +    +F    I+   E+E+E+E  E+           E E
Sbjct: 109 QEEDEAE-AQEKREKQKKLREEIDEFNEERIER-KEEEKEREREEELKILEYQREKAERE 166

Query: 523 EKKEEEKEEDK 533
           E++E E+ E K
Sbjct: 167 EEREAERRERK 177


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 34.5 bits (80), Expect = 0.13
 Identities = 24/94 (25%), Positives = 37/94 (39%), Gaps = 19/94 (20%)

Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE-----KEED 532
           ED A   EE+  K +              L E ER+ E  EDE++  K E        E+
Sbjct: 59  EDIAWLKEERDDKTVS-------------LNEAERKAEREEDEKKRLKRENERRKALGEE 105

Query: 533 KTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQ 566
               L+D  +       +L +   I  D ++K+Q
Sbjct: 106 PLKSLDDLPKDKEEPDAYLDEAANILLD-LIKLQ 138


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 34.6 bits (79), Expect = 0.14
 Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
           MKE   E    K+ KE  K  SQ          +K++++++SE + + KK+EEK   K P
Sbjct: 1   MKETSYEEALAKQRKESGKTQSQ--------KSDKKKKEKVSEKKGKSKKKEEKPNGKIP 52

Query: 536 KLEDEAQLDAIKKVFLSQ 553
           + E   ++  ++ +   +
Sbjct: 53  EHEPNQEVTEVEVIIEKE 70


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 35.1 bits (81), Expect = 0.15
 Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 17/97 (17%)

Query: 474 LHMKEDQ-------AEYIEEKK--------IKEIVKKHSQFIGYPIKLLVEKEREKELSE 518
            H+++         AEY E KK        I+++ K++ +   +  K    K+ +K+  +
Sbjct: 41  SHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK-KKDKDK 99

Query: 519 DEEEEKKEEEKEEDKTPKLEDEAQ-LDAIKKVFLSQI 554
           D++++KK+++ E+    + ED+ + L       LS +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTL 136


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 33.5 bits (77), Expect = 0.15
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
               E+E  E+EEEE++EEE EE+
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 32.3 bits (74), Expect = 0.40
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
               E+E  E+EEEE++EE +EE 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 13/29 (44%), Positives = 19/29 (65%), Gaps = 4/29 (13%)

Query: 513 EKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
                E+EEEE++EEE+EE+     E+EA
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES----EEEA 96



 Score = 30.0 bits (68), Expect = 2.1
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 518 EDEEEEKKEEEKEEDKTPKLEDEAQ 542
             EEEE++EEE+EE++    E+   
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEAMA 98


>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting
           protein 1 (NUFIP1).  Proteins in this family have been
           implicated in the assembly of the large subunit of the
           ribosome and in telomere maintenance. Some proteins in
           this family contain a CCCH zinc finger. This family
           contains a protein called human fragile X mental
           retardation-interacting protein 1, which is known to
           bind RNA and is phosphorylated upon DNA damage.
          Length = 57

 Score = 32.0 bits (73), Expect = 0.16
 Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 16/62 (25%)

Query: 468 RGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE 527
            GT I L   E+ A++IEE+K     K       YP K  +EK+++      EEE  K  
Sbjct: 12  PGTSIKLQTPEEIAKWIEERK-----KN------YPTKANIEKKKKA-----EEERIKRG 55

Query: 528 EK 529
           E 
Sbjct: 56  EL 57



 Score = 31.2 bits (71), Expect = 0.34
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 12/57 (21%)

Query: 1329 RGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVE-KEREKELSEDEEEV 1384
             GT I L   E+ A++IEE+K     K       YP K  +E K++ +E      E+
Sbjct: 12   PGTSIKLQTPEEIAKWIEERK-----KN------YPTKANIEKKKKAEEERIKRGEL 57


>gnl|CDD|188649 cd09817, linoleate_diol_synthase_like, Linoleate (8R)-dioxygenase
           and related enzymes.  These fungal enzymes, related to
           animal heme peroxidases, catalyze the oxygenation of
           linoleate and similar targets. Linoleate
           (8R)-dioxygenase, also called linoleate:oxygen
           7S,8S-oxidoreductase, generates
           (9Z,12Z)-(7S,8S)-dihydroxyoctadeca-9,12-dienoate as a
           product. Other members are 5,8-linoleate dioxygenase
           (LDS, ppoA) and linoleate 10R-dioxygenase (ppoC),
           involved in the biosynthesis of oxylipins.
          Length = 550

 Score = 35.8 bits (83), Expect = 0.17
 Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 288 DIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRGVNDIDYYIGGL 347
            I + R++ +    ++R+  GL    +F D+N   D E    L  +Y   ++++ Y G +
Sbjct: 379 GILQAREWNVATLNEFRKFFGLKPYETFEDIN--SDPEVAEALELLYGHPDNVELYPGLV 436

Query: 348 LE 349
            E
Sbjct: 437 AE 438



 Score = 35.4 bits (82), Expect = 0.24
 Identities = 37/160 (23%), Positives = 62/160 (38%), Gaps = 55/160 (34%)

Query: 677 IRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTF- 735
           I + + F TD       I  ++ ++D+S +YGS+ ++  ++R++  GKL+       TF 
Sbjct: 102 IIIHDIFRTDHRDMN--INNTSSYLDLSPLYGSNQEEQNKVRTMKDGKLKPD-----TFS 154

Query: 736 ------MPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQN- 788
                  P   P  C      + ++                    RFHNYV E+L Q N 
Sbjct: 155 DKRLLGQP---PGVCALL---VMFN--------------------RFHNYVVEQLAQINE 188

Query: 789 --------------PSWSNRDELVYQEARRLVIGCFQHIV 814
                               DE ++Q AR +  G + +IV
Sbjct: 189 GGRFTPPGDKLDSSAKEEKLDEDLFQTARLITCGLYINIV 228



 Score = 33.8 bits (78), Expect = 0.73
 Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 15/55 (27%)

Query: 843 RFHNYVAEKLIQQN---------------PSWSNRDELVYQEARRLVIGCFQHIV 882
           RFHNYV E+L Q N                     DE ++Q AR +  G + +IV
Sbjct: 174 RFHNYVVEQLAQINEGGRFTPPGDKLDSSAKEEKLDEDLFQTARLITCGLYINIV 228


>gnl|CDD|212510 cd11724, TP_methylase_like, Uncharacterized subfamily of S-AdoMet
           dependent tetrapyrrole methylases.  TP-methylases use
           S-AdoMet (S-adenosyl-L-methionine or SAM) in the
           methylation of diverse substrates. Most members catalyze
           various methylation steps in cobalamin (vitamin B12)
           biosynthesis, other members like Diphthine synthase and
           Ribosomal RNA small subunit methyltransferase I (RsmI)
           act on other substrates. The function of this subfamily
           is not known.
          Length = 255

 Score = 35.4 bits (82), Expect = 0.18
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 9/83 (10%)

Query: 472 IVLHMKEDQ---AEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 528
           I+   KE     A+Y++ K + E   K   F  Y  K     E EK   E  EEEKK + 
Sbjct: 29  IIFCPKEIVKRFADYLKGKPVLEDPWKL--FWHYGGKPWENLEAEKSPREAFEEEKKRQR 86

Query: 529 KEEDKTPKLEDEAQLDAIKKVFL 551
           +E  K  +   EA L   K V L
Sbjct: 87  EEFVKVIR---EA-LAQGKNVAL 105


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 34.0 bits (78), Expect = 0.26
 Identities = 21/108 (19%), Positives = 43/108 (39%), Gaps = 14/108 (12%)

Query: 434 MTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
           ++   + + D       S   +FT K +         ++V  + +   + ++EK    ++
Sbjct: 69  LSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVDEKNGAALI 128

Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE--KEEDKTPKLED 539
           ++            ++K R+K   E +E  KK +E  KEE K     D
Sbjct: 129 EE------------LQKHRDKLKKEQKELLKKLDELEKEEKKKIWSVD 164


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 33.0 bits (76), Expect = 0.28
 Identities = 12/80 (15%), Positives = 33/80 (41%), Gaps = 14/80 (17%)

Query: 477 KEDQAEYIEE-------KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
            E++  + ++       K+  +  KK  +          E E+  ++S +E+ E + E+ 
Sbjct: 2   PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKS-------EAEKLAKMSAEEKAEYELEKL 54

Query: 530 EEDKTPKLEDEAQLDAIKKV 549
           E++      + A+ +   + 
Sbjct: 55  EKELEELEAELARRELKAEA 74


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 33.4 bits (77), Expect = 0.30
 Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 19/88 (21%)

Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE------EKEEDKTPKLEDE 540
           K  K+++K   +          EKE E+E   + EE  +EE      EKE  K  K E  
Sbjct: 78  KDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL-KREKR 136

Query: 541 AQLDAIKKVFLSQIICIGSDRIVKMQLQ 568
            + +  +K             I+K Q++
Sbjct: 137 RENERKQK------------EILKEQMK 152


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 34.8 bits (80), Expect = 0.33
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
           E+ER+K+L +  EE +K+   E+ +  +LE  A  +   K
Sbjct: 66  EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK 105



 Score = 31.7 bits (72), Expect = 2.6
 Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 16/76 (21%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED----EEEEKKEEEKEED 532
           K+ QAE  + K+  E   K             E E EK+  E+     EEE K +   E 
Sbjct: 117 KQKQAEEAKAKQAAEAKAK------------AEAEAEKKAKEEAKKQAEEEAKAKAAAEA 164

Query: 533 KTPKLEDEAQLDAIKK 548
           K    E + + +A  K
Sbjct: 165 KKKAAEAKKKAEAEAK 180


>gnl|CDD|184940 PRK14977, PRK14977, bifunctional DNA-directed RNA polymerase A'/A''
            subunit; Provisional.
          Length = 1321

 Score = 35.0 bits (80), Expect = 0.35
 Identities = 29/125 (23%), Positives = 56/125 (44%), Gaps = 8/125 (6%)

Query: 459  KPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQ-FIGYPIKLLVEKEREKELS 517
            K D+ E       I     ED+ +   + +I+E+ K++++ F     KLL +     EL 
Sbjct: 896  KLDHGEAFNLERIIEKQKIEDRGKGASKDEIEELAKEYTKTFNANLPKLLADAIHGAELK 955

Query: 518  EDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQLQSFYMPSERQ 577
            EDE E    E KE        ++A+++  + + +     I  +   +M L++F+    + 
Sbjct: 956  EDELEAICAEGKE------GFEKAKVEPGQAIGIISAQSIA-EPGTQMTLRTFHAAGIKA 1008

Query: 578  FNVTL 582
             +VT 
Sbjct: 1009 MDVTH 1013


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 35.2 bits (81), Expect = 0.35
 Identities = 17/57 (29%), Positives = 34/57 (59%), Gaps = 6/57 (10%)

Query: 504 IKLLVEKEREKELS-----EDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQII 555
           +K L EK+ EK+       + EE +++ EE+E+    +L ++  ++A KK F+ ++I
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK-FIKEVI 331


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
           describes peptide chain release factor 1 (PrfA, RF-1),
           and excludes the related peptide chain release factor 2
           (PrfB, RF-2). RF-1 helps recognize and terminate
           translation at UAA and UAG stop codons. The
           mitochondrial release factors are prfA-like, although
           not included above the trusted cutoff for this model.
           RF-1 does not have a translational frameshift [Protein
           synthesis, Translation factors].
          Length = 360

 Score = 34.3 bits (79), Expect = 0.41
 Identities = 17/74 (22%), Positives = 37/74 (50%), Gaps = 13/74 (17%)

Query: 479 DQAEYI----EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--- 531
           DQ +      E  +++EIV  + ++     +   + +  KE+ E+ + E +E  KEE   
Sbjct: 31  DQDKLRKLSKEYSQLEEIVDCYREYQ----QAQEDIKEAKEILEESDPEMREMAKEELEE 86

Query: 532 --DKTPKLEDEAQL 543
             +K  +LE++ ++
Sbjct: 87  LEEKIEELEEQLKV 100


>gnl|CDD|224581 COG1667, COG1667, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 254

 Score = 34.1 bits (78), Expect = 0.42
 Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 14/89 (15%)

Query: 473 VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
           VL +   + E   E +  +  ++      Y I++   +E E+ L E    +   EE EE 
Sbjct: 113 VLSVPITEEEVENEMEELKSSEREELLEKYGIRVPEIEEIERILDEAVGSDYFSEEIEEP 172

Query: 533 -----------KTPKLEDEAQLDAIKKVF 550
                      K P   DE  ++ I KVF
Sbjct: 173 SREELLKKLGIKEP---DENWVETILKVF 198


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 32.0 bits (73), Expect = 0.46
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
           ++E E+ + E  EE ++  E+ E++  KL  E  L+  ++
Sbjct: 27  KEEAEQIIKEAREEAREIIEEAEEEAEKLAQE-ILEEARE 65


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 34.4 bits (79), Expect = 0.48
 Identities = 15/57 (26%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKER--EKELSEDEEEEKKEEEKEEDKTPKLEDE 540
            KK+++  KK  +           + +  E++L ++E E+K++E+ +EDK  + +DE
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 32.6 bits (74), Expect = 0.49
 Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEEDKTPKLEDE 540
           K I E ++K+S F+   +K         +E+E   +++EEK+E+ +E+ +   +E+E
Sbjct: 74  KLIAEFLEKYSDFLNEYVKFTPRNNNAIQEEEDDMEQQEEKEEKGREKGRKKNVEEE 130


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 33.6 bits (77), Expect = 0.50
 Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEE---EEKKEEEKEED 532
            K+ + E +EE   KE   K     G    L  E+  ++E+ ED E   E K EE K+E+
Sbjct: 5   CKDAKHENMEEDCCKENENKEED-KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63

Query: 533 KTPKLEDEA---QLDAIK 547
              K E++    +L+A+K
Sbjct: 64  NKLKEENKKLENELEALK 81



 Score = 30.1 bits (68), Expect = 7.1
 Identities = 10/40 (25%), Positives = 23/40 (57%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
           EKE +    E+ EE+  +E + +++    E++ + + I+K
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEK 41


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 31.8 bits (73), Expect = 0.51
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
               E++  E+EEEE+KEE +EE 
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 31.1 bits (71), Expect = 1.1
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
               EEKKEEE+EE++  + E+EA
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
              EE+K+EEE+EE+K    E+ A
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAA 98


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 33.1 bits (76), Expect = 0.53
 Identities = 15/58 (25%), Positives = 34/58 (58%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
            +E + E  EE+K  E  ++  +     ++   EK++E+E  ++ EE+ ++E++E +K
Sbjct: 20  QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77



 Score = 30.4 bits (69), Expect = 4.2
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
            +ER+K   + E E K+EEE EE++  K E+E + +  ++
Sbjct: 28  REERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 33.6 bits (77), Expect = 0.56
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
           KE EKE +  E++E K + ++ED+  + E+E 
Sbjct: 66  KEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97


>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL.  All proteins in
            this family for which the functions are known are
            involved in the process of generalized mismatch repair.
            This family is based on the phylogenomic analysis of JA
            Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
            metabolism, DNA replication, recombination, and repair].
          Length = 312

 Score = 33.8 bits (78), Expect = 0.64
 Identities = 23/142 (16%), Positives = 52/142 (36%), Gaps = 45/142 (31%)

Query: 1238 IIDSGIGMTKADLV-----------NNLGTIAKSGTKAFM-EALQAGADISMIGQFGVGF 1285
            + D+G G+ K DL             +   + +  T  F  EAL + + +S         
Sbjct: 56   VSDNGSGIDKEDLPLACERHATSKIQSFEDLERIETLGFRGEALASISSVS--------- 106

Query: 1286 YSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLH--------- 1336
                    ++T+T+K +  +   +++   G        + P   GT + +          
Sbjct: 107  --------RLTITTKTSAADGLAYQALLEGGMIESIKPA-PRPVGTTVEVRDLFYNLPVR 157

Query: 1337 ---MKEDQAEYIEEKKIKEIVK 1355
               +K  + E+   +KI ++++
Sbjct: 158  RKFLKSPKKEF---RKILDVLQ 176


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 33.7 bits (77), Expect = 0.71
 Identities = 16/34 (47%), Positives = 27/34 (79%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
           E+E+EKE  ++E+++KKE+ KEE K  K ++EA+
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 31.0 bits (70), Expect = 5.4
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 5/69 (7%)

Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKK-----EEEKEEDKTPKLED 539
           EEKK K+   K       P +   EK   KE  +++E++ +     EEEK+ ++      
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSR 174

Query: 540 EAQLDAIKK 548
             +    K 
Sbjct: 175 PKKPPKKKP 183


>gnl|CDD|204404 pfam10145, PhageMin_Tail, Phage-related minor tail protein.  Members
            of this family are found in putative phage tail tape
            measure proteins.
          Length = 201

 Score = 33.0 bits (76), Expect = 0.77
 Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 3/59 (5%)

Query: 1244 GMTKADLVNNLGTIAKSGTKAFMEALQAGADI--SMIGQFGVGFYSAYLVADKVTVTSK 1300
            G++  +L+       K  T A    L   ADI  + +  FG+    A   AD +   + 
Sbjct: 30   GLSGKELLGFTPAALKLAT-ATGADLAEAADIAAAAMNAFGLSADDAERAADVLAYAAN 87


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 34.0 bits (77), Expect = 0.84
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 5/72 (6%)

Query: 471  KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
            K V  +K+ +AE   EKK  E +KK  +     IK   E ++ +E  +  EE KK EE E
Sbjct: 1633 KKVEQLKKKEAE---EKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687

Query: 531  EDKTPKLEDEAQ 542
            +     L+ EA+
Sbjct: 1688 KKAAEALKKEAE 1699



 Score = 32.4 bits (73), Expect = 2.0
 Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 477  KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREK----ELSEDEEEEKK------E 526
            K ++A+  EE +I+E++K + +      +   + E  K    EL + EEE+KK      +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641

Query: 527  EEKEEDKTPKLEDEAQLDAIKK 548
            E +E+ K  +L+   + + IK 
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKA 1663



 Score = 31.6 bits (71), Expect = 3.9
 Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 13/81 (16%)

Query: 476  MKEDQAEYIEEKKIK-EIVKKHSQFIGYPIKLLVEKEREKELSEDE--EEEKKEEE---- 528
            MK ++A+  EE KIK E +KK  +      K  VE+ ++KE  E +  EE KK EE    
Sbjct: 1607 MKAEEAKKAEEAKIKAEELKKAEE-----EKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661

Query: 529  -KEEDKTPKLEDEAQLDAIKK 548
               E+     ED+ + +  KK
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKK 1682



 Score = 31.3 bits (70), Expect = 5.2
 Identities = 17/72 (23%), Positives = 36/72 (50%)

Query: 477  KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
              ++A+  EE++  E ++K  +             + +E  + +E +K EE+K+ D+  K
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKK 1297

Query: 537  LEDEAQLDAIKK 548
             E++ + D  KK
Sbjct: 1298 AEEKKKADEAKK 1309



 Score = 31.3 bits (70), Expect = 5.6
 Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 485  EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
            EEKK  E +KK  +      +   +KE E++  + EE +K EEEK++    K E+E + +
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEA-KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAE 1771

Query: 545  AIKK 548
             I+K
Sbjct: 1772 EIRK 1775



 Score = 30.5 bits (68), Expect = 8.9
 Identities = 16/64 (25%), Positives = 26/64 (40%), Gaps = 2/64 (3%)

Query: 485  EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
            EE K  +  KK ++          +K  E +  +  E  K E E   D+    E++A+  
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAK--KAAEAAKAEAEAAADEAEAAEEKAEAA 1369

Query: 545  AIKK 548
              KK
Sbjct: 1370 EKKK 1373


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 32.5 bits (74), Expect = 0.84
 Identities = 21/95 (22%), Positives = 40/95 (42%), Gaps = 16/95 (16%)

Query: 448 WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLL 507
           WE   G      P+      +  + +L     +   I     ++I+K++           
Sbjct: 2   WEFGGGA-----PEKKPEEEKKDENLL-----EHVKITSWDKEDIIKENEDVKDEK---- 47

Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
             +E ++E  E++EEE +E E  ED+   +EDE +
Sbjct: 48  --QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEE 80



 Score = 31.8 bits (72), Expect = 1.3
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 13/68 (19%)

Query: 473 VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
           V   K++  E  EE+  +EI +              + E E+E+ EDEEEE+ E+E++  
Sbjct: 43  VKDEKQEDDEEEEEEDEEEIEEPE------------DIEDEEEIVEDEEEEE-EDEEDNV 89

Query: 533 KTPKLEDE 540
               +E +
Sbjct: 90  DLKDIEKK 97


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 30.4 bits (69), Expect = 0.84
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 3/38 (7%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLE---DEAQLD 544
           K  E++  E+EE  K+EE KE +K    E   +E++LD
Sbjct: 30  KAEEEKREEEEEARKREERKEREKNKSFEELLNESELD 67


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 0.87
 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 477 KEDQAEYIE-EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
           KE + E  E  ++I EI  +  +      KL  E +  +EL E+ EE +KE E  E    
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255

Query: 536 KLE 538
           KLE
Sbjct: 256 KLE 258



 Score = 32.3 bits (74), Expect = 2.3
 Identities = 18/73 (24%), Positives = 40/73 (54%), Gaps = 15/73 (20%)

Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED------ 539
           E+++KE+   +++++    +L   K+ EKEL  +E+E KK EE+ +    +L +      
Sbjct: 591 EERLKELEPFYNEYL----EL---KDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643

Query: 540 --EAQLDAIKKVF 550
               +L+ ++K +
Sbjct: 644 ELRKELEELEKKY 656



 Score = 32.0 bits (73), Expect = 2.9
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIK------LLVEKEREKELSEDEEEEKKEEEKE 530
           KE++ E ++ KK+KE+ K+  +                E ER K+       EK E+E E
Sbjct: 336 KEERLEELK-KKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394

Query: 531 EDKTPKLEDEAQLDAIK 547
           E +  K E E ++  I 
Sbjct: 395 ELEKAKEEIEEEISKIT 411



 Score = 31.6 bits (72), Expect = 4.4
 Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 14/83 (16%)

Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKER-EKELSEDEEE-----------EKK 525
           E++ + ++E  +KE  +++ +   +  + L E    EK LS  EEE           E+K
Sbjct: 279 EEKVKELKE--LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336

Query: 526 EEEKEEDKTPKLEDEAQLDAIKK 548
           EE  EE K    E E +L+ +++
Sbjct: 337 EERLEELKKKLKELEKRLEELEE 359


>gnl|CDD|237762 PRK14598, PRK14598, peptide deformylase; Provisional.
          Length = 187

 Score = 32.4 bits (74), Expect = 0.90
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 3  ETLRQKADADKN-DKAVKDLVNLLFETSLLSSGFTLEDPQVELTL 46
          E LRQKA   K  D  +++L+  +FE+   +SG  L  PQV  +L
Sbjct: 11 EVLRQKAKPLKGVDAEIEELIASMFESMYNASGIGLAAPQVGRSL 55


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 26/107 (24%), Positives = 50/107 (46%), Gaps = 18/107 (16%)

Query: 471 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
           K++          I ++K+K    +  +F     K+L  KE+E+   E+EE+ ++  E+ 
Sbjct: 3   KVIFKRSN----VINKEKLKTHEIQKYRF-----KVLSIKEKERLREEEEEQVEQLREEA 53

Query: 531 EDKTPKLEDEAQLDAIKKV---------FLSQIICIGSDRIVKMQLQ 568
            D+  ++ +EA+ DA + V          L +     S  I K+Q+Q
Sbjct: 54  NDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQ 100



 Score = 31.1 bits (71), Expect = 3.7
 Identities = 20/56 (35%), Positives = 27/56 (48%), Gaps = 7/56 (12%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE 531
             E++A+    KK  E     S  I    KL ++ ERE+E  E+E E   EE K E
Sbjct: 74  AAEEEAKEELLKKTDEA----SSII---EKLQMQIEREQEEWEEELERLIEEAKAE 122


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 31.1 bits (70), Expect = 1.1
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 5/38 (13%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAI 546
           E E E+E+ ED+EEE+ E E+EE      E+E + +  
Sbjct: 50  EMEEEEEVDEDDEEEEGEGEEEEG-----EEEEETEGA 82



 Score = 28.8 bits (64), Expect = 7.4
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
            +E + E+ E+EE ++ +EE+E +   +  +E +
Sbjct: 44  AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEE 77


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 29.2 bits (66), Expect = 1.1
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 507 LVEKEREKELSEDEEEEKKEEEKEEDK 533
           L E+E+ ++  E+E+EE+ EE+ E  +
Sbjct: 20  LCEEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|222896 PHA02592, 52, DNA topisomerase II medium subunit; Provisional.
          Length = 439

 Score = 32.7 bits (75), Expect = 1.3
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 470 TKIVLHMKEDQAEYIEEKKIKEIVKKH-SQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 528
            +IVL  K   +E  EE    +   ++  + +   I  +   EREK L ++ EE +KE E
Sbjct: 362 GEIVLQGKTR-SELTEELSDNKDFGEYADKLVAMNIYSMTSDEREK-LQKEAEELEKEHE 419

Query: 529 KEEDKTPKLEDEAQLDAIKK 548
             +  T K E    L+ +KK
Sbjct: 420 YWKKTTAKKEYIKDLEELKK 439


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.0 bits (75), Expect = 1.4
 Identities = 24/130 (18%), Positives = 40/130 (30%), Gaps = 22/130 (16%)

Query: 434 MTVTSKHNDDEQY----IWESSAGGSFTIKPDNSEP----------LGRGTKIVLHMKED 479
                  +D E Y      +      F    D  +                K       D
Sbjct: 212 EAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGD 271

Query: 480 QAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED 539
            AE  +++  KE VKK +            +E ++E  E E+++ +EE  E        D
Sbjct: 272 DAELEDDEPDKEAVKKEAD--------SKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLD 323

Query: 540 EAQLDAIKKV 549
           E  L+ +   
Sbjct: 324 EPVLEGVDLE 333


>gnl|CDD|237576 PRK13979, PRK13979, DNA topoisomerase IV subunit A; Provisional.
          Length = 957

 Score = 32.8 bits (75), Expect = 1.5
 Identities = 31/128 (24%), Positives = 53/128 (41%), Gaps = 26/128 (20%)

Query: 1227 IIPNKEEKTLTIIDSGIGMT-KADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGV-- 1283
            I  ++E+  L I + G+ +  K++ VN +G IA SG             IS+  +  V  
Sbjct: 853  IKEHEEKNILLITEKGMAIRFKSEAVNPMGKIA-SGVTG----------ISLKDEDKVIY 901

Query: 1284 GFYSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAE 1343
            G         K+ + SK  D  +   +        IK  N    GRG  ++  +  D   
Sbjct: 902  GSIIDENDTRKIKLKSKAKDKGEIEIKD-------IKLQNRA--GRGNSLMPLLLND--- 949

Query: 1344 YIEEKKIK 1351
            +I++ +IK
Sbjct: 950  FIKDVEIK 957


>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37.
          Length = 93

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 9/60 (15%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVE---KEREKELSEDEEEEKKEEEKEED 532
           M  ++AEY  E+ I+      S   G P  +  E   KE +++   +EE EK++EE+E+D
Sbjct: 40  MPVEEAEYHLERCIE------SGLWGVPNAIEDETEFKELQEQYEVEEEAEKEDEEEEDD 93


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 32.4 bits (74), Expect = 1.6
 Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 511 EREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
           +R KE    E E + +EEK+  K  KL   A+L  I K
Sbjct: 282 QRNKEKRRKELEREAKEEKQLKK--KLAQLARLKEIAK 317



 Score = 31.2 bits (71), Expect = 3.7
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
           K  + E +EEKK++++  + S+       LL E + + E   D+E   +  E E +   K
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271



 Score = 30.8 bits (70), Expect = 5.0
 Identities = 10/59 (16%), Positives = 24/59 (40%)

Query: 484 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
            E+K+ +    +  +      +     E  + L E+ +++ +EE  +E      E E +
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYE 267


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 32.2 bits (74), Expect = 1.6
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 12/58 (20%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
           +K  + E  EE + K+  KK             E+   K      EE++K EEKE  K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKK------------EEREAKLAKLSPEEQRKLEEKERKK 318



 Score = 30.3 bits (69), Expect = 8.0
 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 508 VEKEREKELSED---EEEEKKEEEKEEDKTPKLEDEAQL 543
           V+K RE+E  +     EEE++EE +E+ +  K E+    
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAK 299


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 32.5 bits (74), Expect = 1.6
 Identities = 16/71 (22%), Positives = 34/71 (47%), Gaps = 11/71 (15%)

Query: 484 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQL 543
           + EK++ E+ K+  + +          E+++E +++EE  K E EK+         E  L
Sbjct: 118 LLEKELDELSKELQKQLQ---NTAEIIEKKRENNKNEERLKFENEKKL--------EESL 166

Query: 544 DAIKKVFLSQI 554
           +  ++ F  Q+
Sbjct: 167 ELEREKFEEQL 177


>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
          Length = 637

 Score = 32.7 bits (75), Expect = 1.7
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 8/56 (14%)

Query: 486 EKKIKEIVKKHSQFIG--YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED 539
           EKKI E ++++  F+G    + ++      +EL  D E  ++EEE E+ K   +E+
Sbjct: 22  EKKINEKIEENGGFLGEEAALMMVA-----QELGIDIEYGQEEEE-EDFKISDIEE 71


>gnl|CDD|179537 PRK03095, prsA, peptidylprolyl isomerase; Reviewed.
          Length = 287

 Score = 32.3 bits (73), Expect = 1.8
 Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 537
           ED A  +++ ++ E VK  SQF  + IK+   KE EK   + + + KKE  +++ +  + 
Sbjct: 192 EDAAYKLKKDEVSEPVK--SQFGYHIIKVTDIKEPEKSFEQSKADIKKELVQKKAQDGEF 249

Query: 538 EDEAQLDAIKK 548
            ++  +  IKK
Sbjct: 250 MNDLMMKEIKK 260


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.7 bits (72), Expect = 1.8
 Identities = 28/103 (27%), Positives = 40/103 (38%), Gaps = 10/103 (9%)

Query: 456 FTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKE 515
           + ++P   E      K  +  K D+ E  EEK+  +   K  +          EKE+E  
Sbjct: 14  YIVQPKLEEAKANMQKTFIIKKADEKE--EEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71

Query: 516 LSEDE--------EEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
             E E        EE  +EEEKEE+     + E      KK F
Sbjct: 72  EPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPF 114



 Score = 31.3 bits (71), Expect = 2.0
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 483 YIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
           YI + K++E  K + Q   + IK   EKE EKE S++  + K+EE+K E +  + E EA 
Sbjct: 14  YIVQPKLEE-AKANMQKT-FIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71

Query: 543 LDAIKKVFLSQI 554
               + +   Q 
Sbjct: 72  EPEREDIAEQQE 83


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.1 bits (74), Expect = 2.0
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 513 EKELSEDEEEEKKEEEKEED 532
            +E  E+EEEE++EE  EE+
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320



 Score = 30.6 bits (70), Expect = 5.4
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 507 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
           L E    +  +   EEE++EEE+EE++ P  E+ A
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAA 322


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 22/77 (28%)

Query: 471 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
           KI+   KE+ A+ IEE    EI+KK              +E  ++L E    +KK+E +E
Sbjct: 28  KIIRKAKEE-AKKIEE----EIIKK-------------AEEEAQKLIE----KKKKEGEE 65

Query: 531 EDKTPKLEDEAQLDAIK 547
           E K    E E +++ +K
Sbjct: 66  EAKKILEEGEKEIEELK 82


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 30.3 bits (69), Expect = 2.0
 Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 8/47 (17%)

Query: 509 EKEREKELSEDE----EEEKKE--EEKEEDKTPKLEDEAQLDAIKKV 549
           EK+++K+ SE E    ++EKKE  E+++ D+  K  D A L AIKK 
Sbjct: 40  EKKKKKKKSEGEFFAEKKEKKEVSEQRKADQ--KAVDAALLAAIKKH 84


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 31.9 bits (73), Expect = 2.1
 Identities = 16/71 (22%), Positives = 29/71 (40%), Gaps = 13/71 (18%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK-TP 535
           +E      + K  K +  K           + EKE+ K   E++   + EE+  ED+   
Sbjct: 47  EEKAKVAAKAKAKKALKAK-----------IEEKEKAKREKEEKGLRELEEDTPEDELAE 95

Query: 536 KL-EDEAQLDA 545
           KL   + Q ++
Sbjct: 96  KLRLRKLQEES 106


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.0 bits (71), Expect = 2.1
 Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 3/46 (6%)

Query: 505 KLLVEKEREKELSEDEEEEKK-EEEKEEDKTPKLEDEAQLDAIKKV 549
           K   +K++E E  E+++++    EE++ED+  K  D A L AIKKV
Sbjct: 85  KKKKKKKKEGEFFEEKKKKYVVSEERKEDQ--KAVDAALLAAIKKV 128


>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2.  This entry is the
           first approximately 250 residues of cortactin-binding
           protein 2. In addition to being a positional candidate
           for autism this protein is expressed at highest levels
           in the brain in humans. The human protein has six
           associated ankyrin repeat domains pfam00023 towards the
           C-terminus which act as protein-protein interaction
           domains.
          Length = 193

 Score = 31.4 bits (71), Expect = 2.4
 Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 17/93 (18%)

Query: 465 PLGRGTKIVLHMKEDQ---------AEYIEEKKIKEIVKKHSQFIGYPIK-----LLVEK 510
           P+    K++   KE Q         AE  + K + E+ ++  + I Y  K      L+E+
Sbjct: 83  PMSELDKVMAKQKETQRRMLAQLLAAEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQ 142

Query: 511 ERE--KELSEDEEEEKKEEEKEEDKT-PKLEDE 540
           ERE  K+L E E+ ++ ++E+E  K    LE+E
Sbjct: 143 ERERLKKLLEQEKSQQAKKEQEHRKLLATLEEE 175


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.9 bits (73), Expect = 2.5
 Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 19/101 (18%)

Query: 442 DDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIG 501
           DDE+  WE           D           V   KE ++   E+++ K+   K      
Sbjct: 98  DDEEEEWEVEE--------DEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKK----- 144

Query: 502 YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
                  +++ ++ELSE++EEE  EEE+ E +  K  + A 
Sbjct: 145 ------AKEDSDEELSEEDEEEAAEEEEAEAEKEKASELAT 179


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.0 bits (73), Expect = 2.6
 Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 6/87 (6%)

Query: 474 LHMKEDQAEYIEEKK------IKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE 527
           L   E   E +EEK        +EI +   +  G    L   +E  ++L   EE  +K E
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335

Query: 528 EKEEDKTPKLEDEAQLDAIKKVFLSQI 554
           EK E    +LE+ A+        L + 
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEER 362



 Score = 31.7 bits (72), Expect = 4.0
 Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 8/69 (11%)

Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
           +KK+KE+ ++ SQ        L E  +  ELSE E E ++ EE+ E +  KL  +A+L+ 
Sbjct: 599 KKKLKELEERLSQ--------LEELLQSLELSEAENELEEAEEELESELEKLNLQAELEE 650

Query: 546 IKKVFLSQI 554
           + +  L ++
Sbjct: 651 LLQAALEEL 659



 Score = 30.5 bits (69), Expect = 8.2
 Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 6/78 (7%)

Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 537
           E Q   + E     +     +         +++E+E+E  E E E  +E   E      L
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAE------L 244

Query: 538 EDEAQLDAIKKVFLSQII 555
           E+E +     K  L +I 
Sbjct: 245 EEEKERLEELKARLLEIE 262


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 31.3 bits (71), Expect = 2.6
 Identities = 12/37 (32%), Positives = 26/37 (70%)

Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 540
            KL + +++ KEL  ++ +E+ E+++EE++  + EDE
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187



 Score = 30.5 bits (69), Expect = 4.7
 Identities = 14/41 (34%), Positives = 27/41 (65%)

Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
           + +L +K +E E  + +EE++K+EE+EE++  + ED    D
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193


>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p.  In Saccharomyces
           cerevisiae, Sec2p is a GDP/GTP exchange factor for
           Sec4p, which is required for vesicular transport at the
           post-Golgi stage of yeast secretion.
          Length = 90

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 513 EKELSEDEEEEKKEEEKE-EDKTPKLEDEAQ 542
           EK+     E+EKK+ E+E ED T  L DEA 
Sbjct: 6   EKKKRLRAEKEKKKLEQEVEDLTASLFDEAN 36


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 31.9 bits (73), Expect = 2.8
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 13/68 (19%)

Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPK--------LEDEAQLDAIKKVFLSQIICIGS 559
            + E + +   D+E+E+K+E KE +K            + EA   A K   L+      +
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTA-----T 209

Query: 560 DRIVKMQL 567
              VK  L
Sbjct: 210 ADPVKAYL 217


>gnl|CDD|215961 pfam00510, COX3, Cytochrome c oxidase subunit III. 
          Length = 258

 Score = 31.5 bits (72), Expect = 2.9
 Identities = 21/62 (33%), Positives = 26/62 (41%), Gaps = 13/62 (20%)

Query: 947  YFQGYDPEVNPSTLAD-------FAAGAFRGLHSLIPGTIKL-VNERRVTNQELPFSSTM 998
              Q Y+    P T++D       F A  F GLH +I GTI L V   R+ N       T 
Sbjct: 172  GLQAYEYYEAPFTISDGVYGSTFFMATGFHGLHVII-GTIFLLVCLLRLLNYHF----TS 226

Query: 999  QH 1000
             H
Sbjct: 227  NH 228


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 19/100 (19%), Positives = 43/100 (43%), Gaps = 6/100 (6%)

Query: 455 SFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY------PIKLLV 508
           S  +     + + +    ++ + E Q    E  K  + +     ++G         K   
Sbjct: 95  SRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKE 154

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
           +KER +EL ++ EE + E++  E    + E + +L+ +K+
Sbjct: 155 KKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKE 194


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.8 bits (72), Expect = 2.9
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
           E+E E++ SE+EE E +EEE+E +     E+E +
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 31.4 bits (71), Expect = 4.2
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
           VE+E E+E  E+EEE++ EEE+ ED+  + E EA 
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEAD 479


>gnl|CDD|225572 COG3028, COG3028, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 187

 Score = 30.8 bits (70), Expect = 3.0
 Identities = 23/82 (28%), Positives = 38/82 (46%), Gaps = 16/82 (19%)

Query: 1293 DKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIV---------LHMKEDQAE 1343
            D V    + ++D++ IW S +     IK D       G ++V         + + ED  E
Sbjct: 10   DDVPGDDEEDEDDEIIWVSKS----QIKRDAEALQDLGEELVDLTKAALAKIPLDEDLLE 65

Query: 1344 YIEE-KKIK-EIVKK-HSQFIG 1362
             IE  ++IK EI ++   Q+IG
Sbjct: 66   AIELAQRIKSEIARRRQLQYIG 87


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 30.0 bits (68), Expect = 3.4
 Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 14/54 (25%)

Query: 484 IEEKKI-KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
           +EE+KI K+++K+ ++             ++K+  + ++++KK+  K+  K  K
Sbjct: 95  VEEQKIVKQVLKEKAK-------------QKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|237761 PRK14597, PRK14597, peptide deformylase; Provisional.
          Length = 166

 Score = 30.5 bits (69), Expect = 3.5
 Identities = 14/43 (32%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 5  LRQKADADKN-DKAVKDLVNLLFETSLLSSGFTLEDPQVELTL 46
          LR+KA      D  +K  +  + ET     G  L  PQV ++L
Sbjct: 11 LRKKAKPVTKFDDNLKKTIEEMIETMYHYDGVGLAAPQVGISL 53


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 30.2 bits (69), Expect = 3.5
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 8/50 (16%)

Query: 505 KLLVEKEREKELSEDEEEEKKEEEKE---EDKTPKLE---DEAQLDAIKK 548
           K L+   +  E  E  E+ KKEEEKE    ++ P L      A+L  + K
Sbjct: 5   KSLLLL-KAAEELE-FEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCK 52


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 31.2 bits (71), Expect = 3.5
 Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 6/68 (8%)

Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSE-----DEEEEKKEEEKEEDKTPKLEDE 540
           E+K+  + K   +     I++ + KER   L +      EE EK E+E E+ K      E
Sbjct: 79  EEKLSAV-KDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLE 137

Query: 541 AQLDAIKK 548
             L   + 
Sbjct: 138 KNLAEAEA 145


>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1).  This family
            consists of the yeast mitochondrial ribosomal proteins
            VAR1. Mitochondria possess their own ribosomes
            responsible for the synthesis of a small number of
            proteins encoded by the mitochondrial genome. In yeast
            the two ribosomal RNAs and a single ribosomal protein,
            VAR1, are products of mitochondrial genes, and the
            remaining approximately 80 ribosomal proteins are encoded
            in the nucleus. VAR1 along with 15S rRNA are necessary
            for the formation of mature 37S subunits.
          Length = 350

 Score = 31.5 bits (71), Expect = 3.6
 Identities = 20/74 (27%), Positives = 30/74 (40%), Gaps = 17/74 (22%)

Query: 1196 VANVNNNNN-------NNNNNNITNVYFFLFPGFGQDPIIPNKEEKTLTIIDSGIGMT-- 1246
            + N+NNNNN       NN+NNN+     +       + +  NK    L +    IG +  
Sbjct: 230  INNINNNNNIKYNNMINNSNNNLNINNIY-------NNLNINKISMDLLMNKYLIGWSIL 282

Query: 1247 -KADLVNNLGTIAK 1259
             K  L  N+    K
Sbjct: 283  FKGRLNKNISRSIK 296


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 31.6 bits (72), Expect = 3.7
 Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 3/74 (4%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSE--DEEEEKKEEEKEEDKT 534
            E+  E +EE K ++I     +       L   ++   EL E  +E EEK     EE + 
Sbjct: 318 LEELEERLEELK-EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376

Query: 535 PKLEDEAQLDAIKK 548
                  +L  ++ 
Sbjct: 377 LFEALREELAELEA 390


>gnl|CDD|221446 pfam12158, DUF3592, Protein of unknown function (DUF3592).  This
            family of proteins is functionally uncharacterized.This
            family of proteins is found in bacteria, archaea,
            eukaryotes and viruses. Proteins in this family are
            typically between 150 and 242 amino acids in length.
          Length = 134

 Score = 30.1 bits (68), Expect = 3.8
 Identities = 18/95 (18%), Positives = 35/95 (36%), Gaps = 12/95 (12%)

Query: 1053 LVGLTLGFFYLWYCGGFLLQVTRLSQVYRGVNDIDYYIGGLLESVIRG--TLSSPSFRCV 1110
            ++ L +G   L      LLQ  R               G +++S  R      + ++  V
Sbjct: 9    VLFLLIGLVLLVVGVRLLLQAQRSRTWPE-------TPGTVVDSESRDSDGRRATTYYPV 61

Query: 1111 FAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTL 1145
                 +RY  G R  +     +NP  +  G+++ +
Sbjct: 62   VE---FRYDDGRRIRFRSGSGSNPPGYRVGERVPV 93


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 30.0 bits (68), Expect = 4.2
 Identities = 12/32 (37%), Positives = 24/32 (75%), Gaps = 2/32 (6%)

Query: 507 LVEKEREKELSEDEEEEKKEE--EKEEDKTPK 536
           L++++ +KE  ++E ++K+EE   K+E+KT K
Sbjct: 54  LMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 29.2 bits (66), Expect = 4.3
 Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 6/41 (14%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLED-EAQLDAIKKV 549
           KER+     +EE+ ++EEE +  K  K E+ E +L+ IKKV
Sbjct: 1   KERK-----EEEKAQREEELKRLKNLKREEIEEKLEKIKKV 36


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.1 bits (68), Expect = 4.3
 Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 501 GYPIKLLVEK-EREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
           GY I L  E+ E+E++    E EEK+  + +++K  K E + 
Sbjct: 85  GYSINLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKV 126


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 31.2 bits (71), Expect = 4.7
 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 2/45 (4%)

Query: 501 GYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
           GYP+   + K +E+E     E+E  EEE+  + T   +   +L+ 
Sbjct: 387 GYPLAEALSKVKEEE--RPREKEGTEEEERREITVYEKRIKKLEE 429


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.8 bits (70), Expect = 4.9
 Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 477 KEDQAEYIEEKKIKEIVKK-HSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
           K  +   +  + I ++ +K  S+F     KL  E + +K+  + EE++KK+ E+ E++  
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353

Query: 536 KLEDEAQ 542
           KLE +A 
Sbjct: 354 KLEVQAT 360


>gnl|CDD|224389 COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate
            transport and metabolism].
          Length = 397

 Score = 30.8 bits (70), Expect = 5.2
 Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 4/64 (6%)

Query: 1242 GIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYL-VADKVTVTSK 1300
            G+ ++    +  +     S       AL+AG DI ++       Y AYL V + V ++  
Sbjct: 243  GVVISDDLSMKAIAAAHGSAADRAEAALKAGVDIVLVCN---ELYEAYLVVLELVGLSEA 299

Query: 1301 HNDD 1304
              DD
Sbjct: 300  RLDD 303


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 30.3 bits (69), Expect = 5.2
 Identities = 19/41 (46%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 511 EREKELSEDEEE-EKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
           + E+  SE E+E +K EEEKEE +    E EA+L+AI+K  
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKRE 157


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 12/33 (36%)

Query: 510 KEREKELSEDEEEEKKE------------EEKE 530
           K REKEL +++E E++             EEKE
Sbjct: 44  KAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76


>gnl|CDD|226175 COG3649, COG3649, CRISPR system related protein [Defense
           mechanisms].
          Length = 283

 Score = 30.7 bits (69), Expect = 5.3
 Identities = 20/48 (41%), Positives = 25/48 (52%)

Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
           K+HS  +GY    +VEKE E EL  D  E K   E   DK  ++E E 
Sbjct: 225 KEHSSKLGYLSSAVVEKELESELKLDVTELKDVYESLLDKIERIEYEV 272


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 29.6 bits (67), Expect = 5.4
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPK 536
           + + E E  ED+EE+  E+E EE+++P 
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPV 142


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 31.1 bits (70), Expect = 5.4
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 507  LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
            LV KE + +  ED++ ++KE+E+E      ++DE Q D
Sbjct: 3935 LVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 31.0 bits (70), Expect = 5.4
 Identities = 18/77 (23%), Positives = 30/77 (38%), Gaps = 5/77 (6%)

Query: 459 KPDNSEPLGRGTKIVLHMKE---DQAEYIEEKKIKEIVKKHSQFIGYPIKLLV-EKEREK 514
           K    E            KE      EY +E KI+ ++KK         ++   E+    
Sbjct: 481 KAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVI 540

Query: 515 ELSEDEEEEK-KEEEKE 530
           ++++ E EE+   E KE
Sbjct: 541 QVNKIETEEEVLSELKE 557


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 30.2 bits (68), Expect = 5.5
 Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 13/113 (11%)

Query: 488 KIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE-----EDKTPKLEDEAQ 542
            IKEI  K  +      KL  E + E+E  E EE+  KE +KE     E+K   +E+  Q
Sbjct: 2   GIKEIRNKTDELE----KLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQ 57

Query: 543 LDAIKKVFLSQIICIGSDRIVKM----QLQSFYMPSERQFNVTLSDNLHLDLL 591
           + A      ++I    S+   +M    ++   Y P +   N      L L+ L
Sbjct: 58  IHADINDMETEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEEL 110


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 29.6 bits (67), Expect = 5.5
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPK 536
            ++ EK+L E+E + + EE KE ++  K
Sbjct: 98  ARDEEKKLDEEEAKRQHEEAKEREREKK 125


>gnl|CDD|235593 PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated.
          Length = 177

 Score = 30.1 bits (69), Expect = 5.7
 Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 10/75 (13%)

Query: 485 EEKK--IKEIVKKHSQFIGYPIKLLVEKEREKELSE--DE-EEEKKEEEKEEDKT----P 535
           E+KK  +  + K  S+++   +K+L E  R   L E  ++ E  + E E   D       
Sbjct: 56  EQKKKLLAAVFKSLSEYVQNFLKVLAENGRLALLPEILEQFEALRAEHENIVDAEVTSAF 115

Query: 536 KLEDEAQLDAIKKVF 550
            L +E QLD +K   
Sbjct: 116 PLSEE-QLDKLKAAL 129


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
            Provisional.
          Length = 522

 Score = 30.8 bits (70), Expect = 5.8
 Identities = 16/45 (35%), Positives = 18/45 (40%)

Query: 1198 NVNNNNNNNNNNNITNVYFFLFPGFGQDPIIPNKEEKTLTIIDSG 1242
            N NNNNNNNN  N  N          + P    K+E     ID  
Sbjct: 17   NYNNNNNNNNQINSNNPNNNGNNQASKLPRGKKKQENPFNKIDEV 61


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.9 bits (71), Expect = 5.9
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 17/77 (22%)

Query: 480 QAEYIEEKKI----KEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEED 532
           +AE I+++ +    +EI K  ++F         EKE   R  EL + E+   ++EE  + 
Sbjct: 50  EAEAIKKEALLEAKEEIHKLRNEF---------EKELRERRNELQKLEKRLLQKEENLDR 100

Query: 533 KTPKLED-EAQLDAIKK 548
           K   LE  E +L+  +K
Sbjct: 101 KLELLEKREEELEKKEK 117


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 28.4 bits (64), Expect = 6.0
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
           EK+ E+E  ++E  E+ ++E+E  +   L    
Sbjct: 54  EKKEEREEDKEELIERIKKEEETFEDVDLGSAG 86


>gnl|CDD|220069 pfam08915, tRNA-Thr_ED, Archaea-specific editing domain of
           threonyl-tRNA synthetase.  Archaea-specific editing
           domain of threonyl-tRNA synthetase, with marked
           structural similarity to D-amino acids deacylases found
           in eubacteria and eukaryotes. This domain can bind
           D-amino acids, and ensures high fidelity during
           translation. It is especially responsible for removing
           incorrectly attached serine from tRNA-Thr. The domain
           forms a fold that can be be defined as two layers of
           beta-sheets (a three-stranded sheet and a five-stranded
           sheet), with two alpha-helices located adjacent to the
           five-stranded sheet.
          Length = 137

 Score = 29.5 bits (67), Expect = 6.2
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 23/80 (28%)

Query: 496 HSQFIGYPIKLLVEKEREKELSEDEEEEKKEE------------EKEEDKTPKLEDEAQL 543
           H+    Y       + ++K L   EEEE KE             EK +++ P+   E   
Sbjct: 7   HADRFEY-------EPKKKTLKIAEEEEPKEGRLEEALVVFTAVEKGDEENPEEVIEKAA 59

Query: 544 DAIKKVFLSQIICIGSDRIV 563
           + I KV  S+   + + R+V
Sbjct: 60  EEILKVA-SK---VKATRVV 75


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.9 bits (68), Expect = 6.4
 Identities = 17/50 (34%), Positives = 32/50 (64%), Gaps = 2/50 (4%)

Query: 486 EKKIKEIVKKHSQFIGYPIK--LLVEKEREKELSEDEEEEKKEEEKEEDK 533
            K+ K I+KK S+    PI      EKE++K +S++E++  KEE+++ ++
Sbjct: 72  TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 29.7 bits (67), Expect = 6.6
 Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 4/60 (6%)

Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
           L +K ++    E+K I EI   + Q      K L   E +K+++  + +EK E+E+ +++
Sbjct: 24  LQVKPNR---YEKKDI-EINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79



 Score = 29.3 bits (66), Expect = 8.3
 Identities = 10/61 (16%), Positives = 22/61 (36%), Gaps = 2/61 (3%)

Query: 512 REKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKV--FLSQIICIGSDRIVKMQLQS 569
               L E+ E +K+    EE K    +   +    +++   L       ++ +   + Q 
Sbjct: 39  NTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQL 98

Query: 570 F 570
           F
Sbjct: 99  F 99


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 29.2 bits (66), Expect = 6.7
 Identities = 11/24 (45%), Positives = 21/24 (87%)

Query: 508 VEKEREKELSEDEEEEKKEEEKEE 531
           +E++RE++  E EE+E+KEE+++E
Sbjct: 14  IEEKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 29.9 bits (68), Expect = 6.8
 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 12/60 (20%)

Query: 507 LVEKEREKELSEDEEEE----KKEEEKEEDKTPK--------LEDEAQLDAIKKVFLSQI 554
           L  K    +  E+E  E    +  E   E K            ED+  L+  ++  L+++
Sbjct: 13  LPPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEM 72


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 30.3 bits (69), Expect = 6.8
 Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 16/91 (17%)

Query: 462 NSEPLGRGTKI--VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED 519
           N  P  +G K   VL       E++  K+  E     +       + L  KE+  E +E 
Sbjct: 165 NQVP-RKGVKAEEVLQ------EFLNSKEAVEEAILQTD------QALTAKEKAIE-AER 210

Query: 520 EEEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
            + E  E E+E  +  + E+E  ++A ++ +
Sbjct: 211 AKAEAAEAEQELLREKQKEEEQMMEAQERSY 241


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 30.9 bits (70), Expect = 6.9
 Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 9/81 (11%)

Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAI 546
           K + E+VKK+    G P K+++E  RE   ++  +  KK ++K ED   +    A    +
Sbjct: 501 KVVNELVKKY----GPPDKIVIEMARENGTNKGTKNRKKRQKKNEDNIKEAAS-ALEQKL 555

Query: 547 KKVFLSQIICIGSDRIVKMQL 567
           K++ L     I S  I+K++L
Sbjct: 556 KELPLD----IKSKNILKLRL 572


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 30.7 bits (69), Expect = 6.9
 Identities = 18/94 (19%), Positives = 42/94 (44%), Gaps = 6/94 (6%)

Query: 477 KEDQAEYIEEKKIKEIVKKHSQFI----GYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
           +E + +    KK+ E  +  ++ I       ++ L  KE+ K+  E  + ++K E +EE+
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 533 KTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQ 566
                 D  +L+  +   L +++    + I   +
Sbjct: 227 LL--YLDYLKLNEERIDLLQELLRDEQEEIESSK 258


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 30.1 bits (68), Expect = 7.1
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 13/74 (17%)

Query: 484 IEEKKIKEIVKK-HSQFIG--------YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
           I+  K K++    +S+F G           K   EKE      E E+ +KKEE + + K 
Sbjct: 43  IDAAKFKQMAVAFNSEFAGGTGVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKK- 101

Query: 535 PKLEDEAQLDAIKK 548
              ++  +L A++K
Sbjct: 102 ---KEMEELKAVQK 112


>gnl|CDD|234083 TIGR03002, outer_YhbN_LptA, lipopolysaccharide transport periplasmic
            protein LptA.  Members of this protein family include
            LptA (previously called YhbN). It was shown to be an
            essential protein in E. coli, implicated in cell envelope
            integrity, and to play a role in the delivery of LPS to
            the outer leaflet of the outer membrane. It works with
            LptB (formerly yhbG), a homolog of ABC transporter
            ATP-binding proteins, encoded by an adjacent gene.
            Numerous homologs in other Proteobacteria are found in a
            conserved location near lipopolysaccharide inner core
            biosynthesis genes. This family is related to organic
            solvent tolerance protein (OstA), though distantly [Cell
            envelope, Biosynthesis and degradation of surface
            polysaccharides and lipopolysaccharides, Transport and
            binding proteins, Other].
          Length = 142

 Score = 29.1 bits (66), Expect = 7.1
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 7/65 (10%)

Query: 1292 ADKVTVTSKHNDDEQYIWESSAGGS---FTIKPDNSEPL-GRGTKIVLHMKEDQAEYIEE 1347
            ADKV VT       +     +A G    F  K DN + + G+  +I     +D+      
Sbjct: 43   ADKVVVTRNDAGGIEKA---TATGKPATFRQKLDNGKEIEGQANRIEYDPAKDEVVLTGN 99

Query: 1348 KKIKE 1352
             ++K+
Sbjct: 100  ARLKQ 104


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 30.5 bits (69), Expect = 7.2
 Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
           E  K+ E  K  S+F+GY   +  + E E +    EE EKK++ K++ +  K+   + 
Sbjct: 319 EIPKLDE--KSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSN 374


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.6 bits (69), Expect = 7.4
 Identities = 16/68 (23%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 483 YIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE-EKKEEEKEEDKTPKLEDEA 541
           +   KK++  +KK   F     + L+ + +  E     ++  + EEE E  KT  L+  A
Sbjct: 3   HHGVKKLRRALKKAKGFE---RQKLIRRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLA 59

Query: 542 QLDAIKKV 549
           +     K+
Sbjct: 60  KKYLFSKL 67


>gnl|CDD|169396 PRK08349, PRK08349, hypothetical protein; Validated.
          Length = 198

 Score = 29.7 bits (67), Expect = 7.4
 Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 466 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 508
           L RG ++  +H ++D+    +E+K++E+V++  +  G  +K  V
Sbjct: 21  LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61



 Score = 29.7 bits (67), Expect = 7.4
 Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 1327 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 1369
            L RG ++  +H ++D+    +E+K++E+V++  +  G  +K  V
Sbjct: 21   LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61


>gnl|CDD|235305 PRK04527, PRK04527, argininosuccinate synthase; Provisional.
          Length = 400

 Score = 30.2 bits (68), Expect = 7.6
 Identities = 25/87 (28%), Positives = 37/87 (42%), Gaps = 21/87 (24%)

Query: 62  PIRQT-CQHEDNRVYERQANPEREPEVRGYGTRYRQSSYSSSYGYNRNSRESREYGRTET 120
           PIR+   +H   R YE++   ER     G+G R +Q +Y+     N N       G T +
Sbjct: 142 PIREIQKEHTQTRAYEQKYLEER-----GFGVRAKQKAYT----INENL-----LGVTMS 187

Query: 121 SGSRETYGRPDPSSGRPDQASGLCKPR 147
            G  + +  P       + A G C PR
Sbjct: 188 GGEIDRWEAPG------EGARGWCAPR 208


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.4 bits (68), Expect = 7.6
 Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 8/46 (17%)

Query: 511 EREKELSEDEEEEKKEE------EKEEDK--TPKLEDEAQLDAIKK 548
           ER K L E+E+  K+EE      E+EE +    ++E      A K+
Sbjct: 225 ERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270


>gnl|CDD|236603 PRK09640, PRK09640, RNA polymerase sigma factor SigX; Reviewed.
          Length = 188

 Score = 29.9 bits (67), Expect = 7.7
 Identities = 14/36 (38%), Positives = 16/36 (44%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
           KER K    D       EE  E+K PK E+   LD 
Sbjct: 95  KERRKRRLMDALSLDPLEEASEEKAPKPEERGGLDR 130


>gnl|CDD|224418 COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl
           hydrolases [Carbohydrate transport and metabolism].
          Length = 772

 Score = 30.4 bits (69), Expect = 7.8
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 446 YIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 503
              ++SA GSF +           + +   ++  Q +Y  I     K++++K++   G P
Sbjct: 205 LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258



 Score = 30.4 bits (69), Expect = 7.8
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 1307 YIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 1364
               ++SA GSF +           + +   ++  Q +Y  I     K++++K++   G P
Sbjct: 205  LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 29.7 bits (67), Expect = 7.9
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEED 532
            +L + +++E+E S +EEEE  EEE EE+
Sbjct: 69  ARLELLEQQEEEESAEEEEEDPEEENEEE 97


>gnl|CDD|232942 TIGR00366, TIGR00366, TIGR00366 family protein.  [Hypothetical
           proteins, Conserved].
          Length = 438

 Score = 30.2 bits (68), Expect = 8.0
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTP--KLEDEAQL 543
           K  + +L + E+EE+K+ +K+ED T   KLE+   L
Sbjct: 216 KSIDPKLLKKEKEEEKKLKKKEDATIAEKLENSRVL 251


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 29.9 bits (68), Expect = 8.3
 Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 503 PIKLLVEKERE--KELSEDEEEEKKEE-EKEEDK 533
           PI    E E+E  K +S++E++  KEE +K E+ 
Sbjct: 90  PIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 29.2 bits (66), Expect = 8.4
 Identities = 11/48 (22%), Positives = 21/48 (43%)

Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
            KIK ++      I   I+   E +++    + + E + +  KEE K 
Sbjct: 28  DKIKAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKK 75


>gnl|CDD|177121 MTH00046, CYTB, cytochrome b; Validated.
          Length = 355

 Score = 30.3 bits (69), Expect = 8.4
 Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 19/76 (25%)

Query: 1050 IVDLVGLTLGFFYLWYCGGFLLQVTRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFRC 1109
            ++DL        Y W C GF+L V  + QV  GV          L S++    S  SF C
Sbjct: 9    LIDL-PTNYSLNYYW-CSGFVLSVFMVIQVLTGV----------LLSLLYVADSLCSFFC 56

Query: 1110 VFA---EAFY----RY 1118
            V +   ++F+    RY
Sbjct: 57   VMSLSNDSFFTWCVRY 72


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 30.3 bits (69), Expect = 8.8
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 12/73 (16%)

Query: 479 DQAEYI----EEKKIKEIVKKHSQFIGY-----PIKLLVEKEREKELSEDEEEEKKEEEK 529
           D  EY     E  +++ IV+K+ ++          K ++ +E++ E+ E  EEE KE E 
Sbjct: 31  DPDEYRKLSKEYAELEPIVEKYREYKKAQEDLEDAKEMLAEEKDPEMREMAEEEIKELEA 90

Query: 530 EEDKTPKLEDEAQ 542
              K  +LE+E +
Sbjct: 91  ---KIEELEEELK 100


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 30.3 bits (68), Expect = 8.9
 Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)

Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
            E + K++  +H         L  E E+E++    E   K  E +E+ +  + E+  Q  
Sbjct: 662 LELRAKDLEAEHKSLQEKISSLEDELEKERQNH-QELIAKCRELEEKIERAEQEENMQKL 720

Query: 545 AIKKVFLSQ 553
             ++  + Q
Sbjct: 721 DEEEQKIKQ 729


>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F.  This family represents the
            oligoendopeptidase F clade of the family of larger M3 or
            thimet (for thiol-dependent metallopeptidase)
            oligopeptidase family. Lactococcus lactis PepF hydrolyzed
            peptides of 7 and 17 amino acids with fairly broad
            specificity. The homolog of lactococcal PepF in group B
            Streptococcus was named PepB (PMID:8757883), with the
            name difference reflecting a difference in species of
            origin rather activity; substrate profiles were quite
            similar. Differences in substrate specificity should be
            expected in other species. The gene is duplicated in
            Lactococcus lactis on the plasmid that bears it. A
            shortened second copy is found in Bacillus subtilis
            [Protein fate, Degradation of proteins, peptides, and
            glycopeptides].
          Length = 591

 Score = 30.3 bits (69), Expect = 9.0
 Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 1345 IEEKKIKEIVKKHSQFIGYPIKL-LVEKEREKELSEDEEEV 1384
            IEEK IKE +K   +   Y   L  + +++   LSE+ E++
Sbjct: 112  IEEKIIKEWLKDPEELADYKRALEEIFRDKPHILSEEVEKL 152


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.0 bits (68), Expect = 9.6
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQL 543
              +K+  E++E+E++EEE+EE + P+ E+   L
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 29.6 bits (67), Expect = 9.7
 Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 10/63 (15%)

Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIK----LLVE----KEREKELSEDEEEEKKEE 527
           + ED+A  IEEK+        ++  G  +K    LL +    KE++K+ S+ E +E+KE+
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAE--GVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEK 168

Query: 528 EKE 530
            ++
Sbjct: 169 VEK 171


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 30.2 bits (69), Expect = 9.9
 Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 11/51 (21%)

Query: 319 NDFMDS-------EQVTRLSQVY-RGVNDI---DYYIGGLLESVIRGTLSS 358
            DF+DS       EQV  L + Y R V  I    Y I  +  + IRG L+S
Sbjct: 611 EDFVDSIIDRRLFEQVKDLKRAYERPVLIIEGDLYTIRNIHPNAIRGALAS 661


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 30.0 bits (68), Expect = 9.9
 Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 4/53 (7%)

Query: 503 PIKLLVEKEREKELSEDEEE--EKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQ 553
             +L  EKE ++ L E +     KK+EEK++ +  KLE +A++    + F  Q
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE-KAKIPP-AEFFKRQ 600


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.403 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 71,564,958
Number of extensions: 7210772
Number of successful extensions: 11647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10705
Number of HSP's successfully gapped: 325
Length of query: 1386
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1277
Effective length of database: 6,103,016
Effective search space: 7793551432
Effective search space used: 7793551432
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (28.9 bits)