RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15682
         (224 letters)



>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score =  238 bits (609), Expect = 9e-75
 Identities = 126/221 (57%), Positives = 166/221 (75%), Gaps = 18/221 (8%)

Query: 1   ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
           A DKIRY+SLTDPS L     L I+++P+KE KTLT+ D+G                IGM
Sbjct: 40  ACDKIRYQSLTDPSVLGESPRLCIRVVPDKENKTLTVEDNG----------------IGM 83

Query: 61  TKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDNE 120
           TKADLVNNLGT A+SGTKA MEAL+AG D+SMIGQFGVGFYSAY+VAD+VTVTSK+N +E
Sbjct: 84  TKADLVNNLGTIARSGTKAFMEALEAGGDMSMIGQFGVGFYSAYLVADRVTVTSKNNSDE 143

Query: 121 QYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPI 180
            Y+WESSAGG+FTI       + RGT+I LH+KEDQ EY+E +++KE++KKHS+FIGY I
Sbjct: 144 SYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDI 203

Query: 181 KLLVEKEREKELSEDEEEE--KKEEEKEEDKTPKLEDEGED 219
           +L+VEK  EKE+++++EE+  K +E+ EE K  ++++  E 
Sbjct: 204 ELMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG 244


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score =  217 bits (555), Expect = 3e-67
 Identities = 88/192 (45%), Positives = 123/192 (64%), Gaps = 22/192 (11%)

Query: 1   ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
           A+DK+R+E+LTDP+  E + +L I+I  +KE +TLTI D                +GIGM
Sbjct: 41  AIDKLRFEALTDPALYEGDGDLKIRISFDKEARTLTISD----------------NGIGM 84

Query: 61  TKADLVNNLGTTAKSGTKASMEALQ--AGADISMIGQFGVGFYSAYMVADKVTVTSKH-- 116
           T+ +++ NLGT AKSGTK  +E L+     D  +IGQFGVGFYSA+MVADKVTV ++   
Sbjct: 85  TREEVIENLGTIAKSGTKEFLEKLKGDQKKDSQLIGQFGVGFYSAFMVADKVTVITRSAG 144

Query: 117 NDNEQYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFI 176
              E   WES   G +TI+    +   RGT+I LH+KED+ E+++E +I+ I+KK+S FI
Sbjct: 145 PAAEAVRWESDGEGEYTIEEIEKEE--RGTEITLHLKEDEDEFLDEWRIRSIIKKYSDFI 202

Query: 177 GYPIKLLVEKER 188
             PIKL  E+E 
Sbjct: 203 PVPIKLEKEEEE 214


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score =  213 bits (544), Expect = 1e-65
 Identities = 95/218 (43%), Positives = 138/218 (63%), Gaps = 27/218 (12%)

Query: 1   ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
           A+DK+R+E+L+DP   E + +L I+I  +K+ KTLTI D+G                IGM
Sbjct: 42  AIDKLRFEALSDPELGEGDSDLRIRISFDKDNKTLTISDNG----------------IGM 85

Query: 61  TKADLVNNLGTTAKSGTKASMEAL-QAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDN 119
           TK +++ NLGT AKSGTK  +E+L +   D  +IGQFGVGFYSA+MVADKVTV ++    
Sbjct: 86  TKDEVIENLGTIAKSGTKEFLESLSEDQKDSDLIGQFGVGFYSAFMVADKVTVITRSAGE 145

Query: 120 -EQYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY 178
            E Y WES   G +T++  + +P  RGT+I LH+KE++ E++EE +++EIVKK+S  I Y
Sbjct: 146 DEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEFLEEWRLREIVKKYSDHIAY 204

Query: 179 PIKLLVEKEREKELSEDEE--------EEKKEEEKEED 208
           PI +  EKE+++E+ E E            K E  +E+
Sbjct: 205 PIYIEGEKEKDEEVIEWETINKAKALWTRNKSEITDEE 242


>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
          Length = 814

 Score =  163 bits (414), Expect = 2e-46
 Identities = 101/201 (50%), Positives = 134/201 (66%), Gaps = 18/201 (8%)

Query: 1   ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
           AL+KIR+ SL+D S L  EK L I+I  NKE+  L+I D+G                IGM
Sbjct: 103 ALEKIRFLSLSDESVLGEEKKLEIRISANKEKNILSITDTG----------------IGM 146

Query: 61  TKADLVNNLGTTAKSGTKASMEAL-QAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDN 119
           TK DL+NNLGT AKSGT   +EA+ ++G D+S+IGQFGVGFYSA++VADKV V +K+N++
Sbjct: 147 TKEDLINNLGTIAKSGTSNFLEAISKSGGDMSLIGQFGVGFYSAFLVADKVIVYTKNNND 206

Query: 120 EQYIWESSAGGSFTIKPD-NSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY 178
           EQYIWES+A   FTI  D     L RGT+I LH+KED    + +KK+ +++ K+SQFI Y
Sbjct: 207 EQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNLMNDKKLVDLISKYSQFIQY 266

Query: 179 PIKLLVEKEREKELSEDEEEE 199
           PI LL E    +E+  D  +E
Sbjct: 267 PIYLLHENVYTEEVLADIAKE 287


>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
          Length = 601

 Score = 88.5 bits (220), Expect = 3e-20
 Identities = 39/148 (26%), Positives = 78/148 (52%), Gaps = 4/148 (2%)

Query: 42  FIPNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFY 101
            + +    TL + D+GIG+T+ ++   L T  +S  +   E L    +   +GQFG+G  
Sbjct: 56  ELTDAGGGTLIVEDNGIGLTEEEVHEFLATIGRSSKRD--ENLGFARN-DFLGQFGIGLL 112

Query: 102 SAYMVADKVTVTSKHNDNEQYI-WESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYI 160
           S ++VAD++ V S+   +   + W   A G+++++   ++    GT + L  + D  E++
Sbjct: 113 SCFLVADEIVVVSRSAKDGPAVEWRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWL 172

Query: 161 EEKKIKEIVKKHSQFIGYPIKLLVEKER 188
           E + ++E+ KK+   +  PI++  EK  
Sbjct: 173 ERETVEELAKKYGSLLPVPIRVEGEKGG 200


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 82.9 bits (205), Expect = 2e-18
 Identities = 39/66 (59%), Positives = 52/66 (78%), Gaps = 2/66 (3%)

Query: 158 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE--EKEEDKTPKLED 215
           EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEEE+KEE  E+EE  T K E+
Sbjct: 1   EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEE 60

Query: 216 EGEDAD 221
             E+ +
Sbjct: 61  VDEEEE 66


>gnl|CDD|217081 pfam02518, HATPase_c, Histidine kinase-, DNA gyrase B-, and
           HSP90-like ATPase.  This family represents the
           structurally related ATPase domains of histidine kinase,
           DNA gyrase B and HSP90.
          Length = 111

 Score = 44.6 bits (106), Expect = 2e-06
 Identities = 16/95 (16%), Positives = 28/95 (29%), Gaps = 19/95 (20%)

Query: 44  PNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSA 103
            +     +T+ D+GIG+   DL            K      +  +    +G  G+G    
Sbjct: 33  RDGGRLRITVEDNGIGIPPEDL-----------PKIFEPFFRTDSSSRKVGGTGLGLSIV 81

Query: 104 YMVADKVTVTSKHNDNEQYIWESSAGGS-FTIKPD 137
             + +    T               GG+ FT    
Sbjct: 82  RKLVELHGGTITVESEPG-------GGTTFTFTLP 109


>gnl|CDD|222246 pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and
           HSP90-like ATPase.  This family represents,
           additionally, the structurally related ATPase domains of
           histidine kinase, DNA gyrase B and HSP90.
          Length = 134

 Score = 45.0 bits (107), Expect = 3e-06
 Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 13/82 (15%)

Query: 42  FIPNKEEKTLTIIDSGIGMTKADLVNNL--GTTAKSGTKASMEALQAGADISMIGQFGVG 99
             P++ E  ++I D+G GM+  +L N L  G ++K G +          D + +G+ G+G
Sbjct: 26  IDPDRGEDGISIEDNGGGMSYEELRNALKLGRSSKEGER----------DSTTLGRKGIG 75

Query: 100 -FYSAYMVADKVTVTSKHNDNE 120
              ++  +  K+TVTSK     
Sbjct: 76  MKLASLSLGRKLTVTSKKEGES 97


>gnl|CDD|214643 smart00387, HATPase_c, Histidine kinase-like ATPases.  Histidine
           kinase-, DNA gyrase B-, phytochrome-like ATPases.
          Length = 111

 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 21/114 (18%), Positives = 33/114 (28%), Gaps = 22/114 (19%)

Query: 24  IKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEA 83
           IK  P     T+T+   G     +   +T+ D+G G+   DL            K     
Sbjct: 18  IKYTPEGGRITVTLERDG-----DHVEITVEDNGPGIPPEDL-----------EKIFEPF 61

Query: 84  LQAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDNEQYIWESSAGGSFTIKPD 137
            +       IG  G+G      + +                    G +FTI   
Sbjct: 62  FRTDKRSRKIGGTGLGLSIVKKLVELHGGEISVESEPG------GGTTFTITLP 109


>gnl|CDD|238030 cd00075, HATPase_c, Histidine kinase-like ATPases; This family
           includes several ATP-binding proteins for example:
           histidine kinase, DNA gyrase B, topoisomerases, heat
           shock protein HSP90, phytochrome-like ATPases and DNA
           mismatch repair proteins.
          Length = 103

 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 12/78 (15%), Positives = 26/78 (33%), Gaps = 16/78 (20%)

Query: 44  PNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSA 103
            + +   + + D+G G+ + DL                E    G+     G  G+G    
Sbjct: 29  RDGDHLEIRVEDNGPGIPEEDLERIF------------ERFSDGSRSRKGGGTGLGLSIV 76

Query: 104 YMVAD----KVTVTSKHN 117
             + +    ++ V S+  
Sbjct: 77  KKLVELHGGRIEVESEPG 94


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 34.2 bits (78), Expect = 0.050
 Identities = 15/27 (55%), Positives = 23/27 (85%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTP 211
           E+E E+E  E+EEEE++EEE+EE++ P
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEP 893



 Score = 33.8 bits (77), Expect = 0.071
 Identities = 15/31 (48%), Positives = 25/31 (80%)

Query: 193 SEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
           SE+EEEE++EEE+EE++  + E+E E+ + P
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEEP 893



 Score = 33.4 bits (76), Expect = 0.092
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDK 209
           E+E E+E  E+EEEE++EEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 33.0 bits (75), Expect = 0.13
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
           E+E E+E  E+EEEE++EEE+EE+    L  E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 32.3 bits (73), Expect = 0.20
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
           + E  E+EEEE++EEE+EE++  + E+E E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 31.9 bits (72), Expect = 0.28
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
           + E+E  E+EEEE++EEE+EE++  + E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 31.9 bits (72), Expect = 0.33
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDK 209
           E+E E+E  E+EEEE++EEE+EE++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 30.0 bits (67), Expect = 1.1
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
           E+E  E+EEEE++EEE+EE++  +  +E    + P
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWP 899



 Score = 29.2 bits (65), Expect = 1.9
 Identities = 13/38 (34%), Positives = 25/38 (65%)

Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           V+     +  + EEEE++EEE+EE++  + E+E E+ +
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.7 bits (78), Expect = 0.057
 Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 12/76 (15%)

Query: 146 TKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 205
           TK +  + E       EKK +E  K+         K     ++++E  E+E+E+K+EE++
Sbjct: 409 TKKIKKIVEK-----AEKKREEEKKEKK-------KKAFAGKKKEEEEEEEKEKKEEEKE 456

Query: 206 EEDKTPKLEDEGEDAD 221
           EE++  + E E E+  
Sbjct: 457 EEEEEAEEEKEEEEEK 472



 Score = 32.2 bits (74), Expect = 0.18
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 11/74 (14%)

Query: 150 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE----EEEKKEEEK 205
           L + E++ E++   K      K        I    EK+RE+E  E +      +KKEEE+
Sbjct: 392 LELTEEEIEFLTGSKKATKKIKK-------IVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444

Query: 206 EEDKTPKLEDEGED 219
           EE+K  K E++ E+
Sbjct: 445 EEEKEKKEEEKEEE 458


>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL.  All proteins in
           this family for which the functions are known are
           involved in the process of generalized mismatch repair.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 312

 Score = 33.0 bits (76), Expect = 0.11
 Identities = 26/137 (18%), Positives = 54/137 (39%), Gaps = 35/137 (25%)

Query: 53  IIDSGIGMTKADLVNNL--GTTAKSGTKASMEALQAGADISMIGQFGVGF-----YSAYM 105
           + D+G G+ K DL        T+K         +Q+  D+  I   G  F      S   
Sbjct: 56  VSDNGSGIDKEDLPLACERHATSK---------IQSFEDLERIETLG--FRGEALASISS 104

Query: 106 VADKVTVTSKHNDNEQYIWESSAGGSFTIKPDNSQPLGRGTKIVLH------------MK 153
           V+ ++T+T+K +  +   +++   G        + P   GT + +             +K
Sbjct: 105 VS-RLTITTKTSAADGLAYQALLEGGMIESIKPA-PRPVGTTVEVRDLFYNLPVRRKFLK 162

Query: 154 EDQAEYIEEKKIKEIVK 170
             + E+   +KI ++++
Sbjct: 163 SPKKEF---RKILDVLQ 176


>gnl|CDD|212510 cd11724, TP_methylase_like, Uncharacterized subfamily of S-AdoMet
           dependent tetrapyrrole methylases.  TP-methylases use
           S-AdoMet (S-adenosyl-L-methionine or SAM) in the
           methylation of diverse substrates. Most members catalyze
           various methylation steps in cobalamin (vitamin B12)
           biosynthesis, other members like Diphthine synthase and
           Ribosomal RNA small subunit methyltransferase I (RsmI)
           act on other substrates. The function of this subfamily
           is not known.
          Length = 255

 Score = 32.3 bits (74), Expect = 0.15
 Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 5/68 (7%)

Query: 148 IVLHMKEDQ---AEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 204
           I+   KE     A+Y++ K + E   K   F  Y  K     E EK   E  EEEKK + 
Sbjct: 29  IIFCPKEIVKRFADYLKGKPVLEDPWKL--FWHYGGKPWENLEAEKSPREAFEEEKKRQR 86

Query: 205 KEEDKTPK 212
           +E  K  +
Sbjct: 87  EEFVKVIR 94


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 31.3 bits (71), Expect = 0.31
 Identities = 15/42 (35%), Positives = 29/42 (69%)

Query: 180 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           + +L +K +E E  + +EE++K+EE+EE++  + ED  +D D
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 28.6 bits (64), Expect = 2.1
 Identities = 14/42 (33%), Positives = 28/42 (66%)

Query: 180 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
            KL + +++ KEL  ++ +E+ E+++EE++  + EDE  D D
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 31.3 bits (72), Expect = 0.43
 Identities = 13/71 (18%), Positives = 40/71 (56%), Gaps = 2/71 (2%)

Query: 138 NSQPLGRGTKIVLHMKEDQA-EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE 196
             +    G K+++ +  +   +++++K + +++K++ +F G+ I  +  +  + +   ++
Sbjct: 116 KQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKF-GFGILKIDFEIDDSKEELEK 174

Query: 197 EEEKKEEEKEE 207
            E +KEEE E+
Sbjct: 175 FEAQKEEEDEK 185


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.5 bits (69), Expect = 0.49
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 186 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           KE EKE +  E++E K + ++ED+  + E+E ED +
Sbjct: 66  KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.8 bits (70), Expect = 0.49
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 5/45 (11%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKL-----EDEGEDADCPA 224
           ++E E+E  E+EEEE++ EE E ++ P L     ED     D PA
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAPLPNDDPA 392



 Score = 27.7 bits (62), Expect = 6.2
 Identities = 11/29 (37%), Positives = 21/29 (72%)

Query: 186 KEREKELSEDEEEEKKEEEKEEDKTPKLE 214
              +K+  E++E+E++EEE+EE + P+ E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.7 bits (70), Expect = 0.54
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
            + E + +   D+E+E+K+E KE +K    +D   D D
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 30.8 bits (70), Expect = 0.55
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
             E E+E  E+EE  ++EEE+EE++    E+E
Sbjct: 302 PPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333


>gnl|CDD|224418 COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl
           hydrolases [Carbohydrate transport and metabolism].
          Length = 772

 Score = 30.4 bits (69), Expect = 0.79
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 122 YIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 179
              ++SA GSF +           + +   ++  Q +Y  I     K++++K++   G P
Sbjct: 205 LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258


>gnl|CDD|232942 TIGR00366, TIGR00366, TIGR00366 family protein.  [Hypothetical
           proteins, Conserved].
          Length = 438

 Score = 30.2 bits (68), Expect = 0.98
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 2/32 (6%)

Query: 186 KEREKELSEDEEEEKKEEEKEEDKTP--KLED 215
           K  + +L + E+EE+K+ +K+ED T   KLE+
Sbjct: 216 KSIDPKLLKKEKEEEKKLKKKEDATIAEKLEN 247


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 30.0 bits (67), Expect = 1.0
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           KL  E E E    ED++EE  +++ +ED+    +D+ ED +
Sbjct: 40  KLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDE 80


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 30.0 bits (67), Expect = 1.0
 Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 3/37 (8%)

Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPK---LEDEG 217
            E++ ++E  + E EE+ EEE+ E   P+   L+ EG
Sbjct: 87  EEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDAEG 123


>gnl|CDD|233256 TIGR01063, gyrA, DNA gyrase, A subunit.  This model describes the
           common type II DNA topoisomerase (DNA gyrase). Two
           apparently independently arising families, one in the
           Proteobacteria and one in Gram-positive lineages, are
           both designated toposisomerase IV [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 800

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 22/75 (29%), Positives = 40/75 (53%)

Query: 147 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 206
           + +L M+  +   +E +K++E  K+  + I     +L  +ER  E+  +E EE KE+  +
Sbjct: 414 QAILDMRLQRLTGLEREKLQEEYKELLELIADLEDILASEERVLEIIREELEEIKEQFGD 473

Query: 207 EDKTPKLEDEGEDAD 221
             +T  + DE ED D
Sbjct: 474 PRRTEIVYDESEDID 488


>gnl|CDD|235949 PRK07159, PRK07159, F0F1 ATP synthase subunit C; Validated.
          Length = 100

 Score = 28.3 bits (63), Expect = 1.1
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 2/67 (2%)

Query: 61  TKADLVNNLGTTAKSGTKASMEALQA--GADISMIGQFGVGFYSAYMVADKVTVTSKHND 118
               ++ N  T ++     S     A  GA ++MIG  GVG    Y     V   +++ +
Sbjct: 7   FVGQILQNTQTVSQKVAADSSGLKAAYIGAGLAMIGVIGVGLGQGYAFGKAVEAIARNPE 66

Query: 119 NEQYIWE 125
            ++ +++
Sbjct: 67  AQKQVFK 73


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 15/60 (25%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 162 EKKIKEIVKKHSQFIGYPIKLLVEKER--EKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
            KK+++  KK  +           + +  E++L ++E E+K++E+ +EDK  + +DE + 
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|234752 PRK00413, thrS, threonyl-tRNA synthetase; Reviewed.
          Length = 638

 Score = 29.3 bits (67), Expect = 1.9
 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 9/61 (14%)

Query: 162 EKKIKEIVKKHSQFIGYPI-KLLVEKEREKELSEDEEEEKKEE---EKEEDKTPKLEDEG 217
           EK++KEI+K+      YPI + +V +E   EL +D  E  K E   E  ED+   L  +G
Sbjct: 121 EKEMKEIIKE-----NYPIEREVVSREEAIELFKDRGEPYKVELIEEIPEDEEISLYRQG 175

Query: 218 E 218
           E
Sbjct: 176 E 176


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.8 bits (65), Expect = 2.3
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 12/58 (20%)

Query: 152 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 209
           +K  + E  EE + K+  KK             E+   K      EE++K EEKE  K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKK------------EEREAKLAKLSPEEQRKLEEKERKK 318


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 28.6 bits (64), Expect = 2.4
 Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 186 KEREKELSEDEEEEK--KEEEKEEDKTPKLE 214
           KER     ++++EE     EE + +K PK E
Sbjct: 12  KERRNRNDKNKKEEHSIGSEEGDSEKEPKSE 42


>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
           chain-associating membrane) superfamily, longevity
           assurance factor [Intracellular trafficking and
           secretion].
          Length = 395

 Score = 28.7 bits (64), Expect = 2.4
 Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 176 IGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 210
           + Y  +++ E E + E S+DE EE+ + E  EDK 
Sbjct: 362 VAY--RVIWEGELKDERSDDESEEESDLESSEDKN 394


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
           describes peptide chain release factor 1 (PrfA, RF-1),
           and excludes the related peptide chain release factor 2
           (PrfB, RF-2). RF-1 helps recognize and terminate
           translation at UAA and UAG stop codons. The
           mitochondrial release factors are prfA-like, although
           not included above the trusted cutoff for this model.
           RF-1 does not have a translational frameshift [Protein
           synthesis, Translation factors].
          Length = 360

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 13/71 (18%)

Query: 155 DQAEYI----EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--- 207
           DQ +      E  +++EIV  + ++     +   + +  KE+ E+ + E +E  KEE   
Sbjct: 31  DQDKLRKLSKEYSQLEEIVDCYREYQ----QAQEDIKEAKEILEESDPEMREMAKEELEE 86

Query: 208 --DKTPKLEDE 216
             +K  +LE++
Sbjct: 87  LEEKIEELEEQ 97


>gnl|CDD|237538 PRK13876, PRK13876, conjugal transfer coupling protein TraG;
           Provisional.
          Length = 663

 Score = 28.7 bits (65), Expect = 2.8
 Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDA 220
              R+ EL E EE   +    E +    L+D+ +D 
Sbjct: 601 GIRRQPELPEHEEIVPEPRPPENEFDL-LDDDDDDD 635


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.8 bits (65), Expect = 2.9
 Identities = 14/65 (21%), Positives = 26/65 (40%)

Query: 155 DQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
           D  E+I  K IK    + + +    I  L     E+   E  EE ++ + ++      ++
Sbjct: 800 DLEEFIAVKGIKAKGNRLTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDID 859

Query: 215 DEGED 219
           DE   
Sbjct: 860 DEDTG 864


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 27.7 bits (62), Expect = 3.0
 Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 14/54 (25%)

Query: 160 IEEKKI-KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 212
           +EE+KI K+++K+ ++             ++K+  + ++++KK+  K+  K  K
Sbjct: 95  VEEQKIVKQVLKEKAK-------------QKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
           Protease and Nucellin, pepsin-like aspartic proteases
           from plants.  This family contains pepsin like aspartic
           proteases from plants including Chloroplast Nucleoids
           DNA-binding Protease and Nucellin. Chloroplast Nucleoids
           DNA-binding Protease catalyzes the degradation of
           ribulose-1,5-bisphosphate carboxylase/oxygenase
           (Rubisco) in senescent leaves of tobacco and Nucellins
           are important regulators of nucellar cell's progressive
           degradation after ovule fertilization. Structurally,
           aspartic proteases are bilobal enzymes, each lobe
           contributing a catalytic Asp residue, with an extended
           active site cleft localized between the two lobes of the
           molecule. The N- and C-terminal domains, although
           structurally related by a 2-fold axis, have only limited
           sequence homology except the vicinity of the active
           site. This suggests that the enzymes evolved by an
           ancient duplication event.  The enzymes specifically
           cleave bonds in peptides which have at least six
           residues in length with hydrophobic residues in both the
           P1 and P1' positions. The active site is located at the
           groove formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap.  The
           enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
          Length = 265

 Score = 28.4 bits (64), Expect = 3.0
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 32  EKTLTIIDSGFIPNKEEKTLTIIDSG 57
            K L I  S F  + +    TIIDSG
Sbjct: 159 GKRLPIPPSVFAIDSDGSGGTIIDSG 184


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 27.5 bits (61), Expect = 3.0
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLE 214
             E++  E EEE+KKEEEKEE++   L 
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEEALA 100


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 28.5 bits (63), Expect = 3.0
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
           KL  ++E   E ++ E E+K  EE+E +K  + E E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERE 613



 Score = 28.1 bits (62), Expect = 4.5
 Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 3/32 (9%)

Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 212
           K   E+EREK   E E+E ++E E+E ++  K
Sbjct: 595 KAREEREREK---EKEKEREREREREAERAAK 623


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 28.8 bits (64), Expect = 3.2
 Identities = 11/41 (26%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 185 EKEREKELSEDEEE-EKKEEEKEEDKTPKLEDEGEDADCPA 224
           +K +EK  ++ E + E    E E ++    + E +  D PA
Sbjct: 799 QKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPA 839


>gnl|CDD|224536 COG1621, SacC, Beta-fructosidases (levanase/invertase)
           [Carbohydrate transport and metabolism].
          Length = 486

 Score = 28.5 bits (64), Expect = 3.3
 Identities = 13/50 (26%), Positives = 17/50 (34%), Gaps = 10/50 (20%)

Query: 99  GFYSAYMVADK------VTVTSKHNDN----EQYIWESSAGGSFTIKPDN 138
           G YS   V D        T   + ++      Q I  S  GG+F     N
Sbjct: 100 GCYSGSAVVDDGNLSLFYTGNVRDSNGIRQQTQCIAYSEDGGTFEKYSGN 149


>gnl|CDD|111553 pfam02667, SCFA_trans, Short chain fatty acid transporter.  This
           family consists of two sequences annotated as short
           chain fatty acid transporters, however, there are no
           references giving details of experimental
           characterisation of this function.
          Length = 453

 Score = 28.2 bits (63), Expect = 3.3
 Identities = 11/32 (34%), Positives = 21/32 (65%), Gaps = 2/32 (6%)

Query: 186 KEREKELSEDEEEEKKEEEKEEDKTP--KLED 215
           K  + +L + E+EE+K+ + ++D T   KLE+
Sbjct: 219 KSIDPKLLKKEKEEEKKLKSKKDATIAEKLEN 250


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.5 bits (64), Expect = 3.3
 Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 153 KEDQAEYIEEKKIKEIVKK-HSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 211
           K  +   +  + I ++ +K  S+F     KL  E + +K+  + EE++KK+ E+ E++  
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353

Query: 212 KLEDEGEDAD 221
           KLE +  D +
Sbjct: 354 KLEVQATDKE 363


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 28.3 bits (63), Expect = 3.5
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
           E+E E++ SE+EE E +EEE+E +     E+E E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 28.3 bits (63), Expect = 4.0
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           E+E   E  E+EEEE++EEE+E ++    ++E E+  
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476



 Score = 28.0 bits (62), Expect = 5.1
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
           E+  E+E  E+EEEE++E+E EE++    E+E E     
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480



 Score = 28.0 bits (62), Expect = 5.2
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
           E+E E+E  E+E+E ++EE ++E++  ++E +   
Sbjct: 448 EEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGS 482



 Score = 28.0 bits (62), Expect = 5.2
 Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 3/38 (7%)

Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           VE+E E+E  E+EEE++ EEE+ ED+    E+E  +AD
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEE---EEEEVEAD 479


>gnl|CDD|233817 TIGR02304, aden_form_hyp, putative adenylate-forming enzyme.
           Members of this family form a distinct clade within a
           larger family of proteins that also includes coenzyme
           F390 synthetase, an enzyme known in Methanobacterium
           thermoautotrophicum and a few other methanogenic
           archaea. That enzyme adenylates coenzyme F420 to F390, a
           reversible process, during oxygen stress. Other
           informative homologies include domains of the
           non-ribosomal peptide synthetases involved in activation
           by adenylation. The family defined by this model is
           likely to be of an adenylate-forming enzyme related to
           but distinct from coenzyme F390 synthetase.
          Length = 430

 Score = 28.3 bits (63), Expect = 3.5
 Identities = 15/81 (18%), Positives = 25/81 (30%), Gaps = 3/81 (3%)

Query: 47  EEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSAYMV 106
                 ++D  + M   D +N  G         +M + +       +G   VG  S    
Sbjct: 53  PFNQWPMMDKALMMEHFDELNTAGLKKDEALDCAMRSEKTRDFKPCVGNISVGLSSGTSG 112

Query: 107 ADKVTVTSKHNDNEQYIWESS 127
              + V S     EQ +W   
Sbjct: 113 RRGLFVVSPE---EQQMWAGG 130


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 28.6 bits (63), Expect = 3.6
 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 2/60 (3%)

Query: 161  EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDA 220
            EEKK  E +KK  +     IK   E ++ +E  +  EE KK EE E+     L+ E E+A
Sbjct: 1644 EEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 27.9 bits (63), Expect = 3.8
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 189 EKELSEDEEEEKKEEEKEED 208
            +E  E+EEEE++EE  EE+
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320



 Score = 27.5 bits (62), Expect = 6.3
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 183 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
           L E    +  +   EEE++EEE+EE++ P  E+   
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 28.3 bits (63), Expect = 4.1
 Identities = 17/77 (22%), Positives = 30/77 (38%), Gaps = 5/77 (6%)

Query: 135 KPDNSQPLGRGTKIVLHMKE---DQAEYIEEKKIKEIVKKHSQFIGYPIKLLV-EKEREK 190
           K    +            KE      EY +E KI+ ++KK         ++   E+    
Sbjct: 481 KAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVI 540

Query: 191 ELSEDEEEEK-KEEEKE 206
           ++++ E EE+   E KE
Sbjct: 541 QVNKIETEEEVLSELKE 557


>gnl|CDD|236740 PRK10712, PRK10712, PTS system fructose-specific transporter
          subunits IIBC; Provisional.
          Length = 563

 Score = 28.2 bits (63), Expect = 4.3
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 49 KTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMI 93
          KTL IID+ +G  +A +   L   A +  KA +E +    D  + 
Sbjct: 2  KTLLIIDANLGQARAYMAKTLLGAAAA--KAGLEIIDNPNDAELA 44


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
               E+E  E+EEEE++EEE EE+
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 25.8 bits (57), Expect = 9.2
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
               E+E  E+EEEE++EE +EE 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEA 96


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 28.1 bits (62), Expect = 4.4
 Identities = 11/33 (33%), Positives = 24/33 (72%)

Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
           + +++  +D+EE+ +EEE+EE++    +DE E+
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 27.9 bits (63), Expect = 4.6
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 8/68 (11%)

Query: 162 EKKIKEIVK--------KHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 213
           EKK+KE +K        K  +          EK+ +  L +  + + KEEE+EE+K  K 
Sbjct: 493 EKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEKD 552

Query: 214 EDEGEDAD 221
           ++  ED  
Sbjct: 553 KETEEDEP 560


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.0 bits (62), Expect = 4.6
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 183  LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
            LV KE + +  ED++ ++KE+E+E      ++DE + 
Sbjct: 3935 LVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQP 3971


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 28.0 bits (63), Expect = 5.2
 Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)

Query: 158 EYIEEKKIKEIVKKHSQFIGYPIKLLVE-KEREKELSEDEEEEKKEEEKEEDKTP 211
           EY + K    ++K  +   G  ++     KE  ++   +  E +K EEKEE+  P
Sbjct: 828 EYKKSKGYVLVIKLENGG-GLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELNP 881


>gnl|CDD|226175 COG3649, COG3649, CRISPR system related protein [Defense
           mechanisms].
          Length = 283

 Score = 27.6 bits (61), Expect = 5.3
 Identities = 20/47 (42%), Positives = 25/47 (53%)

Query: 170 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
           K+HS  +GY    +VEKE E EL  D  E K   E   DK  ++E E
Sbjct: 225 KEHSSKLGYLSSAVVEKELESELKLDVTELKDVYESLLDKIERIEYE 271


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 26.5 bits (59), Expect = 5.6
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 194 EDEEEEKKEEEKEEDKTPKLEDEGE 218
              EE+K+EEE+EE+K    E+   
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.1 bits (58), Expect = 7.9
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
               E++  E+EEEE+KEE +EE 
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEA 97


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 26.1 bits (58), Expect = 5.7
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 7/62 (11%)

Query: 161 EEKKIKEIVKKHSQFIGYPI------KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
           E KK      K  +  G P       ++L E E +K   E+E+E +KE  K E +    +
Sbjct: 20  EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAE-RAEARK 78

Query: 215 DE 216
             
Sbjct: 79  RG 80


>gnl|CDD|169396 PRK08349, PRK08349, hypothetical protein; Validated.
          Length = 198

 Score = 27.4 bits (61), Expect = 5.7
 Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 142 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 184
           L RG ++  +H ++D+    +E+K++E+V++  +  G  +K  V
Sbjct: 21  LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 26.8 bits (59), Expect = 5.9
 Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 163 KKIKEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEEDKTPKLEDE 216
           K I E ++K+S F+   +K         +E+E   +++EEK+E+ +E+ +   +E+E
Sbjct: 74  KLIAEFLEKYSDFLNEYVKFTPRNNNAIQEEEDDMEQQEEKEEKGREKGRKKNVEEE 130


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 27.7 bits (62), Expect = 6.0
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 179 PIKLLVEKEREKELSEDEEE--EKKEEEKEEDKTPKLE 214
             +L  EKE ++ L E +     KK+EEK++ +  KLE
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587


>gnl|CDD|236873 PRK11186, PRK11186, carboxy-terminal protease; Provisional.
          Length = 667

 Score = 27.6 bits (62), Expect = 6.1
 Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 18/76 (23%)

Query: 153 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKK---EEEKEEDK 209
            ED A Y  EK    +              L   EREKE  ED+ +      E  K E K
Sbjct: 583 NEDIARYKAEKDKNIVS-------------LNYAEREKENDEDDAKRLARLNERFKREGK 629

Query: 210 TP--KLEDEGEDADCP 223
            P   L+D  +D + P
Sbjct: 630 KPLKSLDDLPKDYEEP 645


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 27.5 bits (62), Expect = 6.1
 Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)

Query: 163 KKIKEIVKKHSQFIGYPIKLLVEKEREK--ELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
            K KEI+    + +     L +E++RE+  EL+ +  EE++E+++E+   P L +  + 
Sbjct: 42  SKAKEIIPIVKEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKG 100


>gnl|CDD|119380 cd06418, GH25_BacA-like, BacA is a bacterial lysin from
           Enterococcus faecalis that degrades bacterial cell walls
           by catalyzing the hydrolysis of 1,4-beta-linkages
           between N-acetylmuramic acid and N-acetyl-D-glucosamine
           residues.  BacA is homologous to the YbfG and YkuG
           lysins of Bacillus subtilis. BacA has a C-terminal
           catalytic glycosyl hydrolase family 25 (GH25) domain and
           an N-terminal peptidoglycan-binding domain comprised of
           three alpha helices which is similar to a domain found
           in matrixins.
          Length = 212

 Score = 27.3 bits (61), Expect = 6.2
 Identities = 6/48 (12%), Positives = 13/48 (27%), Gaps = 2/48 (4%)

Query: 96  FGVGFYSAYMVADKVTVTSKHNDNEQYIWESSAGGSFTIKPDNSQPLG 143
           + +G Y +  V  +V             + +     F+       P  
Sbjct: 142 YRIGIYGSRNVCSRVLA--AAAGLAVVSFVADMSTGFSGNLGYPLPKD 187


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 27.2 bits (61), Expect = 6.2
 Identities = 17/50 (34%), Positives = 32/50 (64%), Gaps = 2/50 (4%)

Query: 162 EKKIKEIVKKHSQFIGYPIK--LLVEKEREKELSEDEEEEKKEEEKEEDK 209
            K+ K I+KK S+    PI      EKE++K +S++E++  KEE+++ ++
Sbjct: 72  TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 27.0 bits (60), Expect = 6.3
 Identities = 20/87 (22%), Positives = 44/87 (50%), Gaps = 16/87 (18%)

Query: 150 LHMKEDQ-------AEYIEEKK--------IKEIVKKHSQFIGYPIKLLVEKEREKELSE 194
            H+++         AEY E KK        I+++ K++ +   +  K    K+ +K+  +
Sbjct: 41  SHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK-KKDKDK 99

Query: 195 DEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           D++++KK+++ E+    + ED+ ED  
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLT 126


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 27.4 bits (61), Expect = 6.9
 Identities = 11/60 (18%), Positives = 24/60 (40%)

Query: 160 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
            E+K+ +    +  +      +     E  + L E+ +++ +EE  +E      E E E 
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP 268


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.7 bits (62), Expect = 7.1
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 165 IKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
           I E++K    F       + +K+  K L   EE+ KKEE+KE++K  + E
Sbjct: 633 ISELMKIKQIFR----NWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 26.4 bits (59), Expect = 7.3
 Identities = 20/71 (28%), Positives = 28/71 (39%), Gaps = 18/71 (25%)

Query: 154 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE-----KEED 208
           ED A   EE+  K +              L E ER+ E  EDE++  K E        E+
Sbjct: 59  EDIAWLKEERDDKTVS-------------LNEAERKAEREEDEKKRLKRENERRKALGEE 105

Query: 209 KTPKLEDEGED 219
               L+D  +D
Sbjct: 106 PLKSLDDLPKD 116


>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
          Length = 694

 Score = 27.4 bits (61), Expect = 7.6
 Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 6/46 (13%)

Query: 175 FIGY--PIKLLVEKEREKELSEDEEEEKKEEE----KEEDKTPKLE 214
           F+ Y  P +       E    + EEE  + E     KE+ +T ++E
Sbjct: 318 FVAYTIPRRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 27.1 bits (60), Expect = 8.2
 Identities = 15/42 (35%), Positives = 21/42 (50%), Gaps = 5/42 (11%)

Query: 185 EKEREKELSEDEEEEKK-----EEEKEEDKTPKLEDEGEDAD 221
           E E + E ++D+E+E +     E E E D     EDE ED  
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 27.2 bits (60), Expect = 8.4
 Identities = 15/35 (42%), Positives = 27/35 (77%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
           E+E+EKE  ++E+++KKE+ KEE K  K ++E ++
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 27.4 bits (61), Expect = 8.6
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
             ER+ E  EDEEEE++E+E E       +DE  + D
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEED 421


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 25.9 bits (56), Expect = 9.2
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCPA 224
           E   S+ +EEE  ++E EED     +++ ++   P+
Sbjct: 10  EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPS 45


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 27.1 bits (60), Expect = 9.2
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
           +++E E+ L  ++ E  +E+ +  +  P  EDE ED D
Sbjct: 398 LQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDED 435


>gnl|CDD|234083 TIGR03002, outer_YhbN_LptA, lipopolysaccharide transport
           periplasmic protein LptA.  Members of this protein
           family include LptA (previously called YhbN). It was
           shown to be an essential protein in E. coli, implicated
           in cell envelope integrity, and to play a role in the
           delivery of LPS to the outer leaflet of the outer
           membrane. It works with LptB (formerly yhbG), a homolog
           of ABC transporter ATP-binding proteins, encoded by an
           adjacent gene. Numerous homologs in other Proteobacteria
           are found in a conserved location near
           lipopolysaccharide inner core biosynthesis genes. This
           family is related to organic solvent tolerance protein
           (OstA), though distantly [Cell envelope, Biosynthesis
           and degradation of surface polysaccharides and
           lipopolysaccharides, Transport and binding proteins,
           Other].
          Length = 142

 Score = 26.4 bits (59), Expect = 9.4
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 7/65 (10%)

Query: 107 ADKVTVTSKHNDNEQYIWESSAGGS---FTIKPDNSQPL-GRGTKIVLHMKEDQAEYIEE 162
           ADKV VT       +     +A G    F  K DN + + G+  +I     +D+      
Sbjct: 43  ADKVVVTRNDAGGIEKA---TATGKPATFRQKLDNGKEIEGQANRIEYDPAKDEVVLTGN 99

Query: 163 KKIKE 167
            ++K+
Sbjct: 100 ARLKQ 104


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 25.7 bits (57), Expect = 9.6
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 194 EDEEEEKKEEEKEED 208
           E ++EEKKEEE+EE 
Sbjct: 80  EAKKEEKKEEEEEES 94


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 24.6 bits (54), Expect = 9.7
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 183 LVEKEREKELSEDEEEEKKEEEKEEDK 209
           L E+E+ ++  E+E+EE+ EE+ E  +
Sbjct: 20  LCEEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 26.4 bits (59), Expect = 10.0
 Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 179 PIKLLVEKERE--KELSEDEEEEKKEE-EKEEDK 209
           PI    E E+E  K +S++E++  KEE +K E+ 
Sbjct: 90  PIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.128    0.345 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,230,695
Number of extensions: 1080447
Number of successful extensions: 4751
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3635
Number of HSP's successfully gapped: 592
Length of query: 224
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 131
Effective length of database: 6,812,680
Effective search space: 892461080
Effective search space used: 892461080
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 57 (25.8 bits)