RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9550
         (271 letters)



>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 44.5 bits (106), Expect = 9e-06
 Identities = 20/88 (22%), Positives = 44/88 (50%), Gaps = 7/88 (7%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRERRTMERKLSEMEV 240
            K   ++ + E  +L+  L K E+EL + +    K   +LSE+ ++ +   +++L + + 
Sbjct: 33  GKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAK---QQELQQKQQ 89

Query: 241 ELKELDQFKKENQRLKDENGALIRVISK 268
           EL++  Q     Q L+ +   L++ I  
Sbjct: 90  ELQQKQQ--AAQQELQQKQQELLQPIYD 115


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 44.1 bits (105), Expect = 9e-06
 Identities = 19/98 (19%), Positives = 47/98 (47%), Gaps = 11/98 (11%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRERRTMERKLSEME 239
             K   ++ + E  + +  L K E+EL ++K    K+  +LSE  + ++   E++L +  
Sbjct: 15  AGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKK---EKELQKKV 71

Query: 240 VELKEL-DQFKKENQRLKDEN-----GALIRVISKLSK 271
            E +    + +++ Q+ + E        + + I +++K
Sbjct: 72  QEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 45.5 bits (108), Expect = 2e-05
 Identities = 43/177 (24%), Positives = 63/177 (35%), Gaps = 6/177 (3%)

Query: 4   STPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTS- 62
           STPP  +S R   R   + +S+ SP      S +    +S    +SS  S G  +G  + 
Sbjct: 196 STPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGAS-SSDSSSSESSGCGWGPENE 254

Query: 63  ---SGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSAN 119
                   +TLP+     S  +  S    P+S  SS    +    P  P    + +    
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314

Query: 120 YTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPN 176
            +     R S S ST+S +     A      S   + S SR         S RK P 
Sbjct: 315 SSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP-PPADPSSPRKRPR 370



 Score = 37.8 bits (88), Expect = 0.005
 Identities = 40/153 (26%), Positives = 53/153 (34%), Gaps = 10/153 (6%)

Query: 2   SYSTPPRRSSGRTT-SRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
           S S+ PR  S   + S P S  + S    S + SS     SSST   +SS  S G+    
Sbjct: 286 SSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSST--SSSSESSRGA---A 340

Query: 61  TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRP-RRSKSSTRSAN 119
            S G      PS  SRP   +  S  +P    R S AP++      RP RR   +  +  
Sbjct: 341 VSPGPSPSRSPSP-SRPPPPADPS--SPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397

Query: 120 YTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESG 152
                      +G           A    Y   
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARY 430



 Score = 35.9 bits (83), Expect = 0.021
 Identities = 31/142 (21%), Positives = 47/142 (33%), Gaps = 3/142 (2%)

Query: 5   TPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSG 64
           T    +SG      R   +SS S       S S   S  +    SS  +  S+     S 
Sbjct: 267 TRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS-SPGSGPAPSSPRASSSSSSSRESS 325

Query: 65  YGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEG 124
               +  S  SR +  SP    +   S      P     P  RPR S++ +  A     G
Sbjct: 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPA--ASAG 383

Query: 125 GGRMSRSGSTTSLNSLRSEAPS 146
                R+ +  +  + R +A  
Sbjct: 384 RPTRRRARAAVAGRARRRDATG 405



 Score = 31.7 bits (72), Expect = 0.49
 Identities = 36/167 (21%), Positives = 50/167 (29%), Gaps = 7/167 (4%)

Query: 1   MSYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
            S S       G     P    +    P +R   +      SS P   SS  S       
Sbjct: 239 SSSSESSGCGWGPENECPLPRPAPITLP-TRIWEASGWNGPSSRPGPASSSSSPRERSPS 297

Query: 61  TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSY------APTTTPLPPVRPRRSKSS 114
            S    G     S  R S SS  S  +  SS  SS       A +  P P   P  S+  
Sbjct: 298 PSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357

Query: 115 TRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRS 161
             +   +P    R SR+ S+ + ++ R           G       +
Sbjct: 358 PPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT 404



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 34/152 (22%), Positives = 50/152 (32%), Gaps = 6/152 (3%)

Query: 1   MSYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
               TPP  SS           S  PSP    LS     + S  P   +S P+ G++   
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPD-LSEMLRPVGSPGPPPAASPPAAGASPAA 161

Query: 61  TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANY 120
            +S           + P  S   +  AP S    +  P +TP     PR  + S+  +  
Sbjct: 162 VASDAAS---SRQAALPLSSPEETARAPSSP--PAEPPPSTPPAAASPRPPRRSSPISAS 216

Query: 121 TPEGGGRMSRSGSTTSLNSLRSEAPSEGYESG 152
                    RS +  +  S    + SE    G
Sbjct: 217 ASSPAPAPGRSAADDAGASSSDSSSSESSGCG 248


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 44.1 bits (105), Expect = 2e-05
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEE---LNQVKATYSKNSLSEQEKRERRTM-ERKL 235
           E   +KL  E   E  RL E L K EEE   L +    Y K+  S +  + R    E++L
Sbjct: 43  EEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL 102

Query: 236 SEMEVELKELDQ-FKK---ENQRLKDENGALIR 264
             ++ E + L+Q F+K   E   L D+  A I+
Sbjct: 103 KNLKWESEVLEQRFEKVERERDELYDKFEAAIQ 135


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 43.5 bits (103), Expect = 4e-05
 Identities = 22/87 (25%), Positives = 39/87 (44%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
            +K  ++++AEL  L + L   E EL  ++   S+     QE RER     +      + 
Sbjct: 29  IRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDE 88

Query: 243 KELDQFKKENQRLKDENGALIRVISKL 269
           +EL     E Q  K+   +L   +++L
Sbjct: 89  RELRALNIEIQIAKERINSLEDELAEL 115



 Score = 36.2 bits (84), Expect = 0.010
 Identities = 18/81 (22%), Positives = 43/81 (53%), Gaps = 4/81 (4%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS---EQEKRERRTMERKLSEM 238
           D +    + ++E+  +RE + + EE+L+ VK      +L+   +  K    ++E +L+E+
Sbjct: 56  DLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115

Query: 239 EVELKEL-DQFKKENQRLKDE 258
             E+++L  + +   +RL+  
Sbjct: 116 MEEIEKLEKEIEDLKERLERL 136



 Score = 29.3 bits (66), Expect = 2.0
 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
           +  E+ + E+  L+E L + E+ L + +A      L E+    R   +   S+ E   ++
Sbjct: 117 EEIEKLEKEIEDLKERLERLEKNLAEAEA-----RLEEEVAEIREEGQELSSKREELKEK 171

Query: 245 LDQ---FKKENQRLKDENGALIRV 265
           LD     + E  R   +   ++ +
Sbjct: 172 LDPELLSEYERIRKNKKGVGVVPL 195


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 40.7 bits (96), Expect = 5e-04
 Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 10/79 (12%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNS----------LSEQEKRERRTMERKLSEMEV 240
           + EL  L E L + EEEL ++K    K              EQ ++   T E+   E+E 
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEE 392

Query: 241 ELKELDQFKKENQRLKDEN 259
             +EL + K+E + L  E 
Sbjct: 393 LEEELKELKEELESLYSEG 411


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 40.1 bits (94), Expect = 0.001
 Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 3/84 (3%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKRERRTMERKLSEME 239
           + ++  EE Q EL  L+E + + E E++ ++      +N L E E+R     E+  +  E
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337

Query: 240 VELKELDQFKKENQRLKDENGALI 263
            EL+E +   +E ++L  E     
Sbjct: 338 -ELEERETLLEELEQLLAELEEAK 360



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 167 SGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA--TYSKNSLSEQE 224
           +GGS  K       L  K+  +E + EL+ L   L K EEEL  +K      ++ L E  
Sbjct: 653 TGGSRNK----RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEE-L 707

Query: 225 KRERRTMERKLSEMEVELKELD----QFKKENQRLKDENGALIRVISKLSK 271
           +R+   +ER+L E++ EL  L+    Q +   + L++E   L   + +L +
Sbjct: 708 RRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE 758



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 6/97 (6%)

Query: 179 GELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQ-EKRER-RTMERKLS 236
           G   YK+  EE++ +L R  E L + E+ L +++    K  L  Q EK ER + ++ +L 
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEK--LERQAEKAERYQELKAELR 223

Query: 237 EMEVEL--KELDQFKKENQRLKDENGALIRVISKLSK 271
           E+E+ L   +L + +KE + L++E   L   + +L +
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQE 260



 Score = 36.2 bits (84), Expect = 0.015
 Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATY------------SKNSLSEQE 224
           E    + ++  EE Q  L  L E L   EE L ++K                   L E+ 
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802

Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
           +   R ++    E+E   +  ++ ++E + L++E   L   + +L
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847



 Score = 35.8 bits (83), Expect = 0.022
 Identities = 17/87 (19%), Positives = 35/87 (40%), Gaps = 8/87 (9%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
               EE++ EL      L +  EEL +        +L E+       +    +E+E   +
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFE--------ALREELAELEAELAEIRNELEELKR 404

Query: 244 ELDQFKKENQRLKDENGALIRVISKLS 270
           E++  ++  +RL +    L   + +L 
Sbjct: 405 EIESLEERLERLSERLEDLKEELKELE 431



 Score = 33.9 bits (78), Expect = 0.087
 Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 11/121 (9%)

Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA 213
           + L               R+L +           E  + E+  L E + + EE+L++++ 
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRR------ERLEQEIEELEEEIEELEEKLDELEE 849

Query: 214 TYSKNSLSEQE-KRERRTMERKLSEMEVELK----ELDQFKKENQRLKDENGALIRVISK 268
              +     +E K E   +E +  E+E ELK    E ++ ++E + L+ E   L   I K
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909

Query: 269 L 269
           L
Sbjct: 910 L 910



 Score = 32.4 bits (74), Expect = 0.25
 Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 9/99 (9%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER--------RTMER 233
           + +K  EE + ELSRL E L + +EEL + +    +     +E RE           ++ 
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKE 295

Query: 234 KLSEMEVELKEL-DQFKKENQRLKDENGALIRVISKLSK 271
           ++ E+E E+  L ++ ++    L++    L  +  K+  
Sbjct: 296 EIEELEGEISLLRERLEELENELEELEERLEELKEKIEA 334



 Score = 31.2 bits (71), Expect = 0.60
 Identities = 30/99 (30%), Positives = 48/99 (48%), Gaps = 3/99 (3%)

Query: 173 KLPNENGELDY-KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM 231
           +L     EL+  +   EE + EL  L E   + EEEL ++++  ++     ++ RER   
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916

Query: 232 -ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
            E KL  +EVEL EL + + E +        L R I +L
Sbjct: 917 LEAKLERLEVELPEL-EEELEEEYEDTLETELEREIERL 954



 Score = 30.5 bits (69), Expect = 1.0
 Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 3/104 (2%)

Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRE 227
             R         D K+  +E +AEL  L+  L +  EEL +++      ++ L E E+  
Sbjct: 410 EERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL 469

Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
               E +L  +E EL  L+      +  +  +  +  V+  L  
Sbjct: 470 AELQE-ELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALES 512



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 18/82 (21%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
           +      ++  EE +A+L RL   L + EEEL        +++L  + +RE   +E ++ 
Sbjct: 904 KEEIEKLRERLEELEAKLERLEVELPELEEELE----EEYEDTLETELEREIERLEEEIE 959

Query: 237 EME-VELKELDQFKKENQRLKD 257
            +  V L+ ++++++  +R ++
Sbjct: 960 ALGPVNLRAIEEYEEVEERYEE 981



 Score = 29.3 bits (66), Expect = 2.7
 Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 2/84 (2%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
            E + EL  L+  +   EE L ++           +E      +E   +E+E   +EL++
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE--LEELQTELEELNEELEE 450

Query: 248 FKKENQRLKDENGALIRVISKLSK 271
            +++ + L+D    L R +++L +
Sbjct: 451 LEEQLEELRDRLKELERELAELQE 474


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 1/85 (1%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK-ELD 246
            E Q EL+ L+E L++ ++E  ++K   S      +  ++     ++LS   +EL  E  
Sbjct: 76  AELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENR 135

Query: 247 QFKKENQRLKDENGALIRVISKLSK 271
           + ++E   LK EN AL     +L +
Sbjct: 136 ELREELAELKQENEALEAENERLQE 160



 Score = 35.0 bits (81), Expect = 0.018
 Identities = 24/87 (27%), Positives = 48/87 (55%), Gaps = 8/87 (9%)

Query: 173 KLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME 232
           +L  +  EL  ++  +E + ELS L   L + ++EL ++K   S N++  +   E R + 
Sbjct: 84  ELQEQLAEL--QQENQELKQELSTLEAELERLQKELARIKQL-SANAI--ELDEENRELR 138

Query: 233 RKLSEMEVELKELDQFKKENQRLKDEN 259
            +L+E++ E + L+    EN+RL++  
Sbjct: 139 EELAELKQENEALEA---ENERLQENE 162


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 18/103 (17%)

Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS-------KNSLSEQ 223
            +++ N   EL       E +A +  L E L K EE LN ++A  S       +  LS+ 
Sbjct: 750 EQEIENVKSELK------ELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKL 803

Query: 224 EKRERRTMERKLSEMEVELKELDQFK----KENQRLKDENGAL 262
           E+ E   +E +L E+E +L  L   K    KE Q L+++   L
Sbjct: 804 EE-EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845



 Score = 35.8 bits (83), Expect = 0.021
 Identities = 24/101 (23%), Positives = 45/101 (44%), Gaps = 19/101 (18%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKR-------------ERRTME 232
            E + EL    E L+K E E++++ A     +  + E+ KR             E   + 
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370

Query: 233 RKLSEMEVELK----ELDQFKKENQRLKDENGALIRVISKL 269
            +L E++ E      EL  ++++ ++LK E   L R + +L
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411



 Score = 34.3 bits (79), Expect = 0.073
 Identities = 28/103 (27%), Positives = 49/103 (47%), Gaps = 10/103 (9%)

Query: 169 GSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK-NSLSEQEKRE 227
           G SR      G L  +    E Q    RLRE L   + EL+ +++   +  +  ++  +E
Sbjct: 657 GGSRAP--RGGILFSRSEPAELQ----RLRERLEGLKRELSSLQSELRRIENRLDELSQE 710

Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLS 270
                RK+ E+E   KE++Q ++E ++LK+    L   +S L 
Sbjct: 711 LSDASRKIGEIE---KEIEQLEQEEEKLKERLEELEEDLSSLE 750



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 4/102 (3%)

Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT- 230
             L  E  E+D  K + E++ EL   RE L K + E+N++K    +     Q   E    
Sbjct: 367 EDLRAELEEVD--KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424

Query: 231 MERKLSEMEVELKELDQFKKENQ-RLKDENGALIRVISKLSK 271
           +   ++ +E ++ EL++ K++    +K +   L ++ + LSK
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSK 466



 Score = 32.0 bits (73), Expect = 0.34
 Identities = 20/76 (26%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD- 246
           E    +   L E L + E  L  +++           K+ER  +E +L E+E +++EL+ 
Sbjct: 857 ENLNGKKEELEEELEELEAALRDLESRLGD------LKKERDELEAQLRELERKIEELEA 910

Query: 247 QFKKENQRLKDENGAL 262
           Q +K+ +RL +    L
Sbjct: 911 QIEKKRKRLSELKAKL 926



 Score = 31.2 bits (71), Expect = 0.60
 Identities = 17/77 (22%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
           ++ + +L +L   LSK E+EL  +K  Y +        +E   ++R+L+E E + +  ++
Sbjct: 451 KKQEWKLEQLAADLSKYEQELYDLKEEYDRVE------KELSKLQRELAEAEAQARASEE 504

Query: 248 FKKENQRLKDENGALIR 264
             +  + +++   A I+
Sbjct: 505 RVRGGRAVEEVLKASIQ 521



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
            Y++   + + E  R+ + LSK + EL + +A       SE+  R  R +E  L  
Sbjct: 466 KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ---ARASEERVRGGRAVEEVLKA 518



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 19/95 (20%), Positives = 41/95 (43%), Gaps = 13/95 (13%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE-------RRTMERKLSEMEV 240
           E     +   R+ L +   E    KA   +  L E+ + E       +  +ER+   +E 
Sbjct: 187 ERLDLIIDEKRQQLERLRRERE--KAERYQALLKEKREYEGYELLKEKEALERQKEAIER 244

Query: 241 EL----KELDQFKKENQRLKDENGALIRVISKLSK 271
           +L    +EL++  +E   L+     + +++ +L+K
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNK 279



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 29/112 (25%), Positives = 51/112 (45%), Gaps = 14/112 (12%)

Query: 136 SLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELS 195
            L+SL+SE          +N     S+  S++     ++  E  +L+ ++  E+ +  L 
Sbjct: 689 ELSSLQSELRRI------ENRLDELSQELSDASRKIGEIEKEIEQLEQEE--EKLKERLE 740

Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
            L E LS  E+E+  VK+      L E E R    +E  L ++E  L +L+ 
Sbjct: 741 ELEEDLSSLEQEIENVKS-----ELKELEARIEE-LEEDLHKLEEALNDLEA 786



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 16/72 (22%), Positives = 37/72 (51%), Gaps = 1/72 (1%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNS-LSEQEKRERRTMERKLSEMEVEL 242
           +   +E   ELS     + + E+E+ Q++    K     E+ + +  ++E+++  ++ EL
Sbjct: 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSEL 760

Query: 243 KELDQFKKENQR 254
           KEL+   +E + 
Sbjct: 761 KELEARIEELEE 772



 Score = 27.7 bits (62), Expect = 9.4
 Identities = 17/75 (22%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSE----QEKRERRTMERKLSEMEVELK 243
           E+   E+S L + L + E+ L ++          E    +EK      E    E  +  K
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313

Query: 244 ELDQFKKENQRLKDE 258
           E +    E +  K E
Sbjct: 314 ERELEDAEERLAKLE 328


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 6/76 (7%)

Query: 178 NGELD-----YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME 232
           + EL+     YK+   E    L  L + L + EE  ++  +  S  S     K     ++
Sbjct: 70  DKELEKVNSFYKEKEAELLERLEELEKQLEELEERRDETSSAKSSPSDKNLLK-AFEELK 128

Query: 233 RKLSEMEVELKELDQF 248
           + L E+  EL+ L  +
Sbjct: 129 KALLELYRELQLLKSY 144


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 38.1 bits (89), Expect = 0.004
 Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
            + YEE   EL  + + LS+ EEE+N ++    +    E+   E   +++KL E+E  L+
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE---LKKKLKELEKRLE 355

Query: 244 ELDQFKKENQRLK 256
           EL++  +  +  K
Sbjct: 356 ELEERHELYEEAK 368



 Score = 35.4 bits (82), Expect = 0.031
 Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 6/83 (7%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKR----ERRTMERKL 235
           D +K  E  + EL +L E L K  EEL + +      +  L E EK+    E   +  + 
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEY 668

Query: 236 SEMEVELKELDQFKKENQRLKDE 258
            E+  EL  L    +E ++ ++E
Sbjct: 669 LELSRELAGLRAELEELEKRREE 691



 Score = 33.9 bits (78), Expect = 0.082
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 14/103 (13%)

Query: 172 RKLPNENGELDYKKLYEESQAELSRLRET---LSKTEEELNQVKATYSKNSLSEQEKRER 228
           RKL  E  EL+ K L +ES  EL +L+E    L + EE+L +    Y+   L E++  E 
Sbjct: 476 RKLRKELRELE-KVLKKES--ELIKLKELAEQLKELEEKLKK----YNLEEL-EKKAEEY 527

Query: 229 RTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
             ++ KL +++ E+K L   KKE ++L++    L  +  KL +
Sbjct: 528 EKLKEKLIKLKGEIKSL---KKELEKLEELKKKLAELEKKLDE 567



 Score = 33.5 bits (77), Expect = 0.13
 Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 6/91 (6%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA-----TYSKNSLSEQEKRERRTM 231
           EN   +  ++ +E +  + RL + + +TE     +K            ++E         
Sbjct: 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPEL- 219

Query: 232 ERKLSEMEVELKELDQFKKENQRLKDENGAL 262
             +L ++E E+KEL++ K+E + L+ E  +L
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESL 250



 Score = 31.6 bits (72), Expect = 0.55
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 11/71 (15%)

Query: 193 ELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER------RTMERKLSEMEVELKELD 246
           EL  LRE L K E+E+ +++    K  + E EK         R +E K+ E+E  ++EL 
Sbjct: 215 ELPELREELEKLEKEVKELEEL--KEEIEELEKELESLEGSKRKLEEKIRELEERIEEL- 271

Query: 247 QFKKENQRLKD 257
             KKE + L++
Sbjct: 272 --KKEIEELEE 280



 Score = 31.6 bits (72), Expect = 0.56
 Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           ++ Y E   EL+ LR  L + E+   ++K T  K    ++E  ER   +++L ++E  L+
Sbjct: 665 REEYLELSRELAGLRAELEELEKRREEIKKTLEK---LKEELEEREKAKKELEKLEKALE 721

Query: 244 ELDQFKKENQRLKDE 258
            +++ +++ ++ K  
Sbjct: 722 RVEELREKVKKYKAL 736



 Score = 31.6 bits (72), Expect = 0.56
 Identities = 27/100 (27%), Positives = 52/100 (52%), Gaps = 8/100 (8%)

Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--NSLSEQEK-RERRTMERKLS 236
           E   K+L EE  AEL R+ + L + EE+  +++    +    L ++ +  + + +  +L 
Sbjct: 447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLK 506

Query: 237 EMEVELK-----ELDQFKKENQRLKDENGALIRVISKLSK 271
           E+E +LK     EL++  +E ++LK++   L   I  L K
Sbjct: 507 ELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKK 546



 Score = 30.8 bits (70), Expect = 0.79
 Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 12/86 (13%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR----ERRTMERKLS--- 236
            + YE+ + +L +L+  +   ++EL +++    K +  E++      E   + ++L    
Sbjct: 524 AEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELG 583

Query: 237 -----EMEVELKELDQFKKENQRLKD 257
                E+E  LKEL+ F  E   LKD
Sbjct: 584 FESVEELEERLKELEPFYNEYLELKD 609



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 24/97 (24%), Positives = 43/97 (44%), Gaps = 14/97 (14%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME--------RKL 235
           +K  E  +    +L E + + EE + ++K       + E E++ +   E         KL
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKK-----EIEELEEKVKELKELKEKAEEYIKL 298

Query: 236 SEMEVE-LKELDQFKKENQRLKDENGALIRVISKLSK 271
           SE   E L EL + +K   RL++E   +   I +L +
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEE 335



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 11/107 (10%)

Query: 171 SRKLPNENGELD----YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR 226
           S +LP    EL+      K  EE + E+  L + L   E    +++       + E E+R
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE-----KIRELEER 267

Query: 227 --ERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
             E +    +L E   ELKEL +  +E  +L +     +  + ++ K
Sbjct: 268 IEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNS---LSEQEKRERRTMERKLSEMEVELKE 244
           +++  EL+   + L +  +EL +++  YS+     L E+     R +    +E+E   K 
Sbjct: 629 DKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688

Query: 245 LDQFKKENQRLKDENGALIRVISKLSK 271
            ++ KK  ++LK+E     +   +L K
Sbjct: 689 REEIKKTLEKLKEELEEREKAKKELEK 715



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
           +Y     E +  LSRL E ++  EE + +++    K    E+ K++ + +E++L E+E  
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELE---EKEERLEELKKKLKELEKRLEELEER 360

Query: 242 LKELD---QFKKENQRLKDENGAL 262
            +  +     K+E +RLK     L
Sbjct: 361 HELYEEAKAKKEELERLKKRLTGL 384



 Score = 28.5 bits (64), Expect = 4.8
 Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)

Query: 184 KKLYEESQAELSRLRETLSKTEE------ELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
           KK  +E +  L  L E     EE      EL ++K   +  +  E+ ++E   +E+   E
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP-EKLEKELEELEKAKEE 402

Query: 238 MEVELKELDQFKKE-NQRLKDENGALIR 264
           +E E+ ++     E  + +K+   A+  
Sbjct: 403 IEEEISKITARIGELKKEIKELKKAIEE 430


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 38.3 bits (90), Expect = 0.004
 Identities = 18/62 (29%), Positives = 35/62 (56%), Gaps = 9/62 (14%)

Query: 198 RETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
           ++ + + +E+LN++ A     SL E E+     +E+K  E E  LKE ++ K+E +  K+
Sbjct: 508 KKLIGEDKEKLNELIA-----SLEELERE----LEQKAEEAEALLKEAEKLKEELEEKKE 558

Query: 258 EN 259
           + 
Sbjct: 559 KL 560



 Score = 29.0 bits (66), Expect = 3.0
 Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 3/80 (3%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRERRTMERKLSEMEVELK-E 244
           EE + EL +  E      +E  ++K      K  L E+E +     E++  +   E K E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585

Query: 245 LDQFKKENQRLKDENGALIR 264
            D+  KE ++L+    A ++
Sbjct: 586 ADEIIKELRQLQKGGYASVK 605


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 36.8 bits (86), Expect = 0.005
 Identities = 24/95 (25%), Positives = 47/95 (49%), Gaps = 13/95 (13%)

Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVK------ATYSKNSLSEQEKRERR 229
            EN  L+ +K+ EE   EL RL+   + T   L  +K          +    E +++E R
Sbjct: 49  IENQALN-EKI-EERNKELKRLKLKYTNTVHILTHLKEKLHMLEEELERLKKELKEKEER 106

Query: 230 TME--RKLSEMEVELKELDQFKKENQRLKDENGAL 262
             +  ++L +++    E D+ +K+N++L+ + G L
Sbjct: 107 LAKLRKELYKVK---LERDKLRKQNKKLRQQGGLL 138



 Score = 34.5 bits (80), Expect = 0.026
 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM-ERKLSEMEVELKEL 245
           +E+ Q E   L E + +  +EL ++K  Y+         +E+  M E +L  ++ ELKE 
Sbjct: 44  FEQLQIENQALNEKIEERNKELKRLKLKYTNTVHILTHLKEKLHMLEEELERLKKELKEK 103

Query: 246 DQ----FKKENQRLKDENGALIRVISKLSK 271
           ++     +KE  ++K E   L +   KL +
Sbjct: 104 EERLAKLRKELYKVKLERDKLRKQNKKLRQ 133



 Score = 28.7 bits (65), Expect = 2.2
 Identities = 13/77 (16%), Positives = 33/77 (42%), Gaps = 15/77 (19%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE-- 241
           KK  +E +  L++LR+ L K + E ++++    K             + ++   + V   
Sbjct: 97  KKELKEKEERLAKLRKELYKVKLERDKLRKQNKK-------------LRQQGGLLHVPAL 143

Query: 242 LKELDQFKKENQRLKDE 258
           L + D+  ++ +  +  
Sbjct: 144 LYDYDKTVEKVEEKRKS 160


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 37.2 bits (87), Expect = 0.006
 Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 6/86 (6%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK----LSEM 238
            +++ EE +AE    RE   K  EE+++      +    E+EK   R  E K      E 
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNE--ERIERKEEEKEREREEELKILEYQREK 162

Query: 239 EVELKELDQFKKENQRLKDENGALIR 264
               +E +  ++E +  K+   A +R
Sbjct: 163 AEREEEREAERRERKEEKEREVARLR 188



 Score = 29.1 bits (66), Expect = 2.8
 Identities = 13/71 (18%), Positives = 27/71 (38%), Gaps = 4/71 (5%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
           +E   E    +E   + E E       Y      E+ +RE      +    E + +E+ +
Sbjct: 131 DEFNEERIERKEEEKEREREEELKILEY----QREKAEREEEREAERRERKEEKEREVAR 186

Query: 248 FKKENQRLKDE 258
            + + +  +DE
Sbjct: 187 LRAQQEEAEDE 197



 Score = 28.3 bits (64), Expect = 3.8
 Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 10/78 (12%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER-------RTMERKLSEMEV 240
           E   A L   +E      EEL++++A        E E++ER           R+  E++ 
Sbjct: 181 EREVARLRAQQEEAEDEREELDELRA---DLYQEEYERKERQKEKEEAEKRRRQKQELQR 237

Query: 241 ELKELDQFKKENQRLKDE 258
             +E  + K+E  + +  
Sbjct: 238 AREEQIEEKEERLQEERA 255


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 37.5 bits (88), Expect = 0.006
 Identities = 24/93 (25%), Positives = 48/93 (51%), Gaps = 12/93 (12%)

Query: 183 YKKLYE-ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR---TMERKLSEM 238
            KK    E++ E+ +LR   ++ E+EL +      +N L + EKR  +    ++RKL  +
Sbjct: 54  IKKEALLEAKEEIHKLR---NEFEKELRE-----RRNELQKLEKRLLQKEENLDRKLELL 105

Query: 239 EVELKELDQFKKENQRLKDENGALIRVISKLSK 271
           E   +EL++ +KE ++ + E       + +L +
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIE 138



 Score = 31.3 bits (72), Expect = 0.61
 Identities = 23/95 (24%), Positives = 46/95 (48%), Gaps = 13/95 (13%)

Query: 181 LDYKK-LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
           LD K  L E+ + EL +  + L + ++EL + +    +    + ++ ER      +S + 
Sbjct: 98  LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER------ISGLT 151

Query: 240 V-ELKE--LDQFKKENQRLKDENGALIRVISKLSK 271
             E KE  L++ ++E    + E   LI+ I + +K
Sbjct: 152 AEEAKEILLEKVEEE---ARHEAAVLIKEIEEEAK 183


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 37.0 bits (86), Expect = 0.007
 Identities = 12/50 (24%), Positives = 26/50 (52%)

Query: 209 NQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
             V          +    E   ++++L+E+E  L+E++  ++EN+RLK+ 
Sbjct: 56  EFVDGVLEFLKSLKDLALENEELKKELAELEQLLEEVESLEEENKRLKEL 105


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 36.9 bits (86), Expect = 0.007
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 12/81 (14%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRER--------RTMERKLSE 237
           E ++AE   LRE   + E+ +   + +Y ++   L E+ + ER        R +E KL E
Sbjct: 214 EAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273

Query: 238 MEVELKELDQFKKENQRLKDE 258
            E  LKE   FK E + L+ E
Sbjct: 274 QEELLKE--GFKTEAESLQKE 292


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 36.1 bits (84), Expect = 0.007
 Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 10/80 (12%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           K   E+ + EL  L++ +++ + ++ ++K          +E  ER  +  +L ++E ELK
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKK-------GREETEERTELLEELKQLEKELK 120

Query: 244 ELDQFKKENQRLKDENGALI 263
           +L   K E ++ +  +   I
Sbjct: 121 KL---KAELEKYEKNDPERI 137


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 35.3 bits (82), Expect = 0.008
 Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 4/91 (4%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMER--KLSEMEVE 241
           ++ +E  +  L +  E L K EEEL +    + K  L E E + RR  ++  +  ++  E
Sbjct: 20  REEFERREELLKQREEELEKKEEELQESLIKFDK-FLKENEAKRRRAEKKAEEEKKLRKE 78

Query: 242 L-KELDQFKKENQRLKDENGALIRVISKLSK 271
             +E+ + K E + LK E   L   + +   
Sbjct: 79  KEEEIKELKAELEELKAEIEKLEEKLEEYQP 109



 Score = 34.9 bits (81), Expect = 0.011
 Identities = 18/85 (21%), Positives = 41/85 (48%), Gaps = 11/85 (12%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS--------EQEKRERRTMER 233
           ++++  E  +     L +   + +E L +      +N           E+EK+ R+  E 
Sbjct: 22  EFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEE 81

Query: 234 KLSEMEVELKELDQFKKENQRLKDE 258
           ++ E++ EL+EL   K E ++L+++
Sbjct: 82  EIKELKAELEEL---KAEIEKLEEK 103


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 36.0 bits (84), Expect = 0.008
 Identities = 30/93 (32%), Positives = 40/93 (43%), Gaps = 24/93 (25%)

Query: 183 YKKLYEES---------QAE--LSRLRETLSKTEEE----LNQVKATYSKNSLSEQEKR- 226
           Y+ LYE S         QAE   S L + + K EEE      +V     +  L   EKR 
Sbjct: 100 YQTLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAEL--EAKLEAIEKRE 157

Query: 227 -ERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
            E R +E K    E+        KK+NQ+LK +
Sbjct: 158 EEERQIEEKRHADEI-----AFLKKQNQQLKSQ 185


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 36.8 bits (86), Expect = 0.010
 Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 4/84 (4%)

Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE---MEVELKEL 245
           E   EL  L + + + E+EL Q+KA     +  ++EK++ +   +K           K L
Sbjct: 366 EYDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVL 425

Query: 246 DQFKKEN-QRLKDENGALIRVISK 268
            +  +++ + LK+  G ++  +  
Sbjct: 426 KEATRQDLELLKNVWGEILESLKA 449


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 21/101 (20%), Positives = 46/101 (45%), Gaps = 5/101 (4%)

Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
             K+     E+  +    E++ +L    E L + EEEL Q++    +  L ++     + 
Sbjct: 660 EEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE--LLKKLGEIEQL 717

Query: 231 MER---KLSEMEVELKELDQFKKENQRLKDENGALIRVISK 268
           +E    + +E+E   KEL++ +K  + L++    L +   +
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLR 758



 Score = 34.0 bits (78), Expect = 0.075
 Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 15/106 (14%)

Query: 180 ELDYKKLYEESQAELSRLRE---TLSKTEEELNQVKATYSKNSLSEQEKRER-----RTM 231
           E+  ++  EE + E+  L E    L + +E L ++KA   +    E E  +      R +
Sbjct: 220 EIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELREL 279

Query: 232 ERKLSEMEVELKELD-------QFKKENQRLKDENGALIRVISKLS 270
           ER L E+E +++ L+       + ++E + L+     L  ++ KL 
Sbjct: 280 ERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLK 325



 Score = 33.6 bits (77), Expect = 0.12
 Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 182 DYKKLYEESQAE-LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEV 240
           + ++L E+ Q + L      L    +EL ++          ++E  E R    +L E++ 
Sbjct: 544 ELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELR---ERLKELKK 600

Query: 241 ELKELDQFKKENQRLKDE 258
           +LKEL++   + + L   
Sbjct: 601 KLKELEERLSQLEELLQS 618



 Score = 33.6 bits (77), Expect = 0.12
 Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 7/90 (7%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQ----VKATYSKNSLSEQEKRER---RTMERK 234
           + ++L EE + ++ RL E   + EE   +             L E+ K        +E K
Sbjct: 278 ELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEK 337

Query: 235 LSEMEVELKELDQFKKENQRLKDENGALIR 264
           L ++E EL+EL + K E  +L +E    + 
Sbjct: 338 LEKLESELEELAEEKNELAKLLEERLKELE 367



 Score = 31.3 bits (71), Expect = 0.55
 Identities = 20/106 (18%), Positives = 40/106 (37%), Gaps = 14/106 (13%)

Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEEL--------------NQVKATYSKNSLSEQEK 225
            L+  +   E +     L   L K   +                +V+   ++     Q  
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677

Query: 226 RERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
                +E KL E+E   +EL+Q ++E + L  + G + ++I +L  
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723



 Score = 30.9 bits (70), Expect = 0.92
 Identities = 18/80 (22%), Positives = 41/80 (51%), Gaps = 3/80 (3%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR---ERRTMERKLSEM 238
           + ++  +E + +L  L E LS+ EE L  ++ + ++N L E E+    E   +  +    
Sbjct: 590 ELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELE 649

Query: 239 EVELKELDQFKKENQRLKDE 258
           E+    L++ +++ + L+ E
Sbjct: 650 ELLQAALEELEEKVEELEAE 669



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 23/106 (21%), Positives = 43/106 (40%), Gaps = 17/106 (16%)

Query: 183 YKKLYEESQAELSRL--------------RETLSKTEEELNQVKATY---SKNSLSEQEK 225
            ++  EE + EL  L              +E L +  E+L  +        +    +Q K
Sbjct: 499 LREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLK 558

Query: 226 RERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
            E R +E +L E++  L+EL   +   + L++    L  +  KL +
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 21/77 (27%), Positives = 45/77 (58%), Gaps = 1/77 (1%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER-RTMERKLSEMEVE 241
            ++  EE + EL +  E L + EE + ++K   ++ S + +E +E    +E++L E+E E
Sbjct: 366 LEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERE 425

Query: 242 LKELDQFKKENQRLKDE 258
           L+EL++  K+ +   ++
Sbjct: 426 LEELEEEIKKLEEQINQ 442



 Score = 28.2 bits (63), Expect = 6.4
 Identities = 30/148 (20%), Positives = 56/148 (37%), Gaps = 30/148 (20%)

Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDY-KKLYEESQAELSRLRETLSKTEEELNQVK 212
             L  +   +  E    S  L     EL+  +K  EE + EL  L E + K EE++NQ++
Sbjct: 385 KQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE 444

Query: 213 AT------------------------YSKNSLSEQEK-----RERRTMERKLSEMEVELK 243
           +                         + K  L   E       E  + E++ +E+  E++
Sbjct: 445 SKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIE 504

Query: 244 ELDQFKKENQRLKDENGALIRVISKLSK 271
           EL++  +E +    E   L   + +  +
Sbjct: 505 ELEKELRELEEELIELLELEEALKEELE 532


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 34.6 bits (80), Expect = 0.014
 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 2/86 (2%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSK-NSLSEQEKRERRTMERKLSEMEVELK-E 244
           YE    + +   E L    ++LN++K   ++  + +E  + E    E    E +  L+ E
Sbjct: 47  YERELVKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDE 106

Query: 245 LDQFKKENQRLKDENGALIRVISKLS 270
           L + +K  + L ++N  L   I  LS
Sbjct: 107 LSELEKRIEELNEQNKLLHDQIELLS 132



 Score = 31.1 bits (71), Expect = 0.24
 Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 12/75 (16%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
            +K   E + E+++L+      + EL++     ++ S  EQ+K     +E +LSE+E  +
Sbjct: 64  LRKQLNELKKEIAQLKAEAESAQAELSE-----AEESWEEQKKM----LEDELSELEKRI 114

Query: 243 KELDQFKKENQRLKD 257
           +EL    ++N+ L D
Sbjct: 115 EEL---NEQNKLLHD 126


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 36.2 bits (83), Expect = 0.017
 Identities = 27/104 (25%), Positives = 55/104 (52%), Gaps = 8/104 (7%)

Query: 164  NSESGGSSRKLPNENGELDYKK-LYEESQAELSRLRETLSKTEEELNQVKA---TYSKNS 219
            N +     +K+ N   ++D KK +Y+E +  L+ + E + K +  L +VK    +Y KN 
Sbjct: 1165 NDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAE-IEKDKTSLEEVKGINLSYGKNL 1223

Query: 220  ---LSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENG 260
                 E+   E++  E  +  ME  +++LD+ K+++  +++E G
Sbjct: 1224 GKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMG 1267


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 35.8 bits (83), Expect = 0.017
 Identities = 24/107 (22%), Positives = 30/107 (28%), Gaps = 14/107 (13%)

Query: 2   SYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT 61
           S+       +  + S P S     PS  S      S   SS  P      PS   +    
Sbjct: 171 SFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA-----PSSFQSDT-- 223

Query: 62  SSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRP 108
                    PS  S  + S P    APP        P +T  P    
Sbjct: 224 -------PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPS 263


>gnl|CDD|129323 TIGR00219, mreC, rod shape-determining protein MreC.  MreC (murein
           formation C) is involved in the rod shape determination
           in E. coli, and more generally in cell shape
           determination of bacteria whether or not they are
           rod-shaped. Cells defective in MreC are round. Species
           with MreC include many of the Proteobacteria,
           Gram-positives, and spirochetes [Cell envelope,
           Biosynthesis and degradation of murein sacculus and
           peptidoglycan].
          Length = 283

 Score = 35.6 bits (82), Expect = 0.019
 Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 9/70 (12%)

Query: 195 SRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQR 254
           +R RE      E L  V     +N       + R+ + +K  ++E+  + L Q   EN R
Sbjct: 52  NRPREVFDGISENLKDVNNLEYEN------YKLRQELLKKNQQLEILTQNLKQ---ENVR 102

Query: 255 LKDENGALIR 264
           L++   + + 
Sbjct: 103 LRELLNSPLS 112


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.8 bits (83), Expect = 0.023
 Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE--- 244
            E + EL  L+  + K E EL + +          ++ RE R  +R++  +E EL+E   
Sbjct: 439 SELKRELEELKREIEKLESELERFRREVRD---KVRKDREIRARDRRIERLEKELEEKKK 495

Query: 245 -LDQFKKENQRLKDENGALIR 264
            +++ +++   L+      + 
Sbjct: 496 RVEELERKLAELRKMRKLELS 516



 Score = 28.5 bits (64), Expect = 4.4
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 7/73 (9%)

Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLK 256
           L E LSK +EE  + +         E+E+RE    E+++ ++E  ++ L   ++EN  LK
Sbjct: 390 LAEALSKVKEE-ERPREKEGT---EEEERREITVYEKRIKKLEETVERL---EEENSELK 442

Query: 257 DENGALIRVISKL 269
            E   L R I KL
Sbjct: 443 RELEELKREIEKL 455


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 35.0 bits (81), Expect = 0.025
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDENG 260
           + E   ++++L E+E  L+EL+Q + EN RL++   
Sbjct: 75  REENEELKKELLELESRLQELEQLEAENARLRELLN 110


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 35.3 bits (82), Expect = 0.035
 Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 6/90 (6%)

Query: 175 PNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA---TYSKNSLSEQEKRERRT- 230
             E    +  +       +LS   + L     +LN ++      S  SL E  K      
Sbjct: 36  LKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEEL 95

Query: 231 --MERKLSEMEVELKELDQFKKENQRLKDE 258
             +E+++ E+E E+ EL+   KE ++  + 
Sbjct: 96  EKIEKEIKELEEEISELENEIKELEQEIER 125



 Score = 29.5 bits (67), Expect = 2.0
 Identities = 15/85 (17%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 165 SESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE 224
                  R+   +      ++L ++ + EL ++ + + + EEE++++     +N + E E
Sbjct: 66  LPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL-----ENEIKELE 120

Query: 225 KRERRTMERKLSEMEVELKELDQFK 249
           +   R         +++L  L  FK
Sbjct: 121 QEIERLE--PWGNFDLDLSLLLGFK 143



 Score = 29.1 bits (66), Expect = 3.0
 Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 15/97 (15%)

Query: 172 RKLPNENGELDYKK----LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE 227
           +KL  E  EL+ +     L  E + EL  + +      EEL ++   Y +  L+  E  E
Sbjct: 198 KKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLE 257

Query: 228 RRTMERKLSEMEVELKELDQF--------KKENQRLK 256
              +E + +E   +  + D+         +   ++LK
Sbjct: 258 ---IELERAEALSKFLKTDKTFAIEGWVPEDRVKKLK 291


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 34.8 bits (80), Expect = 0.035
 Identities = 16/74 (21%), Positives = 28/74 (37%), Gaps = 2/74 (2%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLS--EQEKRERRTMERKLSEMEVELKEL 245
              + E   L     K  E   Q +    +       ++ +E   +E++L E+  EL++ 
Sbjct: 74  INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133

Query: 246 DQFKKENQRLKDEN 259
            Q   E    K EN
Sbjct: 134 LQNTAEIIEKKREN 147



 Score = 27.9 bits (62), Expect = 7.1
 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 11/84 (13%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK---------RERRTMERK 234
            +  E    E  +  E L K  +EL++      +N+    EK         R +   E+K
Sbjct: 102 LEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKK 161

Query: 235 LSE-MEVE-LKELDQFKKENQRLK 256
           L E +E+E  K  +Q  + N  L+
Sbjct: 162 LEESLELEREKFEEQLHEANLDLE 185



 Score = 27.5 bits (61), Expect = 7.9
 Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQF 248
           ++ AE+   +   +K EE L        + SL    + ER   E +L E  ++L+  +  
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESL----ELEREKFEEQLHEANLDLEFKENE 190

Query: 249 K 249
           +
Sbjct: 191 E 191


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.038
 Identities = 32/136 (23%), Positives = 53/136 (38%), Gaps = 16/136 (11%)

Query: 127 RMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGEL----- 181
            +SR  S    +  R EA  E  E     LS   + + +E      +L     EL     
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782

Query: 182 ---DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSE 237
              + +   E+ + EL  LRE L +   EL  +              RER   +ER+++ 
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEA-------ANLRERLESLERRIAA 835

Query: 238 MEVELKELDQFKKENQ 253
            E  L++L++  +E  
Sbjct: 836 TERRLEDLEEQIEELS 851



 Score = 34.6 bits (80), Expect = 0.051
 Identities = 19/83 (22%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATY-SKNSLSEQEKRERRTMERKLSEMEVELKELD 246
           E   AE+  L E + + E EL  +     S        + E   +  +L E+E +  EL 
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914

Query: 247 QFKKENQRLKDENGALIRVISKL 269
           +   E + L+++   L   +  L
Sbjct: 915 R---ELEELREKLAQLELRLEGL 934



 Score = 31.6 bits (72), Expect = 0.55
 Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 7/65 (10%)

Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT----MERKLSEMEV 240
                 +  L+ LR  L +  EEL +++   SK S   +E  E R     +E +L  +EV
Sbjct: 880 NERASLEEALALLRSELEELSEELRELE---SKRSELRRELEELREKLAQLELRLEGLEV 936

Query: 241 ELKEL 245
            +  L
Sbjct: 937 RIDNL 941



 Score = 31.2 bits (71), Expect = 0.63
 Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 9/92 (9%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEK------RERRTMERKLSEM 238
             E +  L+ LR+ L + EEEL Q++         +S   K       E   +E +++++
Sbjct: 693 IAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752

Query: 239 EVELKELDQFKKE-NQRLKDENGALIRVISKL 269
             EL EL+   +E  +RL++    L    +++
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEI 784



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 8/107 (7%)

Query: 173 KLPNENGEL-DYKKLYEESQAELSRLRETLSKTEEELNQVKAT--YSKNSLSEQEKRERR 229
           +L     EL + +   EE + +L  LR  +++ E ++  +       +  L   E R  R
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRER 418

Query: 230 TMERKLSEM----EVELKELD-QFKKENQRLKDENGALIRVISKLSK 271
             +     +    E ELKEL  + ++  + L++    L R+   L +
Sbjct: 419 LQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE 465



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 28/89 (31%), Positives = 43/89 (48%), Gaps = 16/89 (17%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
                EE + EL  L+E L + EEEL             E+   E + +E KL E+ +E+
Sbjct: 230 LVLRLEELREELEELQEELKEAEEEL-------------EELTAELQELEEKLEELRLEV 276

Query: 243 KELDQFKKENQRLKDENGALIRVISKLSK 271
            EL+   +E + L+ E  AL   IS+L +
Sbjct: 277 SELE---EEIEELQKELYALANEISRLEQ 302



 Score = 28.9 bits (65), Expect = 3.5
 Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 2/111 (1%)

Query: 160 RSRVNSESGGSSRKLPNENGELD-YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN 218
              +  E   +  +L     EL   ++  EE + E+S L E + + ++EL  +    S+ 
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300

Query: 219 SLSEQEKRERR-TMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISK 268
              +Q  RER   +ER+L E+E +L+EL+    E      E    +  + +
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351


>gnl|CDD|225199 COG2317, COG2317, Zn-dependent carboxypeptidase [Amino acid
           transport and metabolism].
          Length = 497

 Score = 35.0 bits (81), Expect = 0.041
 Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM----EVELKE 244
           E+ AELS LR  L  + E    ++    +  LSE E    R ++R+  +     E  +KE
Sbjct: 45  EAIAELSVLRHELLTSPEFAELLEKAEEEKDLSEIEAGVVRELKREYEKAKKIPEELVKE 104

Query: 245 L 245
            
Sbjct: 105 Y 105


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 35.0 bits (81), Expect = 0.042
 Identities = 27/118 (22%), Positives = 42/118 (35%), Gaps = 16/118 (13%)

Query: 163 VNSESGGSSRKLPNENGELD--YKKL-YEESQAELSRLRETLSKTE--------EELNQV 211
           +N +     RK  NE    D   +KL   ES  ++ +L   L  T         +E   +
Sbjct: 15  LNEDVSAFQRKFVNEVRRCDEVERKLRKLES--KIKKLGIPLKDTGGKPDVPPSKEFLDL 72

Query: 212 KATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
           +         E E +E       L +   EL+E      E +   DEN   +  +S L
Sbjct: 73  EEEILD---LEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNL 127


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 33.8 bits (78), Expect = 0.044
 Identities = 20/96 (20%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE-QEKRE--RRTMER 233
            +   + ++L EE+QA L+   + L +  E+ +++     K +    +E +      +ER
Sbjct: 43  ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102

Query: 234 KLSEMEVELK-ELDQFKKENQRLKDENGALIRVISK 268
                E E++ E ++  +E   L+ E   L   I++
Sbjct: 103 IKEAAEAEIEAEKERALEE---LRAEVAELAVAIAE 135


>gnl|CDD|237833 PRK14845, PRK14845, translation initiation factor IF-2;
           Provisional.
          Length = 1049

 Score = 34.9 bits (80), Expect = 0.048
 Identities = 37/134 (27%), Positives = 49/134 (36%), Gaps = 17/134 (12%)

Query: 136 SLNSLRSEAPSEGYE-----SGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEES 190
           SL  LR    S+ Y      SG  NL   R  +     G S K   E  E   +K  +  
Sbjct: 265 SLLLLRFGIVSKIYRSTLIISGKRNLENFRKYI-----GFSVKEKAEALEKIIEKSKKSE 319

Query: 191 ----QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD 246
                 EL RLR     T  EL+     YSK    E    E   +   L+ +E   K LD
Sbjct: 320 RYPINEELKRLRLLFGFTRNELSSNIPFYSKYESEEAPSYE--ILMEILNSIERGSKNLD 377

Query: 247 -QFKKENQRLKDEN 259
            +      +++D N
Sbjct: 378 KKIAVLEGKIRDHN 391


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 34.4 bits (80), Expect = 0.049
 Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           +K  +     L   +E   + +EE+++VK +Y   +L+E E    R +E++L  +E +  
Sbjct: 309 EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSY---TLNESELESVRQLEKQLESLEKQYD 365

Query: 244 ELDQ 247
           E+ +
Sbjct: 366 EITE 369



 Score = 28.6 bits (65), Expect = 4.2
 Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 24/100 (24%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKRER-----RTMERK 234
             ++  E+   EL  L E+  K  EE+ Q+K  Y   + SL     R         +E++
Sbjct: 116 LIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLL--ANRFSFGPALDELEKQ 173

Query: 235 LSEMEVELKELDQFKKEN------------QRLKDENGAL 262
           L  +E   +E  QF +               +L++E  AL
Sbjct: 174 LENLE---EEFSQFVELTESGDYVEAREILDQLEEELAAL 210


>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
           protein. 
          Length = 226

 Score = 33.9 bits (78), Expect = 0.057
 Identities = 24/90 (26%), Positives = 36/90 (40%)

Query: 28  PQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHA 87
           P S   +SYS G +SS   +  S  S  S+    S+   GL+  S  S  + S   S+  
Sbjct: 108 PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASSRTSSWVR 167

Query: 88  PPSSYRSSYAPTTTPLPPVRPRRSKSSTRS 117
             +S    + P       V P  S +S+  
Sbjct: 168 SQNSNLEPFMPGALQTAWVTPPSSTASSSG 197


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 34.1 bits (78), Expect = 0.057
 Identities = 37/117 (31%), Positives = 51/117 (43%), Gaps = 5/117 (4%)

Query: 6   PPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGY 65
            PR SS R  S      S+S  P  R L++ +S  SSS     SS  S+G +    SS  
Sbjct: 16  SPRSSSRRRLSSSFLSTSASSRP--RRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLS 73

Query: 66  GGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTP 122
            G  L S +  PS+ SP +  +   S  ++         P R RR+ SS  S+   P
Sbjct: 74  RGR-LSSRFVSPSRGSPSAAASLNGSLATASTS--GSSSPSRSRRTTSSDLSSGNGP 127


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 34.0 bits (79), Expect = 0.061
 Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 17/60 (28%)

Query: 222 EQEKRERRTMERKLSEMEVE----LKELDQFKKENQRLK-------------DENGALIR 264
            + + + R +E KL ++E E     +EL++ K E ++LK             D+   +++
Sbjct: 11  SELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKSPPLIVATVLEVLDDGRVVVK 70


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 33.9 bits (78), Expect = 0.077
 Identities = 16/55 (29%), Positives = 23/55 (41%)

Query: 69  TLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPE 123
           +  S  S PS +SPR     PS+    + P+  PLP +      S T     T +
Sbjct: 19  SSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQ 73


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 34.0 bits (78), Expect = 0.081
 Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 8/66 (12%)

Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEKR---ERRTMERKLSEMEVELKELDQFKKEN 252
           + +    + +EE+N +        LS  EK    +   +E+ L E E   KEL+Q  +E 
Sbjct: 501 QAKTFYGEFKEEINVLIE-----KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEEL 555

Query: 253 QRLKDE 258
           +  +  
Sbjct: 556 KERERN 561



 Score = 32.5 bits (74), Expect = 0.28
 Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 5/92 (5%)

Query: 170 SSRKLPNENGELDYK-----KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE 224
             ++L  +N  L+       KL +E + E+  L+E     + EL +      K    E E
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVE 582

Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLK 256
              R   E+K+ + +      D  K +  + K
Sbjct: 583 SIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614



 Score = 30.6 bits (69), Expect = 1.0
 Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS---KNSLSEQEKRERRTMERKLSEM 238
             +K  E+    L +L +   K ++EL Q         +N   E EK  +  ++    E+
Sbjct: 522 ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEV 581

Query: 239 EVELKELDQFKKEN 252
           E  ++EL + KK +
Sbjct: 582 ESIIREL-KEKKIH 594


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 33.9 bits (78), Expect = 0.098
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 1/83 (1%)

Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
             R+L  E  E+D K     S+ ++++L E L + + EL + +A   +     +  +E  
Sbjct: 399 ELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAI 458

Query: 230 TMERKLSEMEVELKELDQFKKEN 252
              RK  + + + K  + F+ E 
Sbjct: 459 EALRKTLDEKTKQKI-NAFELER 480


>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737).  This
           family consists of several uncharacterized mammalian
           proteins of unknown function.
          Length = 187

 Score = 32.8 bits (75), Expect = 0.099
 Identities = 39/150 (26%), Positives = 62/150 (41%), Gaps = 12/150 (8%)

Query: 127 RMSRSGSTTSLNSLRSEAP------SEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE 180
           RM      +S               SE        L  +       S  ++  L +   +
Sbjct: 22  RMKEPSQPSSKPQPSPGFGPSSVPESELRRLIAGELKGALEDAKRPSEETAGGLQSSEVK 81

Query: 181 LDYKKLYEESQA----ELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
            D  K YE+ QA    EL+R+ +   +  EE         K S +EQE+++ + + R+L 
Sbjct: 82  EDLLKRYEQEQAIVQEELARIAKREREAAEEQLSRAVLREKAS-AEQERQKAKHLARQLE 140

Query: 237 EMEVELKELDQFKKEN-QRLKDENGALIRV 265
           E E ELK LD F KE   RL+++N    ++
Sbjct: 141 EKEAELKRLDAFYKEQLARLEEKNSEFYKL 170


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 32.8 bits (75), Expect = 0.10
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 24/92 (26%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
            +  EE Q ELS L++ LS+ +EEL                      +E +++E+E EL+
Sbjct: 71  PQKLEELQGELSELKQQLSELQEELED--------------------LEERIAELESELE 110

Query: 244 EL----DQFKKENQRLKDENGALIRVISKLSK 271
           +L       ++  + L++   +L   I +L+K
Sbjct: 111 DLKEDLQLLRELLKSLEERLESLEESIKELAK 142


>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
            Mg-chelatases [Coenzyme metabolism].
          Length = 1388

 Score = 33.9 bits (78), Expect = 0.10
 Identities = 19/76 (25%), Positives = 29/76 (38%), Gaps = 2/76 (2%)

Query: 97   APTTTPLPPVRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNL 156
              + TP  P     +  ST S+  +    G  + SG+ ++  SL +     G   G    
Sbjct: 1294 PASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEG--YE 1351

Query: 157  STSRSRVNSESGGSSR 172
             T  +   SES G S 
Sbjct: 1352 MTKEAVSGSESTGMSF 1367


>gnl|CDD|220133 pfam09177, Syntaxin-6_N, Syntaxin 6, N-terminal.  Members of this
           family, which are found in the amino terminus of various
           SNARE proteins, adopt a structure consisting of an
           antiparallel three-helix bundle. Their exact function
           has not been determined, though it is known that they
           regulate the SNARE motif, as well as mediate various
           protein-protein interactions involved in
           membrane-transport.
          Length = 96

 Score = 31.1 bits (71), Expect = 0.14
 Identities = 16/74 (21%), Positives = 36/74 (48%), Gaps = 4/74 (5%)

Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
           S  ++ +++   +  +L +E +  L  +   L   EE ++ V+A   K  L E+E   RR
Sbjct: 23  SWLRIRSDSNSPELDELTKELRNALESIEWDLEDLEEAVSVVEANPEKFGLDEEEISRRR 82

Query: 230 ----TMERKLSEME 239
                +  +++E++
Sbjct: 83  RFVAEIRNQINEVK 96


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 31.5 bits (72), Expect = 0.14
 Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 8/61 (13%)

Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
            L  E QA++    + +++ E+EL + +A        + +K E++  +RKL+E + EL+E
Sbjct: 63  GLRAERQAKVEEKEQEVAEREQELKEAQAK------GDADKIEKK--QRKLAEAQAELQE 114

Query: 245 L 245
            
Sbjct: 115 A 115



 Score = 27.6 bits (62), Expect = 2.9
 Identities = 20/81 (24%), Positives = 40/81 (49%), Gaps = 12/81 (14%)

Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
           +++Y K +  +Q  ++ L       E+ L +V+A  +   L  + + +    E++++E E
Sbjct: 32  QIEYAKAHG-NQHRIAGL-------EKALKEVQAHCTDAGLRAERQAKVEEKEQEVAERE 83

Query: 240 VELKEL----DQFKKENQRLK 256
            ELKE     D  K E ++ K
Sbjct: 84  QELKEAQAKGDADKIEKKQRK 104


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 33.2 bits (76), Expect = 0.14
 Identities = 15/57 (26%), Positives = 21/57 (36%)

Query: 50  SYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPV 106
            +  Y S  G  +   GG  +   +  PS+  P    AP +   SSY P       V
Sbjct: 221 GHDGYSSHGGPLAPLAGGDRMGPPHGPPSRYRPAYEAAPLAPAISSYGPAGGGPGSV 277


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 32.6 bits (74), Expect = 0.16
 Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 8/76 (10%)

Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQV----KATYSKNSLSEQEKRER----RTME 232
            + K+  EE Q E   L + L + E E  +V    K    +NS  E+  ++       ++
Sbjct: 138 EELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLK 197

Query: 233 RKLSEMEVELKELDQF 248
           ++  E+E  ++  ++ 
Sbjct: 198 KRWDELEPGVELPEEE 213


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 33.0 bits (76), Expect = 0.18
 Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
             KL  E + EL   R  L + E  L Q + T  +   S  +K E   +E+K  E+  + 
Sbjct: 60  VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEE--NLEKKEKELSNKE 117

Query: 243 KELDQFKKENQRLKDENGALIRVISKLS 270
           K LD+ ++E + L  E    +  IS L+
Sbjct: 118 KNLDEKEEELEELIAEQREELERISGLT 145


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 31.9 bits (73), Expect = 0.18
 Identities = 25/101 (24%), Positives = 44/101 (43%), Gaps = 18/101 (17%)

Query: 162 RVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETL--SKTEEELNQVKATYSKNS 219
           R +S SG  +     +N    YK L +  + E+  L + L  +K  EE  ++K T    S
Sbjct: 37  RFDSLSGKLNLDKFRKN----YKFLDDYREKEIEELEKALKKTKDSEEKEELKRTL--QS 90

Query: 220 LSEQEK-RERRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
           +  + K  + +  ER++         L + KK+ + L  E 
Sbjct: 91  MKSRLKTLKNKDREREI---------LKEHKKQEKELIKEG 122


>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
          Length = 167

 Score = 31.8 bits (72), Expect = 0.18
 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 13/74 (17%)

Query: 180 ELD-YKKLYEESQAELSRLR-----------ETLSKTEEELNQVKATYSKNSLSEQEKRE 227
           ELD  ++L EE+QA L+ ++             L+  + +  +++A  +K  L EQ KR 
Sbjct: 49  ELDEAQRLREEAQALLADVKAEREEAERQAAAMLAAAKADARRMEAE-AKEKLEEQIKRR 107

Query: 228 RRTMERKLSEMEVE 241
               ERK+++ E +
Sbjct: 108 AEMAERKIAQAEAQ 121


>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
           domain of yeast vacuolar transport chaperone (VTC)
           proteins VTC-2, and -3 , and similar proteins.
           Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
           the membrane-integral VTC complex. VTC-2, -3, and -4
           contain polyP polymerase domains. S. cerevisiae
           VTC-2,and -3 belong to this subgroup. For VTC4 it has
           been shown that this domain generates polyP from ATP by
           a phosphotransfer reaction releasing ADP. This activity
           is metal ion-dependent. The ATP gamma phosphate may be
           cleaved and then transferred to an acceptor phosphate to
           form polyP. PolyP is ubiquitous. In prokaryotes, it is a
           store of phosphate and energy. In eukaryotes, polyPs
           have roles in  bone calcification, and osmoregulation,
           and in phosphate transport in the symbiosis of
           mycorrhizal fungi and plants. This subgroup belongs to
           the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
           superfamily, whose enzymes have a unique active site
           located within an eight-stranded beta barrel.
          Length = 303

 Score = 32.3 bits (74), Expect = 0.19
 Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 24/101 (23%)

Query: 176 NENGELDYKKLYEESQAELSRLR-------------------ETLSKTEEELNQVKATYS 216
           N N +L   KL + ++A   RLR                   E  S  EE+  Q+K  Y 
Sbjct: 63  NPNFDLYNDKLLKLNEAPTLRLRWTGKLSDKPDIFVEKKTFDENTSSFEEDKLQLKEKYI 122

Query: 217 KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
              +  + K      E+KL +ME    +L+  KK+ + ++D
Sbjct: 123 NGFIFGKYK-----FEKKLQKMEKRGADLENLKKDVENIQD 158


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 32.3 bits (74), Expect = 0.20
 Identities = 17/70 (24%), Positives = 30/70 (42%), Gaps = 13/70 (18%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKL-------SEMEVELK 243
             EL R +E L K  +E+           L E E  E + +E K+       SE+  E+ 
Sbjct: 203 PTELDRAKEKLKKLLQEI-----MIKVKKLEELE-EELQELESKIEDLTNKKSELNTEIA 256

Query: 244 ELDQFKKENQ 253
           E ++  ++ +
Sbjct: 257 EAEKKLEQCR 266


>gnl|CDD|225939 COG3404, COG3404, Methenyl tetrahydrofolate cyclohydrolase [Amino
           acid transport and metabolism].
          Length = 208

 Score = 31.6 bits (72), Expect = 0.32
 Identities = 17/54 (31%), Positives = 27/54 (50%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
           E+ + + K++ EE Q   + L   + K EE  N + A Y     +E+EK  RR 
Sbjct: 53  EDYDDEMKEILEELQKIKAELLALIDKDEEAFNLIMAAYKLPKSTEEEKAARRK 106


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 30.7 bits (70), Expect = 0.33
 Identities = 14/74 (18%), Positives = 31/74 (41%), Gaps = 4/74 (5%)

Query: 190 SQAELSRLRETLSKTEEELNQVKAT--YSKNSLSEQEKR--ERRTMERKLSEMEVELKEL 245
           +  E+ +         E+  + K +       +S +EK   E   +E++L E+E EL   
Sbjct: 9   TDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68

Query: 246 DQFKKENQRLKDEN 259
           +   +  + L ++ 
Sbjct: 69  ELKAEAKKMLSEKG 82


>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
          Length = 420

 Score = 31.7 bits (72), Expect = 0.33
 Identities = 26/91 (28%), Positives = 33/91 (36%), Gaps = 5/91 (5%)

Query: 10  SSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT-----SSG 64
            SGRT S       +  S  +   SSY +  + ST    S      ST GG+     S+ 
Sbjct: 329 RSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGGSTY 388

Query: 65  YGGLTLPSSYSRPSQSSPRSYHAPPSSYRSS 95
            G  T   S    S S   SY  P  +   S
Sbjct: 389 GGSSTFDGSSRGSSDSFGVSYFGPQQTVGFS 419


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 30.9 bits (70), Expect = 0.34
 Identities = 14/78 (17%), Positives = 34/78 (43%), Gaps = 3/78 (3%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATY---SKNSLSEQEKRERRTMERKLSEMEV 240
           +K YEE+     + R+ L +    L          +  + +  ++  ++  ER  ++ E 
Sbjct: 56  RKQYEEAIELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNES 115

Query: 241 ELKELDQFKKENQRLKDE 258
             K  ++ +KE ++L+  
Sbjct: 116 LQKRNEELEKELEKLRQR 133



 Score = 28.2 bits (63), Expect = 3.0
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 12/62 (19%)

Query: 222 EQEKRERRTMERKLSEMEVE------------LKELDQFKKENQRLKDENGALIRVISKL 269
           E  K++R+ ++R+   + ++                   +KEN+RLK++N +L +   +L
Sbjct: 64  ELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEEL 123

Query: 270 SK 271
            K
Sbjct: 124 EK 125


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 31.3 bits (71), Expect = 0.37
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 5/83 (6%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD 246
            E   A+   + E   K  E   +      K    + E  E    E      E    +++
Sbjct: 3   KECKDAKHENMEEDCCKENENKEE-----DKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57

Query: 247 QFKKENQRLKDENGALIRVISKL 269
           + K EN +LK+EN  L   +  L
Sbjct: 58  ELKDENNKLKEENKKLENELEAL 80


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 31.9 bits (72), Expect = 0.43
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 4/80 (5%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
            +   E+ Q  + RL++TL   +EE+ ++        LS + + E      ++ E E + 
Sbjct: 133 LRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLGRIREAEDQQ 192

Query: 243 KEL----DQFKKENQRLKDE 258
             L    +Q +KE+  L++E
Sbjct: 193 SHLEVLLEQKEKEHMMLREE 212



 Score = 30.4 bits (68), Expect = 1.0
 Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
           E +Q E   L++ L + + +L     +  K   S + KRER   + + + + V  ++L  
Sbjct: 6   ENAQRENELLKKELERKQSKLGSSMNSI-KTFWSPELKRERALRKDEAARITVWKEQLRV 64

Query: 248 FKKENQRLKDENGAL 262
            ++ENQ  +    AL
Sbjct: 65  TQEENQHAQLTVQAL 79


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 30.9 bits (70), Expect = 0.47
 Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 184 KKLYEESQ-AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
            +L  E Q A+L +L+E   K ++EL +         + E + +E+   ER+ +E+
Sbjct: 43  TELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQEREKTEI 98


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 31.4 bits (72), Expect = 0.47
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 6/80 (7%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS---EQEKRERRTMERKLSEME 239
           YK   EE Q   +R  + L   +EE+ +++      SL    +  K+++ ++ER+L+E+E
Sbjct: 193 YKSKLEELQQAAARNGDALRSAKEEITELRRQI--QSLEIELQSLKKQKASLERQLAELE 250

Query: 240 VEL-KELDQFKKENQRLKDE 258
                EL  ++     L++E
Sbjct: 251 ERYELELADYQDTISELEEE 270


>gnl|CDD|215532 PLN02982, PLN02982, galactinol-raffinose
           galactosyltransferase/ghydrolase, hydrolyzing O-glycosyl
           compounds.
          Length = 865

 Score = 31.3 bits (71), Expect = 0.52
 Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQV-----KATYSKNSLSEQEK 225
            + +  E+ E   KK  E    +LS     + + ++EL+ +     K+  S++  S   K
Sbjct: 327 YKAIEREHAEKARKKAIESGVTDLSEFDAKIKQLKKELDAMFDGEEKSVSSESESSGSCK 386

Query: 226 RERRTMERKLSEMEVELKELD 246
                M+    ++  + K LD
Sbjct: 387 VSGSGMKAFTRDLRTKFKGLD 407


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 31.6 bits (71), Expect = 0.52
 Identities = 23/92 (25%), Positives = 31/92 (33%), Gaps = 7/92 (7%)

Query: 52  PSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLP------- 104
              GS   G +SG            P  ++P +   P S+  ++ AP+  P P       
Sbjct: 380 APSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDA 439

Query: 105 PVRPRRSKSSTRSANYTPEGGGRMSRSGSTTS 136
           P  P RS    R A   PE         S  S
Sbjct: 440 PPAPPRSGIPPRPAPRMPEASPVPGAPDSVAS 471


>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
           protein.  This family of transmembrane coiled-coil
           containing proteins is conserved from worms to humans.
           Its function is unknown.
          Length = 387

 Score = 31.3 bits (71), Expect = 0.56
 Identities = 34/169 (20%), Positives = 67/169 (39%), Gaps = 28/169 (16%)

Query: 122 PEGGGR-MSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE 180
           PE   R +S S +        S+  SE          +S +  ++ +  +S   P     
Sbjct: 150 PEENPRNLSGSATLVLSPKYGSDDSSE---------CSSATSSSAGANSNSGSGPGSAVA 200

Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN----SLSEQEKRERRT-MERKL 235
                  ++   EL  ++E  S+ EE   ++K    ++    + S QE+R R   +E +L
Sbjct: 201 QQQSLNLDKLLEELREIKEGQSRLEESYERLKEQLQRDYQYITQSLQEERYRYERLEEQL 260

Query: 236 SE-MEVELKELDQFKKE------------NQRLKDENGALIRVISKLSK 271
           ++  E+   E+   K+E             +R +D   A+    +++SK
Sbjct: 261 NDLTELHQNEIANLKQELASMEEKVAYQSYERARDIQEAMESCQTRISK 309


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 31.1 bits (71), Expect = 0.58
 Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 6/71 (8%)

Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE-KRERRTMERK 234
             +GE   K+++E    E  R  +          +++    +    EQ    ER+ +  K
Sbjct: 296 KNSGEPSLKEIHEARLNEEERELKKKFTE-----KIREKEKRLEELEQNLIEERKELNSK 350

Query: 235 LSEMEVELKEL 245
           L E++ +L++L
Sbjct: 351 LEEIQKKLEDL 361



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 10/89 (11%)

Query: 184 KKLYEESQAE-LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
             LYE  + E LS L+ +   + +E+++ +    +  L ++   + R  E++L E+E  L
Sbjct: 281 NLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNL 340

Query: 243 KELDQFKKENQRLKDENGALIRVISKLSK 271
            E           K+ N  L  +  KL  
Sbjct: 341 IE---------ERKELNSKLEEIQKKLED 360


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
           transduction.  This family is a four helix bundle that
           operates as a ubiquitous sensory module in prokaryotic
           signal-transduction. The 4HB_MCP is always found between
           two predicted transmembrane helices indicating that it
           detects only extracellular signals. In many cases the
           domain is associated with a cytoplasmic HAMP domain
           suggesting that most proteins carrying the bundle might
           share the mechanism of transmembrane signalling which is
           well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 30.3 bits (69), Expect = 0.64
 Identities = 14/76 (18%), Positives = 31/76 (40%), Gaps = 2/76 (2%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
            + +     AE   L + + +   E++++   Y K  L+E+EK+     + +L       
Sbjct: 66  LELILTTDPAERDELLKDIEELRAEIDKLLKKYEKTILTEEEKKLFNEFKEQLKAYRKVR 125

Query: 243 KELDQFKKENQRLKDE 258
            ++    K  +   DE
Sbjct: 126 NKVLDLAKAGK--NDE 139


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 30.6 bits (69), Expect = 0.70
 Identities = 23/67 (34%), Positives = 27/67 (40%), Gaps = 1/67 (1%)

Query: 59  GGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYA-PTTTPLPPVRPRRSKSSTRS 117
           G   S       P   + P+ + P    A P S+RSS A PTT P P     RS S    
Sbjct: 82  GSFLSNAFKWGTPQEPAPPANAPPPKEPAAPPSWRSSPAGPTTQPSPAAANTRSSSFLGG 141

Query: 118 ANYTPEG 124
           A  T  G
Sbjct: 142 ALQTAAG 148


>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family.  This family of
           eukaryotic proteins includes 5' nucleotidase enzymes,
           such as purine 5'-nucleotidase EC:3.1.3.5.
          Length = 448

 Score = 31.1 bits (71), Expect = 0.71
 Identities = 11/59 (18%), Positives = 22/59 (37%)

Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
           E++     +    EL RL   L +  + L+      +      +   E R + R + E+
Sbjct: 329 EIEIWNTEQYRFEELQRLDILLERLYDHLDVHAELSTLRPDLSELDAEIRELRRAMKEL 387


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 30.7 bits (70), Expect = 0.71
 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 4/62 (6%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE----KRERRTMERKLSEMEVELK 243
              QA+L  LR+ L    EEL   +    K  +S  E    +RER   + +L  +E EL+
Sbjct: 161 AGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELE 220

Query: 244 EL 245
            L
Sbjct: 221 VL 222


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 28.3 bits (64), Expect = 0.75
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 10/59 (16%)

Query: 221 SEQEKRERRTM----------ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
            E EKR RR            ERK +E+E   ++++Q + EN+RLK E   L R + KL
Sbjct: 1   EEDEKRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKL 59



 Score = 26.4 bits (59), Expect = 3.6
 Identities = 15/52 (28%), Positives = 30/52 (57%), Gaps = 9/52 (17%)

Query: 217 KNSLSEQEKRERRT-----MERKLSEMEVE----LKELDQFKKENQRLKDEN 259
           +N  + +  RER+      +ERK+ ++E E     KE+++ ++E ++LK E 
Sbjct: 12  RNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63


>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
          Length = 153

 Score = 29.7 bits (67), Expect = 0.77
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 3/76 (3%)

Query: 197 LRETLSK--TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQR 254
           L+E + K      + +VK+ Y++N L   +  +  T    L + ++  K LD   KE + 
Sbjct: 10  LKENIQKLGKSGTVIKVKSGYARNYLIPNKMAKVATQGS-LKQQKMYQKILDLKLKEAKE 68

Query: 255 LKDENGALIRVISKLS 270
              +   L+  I K S
Sbjct: 69  KCLKVKQLLEEIQKFS 84


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 30.2 bits (69), Expect = 0.78
 Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 6/90 (6%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSK--NSLSEQEKRERRTMERKLSEMEVE 241
            KL  E++ EL   R  L + E+ L Q + T  +   SL   EK+E  ++E K  E+   
Sbjct: 63  HKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESL---EKKEE-SLEEKEKELAAR 118

Query: 242 LKELDQFKKENQRLKDENGALIRVISKLSK 271
            ++L++ ++E + L +E    +  IS L+ 
Sbjct: 119 QQQLEEKEEELEELIEEQQQELERISGLTA 148


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 29.2 bits (66), Expect = 0.79
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 10/48 (20%)

Query: 231 MERKLSEMEVELKELDQFKK-------ENQRLKDENGALIRVISKLSK 271
           +E++L+++   L EL + K+       EN  L+ EN  L   + +L +
Sbjct: 13  LEQQLNQL---LAELGELKQQLAELLEENAELRIENEHLRERLEELEQ 57


>gnl|CDD|221833 pfam12889, DUF3829, Protein of unknown function (DUF3829).  This is
           a small family of proteins from several bacterial
           species, whose function is not known. It may, however,
           be related to the GvpL_GvpF family of proteins,
           pfam06386.
          Length = 277

 Score = 30.5 bits (69), Expect = 0.83
 Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 184 KKLYEESQAELSRLRETLSK----TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
           K   E    +L  L++   K    T E L + +   +  S  +  K++   ++  L+E+E
Sbjct: 141 KINDERLLEQLEELKKEGKKIAYYTLELLLEAEQLLNLFSSKDVAKKDAADLDAALAELE 200

Query: 240 VELKELDQFKKENQRLKDENG 260
             L++L +  KE+   +  + 
Sbjct: 201 KALEDLKKLTKEDGDEEKASS 221


>gnl|CDD|238303 cd00542, Ntn_PVA, Penicillin V acylase (PVA), also known as
           conjugated bile salt acid hydrolase (CBAH), catalyzes
           the hydrolysis of penicillin V to yield 6-amino
           penicillanic acid (6-APA), an important key intermediate
           of semisynthetic penicillins.  PVA has an N-terminal
           nucleophilic cysteine, as do other members of the Ntn
           hydrolase family to which PVA belongs.  This
           nucleophilic cysteine is exposed by post-translational
           prossessing of the PVA precursor. PVA forms a
           homotetramer.
          Length = 303

 Score = 30.6 bits (70), Expect = 0.84
 Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 4/47 (8%)

Query: 32  NLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPS 78
           NL +Y   LS   P    +    G        G G L LP  Y+ PS
Sbjct: 184 NLRNYI-NLSPEPP---KNVKLGGVNLTAFGQGSGTLGLPGDYTPPS 226


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 30.8 bits (69), Expect = 0.85
 Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 162 RVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS 221
           + ++E   S R+   +  E+  + L E+     +RLRE LS TE  LN+ +  ++K  +S
Sbjct: 552 QESTEEAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVS 611

Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
            ++ + +   E++ ++    L+E +  K+E QRL
Sbjct: 612 LRQIQRQAAQEKERNQELRRLQE-EARKEEGQRL 644


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 30.1 bits (68), Expect = 0.91
 Identities = 14/78 (17%), Positives = 33/78 (42%), Gaps = 2/78 (2%)

Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEV 240
           L  K    +   E   L + +++ E    ++ +  ++      E + +  + ++    E 
Sbjct: 99  LSLKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEK--KEA 156

Query: 241 ELKELDQFKKENQRLKDE 258
           E KE+ + K E  ++K E
Sbjct: 157 EDKEIARLKSEASKIKQE 174


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 30.8 bits (70), Expect = 0.98
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE--RR 229
           R+L  E  EL  +   EE++A L      + + + EL+Q+++   +    ++E+ E  R+
Sbjct: 464 RRLEAEKDEL--QAALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRK 521

Query: 230 TMERKLSEMEVELKELDQFKKENQRLK 256
             +R +  ++  L+   + K E  RLK
Sbjct: 522 NHQRAIESLQATLEAEAKGKAEASRLK 548



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 21/133 (15%)

Query: 147 EGYESGGDNLSTSRSRVNSESGGSSRKLPNE------------NGELDYKKL---YEESQ 191
           E  E+   NLS  +S++ S+   + R L  E              E D   L    EE  
Sbjct: 225 EEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEES 284

Query: 192 AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE--RRTMERKLSEMEVELKEL---- 245
              + L   LSK   E+ Q ++ +        E+ E  ++ + +K+SE+E   +      
Sbjct: 285 EAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKC 344

Query: 246 DQFKKENQRLKDE 258
           D  +K   RL+ E
Sbjct: 345 DSLEKTKSRLQSE 357



 Score = 27.7 bits (62), Expect = 8.4
 Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 14/93 (15%)

Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQA-ELSRLRETLSKTEEELNQVK 212
           D L    + + S+      +L +EN +L  +    E+Q   LS+L+  L    EE  +  
Sbjct: 197 DELQRQLNDLTSQKS----RLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKR-- 250

Query: 213 ATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
                    E+E RER  ++ +L ++E +L  L
Sbjct: 251 -------SLEEESRERANLQAQLRQLEHDLDSL 276


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 30.7 bits (69), Expect = 0.99
 Identities = 10/50 (20%), Positives = 14/50 (28%)

Query: 77   PSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGG 126
            P  + PR+        R    P+  P     P     S   A   P+   
Sbjct: 2593 PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.4 bits (68), Expect = 1.1
 Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 14/100 (14%)

Query: 173 KLPNENGELDYKKL-YEESQAELSRLRE-TLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
           KL NE   L  +K   E+  +EL    E     T+E+LN          L    +R  R 
Sbjct: 266 KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND---------LYHNHQRTVRE 316

Query: 231 MERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLS 270
            ER+L + +   +EL++  KE + L  E   L+    +L 
Sbjct: 317 KERELVDCQ---RELEKLNKERRLLNQEKTELLVEQGRLQ 353



 Score = 29.2 bits (65), Expect = 3.1
 Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 9/104 (8%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
           E      +++ +  + EL  L+  L + E  L+++    ++    +  K++      +L 
Sbjct: 230 EAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELE 289

Query: 237 EMEV--------ELKELDQFKKENQRLKDENGALI-RVISKLSK 271
                       +L +L    +   R K+       R + KL+K
Sbjct: 290 LKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNK 333


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 30.2 bits (69), Expect = 1.1
 Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 30/103 (29%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRERRT--------M 231
             ++  E+   EL+ L+E+  K  +E+ ++K  Y   + +L  +               +
Sbjct: 112 TIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAK-----SFSYGPALDEL 166

Query: 232 ERKLSEMEVELKELDQFKKEN------------QRLKDENGAL 262
           E++L E+E   +E +QF +               +L++E  AL
Sbjct: 167 EKQLDELE---EEFEQFVELTESGDYLEAREVLLKLEEETDAL 206



 Score = 30.2 bits (69), Expect = 1.2
 Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           +K  ++    L   RE   +   EL++++ +Y+   L+E E    R +E++L E+E +  
Sbjct: 305 EKNIDKLTDFLEHAREQNKQLLLELDRLQQSYT---LNEDELETVRELEKQLEELEAQYD 361

Query: 244 ELDQ 247
           +L +
Sbjct: 362 QLVE 365


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 28/163 (17%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 20  SLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQ 79
                S + +S   S  S+ +++S P +        S     S         +    P  
Sbjct: 109 QFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQL 168

Query: 80  SSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNS 139
           +      +P SSY+S      + L PV    ++S+ RS+ +     G   +   TT    
Sbjct: 169 TPSNKPASPSSSYQSPSYS--SSLGPVNSSGNRSNLRSSPWALRSSG--DKKDITTDEKY 224

Query: 140 LRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELD 182
           L         E+    +   +  + S +G ++      N    
Sbjct: 225 L---------ETFLAEVDEEQHMITSSAGKNATPPETINSFGS 258


>gnl|CDD|197361 cd09803, UBAN, polyubiquitin binding domain of NEMO and related
           proteins.  NEMO (NF-kappaB essential modulator) is a
           regulatory subunit of the kinase complex IKK, which is
           involved in the activation of NF-kappaB via
           phosporylation of inhibitory IkappaBs. This mechanism
           requires the binding of NEMO to ubiquinated substrates.
           Binding is achieved via the UBAN motif (ubiquitin
           binding in ABIN and NEMO), which is described in this
           model. This region of NEMO has also been named CoZi (for
           coiled-coil 2 and leucine zipper). ABINs (A20-binding
           inhibitors of NF-kappaB) are sensors for ubiquitin that
           are involved in regulation of apoptosis, ABIN-1 is
           presumed to inhibit signalling via the NF-kappaB route.
           The UBAN motif is also found in optineurin, the product
           of a gene associated with glaucoma, which has been
           characterized as a negative regulator of NF-kappaB as
           well.
          Length = 87

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 13/73 (17%)

Query: 191 QAELSRLRETLSKTEEELNQVKA------TYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
           Q ++  L+E +++ E +L  +         Y  +  +E+  RE      KL + + +L E
Sbjct: 21  QEDIDELKEEIAQQEADLETIPVLKAQAEIYKSDFEAERAARE------KLHQEKEQLAE 74

Query: 245 -LDQFKKENQRLK 256
            L+  ++ENQ LK
Sbjct: 75  QLEYLQRENQELK 87


>gnl|CDD|153307 cd07623, BAR_SNX1_2, The Bin/Amphiphysin/Rvs (BAR) domain of
           Sorting Nexins 1 and 2.  BAR domains are dimerization,
           lipid binding and curvature sensing modules found in
           many different proteins with diverse functions. Sorting
           nexins (SNXs) are Phox homology (PX) domain containing
           proteins that are involved in regulating membrane
           traffic and protein sorting in the endosomal system.
           SNXs differ from each other in their lipid-binding
           specificity, subcellular localization and specific
           function in the endocytic pathway. A subset of SNXs also
           contain BAR domains. The PX-BAR structural unit
           determines the specific membrane targeting of SNXs. This
           subfamily consists of SNX1, SNX2, and similar proteins.
           SNX1 and SNX2 are components of the retromer complex, a
           membrane coat multimeric complex required for endosomal
           retrieval of lysosomal hydrolase receptors to the Golgi.
           The retromer consists of a cargo-recognition subcomplex
           and a subcomplex formed by a dimer of sorting nexins
           (SNX1 and/or SNX2), which ensures efficient cargo
           sorting by facilitating proper membrane localization of
           the cargo-recognition subcomplex. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions.
          Length = 224

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 20/81 (24%), Positives = 41/81 (50%), Gaps = 14/81 (17%)

Query: 190 SQAELSRLRETLSKTE-----EELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK- 243
           +Q  L++ RE  +K E     ++L+Q      +  + E E +  R  +++  E+   +K 
Sbjct: 128 AQQTLTKKREAKAKLELSGRTDKLDQA-----QQEIKEWEAKVDRG-QKEFEEISKTIKK 181

Query: 244 ELDQFKKENQRLKDENGALIR 264
           E+++F  E  R+KD    +I+
Sbjct: 182 EIERF--EKNRVKDFKDIIIK 200


>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
            Members of this protein family are phage lysis
           regulatory protein, including the well-studied protein
           LysB (lysis protein B) of Enterobacteria phage P2. For
           members of this family, genes are found in phage or in
           prophage regions of bacterial genomes, typically near a
           phage lysozyme or phage holin.
          Length = 135

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 17/80 (21%), Positives = 26/80 (32%), Gaps = 5/80 (6%)

Query: 190 SQAELSRLRETLSKTEEELNQVKATYSK-----NSLSEQEKRERRTMERKLSEMEVELKE 244
               L   R  L +    L   +A  +        L    KR      +   ++      
Sbjct: 17  QSQRLRNARADLERANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARAL 76

Query: 245 LDQFKKENQRLKDENGALIR 264
           L Q ++  +RLK EN  L R
Sbjct: 77  LAQREQRIERLKRENEDLRR 96


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 29.2 bits (66), Expect = 1.2
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQV---KATYSKNSLSEQEKRERRTME 232
            ++     Q E+   RE L +   EL  +   K    K   +E+ KRE++ M+
Sbjct: 80  QQQELALLQQEVEAKRERLLEARRELKALEKLKEKKQKEYRAEEAKREQKEMD 132


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 17/83 (20%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM 231
           R+L +E    D  +L E+++ EL  L   +S+  ++L +++    + +++ +        
Sbjct: 195 RQLADELNLCDPLEL-EKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTN---- 249

Query: 232 ERKLSEMEVELKELDQFKKENQR 254
             K SE+  E+ E ++ ++E + 
Sbjct: 250 --KKSELLEEIAEAEKIREECRG 270


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 7/60 (11%)

Query: 192 AELSRLRETLSKTEEELNQVKATYS-KNSLS---EQEKRERRTMERKLSEMEVELKELDQ 247
           AEL+RL + L K E+E+++++   S +  ++   E+   + +    KL+E +V+L +L++
Sbjct: 814 AELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEK---EKLAEYQVKLAKLEE 870


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 13/81 (16%), Positives = 29/81 (35%), Gaps = 9/81 (11%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
              Y       +   + L  T ++L  V+A  +          E+  +   LSE   +  
Sbjct: 149 AIYYGALNPARAERIDALKATLKQLAAVRAEIA---------AEQAELTTLLSEQRAQQA 199

Query: 244 ELDQFKKENQRLKDENGALIR 264
           +L Q  +E ++   +  + + 
Sbjct: 200 KLAQLLEERKKTLAQLNSELS 220


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 9/38 (23%), Positives = 18/38 (47%), Gaps = 5/38 (13%)

Query: 215 YSKNSLSEQEKRERR-----TMERKLSEMEVELKELDQ 247
           Y + S  E +K E        +E ++ E+E+E++    
Sbjct: 96  YQRQSRKEAKKEEELQQELEELEARVGELELEIERQKA 133


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.3 bits (68), Expect = 1.3
 Identities = 38/173 (21%), Positives = 69/173 (39%), Gaps = 10/173 (5%)

Query: 17  RPRSLYS-----SSPSP---QSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGL 68
           RP+ L S     SS  P    SR+LS+    + SS P  +SS           + G    
Sbjct: 43  RPQGLPSLWAAVSSQQPIKLASRDLSNDEMMMVSSDPPKSSSEMEGEMLAPQATVGRDEA 102

Query: 69  TLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGR- 127
           T   +         R+    P++ +++Y+PT      V+     + +R+ N+     GR 
Sbjct: 103 TPSIAMENTPSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQ 162

Query: 128 MSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSR-VNSESGGSSRKLPNENG 179
             +S +      ++S +P++  E       +   R VNS +  +   +P  +G
Sbjct: 163 RVKSYTPKPRGEVKSSSPTQTREKVRKYTPSPLGRMVNSYAPSTFMTMPRSHG 215


>gnl|CDD|225256 COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic
           ion transport and metabolism].
          Length = 299

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 9/58 (15%)

Query: 45  PYLNSSYPS---------YGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYR 93
           PY+   YP+          G + GG  S Y GL  P  +      S   +  P  +  
Sbjct: 164 PYVEERYPTSADADGRVLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTPLDTQP 221


>gnl|CDD|183704 PRK12726, PRK12726, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 407

 Score = 30.1 bits (67), Expect = 1.4
 Identities = 19/70 (27%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 178 NGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
           +G +D+++ +++S  +     + L K +E+L ++ A  ++ + S     ER+T E +LS 
Sbjct: 63  DGRMDFQETFQKSVVKNRPKTDDLLK-KEKLLEMLAAGAELAQSTPLLEERKTQEEELSA 121

Query: 238 MEVELKELDQ 247
           M +EL  L++
Sbjct: 122 MRLELAALNR 131


>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
           resistance [Inorganic ion transport and metabolism].
          Length = 386

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 16/56 (28%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 199 ETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE-LDQFKKENQ 253
           ETL KT++ +  V A    + + E+ +++R    R+L ++E E+K+ L   K+   
Sbjct: 331 ETLKKTQQSV--VAAIEETSQIQEEGRKKRADTIRELDKLEEEIKKKLILLKQNIS 384


>gnl|CDD|218351 pfam04961, FTCD_C, Formiminotransferase-cyclodeaminase.  Members of
           this family are thought to be Formiminotransferase-
           cyclodeaminase enzymes EC:4.3.1.4. This domain is found
           in the C-terminus of the bifunctional animal members of
           the family.
          Length = 176

 Score = 29.1 bits (66), Expect = 1.6
 Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 13/59 (22%)

Query: 184 KKLYEESQAELSRLRETLSKTEEEL-----------NQVKATY--SKNSLSEQEKRERR 229
           KK YE+ + E+  + E   +  EEL           N V A Y   K +  E+  RE  
Sbjct: 44  KKKYEDVEEEMKEILEKAEELREELLALADEDAEAFNAVMAAYKLPKETEEEKAAREAA 102


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 27/83 (32%), Positives = 45/83 (54%), Gaps = 10/83 (12%)

Query: 180 ELDYKKLYEESQAELSRLR-ETLSKTEEEL-NQVKATYSKNSLSEQEKRE-RRTMER--K 234
           EL+  +L   ++ EL+RLR E   K  ++L   ++A   +N L+ Q +RE  R +E    
Sbjct: 191 ELELAQLSANNRQELARLRSELAKKRSQQLDAYLQAL--RNQLNSQRQREAERALESTEL 248

Query: 235 LSEMEVELKE--LDQFKKENQRL 255
           L+E   +L +  + QFK  N+ L
Sbjct: 249 LAEQSGDLPKSIVAQFKI-NREL 270


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 21/114 (18%), Positives = 38/114 (33%), Gaps = 10/114 (8%)

Query: 71  PSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRS----ANYTPEGG- 125
            SS+   S++   S        R+          P   RR+K++       A  T     
Sbjct: 59  KSSFEVVSETDSGSEAEAERGRRAGMGGRNKATKP--SRRNKTTQCRPTSLALATAATMP 116

Query: 126 --GRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNE 177
                 +S   +S  S+ S    E   +  ++     S  ++ES  S  +  +E
Sbjct: 117 ATPSSGKSPKVSSPPSIPS-LSEEDEGAERNSGGDDSSHTDNESTQSQPEADDE 169


>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper. 
          Length = 54

 Score = 27.2 bits (61), Expect = 1.7
 Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 5/46 (10%)

Query: 217 KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGAL 262
           +N+ + +  RE++    +  E+E  +KEL   ++EN +L+ +   L
Sbjct: 11  RNNEAARRSREKKKQREE--ELEERVKEL---EEENAQLRQKVEQL 51


>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
           unknown].
          Length = 543

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 13/71 (18%), Positives = 24/71 (33%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
            E    L  L E +S  EE L ++          + E ++      K S +         
Sbjct: 406 PEIVERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNTKIT 465

Query: 248 FKKENQRLKDE 258
            +   +++K E
Sbjct: 466 LRNSIEKIKAE 476


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSEMEV 240
            Y     + Q  + R +  L +TEE    +KA           K ++R  + +K +    
Sbjct: 814 SYLACIIKLQKTIKREK-KLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQ 872

Query: 241 ELKELDQFKKENQRLKDENG 260
             + ++  +++ Q LK +  
Sbjct: 873 SAQRVELAERQLQELKIDVK 892



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 8/101 (7%)

Query: 175 PNENGELDYKKLYEESQAELSR----LRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
             +      KKL E  + E S     L +   ++ +   +      K ++  Q  +    
Sbjct: 821 KLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLL-KKETIYLQSAQRVEL 879

Query: 231 MERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
            ER+L E+++++K +   K  N  L+ E   +I +   LS 
Sbjct: 880 AERQLQELKIDVKSISSLKLVNLELESE---IIELKKSLSS 917


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.8 bits (67), Expect = 1.8
 Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 4/73 (5%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
           ++ Q  L    + L    ++L +VK        SE+ + + +  + K  E E + KE   
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDA---SEEAEAKAKEEKLKQEENEKKQKEQAD 421

Query: 248 FKKENQRLKDENG 260
             KE +R KDE  
Sbjct: 422 EDKE-KRQKDERK 433


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 29.8 bits (67), Expect = 1.8
 Identities = 18/59 (30%), Positives = 21/59 (35%), Gaps = 3/59 (5%)

Query: 71  PSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGRMS 129
           P S+  P      S    PSS+ SS+     P PPV       S R   Y    G R  
Sbjct: 14  PRSHGGPDGDEGDSNPYYPSSFGSSW---DRPGPPVPEDYDAPSHRPPPYGGSNGDRHG 69



 Score = 29.4 bits (66), Expect = 1.9
 Identities = 25/103 (24%), Positives = 32/103 (31%), Gaps = 26/103 (25%)

Query: 23  SSSPSPQSRNLSSYSSGLSSSTPYLNSSY-------PSYGSTYGGTSSGYGGLTL--PSS 73
            S+P   S    S+ S      P +   Y       P YG + G    GY  L    PS 
Sbjct: 26  DSNPYYPS----SFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRHGGYQPLGQQDPSL 81

Query: 74  YSR----------PSQSSPR---SYHAPPSSYRSSYAPTTTPL 103
           Y+           P   SPR   S H       +   P   P+
Sbjct: 82  YAGLGQNGGGGLPPPPYSPRDQGSEHVYEEPRDARMVPPWLPV 124


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 29.7 bits (66), Expect = 1.9
 Identities = 28/117 (23%), Positives = 52/117 (44%), Gaps = 13/117 (11%)

Query: 36  YSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSY-HAPPSSYRS 94
           Y  GL +  P   +S P+  +    T+        PS+   PS S P  +  +PP++   
Sbjct: 557 YIPGLFAGNPGSTNSTPTGSAASSNTT---FSSDSPSTVVAPSTSPPAGHLGSPPATPSK 613

Query: 95  SYAPTTTP------LPPVRPRRSKSSTRSA---NYTPEGGGRMSRSGSTTSLNSLRS 142
             +P+T+P       P   P   +SS + A     +PE   +++ + S+ S+ S+ +
Sbjct: 614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMST 670


>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related.  This family
           represents a region within kinesin-related proteins from
           higher plants. Many family members also contain the
           pfam00225 domain. Kinesins are ATP-driven microtubule
           motor proteins that produce directed force. Some family
           members are associated with the phragmoplast, a
           structure composed mainly of microtubules that executes
           cytokinesis in higher plants.
          Length = 488

 Score = 29.5 bits (66), Expect = 2.1
 Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 1/73 (1%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL-KEL 245
           Y E Q + + L E   +  E +  VK   +K  +     R    +  +LS +  E  KE 
Sbjct: 346 YAELQEKYNDLLEKHRRIMEGIEDVKKAAAKAGVKGAGSRFANALAAELSALRAEREKER 405

Query: 246 DQFKKENQRLKDE 258
           +  KKEN+ L+ +
Sbjct: 406 ELLKKENKSLQIQ 418



 Score = 29.1 bits (65), Expect = 3.1
 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 1/84 (1%)

Query: 186 LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
           + EE  + ++  +   S  E     +K T     + E+ +  +      + E EV L+E+
Sbjct: 173 IDEEFLSLMNEHKLLKSDYENHPEVLKETIELKRIQEELESFKNFY-FDMGEKEVLLEEI 231

Query: 246 DQFKKENQRLKDENGALIRVISKL 269
              K + Q +   + +  R  S L
Sbjct: 232 QDLKSQLQCMLSSSKSTARTRSSL 255


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 30/137 (21%), Positives = 52/137 (37%), Gaps = 10/137 (7%)

Query: 107 RPRRSKSSTRSANYTPEGGGRM---SRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRV 163
                + + R      E   RM   +RS        L  +A     E     +   R + 
Sbjct: 295 IAEVVQEAVRKMESVAEEKMRMLKKARSALDACDRELEEKAR----EVSELKMERQRKKP 350

Query: 164 NSESGGSSRKLPNENGELDYKKLYEESQAELSRL-RETLSKTEEELNQVKATYSKNSLSE 222
             +   S  +L     ++ ++   +E++ E  RL R  L+KTE+   +  + Y K  LSE
Sbjct: 351 QIDELESIVRLKQAEADM-FQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSE 409

Query: 223 QEKRERRTMERKLSEME 239
            E+  +   E +L   E
Sbjct: 410 AEEERQYLFE-ELKLQE 425


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 29.2 bits (66), Expect = 2.1
 Identities = 10/64 (15%), Positives = 25/64 (39%), Gaps = 3/64 (4%)

Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
            ++L       ++K L +  +    + RET    E   ++ +    K    E+ K+++  
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEK---KRKSKEEIKKKKPK 264

Query: 231 MERK 234
             + 
Sbjct: 265 ESKG 268


>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein.  This family includes
           the YfhG protein from E. coli. Members of this family
           have an N-terminal lipoprotein attachment site. The
           members of this family are functionally uncharacterized.
          Length = 180

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 12/55 (21%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLS 236
           Y++L + S +EL RLRE  ++ + EL                K E  T +ER+LS
Sbjct: 121 YQRLQQSSDSELDRLREQQARLQYELETTT-----------RKLENLTDIERQLS 164


>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040).  Some
           members in this family of proteins with unknown function
           are annotated as membrane proteins however this cannot
           be confirmed.
          Length = 82

 Score = 27.6 bits (62), Expect = 2.3
 Identities = 8/19 (42%), Positives = 10/19 (52%)

Query: 219 SLSEQEKRERRTMERKLSE 237
            LSE E+R    +ER L  
Sbjct: 2   PLSEHEQRRLEEIERALYA 20


>gnl|CDD|206035 pfam13864, Enkurin, Calmodulin-binding.  This is a family of
           apparent calmodulin-binding proteins found at high
           levels in the testis and vomeronasal organ and at lower
           levels in certain other tissues. Enkurin is a scaffold
           protein that binds PI3 kinase to sperm transient
           receptor potential (canonical) (TRPC) channels. The
           mammalian transient receptor potential (canonical)
           channels are the primary candidates for the Ca(2+) entry
           pathway activated by the hormones, growth factors, and
           neurotransmitters that exert their effect through
           activation of PLC. Calmodulin binds to the C-terminus of
           all TRPC channels, and dissociation of calmodulin from
           TRPC4 results in profound activation of the channel.
          Length = 98

 Score = 27.9 bits (63), Expect = 2.3
 Identities = 18/68 (26%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQ--VKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
                 S+ E   L   L K  +ELN+   K     ++ S++ ++E   +E++L+E+E +
Sbjct: 32  PGHRLLSEEERLELLNGLKKNWDELNKEYQKLPVVIDTPSKKRRKEE--LEKELAEIEED 89

Query: 242 LKELDQFK 249
           +K+L + K
Sbjct: 90  IKKLSRPK 97


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 29.5 bits (66), Expect = 2.3
 Identities = 27/101 (26%), Positives = 40/101 (39%), Gaps = 19/101 (18%)

Query: 150 ESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKK---LYEESQA---ELSRLRETLSK 203
           ES G N   +RSR +     S+     +N +   K+   L     A   E   L+E LSK
Sbjct: 224 ESLGSNFVDNRSRRSPSPASSAFHSSPQNLQKAQKENEFLTARLLAMEDENKTLKEALSK 283

Query: 204 TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
              EL   +  Y+K +              KLS +E +L+ 
Sbjct: 284 RNSELQFSRNMYAKTA-------------SKLSVLEAQLES 311


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 14/61 (22%), Positives = 33/61 (54%), Gaps = 3/61 (4%)

Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
           K + + Q EL  LR+  +K  +E+ ++K        + +   E + ++++++E E E++E
Sbjct: 40  KEWRQRQFELDSLRKEFNKLNKEVAKLKI---AKEDATELIAETKELKKEITEKEAEVQE 96

Query: 245 L 245
            
Sbjct: 97  A 97


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 13/64 (20%)

Query: 195 SRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQ- 253
           S+ +E +   +E + +V      NSLS +E+RER      L E+  EL E ++ KKE + 
Sbjct: 42  SKAKEIIPIVKEVVEEV------NSLSLEEQRER------LEELAPELLEEEEEKKEEKK 89

Query: 254 RLKD 257
            L  
Sbjct: 90  GLPP 93


>gnl|CDD|131935 TIGR02889, spore_YpeB, germination protein YpeB.  Members of this
           family are YpeB, a protein usually encoded with the
           putative spore-cortex-lytic enzyme SleB and required,
           together with SleB, for normal germination. This family
           is retricted to endospore-forming species in the
           Firmicutes lineage of bacteria, and found in all such
           species to date except Clostridium perfringens. The
           matching phenotypes of mutants in SleB (called a lytic
           transglycosylase) and YpeB suggests that YpeB is
           necessary to allow SleB to function [Cellular processes,
           Sporulation and germination].
          Length = 435

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 6/80 (7%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSL------SEQEKRERRT 230
              + + K L ++    L+ L     K E +L +V+    +  +            E + 
Sbjct: 113 SVKDAEGKSLSDKEYKTLTTLYNQAVKLENQLRKVQNIVMQGGVRWGEIRKLYSGDEAQM 172

Query: 231 MERKLSEMEVELKELDQFKK 250
            E  L++ +   K + QF  
Sbjct: 173 PEAILNDFKDVEKNVTQFPT 192


>gnl|CDD|226655 COG4192, COG4192, Signal transduction histidine kinase regulating
           phosphoglycerate transport system [Signal transduction
           mechanisms].
          Length = 673

 Score = 29.1 bits (65), Expect = 2.6
 Identities = 13/51 (25%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 193 ELSRLRETLSKTEEELN-QVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
           EL R+ + L    + +N + ++   +N   + E  ER+  E  L + + EL
Sbjct: 392 ELGRIAQLLRLLLQRVNEKTQSLRERNQELQAEVEERQEAEAHLKKTQGEL 442


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 29.0 bits (65), Expect = 2.7
 Identities = 27/112 (24%), Positives = 55/112 (49%), Gaps = 12/112 (10%)

Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--------NSLS 221
           SS +    +GE   + L +ES ++ +   + + + + EL Q++A  S         +SLS
Sbjct: 5   SSHRKVAADGEEREESLLQESASKEAYYLQRILELQAELKQLRAELSNVQAENERLSSLS 64

Query: 222 EQEKRERRTMERKLSEMEVELKELD----QFKKENQRLKDENGALIRVISKL 269
           ++ K E   +E +   +  E+KE      +  ++   L++EN +L + +S L
Sbjct: 65  QELKEENEMLELQRGRLRDEIKEYKFREARLLQDYSELEEENISLQKQVSVL 116


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 4/79 (5%)

Query: 184 KKLYEESQAELSRLRETLSKTEE---ELNQVKATYSKNSL-SEQEKRERRTMERKLSEME 239
           ++L  E+Q     + +   + +E   E +++   + + S   ++   E R ++ +L E+E
Sbjct: 182 QELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELE 241

Query: 240 VELKELDQFKKENQRLKDE 258
            ++K L   +K  +R +  
Sbjct: 242 KKIKALRAKEKAAKRREKR 260


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 6/100 (6%)

Query: 162 RVNSESGGSSRKLPNENGELDYKKLYE---ESQAELSRLRETLSKTEEELNQVKATYSKN 218
           RV S+  GS  +L  +  E + K L+E     ++EL+ L E + + EE+  Q + T  + 
Sbjct: 180 RVLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEA 239

Query: 219 SLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
               +E  ERR    +L  +E E+++L +   E +R ++E
Sbjct: 240 DEVLEEHEERRE---ELETLEAEIEDLRETIAETEREREE 276



 Score = 28.9 bits (65), Expect = 3.5
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 7/58 (12%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR-ERRTMERKLSEMEVELKE 244
           EE +AEL  L E + + EE L + +       L E E R ER    R+  E  +  + 
Sbjct: 478 EELEAELEDLEEEVEEVEERLERAE------DLVEAEDRIERLEERREDLEELIAERR 529



 Score = 28.5 bits (64), Expect = 4.3
 Identities = 20/76 (26%), Positives = 38/76 (50%), Gaps = 15/76 (19%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSEMEVELKELD 246
           EE++ +  R  E L + EE+L               E RE R  ++ ++  +E EL+EL+
Sbjct: 649 EEAREDKERAEEYLEQVEEKL--------------DELREERDDLQAEIGAVENELEELE 694

Query: 247 QFKKENQRLKDENGAL 262
           + ++  + L++   AL
Sbjct: 695 ELRERREALENRVEAL 710



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 10/94 (10%)

Query: 173 KLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--------NSLSEQE 224
           +L  E  EL+     EE++  +   RE + + EEE+ +++  +              E+ 
Sbjct: 360 ELREEAAELE--SELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417

Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
           + ER  +  + +E+E  L+   +  +E + L + 
Sbjct: 418 REERDELREREAELEATLRTARERVEEAEALLEA 451


>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H).  This family
           consists of several eukaryotic centromere protein H
           (CENP-H) sequences. Macromolecular
           centromere-kinetochore complex plays a critical role in
           sister chromatid separation, but its complete protein
           composition as well as its precise dynamic function
           during mitosis has not yet been clearly determined.
           CENP-H contains a coiled-coil structure and a nuclear
           localisation signal. CENP-H is specifically and
           constitutively localised in kinetochores throughout the
           cell cycle. CENP-H may play a role in kinetochore
           organisation and function throughout the cell cycle.
           This the C-terminus of the region, which is conserved
           from fungi to humans.
          Length = 106

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 232 ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
                      +EL   +KE  +LK +N  L   + +L+K
Sbjct: 6   LNLQQASRETREELSDLEKERLQLKQKNVELALELLELTK 45


>gnl|CDD|131264 TIGR02209, ftsL_broad, cell division protein FtsL.  This model
           represents FtsL, both forms similar to that in E. coli
           and similar to that in B. subtilis. FtsL is one of the
           later proteins active in cell division septum formation.
           FtsL is small, low in complexity, and highly divergent.
           The scope of this model is broader than that of the
           pfam04999.3 for FtsL, as this one includes FtsL from
           Bacillus subtilis and related species [Cellular
           processes, Cell division].
          Length = 85

 Score = 27.2 bits (61), Expect = 2.9
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 244 ELDQFKKENQRLKDENGALIRVISKLSK 271
           EL + + E  +L+ E   L   +++LS+
Sbjct: 32  ELQKLQLEIDKLQKEWRDLQLEVAELSR 59


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 17/95 (17%), Positives = 36/95 (37%), Gaps = 10/95 (10%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNS-LSEQEKRERRTMERK- 234
              +   ++L E+ +     L E L + E+ L++V+     NS         +  +    
Sbjct: 500 GEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLA 559

Query: 235 -------LSEMEVELKELDQFKKENQRLKDENGAL 262
                    E +VE  +  +F K+ ++   + GA 
Sbjct: 560 HMGLPVPAEEAKVEAVDEVKFLKK-KKGILDAGAF 593


>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
          Length = 109

 Score = 27.7 bits (62), Expect = 3.0
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 242 LKELDQFKKENQRLKDENGALIRVISKLSK 271
           +K+ + F+KE + L+ EN  L   + +L+K
Sbjct: 36  IKDYEAFQKEIEELQQENARLKAQVDELTK 65


>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
           complex, found in both eukaryotes and archaea, that
           binds and stabilizes newly synthesized polypeptides
           allowing them to fold correctly.  The complex contains
           two alpha and four beta subunits, the two subunits being
           evolutionarily related. In archaea, there is usually
           only one gene for each subunit while in eukaryotes there
           two or more paralogous genes encoding each subunit
           adding heterogeneity to the structure of the hexamer.
           The structure of the complex consists of a double beta
           barrel assembly with six protruding coiled-coils.
          Length = 129

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 5/37 (13%), Positives = 22/37 (59%)

Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
           ++   + + ++++L  ++ +L++L+    E ++ K+ 
Sbjct: 2   QELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKET 38


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 29.3 bits (65), Expect = 3.0
 Identities = 47/237 (19%), Positives = 87/237 (36%), Gaps = 36/237 (15%)

Query: 3   YSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSY--PSYGSTYGG 60
           YS P   + G   +      S++P         Y++  +S+TPY N     P Y +    
Sbjct: 414 YSNPGNSNPGYNNAP----NSNTP---------YNNPPNSNTPYSNPPNSNPPYSNL--- 457

Query: 61  TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSS---YRSSYAPTTTPLPPVRPRRSKSSTRS 117
                        YS    S+    +APPSS   + S+Y          +P  +  +   
Sbjct: 458 ------------PYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQ 505

Query: 118 ANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNE 177
                  G   +  G+  +     S AP  G  +   + +    R +   GG++R+   +
Sbjct: 506 PAANNFHGAAGNSVGNPFASRPFGS-APYGGNAATTADPNGIAKREDHPEGGTNRQKYEQ 564

Query: 178 NGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK 234
           + E   +    E+ +E     E   K EE  + +K T ++  +++  +    T E+K
Sbjct: 565 SDEESVESSSSENSSE--NENEVTDKGEEIYSLLKKTINRIDMNKIPRPIINTQEKK 619


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 29.1 bits (66), Expect = 3.1
 Identities = 24/123 (19%), Positives = 46/123 (37%), Gaps = 25/123 (20%)

Query: 143 EAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRL----- 197
           E P     +    + T + ++  E     +           + L E  Q EL  L     
Sbjct: 138 EDPENLLHALQQEVLTLKQQL--ELQAREKA--------QSQALAEAQQQELVALEGLAA 187

Query: 198 --RETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
              E   + E +L Q++   ++ S   ++KR+  T +   +   +EL E     +E + L
Sbjct: 188 ELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQ---AAKRLELSE-----EETRIL 239

Query: 256 KDE 258
            D+
Sbjct: 240 IDQ 242


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 22/81 (27%), Positives = 41/81 (50%), Gaps = 10/81 (12%)

Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK-----RERRTMERKLSEMEVELK 243
           +++AE  R  E   +  E++ ++K T + +  +EQE+     R R   E  +++ E +L 
Sbjct: 95  KARAEAIRA-EIEKQAIEDMARLKQTAAADLSAEQERVIAQLR-REAAELAIAKAESQLP 152

Query: 244 -ELDQFKKENQRLKDENGALI 263
             LD+     QRL D + A +
Sbjct: 153 GRLDE--DAQQRLIDRSIANL 171


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 29.1 bits (65), Expect = 3.2
 Identities = 31/117 (26%), Positives = 45/117 (38%), Gaps = 6/117 (5%)

Query: 3   YSTPPRRSSGRTTSRPRSL-YSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT 61
           +S   R SS  T     +   SSS +  S    S  S   S+    NSS  S+ S  G  
Sbjct: 296 HSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSV 355

Query: 62  SSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSA 118
            S     ++  + S+  ++S  S  A  S   +S       +    P  +K ST SA
Sbjct: 356 PSS----SVSDNESKQKRASKSSSGARDSKKDASGMSANGTVENCIP-ENKISTPSA 407


>gnl|CDD|223022 PHA03248, PHA03248, DNA packaging tegument protein UL25;
           Provisional.
          Length = 583

 Score = 28.8 bits (65), Expect = 3.4
 Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 111 SKSSTRSANYTPEGGGRMSRSGSTTSLNSLRS 142
              + RS +Y     GRMSR+  TT++ SL+S
Sbjct: 194 RPLAARSIDY---RDGRMSRTFMTTAITSLQS 222


>gnl|CDD|224227 COG1308, EGD2, Transcription factor homologous to NACalpha-BTF3
           [Transcription].
          Length = 122

 Score = 27.7 bits (62), Expect = 3.4
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
            R +++ + +M ++++ELD  ++   +LKD  
Sbjct: 8   PRKLKKLMKQMGIDVEELDGVERVIIKLKDTE 39


>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
           (MRP).  This model describes multi drug
           resistance-associated protein (MRP) in eukaryotes. The
           multidrug resistance-associated protein is an integral
           membrane protein that causes multidrug resistance when
           overexpressed in mammalian cells. It belongs to ABC
           transporter superfamily. The protein topology and
           function was experimentally demonstrated by epitope
           tagging and immunofluorescence. Insertion of tags in the
           critical regions associated with drug efflux, abrogated
           its function. The C-terminal domain seem to highly
           conserved [Transport and binding proteins, Other].
          Length = 1522

 Score = 29.1 bits (65), Expect = 3.4
 Identities = 32/139 (23%), Positives = 48/139 (34%), Gaps = 27/139 (19%)

Query: 115 TRSANYTPE-------GGGRMSRSGSTTSL--------NSLRSEAPSEGYESGGDNLSTS 159
           T   +Y P+        GG++S  GS   L          LR+ AP E            
Sbjct: 819 THGISYLPQVDVIIVMSGGKISEMGSYQELLQRDGAFAEFLRTYAPDE------QQGHLE 872

Query: 160 RSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLR---ETLSK---TEEELNQVKA 213
            S     SG        ENG L    + ++ Q +LS         S+   +  EL + +A
Sbjct: 873 DSWTALVSGEGKEAKLIENGMLVTDVVGKQLQRQLSASSSDSGDQSRHHGSSAELQKAEA 932

Query: 214 TYSKNSLSEQEKRERRTME 232
                 L E +K +   +E
Sbjct: 933 KEETWKLMEADKAQTGQVE 951


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 28.5 bits (64), Expect = 3.5
 Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 7/72 (9%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           KK     +A+L +L+        ++ ++     +  L E E       +R   E+   LK
Sbjct: 123 KKDLASKKAQLEKLKAAPGIKPAKVEEL-----EEELEEAESALEEARKR-YEEISERLK 176

Query: 244 -ELDQFKKENQR 254
            EL +F +E  R
Sbjct: 177 EELKRFHEERAR 188


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL- 242
           KKL  + Q EL +L + L   E E+  ++A  +      Q   + + +   L++ E EL 
Sbjct: 560 KKLSYKLQRELEQLPQLLEDLEAEIEALQAQVADADFFSQPHEQTQKVLADLADAEQELE 619

Query: 243 ------KELDQFKKEN 252
                 +EL+  K   
Sbjct: 620 VAFERWEELEALKNGG 635



 Score = 28.4 bits (64), Expect = 4.4
 Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 3/58 (5%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNS---LSEQEKRERRTMERKLSEMEVELKEL 245
           QA+   L++   K +EE    KA   K S   LS + +RE   + + L ++E E++ L
Sbjct: 530 QAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEIEAL 587


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 28.8 bits (64), Expect = 3.5
 Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 7/94 (7%)

Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRER 228
           S +L +   E+           EL   +  L K  EE   ++      K  L E+EK+E 
Sbjct: 238 SAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQEL-EEEKKEL 296

Query: 229 RTMERKLSEMEVEL----KELDQFKKENQRLKDE 258
             +  K  E E        EL++ K E +    E
Sbjct: 297 EELREKEGEAEEAASSLEAELNRTKSEKESKARE 330


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 27.6 bits (62), Expect = 3.6
 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 7/46 (15%)

Query: 223 QEKRERRTMERKLSEMEVE------LKELDQFKKENQRLKDENGAL 262
             K  ++ M R     EV+      +K+ +   KEN+ LK+E   L
Sbjct: 7   LNKEFKKKM-RGYDPDEVDEFLDQVIKDYEALYKENEELKEEIERL 51



 Score = 26.9 bits (60), Expect = 8.0
 Identities = 23/103 (22%), Positives = 39/103 (37%), Gaps = 32/103 (31%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELN---------------------QVKATYSK--- 217
           DY+ LY+E++     L+E + + EE+L                      +VKA   K   
Sbjct: 33  DYEALYKENE----ELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAE 88

Query: 218 NSLSEQEKRERRTME---RKLSEMEVELKEL-DQFKKENQRLK 256
             + E E +  R +     +  ++  E  EL  Q +    R K
Sbjct: 89  LIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRTRFK 131


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 21/83 (25%), Positives = 41/83 (49%), Gaps = 6/83 (7%)

Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQF 248
           E +     L ++LSK +    +++    +  L+E  K E + +E+ L +++   K  +Q 
Sbjct: 2   EVKKAARLLNQSLSKLDTPEEKLETLIKR--LAELLK-EMKRLEKDLKKLQ---KRGEQL 55

Query: 249 KKENQRLKDENGALIRVISKLSK 271
           +KE  +L+ E+   I   SKL  
Sbjct: 56  QKEKDKLQSEHSKAILARSKLES 78


>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
           and chromosome partitioning].
          Length = 212

 Score = 28.2 bits (63), Expect = 3.7
 Identities = 15/102 (14%), Positives = 33/102 (32%), Gaps = 2/102 (1%)

Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
           + R L +     + + L  ++Q E  RLR  +        Q        S  ++ +    
Sbjct: 112 ADRYLEDAKIAQELEDLKRQAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANE 171

Query: 230 TMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
             ER     + +   L    + +  +  +   L   +S   +
Sbjct: 172 EAERLADAAQADADRL--RDECDIYVDTKLAELETRLSGTRR 211


>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051). 
           This is a novel protein identified as interacting with
           the leucine-rich repeat domain of human flightless-I,
           FliI protein.
          Length = 294

 Score = 28.3 bits (63), Expect = 4.0
 Identities = 26/129 (20%), Positives = 53/129 (41%), Gaps = 18/129 (13%)

Query: 147 EGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEE 206
           +       + S      NS  G SSR+   +           +++A +  L+++L + EE
Sbjct: 37  DFDRKDSSSTSAVSIATNSLGGSSSRRSSGDTSSS------VDTEASIRELKDSLIEVEE 90

Query: 207 ELNQVKATYS-----KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGA 261
           ++ +   + +     K +L      E   ++ KL E+E   + L + ++E +    +  A
Sbjct: 91  KVKKAMVSNAQLDNEKTNL----MYEVDLLKDKLEELE---ESLAELQREFEEKTRDLEA 143

Query: 262 LIRVISKLS 270
           L R I  L 
Sbjct: 144 LKRAIDILQ 152


>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
           Reviewed.
          Length = 110

 Score = 27.1 bits (61), Expect = 4.0
 Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 16/57 (28%)

Query: 231 MERK-----LSEME----VELKELDQFKK-------ENQRLKDENGALIRVISKLSK 271
           M++K     L ++E    V LKEL   KK       EN  L+ EN  L   + +L  
Sbjct: 1   MDKKEIFDALDDLEQNLGVLLKELGALKKQLAELLEENTALRLENDKLRERLEELEA 57


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 27.2 bits (61), Expect = 4.0
 Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 12/59 (20%)

Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
           +++   + +EEL + K T  K           +T+E++L ++E   KEL++ K+E  +L
Sbjct: 56  VKQDKEEVKEELEERKETLEKEI---------KTLEKQLEKLE---KELEELKEELYKL 102


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.2 bits (63), Expect = 4.1
 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 9/82 (10%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
             +Q EL    ET  + EE+L Q +    +  L E++  E     R+L E      E   
Sbjct: 22  RRAQKELEEYEETALELEEKLKQEE---EEAQLLEKKADELEEENRRLEE------EAAA 72

Query: 248 FKKENQRLKDENGALIRVISKL 269
            ++E +RL+ E       ++KL
Sbjct: 73  SEEERERLEAEVDEATAEVAKL 94


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 10/71 (14%), Positives = 29/71 (40%), Gaps = 1/71 (1%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE-KRERRTMERKLSEMEV 240
           DY+   + + A+L+     L   +    + +    +N++S+ +    +  +E   +++E 
Sbjct: 61  DYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAAQADLEA 120

Query: 241 ELKELDQFKKE 251
               L   +  
Sbjct: 121 AKASLASAQLN 131


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 28.6 bits (64), Expect = 4.2
 Identities = 24/85 (28%), Positives = 33/85 (38%), Gaps = 6/85 (7%)

Query: 76  RPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPR---RSKSSTRSANYTPEGGGRMSRSG 132
           R   SSPRS   P +    S    TT +PP       RS + + SAN +        RSG
Sbjct: 189 RRPSSSPRSLSNPTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSG 248

Query: 133 ST---TSLNSLRSEAPSEGYESGGD 154
            +   T  +   S  P  G +   +
Sbjct: 249 ESFRSTPTSGSSSINPVSGLDEAEE 273


>gnl|CDD|236745 PRK10722, PRK10722, hypothetical protein; Provisional.
          Length = 247

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 12/55 (21%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLS 236
           Y+KL + S +EL  LR+   + + +L                K E  T +ER+LS
Sbjct: 167 YQKLQQSSDSELDALRQQQQRLQYQLELT-----------TRKLENLTDIERQLS 210


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 188 EESQAELSRLRETLSKTEEELNQVK----ATYSK-NSLSEQEKRERRTMERKLSEMEVEL 242
            + + E  R RE L + EE L Q +    A   K ++L  Q +   + +  +  E+E   
Sbjct: 66  NQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELE 125

Query: 243 KELDQFKKE 251
           K+LD     
Sbjct: 126 KQLDNELYR 134


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 7/37 (18%), Positives = 22/37 (59%)

Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
           ++EK  +   E++  +  V+ +E ++  +E ++ K++
Sbjct: 25  KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 28.4 bits (63), Expect = 4.6
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 4/91 (4%)

Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA 213
            NL    + +         +L   N + +  K  EE++ EL   R  + K E+ELN +  
Sbjct: 490 KNLKHDINELTQILEKLELELSEANSKFELSK--EENERELVAQRIEIEKLEKELNDLNL 547

Query: 214 TYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
             SK S+ + E+  + T E KL E++V+L  
Sbjct: 548 L-SKTSILDAEQLVQST-EIKLDELKVDLNR 576


>gnl|CDD|216525 pfam01486, K-box, K-box region.  The K-box region is commonly found
           associated with SRF-type transcription factors see
           pfam00319. The K-box is a possible coiled-coil
           structure. Possible role in multimer formation.
          Length = 100

 Score = 26.7 bits (60), Expect = 4.7
 Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 9/85 (10%)

Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR--ERRTMERKLS-----EME 239
           YE  Q E ++L++     + E   +       SLS +E +  E++ +E  L      + E
Sbjct: 14  YENLQQEAAKLKKQNENLQREQRHLLGE-DLGSLSLKELQQLEQQ-LENGLKHIRSRKNE 71

Query: 240 VELKELDQFKKENQRLKDENGALIR 264
           + L ++++ +K+ + L++EN  L +
Sbjct: 72  LLLNQIEELQKKERELQEENRNLRQ 96


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK-LSEMEVELKELD 246
           EE++  + + +E   K EEE+ + KA      L E++K+E     +K L E E E++EL 
Sbjct: 24  EEAKKIIRKAKEEAKKIEEEIIK-KAEEEAQKLIEKKKKEGEEEAKKILEEGEKEIEELK 82

Query: 247 QFKKEN 252
              +EN
Sbjct: 83  VKAEEN 88


>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353). 
           Members of this family of uncharacterized proteins have
           no known function.
          Length = 319

 Score = 28.3 bits (63), Expect = 4.9
 Identities = 26/133 (19%), Positives = 50/133 (37%), Gaps = 15/133 (11%)

Query: 141 RSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGEL-----DYKKLYEESQAELS 195
           +SE   E   +   ++         +      K   EN +L     + ++   E Q ++ 
Sbjct: 44  KSERELELVAACQGDMKIKPGTKRQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIK 103

Query: 196 RLRETLSKTEEELNQVKATYSKNS----------LSEQEKRERRTMERKLSEMEVELKEL 245
            LR+T+++   E        S             L E+ K+    +   L E E  L E 
Sbjct: 104 LLRQTIAQQRVEFMGGGVRESFEERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMER 163

Query: 246 DQFKKENQRLKDE 258
           D+++++  RL +E
Sbjct: 164 DEYQRKAHRLNNE 176


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 13/72 (18%), Positives = 25/72 (34%), Gaps = 2/72 (2%)

Query: 182 DYKKLYEESQ--AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
                    Q   EL+     L   +  L Q +      S+    K    + +  L +M+
Sbjct: 627 LQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQ 686

Query: 240 VELKELDQFKKE 251
            E ++L  +K+ 
Sbjct: 687 SEKEQLTYWKEM 698


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 27.7 bits (62), Expect = 5.1
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 2/80 (2%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
           + G     KL EE QA   + RE   +  EE  +++         E+E  E R  +++  
Sbjct: 2   KIGAKKRAKL-EEKQAR-RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59

Query: 237 EMEVELKELDQFKKENQRLK 256
           E +   ++  + ++E ++LK
Sbjct: 60  ERKEREEQARKEQEEYEKLK 79


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 28.1 bits (62), Expect = 5.1
 Identities = 28/123 (22%), Positives = 38/123 (30%), Gaps = 10/123 (8%)

Query: 4   STPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSS 63
           ST P  + G TTS       S+P PQ+    S   G S S  +  S+     ST G    
Sbjct: 215 STEPVGTQGTTTS-------SNPEPQTEPPPS-QRGPSGSPQHPPSTTSQDQSTTGDGQE 266

Query: 64  GYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPL--PPVRPRRSKSSTRSANYT 121
                  P + S        +   P +  + +  PT  P            SS       
Sbjct: 267 HTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQAN 326

Query: 122 PEG 124
           P  
Sbjct: 327 PTT 329


>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
           secretion / Intracellular trafficking and secretion /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 148

 Score = 27.3 bits (61), Expect = 5.3
 Identities = 15/68 (22%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
           Q  +S+L   + + + +L++++    +     QEK+       KL E +     L++ ++
Sbjct: 70  QQFISQLEVAIDQQQSQLSKLRKQVEQKREIWQEKQIELKALEKLKERQKTEFLLEENRR 129

Query: 251 ENQRLKDE 258
           E Q++ DE
Sbjct: 130 E-QKIMDE 136


>gnl|CDD|176962 CHL00019, atpF, ATP synthase CF0 B subunit.
          Length = 184

 Score = 27.5 bits (62), Expect = 5.4
 Identities = 23/97 (23%), Positives = 45/97 (46%), Gaps = 19/97 (19%)

Query: 186 LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
             EE+  +L + R  L + E E ++++     N  SE E RE+  +   +++ + +L+ L
Sbjct: 70  RREEAIEKLEKARARLRQAELEADEIR----VNGYSEIE-REKENL---INQAKEDLERL 121

Query: 246 DQFKKEN-----QRLKDE------NGALIRVISKLSK 271
           + +K E      QR  ++        AL R +  L+ 
Sbjct: 122 ENYKNETIRFEQQRAINQVRQQVFQLALQRALGTLNS 158


>gnl|CDD|227569 COG5244, NIP100, Dynactin complex subunit involved in mitotic
           spindle partitioning in anaphase B [Cell division and
           chromosome partitioning].
          Length = 669

 Score = 28.1 bits (62), Expect = 5.4
 Identities = 17/98 (17%), Positives = 30/98 (30%), Gaps = 22/98 (22%)

Query: 165 SESGGSSRKLPNENGELDYK-----KLYEESQAELSRLRETLSKTEEELNQVKATYSKNS 219
            E       L   NG LD          E  + +L  L                  S + 
Sbjct: 211 CERSNIHDVLFLVNGILDGVIDELNGELERLRRQLVSLM-----------------SSHG 253

Query: 220 LSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
           +  +E    +    K   +E+++  L +   +N+ LK 
Sbjct: 254 IEVEENSRLKATLEKFQSLELKVNTLQEELYQNKLLKK 291



 Score = 27.7 bits (61), Expect = 8.9
 Identities = 42/183 (22%), Positives = 61/183 (33%), Gaps = 27/183 (14%)

Query: 106 VRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNSLRSEA---PSEGYESGGDNLSTSRSR 162
           +RP        +A Y    GG +  S        ++ E         ES    L  +   
Sbjct: 64  IRPDDDSLLNGNAAYEKIKGGLVCESKGMDKDGEIKQENHEDRIHFEESKIRRLEETIEA 123

Query: 163 VNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE 222
           + S       +L  EN ELD   L          LRE +S  E ELN+  +  S + L E
Sbjct: 124 LKSTEKEEIVELRRENEELDKINL---------SLRERISSEEPELNKDGSKLSYDELKE 174

Query: 223 QEKRERRTMERKLSEMEVELKELDQFKKENQR--------------LKDENGALIRVISK 268
             +  R      + E+  ++ E        QR              L   NG L  VI +
Sbjct: 175 FVEESRV-QVYDMVELVSDISETLNRNGSIQRSSVRECERSNIHDVLFLVNGILDGVIDE 233

Query: 269 LSK 271
           L+ 
Sbjct: 234 LNG 236


>gnl|CDD|227453 COG5124, COG5124, Protein predicted to be involved in meiotic
           recombination [Cell division and chromosome partitioning
           / General function prediction only].
          Length = 209

 Score = 27.6 bits (61), Expect = 5.5
 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 199 ETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL-----KELDQFKKENQ 253
           E L K  +E+ Q  ATY +    E+  R ++  E + +     L     KE D+ KK+  
Sbjct: 85  ELLKKKIQEVKQDIATYKEEIDKEKATRRKKFTEGQKNYNREALLEKRKKEQDEIKKKLN 144

Query: 254 RLKDEN 259
            L+   
Sbjct: 145 SLQKIE 150


>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 571 and 1310 amino
           acids in length. There are two conserved sequence
           motifs: LPA and ELH. EVC2 is implicated in Ellis van
           Creveld chondrodysplastic dwarfism in humans. Mutations
           in this protein can give rise to this congenital
           condition. LIMBIN is a protein which shares around 80%
           sequence homology with EVC2 and it is implicated in a
           similar condition in bovine chondrodysplastic dwarfism.
          Length = 429

 Score = 27.9 bits (62), Expect = 6.0
 Identities = 19/85 (22%), Positives = 29/85 (34%), Gaps = 6/85 (7%)

Query: 161 SRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSL 220
           +R   E+         E  E   K+  E    E S L +TL   E+E  Q      +   
Sbjct: 230 TRKKMEAQHQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQRSLLLQQEED 289

Query: 221 SEQEKRERRTMERKLSEMEVELKEL 245
             +  R+    +R      VEL  +
Sbjct: 290 FAKAHRQLAVFQR------VELHSI 308


>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
          Length = 796

 Score = 28.3 bits (63), Expect = 6.1
 Identities = 21/100 (21%), Positives = 32/100 (32%), Gaps = 20/100 (20%)

Query: 55  GSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSS 114
           G +     S     +L SS  + S  S RS     S+ RS            R    K S
Sbjct: 1   GRSSSTMVSAAASTSLSSSRPQLSSFSSRSP---QSATRSP-----------RASSIKCS 46

Query: 115 TRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGD 154
             ++  +       + S S + + +      S    SGG 
Sbjct: 47  ASASASSS------ATSSSASLVANGAVALLSASAISGGG 80


>gnl|CDD|233245 TIGR01042, V-ATPase_V1_A, V-type (H+)-ATPase V1, A subunit.  This
           models eukaryotic vacuolar (H+)-ATPase that is
           responsible for acidifying cellular compartments. This
           enzyme shares extensive sequence similarity with
           archaeal ATP synthase [Transport and binding proteins,
           Cations and iron carrying compounds].
          Length = 591

 Score = 28.2 bits (63), Expect = 6.1
 Identities = 14/43 (32%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK 225
           Y+K Y E     ++ +E L + EE+LN++     K++L+E +K
Sbjct: 448 YEKFYPEFVPLRTKAKEILQE-EEDLNEIVQLVGKDALAETDK 489


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.1 bits (62), Expect = 6.2
 Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)

Query: 7   PRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSS 63
           P +  G        L SSS        +S  SG  S++ +  S+     +T  G +S
Sbjct: 132 PGKPKGNQGE---GLASSSDGKSK---ASAKSGSKSASKHGESNSSDESATDSGKAS 182


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 14/82 (17%), Positives = 33/82 (40%), Gaps = 6/82 (7%)

Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
             K+  E+ + +L++    L + + EL  +K    + +         R +E +L++   +
Sbjct: 77  RQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQ 136

Query: 242 LKELDQFKKENQRLKDENGALI 263
           L+           L + N  L+
Sbjct: 137 LQNA------QNDLAEYNSQLV 152


>gnl|CDD|218657 pfam05615, THOC7, Tho complex subunit 7.  The Tho complex is
           involved in transcription elongation and mRNA export
           from the nucleus.
          Length = 132

 Score = 26.9 bits (60), Expect = 6.5
 Identities = 8/40 (20%), Positives = 20/40 (50%)

Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATY 215
           NE    +Y++  EE +  +   ++ + + ++EL + K   
Sbjct: 65  NERERENYEQEKEEIETTIENAKDEIEELKKELEEAKRIR 104


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 27.8 bits (62), Expect = 6.6
 Identities = 15/45 (33%), Positives = 20/45 (44%)

Query: 23  SSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGG 67
           S   +  S   S  S   S S    ++S  S  S+   TSSG+GG
Sbjct: 269 SRRNNNSSYGRSYGSGSPSYSPSSSSNSSSSSSSSSTRTSSGFGG 313


>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
           Fer (Fes related) tyrosine kinases.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Fes (feline sarcoma), also called
           Fps (Fujinami poultry sarcoma), and Fer (Fes related)
           are cytoplasmic (or nonreceptor) tyrosine kinases that
           play roles in haematopoiesis, inflammation and immunity,
           growth factor signaling, cytoskeletal regulation, cell
           migration and adhesion, and the regulation of cell-cell
           interactions. Although Fes and Fer show redundancy in
           their biological functions, they show differences in
           their expression patterns. Fer is ubiquitously expressed
           while Fes is expressed predominantly in myeloid and
           endothelial cells. Fes and Fer contain an N-terminal
           F-BAR domain, an SH2 domain, and a C-terminal catalytic
           kinase domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules. The
           F-BAR domain of Fes is critical in its role in
           microtubule nucleation and bundling.
          Length = 237

 Score = 27.3 bits (61), Expect = 6.9
 Identities = 18/71 (25%), Positives = 35/71 (49%), Gaps = 5/71 (7%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
           KK Y+E   E  ++ E   K  +E+ ++K+ Y K  L E  K  +   E  + +     +
Sbjct: 107 KKAYQE---ERQQIDEQYKKLTDEVEKLKSEYQK--LLEDYKAAKSKFEEAVVKGGRGGR 161

Query: 244 ELDQFKKENQR 254
           +LD+ + + Q+
Sbjct: 162 KLDKARDKYQK 172


>gnl|CDD|149741 pfam08775, ParB, ParB family.  ParB is a component of the par
           system which mediates accurate DNA partition during cell
           division. It recognises A-box and B-box DNA motifs. ParB
           forms an asymmetric dimer with 2 extended
           helix-turn-helix (HTH) motifs that bind to A-boxes. The
           HTH motifs emanate from a beta sheet coiled coil DNA
           binding module. Both DNA binding elements are free to
           rotate around a flexible linker, this enables them to
           bind to complex arrays of A- and B-box elements on
           adjacent DNA arms of the looped partition site.
          Length = 127

 Score = 26.9 bits (60), Expect = 7.0
 Identities = 15/79 (18%), Positives = 36/79 (45%), Gaps = 7/79 (8%)

Query: 182 DYK---KLYEESQAELSRLRETLSKTEEELNQVKATYS----KNSLSEQEKRERRTMERK 234
           DYK   K+ E+ + +   L E +    EE+  + +  S    K  + +   +E + ++ K
Sbjct: 11  DYKLLLKVQEQLEKKNISLDELIDNVSEEIKDIDSELSPDEQKKEIMKLITKELKLLKDK 70

Query: 235 LSEMEVELKELDQFKKENQ 253
             + +  +  L +F  +++
Sbjct: 71  PVKDKAVVTPLAEFDDKDK 89


>gnl|CDD|227369 COG5036, COG5036, SPX domain-containing protein involved in
           vacuolar polyphosphate accumulation [Inorganic ion
           transport and metabolism].
          Length = 509

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 19/97 (19%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM-ERK 234
           NEN +L   +L +   AE  RLR     + +++   + T+ ++   E   + R  + E+ 
Sbjct: 242 NENFDLYNGRLEKLEGAEALRLRWYGKLSPKDIFVERKTHREDWTGESSFKARFQLKEKF 301

Query: 235 LSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
           +++    +K    F+++  +++ E G     + +L +
Sbjct: 302 VNKF---IKGDYTFEEQFAKVRKEGGTTAEDLERLER 335


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 27.7 bits (62), Expect = 7.6
 Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 6/89 (6%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE-- 241
            +L  E Q   + + E L++   ELNQ         L E  ++ R  +E+++ E   E  
Sbjct: 101 AQLSSEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHE-SAEER 159

Query: 242 ---LKELDQFKKENQRLKDENGALIRVIS 267
              L+E+D+   E Q+L  E G L   + 
Sbjct: 160 STLLEEIDRLLGEIQQLAQEAGNLTAALK 188


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 6/35 (17%)

Query: 228 RRTMERKLSEMEVELKE-LDQFKKEN-----QRLK 256
           R  +E+ +  ++ EL+E L  F+K N     QRL+
Sbjct: 253 RERLEQAIESIKEELEERLKFFEKNNKLLEAQRLE 287


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 6/64 (9%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
           Q  ++ LR +LS  E E ++++A  ++ + +          E +  E+  EL    Q   
Sbjct: 80  QDSVANLRASLSAAEAERSRLQALLAELAGAGAA------AEGRAGELAQELDSEKQVSA 133

Query: 251 ENQR 254
               
Sbjct: 134 RALA 137


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%)

Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE--QEKRERRTMERKLSEMEVE 241
            K   E++  L +L + LSK  E++   +  +  N L +  QE RE R   R LS+++ +
Sbjct: 229 SKSLPETKQYLKKLSQDLSKALEKI-SSREKHLNNQLEQLVQEYREAR---RTLSQVQEK 284

Query: 242 LKELDQFKKENQRLKDE 258
             +  Q   E  R  +E
Sbjct: 285 YNQASQGVSELTRELNE 301



 Score = 27.4 bits (61), Expect = 8.0
 Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 16/81 (19%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE--- 244
           E+   E    R TLS+ +E+ NQ            +  RE   +  +L +++ E++E   
Sbjct: 265 EQLVQEYREARRTLSQVQEKYNQASQG------VSELTRELNEISEELEQVKQEMEERGA 318

Query: 245 -------LDQFKKENQRLKDE 258
                  L + K+   +LK+E
Sbjct: 319 SMSDGSPLVKIKQAITKLKEE 339


>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and
           multimerization region of GCN4 and related proteins.
           Basic leucine zipper (bZIP) transcription factors act in
           networks of homo- and hetero-dimers in the regulation in
           a diverse set of cellular pathways. Classical leucine
           zippers have alpha helices with leucine residues 7 amino
           acids apart, which stabilize dimerization with a
           parallel leucine zipper domain. Dimerization creates a
           pair of basic regions that bind DNA and undergo
           conformational change. GCN4 was identified in
           Saccharomyces cerevisiae from mutations in a deficiency
           in activation with the general amino acid control
           pathway. GCN4 encodes a trans-activator of amino acid
           biosynthetic genes containing 2 acidic activation
           domains and a C-terminal bZIP domain, comprised of a
           basic alpha-helical DNA-binding region and a coiled-coil
           dimerization region.
          Length = 54

 Score = 25.3 bits (56), Expect = 7.8
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 2/48 (4%)

Query: 217 KNSLSEQEKRE--RRTMERKLSEMEVELKELDQFKKENQRLKDENGAL 262
             +      RE  RR+  RK+  M+    ++++   EN  L+DE   L
Sbjct: 2   PRARKRARNREAARRSRARKMERMKQLEDKVEELTSENSDLEDEVDRL 49


>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
           subunit A'' fusion protein; Provisional.
          Length = 509

 Score = 27.9 bits (62), Expect = 7.8
 Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEE----LNQVKATYSKNSLSEQEKRERRTME 232
           EN  + Y   +E     ++    TL K E      ++++    + N + +++K   +T+ 
Sbjct: 17  ENVPVYYAVDFE-VPKNITDGYITLKKKEFTYHVIISKIAPYSNSNGIIKKKKPVLKTVL 75

Query: 233 RKLSEMEVELKELDQFKKENQRLKDENGAL 262
              SE ++E  +L +FK+   R+ DEN + 
Sbjct: 76  EIESEEKIEAIDLMEFKRLFGRILDENMSF 105


>gnl|CDD|216102 pfam00756, Esterase, Putative esterase.  This family contains
           Esterase D. However it is not clear if all members of
           the family have the same function. This family is
           related to the pfam00135 family.
          Length = 245

 Score = 27.4 bits (61), Expect = 8.1
 Identities = 12/52 (23%), Positives = 21/52 (40%), Gaps = 7/52 (13%)

Query: 45  PYLNSSYPSY-------GSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPP 89
           P L++++P+        G + GG  + Y  L  P  +   S  SP    +  
Sbjct: 98  PLLDANFPTAPDGRALAGQSMGGLGALYLALKYPDLFGSVSSFSPILNPSNS 149


>gnl|CDD|191268 pfam05405, Mt_ATP-synt_B, Mitochondrial ATP synthase B chain
           precursor (ATP-synt_B).  The Fo sector of the ATP
           synthase is a membrane bound complex which mediates
           proton transport. It is composed of nine different
           polypeptide subunits (a, b, c, d, e, f, g F6, A6L).
          Length = 163

 Score = 26.9 bits (60), Expect = 8.3
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKEN 252
           L + + K ++ELNQ +       L E+  +ER    +KL  +  E K L +  KE 
Sbjct: 42  LDKRIEKIQDELNQSRN------LHEKALKERIEYVKKLQSLVEETKVLFEVSKET 91


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 27.5 bits (61), Expect = 8.5
 Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 6/78 (7%)

Query: 194 LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQ 253
            + L+  L + +EE  +++A        +  + +   +E++L+E++    EL     E+Q
Sbjct: 241 YNLLQSQLEQLQEENFRLEAA------RDDYRIKCEELEKELAELQHRNDELTSLAAESQ 294

Query: 254 RLKDENGALIRVISKLSK 271
            LKDE   L     K  K
Sbjct: 295 ALKDEIDVLRESSDKAKK 312


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 23/103 (22%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 171 SRKLPNENGELDYKKLYEESQAELS----------RLRETLSKTEEELNQVKATYSKNSL 220
            RKL   N +L  + L     +               ++ LS+ EE L   +   SK  +
Sbjct: 3   DRKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEE-LQAKRNELSK-QI 60

Query: 221 SEQEKRERRTME---RKLSEMEVELKELD-QFKKENQRLKDEN 259
            + + +++  +E   ++L E++ EL EL    K     L+D+ 
Sbjct: 61  GKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKL 103


>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
           The N-terminal region of the fission yeast spindle pole
           body protein PPC89 has low similarity to the human Cep57
           protein. The CLD or centrosome localisation domain of
           Cep57 and PPC89 is found at the N-terminus. This region
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57
           and PPC89.
          Length = 69

 Score = 25.3 bits (56), Expect = 8.7
 Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 10/68 (14%)

Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEK----RERRTMERKLSEMEVELKELDQFKKE 251
           RL          L+       K ++ E E     RER +  R+L    V   E  + K E
Sbjct: 2   RLEAENLTLRNRLDSAT---RKVAVHEIELKRLNRERDSALRQLG---VAYLECQELKAE 55

Query: 252 NQRLKDEN 259
           N+ L+ E 
Sbjct: 56  NEALRKEL 63


>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
           N-terminal.  The DAZ gene-product - Deleted in
           Azoospermia - and a closely related sequence are
           required early in germ-cell development in order to
           maintain germ-cell populations. This family is the
           N-terminal region that is the only part of the protein
           in some fungi and lower metazoa.
          Length = 118

 Score = 26.5 bits (59), Expect = 8.8
 Identities = 11/46 (23%), Positives = 22/46 (47%)

Query: 214 TYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
            +S+  L++        ++  L E E   +EL +  +E + LK E+
Sbjct: 72  LHSQECLTQSNSELEERLQEALQEAEEREQELSKQSEELKSLKRES 117


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 27.3 bits (61), Expect = 9.0
 Identities = 13/43 (30%), Positives = 26/43 (60%)

Query: 229 RTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
           + ++RKL E+E E KE +  +K N+++ ++ G  +  + K  K
Sbjct: 87  QGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMK 129


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 27.5 bits (61), Expect = 9.3
 Identities = 16/70 (22%), Positives = 32/70 (45%), Gaps = 5/70 (7%)

Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
           Q  L++L+  +    +E  Q  A  ++     +EK ++    + + E E  +K+L     
Sbjct: 439 QKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKDL----- 493

Query: 251 ENQRLKDENG 260
           E QR + + G
Sbjct: 494 EAQRAQLQAG 503


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 27.4 bits (60), Expect = 9.4
 Identities = 44/234 (18%), Positives = 69/234 (29%), Gaps = 18/234 (7%)

Query: 23  SSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSY-SRPSQSS 81
           S S +  S +  S S   S S  + NS    +       +     L       + P Q+S
Sbjct: 196 SGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQHQPALPRQNS 255

Query: 82  PRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGG---RMSRSGSTTSLN 138
                                +PP       S     +      G   R   S S     
Sbjct: 256 AGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSP 315

Query: 139 SLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE-LDYKKLYEESQAELSRL 197
            LR  A      S  +  +  R+            +  E  E  +     E+ + E++RL
Sbjct: 316 ELRPAAGHLQQPSPVNMSALERTAAWVLMNSQYEDIEYETEENREEGTQAEKYEQEIARL 375

Query: 198 RETLS-------------KTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
           +E L                +E+  Q      +  L + E+R RR  E K S+M
Sbjct: 376 KERLRVSVRRLEEYERRLLGQEQQMQKLLQEYQARLEDSEERLRRQQEEKDSQM 429


>gnl|CDD|177739 PLN00128, PLN00128, Succinate dehydrogenase [ubiquinone]
          flavoprotein subunit.
          Length = 635

 Score = 27.5 bits (61), Expect = 9.5
 Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 1/60 (1%)

Query: 12 GRTTSRPRSLYSSSPSPQSRNLSSYSS-GLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTL 70
                 R L SSS S    + S  ++     ST    SSY     TY     G GG  L
Sbjct: 4  RCVARGLRLLASSSASSSLASASLRTALSRFFSTGGGRSSYTIVDHTYDAVVVGAGGAGL 63


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.0 bits (59), Expect = 9.5
 Identities = 18/71 (25%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
           E+++ E   L E   + E E  + K    ++   ++ +RE +T+E +  + E E+ E D+
Sbjct: 129 EKARQEREELEE---RMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDE 185

Query: 248 FKKENQRLKDE 258
            + E+   K E
Sbjct: 186 SETESDDDKTE 196


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.300    0.118    0.312 

Gapped
Lambda     K      H
   0.267   0.0630    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,910,933
Number of extensions: 1162772
Number of successful extensions: 2694
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2349
Number of HSP's successfully gapped: 614
Length of query: 271
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 176
Effective length of database: 6,723,972
Effective search space: 1183419072
Effective search space used: 1183419072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 58 (26.3 bits)