RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11675
         (431 letters)



>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 55.5 bits (134), Expect = 3e-08
 Identities = 25/89 (28%), Positives = 29/89 (32%), Gaps = 8/89 (8%)

Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAH--IPPPQIQHIPPPIQH-----QPVQNIHPSIQ 400
             P     PQ     QQ   PPPQP      PPQ Q  PPP           Q  +  + 
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297

Query: 401 -HIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
              Q  L   +QQ   Q      +  L Q
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFREQLVQ 326



 Score = 53.2 bits (128), Expect = 2e-07
 Identities = 25/92 (27%), Positives = 26/92 (28%), Gaps = 9/92 (9%)

Query: 349 IPLQHITPQVQLH-QQQQQQPPPQ---PAHIPPPQIQHIPPPIQHQ--PVQNIHPSIQHI 402
            PL    PQ     QQ Q     Q   P    PPQ Q  PP  Q Q  P     P     
Sbjct: 230 PPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLP 289

Query: 403 QQN---LQQNIQQNLQQNIQQNIQHLLQQNMH 431
           Q     L    Q  L   +QQ           
Sbjct: 290 QGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFR 321



 Score = 52.1 bits (125), Expect = 4e-07
 Identities = 21/85 (24%), Positives = 24/85 (28%), Gaps = 1/85 (1%)

Query: 346 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS-IQHIQQ 404
               P Q   PQ    +Q    P P   H    Q Q   P     P Q   P  +     
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241

Query: 405 NLQQNIQQNLQQNIQQNIQHLLQQN 429
            LQQ     L Q +        QQ 
Sbjct: 242 PLQQPQFPGLSQQMPPPPPQPPQQQ 266



 Score = 48.2 bits (115), Expect = 5e-06
 Identities = 20/82 (24%), Positives = 21/82 (25%), Gaps = 2/82 (2%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           P Q    Q      Q    PP P  +P        P       Q   P  Q  QQ  Q  
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPP 270

Query: 410 IQQNLQ--QNIQQNIQHLLQQN 429
             Q     QN       L Q  
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQ 292



 Score = 46.3 bits (110), Expect = 2e-05
 Identities = 23/91 (25%), Positives = 29/91 (31%), Gaps = 13/91 (14%)

Query: 351 LQHITPQVQLHQQQQQQPPPQ----PAHIPPPQI---QHIPPPIQHQ-PVQNIHPSI--- 399
           L+ +  Q+Q  QQ  Q P P     P  +PP Q    Q  PP      P           
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQ 215

Query: 400 --QHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
             Q +    Q   Q  L   + Q    L Q 
Sbjct: 216 PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQP 246



 Score = 46.3 bits (110), Expect = 2e-05
 Identities = 15/96 (15%), Positives = 21/96 (21%), Gaps = 9/96 (9%)

Query: 337 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQH---------IPPPI 387
             V+               P  Q     Q          P   +           +PP +
Sbjct: 177 QQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQL 236

Query: 388 QHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 423
             QP     P    + Q +     Q  QQ  Q    
Sbjct: 237 PQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQP 272



 Score = 46.3 bits (110), Expect = 3e-05
 Identities = 20/88 (22%), Positives = 20/88 (22%), Gaps = 9/88 (10%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQP-----AHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ- 403
                  Q         Q P QP        P    Q  PPP Q    Q   P  Q    
Sbjct: 218 QFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPP 277

Query: 404 -QNLQQNIQQNLQQNIQQNIQHLLQQNM 430
            QN         Q   Q       QQ  
Sbjct: 278 PQNQPTPHPGLPQG--QNAPLPPPQQPQ 303



 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 19/77 (24%), Positives = 25/77 (32%), Gaps = 7/77 (9%)

Query: 348 NIPLQHITPQVQ--LHQQQQQQPPPQPAHIPPPQ-----IQHIPPPIQHQPVQNIHPSIQ 400
            +P     P  Q     Q Q QPPPQ    P P         +PPP Q Q +  +     
Sbjct: 254 QMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG 313

Query: 401 HIQQNLQQNIQQNLQQN 417
             +    +     L Q 
Sbjct: 314 QQRGPQFREQLVQLSQQ 330



 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 19/76 (25%), Positives = 22/76 (28%), Gaps = 7/76 (9%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPA-HIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
                 P+    QQ    P   PA    PPQ+   PPP+Q              QQ    
Sbjct: 205 QPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFP------GLSQQMPPP 258

Query: 409 NIQQNLQQNIQQNIQH 424
             Q   QQ      Q 
Sbjct: 259 PPQPPQQQQQPPQPQA 274



 Score = 45.1 bits (107), Expect = 5e-05
 Identities = 20/86 (23%), Positives = 22/86 (25%), Gaps = 7/86 (8%)

Query: 350 PLQHITPQVQLHQQQQQQ-----PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 404
             + + PQ  L    Q       PP  P   PP Q    P   Q  P     P  Q  Q 
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQP 269

Query: 405 NLQQNIQ--QNLQQNIQQNIQHLLQQ 428
              Q     QN         Q     
Sbjct: 270 PQPQAQPPPQNQPTPHPGLPQGQNAP 295



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 13/76 (17%), Positives = 23/76 (30%), Gaps = 3/76 (3%)

Query: 349 IPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQN 405
            P     PQ Q        Q Q  P P    P  +  +  P   Q        +  + Q 
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330

Query: 406 LQQNIQQNLQQNIQQN 421
            ++ + Q   +  ++ 
Sbjct: 331 QREALSQEEAKRAKRR 346



 Score = 35.9 bits (83), Expect = 0.046
 Identities = 19/90 (21%), Positives = 24/90 (26%), Gaps = 14/90 (15%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPP-------------PQIQHIP-PPIQHQPVQNIHPSIQH 401
           P+ Q  Q  + QP PQ                    Q   +P PP Q  P          
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF 191

Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNMH 431
            QQ   +      Q       Q   QQ + 
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLP 221



 Score = 29.7 bits (67), Expect = 3.5
 Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 5/61 (8%)

Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQ 427
             P+P   PP       P  Q QP       +   +   Q   +Q   Q  Q   Q L Q
Sbjct: 128 TAPKPEPQPPQA-----PESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQ 182

Query: 428 Q 428
            
Sbjct: 183 G 183



 Score = 29.0 bits (65), Expect = 5.4
 Identities = 7/61 (11%), Positives = 14/61 (22%), Gaps = 2/61 (3%)

Query: 350 PLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
             Q     +   QQ Q              PQ +     +  Q  + +        +   
Sbjct: 288 LPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAKRRH 347

Query: 408 Q 408
           +
Sbjct: 348 K 348


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 46.9 bits (111), Expect = 1e-05
 Identities = 23/83 (27%), Positives = 27/83 (32%), Gaps = 5/83 (6%)

Query: 352 QHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
           Q   P     QQ Q   Q          P   Q   PP   QP          +QQ  Q 
Sbjct: 159 QSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQ--QPPQMGQPGMPGGGGQGQMQQQGQP 216

Query: 409 NIQQNLQQNIQQNIQHLLQQNMH 431
             QQ     +QQ +Q+  QQ M 
Sbjct: 217 GGQQQQNPQMQQQLQNQQQQQMD 239



 Score = 38.8 bits (90), Expect = 0.005
 Identities = 15/67 (22%), Positives = 17/67 (25%)

Query: 351 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 410
            Q             QQ PPQ      P         Q             +QQ LQ   
Sbjct: 175 GQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQ 234

Query: 411 QQNLQQN 417
           QQ + Q 
Sbjct: 235 QQQMDQQ 241



 Score = 38.4 bits (89), Expect = 0.008
 Identities = 19/82 (23%), Positives = 29/82 (35%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
            L  +  Q+Q      Q  P QP   PP +   +P   Q      +    Q ++Q   +N
Sbjct: 288 QLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRN 347

Query: 410 IQQNLQQNIQQNIQHLLQQNMH 431
           ++   Q   QQ  Q       H
Sbjct: 348 MRGQQQTQQQQQQQGGNHPAAH 369



 Score = 36.9 bits (85), Expect = 0.020
 Identities = 15/80 (18%), Positives = 21/80 (26%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           P Q   P +     Q Q            Q   +   +Q+Q  Q +           Q  
Sbjct: 193 PPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMG 252

Query: 410 IQQNLQQNIQQNIQHLLQQN 429
            QQ  Q  +Q       Q  
Sbjct: 253 QQQQGQGGMQPQQMQGGQMQ 272



 Score = 36.9 bits (85), Expect = 0.024
 Identities = 19/81 (23%), Positives = 25/81 (30%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
            +Q +    Q    Q   PPPQ     P   Q +   +     Q +        +  QQ 
Sbjct: 295 QMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQT 354

Query: 410 IQQNLQQNIQQNIQHLLQQNM 430
            QQ  QQ       H  Q N 
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQ 375



 Score = 36.5 bits (84), Expect = 0.029
 Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 11/89 (12%)

Query: 352 QHITPQVQLHQQQ-QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP----------SIQ 400
           Q    QV + QQ  QQQP      + P Q+Q +P   Q  P Q + P            Q
Sbjct: 267 QGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQ 326

Query: 401 HIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
            +QQ +    QQ L+Q   +N++   Q  
Sbjct: 327 AVQQGVMSAGQQQLKQMKLRNMRGQQQTQ 355



 Score = 35.0 bits (80), Expect = 0.082
 Identities = 19/75 (25%), Positives = 25/75 (33%), Gaps = 6/75 (8%)

Query: 356 PQVQLHQQQQ--QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 413
            Q Q+ QQQQ      PQ       Q+    P  Q  P Q    S   +  N  Q +   
Sbjct: 247 AQAQMGQQQQGQGGMQPQQMQGGQMQV----PMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302

Query: 414 LQQNIQQNIQHLLQQ 428
            Q    Q +    Q+
Sbjct: 303 GQGGPGQPMGPPPQR 317



 Score = 33.1 bits (75), Expect = 0.33
 Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)

Query: 347 MNIPLQHITPQVQLHQQQQQQPPP--QPAHIPPPQI--QHIPPPIQHQPVQNIHPSIQHI 402
              P+  +     + Q  Q   P    P   PP  +    IP P    P  +   S    
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPA 472

Query: 403 QQNLQQNIQ 411
            Q   Q   
Sbjct: 473 SQRTIQQDM 481



 Score = 32.7 bits (74), Expect = 0.43
 Identities = 19/83 (22%), Positives = 27/83 (32%), Gaps = 4/83 (4%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAH-IPPPQIQHIPPPIQHQPVQNIH---PSIQHIQQN 405
             Q+   Q QL  QQQQQ   Q        Q+            Q +      +   QQ 
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQP 279

Query: 406 LQQNIQQNLQQNIQQNIQHLLQQ 428
            QQ  QQ+    +   +Q +   
Sbjct: 280 PQQQPQQSQLGMLPNQMQQMPGG 302



 Score = 30.7 bits (69), Expect = 1.6
 Identities = 21/89 (23%), Positives = 24/89 (26%), Gaps = 6/89 (6%)

Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP------PIQHQPVQNIHPSIQH 401
               Q   P  Q  Q  Q Q   Q         Q  P         Q Q    + P    
Sbjct: 208 GQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ 267

Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
             Q      QQ  QQ  QQ+   +L   M
Sbjct: 268 GGQMQVPMQQQPPQQQPQQSQLGMLPNQM 296



 Score = 30.7 bits (69), Expect = 1.6
 Identities = 18/74 (24%), Positives = 21/74 (28%), Gaps = 6/74 (8%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
              Q     QQQPP       P          Q QP        Q     +QQ +Q   Q
Sbjct: 182 QGQQGP-VGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQ-----QQQNPQMQQQLQNQQQ 235

Query: 416 QNIQQNIQHLLQQN 429
           Q + Q       Q 
Sbjct: 236 QQMDQQQGPADAQA 249


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 44.7 bits (105), Expect = 1e-04
 Identities = 25/82 (30%), Positives = 29/82 (35%), Gaps = 2/82 (2%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           P Q + PQ Q   QQ QQP         PQ    P P   QP Q + P  Q+ Q      
Sbjct: 768 PQQPVAPQPQ--YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 825

Query: 410 IQQNLQQNIQQNIQHLLQQNMH 431
            Q   QQ  Q          +H
Sbjct: 826 PQPQYQQPQQPVAPQPQDTLLH 847



 Score = 40.5 bits (94), Expect = 0.002
 Identities = 20/55 (36%), Positives = 23/55 (41%), Gaps = 7/55 (12%)

Query: 350 PLQHITPQVQLHQQQQQQPP------PQPAHIPPPQIQHIPPPIQHQPVQN-IHP 397
           P Q + PQ Q  Q QQ   P      PQ    P PQ Q    P+  QP    +HP
Sbjct: 794 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848



 Score = 38.9 bits (90), Expect = 0.005
 Identities = 22/69 (31%), Positives = 27/69 (39%), Gaps = 2/69 (2%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
           TP V+  QQ QQ   PQ  +  P Q   + P  Q+Q  Q         QQ  Q    Q  
Sbjct: 746 TPIVEPVQQPQQPVAPQQQYQQPQQ--PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ 803

Query: 415 QQNIQQNIQ 423
            Q  QQ + 
Sbjct: 804 YQQPQQPVA 812



 Score = 37.8 bits (87), Expect = 0.014
 Identities = 23/71 (32%), Positives = 28/71 (39%), Gaps = 8/71 (11%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQI-----QHIPPPIQH-QPVQNIHPSIQHIQQNLQQ 408
            PQ  +  QQQ Q P QP    P        Q + P  Q+ QP Q + P  Q+ Q     
Sbjct: 754 QPQQPVAPQQQYQQPQQPV--APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811

Query: 409 NIQQNLQQNIQ 419
             Q   QQ  Q
Sbjct: 812 APQPQYQQPQQ 822



 Score = 33.5 bits (76), Expect = 0.28
 Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 5/112 (4%)

Query: 292 NKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPL 351
           N+   Q  Q Q   Y  +  +P      A +  Q      +       +  N+       
Sbjct: 394 NEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQ 453

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
               PQ      Q +Q   QP       +   P P++ QPV    P ++  +
Sbjct: 454 STFAPQST---YQTEQTYQQP--AAQEPLYQQPQPVEQQPVVEPEPVVEETK 500



 Score = 30.1 bits (67), Expect = 2.6
 Identities = 13/59 (22%), Positives = 19/59 (32%)

Query: 354 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 412
           I P  + + QQ Q   P   +  P Q    P    + P         +     +Q  QQ
Sbjct: 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 2/56 (3%)

Query: 350 PLQHITPQVQLHQQQ--QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
           P QH  P +    QQ  Q QP   P    P Q Q    PI   P Q   P +  +Q
Sbjct: 84  PTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139



 Score = 36.3 bits (84), Expect = 0.012
 Identities = 22/76 (28%), Positives = 23/76 (30%), Gaps = 11/76 (14%)

Query: 360 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSI-----------QHIQQNLQQ 408
           LH Q        P         HIP     QPV    P +           QH Q NL Q
Sbjct: 35  LHHQIIPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQ 94

Query: 409 NIQQNLQQNIQQNIQH 424
             QQ  Q    Q  Q 
Sbjct: 95  PAQQPFQPQPLQPPQP 110



 Score = 32.1 bits (73), Expect = 0.29
 Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 8/63 (12%)

Query: 337 HGVITSNSANMNIPLQHITP------QVQLHQQQQQQPPPQPAHIPPPQIQHIPP--PIQ 388
           H +  +     N+P     P      Q    QQ  Q  PP     P P    +PP  P+Q
Sbjct: 80  HSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139

Query: 389 HQP 391
             P
Sbjct: 140 PLP 142



 Score = 31.7 bits (72), Expect = 0.43
 Identities = 18/88 (20%), Positives = 24/88 (27%), Gaps = 11/88 (12%)

Query: 339 VITSNSANMNIPLQHITPQVQLHQQQQQQPP--PQPAHIP--------PPQIQHIPPPIQ 388
           V   +     +   H  P +   Q    Q P  P P            P   Q    P Q
Sbjct: 42  VSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQ 101

Query: 389 HQPVQNIHPSIQHIQQNLQQNIQQNLQQ 416
            QP+Q      Q +Q     +    L  
Sbjct: 102 PQPLQPP-QPQQPMQPQPPVHPIPPLPP 128


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 17/97 (17%), Positives = 27/97 (27%), Gaps = 8/97 (8%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
                 + S   N P     P         Q  P Q   +  P +    P          
Sbjct: 84  QQHNTPTGSPAANPPATFALP-AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQ 142

Query: 396 HPSIQHIQQNLQQN-------IQQNLQQNIQQNIQHL 425
             ++Q +QQ             QQ+ Q+N +  +Q  
Sbjct: 143 QRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179



 Score = 37.8 bits (88), Expect = 0.008
 Identities = 21/114 (18%), Positives = 26/114 (22%), Gaps = 19/114 (16%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPP------------QPAHIPPPQ---- 379
                  +        Q   P  Q    Q  Q  P             PA  PP      
Sbjct: 45  AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104

Query: 380 IQHIPPPIQHQPVQN---IHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
                P IQ +P Q      P +           Q   Q+ +QQ  Q       
Sbjct: 105 AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPAS 158



 Score = 33.6 bits (77), Expect = 0.16
 Identities = 18/69 (26%), Positives = 22/69 (31%), Gaps = 4/69 (5%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
           V  ++T N AN  +      P  Q   QQ QQ    PA    P  Q          +Q  
Sbjct: 124 VPVMVTQNPANSPLD----QPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179

Query: 396 HPSIQHIQQ 404
                  QQ
Sbjct: 180 PNGETPPQQ 188



 Score = 33.6 bits (77), Expect = 0.18
 Identities = 19/97 (19%), Positives = 31/97 (31%), Gaps = 8/97 (8%)

Query: 338 GVITSNSANM-NIPLQHITPQVQLHQQQQQQPPP---QPAHIPPPQIQHIPPPIQHQPVQ 393
            V    + N  N PL     Q  L Q QQ+   P   Q         ++    +Q QP  
Sbjct: 123 QVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNG 182

Query: 394 NIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
              P     Q +   + +      +++    L Q+  
Sbjct: 183 ETPPQ----QTDGAGDDESEALVRLREADGTLEQRIK 215



 Score = 32.8 bits (75), Expect = 0.36
 Identities = 21/99 (21%), Positives = 23/99 (23%), Gaps = 23/99 (23%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ----------PVQNI---HPSIQHI 402
           PQ      Q  QP PQP      Q        QH           P        P+   I
Sbjct: 53  PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTI 112

Query: 403 QQNLQQNIQQNL----------QQNIQQNIQHLLQQNMH 431
           Q    Q     +              Q   Q  LQQ   
Sbjct: 113 QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQ 151


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 39.2 bits (91), Expect = 0.003
 Identities = 27/82 (32%), Positives = 28/82 (34%), Gaps = 3/82 (3%)

Query: 346 NMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHI 402
               P  H  P     QQ Q   QQP PQ     PP  Q  PPP Q    Q   P  Q  
Sbjct: 89  QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVP 148

Query: 403 QQNLQQNIQQNLQQNIQQNIQH 424
           QQ   Q+  Q  Q       Q 
Sbjct: 149 QQQQYQSPPQQPQYQQNPPPQA 170



 Score = 37.2 bits (86), Expect = 0.012
 Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 2/51 (3%)

Query: 350 PLQHITPQVQLHQQQQQQPP--PQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
             Q   P     QQQ Q PP  PQ    PPPQ Q  P      P ++ +  
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP 188



 Score = 36.8 bits (85), Expect = 0.018
 Identities = 24/86 (27%), Positives = 24/86 (27%), Gaps = 3/86 (3%)

Query: 346 NMNIPLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
               P      Q Q     Q   Q PPQ     P Q      P    P Q   P  Q  Q
Sbjct: 78  APQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQ 137

Query: 404 -QNLQQNIQQNLQQNIQQNIQHLLQQ 428
            Q  Q   Q   QQ  Q   Q    Q
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQ 163



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 14/71 (19%), Positives = 19/71 (26%), Gaps = 3/71 (4%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
                         Q P P        PP  Q+ P  +  Q VQ++       Q+     
Sbjct: 66  LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQY-PSQLPPQQVQSVPQQPTPQQEPYYPP 124

Query: 410 IQQNLQQNIQQ 420
             Q      QQ
Sbjct: 125 PSQPQPPPAQQ 135



 Score = 33.4 bits (76), Expect = 0.20
 Identities = 18/47 (38%), Positives = 19/47 (40%)

Query: 363 QQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           QQ Q   PQP    P Q Q+  PP Q Q  QN  P  Q   Q     
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180



 Score = 31.8 bits (72), Expect = 0.66
 Identities = 22/77 (28%), Positives = 27/77 (35%), Gaps = 4/77 (5%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
            +Q +  Q    Q+    PP QP   P   PQ Q   PP Q    Q      Q  Q   Q
Sbjct: 106 QVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQY--Q 163

Query: 408 QNIQQNLQQNIQQNIQH 424
           QN     Q   Q +  +
Sbjct: 164 QNPPPQAQSAPQVSGLY 180



 Score = 31.4 bits (71), Expect = 0.78
 Identities = 20/66 (30%), Positives = 24/66 (36%), Gaps = 3/66 (4%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
            QV  H+          A +PP P  Q   P  Q Q      PS Q+  Q   Q +Q   
Sbjct: 54  EQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQS--QAPPSHQYPSQLPPQQVQSVP 111

Query: 415 QQNIQQ 420
           QQ   Q
Sbjct: 112 QQPTPQ 117



 Score = 28.4 bits (63), Expect = 8.8
 Identities = 13/49 (26%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
           P Q          Q QQ PPPQ    P     +  P       Q+  P+
Sbjct: 148 PQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY--PEESPYQPQSYPPN 194


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 38.8 bits (90), Expect = 0.007
 Identities = 17/45 (37%), Positives = 18/45 (40%), Gaps = 3/45 (6%)

Query: 356  PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 400
            P V    +    PP QP   P PQ    PPP Q QP     P  Q
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQ---APPPPQPQPQPPPPPQPQ 2930



 Score = 35.3 bits (81), Expect = 0.069
 Identities = 14/32 (43%), Positives = 14/32 (43%)

Query: 355  TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
             PQ     Q Q QPPP P   PPP     P P
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941



 Score = 35.3 bits (81), Expect = 0.079
 Identities = 15/44 (34%), Positives = 15/44 (34%), Gaps = 5/44 (11%)

Query: 355  TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
             PQ Q     Q QPPP P   P P     P      P     PS
Sbjct: 2918 QPQPQPPPPPQPQPPPPPPPRPQP-----PLAPTTDPAGAGEPS 2956



 Score = 34.9 bits (80), Expect = 0.090
 Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 355  TPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQP 391
            + +       Q + PPQP   PPPQ Q   PPP Q QP
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931



 Score = 34.5 bits (79), Expect = 0.13
 Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 1/37 (2%)

Query: 356  PQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQP 391
            P  Q  +  Q Q PP P   P P     P PP    P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 32.6 bits (74), Expect = 0.45
 Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 350  PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 400
             L    P+     Q    P PQP   PPPQ Q  PPP   +P   + P+  
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTD 2948



 Score = 31.8 bits (72), Expect = 0.80
 Identities = 8/37 (21%), Positives = 9/37 (24%)

Query: 350  PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
            P     P      Q    PPP+P     P        
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953



 Score = 31.4 bits (71), Expect = 1.2
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 356  PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
            P V+   +       +   +PP Q +  P P    P Q
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQ 2918


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 38.3 bits (90), Expect = 0.008
 Identities = 12/60 (20%), Positives = 18/60 (30%), Gaps = 1/60 (1%)

Query: 362 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
           QQ Q             Q Q      Q Q  Q    S+   QQ L+  + +  +  +   
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGR-QRGGLPGM 683



 Score = 33.3 bits (77), Expect = 0.31
 Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 6/69 (8%)

Query: 362 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
           Q+ +QQ           Q        Q Q         Q  Q +L +  QQ L+  + + 
Sbjct: 618 QRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAER-QQALRDELGR- 675

Query: 422 IQHLLQQNM 430
                +  +
Sbjct: 676 ----QRGGL 680


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 37.5 bits (86), Expect = 0.013
 Identities = 9/52 (17%), Positives = 35/52 (67%), Gaps = 3/52 (5%)

Query: 379 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
           Q +++P  ++     NI    +  ++N++++ ++N+++N+++N++  +++N+
Sbjct: 931 QNENVPEHLKEHAEANIE---EDAEENVEEDAEENVEENVEENVEENVEENV 979



 Score = 34.8 bits (79), Expect = 0.094
 Identities = 16/112 (14%), Positives = 53/112 (47%), Gaps = 26/112 (23%)

Query: 336  VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
            + G I      +N  L++    V  H ++  +     A+I     +     ++    +N+
Sbjct: 913  LSGNIAHEINLINKELKNQNENVPEHLKEHAE-----ANIE----EDAEENVEEDAEENV 963

Query: 396  HPSIQ-----------------HIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
              +++                 ++++N+++N+++N+++NI++N++  +++N+
Sbjct: 964  EENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENI 1015



 Score = 31.7 bits (71), Expect = 1.0
 Identities = 9/43 (20%), Positives = 30/43 (69%)

Query: 381  QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 423
            ++I   I+    +N+  +++ I++N+++N+++N ++N ++N +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAE 1116



 Score = 30.5 bits (68), Expect = 2.4
 Identities = 9/38 (23%), Positives = 27/38 (71%)

Query: 393  QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
            +NI  +I+   +   + I++N+++NI++N++  +++N+
Sbjct: 1055 ENIEENIEEYDEENVEEIEENIEENIEENVEENVEENV 1092



 Score = 30.1 bits (67), Expect = 2.5
 Identities = 7/31 (22%), Positives = 28/31 (90%)

Query: 400  QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
            +++++N+++N+++N+++NI++N++   ++N+
Sbjct: 997  ENVEENIEENVEENVEENIEENVEEYDEENV 1027



 Score = 30.1 bits (67), Expect = 3.2
 Identities = 7/28 (25%), Positives = 26/28 (92%)

Query: 398  SIQHIQQNLQQNIQQNLQQNIQQNIQHL 425
            +++ I++N+++NI++N+++N+++N++ +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEI 1095



 Score = 28.6 bits (63), Expect = 7.8
 Identities = 7/37 (18%), Positives = 25/37 (67%)

Query: 393  QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
            +N+    +++++N+++N ++N ++N ++N +    +N
Sbjct: 1090 ENVEEIEENVEENVEENAEENAEENAEENAEEYDDEN 1126



 Score = 28.6 bits (63), Expect = 8.2
 Identities = 8/51 (15%), Positives = 31/51 (60%)

Query: 379  QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
             ++ I   I+    +N+  +++   + +++N+++N+++N ++N +   ++N
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 36.6 bits (85), Expect = 0.014
 Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 343 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQPVQN 394
           N     +P   +  Q Q  QQQ      QP  + PP+    P  P    PV+ 
Sbjct: 108 NEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRT 160



 Score = 36.6 bits (85), Expect = 0.014
 Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 1/43 (2%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 397
            P+  +  QQQ Q    PA    PQ    PP     PVQ   P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQ-PVTPPRQTTAPVQPQTP 155



 Score = 35.0 bits (81), Expect = 0.056
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQN 394
           Q   P     Q Q   PP Q      PQ           PV  
Sbjct: 126 QQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168



 Score = 33.5 bits (77), Expect = 0.15
 Identities = 27/109 (24%), Positives = 37/109 (33%), Gaps = 16/109 (14%)

Query: 325 QRNTYIKHLEIVH-GVIT----SNSANMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIP 376
           +R  YIK LE    GV T    S    +N P Q    Q QL +Q Q   +Q P Q + +P
Sbjct: 47  ERWRYIKELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVP 106

Query: 377 -----PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQ 420
                P   +     +Q Q             Q       +     +Q 
Sbjct: 107 YNEQTPQVPR---STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQP 152



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP 391
            I  Q    Q      Q Q   P      P Q Q   P      
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163



 Score = 29.6 bits (67), Expect = 3.0
 Identities = 9/39 (23%), Positives = 12/39 (30%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 388
           P         +   +Q   P QP    P + Q   P  Q
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168



 Score = 28.5 bits (64), Expect = 6.0
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
           P     P+      Q Q P P       P  Q +  P
Sbjct: 137 PQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 36.6 bits (85), Expect = 0.018
 Identities = 15/59 (25%), Positives = 19/59 (32%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
           P     P  Q   +Q  Q P  P   P PQ  H  P    Q  Q   P      + + +
Sbjct: 126 PQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184



 Score = 33.9 bits (78), Expect = 0.13
 Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 8/76 (10%)

Query: 353 HITPQVQLHQQQQQQPP-----PQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
                 +   Q Q QPP     P+     PP+   +PP    +P Q     +Q  Q   Q
Sbjct: 79  QEHEAARPSPQHQYQPPYASAQPRQPVQQPPEA-QVPPQHAPRPAQPAPQPVQ--QPAYQ 135

Query: 408 QNIQQNLQQNIQQNIQ 423
              +Q LQQ +   + 
Sbjct: 136 PQPEQPLQQPVSPQVA 151



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 12/58 (20%), Positives = 18/58 (31%), Gaps = 8/58 (13%)

Query: 350 PLQHITPQVQ---LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 404
             Q ++PQV           QP  Q      P         Q +PV    P +   ++
Sbjct: 142 LQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPV-----AAPQPEPVAEPAPVMDKPKR 194



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 21/59 (35%), Positives = 25/59 (42%), Gaps = 2/59 (3%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
           P QH     Q   Q  QQP  QP    P Q Q + P +   P Q +H + Q  QQ  Q 
Sbjct: 114 PPQHAPRPAQPAPQPVQQPAYQPQPEQPLQ-QPVSPQVAPAP-QPVHSAPQPAQQAFQP 170



 Score = 28.8 bits (65), Expect = 5.9
 Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 8/79 (10%)

Query: 358 VQLHQQQQQQPPPQPAHIPPPQIQH-IPPPIQH-QPVQNIHPSIQ------HIQQNLQQN 409
           V++H+        Q      P  QH   PP    QP Q +    +      H  +  Q  
Sbjct: 66  VRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPA 125

Query: 410 IQQNLQQNIQQNIQHLLQQ 428
            Q   Q   Q   +  LQQ
Sbjct: 126 PQPVQQPAYQPQPEQPLQQ 144


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 36.9 bits (85), Expect = 0.021
 Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 5/65 (7%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP----IQHQP 391
           V    ++ + +++  L  +  Q Q   Q     PP    I  P      P     I  Q 
Sbjct: 316 VRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPP-SNFITNPFSSVPEPLREARILPQE 374

Query: 392 VQNIH 396
            QN H
Sbjct: 375 GQNPH 379


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 36.6 bits (84), Expect = 0.021
 Identities = 22/77 (28%), Positives = 29/77 (37%), Gaps = 20/77 (25%)

Query: 107 NSVHKDVKKYNCE--ECDKYYLSQGELDSHVQTSHRGI------------------KPYV 146
               KD K Y C    C+K Y +Q  L  H+   H+                    KPY 
Sbjct: 341 MLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYR 400

Query: 147 CEICSKAFTQQYNLKHH 163
           CE+C K +     LK+H
Sbjct: 401 CEVCDKRYKNLNGLKYH 417



 Score = 36.6 bits (84), Expect = 0.022
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 13/87 (14%)

Query: 141 GIKPYVCEI--CSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQHSFVHGG 198
             KPY C +  C+K +  Q  LK+H +LHG            +         + +     
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHG---------HQNQKLHENPSPEKMNIFSAK 395

Query: 199 EKPFKCSYCPKAFSHKGHLNEHLYTHS 225
           +KP++C  C K + +   L  H   HS
Sbjct: 396 DKPYRCEVCDKRYKNLNGLKYHR-KHS 421


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 36.2 bits (83), Expect = 0.030
 Identities = 17/64 (26%), Positives = 20/64 (31%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           P     P VQ   QQ     P       PQ Q + P  Q  P        Q  Q + +  
Sbjct: 201 PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSR 260

Query: 410 IQQN 413
            Q N
Sbjct: 261 EQGN 264



 Score = 33.9 bits (77), Expect = 0.14
 Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 1/74 (1%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
           +H        Q   Q P  QP  +  P  Q    P Q        P+ Q+  Q   Q  Q
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPT-VQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254

Query: 412 QNLQQNIQQNIQHL 425
            + +   Q N + L
Sbjct: 255 GHKRSREQGNQEFL 268



 Score = 33.5 bits (76), Expect = 0.19
 Identities = 25/74 (33%), Positives = 28/74 (37%), Gaps = 14/74 (18%)

Query: 368 PPPQPAHIPPPQI------------QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
           PP  P   P P +            +H P P Q   VQN  P+ Q   QN  Q  QQ  Q
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQN--PAQQPTVQNPAQQPQQQPQ 230

Query: 416 QNIQQNIQHLLQQN 429
           Q   Q  Q    QN
Sbjct: 231 QQPVQPAQQPTPQN 244



 Score = 33.5 bits (76), Expect = 0.22
 Identities = 20/56 (35%), Positives = 23/56 (41%), Gaps = 3/56 (5%)

Query: 374 HIPPPQIQ-HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
           H P P  Q  +  P Q   VQN  P+ Q  QQ  QQ +Q   Q   Q   Q   Q 
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQN--PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQT 252



 Score = 29.6 bits (66), Expect = 3.5
 Identities = 12/34 (35%), Positives = 13/34 (38%)

Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ 381
           N   Q      Q   Q  QQP PQ     PPQ +
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253



 Score = 29.3 bits (65), Expect = 4.4
 Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query: 348 NIPLQHITPQVQLHQQQQQQ-PPPQPAHIPPPQIQHIPPPIQHQ 390
           N   Q          QQQ Q  P QPA  P PQ     PP   Q
Sbjct: 211 NPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 35.8 bits (82), Expect = 0.034
 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 354 ITPQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHP 397
           I P  +++ Q+  QP        P PQ+Q   P     P Q I+P
Sbjct: 130 IDPTEEVNTQEPTQPAGVNVANNPQPQVQ---PQFGPNPQQRINP 171



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 11/71 (15%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
           + +  Q+Q   +Q +Q   Q   I P +  +   P Q        P+  ++  N Q  +Q
Sbjct: 110 EQMAEQLQRISEQNEQ---QAIEIDPTEEVNTQEPTQ--------PAGVNVANNPQPQVQ 158

Query: 412 QNLQQNIQQNI 422
                N QQ I
Sbjct: 159 PQFGPNPQQRI 169


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 35.9 bits (83), Expect = 0.041
 Identities = 29/164 (17%), Positives = 45/164 (27%), Gaps = 20/164 (12%)

Query: 266 CFSTKSELNR-----HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYTHSKV---KPYNCG 317
           CFS   E NR     H ++ GG KP    +  +   ++  LQ+  +   +    +     
Sbjct: 332 CFSNPEEANRAVTEMHGRMLGG-KPLYVALAQRKEQRRAHLQD-QFMQLQPRMRQLPMGS 389

Query: 318 ICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPA-HIP 376
               +  Q   Y +                       TP       +     P  A   P
Sbjct: 390 PMGGAMGQPPYYGQG---PQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446

Query: 377 PPQIQ-----HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
               Q         P+ + P     P  Q + Q  Q    Q  Q
Sbjct: 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQP-QSTASQGGQ 489


>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587).  This is
           a bacterial family of proteins with no known function.
          Length = 168

 Score = 34.7 bits (80), Expect = 0.042
 Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)

Query: 337 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
           HG+ T+      +  Q +  + QL Q +++  PP     PP Q Q           Q
Sbjct: 116 HGIQTA------LFAQQMAARAQLEQMRRRALPPGVGIAPPGQPQGARGGPPPGTGQ 166


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 30.8 bits (70), Expect = 0.064
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 188 HLVQHSFVHGGEKPFKCSYCPKAFSH 213
           +L +H   H GEKP+KC  C K+FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 30.8 bits (70), Expect = 0.076
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 272 ELNRHAQIHGGVKPFACTVCNKSFLQ 297
            L RH + H G KP+ C VC KSF  
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 30.1 bits (68), Expect = 0.13
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 301 LQNHLYTHSKVKPYNCGICAKSFVQ 325
           L+ H+ TH+  KPY C +C KSF  
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 27.7 bits (62), Expect = 0.78
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 216 HLNEHLYTHSENKCFQCKYCPKTFT 240
           +L  H+ TH+  K ++C  C K+F+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 27.7 bits (62), Expect = 1.0
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 130 ELDSHVQTSHRGIKPYVCEICSKAFTQ 156
            L  H++T H G KPY C +C K+F+ 
Sbjct: 1   NLRRHMRT-HTGEKPYKCPVCGKSFSS 26



 Score = 26.2 bits (58), Expect = 3.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 244 HLKAHTNKHEGVLPHTCLECSKCFST 269
           +L+ H   H G  P+ C  C K FS+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 25.0 bits (55), Expect = 7.9
 Identities = 8/26 (30%), Positives = 10/26 (38%)

Query: 73 EFNEHIMKHEKEKLFSCNFCLKPFLR 98
              H+  H  EK + C  C K F  
Sbjct: 1  NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 35.3 bits (81), Expect = 0.070
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 367 QPPPQPAHIP----PPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
           QP   PA IP    PP IQ++ PP    P Q I P IQHI+
Sbjct: 232 QPSAPPASIPAPPIPPVIQYVAPP-PVPPPQPIIP-IQHIR 270



 Score = 29.5 bits (66), Expect = 3.6
 Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 1/40 (2%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH 389
           P Q   P   +            A  P P  Q I  PIQH
Sbjct: 230 PAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPI-IPIQH 268


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 32.5 bits (74), Expect = 0.14
 Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 1/38 (2%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
           PQ Q+ Q     P P    +P  Q+  + P   H P+ 
Sbjct: 20  PQAQMPQASAPYPGPSMY-LPMAQVMAVGPQSSHPPMA 56



 Score = 29.8 bits (67), Expect = 1.2
 Identities = 11/60 (18%), Positives = 17/60 (28%), Gaps = 7/60 (11%)

Query: 356 PQVQLHQQQQQQPPP-----QPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQNLQQ 408
              +L+Q     PP        A  P P +      +     Q+ HP +    I      
Sbjct: 7   AYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV 66



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 344 SANMN-IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQ 390
            A M      +  P + L   Q     PQ +H P        PPP+   
Sbjct: 21  QAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69


>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain.  Gall11/MED15 acts
           in the general regulation of GAL structural genes and is
           required for full expression for several genes in this
           pathway, including GALs 1,7, and 10 in Saccharomyces
           cerevisiae. GAL11 function is dependent on GCN4
           functionality and binds GCN4 in a degenerate manner with
           multiple orientations found at the GCN4-Gal11 interface.
          Length = 90

 Score = 31.6 bits (72), Expect = 0.15
 Identities = 26/89 (29%), Positives = 34/89 (38%), Gaps = 23/89 (25%)

Query: 356 PQVQLHQQQQQQPPPQPAH---------IPPP---QIQHIPPPI-----------QHQPV 392
            Q Q  QQQQQQ  PQ            IPP    +I ++PP +           Q+   
Sbjct: 2   QQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNIP 61

Query: 393 QNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
                 I+ +QQ   Q + Q  QQ  QQ 
Sbjct: 62  PQAMEQIKEVQQTHFQLLMQRRQQQQQQQ 90


>gnl|CDD|188742 cd09356, LIM2_TRIP6, The second LIM domain of Thyroid
          receptor-interacting protein 6 (TRIP6).  The second LIM
          domain of Thyroid receptor-interacting protein 6
          (TRIP6): TRIP6 is a member of the zyxin LIM protein
          family and contains three LIM zinc-binding domains at
          the C-terminal. TRIP6 protein localizes to focal
          adhesion sites and along actin stress fibers.
          Recruitment of this protein to the plasma membrane
          occurs in a lysophosphatidic acid (LPA)-dependent
          manner. TRIP6 recruits a number of molecules involved
          in actin assembly, cell motility, survival and
          transcriptional control. The function of TRIP6 in cell
          motility is regulated by Src-dependent phosphorylation
          at a Tyr residue. The phosphorylation activates the
          coupling to the Crk SH2 domain, which is required for
          the function of TRIP6 in promoting lysophosphatidic
          acid (LPA)-induced cell migration. TRIP6 can shuttle to
          the nucleus to serve as a coactivator of AP-1 and
          NF-kappaB transcriptional factors. Moreover, TRIP6 can
          form a ternary complex with the NHERF2 PDZ protein and
          LPA2 receptor to regulate LPA-induced activation of ERK
          and AKT, rendering cells resistant to chemotherapy.
          Recent evidence shows that TRIP6 antagonizes
          Fas-Induced apoptosis by enhancing the antiapoptotic
          effect of LPA in cells. As in other LIM domains, this
          domain family is 50-60 amino acids in size and shares
          two characteristic zinc finger motifs. The two zinc
          fingers contain eight conserved residues, mostly
          cysteines and histidines, which coordinately bond to
          two zinc atoms. LIM domains function as adaptors or
          scaffolds to support the assembly of multimeric
          protein.
          Length = 53

 Score = 30.6 bits (69), Expect = 0.15
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 69
          C +C K   +++L    K  H   FTC VC RS +
Sbjct: 1  CSVCSKPIMERILRATGKAYHPHCFTCVVCHRSLD 35


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 33.5 bits (77), Expect = 0.23
 Identities = 14/96 (14%), Positives = 30/96 (31%), Gaps = 11/96 (11%)

Query: 342 SNSANMNIP-LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS-- 398
                     L+     +Q     ++Q   +PA     + +     IQ +   N   S  
Sbjct: 17  RELLTAESAQLEAALQLLQEAVNSKRQEEAEPA---AEEAELQAELIQQELAINDQLSQA 73

Query: 399 ----IQHIQQNLQQNIQ-QNLQQNIQQNIQHLLQQN 429
                + +      + Q  NL   + Q+ + + +Q 
Sbjct: 74  LNQQTERLNALASDDRQLANLLLQLLQSSRTIREQI 109


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 31.8 bits (73), Expect = 0.27
 Identities = 10/45 (22%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 385 PPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ-QNIQQNIQHLLQQ 428
           PP  + P  +I P +  ++Q  +  I ++ + Q + ++I  L ++
Sbjct: 23  PPADYTPYGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEE 67


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 33.1 bits (75), Expect = 0.28
 Identities = 42/164 (25%), Positives = 58/164 (35%), Gaps = 17/164 (10%)

Query: 200 KPFKCSYCPKAFSHKGHLNEHLYTHS----ENKCFQCKY--CPKTFTLRKHLKAHTNKHE 253
            P K   C  +FS    L  HL + +      K F C Y  C K F+    LK H   H 
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347

Query: 254 GVLPHTC--LECSKCFSTKSELNRHAQIHG-----GVKPFACTV--CNKSFLQKTQLQNH 304
            + P     L  S  FS          +         K        C ++F + + L  H
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH 407

Query: 305 LYTHSKVKPYNC--GICAKSFVQRNTYIKHLEIVHGVITSNSAN 346
           + TH   +PYNC    C+KSF +    I H +I         + 
Sbjct: 408 IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSI 451



 Score = 33.1 bits (75), Expect = 0.31
 Identities = 36/261 (13%), Positives = 67/261 (25%), Gaps = 30/261 (11%)

Query: 171 NSLFRCDQCGKAFSRKGHLVQHSFVHGGEKPFKCSYCPKAFSHKGHLNEHLYTHSENKCF 230
                C  C  +FSR  HL +H   H GEKP +CSY     S    L             
Sbjct: 31  PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLE------------ 78

Query: 231 QCKYCPKTFTLRKHLKAHTNKHEGVLPHTCLECSKCFSTKSELNRHAQIHGGVKPFACTV 290
                     L +HL+ H N    +   +        S  S  +  +           + 
Sbjct: 79  ----------LSRHLRTHHNNPSDLNSKS---LPLSNSKASSSSLSSSSSNSNDNNLLSS 125

Query: 291 CNKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIP 350
            +     +      L + S ++         S V           +     S   + N+ 
Sbjct: 126 HSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLS 185

Query: 351 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 410
           L   +          +  P   ++  P          Q+    +    +    Q   +++
Sbjct: 186 LLISSNVSTSIPSSSENSPLSSSYSIPSSSSD-----QNLENSSSSLPLTTNSQLSPKSL 240

Query: 411 QQNLQQNIQQNIQHLLQQNMH 431
                 ++  +          
Sbjct: 241 LSQSPSSLSSSDSSSSASESP 261



 Score = 30.4 bits (68), Expect = 1.9
 Identities = 48/202 (23%), Positives = 67/202 (33%), Gaps = 19/202 (9%)

Query: 151 SKAFTQQYNLKHHLLL-HGDGNSLFRCDQCGKAFSRKGHLVQH--SFVHGGE--KPFKC- 204
             A +Q  +            +   +  QC  +FSR   L +H  S  H GE  KPF C 
Sbjct: 266 PTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCP 325

Query: 205 -SYCPKAFSHKGHLNEHLYTHSENKCFQCKYCPKTFTLRKHLKA-------HTNKHEGVL 256
            S C K FS    L  H+  H+     + K    +      L              +   
Sbjct: 326 YSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385

Query: 257 PHTCLECSKCFSTKSELNR--HAQIHGGVKP--FACTVCNKSFLQKTQLQNHLYTHSKVK 312
               L  S   + K + N   H   H   +P       C+KSF +   L  H   H+   
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445

Query: 313 PYNCGICAKSFVQRNTYIKHLE 334
           P  C I  KSF +      H +
Sbjct: 446 PLLCSIL-KSFRRDLDLSNHGK 466


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 32.7 bits (75), Expect = 0.28
 Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 1/58 (1%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
           VH VI   +    I +  + P       Q  QPPP+P   P P+ + IP P +  PV 
Sbjct: 35  VHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 32.3 bits (73), Expect = 0.47
 Identities = 20/80 (25%), Positives = 26/80 (32%), Gaps = 14/80 (17%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
           Q +  Q Q  Q  +QQ   Q +       Q      Q  PV              Q   +
Sbjct: 151 QTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPV--------------QAQPR 196

Query: 412 QNLQQNIQQNIQHLLQQNMH 431
           Q+   + QQ  Q LLQ   H
Sbjct: 197 QSKPASTQQPYQDLLQTPAH 216



 Score = 29.3 bits (65), Expect = 4.6
 Identities = 20/72 (27%), Positives = 27/72 (37%), Gaps = 2/72 (2%)

Query: 351 LQHITPQVQLHQQQQ-QQPPPQPAHIPPPQIQHI-PPPIQHQPVQNIHPSIQHIQQNLQQ 408
           LQ      QL +QQ+  Q           Q +     P+Q QP Q+   S Q   Q+L Q
Sbjct: 153 LQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQ 212

Query: 409 NIQQNLQQNIQQ 420
                  Q+  Q
Sbjct: 213 TPAHTTAQSKPQ 224


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 32.3 bits (74), Expect = 0.48
 Identities = 10/47 (21%), Positives = 13/47 (27%)

Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
            P  P+  P P     PP  Q  P+    P+               L
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 15/58 (25%), Positives = 20/58 (34%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
             Q  TP           PPP PA  PPP +Q +PP    +P      +       + 
Sbjct: 218 SFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 28.5 bits (64), Expect = 0.50
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 203 KCSYCPKAFSHKGHLNEHLYTH 224
           KC  C K+FS K +L  HL TH
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.1 bits (63), Expect = 0.64
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 146 VCEICSKAFTQQYNLKHHLLLH 167
            C  C K+F+++ NLK HL  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 27.7 bits (62), Expect = 0.88
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 231 QCKYCPKTFTLRKHLKAHTNKH 252
           +C  C K+F+ + +LK H   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 26.9 bits (60), Expect = 1.4
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 259 TCLECSKCFSTKSELNRHAQIH 280
            C +C K FS KS L RH + H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.8 bits (57), Expect = 4.2
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 175 RCDQCGKAFSRKGHLVQHSFVH 196
           +C  CGK+FSRK +L +H   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.0 bits (55), Expect = 7.5
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 288 CTVCNKSFLQKTQLQNHLYTH 308
           C  C KSF +K+ L+ HL TH
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 30.5 bits (69), Expect = 0.50
 Identities = 17/57 (29%), Positives = 18/57 (31%), Gaps = 4/57 (7%)

Query: 346 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP----PIQHQPVQNIHPS 398
               P Q+   Q    Q   QQ PP P   P  Q  H  P    P   Q  Q   P 
Sbjct: 7   TNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPP 63



 Score = 28.5 bits (64), Expect = 2.5
 Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 8/73 (10%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
                 Q QQQQP  Q      P  Q +PPP    P Q        +    Q   +Q+ Q
Sbjct: 7   TNQYPPQNQQQQPYQQ-----QPYHQQMPPPPYSPPQQ---QQGHFMPPQPQPYPKQSPQ 58

Query: 416 QNIQQNIQHLLQQ 428
           Q         L Q
Sbjct: 59  QQQPPQFSSFLSQ 71


>gnl|CDD|235654 PRK05957, PRK05957, aspartate aminotransferase; Provisional.
          Length = 389

 Score = 32.0 bits (73), Expect = 0.61
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
           PPP  A          P   ++Q VQ I P ++ I Q LQQ
Sbjct: 40  PPPPEAIEALNNFLANPENHKYQAVQGIPPLLEAITQKLQQ 80


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 28.2 bits (63), Expect = 0.67
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 202 FKCSYCPKAFSHKGHLNEHLYTH 224
           ++C  C K F  K  L EH+ TH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 2.7
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 258 HTCLECSKCFSTKSELNRHAQIH 280
           + C EC K F +KS L  H + H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 2.7
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 145 YVCEICSKAFTQQYNLKHHLLLH 167
           Y C  C K F  +  L+ H+  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 2.8
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 286 FACTVCNKSFLQKTQLQNHLYTH 308
           + C  C K F  K+ L+ H+ TH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 3.2
 Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 59 FTCEVCMRSFNDRKEFNEHIMKH 81
          + C  C + F  +    EH+  H
Sbjct: 1  YRCPECGKVFKSKSALREHMRTH 23



 Score = 25.1 bits (55), Expect = 7.9
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 174 FRCDQCGKAFSRKGHLVQHSFVH 196
           +RC +CGK F  K  L +H   H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 24.7 bits (54), Expect = 8.8
 Identities = 7/23 (30%), Positives = 11/23 (47%)

Query: 230 FQCKYCPKTFTLRKHLKAHTNKH 252
           ++C  C K F  +  L+ H   H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 30.7 bits (69), Expect = 0.82
 Identities = 12/36 (33%), Positives = 15/36 (41%)

Query: 343 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPP 378
            + N + P Q      Q  QQQ +  P QPA  P  
Sbjct: 126 GNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 14/89 (15%), Positives = 30/89 (33%), Gaps = 13/89 (14%)

Query: 348 NIPLQHITPQVQLHQQQQQQP-----PPQPAHIPPPQ-IQHIPPPIQHQPVQNIHPSIQH 401
            +P Q     +Q H+ +  QP                 I+    P    P++ I+ S + 
Sbjct: 104 TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNV--PIEGINSSSEQ 161

Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
           ++  L +   +     I  +  H ++   
Sbjct: 162 LRVELAELAAE-----IYSDASHRVELAK 185



 Score = 30.5 bits (69), Expect = 1.8
 Identities = 7/76 (9%), Positives = 20/76 (26%), Gaps = 4/76 (5%)

Query: 324 VQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHI 383
            Q N     +E            +N   +    +V+L +   +        +   +   +
Sbjct: 133 TQTNVASMTIEETQSPNVPI-EGINSSSEQ--LRVELAELAAEIYSDASHRVELAK-NFM 188

Query: 384 PPPIQHQPVQNIHPSI 399
            P  + +       + 
Sbjct: 189 EPQAETEAQPEATTNF 204


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 11/70 (15%), Positives = 13/70 (18%), Gaps = 4/70 (5%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQ-IQHIPPPIQHQPVQNIHP--SIQHIQQNLQQNIQ 411
           TP            P  P            P            P         N      
Sbjct: 378 TPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437

Query: 412 Q-NLQQNIQQ 420
             NL++  QQ
Sbjct: 438 SLNLEELWQQ 447


>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein.  Rubella virus
           is an enveloped positive-strand RNA virus of the family
           Togaviridae. Virions are composed of three structural
           proteins: a capsid and two membrane-spanning
           glycoproteins, E2 and E1. During virus assembly, the
           capsid interacts with genomic RNA to form nucleocapsids.
           It has been discovered that capsid phosphorylation
           serves to negatively regulate binding of viral genomic
           RNA. This may delay the initiation of nucleocapsid
           assembly until sufficient amounts of virus glycoproteins
           accumulate at the budding site and/or prevent
           non-specific binding to cellular RNA when levels of
           genomic RNA are low. It follows that at a late stage in
           replication, the capsid may undergo dephosphorylation
           before nucleocapsid assembly occurs.
          Length = 300

 Score = 31.0 bits (69), Expect = 1.2
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 388
           P  +  + + Q P P+P+  PP Q Q  PP +Q
Sbjct: 80  PPEERQESRSQTPAPKPSRAPPQQPQ--PPRMQ 110


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 29.3 bits (65), Expect = 1.2
 Identities = 14/44 (31%), Positives = 16/44 (36%)

Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
            PP P   PPP      P    +PV    P I+  Q    Q  Q
Sbjct: 16  LPPAPVPPPPPAPPAPVPEPTVKPVNAEAPKIREAQSKRGQKQQ 59


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 31.6 bits (71), Expect = 1.2
 Identities = 13/44 (29%), Positives = 17/44 (38%)

Query: 360 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
                   PPP  A   P  ++HI P +   P+Q  H  I   Q
Sbjct: 153 ESLCSGPPPPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQ 196


>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
          Length = 230

 Score = 30.5 bits (68), Expect = 1.3
 Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 5/43 (11%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQ---HIPPPIQHQP 391
            H  PQ      +   PPP P   P P  Q   H  PP+  QP
Sbjct: 40  HHPNPQED--PDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQP 80


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.8 bits (69), Expect = 1.7
 Identities = 23/84 (27%), Positives = 25/84 (29%), Gaps = 4/84 (4%)

Query: 349 IPLQHITPQVQLHQQQQQQPP-PQPAHIPPPQIQHIPP---PIQHQPVQNIHPSIQHIQQ 404
           +P  H   Q Q   QQ  QPP P   H         PP    +Q  PV   HPS    Q 
Sbjct: 237 LPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296

Query: 405 NLQQNIQQNLQQNIQQNIQHLLQQ 428
                 Q        Q   H    
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTP 320



 Score = 29.7 bits (66), Expect = 4.4
 Identities = 15/50 (30%), Positives = 19/50 (38%)

Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 401
            H          Q QQPP +    P P + HI PP      Q  + S +H
Sbjct: 315 PHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364


>gnl|CDD|181818 PRK09393, ftrA, transcriptional activator FtrA; Provisional.
          Length = 322

 Score = 30.3 bits (69), Expect = 1.9
 Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 4/39 (10%)

Query: 375 IPP----PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
           +PP     Q Q +P P+  +    + P I  ++ +L + 
Sbjct: 196 VPPHRDGGQAQFVPRPVASRESDRLGPLIDWMRAHLAEP 234


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 30.2 bits (69), Expect = 2.1
 Identities = 15/91 (16%), Positives = 38/91 (41%), Gaps = 14/91 (15%)

Query: 351 LQHITPQVQLHQQQQ-------------QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 397
           L+  +P+  L QQQQ             ++             Q +      + ++    
Sbjct: 298 LKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQ 357

Query: 398 SIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
            ++ ++Q L++ +++ L++  +Q ++ L QQ
Sbjct: 358 RLEQLEQRLRRAMRRQLKRK-RQRLEALAQQ 387


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 30.3 bits (68), Expect = 2.2
 Identities = 8/24 (33%), Positives = 10/24 (41%)

Query: 361 HQQQQQQPPPQPAHIPPPQIQHIP 384
              QQQQ P Q   I    +Q + 
Sbjct: 430 PHNQQQQEPLQNPDIYTKDLQRLT 453


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 30.2 bits (69), Expect = 2.4
 Identities = 6/18 (33%), Positives = 11/18 (61%)

Query: 403 QQNLQQNIQQNLQQNIQQ 420
           Q   +Q +++ LQQ I+ 
Sbjct: 235 QLKYKQQVEKRLQQRIES 252



 Score = 29.5 bits (67), Expect = 3.6
 Identities = 6/21 (28%), Positives = 13/21 (61%)

Query: 407 QQNIQQNLQQNIQQNIQHLLQ 427
           Q   +Q +++ +QQ I+ +L 
Sbjct: 235 QLKYKQQVEKRLQQRIESILA 255


>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 125 to 142 amino acids in length.
          Length = 131

 Score = 28.9 bits (65), Expect = 2.5
 Identities = 11/66 (16%), Positives = 20/66 (30%), Gaps = 9/66 (13%)

Query: 336 VHGVITSNSANMNIPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPV 392
            HG+       +   L   T  +      Q+ Q +    P  + P   Q   PP     +
Sbjct: 5   AHGL------PVVASLGLCTADLMARVIKQEPQLEDNQCPQDLYPHSSQPDLPPPTTLDL 58

Query: 393 QNIHPS 398
            +   +
Sbjct: 59  NDGTIT 64


>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score = 30.1 bits (68), Expect = 2.5
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 85  KLFSCNFCLKPFLRKDLLEAH 105
           KL+ C FCLK   RK+ L+ H
Sbjct: 197 KLYFCEFCLKFMKRKEQLQRH 217


>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631).  The
           members of this family are sequences derived from a
           group of hypothetical proteins expressed by certain
           bacterial species. The region concerned is approximately
           440 amino acid residues in length.
          Length = 729

 Score = 29.9 bits (68), Expect = 2.7
 Identities = 13/72 (18%), Positives = 26/72 (36%), Gaps = 5/72 (6%)

Query: 357 QVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQ 416
           Q  L Q +     P       P       P+  + +     S+Q  QQ     + +    
Sbjct: 266 QGLLAQARSSSAAPAAPAGQAPGSA---QPLSTRELLAALSSLQ--QQAPALALGEPADF 320

Query: 417 NIQQNIQHLLQQ 428
           ++++ +Q LL +
Sbjct: 321 DLRELLQQLLSR 332


>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3.  The
           reovirus inner capsid protein lambda-1 displays
           nucleoside triphosphate phosphohydrolase (NTPase),
           RNA-5'-triphosphatase (RTPase), and RNA helicase
           activity and may play a role in the transcription of the
           virus genome, the unwinding or reannealing of
           double-stranded RNA during RNA synthesis. The RTPase
           activity constitutes the first step in the capping of
           RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
           lambda1 is an Orthoreovirus core protein, VP3 is the
           homologous core protein in Aquareoviruses.
          Length = 1166

 Score = 30.1 bits (68), Expect = 2.7
 Identities = 10/35 (28%), Positives = 11/35 (31%), Gaps = 1/35 (2%)

Query: 276 HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYT-HS 309
                     + C VC   F    QL  H  T HS
Sbjct: 64  KKATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98


>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
          Length = 84

 Score = 28.0 bits (62), Expect = 2.8
 Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 7/41 (17%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
           TP+ Q  Q QQQ  P Q   I  PQ    P   QH   Q +
Sbjct: 51  TPEQQQQQPQQQIAPNQ---IEAPQ----PNFQQHYQGQQV 84


>gnl|CDD|192241 pfam09280, XPC-binding, XPC-binding domain.  Members of this family
           adopt a structure consisting of four alpha helices,
           arranged in an array. They bind specifically and
           directly to the xeroderma pigmentosum group C protein
           (XPC) to initiate nucleotide excision repair.
          Length = 59

 Score = 27.2 bits (61), Expect = 3.2
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 5/39 (12%)

Query: 396 HPSIQHIQQNLQQN---IQQNLQQNIQQNIQ--HLLQQN 429
           +P  Q ++Q +QQN   +   LQQ  Q N Q   L+QQN
Sbjct: 7   NPQFQQLRQMVQQNPELLPPLLQQLGQSNPQLLQLIQQN 45


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.0 bits (67), Expect = 3.2
 Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 5/61 (8%)

Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQNIQ 411
            P  Q     QQ+P   P   PPPQ    P  +Q           P+ Q ++Q L   ++
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQA--GPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834

Query: 412 Q 412
           +
Sbjct: 835 R 835


>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
           Provisional.
          Length = 177

 Score = 29.0 bits (65), Expect = 3.5
 Identities = 7/28 (25%), Positives = 10/28 (35%)

Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPP 377
           P            + QQ  P  P++ PP
Sbjct: 141 PQGGNQFSGGAQSRPQQSAPAAPSNEPP 168


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 28.3 bits (63), Expect = 3.6
 Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 140 RGIKPYVCEICSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQH 192
           + + PYVC +C   F+   +LK H+           C  CGK F      + H
Sbjct: 69  KAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNTDSTLDH 118



 Score = 28.3 bits (63), Expect = 4.1
 Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 2/52 (3%)

Query: 59  FTCEVCMRSFNDRKEFNEHIMKHEKEKLFSCNFCLKPFLRKDLLEAHVNSVH 110
           + C +C+  F+      +HI   E  K+  C  C K F   D    HV   H
Sbjct: 74  YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123



 Score = 27.5 bits (61), Expect = 6.3
 Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 6/62 (9%)

Query: 283 VKPFACTVCNKSFLQKTQLQNHL-YT-HSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVI 340
           V P+ C +C   F     L+ H+ YT HSKV    C +C K F   ++ + H+   H + 
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV----CPVCGKEFRNTDSTLDHVCKKHNIC 126

Query: 341 TS 342
            S
Sbjct: 127 VS 128


>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
           C2H2 Zn-finger [General function prediction only].
          Length = 65

 Score = 27.3 bits (60), Expect = 3.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 169 DGNSLFRCDQCGKAFSRKGHLVQH 192
           DG    RC +CG  F R+   ++H
Sbjct: 13  DGEEFLRCPRCGMVFRRRKDYIRH 36


>gnl|CDD|188739 cd09353, LIM2_Zyxin, The second LIM domain of Zyxin.  The second
          LIM domain of Zyxin: Zyxin exhibits three copies of the
          LIM domain, an extensive proline-rich domain and a
          nuclear export signal.  Localized at sites of
          cellsubstratum adhesion in fibroblasts, Zyxin interacts
          with alpha-actinin, members of the cysteine-rich
          protein (CRP) family, proteins that display Src
          homology 3 (SH3) domains and Ena/VASP family members.
          Zyxin and its partners have been implicated in the
          spatial control of actin filament assembly as well as
          in pathways important for cell differentiation. In
          addition to its functions at focal adhesion plaques,
          recent work has shown that zyxin moves from the sites
          of cell contacts to the nucleus, where it directly
          participates in the regulation of gene expression. As
          in other LIM domains, this domain family is 50-60 amino
          acids in size and shares two characteristic zinc finger
          motifs. The two zinc fingers contain eight conserved
          residues, mostly cysteines and histidines, which
          coordinately bond to two zinc atoms. LIM domains
          function as adaptors o r scaffolds to support the
          assembly of multimeric protein.
          Length = 60

 Score = 26.8 bits (59), Expect = 3.7
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVC 64
          C +C +  +D++L    K  H Q FTC VC
Sbjct: 1  CAVCDQKITDRMLKATGKSYHPQCFTCVVC 30


>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
           ABF1 is a sequence-specific DNA binding protein involved
           in transcription activation, gene silencing and
           initiation of DNA replication. ABF1 is known to remodel
           chromatin, and it is proposed that it mediates its
           effects on transcription and gene expression by
           modifying local chromatin architecture. These functions
           require a conserved stretch of 20 amino acids in the
           C-terminal region of ABF1 (amino acids 639 to 662 S.
           cerevisiae). The N-terminal two thirds of the protein
           are necessary for DNA binding, and the N-terminus (amino
           acids 9 to 91 in S. cerevisiae) is thought to contain a
           novel zinc-finger motif which may stabilise the protein
           structure.
          Length = 508

 Score = 29.7 bits (66), Expect = 3.8
 Identities = 23/84 (27%), Positives = 30/84 (35%), Gaps = 13/84 (15%)

Query: 349 IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
           +P  H       HQ Q    P  PAH             QHQ  +      QH  Q+ QQ
Sbjct: 280 LPNAHQISGADHHQHQHHHHPSIPAHH------------QHQLPEGHQHDHQHHHQHQQQ 327

Query: 409 NIQ-QNLQQNIQQNIQHLLQQNMH 431
           ++Q Q   Q    N  H+ Q  + 
Sbjct: 328 DLQHQEQHQQHVDNDHHVYQDRIG 351


>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 27.5 bits (61), Expect = 4.0
 Identities = 9/28 (32%), Positives = 12/28 (42%)

Query: 369 PPQPAHIPPPQIQHIPPPIQHQPVQNIH 396
             +PA  P P  Q +PP     P+   H
Sbjct: 49  EDKPAEQPAPPAQPVPPVQNTAPLNQPH 76


>gnl|CDD|188740 cd09354, LIM2_LPP, The second LIM domain of lipoma preferred
          partner (LPP).  The second LIM domain of lipoma
          preferred partner (LPP): LPP is a member of the zyxin
          LIM protein family and contains three LIM zinc-binding
          domains at the C-terminal and proline-rich region at
          the N-terminal.  LPP initially identified as the most
          frequent translocation partner of HMGA2 (High Mobility
          Group A2) in a subgroup of benign tumors of adipose
          tissue (lipomas). It was also shown to be rearranged in
          a number of other soft tissues, as well as in a case of
          acute monoblastic leukemia. In addition to its
          involvement in tumors, LPP was inedited as a smooth
          muscle restricted LIM protein that plays an important
          role in SMC migration. LPP is localized at sites of
          cell adhesion, cell-cell contacts and transiently in
          the nucleus. In nucleus, it acts as a coactivator for
          the ETS domain transcription factor PEA3. In addition
          to PEA3, it interacts with alpha-actinin,vasodilator
          stimulated phosphoprotein (VASP),Palladin, and Scrib.
          The  LIM domains are the main focal adhesion targeting
          elements and that the proline- rich region, which
          harbors binding sites for alpha-actinin and
          vasodilator- stimulated phosphoprotein (VASP), has a
          weak targeting capacity. As in other LIM domains, this
          domain family is 50-60 amino acids in size and shares
          two characteristic zinc finger motifs. The two zinc
          fingers contain eight conserved residues, mostly
          cysteines and histidines, which coordinately bond to
          two zinc atoms. LIM domains function as adaptors or
          scaffolds to support the assembly of multimeric
          protein.
          Length = 60

 Score = 26.7 bits (59), Expect = 4.1
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 69
          C +C K   D++L    K  H Q FTC VC +S +
Sbjct: 1  CSVCSKPILDRILRATGKPYHPQCFTCVVCGKSLD 35


>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding.  The Creb binding domain
           assumes a structure comprising of three alpha-helices
           which pack in a bundle, exposing a hydrophobic groove
           between alpha-1 and alpha-3 within which complimentary
           domains found in the protein 'activator for thyroid
           hormone and retinoid receptors' (ACTR) can dock. Docking
           of these domains is required for the recruitment of RNA
           polymerase II and the basal transcription machinery.
          Length = 104

 Score = 27.7 bits (61), Expect = 4.7
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 355 TPQVQLHQQQQQQPP-PQPAHIPPPQI-QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 412
            PQ  L QQQQ QP  P+P      Q     P P   QP  +      +  Q+L + ++ 
Sbjct: 6   WPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLLRTLKS 65

Query: 413 NLQQNIQQNIQHLLQQNMH 431
                 QQ + ++L+ N  
Sbjct: 66  PSSPQQQQQVLNILKSNPQ 84


>gnl|CDD|219971 pfam08690, GET2, GET complex subunit GET2.  This family corresponds
           to the GET complex subunit GET2. The GET complex is
           involved in the retrieval of ER resident proteins from
           the Golgi.
          Length = 298

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 16/60 (26%), Positives = 22/60 (36%), Gaps = 7/60 (11%)

Query: 369 PPQPAHIP-PPQIQHI------PPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
               AH    P+IQ I      P      P +NI P ++  QQ  +   Q N   N   +
Sbjct: 56  STTSAHDQSTPEIQDILEAIDPPKDESESPAENIDPEVEMFQQLAKMQQQGNGSDNPPAD 115


>gnl|CDD|201555 pfam01023, S_100, S-100/ICaBP type calcium binding domain.  The
           S-100 domain is a subfamily of the EF-hand calcium
           binding proteins.
          Length = 44

 Score = 26.0 bits (58), Expect = 5.5
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 102 LEAHVNSVHKDVKKYNCEECDKYYLSQGELDSHVQT 137
           LE  + ++     +Y+ +E DK  LS+ EL   ++ 
Sbjct: 1   LEKAIETIINVFHQYSGKEGDKDTLSKKELKELLEK 36


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 25.3 bits (55), Expect = 6.3
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 202 FKCSYCPKAFSHKGHLNEHLYTH 224
           FKC  C K+FS K  L  HL  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.3 bits (55), Expect = 7.5
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 230 FQCKYCPKTFTLRKHLKAHTNKH 252
           F+C  C K+F+ +  LK H  KH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region.
           The RFX family is a family of winged-helix DNA binding
           proteins. RFX1 is a regulatory factor essential for
           expression of MHC class II genes. This region is to
           found N terminal to the RFX DNA binding region
           (pfam02257) in some mammalian RFX proteins, and is
           thought to activate transcription when associated with
           DNA. Deletion analysis has identified the region 233-351
           in human RFX1 as being required for maximal activation.
          Length = 150

 Score = 28.0 bits (62), Expect = 6.4
 Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 2/66 (3%)

Query: 349 IPLQ-HITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
           + LQ  +  Q  +   QQ+      +      +Q +P     Q  Q +   +QH+     
Sbjct: 14  VALQTSVPAQAPVPASQQRSVVQATSQTKGGPVQQLPVHRVQQVPQQV-QQVQHVYPAQV 72

Query: 408 QNIQQN 413
           Q ++  
Sbjct: 73  QYVEGG 78


>gnl|CDD|237481 PRK13723, PRK13723, conjugal transfer pilus assembly protein TraH;
           Provisional.
          Length = 451

 Score = 28.6 bits (64), Expect = 6.6
 Identities = 5/42 (11%), Positives = 19/42 (45%), Gaps = 5/42 (11%)

Query: 392 VQNIHPSIQHIQQNLQQ-----NIQQNLQQNIQQNIQHLLQQ 428
           + ++  ++   Q+ +        +QQ+    + + + ++ QQ
Sbjct: 393 MDHLRENLNQAQRQIAAFQSQVQVQQDALLVVDRQMSYMRQQ 434


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.7 bits (65), Expect = 6.8
 Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 403 QQNLQQNIQQNLQQ--NIQQNIQHLLQQNM 430
           Q NL  N+Q N QQ  ++  ++Q  L Q +
Sbjct: 426 QLNLAINLQLNQQQLLSVSDSLQSTLTQQI 455


>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3. 
          Length = 424

 Score = 28.8 bits (65), Expect = 6.8
 Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 2/75 (2%)

Query: 354 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ--QNIQ 411
           I  ++   QQ   +   +PA  PPP                    +Q +++ L   +   
Sbjct: 108 ILQRLSELQQNTAKEVARPAAAPPPSSLPNSSSSPDSINPAGLKELQSLRRELAVLRQTH 167

Query: 412 QNLQQNIQQNIQHLL 426
            + +  IQ +I  +L
Sbjct: 168 SDFKSEIQASISSIL 182


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 28.7 bits (64), Expect = 7.1
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 349 IPLQHITPQVQLHQQQQQQPPPQ 371
           I L  + PQ    Q Q QQ   Q
Sbjct: 300 IQLNFVIPQQAQQQGQGQQQQAQ 322


>gnl|CDD|182696 PRK10748, PRK10748, flavin mononucleotide phosphatase; Provisional.
          Length = 238

 Score = 28.2 bits (63), Expect = 7.6
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 392 VQNIHPSIQHIQQNLQQNIQQNLQQ 416
           VQN HP+++  Q    Q ++Q L++
Sbjct: 39  VQNYHPALRSFQNEDLQRLRQALRE 63


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 27.9 bits (62), Expect = 8.9
 Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 2/51 (3%)

Query: 340 ITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ 390
               + N N P  +  P    +  Q Q    Q A+    Q Q  PP  ++ 
Sbjct: 121 SFHENFNNNYPGNYNNPSQDPYMNQAQS-YNQNAYAKENQ-QAQPPKYENS 169


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.134    0.433 

Gapped
Lambda     K      H
   0.267   0.0704    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,374,955
Number of extensions: 1848299
Number of successful extensions: 5416
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4650
Number of HSP's successfully gapped: 346
Length of query: 431
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 331
Effective length of database: 6,502,202
Effective search space: 2152228862
Effective search space used: 2152228862
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 60 (26.9 bits)