RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11693
         (579 letters)



>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 62.9 bits (153), Expect = 3e-10
 Identities = 31/123 (25%), Positives = 41/123 (33%), Gaps = 13/123 (10%)

Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAH--IPPPQIQHIPPPIQHQPVQ---------NIH 511
             P     PQ     QQ   PPPQP      PPQ Q  PPP                 + 
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297

Query: 512 PSIQHIQQNLQQNIQQNLQQNIQQNIQ-HLLQQNMHVANSSHHMSQQRMHQLYAL-HHDG 569
           P  Q     L Q  Q   +    +     L QQ     +       +R H++ +L  ++G
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAKRRHKIVSLSKYNG 357

Query: 570 YMT 572
            MT
Sbjct: 358 LMT 360



 Score = 54.8 bits (132), Expect = 9e-08
 Identities = 23/120 (19%), Positives = 29/120 (24%), Gaps = 1/120 (0%)

Query: 461 NMNIPLQHITPQVQLHQQ-QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 519
               P Q   PQ    +Q      PPQ         Q +P P Q      + P +     
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241

Query: 520 NLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYMTATASIMQ 579
            LQQ     L Q +        QQ             Q     +     G         Q
Sbjct: 242 PLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQ 301



 Score = 49.8 bits (119), Expect = 3e-06
 Identities = 21/110 (19%), Positives = 26/110 (23%), Gaps = 8/110 (7%)

Query: 466 LQHITPQVQLHQQQQQQPPPQ----PAHIPPPQI---QHIPPPIQHQ-PVQNIHPSIQHI 517
           L+ +  Q+Q  QQ  Q P P     P  +PP Q    Q  PP      P        Q  
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQ 215

Query: 518 QQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHH 567
            Q       Q   Q             +          Q           
Sbjct: 216 PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQ 265



 Score = 49.0 bits (117), Expect = 6e-06
 Identities = 27/112 (24%), Positives = 30/112 (26%), Gaps = 17/112 (15%)

Query: 465 PLQHITPQVQLHQQQQQQ-----PPPQPAHIPPPQIQHI--------PPPIQHQPVQNIH 511
             + + PQ  L    Q       PP  P   PP Q            PPP Q    Q   
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQP 269

Query: 512 PSIQHIQ--QNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
           P  Q     QN         Q   Q       QQ   +        QQR  Q
Sbjct: 270 PQPQAQPPPQNQPTPHPGLPQG--QNAPLPPPQQPQLLPLVQQPQGQQRGPQ 319



 Score = 37.1 bits (86), Expect = 0.029
 Identities = 20/105 (19%), Positives = 26/105 (24%), Gaps = 17/105 (16%)

Query: 471 PQVQLHQQQQQQPPPQPAHIPP-------------PQIQHIP-PPIQHQPVQNIHPSIQH 516
           P+ Q  Q  + QP PQ                    Q   +P PP Q  P          
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQ-AA 190

Query: 517 IQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
             Q           Q  Q + + +  Q    A S      Q    
Sbjct: 191 FPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQ--APAQPPLP 233



 Score = 29.7 bits (67), Expect = 5.2
 Identities = 17/100 (17%), Positives = 24/100 (24%), Gaps = 20/100 (20%)

Query: 477 QQQQQQPPPQ----PAHIP-----------PPQIQHIPPPIQHQPVQNIHPSIQHIQQNL 521
                Q   +    P +IP            P+ +  PP           P  Q   Q +
Sbjct: 99  TFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPES-----QPQPQTPAQKM 153

Query: 522 QQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
               +   Q   +Q    L Q    V        Q    Q
Sbjct: 154 LSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQ 193



 Score = 29.0 bits (65), Expect = 7.9
 Identities = 8/67 (11%), Positives = 16/67 (23%), Gaps = 1/67 (1%)

Query: 457 SNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 516
           +    +        P  Q  Q       PQ      PQ +     +  Q  + +      
Sbjct: 283 TPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG-QQRGPQFREQLVQLSQQQREALSQEEAK 341

Query: 517 IQQNLQQ 523
             +   +
Sbjct: 342 RAKRRHK 348


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 51.2 bits (122), Expect = 1e-06
 Identities = 26/99 (26%), Positives = 32/99 (32%), Gaps = 6/99 (6%)

Query: 467 QHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
           Q   P     QQ Q   Q          P   Q   PP   QP          +QQ  Q 
Sbjct: 159 QSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQ--QPPQMGQPGMPGGGGQGQMQQQGQP 216

Query: 524 NIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
             QQ     +QQ +Q+  QQ M         +Q +M Q 
Sbjct: 217 GGQQQQNPQMQQQLQNQQQQQMD-QQQGPADAQAQMGQQ 254



 Score = 47.3 bits (112), Expect = 2e-05
 Identities = 18/99 (18%), Positives = 21/99 (21%), Gaps = 2/99 (2%)

Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
            Q             QQ PPQ      P         Q             +QQ LQ   
Sbjct: 175 GQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQ 234

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYA 564
           QQ + Q                      M  Q+M     
Sbjct: 235 QQQMDQQQGPADAQAQMGQQQQGQGG--MQPQQMQGGQM 271



 Score = 41.1 bits (96), Expect = 0.002
 Identities = 22/93 (23%), Positives = 29/93 (31%), Gaps = 6/93 (6%)

Query: 471 PQVQLHQQQQ--QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
            Q Q+ QQQQ      PQ       Q+    P  Q  P Q    S   +  N  Q +   
Sbjct: 247 AQAQMGQQQQGQGGMQPQQMQGGQMQV----PMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302

Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
            Q    Q +    Q+   V      + Q  M  
Sbjct: 303 GQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSA 335



 Score = 40.4 bits (94), Expect = 0.003
 Identities = 22/92 (23%), Positives = 30/92 (32%), Gaps = 1/92 (1%)

Query: 467 QHITPQVQLHQQQ-QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
           Q    QV + QQ  QQQP      + P Q+Q +P   Q  P Q + P  Q      Q   
Sbjct: 267 QGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQ 326

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
                       Q    +  ++        QQ
Sbjct: 327 AVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQ 358



 Score = 39.6 bits (92), Expect = 0.005
 Identities = 20/94 (21%), Positives = 32/94 (34%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
            L  +  Q+Q      Q  P QP   PP +   +P   Q      +    Q ++Q   +N
Sbjct: 288 QLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRN 347

Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQR 558
           ++   Q   QQ  Q       H    +  + Q  
Sbjct: 348 MRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGG 381



 Score = 37.7 bits (87), Expect = 0.019
 Identities = 20/97 (20%), Positives = 28/97 (28%), Gaps = 7/97 (7%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P Q   P +     Q Q            Q   +   +Q+Q  Q +       QQ    +
Sbjct: 193 PPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMD------QQQGPAD 246

Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
            Q  + Q  Q     +  Q M        M QQ   Q
Sbjct: 247 AQAQMGQQQQGQ-GGMQPQQMQGGQMQVPMQQQPPQQ 282



 Score = 33.4 bits (76), Expect = 0.35
 Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)

Query: 462 MNIPLQHITPQVQLHQQQQQQPPP--QPAHIPPPQI--QHIPPPIQHQPVQNIHPSIQHI 517
              P+  +     + Q  Q   P    P   PP  +    IP P    P  +   S    
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPA 472

Query: 518 QQNLQQNIQ 526
            Q   Q   
Sbjct: 473 SQRTIQQDM 481



 Score = 33.4 bits (76), Expect = 0.39
 Identities = 23/97 (23%), Positives = 30/97 (30%), Gaps = 7/97 (7%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAH-IPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
             Q+   Q QL  QQQQQ   Q        Q+            Q +       Q  +  
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ----GGQMQVPM 275

Query: 524 NIQQNLQQNIQQN-IQHLLQQNMHVANSSHHMSQQRM 559
             QQ  QQ  QQ+ +  L  Q   +         Q M
Sbjct: 276 Q-QQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPM 311



 Score = 29.2 bits (65), Expect = 7.0
 Identities = 19/109 (17%), Positives = 24/109 (22%), Gaps = 24/109 (22%)

Query: 472 QVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP-SIQHIQQNLQQNI----- 525
           +V   Q   Q           PQ Q       +Q          Q    N  Q       
Sbjct: 137 RVGTMQPGGQAGGMMQQSSGQPQSQQ-----PNQMGPQQGQAQGQAGGMNQGQQGPVGQQ 191

Query: 526 -------------QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
                            Q   Q       QQN  +     +  QQ+M Q
Sbjct: 192 QPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQ 240



 Score = 29.2 bits (65), Expect = 8.1
 Identities = 21/103 (20%), Positives = 26/103 (25%), Gaps = 1/103 (0%)

Query: 460 ANMNIPLQHITPQVQLHQQ-QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
             M         Q Q     QQQQ P     +   Q Q +         Q      Q  Q
Sbjct: 199 PGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQ 258

Query: 519 QNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
             +Q    Q  Q  +    Q   QQ             Q+M  
Sbjct: 259 GGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPG 301


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 44.3 bits (104), Expect = 2e-04
 Identities = 25/82 (30%), Positives = 29/82 (35%), Gaps = 2/82 (2%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P Q + PQ Q   QQ QQP         PQ    P P   QP Q + P  Q+ Q      
Sbjct: 768 PQQPVAPQPQ--YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 825

Query: 525 IQQNLQQNIQQNIQHLLQQNMH 546
            Q   QQ  Q          +H
Sbjct: 826 PQPQYQQPQQPVAPQPQDTLLH 847



 Score = 40.1 bits (93), Expect = 0.003
 Identities = 25/98 (25%), Positives = 31/98 (31%), Gaps = 3/98 (3%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P Q + PQ Q  Q QQ   P      P    Q + P  Q+Q  Q         QQ  Q  
Sbjct: 755 PQQPVAPQQQYQQPQQPVAPQPQYQQPQ---QPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811

Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
             Q   Q  QQ +    Q        +       +H L
Sbjct: 812 APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849



 Score = 38.9 bits (90), Expect = 0.008
 Identities = 27/92 (29%), Positives = 36/92 (39%), Gaps = 6/92 (6%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH-QPVQNIHPSIQHIQQN---LQQNI 525
           TP V+  QQ QQ   PQ  +  P Q   + P  Q+ QP Q + P  Q+ Q       Q  
Sbjct: 746 TPIVEPVQQPQQPVAPQQQYQQPQQ--PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ 803

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
            Q  QQ +    Q+   Q        +   QQ
Sbjct: 804 YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ 835



 Score = 33.5 bits (76), Expect = 0.37
 Identities = 16/95 (16%), Positives = 25/95 (26%), Gaps = 2/95 (2%)

Query: 469 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
           I P  + + QQ Q   P   +  P Q    P    + P         +     +Q  QQ 
Sbjct: 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ- 431

Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLY 563
                    Q +              + Q  +Q  
Sbjct: 432 -PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465



 Score = 33.1 bits (75), Expect = 0.54
 Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 5/112 (4%)

Query: 407 NKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPL 466
           N+   Q  Q Q   Y  +  +P      A +  Q      +       +  N+       
Sbjct: 394 NEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQ 453

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
               PQ      Q +Q   QP       +   P P++ QPV    P ++  +
Sbjct: 454 STFAPQST---YQTEQTYQQP--AAQEPLYQQPQPVEQQPVVEPEPVVEETK 500


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 42.8 bits (101), Expect = 3e-04
 Identities = 23/127 (18%), Positives = 30/127 (23%), Gaps = 19/127 (14%)

Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPP------------QPAHIPPPQ---- 494
                  +        Q   P  Q    Q  Q  P             PA  PP      
Sbjct: 45  AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104

Query: 495 IQHIPPPIQHQPVQN---IHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSS 551
                P IQ +P Q      P +           Q   Q+ +QQ  Q           S 
Sbjct: 105 AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQ 164

Query: 552 HHMSQQR 558
              +Q+ 
Sbjct: 165 QQSAQKN 171



 Score = 36.3 bits (84), Expect = 0.040
 Identities = 17/113 (15%), Positives = 22/113 (19%), Gaps = 19/113 (16%)

Query: 479 QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNI----- 533
              Q PP  A +P P  Q  P               Q                       
Sbjct: 51  PSPQAPPPVAQLPQPLPQPPPTQALQALPAG---DQQQHNTPTGSPAANPPATFALPAGP 107

Query: 534 -QQNIQ----HLLQQNMHVANS-----SHHMSQQRMHQLYALHHDGYMTATAS 576
               IQ     L    + V  +     S      +   L  L    Y    + 
Sbjct: 108 AGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQR-YGAPASG 159



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 18/69 (26%), Positives = 22/69 (31%), Gaps = 4/69 (5%)

Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
           V  ++T N AN  +      P  Q   QQ QQ    PA    P  Q          +Q  
Sbjct: 124 VPVMVTQNPANSPLD----QPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179

Query: 511 HPSIQHIQQ 519
                  QQ
Sbjct: 180 PNGETPPQQ 188



 Score = 32.0 bits (73), Expect = 0.75
 Identities = 17/107 (15%), Positives = 28/107 (26%), Gaps = 7/107 (6%)

Query: 457 SNSANMNIPL-QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
                + +P+     P      Q  QQ   Q              P Q Q  Q       
Sbjct: 117 GQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDE--- 173

Query: 516 HIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
             Q   Q N +   QQ          +  + +  +   + +QR+   
Sbjct: 174 -SQLQQQPNGETPPQQ-TDGAGDDESEALVRLREADGTL-EQRIKGA 217


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 42.8 bits (100), Expect = 4e-04
 Identities = 38/278 (13%), Positives = 71/278 (25%), Gaps = 30/278 (10%)

Query: 286 NSLFRCDQCGKAFSRKGHLVQHSFVHGGEKPFKCSYCPKAFSHKGHLNEHLYTHSENKCF 345
                C  C  +FSR  HL +H   H GEKP +CSY     S    L             
Sbjct: 31  PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLE------------ 78

Query: 346 QCKYCPKTFTLRKHLKAHTNKHEGVLPHTCLECSKCFSTKSELNRHAQIHGGVKPFACTV 405
                     L +HL+ H N    +   +        S  S  +  +           + 
Sbjct: 79  ----------LSRHLRTHHNNPSDLNSKS---LPLSNSKASSSSLSSSSSNSNDNNLLSS 125

Query: 406 CNKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIP 465
            +     +      L + S ++         S V           +     S   + N+ 
Sbjct: 126 HSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLS 185

Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
           L   +          +  P   ++  P          Q+    +    +    Q   +++
Sbjct: 186 LLISSNVSTSIPSSSENSPLSSSYSIPSSSSD-----QNLENSSSSLPLTTNSQLSPKSL 240

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLY 563
                 ++  +           ++     SQ       
Sbjct: 241 LSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNES 278



 Score = 37.0 bits (85), Expect = 0.023
 Identities = 48/202 (23%), Positives = 67/202 (33%), Gaps = 19/202 (9%)

Query: 266 SKAFTQQYNLKHHLLL-HGDGNSLFRCDQCGKAFSRKGHLVQH--SFVHGGE--KPFKC- 319
             A +Q  +            +   +  QC  +FSR   L +H  S  H GE  KPF C 
Sbjct: 266 PTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCP 325

Query: 320 -SYCPKAFSHKGHLNEHLYTHSENKCFQCKYCPKTFTLRKHLKA-------HTNKHEGVL 371
            S C K FS    L  H+  H+     + K    +      L              +   
Sbjct: 326 YSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385

Query: 372 PHTCLECSKCFSTKSELNR--HAQIHGGVKP--FACTVCNKSFLQKTQLQNHLYTHSKVK 427
               L  S   + K + N   H   H   +P       C+KSF +   L  H   H+   
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445

Query: 428 PYNCGICAKSFVQRNTYIKHLE 449
           P  C I  KSF +      H +
Sbjct: 446 PLLCSIL-KSFRRDLDLSNHGK 466


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 40.5 bits (95), Expect = 5e-04
 Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 2/56 (3%)

Query: 465 PLQHITPQVQLHQQQ--QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
           P QH  P +    QQ  Q QP   P    P Q Q    PI   P Q   P +  +Q
Sbjct: 84  PTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139



 Score = 37.5 bits (87), Expect = 0.007
 Identities = 22/76 (28%), Positives = 23/76 (30%), Gaps = 11/76 (14%)

Query: 475 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSI-----------QHIQQNLQQ 523
           LH Q        P         HIP     QPV    P +           QH Q NL Q
Sbjct: 35  LHHQIIPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQ 94

Query: 524 NIQQNLQQNIQQNIQH 539
             QQ  Q    Q  Q 
Sbjct: 95  PAQQPFQPQPLQPPQP 110



 Score = 33.2 bits (76), Expect = 0.16
 Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 8/63 (12%)

Query: 452 HGVITSNSANMNIPLQHITP------QVQLHQQQQQQPPPQPAHIPPPQIQHIPP--PIQ 503
           H +  +     N+P     P      Q    QQ  Q  PP     P P    +PP  P+Q
Sbjct: 80  HSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139

Query: 504 HQP 506
             P
Sbjct: 140 PLP 142



 Score = 33.2 bits (76), Expect = 0.20
 Identities = 19/88 (21%), Positives = 26/88 (29%), Gaps = 11/88 (12%)

Query: 454 VITSNSANMNIPLQHITPQVQLHQQQQQQPP--PQPAHIPPPQIQH----IPPPIQH--- 504
           V   +     +   H  P +   Q    Q P  P P        QH    +P P Q    
Sbjct: 42  VSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQ 101

Query: 505 -QPVQNIHPSIQHIQQNLQQNIQQNLQQ 531
            QP+Q      Q +Q     +    L  
Sbjct: 102 PQPLQPP-QPQQPMQPQPPVHPIPPLPP 128


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 39.9 bits (93), Expect = 0.003
 Identities = 27/82 (32%), Positives = 28/82 (34%), Gaps = 3/82 (3%)

Query: 461 NMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHI 517
               P  H  P     QQ Q   QQP PQ     PP  Q  PPP Q    Q   P  Q  
Sbjct: 89  QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVP 148

Query: 518 QQNLQQNIQQNLQQNIQQNIQH 539
           QQ   Q+  Q  Q       Q 
Sbjct: 149 QQQQYQSPPQQPQYQQNPPPQA 170



 Score = 37.6 bits (87), Expect = 0.013
 Identities = 24/86 (27%), Positives = 24/86 (27%), Gaps = 3/86 (3%)

Query: 461 NMNIPLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
               P      Q Q     Q   Q PPQ     P Q      P    P Q   P  Q  Q
Sbjct: 78  APQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQ 137

Query: 519 -QNLQQNIQQNLQQNIQQNIQHLLQQ 543
            Q  Q   Q   QQ  Q   Q    Q
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQ 163



 Score = 34.5 bits (79), Expect = 0.14
 Identities = 14/71 (19%), Positives = 19/71 (26%), Gaps = 3/71 (4%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
                         Q P P        PP  Q+ P  +  Q VQ++       Q+     
Sbjct: 66  LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQY-PSQLPPQQVQSVPQQPTPQQEPYYPP 124

Query: 525 IQQNLQQNIQQ 535
             Q      QQ
Sbjct: 125 PSQPQPPPAQQ 135



 Score = 34.1 bits (78), Expect = 0.18
 Identities = 18/47 (38%), Positives = 19/47 (40%)

Query: 478 QQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           QQ Q   PQP    P Q Q+  PP Q Q  QN  P  Q   Q     
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180



 Score = 33.0 bits (75), Expect = 0.46
 Identities = 23/92 (25%), Positives = 30/92 (32%), Gaps = 7/92 (7%)

Query: 471 PQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 529
            QV  H+          A +PP P  Q   P  Q Q      PS Q+  Q   Q +Q   
Sbjct: 54  EQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQS--QAPPSHQYPSQLPPQQVQSVP 111

Query: 530 QQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
           QQ   Q   +        +      +QQ   Q
Sbjct: 112 QQPTPQQEPYYPPP----SQPQPPPAQQPQAQ 139



 Score = 32.6 bits (74), Expect = 0.51
 Identities = 22/77 (28%), Positives = 27/77 (35%), Gaps = 4/77 (5%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
            +Q +  Q    Q+    PP QP   P   PQ Q   PP Q    Q      Q  Q   Q
Sbjct: 106 QVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQY--Q 163

Query: 523 QNIQQNLQQNIQQNIQH 539
           QN     Q   Q +  +
Sbjct: 164 QNPPPQAQSAPQVSGLY 180



 Score = 31.8 bits (72), Expect = 1.0
 Identities = 18/57 (31%), Positives = 21/57 (36%), Gaps = 5/57 (8%)

Query: 462 MNIPLQHITPQVQLHQQQQQQ---PP--PQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
                Q   PQ      QQQQ   PP  PQ    PPPQ Q  P      P ++ +  
Sbjct: 132 PAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP 188



 Score = 28.8 bits (64), Expect = 7.6
 Identities = 13/49 (26%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
           P Q          Q QQ PPPQ    P     +  P       Q+  P+
Sbjct: 148 PQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY--PEESPYQPQSYPPN 194


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 38.2 bits (88), Expect = 0.011
 Identities = 22/77 (28%), Positives = 33/77 (42%), Gaps = 7/77 (9%)

Query: 209 NSVHKDVKKYNCE--ECDKYYLSQGGVGCVFLY-YLSQGELDSHVQTSHRGI----KPYV 261
               KD K Y C    C+K Y +Q G+    L+ + +Q   ++             KPY 
Sbjct: 341 MLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYR 400

Query: 262 CEICSKAFTQQYNLKHH 278
           CE+C K +     LK+H
Sbjct: 401 CEVCDKRYKNLNGLKYH 417



 Score = 38.2 bits (88), Expect = 0.012
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 13/87 (14%)

Query: 256 GIKPYVCEI--CSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQHSFVHGG 313
             KPY C +  C+K +  Q  LK+H +LHG            +         + +     
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHG---------HQNQKLHENPSPEKMNIFSAK 395

Query: 314 EKPFKCSYCPKAFSHKGHLNEHLYTHS 340
           +KP++C  C K + +   L  H   HS
Sbjct: 396 DKPYRCEVCDKRYKNLNGLKYHR-KHS 421


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 38.4 bits (89), Expect = 0.012
 Identities = 17/45 (37%), Positives = 18/45 (40%), Gaps = 3/45 (6%)

Query: 471  PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
            P V    +    PP QP   P PQ    PPP Q QP     P  Q
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQ---APPPPQPQPQPPPPPQPQ 2930



 Score = 35.3 bits (81), Expect = 0.12
 Identities = 14/32 (43%), Positives = 14/32 (43%)

Query: 470  TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
             PQ     Q Q QPPP P   PPP     P P
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941



 Score = 34.9 bits (80), Expect = 0.14
 Identities = 15/44 (34%), Positives = 15/44 (34%), Gaps = 5/44 (11%)

Query: 470  TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
             PQ Q     Q QPPP P   P P     P      P     PS
Sbjct: 2918 QPQPQPPPPPQPQPPPPPPPRPQP-----PLAPTTDPAGAGEPS 2956



 Score = 34.9 bits (80), Expect = 0.16
 Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 470  TPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQP 506
            + +       Q + PPQP   PPPQ Q   PPP Q QP
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931



 Score = 34.5 bits (79), Expect = 0.22
 Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 1/37 (2%)

Query: 471  PQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQP 506
            P  Q  +  Q Q PP P   P P     P PP    P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 32.6 bits (74), Expect = 0.77
 Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 465  PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
             L    P+     Q    P PQP   PPPQ Q  PPP   +P   + P+  
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTD 2948



 Score = 31.8 bits (72), Expect = 1.3
 Identities = 8/37 (21%), Positives = 9/37 (24%)

Query: 465  PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
            P     P      Q    PPP+P     P        
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 37.3 bits (87), Expect = 0.015
 Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 7/92 (7%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P     P  Q   +Q  Q P  P   P PQ  H  P    Q  Q   P      + + + 
Sbjct: 126 PQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEP 185

Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQ 556
                +   ++ +  ++   M+VA  +HH S+
Sbjct: 186 APVMDKPKRKEAV--IV---MNVA--AHHGSE 210



 Score = 33.9 bits (78), Expect = 0.22
 Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 8/76 (10%)

Query: 468 HITPQVQLHQQQQQQPP-----PQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
                 +   Q Q QPP     P+     PP+   +PP    +P Q     +Q  Q   Q
Sbjct: 79  QEHEAARPSPQHQYQPPYASAQPRQPVQQPPEA-QVPPQHAPRPAQPAPQPVQ--QPAYQ 135

Query: 523 QNIQQNLQQNIQQNIQ 538
              +Q LQQ +   + 
Sbjct: 136 PQPEQPLQQPVSPQVA 151



 Score = 28.5 bits (64), Expect = 9.7
 Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 8/79 (10%)

Query: 473 VQLHQQQQQQPPPQPAHIPPPQIQH-IPPPIQH-QPVQNIHPSIQ------HIQQNLQQN 524
           V++H+        Q      P  QH   PP    QP Q +    +      H  +  Q  
Sbjct: 66  VRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPA 125

Query: 525 IQQNLQQNIQQNIQHLLQQ 543
            Q   Q   Q   +  LQQ
Sbjct: 126 PQPVQQPAYQPQPEQPLQQ 144


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 37.8 bits (87), Expect = 0.018
 Identities = 9/52 (17%), Positives = 35/52 (67%), Gaps = 3/52 (5%)

Query: 494 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
           Q +++P  ++     NI    +  ++N++++ ++N+++N+++N++  +++N+
Sbjct: 931 QNENVPEHLKEHAEANIE---EDAEENVEEDAEENVEENVEENVEENVEENV 979



 Score = 35.1 bits (80), Expect = 0.12
 Identities = 16/112 (14%), Positives = 53/112 (47%), Gaps = 26/112 (23%)

Query: 451  VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
            + G I      +N  L++    V  H ++  +     A+I     +     ++    +N+
Sbjct: 913  LSGNIAHEINLINKELKNQNENVPEHLKEHAE-----ANIE----EDAEENVEEDAEENV 963

Query: 511  HPSIQ-----------------HIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
              +++                 ++++N+++N+++N+++NI++N++  +++N+
Sbjct: 964  EENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENI 1015



 Score = 31.7 bits (71), Expect = 1.4
 Identities = 9/43 (20%), Positives = 30/43 (69%)

Query: 496  QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 538
            ++I   I+    +N+  +++ I++N+++N+++N ++N ++N +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAE 1116



 Score = 30.5 bits (68), Expect = 3.2
 Identities = 9/38 (23%), Positives = 27/38 (71%)

Query: 508  QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
            +NI  +I+   +   + I++N+++NI++N++  +++N+
Sbjct: 1055 ENIEENIEEYDEENVEEIEENIEENIEENVEENVEENV 1092



 Score = 30.5 bits (68), Expect = 3.4
 Identities = 7/31 (22%), Positives = 28/31 (90%)

Query: 515  QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
            +++++N+++N+++N+++NI++N++   ++N+
Sbjct: 997  ENVEENIEENVEENVEENIEENVEEYDEENV 1027



 Score = 30.1 bits (67), Expect = 4.3
 Identities = 7/28 (25%), Positives = 26/28 (92%)

Query: 513  SIQHIQQNLQQNIQQNLQQNIQQNIQHL 540
            +++ I++N+++NI++N+++N+++N++ +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEI 1095



 Score = 29.4 bits (65), Expect = 6.2
 Identities = 8/45 (17%), Positives = 27/45 (60%)

Query: 508  QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSH 552
            +N+    +++++N+++N ++N ++N ++N +    +N    N  +
Sbjct: 1090 ENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEEY 1134



 Score = 29.0 bits (64), Expect = 9.0
 Identities = 8/56 (14%), Positives = 32/56 (57%)

Query: 494  QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVAN 549
             ++ I   I+    +N+  +++   + +++N+++N+++N ++N +   ++N    +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYD 1123


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 37.6 bits (88), Expect = 0.018
 Identities = 12/60 (20%), Positives = 18/60 (30%), Gaps = 1/60 (1%)

Query: 477 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
           QQ Q             Q Q      Q Q  Q    S+   QQ L+  + +  +  +   
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGR-QRGGLPGM 683



 Score = 32.6 bits (75), Expect = 0.70
 Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 6/69 (8%)

Query: 477 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
           Q+ +QQ           Q        Q Q         Q  Q +L +  QQ L+  + + 
Sbjct: 618 QRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAER-QQALRDELGR- 675

Query: 537 IQHLLQQNM 545
                +  +
Sbjct: 676 ----QRGGL 680


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 36.6 bits (85), Expect = 0.021
 Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 458 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQPVQN 509
           N     +P   +  Q Q  QQQ      QP  + PP+    P  P    PV+ 
Sbjct: 108 NEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRT 160



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 1/43 (2%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 512
            P+  +  QQQ Q    PA    PQ    PP     PVQ   P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQ-PVTPPRQTTAPVQPQTP 155



 Score = 35.0 bits (81), Expect = 0.083
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQN 509
           Q   P     Q Q   PP Q      PQ           PV  
Sbjct: 126 QQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 30/108 (27%), Positives = 38/108 (35%), Gaps = 18/108 (16%)

Query: 440 QRNTYIKHLEIVH-GVIT----SNSANMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIP 491
           +R  YIK LE    GV T    S    +N P Q    Q QL +Q Q   +Q P Q + +P
Sbjct: 47  ERWRYIKELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVP 106

Query: 492 PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQH 539
                        Q  Q    ++Q  QQ  QQ       Q        
Sbjct: 107 ----------YNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPR 144



 Score = 29.6 bits (67), Expect = 3.8
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP 506
            I  Q    Q      Q Q   P      P Q Q   P      
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163



 Score = 29.6 bits (67), Expect = 4.3
 Identities = 9/39 (23%), Positives = 12/39 (30%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 503
           P         +   +Q   P QP    P + Q   P  Q
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168



 Score = 28.5 bits (64), Expect = 8.6
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
           P     P+      Q Q P P       P  Q +  P
Sbjct: 137 PQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 32.4 bits (74), Expect = 0.026
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 303 HLVQHSFVHGGEKPFKCSYCPKAFSH 328
           +L +H   H GEKP+KC  C K+FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 32.0 bits (73), Expect = 0.031
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 387 ELNRHAQIHGGVKPFACTVCNKSFLQ 412
            L RH + H G KP+ C VC KSF  
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 31.6 bits (72), Expect = 0.054
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 416 LQNHLYTHSKVKPYNCGICAKSFVQ 440
           L+ H+ TH+  KPY C +C KSF  
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 29.3 bits (66), Expect = 0.36
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 331 HLNEHLYTHSENKCFQCKYCPKTFT 355
           +L  H+ TH+  K ++C  C K+F+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 28.9 bits (65), Expect = 0.39
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 245 ELDSHVQTSHRGIKPYVCEICSKAFTQ 271
            L  H++T H G KPY C +C K+F+ 
Sbjct: 1   NLRRHMRT-HTGEKPYKCPVCGKSFSS 26



 Score = 27.4 bits (61), Expect = 1.8
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 359 HLKAHTNKHEGVLPHTCLECSKCFST 384
           +L+ H   H G  P+ C  C K FS+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 26.2 bits (58), Expect = 4.1
 Identities = 8/26 (30%), Positives = 10/26 (38%)

Query: 175 EFNEHIMKHEKEKLFSCNFCLKPFLR 200
               H+  H  EK + C  C K F  
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 37.3 bits (86), Expect = 0.027
 Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 5/65 (7%)

Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP----IQHQP 506
           V    ++ + +++  L  +  Q Q   Q     PP    I  P      P     I  Q 
Sbjct: 316 VRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPP-SNFITNPFSSVPEPLREARILPQE 374

Query: 507 VQNIH 511
            QN H
Sbjct: 375 GQNPH 379


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 36.2 bits (83), Expect = 0.034
 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 469 ITPQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHP 512
           I P  +++ Q+  QP        P PQ+Q   P     P Q I+P
Sbjct: 130 IDPTEEVNTQEPTQPAGVNVANNPQPQVQ---PQFGPNPQQRINP 171



 Score = 30.4 bits (68), Expect = 2.1
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 11/71 (15%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
           + +  Q+Q   +Q +Q   Q   I P +  +   P Q        P+  ++  N Q  +Q
Sbjct: 110 EQMAEQLQRISEQNEQ---QAIEIDPTEEVNTQEPTQ--------PAGVNVANNPQPQVQ 158

Query: 527 QNLQQNIQQNI 537
                N QQ I
Sbjct: 159 PQFGPNPQQRI 169


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 36.2 bits (83), Expect = 0.040
 Identities = 17/64 (26%), Positives = 20/64 (31%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P     P VQ   QQ     P       PQ Q + P  Q  P        Q  Q + +  
Sbjct: 201 PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSR 260

Query: 525 IQQN 528
            Q N
Sbjct: 261 EQGN 264



 Score = 34.3 bits (78), Expect = 0.18
 Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 1/74 (1%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
           +H        Q   Q P  QP  +  P  Q    P Q        P+ Q+  Q   Q  Q
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPT-VQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254

Query: 527 QNLQQNIQQNIQHL 540
            + +   Q N + L
Sbjct: 255 GHKRSREQGNQEFL 268



 Score = 33.9 bits (77), Expect = 0.23
 Identities = 25/74 (33%), Positives = 28/74 (37%), Gaps = 14/74 (18%)

Query: 483 PPPQPAHIPPPQI------------QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 530
           PP  P   P P +            +H P P Q   VQN  P+ Q   QN  Q  QQ  Q
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQN--PAQQPTVQNPAQQPQQQPQ 230

Query: 531 QNIQQNIQHLLQQN 544
           Q   Q  Q    QN
Sbjct: 231 QQPVQPAQQPTPQN 244



 Score = 33.9 bits (77), Expect = 0.26
 Identities = 23/75 (30%), Positives = 29/75 (38%), Gaps = 3/75 (4%)

Query: 489 HIPPPQIQ-HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHV 547
           H P P  Q  +  P Q   VQN  P+ Q  QQ  QQ +Q   Q   Q   Q   Q     
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQN--PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGH 256

Query: 548 ANSSHHMSQQRMHQL 562
             S    +Q+ +  L
Sbjct: 257 KRSREQGNQEFLKML 271



 Score = 29.6 bits (66), Expect = 4.4
 Identities = 12/34 (35%), Positives = 13/34 (38%)

Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ 496
           N   Q      Q   Q  QQP PQ     PPQ +
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253



 Score = 29.6 bits (66), Expect = 5.5
 Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query: 463 NIPLQHITPQVQLHQQQQQQ-PPPQPAHIPPPQIQHIPPPIQHQ 505
           N   Q          QQQ Q  P QPA  P PQ     PP   Q
Sbjct: 211 NPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254


>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
           ABF1 is a sequence-specific DNA binding protein involved
           in transcription activation, gene silencing and
           initiation of DNA replication. ABF1 is known to remodel
           chromatin, and it is proposed that it mediates its
           effects on transcription and gene expression by
           modifying local chromatin architecture. These functions
           require a conserved stretch of 20 amino acids in the
           C-terminal region of ABF1 (amino acids 639 to 662 S.
           cerevisiae). The N-terminal two thirds of the protein
           are necessary for DNA binding, and the N-terminus (amino
           acids 9 to 91 in S. cerevisiae) is thought to contain a
           novel zinc-finger motif which may stabilise the protein
           structure.
          Length = 508

 Score = 36.3 bits (83), Expect = 0.047
 Identities = 25/72 (34%), Positives = 33/72 (45%), Gaps = 5/72 (6%)

Query: 493 PQIQHIPPPIQHQPVQNIHPSI----QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVA 548
           P    I     HQ   + HPSI    QH      Q+  Q+  Q+ QQ++QH  Q   HV 
Sbjct: 281 PNAHQISGADHHQHQHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQHQQHVD 340

Query: 549 NSSHHMSQQRMH 560
           N  HH+ Q R+ 
Sbjct: 341 N-DHHVYQDRIG 351


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 35.9 bits (83), Expect = 0.054
 Identities = 30/204 (14%), Positives = 50/204 (24%), Gaps = 42/204 (20%)

Query: 381 CFSTKSELNR-----HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYTHSKVKPYNCGICA 435
           CFS   E NR     H ++ GG KP    +  +   ++  LQ+                 
Sbjct: 332 CFSNPEEANRAVTEMHGRMLGG-KPLYVALAQRKEQRRAHLQD----------------- 373

Query: 436 KSFVQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQI 495
               Q        ++   +      +         P      QQQ    P          
Sbjct: 374 ----QFM------QLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP 423

Query: 496 QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMS 555
             + P    +P      +         QN     Q+   Q + +           S  +S
Sbjct: 424 TPMGPGGPLRPNGLAPMNAVRAPSRNAQNA---AQKPPMQPVMYPPNYQ------SLPLS 474

Query: 556 QQRMHQLYALHHDGYMTATASIMQ 579
           Q            G     A ++ 
Sbjct: 475 QDLPQPQSTASQGGQNKKLAQVLA 498


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 35.7 bits (82), Expect = 0.063
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 482 QPPPQPAHIP----PPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
           QP   PA IP    PP IQ++ PP    P Q I P IQHI+
Sbjct: 232 QPSAPPASIPAPPIPPVIQYVAPP-PVPPPQPIIP-IQHIR 270



 Score = 30.3 bits (68), Expect = 3.7
 Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 1/40 (2%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH 504
           P Q   P   +            A  P P  Q I  PIQH
Sbjct: 230 PAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPI-IPIQH 268


>gnl|CDD|188742 cd09356, LIM2_TRIP6, The second LIM domain of Thyroid
           receptor-interacting protein 6 (TRIP6).  The second LIM
           domain of Thyroid receptor-interacting protein 6
           (TRIP6): TRIP6 is a member of the zyxin LIM protein
           family and contains three LIM zinc-binding domains at
           the C-terminal. TRIP6 protein localizes to focal
           adhesion sites and along actin stress fibers.
           Recruitment of this protein to the plasma membrane
           occurs in a lysophosphatidic acid (LPA)-dependent
           manner. TRIP6 recruits a number of molecules involved in
           actin assembly, cell motility, survival and
           transcriptional control. The function of TRIP6 in cell
           motility is regulated by Src-dependent phosphorylation
           at a Tyr residue. The phosphorylation activates the
           coupling to the Crk SH2 domain, which is required for
           the function of TRIP6 in promoting lysophosphatidic acid
           (LPA)-induced cell migration. TRIP6 can shuttle to the
           nucleus to serve as a coactivator of AP-1 and NF-kappaB
           transcriptional factors. Moreover, TRIP6 can form a
           ternary complex with the NHERF2 PDZ protein and LPA2
           receptor to regulate LPA-induced activation of ERK and
           AKT, rendering cells resistant to chemotherapy. Recent
           evidence shows that TRIP6 antagonizes Fas-Induced
           apoptosis by enhancing the antiapoptotic effect of LPA
           in cells. As in other LIM domains, this domain family is
           50-60 amino acids in size and shares two characteristic
           zinc finger motifs. The two zinc fingers contain eight
           conserved residues, mostly cysteines and histidines,
           which coordinately bond to two zinc atoms. LIM domains
           function as adaptors or scaffolds to support the
           assembly of multimeric protein.
          Length = 53

 Score = 31.8 bits (72), Expect = 0.075
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 171
           C +C K   +++L    K  H   FTC VC RS +
Sbjct: 1   CSVCSKPIMERILRATGKAYHPHCFTCVVCHRSLD 35


>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587).  This is
           a bacterial family of proteins with no known function.
          Length = 168

 Score = 34.3 bits (79), Expect = 0.083
 Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)

Query: 452 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
           HG+ T+      +  Q +  + QL Q +++  PP     PP Q Q           Q
Sbjct: 116 HGIQTA------LFAQQMAARAQLEQMRRRALPPGVGIAPPGQPQGARGGPPPGTGQ 166


>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain.  Gall11/MED15 acts
           in the general regulation of GAL structural genes and is
           required for full expression for several genes in this
           pathway, including GALs 1,7, and 10 in Saccharomyces
           cerevisiae. GAL11 function is dependent on GCN4
           functionality and binds GCN4 in a degenerate manner with
           multiple orientations found at the GCN4-Gal11 interface.
          Length = 90

 Score = 32.3 bits (74), Expect = 0.10
 Identities = 26/89 (29%), Positives = 34/89 (38%), Gaps = 23/89 (25%)

Query: 471 PQVQLHQQQQQQPPPQPAH---------IPPP---QIQHIPPPI-----------QHQPV 507
            Q Q  QQQQQQ  PQ            IPP    +I ++PP +           Q+   
Sbjct: 2   QQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNIP 61

Query: 508 QNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
                 I+ +QQ   Q + Q  QQ  QQ 
Sbjct: 62  PQAMEQIKEVQQTHFQLLMQRRQQQQQQQ 90



 Score = 29.6 bits (67), Expect = 0.89
 Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 22/97 (22%)

Query: 478 QQQQQPPPQPAHIPPPQIQH---------IPPPIQHQPVQNIHPSI---QHIQQNLQQNI 525
            QQQQ P Q      PQ            IPP +  + + N+ P +   + ++  LQQN 
Sbjct: 1   PQQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAK-IPNLPPEVKTWRQLKAWLQQN- 58

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
             N+     + I+ +  Q  H       + Q+R  Q 
Sbjct: 59  --NIPPQAMEQIKEV--QQTHF----QLLMQRRQQQQ 87


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 30.0 bits (68), Expect = 0.17
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 318 KCSYCPKAFSHKGHLNEHLYTH 339
           KC  C K+FS K +L  HL TH
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.6 bits (67), Expect = 0.22
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 261 VCEICSKAFTQQYNLKHHLLLH 282
            C  C K+F+++ NLK HL  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.31
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 346 QCKYCPKTFTLRKHLKAHTNKH 367
           +C  C K+F+ + +LK H   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.5 bits (64), Expect = 0.52
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 374 TCLECSKCFSTKSELNRHAQIH 395
            C +C K FS KS L RH + H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 27.3 bits (61), Expect = 1.6
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 290 RCDQCGKAFSRKGHLVQHSFVH 311
           +C  CGK+FSRK +L +H   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 26.5 bits (59), Expect = 3.1
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 403 CTVCNKSFLQKTQLQNHLYTH 423
           C  C KSF +K+ L+ HL TH
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.8 bits (57), Expect = 5.8
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 108 TCEVCQQAFTRNADLKVHSMVH 129
            C  C ++F+R ++LK H   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein
           [Intracellular trafficking and secretion].
          Length = 554

 Score = 34.2 bits (79), Expect = 0.17
 Identities = 21/111 (18%), Positives = 41/111 (36%), Gaps = 9/111 (8%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQ-NLQQNIQ 526
            P     +QQQ +   +   +P P ++    P    P  +         I +  L+ N  
Sbjct: 27  NPAEGNIRQQQDERARREELLPAPDVRL-EQPDPAAPAVDAPDETPCFPINRIELEGNTL 85

Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYMTATASI 577
            +    +Q  +   L + + +A+ +   S   +  LY     GY+T  A +
Sbjct: 86  LDA-LELQAILAPYLGRCLGLADLNQLAS--ALTNLYI--DRGYVTTRALL 131


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 32.5 bits (74), Expect = 0.21
 Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 1/38 (2%)

Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
           PQ Q+ Q     P P    +P  Q+  + P   H P+ 
Sbjct: 20  PQAQMPQASAPYPGPSMY-LPMAQVMAVGPQSSHPPMA 56



 Score = 29.8 bits (67), Expect = 1.8
 Identities = 11/60 (18%), Positives = 17/60 (28%), Gaps = 7/60 (11%)

Query: 471 PQVQLHQQQQQQPPP-----QPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQNLQQ 523
              +L+Q     PP        A  P P +      +     Q+ HP +    I      
Sbjct: 7   AYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV 66



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 459 SANMN-IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQ 505
            A M      +  P + L   Q     PQ +H P        PPP+   
Sbjct: 21  QAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 29.7 bits (67), Expect = 0.24
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 317 FKCSYCPKAFSHKGHLNEHLYTH 339
           ++C  C K F  K  L EH+ TH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 1.1
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 373 HTCLECSKCFSTKSELNRHAQIH 395
           + C EC K F +KS L  H + H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 1.1
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 260 YVCEICSKAFTQQYNLKHHLLLH 282
           Y C  C K F  +  L+ H+  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 1.1
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 401 FACTVCNKSFLQKTQLQNHLYTH 423
           + C  C K F  K+ L+ H+ TH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.4 bits (61), Expect = 1.3
 Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 161 FTCEVCMRSFNDRKEFNEHIMKH 183
           + C  C + F  +    EH+  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 3.4
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 289 FRCDQCGKAFSRKGHLVQHSFVH 311
           +RC +CGK F  K  L +H   H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 3.8
 Identities = 7/23 (30%), Positives = 11/23 (47%)

Query: 345 FQCKYCPKTFTLRKHLKAHTNKH 367
           ++C  C K F  +  L+ H   H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 493

 Score = 33.8 bits (77), Expect = 0.26
 Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 34/163 (20%)

Query: 133 QLHKCELCYKMFSD---KLLYEKHKMV------HRQVFTCEVCMRSFNDRKEFNEHIMKH 183
              KC        +       +   ++      +++ F  E+ +          +H    
Sbjct: 155 PKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFR---SSTLRDHKNGG 211

Query: 184 EKEKLFS----CNFCLKPFLRKDLLEAHVNSVHKDVKKYNCEECDKYYLSQGGVGCV-FL 238
            +E+ F     C FC   F   D L  H    H+      C  CD        VG + + 
Sbjct: 212 LEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE-----ACHICDM-------VGPIRYQ 259

Query: 239 YYLSQGELDSHVQTSHRGIKPYVCEICSKAFTQQYNLKHHLLL 281
           Y+ S  +L++H + +H     Y C   +    + Y   +H  L
Sbjct: 260 YFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHTEL 297


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 31.2 bits (71), Expect = 0.37
 Identities = 17/57 (29%), Positives = 18/57 (31%), Gaps = 4/57 (7%)

Query: 461 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP----PIQHQPVQNIHPS 513
               P Q+   Q    Q   QQ PP P   P  Q  H  P    P   Q  Q   P 
Sbjct: 7   TNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPP 63



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 8/73 (10%)

Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 530
                 Q QQQQP  Q      P  Q +PPP    P Q        +    Q   +Q+ Q
Sbjct: 7   TNQYPPQNQQQQPYQQ-----QPYHQQMPPPPYSPPQQ---QQGHFMPPQPQPYPKQSPQ 58

Query: 531 QNIQQNIQHLLQQ 543
           Q         L Q
Sbjct: 59  QQQPPQFSSFLSQ 71


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 32.7 bits (74), Expect = 0.46
 Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 16/91 (17%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
           Q +  Q Q  Q  +QQ   Q +       Q      Q  PV              Q   +
Sbjct: 151 QTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPV--------------QAQPR 196

Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
           Q+   + QQ  Q LLQ   H   ++    QQ
Sbjct: 197 QSKPASTQQPYQDLLQTPAH--TTAQSKPQQ 225



 Score = 29.3 bits (65), Expect = 6.4
 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 2/60 (3%)

Query: 466 LQHITPQVQLHQQQQ-QQPPPQPAHIPPPQIQHI-PPPIQHQPVQNIHPSIQHIQQNLQQ 523
           LQ      QL +QQ+  Q           Q +     P+Q QP Q+   S Q   Q+L Q
Sbjct: 153 LQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQ 212


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 32.9 bits (76), Expect = 0.55
 Identities = 21/101 (20%), Positives = 44/101 (43%), Gaps = 13/101 (12%)

Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
           L+  +P+  L QQQQ+             +  +   +Q    + +  + Q +++  Q+  
Sbjct: 298 LKFQSPERLLAQQQQR-------------LDRLQQRLQRALERRLRLAKQRLERLSQRLQ 344

Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALH 566
           QQN Q+ I++  Q L Q    +  +     +++  +L AL 
Sbjct: 345 QQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALA 385


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 31.4 bits (72), Expect = 0.56
 Identities = 10/45 (22%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 500 PPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ-QNIQQNIQHLLQQ 543
           PP  + P  +I P +  ++Q  +  I ++ + Q + ++I  L ++
Sbjct: 23  PPADYTPYGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEE 67


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 32.0 bits (73), Expect = 0.65
 Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 1/58 (1%)

Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
           VH VI   +    I +  + P       Q  QPPP+P   P P+ + IP P +  PV 
Sbjct: 35  VHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 32.0 bits (73), Expect = 0.72
 Identities = 10/47 (21%), Positives = 13/47 (27%)

Query: 483 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 529
            P  P+  P P     PP  Q  P+    P+               L
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276



 Score = 30.4 bits (69), Expect = 2.2
 Identities = 15/58 (25%), Positives = 20/58 (34%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
             Q  TP           PPP PA  PPP +Q +PP    +P      +       + 
Sbjct: 218 SFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 32.2 bits (74), Expect = 0.85
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 68  QSKEKKAKKTKVPMVQVLEYVEQSKEKKAKKTKV 101
           ++K    KKT  P  +VL    + KEK+ +K KV
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412


>gnl|CDD|234905 PRK01122, PRK01122, potassium-transporting ATPase subunit B;
           Provisional.
          Length = 679

 Score = 32.1 bits (74), Expect = 0.87
 Identities = 13/27 (48%), Positives = 16/27 (59%), Gaps = 2/27 (7%)

Query: 9   EIYLLILLLTLTIVLVLPVVSVELLPM 35
           EI L ILL  LTI+ +L V +  L P 
Sbjct: 215 EIALTILLAGLTIIFLLVVAT--LPPF 239


>gnl|CDD|216066 pfam00695, vMSA, Major surface antigen from hepadnavirus. 
          Length = 364

 Score = 31.9 bits (72), Expect = 0.88
 Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 6   RRFEIYLLILLLTLTIVLVL 25
           RRF I+LLILLL L  +LVL
Sbjct: 223 RRFIIFLLILLLALGFLLVL 242


>gnl|CDD|235654 PRK05957, PRK05957, aspartate aminotransferase; Provisional.
          Length = 389

 Score = 32.0 bits (73), Expect = 0.90
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 483 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
           PPP  A          P   ++Q VQ I P ++ I Q LQQ
Sbjct: 40  PPPPEAIEALNNFLANPENHKYQAVQGIPPLLEAITQKLQQ 80


>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
          Length = 230

 Score = 31.2 bits (70), Expect = 1.1
 Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 5/43 (11%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQ---HIPPPIQHQP 506
            H  PQ      +   PPP P   P P  Q   H  PP+  QP
Sbjct: 40  HHPNPQED--PDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQP 80


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 31.1 bits (70), Expect = 1.1
 Identities = 12/36 (33%), Positives = 15/36 (41%)

Query: 458 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPP 493
            + N + P Q      Q  QQQ +  P QPA  P  
Sbjct: 126 GNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|225126 COG2216, KdpB, High-affinity K+ transport system, ATPase chain B
           [Inorganic ion transport and metabolism].
          Length = 681

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 2/27 (7%)

Query: 9   EIYLLILLLTLTIVLVLPVVSVELLPM 35
           EI L ILL  LT++ +L V +  L P 
Sbjct: 215 EIALTILLSGLTLIFLLAVAT--LYPF 239


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 31.6 bits (72), Expect = 1.2
 Identities = 14/89 (15%), Positives = 30/89 (33%), Gaps = 13/89 (14%)

Query: 463 NIPLQHITPQVQLHQQQQQQP-----PPQPAHIPPPQ-IQHIPPPIQHQPVQNIHPSIQH 516
            +P Q     +Q H+ +  QP                 I+    P    P++ I+ S + 
Sbjct: 104 TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNV--PIEGINSSSEQ 161

Query: 517 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
           ++  L +   +     I  +  H ++   
Sbjct: 162 LRVELAELAAE-----IYSDASHRVELAK 185



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 7/76 (9%), Positives = 20/76 (26%), Gaps = 4/76 (5%)

Query: 439 VQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHI 498
            Q N     +E            +N   +    +V+L +   +        +   +   +
Sbjct: 133 TQTNVASMTIEETQSPNVPI-EGINSSSEQ--LRVELAELAAEIYSDASHRVELAK-NFM 188

Query: 499 PPPIQHQPVQNIHPSI 514
            P  + +       + 
Sbjct: 189 EPQAETEAQPEATTNF 204


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.6 bits (71), Expect = 1.3
 Identities = 23/84 (27%), Positives = 25/84 (29%), Gaps = 4/84 (4%)

Query: 464 IPLQHITPQVQLHQQQQQQPP-PQPAHIPPPQIQHIPP---PIQHQPVQNIHPSIQHIQQ 519
           +P  H   Q Q   QQ  QPP P   H         PP    +Q  PV   HPS    Q 
Sbjct: 237 LPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296

Query: 520 NLQQNIQQNLQQNIQQNIQHLLQQ 543
                 Q        Q   H    
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTP 320



 Score = 30.0 bits (67), Expect = 4.2
 Identities = 15/50 (30%), Positives = 19/50 (38%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 516
            H          Q QQPP +    P P + HI PP      Q  + S +H
Sbjct: 315 PHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 13/69 (18%), Positives = 21/69 (30%), Gaps = 16/69 (23%)

Query: 38  SKEKKAKKTKVPMVQVADFVSGVWSLEYVEQSKEKKAKKTKVPMVQVLEYVEQSKEKKAK 97
           ++EKK    K                    +   K  K  K    +  +   + K+ K+ 
Sbjct: 13  TEEKKPAAKKAT----------------TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSV 56

Query: 98  KTKVPMVQV 106
           KT    V V
Sbjct: 57  KTTTKKVTV 65



 Score = 28.8 bits (64), Expect = 7.0
 Identities = 18/89 (20%), Positives = 27/89 (30%), Gaps = 13/89 (14%)

Query: 34  PMQQSKEKKAKKTKVPMVQVADFVSGVWSLEYVEQSKEKKAKKTKVPMVQVLEYVEQSKE 93
              +   K  K  K    + A             + K+ K+ KT    V V    +  K 
Sbjct: 24  TTSKETAKTKKTAKTTSTKAAK---------KAAKVKKTKSVKTTTKKVTV----KFEKT 70

Query: 94  KKAKKTKVPMVQVFTCEVCQQAFTRNADL 122
           +  KK  V    V    V  + F  +  L
Sbjct: 71  ESVKKESVAKKTVKKEAVSAEVFEASNKL 99


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.5 bits (72), Expect = 1.4
 Identities = 11/70 (15%), Positives = 13/70 (18%), Gaps = 4/70 (5%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQ-IQHIPPPIQHQPVQNIHP--SIQHIQQNLQQNIQ 526
           TP            P  P            P            P         N      
Sbjct: 378 TPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437

Query: 527 Q-NLQQNIQQ 535
             NL++  QQ
Sbjct: 438 SLNLEELWQQ 447


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 31.6 bits (71), Expect = 1.6
 Identities = 13/44 (29%), Positives = 17/44 (38%)

Query: 475 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
                   PPP  A   P  ++HI P +   P+Q  H  I   Q
Sbjct: 153 ESLCSGPPPPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQ 196


>gnl|CDD|216927 pfam02203, TarH, Tar ligand binding domain homologue. 
          Length = 146

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 6/29 (20%), Positives = 13/29 (44%)

Query: 530 QQNIQQNIQHLLQQNMHVANSSHHMSQQR 558
              + +   + LQQ   +A++   + Q R
Sbjct: 12  NDALDEVYTNSLQQQAALADAWVLLLQAR 40


>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.
          Sequences in this family of proteins are members of the
          chain length determinant family (pfam02706) which
          includes the wzc protein from E.coli. This family of
          proteins are homologous to the EpsF protein of the
          methanolan biosynthesis operon of Methylobacillus
          species strain 12S. The distribution of this protein
          appears to be restricted to a subset of
          exopolysaccharide operons containing a syntenic
          grouping of genes including a variant of the EpsH
          exosortase protein. Exosortase has been proposed to be
          involved in the targetting and processing of proteins
          containing the PEP-CTERM domain to the
          exopolysaccharide layer.
          Length = 444

 Score = 30.9 bits (70), Expect = 1.8
 Identities = 19/60 (31%), Positives = 29/60 (48%), Gaps = 8/60 (13%)

Query: 1  MVVLGRRFEIYLLILLLTLTIVLVLPVVSVELLPMQQSKEKKAKKTKVPMVQVADFVSGV 60
          +++L  R+ I L  LL+T+T   V     V LL     KE  A  + V   +  D V+G+
Sbjct: 7  LLILKARYWIVLFTLLITVTTTAV-----VSLL---LPKEYTATTSVVLDYKGPDPVTGM 58


>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3.  The
           C.elegans Notch pathway, involved in the control of
           growth, differentiation and patterning in animal
           development, relies on either of the receptors GLP-1 or
           LIN-12. Both these receptors promote signalling by the
           recruitment of LAG-3 to target promoters, where it then
           acts as a transcriptional activator. LAG-3 works as a
           ternary complex together with the DNA binding protein,
           LAG-1.
          Length = 476

 Score = 31.1 bits (69), Expect = 1.9
 Identities = 27/95 (28%), Positives = 38/95 (40%), Gaps = 1/95 (1%)

Query: 478 QQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
           Q    P   P HI    Q Q+    +Q Q ++      Q  QQ + Q+ QQ  Q++ QQ 
Sbjct: 310 QSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQ 369

Query: 537 IQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYM 571
           +    QQ MH     H M+        A  H  Y+
Sbjct: 370 MLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYL 404



 Score = 29.5 bits (65), Expect = 5.3
 Identities = 26/104 (25%), Positives = 36/104 (34%), Gaps = 6/104 (5%)

Query: 473 VQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP------VQNIHPSIQHIQQNLQQNIQ 526
           +  HQQQQQQ   Q   +   Q Q       HQ           H    ++QQ     +Q
Sbjct: 354 MHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYLQQMQHMRLQ 413

Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGY 570
           + +Q   QQ   H   Q  H   + H      +   Y  H  G+
Sbjct: 414 EQIQHQQQQAQHHQQAQQQHQQPAQHGQMGYGIPNGYPAHMHGH 457



 Score = 29.2 bits (64), Expect = 6.8
 Identities = 21/93 (22%), Positives = 30/93 (32%)

Query: 469 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
                +   +      PQ A  P    QHI    Q Q    +    +   Q ++Q  QQ 
Sbjct: 293 EGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQ 352

Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
           +    QQ  Q   QQ   +      M Q + H 
Sbjct: 353 IMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHH 385


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 27.2 bits (60), Expect = 2.0
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 317 FKCSYCPKAFSHKGHLNEHLYTH 339
           FKC  C K+FS K  L  HL  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.8 bits (59), Expect = 2.4
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 345 FQCKYCPKTFTLRKHLKAHTNKH 367
           F+C  C K+F+ +  LK H  KH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.7 bits (56), Expect = 6.5
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 401 FACTVCNKSFLQKTQLQNHLYTH 423
           F C +C KSF  K  L+ HL  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.3 bits (55), Expect = 7.8
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 260 YVCEICSKAFTQQYNLKHHLLLH 282
           + C +C K+F+ +  LK HL  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.3 bits (55), Expect = 8.0
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 161 FTCEVCMRSFNDRKEFNEHIMKH 183
           F C +C +SF+ +     H+ KH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|188739 cd09353, LIM2_Zyxin, The second LIM domain of Zyxin.  The second
           LIM domain of Zyxin: Zyxin exhibits three copies of the
           LIM domain, an extensive proline-rich domain and a
           nuclear export signal.  Localized at sites of
           cellsubstratum adhesion in fibroblasts, Zyxin interacts
           with alpha-actinin, members of the cysteine-rich protein
           (CRP) family, proteins that display Src homology 3 (SH3)
           domains and Ena/VASP family members. Zyxin and its
           partners have been implicated in the spatial control of
           actin filament assembly as well as in pathways important
           for cell differentiation. In addition to its functions
           at focal adhesion plaques, recent work has shown that
           zyxin moves from the sites of cell contacts to the
           nucleus, where it directly participates in the
           regulation of gene expression. As in other LIM domains,
           this domain family is 50-60 amino acids in size and
           shares two characteristic zinc finger motifs. The two
           zinc fingers contain eight conserved residues, mostly
           cysteines and histidines, which coordinately bond to two
           zinc atoms. LIM domains function as adaptors o r
           scaffolds to support the assembly of multimeric protein.
          Length = 60

 Score = 28.0 bits (62), Expect = 2.1
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVC 166
           C +C +  +D++L    K  H Q FTC VC
Sbjct: 1   CAVCDQKITDRMLKATGKSYHPQCFTCVVC 30


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 30.7 bits (69), Expect = 2.1
 Identities = 8/24 (33%), Positives = 10/24 (41%)

Query: 476 HQQQQQQPPPQPAHIPPPQIQHIP 499
              QQQQ P Q   I    +Q + 
Sbjct: 430 PHNQQQQEPLQNPDIYTKDLQRLT 453


>gnl|CDD|181567 PRK08861, PRK08861, cystathionine gamma-synthase; Provisional.
          Length = 388

 Score = 30.6 bits (69), Expect = 2.1
 Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 13/86 (15%)

Query: 59  GVWSLEYVEQSKEKK-----AKKTKV--------PMVQVLEYVEQSKEKKAKKTKVPMVQ 105
           G + +++V+QS         AKK K+        P+V+V++  E  ++ KA    V +  
Sbjct: 116 GDFKVQFVDQSDAAALDAALAKKPKLILLETPSNPLVRVVDIAELCQKAKAVGALVAVDN 175

Query: 106 VFTCEVCQQAFTRNADLKVHSMVHKI 131
            F   V Q+     AD  +HS    I
Sbjct: 176 TFLTPVLQKPLELGADFVIHSTTKYI 201


>gnl|CDD|188740 cd09354, LIM2_LPP, The second LIM domain of lipoma preferred
           partner (LPP).  The second LIM domain of lipoma
           preferred partner (LPP): LPP is a member of the zyxin
           LIM protein family and contains three LIM zinc-binding
           domains at the C-terminal and proline-rich region at the
           N-terminal.  LPP initially identified as the most
           frequent translocation partner of HMGA2 (High Mobility
           Group A2) in a subgroup of benign tumors of adipose
           tissue (lipomas). It was also shown to be rearranged in
           a number of other soft tissues, as well as in a case of
           acute monoblastic leukemia. In addition to its
           involvement in tumors, LPP was inedited as a smooth
           muscle restricted LIM protein that plays an important
           role in SMC migration. LPP is localized at sites of cell
           adhesion, cell-cell contacts and transiently in the
           nucleus. In nucleus, it acts as a coactivator for the
           ETS domain transcription factor PEA3. In addition to
           PEA3, it interacts with alpha-actinin,vasodilator
           stimulated phosphoprotein (VASP),Palladin, and Scrib.
           The  LIM domains are the main focal adhesion targeting
           elements and that the proline- rich region, which
           harbors binding sites for alpha-actinin and vasodilator-
           stimulated phosphoprotein (VASP), has a weak targeting
           capacity. As in other LIM domains, this domain family is
           50-60 amino acids in size and shares two characteristic
           zinc finger motifs. The two zinc fingers contain eight
           conserved residues, mostly cysteines and histidines,
           which coordinately bond to two zinc atoms. LIM domains
           function as adaptors or scaffolds to support the
           assembly of multimeric protein.
          Length = 60

 Score = 27.9 bits (62), Expect = 2.4
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 171
           C +C K   D++L    K  H Q FTC VC +S +
Sbjct: 1   CSVCSKPILDRILRATGKPYHPQCFTCVVCGKSLD 35


>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 125 to 142 amino acids in length.
          Length = 131

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 11/66 (16%), Positives = 20/66 (30%), Gaps = 9/66 (13%)

Query: 451 VHGVITSNSANMNIPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPV 507
            HG+       +   L   T  +      Q+ Q +    P  + P   Q   PP     +
Sbjct: 5   AHGL------PVVASLGLCTADLMARVIKQEPQLEDNQCPQDLYPHSSQPDLPPPTTLDL 58

Query: 508 QNIHPS 513
            +   +
Sbjct: 59  NDGTIT 64


>gnl|CDD|192241 pfam09280, XPC-binding, XPC-binding domain.  Members of this family
           adopt a structure consisting of four alpha helices,
           arranged in an array. They bind specifically and
           directly to the xeroderma pigmentosum group C protein
           (XPC) to initiate nucleotide excision repair.
          Length = 59

 Score = 27.6 bits (62), Expect = 2.8
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 5/39 (12%)

Query: 511 HPSIQHIQQNLQQN---IQQNLQQNIQQNIQ--HLLQQN 544
           +P  Q ++Q +QQN   +   LQQ  Q N Q   L+QQN
Sbjct: 7   NPQFQQLRQMVQQNPELLPPLLQQLGQSNPQLLQLIQQN 45


>gnl|CDD|181818 PRK09393, ftrA, transcriptional activator FtrA; Provisional.
          Length = 322

 Score = 30.3 bits (69), Expect = 3.1
 Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 4/39 (10%)

Query: 490 IPP----PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           +PP     Q Q +P P+  +    + P I  ++ +L + 
Sbjct: 196 VPPHRDGGQAQFVPRPVASRESDRLGPLIDWMRAHLAEP 234


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 28.7 bits (64), Expect = 3.6
 Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 255 RGIKPYVCEICSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQH 307
           + + PYVC +C   F+   +LK H+           C  CGK F      + H
Sbjct: 69  KAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNTDSTLDH 118



 Score = 28.7 bits (64), Expect = 4.1
 Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 2/52 (3%)

Query: 161 FTCEVCMRSFNDRKEFNEHIMKHEKEKLFSCNFCLKPFLRKDLLEAHVNSVH 212
           + C +C+  F+      +HI   E  K+  C  C K F   D    HV   H
Sbjct: 74  YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123



 Score = 28.3 bits (63), Expect = 5.8
 Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 6/62 (9%)

Query: 398 VKPFACTVCNKSFLQKTQLQNHL-YT-HSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVI 455
           V P+ C +C   F     L+ H+ YT HSKV    C +C K F   ++ + H+   H + 
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV----CPVCGKEFRNTDSTLDHVCKKHNIC 126

Query: 456 TS 457
            S
Sbjct: 127 VS 128


>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score = 30.1 bits (68), Expect = 3.6
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 187 KLFSCNFCLKPFLRKDLLEAH 207
           KL+ C FCLK   RK+ L+ H
Sbjct: 197 KLYFCEFCLKFMKRKEQLQRH 217


>gnl|CDD|116585 pfam07975, C1_4, TFIIH C1-like domain.  The carboxyl-terminal
           region of TFIIH is essential for transcription activity.
           This regions binds three zinc atoms through two
           independent domain. The first contains a C4 zinc finger
           motif, whereas the second is characterized by a
           CX(2)CX(2-4)FCADCD motif. The solution structure of the
           second C-terminal domain revealed homology with the
           regulatory domain of protein kinase C (pfam00130).
          Length = 48

 Score = 27.0 bits (60), Expect = 3.8
 Identities = 11/47 (23%), Positives = 17/47 (36%), Gaps = 6/47 (12%)

Query: 94  KKAKKTKVPMVQVFTCEVCQQAFTRNADLKVHSMVHKIAQLHKCELC 140
           +K          V+ C  C+  F  + D+ +H        LH C  C
Sbjct: 7   QKLPDGINKTSSVYRCPKCKNVFCLDCDVFIHE------SLHNCPGC 47


>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
          Length = 84

 Score = 28.0 bits (62), Expect = 3.9
 Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 7/41 (17%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
           TP+ Q  Q QQQ  P Q   I  PQ    P   QH   Q +
Sbjct: 51  TPEQQQQQPQQQIAPNQ---IEAPQ----PNFQQHYQGQQV 84


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.0 bits (67), Expect = 4.0
 Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 5/61 (8%)

Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQNIQ 526
            P  Q     QQ+P   P   PPPQ    P  +Q           P+ Q ++Q L   ++
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQA--GPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834

Query: 527 Q 527
           +
Sbjct: 835 R 835


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 29.1 bits (65), Expect = 4.0
 Identities = 25/91 (27%), Positives = 30/91 (32%), Gaps = 8/91 (8%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQ 523
             I P      QQQQ   P+   + P     I PP Q     P  +    +   Q +LQ 
Sbjct: 48  HPIIPLSPQMPQQQQSAHPK---LTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQP 104

Query: 524 NIQQNLQQNIQQNIQHLLQQNMHVANSSHHM 554
             Q  LQ    QN Q   QQ  H        
Sbjct: 105 PAQHPLQPTYGQNPQP--QQPTHTQPPVQPQ 133



 Score = 28.3 bits (63), Expect = 8.1
 Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 1/62 (1%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
           P+  +     +     QQP  QP    P Q  +   P   QP     P +Q  Q    Q 
Sbjct: 82  PMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPP-VQPQQPADPQP 140

Query: 525 IQ 526
            Q
Sbjct: 141 GQ 142


>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3.  The
           reovirus inner capsid protein lambda-1 displays
           nucleoside triphosphate phosphohydrolase (NTPase),
           RNA-5'-triphosphatase (RTPase), and RNA helicase
           activity and may play a role in the transcription of the
           virus genome, the unwinding or reannealing of
           double-stranded RNA during RNA synthesis. The RTPase
           activity constitutes the first step in the capping of
           RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
           lambda1 is an Orthoreovirus core protein, VP3 is the
           homologous core protein in Aquareoviruses.
          Length = 1166

 Score = 30.1 bits (68), Expect = 4.1
 Identities = 10/35 (28%), Positives = 11/35 (31%), Gaps = 1/35 (2%)

Query: 391 HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYT-HS 424
                     + C VC   F    QL  H  T HS
Sbjct: 64  KKATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98


>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
           C2H2 Zn-finger [General function prediction only].
          Length = 65

 Score = 27.3 bits (60), Expect = 4.3
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 284 DGNSLFRCDQCGKAFSRKGHLVQH 307
           DG    RC +CG  F R+   ++H
Sbjct: 13  DGEEFLRCPRCGMVFRRRKDYIRH 36


>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding.  The Creb binding domain
           assumes a structure comprising of three alpha-helices
           which pack in a bundle, exposing a hydrophobic groove
           between alpha-1 and alpha-3 within which complimentary
           domains found in the protein 'activator for thyroid
           hormone and retinoid receptors' (ACTR) can dock. Docking
           of these domains is required for the recruitment of RNA
           polymerase II and the basal transcription machinery.
          Length = 104

 Score = 28.1 bits (62), Expect = 5.0
 Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 470 TPQVQLHQQQQQQPP-PQPAHIPPPQI-QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 527
            PQ  L QQQQ QP  P+P      Q     P P   QP  +      +  Q+L + ++ 
Sbjct: 6   WPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLLRTLKS 65

Query: 528 NLQQNIQQNIQHLLQQNMHV 547
                 QQ + ++L+ N  +
Sbjct: 66  PSSPQQQQQVLNILKSNPQL 85


>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
           Provisional.
          Length = 177

 Score = 28.7 bits (64), Expect = 5.7
 Identities = 7/28 (25%), Positives = 10/28 (35%)

Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPP 492
           P            + QQ  P  P++ PP
Sbjct: 141 PQGGNQFSGGAQSRPQQSAPAAPSNEPP 168


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 29.5 bits (67), Expect = 6.3
 Identities = 6/18 (33%), Positives = 11/18 (61%)

Query: 518 QQNLQQNIQQNLQQNIQQ 535
           Q   +Q +++ LQQ I+ 
Sbjct: 235 QLKYKQQVEKRLQQRIES 252


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 29.4 bits (66), Expect = 6.5
 Identities = 12/67 (17%), Positives = 18/67 (26%), Gaps = 3/67 (4%)

Query: 467 QHITPQVQLHQQQQQQPPPQPAHIP---PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
              TP           P   P+  P    P        I    V ++ PS     Q   +
Sbjct: 420 PAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAE 479

Query: 524 NIQQNLQ 530
               N++
Sbjct: 480 QATGNIK 486


>gnl|CDD|182696 PRK10748, PRK10748, flavin mononucleotide phosphatase; Provisional.
          Length = 238

 Score = 28.5 bits (64), Expect = 9.5
 Identities = 12/54 (22%), Positives = 25/54 (46%), Gaps = 3/54 (5%)

Query: 507 VQNIHPSIQHIQQNLQQNIQQNLQQ---NIQQNIQHLLQQNMHVANSSHHMSQQ 557
           VQN HP+++  Q    Q ++Q L++    I  ++     + +  A     +S +
Sbjct: 39  VQNYHPALRSFQNEDLQRLRQALREAEPEIYHDVTRWRWRAIEQAMLDAGLSAE 92


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 27.9 bits (62), Expect = 9.8
 Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 2/51 (3%)

Query: 455 ITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ 505
               + N N P  +  P    +  Q Q    Q A+    Q Q  PP  ++ 
Sbjct: 121 SFHENFNNNYPGNYNNPSQDPYMNQAQS-YNQNAYAKENQ-QAQPPKYENS 169


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.133    0.414 

Gapped
Lambda     K      H
   0.267   0.0704    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,445,280
Number of extensions: 2514350
Number of successful extensions: 5463
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4663
Number of HSP's successfully gapped: 286
Length of query: 579
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 477
Effective length of database: 6,413,494
Effective search space: 3059236638
Effective search space used: 3059236638
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.7 bits)