RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14280
         (1891 letters)



>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
           and repair / Cell division and chromosome partitioning /
           Translation].
          Length = 953

 Score =  640 bits (1651), Expect = 0.0
 Identities = 319/889 (35%), Positives = 451/889 (50%), Gaps = 166/889 (18%)

Query: 1   MGVPKFFRYISERYPCLSEVVREYQIPEFDNLYLDMNGIIHNCSHPNDNDPHFRITEKQI 60
           MGVP FFR++SERYP + +++ E QIPEFDNLYLDMNGI+HNC+HPND  P    TE+++
Sbjct: 1   MGVPSFFRWLSERYPKIIQLIEEKQIPEFDNLYLDMNGILHNCTHPNDGSP--PETEEEM 58

Query: 61  FQDIFHYIEILFRMIQPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKGE 120
           ++ +F YI+ +   I+P+KL +MA+DGVAPRAKMNQQR RRFRSAK+A     KA+  GE
Sbjct: 59  YKAVFEYIDHILLKIRPRKLLYMAVDGVAPRAKMNQQRARRFRSAKDASAAALKAEPNGE 118

Query: 121 RLPTEKRFDFITLKFCFPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKG 180
            +P EK                                              ++   E  
Sbjct: 119 EIPEEK----------------------------------------------DEIGNEID 132

Query: 181 ERLPTEKRFDSNCITPGTPFMCRLHQQLRYFVKLKITTDKLWKDTLSLWVWKDVCVILSG 240
                +K+FDSNCITPGTPFM RL + LRY++  K+++D           W+++ +I SG
Sbjct: 133 TIDVEKKKFDSNCITPGTPFMERLAKVLRYYIHCKLSSDP---------EWRNLRIIFSG 183

Query: 241 HETPGEGEHKIMDYIRYMKSKPDYNPNMRHCLYGLDADLIMLGLCTHEPYFSLLREEVKF 300
           H  PGEGEHKIM++IR  K++P YNPN RHC+YGLDADLIMLGL THEP+F +LRE+V F
Sbjct: 184 HLVPGEGEHKIMNFIRSQKAQPSYNPNTRHCIYGLDADLIMLGLSTHEPHFLILREDVFF 243

Query: 301 ---NTKSNKGGSQIVT---------PEETKFFLLHLSLMRDYLELEFGKLKEPGVLSFPF 348
              + +  K      T              F+LLH+SL+R+YLE EF   +EP  L F F
Sbjct: 244 GSKSRRKRKCTKCGRTGHSDEECKVLTHQPFYLLHISLLREYLEREF---REPT-LPFTF 299

Query: 349 SIENIIDDWVLMGFLIGNDFIPNLPDLHIINGALPLLYKVYQEVLP------TLDGYIN- 401
            +E I+DDW+ + F +GNDF+P+LP L I  GA+  L +++++ LP      T DG IN 
Sbjct: 300 DLERILDDWIFLCFFVGNDFLPHLPCLDIREGAIETLTEIWKKSLPHMKGYITCDGVINL 359

Query: 402 ---EGGYLNLKRFEKFLTK------------LASFDFEQFREKYAD--------IQYLEG 438
              E     L  FE  + K            L  F   + R++            + L G
Sbjct: 360 ARLEVILAILGSFEDDIFKKDHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIG 419

Query: 439 KMGKKFSDVFHKEKS----KEMDPDLAALIKATDD------------------------D 470
            +     D   ++KS     E   D  AL K  D                          
Sbjct: 420 SIKPTLMDQLQEKKSPDLPDEEFIDTLALPKDLDMKNHELFLKRFANDLGLSISKAIKSK 479

Query: 471 YDDGTEEVLNYANYEEYYYSDEDE----------------------EWRTNFFSEFGQHK 508
            +   E  +   + +E     E E                      E        F   K
Sbjct: 480 GNYSLEMDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEENETEKTVNLRFPGWK 539

Query: 509 TDYYHYKLDYAKVTNEVLRDQAECYIRAIQWNLHYYYNGCVSWSWYYPHHYAPYISDIKD 568
             YY  KL +   + E +RD A+ Y+  +QW L YYY GC SW WYYP+HYAP  +D+  
Sbjct: 540 ERYYTSKLHFTTDSEEKIRDMAKEYVEGLQWVLSYYYRGCPSWDWYYPYHYAPLAADLSK 599

Query: 569 FSHLNLKYEMGEPFLPFQQLLAVLPAASKELLPEPYQNLMTLENSPILEYYPEEFATDLN 628
            S  ++K+E+G PF PF+QL+AVLPA SK L+PE ++ LM  E SPI+++YPEEF  D+N
Sbjct: 600 LSDNDIKFELGTPFRPFEQLMAVLPARSKNLVPEGFRPLMDDEKSPIIDFYPEEFKLDMN 659

Query: 629 GKKQDWEAVVLIPFIDEVKLIKAMDAVEHLLTDEERSRNRHGPMLVYTYS-NVNLGTALA 687
           GK   W+AVVL+PFIDE +L+ A+      L++EER RN  G  L+++ +   +L     
Sbjct: 660 GKTASWQAVVLLPFIDERRLLSAVAVKYPTLSEEERKRNLRGLDLLFSSNKKSDL----- 714

Query: 688 PDYFPAINENHAVEKPLYRDDIYVHPNKLRKGLCQDVNLKLYVTHFPTLKHVQFTHKIEN 747
            + F  +  +   +K      I +   +   GL   V L       P L  +     +  
Sbjct: 715 SELFKDL-YSKCKQKE----YITMCSKESPYGLFGTVKLGAEGL-APNLLSLCPISFLSY 768

Query: 748 AKVKVFQQPSRNESII-IVQDKVQVPSLKQVADQILGKTIFINYPHLIE 795
             + VF + S+N+S   +++D     + K +  +   K I + +P+L E
Sbjct: 769 PGLMVFLEYSKNQSARLVIEDPKSTVTNKSIVLRGFIKPINVLWPYLRE 817


>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus.  This family
           aligns residues towards the N-terminus of several
           proteins with multiple functions. The members of this
           family all appear to possess 5'-3' exonuclease activity
           EC:3.1.11.-. Thus, the aligned region may be necessary
           for 5' to 3' exonuclease function. The family also
           contains several Xrn1 and Xrn2 proteins. The 5'-3'
           exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
           degradation and processing of several classes of RNA in
           Saccharomyces cerevisiae. Xrn1p is the main enzyme
           catalyzing cytoplasmic mRNA degradation in multiple
           decay pathways, whereas Xrn2p/Rat1p functions in the
           processing of rRNAs and small nucleolar RNAs (snoRNAs)
           in the nucleus.
          Length = 237

 Score =  382 bits (983), Expect = e-122
 Identities = 153/299 (51%), Positives = 188/299 (62%), Gaps = 63/299 (21%)

Query: 1   MGVPKFFRYISERYPCLSEVVREYQIP-EFDNLYLDMNGIIHNCSHPNDNDPHFRITEKQ 59
           MGVPKFFR++SERYP + + + E+QIP EFDNLYLDMNGIIH CSHP D    F  TE +
Sbjct: 1   MGVPKFFRWLSERYPKIIQPIIEHQIPDEFDNLYLDMNGIIHPCSHPEDK--PFPETEDE 58

Query: 60  IFQDIFHYIEILFRMIQPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKG 119
           +F  IF YI+ LF +++P+KL +MAIDGVAPRAKMNQQR RRFR+AK+A+  E +A+E  
Sbjct: 59  MFVAIFEYIDRLFNIVRPRKLLYMAIDGVAPRAKMNQQRSRRFRAAKDAKEKEAEAEENR 118

Query: 120 ERLPTEKRFDFITLKFCFPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEK 179
           E L TE                                                   K +
Sbjct: 119 EELETEGIK---------------------------------------------LPEKVE 133

Query: 180 GERLPTEKRFDSNCITPGTPFMCRLHQQLRYFVKLKITTDKLWKDTLSLWVWKDVCVILS 239
            E      +FDSNCITPGTPFM RL + LRY++K K+ TD           W+++ VILS
Sbjct: 134 KE------KFDSNCITPGTPFMARLAKALRYYIKKKLNTDPE---------WQNLTVILS 178

Query: 240 GHETPGEGEHKIMDYIRYMKSKPDYNPNMRHCLYGLDADLIMLGLCTHEPYFSLLREEV 298
             E PGEGEHKIMD+IR  +++PDY+PN RHCLYGLDADLIMLGL THEP+FS+LREEV
Sbjct: 179 DAEVPGEGEHKIMDFIRSQRAQPDYDPNTRHCLYGLDADLIMLGLATHEPHFSVLREEV 237


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
            family represents a conserved region approximately 350
            residues long within a number of plant proteins of
            unknown function.
          Length = 357

 Score = 47.2 bits (112), Expect = 5e-05
 Identities = 30/134 (22%), Positives = 43/134 (32%), Gaps = 6/134 (4%)

Query: 1610 KEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQ 1669
             E+ Q  +      +    PP P+Q+A+   ++      Q         +    +    Q
Sbjct: 31   HEEAQSSEAHSFHVDSTKQPPAPEQVAK--HELADAPLQQVNAALPPAPAPQSPQPDQQQ 88

Query: 1670 PRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
                P     +  Y   +P QQ Q        QQ  YYP      PPP QQ +  Q   P
Sbjct: 89   QSQAP----PSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPP 144

Query: 1730 PHPILNASSQPVPP 1743
            P        Q  P 
Sbjct: 145  PQVPQQQQYQSPPQ 158



 Score = 40.7 bits (95), Expect = 0.006
 Identities = 28/143 (19%), Positives = 34/143 (23%), Gaps = 19/143 (13%)

Query: 1616 QQTVEPEQ----NKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRS--TNAQ 1669
            +Q   PEQ         P      A P        +P          S            
Sbjct: 48   KQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQV 107

Query: 1670 PRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY-------YPQQVAYPPPPYQQAR 1722
                     Q   Y     Y    Q Q    QQ            PQQ  Y  PP QQ +
Sbjct: 108  QSVPQQPTPQQEPY-----YPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPP-QQPQ 161

Query: 1723 HPQYSKPPHPILNASSQPVPPDT 1745
            + Q   P        S   P ++
Sbjct: 162  YQQNPPPQAQSAPQVSGLYPEES 184



 Score = 40.3 bits (94), Expect = 0.008
 Identities = 37/148 (25%), Positives = 46/148 (31%), Gaps = 24/148 (16%)

Query: 1613 LQQQQTVEPEQ-NKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPR 1671
            LQQ     P     Q+  P   Q +Q         +  P  V+         +     P 
Sbjct: 66   LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPP 125

Query: 1672 G--IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYP--PPPYQQAR----- 1722
                P    Q     P  P Q  QQ QY    Q P   PQQ  Y   PPP  Q+      
Sbjct: 126  SQPQPPPAQQPQAQQPQPPPQVPQQQQY----QSP---PQQPQYQQNPPPQAQSAPQVSG 178

Query: 1723 -------HPQYSKPPHPILNASSQPVPP 1743
                   +   S PP+  L +S    PP
Sbjct: 179  LYPEESPYQPQSYPPNEPLPSSMAMQPP 206



 Score = 33.8 bits (77), Expect = 0.98
 Identities = 28/138 (20%), Positives = 39/138 (28%), Gaps = 15/138 (10%)

Query: 1615 QQQTVEP-EQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
            Q     P +Q  Q+ P  P     P        + Q  P        +      + P   
Sbjct: 142  QPPPQVPQQQQYQSPPQQPQYQQNPP------PQAQSAPQVSGLYPEESPYQPQSYPPNE 195

Query: 1674 PILGSQAAM--YNPSVPYQQ------QQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
            P+  S A    Y+ + P QQ         Y Y     +P         PPP   Q  +  
Sbjct: 196  PLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGY 255

Query: 1726 YSKPPHPILNASSQPVPP 1743
               PP    + S     P
Sbjct: 256  SGPPPSKGNHGSVASYAP 273


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
            of this family are necessary for accurate chromosome
            transmission during cell division.
          Length = 804

 Score = 45.5 bits (108), Expect = 3e-04
 Identities = 25/128 (19%), Positives = 32/128 (25%), Gaps = 16/128 (12%)

Query: 1613 LQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
              QQ   E       +P  P    +       L  P              + +    P  
Sbjct: 191  FPQQGPPEQPPG---YPQPPQGHPEQVQPQQFLPAPSQ------------APAQPPLPPQ 235

Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHP 1732
            +P                QQ         QQ    PQ  A PPP  Q   HP   +  + 
Sbjct: 236  LP-QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNA 294

Query: 1733 ILNASSQP 1740
             L    QP
Sbjct: 295  PLPPPQQP 302



 Score = 44.0 bits (104), Expect = 8e-04
 Identities = 25/142 (17%), Positives = 38/142 (26%), Gaps = 5/142 (3%)

Query: 1603 EVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGP-VRGHTNSSD 1661
            +++ +E  E   QQ+   P+  +      P  +            P+  P          
Sbjct: 152  KMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHP 211

Query: 1662 LSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQA 1721
                            +Q  +  P    QQ    Q           P     P PP QQ 
Sbjct: 212  EQVQPQQFLPAPSQAPAQPPL--PPQLPQQPPPLQQPQFPGLSQQMPPPP--PQPPQQQQ 267

Query: 1722 RHPQYSKPPHPILNASSQPVPP 1743
            + PQ    P P    +  P  P
Sbjct: 268  QPPQPQAQPPPQNQPTPHPGLP 289



 Score = 41.3 bits (97), Expect = 0.006
 Identities = 17/85 (20%), Positives = 24/85 (28%), Gaps = 5/85 (5%)

Query: 1668 AQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
            AQ +               +P     + Q  + QQ P   P     PP  + +   PQ  
Sbjct: 161  AQLQQRQQAPQLPQPPQQVLPQGMPPR-QAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQF 219

Query: 1728 KPPHPILNASS---QPVPPDTTEDL 1749
              P P    +     P  P     L
Sbjct: 220  -LPAPSQAPAQPPLPPQLPQQPPPL 243



 Score = 40.9 bits (96), Expect = 0.008
 Identities = 22/137 (16%), Positives = 32/137 (23%), Gaps = 19/137 (13%)

Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
            Q  ++    Q         +++           +    P +               PR  
Sbjct: 138  QAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQ--------GMPPRQA 189

Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYY-------PQQVAYPPPPYQQARHPQY 1726
                       P  P    Q  Q +  Q QP  +       P Q   PP   QQ    Q 
Sbjct: 190  AF----PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQ 245

Query: 1727 SKPPHPILNASSQPVPP 1743
             + P         P  P
Sbjct: 246  PQFPGLSQQMPPPPPQP 262



 Score = 36.3 bits (84), Expect = 0.19
 Identities = 25/118 (21%), Positives = 32/118 (27%), Gaps = 23/118 (19%)

Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQP 1670
              LQQ Q   P  ++Q  PP P    Q              P +              QP
Sbjct: 241  PPLQQPQF--PGLSQQMPPPPPQPPQQQQQPP---QPQAQPPPQN-------------QP 282

Query: 1671 RGIPILG-SQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
               P L   Q A   P    Q     Q    QQ+   + +Q         Q +    S
Sbjct: 283  TPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQ----LVQLSQQQREALS 336



 Score = 32.8 bits (75), Expect = 2.2
 Identities = 21/138 (15%), Positives = 32/138 (23%), Gaps = 8/138 (5%)

Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQP 1670
                 + +     N+ A               TS   P      G+  S+D     + QP
Sbjct: 65   GGETAKLSAAVRYNQNAPGAPSVGPDSDLSQKTSTFSPCQ---SGYEASTDPEYIPDLQP 121

Query: 1671 R-----GIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
                    P    Q      S P  Q    +    ++      Q+   P  P    +   
Sbjct: 122  DPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLP 181

Query: 1726 YSKPPHPILNASSQPVPP 1743
               PP         P   
Sbjct: 182  QGMPPRQAAFPQQGPPEQ 199


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 45.1 bits (106), Expect = 4e-04
 Identities = 37/135 (27%), Positives = 45/135 (33%), Gaps = 26/135 (19%)

Query: 1619 VEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPR-GIPILG 1677
            VEP Q  Q       Q  QP          QP             +    QP+  +    
Sbjct: 749  VEPVQQPQQPVAPQQQYQQP---------QQPVA----------PQPQYQQPQQPVAPQP 789

Query: 1678 SQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPP----HPI 1733
                   P  P  Q QQ Q     Q     PQQ   P P YQQ + P   +P     HP+
Sbjct: 790  QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849

Query: 1734 L--NASSQPVPPDTT 1746
            L  N  S+P+   TT
Sbjct: 850  LMRNGDSRPLHKPTT 864



 Score = 43.5 bits (102), Expect = 0.001
 Identities = 31/132 (23%), Positives = 42/132 (31%), Gaps = 10/132 (7%)

Query: 1617 QTVEP--EQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIP 1674
            QT EP      + +P    Q AQP         P   PV+        +    AQ     
Sbjct: 367  QTGEPVIAPAPEGYPQQS-QYAQPAVQYNE---PLQQPVQPQQPYYAPAAEQPAQQPYYA 422

Query: 1675 ILGSQAAMYNPSVPYQQQ--QQYQYYYNQQQPMYYPQQVAYPPPPYQQ--ARHPQYSKPP 1730
                Q A      P  +Q      +   +QQ  + PQ        YQQ  A+ P Y +P 
Sbjct: 423  PAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQ 482

Query: 1731 HPILNASSQPVP 1742
                    +P P
Sbjct: 483  PVEQQPVVEPEP 494



 Score = 43.1 bits (101), Expect = 0.002
 Identities = 22/71 (30%), Positives = 24/71 (33%)

Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASS 1738
            Q     P  P   Q QYQ       P    QQ   P  P  Q + PQ    P P      
Sbjct: 775  QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 834

Query: 1739 QPVPPDTTEDL 1749
            QPV P   + L
Sbjct: 835  QPVAPQPQDTL 845



 Score = 41.2 bits (96), Expect = 0.007
 Identities = 29/140 (20%), Positives = 42/140 (30%), Gaps = 7/140 (5%)

Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
            QQ Q  +P          P Q  QP     +    Q                    P   
Sbjct: 382  QQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQ- 440

Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPI 1733
            P+ G+          +  Q  YQ     QQP    Q+  Y  P   +    Q    P P+
Sbjct: 441  PVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPA--AQEPLYQQPQPVE---QQPVVEPEPV 495

Query: 1734 LNASSQPVPPDTT-EDLEER 1752
            +  +    PP    E++EE+
Sbjct: 496  VEETKPARPPLYYFEEVEEK 515



 Score = 40.1 bits (93), Expect = 0.013
 Identities = 34/138 (24%), Positives = 42/138 (30%), Gaps = 7/138 (5%)

Query: 1615 QQQTVEPEQNKQAFPPHPDQLA-QPGDDVTS------LIRPQPGPVRGHTNSSDLSRSTN 1667
            QQQ    EQ   A P   D     P   +        L  P   PV+         +   
Sbjct: 707  QQQRYSGEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQ 766

Query: 1668 AQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
               + +           P  P  Q QQ Q     Q     PQQ   P P YQQ + P   
Sbjct: 767  QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP 826

Query: 1728 KPPHPILNASSQPVPPDT 1745
            +P +        P P DT
Sbjct: 827  QPQYQQPQQPVAPQPQDT 844



 Score = 32.7 bits (74), Expect = 2.2
 Identities = 32/139 (23%), Positives = 42/139 (30%), Gaps = 11/139 (7%)

Query: 1619 VEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGS 1678
            VEP              AQP    T   +P PGP  G    +        Q +       
Sbjct: 337  VEPVTQTPPVASVDVPPAQP----TVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQ 392

Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS-KPPHPILNAS 1737
                     P QQ  Q Q  Y         QQ  Y P P Q A+ P Y+  P  P+   +
Sbjct: 393  YNE------PLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNA 446

Query: 1738 SQPVPPDTTEDLEERADSY 1756
             Q     +T   +    + 
Sbjct: 447  WQAEEQQSTFAPQSTYQTE 465


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
            includes the B. subtilis YppG protein, which is
            functionally uncharacterized. This family of proteins is
            found in bacteria. Proteins in this family are typically
            between 115 and 181 amino acids in length. There are two
            completely conserved residues (F and G) that may be
            functionally important.
          Length = 110

 Score = 40.9 bits (96), Expect = 6e-04
 Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 3/62 (4%)

Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPV 1741
              N +    Q QQ Q Y  Q      P     P  P QQ +       P P    S Q  
Sbjct: 4    QQNTNQYPPQNQQQQPYQQQPYHQQMPPP---PYSPPQQQQGHFMPPQPQPYPKQSPQQQ 60

Query: 1742 PP 1743
             P
Sbjct: 61   QP 62



 Score = 34.7 bits (80), Expect = 0.080
 Identities = 15/49 (30%), Positives = 17/49 (34%), Gaps = 4/49 (8%)

Query: 1697 YYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPP--HPILNASSQPVPP 1743
              Y Q    Y PQ       PYQQ  + Q   PP   P        +PP
Sbjct: 1    NPYQQNTNQYPPQN--QQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPP 47



 Score = 32.4 bits (74), Expect = 0.55
 Identities = 13/46 (28%), Positives = 15/46 (32%)

Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
               P         Y     QQ     PQ   YP    QQ + PQ+S
Sbjct: 21   QQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66



 Score = 32.4 bits (74), Expect = 0.63
 Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 5/65 (7%)

Query: 1678 SQAAMYNPSVPYQQQ--QQYQYYYNQQQPMYYP--QQVAYPPPPYQQARHPQYSKPPHPI 1733
             Q     P    QQQ  QQ  Y+     P Y P  QQ  +  PP  Q  +P+ S      
Sbjct: 4    QQNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQP-YPKQSPQQQQP 62

Query: 1734 LNASS 1738
               SS
Sbjct: 63   PQFSS 67


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
            non-fungal.  The approx. 70 residue Med15 domain of the
            ARC-Mediator co-activator is a three-helix bundle with
            marked similarity to the KIX domain. The sterol
            regulatory element binding protein (SREBP) family of
            transcription activators use the ARC105 subunit to
            activate target genes in the regulation of cholesterol
            and fatty acid homeostasis. In addition, Med15 is a
            critical transducer of gene activation signals that
            control early metazoan development.
          Length = 768

 Score = 42.7 bits (100), Expect = 0.002
 Identities = 29/130 (22%), Positives = 37/130 (28%), Gaps = 19/130 (14%)

Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
            QQQQ      +  A     +Q    G  + +L        +G           N   +G 
Sbjct: 356  QQQQQQGG-NHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLG-------ANPMQQGQ 407

Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY-YPQQVAYPPPPYQQARHPQYSKPPHP 1732
            P +     M +PS P  Q Q  Q      QP    P      PP            P  P
Sbjct: 408  PGM-----MSSPS-PVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQS----VSGGMIPSPP 457

Query: 1733 ILNASSQPVP 1742
             L  S  P  
Sbjct: 458  ALMPSPSPQM 467



 Score = 33.8 bits (77), Expect = 1.1
 Identities = 24/122 (19%), Positives = 29/122 (23%), Gaps = 8/122 (6%)

Query: 1604 VMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQ---PGPVRGHTNSS 1660
            +M     +   QQ      Q  QA              V     PQ   PG   G     
Sbjct: 150  MMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQG- 208

Query: 1661 DLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQ-QQYQYYYNQQQPMYYPQQVAYPPPPYQ 1719
               +       G     +           QQQ  Q Q   + Q  M   QQ      P Q
Sbjct: 209  ---QMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQ 265

Query: 1720 QA 1721
              
Sbjct: 266  MQ 267



 Score = 32.7 bits (74), Expect = 2.6
 Identities = 22/141 (15%), Positives = 36/141 (25%), Gaps = 14/141 (9%)

Query: 1613 LQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
                   + +Q  Q       Q  Q    + +  + Q    +G  ++         Q +G
Sbjct: 202  PGGGGQGQMQQQGQPGGQ-QQQNPQMQQQLQNQQQQQMDQQQGPADAQ-AQMGQQQQGQG 259

Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQ-----QQPMYY-------PQQVAYPPPPYQQ 1720
                             QQ  Q Q   +Q      Q           P Q   PPP    
Sbjct: 260  GMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPG 319

Query: 1721 ARHPQYSKPPHPILNASSQPV 1741
            A           +++A  Q +
Sbjct: 320  AVPQGGQAVQQGVMSAGQQQL 340



 Score = 31.9 bits (72), Expect = 4.8
 Identities = 21/117 (17%), Positives = 30/117 (25%), Gaps = 10/117 (8%)

Query: 1614 QQQQTVEPEQNKQAFPPHPD-QLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
            QQ Q  + +   Q  PP    Q +Q G     + +   G   G              P+ 
Sbjct: 264  QQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQP------MGPPPQR 317

Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
               +          V    QQQ       +      QQ        Q   HP   + 
Sbjct: 318  PGAVPQGGQAVQQGVMSAGQQQL---KQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQ 371



 Score = 30.7 bits (69), Expect = 9.8
 Identities = 42/204 (20%), Positives = 64/204 (31%), Gaps = 23/204 (11%)

Query: 1538 MVQQSPHLLEKNKELNTPSGGELSFEEKHSMA-AKSKNETTPTIK---TEMRKIISSSDY 1593
            +V +    L KN + +T +  E+   E H  A A++K+E    +       R +   +  
Sbjct: 3    VVSKIEEALRKNGQTHTKNAREM---ENHVFAKARTKDEYLGLVARLIIHFRDMSKKAAQ 59

Query: 1594 QGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTS-------LI 1646
            Q + +  +     I   ++L  Q T  P+       P      Q G   T+        +
Sbjct: 60   QQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNV 119

Query: 1647 RPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY 1706
            R Q            +SR    QP      G QA         Q Q Q     NQ  P  
Sbjct: 120  RGQMPMGAAGMGPHQMSRVGTMQP------GGQAGGMMQQSSGQPQSQQP---NQMGPQQ 170

Query: 1707 YPQQVAYPPPPYQQARHPQYSKPP 1730
               Q         Q       +PP
Sbjct: 171  GQAQGQAGGMNQGQQGPVGQQQPP 194


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the protein
            product of the dentatorubral-pallidoluysian atrophy
            (DRPLA) gene. DRPLA OMIM:125370 is a progressive
            neurodegenerative disorder. It is caused by the expansion
            of a CAG repeat in the DRPLA gene on chromosome 12p. This
            results in an extended polyglutamine region in
            atrophin-1, that is thought to confer toxicity to the
            protein, possibly through altering its interactions with
            other proteins. The expansion of a CAG repeat is also the
            underlying defect in six other neurodegenerative
            disorders, including Huntington's disease. One
            interaction of expanded polyglutamine repeats that is
            thought to be pathogenic is that with the short glutamine
            repeat in the transcriptional coactivator CREB binding
            protein, CBP. This interaction draws CBP away from its
            usual nuclear location to the expanded polyglutamine
            repeat protein aggregates that are characteristic of the
            polyglutamine neurodegenerative disorders. This
            interferes with CBP-mediated transcription and causes
            cytotoxicity.
          Length = 979

 Score = 41.2 bits (96), Expect = 0.006
 Identities = 53/233 (22%), Positives = 89/233 (38%), Gaps = 19/233 (8%)

Query: 1520 SSSNDTEAENTFKRAISAMVQQSPHLLEKNKELNTPSGGELSFEEKHSMAAKSKNETTPT 1579
            +SSND++AE+T K+    + +++   L+  K        +    E+ + A KSK +    
Sbjct: 52   TSSNDSKAEST-KKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVT-AKKSKTQ---- 105

Query: 1580 IKTEMRKIISSSDYQGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPG 1639
               E+ +  S S+ +G  E E E      + E+         + N+ + P  P       
Sbjct: 106  ---ELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNES 162

Query: 1640 DDVTS----LIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQY 1695
            D  +S    L++PQ  P       + L+ S          +  Q +    + P  Q QQ 
Sbjct: 163  DSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSP-IAAQPAPQPQQP 221

Query: 1696 QYYYNQQQPMYYPQQVAYPPPPYQ----QARHPQYSKPPHPILNAS-SQPVPP 1743
                    P  +PQ++  P PP Q      + PQ   P      +S   P PP
Sbjct: 222  SPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPP 274


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
            role in the biomineralisation of teeth.  They seem to
            regulate formation of crystallites during the secretory
            stage of tooth enamel development and are thought to play
            a major role in the structural organisation and
            mineralisation of developing enamel. The extracellular
            matrix of the developing enamel comprises two major
            classes of protein: the hydrophobic amelogenins and the
            acidic enamelins. Circular dichroism studies of porcine
            amelogenin have shown that the protein consists of 3
            discrete folding units: the N-terminal region appears to
            contain beta-strand structures, while the C-terminal
            region displays characteristics of a random coil
            conformation. Subsequent studies on the bovine protein
            have indicated the amelogenin structure to contain a
            repetitive beta-turn segment and a "beta-spiral" between
            Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich
            region. The beta-spiral offers a probable site for
            interactions with Ca2+ ions. Muatations in the human
            amelogenin gene (AMGX) cause X-linked hypoplastic
            amelogenesis imperfecta, a disease characterised by
            defective enamel. A 9bp deletion in exon 2 of AMGX
            results in the loss of codons for Ile5, Leu6, Phe7 and
            Ala8, and replacement by a new threonine codon,
            disrupting the 16-residue (Met1-Ala16) amelogenin signal
            peptide.
          Length = 165

 Score = 38.2 bits (89), Expect = 0.015
 Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 5/61 (8%)

Query: 1683 YNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVP 1742
            + P++P   QQ +Q    Q      P Q    P P      P   +PP P +    QP+P
Sbjct: 88   HQPNLPQPAQQPFQPQPLQPPQPQQPMQ----PQPPVHPIPPLPPQPPLPPM-FPMQPLP 142

Query: 1743 P 1743
            P
Sbjct: 143  P 143



 Score = 34.8 bits (80), Expect = 0.17
 Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 10/75 (13%)

Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQ-QPMYYPQQV--AYPPPPYQQARHPQYSKPP 1730
             +  +Q    N   P QQ  Q Q     Q Q    PQ      PP P Q    P +   P
Sbjct: 81   SMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQP 140

Query: 1731 HPILNASSQPVPPDT 1745
             P       P+ PD 
Sbjct: 141  LP-------PLLPDL 148



 Score = 30.9 bits (70), Expect = 3.7
 Identities = 20/74 (27%), Positives = 23/74 (31%), Gaps = 4/74 (5%)

Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQ---QQPMY-YPQQVAYPPPPYQQARHPQYSKP 1729
            P    Q   + P +P QQ    Q        Q  M          P P QQ   PQ  +P
Sbjct: 48   PTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQP 107

Query: 1730 PHPILNASSQPVPP 1743
            P P      QP   
Sbjct: 108  PQPQQPMQPQPPVH 121


>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo red.
             RCR proteins are ER membrane proteins that regulate
            chitin deposition in fungal cell walls. Although chitin,
            a linear polymer of beta-1,4-linked N-acetylglucosamine,
            constitutes only 2% of the cell wall it plays a vital
            role in the overall protection of the cell wall against
            stress, noxious chemicals and osmotic pressure changes.
            Congo red is a cell wall-disrupting benzidine-type dye
            extensively used in many cell wall mutant studies that
            specifically targets chitin in yeast cells and inhibits
            growth. RCR proteins render the yeasts resistant to Congo
            red by diminishing the content of chitin in the cell
            wall. RCR proteins are probably regulating chitin
            synthase III interact directly with ubiquitin ligase
            Rsp5, and the VPEY motif is necessary for this, via
            interaction with the WW domains of Rsp5.
          Length = 124

 Score = 37.1 bits (86), Expect = 0.015
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 6/48 (12%)

Query: 1700 NQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTE 1747
               QP+Y    +A PP   Q  +  QY+  P        Q VPP T  
Sbjct: 29   RGLQPIYGTAWLA-PPSYRQSQQ--QYNNTPQG---RPPQYVPPYTET 70



 Score = 36.3 bits (84), Expect = 0.030
 Identities = 26/88 (29%), Positives = 31/88 (35%), Gaps = 11/88 (12%)

Query: 1674 PILGSQAAMYNPSVPYQQQQQY----QYYYNQQQPMYYPQQVAYPPPPY--QQARHPQYS 1727
            PI G+ A +  PS   Q QQQY    Q    Q  P Y           Y  Q   HP   
Sbjct: 33   PIYGT-AWLAPPS-YRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHPNPK 90

Query: 1728 KPPH---PILNASSQPVPPDTTEDLEER 1752
             P     P  NA  +     TT+D  + 
Sbjct: 91   APAIELQPPPNAYERGTRSPTTDDEYQP 118


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
            2, 3, 4 family.  These eukaryotic proteins recognize the
            poly-A of mRNA and consists of four tandem RNA
            recognition domains at the N-terminus (rrm: pfam00076)
            followed by a PABP-specific domain (pfam00658) at the
            C-terminus. The protein is involved in the transport of
            mRNA's from the nucleus to the cytoplasm. There are four
            paralogs in Homo sapiens which are expressed in testis
            (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
            broadly expressed (SP:P11940_PABP1) and of unknown tissue
            range (SP:Q15097_PABP2).
          Length = 562

 Score = 37.5 bits (87), Expect = 0.074
 Identities = 17/91 (18%), Positives = 23/91 (25%), Gaps = 16/91 (17%)

Query: 1670 PRGIPILGSQAA-MYNPSVPYQQQQQYQYYYNQQQPMYYPQQ-------VAYPPPP---- 1717
            P G P+ G+     Y    P QQ       + +   M  P             P      
Sbjct: 386  PMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRA 445

Query: 1718 ---YQQARHPQYSKPPHPIL-NASSQPVPPD 1744
                 Q    +    P     N  S P+  D
Sbjct: 446  PSRNAQNAAQKPPMQPVMYPPNYQSLPLSQD 476


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 36.9 bits (86), Expect = 0.088
 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%)

Query: 1647 RPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY 1706
               P   + H  ++  S     QP        Q     P      Q   +      QP+ 
Sbjct: 72   NHAPANAQEH-EAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQ 130

Query: 1707 YPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTT 1746
             P     P  P QQ   PQ +  P P     S P P    
Sbjct: 131  QPAYQPQPEQPLQQPVSPQVAPAPQP---VHSAPQPAQQA 167



 Score = 33.9 bits (78), Expect = 0.75
 Identities = 24/128 (18%), Positives = 36/128 (28%), Gaps = 23/128 (17%)

Query: 1620 EPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQ 1679
               Q  +A  P P    QP          QP         + +      +P       + 
Sbjct: 76   ANAQEHEAARPSPQHQYQP-----PYASAQPRQPVQQPPEAQVPPQHAPRP----AQPAP 126

Query: 1680 AAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS----KPPHPILN 1735
              +  P   YQ Q +        QP+  P      P P      PQ +    +P  P+  
Sbjct: 127  QPVQQP--AYQPQPE--------QPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAA 176

Query: 1736 ASSQPVPP 1743
               +PV  
Sbjct: 177  PQPEPVAE 184


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 614

 Score = 36.7 bits (85), Expect = 0.14
 Identities = 18/88 (20%), Positives = 26/88 (29%), Gaps = 15/88 (17%)

Query: 1661 DLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPY-- 1718
            D S     +    P+    AA   PS                     P Q +    P   
Sbjct: 368  DASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPS-------------PSQSSAAAQPSAP 414

Query: 1719 QQARHPQYSKPPHPILNASSQPVPPDTT 1746
            Q A  P  + P   +   ++ PV P +T
Sbjct: 415  QSATQPAGTPPTVSVDPPAAVPVNPPST 442


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 36.3 bits (84), Expect = 0.14
 Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 7/84 (8%)

Query: 1805 AKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKN 1864
            A KA + +I       + T++   AP    A+ V  +E     AA+P   + E +   K 
Sbjct: 208  ASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPVPPSE-----AARPKSADAETNAALKT 262

Query: 1865 --TAAAAEKKEKKAPRIAAKFKTP 1886
              TA  A  K  KAP       T 
Sbjct: 263  PATAPKAAAKNAKAPEAQPVSGTA 286


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 35.0 bits (81), Expect = 0.28
 Identities = 13/67 (19%), Positives = 16/67 (23%), Gaps = 9/67 (13%)

Query: 1685 PSVPY----QQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQP 1740
              VPY     Q  +      QQ     P      P P    R         P+   +  P
Sbjct: 103  SEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTT-----APVQPQTPAP 157

Query: 1741 VPPDTTE 1747
            V      
Sbjct: 158  VRTQPAA 164



 Score = 30.8 bits (70), Expect = 6.5
 Identities = 17/76 (22%), Positives = 22/76 (28%), Gaps = 3/76 (3%)

Query: 1683 YN---PSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQ 1739
            YN   P VP    Q  Q    QQ P    Q     PP    A     +  P     A+  
Sbjct: 107  YNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPV 166

Query: 1740 PVPPDTTEDLEERADS 1755
                +  +   E+   
Sbjct: 167  TQAVEAPKVEAEKEKE 182


>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein; Provisional.
          Length = 177

 Score = 34.1 bits (78), Expect = 0.36
 Identities = 14/57 (24%), Positives = 20/57 (35%), Gaps = 1/57 (1%)

Query: 1675 ILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQ-QVAYPPPPYQQARHPQYSKPP 1730
            +LG +     P+      QQ   +   QQP    Q        P Q A     ++PP
Sbjct: 112  MLGGRQGGGAPAGGNIGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPP 168


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
            Provisional.
          Length = 617

 Score = 34.8 bits (80), Expect = 0.45
 Identities = 26/121 (21%), Positives = 35/121 (28%), Gaps = 19/121 (15%)

Query: 1618 TVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILG 1677
            TV+P     A  P P            L RP   P  G+       R             
Sbjct: 87   TVDP----SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPR------Q 136

Query: 1678 SQAAMYNPSVPYQQQQQY--------QYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
             Q     P+ P  QQ+            Y  QQQ + +P +  Y  P    A   +  + 
Sbjct: 137  DQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPA-SYAPEQERDRE 195

Query: 1730 P 1730
            P
Sbjct: 196  P 196


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
            family of proteins are found in large numbers in the
            Trichomonas vaginalis proteome. The function of this
            protein is unknown.
          Length = 422

 Score = 34.6 bits (79), Expect = 0.46
 Identities = 24/102 (23%), Positives = 36/102 (35%), Gaps = 10/102 (9%)

Query: 1688 PYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTE 1747
            P   QQ       QQ  +  P Q     P  Q  +  Q   P +P   A   P      +
Sbjct: 201  PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNP---AQQPPQTEQGHK 257

Query: 1748 DLEERADSYLAKLLRLTRAHNLGAPDYTFSMDASKLYVATVR 1789
               E+ +    K+L+     + G PD   ++D S  Y   +R
Sbjct: 258  RSREQGNQEFLKMLK----EDYGYPD---TIDFSDRYKEAIR 292



 Score = 33.5 bits (76), Expect = 1.2
 Identities = 22/97 (22%), Positives = 31/97 (31%), Gaps = 11/97 (11%)

Query: 1629 PPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVP 1688
            PP+P + A           P PG  +  T+S         +P   P + + A       P
Sbjct: 173  PPNPPREA-----------PAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNP 221

Query: 1689 YQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
             QQ QQ       Q       Q     PP  +  H +
Sbjct: 222  AQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKR 258


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
            characterized by several short PY and PT-like motifs of
            the PPPPY form. These appear to bind directly to the WW
            domains of WWP1 and WWP2 and other such diverse proteins
            as dystrophin and YAP (Yes-associated protein). This is
            the WW-domain binding protein WWbp via PY and PY_like
            motifs. The presence of a phosphotyrosine residue in the
            pWBP-1 peptide abolishes WW domain binding which suggests
            a potential regulatory role for tyrosine phosphorylation
            in modulating WW domain-ligand interactions. Given the
            likelihood that WWP1 and WWP2 function as E3
            ubiquitin-protein ligases, it is possible that initial
            substrate-specific recognition occurs via WW
            domain-substrate protein interaction followed by
            ubiquitin transfer and subsequent proteolysis. This
            domain lies just downstream of the GRAM (pfam02893) in
            many members.
          Length = 111

 Score = 32.8 bits (75), Expect = 0.48
 Identities = 15/46 (32%), Positives = 15/46 (32%), Gaps = 2/46 (4%)

Query: 1704 PMYYPQQVAYPPPPYQQARHPQYSKPPHP--ILNASSQPVPPDTTE 1747
              Y       PPPPY  A    Y  PP P     A  Q  PP    
Sbjct: 63   GQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPG 108



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 11/68 (16%), Positives = 17/68 (25%)

Query: 1259 QPQRTTFYKENAPLVNSNLQVTSPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNVKPT 1318
            QP          P    +L+    Y    +                +      P +    
Sbjct: 38   QPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMA 97

Query: 1319 EKTEPPPP 1326
            +  EPPPP
Sbjct: 98   DPQEPPPP 105


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family consists
            of several Mycoplasma species specific Cytadhesin P32 and
            P30 proteins. P30 has been found to be membrane
            associated and localised on the tip organelle. It is
            thought that it is important in cytadherence and
            virulence.
          Length = 279

 Score = 33.9 bits (77), Expect = 0.63
 Identities = 29/151 (19%), Positives = 50/151 (33%), Gaps = 21/151 (13%)

Query: 1594 QGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPV 1653
            Q + E+ E++ + I+  E++  Q+  +P     A  P P Q+            P P   
Sbjct: 117  QRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQP-QVQPQ-------FGPNPQQ- 167

Query: 1654 RGHTNSSDLSRSTNAQPR-GIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVA 1712
            R +          N   R G     +Q   + P +P  Q +     +N    M  P +  
Sbjct: 168  RINPQRFGFPMQPNMGMRPGF----NQMPPHMPGMPPNQMRP---GFNPMPGM--PPRPG 218

Query: 1713 YPPPPYQQARHPQYSKPPHPILNASSQPVPP 1743
            +   P       +    P P     + P  P
Sbjct: 219  FNQNPNMMPNMNRPGFRPQP--GGFNHPGTP 247


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 34.3 bits (78), Expect = 0.74
 Identities = 25/97 (25%), Positives = 29/97 (29%), Gaps = 7/97 (7%)

Query: 1651 GPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQ 1710
               R    +     S   QP  +  +  Q  M  P  P QQ      +      M     
Sbjct: 591  RLARLRAEAQPYQASVEVQPPQLTQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGV 650

Query: 1711 VAYPPP----PYQQARHPQYSKPPHPILNASSQPVPP 1743
             A  P     P QQ   P     P   L AS  PVPP
Sbjct: 651  PAMQPQYFDLPLQQ---PISQGAPLAPLRASMGPVPP 684



 Score = 32.0 bits (72), Expect = 3.9
 Identities = 22/109 (20%), Positives = 31/109 (28%), Gaps = 18/109 (16%)

Query: 1645 LIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQP 1704
            L+R      R        S  T   PR             P V   + +  Q    +   
Sbjct: 391  LMRAGKLTERAREALEKASEPTYGTPR-------------PPVEKPRPEVPQSL--ETAT 435

Query: 1705 MYYPQQVAYPPPPYQQARHP---QYSKPPHPILNASSQPVPPDTTEDLE 1750
             +   QV  PPP +     P   Q+S  P P+      P+      D  
Sbjct: 436  SHGSAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQL 484


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 33.7 bits (77), Expect = 1.2
 Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 14/99 (14%)

Query: 1288 SRCNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPPPHK------LNQVNTSSAPKEK 1341
            S   S   S S+S SS+  S+++ P     + + EPP  +K      LN+VN       +
Sbjct: 405  SGSESSSGSDSESESSSSDSEENEPPRT-ASPEPEPPSTNKWQLDNWLNKVNPHKVSPAE 463

Query: 1342 PRTSE-------QNQIKSNGNEQKVIINKEERCIFSNAT 1373
              +S        + + K   +  +     +E    S++ 
Sbjct: 464  SVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSK 502


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ associated
            protein 2 has a highly conserved sequence throughout
            evolution including a conserved polyproline region and
            several SH2/SH3 binding sites. It occurs as a single copy
            gene with a four-exon organisation and is located on
            chromosome 12. It encodes a ubiquitously expressed
            protein and binds to DAZ and DAZL1 through DAZ repeats.
          Length = 136

 Score = 32.1 bits (73), Expect = 1.2
 Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 2/33 (6%)

Query: 1713 YP--PPPYQQARHPQYSKPPHPILNASSQPVPP 1743
            YP  PP Y +   P Y  PP   +  +S P P 
Sbjct: 1    YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPG 33



 Score = 29.4 bits (66), Expect = 7.8
 Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 7/70 (10%)

Query: 1679 QAAMYNPS--VPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNA 1736
             + +Y PS  VP Q Q            MY P        P        Y    +PI  A
Sbjct: 8    YSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAY----YPI-GA 62

Query: 1737 SSQPVPPDTT 1746
                 PP +T
Sbjct: 63   PPPVYPPGST 72


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
            partitioning].
          Length = 324

 Score = 33.3 bits (76), Expect = 1.2
 Identities = 20/106 (18%), Positives = 32/106 (30%), Gaps = 4/106 (3%)

Query: 1651 GPVRGHTNSSDLSRSTNAQP-RGIPILGSQA--AMYNPSVPYQQQQQYQYYYNQQQPMYY 1707
            G VR    +     +   +  R  P    Q   A     +P  Q  Q        QP   
Sbjct: 62   GEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQP 121

Query: 1708 PQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERA 1753
                  PP   +Q   PQ +  P P+ ++++      T +      
Sbjct: 122  ALDQEQPPEEARQPVLPQEAPAPQPV-HSAAPQPAVQTVQPAVPEQ 166



 Score = 32.5 bits (74), Expect = 1.9
 Identities = 32/226 (14%), Positives = 53/226 (23%), Gaps = 32/226 (14%)

Query: 1581 KTEMRKIISSSDYQGLEEEEE-EEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPG 1639
               M+       Y  + ++E   EV ++   E  Q  Q  E  +     P H  Q     
Sbjct: 40   LKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAAR---QSPQHQYQPEYAS 96

Query: 1640 DDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYY 1699
              +   +   P            S          P L  +        P   Q+      
Sbjct: 97   AQIKIPVPQPPQI----------SDPPAHPQPTQPALDQEQPPEEARQPVLPQEA-PAPQ 145

Query: 1700 NQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAK 1759
                    P      P   +Q    Q  +   P      +   P   + +          
Sbjct: 146  PVHSAAPQPAVQTVQPAVPEQQ--VQPEEVVEPA----PEVKRPPRKDTV---------I 190

Query: 1760 LLRLTRAHN--LGAPDYTFSMDASKLYVATVRICHPHSDIVVRGKP 1803
            +L +   H           S+  S      + I H H  +   G  
Sbjct: 191  ILNVAAHHESEFNGEKLLQSIQQSGFIFGDMNIFHRHLSLSGSGPV 236


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 33.7 bits (77), Expect = 1.3
 Identities = 36/238 (15%), Positives = 68/238 (28%), Gaps = 23/238 (9%)

Query: 1431 KEVEEITNGTQTITMNDLYQSANKRSENGFNIPSISDLIKQNNITAKTDYPTTSTFPEVK 1490
             E + I+N               K  EN                        TS   ++K
Sbjct: 2438 NETKRISNVDAFTNKILQDIDNEKNKENN---------NMNAEKIDDLIENVTSHNEKIK 2488

Query: 1491 YTNSPSDNGGRTATPDTSFTTKLTNLFNRSSSNDTEAENTFKRAISAMVQQSPHLLEKNK 1550
                  ++  R          KL N    +++N+  +        + ++ +    + K  
Sbjct: 2489 SELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLN 2548

Query: 1551 ELNTPSGGELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVMIIEIK 1610
            EL +    E+   E   +    K            KI  S   +   E E +E    E +
Sbjct: 2549 ELLSYIDNEIKELENEKLKLLEKA-----------KIEESRKERERIESETQEDNTDEEQ 2597

Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNA 1668
             + QQQ+ ++ E+ ++A   +  +        T             +N    S   N 
Sbjct: 2598 INRQQQERLQKEEEQKA---YSQERLNREVSGTDDTNKNHNTGHDESNYGRYSNKRND 2652


>gnl|CDD|216209 pfam00946, Mononeg_RNA_pol, Mononegavirales RNA dependent RNA
            polymerase.  Members of the Mononegavirales including the
            Paramyxoviridae, like other non-segmented negative strand
            RNA viruses, have an RNA-dependent RNA polymerase
            composed of two subunits, a large protein L and a
            phosphoprotein P. This is a protein family of the L
            protein. The L protein confers the RNA polymerase
            activity on the complex. The P protein acts as a
            transcription factor.
          Length = 1065

 Score = 32.6 bits (75), Expect = 2.4
 Identities = 33/152 (21%), Positives = 48/152 (31%), Gaps = 13/152 (8%)

Query: 1443 ITMNDLYQSANKR--SENGFNIPSISDLIKQNNITAKTDYPTTSTFPEVKYTNSPSDNGG 1500
              +ND     N R     G  +P I  L    N   K       +  E K    PS N  
Sbjct: 26   YLLNDKTLLRNIRWDRIRGEELPLIEHLRTLQNWLNKW----LLSLGEYKVVPYPSINPE 81

Query: 1501 RTATPDTSFTTKLTNLFNRSSSNDTEAENTFKRAISAMVQQSPHLLEKNKELNTPSGGEL 1560
                 D   +++L  L       + E      +    +      L  K+K   T      
Sbjct: 82   LLRIVDPHDSSQLEELLKLV---NRELSIISSKVEDFLDDLGYKLKGKDKLFLTSLAPLC 138

Query: 1561 -SFEEKHSMAAKSKN---ETTPTIKTEMRKII 1588
             S  + H++          T  TIKTE+R +I
Sbjct: 139  NSILDLHAVMLILNWYDFSTWFTIKTELRSLI 170


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
            Provisional.
          Length = 376

 Score = 32.2 bits (73), Expect = 2.5
 Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 1292 SQQSSVSQSPSSTMISQQSSPFNVKPTE-KTEPPPPHKLNQVNTSSAPKEKPRTSE 1346
              +     +P STMI+  + PF+  P +   +PP   +  Q     + + +P  +E
Sbjct: 52   RSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNE 107


>gnl|CDD|235903 PRK06991, PRK06991, ferredoxin; Provisional.
          Length = 270

 Score = 32.1 bits (73), Expect = 2.5
 Identities = 19/81 (23%), Positives = 26/81 (32%), Gaps = 4/81 (4%)

Query: 1815 ARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVD--QTDKNT--AAAAE 1870
            AR    +             A+    A    A A   A      D  +  K    AAA E
Sbjct: 157  ARHDARQARLRREREAAEARAAARAAASAAAAAAEASAAAAPAADDAEAKKRAIIAAALE 216

Query: 1871 KKEKKAPRIAAKFKTPLNLKN 1891
            +  KK   +AA+   P N + 
Sbjct: 217  RARKKKEELAAQGAGPKNTEG 237


>gnl|CDD|185514 PTZ00212, PTZ00212, T-complex protein 1 subunit beta; Provisional.
          Length = 533

 Score = 32.3 bits (74), Expect = 2.9
 Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 16/63 (25%)

Query: 743 HKIENAKVKVFQQPSRNESIIIVQDKVQVPSLKQVA--------------DQIL--GKTI 786
            ++EN K+ V   P   + I I   KV+V S+++VA              D+IL  G  +
Sbjct: 232 KRLENCKILVANTPMDTDKIKIYGAKVKVDSMEKVAEIEAAEKEKMKNKVDKILAHGCNV 291

Query: 787 FIN 789
           FIN
Sbjct: 292 FIN 294


>gnl|CDD|233569 TIGR01776, TonB-tbp-lbp, TonB-dependent lactoferrin and transferrin
            receptors.  This family of TonB-dependent receptors are
            responsible for import of iron from the mammalian iron
            carriers lactoferrin and transferrin across the outer
            membrane. These receptors are found only in bacteria
            which can infect mammals such as Moraxella, Mannheimia,
            Neisseria, Actinobacillus, Pasteurella, Haemophilus and
            Histophilus species [Transport and binding proteins,
            Cations and iron carrying compounds, Transport and
            binding proteins, Porins].
          Length = 932

 Score = 32.5 bits (74), Expect = 2.9
 Identities = 24/88 (27%), Positives = 35/88 (39%), Gaps = 13/88 (14%)

Query: 1685 PSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPD 1744
             S+      QY Y Y+ Q       QV      ++Q     Y+   + +L     P P  
Sbjct: 272  ASLDCAPCAQYGYVYDNQLSDEEKAQVLASRYRHEQVSAKDYTG-ENRLL-----PNP-- 323

Query: 1745 TTEDLEERADSYLAKL-LRLTRAHNLGA 1771
                L+ R+DS+LAKL   L   H +G 
Sbjct: 324  ----LDYRSDSWLAKLGYHLNNRHYVGG 347


>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF). 
           Peptidase family M3B Oligopeptidase F (PepF;
           Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
           includes oligoendopeptidase F from Lactococcus lactis.
           This enzyme hydrolyzes peptides containing between 7 and
           17 amino acids with fairly broad specificity. The PepF
           gene is duplicated in L. lactis on the plasmid that
           bears it, while a shortened second copy is found in
           Bacillus subtilis. Most bacterial PepFs are cytoplasmic
           endopeptidases; however, the PepF Bacillus
           amyloliquefaciens oligopeptidase is a secreted protein
           and may facilitate the process of sporulation.
           Specifically, the yjbG gene encoding the homolog of the
           PepF1 and PepF2 oligoendopeptidases of Lactococcus
           lactis has been identified in Bacillus subtilis as an
           inhibitor of sporulation initiation when over expressed
           from a multicopy plasmid.
          Length = 586

 Score = 32.2 bits (74), Expect = 2.9
 Identities = 37/147 (25%), Positives = 51/147 (34%), Gaps = 41/147 (27%)

Query: 407 NLKRFEKFLTKLASFDFEQFREKYADIQ-YLEGKMGKKFSDVFHKEK---SKEMDPDLAA 462
            L  FE  L  L     E+  E+  +   YLE        D+  K+    S E++  LAA
Sbjct: 101 KLSFFESELLALDEKILEEVIEELPEYAAYLE--------DLLRKKPHTLSPEVEEALAA 152

Query: 463 L-------------IKATDDDYD----DGTEEVLNYANYEEYYYSDEDEEWRTNFFSEFG 505
           L              K  D  ++    DG    L++  +E  Y  D D E R   F  F 
Sbjct: 153 LSPVLDAPYNIYETTKLADMTFEDFEVDGKTYPLSFVLFENEYEYDPDTEVRRAAFKSF- 211

Query: 506 QHKT--DYYH-----YKLDYAKVTNEV 525
             KT   Y +     Y     +V  E 
Sbjct: 212 -SKTLRKYQNTTAATY---ATQVQKEK 234


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
            transcription factor-type molecule that physically
            interacts with immature forms of the Wnt receptor
            Frizzled and the FGF receptor within the endoplasmic
            reticulum to inhibit their post-translational maturation
            and trafficking to the cell surface.
          Length = 177

 Score = 31.3 bits (71), Expect = 3.0
 Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 4/73 (5%)

Query: 1653 VRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVA 1712
             R        + ST  Q   +P   S      PS P  Q Q Y     Q      P  + 
Sbjct: 107  CRPQRPVMTRATSTTVQTTPLPQPPS----TAPSYPGPQYQGYHPMPPQPGMPAPPYSLQ 162

Query: 1713 YPPPPYQQARHPQ 1725
            YPPP   Q + P 
Sbjct: 163  YPPPGLLQPQGPP 175


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 32.6 bits (74), Expect = 3.1
 Identities = 24/127 (18%), Positives = 39/127 (30%), Gaps = 12/127 (9%)

Query: 1621 PEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPV--RGHTNSSDLSRSTNAQPRGIPILGS 1678
            P   +   PP P     P   +   + P  G V  R  + S     +  A+P    +  +
Sbjct: 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRL--A 2887

Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASS 1738
            + A+   +  +           ++ P         P P       PQ   PP P      
Sbjct: 2888 RPAVSRSTESFALPPDQP----ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP-- 2941

Query: 1739 QPVPPDT 1745
             P+ P T
Sbjct: 2942 -PLAPTT 2947


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.9 bits (72), Expect = 3.5
 Identities = 21/104 (20%), Positives = 38/104 (36%), Gaps = 11/104 (10%)

Query: 1546 LEKNKELNTPSGGELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVM 1605
            LE+ KE   P+  + S  E      +       T++ E          +  EE EE E +
Sbjct: 78   LERQKEFK-PTSTDQSLSEPSRRMQEDSGAENETVEEEE----KEESREEREEVEETEGV 132

Query: 1606 IIEIKEDL------QQQQTVEPEQNKQAFPPHPDQLAQPGDDVT 1643
                +++        Q++  EPE  ++  P         G+ +T
Sbjct: 133  TKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMT 176


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 32.2 bits (74), Expect = 3.5
 Identities = 15/60 (25%), Positives = 17/60 (28%)

Query: 1827 PTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAKFKTP 1886
              AP   +A+    A    A  A  AP             AA  K    A   AA    P
Sbjct: 41   TAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPP 100



 Score = 31.4 bits (72), Expect = 6.0
 Identities = 16/64 (25%), Positives = 19/64 (29%), Gaps = 4/64 (6%)

Query: 1827 PTAPNTKSASDVTRAE----TKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAK 1882
            PTA    +A+  +        K   A  PAP            AAAA      A   AA 
Sbjct: 44   PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAA 103

Query: 1883 FKTP 1886
                
Sbjct: 104  AAAA 107


>gnl|CDD|165164 PHA02813, PHA02813, hypothetical protein; Provisional.
          Length = 354

 Score = 31.8 bits (72), Expect = 3.7
 Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 21/96 (21%)

Query: 1445 MNDLYQSANKRSENGFNIPS-ISDLIKQNNIT----AKTDYPTTST------------FP 1487
            ++D+++   K+    F  P  ISD+I  N IT     K D+                 + 
Sbjct: 66   LDDIFKVIRKKLLLSFEFPQKISDIILDNTITLIKYEKGDFFNNHRDFIHFKSKNCYCYH 125

Query: 1488 EVKYTNSPSDNGGRTA---TPDTSFTTKLTNLFNRS 1520
             V Y N+ S  GG T      +T F+TK   LF+++
Sbjct: 126  LVLYLNNTS-KGGNTNIHIKDNTIFSTKNDVLFDKT 160


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 620

 Score = 31.9 bits (73), Expect = 3.9
 Identities = 29/175 (16%), Positives = 51/175 (29%), Gaps = 20/175 (11%)

Query: 1714 PPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAKL-LRLTRAHNLGAP 1772
            PPP       P  ++P  P     +       + +LEE     LAKL L  TR   L   
Sbjct: 406  PPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQQILAKLELPSTRML-LSQQ 464

Query: 1773 DYTFSMDASKLYVATVRICHPHSDIVVRGKPCAKKADSVEICA--RRAFNELTQIY---- 1826
                S+D+++   A + +      +V        ++    +     +      ++     
Sbjct: 465  AELVSLDSNR---AVIAVSPNWLGMV--------QSRKPLLEQAFAKVLGRSIKLNLESQ 513

Query: 1827 -PTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIA 1880
              +A NT       +          P PQ T                +  +   A
Sbjct: 514  SGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPA 568


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein; Provisional.
          Length = 172

 Score = 31.1 bits (70), Expect = 3.9
 Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 1/49 (2%)

Query: 1669 QPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPP 1717
            +P+G    G     YN S P QQ Q+ Q    QQQ    PQQ A  P  
Sbjct: 114  RPQGDDQGGQGGGNYNQSAPRQQAQRPQ-QAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
            domain family is found in eukaryotes, and is typically
            between 125 and 136 amino acids in length.
          Length = 133

 Score = 30.2 bits (68), Expect = 4.4
 Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 10/75 (13%)

Query: 1558 GELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVMIIEIKEDLQQQQ 1617
            G+ S  ++   AA + N + P          S + +   EEEEEEE       E+ +QQQ
Sbjct: 61   GDSSTGDQSPAAAAAHNSSPPEGPAGT---TSGTGHTRQEEEEEEE-------ENEKQQQ 110

Query: 1618 TVEPEQNKQAFPPHP 1632
            + E +  +     HP
Sbjct: 111  SDEAQVQQHQQHEHP 125


>gnl|CDD|213343 cd05395, RasGAP_RASA4, Ras-GTPase Activating Domain of RASA4.  Ras
           GTPase activating-like 4 protein (RASAL4), also known as
           Ca2+ -promoted Ras inactivator (CAPRI), is a member of
           the GAP1 family. Members of the GAP1 family are
           characterized by a conserved domain structure comprising
           N-terminal tandem C2 domains, a highly conserved central
           RasGAP domain, and a C-terminal pleckstrin-homology
           domain that is associated with a Bruton's tyrosine
           kinase motif. RASAL4, like RASAL, is a cytosolic protein
           that undergoes a rapid translocation to the plasma
           membrane in response to a receptor-mediated elevation in
           the concentration of intracellular free Ca2+ ([Ca2+]i).
           However, unlike RASAL, RASAL4 does not sense
           oscillations in [Ca2+]i.
          Length = 287

 Score = 31.4 bits (71), Expect = 4.7
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 418 LASFDFEQFREKYADIQYLEGKMGKKFSDVFHKEKSKEMDP 458
           LAS   E F  K A +QYL   +G   + VF ++K  E+DP
Sbjct: 84  LASKSMESFL-KVAGMQYLHSVLGPTINRVFEEKKYVELDP 123


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 31.0 bits (70), Expect = 6.0
 Identities = 19/87 (21%), Positives = 31/87 (35%), Gaps = 14/87 (16%)

Query: 1801 GKPCAKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQN----- 1855
             K  AK   +     + +             + +     +   KTAV+A+ A +      
Sbjct: 199  AKKAAKTPAAKSGAKKAS---------AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAA 249

Query: 1856 TEVDQTDKNTAAAAEKKEKKAPRIAAK 1882
             +  +T K     A K  KKA + AAK
Sbjct: 250  KKAKKTAKKALKKAAKAVKKAAKKAAK 276


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 30.8 bits (69), Expect = 6.8
 Identities = 22/76 (28%), Positives = 28/76 (36%), Gaps = 8/76 (10%)

Query: 1670 PRGIPILGSQ---AAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQY 1726
            P G+  L +    AA   P      Q Q      Q      PQQ+A P    Q    P  
Sbjct: 9    PPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQPGQPGA---PQQLAIPTQ--QPQPVPTS 63

Query: 1727 SKPPHPILNASSQPVP 1742
            +  PH +  A +QP P
Sbjct: 64   AMTPHVVQQAPAQPAP 79


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 31.2 bits (70), Expect = 6.8
 Identities = 37/225 (16%), Positives = 78/225 (34%), Gaps = 31/225 (13%)

Query: 1274 NSNLQVTSPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNV-KPTEKTEPPPPHKLNQV 1332
            ++   V   Y       ++QQS++   P    ++      N  +P E+ E   PH  +  
Sbjct: 103  DNRAHVGRAYGAGACEQSNQQSNI---PIGDPVNHLRGHPNFGEPRERAEDAAPHAQHNH 159

Query: 1333 NTSSAPKEKPRTSEQNQIKSNGNEQKVIINKEERCIFSNATYSIPTTQNSELFSTGGMSF 1392
            +  +     P   + +Q        +++ + +     +    + P+ ++      G    
Sbjct: 160  SGKTNGDNPPTGGQYHQSGGTSRNHQMMDSNKNCPADALFNETNPSGEHKRNSIDG---- 215

Query: 1393 FGYPNLLDIPENKNKLLTGEITANKMETKPEKATCPQDKEVEEIT----NGTQTITM--- 1445
                   DIP +        I +   E   +K    Q K+    T    N    +T+   
Sbjct: 216  -------DIPSDI------YIDSQPNEGDVQKTNPWQGKQGNSATSPPANENNAVTLSCS 262

Query: 1446 NDLYQSANKRSENGFNIPSISDL---IKQNNITAKTDYPTTSTFP 1487
            ND  + A+  +E+G+     S++      +NI    + P  +T  
Sbjct: 263  NDQQRGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLNNTND 307


>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
          Length = 175

 Score = 30.2 bits (68), Expect = 6.9
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 1528 ENTFKRAISAMVQQSPHLLEKNKELNTPSGGELS--FEEKHSMAAKSKNETTPTIKTEMR 1585
            +++  RA SA   ++  +L KN+EL   +  E      E    A K + E T    TE +
Sbjct: 55   QSSIDRAHSAK-DEAEAILRKNRELLAKADAEADKIIREGKEYAEKLRAEITEKAHTEAK 113

Query: 1586 KIISSSDYQGLEEEEEEE 1603
            K+I+S+     +EE E+E
Sbjct: 114  KMIASA-----KEEIEQE 126


>gnl|CDD|219927 pfam08601, PAP1, Transcription factor PAP1.  The transcription factor
            Pap1 regulates antioxidant-gene transcription in response
            to H2O2. This region is cysteine rich. Alkylation of
            cysteine residues following treatment with a cysteine
            alkylating agent can mask the accessibility of the
            nuclear exporter Crm1, triggering nuclear accumulation
            and Pap1 dependent transcriptional expression.
          Length = 344

 Score = 30.7 bits (69), Expect = 7.2
 Identities = 18/114 (15%), Positives = 37/114 (32%), Gaps = 5/114 (4%)

Query: 1281 SPYHVKVSR-CNSQQSSVSQSPSSTMISQQSSPF-NVKPTEKTEPPPPHKLNQVNTSSAP 1338
              +  K+   C ++   + +   +   S  SSP   +  + ++    P+      TS + 
Sbjct: 27   PKFCGKLPGACGTKNCPIPKLAKN---SSVSSPVPGLLNSTESNVSSPNNNPNGYTSPSS 83

Query: 1339 KEKPRTSEQNQIKSNGNEQKVIINKEERCIFSNATYSIPTTQNSELFSTGGMSF 1392
                  S    +  + N      N       S   Y+     +S+  S+G   F
Sbjct: 84   AAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGF 137


>gnl|CDD|169051 PRK07667, PRK07667, uridine kinase; Provisional.
          Length = 193

 Score = 30.1 bits (68), Expect = 7.5
 Identities = 20/59 (33%), Positives = 27/59 (45%), Gaps = 12/59 (20%)

Query: 464 IKATDDDYDDGTEEVLNYANYEEYYYSDEDEEW-RTNFFSEFGQHKTD----YYHYKLD 517
           I   +  Y  G EE      + EYYY   D EW R  FF +  Q++T     +YH + D
Sbjct: 57  IVERNKRYHTGFEE------WYEYYYLQWDIEWLRQKFFRKL-QNETKLTLPFYHDETD 108


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
            biomineralisation. They seem to regulate the formation of
            crystallites during the secretory stage of tooth enamel
            development. thought to play a major role in the
            structural organisation and mineralisation of developing
            enamel. They are found in the extracellular matrix.
            Mutations in X-chromosomal amelogenin can cause
            Amelogenesis imperfecta.
          Length = 174

 Score = 29.9 bits (67), Expect = 7.5
 Identities = 23/76 (30%), Positives = 27/76 (35%), Gaps = 11/76 (14%)

Query: 1673 IPILGSQAAM-----YNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
            + IL  Q  M     ++P VP   QQ   +     Q    P     P P  QQ  H Q  
Sbjct: 74   LLILPPQQPMMPVPGHHPMVPMTGQQ--PHLQPPAQHPLQPTYGQNPQP--QQPTHTQ-- 127

Query: 1728 KPPHPILNASSQPVPP 1743
             P  P   A  QP  P
Sbjct: 128  PPVQPQQPADPQPGQP 143


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
            presumed domain is functionally uncharacterized. This
            domain family is found in bacteria and eukaryotes, and is
            typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 31.0 bits (70), Expect = 7.8
 Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)

Query: 1281 SPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPPPHKLNQVNTSSAPKE 1340
             P           +    Q  +S+M +++SS    KP E  +P  P K   ++T+    +
Sbjct: 224  IPDLPSSRDSEKTKPEKPQQETSSMDTEKSS--APKPRETLDPKSPEKAPPIDTTEEELK 281

Query: 1341 KPRTSEQN 1348
             P  S + 
Sbjct: 282  SPEASPKE 289


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 30.3 bits (69), Expect = 8.1
 Identities = 13/50 (26%), Positives = 19/50 (38%)

Query: 1841 AETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAKFKTPLNLK 1890
             E K  V AQ A Q  +  +       A  ++ K  P+   K + P   K
Sbjct: 104  EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
            protein. 
          Length = 314

 Score = 30.6 bits (69), Expect = 8.2
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 2/96 (2%)

Query: 1267 KENAPLVNSNLQVTSPYHVKVSR-CNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPP 1325
            K+ +    +NL  +S   + +     + + SV +   +T+     S        +   PP
Sbjct: 68   KKTSKKSTANLASSSYQKIVIQLNRENLKKSVPRPSDTTLSCANLSSLREPSPGQPPAPP 127

Query: 1326 PHKLNQVNTSSAPKEKPRTSEQNQIKSNGNEQKVII 1361
            P  L+  N +  P +K   S      + G+ ++VI+
Sbjct: 128  PSVLSGKNANCIPSQKNAPSIAIT-STGGSPRRVIV 162


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein; Provisional.
          Length = 182

 Score = 29.8 bits (67), Expect = 8.3
 Identities = 10/46 (21%), Positives = 13/46 (28%), Gaps = 3/46 (6%)

Query: 1678 SQAAMYNPSVPYQQQQQYQYYYNQQ---QPMYYPQQVAYPPPPYQQ 1720
             +    N    Y    Q  Y    Q   Q  Y  +     PP Y+ 
Sbjct: 123  HENFNNNYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPPKYEN 168


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 30.7 bits (69), Expect = 8.3
 Identities = 16/111 (14%), Positives = 27/111 (24%), Gaps = 12/111 (10%)

Query: 1665 STNAQPRGIPILGSQAAMYNPS--VPYQQQQQYQYYYNQQQPMYYPQQV------AYPPP 1716
               A    + ++ +      P+     Q+     Y    + PM     V           
Sbjct: 358  RVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVS 417

Query: 1717 PYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAKLLRLTRAH 1767
            PY     P  S  P P+     QP        +     + +         H
Sbjct: 418  PYNPQS-PGTSYGPEPVGPVPPQPT---NPYVMPISMANMVYPGHPQEHGH 464


>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2.  This family
            contains the bacterial histone H1-like nucleoprotein HC2
            (approximately 200 residues long), which seems to be
            found mostly in Chlamydia. HC2 functions in DNA
            condensation, although it has been suggested that it also
            has other roles.
          Length = 187

 Score = 30.1 bits (67), Expect = 8.7
 Identities = 19/85 (22%), Positives = 27/85 (31%), Gaps = 2/85 (2%)

Query: 1802 KPCAKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQT 1861
            KP AKKA + +   R+   +      T      A+    A  K A     A +       
Sbjct: 22   KPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAA--KKAAKKAVAKKVVAKKPV 79

Query: 1862 DKNTAAAAEKKEKKAPRIAAKFKTP 1886
             K   A     +K A +     KT 
Sbjct: 80   AKKAVAKKATAKKVAAKKVVAKKTV 104


>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
          Length = 84

 Score = 28.4 bits (63), Expect = 9.1
 Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 6/44 (13%)

Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
            M       +QQQQ      Q Q    P Q+  P P +QQ    Q
Sbjct: 45   MSKDDQTPEQQQQ------QPQQQIAPNQIEAPQPNFQQHYQGQ 82


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.392 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 96,526,810
Number of extensions: 9655777
Number of successful extensions: 10046
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9597
Number of HSP's successfully gapped: 142
Length of query: 1891
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1780
Effective length of database: 6,014,308
Effective search space: 10705468240
Effective search space used: 10705468240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 66 (29.4 bits)