RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12290
         (530 letters)



>gnl|CDD|238046 cd00094, HX, Hemopexin-like repeats.; Hemopexin is a heme-binding
           protein that transports heme to the liver.
           Hemopexin-like repeats occur in vitronectin and some
           matrix metalloproteinases family (matrixins). The HX
           repeats of some matrixins bind tissue inhibitor of
           metalloproteinases (TIMPs). This CD contains 4 instances
           of the repeat.
          Length = 194

 Score =  192 bits (490), Expect = 1e-58
 Identities = 81/198 (40%), Positives = 120/198 (60%), Gaps = 6/198 (3%)

Query: 232 PDTCDT-SYDAIAVIRTEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAV 290
           PD CD  S+DA+  +R E++ FK R+ WR+   G  PG P LI  FW  LP   + +DA 
Sbjct: 1   PDACDPLSFDAVTTLRGELYFFKGRYFWRL-SPGKPPGSPFLISSFWPSLP---SPVDAA 56

Query: 291 YQRID-NHIAFFIGRQYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLF 349
           ++R D   I FF G +Y+++ G  L PGYP+P++ LG P ++++IDAA+ W  N KTY F
Sbjct: 57  FERPDTGKIYFFKGDKYWVYTGKNLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFF 116

Query: 350 SGTRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWKDGATYFFKGKGFWKFKDVS 409
            G +YW+ D++T + +P YP+ +  ++ GV   +D AF+W DG  YFFKG  +W+F   S
Sbjct: 117 KGDKYWRYDEKTQKMDPGYPKLIETDFPGVPDKVDAAFRWLDGYYYFFKGDQYWRFDPRS 176

Query: 410 MRVEKEKPSPSAQFWMKC 427
             V    P   +  W+ C
Sbjct: 177 KEVRVGYPLKISSDWLGC 194


>gnl|CDD|214524 smart00120, HX, Hemopexin-like repeats.  Hemopexin is a
           heme-binding protein that transports heme to the liver.
           Hemopexin-like repeats occur in vitronectin and some
           matrix metalloproteinases family (matrixins). The HX
           repeats of some matrixins bind tissue inhibitor of
           metalloproteinases (TIMPs).
          Length = 45

 Score = 45.7 bits (109), Expect = 6e-07
 Identities = 17/47 (36%), Positives = 27/47 (57%), Gaps = 3/47 (6%)

Query: 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVG 380
           IDAA     + KTY F G +YW+ D +  R +P YP+ ++  + G+ 
Sbjct: 1   IDAAFELR-DGKTYFFKGDKYWRFDPK--RVDPGYPKLISSFFPGLP 44



 Score = 42.2 bits (100), Expect = 1e-05
 Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 287 IDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTAL--GLP 328
           IDA ++  D    FF G +Y+ F+  R+ PGYP+ +++   GLP
Sbjct: 1   IDAAFELRDGKTYFFKGDKYWRFDPKRVDPGYPKLISSFFPGLP 44



 Score = 41.5 bits (98), Expect = 2e-05
 Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 240 DAIAVIR-TEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPH 282
           DA   +R  + + FK    WR   K V PGYP LI  F+  LP 
Sbjct: 2   DAAFELRDGKTYFFKGDKYWRFDPKRVDPGYPKLISSFFPGLPC 45



 Score = 41.1 bits (97), Expect = 3e-05
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 383 IDDAFQWKDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFW 424
           ID AF+ +DG TYFFKG  +W+F     RV+   P   + F+
Sbjct: 1   IDAAFELRDGKTYFFKGDKYWRFDPK--RVDPGYPKLISSFF 40


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 49.6 bits (119), Expect = 2e-06
 Identities = 24/107 (22%), Positives = 38/107 (35%), Gaps = 5/107 (4%)

Query: 117 VPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE----TPQDSPRREPSLRQPNQ-YPE 171
           V R  H   +  EH     SP+ +  P + +  P      P ++        +P Q  P+
Sbjct: 68  VHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQ 127

Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
              +P     P  P Q P +P+  PA  P  +  +P         P 
Sbjct: 128 PVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPV 174



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 22/102 (21%), Positives = 31/102 (30%), Gaps = 8/102 (7%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHP------TPESPRREPPPRHPNQYPETPQDSPR 158
            P      R+P    P           P        + P  +P P  P Q P +PQ +P 
Sbjct: 94  PPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPA 153

Query: 159 REPSLRQPNQYPET--PHEPRRDSPPRHPNQFPETPRKPPAR 198
            +P    P    +   P EP     P    +      KP  +
Sbjct: 154 PQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKRK 195



 Score = 37.7 bits (88), Expect = 0.012
 Identities = 23/114 (20%), Positives = 34/114 (29%), Gaps = 20/114 (17%)

Query: 94  EEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP 153
           E ++    P++   P   + +P     + P          PE+         P Q    P
Sbjct: 80  EHEAARPSPQHQYQPPYASAQPR-QPVQQP----------PEAQVPPQHAPRPAQPAPQP 128

Query: 154 QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
              P  +P   QP Q P +P      +P   P         P         AEP
Sbjct: 129 VQQPAYQPQPEQPLQQPVSPQ----VAPAPQPVH-----SAPQPAQQAFQPAEP 173


>gnl|CDD|215676 pfam00045, Hemopexin, Hemopexin.  Hemopexin is a heme-binding
           protein that transports heme to the liver.
           Hemopexin-like repeats occur in vitronectin and some
           matrix metallopeptidases family (matrixins). The HX
           repeats of some matrixins bind tissue inhibitor of
           metallopeptidases (TIMPs).
          Length = 45

 Score = 44.1 bits (105), Expect = 3e-06
 Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 287 IDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTAL--GLP 328
           IDA ++  D    FF GR+Y+ F+  R+ PGYP+ ++    GLP
Sbjct: 1   IDAAFEDRDGKTYFFKGRKYWRFDPQRVEPGYPKLISDFWPGLP 44



 Score = 43.3 bits (103), Expect = 5e-06
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 3/47 (6%)

Query: 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVG 380
           IDAA       KTY F G +YW+ D +  R EP YP+ +++ W G+ 
Sbjct: 1   IDAAFEDRDG-KTYFFKGRKYWRFDPQ--RVEPGYPKLISDFWPGLP 44



 Score = 43.3 bits (103), Expect = 5e-06
 Identities = 19/44 (43%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 240 DAIAVIRT-EVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPH 282
           DA    R  + + FK R  WR   + V PGYP LI  FW  LP 
Sbjct: 2   DAAFEDRDGKTYFFKGRKYWRFDPQRVEPGYPKLISDFWPGLPC 45



 Score = 41.0 bits (97), Expect = 3e-05
 Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 383 IDDAFQWKDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFW 424
           ID AF+ +DG TYFFKG+ +W+F     RVE   P   + FW
Sbjct: 1   IDAAFEDRDGKTYFFKGRKYWRFDPQ--RVEPGYPKLISDFW 40


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 49.9 bits (119), Expect = 3e-06
 Identities = 31/143 (21%), Positives = 38/143 (26%), Gaps = 10/143 (6%)

Query: 98   WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE---TPQ 154
            W      +         P    P  P    P     P +P   P P  P+        P 
Sbjct: 2802 WDPADPPAAVLAPAAALPPAASPAGPLP--PPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859

Query: 155  DSPRREPSLRQPNQYPETPHEPR-----RDSPPRHPNQFPETPRKPPARYPDIARAEPRT 209
               RR P  R P   P  P  P      R +  R    F   P +P       A   P+ 
Sbjct: 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919

Query: 210  TPTTTTPPATRRGESNPGPSDKP 232
             P    PP  +     P     P
Sbjct: 2920 QPQPPPPPQPQPPPPPPPRPQPP 2942



 Score = 44.9 bits (106), Expect = 1e-04
 Identities = 32/137 (23%), Positives = 40/137 (29%), Gaps = 16/137 (11%)

Query: 102  PRYSPSPDLTTRRPEV-----PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
             R   +      RP V     P     T         PE P +   P  P Q    P   
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPP-QPQPQPPPP 2926

Query: 157  PRREPSLRQPNQYPETPHEPRRD----SPPRHPNQFPETPRKPPARYPDIAR-----AEP 207
            P+ +P    P + P+ P  P  D      P      P      P R           A  
Sbjct: 2927 PQPQPPPPPPPR-PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

Query: 208  RTTPTTTTPPATRRGES 224
            R  P ++TPP T    S
Sbjct: 2986 REAPASSTPPLTGHSLS 3002



 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 32/142 (22%), Positives = 40/142 (28%), Gaps = 11/142 (7%)

Query: 102  PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPR-REPPPRHPNQYPETPQDS---- 156
            PR         R P  P P  P  H P+  P   SP   EP P  P   P   +      
Sbjct: 2598 PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPA 2657

Query: 157  ------PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTT 210
                  PRR   L +  Q    P  PRR +              PP             +
Sbjct: 2658 PGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVS 2717

Query: 211  PTTTTPPATRRGESNPGPSDKP 232
             T   P      +++P     P
Sbjct: 2718 ATPLPPGPAAARQASPALPAAP 2739



 Score = 43.0 bits (101), Expect = 4e-04
 Identities = 29/140 (20%), Positives = 36/140 (25%), Gaps = 26/140 (18%)

Query: 105  SPSPDLTTR---RPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
            +P  D+  R   R    +P  P        P      R    R    +   P D P R P
Sbjct: 2857 APGGDVRRRPPSRSPAAKPAAPA------RPPVRRLARPAVSRSTESFAL-PPDQPERPP 2909

Query: 162  SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP----- 216
                  Q P  P    +  PP  P   P  P +P               P+   P     
Sbjct: 2910 ----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965

Query: 217  -------PATRRGESNPGPS 229
                      R     P PS
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPS 2985



 Score = 41.1 bits (96), Expect = 0.002
 Identities = 30/133 (22%), Positives = 36/133 (27%), Gaps = 18/133 (13%)

Query: 105  SPSPDLTTRRPE-VPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
              +PD +   P   PRP  P        P        PP     + P   +  PR     
Sbjct: 2561 PAAPDRSVPPPRPAPRPSEPAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPA-- 2612

Query: 164  RQPNQYPETPHEPRRDSPPRHPNQF----PETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
              P+  P   H P    P   P          P  PP   P    A  R +     P   
Sbjct: 2613 -PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR----PRRA 2667

Query: 220  RRGESNPGPSDKP 232
            RR       S  P
Sbjct: 2668 RRLGRAAQASSPP 2680



 Score = 40.3 bits (94), Expect = 0.003
 Identities = 35/137 (25%), Positives = 43/137 (31%), Gaps = 8/137 (5%)

Query: 102  PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR---HPNQYPETPQDSPR 158
               + SP    RR              +  P P +P  EP P         P  P  + +
Sbjct: 2673 AAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP--EPAPHALVSATPLPPGPAAARQ 2730

Query: 159  REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAE--PRTTPTTTTP 216
              P+L      P  P  P     P  P + P T   PPA  P  A A   PR        
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789

Query: 217  PATRRGESNPGPSDKPD 233
              +   ES P P D  D
Sbjct: 2790 SLSESRESLPSPWDPAD 2806



 Score = 38.8 bits (90), Expect = 0.009
 Identities = 26/134 (19%), Positives = 32/134 (23%), Gaps = 3/134 (2%)

Query: 102  PRYSPSPDLTTRRPEVPRPRHPTH-HHPEHHPTPESPRREPPPR--HPNQYPETPQDSPR 158
            P   P P+     P   R   P             SP + P  R   P     T    P 
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702

Query: 159  REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
              P   +P  +      P    P       P  P  P         A P        PP 
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762

Query: 219  TRRGESNPGPSDKP 232
            T    +   P+   
Sbjct: 2763 TAGPPAPAPPAAPA 2776



 Score = 37.6 bits (87), Expect = 0.021
 Identities = 26/110 (23%), Positives = 30/110 (27%), Gaps = 4/110 (3%)

Query: 123  PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
            P    P   P     R  PPPR   +  E    S  R P     +  P  P +  R  P 
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD-RGDPR 2609

Query: 183  RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
                  P +P  P    PD     P        P           P D P
Sbjct: 2610 GPA---PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP 2656



 Score = 36.1 bits (83), Expect = 0.052
 Identities = 35/144 (24%), Positives = 39/144 (27%), Gaps = 16/144 (11%)

Query: 77  GESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPES 136
           G   P R    +   R    +     R     D T     VP         P   PTP  
Sbjct: 371 GRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASV------PTPAPTPVP 424

Query: 137 PRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP------NQFPE 190
               PPP  P    E   D     P  RQP      P     D   R         + PE
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPE 484

Query: 191 TPRKPPA----RYPDIARAEPRTT 210
            P    A    R+PD A    R  
Sbjct: 485 PPGADLAELLGRHPDTAGTVVRLA 508



 Score = 35.7 bits (82), Expect = 0.081
 Identities = 34/145 (23%), Positives = 39/145 (26%), Gaps = 17/145 (11%)

Query: 102  PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
            P  +P P         P P  P          P +P     P  P   P  P    R   
Sbjct: 2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP-ATPGGPARPARPPT 2762

Query: 162  SLRQPNQYPETPHEPRRDSPPRHPNQ-------------FPETPRKPPARYPDIARAEPR 208
            +   P   P  P  P    P R                  P  P  PPA     A A P 
Sbjct: 2763 TAGPPA--PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820

Query: 209  T-TPTTTTPPATRRGESNPGPSDKP 232
              +P    PP T    + P P   P
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGP 2845



 Score = 35.7 bits (82), Expect = 0.082
 Identities = 29/136 (21%), Positives = 34/136 (25%), Gaps = 15/136 (11%)

Query: 105 SPSPDLTTRRPE-VPRPRHPTHHHP---------EHHPTPESPRREPPPRHPNQYPETPQ 154
           SP P      P   P+ R PT   P          HHP     R   P R          
Sbjct: 338 SPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHP----KRASLPTRKRRSARHAAT 393

Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
              R      Q       P      +P   P   P  P  P     +    +    P   
Sbjct: 394 PFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPS-AEPGSDDGPAPPPER 452

Query: 215 TPPATRRGESNPGPSD 230
            PPA     +   P D
Sbjct: 453 QPPAPATEPAPDDPDD 468



 Score = 33.4 bits (76), Expect = 0.36
 Identities = 26/112 (23%), Positives = 30/112 (26%), Gaps = 6/112 (5%)

Query: 116  EVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE 175
                 R PT   P     P +P   PP R       +  +S    PS   P   P     
Sbjct: 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA 2813

Query: 176  PRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
            P   +    P   P  P  PP          P   P    P     G   PG
Sbjct: 2814 P---AAALPPAASPAGPLPPP---TSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 32/130 (24%), Positives = 35/130 (26%), Gaps = 12/130 (9%)

Query: 105  SPSPDLTTRRPEVPR-PRHP-THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPS 162
            SP+       P VP  P  P     P   PT   P    PP  P   P      PRR   
Sbjct: 2732 SPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP------PRRLTR 2785

Query: 163  LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
                +        P     P  P   P     P A  P  A       P T+  P     
Sbjct: 2786 PAVASLSESRESLP----SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841

Query: 223  ESNPGPSDKP 232
               P P   P
Sbjct: 2842 PPGPPPPSLP 2851


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 46.7 bits (111), Expect = 2e-05
 Identities = 31/140 (22%), Positives = 38/140 (27%), Gaps = 6/140 (4%)

Query: 99  AVIPRYSPSPDLTTRRPEVPRP-RHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
           A  P   P     T  PE+PRP R P   +         P      + P   P  P    
Sbjct: 92  AGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQ 151

Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
           R EP             + R   PPR P   P +      R       EP          
Sbjct: 152 RPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERD-----REPYDAGRPEYDQ 206

Query: 218 ATRRGESNPGPSDKPDTCDT 237
             R  +      D+P    T
Sbjct: 207 RRRDYDHPRPDWDRPRRDRT 226



 Score = 39.0 bits (91), Expect = 0.005
 Identities = 32/149 (21%), Positives = 37/149 (24%), Gaps = 21/149 (14%)

Query: 99  AVIPRYSPSPDLTTRRPEVP----RPRHPTHHHPEHHPTPESPRREPPPRHPNQYP--ET 152
              P       L T RP  P    RP              +  R   PPR P   P    
Sbjct: 128 DRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYA 187

Query: 153 PQDSPRREPSLRQPNQYP-------------ETPHEPRRDSPPRHPNQFPETPRKPPARY 199
           P+    REP      +Y              + P   R D P   P         P    
Sbjct: 188 PEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPE 247

Query: 200 PDIARAEPRTTPTTTTPPATRRGESNPGP 228
            D A   P          A  +    PGP
Sbjct: 248 RDDAPVVPIRPSAPGPLAA--QPAPAPGP 274



 Score = 37.9 bits (88), Expect = 0.015
 Identities = 22/100 (22%), Positives = 27/100 (27%), Gaps = 8/100 (8%)

Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
            P +    PPP H  +  E     P R P       Y     + R    PR   Q P   
Sbjct: 89  DPSAGEPAPPPPHARRTSEPELPRPGRRPY----EGYGGPRADDRPPGLPRQD-QLPTAR 143

Query: 193 RKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
              PA      R EP   P        ++      P    
Sbjct: 144 PAYPAYQQ---RPEPGAWPRAADDYGWQQQRLGFPPRAPY 180


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 44.3 bits (104), Expect = 1e-04
 Identities = 33/122 (27%), Positives = 52/122 (42%), Gaps = 25/122 (20%)

Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN--QY 169
           +++PE P+        P+H   PE P++   PR   Q P  P+ SP+    L  P   + 
Sbjct: 576 SKKPEFPK-------DPKHPKDPEEPKKPKRPRSA-QRPTRPK-SPKLPELLDIPKSPKR 626

Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
           PE+P  P+R  PP    Q P +P +P          E      +  PP + +   +P   
Sbjct: 627 PESPKSPKRPPPP----QRPSSPERP----------EGPKIIKSPKPPKSPKPPFDPKFK 672

Query: 230 DK 231
           +K
Sbjct: 673 EK 674



 Score = 38.5 bits (89), Expect = 0.010
 Identities = 41/180 (22%), Positives = 59/180 (32%), Gaps = 26/180 (14%)

Query: 85  RTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVP-RPRHPTH-HHPEHHPTPESP----R 138
           +T  + D   E         +P    TT RP  P  PR       P   P  E P     
Sbjct: 690 KTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEF 749

Query: 139 REPPPRHPNQYPETPQDSPR--------REPSLRQPNQYPETPHEPRRDSPPRHPNQFPE 190
             PP      + ETP D+P         +E  +      P+   + R DSP  H +    
Sbjct: 750 FTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK-RPDSPSEHED---- 804

Query: 191 TPRKPPARYPDIARAEPRTT--PTTTTPPATRRGESNPGPSDKPDTC--DTSYDAIAVIR 246
              KPP  +P + +   R      +TT   +  G      S K        S+D +  + 
Sbjct: 805 ---KPPGDHPSLPKKRHRLDGLALSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVE 861



 Score = 38.1 bits (88), Expect = 0.013
 Identities = 22/125 (17%), Positives = 43/125 (34%), Gaps = 1/125 (0%)

Query: 108 PDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
           P+ +   P+ P  +       E     + P+    P    +     +  P +E    +  
Sbjct: 514 PEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIP 573

Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
              + P  P+    P+ P + P+ P++P +         P+       P + +R ES   
Sbjct: 574 TLSKKPEFPKDPKHPKDPEE-PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKS 632

Query: 228 PSDKP 232
           P   P
Sbjct: 633 PKRPP 637



 Score = 28.9 bits (64), Expect = 8.4
 Identities = 27/138 (19%), Positives = 47/138 (34%), Gaps = 8/138 (5%)

Query: 101 IPRYSPSPDLTT--RRPEVP-RPRHPTH-HHPEHHPTPESPRREPPPRHPN---QYPETP 153
           IP+    P+     +RP  P RP  P     P+   +P+ P+   PP  P    ++ +  
Sbjct: 620 IPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDY 679

Query: 154 QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTT 213
            D+  +    +      E+     +++ P  P     TPR  P + P      P      
Sbjct: 680 LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPR-DEEFPFEPIGD 738

Query: 214 TTPPATRRGESNPGPSDK 231
                    E    P ++
Sbjct: 739 PDAEQPDDIEFFTPPEEE 756


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 43.9 bits (103), Expect = 2e-04
 Identities = 34/130 (26%), Positives = 42/130 (32%), Gaps = 9/130 (6%)

Query: 102 PRYSPSPDLTT--RRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRR 159
           P+   +P   T  +   +P    PT  +            +PPPR P     TP   P  
Sbjct: 655 PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAP-----TPMRPPAA 709

Query: 160 EPSLRQPNQYPETPHEPRRDSPPR-HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
            P   Q          P   +P R  P        +PPA  P  AR  P   P    PPA
Sbjct: 710 PPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR-PPAAAPGRARPPA 768

Query: 219 TRRGESNPGP 228
              G   P P
Sbjct: 769 AAPGAPTPQP 778



 Score = 43.5 bits (102), Expect = 2e-04
 Identities = 36/148 (24%), Positives = 43/148 (29%), Gaps = 8/148 (5%)

Query: 95  EKSWAVIPRYSPSPDLTTRRPEVPRPRHP-THHHPEHHPTPESPRREPP-PRHPNQYPET 152
           + +W  I      P  T     +P    P T   P   PTP  P   PP           
Sbjct: 663 KPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATG 722

Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPR-HPNQFPETPRKPPARYPDIARAEPRTTP 211
               P   P   +P         P   +P R  P        +PPA  P      P   P
Sbjct: 723 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAP----GAPTPQP 778

Query: 212 TTTTPPATR-RGESNPGPSDKPDTCDTS 238
               PPA + R    P P   P    TS
Sbjct: 779 PPQAPPAPQQRPRGAPTPQPPPQAGPTS 806



 Score = 40.1 bits (93), Expect = 0.003
 Identities = 38/149 (25%), Positives = 46/149 (30%), Gaps = 20/149 (13%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP-ESPRREPPPRHPN--QYPETP-QDSP 157
           PR  P P      P  P    P   +    PTP + P+ E  P  P   Q    P Q SP
Sbjct: 621 PRQWPMP--LRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSP 678

Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIAR------------- 204
               ++      P T   P R   P  P   P    + PA     AR             
Sbjct: 679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPA 738

Query: 205 -AEPRTTPTTTTPPATRRGESNPGPSDKP 232
            A  R  P    P   R   + PG +  P
Sbjct: 739 AAPGRARPPAAAPGRARPPAAAPGRARPP 767



 Score = 32.3 bits (73), Expect = 0.78
 Identities = 19/70 (27%), Positives = 24/70 (34%), Gaps = 3/70 (4%)

Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
           P    P    P     P +    P P+ P Q P  PQ  PR  P+   P   P+      
Sbjct: 751 PGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPT---PQPPPQAGPTSM 807

Query: 178 RDSPPRHPNQ 187
           +  P   P Q
Sbjct: 808 QLMPRAAPGQ 817


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 40.4 bits (94), Expect = 0.002
 Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 3/86 (3%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
           PR +P+P L          RH     P   PT ++P ++P  ++P Q    PQ  P+++P
Sbjct: 177 PREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQ---QPQQQPQQQP 233

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQ 187
                   P+ P +    +   H   
Sbjct: 234 VQPAQQPTPQNPAQQPPQTEQGHKRS 259



 Score = 34.3 bits (78), Expect = 0.17
 Identities = 15/87 (17%), Positives = 26/87 (29%), Gaps = 10/87 (11%)

Query: 134 PESPRREPPP----------RHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
           P  PR  P P           H +++   P   P  +   +QP          ++     
Sbjct: 174 PNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQP 233

Query: 184 HPNQFPETPRKPPARYPDIARAEPRTT 210
                  TP+ P  + P   +   R+ 
Sbjct: 234 VQPAQQPTPQNPAQQPPQTEQGHKRSR 260



 Score = 34.3 bits (78), Expect = 0.18
 Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 8/71 (11%)

Query: 103 RYSPSPDLTTRRPEVPRP-RHPTHHHPEHHPTPESPRREPPPRH----PNQYPETPQDSP 157
           R++P P   T++P V  P + PT  +P   P  +  ++   P       N   + PQ   
Sbjct: 198 RHAPKP---TQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254

Query: 158 RREPSLRQPNQ 168
             + S  Q NQ
Sbjct: 255 GHKRSREQGNQ 265


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 40.8 bits (95), Expect = 0.002
 Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 6/131 (4%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP------ETPQD 155
           P  +P+P+   ++ +  +P    +   +    P+ P   P    P Q P      E P  
Sbjct: 371 PVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQ 430

Query: 156 SPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
            P   P+  QP        E ++ +         E   + PA    + +           
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490

Query: 216 PPATRRGESNP 226
            P     E+ P
Sbjct: 491 EPEPVVEETKP 501



 Score = 40.5 bits (94), Expect = 0.002
 Identities = 34/153 (22%), Positives = 52/153 (33%), Gaps = 9/153 (5%)

Query: 104 YSPSPDLTTRRPEVP------RPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
           +SP   L    P  P       P              + P++   P+   Q P+ P    
Sbjct: 729 FSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQ 788

Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPD-IARAEPRTTPTTTTP 216
            +    +QP        +P++   P+   Q P+ P  P  +Y        P+   T   P
Sbjct: 789 PQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848

Query: 217 PATRRGESNPGPSDKPDTCDTSYDAIAVIRTEV 249
              R G+S   P  KP T   S D +    +EV
Sbjct: 849 LLMRNGDSR--PLHKPTTPLPSLDLLTPPPSEV 879



 Score = 35.1 bits (80), Expect = 0.10
 Identities = 21/100 (21%), Positives = 34/100 (34%), Gaps = 1/100 (1%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQ-YPETPQDSPRRE 160
           P Y+P+ +   ++P            P + P PE P      +   Q     PQ + + E
Sbjct: 406 PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465

Query: 161 PSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP 200
            + +QP        +P+           P      PAR P
Sbjct: 466 QTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPP 505



 Score = 34.3 bits (78), Expect = 0.20
 Identities = 21/121 (17%), Positives = 38/121 (31%), Gaps = 12/121 (9%)

Query: 80  QPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPE-VPRPRHPTHHHPEHHPTPESPR 138
           QP + +        E+ +    P Y+P+P+   ++P   P P  P   +       +S  
Sbjct: 399 QPVQPQQPYYAPAAEQPAQQ--PYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF 456

Query: 139 REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
                    Q          ++P   Q   Y +     ++      P      P +PP  
Sbjct: 457 APQSTYQTEQ--------TYQQP-AAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLY 507

Query: 199 Y 199
           Y
Sbjct: 508 Y 508



 Score = 32.4 bits (73), Expect = 0.84
 Identities = 22/121 (18%), Positives = 34/121 (28%), Gaps = 7/121 (5%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
            S D+   +P V     P     E       P   P P    Q  +  Q + +    L+Q
Sbjct: 347 ASVDVPPAQPTVAWQPVPGPQTGE-------PVIAPAPEGYPQQSQYAQPAVQYNEPLQQ 399

Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESN 225
           P Q  +  + P  + P + P   P   +     Y   A  +P                  
Sbjct: 400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ 459

Query: 226 P 226
            
Sbjct: 460 S 460



 Score = 29.7 bits (66), Expect = 5.8
 Identities = 23/99 (23%), Positives = 38/99 (38%), Gaps = 9/99 (9%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP-PPRHPNQYPETPQDSPRRE 160
           P+   +P    ++P+ P    P +  P+    P+   ++P  P  P    + PQ     +
Sbjct: 768 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 827

Query: 161 PSLRQPNQYP-ETPHEP------RRDSPPRHPNQFPETP 192
           P  +QP Q     P +        R+   R P   P TP
Sbjct: 828 PQYQQPQQPVAPQPQDTLLHPLLMRNGDSR-PLHKPTTP 865


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 39.9 bits (93), Expect = 0.003
 Identities = 24/128 (18%), Positives = 39/128 (30%), Gaps = 6/128 (4%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
           P+   S        + P    P+   P     P++ + +PPP+ P Q            P
Sbjct: 104 PQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQS------PP 157

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
              Q  Q P    +        +P + P  P+  P   P  +    +   +   P     
Sbjct: 158 QQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFY 217

Query: 222 GESNPGPS 229
           G   P P 
Sbjct: 218 GPPQPSPY 225



 Score = 34.9 bits (80), Expect = 0.083
 Identities = 26/138 (18%), Positives = 45/138 (32%), Gaps = 7/138 (5%)

Query: 64  LSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHP 123
           LS    +  E         ++        K E + A      P   +    P  P P+ P
Sbjct: 29  LSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADA------PLQQVNAALPPAPAPQSP 82

Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
                +    P S +               Q +P++EP    P+Q    P +  +   P+
Sbjct: 83  QPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQ 142

Query: 184 HPNQFPETPR-KPPARYP 200
            P Q P+  + + P + P
Sbjct: 143 PPPQVPQQQQYQSPPQQP 160


>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
           This family consists of several Neisseria meningitidis
           TspB virulence factor proteins.
          Length = 502

 Score = 40.0 bits (93), Expect = 0.003
 Identities = 35/102 (34%), Positives = 40/102 (39%), Gaps = 22/102 (21%)

Query: 100 VIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP-ESPRREPPPRHPNQYPETPQDSPR 158
           VIPR    PDLT    E P         P    +P E+P   P PR      E P   P 
Sbjct: 313 VIPR----PDLTPGSAEAP------EAQPLPEVSPAENPANNPNPR------ENPGTRPN 356

Query: 159 REPSL-RQPNQYPETPHEP--RRDSP--PRHPNQFPETPRKP 195
            EP     P+  P+T  +P  R DSP  P  PN      RK 
Sbjct: 357 PEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKE 398


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 39.6 bits (93), Expect = 0.004
 Identities = 16/117 (13%), Positives = 24/117 (20%), Gaps = 6/117 (5%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
               +  + PP   P         +    P L +  +      E                
Sbjct: 56  AAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAP----AAP 111

Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIRTE 248
             + PA   + A AE           A  R              D     I  I  +
Sbjct: 112 EAQAPAAPAERAAAENAARRLARAAAAAPR--PRVPADAAAAVADAVKARIERIVND 166



 Score = 33.0 bits (76), Expect = 0.45
 Identities = 18/121 (14%), Positives = 26/121 (21%), Gaps = 9/121 (7%)

Query: 99  AVIPRYSPSPDLTTRRPEVPRP--RHPTHHHPEHHPTP---ESPRREPPPRHP-NQYPET 152
           A+ P  + +P      P         P     E  P P   E  +R    R         
Sbjct: 50  ALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAE--PAPWLVEHAKRLTAQREQLVARAAA 107

Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPT 212
           P     + P+        E     R         +         A    +     R    
Sbjct: 108 PAAPEAQAPAAPAERAAAENA-ARRLARAAAAAPRPRVPADAAAAVADAVKARIERIVND 166

Query: 213 T 213
           T
Sbjct: 167 T 167


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 38.9 bits (91), Expect = 0.004
 Identities = 22/106 (20%), Positives = 27/106 (25%), Gaps = 8/106 (7%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRRE--PSLRQPNQYPETPHEPRRDSPPRHPNQ-- 187
                P    P   P+  P  P      E   S    +  P  P   + D+PP  P    
Sbjct: 174 GEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSL-PPAPSSFQSDTPPPSPESPT 232

Query: 188 ---FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSD 230
               P  P  PP              PT  +  AT          D
Sbjct: 233 NPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278



 Score = 32.7 bits (75), Expect = 0.46
 Identities = 16/105 (15%), Positives = 26/105 (24%), Gaps = 14/105 (13%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
           P +   P+  +   +   P  P+    +  P        P P      P  P        
Sbjct: 195 PSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPL 254

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAE 206
           S  +P         P   + P                   IA+A+
Sbjct: 255 STAKPTP-------PSASATPAPIGGITLDDD-------AIAKAQ 285



 Score = 32.0 bits (73), Expect = 0.78
 Identities = 22/103 (21%), Positives = 28/103 (27%), Gaps = 9/103 (8%)

Query: 132 PTPESPRRE-------PPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
           P P  P  E             N +P    D     PS   P+  P  P  P     P  
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPS-DPPSSSPGVPSFPSPPEDPSS 206

Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
           P+     P  P +   D     P +    + PP        P 
Sbjct: 207 PSDSSLPPA-PSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248



 Score = 30.4 bits (69), Expect = 2.2
 Identities = 18/100 (18%), Positives = 24/100 (24%), Gaps = 1/100 (1%)

Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
              +   +    + P         P     P SP     P  P+ +           P+ 
Sbjct: 174 GEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTN 233

Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIA 203
             P   P  P  P     P      P TP    A    I 
Sbjct: 234 PSPPPGPAAPPPPPVQQVPPLSTAKP-TPPSASATPAPIG 272


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 39.8 bits (93), Expect = 0.005
 Identities = 24/136 (17%), Positives = 31/136 (22%), Gaps = 10/136 (7%)

Query: 101 IPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE 160
               + S    T       P  P        P P SP   PP   P   P +P       
Sbjct: 78  EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP------- 130

Query: 161 PSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARA---EPRTTPTTTTPP 217
                    P     P   + P      P       A     A        T    ++PP
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190

Query: 218 ATRRGESNPGPSDKPD 233
           A     + P  +    
Sbjct: 191 AEPPPSTPPAAASPRP 206



 Score = 38.6 bits (90), Expect = 0.009
 Identities = 20/137 (14%), Positives = 36/137 (26%), Gaps = 10/137 (7%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
            R  P+   ++ R   P P   +        +P +       R  +    +      R  
Sbjct: 280 SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGA 339

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDI------ARAEPRTTPTTTT 215
           ++        +P      S P  P       ++P              R   R       
Sbjct: 340 AVS----PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395

Query: 216 PPATRRGESNPGPSDKP 232
             A RR  +   P+ +P
Sbjct: 396 GRARRRDATGRFPAGRP 412



 Score = 35.9 bits (83), Expect = 0.069
 Identities = 25/133 (18%), Positives = 28/133 (21%), Gaps = 9/133 (6%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
                L    P                 TP+       P  P      P    R   S +
Sbjct: 797 RRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPP-PARSSESSK 855

Query: 165 QPNQYPETP---HEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
                          RR   P  P   P     P A     A A P        P    R
Sbjct: 856 SKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKA-----AAAAPPAGAPAPRPRPAPR 910

Query: 222 GESNPGPSDKPDT 234
            +  P P   PD 
Sbjct: 911 VKLGPMPPGGPDP 923



 Score = 35.1 bits (81), Expect = 0.11
 Identities = 30/129 (23%), Positives = 35/129 (27%), Gaps = 13/129 (10%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
           S S       P     R P+   P     P SPR+ P P      P      P R  +  
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392

Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
                       RRD+  R P        +P     D   A          P  T  GE 
Sbjct: 393 AVA-----GRARRRDATGRFPAG------RPRPSPLDAGAASGA--FYARYPLLTPSGEP 439

Query: 225 NPGPSDKPD 233
            PG    P 
Sbjct: 440 WPGSPPPPP 448



 Score = 31.7 bits (72), Expect = 1.4
 Identities = 18/130 (13%), Positives = 29/130 (22%), Gaps = 22/130 (16%)

Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP---------------ETPQDS 156
                 P  R          P P   R        +                  E P   
Sbjct: 200 AAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR 259

Query: 157 PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
           P       +  +     + P     P            P  R P  + + P + P  ++P
Sbjct: 260 PAPITLPTRIWE-ASGWNGPSSRPGPASS------SSSPRERSPSPSPSSPGSGPAPSSP 312

Query: 217 PATRRGESNP 226
            A+    S+ 
Sbjct: 313 RASSSSSSSR 322



 Score = 31.3 bits (71), Expect = 1.6
 Identities = 20/139 (14%), Positives = 32/139 (23%), Gaps = 11/139 (7%)

Query: 95  EKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQ 154
           E             +   RRP   R   P         +  + +R+     P+   E+  
Sbjct: 779 EPQRGAGSSPPVRAEAAFRRPGRLRRSGPA----ADAASRTASKRKSRSHTPDGGSESS- 833

Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
              R   +  +P      P     +S    P       R    R                
Sbjct: 834 GPARPPGAAARP------PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAA 887

Query: 215 TPPATRRGESNPGPSDKPD 233
            P A         P+ +P 
Sbjct: 888 PPKAAAAAPPAGAPAPRPR 906



 Score = 30.5 bits (69), Expect = 2.9
 Identities = 18/119 (15%), Positives = 27/119 (22%), Gaps = 2/119 (1%)

Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
            +  +        T  +         P   P     +       R    +  +   P   
Sbjct: 41  GQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPA-- 98

Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
           SP R  +  P  P  P    P    A P  +P        R   S   P         +
Sbjct: 99  SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157



 Score = 29.8 bits (67), Expect = 5.4
 Identities = 24/130 (18%), Positives = 34/130 (26%), Gaps = 7/130 (5%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
             SP+ T R P  P    P    P    +P  PRR  P         +P  +P R  +  
Sbjct: 176 LSSPEETARAPSSPPAEPPPST-PPAAASPRPPRRSSPISAS---ASSPAPAPGRSAADD 231

Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
                 ++             N   E P   PA      R    +     +        S
Sbjct: 232 AGASSSDSSSSESSGCGWGPEN---ECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288

Query: 225 NPGPSDKPDT 234
           +      P  
Sbjct: 289 SSPRERSPSP 298



 Score = 29.0 bits (65), Expect = 7.5
 Identities = 28/172 (16%), Positives = 40/172 (23%), Gaps = 21/172 (12%)

Query: 78  ESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESP 137
           E+     R+            +     SP+P   +     P    P    P   P     
Sbjct: 78  EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEM 137

Query: 138 RREPPPRHPN--QYPETPQDSPRREPS--------------LRQPNQYPETPHEPRRDSP 181
            R      P     P     SP    S                +  + P +P      S 
Sbjct: 138 LRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197

Query: 182 PRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPD 233
           P           +PP R   I+ +     P      A   G S+   S    
Sbjct: 198 PPAA-----ASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 38.0 bits (88), Expect = 0.010
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 11/83 (13%)

Query: 140 EPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH---EPRRDSPPRHPNQFPETPRKPP 196
           EP    P +     + + + + + R     P       EP++      P    +T  KP 
Sbjct: 126 EPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKK------PQATAKTEPKPV 179

Query: 197 ARYPDIARAEPRTTPTTTTPPAT 219
           A+ P   R EP     +T  PA 
Sbjct: 180 AQTPK--RTEPAAPVASTKAPAA 200



 Score = 34.5 bits (79), Expect = 0.12
 Identities = 20/97 (20%), Positives = 33/97 (34%), Gaps = 9/97 (9%)

Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
           + ++ + +   R     P   Q     EP  ++P    +T  +P   +P R     P   
Sbjct: 139 SRQTAKTQTAERPATTRPARKQAVI--EP--KKPQATAKTEPKPVAQTPKRTEPAAPVAS 194

Query: 193 RKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
            K PA     A + P    T TT P      +    +
Sbjct: 195 TKAPA-----ATSTPAPKETATTAPVQTASPAQTTAT 226


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 37.4 bits (87), Expect = 0.019
 Identities = 27/124 (21%), Positives = 42/124 (33%), Gaps = 15/124 (12%)

Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRH-----PNQYPETPQDSPRREP---SLRQPNQY 169
            +         +  P    PR+   P+      P  YP+ PQ  P +      L  P+Q 
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226

Query: 170 PETPHEPRRDSPPRHPN-----QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
           P  P  P     P+ P      QFP   ++ P   P   + + +       PP   +   
Sbjct: 227 PAQPPLPP--QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTP 284

Query: 225 NPGP 228
           +PG 
Sbjct: 285 HPGL 288



 Score = 34.4 bits (79), Expect = 0.15
 Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 11/107 (10%)

Query: 106 PSPDLTTRRPEVPRPRHPTH----HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
             P L+ + P  P P+ P        P+  P P++     P     Q    P     +  
Sbjct: 247 QFPGLSQQMPPPP-PQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLL 305

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPR 208
            L Q  Q  +   + R         Q      +      +  RA+ R
Sbjct: 306 PLVQQPQGQQRGPQFREQLVQLSQQQ------REALSQEEAKRAKRR 346



 Score = 33.2 bits (76), Expect = 0.35
 Identities = 19/83 (22%), Positives = 31/83 (37%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
           PE  +P+       +    P  P + P    P Q P+ P  S +  P   QP Q  + P 
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPP 270

Query: 175 EPRRDSPPRHPNQFPETPRKPPA 197
           +P+   PP++         +   
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQN 293



 Score = 33.2 bits (76), Expect = 0.40
 Identities = 30/140 (21%), Positives = 49/140 (35%), Gaps = 16/140 (11%)

Query: 97  SWAVIPRYSPSPDLTTRRPEVPR--PRHPTHHHPE-------HHPTPESPRREPPPRHPN 147
              + PR +  P      PE P   P+ P  H  +         P+    +   PP+ P 
Sbjct: 181 PQGMPPRQAAFP--QQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQ 238

Query: 148 QYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
           Q P   Q       S + P   P+ P + ++  PP+   Q P  P+  P  +P + + + 
Sbjct: 239 Q-PPPLQQPQFPGLSQQMPPPPPQPPQQQQQ--PPQPQAQPP--PQNQPTPHPGLPQGQN 293

Query: 208 RTTPTTTTPPATRRGESNPG 227
              P    P      +   G
Sbjct: 294 APLPPPQQPQLLPLVQQPQG 313



 Score = 31.3 bits (71), Expect = 1.4
 Identities = 14/90 (15%), Positives = 28/90 (31%), Gaps = 6/90 (6%)

Query: 118 PRPRHPTHHHPEHHPTPESPRR--EPPPRHPNQYPETPQDS----PRREPSLRQPNQYPE 171
           P          +    P   ++   PPP+ P Q  + PQ      P+ +P+        +
Sbjct: 233 PPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQ 292

Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPD 201
               P    P   P       ++   ++ +
Sbjct: 293 NAPLPPPQQPQLLPLVQQPQGQQRGPQFRE 322


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 37.2 bits (86), Expect = 0.021
 Identities = 31/165 (18%), Positives = 54/165 (32%), Gaps = 14/165 (8%)

Query: 66  PRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTH 125
              +++KE ++ E +          DR+EEK    +   S  P    ++    + + P  
Sbjct: 144 KEKEKEKEKKVEEPR----------DREEEKKRERVRAKS-RPKKPPKKKPPNKKKEPPE 192

Query: 126 HHPEHHPTPESPRREPP-PRHPNQYPETPQDSPRREPSLRQPNQYPETPH-EPRRDSPPR 183
              +     E+ + +P  P    +  +   D   RE +     +       E  R S   
Sbjct: 193 EEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSS 252

Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
                P      P       R E R   T+  PP+ R   + P P
Sbjct: 253 LKKPDPSPSMASPETRESSKRTETRPR-TSLRPPSARPASARPAP 296



 Score = 35.6 bits (82), Expect = 0.063
 Identities = 31/186 (16%), Positives = 59/186 (31%), Gaps = 28/186 (15%)

Query: 67  RNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHH 126
           + Q K+E +  + +P  E      DRK ++                ++ E PR R     
Sbjct: 110 KEQVKEEKKKKKEKPKEEPK----DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE-EEK 164

Query: 127 HPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETP-----------HE 175
             E       P++ P  + PN+  E P++  +R+ +       PE P             
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDG 224

Query: 176 PRRDSPPRHPNQF------------PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGE 223
             R++      +               + +KP       +     ++  T T P T    
Sbjct: 225 KDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRP 284

Query: 224 SNPGPS 229
            +  P+
Sbjct: 285 PSARPA 290



 Score = 32.9 bits (75), Expect = 0.48
 Identities = 25/160 (15%), Positives = 51/160 (31%), Gaps = 13/160 (8%)

Query: 65  SPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPT 124
            P+N+  KE+   + Q   E+      +++ K         P  +   +RP   + +   
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKK---KEKPKEEP--KDRKPKEEAKEKRPPKEKEKE-- 149

Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
               +     E   RE   +      ++    P ++    +  + P    + R+ +    
Sbjct: 150 ----KEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE-PPEEEKQRQAAREA- 203

Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
               PE P     R  +    + R T T+       R  S
Sbjct: 204 VKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSS 243



 Score = 29.9 bits (67), Expect = 3.5
 Identities = 18/103 (17%), Positives = 24/103 (23%), Gaps = 7/103 (6%)

Query: 62  EVLSPRNQRKKED-------RIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRR 114
           E      + K+ED            +    R  S   R+   S              TR 
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
                   P          P S R  PP     +     QD+ 
Sbjct: 270 SSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQ 312


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 37.0 bits (85), Expect = 0.030
 Identities = 40/161 (24%), Positives = 51/161 (31%), Gaps = 35/161 (21%)

Query: 61  TEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRP 120
           TE LS     KK+           RT+S   R      AV P                  
Sbjct: 572 TEALSLVVTPKKKRHKVTDTRITPRTVS---RILALRDAVGPAAG--------------- 613

Query: 121 RHPTHHHPEHHPTPESPR---REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
                HH   HP+  S       PP RHP   P T        PSL Q   +   P +  
Sbjct: 614 ----THHQPLHPSSLSASMGFHPPPFRHPFPLPLT---VAIPNPSLHQSEVFMGYPFQSP 666

Query: 178 R-----DSPPRHPNQFPETPRKPPARYPDIARA--EPRTTP 211
                  SPP      P+ PR   + +P +  A   P ++P
Sbjct: 667 HLGAPSGSPPGKDRDSPDLPRPTTSLHPKLLSAHHHPGSSP 707


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 36.6 bits (84), Expect = 0.033
 Identities = 39/186 (20%), Positives = 56/186 (30%), Gaps = 35/186 (18%)

Query: 82  FRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPE------ 135
            R + +S C  KE K+     + +  P       EV  P +P+   P+  P  +      
Sbjct: 157 LRGKDVSCC--KEPKTAVTTSKTTSWPT------EVSHPTYPSQVTPQSQPATQGHQTAT 208

Query: 136 -----------------SPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPE---TPHE 175
                            +     P   P      P  SP+  PS    +Q        H 
Sbjct: 209 ANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHT 268

Query: 176 PRRDSPPRHPN-QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDT 234
            RR +PP   N + P +   PP          P   PT TT   +    S+P       T
Sbjct: 269 QRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328

Query: 235 CDTSYD 240
                D
Sbjct: 329 TQNLVD 334


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 36.0 bits (83), Expect = 0.040
 Identities = 11/51 (21%), Positives = 14/51 (27%), Gaps = 1/51 (1%)

Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQ-DSPRREP 161
            +    P         P    T     +   P       E P+ D PRRE 
Sbjct: 15  EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRER 65



 Score = 34.5 bits (79), Expect = 0.15
 Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 7/75 (9%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR-----HPN 186
           P P SP   P P  P +       +    P+   P        +PRR+  P+        
Sbjct: 19  PAPPSPAAAPAPPPPAKTAAPA--TKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKLE 76

Query: 187 QFPETPRKPPARYPD 201
            F   P++   R+ D
Sbjct: 77  DFVVEPQEGKTRFHD 91



 Score = 32.6 bits (74), Expect = 0.54
 Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 2/63 (3%)

Query: 135 ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH--EPRRDSPPRHPNQFPETP 192
           E+ +  P P  P   P  P  +    P+ +        P   +P++D P R     P + 
Sbjct: 13  EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASL 72

Query: 193 RKP 195
            K 
Sbjct: 73  WKL 75



 Score = 29.5 bits (66), Expect = 4.6
 Identities = 10/43 (23%), Positives = 13/43 (30%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR 144
              +P P   T  P                P  + PRRE  P+
Sbjct: 26  AAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPK 68


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 35.8 bits (83), Expect = 0.052
 Identities = 20/128 (15%), Positives = 28/128 (21%), Gaps = 13/128 (10%)

Query: 130 HHPTPESPRREP-PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF 188
             P   +P   P         P     +    P+       P         +P       
Sbjct: 382 ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP-----PAAAPPAPVAAPAAAAP-- 434

Query: 189 PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIRTE 248
              P   PA          +  P T   P   R    P  +        +  A  +  TE
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIP--VRVAPEPAVASAAPAPAAAPAAARLTPTE 492

Query: 249 VFVFKDRW 256
                D W
Sbjct: 493 EG---DVW 497



 Score = 34.3 bits (79), Expect = 0.19
 Identities = 21/131 (16%), Positives = 25/131 (19%), Gaps = 8/131 (6%)

Query: 99  AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
           A  P    +P           P       P     P +    P        P        
Sbjct: 372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAA------PPAAAPPAP 425

Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARY-PDIARAEPRTTPTTTTPP 217
                                  P  P Q        P R  P+ A A     P    P 
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAA-PA 484

Query: 218 ATRRGESNPGP 228
           A R   +  G 
Sbjct: 485 AARLTPTEEGD 495


>gnl|CDD|215479 PLN02887, PLN02887, hydrolase family protein.
          Length = 580

 Score = 36.0 bits (83), Expect = 0.054
 Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 12/99 (12%)

Query: 134 PESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEP---RRDSPPRHPNQFPE 190
           P   R   P R P  +P   ++ P     L QP+    +P      R D      + F +
Sbjct: 3   PMLSRVSLPLRLPRLFPAAAREMPPAARLLLQPSSLSFSPKTSNFNRLDVTSTEFSSFRK 62

Query: 191 -----TPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
                   KP  R      A  R            + E 
Sbjct: 63  GFHAFRSTKPRGR----VWAAHRGYRKIRRRAPPPKKEQ 97


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 35.2 bits (81), Expect = 0.059
 Identities = 26/140 (18%), Positives = 44/140 (31%), Gaps = 14/140 (10%)

Query: 76  IGESQPFRERTLSICDRK---EEKSWAVIPRYSPSPDLTTRRPEVPRP-------RHPTH 125
             +   + ER ++   R     E++ A++ R +    L      +P         +   H
Sbjct: 51  SEKKAAYAERDMAFLQRDAWIAERNNALMERDNKFAALQPVENSLPPGCGILPGTKGIHH 110

Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSP-RREPSLRQPNQYPETPHEPRRDSP--- 181
               HHP            H  + P      P  +E    +  Q P+ P  P+   P   
Sbjct: 111 LQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170

Query: 182 PRHPNQFPETPRKPPARYPD 201
               N+  + P   P   PD
Sbjct: 171 GSVSNRSVKMPGIDPRSKPD 190


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 35.7 bits (83), Expect = 0.060
 Identities = 17/94 (18%), Positives = 20/94 (21%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
              P P       P   P P  P               P  +P  +P+         T  
Sbjct: 420 AAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAA 479

Query: 175 EPRRDSPPRHPNQFPETPRKPPARYPDIARAEPR 208
                     P   P  P  P A       A  R
Sbjct: 480 PAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLR 513



 Score = 35.0 bits (81), Expect = 0.11
 Identities = 24/136 (17%), Positives = 36/136 (26%), Gaps = 7/136 (5%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRR-- 159
           P    +P              P        P   +P   P         +     P +  
Sbjct: 615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674

Query: 160 --EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
              P+   P   P  P  P   +P +     P     PPA   D   A+P       + P
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQ---PAPAPAATPPAGQADDPAAQPPQAAQGASAP 731

Query: 218 ATRRGESNPGPSDKPD 233
           +    +  P P +  D
Sbjct: 732 SPAADDPVPLPPEPDD 747



 Score = 35.0 bits (81), Expect = 0.12
 Identities = 18/117 (15%), Positives = 22/117 (18%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
                    P    P  P        P  ++      P    Q    P  +      L  
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP 743

Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
               P  P       PP         P   P   P     E       +     RR 
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800



 Score = 34.2 bits (79), Expect = 0.20
 Identities = 26/139 (18%), Positives = 37/139 (26%), Gaps = 14/139 (10%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPR------REPPPRHPNQYPETPQDSPRR 159
           P+       P V        HHP+H   P++         +     P   P  P  +   
Sbjct: 636 PAEASAAPAPGVA----APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691

Query: 160 EPSLRQPNQ----YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
            P+   P Q       TP   + D P   P Q  +    P     D     P        
Sbjct: 692 APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDP 751

Query: 216 PPATRRGESNPGPSDKPDT 234
             A  +    P P+     
Sbjct: 752 AGAPAQPPPPPAPAPAAAP 770



 Score = 33.8 bits (78), Expect = 0.26
 Identities = 22/124 (17%), Positives = 30/124 (24%), Gaps = 6/124 (4%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
            P+P  +    E  RP       P     P +P        P +    P           
Sbjct: 600 PPAPASSGPPEEAARPA-----APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP 654

Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
           +    P+   +     P +     P  P   PA     A A          P AT     
Sbjct: 655 KHVAVPD-ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713

Query: 225 NPGP 228
              P
Sbjct: 714 ADDP 717



 Score = 33.8 bits (78), Expect = 0.26
 Identities = 20/106 (18%), Positives = 28/106 (26%), Gaps = 7/106 (6%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPR---REPSLRQPNQYPETPHEPRRDSPPRHPNQF 188
           P    P   P            Q +P      P+ +  +   + P   +  S P      
Sbjct: 678 PAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737

Query: 189 ----PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSD 230
               P  P  PP      A+  P   P     PA     S P   +
Sbjct: 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783



 Score = 32.7 bits (75), Expect = 0.54
 Identities = 16/119 (13%), Positives = 21/119 (17%), Gaps = 13/119 (10%)

Query: 132 PTPESPR--------REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
           P                P P    Q    P  +P                  P      +
Sbjct: 406 PAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465

Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAI 242
                P                 P   P     PA     + P  +D   T    +  I
Sbjct: 466 -----PAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEI 519



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 14/84 (16%), Positives = 17/84 (20%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
                        P+  P P      P P         P   P   PS +        P 
Sbjct: 416 APAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPE 475

Query: 175 EPRRDSPPRHPNQFPETPRKPPAR 198
                +P       P      PA 
Sbjct: 476 PTAAPAPAPPAAPAPAAAPAAPAA 499



 Score = 31.1 bits (71), Expect = 1.9
 Identities = 19/125 (15%), Positives = 26/125 (20%), Gaps = 11/125 (8%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHH-HPEHHPTPESPRREPPPR--HPNQYPETPQDSPR--R 159
           +P+    +     P          P     P       P     P   P +P  +     
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452

Query: 160 EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
            PS          P      +P       P     P A  P    A P        P   
Sbjct: 453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPA---PPAAPAPA---AAPAAPAAPAAPAGA 506

Query: 220 RRGES 224
               +
Sbjct: 507 DDAAT 511



 Score = 30.7 bits (70), Expect = 2.4
 Identities = 15/129 (11%), Positives = 25/129 (19%), Gaps = 1/129 (0%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
           P  +  P           P            +         P H  ++   P  S   + 
Sbjct: 609 PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
              +          P               P +P         A     P    P A + 
Sbjct: 669 WPAKAGGAAPAAPPPAPAPAAPAAPA-GAAPAQPAPAPAATPPAGQADDPAAQPPQAAQG 727

Query: 222 GESNPGPSD 230
             +    +D
Sbjct: 728 ASAPSPAAD 736


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 35.5 bits (82), Expect = 0.061
 Identities = 30/151 (19%), Positives = 41/151 (27%), Gaps = 11/151 (7%)

Query: 112 TRRPEVPRPRHPTH---HHPEHHPTP-ESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
            RRP  PRP H            P P E        + P Q PE+  D+     SL   +
Sbjct: 244 VRRP--PRPGHLLVVSTPELVLPPEPDEQSSERQTFKTPPQ-PESSSDAENGLVSLVDED 300

Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
              E   +   D+PP   +          A  P++  A          P         P 
Sbjct: 301 DKEEVSRDSESDAPPDDTDLTRALAEYEAA-APEVPDAARTVLQGKEQPSPDPVESPGP- 358

Query: 228 PSDKPDTCDTSYDAIAVIRTEVFVFKDRWMW 258
             D       S +          V + R   
Sbjct: 359 --DLTPGYPKSDEVAGTYLGGGSVAEGRDPL 387


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 34.8 bits (80), Expect = 0.12
 Identities = 17/97 (17%), Positives = 23/97 (23%), Gaps = 9/97 (9%)

Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
            P    +   P      P  P  +P   P        P  P EP R++        P   
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIP--PKEPVRETAT------PPPV 412

Query: 193 RKPPARYP-DIARAEPRTTPTTTTPPATRRGESNPGP 228
              P   P                P   +   + P P
Sbjct: 413 PPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449



 Score = 31.7 bits (72), Expect = 1.1
 Identities = 16/78 (20%), Positives = 19/78 (24%), Gaps = 1/78 (1%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
           +PSP   T  P          + P   P  E+    P P  P   P             R
Sbjct: 375 APSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT-PESAPKLTR 433

Query: 165 QPNQYPETPHEPRRDSPP 182
                 E P       P 
Sbjct: 434 AAIPVDEKPKYTPPAPPK 451



 Score = 31.7 bits (72), Expect = 1.2
 Identities = 17/81 (20%), Positives = 23/81 (28%), Gaps = 9/81 (11%)

Query: 150 PETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRT 209
           P  P  +    P    P            + PP+ P +   TP   P R        P  
Sbjct: 368 PAKPT-AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR--------PVA 418

Query: 210 TPTTTTPPATRRGESNPGPSD 230
            P   TP +  +      P D
Sbjct: 419 PPVPHTPESAPKLTRAAIPVD 439


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 34.9 bits (80), Expect = 0.12
 Identities = 21/106 (19%), Positives = 24/106 (22%), Gaps = 5/106 (4%)

Query: 132 PTPESPRREPPPRHPNQYPET--PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFP 189
           P   +P    PP  P   P       +    P+ R P               P       
Sbjct: 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451

Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATRR---GESNPGPSDKP 232
             P   PA     A A PR           R        P   D P
Sbjct: 452 PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP 497



 Score = 34.1 bits (78), Expect = 0.18
 Identities = 24/134 (17%), Positives = 30/134 (22%), Gaps = 9/134 (6%)

Query: 99  AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
           A  P  + +P    R      PR              +P R  P   P    + P     
Sbjct: 449 APAPAPAAAPAAAARPA-AAGPRPVA------AAAAAAPARAAPAAAPAPADDDPPPWEE 501

Query: 159 REPSLRQPNQYPETPHEPRRD--SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
             P    P               S P      P+   +  A  P  A A      T    
Sbjct: 502 LPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVV 561

Query: 217 PATRRGESNPGPSD 230
                  S  G  D
Sbjct: 562 APRPPRASASGLPD 575



 Score = 32.5 bits (74), Expect = 0.55
 Identities = 27/150 (18%), Positives = 33/150 (22%), Gaps = 14/150 (9%)

Query: 97  SWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
           + A  P   P+P         P P  P            +P      R     P     +
Sbjct: 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPA------APAAAPAAAAAARAVAAAPARRSPA 429

Query: 157 PRREPSLRQ-----PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTP 211
           P    + RQ     P   P     P                    A     ARA P   P
Sbjct: 430 PEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAP 489

Query: 212 TTTTPPATRRGESNPG--PSDKPDTCDTSY 239
                      E  P    S  P   D + 
Sbjct: 490 -APADDDPPPWEELPPEFASPAPAQPDAAP 518


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
           genes is always found in type III secretion operons,
           althought its function in the processes of secretion and
           virulence is unclear. Hpa stands for Hrp-associated
           gene, where Hrp stands for hypersensitivity response and
           virulence.
          Length = 201

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 23/107 (21%), Positives = 31/107 (28%), Gaps = 23/107 (21%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
               +    P P         P    RR   L Q  +     + P               
Sbjct: 1   IIQGARLARPAP-------ADPARPARRRTPLAQLRRRDALAYAP--------------P 39

Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
           PR  P    D  R EPR     + PP     +++  P+  PD  D S
Sbjct: 40  PRPEPPPPCDEDRPEPRADTRASDPPPEAPTDAD--PAQPPDDPDAS 84


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 34.5 bits (79), Expect = 0.14
 Identities = 24/114 (21%), Positives = 31/114 (27%), Gaps = 8/114 (7%)

Query: 134 PESPRREPPPRHPNQY----PETPQDSPRREPS----LRQPNQYPETPHEPRRDSPPRHP 185
            E     PP    +      P  P  +PR  P+       P   P T   P     P   
Sbjct: 168 AEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTT 227

Query: 186 NQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSY 239
              P T    P+      +A        T  P T  G   P  +  P    + Y
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRY 281



 Score = 31.8 bits (72), Expect = 0.81
 Identities = 19/92 (20%), Positives = 23/92 (25%), Gaps = 3/92 (3%)

Query: 108 PDLTTRRPEV-PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
           P L    P + P              T  SP   P P      P T   +P    +  Q 
Sbjct: 188 PALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247

Query: 167 NQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
              PE    P    P     + P     P   
Sbjct: 248 GTTPEAEGTP--APPTPGGGEAPPANATPAPE 277


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 33.4 bits (76), Expect = 0.16
 Identities = 33/123 (26%), Positives = 43/123 (34%), Gaps = 14/123 (11%)

Query: 110 LTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQY 169
           L T  P  P    PT  HP   P P      P P HP Q  +T +     +P      + 
Sbjct: 45  LETGYPFCP----PTTPHPSSQPPPCP----PSPGHPPQTNDTHEKDLALQPPPGGKKKE 96

Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
            +     +       P+Q PE   +     P+    +P   P  T PP    GE   GPS
Sbjct: 97  KKKKETEKPAQGGEKPDQGPEAKGEGEGHEPE----DP--PPEDTPPPPGGEGEVEGGPS 150

Query: 230 DKP 232
             P
Sbjct: 151 PGP 153


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 34.4 bits (79), Expect = 0.18
 Identities = 16/118 (13%), Positives = 29/118 (24%), Gaps = 1/118 (0%)

Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHP-NQYPETPQDSPRREPSLRQPNQY 169
           T+    VPRP        +   +  + R  PP   P          SPR       P   
Sbjct: 68  TSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFL 127

Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
            +              +   E   +     P  + +          P    + +++  
Sbjct: 128 YQVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKHKTHKR 185



 Score = 30.1 bits (68), Expect = 3.3
 Identities = 27/186 (14%), Positives = 41/186 (22%), Gaps = 23/186 (12%)

Query: 66  PRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVI-PRYSPSPDLTTRR-PEVPRPRHP 123
           PR Q  + +             S  +  + +    + P  S S   +       P     
Sbjct: 5   PRGQPGRWN---YDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG 61

Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETP--QDSPRREPSLRQPNQ----------YPE 171
           +            PR   PPR P Q  + P    + R  P    P               
Sbjct: 62  SGGGVATSTIYTVPR---PPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLG 118

Query: 172 TPHEPRRDS--PPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
                 R            P       +        E    P + +      G+  P   
Sbjct: 119 ADSTSPRFLYQVNFPVILAPIGESNSSSEELS-EEEEHSRPPPSESLKVKNGGKVYPKGF 177

Query: 230 DKPDTC 235
            K  T 
Sbjct: 178 SKHKTH 183


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 34.1 bits (78), Expect = 0.19
 Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 13/84 (15%)

Query: 97  SWAVIPRYSPSPDLTTRRPEVPRP---RHPTHHHPEH-------HPTPES---PRREPPP 143
              V P  + SPD+  + PE   P   RHP    PE         P+P S   P ++PP 
Sbjct: 377 DGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPV 436

Query: 144 RHPNQYPETPQDSPRREPSLRQPN 167
                 P +PQ S  R  +  +P 
Sbjct: 437 ARSAPLPPSPQASAPRNVASGKPG 460



 Score = 29.5 bits (66), Expect = 5.1
 Identities = 16/76 (21%), Positives = 21/76 (27%), Gaps = 7/76 (9%)

Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
           P          PE    +P        PE P   P   PS       P +   P +  P 
Sbjct: 384 PAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPS-------PASAPTPEQQPPV 436

Query: 183 RHPNQFPETPRKPPAR 198
                 P +P+    R
Sbjct: 437 ARSAPLPPSPQASAPR 452


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 34.1 bits (78), Expect = 0.19
 Identities = 17/100 (17%), Positives = 26/100 (26%), Gaps = 11/100 (11%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
            TP   R    P        +         S + P    +TP     D     P      
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPM-TEEIKSWQTP---LQTPAMYSSDYQAPKPEPIYTW 324

Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
                 R+P             ++ P +    S+ GP+ K
Sbjct: 325 EELLRERFPS-------DLFAISSLPDSDSEASDSGPTRK 357


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 34.4 bits (78), Expect = 0.19
 Identities = 34/137 (24%), Positives = 45/137 (32%), Gaps = 16/137 (11%)

Query: 105  SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
            S S   ++     P  R      P   P+P  PRR P  R  +      +D P   P   
Sbjct: 1818 SSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRSRSGR-RRERDRPSANPFRW 1876

Query: 165  QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
             P Q     H P   +P   P    + P           R  P  TP++ T   +R    
Sbjct: 1877 APRQRSRADHSPDGTAPGDAPLNLEDGP----------GRGRPIWTPSSATTLPSRS--- 1923

Query: 225  NPGPSDKPDTCDTSYDA 241
              GP D  D  +T   A
Sbjct: 1924 --GPEDSVDETETEDSA 1938


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 34.0 bits (78), Expect = 0.21
 Identities = 19/87 (21%), Positives = 22/87 (25%), Gaps = 11/87 (12%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
           P+       PP   P    +      R E  +R     P           P     F   
Sbjct: 346 PSEGVAAVAPPAPAPADLTQRLN---RLEKEVRSLRSAPTAAATAAGAPLP----DFDPR 398

Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPA 218
           PR PPA       A     P    P A
Sbjct: 399 PRGPPA----PEPARSAEAPPLVAPAA 421


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 33.5 bits (76), Expect = 0.22
 Identities = 21/101 (20%), Positives = 29/101 (28%), Gaps = 7/101 (6%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
           P     P      P +P    P    P  +P P  P R    ++PN  P   +   R +P
Sbjct: 180 PNMGMRPGFNQMPPHMPG-MPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQP 238

Query: 162 S-LRQPNQYPETPHEPRRDSPPR-----HPNQFPETPRKPP 196
                P        + R    P       P+        PP
Sbjct: 239 GGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279


>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region.  SelP is the
           only known eukaryotic selenoprotein that contains
           multiple selenocysteine (Sec) residues, and accounts for
           more than 50% of the selenium content of rat and human
           plasma. It is thought to be glycosylated. SelP may have
           antioxidant properties. It can attach to epithelial
           cells, and may protect vascular endothelial cells
           against peroxynitrite toxicity. The high selenium
           content of SelP suggests that it may be involved in
           selenium intercellular transport or storage. The
           promoter structure of bovine SelP suggest that it may be
           involved in countering heavy metal intoxication, and may
           also have a developmental function. The N-terminal
           region of SelP can exist independently of the C terminal
           region. Zebrafish selenoprotein Pb lacks the C terminal
           Sec-rich region, and a protein encoded by the rat SelP
           gene and lacking this region has also been reported.
           N-terminal region contains a conserved SecxxCys motif,
           which is similar to the CysxxCys found in thioredoxins.
           It is speculated that the N terminal region may adopt a
           thioredoxin fold and catalyze redox reactions. The
           N-terminal region also contains a His-rich region, which
           is thought to mediate heparin binding. Binding to
           heparan proteoglycans could account for the membrane
           binding properties of SelP. The function of the
           bacterial members of this family is uncharcterised.
          Length = 238

 Score = 33.3 bits (76), Expect = 0.23
 Identities = 15/70 (21%), Positives = 23/70 (32%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
                +T      P    P   HP HH   E         H + +P +   S  ++P   
Sbjct: 163 DECKKVTLATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPD 222

Query: 165 QPNQYPETPH 174
           +P + P   H
Sbjct: 223 KPTEPPSGLH 232


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 33.8 bits (77), Expect = 0.29
 Identities = 27/114 (23%), Positives = 37/114 (32%), Gaps = 5/114 (4%)

Query: 132 PTPESPRRE-PPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH---EPRRDSPPRHPNQ 187
           P   +PRR+  PP  P   P +     RR  + R  +  PE        RR      P  
Sbjct: 431 PGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPP 490

Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDA 241
            P     P   Y  +    PR  P       T R +  P  +  P+  +  Y  
Sbjct: 491 EPAAAPSPATYYTRMGGGPPRLPPRNRATE-TLRPDWGPPAAAPPEQMEDPYLE 543


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 33.2 bits (76), Expect = 0.32
 Identities = 12/44 (27%), Positives = 15/44 (34%), Gaps = 3/44 (6%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE 175
           P P+ P   PPP      PE   D P +           + P E
Sbjct: 269 PEPQPP--PPPPPPEPPEPEEEPDEPDQTDPDDGEET-DQIPEE 309



 Score = 29.4 bits (66), Expect = 6.0
 Identities = 11/50 (22%), Positives = 17/50 (34%), Gaps = 2/50 (4%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE 151
           PR +  P+   + P  P P  P     E     ++          +Q PE
Sbjct: 261 PRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTD--PDDGEETDQIPE 308


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 33.4 bits (77), Expect = 0.36
 Identities = 15/77 (19%), Positives = 22/77 (28%), Gaps = 3/77 (3%)

Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP 185
           +  +  P P+     P P  P   P      P   P    P     + + P +      P
Sbjct: 519 NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPP--PPPTATQASSNAPAQIPADSSP 576

Query: 186 NQ-FPETPRKPPARYPD 201
               PE P   P +   
Sbjct: 577 PPPIPEEPTPSPTKDSS 593



 Score = 30.7 bits (70), Expect = 2.1
 Identities = 22/79 (27%), Positives = 26/79 (32%), Gaps = 1/79 (1%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
           S S    T  P    P  P    P   PT  +P   PPP  P     +     +      
Sbjct: 516 SASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSS 575

Query: 165 QPNQYPETP-HEPRRDSPP 182
            P   PE P   P +DS P
Sbjct: 576 PPPPIPEEPTPSPTKDSSP 594


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 33.1 bits (75), Expect = 0.44
 Identities = 16/62 (25%), Positives = 18/62 (29%), Gaps = 6/62 (9%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
           P VP P  P    P+       P    PP            SP    S R   Q   +P 
Sbjct: 432 PSVPSPGGPGSQPPQSVSGGMIP---SPPALMPSPSPQMSQSP---ASQRTIQQDMVSPG 485

Query: 175 EP 176
            P
Sbjct: 486 GP 487


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 32.5 bits (74), Expect = 0.49
 Identities = 23/134 (17%), Positives = 37/134 (27%), Gaps = 4/134 (2%)

Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPS 162
             S   D                   ++     +   E   + P    +    S + +  
Sbjct: 43  MKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIP 102

Query: 163 LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPP----ARYPDIARAEPRTTPTTTTPPA 218
           + QP Q  + P  P+   P     Q PE  R+P     A  P    +        T  PA
Sbjct: 103 VPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQPA 162

Query: 219 TRRGESNPGPSDKP 232
               +  P    +P
Sbjct: 163 VPEQQVQPEEVVEP 176



 Score = 32.1 bits (73), Expect = 0.71
 Identities = 17/89 (19%), Positives = 27/89 (30%)

Query: 107 SPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
           S  +    P+ P+   P  H     P  +  +     R P    E P   P    + +  
Sbjct: 96  SAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPA 155

Query: 167 NQYPETPHEPRRDSPPRHPNQFPETPRKP 195
            Q  +     ++  P       PE  R P
Sbjct: 156 VQTVQPAVPEQQVQPEEVVEPAPEVKRPP 184


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 32.9 bits (75), Expect = 0.50
 Identities = 24/136 (17%), Positives = 33/136 (24%), Gaps = 16/136 (11%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
           PT  S   E P        E  + SP     LR P             + P         
Sbjct: 445 PTLASIPDEKPS--NISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTL 502

Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPAT---------RRGESNPGPSDKPDTC-DTSYDA 241
              PP       R    T PT + P +              S   P   P          
Sbjct: 503 SSIPPLSP----RQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAG 558

Query: 242 IAVIRTEVFVFKDRWM 257
           ++  R+   +  +R  
Sbjct: 559 LSFNRSVSDIEGERIG 574


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 32.7 bits (74), Expect = 0.55
 Identities = 21/80 (26%), Positives = 28/80 (35%), Gaps = 3/80 (3%)

Query: 98  WAVIPRYSPSPDLTTRRPEV-PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
             +    S +P  T   P V        HH     P P  P    P      +PE P  S
Sbjct: 367 RGLEKNPSTAPS-TPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAPSPS 425

Query: 157 PRREPSLRQPNQYPETPHEP 176
           P   P   QP+ +P+  + P
Sbjct: 426 PSALPP-GQPDLHPKAEYPP 444


>gnl|CDD|239805 cd04278, ZnMc_MMP, Zinc-dependent metalloprotease, matrix
           metalloproteinase (MMP) sub-family. MMPs are responsible
           for a great deal of pericellular proteolysis of
           extracellular matrix and cell surface molecules, playing
           crucial roles in morphogenesis, cell fate specification,
           cell migration, tissue repair, tumorigenesis, gain or
           loss of tissue-specific functions, and apoptosis. In
           many instances, they are anchored to cell membranes via
           trans-membrane domains, and their activity is controlled
           via TIMPs (tissue inhibitors of metalloproteinases).
          Length = 157

 Score = 31.4 bits (72), Expect = 0.55
 Identities = 8/16 (50%), Positives = 12/16 (75%)

Query: 6   WELPEDDKMAIQSLYG 21
           ++L +DD   IQ+LYG
Sbjct: 142 FKLSQDDIRGIQALYG 157


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 32.7 bits (75), Expect = 0.58
 Identities = 19/92 (20%), Positives = 25/92 (27%), Gaps = 7/92 (7%)

Query: 173 PHEPRR----DSPPRHPNQFPETPR--KPPARYPDIARAEPRTTPTTTTPPATRRGESNP 226
           PH  RR        +      E P    P    PD    +   +     P      E  P
Sbjct: 306 PHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEP 365

Query: 227 GPSDKPDTCDTSYDAIAVI-RTEVFVFKDRWM 257
            P ++ D   +S D    I R  V        
Sbjct: 366 DPQEEADGQGSSTDPAGDIFRIRVLAPPQARA 397



 Score = 30.4 bits (69), Expect = 2.7
 Identities = 12/93 (12%), Positives = 25/93 (26%), Gaps = 3/93 (3%)

Query: 138 RREPPPRHPNQYPETPQDSPRR---EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK 194
           RR+P  +   +  +  ++ P     +P      +       PR    P + ++    P++
Sbjct: 310 RRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQE 369

Query: 195 PPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
                                 P   R     G
Sbjct: 370 EADGQGSSTDPAGDIFRIRVLAPPQARARGASG 402


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 32.4 bits (73), Expect = 0.58
 Identities = 23/120 (19%), Positives = 36/120 (30%), Gaps = 12/120 (10%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHH--PTPESPRREPPPRHPNQYPETPQDSPRR 159
           P  +P    +        P       P+    PTP +  +  P   P+Q      D P  
Sbjct: 42  PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD-PAV 100

Query: 160 EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
            P L    + P+   E    +   H     +      ++ PD A        T  +PP  
Sbjct: 101 APQLAAAPK-PD-AAEAFTSAAQAHEAPA-DAGTSAASKKPDPA------AHTQHSPPPF 151


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 32.4 bits (73), Expect = 0.59
 Identities = 34/159 (21%), Positives = 45/159 (28%), Gaps = 26/159 (16%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
              SPSP  ++ R          HH P             P   P  +P +P    +   
Sbjct: 116 LFSSPSPQDSSPRLSA----FTQHHRPVITGHSGISASPHPTPSPLHFPTSPILPQQPSS 171

Query: 162 SLRQPN-QYPETPH---------------------EPRRDSPPRHPNQFPETPRKPPARY 199
                  +YP   H                     +P      + PN F  TP   P   
Sbjct: 172 YFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPP 231

Query: 200 PDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
           P +AR  P   P T  P  +  G +    S    T  TS
Sbjct: 232 PPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTS 270


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 31.4 bits (71), Expect = 0.59
 Identities = 25/102 (24%), Positives = 37/102 (36%), Gaps = 11/102 (10%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS-------PR 158
           P   L+ + P+  +  HP      HH     P ++P    P  +P  P          P 
Sbjct: 49  PIIPLSPQMPQQQQSAHPKL--TPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPA 106

Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP 200
           + P      Q P+ P +P    PP  P Q P  P+     +P
Sbjct: 107 QHPLQPTYGQNPQ-PQQPTHTQPPVQPQQ-PADPQPGQPMFP 146



 Score = 31.4 bits (71), Expect = 0.68
 Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 6/78 (7%)

Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
           P+ P    P HHP      ++P  + P Q+P  P     + P  +QP    + P +P++ 
Sbjct: 79  PQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPT--YGQNPQPQQPTH-TQPPVQPQQP 135

Query: 180 SPPRHPNQ--FPETPRKP 195
           + P  P Q  FP  P  P
Sbjct: 136 ADP-QPGQPMFPMQPLPP 152


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.7 bits (74), Expect = 0.60
 Identities = 38/130 (29%), Positives = 47/130 (36%), Gaps = 17/130 (13%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
           SP P + +  P  P  + P    P HHP    P   PPP       +  Q  P + P L 
Sbjct: 372 SPFPQMPSNLPPPPALK-PLSSLPTHHP----PSAHPPPLQLMPQSQPLQSVPAQPPVLT 426

Query: 165 Q----PNQYPETPHEPRRDSPPRHP-NQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
           Q    P +    PH      PP+ P  Q P T    PA         P  +  T+TP A 
Sbjct: 427 QSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPA-------IGPPPSLPTSTPAAP 479

Query: 220 RRGESNPGPS 229
            R  S   P 
Sbjct: 480 PRASSGSQPP 489



 Score = 32.4 bits (73), Expect = 0.77
 Identities = 30/133 (22%), Positives = 41/133 (30%), Gaps = 5/133 (3%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
               P        P +P   H     P     P S   +P     +Q P  P  S  +  
Sbjct: 260 SSRHPQSSHHGPGPPMP---HALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
           S   P+Q    P +P R+ P       P     P    P +     +  P    P    +
Sbjct: 317 SHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQ 376

Query: 222 GESN--PGPSDKP 232
             SN  P P+ KP
Sbjct: 377 MPSNLPPPPALKP 389



 Score = 32.0 bits (72), Expect = 0.94
 Identities = 19/108 (17%), Positives = 30/108 (27%), Gaps = 7/108 (6%)

Query: 122 HPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSP 181
            P        P   +     PP  P+     PQ SP    + +   Q  +        +P
Sbjct: 174 QPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSP---IAAQPAPQPQQPSPLSLISAP 230

Query: 182 PRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
             HP + P     P         ++    P   +    +     PGP 
Sbjct: 231 SLHPQRLPS----PHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPP 274



 Score = 29.7 bits (66), Expect = 4.4
 Identities = 27/110 (24%), Positives = 46/110 (41%), Gaps = 9/110 (8%)

Query: 116 EVPRPRHPTHHHPEHHPTPESPRREPP----PRH---PNQYPETPQDSPRREPSLRQPNQ 168
           E P P  P+  H +  PT   P+        P H   P+ +P+ P + P   P+L+  + 
Sbjct: 334 EQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPP-PPALKPLSS 392

Query: 169 YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
            P T H P    PP       +  +  PA+ P + +++      +T P +
Sbjct: 393 LP-THHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHS 441


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 31.8 bits (72), Expect = 0.61
 Identities = 28/148 (18%), Positives = 37/148 (25%), Gaps = 13/148 (8%)

Query: 91  DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP 150
           D       A  P  +          E      P H   E    PE    EP P       
Sbjct: 30  DAAGPHDDAAEPVLTDQIVPG---AEQAASAAPVHAAREATADPEFVAVEPVPTPHVPAV 86

Query: 151 ETPQDSP-RREPSLRQ--PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
             P D+    EP        +       P   +      Q P       A   D   A P
Sbjct: 87  ALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPA----GATAADAGDAAP 142

Query: 208 RTTP---TTTTPPATRRGESNPGPSDKP 232
             TP      +PPA  +  ++   +   
Sbjct: 143 DATPPAAGDASPPAAAQAAASAAAALTD 170


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 32.2 bits (73), Expect = 0.65
 Identities = 28/144 (19%), Positives = 40/144 (27%), Gaps = 35/144 (24%)

Query: 101 IPRYSPSPDLTTRRPEVP----RPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
           +P  S  P      P VP            +P + P P+ P +EP     +   E P+D 
Sbjct: 354 VPDESNVPPNP---PNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSN-KEVPEDV 409

Query: 157 PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
           P            PE   +   + P +  N+                       P    P
Sbjct: 410 PME----------PEDDRDNNFNEPKKPENKGDGQNE-----------------PVIPKP 442

Query: 217 PATRRGESNPGPSDKPDTCDTSYD 240
               R +SN      P     S D
Sbjct: 443 LDNERDQSNKNKQVNPGNRHNSED 466



 Score = 29.2 bits (65), Expect = 6.1
 Identities = 27/129 (20%), Positives = 39/129 (30%), Gaps = 20/129 (15%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHH---PTPESPRREPPPRHPNQYPETPQDSPRREPS 162
           P+P   T   + PRP       P  +     P++P+   PP      P   +D       
Sbjct: 292 PAPVPPTPEDDNPRPTDDEFAVPNFNEGLDVPDNPQDPVPP------PNEGKDG------ 339

Query: 163 LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
               N   E    P  D  P   N  P  P   P        ++    P    P    + 
Sbjct: 340 ----NPNEENLFPPGDDEVPDESNVPP-NPPNVPGGSNSEFSSDVENPPNPPNPDIPEQE 394

Query: 223 ESNPGPSDK 231
            + P  S+K
Sbjct: 395 PNIPEDSNK 403


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.5 bits (74), Expect = 0.65
 Identities = 18/115 (15%), Positives = 30/115 (26%), Gaps = 7/115 (6%)

Query: 113 RRPEVPRPRHPTH-HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPE 171
           R+  +  P        P +   P+      P   P          P       +PN    
Sbjct: 383 RQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL---RPNGLA- 438

Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNP 226
                   +P R+     + P   P  YP   ++ P +           +G  N 
Sbjct: 439 --PMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNK 491


>gnl|CDD|173550 PTZ00357, PTZ00357, methyltransferase; Provisional.
          Length = 1072

 Score = 32.4 bits (73), Expect = 0.79
 Identities = 26/117 (22%), Positives = 42/117 (35%), Gaps = 9/117 (7%)

Query: 136 SPRREPPPRHPNQYPETP------QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFP 189
           +P    P  HP+  P+ P      +D    +      + YP+    P R +PP + +   
Sbjct: 428 APHGVAPQEHPHYRPDAPGTTLAEEDVVEAQEYEHPNHGYPDEDGAPLRRTPPTYHDGES 487

Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIR 246
           E  +   A Y      +  T   T+    +    +  G        DTSY+A A  R
Sbjct: 488 EGSQ---ADYSLFDWYDSVTELETSHEADSAVAAAKGGSKGSGSASDTSYEAAADCR 541


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 31.6 bits (72), Expect = 0.82
 Identities = 31/143 (21%), Positives = 49/143 (34%), Gaps = 27/143 (18%)

Query: 93  KEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHP---TPE---------SPRRE 140
           K E+ W  I       +L  R+  VP P  P+     + P   T E         +  R+
Sbjct: 44  KPEERWRYIK------ELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQ 97

Query: 141 PPPRHP-----NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKP 195
            P +        Q P+ P+ + + +   +Q    P T  +P+  +PPR     P  P+ P
Sbjct: 98  QPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQP-PATTAQPQPVTPPRQTTA-PVQPQTP 155

Query: 196 PARYPDIARAEPRTTPTTTTPPA 218
                    A P T         
Sbjct: 156 APVRT--QPAAPVTQAVEAPKVE 176



 Score = 31.6 bits (72), Expect = 0.93
 Identities = 24/96 (25%), Positives = 35/96 (36%), Gaps = 8/96 (8%)

Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPET--PRKPPARYPDIARAEPRTTPTTTTPPATRR 221
           +QP Q  E P+  +    PR   Q  +    ++PPA     A+ +P T P  TT P   +
Sbjct: 97  QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPAT---TAQPQPVTPPRQTTAPVQPQ 153

Query: 222 GESNPGPSDKPDTCDTSYDAIAVIRTEVFVFKDRWM 257
               P P         +    A         + RWM
Sbjct: 154 ---TPAPVRTQPAAPVTQAVEAPKVEAEKEKEQRWM 186



 Score = 30.0 bits (68), Expect = 2.8
 Identities = 17/81 (20%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
           P +  TP+ PR     +   Q  + P  + + +P    P +    P +P+  +P R    
Sbjct: 106 PYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQP--VTPPRQTTAPVQPQTPAPVRTQPA 163

Query: 188 FPETPRKPPARYPDIARAEPR 208
            P T      +       E R
Sbjct: 164 APVTQAVEAPKVEAEKEKEQR 184


>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin.  NC10
           stands for Non-helical region 10 and is taken from human
           COL15A1. A mutation in this region in human COL18A1 is
           associated with an increased risk of prostrate cancer.
           This domain is cleaved from the precursor and forms
           endostatin. Endostatin is a key tumour suppressor and
           has been used highly successfully to treat cancer. It is
           a potent angiogenesis inhibitor. Endostatin also binds a
           zinc ion near the N-terminus; this is likely to be of
           structural rather than functional importance according
           to.
          Length = 291

 Score = 31.6 bits (72), Expect = 0.83
 Identities = 13/85 (15%), Positives = 20/85 (23%), Gaps = 13/85 (15%)

Query: 96  KSWAVIP--RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP 153
             W  +      P P  T  +P            P   P    P   P P+  +      
Sbjct: 42  NGWRKVLLGELVPLPGTTATQPPPVV------LTPWSDPRLPDPPHLPDPQTHSATAHRN 95

Query: 154 QDSPRREPSLRQPNQYPETPHEPRR 178
                  P L  P +      + + 
Sbjct: 96  P-----HPPLNSPARIAHVHQDFQP 115



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 12/58 (20%), Positives = 20/58 (34%)

Query: 129 EHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPN 186
           E  P P +   +PPP     + +     P   P  +  +        P  +SP R  +
Sbjct: 51  ELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHPPLNSPARIAH 108


>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
          Length = 293

 Score = 31.7 bits (72), Expect = 0.87
 Identities = 25/121 (20%), Positives = 31/121 (25%), Gaps = 14/121 (11%)

Query: 102 PRYSPSPDLTTRRPEVPRPRH--PTHHHPE--HHPTPESPRREPPPRHPNQYPETPQDSP 157
             YS  PD      E+  P     TH  P+   H  P    R    R   +  +  +   
Sbjct: 41  RSYSNLPDEEEGSAELLGPARVLDTHKEPQLDEHDLPSMSARPRERRRDTKTAKQQKRGR 100

Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
             EP   Q             D     P  F +               EPR  P    PP
Sbjct: 101 GSEP---QQGDLNL-------DLDEVEPALFSDRDDDFTPDKRKSKGREPRIEPPKELPP 150

Query: 218 A 218
            
Sbjct: 151 V 151


>gnl|CDD|132911 cd06844, STAS, Sulphate Transporter and Anti-Sigma factor
           antagonist domain found in the C-terminal region of
           sulphate transporters as well as in bacterial and
           archaeal proteins involved in the regulation of sigma
           factors.  The STAS (Sulphate Transporter and Anti-Sigma
           factor antagonist) domain is found in the C-terminal
           region of sulphate transporters as well as in bacterial
           and archaeal proteins involved in the regulation of
           sigma factors, like anti-anti-sigma factors and
           "stressosome" components. The sigma factor regulators
           are involved in protein-protein interaction which is
           regulated by phosphorylation.
          Length = 100

 Score = 29.8 bits (67), Expect = 0.89
 Identities = 21/78 (26%), Positives = 26/78 (33%), Gaps = 12/78 (15%)

Query: 275 QFWSELPHNLTHIDAVYQRIDNHIAFFIGRQYFL-FEGIRLLPGYPRPLTALGLPASLER 333
           QF  EL HN+T            I   I    F+   G  +L    R   A+G    L  
Sbjct: 25  QFKEELLHNIT------NVAGKTIVIDISALEFMDSSGTGVLLERSRLAEAVGGQFVLTG 78

Query: 334 IDAAMVWGYNSKTYLFSG 351
           I  A+       T   SG
Sbjct: 79  ISPAV-----RITLTESG 91


>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin).  This family
           consists of several mammalian Ameloblastin precursor
           (Amelin) proteins. Matrix proteins of tooth enamel
           consist mainly of amelogenin but also of non-amelogenin
           proteins, which, although their volumetric percentage is
           low, have an important role in enamel mineralisation.
           One of the non-amelogenin proteins is ameloblastin, also
           known as amelin and sheathlin. Ameloblastin (AMBN) is
           one of the enamel sheath proteins which is though to
           have a role in determining the prismatic structure of
           growing enamel crystals.
          Length = 411

 Score = 31.8 bits (72), Expect = 0.90
 Identities = 25/92 (27%), Positives = 37/92 (40%), Gaps = 9/92 (9%)

Query: 127 HPEHHPTPESPRREP--PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
            P  H T +     P  PP  P+Q    PQ  P  +P L QP   P     P+++ P   
Sbjct: 80  RPREHETQQYEYSLPVHPPPLPSQPSLQPQ-QPGLKPFL-QPTALPTNQATPQKNGP--- 134

Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
             Q P    +PP +  ++    P+  P+   P
Sbjct: 135 --QPPMHLGQPPLQQAELPMIPPQVAPSDKPP 164


>gnl|CDD|218429 pfam05097, DUF688, Protein of unknown function (DUF688).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana.
          Length = 446

 Score = 31.7 bits (71), Expect = 0.91
 Identities = 24/97 (24%), Positives = 36/97 (37%), Gaps = 2/97 (2%)

Query: 109 DLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQ 168
           +L      V R         E   T  +  R   P  P    E     P REP    P +
Sbjct: 7   NLNAPLLSVRRVPSTPESSSESENTKTTSTRSIKPSCPVTKSEIDSG-PVREPG-SVPFK 64

Query: 169 YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARA 205
           + +TP +P+ +S P+       TP+ PP R   +  +
Sbjct: 65  WEQTPGKPKDESKPQIQEHPDFTPKLPPGRSRTVPVS 101


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 32.1 bits (72), Expect = 0.93
 Identities = 35/134 (26%), Positives = 51/134 (38%), Gaps = 17/134 (12%)

Query: 90  CDRKEEKS---WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP----- 141
           CD +E  S   +   P   P  D  TR+      R  +  H +  P   + ++       
Sbjct: 768 CDEQEPLSVSPYGPEPDRPPDDDFETRKG---LKRKSSEDHADPIPEGNATKKTCGLQGL 824

Query: 142 PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPD 201
           P   P   PET +D+P   P    P   P  P E      P+ P Q P+ P+ P     +
Sbjct: 825 PDSLPPAVPETDRDNPLLPPCPITPEGPPCPPREE-----PQQP-QEPQEPQSPSFHISE 878

Query: 202 IARAEPRTTPTTTT 215
           I  A   +TP + T
Sbjct: 879 IGEALFHSTPVSPT 892


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 31.9 bits (72), Expect = 0.97
 Identities = 14/93 (15%), Positives = 17/93 (18%), Gaps = 14/93 (15%)

Query: 132 PTPESPRREPPPRHP----NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
              +      P  H      Q P+ P   P   P+      YP  P            N 
Sbjct: 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNP 421

Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATR 220
                   P              P    P    
Sbjct: 422 QSPGTSYGP----------EPVGPVPPQPTNPY 444


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 31.1 bits (70), Expect = 1.0
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 180 SPPRHPN-QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDT 234
           SPP   + + P TP   P   P+      ++T ++T P AT        P+  PDT
Sbjct: 34  SPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDT 89



 Score = 28.4 bits (63), Expect = 7.8
 Identities = 20/115 (17%), Positives = 33/115 (28%), Gaps = 13/115 (11%)

Query: 137 PRREPPPRHPNQ--------YPETPQDSPRREPS----LRQPNQYPETPHEPRRDSPPR- 183
           P   P  +HP Q           T +     EP+         Q  +    P   S  + 
Sbjct: 51  PDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQT 110

Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
                 +   +  +R             TTTT   +  G+     S +P +  T+
Sbjct: 111 TQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSATT 165


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 31.6 bits (72), Expect = 1.0
 Identities = 31/130 (23%), Positives = 45/130 (34%), Gaps = 21/130 (16%)

Query: 107 SPDLTTRRPEVPRPRHPTHHHPE-----HHPTPESPR--REPPPRHPNQYP--------- 150
           S +L    P+          HP      H P P  P+  R+ P   P + P         
Sbjct: 37  SSNLPLDHPKGTSSFPQLLRHPSLRLRLHPPPPPCPQNPRDFPSLQPLELPLHHSLLRHV 96

Query: 151 -ETPQDS--PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
            ET + S  P  +P L +  Q P   H  R  S P H +   +   +  AR  D+  +  
Sbjct: 97  HETLKVSQAPGEQPKLPRAPQLP--SHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSLS 154

Query: 208 RTTPTTTTPP 217
            + P    P 
Sbjct: 155 NSRPVHELPK 164



 Score = 30.1 bits (68), Expect = 3.2
 Identities = 27/123 (21%), Positives = 36/123 (29%), Gaps = 17/123 (13%)

Query: 107 SPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
           SP+L  R  ++P P             P  P +                  R  PS+  P
Sbjct: 351 SPNL--RSSQLPPP--TKRRLRL---LPVPPPKVQALPLT-----ALAPLVRHSPSIPLP 398

Query: 167 NQYPETPHEPRRDSPPRH--PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
           +     P      S   H  P      PR      P  + + P   P T  PPA+    S
Sbjct: 399 HPPSALPSHVGASSSKHHRLPPSVLPGPRLSS---PSPSPSLPTRRPGTPPPPASPPTPS 455

Query: 225 NPG 227
            P 
Sbjct: 456 PPS 458


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 31.5 bits (71), Expect = 1.1
 Identities = 27/140 (19%), Positives = 37/140 (26%), Gaps = 21/140 (15%)

Query: 107 SPDLTTRRPEVP---RPRHPTHHHPEH-----HPTP-------ESPRREPPPRHPNQYPE 151
            P    RR       R +     H        +P         E P    P R P+  PE
Sbjct: 39  EPADRARRGAGQARGRSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPE 98

Query: 152 TPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTP 211
                P R        + P       R +    P   P        R    A A+     
Sbjct: 99  ASGPGPARGA------RGPAGSRGRGRRAESPSPRDPPNPKGASAPRGRKSACADSAALL 152

Query: 212 TTTTPPATRRGESNPGPSDK 231
               P A +R ++  G + K
Sbjct: 153 DAPAPAAPKRQKTPAGLARK 172


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 21/112 (18%), Positives = 32/112 (28%), Gaps = 8/112 (7%)

Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
           P   P PE    E  P          +     EP+  +P + P+   +P++   P+   +
Sbjct: 52  PTEEPQPEPEPPEEQP------KPPTEPETPPEPTPPKPKEKPKPEKKPKK-PKPKPKPK 104

Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSY 239
               P+  P   P               P    R  S    S        SY
Sbjct: 105 PKPKPKVKPQPKPKK-PPSKTAAKAPAAPNQPARPPSAASASGAATGPSASY 155



 Score = 28.6 bits (64), Expect = 7.4
 Identities = 24/110 (21%), Positives = 36/110 (32%), Gaps = 12/110 (10%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP----QDSPRRE 160
            P+   T   P  P+P+      PE  P    P+ +P P+   +    P      S    
Sbjct: 69  PPTEPETPPEPTPPKPKEKPK--PEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126

Query: 161 PSLRQPNQY--PETPHEPRRDSPPRHPNQFPETPRKP---PARYPDIARA 205
            +   PNQ   P +       +       +    R+      RYP  ARA
Sbjct: 127 KAPAAPNQPARPPSAASASGAATGP-SASYLSGLRRAIRRAPRYPAQARA 175


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 31.3 bits (71), Expect = 1.4
 Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 8/92 (8%)

Query: 91  DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP 150
            +  E +    P+    PD      E    +    +  E +P+  +P ++P P    +  
Sbjct: 40  IKPNENT----PKIPKKPD-NKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEI 94

Query: 151 ETPQDSPRR---EPSLRQPNQYPETPHEPRRD 179
           E P+D P++   +P   QPN          + 
Sbjct: 95  EKPKDEPKKPDKKPQADQPNNVHADQPNNNKV 126


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 31.4 bits (71), Expect = 1.4
 Identities = 28/168 (16%), Positives = 40/168 (23%), Gaps = 21/168 (12%)

Query: 63  VLSPRNQRKKEDRIGESQPFRERTLS--------ICDRKEEKSWAVIPRYSPSPDLTTRR 114
           + SP                 ER           +      +S AV P +       +R 
Sbjct: 286 LSSPAPDSSSFSSPSGESGLEEREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKSRH 345

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS-------PRREPSLRQPN 167
              P+ R  +     +   P+  R  P        P  P+D        P  E S R   
Sbjct: 346 ESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIAVKPPTEESSRRPEE 405

Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
           +    P E   +  P                 P         TP   T
Sbjct: 406 EKHRFPSEDVWEDSPSSLQDTATVST------PSNPPPRASETPEQET 447


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 31.0 bits (70), Expect = 1.4
 Identities = 19/82 (23%), Positives = 24/82 (29%), Gaps = 1/82 (1%)

Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQY-PETPHEPRRDSP 181
           P   + +  P P  P        P   P  P  +    P    PN   P  P +P    P
Sbjct: 34  PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93

Query: 182 PRHPNQFPETPRKPPARYPDIA 203
           P      P  P   P R  +  
Sbjct: 94  PPVDPNAPPPPAPEPGRIDNAV 115


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 31.0 bits (70), Expect = 1.5
 Identities = 26/128 (20%), Positives = 38/128 (29%), Gaps = 18/128 (14%)

Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESP----------RREPPPRHPNQYPET 152
            +S    L  R+  VP+P  P         TPE            + +P           
Sbjct: 63  VWSYIKALEDRQIGVPQPTEPAAVKDAERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQAR 122

Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK-PPARYPDIARAEPRTTP 211
            ++ PR +    Q  Q   T  +      P  P   PE P+   PA  P+     P+   
Sbjct: 123 IEEQPRTQS---QKAQSQATTVQT----QPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEA 175

Query: 212 TTTTPPAT 219
                P  
Sbjct: 176 APPPKPKA 183


>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
           (NESP55).  This family consists of several mammalian
           neuroendocrine-specific golgi protein P55 (NESP55)
           sequences. NESP55 is a novel member of the chromogranin
           family and is a soluble, acidic, heat-stable secretory
           protein that is expressed exclusively in endocrine and
           nervous tissues, although less widely than
           chromogranins.
          Length = 261

 Score = 30.6 bits (68), Expect = 1.6
 Identities = 29/112 (25%), Positives = 41/112 (36%), Gaps = 18/112 (16%)

Query: 128 PEHHPTPESPRREPPPRHPNQYP-----ETPQDSPRREPSLRQPNQYPETPHEPRR---D 179
           PE  P P  P+     +   +       ++   SPRR P   Q  +      EP R   D
Sbjct: 152 PEDEPGPVVPKGATFHQSLTERLHALKLQSADASPRRAPPSTQEPESAREGEEPERGPLD 211

Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
             PR P +  E  ++   + P   R +P+        PA RR  S   P  K
Sbjct: 212 KDPRDPEEEEE-EKEEEKQQPH--RCKPKK-------PARRRDPSPESPPKK 253



 Score = 28.7 bits (63), Expect = 7.9
 Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 3/75 (4%)

Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
           PR   P+   PE     E P R P  + P    E  ++   +E   +QP++        R
Sbjct: 186 PRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEEEE---KEEEKQQPHRCKPKKPARR 242

Query: 178 RDSPPRHPNQFPETP 192
           RD  P  P +    P
Sbjct: 243 RDPSPESPPKKGAIP 257


>gnl|CDD|233777 TIGR02205, septum_zipA, cell division protein ZipA.  This model
           represents the full length of bacterial cell division
           protein ZipA. The N-terminal hydrophobic stretch is an
           uncleaved signal-anchor sequence. This is followed by an
           unconserved, variable length, low complexity region, and
           then a conserved C-terminal region of about 140 amino
           acids (see pfam04354) that interacts with the
           tubulin-like cell division protein FtsZ [Cellular
           processes, Cell division].
          Length = 284

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 15/128 (11%), Positives = 25/128 (19%), Gaps = 21/128 (16%)

Query: 92  RKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPE--------------SP 137
            K       +    P+ +   +    P  R     HP   P  +                
Sbjct: 34  DKAPLDRMKLKSDDPTSEEMVQPDNSPNTRVERGEHPIPQPREQHLPSISELVAYQRDKS 93

Query: 138 RREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE-PRRDSPPRHPNQFPETP--RK 194
             +     P Q    P       P      Q    P              +  +      
Sbjct: 94  VDDEEASIPMQ----PTQQQYDMPQPNNVAQQTVEPRVAKSLPEASPQEEEVGKNLEVTA 149

Query: 195 PPARYPDI 202
           PP +   +
Sbjct: 150 PPKQKDKV 157


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.0 bits (71), Expect = 1.8
 Identities = 21/128 (16%), Positives = 35/128 (27%), Gaps = 7/128 (5%)

Query: 99  AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPP---PRHPNQYPETPQD 155
           A   + + +P      P+ P    P    P+  P    P         R   Q  +    
Sbjct: 378 AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK 437

Query: 156 SPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARY----PDIARAEPRTTP 211
           + + EP+     +   +  E      P          +K   R+    P   + EP  TP
Sbjct: 438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATP 497

Query: 212 TTTTPPAT 219
                   
Sbjct: 498 KALKKALE 505


>gnl|CDD|191194 pfam05111, Amelin, Ameloblastin precursor (Amelin).  This family
           consists of several mammalian Ameloblastin precursor
           (Amelin) proteins. Matrix proteins of tooth enamel
           consist mainly of amelogenin but also of non-amelogenin
           proteins, which, although their volumetric percentage is
           low, have an important role in enamel mineralisation.
           One of the non-amelogenin proteins is ameloblastin, also
           known as amelin and sheathlin. Ameloblastin (AMBN) is
           one of the enamel sheath proteins which is though to
           have a role in determining the prismatic structure of
           growing enamel crystals.
          Length = 410

 Score = 30.7 bits (69), Expect = 1.9
 Identities = 23/89 (25%), Positives = 35/89 (39%), Gaps = 3/89 (3%)

Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
           P  H  + +         P P  P+  P+ P   P  +P+     Q  +TP +     PP
Sbjct: 96  PREHETQQYEYSVPVHPPPLPSQPSLQPQQPGLKPFLQPTAATAIQ--DTPQKA-GPQPP 152

Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTP 211
            HP Q P    + P  +  +A +E   TP
Sbjct: 153 MHPGQLPLQEAELPEVHEQVAPSEKPPTP 181


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 30.8 bits (69), Expect = 2.0
 Identities = 8/57 (14%), Positives = 10/57 (17%), Gaps = 7/57 (12%)

Query: 140 EPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPP 196
             P       P      P  +P+          PH       P         P    
Sbjct: 208 AEPAEEEAPAPSEAGSEPAPDPA-------ARAPHAAPDPPAPAPAPAKTAAPAAAA 257



 Score = 28.8 bits (64), Expect = 9.0
 Identities = 9/55 (16%), Positives = 15/55 (27%), Gaps = 2/55 (3%)

Query: 166 PNQYPETPHEPRRDSPPRH-PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
           P +  E       ++     P+     P   P      A A  +T       P +
Sbjct: 207 PAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP-APAPAKTAAPAAAAPVS 260


>gnl|CDD|220828 pfam10630, DUF2476, Protein of unknown function (DUF2476).  This is
           a family of proteins of unknown function. The family is
           rich in proline residues.
          Length = 255

 Score = 30.6 bits (68), Expect = 2.0
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 6/71 (8%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPR----REPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
            +P    + P P  P+  P     SPR     + S+ +P   P++P +P   SP  +P +
Sbjct: 182 FSPYRDGQVPGPWSPSASPVAESCSPRFIWSLKGSMLKP--LPDSPLQPLPPSPSPNPQE 239

Query: 188 FPETPRKPPAR 198
               P +PP +
Sbjct: 240 QSPLPTRPPCK 250


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 30.4 bits (69), Expect = 2.2
 Identities = 12/81 (14%), Positives = 21/81 (25%), Gaps = 2/81 (2%)

Query: 135 ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK 194
           E P         ++  ++    P+         Q P  P +P         +   ++   
Sbjct: 173 EVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQP--MPSASSESATSKSAST 230

Query: 195 PPARYPDIARAEPRTTPTTTT 215
                P      PR  P  T 
Sbjct: 231 SRESSPQPQSPPPRRVPAPTV 251


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 30.7 bits (69), Expect = 2.2
 Identities = 33/151 (21%), Positives = 48/151 (31%), Gaps = 10/151 (6%)

Query: 84  ERTLSICDRKEEKSWAVIPR----YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRR 139
           E      + +  +  AV+PR    Y+   DL  + P  P P  P+               
Sbjct: 350 EAPSPPIEEEPPQPKAVVPRPLSPYTAYEDL--KPPTSPIPTPPSSSPAS--SKSVDAVA 405

Query: 140 EPPPRHPNQYPETPQDSPRREP-SLRQPNQYPETPHEPRRD-SPPRHPNQFPETPRKPPA 197
           +P        P +  + P  EP  +      P +P+    D  PP  P+    T   P  
Sbjct: 406 KPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSV 465

Query: 198 RYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
                  A P T P T    A     +N  P
Sbjct: 466 SSTSSVPAVPDTAPATAATDAAAPPPANMRP 496


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 30.5 bits (69), Expect = 2.3
 Identities = 20/130 (15%), Positives = 33/130 (25%), Gaps = 16/130 (12%)

Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHH----PEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
           + S  P    +     R                 T +  R +  P    +  E  Q    
Sbjct: 159 QSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIP-DFEKLHEKFQKQLA 217

Query: 159 R-----EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTT 213
                  P++ +P  + E+     R    +      E   KP        R  P +T   
Sbjct: 218 EKKKSKRPTVPEPFNFQESHKSSSRTYLDQENISAGEENLKP------TRRKLPPSTKKW 271

Query: 214 TTPPATRRGE 223
            +     R E
Sbjct: 272 ESLVKFLRTE 281


>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
           family.  The immune response of the purple sea urchin
           appears to be more complex than previously believed in
           that it uses immune-related gene families homologous to
           vertebrate Toll-like and NOD/NALP-like receptor families
           as well as C-type lectins and a rudimentary complement
           system. In addition, the species also produces this
           unusual family of mRNAs, also known as 185/333, which is
           strongly upregulated in response to pathogen challenge.
          Length = 312

 Score = 30.4 bits (68), Expect = 2.3
 Identities = 33/153 (21%), Positives = 45/153 (29%), Gaps = 22/153 (14%)

Query: 78  ESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP--- 134
           E QPF +R  S     EE       R  P            R           + T    
Sbjct: 159 EEQPFGQRNES----SEEDG-----RPHPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQ 209

Query: 135 -ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD---------SPPRH 184
             +   + P RH N+  +  QD P  E    + N +   P   R           SP R 
Sbjct: 210 GHNETGDHPHRHHNKTGDGDQDRPMFEMRPFRFNPFGRKPFGDRPFDRRNGTEEGSPRRD 269

Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
            ++ P   R          +  P T   TT+ P
Sbjct: 270 GHRRPYGNRGRWGENESEEKEHPTTESVTTSSP 302


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 30.5 bits (69), Expect = 2.5
 Identities = 12/68 (17%), Positives = 13/68 (19%), Gaps = 2/68 (2%)

Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
            PS       P  P    P    P     P  P    P   P          P R     
Sbjct: 384 PPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPA--AAAPPARSADPA 441

Query: 165 QPNQYPET 172
                 + 
Sbjct: 442 AAASAGDR 449



 Score = 29.3 bits (66), Expect = 5.5
 Identities = 15/63 (23%), Positives = 18/63 (28%), Gaps = 2/63 (3%)

Query: 170 PETPHEPRRDSPPRHPNQFPET--PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
           P  P        P  P   P    P  PPA     A A P   P      A     ++P 
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPA 441

Query: 228 PSD 230
            + 
Sbjct: 442 AAA 444



 Score = 28.9 bits (65), Expect = 7.7
 Identities = 20/107 (18%), Positives = 23/107 (21%), Gaps = 22/107 (20%)

Query: 98  WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR---HPNQYPETPQ 154
           W      +P+       P VP             P P  P    PP     P        
Sbjct: 389 WGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAGD 448

Query: 155 -----------DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPE 190
                        P    SL Q +        P   S       FPE
Sbjct: 449 RWRAFVAFVKGKKPALGASLEQGS--------PLGVSAGLLEIGFPE 487


>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin.  The members of this family are
           enzymes that cleave peptides. These proteases require
           zinc for catalysis.
          Length = 159

 Score = 29.5 bits (67), Expect = 2.6
 Identities = 9/15 (60%), Positives = 10/15 (66%)

Query: 7   ELPEDDKMAIQSLYG 21
            L +DD   IQSLYG
Sbjct: 145 RLDQDDIKGIQSLYG 159


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 28.5 bits (64), Expect = 3.0
 Identities = 15/56 (26%), Positives = 23/56 (41%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYP 170
           P   + + P    P H   P  P   P  +  +  P  PQ  P++ P  +QP Q+ 
Sbjct: 11  PPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 29.4 bits (66), Expect = 3.2
 Identities = 21/77 (27%), Positives = 28/77 (36%), Gaps = 4/77 (5%)

Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
           P+ P    P  H    +   +P    P Q P  PQ     +P   Q    P+ P  P   
Sbjct: 69  PQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPL---QPPQPQQPMQPQPPVHPIPP 125

Query: 180 SPPRHPNQFPETPRKPP 196
            PP+ P   P  P +P 
Sbjct: 126 LPPQPPLP-PMFPMQPL 141



 Score = 29.0 bits (65), Expect = 4.0
 Identities = 25/86 (29%), Positives = 33/86 (38%), Gaps = 4/86 (4%)

Query: 101 IPRYSPSPDLTTRRPEVPRPRH----PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
           IP       +  ++P +P P      PT HH  + P P     +P P  P Q  +  Q  
Sbjct: 58  IPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQ 117

Query: 157 PRREPSLRQPNQYPETPHEPRRDSPP 182
           P   P    P Q P  P  P +  PP
Sbjct: 118 PPVHPIPPLPPQPPLPPMFPMQPLPP 143



 Score = 28.6 bits (64), Expect = 4.9
 Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 16/92 (17%)

Query: 123 PTHHHPEHHPTPESPRREP-PPRHP-----NQYPETPQDSPRREPSLRQPNQYPETP--- 173
           PTH    HH  P  P ++P  P+ P      Q+  TP  +   +P+L QP Q P  P   
Sbjct: 48  PTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTP--TQHHQPNLPQPAQQPFQPQPL 105

Query: 174 -----HEPRRDSPPRHPNQFPETPRKPPARYP 200
                 +P +  PP HP          P  +P
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFP 137


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 3/95 (3%)

Query: 141 PPPRHPNQY--PETPQDSPRREPSLR-QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPA 197
              R P Q+  P   Q +   +PS     +  P       + S P+   Q   TP     
Sbjct: 370 SGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSV 429

Query: 198 RYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
             P      P +T      PA  + E     S   
Sbjct: 430 DPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVS 464


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 29.6 bits (66), Expect = 3.7
 Identities = 17/43 (39%), Positives = 23/43 (53%), Gaps = 4/43 (9%)

Query: 180 SPPRHPNQFPETPR-KPPARYPDIARAE-PRTTPTTTTPPATR 220
           S P  P+   E P+ KP A  P   + E P+ TP  TTP +T+
Sbjct: 43  STP--PSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTK 83



 Score = 29.2 bits (65), Expect = 5.9
 Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 5/47 (10%)

Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATR--RGESNPGPSDKPDT 234
           E P+  P   P      P++ P  TTPP+T+    +  P  +    T
Sbjct: 39  EAPQSTP---PSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 29.3 bits (66), Expect = 3.7
 Identities = 9/57 (15%), Positives = 14/57 (24%), Gaps = 8/57 (14%)

Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
           +T+          T   P    T E+              ETP +      +    N
Sbjct: 97  STKTSPTVSTTVTTTTSPSETDTEEATTTVS--------TETPTEGGSSAATDPSKN 145


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 30.0 bits (67), Expect = 3.8
 Identities = 23/103 (22%), Positives = 28/103 (27%), Gaps = 15/103 (14%)

Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHP-TPESPRREPPPRHPNQYPETPQDSPRREPS 162
           Y P P  +   P  P         PE  P    +P RE         P     +   EP 
Sbjct: 264 YEPPPAPSGGSPA-PPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPK 322

Query: 163 LR---------QPNQYPETPHEPRRDSPPRHPNQFPETPRKPP 196
                      +P  +P          PP  P   P  PR  P
Sbjct: 323 PGPPRPAPDADRPEGWPSLEAITF---PPPTPAT-PAVPRARP 361


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
           represents proteins encoded by genes which are always
           found in type III secretion operons, although their
           function in the processes of secretion and virulence is
           unclear. Hpa stands for Hrp-associated gene, where Hrp
           stands for hypersensitivity response and virulence. see
           also PMID:18584024.
          Length = 185

 Score = 29.0 bits (65), Expect = 4.0
 Identities = 17/98 (17%), Positives = 23/98 (23%), Gaps = 7/98 (7%)

Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
                  + R  +          R +    P   P  P  P           PR  P   
Sbjct: 3   AGAAPRAARRSFDYAR-----LMRRAARAGPPGTPA-PPGPAEDAHPEFPERPRDAPAPP 56

Query: 215 TPPATRRGESNPGPS-DKPDTCDTSYDAIAVIRTEVFV 251
            PP    G+ +P P  D          A      +  V
Sbjct: 57  APPRATDGDRDPQPLADALAEAMARAAAARFAAGQRMV 94


>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
          Length = 346

 Score = 29.7 bits (67), Expect = 4.1
 Identities = 21/84 (25%), Positives = 29/84 (34%), Gaps = 2/84 (2%)

Query: 91  DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP-PPRHPNQY 149
           D  + KS  V   Y  S D + +R   PR        P      +  RR     R+    
Sbjct: 3   DELKGKSGKVKVMYVRSDDDSDKRTHNPRTGKGGGR-PSGKSRADGGRRPARDDRNSQSR 61

Query: 150 PETPQDSPRREPSLRQPNQYPETP 173
               +DSP R  S    ++ PE  
Sbjct: 62  DRKWEDSPWRTVSRAPGDETPEKA 85


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 29.6 bits (66), Expect = 4.1
 Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 5/81 (6%)

Query: 94  EEKSWAVIPRYSPSPDLTTRRPEVPRPRH-PTHHHPEHHPTPESPRREPPPRHPNQYPET 152
           +E   A      P P    R P  P   +      P      E+P    PPR     P+ 
Sbjct: 191 DEPDAAAQRPPQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAP---SPPRTDTLDPDP 247

Query: 153 PQDSPRREPSLRQPNQYPETP 173
               P R    R P+  P +P
Sbjct: 248 AIAGPSR-AVNRTPSPRPSSP 267



 Score = 29.6 bits (66), Expect = 4.8
 Identities = 36/195 (18%), Positives = 55/195 (28%), Gaps = 19/195 (9%)

Query: 57  TNLSTEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPE 116
           T +      P   R  +    +  P   R   +   +       + +  PS        E
Sbjct: 122 TPIPCFAEVPVFPRPYQSSGDDDGPSTSRGSGVARVRPTVIQHRVDKTRPS------DYE 175

Query: 117 VPRPRHPTHHHPEHHPTPE---SPRREPPPRHPNQYPETP---------QDSPRREPSLR 164
             RPR     +P     P+       +P P   N+ P TP                    
Sbjct: 176 NHRPRPFAMANPSWVDEPDAAAQRPPQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAP 235

Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPE-TPRKPPARYPDIARAEPRTTPTTTTPPATRRGE 223
            P +      +P    P R  N+ P   P  PP    +   A+P    T       R G 
Sbjct: 236 SPPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGT 295

Query: 224 SNPGPSDKPDTCDTS 238
            + G +   D+   S
Sbjct: 296 PDDGSAISEDSAGGS 310


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 29.7 bits (66), Expect = 4.1
 Identities = 24/110 (21%), Positives = 40/110 (36%), Gaps = 10/110 (9%)

Query: 128 PEHHPTPESPRREPPP-----RHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
           P +     +P ++ PP       P + P   Q   R      +P++  E    PR  +P 
Sbjct: 14  PANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATP- 72

Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
             P +  +TP    ++    ++  PR  P  T      R    PGP +  
Sbjct: 73  --PARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAG--RNSILPGPGNSL 118


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 29.8 bits (67), Expect = 4.3
 Identities = 26/122 (21%), Positives = 37/122 (30%), Gaps = 19/122 (15%)

Query: 113 RRPEVPRPRHPTHHHPEHHPTPE-----SPRREPPPRHPNQYPETPQDSP----RREPSL 163
           RR + PR    +      +P+PE     S  R   P            SP    RR  S 
Sbjct: 53  RRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSS 112

Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP--------DIARAEPRTTPTTTT 215
              ++        R  S     +  P  P  P A  P        +++ A     PT + 
Sbjct: 113 PSDDEDEAERPSKRPRSDSISSSSSPAKP--PEACLPSPAASTQDELSEASAAPLPTPSL 170

Query: 216 PP 217
            P
Sbjct: 171 SP 172


>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
           Provisional.
          Length = 599

 Score = 29.8 bits (67), Expect = 4.8
 Identities = 18/88 (20%), Positives = 27/88 (30%), Gaps = 14/88 (15%)

Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHP---NQYPETPQDSPRRE-----PS 162
             + PE  R   P    P+   T +  +R+ P        Q  E  Q+ P  E     P 
Sbjct: 347 VAQEPERARDDAPNQVVPDKEKTKKPRKRKRPRGRSTVAEQEVEVTQEHPPAESRNALPD 406

Query: 163 LRQ------PNQYPETPHEPRRDSPPRH 184
           +          Q   T HE    +  + 
Sbjct: 407 MNHVDAQGMEYQITGTSHENDTVTALQA 434


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 28.3 bits (63), Expect = 4.9
 Identities = 28/121 (23%), Positives = 35/121 (28%), Gaps = 13/121 (10%)

Query: 65  SPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPT 124
           S R +RKK +R        ER     +R      A    Y    D T   P  P      
Sbjct: 24  SQRRERKKAER--------ER-----ERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRY 70

Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
           + +  + P P      PPP     Y       P       Q   YP  P +      PR 
Sbjct: 71  YPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRPRR 130

Query: 185 P 185
            
Sbjct: 131 A 131


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 13/69 (18%), Positives = 22/69 (31%), Gaps = 7/69 (10%)

Query: 128 PEHHPTPESPRREPPPRHPNQYPET---PQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
           P H  +  SP    PP   +  P T   P  S    P          +P   +R+   + 
Sbjct: 310 PRHLSSDSSP---SPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKR-KREGDKKQ 365

Query: 185 PNQFPETPR 193
             +  +  +
Sbjct: 366 KKKKSKKLK 374


>gnl|CDD|173181 PRK14718, PRK14718, ribonuclease III; Provisional.
          Length = 467

 Score = 29.4 bits (65), Expect = 4.9
 Identities = 31/156 (19%), Positives = 45/156 (28%), Gaps = 19/156 (12%)

Query: 70  RKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRP-----EVPRPRHPT 124
           R  + + GE++    R  +          A +P      D   R P      V R  H  
Sbjct: 299 RADDAKAGETKAGEARASADAKAGAHTHAAAMPAVPEQADDAARSPATTPVAVIRAAHVE 358

Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE-------------PSLRQPNQYPE 171
           H   +  P    P  +P     ++ PE   D P  E                +      E
Sbjct: 359 HGLDKGEPRASKPAEKPAAA-TDKPPEKASDKPSPEKTSEKTPDKSHEKQLDKSSEPVAE 417

Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
              +   D P        ET  +PP      + AEP
Sbjct: 418 KALDKTADKPDAAARLPAETADRPPRARDASSSAEP 453


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 29.3 bits (65), Expect = 5.0
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 2/49 (4%)

Query: 150 PETPQDSPRREPSLRQPNQYPETP--HEPRRDSPPRHPNQFPETPRKPP 196
           PET    P      +Q  +    P   +P+  S P  P Q  + P+ PP
Sbjct: 182 PETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEPQLPP 230



 Score = 28.6 bits (63), Expect = 8.6
 Identities = 18/75 (24%), Positives = 27/75 (36%), Gaps = 11/75 (14%)

Query: 134 PESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPR 193
           PE+   +P P      P   +  P+ E +        E P  P    PP+   + P+ P 
Sbjct: 182 PETTDPKPSP------PAEDKQKPKVESA-----PVAEKPKAPSSPPPPKQSAKEPQLPP 230

Query: 194 KPPARYPDIARAEPR 208
           K   R   + R   R
Sbjct: 231 KERERRVPMTRLRKR 245



 Score = 28.6 bits (63), Expect = 9.9
 Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 6/75 (8%)

Query: 96  KSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHP--EHHPTPESPRREPPPRHPNQYPETP 153
           KS     + +PS  +       P P       P  E  P  E P+    P  P Q  + P
Sbjct: 167 KSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEP 226

Query: 154 QDSP----RREPSLR 164
           Q  P    RR P  R
Sbjct: 227 QLPPKERERRVPMTR 241


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 29.3 bits (66), Expect = 5.2
 Identities = 29/112 (25%), Positives = 37/112 (33%), Gaps = 22/112 (19%)

Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF-PE 190
           PTP   ++E      N +   P     R P+L       E+  E R         Q+  E
Sbjct: 93  PTPVETQQEEKA--VNPFTPQPGQREERRPTL-------ESEEEWRARLKREQEEQYLRE 143

Query: 191 TPRKPPARYP------------DIARAEPRTTPTTTTPPATRRGESNPGPSD 230
             R+  AR              DI + E RT  T  T   T    S  G SD
Sbjct: 144 RQRQRMARLQANAAAYDSPLAVDIGKVEKRTADTNDTATQTTNATSTAGASD 195


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 29.1 bits (65), Expect = 5.8
 Identities = 13/66 (19%), Positives = 18/66 (27%), Gaps = 1/66 (1%)

Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
            P      S P      P +P    +  P  A  E ++    +         S   PS  
Sbjct: 85  APPAATPTSAPTPTPSPPASPASGMSAAPASAV-EEKSPSEESATATAPESPSTSVPSSG 143

Query: 232 PDTCDT 237
            D   T
Sbjct: 144 SDAAST 149


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 29.3 bits (65), Expect = 6.1
 Identities = 25/125 (20%), Positives = 36/125 (28%), Gaps = 15/125 (12%)

Query: 77  GESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPES 136
           G + P+  R      R     W+  P +        + P  PRP H            + 
Sbjct: 792 GYAGPWGLRAQHPRYRHSWAYWSQYPGHGH-----PQGPWAPRPPHLPPQWDGSAGHGQD 846

Query: 137 PRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKP- 195
              + P       P   Q S   +    Q      +   P   SP       P  P +P 
Sbjct: 847 QVSQFPHLQSETGPPRLQLSQVPQLPYSQT---LVSSSAPSWSSPQ------PRAPIRPI 897

Query: 196 PARYP 200
           P R+P
Sbjct: 898 PTRFP 902


>gnl|CDD|114645 pfam05934, MCLC, Mid-1-related chloride channel (MCLC).  This
           family consists of several mid-1-related chloride
           channels. mid-1-related chloride channel (MCLC) proteins
           function as a chloride channel when incorporated in the
           planar lipid bilayer.
          Length = 577

 Score = 29.3 bits (65), Expect = 6.2
 Identities = 20/139 (14%), Positives = 36/139 (25%), Gaps = 10/139 (7%)

Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
           P       R E+ R           + +PE  +                   R  P +  
Sbjct: 435 PYAKTYEGRREILRDNDVDLRFQTGNKSPEILKAFD----------LDDADAREHPPVVP 484

Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESN 225
            ++ P    +P+     +      E  ++       I  +       T  P A  + +  
Sbjct: 485 SHKSPTADADPKDTGGIKGEGTGEELSQEDHQIAKPIKESGNDERGNTEGPEAAEKAQLK 544

Query: 226 PGPSDKPDTCDTSYDAIAV 244
              +  PD   T   A  V
Sbjct: 545 SEAAGSPDQGSTYSPARGV 563


>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
           Members of this family include high molecular weight
           subunits of glutenin. This group of gluten proteins is
           thought to be largely responsible for the elastic
           properties of gluten, and hence, doughs. Indeed,
           glutenin high molecular weight subunits are classified
           as elastomeric proteins, because the glutenin network
           can withstand significant deformations without breaking,
           and return to the original conformation when the stress
           is removed. Elastomeric proteins differ considerably in
           amino acid sequence, but they are all polymers whose
           subunits consist of elastomeric domains, composed of
           repeated motifs, and non-elastic domains that mediate
           cross-linking between the subunits. The elastomeric
           domain motifs are all rich in glycine residues in
           addition to other hydrophobic residues. High molecular
           weight glutenin subunits have an extensive central
           elastomeric domain, flanked by two terminal non-elastic
           domains that form disulphide cross-links. The central
           elastomeric domain is characterized by the following
           three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
           possesses overlapping beta-turns within and between the
           repeated motifs, and assumes a regular helical secondary
           structure with a diameter of approx. 1.9 nm and a pitch
           of approx. 1.5 nm.
          Length = 779

 Score = 29.3 bits (64), Expect = 6.6
 Identities = 14/75 (18%), Positives = 27/75 (36%)

Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP 185
           H+      P   ++    + P Q  +  Q    ++P+  Q    P    +P +  P  +P
Sbjct: 244 HYLASPQQPGQGQQPGQLQQPAQGQQPEQGQQGQQPAQGQQGHQPAQGQQPGQGQPGHYP 303

Query: 186 NQFPETPRKPPARYP 200
               +  +  P  YP
Sbjct: 304 ASPQQPGQGQPGHYP 318


>gnl|CDD|206085 pfam13914, Phostensin, Phostensin PP1-binding and SH3-binding
           region.  Phostensin has been identified as a PP1
           regulatory protein binding PP1 at the KISF motif. The
           domain also appears to carry an incomplete incomplete
           SH3-binding domain PxRxP further upstream. It is likely
           that Phostensin targets PP1 to the F-actin cytoskeleton.
           Phostensin binds to actin and decreases the elongation
           and depolymerisation rates of actin filament pointed
           ends.
          Length = 132

 Score = 28.1 bits (62), Expect = 6.7
 Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 2/83 (2%)

Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
           P+  +P          HP     +     R  N +   P+  P   P   Q N   ET  
Sbjct: 2   PQAAKPPGSPLPSRHLHPAKPGHQSGLQRRGGNTFTVVPKRKPGSLPHGSQAN--GETAP 59

Query: 175 EPRRDSPPRHPNQFPETPRKPPA 197
           +P  D             R P  
Sbjct: 60  QPAEDEEAISLLGPTGKKRYPTV 82


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 28.7 bits (64), Expect = 6.7
 Identities = 20/94 (21%), Positives = 24/94 (25%), Gaps = 3/94 (3%)

Query: 147 NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF---PETPRKPPARYPDIA 203
               E    S     +    +  P    E    S P   +      E    PPA      
Sbjct: 79  GDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138

Query: 204 RAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDT 237
              P  T    TP   R     P  S +P T  T
Sbjct: 139 GPTPDPTAQPATPDERRSPRQRPPVSGEPPTPST 172


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 27.9 bits (62), Expect = 6.8
 Identities = 13/53 (24%), Positives = 16/53 (30%), Gaps = 3/53 (5%)

Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
           P   P  P      +PP   +  P  P  PP   P +  A     P     P 
Sbjct: 81  PLTPPAPPEPV---TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130


>gnl|CDD|132858 cd07219, Pat_PNPLA1, Patatin-like phospholipase domain containing
           protein 1.  Members of this family share a patatin
           domain, initially discovered in potato tubers. Some
           members of PNPLA1 subfamily do not have the lipase
           consensus sequence Gly-X-Ser-X-Gly which is essential
           for hydrolase activity.  This family includes PNPLA1
           from Homo sapiens and Gallus gallus. Currently, there is
           no literature available on the physiological role,
           structure, or enzymatic activity of PNPLA1. It is
           expressed in various human tissues in low mRNA levels.
          Length = 382

 Score = 29.1 bits (65), Expect = 6.8
 Identities = 23/107 (21%), Positives = 33/107 (30%), Gaps = 10/107 (9%)

Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
           P    P       S   E   +               +P L  P   PE+  E   +SP 
Sbjct: 286 PQRSQPSLQDGQASL--EESWQPHLARAPKGDGRGLHDPPLSPPLAAPESTAEWVVESPV 343

Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
             P    E+    P    D++   P + P   + P+     S PG S
Sbjct: 344 SSPASPLESSPSLPGSLTDLS---PASLPAVHSLPS-----STPGLS 382


>gnl|CDD|220233 pfam09422, WTX, WTX protein.  The WTX protein is found to be
           inactivated in one third of Wilms tumours. The WTX
           protein is functionally uncharacterized.
          Length = 467

 Score = 29.2 bits (65), Expect = 6.9
 Identities = 21/131 (16%), Positives = 31/131 (23%), Gaps = 15/131 (11%)

Query: 116 EVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE----PSLRQP----- 166
           E  +        P          +E  PR   +    PQD+   +     +L  P     
Sbjct: 103 EEAKEPEGGLILPSSLTASLECVKEETPRAAREENAPPQDADGGKVSPASALEAPGTSCV 162

Query: 167 ---NQYPETPHEPRRDSPPRHPN-QFPETPRKPPARYPDIARAEPRTT-PTTTTPPATRR 221
              +           + P           P  P      +A     T      T P    
Sbjct: 163 ECGDPAKPGSESSAFEDPGPGLGADSLCQPLTPETGEKVVAEDASITGDVPVKTVPPVDS 222

Query: 222 GESNPGPSDKP 232
              +P PSD  
Sbjct: 223 STVDP-PSDPS 232


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.0 bits (64), Expect = 7.7
 Identities = 23/114 (20%), Positives = 31/114 (27%), Gaps = 6/114 (5%)

Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
           +  +PD T     V      T  H E H T   P     P        +  D     P+ 
Sbjct: 422 FHKAPDTTKSVIFVY-----TLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTG 476

Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPE-TPRKPPARYPDIARAEPRTTPTTTTP 216
              +  PE      R +        P      P A  P   +       T+ TP
Sbjct: 477 TTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTP 530


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 28.9 bits (64), Expect = 7.9
 Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 11/92 (11%)

Query: 152 TPQDSPRREPSLRQPNQYPETPHEP------RRDSPPRHPNQFPETPRKPPARYPDIARA 205
           T Q +  R+    +  + P T  E        + S P+   Q        PA   + ++ 
Sbjct: 62  TTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQT 121

Query: 206 EPR-TTPTT--TTPPATRRGESNPGPSDKPDT 234
               TTP T  TTPP+T      P  S K DT
Sbjct: 122 TTESTTPKTKVTTPPST--NTPQPMQSTKSDT 151


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 29.0 bits (65), Expect = 7.9
 Identities = 13/84 (15%), Positives = 26/84 (30%)

Query: 93  KEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPET 152
           KEE + + +P  + +     +  E+ +                  R  PP   P+  P  
Sbjct: 161 KEEGTSSKMPATALASAALFKDDEIRQEVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQ 220

Query: 153 PQDSPRREPSLRQPNQYPETPHEP 176
           P  +          ++  E   +P
Sbjct: 221 PMLTRSAGGRFEGEDENLERNLQP 244


>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
           1).  Dystroglycan is one of the dystrophin-associated
           glycoproteins, which is encoded by a 5.5 kb transcript
           in human. The protein product is cleaved into two
           non-covalently associated subunits, [alpha] (N-terminal)
           and [beta] (C-terminal). In skeletal muscle the
           dystroglycan complex works as a transmembrane linkage
           between the extracellular matrix and the cytoskeleton.
           [alpha]-dystroglycan is extracellular and binds to
           merosin ([alpha]-2 laminin) in the basement membrane,
           while [beta]-dystroglycan is a transmembrane protein and
           binds to dystrophin, which is a large rod-like
           cytoskeletal protein, absent in Duchenne muscular
           dystrophy patients. Dystrophin binds to intracellular
           actin cables. In this way, the dystroglycan complex,
           which links the extracellular matrix to the
           intracellular actin cables, is thought to provide
           structural integrity in muscle tissues. The dystroglycan
           complex is also known to serve as an agrin receptor in
           muscle, where it may regulate agrin-induced
           acetylcholine receptor clustering at the neuromuscular
           junction. There is also evidence which suggests the
           function of dystroglycan as a part of the signal
           transduction pathway because it is shown that Grb2, a
           mediator of the Ras-related signal pathway, can interact
           with the cytoplasmic domain of dystroglycan. In general,
           aberrant expression of dystrophin-associated protein
           complex underlies the pathogenesis of Duchenne muscular
           dystrophy, Becker muscular dystrophy and severe
           childhood autosomal recessive muscular dystrophy.
           Interestingly, no genetic disease has been described for
           either [alpha]- or [beta]-dystroglycan. Dystroglycan is
           widely distributed in non-muscle tissues as well as in
           muscle tissues. During epithelial morphogenesis of
           kidney, the dystroglycan complex is shown to act as a
           receptor for the basement membrane. Dystroglycan
           expression in mouse brain and neural retina has also
           been reported. However, the physiological role of
           dystroglycan in non-muscle tissues has remained unclear.
          Length = 290

 Score = 28.4 bits (63), Expect = 8.2
 Identities = 30/125 (24%), Positives = 45/125 (36%), Gaps = 12/125 (9%)

Query: 67  RNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHH 126
           R +RK +  + +   F ++ + I    E       P  S    L   +P +P P +P  +
Sbjct: 171 RKKRKGKLTLEDQATFIKKGVPIIFADELDDSKPPPSSSMPLILKEEKPPLPPPEYPNQN 230

Query: 127 HPEHHPTPE------SPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDS 180
            PE  P  +      +P R+  P  P   P  P  +P      R  N        P R  
Sbjct: 231 VPETTPLNQDLLGEYTPLRDEDPNAPPYQPPPPFTTPMEGKGSRPKNM------TPYRSP 284

Query: 181 PPRHP 185
           PP  P
Sbjct: 285 PPYVP 289


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 28.0 bits (61), Expect = 8.2
 Identities = 13/59 (22%), Positives = 17/59 (28%)

Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
           T    E     E    E     P    E  ++    E    +P   PE   EP  +  P
Sbjct: 58  TDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEP 116



 Score = 27.6 bits (60), Expect = 9.0
 Identities = 11/59 (18%), Positives = 14/59 (23%)

Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEP 176
             P       PE     E    E     P       ++         +P   PE   EP
Sbjct: 62  DEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEP 120


>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family. 
          Length = 850

 Score = 28.9 bits (65), Expect = 8.3
 Identities = 17/62 (27%), Positives = 20/62 (32%), Gaps = 6/62 (9%)

Query: 142 PPRHP---NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
           P R     N+ P         E   R+       P  P    PP  P   PET R+   R
Sbjct: 607 PIRQAVLMNRSPPELPTDSEFE---REDEPADPRPPPPVETEPPPRPPPPPETDREEMRR 663

Query: 199 YP 200
            P
Sbjct: 664 LP 665


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 28.7 bits (64), Expect = 8.3
 Identities = 20/75 (26%), Positives = 28/75 (37%), Gaps = 7/75 (9%)

Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
           R++ +L QP      P  P     P  P      P  PPA  P  +       PT  +PP
Sbjct: 136 RKKEALPQP-----PPPAPVVMMQP--PPPHAMPPASPPAAQPAPSAPASSPPPTPASPP 188

Query: 218 ATRRGESNPGPSDKP 232
             +  +S+  P   P
Sbjct: 189 PAKAPKSSHPPLKSP 203


>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
          Length = 287

 Score = 28.7 bits (63), Expect = 8.3
 Identities = 20/135 (14%), Positives = 29/135 (21%), Gaps = 7/135 (5%)

Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
            R   +   + R+    R        P    +        P RH  Q+            
Sbjct: 31  HRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASSRSSHN- 89

Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
             R         H  R      H     +  R     Y    ++          P  T R
Sbjct: 90  --RGSTSSSHNHHAHRGPHQSAHRRSKHDAVRDT---YQPCPQSPETDLYKGRLPGETER 144

Query: 222 GESNPGP-SDKPDTC 235
               P    D P+  
Sbjct: 145 HYETPDHIYDVPEDV 159


>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 451

 Score = 28.6 bits (64), Expect = 8.6
 Identities = 9/60 (15%), Positives = 14/60 (23%), Gaps = 9/60 (15%)

Query: 132 PTPESPRREPPP---RHPNQYPETPQDSPRREPSLRQ------PNQYPETPHEPRRDSPP 182
           P     R E  P   +  N         P+    L Q       +   +     +    P
Sbjct: 365 PAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKKETIHQEVPSP 424


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 29.0 bits (65), Expect = 8.8
 Identities = 22/118 (18%), Positives = 31/118 (26%), Gaps = 7/118 (5%)

Query: 117 VPRPRHPTHHHPEHHPTPESPR-REPPPRHPNQYPET-PQDSPRREPSLR--QPNQYPET 172
            P         P+++   E+     P P        T       R   +R    +     
Sbjct: 635 FPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRA 694

Query: 173 PHEPRRDSPPRHPNQFPETPRKPPARYPD---IARAEPRTTPTTTTPPATRRGESNPG 227
           P  PR   PP      P  P  P A  P     +     T P    PP   + +    
Sbjct: 695 PQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAEN 752


>gnl|CDD|234080 TIGR02987, met_A_Alw26, type II restriction m6 adenine DNA
           methyltransferase, Alw26I/Eco31I/Esp3I family.  Members
           of this family are the m6-adenine DNA methyltransferase
           protein, or domain of a fusion protein that also carries
           m5 cytosine methyltransferase activity, of type II
           restriction systems of the Alw26I/Eco31I/Esp3I family. A
           methyltransferase of this family is alway accompanied by
           a type II restriction endonuclease from the
           Alw26I/Eco31I/Esp3I family (TIGR02986) and by an
           adenine-specific modification methyltransferase. Members
           of this family are unusual in that regions of similarity
           to homologs outside this family are circularly permuted
           [DNA metabolism, Restriction/modification].
          Length = 524

 Score = 28.6 bits (64), Expect = 9.2
 Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 41  IEAEFKWFDGGN-STICTNLSTEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWA 99
            + E K F G N +T   +L++   S + +    D+    +      + +   K      
Sbjct: 225 FQEEAKLFSGVNQATSIIHLNSGSKSKKIELIFYDKSLSEKGDNITYVDLNIIKSMNLSI 284

Query: 100 VIPRYSPSPDLTTRR 114
            IP+ S +  LT  +
Sbjct: 285 PIPQGS-NHLLTILK 298


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 16/101 (15%)

Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
           PE  P PE P+  P      +    P+  P+ +P  +   Q       P+R+  P  P  
Sbjct: 76  PEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQ-------PKREVKPVEP-- 126

Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
                 +P + + + A A P ++  T          S+ GP
Sbjct: 127 ------RPASPFENTAPARPTSSTATAAASKPVTSVSS-GP 160


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 28.7 bits (64), Expect = 9.4
 Identities = 20/132 (15%), Positives = 27/132 (20%), Gaps = 8/132 (6%)

Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
           P  R          P   +                   + R E     P   P T     
Sbjct: 383 PGARAAAAVGASAVP-AVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAP-PATADRGD 440

Query: 178 RDSPPRHPNQFPETPRKPPARYPDIARAEPRT------TPTTTTPPATRRGESNPGPSDK 231
             +    P       R       D   A+P         P +  PP      +    +  
Sbjct: 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPS 500

Query: 232 PDTCDTSYDAIA 243
             T     DA A
Sbjct: 501 AATPAAVPDARA 512


>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
           [Chromatin structure and dynamics].
          Length = 1163

 Score = 28.7 bits (64), Expect = 9.6
 Identities = 21/115 (18%), Positives = 34/115 (29%), Gaps = 4/115 (3%)

Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
           P  P H +P       S ++  P  HP+   +   ++  +       N     P     +
Sbjct: 7   PVDPPHDNPGDKVMTGSLKQISPKYHPSMSRDHDVNTSLQNSEPWPDNGSARKPSY-SPE 65

Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGE-SNPGPSDKPD 233
           S   +    P   R    RY +       +       P+T  G  SN      P 
Sbjct: 66  SMSSYGG--PMDERSGTRRYYEDVEQILSSLAGDNPDPSTDHGSASNQNMERSPP 118


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 28.7 bits (64), Expect = 10.0
 Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 6/75 (8%)

Query: 120 PRHP----THHHPEHHPTPESPRREPPPRHPNQYPE-TPQDSPRREPSLRQPNQYPETPH 174
           PR+     T    E  P P S + +P  R      E  P        S  +P+       
Sbjct: 7   PRNAILRETSSGEEQSPNPSSHKSKPSSRKLKSSKENAPPPDLNSLTSDLKPDHRS-ASA 65

Query: 175 EPRRDSPPRHPNQFP 189
           + +   PPR P+  P
Sbjct: 66  KLKSPLPPRPPSSNP 80


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.448 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,580,410
Number of extensions: 2809071
Number of successful extensions: 4503
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3537
Number of HSP's successfully gapped: 392
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)