RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12968
         (402 letters)



>gnl|CDD|239088 cd02395, SF1_like-KH, Splicing factor 1 (SF1) K homology
           RNA-binding domain (KH). Splicing factor 1 (SF1)
           specifically recognizes the intron branch point sequence
           (BPS) UACUAAC in the pre-mRNA transcripts during
           spliceosome assembly. We show that the KH-QUA2 region of
           SF1 defines an enlarged KH (hnRNP K) fold which is
           necessary and sufficient for BPS binding. KH binds
           single-stranded RNA or DNA. It is found in a wide
           variety of proteins including ribosomal proteins,
           transcription factors and post-transcriptional modifiers
           of mRNA.
          Length = 120

 Score =  162 bits (411), Expect = 4e-49
 Identities = 61/113 (53%), Positives = 83/113 (73%), Gaps = 3/113 (2%)

Query: 74  KVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGDP 133
           KV +PVK++P++NFVG +LGP+GN+L++L++ET  K+SI GKGSM+D  KEEELR    P
Sbjct: 3   KVYIPVKQYPKYNFVGLILGPRGNTLKQLEKETGAKISIRGKGSMKDGKKEEELRG---P 59

Query: 134 KNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREM 186
           K AHL E LHV ++A  PP EA A+   A+ EL K  I   NDE+++EQ RE+
Sbjct: 60  KYAHLNEPLHVLITAETPPEEALAKAVEAIEELLKPAIEGGNDELKREQLREL 112


>gnl|CDD|227503 COG5176, MSL5, Splicing factor (branch point binding protein) [RNA
           processing and modification].
          Length = 269

 Score = 55.0 bits (132), Expect = 2e-08
 Identities = 25/75 (33%), Positives = 45/75 (60%), Gaps = 4/75 (5%)

Query: 74  KVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGDP 133
           K+ +PV+E+P  NFVG L+GP+G++L++L+  +  K++I G GS+    KE ++  +   
Sbjct: 151 KIYIPVQEYPESNFVGLLIGPRGSTLKQLERISRAKIAIRGSGSV----KEGKISSDTPE 206

Query: 134 KNAHLVEDLHVQVSA 148
              +    LH  + A
Sbjct: 207 SLKNAEAVLHCLIEA 221


>gnl|CDD|197652 smart00322, KH, K homology RNA-binding domain. 
          Length = 68

 Score = 47.7 bits (114), Expect = 2e-07
 Identities = 21/72 (29%), Positives = 39/72 (54%), Gaps = 11/72 (15%)

Query: 71  VKVKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKE 130
           V ++VL+P       + VG ++G  G+++++++EET  K+ I G GS     +  E+   
Sbjct: 3   VTIEVLIPA------DKVGLIIGKGGSTIKKIEEETGVKIDIPGPGS---EERVVEIT-- 51

Query: 131 GDPKNAHLVEDL 142
           G P+N     +L
Sbjct: 52  GPPENVEKAAEL 63


>gnl|CDD|238053 cd00105, KH-I, K homology RNA-binding domain, type I.  KH binds
           single-stranded RNA or DNA. It is found in a wide
           variety of proteins including ribosomal proteins,
           transcription factors and post-transcriptional modifiers
           of mRNA. There are two different KH domains that belong
           to different protein folds, but they share a single KH
           motif. The KH motif is folded into a beta alpha alpha
           beta unit. In addition to the core, type II KH domains
           (e.g. ribosomal protein S3) include N-terminal extension
           and type I KH domains (e.g. hnRNP K) contain C-terminal
           extension.
          Length = 64

 Score = 45.6 bits (109), Expect = 8e-07
 Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 9/70 (12%)

Query: 73  VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGD 132
            +VLVP         VG+++G  G++++ ++EET  K+ I   G     S+E  +   G 
Sbjct: 2   ERVLVPSS------LVGRIIGKGGSTIKEIREETGAKIKIPDSG---SGSEERIVTITGT 52

Query: 133 PKNAHLVEDL 142
           P+     ++L
Sbjct: 53  PEAVEKAKEL 62


>gnl|CDD|215657 pfam00013, KH_1, KH domain.  KH motifs bind RNA in vitro.
           Autoantibodies to Nova, a KH domain protein, cause
           paraneoplastic opsoclonus ataxia.
          Length = 59

 Score = 36.0 bits (84), Expect = 0.002
 Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 6/48 (12%)

Query: 72  KVKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMR 119
             ++L+P         VG+++G  G++++ ++EET  K+ I       
Sbjct: 1   TERILIPPD------KVGRIIGKGGSNIKEIREETGVKIRIPDDRDDT 42


>gnl|CDD|221895 pfam13014, KH_3, KH domain.  KH motifs bind RNA in vitro. This
           RNA-binding domain is required for the efficient
           anchoring of ASH1-mRNA to the distal tip of the daughter
           cell. ASH1 is a specific repressor of transcription that
           localizes asymmetrically to the daughter cell nucleus.
           RNA localisation is a widespread mechanism for achieving
           localised protein synthesis.
          Length = 42

 Score = 35.6 bits (83), Expect = 0.002
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 87  FVGKLLGPKGNSLRRLQEETMTKMSI 112
            VG ++G  G +++ ++EET  K+ I
Sbjct: 1   LVGAIIGKGGETIKEIREETGAKIQI 26


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 40.2 bits (94), Expect = 0.002
 Identities = 43/178 (24%), Positives = 56/178 (31%), Gaps = 16/178 (8%)

Query: 145 QVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPP 204
             +  A PA A A  A A +  R+   P+     RQ   R  G            P P P
Sbjct: 403 PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG----------GAPAPAP 452

Query: 205 HMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPSA 264
              AA PA    A P      P           P  A+     +  D      E+  P  
Sbjct: 453 APAAA-PAAA--ARPAAAGPRPV---AAAAAAAPARAAPAAAPAPADDDPPPWEELPPEF 506

Query: 265 GVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSR 322
                  P AA A +   +  DPA  DP  +        +A+    AA A  P  + R
Sbjct: 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564



 Score = 32.2 bits (73), Expect = 0.53
 Identities = 38/180 (21%), Positives = 54/180 (30%), Gaps = 13/180 (7%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
           P P    P       AA PA    AP     A           P P A +  +  S    
Sbjct: 385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444

Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYET-TASYDPAMYDPYASTAGDYYDT-------- 303
             A    PAP+A       P AA        A+  PA   P A+ A    D         
Sbjct: 445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP 504

Query: 304 ---SASYYETAAPAPLPSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNA 360
              S +  +    AP    +               E  + + + + AP  ++++E VV  
Sbjct: 505 EFASPAPAQP-DAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563



 Score = 29.1 bits (65), Expect = 5.3
 Identities = 29/125 (23%), Positives = 33/125 (26%), Gaps = 5/125 (4%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
           P      P   P   AA PA    AP      A P             A+     S    
Sbjct: 374 PATAAAAPVAQPAPAAAAPAA--AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431

Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAA 312
           A AA  + +            A  A     A+  PA   P    A       A     AA
Sbjct: 432 ALAAARQASARGPGGAPAPAPAPAA--APAAAARPAAAGPRPVAAAA-AAAPARAAPAAA 488

Query: 313 PAPLP 317
           PAP  
Sbjct: 489 PAPAD 493


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 23/73 (31%), Positives = 25/73 (34%), Gaps = 10/73 (13%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
           P      P P      A P P    P      A P   RP   P PPAA+          
Sbjct: 385 PSAAWGAPTPAA---PAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAA-------PP 434

Query: 253 ARAAMEKPAPSAG 265
           AR+A    A SAG
Sbjct: 435 ARSADPAAAASAG 447


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 39.5 bits (92), Expect = 0.003
 Identities = 36/134 (26%), Positives = 45/134 (33%), Gaps = 4/134 (2%)

Query: 192  EPRPQMRHPRPPPH-MVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
            +P P    P P PH +V+A P P   A  R    A P    PP  P  PA   G      
Sbjct: 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR 2759

Query: 251  DRARAAMEKPAPSAGVYEYEDPHA---AHAYYETTASYDPAMYDPYASTAGDYYDTSASY 307
                A    PAP A             A A    +    P+ +DP    A      +A  
Sbjct: 2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819

Query: 308  YETAAPAPLPSTSS 321
               +   PLP  +S
Sbjct: 2820 PAASPAGPLPPPTS 2833



 Score = 34.1 bits (78), Expect = 0.17
 Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 193  PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
            P  Q   P  P      +P P    PP+ Q   PP    PP P PP A
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP----PPRPQPPLA 2944



 Score = 33.8 bits (77), Expect = 0.22
 Identities = 21/71 (29%), Positives = 23/71 (32%), Gaps = 3/71 (4%)

Query: 193  PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
              P            AA PA  L  P   Q  APP    PP PPPP     G V    D 
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSVAPGGDV 2862

Query: 253  ARAAMEKPAPS 263
             R    +   +
Sbjct: 2863 RRRPPSRSPAA 2873



 Score = 32.2 bits (73), Expect = 0.57
 Identities = 34/132 (25%), Positives = 40/132 (30%), Gaps = 12/132 (9%)

Query: 194  RPQMRHPRPPPHMVA--ARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGK--VLSI 249
            RP+ R  RP    +   A P P    PP     AP  +V     PP PAA+R     L  
Sbjct: 2682 RPRRRAARPTVGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737

Query: 250  LDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYE 309
                 A    PA   G      P              PA   P   T        AS  E
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP----AVASLSE 2793

Query: 310  TAAPAPLPSTSS 321
            +    P P   +
Sbjct: 2794 SRESLPSPWDPA 2805



 Score = 31.4 bits (71), Expect = 0.98
 Identities = 32/135 (23%), Positives = 36/135 (26%), Gaps = 15/135 (11%)

Query: 193  PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
            P P   H   PP      P+P    P        P   RP   P P   SR +    L R
Sbjct: 2616 PLPPDTHAPDPP---PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGR 2672

Query: 253  ARAAMEKP---------APSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDT 303
            A  A   P              +    DP        T      A+        G     
Sbjct: 2673 AAQASSPPQRPRRRAARPTVGSLTSLADPPPPP---PTPEPAPHALVSATPLPPGPAAAR 2729

Query: 304  SASYYETAAPAPLPS 318
             AS    AAPAP   
Sbjct: 2730 QASPALPAAPAPPAV 2744



 Score = 31.1 bits (70), Expect = 1.3
 Identities = 31/149 (20%), Positives = 40/149 (26%), Gaps = 12/149 (8%)

Query: 200  PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
            P PP          G   P R    A P    PP  P      R    ++   + +    
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798

Query: 260  PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPST 319
            P+P        DP    A     A+  P    P             S   TA P P    
Sbjct: 2799 PSPW-------DPADPPAAVLAPAAALPPAASPAGPLP-----PPTSAQPTAPPPPPGPP 2846

Query: 320  SSRTGGSGWKSSSGYAEHHSSSRSKSSAP 348
                   G  +  G       SRS ++ P
Sbjct: 2847 PPSLPLGGSVAPGGDVRRRPPSRSPAAKP 2875



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 18/53 (33%), Positives = 20/53 (37%), Gaps = 4/53 (7%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAA----PPHMVRPPHPPPPPAA 241
           P        PPP   AA P      P  +  AA    P  +  PP PPPP  A
Sbjct: 269 PETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPA 321



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 19/88 (21%), Positives = 22/88 (25%)

Query: 200  PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
            P P P    A   P     P      P      P P  P        +S   RAR     
Sbjct: 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA 2673

Query: 260  PAPSAGVYEYEDPHAAHAYYETTASYDP 287
               S+         A       T+  DP
Sbjct: 2674 AQASSPPQRPRRRAARPTVGSLTSLADP 2701



 Score = 29.5 bits (66), Expect = 4.7
 Identities = 25/91 (27%), Positives = 33/91 (36%), Gaps = 10/91 (10%)

Query: 181  EQFREMGYVHHEPRPQMRHP-----RPPPHMVAARPAPGLLAPPRMQHAAP----PHMVR 231
            E+ RE        R +         RP P     R AP +LA  R          P   +
Sbjct: 998  EEARERAEAARAARERFFQKLQGVLRPLPDFGGLRAAPAVLATLRADLPGGWTDLPDAAQ 1057

Query: 232  PPHPPPPPAASRGKVLSILDRARAAMEKPAP 262
               PP   AA R  +  +L + R A+E P P
Sbjct: 1058 AA-PPEVRAALRADLWGLLGQYREALEHPTP 1087



 Score = 28.8 bits (64), Expect = 7.2
 Identities = 11/50 (22%), Positives = 14/50 (28%)

Query: 192  EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
            +P+        P      +P P    PPR Q    P         P  A 
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV 2959


>gnl|CDD|239089 cd02396, PCBP_like_KH, K homology RNA-binding domain, PCBP_like.
           Members of this group possess KH domains in a tandem
           arrangement. Most members, similar to the poly(C)
           binding proteins (PCBPs) and Nova, containing three KH
           domains, with the first and second domains, which are
           represented here, in tandem arrangement, followed by a
           large spacer region, with the third domain near the
           C-terminal end of the protein. The poly(C) binding
           proteins (PCBPs) can be divided into two groups, hnRNPs
           K/J and the alphaCPs, which share a triple KH domain
           configuration and  poly(C) binding specificity. They
           play roles in mRNA stabilization, translational
           activation, and translational silencing. Nova-1 and
           Nova-2 are nuclear RNA-binding proteins that regulate
           splicing. This group also contains plant proteins that
           seem to have two tandem repeat arrrangements, like Hen4,
           a protein that plays a role in  AGAMOUS (AG) pre-mRNA
           processing and important step in plant development. In
           general, KH binds single-stranded RNA or DNA. It is
           found in a wide variety of proteins including ribosomal
           proteins, transcription factors and post-transcriptional
           modifiers of mRNA.
          Length = 65

 Score = 35.1 bits (82), Expect = 0.005
 Identities = 10/43 (23%), Positives = 24/43 (55%), Gaps = 6/43 (13%)

Query: 73  VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGK 115
           +++LVP  +       G ++G  G++++ ++EET  K+ +   
Sbjct: 2   LRLLVPSSQ------AGSIIGKGGSTIKEIREETGAKIRVSKS 38


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 37.8 bits (88), Expect = 0.009
 Identities = 33/136 (24%), Positives = 41/136 (30%), Gaps = 14/136 (10%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPP---RMQHAAPPHMVRPPHPPPPPAASRGKVLS 248
                    + P    AA PA   +A         AAP      P  PP  A     V +
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP-APVAA 428

Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYY 308
               A AA    AP+A       P  A    ET A       +P  ++A           
Sbjct: 429 PAAAAPAAAPAAAPAAVALAPAPPAQAAP--ETVAIPVRVAPEPAVASAAP--------A 478

Query: 309 ETAAPAPLPSTSSRTG 324
             AAPA    T +  G
Sbjct: 479 PAAAPAAARLTPTEEG 494


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 37.8 bits (88), Expect = 0.010
 Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 1/49 (2%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
              +     P+PP       P P    PP+ Q    P + +  + P PP
Sbjct: 251 LSQQMPPPPPQPPQQQ-QQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPP 298



 Score = 34.7 bits (80), Expect = 0.092
 Identities = 11/64 (17%), Positives = 13/64 (20%), Gaps = 4/64 (6%)

Query: 180 QEQFREMGYVHHEPRPQMRHPR----PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHP 235
             Q  +       P  Q   P+      P      P          Q    P       P
Sbjct: 172 LPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPP 231

Query: 236 PPPP 239
            PP 
Sbjct: 232 LPPQ 235



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 4/63 (6%)

Query: 179 RQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
           RQ  F + G      +P   +P+PP            L  P    A PP   + P  PPP
Sbjct: 187 RQAAFPQQGP---PEQPPG-YPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPP 242

Query: 239 PAA 241
              
Sbjct: 243 LQQ 245



 Score = 29.7 bits (67), Expect = 3.1
 Identities = 15/62 (24%), Positives = 18/62 (29%), Gaps = 2/62 (3%)

Query: 180 QEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
           Q        V   P+  +  P   P      P      PP  Q   P    + P PPP P
Sbjct: 205 QPPQGHPEQVQ--PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262

Query: 240 AA 241
             
Sbjct: 263 PQ 264



 Score = 29.4 bits (66), Expect = 4.0
 Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 4/52 (7%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
               P +    PPP        P     P    A PP   +P   P  P   
Sbjct: 245 QPQFPGLSQQMPPPPP----QPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQ 292



 Score = 29.0 bits (65), Expect = 5.6
 Identities = 14/70 (20%), Positives = 18/70 (25%), Gaps = 5/70 (7%)

Query: 194 RPQMRHPRPPPHMVAARP-APGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
           R Q      PP  V  +   P   A P+      P    P  P   P   +         
Sbjct: 166 RQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGY-PQPPQGHPEQVQP---QQFLP 221

Query: 253 ARAAMEKPAP 262
           A +      P
Sbjct: 222 APSQAPAQPP 231



 Score = 28.2 bits (63), Expect = 8.7
 Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)

Query: 195 PQMRHPRPPPH-----MVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
           P ++ P+ P           +P      PP+ Q   PP     PHP  P
Sbjct: 241 PPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLP 289


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 35.7 bits (83), Expect = 0.040
 Identities = 17/73 (23%), Positives = 18/73 (24%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
             P      P      A  AP   A P    A  P     P P PP  A           
Sbjct: 397 AAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSP 456

Query: 253 ARAAMEKPAPSAG 265
             AA     P+  
Sbjct: 457 PPAAAPSAQPAPA 469



 Score = 31.9 bits (73), Expect = 0.72
 Identities = 21/81 (25%), Positives = 24/81 (29%), Gaps = 1/81 (1%)

Query: 184 REMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASR 243
           R +G       P    P       AA PAP   AP      AP      P P P PA + 
Sbjct: 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA-APQPAPAPAPAPAP 441

Query: 244 GKVLSILDRARAAMEKPAPSA 264
                      A    PA + 
Sbjct: 442 PSPAGNAPAGGAPSPPPAAAP 462



 Score = 31.5 bits (72), Expect = 1.0
 Identities = 23/132 (17%), Positives = 32/132 (24%), Gaps = 2/132 (1%)

Query: 207 VAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAME--KPAPSA 264
               PA         + AAP     P  P P  AA+     S       A     P   A
Sbjct: 599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA 658

Query: 265 GVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTG 324
                +      A     A   P      A+ A       A      A  P    +    
Sbjct: 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718

Query: 325 GSGWKSSSGYAE 336
               +++ G + 
Sbjct: 719 AQPPQAAQGASA 730



 Score = 30.3 bits (69), Expect = 1.9
 Identities = 17/75 (22%), Positives = 20/75 (26%), Gaps = 6/75 (8%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
              P      P     +  PA    A P    AA P     P P PP A +         
Sbjct: 439 PAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA------A 492

Query: 252 RARAAMEKPAPSAGV 266
              A     AP+   
Sbjct: 493 APAAPAAPAAPAGAD 507



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 18/77 (23%), Positives = 21/77 (27%), Gaps = 6/77 (7%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
                    P P   A   A         Q A  P    P   PP PA +     +    
Sbjct: 401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA---PAPAPPSPAGNAPAGGAPSPP 457

Query: 253 ARAAMEK---PAPSAGV 266
             AA      PAP+A  
Sbjct: 458 PAAAPSAQPAPAPAAAP 474



 Score = 30.0 bits (68), Expect = 3.1
 Identities = 18/116 (15%), Positives = 28/116 (24%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
             P     AA  AP     P    A        P P P      G+      +   A + 
Sbjct: 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQG 727

Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
            +  +   +   P           +  PA   P  + A      +A      +   
Sbjct: 728 ASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 29/156 (18%), Positives = 35/156 (22%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
             P P   AA PA    AP     A   H      P            +      A    
Sbjct: 625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684

Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPST 319
           PAP+A         A  A             DP A         SA       P PLP  
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPE 744

Query: 320 SSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSE 355
                      +        +  +  +A    S   
Sbjct: 745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 27/133 (20%), Positives = 36/133 (27%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSI 249
                        P H+     + G    P     A P    P   P  PAA  G   + 
Sbjct: 641 AAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700

Query: 250 LDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYE 309
              A AA      +          A  A   + A+ DP    P      D     A    
Sbjct: 701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPP 760

Query: 310 TAAPAPLPSTSSR 322
             APAP  + ++ 
Sbjct: 761 PPAPAPAAAPAAA 773



 Score = 29.2 bits (66), Expect = 4.5
 Identities = 25/125 (20%), Positives = 35/125 (28%), Gaps = 18/125 (14%)

Query: 198 RHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAM 257
           R  R    +  A  A    A      AA P     P    P AA+          A AA 
Sbjct: 377 RLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAA--------PAPAAA 428

Query: 258 EKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLP 317
            +PAP         P  A A    + + +       +           +    AAP P  
Sbjct: 429 PQPAP--------APAPAPA--PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTA 478

Query: 318 STSSR 322
           + +  
Sbjct: 479 APAPA 483



 Score = 28.8 bits (65), Expect = 6.4
 Identities = 21/90 (23%), Positives = 22/90 (24%), Gaps = 11/90 (12%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRP---PHPPPPPAASRGKVLS 248
            P P       P    A  P       P     +PP    P   P P P  A        
Sbjct: 422 APAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPT---- 477

Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAY 278
               A  A   PA  A       P A  A 
Sbjct: 478 ----AAPAPAPPAAPAPAAAPAAPAAPAAP 503



 Score = 28.4 bits (64), Expect = 8.3
 Identities = 28/135 (20%), Positives = 36/135 (26%), Gaps = 5/135 (3%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
            R + R          A  AP   A       AP          P PAA+          
Sbjct: 379 ERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA-----P 433

Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAA 312
           A A    P   AG        +       +A   PA       TA       A+    AA
Sbjct: 434 APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493

Query: 313 PAPLPSTSSRTGGSG 327
           PA   + ++  G   
Sbjct: 494 PAAPAAPAAPAGADD 508



 Score = 28.4 bits (64), Expect = 9.5
 Identities = 15/74 (20%), Positives = 18/74 (24%), Gaps = 7/74 (9%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
            P          P   AA        P      AP       + P   A S         
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSP-------P 457

Query: 252 RARAAMEKPAPSAG 265
            A A   +PAP+  
Sbjct: 458 PAAAPSAQPAPAPA 471


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 35.6 bits (82), Expect = 0.045
 Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 7/75 (9%)

Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAP----PHMVRPPHPPPPPAASRGKVLSI 249
           RP    P   P   AA   P     P  + A P    P  V PP P  P +A +    +I
Sbjct: 380 RPT-PAPSTRPKAAAAANIPP--KEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436

Query: 250 LDRARAAMEKPAPSA 264
               +     PAP  
Sbjct: 437 PVDEKPKYTPPAPPK 451



 Score = 34.0 bits (78), Expect = 0.13
 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
           EP  +   P P P    A P P    + P++  AA P   +P + PP P     K L
Sbjct: 401 EPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKAL 457



 Score = 30.2 bits (68), Expect = 2.5
 Identities = 13/73 (17%), Positives = 18/73 (24%), Gaps = 6/73 (8%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
                      PP       A     PPR    APP     PH P          + + +
Sbjct: 387 TRPKAAAAANIPPKEPVRETATPPPVPPRP--VAPP----VPHTPESAPKLTRAAIPVDE 440

Query: 252 RARAAMEKPAPSA 264
           + +     P    
Sbjct: 441 KPKYTPPAPPKEE 453



 Score = 28.2 bits (63), Expect = 8.1
 Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 2/73 (2%)

Query: 193 PRPQMRHPRPP-PHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
           P PQ   P    P  V   PAP            PP         PPP   R  V   + 
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR-PVAPPVP 422

Query: 252 RARAAMEKPAPSA 264
               +  K   +A
Sbjct: 423 HTPESAPKLTRAA 435


>gnl|CDD|145536 pfam02444, HEV_ORF1, Hepatitis E virus ORF-2 (Putative capsid
           protein).  The Hepatitis E virus (HEV) genome is a
           single-stranded, positive-sense RNA molecule of
           approximately 7.5 kb. Three open reading frames (ORF)
           were identified within the HEV genome: ORF1 encodes
           non-structural proteins, ORF2 encodes the putative
           structural protein(s), and ORF3 encodes a protein of
           unknown function. ORF2 contains a consensus signal
           peptide sequence at its amino terminus and a capsid-like
           region with a high content of basic amino acids similar
           to that seen with other virus capsid proteins.
          Length = 114

 Score = 33.5 bits (76), Expect = 0.052
 Identities = 18/49 (36%), Positives = 19/49 (38%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
             P    P P P M   RP   L    R  H AP  + RP  PP PP  
Sbjct: 57  QSPIFIQPTPSPPMSPLRPGLDLAFANRPAHLAPLGVTRPSAPPLPPVV 105


>gnl|CDD|226193 COG3667, PcoB, Uncharacterized protein involved in copper
           resistance [Inorganic ion transport and metabolism].
          Length = 321

 Score = 34.9 bits (80), Expect = 0.069
 Identities = 24/74 (32%), Positives = 27/74 (36%), Gaps = 3/74 (4%)

Query: 190 HHEP--RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
            H P   P    P    H  +  P P + AP  M H A PHM   P P P   A+     
Sbjct: 36  EHAPMDAPHPAMPGMDHHAHSKMPGPEMAAPQ-MDHGAMPHMDHAPPPIPTQHAAERSRS 94

Query: 248 SILDRARAAMEKPA 261
                AR A   PA
Sbjct: 95  PASAAARVAAFPPA 108


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 34.8 bits (80), Expect = 0.077
 Identities = 31/142 (21%), Positives = 42/142 (29%), Gaps = 19/142 (13%)

Query: 128 RKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMG 187
           R    P ++   +   V VS   P  E   R+ YA         P + + + Q      G
Sbjct: 850 RISTSPSSSTFPQLSRVLVSVREPRTEGGVRVGYA---------PTLAEALDQVFGPGTG 900

Query: 188 YVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
            V   P          P   A  PAP        Q   PP   +PP  PP         +
Sbjct: 901 RVATAPGGD---AASAPPPGAGPPAP-------PQAVPPPRTTQPPAAPPRGPDVPPAAV 950

Query: 248 SILDRARAAMEKPAPSAGVYEY 269
           + L    A +     S     Y
Sbjct: 951 AELRETLADLRSAQRSGDFTAY 972


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 34.8 bits (80), Expect = 0.088
 Identities = 40/221 (18%), Positives = 54/221 (24%), Gaps = 31/221 (14%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPR---MQHAAPPHMVRPPHPPPPPAASRGKV 246
              P P       PP       +P     P    M          P   PP   AS   V
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAV 162

Query: 247 LSILDRARAAME-----------KPAPSAGVYEYEDPHAA----HAYYETTASY----DP 287
            S    +R A               +P A       P AA           ++      P
Sbjct: 163 ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAP 222

Query: 288 AMYDPYASTAGDYYDTSASYY--------ETAAPAPLPSTSSRTGGSGWKSSSGYAEHHS 339
           A     A  AG     S+S          E   P P P+         W++S        
Sbjct: 223 APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP-ITLPTRIWEASGWNGPSSR 281

Query: 340 SSRSKSSAPIRSSSSEKVVNASLVGLQGAQSTEYGYEDRKR 380
              + SS+  R  S     ++   G   +           R
Sbjct: 282 PGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR 322



 Score = 32.8 bits (75), Expect = 0.40
 Identities = 34/169 (20%), Positives = 44/169 (26%), Gaps = 29/169 (17%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
            E         P   +    PA     P R     PP        PPPP           
Sbjct: 77  TEAPANESRSTPTWSLSTLAPAS----PAREGSPTPPG--PSSPDPPPPTPPPASPPPSP 130

Query: 251 DRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYET 310
               + M +P  S G      P AA A     AS                   +AS  + 
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVAS------------------DAASSRQA 172

Query: 311 AAPAPLPSTSSRTGGS-----GWKSSSGYAEHHSSSRSKSSAPIRSSSS 354
           A P   P  ++R   S        +    A      RS   +   SS +
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA 221



 Score = 31.7 bits (72), Expect = 0.78
 Identities = 36/159 (22%), Positives = 49/159 (30%), Gaps = 9/159 (5%)

Query: 203 PPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAP 262
           P     A  +P    PP    AA       P     P ++     +      AA +  A 
Sbjct: 179 PEETARAPSSPPAEPPPSTPPAAASP---RPPRRSSPISASASSPAPAPGRSAADDAGAS 235

Query: 263 SAGVYEYEDPHAAHAYYETTASYDPAM------YDPYASTAGDYYDTSASYYETAAPAPL 316
           S+     E                PA           +   G       +   ++     
Sbjct: 236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295

Query: 317 PSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSE 355
           PS S  + GSG   SS  A   SSS  +SS+   SSSSE
Sbjct: 296 PSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE 334


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 34.1 bits (79), Expect = 0.15
 Identities = 18/85 (21%), Positives = 27/85 (31%), Gaps = 8/85 (9%)

Query: 158 RIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLA 217
           R+AYA ++L     P  ++ +R+   +E                  P   AA  APG  A
Sbjct: 370 RLAYA-ADL-----PTPDEALRR--LQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAA 421

Query: 218 PPRMQHAAPPHMVRPPHPPPPPAAS 242
                      +   P      AA 
Sbjct: 422 AAPAAGGPAAALAAVPDAAAAAAAP 446


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 33.3 bits (76), Expect = 0.16
 Identities = 24/76 (31%), Positives = 28/76 (36%), Gaps = 4/76 (5%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSI 249
           + +P P    P PPP   AA PAP    PP    A P      P+  PPP  +       
Sbjct: 38  NADPAP----PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93

Query: 250 LDRARAAMEKPAPSAG 265
                 A   PAP  G
Sbjct: 94  PPVDPNAPPPPAPEPG 109



 Score = 29.9 bits (67), Expect = 2.6
 Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKV 246
           P  PP      P      PP   +A PP  V  P+ PPPPA   G++
Sbjct: 66  PAAPPAPQPDDPNAAPPPPPADPNAPPPPPV-DPNAPPPPAPEPGRI 111


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 33.5 bits (77), Expect = 0.18
 Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 5/104 (4%)

Query: 172 PDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVR 231
              ++  R     +    +   +P+    +PP   V  + AP    P       P +  +
Sbjct: 78  AQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQ 137

Query: 232 PPHPPPPPAASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAA 275
           P  P   P +       +    +     P P+   ++  +P AA
Sbjct: 138 PEQPLQQPVSP-----QVAPAPQPVHSAPQPAQQAFQPAEPVAA 176



 Score = 28.5 bits (64), Expect = 6.1
 Identities = 15/53 (28%), Positives = 18/53 (33%), Gaps = 4/53 (7%)

Query: 195 PQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRP-PHPPPPPAASRGKV 246
           P      P P  V + P P   A    Q A P    +P P   P P   + K 
Sbjct: 145 PVSPQVAPAPQPVHSAPQP---AQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194



 Score = 28.1 bits (63), Expect = 8.2
 Identities = 14/44 (31%), Positives = 18/44 (40%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPP 233
              P PQ  H  P P   A +PA  + AP     A P  ++  P
Sbjct: 149 QVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKP 192


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 33.6 bits (77), Expect = 0.18
 Identities = 20/78 (25%), Positives = 22/78 (28%), Gaps = 5/78 (6%)

Query: 192 EPRPQMRHP-RPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
             R     P RP  H  A R        PR           PP P  P AA      S L
Sbjct: 674 PSRVARGDPVRPTAHHAALRAPQ----APRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSL 729

Query: 251 DRARAAMEKPAPSAGVYE 268
             +  A  +P P      
Sbjct: 730 IASPTAPPEPEPPGAEQA 747



 Score = 31.3 bits (71), Expect = 1.2
 Identities = 19/90 (21%), Positives = 19/90 (21%), Gaps = 9/90 (10%)

Query: 193 PRPQMRHPRPPPHMVAA-RPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
             P       P H     R A G    P   HAA     R P  P P     G       
Sbjct: 658 GVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAAL----RAPQAPRPGGPPGGGGGLPPP 713

Query: 252 RARAAMEKPAPSAGVYEYED----PHAAHA 277
               A   PAP                   
Sbjct: 714 PDLPAAAGPAPCGSSLIASPTAPPEPEPPG 743


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.9 bits (75), Expect = 0.32
 Identities = 31/151 (20%), Positives = 35/151 (23%), Gaps = 20/151 (13%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAP---------------PRMQHAAPPHMVRP-PHPP 236
           P    R   P       RP  G   P               P  + A P +  RP P   
Sbjct: 99  PPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAW 158

Query: 237 PPPAASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYAST 296
           P  A   G     L     A   P  S   Y  E       Y      YD      Y   
Sbjct: 159 PRAADDYGWQQQRLGFPPRA---PYASPASYAPEQERDREPYDAGRPEYDQR-RRDYDHP 214

Query: 297 AGDYYDTSASYYETAAPAPLPSTSSRTGGSG 327
             D+        +   P P      R G   
Sbjct: 215 RPDWDRPRRDRTDRPEPPPGAGHVHRGGPGP 245


>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin.  NC10
           stands for Non-helical region 10 and is taken from human
           COL15A1. A mutation in this region in human COL18A1 is
           associated with an increased risk of prostrate cancer.
           This domain is cleaved from the precursor and forms
           endostatin. Endostatin is a key tumour suppressor and
           has been used highly successfully to treat cancer. It is
           a potent angiogenesis inhibitor. Endostatin also binds a
           zinc ion near the N-terminus; this is likely to be of
           structural rather than functional importance according
           to.
          Length = 291

 Score = 32.0 bits (73), Expect = 0.48
 Identities = 17/73 (23%), Positives = 24/73 (32%), Gaps = 2/73 (2%)

Query: 183 FREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPP--HMVRPPHPPPPPA 240
           +R++      P P     +PPP ++     P L  PP +           R PHPP    
Sbjct: 44  WRKVLLGELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHPPLNSP 103

Query: 241 ASRGKVLSILDRA 253
           A    V       
Sbjct: 104 ARIAHVHQDFQPV 116


>gnl|CDD|172502 PRK14001, PRK14001, potassium-transporting ATPase subunit C;
           Provisional.
          Length = 189

 Score = 31.6 bits (71), Expect = 0.49
 Identities = 35/140 (25%), Positives = 58/140 (41%), Gaps = 29/140 (20%)

Query: 274 AAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTGGSGWKSSSG 333
           +AH   + TA+     + P  S+AGD YD +AS            + S  G +  K  + 
Sbjct: 53  SAHIGQQFTAA---KYFHPRPSSAGDGYDAAAS------------SGSNLGPTNEKLLAA 97

Query: 334 YAEHHSSSRSKSSAP---------IRSSSSE-----KVVNASLVGLQGAQSTEYGYEDRK 379
            AE  ++ R +++ P         +  S S       VVNA L   + AQ+        +
Sbjct: 98  VAERVTAYRKENNLPADTLVPVDAVTGSGSGLDPAISVVNAKLQAPRVAQARNISIRQVE 157

Query: 380 RRLDPYSDVESKKFVGSRVV 399
           R ++ ++D     F+G R V
Sbjct: 158 RLIEDHTDARGLGFLGERAV 177


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 32.1 bits (73), Expect = 0.53
 Identities = 27/121 (22%), Positives = 39/121 (32%), Gaps = 15/121 (12%)

Query: 202 PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPA 261
                  A  AP   A      AA      PP  P PPA +        DR   A +  A
Sbjct: 401 AVTGAAGAALAPKAAAA-----AAATRAEAPPAAPAPPATA--------DRGDDAADGDA 447

Query: 262 PSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSS 321
           P     +     +A +  +   +  PA     ++ A D    +A      A AP  +T +
Sbjct: 448 PVPA--KANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPA 505

Query: 322 R 322
            
Sbjct: 506 A 506



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 29/148 (19%), Positives = 42/148 (28%), Gaps = 6/148 (4%)

Query: 206 MVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPSA 264
           M+A  PA     AP     A     V  P      A     V ++     AA    AP A
Sbjct: 355 MLAFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKA 414

Query: 265 GV----YEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTS 320
                    E P AA A   T    D A     A         +++          P   
Sbjct: 415 AAAAAATRAEAPPAAPAPPATADRGDDAADGD-APVPAKANARASADSRCDERDAQPPAD 473

Query: 321 SRTGGSGWKSSSGYAEHHSSSRSKSSAP 348
           S +  +    +   A    + R+ + + 
Sbjct: 474 SGSASAPASDAPPDAAFEPAPRAAAPSA 501



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 19/124 (15%), Positives = 27/124 (21%), Gaps = 4/124 (3%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
            R    + A               A             P AA+     +    A  A   
Sbjct: 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--AAAATRAEAPPAAPA 431

Query: 260 PAPSAGVYE--YEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLP 317
           P  +A   +   +      A     AS D    +  A    D    SA   +    A   
Sbjct: 432 PPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFE 491

Query: 318 STSS 321
               
Sbjct: 492 PAPR 495



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 25/114 (21%), Positives = 35/114 (30%), Gaps = 3/114 (2%)

Query: 202 PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPA 261
            P  +  A PAPG  A   +  +A P +           A +    +   RA A    PA
Sbjct: 372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPA 431

Query: 262 PSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
           P A     +D     A     A+   A  D             +     +APA 
Sbjct: 432 PPATADRGDDAADGDAPVPAKAN-ARASADSRCDERDAQPPADSG--SASAPAS 482


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 31.9 bits (73), Expect = 0.67
 Identities = 24/115 (20%), Positives = 31/115 (26%), Gaps = 6/115 (5%)

Query: 146 VSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPR-PPP 204
           ++ALAPPA A        +                    E      E   ++   R    
Sbjct: 48  LAALAPPAAAAPA-----AAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLV 102

Query: 205 HMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
              AA  AP   AP      A              AA R +V +    A A   K
Sbjct: 103 ARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVK 157



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 17/105 (16%), Positives = 27/105 (25%), Gaps = 1/105 (0%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
           P      P   P  V+   AP    AP  ++HA      R        A +  +  +   
Sbjct: 59  PAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAA 118

Query: 252 RARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYAST 296
            A  A  + A                  +  A+   A+       
Sbjct: 119 PAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKARIERI 163



 Score = 28.8 bits (65), Expect = 6.1
 Identities = 14/69 (20%), Positives = 20/69 (28%), Gaps = 2/69 (2%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHM--VRPPHPPPPPAASRGKVL 247
             +   +   P  P     A PA    A    +  A       RP  P    AA    V 
Sbjct: 98  REQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVK 157

Query: 248 SILDRARAA 256
           + ++R    
Sbjct: 158 ARIERIVND 166


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 31.4 bits (71), Expect = 0.69
 Identities = 23/85 (27%), Positives = 27/85 (31%), Gaps = 4/85 (4%)

Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRA 253
           R + R P PP      RP P   +P       PP    P   PPP  A+         R 
Sbjct: 3   RARSRSPSPPRRPSPPRPTP-PRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRG 61

Query: 254 RAA---MEKPAPSAGVYEYEDPHAA 275
             A       AP       +D  AA
Sbjct: 62  CPAGVTFSSSAPPRPPLGLDDAPAA 86


>gnl|CDD|239087 cd02394, vigilin_like_KH, K homology RNA-binding
           domain_vigilin_like.  The vigilin family is a large and
           extended family of multiple KH-domain proteins,
           including vigilin, also called high density lipoprotein
           binding protien (HBP), fungal Scp160 and bicaudal-C.
           Yeast Scp160p has been shown to bind RNA and to
           associate with both soluble and membrane-bound
           polyribosomes as a mRNP component. Bicaudal-C is a
           RNA-binding molecule believed to function in embryonic
           development at the post-transcriptional level. In
           general, KH binds single-stranded RNA or DNA. It is
           found in a wide variety of proteins including ribosomal
           proteins, transcription factors and post-transcriptional
           modifiers of mRNA.
          Length = 62

 Score = 28.6 bits (65), Expect = 0.71
 Identities = 16/69 (23%), Positives = 31/69 (44%), Gaps = 11/69 (15%)

Query: 73  VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGD 132
            +V +P K H        ++G KG+++R++ EET  K+           SK + +   G 
Sbjct: 2   EEVEIPKKLHRF------IIGKKGSNIRKIMEETGVKIRF-----PDPGSKSDTITITGP 50

Query: 133 PKNAHLVED 141
            +N    ++
Sbjct: 51  KENVEKAKE 59


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.8 bits (73), Expect = 0.79
 Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 24/140 (17%)

Query: 194 RPQMRHPRPPPHMVAARPAPGLLAP--PRMQHAAPPHMVRPPHPPPPPAAS--------- 242
            P    P P     +A PA    A   P    A P     PP P   P  +         
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETT 419

Query: 243 ------RGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYD-PYAS 295
                 R ++       +A   +PA ++           ++  E  AS  PA      A 
Sbjct: 420 SQLLAARQQLQRAQGATKAKKSEPAAAS------RARPVNSALERLASVRPAPSALEKAP 473

Query: 296 TAGDYYDTSASYYETAAPAP 315
              + Y   A+        P
Sbjct: 474 AKKEAYRWKATNPVEVKKEP 493



 Score = 28.7 bits (65), Expect = 6.6
 Identities = 20/89 (22%), Positives = 30/89 (33%), Gaps = 7/89 (7%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
           P P    P+  P +        LLA  R Q        +     P  A+    V S L+R
Sbjct: 400 PPPPASAPQQAPAVPLPETTSQLLAA-RQQLQRAQGATKAKKSEPAAASRARPVNSALER 458

Query: 253 ------ARAAMEKPAPSAGVYEYEDPHAA 275
                 A +A+EK       Y ++  +  
Sbjct: 459 LASVRPAPSALEKAPAKKEAYRWKATNPV 487


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 31.4 bits (71), Expect = 0.97
 Identities = 14/54 (25%), Positives = 16/54 (29%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKV 246
           P P      PPP   AA              A  P   +P     P  AS  K+
Sbjct: 22  PSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKL 75


>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA binding)
           region.  This family represents the N-terminal head
           region of intermediate filaments. Intermediate filament
           heads bind DNA. Vimentin heads are able to alter nuclear
           architecture and chromatin distribution, and the
           liberation of heads by HIV-1 protease liberates may play
           an important role in HIV-1 associated cytopathogenesis
           and carcinogenesis. Phosphorylation of the head region
           can affect filament stability. The head has been shown
           to interaction with the rod domain of the same protein.
          Length = 89

 Score = 28.6 bits (64), Expect = 1.5
 Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 303 TSASYYETAAPAPLPSTSSRTGGSG--WKSSSGYAEHHSSSRSKSSAPIRSSSS 354
           +++SY  T       S+S  +GGS     SSSG+     S  S SS      SS
Sbjct: 2   STSSYRRTFGDPTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSS 55



 Score = 27.1 bits (60), Expect = 5.6
 Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 304 SASYYETAAPAPLPSTSSRTGGSGW---KSSSGYAEHHSSSRSKSSAPIRSSSSEKV 357
           S+S   +   +   S+SS           SSS      SS R +SS+   S SS+ +
Sbjct: 16  SSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSSYRKRSSSAPPSLSSDSL 72



 Score = 26.7 bits (59), Expect = 6.9
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 2/49 (4%)

Query: 318 STSS--RTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNASLVG 364
           STSS  RT G   +SSS  +   SS RS SS+  RS S  +  ++S   
Sbjct: 2   STSSYRRTFGDPTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSP 50


>gnl|CDD|236234 PRK08312, PRK08312, putative indolepyruvate oxidoreductase subunit
           B; Reviewed.
          Length = 510

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 15/63 (23%), Positives = 22/63 (34%), Gaps = 6/63 (9%)

Query: 224 AAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPS---AGVY---EYEDPHAAHA 277
           AAPP         P P        ++L R  A     A +    G+    +Y+D   A  
Sbjct: 217 AAPPPPAAAAPTAPRPRPPALAARALLARVEAEFPAAARAMLLEGLRRLVDYQDAAYAAE 276

Query: 278 YYE 280
           Y +
Sbjct: 277 YLD 279


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 767

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 14/53 (26%), Positives = 19/53 (35%), Gaps = 1/53 (1%)

Query: 207 VAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
             A PAP  L    +  A     +  P   P P A  G +   L   R  +E+
Sbjct: 57  AYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAI-GDLRGELQSMRGMLER 108


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 9/48 (18%), Positives = 10/48 (20%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
           P P        P    +   P    P           V PP   P   
Sbjct: 400 PSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAV 447


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
           genes is always found in type III secretion operons,
           althought its function in the processes of secretion and
           virulence is unclear. Hpa stands for Hrp-associated
           gene, where Hrp stands for hypersensitivity response and
           virulence.
          Length = 201

 Score = 29.9 bits (67), Expect = 1.7
 Identities = 15/74 (20%), Positives = 18/74 (24%), Gaps = 7/74 (9%)

Query: 176 DEIRQEQFREMGYVHHEPRPQMR-------HPRPPPHMVAARPAPGLLAPPRMQHAAPPH 228
                 +         + R +          P PPP     RP P             P 
Sbjct: 11  APADPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEAPT 70

Query: 229 MVRPPHPPPPPAAS 242
              P  PP  P AS
Sbjct: 71  DADPAQPPDDPDAS 84


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 30.2 bits (68), Expect = 1.8
 Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 3/46 (6%)

Query: 197 MRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
           M  P PP  M  A P     AP     + PP    P  PPP  A  
Sbjct: 152 MMQPPPPHAMPPASPPAAQPAPSAPASSPPP---TPASPPPAKAPK 194


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.4 bits (68), Expect = 1.8
 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 1/87 (1%)

Query: 274 AAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTGGSGWKSSSG 333
           +A A  E   S D   +    + AG       +  E  A +     S  +  SG KS+S 
Sbjct: 106 SAAAILEEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSS-DGKSKASAKSGSKSASK 164

Query: 334 YAEHHSSSRSKSSAPIRSSSSEKVVNA 360
           + E +SS  S + +   S+S   +V A
Sbjct: 165 HGESNSSDESATDSGKASASVAGIVGA 191


>gnl|CDD|236128 PRK07908, PRK07908, hypothetical protein; Provisional.
          Length = 349

 Score = 30.4 bits (69), Expect = 1.9
 Identities = 20/62 (32%), Positives = 23/62 (37%), Gaps = 2/62 (3%)

Query: 118 MRDRSKEEELRKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDE 177
             D S    LR  GD      + D  V V    PP     R+A  L +L  Y  P   DE
Sbjct: 3   SSDTSPLAALRHHGDQDAGPGLLDFAVNVRHDTPPEWLRERLAARLGDLAAY--PSTEDE 60

Query: 178 IR 179
            R
Sbjct: 61  RR 62


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 30.3 bits (68), Expect = 1.9
 Identities = 23/91 (25%), Positives = 26/91 (28%), Gaps = 7/91 (7%)

Query: 192 EPRPQMRHPRPPPHMV----AARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
           +P   +  PR  P  V      RP P   A P       P     P     PA S     
Sbjct: 187 DPALPLSAPRLGPADVFVPATPRPTPRTTASPET--TPTPSTTTSPPSTTIPAPSTTIAA 244

Query: 248 SILDRARAAMEKPAPSA-GVYEYEDPHAAHA 277
                   A   PAP   G  E    +A  A
Sbjct: 245 PQAGTTPEAEGTPAPPTPGGGEAPPANATPA 275


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 29.9 bits (67), Expect = 2.1
 Identities = 31/150 (20%), Positives = 38/150 (25%), Gaps = 26/150 (17%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPA--------PGLLAPPRMQHAAPPHMVRPP----HPPP 237
           H  P          PH  AA P             A     HAA      P      P P
Sbjct: 20  HPVPARSSSADAAGPHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVP 79

Query: 238 PP-------------AASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTAS 284
            P              A  G    ++    AAM+ P PSA   +      A A       
Sbjct: 80  TPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADAGD 139

Query: 285 YDPAMYDPYASTAGDYYDTSASYYETAAPA 314
             P    P A+        + +    AA  
Sbjct: 140 AAPD-ATPPAAGDASPPAAAQAAASAAAAL 168


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 14/49 (28%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
                   P  PP    A   P   AP R Q AAP  + +    P   A
Sbjct: 131 PATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAP--VTQAVEAPKVEA 177



 Score = 28.1 bits (63), Expect = 7.7
 Identities = 12/50 (24%), Positives = 14/50 (28%), Gaps = 1/50 (2%)

Query: 193 PRPQM-RHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
           PR  +    +       A  A      P  Q  AP     P      PAA
Sbjct: 115 PRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAA 164


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 29.4 bits (66), Expect = 2.2
 Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 6/47 (12%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPP 237
             P PQ +   P P      P PG+ APP      PP +++P  PPP
Sbjct: 136 SYPGPQYQGYHPMP------PQPGMPAPPYSLQYPPPGLLQPQGPPP 176



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 15/83 (18%)

Query: 199 HPRPPPHMVAARPAPGLLAPPRMQHAAPP-HMVRPPHPPPPPAASRGKVLSILDR--ARA 255
                P M  A           +Q    P      P  P P       +           
Sbjct: 107 CRPQRPVMTRATSTT-------VQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPY 159

Query: 256 AMEKPAPSAGVYEYEDPHAAHAY 278
           +++ P P         P     Y
Sbjct: 160 SLQYPPPGLL-----QPQGPPPY 177


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 30.3 bits (68), Expect = 2.3
 Identities = 14/54 (25%), Positives = 18/54 (33%), Gaps = 11/54 (20%)

Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
           V   P      P  P    A+ PAP    PP         +++   PPP P   
Sbjct: 219 VSFNPFLPGPSPAQPSAPPASIPAP--PIPP---------VIQYVAPPPVPPPQ 261


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 14/67 (20%), Positives = 18/67 (26%), Gaps = 8/67 (11%)

Query: 195 PQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPP----HMVRPPHPPPPPAASRGKVLSIL 250
             M   R P              PP      PP      +    P P   AS+G     L
Sbjct: 438 APMNAVRAPSRNAQNAAQK----PPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKL 493

Query: 251 DRARAAM 257
            +  A+ 
Sbjct: 494 AQVLASA 500



 Score = 30.2 bits (68), Expect = 2.4
 Identities = 14/63 (22%), Positives = 18/63 (28%)

Query: 180 QEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
           Q   + +G+      P    P  P       P   + AP R    A       P   PP 
Sbjct: 408 QFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPN 467

Query: 240 AAS 242
             S
Sbjct: 468 YQS 470


>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
          Length = 427

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 10/72 (13%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA-------ASRGKVLSILDR 252
           P          P PG   P     A  P     P P    A       A+   +L+++ +
Sbjct: 6   PFSAGSGGFVPPNPGDRTPAAAPAAGAP---FQPRPGRGAASGLNPLVAAANPLLNLIPQ 62

Query: 253 ARAAMEKPAPSA 264
            R+ +  P P++
Sbjct: 63  IRSTVHHPDPAS 74


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 4/50 (8%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
           HH+P       +P        P P       MQ   P H + P  P PP 
Sbjct: 87  HHQPNLPQPAQQPFQPQPLQPPQP----QQPMQPQPPVHPIPPLPPQPPL 132



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 11/49 (22%), Positives = 18/49 (36%)

Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
             +   Q +  +PP      +P P +   P +    P   + P  P PP
Sbjct: 95  PAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPP 143



 Score = 27.8 bits (62), Expect = 7.6
 Identities = 13/49 (26%), Positives = 17/49 (34%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
              +PQ   P  P   +  +P    + P   Q   PP     P PP  P
Sbjct: 98  QPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146


>gnl|CDD|100009 cd06152, YjgF_YER057c_UK114_like_4, YjgF, YER057c, and UK114
          belong to a large family of proteins present in
          bacteria, archaea, and eukaryotes with no definitive
          function.  The conserved domain is similar in structure
          to chorismate mutase but there is no sequence
          similarity and no functional connection. Members of
          this family have been implicated in isoleucine (Yeo7,
          Ibm1, aldR) and purine (YjgF) biosynthesis, as well as
          threonine anaerobic degradation (tdcF) and
          mitochondrial DNA maintenance (Ibm1). This domain
          homotrimerizes forming a distinct intersubunit cavity
          that may serve as a small molecule binding site.
          Length = 114

 Score = 28.4 bits (64), Expect = 2.9
 Identities = 7/23 (30%), Positives = 13/23 (56%)

Query: 34 HSYTLKLLDDEINKAMASNKKKY 56
          +SY + + ++E    M  N KK+
Sbjct: 62 NSYHVDIKNEEAFGLMVENFKKW 84


>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
           alphaproteobacterial type.  AddAB, also called RexAB,
           substitutes for RecBCD in several bacterial lineages.
           These DNA recombination proteins act before synapse and
           are particularly important for DNA repair of
           double-stranded breaks by homologous recombination. The
           term AddAB is used broadly, with AddA homologous between
           the alphaproteobacteria (as modeled here) and the
           Firmicutes, while the partner AddB proteins show no
           strong homology across the two groups of species [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 1135

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 16/75 (21%), Positives = 22/75 (29%), Gaps = 8/75 (10%)

Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
           +   ++R  R     V         A       A P  +R P  P  PA  R    S   
Sbjct: 901 KAEWRLRFTRRDWDPVGLPVE----AAQTDTLEALPDWLRAP-APAEPALPRPLAPS--- 952

Query: 252 RARAAMEKPAPSAGV 266
               A++   P   V
Sbjct: 953 GLGGAIDSALPGEAV 967


>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
           protein, R64 PilN family.  Several related protein
           families encode outer membrane pore proteins for type II
           secretion, type III secretion, and type IV pilus
           formation. This protein family appears to encode a
           secretin for pilus formation, although it is quite
           different from PilQ. Members include the PilN
           lipoprotein of the plasmid R64 thin pilus, a type IV
           pilus. Scoring between the trusted and noise cutoffs are
           examples of bundle-forming pilus B (bfpB) [Cell
           envelope, Surface structures, Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 497

 Score = 29.9 bits (67), Expect = 3.1
 Identities = 35/164 (21%), Positives = 49/164 (29%), Gaps = 41/164 (25%)

Query: 208 AARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRG------------KVLSILDRARA 255
           AAR  P +L         P    RPP P    A S               +    +    
Sbjct: 75  AARAIPAVLP-------RPDEGGRPPLPSGLGATSTQSEGPGGTAGLITDISWEGEPLGG 127

Query: 256 AMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
            +++     G+    +  A   +Y      D   +   A              +TA  + 
Sbjct: 128 LLDQVTSRLGLSWKYENGAVRIFY-----TDTRTFQVKALPG-----------DTALNSS 171

Query: 316 LPSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVN 359
           + STSS T GSG  SS G      S  S S+          V N
Sbjct: 172 VTSTSSSTAGSGSSSSGG------SGNSGSTQSTAVKLESSVHN 209


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 29.7 bits (66), Expect = 3.2
 Identities = 31/135 (22%), Positives = 45/135 (33%), Gaps = 19/135 (14%)

Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLS 248
           +H     Q   P P PH  A+R     +AP       P          P PAAS      
Sbjct: 31  LHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKP-----DLAQAPTPAASEK---- 81

Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYY 308
             D A      PAP        DP  A    +  A+  P   + + S A  +   + +  
Sbjct: 82  -FDPA------PAPHQAASRAPDPAVAP---QLAAAPKPDAAEAFTSAAQAHEAPADAGT 131

Query: 309 ETAAPAPLPSTSSRT 323
             A+  P P+  ++ 
Sbjct: 132 SAASKKPDPAAHTQH 146


>gnl|CDD|214981 smart01025, BEN, The BEN domain is found in diverse animal
           proteins.  Proteins containing BEN domains are
           BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C,
           the chordopoxvirus virosomal protein E5R and several
           proteins of polydnaviruses. Computational analysis
           suggests that the BEN domain mediates protein-DNA and
           protein-protein interactions during chromatin
           organisation and transcription.
          Length = 80

 Score = 27.3 bits (61), Expect = 3.5
 Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 15/78 (19%)

Query: 104 EETMTKMSILGKGSMRDRSKEEELRKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYAL 163
            ET+   S+ GK S    S     +K  DP     V+ +   +    P  +        L
Sbjct: 14  RETLANSSVTGKRSRTGGSG----KKPLDPNK---VQAIREYLEERFPSVD--------L 58

Query: 164 SELRKYIIPDVNDEIRQE 181
            E  K  I  +N++ R+ 
Sbjct: 59  DEDWKVCIQAINNKCRRL 76


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 29.5 bits (66), Expect = 3.6
 Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 5/87 (5%)

Query: 206 MVAARPAPGLLAPPRMQHAAPPHMVRPPHPP-----PPPAASRGKVLSILDRARAAMEKP 260
           ++++R  PG  AP R     PP   RP   P          +R      +D   A    P
Sbjct: 424 LLSSRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLP 483

Query: 261 APSAGVYEYEDPHAAHAYYETTASYDP 287
           A +A   E     +   YY       P
Sbjct: 484 AGAAPPPEPAAAPSPATYYTRMGGGPP 510


>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
           I/polynucleotide phosphorylase.  Sohlberg, et al present
           characterization of two proteins from Streptomyces
           coelicolor. The protein in this family was shown to have
           poly(A) polymerase activity and may be responsible for
           polyadenylating RNA in this species. Reference 2 showed
           that a nearly identical plasmid-encoded protein from
           Streptomyces antibioticus is a bifunctional enzyme that
           acts also as a guanosine pentaphosphate synthetase.
          Length = 719

 Score = 29.4 bits (66), Expect = 4.1
 Identities = 14/46 (30%), Positives = 29/46 (63%), Gaps = 6/46 (13%)

Query: 73  VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSM 118
           + V +PV +      +G+++GPKG  + ++Q+ET  ++SI   G++
Sbjct: 580 ITVKIPVDK------IGEVIGPKGKMINQIQDETGAEISIEDDGTV 619


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 29.2 bits (65), Expect = 4.5
 Identities = 16/58 (27%), Positives = 19/58 (32%), Gaps = 3/58 (5%)

Query: 184 REMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
           RE G     P P  R           +P P   AP   +    P +     PPP PA 
Sbjct: 299 RETGGEGEGPEPAGRDGAAGGE---PKPGPPRPAPDADRPEGWPSLEAITFPPPTPAT 353


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 29.3 bits (65), Expect = 4.6
 Identities = 11/53 (20%), Positives = 16/53 (30%), Gaps = 3/53 (5%)

Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASR 243
           H P+P     +P     A +P     A    Q      +     P P   A +
Sbjct: 199 HAPKPT---QQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQ 248


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 29.4 bits (66), Expect = 4.6
 Identities = 21/89 (23%), Positives = 26/89 (29%), Gaps = 22/89 (24%)

Query: 209 ARPAPGLLAPPRMQHAAPPHMVRPP-------------HPPPPPAASRGKVLSILDRARA 255
           A  A G +A    Q AAP      P              PP PPA +         +   
Sbjct: 40  AEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPA---------QEGE 90

Query: 256 AMEKPAPSAGVYEYEDPHAAHAYYETTAS 284
           A     PSA       P  A     + A+
Sbjct: 91  APAAEQPSAVPAPSAAPAPAEPVEPSLAA 119


>gnl|CDD|220243 pfam09440, eIF3_N, eIF3 subunit 6 N terminal domain.  This is the N
           terminal domain of subunit 6 translation initiation
           factor eIF3.
          Length = 133

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 4/57 (7%)

Query: 1   MDIERELEDGNDTKANEYVKT---LLKEKMTLDSDSHSYTLKLLDDEINKAMASNKK 54
            D+ +EL   ++    E       +LKE   L  +    T  L D E+ K + S+KK
Sbjct: 51  ADLRKELY-PDEEMPAELAAKRQEVLKELQELQEEVAPITKLLEDPEVIKNLRSDKK 106


>gnl|CDD|235357 PRK05177, minC, septum formation inhibitor; Reviewed.
          Length = 239

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 16/47 (34%), Positives = 20/47 (42%)

Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
           RP +  P  PP +V  RPA  +  P     A PP    P  P   P+
Sbjct: 87  RPSLLGPGMPPALVGGRPAGDVEIPEEEPAAPPPAPAAPEAPAAVPS 133


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 28.7 bits (64), Expect = 5.4
 Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 8/44 (18%)

Query: 200 PRPPPHMVAA-----RPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
           P PP  M+A       PAP  + P      A    V+P  PP  
Sbjct: 59  PAPPSTMIATNTKPFHPAPIDVPPDP---PAAQEAVQPTAPPSA 99


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.0 bits (64), Expect = 5.6
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 303 TSASYYETAAPAPLPSTSSRTGGSGWKSSSGY 334
            S ++  T +P P P T+S+  G G  S+S Y
Sbjct: 638 PSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRY 669


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 28.9 bits (64), Expect = 5.6
 Identities = 24/83 (28%), Positives = 29/83 (34%), Gaps = 5/83 (6%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQH-AAPPHMVRPPHPPP----PPAASRGKVLSILDRAR 254
               P     R  P   AP R +  AA P   RPP   P    PPAA+ G+         
Sbjct: 713 RAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772

Query: 255 AAMEKPAPSAGVYEYEDPHAAHA 277
           A   +P P A     + P  A  
Sbjct: 773 APTPQPPPQAPPAPQQRPRGAPT 795


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 28.7 bits (63), Expect = 6.0
 Identities = 16/49 (32%), Positives = 16/49 (32%), Gaps = 1/49 (2%)

Query: 193 PRPQMRHPRPP-PHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
           P P      P  P      PAP    PP      PP  V P    P PA
Sbjct: 91  PAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPA 139


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 29.1 bits (66), Expect = 6.2
 Identities = 18/64 (28%), Positives = 19/64 (29%), Gaps = 10/64 (15%)

Query: 201 RPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKP 260
            P         A    A      AAP         P PPAA           A AA  KP
Sbjct: 37  GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAA----------AAPAAPPKP 86

Query: 261 APSA 264
           A +A
Sbjct: 87  AAAA 90


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 28.5 bits (64), Expect = 6.4
 Identities = 12/38 (31%), Positives = 15/38 (39%), Gaps = 5/38 (13%)

Query: 317 PSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSS 354
             +      S   SSS       SS S SS+  R+SS 
Sbjct: 278 GRSYGSGSPSYSPSSSSN-----SSSSSSSSSTRTSSG 310


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 12/40 (30%), Positives = 14/40 (35%), Gaps = 8/40 (20%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
           PRPPP    +  A              P+M  P  P  PP
Sbjct: 40  PRPPPPYFNSSVASS--------PQPHPYMWGPQQPMMPP 71


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 28.8 bits (64), Expect = 6.8
 Identities = 17/79 (21%), Positives = 27/79 (34%), Gaps = 2/79 (2%)

Query: 150 APPAEAHARIAYALSELRKYIIPDVNDEIRQEQFR--EMGYVHHEPRPQMRHPRPPPHMV 207
             P +A    A A        +       + +  +  + G+   +  PQ+  P   P +V
Sbjct: 178 RAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDALPQILPPAALPPIV 237

Query: 208 AARPAPGLLAPPRMQHAAP 226
            A  AP  LA       AP
Sbjct: 238 VAPAAPAALAAVAAAAPAP 256


>gnl|CDD|197247 cd09149, PLDc_vPLD5_2, Putative catalytic domain, repeat 2, of
          inactive veterbrate phospholipase PLD5.  Putative
          catalytic domain, repeat 2, of inactive veterbrate
          phospholipases D5 (PLD5, EC 3.1.4.4), homologs of the
          vaccinia virus protein K4 encoded by the HindIII K4L
          gene. Vertebrate PLD5 has been assigned to the PLD
          superfamily, since it shows high sequence similarity to
          other human homologs of protein K4, which contain two
          copies of the conserved HKD motif (H-x-K-x(4)-D, where
          x represents any amino acid residue). However, due to
          the lack of functionally important histidine and lysine
          residues in the HKD motif, vetebrate PLD5 has been
          characterized as an inactive PLD.
          Length = 188

 Score = 27.9 bits (62), Expect = 7.1
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 51 SNKKKYDSPYLDLFREKPIL--VKVKVLVPV--KEHP-RFNFVGKL 91
          S  ++Y S      RE  +L  V+V++L+    K  P  FNFV  L
Sbjct: 46 SYARRYWSRIDSKIREALVLRSVRVRLLISFWRKTDPLTFNFVSSL 91


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 28.4 bits (63), Expect = 7.3
 Identities = 12/42 (28%), Positives = 13/42 (30%)

Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
           P  P    A  P+     P     A  PH    P  P P  A
Sbjct: 207 PAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPA 248


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 14/48 (29%), Positives = 15/48 (31%)

Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
           RPQ             + AP   A    Q A       P  P P PAA
Sbjct: 114 RPQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|178608 PLN03046, PLN03046, D-glycerate 3-kinase; Provisional.
          Length = 460

 Score = 28.0 bits (62), Expect = 9.3
 Identities = 14/99 (14%), Positives = 26/99 (26%), Gaps = 7/99 (7%)

Query: 272 PHAAHAYYETTAS-------YDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTG 324
           P    +      +            Y P +    D    S S Y ++  +   +  S  G
Sbjct: 28  PIPCASAAAFNFNPISNPFNLKRRPYSPSSHKFNDRVAASCSSYPSSKLSSRKTHLSCPG 87

Query: 325 GSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNASLV 363
                       H  ++ + +S    S  +      S V
Sbjct: 88  CGCSWIQDNSMVHDYATTAATSKRCSSLPTSSPALVSSV 126


>gnl|CDD|106738 PRK13797, PRK13797, putative bifunctional allantoicase/OHCU
           decarboxylase; Provisional.
          Length = 516

 Score = 28.0 bits (62), Expect = 9.6
 Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 231 RPPHPPPPPAASRGKVLSILD--RARAAMEKPA 261
           RP   P PP ++R  V+S+    R +AAM++ A
Sbjct: 412 RPTQAPAPPTSARATVVSLDAPRREQAAMDQAA 444


>gnl|CDD|181337 PRK08266, PRK08266, hypothetical protein; Provisional.
          Length = 542

 Score = 28.1 bits (63), Expect = 9.7
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 7/58 (12%)

Query: 206 MVAARPAPGLLAPP------RMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAM 257
           M++ RP P  L  P      R   AA P +   P P P P A      +++  A+  M
Sbjct: 153 MLSGRPRPVALEMPWDVFGQRAPVAAAPPLRPAPPPAPDPDAI-AAAAALIAAAKNPM 209


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
           activation/proliferation-associated protein-1 C term.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 343 and 708 amino
           acids in length. This family is the C terminal region of
           caprin-1. Caprin-1 is a protein involved in regulating
           cellular proliferation. In mutated phenotypes, the G1
           phase of the cell cycle is greatly lengthened, impairing
           normal proliferation. The C terminal region of caprin-1
           contains RGG motifs which are characteristic of RNA
           binding domains. It is possible that caprin-1 functions
           through an RNA binding mechanism.
          Length = 319

 Score = 28.0 bits (62), Expect = 9.9
 Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 9/53 (16%)

Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPG---LLAPPRMQHAAPPHMVRPPHPPPP 238
           VH E R         P  V  +P P    +++P    + + P + +P H   P
Sbjct: 45  VHSESRL------SQPSAVPVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEP 91


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 27.6 bits (61), Expect = 9.9
 Identities = 19/58 (32%), Positives = 22/58 (37%), Gaps = 10/58 (17%)

Query: 193 PRPQMRHPRPPPHMVAARPAPGL--------LAPPRMQHAAPPHMVRPPHPPPPPAAS 242
             P   +PR PP      P PG           PP +    PP  + PP  PPPP  S
Sbjct: 163 GPPSPWNPRFPP--GPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPMS 218


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.129    0.371 

Gapped
Lambda     K      H
   0.267   0.0703    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,925,085
Number of extensions: 2065459
Number of successful extensions: 3608
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2766
Number of HSP's successfully gapped: 328
Length of query: 402
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 303
Effective length of database: 6,546,556
Effective search space: 1983606468
Effective search space used: 1983606468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 60 (26.9 bits)