RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11399
         (865 letters)



>gnl|CDD|240015 cd04657, Piwi_ago-like, Piwi_ago-like: PIWI domain, Argonaute-like
           subfamily. Argonaute is the central component of the
           RNA-induced silencing complex (RISC) and related
           complexes. The PIWI domain is the C-terminal portion of
           Argonaute and consists of two subdomains, one of which
           provides the 5' anchoring of the guide RNA and the
           other, the catalytic site for slicing.
          Length = 426

 Score =  322 bits (827), Expect = e-102
 Identities = 127/279 (45%), Positives = 167/279 (59%), Gaps = 30/279 (10%)

Query: 448 YVHEFGLTISNSMMEVRGRVLPPPKLQYGGRMSSITAQVNSMSSQGLFASKQQALPNSGV 507
           Y+ EFG+++S  M+ V GRVLPPPKL+YG                      +   P +G 
Sbjct: 2   YLKEFGISVSKEMITVPGRVLPPPKLKYGDS-------------------SKTVPPRNGS 42

Query: 508 WDMRGKQFYTGVEIRVWAIACFAPQRTVRED--SLRSFTQQLQKISNDAGMPIIGQPCFC 565
           W++RGK+F  G  IR WA+  FA  R  RE+   LR+F  QL K    AG+ I       
Sbjct: 43  WNLRGKKFLEGGPIRSWAVLNFAGPRRSREERADLRNFVDQLVKTVIGAGINIT-----T 97

Query: 566 KYATGPDQVEPMFRYLK-STFQALQLVVVVLPGK-TPVYAEVKRVGDTVLGMATQCVQAK 623
             A+   +VE +F  LK +  +  QLV+V+LP K + +Y  +KR+ DT LG+ TQCV AK
Sbjct: 98  AIASVEGRVEELFAKLKQAKGEGPQLVLVILPKKDSDIYGRIKRLADTELGIHTQCVLAK 157

Query: 624 NVNK-TSPQTLSNLCLKINVKLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDN-KK 681
            V K  +PQ  +N+ LKIN+KLGGIN  L P IRP +  EP + +GADVTHP  GD    
Sbjct: 158 KVTKKGNPQYFANVALKINLKLGGINHSLEPDIRPLLTKEPTMVLGADVTHPSPGDPAGA 217

Query: 682 PSIAAVVGSMDAHPSRYAATVRVQQHRQEIIQELSSMVR 720
           PSIAAVV S+D H ++Y A+VR+Q HRQEII +L SMVR
Sbjct: 218 PSIAAVVASVDWHLAQYPASVRLQSHRQEIIDDLESMVR 256



 Score =  224 bits (573), Expect = 3e-65
 Identities = 74/104 (71%), Positives = 83/104 (79%)

Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
           R F  D+ +  GK+GN+P GT VD GITHP EFDFYLCSH GIQGT+RP+HYHVLWD+  
Sbjct: 323 RFFPTDEDDADGKNGNVPPGTVVDRGITHPREFDFYLCSHAGIQGTARPTHYHVLWDEIG 382

Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHL 823
           F ADELQ LTY LC+TY RCTRSVSIP PAYYAHL A RAR +L
Sbjct: 383 FTADELQTLTYNLCYTYARCTRSVSIPPPAYYAHLAAARARCYL 426


>gnl|CDD|215631 PLN03202, PLN03202, protein argonaute; Provisional.
          Length = 900

 Score =  317 bits (814), Expect = 8e-95
 Identities = 229/789 (29%), Positives = 361/789 (45%), Gaps = 136/789 (17%)

Query: 39  PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRA 98
              PP PV PP  VP        I      +P +P  LPM     R   G +G+ I L  
Sbjct: 3   ALPPPPPVVPPNVVP--------IKLEPTKKPSKPKRLPM----ARRGFGSKGQKIQLLT 50

Query: 99  NHFQISM--PRGFVHHYEIQIQ-PDKCP---RKVNREIIETMVHAYSKIFGALKPVFDGR 152
           NHF++S+  P G   HY + +   D  P   + + R++I+ +   YS         +DG 
Sbjct: 51  NHFKVSVNNPDGHFFHYSVSLTYEDGRPVDGKGIGRKVIDKVQETYSSDLAGKDFAYDGE 110

Query: 153 NNLYTRDPLPIGNDRMELEVTL--------------------PGEGKDRV--------FR 184
            +L+T   LP   +++E  V L                     G  + R         F+
Sbjct: 111 KSLFTVGALP--QNKLEFTVVLEDVSSNRNNGNGSPVGNGSPNGGDRKRSRRPYQSKTFK 168

Query: 185 VAIKWQAQVSLFALEEALEGRTRQIPFDAILALDVVMR-HLPSMTYTPVGRSFFSTPDGY 243
           V I + A++ + A+  AL G+  +   DA+  LD+++R H        V +SFF      
Sbjct: 169 VEISFAAKIPMQAIANALRGQESENSQDALRVLDIILRQHAAKQGCLLVRQSFFHNDPKN 228

Query: 244 YHPLGGGREVWFGFHQSVRPSQWKMMLNIDVSATAFYKSQPVIEFMCEVLDIRDVNEQRK 303
           +  LGGG     GFH S R +Q  + LNIDVS T   +  PV++F+    ++RD  +   
Sbjct: 229 FVDLGGGVLGCRGFHSSFRTTQGGLSLNIDVSTTMIVQPGPVVDFLIANQNVRDPFQ--- 285

Query: 304 PLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRRPAQMQSFPLQLENG-----QT 358
              D  + K  + +K L+++++       +Y++  ++ +P + Q+F L+  NG     +T
Sbjct: 286 --IDWSKAK--RMLKNLRVKVSPSNQ---EYKITGLSEKPCKEQTFSLKQRNGNGNEVET 338

Query: 359 VECTVAKYFLDKYKMKLRFP-HLPCLQVGQEHKHTYLPLEVCNIVAGQRCIKKLTDMQTS 417
           VE TV  YF+    ++LR+   LPC+ VG+  + TY P+E+C++V+ QR  K L+ +Q S
Sbjct: 339 VEITVYDYFVKHRGIELRYSGDLPCINVGKPKRPTYFPIELCSLVSLQRYTKALSTLQRS 398

Query: 418 TMIKATARSAPDREREIQSLVKRADFNNDAYVHEFGLTISNSMMEVRGRVLPPPKLQYGG 477
           ++++ + +   +R + +   +K ++++ D  +   G++IS+   +V GRVLP PKL+ G 
Sbjct: 399 SLVEKSRQKPQERMKVLTDALKSSNYDADPMLRSCGISISSQFTQVEGRVLPAPKLKVG- 457

Query: 478 RMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQRTVRE 537
                       + +  F       P +G W+   K+     +I  WA+  F    + R 
Sbjct: 458 ------------NGEDFF-------PRNGRWNFNNKKLVEPTKIERWAVVNF----SARC 494

Query: 538 DSLRSFTQQLQKISNDAGMPIIG-------QPCFCKYATGPDQVEPMFRYLKSTFQAL-Q 589
           D  R   + L K     G+ I          P F + A  P +VE MF  ++S      Q
Sbjct: 495 DI-RHLVRDLIKCGEMKGINIEPPFDVFEENPQF-RRAPPPVRVEKMFEQIQSKLPGPPQ 552

Query: 590 LVVVVLPGK--TPVYAEVKRVGDTVLGMATQCVQAKNVNKTSPQTLSNLCLKINVKLGGI 647
            ++ +LP +  + +Y   K+   +  G+ TQC+    VN    Q L+N+ LKIN KLGG+
Sbjct: 553 FLLCILPERKNSDIYGPWKKKNLSEFGIVTQCIAPTRVND---QYLTNVLLKINAKLGGL 609

Query: 648 NSILV----PSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHP--SRYAAT 701
           NS+L     PSI P V   P I +G DV+H   G +  PSIAAVV S    P  SRY A+
Sbjct: 610 NSLLAIEHSPSI-PLVSKVPTIILGMDVSHGSPGQSDVPSIAAVVSSRQ-WPLISRYRAS 667

Query: 702 VRVQQHRQEIIQELSSMVRLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQG 761
           VR Q  + E+I  L                     P G   D GI      DFY  S   
Sbjct: 668 VRTQSPKVEMIDSLFK-------------------PVGDKDDDGIIRELLLDFYTSS--- 705

Query: 762 IQGTSRPSH 770
             G  +P  
Sbjct: 706 --GKRKPEQ 712



 Score =  118 bits (298), Expect = 3e-27
 Identities = 45/88 (51%), Positives = 59/88 (67%)

Query: 729 QSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNHFDADELQCL 788
           Q+G   N+P GT VD  I HP   DFY+C+H G+ GT+RP+HYHVL D+  F AD+LQ L
Sbjct: 766 QAGSPDNVPPGTVVDNKICHPRNNDFYMCAHAGMIGTTRPTHYHVLLDEIGFSADDLQEL 825

Query: 789 TYQLCHTYVRCTRSVSIPAPAYYAHLVA 816
            + L + Y R T ++S+ AP  YAHL A
Sbjct: 826 VHSLSYVYQRSTTAISVVAPVCYAHLAA 853


>gnl|CDD|214930 smart00950, Piwi, This domain is found in the protein Piwi and its
           relatives.  The function of this domain is the dsRNA
           guided hydrolysis of ssRNA. Determination of the crystal
           structure of Argonaute reveals that PIWI is an RNase H
           domain, and identifies Argonaute as Slicer, the enzyme
           that cleaves mRNA in the RNAi RISC complex.. In
           addition, Mg+2 dependence and production of 3'-OH and 5'
           phosphate products are shared characteristics of RNaseH
           and RISC. The PIWI domain core has a tertiary structure
           belonging to the RNase H family of enzymes. RNase H fold
           proteins all have a five-stranded mixed beta-sheet
           surrounded by helices. By analogy to RNase H enzymes
           which cleave single-stranded RNA guided by the DNA
           strand in an RNA/DNA hybrid, the PIWI domain can be
           inferred to cleave single-stranded RNA, for example
           mRNA, guided by double stranded siRNA.
          Length = 301

 Score =  258 bits (662), Expect = 1e-79
 Identities = 118/308 (38%), Positives = 153/308 (49%), Gaps = 80/308 (25%)

Query: 590 LVVVVLPG--KTPVYAEVKRVGDTVLGMATQCVQAKNVNK-----TSPQTLSNLCLKINV 642
           L+VV+LPG  KT +Y E+K+  +T LG+ TQCVQAK ++K        Q L+N+ LKIN 
Sbjct: 1   LIVVILPGEKKTDLYHEIKKYLETKLGVPTQCVQAKTLDKVSKRRKLKQYLTNVALKINA 60

Query: 643 KLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDNK--KPSIAAVVGS-MDAHPSRYA 699
           KLGGIN +L     P +  +P + +G DV+HP AG      PS+AA V S      + Y 
Sbjct: 61  KLGGINWVLDV---PPIPLKPTLIIGIDVSHPSAGKGGSVAPSVAAFVASGNYLSGNFYQ 117

Query: 700 ATVRVQ--------------------------------------QHRQEIIQELSSM--- 718
           A VR Q                                      Q +Q +  E+ ++   
Sbjct: 118 AFVREQGSRQLKEILREALKKYYKSNRKRLPDRIVVYRDGVSEGQFKQVLEYEVKAIKKA 177

Query: 719 ----------------------VRLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYL 756
                                  R F  D         N+P GT VD  IT P  +DFYL
Sbjct: 178 CKELGPDYKPKLTVIVVQKRHHTRFFPEDG----NGRVNVPPGTVVDSVITSPEWYDFYL 233

Query: 757 CSHQGIQGTSRPSHYHVLWDDNHFDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVA 816
            SH G+QGT+RP+HY VL+D+ + D DELQ LTY+LCH Y R TR VS+PAP YYAHL+A
Sbjct: 234 VSHAGLQGTARPTHYTVLYDEGNLDPDELQRLTYKLCHLYYRSTRPVSLPAPVYYAHLLA 293

Query: 817 FRARYHLV 824
            RAR  L 
Sbjct: 294 KRARQLLH 301


>gnl|CDD|239208 cd02826, Piwi-like, Piwi-like: PIWI domain. Domain found in
           proteins involved in RNA silencing. RNA silencing refers
           to a group of related gene-silencing mechanisms mediated
           by short RNA molecules, including siRNAs, miRNAs, and
           heterochromatin-related guide RNAs. The central
           component of the RNA-induced silencing complex (RISC)
           and related complexes is Argonaute. The PIWI domain is
           the C-terminal portion of Argonaute and consists of two
           subdomains, one of which provides the 5' anchoring of
           the guide RNA and the other, the catalytic site for
           slicing. This domain is also found in closely related
           proteins, including the Piwi subfamily, where it is
           believed to perform a crucial role in germline cells,
           via a similar mechanism.
          Length = 393

 Score =  225 bits (575), Expect = 6e-66
 Identities = 108/434 (24%), Positives = 165/434 (38%), Gaps = 116/434 (26%)

Query: 461 MEVRGRVLPPPKLQYGGRMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVE 520
           + ++GRVLP P+         I  +   + + G F    +      V   R ++      
Sbjct: 3   LILKGRVLPKPQ---------ILFKNKFLRNIGPFEKPAKITNPVAVIAFRNEEVDD--L 51

Query: 521 IRVWAIACFAPQRTVREDSLRSFTQQLQKISNDAGMPIIGQPCFCKYATGPDQVEPMFRY 580
           ++  A AC      ++E  + S+ +                          +  + +   
Sbjct: 52  VKRLADACRQLGMKIKEIPIVSWIED-----------------------LNNSFKDLKSV 88

Query: 581 LKSTFQA-LQLVVVVLPGK-TPVYAEVKRVGDTVLGMATQCVQAKNVNKTS--PQTLSNL 636
            K+  +A +QLV+ +L  K  P++ E+KR+      + +Q +Q K   K     QTL NL
Sbjct: 89  FKNAIKAGVQLVIFILKEKKPPLHDEIKRLEAKS-DIPSQVIQLKTAKKMRRLKQTLDNL 147

Query: 637 CLKINVKLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAH-- 694
             K+N KLGGIN IL     P    +  IF+G DV+HP           + VG       
Sbjct: 148 LRKVNSKLGGINYILDS---PVKLFKSDIFIGFDVSHPDRR--TVNGGPSAVGFAANLSN 202

Query: 695 -----------PSRYAATV--------------------------------------RVQ 705
                      PSR                                           RV+
Sbjct: 203 HTFLGGFLYVQPSREVKLQDLGEVIKKCLDGFKKSTGEGLPEKIVIYRDGVSEGEFKRVK 262

Query: 706 QHRQEIIQELSSM------------------VRLFCADKKEQSGKSGNIPAGTTVDIGIT 747
           +  +EII+E   +                   R F  +K    G   N   GT VD  IT
Sbjct: 263 EEVEEIIKEACEIEESYRPKLVIIVVQKRHNTRFFPNEKN---GGVQNPEPGTVVDHTIT 319

Query: 748 HPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNHFDADELQCLTYQLCHTYVRCTRSVSIPA 807
            P   +FYL SH   QGT +P+ Y V+++D ++  +EL+ LTY LC T+      +S+PA
Sbjct: 320 SPGLSEFYLASHVARQGTVKPTKYTVVFNDKNWSLNELEILTYILCLTHQNVYSPISLPA 379

Query: 808 PAYYAHLVAFRARY 821
           P YYAH +A R R 
Sbjct: 380 PLYYAHKLAKRGRN 393


>gnl|CDD|216915 pfam02171, Piwi, Piwi domain.  This domain is found in the protein
           Piwi and its relatives. The function of this domain is
           the dsRNA guided hydrolysis of ssRNA. Determination of
           the crystal structure of Argonaute reveals that PIWI is
           an RNase H domain, and identifies Argonaute as Slicer,
           the enzyme that cleaves mRNA in the RNAi RISC complex.
           In addition, Mg+2 dependence and production of 3'-OH and
           5' phosphate products are shared characteristics of
           RNaseH and RISC. The PIWI domain core has a tertiary
           structure belonging to the RNase H family of enzymes.
           RNase H fold proteins all have a five-stranded mixed
           beta-sheet surrounded by helices. By analogy to RNase H
           enzymes which cleave single-stranded RNA guided by the
           DNA strand in an RNA/DNA hybrid, the PIWI domain can be
           inferred to cleave single-stranded RNA, for example
           mRNA, guided by double stranded siRNA.
          Length = 296

 Score =  154 bits (392), Expect = 7e-42
 Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 3/105 (2%)

Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
           R F + K      + N P GT VD  IT P  +DFYLCSH G QGT +P+ Y VL+D+  
Sbjct: 195 RFFASGK---RDGAQNPPPGTVVDDVITSPEYYDFYLCSHAGRQGTVKPTKYTVLYDEIG 251

Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHLV 824
              +ELQ LTY+LC+ Y R  R VS+PAP YYAH +A R R +L 
Sbjct: 252 LSPEELQQLTYKLCYMYQRVFRPVSLPAPVYYAHKLAKRGRNNLK 296



 Score =  129 bits (327), Expect = 3e-33
 Identities = 54/133 (40%), Positives = 79/133 (59%), Gaps = 7/133 (5%)

Query: 590 LVVVVLPG-KTPVYAEVKRVGDTVLGMATQCVQAKNV-NKTSPQTLSNLCLKINVKLGGI 647
           L+VV+LP      Y  +K+  +T LG+ TQC++ K    +T PQTL+N+ LKINVKLGG+
Sbjct: 1   LIVVILPDKNKDNYHSIKKYLETELGIPTQCIRLKTALKRTKPQTLTNVLLKINVKLGGL 60

Query: 648 NSILVPSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHPSRYAATVRVQQH 707
           N   +  I PK+     + +G D++H   G +  PS+   V SMD HP +YA  VR Q  
Sbjct: 61  N-YWIVEIPPKI----DVIIGFDISHGTGGTDDNPSVVGFVASMDKHPQKYAGGVRYQAS 115

Query: 708 RQEIIQELSSMVR 720
            QE+I+ L  ++ 
Sbjct: 116 GQELIEPLKEIIL 128


>gnl|CDD|239212 cd02846, PAZ_argonaute_like, PAZ domain, argonaute_like subfamily.
           Argonaute is part of the RNA-induced silencing complex
           (RISC), and is an endonuclease that plays a key role in
           the RNA interference pathway. The PAZ domain has been
           named after the proteins Piwi,Argonaut, and Zwille. PAZ
           is found in two families of proteins that are essential
           components of RNA-mediated gene-silencing pathways,
           including RNA interference, the Piwi and Dicer families.
           PAZ functions as a nucleic acid binding domain, with a
           strong preference for single-stranded nucleic acids (RNA
           or DNA) or RNA duplexes with single-stranded 3'
           overhangs. It has been suggested that the PAZ domain
           provides a unique mode for the recognition of the two
           3'-terminal nucleotides in single-stranded nucleic acids
           and buries the 3' OH group, and that it might recognize
           characteristic 3' overhangs in siRNAs within RISC
           (RNA-induced silencing) and other complexes.
          Length = 114

 Score =  146 bits (370), Expect = 4e-41
 Identities = 54/121 (44%), Positives = 77/121 (63%), Gaps = 7/121 (5%)

Query: 282 SQPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTR 341
           +QPVIEF+ E L           L+D+ R K  K +KGLK+E+TH G   RKY++  ++ 
Sbjct: 1   AQPVIEFLKEFLGFDTPL----GLSDNDRRKLKKALKGLKVEVTHRGNTNRKYKIKGLSA 56

Query: 342 RPAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHKHTYLPLEVCNI 401
            PA  Q+F L        E +VA YF +KY ++L++P+LPCLQVG++ K  YLP+E+CNI
Sbjct: 57  EPASQQTFEL---KDGEKEISVADYFKEKYNIRLKYPNLPCLQVGRKGKPNYLPMELCNI 113

Query: 402 V 402
           V
Sbjct: 114 V 114


>gnl|CDD|216914 pfam02170, PAZ, PAZ domain.  This domain is named PAZ after the
           proteins Piwi Argonaut and Zwille. This domain is found
           in two families of proteins that are involved in
           post-transcriptional gene silencing. These are the Piwi
           family and the Dicer family, that includes the Carpel
           factory protein. The function of the domains is unknown
           but has been suggested to mediate complex formation
           between proteins of the Piwi and Dicer families by
           hetero-dimerisation. The three-dimensional structure of
           this domain has been solved. The PAZ domain is composed
           of two subdomains. One subdomain is similar to the OB
           fold, albeit with a different topology. The OB-fold is
           well known as a single-stranded nucleic acid binding
           fold. The second subdomain is composed of a beta-hairpin
           followed by an alpha-helix. The PAZ domains shows
           low-affinity nucleic acid binding and appears to
           interact with the 3' ends of single-stranded regions of
           RNA in the cleft between the two subdomains. PAZ can
           bind the characteristic two-base 3' overhangs of siRNAs,
           indicating that although PAZ may not be a primary
           nucleic acid binding site in Dicer or RISC, it may
           contribute to the specific and productive incorporation
           of siRNAs and miRNAs into the RNAi pathway.
          Length = 114

 Score =  137 bits (346), Expect = 5e-38
 Identities = 53/127 (41%), Positives = 75/127 (59%), Gaps = 13/127 (10%)

Query: 283 QPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRR 342
           QPV++F+       D+  QR PL D  R KFTK +KGL +E T+     R YR+  +T  
Sbjct: 1   QPVLDFL------TDILTQRTPLGDDFREKFTKALKGLIVETTYN---NRTYRIDGITWD 51

Query: 343 PAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHKHTYLPLEVCNIV 402
           P    +FPL+       E +VA+YF +KY + L++P+LP L VG++ K  YLP E+C I 
Sbjct: 52  PTPNSTFPLK----DGGEISVAEYFKEKYNITLKYPNLPLLVVGRKKKPNYLPPELCFIT 107

Query: 403 AGQRCIK 409
            GQR +K
Sbjct: 108 GGQRYMK 114


>gnl|CDD|239207 cd02825, PAZ, PAZ domain, named PAZ after the proteins Piwi
           Argonaut and Zwille. PAZ is found in two families of
           proteins that are essential components of RNA-mediated
           gene-silencing pathways, including RNA interference, the
           piwi and Dicer families. PAZ functions as a nucleic-acid
           binding domain, with a strong preference for
           single-stranded nucleic acids (RNA or DNA) or RNA
           duplexes with single-stranded 3' overhangs. It has been
           suggested that the PAZ domain provides a unique mode for
           the recognition of the two 3'-terminal nucleotides in
           single-stranded nucleic acids and buries the 3' OH
           group, and that it might recognize characteristic 3'
           overhangs in siRNAs within RISC (RNA-induced silencing)
           and other complexes. This parent model also contains
           structures of an archaeal PAZ domain.
          Length = 115

 Score =  115 bits (289), Expect = 3e-30
 Identities = 48/123 (39%), Positives = 63/123 (51%), Gaps = 12/123 (9%)

Query: 283 QPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRR 342
            PVIE MC+    R+++    PL DS R +FTKE+KGLK+E TH   + R YR    TR 
Sbjct: 2   DPVIETMCKFPKDREIDT---PLLDSPREEFTKELKGLKVEDTHNP-LNRVYRPDGETRL 57

Query: 343 PAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHK---HTYLPLEVC 399
            A     P QL++    E T A YF ++Y + L   + P L V    K      LP E+C
Sbjct: 58  KA-----PSQLKHSDGKEITFADYFKERYNLTLTDLNQPLLIVKFSSKKSYSILLPPELC 112

Query: 400 NIV 402
            I 
Sbjct: 113 VIT 115


>gnl|CDD|240016 cd04658, Piwi_piwi-like_Euk, Piwi_piwi-like_Euk: PIWI domain,
           Piwi-like subfamily found in eukaryotes. This domain is
           found in Piwi and closely related proteins, where it is
           believed to perform a crucial role in germline cells,
           via RNA silencing. RNA silencing refers to a group of
           related gene-silencing mechanisms mediated by short RNA
           molecules, including siRNAs, miRNAs, and
           heterochromatin-related guide RNAs. The mechanism in
           Piwi is believed to be similar to that in Argonaute, the
           central component of the RNA-induced silencing complex
           (RISC). The PIWI domain is the C-terminal portion of
           Argonaute and consists of two subdomains, one of which
           provides the 5' anchoring of the guide RNA and the
           other, the catalytic site for slicing.
          Length = 448

 Score =  105 bits (264), Expect = 7e-24
 Identities = 43/98 (43%), Positives = 54/98 (55%), Gaps = 4/98 (4%)

Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
           R F     +      N P GT VD  IT P  +DF+L S    QGT  P+HY+VL+D   
Sbjct: 351 RFF----NQGGNNFSNPPPGTVVDSEITKPEWYDFFLVSQSVRQGTVTPTHYNVLYDTTG 406

Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAF 817
              D LQ LTY+LCH Y   + S+ +PAP  YAH +AF
Sbjct: 407 LKPDHLQRLTYKLCHLYYNWSGSIRVPAPCQYAHKLAF 444



 Score = 95.4 bits (238), Expect = 1e-20
 Identities = 73/312 (23%), Positives = 136/312 (43%), Gaps = 49/312 (15%)

Query: 418 TMIKATARSAPDREREIQSLVKRADFNNDAYVHE----FGLTISNSMMEVRGRVLPPPKL 473
            + + T  +  +R   I+  ++R     +  V E    +G+ + ++ ++++GRVLPP ++
Sbjct: 5   ELAEHTKLNPKERYDTIRQFIQR--IQKNPSVQELLKKWGIELDSNPLKIQGRVLPPEQI 62

Query: 474 QYGGRMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQR 533
             G                    +   A       ++R +  Y  V +  W +    P R
Sbjct: 63  IMGNVFVY---------------ANSNADWKR---EIRNQPLYDAVNLNNWVL--IYPSR 102

Query: 534 TVREDSLRSFTQQLQKISNDAGMPIIGQPCFCKYATGPDQVEPMFRYLKSTFQAL-QLVV 592
             RE    SF Q L++++   G+ I   P   K     D++E   R LK  F++  QLVV
Sbjct: 103 DQRE--AESFLQTLKQVAGPMGIQIS-PPKIIKVKD--DRIETYIRALKDAFRSDPQLVV 157

Query: 593 VVLPG-KTPVYAEVKRVGDTVLGMATQCVQAKNVNKTSPQTLSNLCLKI----NVKLGGI 647
           ++LPG K  +Y  +K+       + +Q + ++ + K   + L ++  KI    N KLGGI
Sbjct: 158 IILPGNKKDLYDAIKKFCCVECPVPSQVITSRTLKK--KKNLRSIASKIALQINAKLGGI 215

Query: 648 NSILVP-SIRPKVF-NEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHPSRYAATVRVQ 705
                P ++    F  +  + +G DV H      K  S+   V S++   +++ +    Q
Sbjct: 216 -----PWTVEIPPFILKNTMIVGIDVYHDTITKKK--SVVGFVASLNKSITKWFSKYISQ 268

Query: 706 -QHRQEIIQELS 716
            + ++EII  L 
Sbjct: 269 VRGQEEIIDSLG 280


>gnl|CDD|219976 pfam08699, DUF1785, Domain of unknown function (DUF1785).  This
           region is found in argonaute proteins and often
           co-occurs with pfam02179 and pfam02171.
          Length = 52

 Score = 89.5 bits (223), Expect = 5e-22
 Identities = 32/53 (60%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 230 TPVGRSFFSTPDGYYHPLGGGREVWFGFHQSVRPSQWKMMLNIDVSATAFYKS 282
             VGRSFFS   G+   LGGG E W GF QSVRP+Q  + LN+DVS TAFYK 
Sbjct: 1   IAVGRSFFSISFGH-RDLGGGLEAWRGFFQSVRPTQGGLSLNVDVSTTAFYKP 52


>gnl|CDD|198017 smart00949, PAZ, This domain is named PAZ after the proteins Piwi
           Argonaut and Zwille.  This domain is found in two
           families of proteins that are involved in
           post-transcriptional gene silencing. These are the Piwi
           family and the Dicer family, that includes the Carpel
           factory protein. The function of the domains is unknown
           but has been suggested to mediate complex formation
           between proteins of the Piwi and Dicer families by
           hetero-dimerisation. The three-dimensional structure of
           this domain has been solved. The PAZ domain is composed
           of two subdomains. One subdomain is similar to the OB
           fold, albeit with a different topology. The OB-fold is
           well known as a single-stranded nucleic acid binding
           fold. The second subdomain is composed of a beta-hairpin
           followed by an alpha-helix. The PAZ domains shows
           low-affinity nucleic acid binding and appears to
           interact with the 3' ends of single-stranded regions of
           RNA in the cleft between the two subdomains. PAZ can
           bind the characteristic two-base 3' overhangs of siRNAs,
           indicating that although PAZ may not be a primary
           nucleic acid binding site in Dicer or RISC, it may
           contribute to the specific and productive incorporation
           of siRNAs and miRNAs into the RNAi pathway.
          Length = 138

 Score = 54.6 bits (132), Expect = 8e-09
 Identities = 29/145 (20%), Positives = 54/145 (37%), Gaps = 18/145 (12%)

Query: 290 CEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRRPAQMQSF 349
             VLD        +    + + +  K++KGL +   +     + YR+ ++    A   +F
Sbjct: 1   ETVLDFM-RQLPSQGNRSNFQDRCAKDLKGLIVLTRY---NNKTYRIDDIDWNLAPKSTF 56

Query: 350 PLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCL--------QVGQEHKHTYLPLEVCNI 401
                +G   E T  +Y+  KY + +R P+ P L            + +   LP E+C I
Sbjct: 57  EKS--DGS--EITFVEYYKQKYNITIRDPNQPLLVSRPKRRRNQNGKGEPVLLPPELCFI 112

Query: 402 VA-GQRCIKKLTDM-QTSTMIKATA 424
                R  K    M   +   + + 
Sbjct: 113 TGLTDRMRKDFMLMKSIADRTRLSP 137


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 50.3 bits (120), Expect = 2e-06
 Identities = 26/59 (44%), Positives = 26/59 (44%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGR 89
           A AP P AP  PP P APP   P  P A    PP  P  P  PP  P    P  P  GR
Sbjct: 52  AAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGR 110



 Score = 44.9 bits (106), Expect = 9e-05
 Identities = 21/82 (25%), Positives = 25/82 (30%), Gaps = 6/82 (7%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
                A  AP  P  PP     P           PP P  P   P    P  PN      
Sbjct: 48  PSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPP--PPPVDPNAPPPPA 105

Query: 93  PIVLRANH----FQISMPRGFV 110
           P   R ++    F   +P G+V
Sbjct: 106 PEPGRIDNAVGGFSYVVPAGWV 127



 Score = 32.2 bits (73), Expect = 1.1
 Identities = 13/39 (33%), Positives = 14/39 (35%)

Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
          P    A P P  PP +  A   A    PP  PP     P
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAP 72



 Score = 31.0 bits (70), Expect = 2.3
 Identities = 15/33 (45%), Positives = 15/33 (45%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
           A  PPP  P APP P   P A P P    G I 
Sbjct: 80  APPPPPADPNAPPPPPVDPNAPPPPAPEPGRID 112



 Score = 31.0 bits (70), Expect = 2.8
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP 64
           A  PPP  P APP P   PG +    G    + P
Sbjct: 90  APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYVVP 123


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.8 bits (103), Expect = 5e-04
 Identities = 16/76 (21%), Positives = 20/76 (26%)

Query: 18  GNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
             + F  G        P    P + P P   P    APP     +P  +P     P   P
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPP 450

Query: 78  MFNCPRRPNLGREGRP 93
               P          P
Sbjct: 451 ERQPPAPATEPAPDDP 466



 Score = 38.4 bits (89), Expect = 0.019
 Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 2/63 (3%)

Query: 31   AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
            AG P P  P AP     PP  +  P  A         P P  P + P         L   
Sbjct: 2764 AGPPAPAPPAAPAAG--PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821

Query: 91   GRP 93
              P
Sbjct: 2822 ASP 2824



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 16/63 (25%), Positives = 19/63 (30%), Gaps = 2/63 (3%)

Query: 31   AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
              +P P    AP  P   P      P          P +P   P     + PRR    R 
Sbjct: 2613 PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR--RL 2670

Query: 91   GRP 93
            GR 
Sbjct: 2671 GRA 2673



 Score = 35.7 bits (82), Expect = 0.13
 Identities = 17/42 (40%), Positives = 18/42 (42%)

Query: 33   APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
            A PP  P  PP P APP   P P       P   PP P +P 
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940



 Score = 34.9 bits (80), Expect = 0.25
 Identities = 15/46 (32%), Positives = 17/46 (36%)

Query: 32   GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
                P  P APP P   P   P P       PP +P  P+ P   P
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949



 Score = 34.9 bits (80), Expect = 0.26
 Identities = 16/60 (26%), Positives = 18/60 (30%), Gaps = 9/60 (15%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPG---------AMGLIPPNQPPQPVQPPELP 77
             + A   P  A GA   P  P  A P             A+   P   P  P  PP  P
Sbjct: 261 VGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320



 Score = 33.8 bits (77), Expect = 0.53
 Identities = 19/51 (37%), Positives = 22/51 (43%), Gaps = 8/51 (15%)

Query: 31   AGAPPPGAPGAPPGP-----VAPPGAVPAPPG-AMGLIP--PNQPPQPVQP 73
            AG PPP  P A P       V PP   P P   A+      P+ PPQ  +P
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARP 2598



 Score = 33.4 bits (76), Expect = 0.67
 Identities = 13/70 (18%), Positives = 14/70 (20%)

Query: 4    PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
            P               +             P   A  A PG  A P   P   G     P
Sbjct: 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771

Query: 64   PNQPPQPVQP 73
            P  P      
Sbjct: 2772 PAAPAAGPPR 2781



 Score = 33.4 bits (76), Expect = 0.69
 Identities = 18/54 (33%), Positives = 18/54 (33%), Gaps = 3/54 (5%)

Query: 23   TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPG--AMGLIPPNQPPQPVQPP 74
            T G  T  A  PPP     P        A P PPG  A     P  P  P  P 
Sbjct: 2691 TVGSLTSLADPPPPPPTPEPAPHALVS-ATPLPPGPAAARQASPALPAAPAPPA 2743



 Score = 33.0 bits (75), Expect = 0.96
 Identities = 15/44 (34%), Positives = 15/44 (34%)

Query: 34   PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
            P        P    PP   P PP      PP  PP   QPP  P
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945



 Score = 33.0 bits (75), Expect = 0.99
 Identities = 21/63 (33%), Positives = 25/63 (39%), Gaps = 7/63 (11%)

Query: 35   PPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPI 94
             P  P  PP P  P G   AP G +   PP++ P             P RP + R  RP 
Sbjct: 2838 APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA-------APARPPVRRLARPA 2890

Query: 95   VLR 97
            V R
Sbjct: 2891 VSR 2893



 Score = 32.6 bits (74), Expect = 1.1
 Identities = 17/43 (39%), Positives = 17/43 (39%), Gaps = 4/43 (9%)

Query: 33   APPPGAPG--APPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
            AP         PPGP A   A PA P A    PP  P  P  P
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAP--APPAVPAGPATP 2751



 Score = 32.6 bits (74), Expect = 1.3
 Identities = 15/49 (30%), Positives = 15/49 (30%), Gaps = 5/49 (10%)

Query: 31   AGAPPPGAPGAPPGPVAP-----PGAVPAPPGAMGLIPPNQPPQPVQPP 74
            A    P AP  P  P  P     P     PP   G   P  P  P   P
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779



 Score = 32.6 bits (74), Expect = 1.3
 Identities = 23/69 (33%), Positives = 24/69 (34%), Gaps = 15/69 (21%)

Query: 34  PPPGAPGAPPGPVAPP-------GAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
           P P    APP P  P           PAPP      P  QPP P   P     +   R  
Sbjct: 420 PTPVPASAPPPPATPLPSAEPGSDDGPAPP------PERQPPAPATEPAPDDPDDATRKA 473

Query: 87  LG--REGRP 93
           L   RE RP
Sbjct: 474 LDALRERRP 482



 Score = 32.2 bits (73), Expect = 1.5
 Identities = 23/79 (29%), Positives = 26/79 (32%), Gaps = 21/79 (26%)

Query: 31   AGAPPPGAP--------------GAPPGPVAPPGAVPAPPGAM-------GLIPPNQPPQ 69
            A  PP                   +P  P  PP AV AP  A+       G +PP    Q
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835

Query: 70   PVQPPELPMFNCPRRPNLG 88
            P  PP  P    P  P  G
Sbjct: 2836 PTAPPPPPGPPPPSLPLGG 2854



 Score = 31.8 bits (72), Expect = 1.8
 Identities = 15/63 (23%), Positives = 19/63 (30%)

Query: 31   AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
            + A  P AP  PP       AV     +  L P      P      P    P+ P   + 
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929

Query: 91   GRP 93
              P
Sbjct: 2930 QPP 2932



 Score = 31.8 bits (72), Expect = 2.0
 Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 7/69 (10%)

Query: 32   GAPP----PGAPGAPPGP---VAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
             APP    P AP    G     APP  +P    A    PP+  P   +P   P    P  
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649

Query: 85   PNLGREGRP 93
                 +  P
Sbjct: 2650 ERPRDDPAP 2658



 Score = 31.8 bits (72), Expect = 2.3
 Identities = 15/61 (24%), Positives = 18/61 (29%), Gaps = 12/61 (19%)

Query: 39   PGAPPG------PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
            P APP       PV   G    P       PP+  P     P+ P  +     N      
Sbjct: 2589 PDAPPQSARPRAPVDDRGDPRGPA------PPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642

Query: 93   P 93
            P
Sbjct: 2643 P 2643



 Score = 31.4 bits (71), Expect = 2.6
 Identities = 20/61 (32%), Positives = 21/61 (34%), Gaps = 2/61 (3%)

Query: 31   AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
               P P  P  PP  V  P    A P A     P  PP   QP   P    P  P+L   
Sbjct: 2796 ESLPSPWDPADPPAAVLAPA--AALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853

Query: 91   G 91
            G
Sbjct: 2854 G 2854



 Score = 30.7 bits (69), Expect = 4.5
 Identities = 17/56 (30%), Positives = 19/56 (33%), Gaps = 1/56 (1%)

Query: 31   AGAPPPGAPGAPPGPVAPPGAV-PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
            AG   PG P  P  P    G   PAPP A    PP +  +P              P
Sbjct: 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801



 Score = 30.7 bits (69), Expect = 4.7
 Identities = 13/61 (21%), Positives = 16/61 (26%), Gaps = 17/61 (27%)

Query: 34   PPPGAPGAPPGPVAPPGAVPAPPGAMG-----------------LIPPNQPPQPVQPPEL 76
            PPP  P  P  P   P     P GA+                   +P   P +       
Sbjct: 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASST 2993

Query: 77   P 77
            P
Sbjct: 2994 P 2994



 Score = 30.7 bits (69), Expect = 4.9
 Identities = 17/78 (21%), Positives = 21/78 (26%), Gaps = 5/78 (6%)

Query: 33   APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
            A P  +       + P      P           PPQP   P  P    P  P   R   
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQP-----QAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941

Query: 93   PIVLRANHFQISMPRGFV 110
            P+    +      P G V
Sbjct: 2942 PLAPTTDPAGAGEPSGAV 2959


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 38.5 bits (90), Expect = 0.002
 Identities = 19/44 (43%), Positives = 21/44 (47%), Gaps = 5/44 (11%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           PPP  P APP    PP   PA P +     P  PP P + P LP
Sbjct: 64  PPPSEPAAPPPDPEPPVPGPAGPPS-----PLAPPAPARKPPLP 102



 Score = 35.8 bits (83), Expect = 0.016
 Identities = 16/56 (28%), Positives = 21/56 (37%), Gaps = 4/56 (7%)

Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLI----PPNQPPQPVQPPELP 77
          G + C   P    P  PP    P   V   P  +  +    PP++P  P   PE P
Sbjct: 24 GFSSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPP 79



 Score = 32.7 bits (75), Expect = 0.23
 Identities = 14/37 (37%), Positives = 15/37 (40%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQ 69
           APPP      PGP  PP  +  P  A     P   PQ
Sbjct: 71  APPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107



 Score = 31.2 bits (71), Expect = 0.60
 Identities = 13/51 (25%), Positives = 15/51 (29%)

Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
          G         PP        P     A P P     +  P  PP P+ PP 
Sbjct: 44 GDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPA 94



 Score = 29.6 bits (67), Expect = 2.1
 Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 2/55 (3%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
           G P P     PP        V  PP      PP+  P    P   P    P  P
Sbjct: 43 VGDPRPPVVDTPPP--VSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAP 95



 Score = 28.1 bits (63), Expect = 7.1
 Identities = 11/26 (42%), Positives = 12/26 (46%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPP 56
              PP   P  PP P+APP     PP
Sbjct: 75  DPEPPVPGPAGPPSPLAPPAPARKPP 100



 Score = 28.1 bits (63), Expect = 8.3
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 32  GAPPPGAPGAPPGPVAPPGAVPAPP 56
             P P  P +P  P AP    P PP
Sbjct: 79  PVPGPAGPPSPLAPPAPARKPPLPP 103


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 41.1 bits (97), Expect = 0.002
 Identities = 14/55 (25%), Positives = 14/55 (25%), Gaps = 4/55 (7%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAV----PAPPGAMGLIPPNQPPQPVQPPE 75
            G      A  P A  A P     P A      A P       P   P P   P 
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPP 442



 Score = 41.1 bits (97), Expect = 0.003
 Identities = 15/45 (33%), Positives = 15/45 (33%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
              P P  P  PP P   P   P PP       P   P P  P E
Sbjct: 737 DPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781



 Score = 36.9 bits (86), Expect = 0.054
 Identities = 15/48 (31%), Positives = 15/48 (31%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
             A    AP A P P   P   PAPP   G  P    P P        
Sbjct: 417 PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464



 Score = 35.3 bits (82), Expect = 0.15
 Identities = 15/61 (24%), Positives = 15/61 (24%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
                 A A  P    A P   A P    AP  A    P   PP P           P  
Sbjct: 399 PSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458

Query: 85  P 85
            
Sbjct: 459 A 459



 Score = 34.2 bits (79), Expect = 0.31
 Identities = 16/44 (36%), Positives = 16/44 (36%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           A  PP  AP AP  P     A PAP  A             QPP
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP 722



 Score = 34.2 bits (79), Expect = 0.40
 Identities = 16/47 (34%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A   P  AP   P P AP  A P+P G         PP    P   P
Sbjct: 421 AAPAPAAAPQPAPAP-APAPAPPSPAGNAPAGGAPSPPPAAAPSAQP 466



 Score = 33.4 bits (77), Expect = 0.58
 Identities = 14/45 (31%), Positives = 16/45 (35%), Gaps = 4/45 (8%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
            PP   GAP  P  PP   PA   A        P  P +  E+  
Sbjct: 747 DPPDPAGAPAQPPPPPAPAPAAAPA----AAPPPSPPSEEEEMAE 787



 Score = 33.4 bits (77), Expect = 0.64
 Identities = 9/53 (16%), Positives = 10/53 (18%), Gaps = 2/53 (3%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP--PQPVQPPE 75
                     P  A  +           PA         P     PQP   P 
Sbjct: 384 RLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA 436



 Score = 33.0 bits (76), Expect = 0.71
 Identities = 15/50 (30%), Positives = 15/50 (30%), Gaps = 6/50 (12%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
            G    AG   P AP   P P AP     A P          P     PP
Sbjct: 667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP------AQPAPAPAATPP 710



 Score = 32.7 bits (75), Expect = 0.99
 Identities = 13/47 (27%), Positives = 13/47 (27%), Gaps = 1/47 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
             A P   P   P     P A PAP        P   P     P  P
Sbjct: 458 PAAAPSAQPAPAPAAAPEPTAAPAPAPPAA-PAPAAAPAAPAAPAAP 503



 Score = 32.7 bits (75), Expect = 1.0
 Identities = 16/48 (33%), Positives = 16/48 (33%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
            GAP P    AP    AP  A    P A     P   P P   P  P 
Sbjct: 451 GGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498



 Score = 32.7 bits (75), Expect = 1.1
 Identities = 12/44 (27%), Positives = 13/44 (29%), Gaps = 1/44 (2%)

Query: 33  APPPGAPGAP-PGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
           AP      AP P P   P +      A G   P     P   P 
Sbjct: 424 APAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467



 Score = 31.5 bits (72), Expect = 2.3
 Identities = 13/43 (30%), Positives = 13/43 (30%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
                 P AP  P AP    PA   A        P   V  PE
Sbjct: 609 PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPE 651



 Score = 31.5 bits (72), Expect = 2.6
 Identities = 12/45 (26%), Positives = 14/45 (31%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
             AP P AP AP G      A             +   QP Q  +
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ 726



 Score = 31.1 bits (71), Expect = 2.8
 Identities = 13/47 (27%), Positives = 14/47 (29%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A A    +P     P A P   PA        P   PP    P   P
Sbjct: 448 APAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAP 494



 Score = 31.1 bits (71), Expect = 2.9
 Identities = 11/54 (20%), Positives = 11/54 (20%), Gaps = 5/54 (9%)

Query: 22  FTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
                      A   G P           A PA P A     P  P  P     
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAA-----PAAPAAPAPAGA 632



 Score = 31.1 bits (71), Expect = 3.1
 Identities = 19/54 (35%), Positives = 19/54 (35%), Gaps = 1/54 (1%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-PQPVQPPELPMFNCPRRP 85
           AP P    APP P     A  AP       P  QP P P   PE      P  P
Sbjct: 432 APAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP 485



 Score = 31.1 bits (71), Expect = 3.2
 Identities = 13/45 (28%), Positives = 14/45 (31%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
               PP   G  P   AP     A P A     P   P+P   P 
Sbjct: 437 PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481



 Score = 30.3 bits (69), Expect = 5.1
 Identities = 14/49 (28%), Positives = 14/49 (28%), Gaps = 4/49 (8%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
                 A A P GA  A P P                 P  QPPQ  Q 
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD----PAAQPPQAAQG 727



 Score = 30.0 bits (68), Expect = 6.6
 Identities = 13/55 (23%), Positives = 14/55 (25%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
           A   P  AP   P P +P G  PA              QP   P           
Sbjct: 427 AAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481



 Score = 30.0 bits (68), Expect = 6.9
 Identities = 12/47 (25%), Positives = 12/47 (25%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A AP P    AP      P       GA    P   P     P    
Sbjct: 426 AAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAA 472



 Score = 30.0 bits (68), Expect = 7.2
 Identities = 14/51 (27%), Positives = 19/51 (37%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
           P P    A P P   P    AP  A    PP++  +  +     M +  RR
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRR 799



 Score = 29.6 bits (67), Expect = 8.1
 Identities = 13/28 (46%), Positives = 13/28 (46%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMG 60
           AP P A  AP    A P A  AP GA  
Sbjct: 481 APAPPAAPAPAAAPAAPAAPAAPAGADD 508


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 35.5 bits (82), Expect = 0.019
 Identities = 17/44 (38%), Positives = 17/44 (38%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           P  G   APP    PP   PAPPG     PP        P E P
Sbjct: 60  PAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPP 103



 Score = 32.4 bits (74), Expect = 0.26
 Identities = 15/45 (33%), Positives = 16/45 (35%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A PP     PP   APPG  P PP        +    P   P  P
Sbjct: 66  AAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPP 110


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 38.0 bits (89), Expect = 0.019
 Identities = 26/140 (18%), Positives = 39/140 (27%), Gaps = 9/140 (6%)

Query: 30  CAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-------PELPMFNCP 82
            A A     P A P  V+ P A  A P    +    +     +        P  P    P
Sbjct: 57  AAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAP 116

Query: 83  RRPNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHAYSKIF 142
             P           R      + PR  V         D    ++ R + +T++     + 
Sbjct: 117 AAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKARIERIVNDTVMQELRSLR 176

Query: 143 GALKPVFDG--RNNLYTRDP 160
           G L+             RDP
Sbjct: 177 GMLEEQLASLAWGERQRRDP 196



 Score = 31.1 bits (71), Expect = 2.7
 Identities = 15/64 (23%), Positives = 18/64 (28%), Gaps = 2/64 (3%)

Query: 36  PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIV 95
           P A  AP     PP A PA             P  V+  +       R   + R   P  
Sbjct: 53  PPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLT--AQREQLVARAAAPAA 110

Query: 96  LRAN 99
             A 
Sbjct: 111 PEAQ 114


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 38.2 bits (89), Expect = 0.020
 Identities = 14/45 (31%), Positives = 15/45 (33%)

Query: 32  GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
              PPG  G  P P   P A    P    LI     P   +PP  
Sbjct: 700 PGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744



 Score = 29.4 bits (66), Expect = 9.6
 Identities = 16/62 (25%), Positives = 19/62 (30%), Gaps = 3/62 (4%)

Query: 35  PPGAPGAPPGPVAPPGAVPA---PPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREG 91
             G  G P  P   P  +PA    P  +    P +P         P    P  P  G  G
Sbjct: 650 RGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGG 709

Query: 92  RP 93
            P
Sbjct: 710 LP 711


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 38.1 bits (88), Expect = 0.024
 Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 4/54 (7%)

Query: 28  TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP----PQPVQPPELP 77
                 PP  APG    P A PG    P  A G   P       P P  PP+ P
Sbjct: 730 APGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP 783



 Score = 37.7 bits (87), Expect = 0.033
 Identities = 19/54 (35%), Positives = 21/54 (38%), Gaps = 4/54 (7%)

Query: 28  TKCAGAPPPGAPGAPPGPVAPPGA-VPAPPGAMGLIPPNQP---PQPVQPPELP 77
                 PP  APG    P A PGA  P PP      P  +P   P P  PP+  
Sbjct: 750 APGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803



 Score = 37.4 bits (86), Expect = 0.035
 Identities = 22/62 (35%), Positives = 23/62 (37%), Gaps = 2/62 (3%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQPPE-LPMFNCPRRPNLGREG 91
           PPP AP     P APPG    P  A G   P    P   +PP   P    P     GR  
Sbjct: 696 PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755

Query: 92  RP 93
            P
Sbjct: 756 PP 757



 Score = 35.8 bits (82), Expect = 0.12
 Identities = 18/61 (29%), Positives = 19/61 (31%), Gaps = 3/61 (4%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQPPELPMFNCPRRPNLGREGR 92
            P  A G    P A PG    P  A G   P    P   +PP         RP     G 
Sbjct: 716 RPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA--PGRARPPAAAPGA 773

Query: 93  P 93
           P
Sbjct: 774 P 774



 Score = 32.3 bits (73), Expect = 1.4
 Identities = 29/89 (32%), Positives = 32/89 (35%), Gaps = 18/89 (20%)

Query: 23  TCGGQTKCAGAPPP-----GAPGAP---PGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQ- 72
             G     A AP        APGAP   P P APP     P GA    PP Q  P  +Q 
Sbjct: 750 APGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQL 809

Query: 73  -PPELPMFNCPRRPNLG-------REGRP 93
            P   P    P +  L        + GRP
Sbjct: 810 MPRAAPGQQGPTKQILRQLLTGGVKRGRP 838


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 37.8 bits (88), Expect = 0.024
 Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 2/94 (2%)

Query: 26  GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
            Q+  A  P        P    P  +V  PP A+ + PP+  PQ V+P +          
Sbjct: 403 SQSSAAAQPSAPQSATQPAGTPPTVSVD-PPAAVPVNPPSTAPQAVRPAQFKEEKKIPVS 461

Query: 86  NLGREGRPIVLRANHFQISMPRGFVHHYEIQIQP 119
            +   G    LR    +     G +       Q 
Sbjct: 462 KVSSLGPS-TLRPIQEKAEQATGNIKEAPTGTQK 494


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 37.6 bits (88), Expect = 0.031
 Identities = 23/93 (24%), Positives = 27/93 (29%), Gaps = 15/93 (16%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN---- 86
            GAP   A  A PG  A   A   P  A+  +P         P          R N    
Sbjct: 405 GGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVRLNSFED 464

Query: 87  ---LGREGRPIVLRANHFQISMPRGFVH--HYE 114
              L  E R I L+A           V    +E
Sbjct: 465 IVALAEEKRDIKLKA------AVERDVRLVRFE 491



 Score = 32.2 bits (74), Expect = 1.5
 Identities = 11/63 (17%), Positives = 13/63 (20%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
             +P  G  G P G  AP     A         P           +P             
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPA 450

Query: 91  GRP 93
             P
Sbjct: 451 AAP 453



 Score = 31.0 bits (71), Expect = 3.2
 Identities = 7/46 (15%), Positives = 7/46 (15%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
              PP    G    P            A        P        L
Sbjct: 388 QEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433



 Score = 30.6 bits (70), Expect = 4.0
 Identities = 13/53 (24%), Positives = 15/53 (28%), Gaps = 1/53 (1%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
                 A A P  A  AP     P  A+ A P A         P     P + 
Sbjct: 407 APGAPAAAAAPGAAAAAPAAG-GPAAALAAVPDAAAAAAAPPAPAAAPQPAVR 458


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 37.4 bits (87), Expect = 0.035
 Identities = 14/47 (29%), Positives = 14/47 (29%)

Query: 28  TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
            K    P   AP A P   A     PA   A     P  PP    P 
Sbjct: 378 KKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPA 424



 Score = 35.5 bits (82), Expect = 0.14
 Identities = 16/47 (34%), Positives = 16/47 (34%), Gaps = 2/47 (4%)

Query: 33  APPPGA--PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A P  A    A P   APP  V AP  A     P   P  V     P
Sbjct: 405 AAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAP 451



 Score = 34.7 bits (80), Expect = 0.21
 Identities = 14/50 (28%), Positives = 15/50 (30%)

Query: 26  GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
                  APP  AP AP    A      AP  A   +     P     PE
Sbjct: 409 AAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPE 458



 Score = 32.0 bits (73), Expect = 1.5
 Identities = 16/53 (30%), Positives = 20/53 (37%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
           A  P AP A P   AP  A  A P A+ L P        +   +P+   P   
Sbjct: 419 AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPA 471



 Score = 32.0 bits (73), Expect = 1.5
 Identities = 14/53 (26%), Positives = 14/53 (26%), Gaps = 5/53 (9%)

Query: 31  AGAPPPG-----APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           A A  P      A      P A P A  A   A    P      P  PP    
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAP 422



 Score = 31.2 bits (71), Expect = 3.1
 Identities = 16/45 (35%), Positives = 17/45 (37%), Gaps = 3/45 (6%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
             A P     A P P A P A  + P A    P   PP PV  P 
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAA---PPAAAPPAPVAAPA 430


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 37.4 bits (87), Expect = 0.035
 Identities = 32/116 (27%), Positives = 39/116 (33%), Gaps = 22/116 (18%)

Query: 23  TCGGQTKCAGAPPPGAPGAPPGPVAPP--GAVPAPPGAMGLIPPNQPPQPVQPPELPMFN 80
                +   GAP P AP APP   APP   A PA P A    P   PP    PP      
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADP 440

Query: 81  CPR-----------------RPNLG---REGRPIVLRANHFQISMPRGFVHHYEIQ 116
                               +P LG    +G P+ + A   +I  P G      +Q
Sbjct: 441 AAAASAGDRWRAFVAFVKGKKPALGASLEQGSPLGVSAGLLEIGFPEGSFELSAMQ 496


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 36.8 bits (85), Expect = 0.054
 Identities = 21/75 (28%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 4   PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
             P  P  S+ L    S        + +  P    P  P  P APP ++PAPP    +  
Sbjct: 193 SLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPS-PAQPSAPPASIPAPPIPPVIQY 251

Query: 64  PNQPPQPVQPPELPM 78
              PP P   P +P+
Sbjct: 252 VAPPPVPPPQPIIPI 266



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 6/51 (11%)

Query: 35  PPGAPGAPP-GPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPPELPMF 79
           PP +  APP  PV    A P  P    +IP           P  P ++P++
Sbjct: 236 PPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANPRDIPLW 286


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 36.8 bits (85), Expect = 0.059
 Identities = 13/47 (27%), Positives = 14/47 (29%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A  P    P       A   A  AP  A      + PP    PPE  
Sbjct: 461 AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507



 Score = 34.5 bits (79), Expect = 0.29
 Identities = 12/44 (27%), Positives = 12/44 (27%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
               P  AP A P   A P A    P A           P   P
Sbjct: 446 GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAP 489



 Score = 34.1 bits (78), Expect = 0.42
 Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 10/57 (17%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-------PQPVQPP---ELP 77
           A AP P    AP     P  A P P  A     P +        P    PP   ELP
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP 503



 Score = 33.3 bits (76), Expect = 0.68
 Identities = 14/50 (28%), Positives = 14/50 (28%), Gaps = 3/50 (6%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAP---PGAMGLIPPNQPPQPVQPPELP 77
           A       PG  P P   P A PA    P A G  P             P
Sbjct: 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485



 Score = 32.9 bits (75), Expect = 0.92
 Identities = 19/67 (28%), Positives = 21/67 (31%), Gaps = 2/67 (2%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
           Q   A A P  A  AP  P A P A  AP  A         P    P    +    +   
Sbjct: 384 QPAPAAAAPAAAAPAPAAPPAAPAA--APAAAAAARAVAAAPARRSPAPEALAAARQASA 441

Query: 87  LGREGRP 93
            G  G P
Sbjct: 442 RGPGGAP 448



 Score = 31.0 bits (70), Expect = 3.0
 Identities = 13/68 (19%), Positives = 16/68 (23%), Gaps = 1/68 (1%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFNCPRRPNLGREG 91
            P     G     +  P           L P P   P P           PR P     G
Sbjct: 513 QPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572

Query: 92  RPIVLRAN 99
            P +   +
Sbjct: 573 LPDMFDGD 580



 Score = 31.0 bits (70), Expect = 3.2
 Identities = 12/48 (25%), Positives = 13/48 (27%), Gaps = 1/48 (2%)

Query: 31  AGAPPPGA-PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
             +P P A   A       PG  PAP  A    P          P   
Sbjct: 425 RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPV 472



 Score = 31.0 bits (70), Expect = 3.3
 Identities = 10/48 (20%), Positives = 11/48 (22%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           Q    G     AP   P       A PA  G   +            P
Sbjct: 438 QASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485



 Score = 29.5 bits (66), Expect = 9.6
 Identities = 16/57 (28%), Positives = 16/57 (28%), Gaps = 1/57 (1%)

Query: 22  FTCGGQTKCAGAPPP-GAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           F  G     AG      AP A P P A   A  AP  A     P   P         
Sbjct: 363 FRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAV 419


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 36.7 bits (85), Expect = 0.059
 Identities = 18/61 (29%), Positives = 22/61 (36%), Gaps = 3/61 (4%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
           PP   P   PG      A   PP    L+P  Q PQ  Q    P F   +   L ++ R 
Sbjct: 277 PPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRG--PQFR-EQLVQLSQQQRE 333

Query: 94  I 94
            
Sbjct: 334 A 334



 Score = 35.5 bits (82), Expect = 0.13
 Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 9/55 (16%)

Query: 32  GAPPPGAPGAPPGPVAPPGAVPAPPGAMGL---------IPPNQPPQPVQPPELP 77
           G PP  A     GP   P   P PP               P   P QP  PP+LP
Sbjct: 183 GMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237



 Score = 33.2 bits (76), Expect = 0.68
 Identities = 20/113 (17%), Positives = 31/113 (27%), Gaps = 7/113 (6%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP----PQPVQP--PELPMFNCPRR 84
               P  +   PP P  PP     PP      PP       P   Q     LP    P+ 
Sbjct: 245 QPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQL 304

Query: 85  PNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHA 137
             L ++ +         +  +        E   Q +    K   +I+    + 
Sbjct: 305 LPLVQQPQGQQRGPQFREQLVQLSQQQR-EALSQEEAKRAKRRHKIVSLSKYN 356



 Score = 32.4 bits (74), Expect = 1.4
 Identities = 12/42 (28%), Positives = 13/42 (30%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           AP       P  P  P    P        +    PP P QPP
Sbjct: 222 APSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPP 263



 Score = 31.7 bits (72), Expect = 2.4
 Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 4/58 (6%)

Query: 31  AGAPPPGAPGAPPGP---VAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFNCPRR 84
               PPG P  P G    V P   +PAP  A    P P Q PQ   P + P F    +
Sbjct: 196 PPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQ 253



 Score = 31.3 bits (71), Expect = 2.5
 Identities = 13/63 (20%), Positives = 16/63 (25%), Gaps = 2/63 (3%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-PMFNCPRRPNLGREG 91
             P       P  + P  A     G     PP  P  P   PE          P+     
Sbjct: 171 QLPQPPQQVLPQGMPPRQAAFPQQGPPE-QPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQ 229

Query: 92  RPI 94
            P+
Sbjct: 230 PPL 232



 Score = 30.9 bits (70), Expect = 3.6
 Identities = 15/95 (15%), Positives = 20/95 (21%), Gaps = 8/95 (8%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQ--PVQPPELPMFNCPRR 84
           Q +     P      PP  V P G  P         PP QPP                + 
Sbjct: 162 QLQQRQQAP--QLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQF 219

Query: 85  PNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQP 119
                               +P+      + Q   
Sbjct: 220 LP----APSQAPAQPPLPPQLPQQPPPLQQPQFPG 250



 Score = 29.7 bits (67), Expect = 8.2
 Identities = 13/55 (23%), Positives = 17/55 (30%), Gaps = 2/55 (3%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ--PPQPVQPPELPMFNCPRRPN 86
             P  P   P    P      P  +  + PP    P Q  QPP+      P+   
Sbjct: 228 AQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 36.0 bits (83), Expect = 0.086
 Identities = 23/67 (34%), Positives = 24/67 (35%), Gaps = 11/67 (16%)

Query: 19  NSIFTCGGQTKCAGAPPPGAPGAP-PGPVA---PP---GAVPAP----PGAMGLIPPNQP 67
           N +   G       A PPG  G P PG      PP    AVP P    PG     PP   
Sbjct: 126 NPVTATGEPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVT 185

Query: 68  PQPVQPP 74
            Q V  P
Sbjct: 186 YQSVPVP 192


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 36.0 bits (83), Expect = 0.10
 Identities = 25/120 (20%), Positives = 33/120 (27%), Gaps = 17/120 (14%)

Query: 2   AIPFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGL 61
           A P P   D++  LN              A A   GAP     P       P P      
Sbjct: 354 APPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPAR---- 409

Query: 62  IPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDK 121
               + P  V P   P     R  +        VL A   Q+   R F+      ++   
Sbjct: 410 --SAEAPPLVAPAAAPAGLALRWRD--------VLAALKMQL---RAFLREARPHVEEGY 456


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 33.2 bits (75), Expect = 0.12
 Identities = 17/40 (42%), Positives = 18/40 (45%), Gaps = 7/40 (17%)

Query: 37 GAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
           AP  PP P AP    P PP      PP  PP PV  P +
Sbjct: 5  RAP-KPPEPPAPLPPAPVPP------PPPAPPAPVPEPTV 37



 Score = 28.6 bits (63), Expect = 3.7
 Identities = 12/26 (46%), Positives = 12/26 (46%)

Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGA 58
          AP P  P APP PV  P   P    A
Sbjct: 19 APVPPPPPAPPAPVPEPTVKPVNAEA 44



 Score = 28.6 bits (63), Expect = 3.7
 Identities = 14/23 (60%), Positives = 14/23 (60%), Gaps = 1/23 (4%)

Query: 33 APPPGAPGAPPGPVAPPGAVPAP 55
          AP P AP  PP P APP  VP P
Sbjct: 14 APLPPAP-VPPPPPAPPAPVPEP 35


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
          modification].
          Length = 2365

 Score = 35.7 bits (82), Expect = 0.14
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 48 PPGAVPAPPGAMGLIPPNQPPQPVQP 73
          PPG  P PP   G  PP+QPP P  P
Sbjct: 5  PPGNPPPPPPPPGFEPPSQPPPPPPP 30


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
          family of proteins is found in bacteria, eukaryotes and
          viruses. Proteins in this family are typically between
          126 and 281 amino acids in length. The function of this
          domain is unknown. The structure of this domain has
          been found to contain five helices with a long flexible
          loop between helices one and two.
          Length = 124

 Score = 33.5 bits (77), Expect = 0.15
 Identities = 12/40 (30%), Positives = 14/40 (35%), Gaps = 5/40 (12%)

Query: 19 NSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGA 58
          N+IF      + A A P  A      P A   A PA    
Sbjct: 7  NAIFG-----EAAAAAPAPAAAPATAPAAAAAAAPAATPP 41



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 10/34 (29%), Positives = 10/34 (29%)

Query: 38 APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPV 71
          A  A P P A P   PA   A            V
Sbjct: 13 AAAAAPAPAAAPATAPAAAAAAAPAATPPAAASV 46



 Score = 29.2 bits (66), Expect = 4.1
 Identities = 6/25 (24%), Positives = 6/25 (24%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAP 55
          A AP      A     A      A 
Sbjct: 21 AAAPATAPAAAAAAAPAATPPAAAS 45


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 35.6 bits (83), Expect = 0.15
 Identities = 14/49 (28%), Positives = 14/49 (28%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
               A A P  AP A   P APP    A   A     P        P  
Sbjct: 62  PAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAA 110



 Score = 34.1 bits (79), Expect = 0.35
 Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 1/45 (2%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
          A A    A      P A   A PAP       P   PP+P     
Sbjct: 48 AAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA-APPKPAAAAA 91



 Score = 33.7 bits (78), Expect = 0.55
 Identities = 13/45 (28%), Positives = 13/45 (28%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
              P   AP APP P A   A  AP           P       E
Sbjct: 72  PAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116



 Score = 33.3 bits (77), Expect = 0.71
 Identities = 11/47 (23%), Positives = 11/47 (23%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
             A P     A P P  P  A PA P                P    
Sbjct: 58  PAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAA 104



 Score = 32.9 bits (76), Expect = 0.91
 Identities = 12/50 (24%), Positives = 12/50 (24%)

Query: 28  TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
              A A    A  AP  P A   A P  P A           P       
Sbjct: 58  PAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107



 Score = 32.9 bits (76), Expect = 1.0
 Identities = 12/44 (27%), Positives = 12/44 (27%), Gaps = 4/44 (9%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
            A    A  A     AP    PA P       P     P  PP
Sbjct: 45 TAAAAAAAAAASAPAAAPAAKAPAAPAP----APPAAAAPAAPP 84



 Score = 31.8 bits (73), Expect = 1.9
 Identities = 12/51 (23%), Positives = 13/51 (25%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           +   A AP P A  AP  P  P  A  A               P       
Sbjct: 65  KAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115



 Score = 31.0 bits (71), Expect = 4.0
 Identities = 12/46 (26%), Positives = 13/46 (28%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
                 A  P AP  P    A   A  APP A     P       +
Sbjct: 71  APAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116



 Score = 29.9 bits (68), Expect = 8.5
 Identities = 11/39 (28%), Positives = 13/39 (33%)

Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
          PG+  AP    A   A  + P A         P P  P 
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPA 76


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 33.6 bits (77), Expect = 0.16
 Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 31  AGAPPPGAPGAPP--GPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           A +P P  P  PP   P  PP    A P      PP+ PPQP + P
Sbjct: 96  AQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141



 Score = 33.3 bits (76), Expect = 0.22
 Identities = 18/50 (36%), Positives = 19/50 (38%), Gaps = 5/50 (10%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPPELP 77
            PP     AP  P  PP + PA P A     P       P  P QPP  P
Sbjct: 92  TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141



 Score = 32.9 bits (75), Expect = 0.25
 Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 3/44 (6%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           P P  P  PP   +P  AVP PP       P  PP P      P
Sbjct: 85  PAPPEPVTPPTAQSPAPAVPTPPPTST---PAVPPAPAAAVPAP 125


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
          represents proteins encoded by genes which are always
          found in type III secretion operons, although their
          function in the processes of secretion and virulence is
          unclear. Hpa stands for Hrp-associated gene, where Hrp
          stands for hypersensitivity response and virulence. see
          also PMID:18584024.
          Length = 185

 Score = 34.0 bits (78), Expect = 0.17
 Identities = 14/43 (32%), Positives = 17/43 (39%)

Query: 30 CAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
             APP  A  A P     P   PAPP        ++ PQP+ 
Sbjct: 30 GTPAPPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 34.7 bits (80), Expect = 0.18
 Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
             PP +P +P  P  PPG    PP  +  +PP    +P  P 
Sbjct: 222 DTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPS 263



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 17/60 (28%), Positives = 18/60 (30%), Gaps = 5/60 (8%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAV-----PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
             +  PG P  P  P  P         PAP       PP  P  P  P   P    P  P
Sbjct: 187 PPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPP 246



 Score = 30.8 bits (70), Expect = 2.6
 Identities = 9/56 (16%), Positives = 10/56 (17%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
           A          P      P     P         + PP P           P  P 
Sbjct: 177 ADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232


>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
          Members of this family belong to the collagen
          superfamily. Collagens are generally extracellular
          structural proteins involved in formation of connective
          tissue structure. The alignment contains 20 copies of
          the G-X-Y repeat that forms a triple helix. The first
          position of the repeat is glycine, the second and third
          positions can be any residue but are frequently proline
          and hydroxyproline. Collagens are post translationally
          modified by proline hydroxylase to form the
          hydroxyproline residues. Defective hydroxylation is the
          cause of scurvy. Some members of the collagen
          superfamily are not involved in connective tissue
          structure but share the same triple helical structure.
          Length = 60

 Score = 31.3 bits (72), Expect = 0.21
 Identities = 26/50 (52%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 31 AGAP-PPGAPGAPPGPVAPPGAV--PAPPGAMGLIPPNQPPQPVQPPELP 77
           G P PPG PG PPGP  PPG    P PPG  G   P  PP P  PP  P
Sbjct: 3  PGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 51



 Score = 27.5 bits (62), Expect = 5.3
 Identities = 17/29 (58%), Positives = 17/29 (58%), Gaps = 4/29 (13%)

Query: 31 AGAP-PPGAPGAP--PGPVAPPGAVPAPP 56
           G P PPG PG P  PGP   PGA P PP
Sbjct: 33 PGPPGPPGPPGPPGPPGPPGAPGA-PGPP 60


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
          domain family is found in eukaryotes, and is typically
          between 125 and 136 amino acids in length.
          Length = 133

 Score = 32.9 bits (75), Expect = 0.21
 Identities = 9/34 (26%), Positives = 11/34 (32%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP 64
           G     +  APP    P    P  P A G+   
Sbjct: 8  GGVSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSS 41



 Score = 28.2 bits (63), Expect = 8.3
 Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 2/40 (5%)

Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGA--VPAPPGAMG 60
          T G  +  +   PP  PG  P     P A  V +  G+ G
Sbjct: 7  TGGVSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSSAGSSG 46


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 33.9 bits (78), Expect = 0.23
 Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 17  FGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP 67
           F + +F  GG  +   A P   P APP         P+  GA       QP
Sbjct: 82  FLSGMF-GGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQP 131



 Score = 30.4 bits (69), Expect = 3.1
 Identities = 10/43 (23%), Positives = 11/43 (25%)

Query: 32  GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           G   P  P A P    P        G+ G        QP    
Sbjct: 88  GGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQ 130


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 34.8 bits (80), Expect = 0.26
 Identities = 15/45 (33%), Positives = 16/45 (35%)

Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
          A       AP    A P    A   A   +PP  PP P Q  E P
Sbjct: 48 AKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAP 92


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 34.4 bits (79), Expect = 0.28
 Identities = 14/54 (25%), Positives = 14/54 (25%), Gaps = 7/54 (12%)

Query: 31  AGAPPPGAPGAPPG--PVAPPGAVPAP-----PGAMGLIPPNQPPQPVQPPELP 77
           A  P     GAP G  P    GAVPAP             P             
Sbjct: 357 AFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAAL 410



 Score = 32.9 bits (75), Expect = 0.86
 Identities = 17/51 (33%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 26  GQTKCAGAPPPGAPGAPPGPVAPPGAVPAP-PGAMGLIPPNQPPQPVQPPE 75
           G    A A P  AP A P P AP  AV  P P A        P    +  +
Sbjct: 568 GARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ 618



 Score = 30.6 bits (69), Expect = 4.3
 Identities = 21/76 (27%), Positives = 23/76 (30%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
           AP   AP A      P    PA            PP P   P  P    P     G    
Sbjct: 493 APRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAAA 552

Query: 93  PIVLRANHFQISMPRG 108
             VLR    ++S  RG
Sbjct: 553 LDVLRNAGMRVSSDRG 568



 Score = 29.8 bits (67), Expect = 7.6
 Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 5/48 (10%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
           G     A     GA GA   P A   A      A     P   P P  
Sbjct: 392 GASAVPAVTAVTGAAGAALAPKAAAAAAATRAEA-----PPAAPAPPA 434


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 34.3 bits (78), Expect = 0.30
 Identities = 22/69 (31%), Positives = 28/69 (40%), Gaps = 4/69 (5%)

Query: 32  GAPPPGAPGA-PPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-PMFNCPRRPNLGR 89
              P  AP    PGP+  P   P PP  + +     PP  V P E+ P F    R  + R
Sbjct: 359 PPLPQAAPAVVKPGPMEIP--TPVPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKR 416

Query: 90  EGRPIVLRA 98
           E  P+   A
Sbjct: 417 EKLPVTFGA 425



 Score = 31.2 bits (70), Expect = 2.6
 Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 37  GAPGAPPGPVAPPGAVPA-PPGAMGLIPPN--QPPQPVQPPELPM 78
           G              VP  P  A  ++ P   + P PV PP L +
Sbjct: 343 GNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAI 387


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 34.1 bits (78), Expect = 0.31
 Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 7/59 (11%)

Query: 29 KCAGAPPPGAPGAPPG--PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
          K     P  AP +P       PP    AP           P       E P  + PRR 
Sbjct: 11 KGEAEQPAPAPPSPAAAPAPPPPAKTAAPA-----TKAAAPAAAAPRAEKPKKDKPRRE 64



 Score = 29.9 bits (67), Expect = 7.7
 Identities = 9/50 (18%), Positives = 12/50 (24%)

Query: 44 GPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
          G    P   P  P A    PP                   R    ++ +P
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKP 61


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 33.2 bits (76), Expect = 0.35
 Identities = 20/70 (28%), Positives = 22/70 (31%), Gaps = 6/70 (8%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPG-----AMGLIPPNQPPQPVQPPELPMFNCPRRPNL 87
           A P   P  PP P  P G  P  PG        L+P  QP  P+          P     
Sbjct: 126 ALPEIKPLQPPKP-DPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQYG 184

Query: 88  GREGRPIVLR 97
           G  G     R
Sbjct: 185 GGRGNYDEFR 194


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 130

 Score = 32.1 bits (73), Expect = 0.43
 Identities = 13/44 (29%), Positives = 14/44 (31%), Gaps = 5/44 (11%)

Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
          GAP   A   P  P + P  VP            Q PQP     
Sbjct: 22 GAPAQAAA--PAQPASTPVPVPTEASPQ---VEAQAPQPAAAAG 60


>gnl|CDD|165142 PHA02777, PHA02777, major capsid L1 protein; Provisional.
          Length = 555

 Score = 33.9 bits (77), Expect = 0.43
 Identities = 18/83 (21%), Positives = 24/83 (28%), Gaps = 10/83 (12%)

Query: 25  GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
           GG +  AGA     P  P G + P   +P    A  L    +       P  P+    R 
Sbjct: 65  GGTSLGAGAIGSARPSIPAGAIGPGDIIPIDAEAGALAEEIEL-----LPMAPISGRARD 119

Query: 85  PNLGREGRPIVLRANHFQISMPR 107
           P                 I +P 
Sbjct: 120 PF-----GEGFGEEPFDPIGIPP 137


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 33.6 bits (77), Expect = 0.44
 Identities = 22/72 (30%), Positives = 25/72 (34%), Gaps = 6/72 (8%)

Query: 31  AGAPPPG----APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
           A AP P     AP   P   A     P  P      PP  PP+PV PP          P 
Sbjct: 373 AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT--PESAPK 430

Query: 87  LGREGRPIVLRA 98
           L R   P+  + 
Sbjct: 431 LTRAAIPVDEKP 442



 Score = 32.5 bits (74), Expect = 1.00
 Identities = 15/48 (31%), Positives = 16/48 (33%), Gaps = 3/48 (6%)

Query: 33  APPPGAPGAP-PGPVAP-PGAVPAPPGAMGLIPPNQPPQP-VQPPELP 77
           AP P  P A  P PV P P     P  A     P + P      P   
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPV 412


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 33.8 bits (77), Expect = 0.44
 Identities = 30/133 (22%), Positives = 42/133 (31%), Gaps = 16/133 (12%)

Query: 36  PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE--GRP 93
           P     PP P   PG   +   +    PPN PP P  P + P         +  +    P
Sbjct: 355 PDESNVPPNPPNVPGGSNSEFSSDVENPPN-PPNPDIPEQEPNIPEDSNKEVPEDVPMEP 413

Query: 94  IVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHAYSKIFGALKPVFDG-R 152
              R N+F    P+          +P+      N  +I   +          K V  G R
Sbjct: 414 EDDRDNNFNE--PK----------KPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNR 461

Query: 153 NNLYTRDPLPIGN 165
           +N   R   P G 
Sbjct: 462 HNSEDRYTRPHGR 474


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 33.3 bits (76), Expect = 0.45
 Identities = 17/52 (32%), Positives = 20/52 (38%), Gaps = 6/52 (11%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-PQPVQPPE--LPMFNCP 82
           PPP     P    +PP A PAP       PP    P P + P+   P    P
Sbjct: 155 PPPPHAMPPA---SPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSP 203



 Score = 32.1 bits (73), Expect = 1.1
 Identities = 15/44 (34%), Positives = 15/44 (34%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           PPP AP     P  P    PA P A    P      P   P  P
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASP 187


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 33.3 bits (76), Expect = 0.47
 Identities = 10/53 (18%), Positives = 14/53 (26%), Gaps = 8/53 (15%)

Query: 28  TKCAGAPPPGAPGAPPGPVA------PPGAVPAPPGAMGLIPPNQPPQPVQPP 74
                APP   P + P P          G   AP  A+     +   +     
Sbjct: 80  GTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAV--EEKSPSEESATAT 130


>gnl|CDD|218646 pfam05590, DUF769, Xylella fastidiosa protein of unknown function
           (DUF769).  This family consists of several
           uncharacterized hypothetical proteins of unknown
           function from Xylella fastidiosa, the organism that
           causes Pierce's disease in plants.
          Length = 255

 Score = 33.3 bits (76), Expect = 0.49
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 5/45 (11%)

Query: 225 PSMTYTPVGRSFFSTPDGYYHPL-----GGGREVWFGFHQSVRPS 264
            S  Y P G +    PDG + PL      GG ++ FG   +   S
Sbjct: 100 TSNEYQPDGDTVLRNPDGKFDPLCTAEFVGGNDIQFGIRSAASES 144


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 33.1 bits (76), Expect = 0.49
 Identities = 13/42 (30%), Positives = 17/42 (40%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
           PP GA  A     A   ++     A    P    P PV+PP+
Sbjct: 67  PPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPK 108



 Score = 31.5 bits (72), Expect = 1.6
 Identities = 15/45 (33%), Positives = 17/45 (37%), Gaps = 4/45 (8%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           APP       P PV PP   P         P  +P Q V+ P  P
Sbjct: 91  APPNTPVEPEPAPVEPPKPKPVEK----PKPKPKPQQKVEAPPAP 131


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 32.2 bits (73), Expect = 0.52
 Identities = 23/62 (37%), Positives = 24/62 (38%), Gaps = 7/62 (11%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPP-----GAMGLIPPNQPP--QPVQPPELPMFNCPRRPN 86
           P PG  G PP P   PG    PP     G  G+ PP  P    P  PP  P    P  P 
Sbjct: 89  PLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148

Query: 87  LG 88
            G
Sbjct: 149 FG 150



 Score = 31.8 bits (72), Expect = 0.74
 Identities = 22/47 (46%), Positives = 23/47 (48%), Gaps = 5/47 (10%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A  PPP  P  P GP  PP   P  PGA G+ PP  PP    PP  P
Sbjct: 107 AAVPPP--PPLPGGPGVPPPP-PPFPGAPGIPPP--PPGMGSPPPPP 148



 Score = 29.1 bits (65), Expect = 5.6
 Identities = 17/37 (45%), Positives = 17/37 (45%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
           PPP  PGAP  P  PPG    PP   G   P  P  P
Sbjct: 124 PPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 33.3 bits (77), Expect = 0.55
 Identities = 13/43 (30%), Positives = 15/43 (34%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
           A P   P  PP   AP  +  A       + P Q P    PP 
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPA 404



 Score = 32.9 bits (76), Expect = 0.91
 Identities = 11/48 (22%), Positives = 14/48 (29%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
              PP  A  A              P     +PP     P Q P +P+
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPL 415



 Score = 32.5 bits (75), Expect = 1.1
 Identities = 15/47 (31%), Positives = 16/47 (34%), Gaps = 1/47 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A A    A  AP   VAPP A   PP      P   P  P+      
Sbjct: 376 APAASAQATAAPTAAVAPPQAPAVPP-PPASAPQQAPAVPLPETTSQ 421



 Score = 30.6 bits (70), Expect = 4.2
 Identities = 12/44 (27%), Positives = 13/44 (29%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           A  P P  P     P A   A  AP  A+        P P    
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASA 406


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.5 bits (76), Expect = 0.56
 Identities = 16/63 (25%), Positives = 23/63 (36%), Gaps = 3/63 (4%)

Query: 23  TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQP--VQPPELPMF 79
           T   Q+    AP    P +      PP       G + L  P+  PPQP  +   ++P  
Sbjct: 248 TASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPL 307

Query: 80  NCP 82
             P
Sbjct: 308 PLP 310



 Score = 32.0 bits (72), Expect = 1.9
 Identities = 22/71 (30%), Positives = 26/71 (36%), Gaps = 12/71 (16%)

Query: 35  PPGAPGAPPGPVAPPGAVPAPP--GAMGLIPPNQPPQPV----------QPPELPMFNCP 82
           PPGA  AP  P   P A   PP    +   P  QP QP            P  LP  + P
Sbjct: 184 PPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243

Query: 83  RRPNLGREGRP 93
            +P    +  P
Sbjct: 244 LQPQTASQQSP 254



 Score = 29.7 bits (66), Expect = 7.9
 Identities = 13/45 (28%), Positives = 16/45 (35%), Gaps = 1/45 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGL-IPPNQPPQPVQPP 74
           A  PPP  P + P       +   PPG+          P P  PP
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPP 508


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing
          protein (DUF2360).  This is the conserved 140 amino
          acid region of a family of proteins conserved from
          nematodes to humans. One C. elegans member is annotated
          as a Daf-16-dependent longevity protein 1 but this
          could not be confirmed. The function is unknown.
          Length = 147

 Score = 32.0 bits (73), Expect = 0.59
 Identities = 11/44 (25%), Positives = 12/44 (27%), Gaps = 1/44 (2%)

Query: 35 PPGAPGAPPGPVAPP-GAVPAPPGAMGLIPPNQPPQPVQPPELP 77
                 PP   A   G  P PP A        PP+    P   
Sbjct: 53 VTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAE 96



 Score = 29.7 bits (67), Expect = 3.8
 Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 6/53 (11%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPR 83
            G PPP  P        PP   PA P      P    P+      + +   PR
Sbjct: 68  NGGPPPPPPARAEAASPPPPEAPAEP------PAEPEPEAPAENTVTVAKDPR 114


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF;
          Provisional.
          Length = 767

 Score = 33.2 bits (76), Expect = 0.60
 Identities = 13/48 (27%), Positives = 14/48 (29%), Gaps = 3/48 (6%)

Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
                        APP P   P A+ AP  A   I     P  V  P
Sbjct: 45 DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIA---APAAVPAP 89


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 33.3 bits (76), Expect = 0.68
 Identities = 18/75 (24%), Positives = 21/75 (28%), Gaps = 5/75 (6%)

Query: 21  IFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP-ELPMF 79
             +  G T      P  A  A   P A      A P A       Q P P Q   + P  
Sbjct: 188 TLSVAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAA----AKAQAPAPQQAGTQNPAK 243

Query: 80  NCPRRPNLGREGRPI 94
                P + R  R  
Sbjct: 244 VDHAAPAVRRLAREF 258


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 33.0 bits (75), Expect = 0.77
 Identities = 21/79 (26%), Positives = 24/79 (30%), Gaps = 5/79 (6%)

Query: 24 CGGQTKCAGA---PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP--M 78
          C G     GA            PG    P   P P    G       P    PP+ P   
Sbjct: 20 CAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPAP 79

Query: 79 FNCPRRPNLGREGRPIVLR 97
          F C R   L R G  + +R
Sbjct: 80 FRCKRPDVLARYGSRVQIR 98


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 32.9 bits (75), Expect = 0.83
 Identities = 17/31 (54%), Positives = 17/31 (54%), Gaps = 3/31 (9%)

Query: 31  AGAPPPGA-PGAPPGPVAPP--GAVPAPPGA 58
           A APPPGA P APP  V PP     PA P  
Sbjct: 911 ASAPPPGAGPPAPPQAVPPPRTTQPPAAPPR 941



 Score = 32.9 bits (75), Expect = 0.93
 Identities = 17/38 (44%), Positives = 19/38 (50%), Gaps = 4/38 (10%)

Query: 38  APGAPPGPVAPPGAV-PAPPGAMGLIPPNQPPQPVQPP 74
           APG       PPGA  PAPP A   +PP +  QP   P
Sbjct: 905 APGGDAASAPPPGAGPPAPPQA---VPPPRTTQPPAAP 939



 Score = 30.2 bits (68), Expect = 5.4
 Identities = 17/45 (37%), Positives = 17/45 (37%), Gaps = 4/45 (8%)

Query: 39  PGAPPGPVA-PPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCP 82
           PG   G VA  PG   A     G  PP  PPQ V PP        
Sbjct: 897 PGT--GRVATAPGGDAASAPPPGAGPP-APPQAVPPPRTTQPPAA 938


>gnl|CDD|183659 PRK12659, PRK12659, putative monovalent cation/H+ antiporter
          subunit C; Reviewed.
          Length = 117

 Score = 30.8 bits (70), Expect = 0.86
 Identities = 17/31 (54%), Positives = 18/31 (58%), Gaps = 4/31 (12%)

Query: 21 IFTCGGQTKCAGAPP--PGAPGAPPGPVAPP 49
          IFT G  T+  GAPP  P    AP GPVA P
Sbjct: 43 IFTVGRLTR--GAPPLVPEGATAPAGPVADP 71


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 32.7 bits (74), Expect = 0.97
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           P PG PG+ P      G +P+PP  M    P+  PQ  Q P
Sbjct: 435 PSPGGPGSQPPQSVSGGMIPSPPALM----PSPSPQMSQSP 471


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score = 32.5 bits (75), Expect = 1.00
 Identities = 15/44 (34%), Positives = 15/44 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           A AP   A  A   P A   A PAP  A    P    P     P
Sbjct: 196 AAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAP 239



 Score = 31.7 bits (73), Expect = 1.7
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
             A  P A  AP        A  AP  A           P    
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAA 237


>gnl|CDD|217814 pfam03957, Jun, Jun-like transcription factor. 
          Length = 236

 Score = 31.9 bits (72), Expect = 1.0
 Identities = 19/80 (23%), Positives = 20/80 (25%), Gaps = 23/80 (28%)

Query: 23  TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPP-----------------GAMGL---- 61
                       P  A  A PG +   G  P PP                 GAM      
Sbjct: 139 NSAAAAASGAGGPTAAAMAGPGGLFAGGLAPEPPVYANLSTFNPGATATAYGAMTEGFAA 198

Query: 62  --IPPNQPPQPVQPPELPMF 79
              P   PP  V PP L   
Sbjct: 199 PPHPHANPPGAVGPPRLQAL 218


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 15/48 (31%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-PELP 77
           AGA P  A  A     A   A  A   A     P  P   V   P++P
Sbjct: 87  AGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIP 134


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 21/51 (41%), Positives = 22/51 (43%), Gaps = 7/51 (13%)

Query: 33  APPPGAPGAP--PGPVAP--PGAVPA--PPGAMGLIPPNQPPQPVQPPELP 77
           AP P  P     PGP  P  PGA PA  P  A    P  QPP   +   LP
Sbjct: 394 APEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVA-RSAPLP 443


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 18/71 (25%), Positives = 23/71 (32%), Gaps = 6/71 (8%)

Query: 29  KCAGAPPPGAPGAPPGPVAPPGAVPA----PPGAMGL--IPPNQPPQPVQPPELPMFNCP 82
           +  G+P P    +PP   A P AV +       A      P      P  PP  P  + P
Sbjct: 139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP 198

Query: 83  RRPNLGREGRP 93
                 R  R 
Sbjct: 199 PAAASPRPPRR 209



 Score = 31.7 bits (72), Expect = 2.4
 Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 1/49 (2%)

Query: 31  AGAPPPGAPGAPPG-PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
            G+P P  P +P   P  PP A P P  A  L    +P     PP    
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAAS 151



 Score = 30.5 bits (69), Expect = 5.5
 Identities = 11/37 (29%), Positives = 11/37 (29%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
           PPP  P A P P   P             PP     P
Sbjct: 117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPP 153



 Score = 30.1 bits (68), Expect = 7.4
 Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 1/45 (2%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQP 70
                GA  P    A   P   P   P P   + L P P   P P
Sbjct: 879 PRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
          (LMP2) protein.  This family consists of several
          Gammaherpesvirus latent membrane protein (LMP2)
          proteins. Epstein-Barr virus is a human
          Gammaherpesvirus that infects and establishes latency
          in B lymphocytes in vivo. The latent membrane protein 2
          (LMP2) gene is expressed in latently infected B cells
          and encodes two protein isoforms, LMP2A and LMP2B, that
          are identical except for an additional N-terminal 119
          aa cytoplasmic domain which is present in the LMP2A
          isoform. LMP2A is thought to play a key role in either
          the establishment or the maintenance of latency and/or
          the reactivation of productive infection from the
          latent state. The significance of LMP2B and its role in
          pathogenesis remain unclear.
          Length = 489

 Score = 32.5 bits (74), Expect = 1.2
 Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 4/52 (7%)

Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-PELPMFNCP 82
          GA  P + G P G      + P  P + G    ++P  PV    + P    P
Sbjct: 10 GAGGPRSHGGPDG--DEGDSNPYYPSSFGS-SWDRPGPPVPEDYDAPSHRPP 58


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 32.0 bits (73), Expect = 1.4
 Identities = 11/30 (36%), Positives = 11/30 (36%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG 60
             A    AP  PP P A P   PA  G   
Sbjct: 294 QPAAAAAAPAPPPNPPATPPEPPARRGRGS 323


>gnl|CDD|236131 PRK07920, PRK07920, lipid A biosynthesis lauroyl acyltransferase;
           Provisional.
          Length = 298

 Score = 31.8 bits (73), Expect = 1.4
 Identities = 14/37 (37%), Positives = 16/37 (43%), Gaps = 10/37 (27%)

Query: 501 ALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQRTVRE 537
           ALP+SG WDM G           W +    P  TV E
Sbjct: 113 ALPHSGNWDMAG----------AWLVQHHGPFTTVAE 139


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.1 bits (73), Expect = 1.5
 Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 5/45 (11%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           P P  PG P  P         P  A+     N      +PP  P+
Sbjct: 423 PTPMGPGGPLRPNGLA-----PMNAVRAPSRNAQNAAQKPPMQPV 462



 Score = 31.3 bits (71), Expect = 2.6
 Identities = 15/52 (28%), Positives = 16/52 (30%), Gaps = 6/52 (11%)

Query: 25  GGQTKCAGAPPPGAPGAPPG------PVAPPGAVPAPPGAMGLIPPNQPPQP 70
           GG  +  G  P  A  AP           P   V  PP    L      PQP
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQP 480


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 31.4 bits (70), Expect = 1.7
 Identities = 23/66 (34%), Positives = 28/66 (42%), Gaps = 17/66 (25%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAP--PGAMGLIPPNQP---PQP------------VQPPEL 76
           PP  AP  PP  V P  A PAP  P +    PP  P   P+P            + PP+ 
Sbjct: 117 PPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHNQLPPPDY 176

Query: 77  PMFNCP 82
           P  +CP
Sbjct: 177 PAASCP 182



 Score = 30.2 bits (67), Expect = 4.5
 Identities = 20/52 (38%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAP-PGAMGLIPPNQPPQPVQP-PELPMFNCP 82
           AP P  P   P   AP    PAP P       P  PP  V P P  P   CP
Sbjct: 90  APAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACP 141


>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
          Length = 226

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 12/40 (30%), Positives = 15/40 (37%), Gaps = 4/40 (10%)

Query: 35  PPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           P  AP     P     A P P  A    P  + P+P + P
Sbjct: 179 PLSAPDDEASPQPKSLATPPPVAA----PSRRTPRPRRKP 214


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 31.7 bits (71), Expect = 1.8
 Identities = 14/47 (29%), Positives = 19/47 (40%)

Query: 43  PGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGR 89
           PG +     VPAP  A    PP  P      P  P+ +  +   LG+
Sbjct: 100 PGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQ 146



 Score = 29.4 bits (65), Expect = 8.7
 Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 6/43 (13%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
           A AP P AP   P    P    PA        P    P  VQP
Sbjct: 105 AAAPVP-APAPAPAAAPPAAPAPAAD-----TPAAPIPDAVQP 141


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 31.0 bits (70), Expect = 2.0
 Identities = 17/69 (24%), Positives = 18/69 (26%), Gaps = 10/69 (14%)

Query: 25 GGQTKCAGAPPPG--APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCP 82
          GG     G P     A     GPV   G   A  GA G         PV          P
Sbjct: 17 GGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPG--------GPVGGGGGGSGGPP 68

Query: 83 RRPNLGREG 91
              +  E 
Sbjct: 69 GGGEVAGEA 77


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 30.2 bits (68), Expect = 2.1
 Identities = 23/56 (41%), Positives = 25/56 (44%), Gaps = 7/56 (12%)

Query: 34  PPPG-APGAPPGPVAP---PGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
           PPPG  P  PP    P   P   P PPGA+   PP   P P  P + P    PRR 
Sbjct: 79  PPPGSTP-VPPPGPQPGYNPADYPPPPGAV--PPPQNYPYPPGPGQDPYAPRPRRA 131


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 31.3 bits (71), Expect = 2.1
 Identities = 16/60 (26%), Positives = 21/60 (35%), Gaps = 2/60 (3%)

Query: 21  IFTCGGQTKCAGAPPPG--APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           I   G + +     PP     G PP P  P      PP  + L  P  P Q V+  +   
Sbjct: 44  IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQP 103


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 31.6 bits (71), Expect = 2.4
 Identities = 20/76 (26%), Positives = 23/76 (30%), Gaps = 14/76 (18%)

Query: 25  GGQTKCAGAPPPGAPGA----PPGPVAPPGAVPA-----PPGAMGLIPPNQPPQPV---- 71
                 + A  P + GA     PG   P G  PA        A     P+  P P     
Sbjct: 378 ASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437

Query: 72  -QPPELPMFNCPRRPN 86
             PP  P    P RP 
Sbjct: 438 DAPPAPPRSGIPPRPA 453



 Score = 30.4 bits (68), Expect = 4.4
 Identities = 17/49 (34%), Positives = 19/49 (38%)

Query: 26  GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           G T  + AP   AP A P P  P    P  P   G+ P   P  P   P
Sbjct: 413 GMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASP 461


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 31.1 bits (70), Expect = 2.8
 Identities = 14/70 (20%), Positives = 14/70 (20%), Gaps = 2/70 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAV--PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLG 88
           A   P     A P     P     P              PQ    PE      P  P  G
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGG 265

Query: 89  REGRPIVLRA 98
                    A
Sbjct: 266 EAPPANATPA 275


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 10/26 (38%), Positives = 10/26 (38%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPP 56
            G P   A  A  G  APP     PP
Sbjct: 510 TGKPAAAASPAGGGANAPPAKPVKPP 535



 Score = 29.8 bits (67), Expect = 7.6
 Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG 60
           A AP   AP     P   P  V AP  A G
Sbjct: 399 APAPSAAAPEPKHQPAPEPRPVLAPTPASG 428


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.1 bits (71), Expect = 2.9
 Identities = 16/47 (34%), Positives = 17/47 (36%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A APPP  P  PP         PA   A    PP  P +P   P   
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKD 591



 Score = 30.7 bits (70), Expect = 4.3
 Identities = 11/52 (21%), Positives = 15/52 (28%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
            T     P   AP   P P  P     +      +   + PP P+     P 
Sbjct: 536 PTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members
          of this family confer resistance to the metalloid
          element tellurium and its salts.
          Length = 98

 Score = 28.9 bits (65), Expect = 3.0
 Identities = 9/22 (40%), Positives = 9/22 (40%)

Query: 36 PGAPGAPPGPVAPPGAVPAPPG 57
          P AP  PP P  P  A P    
Sbjct: 1  PAAPVPPPAPAPPAPAPPPAAP 22


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 30.7 bits (68), Expect = 3.1
 Identities = 15/48 (31%), Positives = 17/48 (35%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           A A P  A  AP    APP    APP      P      P +    P+
Sbjct: 302 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349



 Score = 30.7 bits (68), Expect = 3.1
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A APP  A   P    APP    APP      P      P +    P
Sbjct: 267 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAP 313



 Score = 30.3 bits (67), Expect = 5.2
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A APP  A  AP    A P    APP      P      P +    P
Sbjct: 232 AAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 278



 Score = 30.3 bits (67), Expect = 5.2
 Identities = 14/47 (29%), Positives = 15/47 (31%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A A P  A   P    APP    APP      P      P +    P
Sbjct: 246 AAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 292



 Score = 29.9 bits (66), Expect = 5.6
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A A P  A  AP    APP    APP      P      P +    P
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 285



 Score = 29.9 bits (66), Expect = 5.6
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A APP  A   P    APP    APP      P      P +    P
Sbjct: 253 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAP 299



 Score = 29.5 bits (65), Expect = 7.9
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           A APP  A   P    APP    APP      P      P +    P
Sbjct: 260 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAP 306


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 1/48 (2%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
              PP       P P   P     P   +  + P QP  P+  P+LP+
Sbjct: 104 PLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPL-LPDLPL 150


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 30.5 bits (69), Expect = 3.3
 Identities = 14/47 (29%), Positives = 15/47 (31%), Gaps = 1/47 (2%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP-NQPPQPVQPPELPM 78
           A P  A    P P      VPA P            P PV+ P   M
Sbjct: 123 ADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 20/81 (24%), Positives = 24/81 (29%), Gaps = 7/81 (8%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP------NL 87
            P   P  PP P  PP     P       P +       P EL MF+             
Sbjct: 268 EPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEEL-MFDAVEADLPDNILAT 326

Query: 88  GREGRPIVLRANHFQISMPRG 108
            +  +    RA   Q S  RG
Sbjct: 327 LQTVQRRRGRAGGEQKSNHRG 347


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 29.9 bits (67), Expect = 3.8
 Identities = 14/45 (31%), Positives = 15/45 (33%), Gaps = 9/45 (20%)

Query: 32  GAPPPGAPGA---------PPGPVAPPGAVPAPPGAMGLIPPNQP 67
           G PPP  P           PP P       P  PG +G IP    
Sbjct: 155 GTPPPHLPYDVDPKTGIPRPPHPPDISPFYPLSPGGVGQIPHPLG 199


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 30.6 bits (69), Expect = 4.2
 Identities = 12/38 (31%), Positives = 15/38 (39%)

Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
           P P A    P   AP     A P A G  P ++P + 
Sbjct: 18 PPAPAASAPAPAAAAPAPVAAAAPAAAGPRPDDEPFKA 55


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 30.7 bits (69), Expect = 4.4
 Identities = 12/43 (27%), Positives = 14/43 (32%), Gaps = 2/43 (4%)

Query: 32  GAPPPGAPGAPPGPVAPPGAVPAP--PGAMGLIPPNQPPQPVQ 72
              P G P + P         P P   G  GL+ P  P  P  
Sbjct: 384 PQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGT 426


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 30.6 bits (69), Expect = 4.5
 Identities = 16/55 (29%), Positives = 22/55 (40%), Gaps = 6/55 (10%)

Query: 29  KCAGAPPPGAP------GAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
               A P   P      G    PVAP  +    P A G  P ++PP+ +  PE+ 
Sbjct: 123 YGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVK 177


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 30.4 bits (69), Expect = 4.7
 Identities = 12/54 (22%), Positives = 15/54 (27%), Gaps = 1/54 (1%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAP-PGAVPAPPGAMGLIPPNQPPQPVQPPELPMF 79
           +   AG   P          A    A PA       +P  QP +     EL   
Sbjct: 191 EVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIAD 244


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 30.0 bits (67), Expect = 4.7
 Identities = 10/40 (25%), Positives = 10/40 (25%), Gaps = 4/40 (10%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
             A  PG      GP    G             P  PP P
Sbjct: 97  PEASGPGPARGARGPAGSRGRGRRAESP----SPRDPPNP 132


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 29.8 bits (66), Expect = 5.0
 Identities = 18/62 (29%), Positives = 22/62 (35%), Gaps = 4/62 (6%)

Query: 23  TCGGQTKCAGAPP----PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
               +TK   AP     P +  A P     P A  AP        P  P + V+P E P 
Sbjct: 163 AAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASAPAAPAEEVKPSEAPA 222

Query: 79  FN 80
            N
Sbjct: 223 AN 224


>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 28.2 bits (63), Expect = 5.3
 Identities = 10/43 (23%), Positives = 11/43 (25%), Gaps = 7/43 (16%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
          A A       A P PV      PA           Q     +P
Sbjct: 56 AAAAQAPYQVATPTPVQSQRVDPAA-------ASGQDSTEARP 91


>gnl|CDD|215643 PLN03241, PLN03241, magnesium chelatase subunit H; Provisional.
          Length = 1315

 Score = 30.2 bits (68), Expect = 5.4
 Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 3/40 (7%)

Query: 41  APPGPVAPPGAVPAPPG---AMGLIPPNQPPQPVQPPELP 77
           APP  V PP   P P     A+GL  P++  Q     E P
Sbjct: 257 APPSTVTPPSYAPPPLVETPALGLYHPDRERQQAPYFESP 296


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 29.9 bits (67), Expect = 5.5
 Identities = 14/53 (26%), Positives = 15/53 (28%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
            PPP  P  PP           PP          PPQ  Q  + P       P
Sbjct: 122 YPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAP 174



 Score = 29.5 bits (66), Expect = 7.9
 Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 5/76 (6%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP-----MFNCPRRPNLG 88
           PPP A  AP      P   P  P +     P      +QPP         F  P +P+  
Sbjct: 166 PPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPY 225

Query: 89  REGRPIVLRANHFQIS 104
             G P     + F   
Sbjct: 226 MYGGPGGRPNSGFPSG 241


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
          associated protein 2 has a highly conserved sequence
          throughout evolution including a conserved polyproline
          region and several SH2/SH3 binding sites. It occurs as
          a single copy gene with a four-exon organisation and is
          located on chromosome 12. It encodes a ubiquitously
          expressed protein and binds to DAZ and DAZL1 through
          DAZ repeats.
          Length = 136

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 6/49 (12%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPP 74
          A  P   AP  P   +  P A     G     PP        P PV PP
Sbjct: 22 AQMPQASAP-YPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69


>gnl|CDD|193426 pfam12953, DUF3842, Domain of unknown function (DUF3842).  This
           short protein is found mainly in firmicute bacteria. It
           is functionally uncharacterized.
          Length = 131

 Score = 28.6 bits (65), Expect = 6.0
 Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 414 MQTSTMIKATARSAPDREREIQSLVKRADFNNDAYVHEFGLTISNSMM-EV 463
           + TS M+KA A      E  I    K+AD      V   G+ I+NSM+ E+
Sbjct: 38  IATSAMLKAGANRGATGENAIVVNAKKADI----IVGPIGIVIANSMLGEI 84


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
          includes the B. subtilis YppG protein, which is
          functionally uncharacterized. This family of proteins
          is found in bacteria. Proteins in this family are
          typically between 115 and 181 amino acids in length.
          There are two completely conserved residues (F and G)
          that may be functionally important.
          Length = 110

 Score = 28.5 bits (64), Expect = 6.1
 Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 35 PPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFN 80
           P     PP P +PP             P P Q PQ  QPP+   F 
Sbjct: 23 QPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFSSFL 69


>gnl|CDD|234228 TIGR03483, FtsZ_alphas_C, cell division protein FtsZ,
          alphaProteobacterial C-terminal extension.  This model
          describes a domain found as a C-terminal extension to
          the cell division protein FtsZ in many but not all
          members of the alphaProteobacteria [Cellular processes,
          Cell division].
          Length = 121

 Score = 28.7 bits (64), Expect = 6.1
 Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%)

Query: 41 APPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
          AP   + P    P  P A       +       P+  ++  PRR  L  +GRP
Sbjct: 50 APAAHLEPAPRAPQAPPAHPYNENRRAF----APQASIY-APRRGGLDDQGRP 97


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 29.7 bits (67), Expect = 6.3
 Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 6/44 (13%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
          A   PP A   PP PV  P   P P      IP      PV  P
Sbjct: 56 ADLEPPQAVQPPPEPVVEPEPEPEP------IPEPPKEAPVVIP 93


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 30.1 bits (68), Expect = 6.4
 Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 1/28 (3%)

Query: 31  AGAPPPGAPGAPPGPVAPPGAVPAPPGA 58
            G  P G PG  PG   P GA PA  GA
Sbjct: 617 GGGMPGGMPGGMPG-GMPGGAGPAGAGA 643


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 29.6 bits (66), Expect = 6.4
 Identities = 20/51 (39%), Positives = 20/51 (39%), Gaps = 5/51 (9%)

Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG---LIPPNQPPQPVQPPELPM 78
          A  PP  AP   P PV  P A   P        L  P Q PQPV  P   M
Sbjct: 17 ANVPPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPV--PTSAM 65



 Score = 29.3 bits (65), Expect = 8.0
 Identities = 15/36 (41%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 36  PGAPGAPPGPVAPPGAVPAPPGAMGLIPP--NQPPQ 69
             AP A P P APP A  A P A+ + PP    P  
Sbjct: 71  QQAP-AQPAPAAPPAAGAALPEALEVPPPPAFTPNG 105


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 29.9 bits (67), Expect = 6.8
 Identities = 24/91 (26%), Positives = 31/91 (34%), Gaps = 27/91 (29%)

Query: 25  GGQTKCAGAPPPGAP--------------GAPPGPVAPPGAV--PAPPGAMGLIPPNQPP 68
           GG++K    P P +                 PP P A   AV   APP A      N+P 
Sbjct: 50  GGRSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEP- 108

Query: 69  QPVQPPELPMF-------NCPRRPNLGREGR 92
              +P E P+F          +R   G   R
Sbjct: 109 ---RPEETPIFAYSSGDQGGSKRAGHGDTDR 136


>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
          Length = 136

 Score = 28.4 bits (64), Expect = 7.3
 Identities = 17/50 (34%), Positives = 21/50 (42%), Gaps = 9/50 (18%)

Query: 19  NSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPP 68
           +S+FT  G     G   P          +P  A PAPP A  L P  +PP
Sbjct: 90  DSLFT--GNFDQTGNGRPTPS-------SPDAAGPAPPDATPLAPGERPP 130


>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182. 
          Length = 238

 Score = 29.5 bits (66), Expect = 7.4
 Identities = 15/42 (35%), Positives = 15/42 (35%), Gaps = 4/42 (9%)

Query: 34  PPPGAPGAPPGPVAPP--GAVPAPPGAMGLIPPNQPPQPVQP 73
           PPP   G   G   P   G    P  A   I PN   Q  QP
Sbjct: 75  PPPLGNGGGSGAGGPGPVGGGGGPGVAPNNIQPNA--QAQQP 114


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 29.0 bits (65), Expect = 7.5
 Identities = 16/52 (30%), Positives = 20/52 (38%), Gaps = 1/52 (1%)

Query: 23  TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
           T   Q        PG       P+ P   +PAPP ++   PP    QP  PP
Sbjct: 125 TPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQY-PPPGLLQPQGPP 175


>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family. 
          Length = 296

 Score = 29.4 bits (65), Expect = 7.5
 Identities = 11/44 (25%), Positives = 12/44 (27%), Gaps = 4/44 (9%)

Query: 34  PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
           P   A  +     AP     A P       P  PPQ       P
Sbjct: 251 PAEEAGASSAAQEAPAAEQEAAPAE----EPAAPPQEACSESSP 290


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 30.1 bits (67), Expect = 7.6
 Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 3/54 (5%)

Query: 33  APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPP-QPVQPPELPMFNCPRRP 85
             P      P  PVAP      P   +   P  Q P QPV P   P +  P++P
Sbjct: 772 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ--PQYQQPQQP 823


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 155

 Score = 28.6 bits (65), Expect = 8.4
 Identities = 16/51 (31%), Positives = 16/51 (31%), Gaps = 5/51 (9%)

Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-----PM 78
          A  P AP A     AP  A PA   A    P   P       E      PM
Sbjct: 36 AAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPM 86


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 29.7 bits (67), Expect = 8.6
 Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 63   PPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRANHFQISMPR 107
            P +  P PVQ   L       R    REG P+VLR   F+I+ PR
Sbjct: 1296 PTSAAPHPVQAGSLVFEGVQMRY---REGLPLVLRGVSFRIA-PR 1336


>gnl|CDD|221440 pfam12144, Med12-PQL, Eukaryotic Mediator 12 catenin-binding
          domain.  This domain is found in eukaryotes, and is
          typically between 325 and 354 amino acids in length.
          Both development and carcinogenesis are driven by
          signal transduction within the canonical
          Wnt/beta-catenin pathway through both programmed and
          unprogrammed changes in gene transcription.
          Beta-catenin physically and functionally targets this
          PQL (proline-, glutamine-, leucine-rich) region of the
          Med12 subunit of Mediator to activate transcription.
          The beta-catenin transactivation domain binds directly
          to isolated Med12 and intact Mediator both in vitro and
          in vivo, and Mediator is recruited to Wnt-responsive
          genes in a beta-catenin-dependent manner.
          Length = 204

 Score = 28.8 bits (64), Expect = 8.8
 Identities = 17/48 (35%), Positives = 20/48 (41%), Gaps = 8/48 (16%)

Query: 47 APPGAVPAPPGAMGLIPPNQP--------PQPVQPPELPMFNCPRRPN 86
          AP G +P    AM +   NQP          P +P   PM   P RPN
Sbjct: 8  APYGRLPYGQQAMNMYTQNQPLPPGGPGLEPPYRPARNPMNKMPVRPN 55


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 28.6 bits (63), Expect = 9.0
 Identities = 19/50 (38%), Positives = 21/50 (42%), Gaps = 9/50 (18%)

Query: 35  PPGAPGAPPGPVAP---------PGAVPAPPGAMGLIPPNQPPQPVQPPE 75
           PPG PGAP    A          PG+  AP      +PP QP QP   P 
Sbjct: 104 PPGPPGAPKPSPASQLSLVTTVSPGSGLAPVLTQQQVPPQQPQQPSMVPT 153



 Score = 28.6 bits (63), Expect = 9.7
 Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 4/48 (8%)

Query: 36 PGAPGAPPGPVAPPGAVPAPP----GAMGLIPPNQPPQPVQPPELPMF 79
          P   G+ PGP+     V  PP        L  P  P  PV PP++ + 
Sbjct: 2  PVGGGSVPGPLQSKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQMQVP 49


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 29.2 bits (65), Expect = 9.4
 Identities = 20/55 (36%), Positives = 21/55 (38%), Gaps = 8/55 (14%)

Query: 32  GAPPPGAPGAPPGPVAPP--------GAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           G P P  P  PPGP  PP             PP   G  PP     P+ PP  PM
Sbjct: 163 GPPSPWNPRFPPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 29.6 bits (66), Expect = 9.7
 Identities = 12/52 (23%), Positives = 16/52 (30%), Gaps = 1/52 (1%)

Query: 27  QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
           + +    P P  P  PP   +                P Q    VQPP+L  
Sbjct: 565 RVRERWRPAPWTP-NPPRSPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQ 615


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 29.5 bits (66), Expect = 9.8
 Identities = 13/50 (26%), Positives = 15/50 (30%), Gaps = 12/50 (24%)

Query: 4   PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVP 53
           P   TP   +D                AGAP   AP   P P   P  + 
Sbjct: 168 PPGGTPPPDDDEGDE------------AGAPATPAPPLHPPPAPHPHPIA 205


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.136    0.418 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 44,348,042
Number of extensions: 4362570
Number of successful extensions: 6910
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5806
Number of HSP's successfully gapped: 422
Length of query: 865
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 760
Effective length of database: 6,280,432
Effective search space: 4773128320
Effective search space used: 4773128320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)