RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1605
         (718 letters)



>gnl|CDD|215784 pfam00200, Disintegrin, Disintegrin. 
          Length = 76

 Score =  116 bits (294), Expect = 2e-31
 Identities = 45/78 (57%), Positives = 53/78 (67%), Gaps = 2/78 (2%)

Query: 86  EDGEECDCGLEDSCKNACCNATTCMLNVNATCATGSCCNLETCQPHTAGRQCRAADRECD 145
           E+GEECDCG  + C+N CC+ATTC L   A CATG CC+   C+   AG  CR A  ECD
Sbjct: 1   EEGEECDCGSPEECQNPCCDATTCKLKPGAQCATGPCCD--QCKFKPAGTVCRPASGECD 58

Query: 146 LPEFCTGDSEFCPSDVFK 163
           LPE+CTG S  CP DV+K
Sbjct: 59  LPEYCTGQSAECPPDVYK 76


>gnl|CDD|214490 smart00050, DISIN, Homologues of snake disintegrins.  Snake
           disintegrins inhibit the binding of ligands to integrin
           receptors. They contain a 'RGD' sequence, identical to
           the recognition site of many adhesion proteins.
           Molecules containing both disintegrin and
           metalloprotease domains are known as ADAMs.
          Length = 75

 Score =  105 bits (264), Expect = 2e-27
 Identities = 37/77 (48%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 86  EDGEECDCGLEDSCKNACCNATTCMLNVNATCATGSCCNLETCQPHTAGRQCRAADRECD 145
           E+GEECDCG    C + CC+  TC L   A CA+G CC  + C+   AG  CR +  ECD
Sbjct: 1   EEGEECDCGSPKECTDPCCDPATCKLKPGAQCASGPCC--DNCKFKPAGTLCRPSVDECD 58

Query: 146 LPEFCTGDSEFCPSDVF 162
           LPE+C G S  CP D +
Sbjct: 59  LPEYCNGTSADCPPDPY 75


>gnl|CDD|219879 pfam08516, ADAM_CR, ADAM cysteine-rich.  ADAMs are
           membrane-anchored proteases that proteolytically modify
           cell surface and extracellular matrix (ECM) in order to
           alter cell behaviour. It has been shown that the
           cysteine-rich domain of ADAM13 regulates the protein's
           metalloprotease activity.
          Length = 117

 Score = 78.5 bits (194), Expect = 2e-17
 Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 5/72 (6%)

Query: 165 DGETCKGGEAFCYEGSCRTHSDQCLLLWGPSASSSDKRCF-DLNTSGNRHGNCGYYKPNM 223
           DG  C+  + +CY G C T  +QC  L+GP A  +   C+ +LN+ G+R GNCG  K N 
Sbjct: 1   DGHPCQNNQGYCYNGRCPTRDNQCQDLFGPGAKVAPDSCYEELNSKGDRFGNCG--KENG 58

Query: 224 TYAKCEED--KC 233
           TY  CE    KC
Sbjct: 59  TYIPCEPQDVKC 70


>gnl|CDD|214743 smart00608, ACR, ADAM Cysteine-Rich Domain. 
          Length = 137

 Score = 77.8 bits (192), Expect = 6e-17
 Identities = 31/73 (42%), Positives = 41/73 (56%), Gaps = 5/73 (6%)

Query: 164 MDGETCKGGEAFCYEGSCRTHSDQCLLLWGPSASSSDKRCFD-LNTSGNRHGNCGYYKPN 222
            DG  C  G+ +CY G C T  +QC  L+GP A  +   C++ LNT G+R GNCG  + N
Sbjct: 1   QDGTPCDNGQGYCYNGRCPTRDNQCQALFGPGAKVAPDSCYEELNTKGDRFGNCG--REN 58

Query: 223 MTYAKCEED--KC 233
            TY  C  +  KC
Sbjct: 59  GTYIPCAPEDVKC 71


>gnl|CDD|239797 cd04269, ZnMc_adamalysin_II_like, Zinc-dependent metalloprotease;
           adamalysin_II_like subfamily. Adamalysin II is a snake
           venom zinc endopeptidase. This subfamily contains other
           snake venom metalloproteinases, as well as
           membrane-anchored metalloproteases belonging to the ADAM
           family. ADAMs (A Disintegrin And Metalloprotease) are
           glycoproteins, which play roles in cell signaling, cell
           fusion, and cell-cell interactions.
          Length = 194

 Score = 55.7 bits (135), Expect = 7e-09
 Identities = 17/25 (68%), Positives = 18/25 (72%)

Query: 47  GHNLGLEHDTTECTCPSDRCIMAPS 71
           GHNLG+EHD   CTC    CIMAPS
Sbjct: 140 GHNLGMEHDDGGCTCGRSTCIMAPS 164


>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
           metalloprotease.  The members of this family are enzymes
           that cleave peptides. These proteases require zinc for
           catalysis. Members of this family are also known as
           adamalysins. Most members of this family are snake venom
           endopeptidases, but there are also some mammalian
           proteins such as human ADAM8 and fertilin. Fertilin and
           closely related proteins appear to not have some active
           site residues and may not be active enzymes.
          Length = 198

 Score = 50.3 bits (121), Expect = 5e-07
 Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 47  GHNLGLEHDTT-ECTCPSDRCIMAP-----SSRLF 75
           GHNLG+ HD    CTC    CIM P       + F
Sbjct: 140 GHNLGMTHDDIDGCTCGGGGCIMNPVASSSPGKKF 174


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.8 bits (103), Expect = 4e-04
 Identities = 57/304 (18%), Positives = 89/304 (29%), Gaps = 27/304 (8%)

Query: 285  SVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANK 344
            +  LP   A  +   P +P+      + A P  P    R +        T  P  P    
Sbjct: 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPA 2773

Query: 345  GPNTSPYAR---PLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVAT 401
             P   P  R   P ++S   S  +  +  +  D   A        P + + +  LPP  +
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833

Query: 402  ESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISE 461
                 P     P     + P G ++        R  +R  +     PA         ++ 
Sbjct: 2834 AQPTAPPPPPGPP--PPSLPLGGSVAPGGDVRRRPPSRSPA---AKPAAPARPPVRRLAR 2888

Query: 462  PIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRPSQPPP 521
            P                + T+S        +RP  P       +         +P  PPP
Sbjct: 2889 PAV-------------SRSTESFALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPP 2934

Query: 522  PAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSAGQAK 581
            P P     L    +PAG  + PSG    P       G+   P      P     +   + 
Sbjct: 2935 PPPRPQPPLAPTTDPAGAGE-PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASST 2993

Query: 582  TPGT 585
             P T
Sbjct: 2994 PPLT 2997



 Score = 43.0 bits (101), Expect = 7e-04
 Identities = 58/323 (17%), Positives = 89/323 (27%), Gaps = 64/323 (19%)

Query: 297  PTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELP--PANKGPNTSPYARP 354
            P PP  P  A    + A P  P  +    A  A  +A  PP +P  PA  G    P   P
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761

Query: 355  LISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPV 414
              + P                            A   +  T P VA+ S +         
Sbjct: 2762 TTAGP------------------PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803

Query: 415  SSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDK 474
             ++  A   +    +            +  L PP +           P P          
Sbjct: 2804 PADPPAAVLAPAAAL------PPAASPAGPLPPPTSAQPTAPPPPPGPPP---------- 2847

Query: 475  VDMVKRTQSMRETDVNFKRPNLPAFGSMR-----AKRPTSLAGSQRPSQPPPPAPSGIQS 529
                               P+LP  GS+       +RP S + + +P+ P  P    +  
Sbjct: 2848 -------------------PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888

Query: 530  LPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQG 589
                ++    A  P   +  P  Q     Q +        P    P   + + P  P   
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948

Query: 590  LLGPGQGLGY----QNPAVPPGK 608
              G G+  G        A+ PG+
Sbjct: 2949 PAGAGEPSGAVPQPWLGALVPGR 2971



 Score = 39.5 bits (92), Expect = 0.008
 Identities = 65/345 (18%), Positives = 101/345 (29%), Gaps = 44/345 (12%)

Query: 292  AAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSA-----------------DFANFSAT 334
              P+ P  P    D  + P R AP+    +V   A                 D  +    
Sbjct: 2552 PPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP 2611

Query: 335  LPPE-LPPANKGPNTSPYA-RPLISSPILSDTTSQTVRELIDAKNAES----TRQTTRPA 388
             PP  LPP    P+  P +  P  + P      +    E      A       R+  R  
Sbjct: 2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG 2671

Query: 389  SVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPP 448
              A++S+ PP      A    T+  ++S ++ P                   ++ V   P
Sbjct: 2672 RAAQASS-PPQRPRRRAAR-PTVGSLTSLADPPPPP---------PTPEPAPHALVSATP 2720

Query: 449  ATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPT 508
               G A + + S  +P             V    +         RP   A G      P 
Sbjct: 2721 LPPGPAAARQASPALPAAPA------PPAVPAGPATPGGPARPARPPTTA-GPPAPAPPA 2773

Query: 509  SLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKT-PSGIE 567
            + A         P   S  +S     +P   A  P+ + +       +A  A   P    
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833

Query: 568  SLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPA-VPPGKPKT 611
            + P    P  G        G G + PG  +  + P+  P  KP  
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLG-GSVAPGGDVRRRPPSRSPAAKPAA 2877



 Score = 35.3 bits (81), Expect = 0.15
 Identities = 62/339 (18%), Positives = 92/339 (27%), Gaps = 30/339 (8%)

Query: 293  APVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYA 352
            AP +   P+ P D +  P   AP  P      + D    S +     P  +  P   P  
Sbjct: 2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650

Query: 353  RPLISSPILSDTTSQTVRELIDAKNAESTRQTTRP-------ASVAKSSTLPPVATESNA 405
            RP         +  +  R L  A  A S  Q  R         S+   +  PP       
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPT--- 2707

Query: 406  LPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTR------------RNSKVLTPPATQGS 453
             P    H + S +  P G    R  S     +                ++   PP T G 
Sbjct: 2708 -PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766

Query: 454  ARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTS-LAG 512
                  + P      R     V       S+ E+  +   P  PA        P + L  
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAV------ASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820

Query: 513  SQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGY 572
            +  P+ P PP  S   + P           P G    PG        +++P+   + P  
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880

Query: 573  QNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKT 611
                           +    P            P  P+ 
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919


>gnl|CDD|239801 cd04273, ZnMc_ADAMTS_like, Zinc-dependent metalloprotease,
           ADAMTS_like subgroup. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
           This particular subfamily represents domain
           architectures that combine ADAM-like metalloproteinases
           with thrombospondin type-1 repeats. ADAMTS (a
           disintegrin and metalloproteinase with thrombospondin
           motifs) proteinases are inhibited by TIMPs (tissue
           inhibitors of metalloproteinases), and they play roles
           in coagulation, angiogenesis, development and
           progression of arthritis. They hydrolyze the von
           Willebrand factor precursor and various components of
           the extracellular matrix.
          Length = 207

 Score = 36.8 bits (86), Expect = 0.019
 Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 3/28 (10%)

Query: 47  GHNLGLEHDTTECTCPSDR---CIMAPS 71
           GH LG+ HD    +C  +     IM+P+
Sbjct: 149 GHVLGMPHDGDGNSCGPEGKDGHIMSPT 176


>gnl|CDD|239795 cd04267, ZnMc_ADAM_like, Zinc-dependent metalloprotease, ADAM_like
           or reprolysin_like subgroup. The adamalysin_like or ADAM
           family of metalloproteases contains proteolytic domains
           from snake venoms, proteases from the mammalian
           reproductive tract, and the tumor necrosis factor alpha
           convertase, TACE. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
          Length = 192

 Score = 36.6 bits (85), Expect = 0.021
 Identities = 12/30 (40%), Positives = 13/30 (43%), Gaps = 4/30 (13%)

Query: 47  GHNLGLEHDTTECTCPS----DRCIMAPSS 72
           GHNLG EHD  +            IMAP  
Sbjct: 142 GHNLGAEHDGGDELAFECDGGGNYIMAPVD 171


>gnl|CDD|224825 COG1913, COG1913, Predicted Zn-dependent proteases [General
           function prediction only].
          Length = 181

 Score = 35.8 bits (83), Expect = 0.031
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 6/30 (20%)

Query: 47  GHNLGLEHDTTECTCPSDRCIMAPSSRLFD 76
           GH LGL H      CP+ RC+M  S+ L D
Sbjct: 133 GHLLGLSH------CPNPRCVMNFSNSLRD 156


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 36.0 bits (83), Expect = 0.081
 Identities = 44/253 (17%), Positives = 71/253 (28%), Gaps = 25/253 (9%)

Query: 280 KSSDMSVKLPTRAAPVKPTPPKVP----SDAQITPIRAAPKIPATSVRLSADFANFSATL 335
            S   SV        +K   P++     S    +P ++   +       S   A  S   
Sbjct: 514 SSDAESVVS------IKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASPSTAG-SPRP 566

Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSST 395
            P L     G    P+A         +      V   +  K + S++  T P S  K +T
Sbjct: 567 KPSLGKFVIGT--DPFAFANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPGSDLKPAT 624

Query: 396 LPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSAR 455
           L      S+ +  +     SS+   P G         +  ++        T    + +  
Sbjct: 625 LNGKTPSSSLVGAARNAGASSKVKIPSGLGGFTSPISLLESALEDVLTSATSTPVKKNDP 684

Query: 456 SLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNF--KRPNLPAFGSMRAKRPTSLAGS 513
            L             N +K      + S  +   NF       P  G  + K P S A  
Sbjct: 685 YLW----------DTNGEKAGGGTESASTTDVFQNFAGLNKKTPVGGPFQPKPPLSRALD 734

Query: 514 QRPSQPPPPAPSG 526
              S        G
Sbjct: 735 SASSPGGSGGKPG 747


>gnl|CDD|213029 cd11375, Peptidase_M54, Peptidase family M54, also called
           archaemetzincins or archaelysins.  Peptidase M54
           (archaemetzincin or archaelysin) is a zinc-dependent
           aminopeptidase that contains the consensus zinc-binding
           sequence HEXXHXXGXXH/D and a conserved Met residue at
           the active site, and is thus classified as a metzincin.
           Archaemetzincins, first identified in archaea, are also
           found in bacteria and eukaryotes, including two human
           members, archaemetzincin-1 and -2 (AMZ1 and AMZ2). AMZ1
           is mainly found in the liver and heart while AMZ2 is
           primarily expressed in testis and heart; both have been
           reported to degrade synthetic substrates and peptides.
           The Peptidase M54 family contains an extended metzincin
           concensus sequence of HEXXHXXGX3CX4CXMX17CXXC such that
           a second zinc ion is bound to four cysteines, thus
           resembling a zinc finger. Phylogenetic analysis of this
           family reveals a complex evolutionary process involving
           a series of lateral gene transfer, gene loss and genetic
           duplication events.
          Length = 173

 Score = 34.2 bits (79), Expect = 0.096
 Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 14/42 (33%)

Query: 47  GHNLGLEHDTTECTCPSDRCIMAPSSRLFDS--------PVC 80
           GH  GL+H      CP   C+M  S+ L ++        PVC
Sbjct: 132 GHLFGLDH------CPYYACVMNFSNSLEETDRKPPYLCPVC 167


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 35.2 bits (81), Expect = 0.15
 Identities = 28/143 (19%), Positives = 45/143 (31%), Gaps = 6/143 (4%)

Query: 285 SVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPAT--SVRLSADFANFSATLPPELPPA 342
           S + P  A    P P   P+ A          + A   +    A  A   A    + PP 
Sbjct: 440 SARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499

Query: 343 NKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATE 402
            + P       P       +   ++++ +   A + +   +T  PA  A  +     ATE
Sbjct: 500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATA-DPDDAFETLAPAPAAAPAPRAAAATE 558

Query: 403 SNALPGSTLHPVSSESNAPKGST 425
               P     P +S S  P    
Sbjct: 559 PVVAPRP---PRASASGLPDMFD 578



 Score = 29.8 bits (67), Expect = 5.5
 Identities = 23/125 (18%), Positives = 33/125 (26%), Gaps = 10/125 (8%)

Query: 297 PTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLI 356
           P        AQ  P  AAP         +A  A  +    P   PA      +  A P  
Sbjct: 374 PATAAAAPVAQPAPAAAAPA--------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPAR 425

Query: 357 SSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSS 416
            SP          R+        +      PA+   ++  P  A        +   P  +
Sbjct: 426 RSPAPE--ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA 483

Query: 417 ESNAP 421
              A 
Sbjct: 484 APAAA 488


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 34.7 bits (80), Expect = 0.17
 Identities = 19/106 (17%), Positives = 25/106 (23%), Gaps = 3/106 (2%)

Query: 504 AKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTP 563
           A +P         SQP P  P+  Q +   +    Q +       LP        Q   P
Sbjct: 129 APKPEPQPPQAPESQPQPQTPA--QKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPP 186

Query: 564 SGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
               + P    P           G       Q         P   P
Sbjct: 187 RQA-AFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPP 231



 Score = 33.2 bits (76), Expect = 0.52
 Identities = 22/118 (18%), Positives = 28/118 (23%), Gaps = 4/118 (3%)

Query: 494 PNLPAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQ 553
            +       +     S A +Q P   PP  P     L   Q P    + P      P  Q
Sbjct: 209 GHPEQVQPQQFLPAPSQAPAQPP--LPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQ 266

Query: 554 NHSAGQAKTPSGIESLPGYQNPSAGQAKT--PGTPGQGLLGPGQGLGYQNPAVPPGKP 609
                    P        +     GQ     P    Q L    Q  G Q       + 
Sbjct: 267 QQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQL 324



 Score = 31.3 bits (71), Expect = 2.4
 Identities = 23/101 (22%), Positives = 26/101 (25%), Gaps = 8/101 (7%)

Query: 513 SQRPSQPPPPAPS-GIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPG 571
            Q P  P PP         P       Q          PGY     G  +     + LP 
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQP----PGYPQPPQGHPEQVQPQQFLPA 222

Query: 572 YQNPSAG---QAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
                A      + P  P         GL  Q P  PP  P
Sbjct: 223 PSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPP 263



 Score = 30.1 bits (68), Expect = 4.8
 Identities = 22/111 (19%), Positives = 26/111 (23%), Gaps = 16/111 (14%)

Query: 516 PSQPPPPAPSGIQSLPGYQNPAG-----------QAKTPSGIQSLPGYQNHSAGQAKTPS 564
           P Q PP  P G    P                      P     LP              
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL 251

Query: 565 GIESLPGYQNPSAGQAKTPGT----PGQGLLGPGQGLGYQ-NPAVPPGKPK 610
             +  P    P   Q + P      P Q    P  GL    N  +PP +  
Sbjct: 252 SQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQP 302


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 34.1 bits (78), Expect = 0.31
 Identities = 50/289 (17%), Positives = 75/289 (25%), Gaps = 44/289 (15%)

Query: 292 AAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNT-SP 350
            AP    P +V          A P   A +       A  + T       A   P   + 
Sbjct: 366 GAPGGGVPARVA--------GAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA 417

Query: 351 YARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGST 410
            A     +P  +     T     DA + ++       A  +  S       +  A  GS 
Sbjct: 418 AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSA 477

Query: 411 LHPVSSE-----------SNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEI 459
             P S             + AP  +T   +    +  +  R             AR    
Sbjct: 478 SAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTP 537

Query: 460 SEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRPSQP 519
           +   P          +D V R   MR                + + R    A + +P+  
Sbjct: 538 AAAAPAARAGGAAAALD-VLRNAGMR----------------VSSDRGARAAAAAKPAAA 580

Query: 520 PPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIES 568
           P  AP      P     A Q  TP    +         G A+     ES
Sbjct: 581 PAAAPK-----PAAPRVAVQVPTPRARAATG--DAPPNGAARAEQAAES 622



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 27/136 (19%), Positives = 37/136 (27%), Gaps = 5/136 (3%)

Query: 275 AAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSAT 334
           AA   K++  +      A P  P PP             AP     + R SAD       
Sbjct: 408 AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467

Query: 335 LPPELPPANK---GPNTSPYAR--PLISSPILSDTTSQTVRELIDAKNAESTRQTTRPAS 389
             P     +      +  P A   P   +   S  T   V +      A         A 
Sbjct: 468 AQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAP 527

Query: 390 VAKSSTLPPVATESNA 405
            A  +  P  A  + A
Sbjct: 528 PAPEARPPTPAAAAPA 543


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 32.9 bits (75), Expect = 0.41
 Identities = 28/95 (29%), Positives = 34/95 (35%), Gaps = 2/95 (2%)

Query: 260 QRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPA 319
           Q+  S  E    N  A  T     M +     A PV    P V + A      AAPK PA
Sbjct: 97  QKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAA--AAPKAPA 154

Query: 320 TSVRLSADFANFSATLPPELPPANKGPNTSPYARP 354
                S   A  +A  P E   A K   T+   +P
Sbjct: 155 KPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKP 189


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 33.5 bits (76), Expect = 0.54
 Identities = 57/268 (21%), Positives = 90/268 (33%), Gaps = 43/268 (16%)

Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSST 395
           PP++ P+++GP   P A P + +P    +T   V         +    +T P   AK   
Sbjct: 552 PPKVSPSDRGP---PKASPPVMAP---PSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKD 605

Query: 396 LPPVATESNALPGST---------LHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLT 446
            PP +      P S+         +     E    + ST  +  SF    + R  S +  
Sbjct: 606 GPPASGPHEKQPPSSAPRDMAPSVVRMFLRERLLEQ-STGPKPKSFWEMRAGRDGSGIQQ 664

Query: 447 PPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRP-NLPAFGSMRAK 505
            P+++    + + + P P             +     +   D    +P       SM   
Sbjct: 665 EPSSRRQP-ATQSTPPRP-----------SWLPSVFVLPSVDAGRAQPSEESHLSSMSPT 712

Query: 506 RPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSG 565
           +P S    ++P    P  P  +   P       QA  PS      G++   A QA  P  
Sbjct: 713 QPIS--HEEQPRYEDPDDPLDLSLHPD------QAPPPSHQAPYSGHEEPQAQQAPYPGY 764

Query: 566 IESLP------GYQNPSAGQAKTPGTPG 587
            E  P      GYQ P A   +    PG
Sbjct: 765 WEPRPPQAPYLGYQEPQAQGVQVSSYPG 792


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
          domains found in a variety of extracellular proteins.
          Length = 31

 Score = 28.9 bits (65), Expect = 0.56
 Identities = 8/26 (30%), Positives = 11/26 (42%), Gaps = 1/26 (3%)

Query: 18 NCHDQGVCNSR-GHCHCHPGFAPPYC 42
           C+ +G C    G C C  G+    C
Sbjct: 6  ICNGRGTCVRPCGKCVCDSGYQGATC 31


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 33.2 bits (75), Expect = 0.63
 Identities = 52/337 (15%), Positives = 105/337 (31%), Gaps = 22/337 (6%)

Query: 261 RLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPAT 320
           R  ++     + ++  K   ++   + + T           VP+   + P    P +   
Sbjct: 408 RTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTA 467

Query: 321 SVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAES 380
                      S+TLP +  P ++  + +P A     +    + TS T ++  D     +
Sbjct: 468 DPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSD--TPNA 525

Query: 381 TRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRR 440
           T  T     V  ++T PP  T S     +   P  +E +    +    +TS  S  ++  
Sbjct: 526 TSPTPIVIGVTTTATSPPTGTTS---VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAV 582

Query: 441 NSKVLTPPATQGSARSL--EISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPA 498
            +      ++  S +      S   P +N       +     T     T+     P++P+
Sbjct: 583 TTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEET---PSVPS 639

Query: 499 FGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAG 558
              +    P            P P  +   S PG  + +         + +P   N ++ 
Sbjct: 640 TTHVSTLSPG-----------PGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPN-PNATSP 687

Query: 559 QAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQ 595
            A +                 + T  T G  L+    
Sbjct: 688 SAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTS 724



 Score = 30.9 bits (69), Expect = 3.2
 Identities = 36/209 (17%), Positives = 60/209 (28%), Gaps = 7/209 (3%)

Query: 243 VAKSTQNHSNSTSGGRGQRLLSSGE--GQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPP 300
           V  +      ST         +SG   G   + L   T  +S  +   P   +P      
Sbjct: 449 VPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVT- 507

Query: 301 KVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPI 360
             P+    T  + +    ATS          +AT PP           +   +    SP+
Sbjct: 508 -TPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT---GTTSVPNATSPQVTEESPV 563

Query: 361 LSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNA 420
            +  T           +A +T Q    +S        P ++ S     ST       S  
Sbjct: 564 NNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAH 623

Query: 421 PKGSTLNRITSFMSRNSTRRNSKVLTPPA 449
           P G       +    ++T  ++    P  
Sbjct: 624 PTGGENITEETPSVPSTTHVSTLSPGPGP 652



 Score = 30.1 bits (67), Expect = 5.0
 Identities = 38/208 (18%), Positives = 70/208 (33%), Gaps = 12/208 (5%)

Query: 244 AKSTQNHSNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVP 303
           A +T  H   +S    Q  + S    +    +  T  +  ++   PT    +    P VP
Sbjct: 581 AVTTGQHGTGSSPTSQQPGIPSS--SHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVP 638

Query: 304 SDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSD 363
           S   ++ +   P  P T+ ++S    + ++  P E+      PN      P  +SP    
Sbjct: 639 STTHVSTLSPGPG-PGTTSQVSGPGNSSTSRYPGEVHVTEGMPN------PNATSPSAPS 691

Query: 364 TTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKG 423
                V  +       ++       S   +ST P     +     +T +  ++       
Sbjct: 692 GQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFR---TTPYNATTYLPPSTS 748

Query: 424 STLNRITSFMSRNSTRRNSKVLTPPATQ 451
           S L    +F S   T + + V  PP   
Sbjct: 749 SKLRPRWTFTSPPVTTKQATVPVPPTQH 776


>gnl|CDD|238124 cd00203, ZnMc, Zinc-dependent metalloprotease. This super-family of
           metalloproteases contains two major branches, the
           astacin-like proteases and the
           adamalysin/reprolysin-like proteases. Both branches have
           wide phylogenetic distribution, and contain
           sub-families, which are involved in vertebrate
           development and disease.
          Length = 167

 Score = 31.7 bits (72), Expect = 0.71
 Identities = 7/23 (30%), Positives = 7/23 (30%)

Query: 47  GHNLGLEHDTTECTCPSDRCIMA 69
           GH LG  HD           I  
Sbjct: 105 GHALGFYHDHDRKDRDDYPTIDD 127


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 28.8 bits (65), Expect = 0.76
 Identities = 11/28 (39%), Positives = 14/28 (50%), Gaps = 3/28 (10%)

Query: 19 CHDQGVC-NSRG--HCHCHPGFAPPYCE 43
          C + G C N+ G   C C PG+    CE
Sbjct: 11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 32.7 bits (74), Expect = 0.92
 Identities = 33/174 (18%), Positives = 57/174 (32%), Gaps = 12/174 (6%)

Query: 447 PPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMR--ETDVNFKRPNLPAFGSMRA 504
            P T    +      PIP+  LR      +++      +  + ++   +P     G +  
Sbjct: 615 IPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPY 674

Query: 505 K-RPTSLAGSQRPS------QPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSA 557
           +  PT               QPPP AP+    +     P G+A+ P+           + 
Sbjct: 675 QPSPTGANTMLPIQWAPGTMQPPPRAPT---PMRPPAAPPGRAQRPAAATGRARPPAAAP 731

Query: 558 GQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKT 611
           G+A+ P+           + G+A+ P         P    G   P  PP  P  
Sbjct: 732 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA 785



 Score = 30.0 bits (67), Expect = 5.7
 Identities = 32/117 (27%), Positives = 37/117 (31%), Gaps = 12/117 (10%)

Query: 497 PAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
           PA    RA+RP +  G  RP    P       + PG   P   A  P   +        +
Sbjct: 707 PAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP--AAAPGRARPPAAAPGRA 764

Query: 557 AGQAKTPSGIESLPGYQNPSAGQ-----AKTPGTPGQGL-----LGPGQGLGYQNPA 603
              A  P      P  Q P A Q     A TP  P Q       L P    G Q P 
Sbjct: 765 RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPT 821


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 26/132 (19%), Positives = 34/132 (25%), Gaps = 4/132 (3%)

Query: 518 QPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSA 577
            P  P P      P  Q P  QA           +   +   A  P    +LP       
Sbjct: 52  SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111

Query: 578 GQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDDCLNLLSESMAALANVDHNPNDNI 637
            Q + PG      +        QNPA  P         L  L +   A A+         
Sbjct: 112 IQTE-PGQLYPVQVPVMVT---QNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQS 167

Query: 638 YAVIEESPATSD 649
               +ES     
Sbjct: 168 AQKNDESQLQQQ 179


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 23/113 (20%), Positives = 31/113 (27%), Gaps = 2/113 (1%)

Query: 497 PAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
           PA G+   + P + A S  P +   PA     + P    PAG A  P+   + P      
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPG--V 647

Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
           A     P  +             AK  G                 PA      
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700



 Score = 31.1 bits (71), Expect = 2.8
 Identities = 17/117 (14%), Positives = 28/117 (23%), Gaps = 5/117 (4%)

Query: 503 RAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKT 562
           +A      A    P+   P AP+G         PA                  + G +  
Sbjct: 672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731

Query: 563 PS----GIESLPGYQNPSAG-QAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDD 614
                  +   P   +P     A     P               P+ P  + +  +D
Sbjct: 732 SPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788


>gnl|CDD|191923 pfam07998, Peptidase_M54, Peptidase family M54.  This is a family
           of metallopeptidases. Two human proteins have been
           reported to degrade synthetic substrates and peptides.
          Length = 176

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 11/30 (36%), Positives = 14/30 (46%), Gaps = 6/30 (20%)

Query: 47  GHNLGLEHDTTECTCPSDRCIMAPSSRLFD 76
           GH  GL H      C +  C+M  S+ L D
Sbjct: 135 GHTYGLSH------CNNTDCVMNFSNSLKD 158


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 51/357 (14%), Positives = 90/357 (25%), Gaps = 10/357 (2%)

Query: 255 SGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAA 314
           SG +GQ +  S E     ++A             P      +    +  S    +    A
Sbjct: 37  SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLA 96

Query: 315 PKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELID 374
           P  PA     +    +     PP  PPA+  P+ +P    ++                  
Sbjct: 97  PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156

Query: 375 AKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMS 434
           A  A            A   + P     + + P +   P +  + A              
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPI--- 213

Query: 435 RNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRP 494
             S   +S    P  +         S+     +        +     +    T       
Sbjct: 214 --SASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWE 271

Query: 495 NLPAFGSMRAKRPTSLAGSQRPSQPP--PPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGY 552
                G      P S + S R   P   P +P    +    +  +  + +     S    
Sbjct: 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331

Query: 553 QNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
            + S+  A    G                 P +P +    P       +PA   G+P
Sbjct: 332 SSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK---RPRPSRAPSSPAASAGRP 385


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 32.3 bits (73), Expect = 1.1
 Identities = 29/153 (18%), Positives = 48/153 (31%), Gaps = 18/153 (11%)

Query: 273 LLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIP----ATSVRLSADF 328
           ++AA   +      +    AAPV+      P+   +    A    P    A S      F
Sbjct: 108 MIAAMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLF 167

Query: 329 ANFSATLPPELPPANKGPNTSPYARPLISSPILSDT-TSQTVRELIDAKNAESTRQTTRP 387
           A+F  T P    P       +P   P I++ + +    +Q   E +D    +        
Sbjct: 168 ADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDA---- 223

Query: 388 ASVAKSSTLPPVATESNALPGSTLHPVSSESNA 420
                     P      ALP   + P +  + A
Sbjct: 224 ---------LPQILPPAALPPIVVAPAAPAALA 247



 Score = 29.2 bits (65), Expect = 8.4
 Identities = 27/155 (17%), Positives = 49/155 (31%), Gaps = 1/155 (0%)

Query: 275 AAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSAT 334
           A +T +S   +       AP  PT P  P  A +     A       V  +A+    +  
Sbjct: 54  ALETARSDTPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA 113

Query: 335 LPPELPPANKGPNTSPYARPLISSPILSDTT-SQTVRELIDAKNAESTRQTTRPASVAKS 393
           L   +    + P  +P     I SP       +  VR     ++A S       A    +
Sbjct: 114 LRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTT 173

Query: 394 STLPPVATESNALPGSTLHPVSSESNAPKGSTLNR 428
           + +P    ++  +      P  + + A   +    
Sbjct: 174 APVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQD 208


>gnl|CDD|237325 PRK13267, PRK13267, archaemetzincin-like protein; Reviewed.
          Length = 179

 Score = 31.1 bits (71), Expect = 1.1
 Identities = 11/22 (50%), Positives = 13/22 (59%), Gaps = 6/22 (27%)

Query: 47  GHNLGLEHDTTECTCPSDRCIM 68
           GH LGLEH      C + RC+M
Sbjct: 134 GHTLGLEH------CDNPRCVM 149


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 32.3 bits (73), Expect = 1.4
 Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 6/51 (11%)

Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSA 557
           P    G + PSQPPPP P      PG        K  S +  + G+  +  
Sbjct: 12  PPPPPGFEPPSQPPPPPP------PGVNVKKRSRKQLSIVGDILGHSGNPI 56


>gnl|CDD|222320 pfam13688, Peptidase_M84, Metallo-peptidase family M84. 
          Length = 193

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 12/38 (31%)

Query: 47  GHNLGLEHD------TTEC-----TCPSD-RCIMAPSS 72
           GHN G  HD      +  C     TCP+  R IM PSS
Sbjct: 148 GHNFGAVHDCDSSTESQCCPLSSSTCPAGGRYIMNPSS 185


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 31.7 bits (72), Expect = 1.5
 Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 2/94 (2%)

Query: 305 DAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDT 364
           +A + P+ A      T+   S      + +  P+   A   P   P        P+    
Sbjct: 357 EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416

Query: 365 TSQTVRELIDAKNAESTRQTTRPASVAKSSTLPP 398
            +  V    ++  A    +   P       T P 
Sbjct: 417 VAPPVPHTPES--APKLTRAAIPVDEKPKYTPPA 448


>gnl|CDD|200169 TIGR02232, myxo_disulf_rpt, Myxococcus cysteine-rich repeat.  This
           model represents a sequence region shared between
           several proteins of Myxococcus xanthus DK 1622 and some
           eukaryotic proteins that include human pappalysin-1
           (SP|Q13219). The region of about 40 amino acids contains
           several conserved Cys residues presumed to form
           disulfide bonds. The region appears in up to 13 repeats
           in Myxococcus.
          Length = 38

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 3/30 (10%)

Query: 77  SPVCGNGFVEDGEECDCG---LEDSCKNAC 103
           +P CG+G +E GEECD G     D C   C
Sbjct: 1   APTCGDGIIEPGEECDDGNTTSGDGCSATC 30


>gnl|CDD|215474 PLN02877, PLN02877, alpha-amylase/limit dextrinase.
          Length = 970

 Score = 31.7 bits (72), Expect = 1.5
 Identities = 17/98 (17%), Positives = 28/98 (28%), Gaps = 6/98 (6%)

Query: 283 DMSVKLPTRAAPVKP----TPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPE 338
              +  P+ A  + P      P+  S A +   R      ATS+   A          P 
Sbjct: 1   MALLLTPSPAVHLAPAPCVARPRASSAAFVLRSRWLRSRGATSISFVAHRGRNFRVRIPS 60

Query: 339 LPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAK 376
                     S  +         S  TS    +L+ ++
Sbjct: 61  ATSLAMATRPSSASSFFTLEL--SVPTSLFQDDLLYSR 96


>gnl|CDD|221242 pfam11816, DUF3337, Domain of unknown function (DUF3337).  This
           family of proteins are functionally uncharacterized.
           This family is only found in eukaryotes. This presumed
           domain is typically between 285 to 342 amino acids in
           length.
          Length = 320

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 30/159 (18%), Positives = 43/159 (27%), Gaps = 13/159 (8%)

Query: 280 KSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPA-----------TSVRLSADF 328
           K S  SV L    +      P   S+ +      +   P            +S      F
Sbjct: 13  KDSSGSVTLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKLSSSLFGKKF 72

Query: 329 ANFSATLPP-ELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRP 387
              S+ L           P  S  A  L SS    D        L+ A           P
Sbjct: 73  RMSSSLLKKCGAASTEGKPQKSEKAIDLKSSKAEKDPEINLGGLLLRALLEYWKELKCNP 132

Query: 388 ASVAKSSTLPPVATESNALPGSTLHPVS-SESNAPKGST 425
             +  S+ LP +  E+  L       +  SE +   G  
Sbjct: 133 RVLVFSTFLPSLDNETPYLKLPPDTTIIISEESPDLGGG 171


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 31.1 bits (70), Expect = 2.1
 Identities = 27/110 (24%), Positives = 33/110 (30%), Gaps = 13/110 (11%)

Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTP--------SGIQSLPGYQNHSAG 558
           P      Q  +Q P P P   Q    YQ+P  Q +          S  Q    Y   S  
Sbjct: 128 PQPPPAQQPQAQQPQPPPQVPQQ-QQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPY 186

Query: 559 QAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGL--GYQNPAVPP 606
           Q ++    E LP           +   P Q   GP Q     Y  P   P
Sbjct: 187 QPQSYPPNEPLPSSMAMQ--PPYSGAPPSQQFYGPPQPSPYMYGGPGGRP 234


>gnl|CDD|132975 cd06644, STKc_STK10_LOK, Catalytic domain of the Protein
           Serine/Threonine Kinase, STK10 or Lymphocyte-oriented
           kinase.  Serine/threonine kinases (STKs), STK10
           subfamily, catalytic (c) domain. STKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The
           STK10 subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. Other names for STK10 include
           lymphocyte-oriented kinase (LOK) and Xenopus polo-like
           kinase kinase 1 (xPlkk1). STK10 is highly expressed in
           lymphocytes and is responsible in regulating leukocyte
           function associated antigen (LFA-1)-mediated lymphocyte
           adhesion. It plays a role in regulating the CD28
           responsive element in T cells, and may also function as
           a regulator of polo-like kinase 1 (Plk1), a protein
           which is overexpressed in multiple tumor types.
          Length = 292

 Score = 31.2 bits (70), Expect = 2.1
 Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 14/85 (16%)

Query: 303 PSDAQITPIRAAPKIP-------ATSVRLSADFANFSATLPPELPPANKGPNTSPYARPL 355
           P   ++ P+R   KI        +   + S +F +F  T        +K P T P A  L
Sbjct: 213 PPHHELNPMRVLLKIAKSEPPTLSQPSKWSMEFRDFLKT------ALDKHPETRPSAAQL 266

Query: 356 ISSPILSDTTS-QTVRELIDAKNAE 379
           +  P +S  TS + +REL+    AE
Sbjct: 267 LEHPFVSSVTSNRPLRELVAEAKAE 291


>gnl|CDD|222468 pfam13948, DUF4215, Domain of unknown function (DUF4215).  The
           function of this family is unknown.
          Length = 47

 Score = 27.8 bits (62), Expect = 2.1
 Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 3/27 (11%)

Query: 78  PVCGNGFVEDGEECDCG---LEDSCKN 101
            +CG+G +   E+CD G     D C N
Sbjct: 19  SICGDGIIVGNEQCDDGNNVSFDGCYN 45


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 30.9 bits (70), Expect = 2.7
 Identities = 25/144 (17%), Positives = 34/144 (23%), Gaps = 11/144 (7%)

Query: 493 RPNLPAFGS--MRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLP 550
            P   A G      + P      Q P   P  +       PG          P      P
Sbjct: 389 SPMGGAMGQPPYYGQGPQQQFNGQ-PLGWPRMSMMPTPMGPGGP-LRPNGLAPMNAVRAP 446

Query: 551 GYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPK 610
                +A Q      +   P YQ+    Q            G  + L     +  P   K
Sbjct: 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506

Query: 611 TYDDCLNLLSESMAALANVDHNPN 634
                  +L E +  L      P 
Sbjct: 507 ------QVLGERLFPLV-EAIEPA 523


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is
           found in the aflatoxin regulatory protein (AflR) which
           is involved in the regulation of the biosynthesis of
           aflatoxin in the fungal genus Aspergillus. It occurs
           together with the fungal Zn(2)-Cys(6) binuclear cluster
           domain (pfam00172).
          Length = 275

 Score = 30.3 bits (68), Expect = 3.0
 Identities = 33/127 (25%), Positives = 50/127 (39%), Gaps = 9/127 (7%)

Query: 293 APVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYA 352
           +   P PP     + I P  A P  P  +   S  F++ S    PEL  +   PN S   
Sbjct: 23  SSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGSILSPNASNQD 82

Query: 353 RPLISSPILSDT------TSQTVRELIDAKNAESTRQTTRPASVAK---SSTLPPVATES 403
           +  +SS +  +T       S +   L D  +A+   + T   SVA    SS +  +   S
Sbjct: 83  QGDLSSLLSVNTDFGQLFASLSPSPLFDGNDADLHAEATGELSVADLEVSSPMQDLFLTS 142

Query: 404 NALPGST 410
              P S+
Sbjct: 143 ALSPPSS 149


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.7 bits (69), Expect = 3.2
 Identities = 28/98 (28%), Positives = 36/98 (36%), Gaps = 11/98 (11%)

Query: 494 PNLPAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQ 553
           P  P     R     S A +  P Q PP A S    LP    P+ QA  P  + S     
Sbjct: 408 PAKPEAPGARPAELPSPASAPTPEQQPPVARSA--PLP----PSPQASAPRNVASGKPGV 461

Query: 554 NHSAGQAK----TPSGIESLPGYQNPSAGQAKTPGTPG 587
           +  + Q K    T +G    P  Q  S+  A+T    G
Sbjct: 462 DLGSWQGKFMNFTRNGSRKQP-VQASSSDAAQTGVFEG 498


>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
          domai. 
          Length = 46

 Score = 27.3 bits (61), Expect = 3.4
 Identities = 9/31 (29%), Positives = 11/31 (35%), Gaps = 1/31 (3%)

Query: 14 DCPFNCHDQGVCNSR-GHCHCHPGFAPPYCE 43
          DC       G C+   G C C P      C+
Sbjct: 2  DCDPGGSASGTCDPDTGQCECKPNVTGRRCD 32


>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V).
          This family is like pfam00008 but has 8 conserved
          cysteines instead of six.
          Length = 49

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 9/31 (29%), Positives = 14/31 (45%), Gaps = 5/31 (16%)

Query: 18 NCHDQG----VCNSR-GHCHCHPGFAPPYCE 43
          +C+  G     C+   G C C PG    +C+
Sbjct: 2  DCNPHGSLSDTCDPETGQCLCKPGVTGRHCD 32


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 20/114 (17%), Positives = 27/114 (23%), Gaps = 6/114 (5%)

Query: 495 NLPAFGSMRAKRPTSLAGSQRPSQPPPP---APSGIQSLPGY--QNPAGQAKTPSGIQSL 549
              A G    K+      +Q  + P P    A S   S      Q  A Q+ T       
Sbjct: 366 GDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTP- 424

Query: 550 PGYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPA 603
           P                 +    +     + K         LGP      Q  A
Sbjct: 425 PTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 30.2 bits (68), Expect = 4.0
 Identities = 30/174 (17%), Positives = 52/174 (29%), Gaps = 12/174 (6%)

Query: 251 SNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITP 310
           S+S S          G           T ++   +    +R+    P+P    S    + 
Sbjct: 148 SSSKSASIDSSPTVIGPRPRSFSELNLTDRT--PAKVRSSRSELGAPSPSGGTSCPS-SS 204

Query: 311 IRAAPKIPATSVRLSADFANFSATLPPELPPANK--GPNTSPYARPLIS-SPILSDTTSQ 367
                 I +  +R SA      A L     P ++     +SP AR   + SP  S    +
Sbjct: 205 GGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSSSAKSPFKSSIQRK 264

Query: 368 TVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAP 421
             + L       S      P   +KSS       + +     +     ++ N  
Sbjct: 265 ATKALSKLSLRAS------PKDTSKSSKSEVAPPKKSEAKVPSSSKKWTDGNVS 312


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
           Provisional.
          Length = 408

 Score = 29.8 bits (67), Expect = 5.1
 Identities = 18/123 (14%), Positives = 40/123 (32%), Gaps = 1/123 (0%)

Query: 251 SNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITP 310
           +++ S        +       +    +TH  S    +    +A +    P         P
Sbjct: 160 ADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGTPAQP 219

Query: 311 IRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVR 370
           +            + +  +  +A   P + P    P     A P++S+P+ S    Q++ 
Sbjct: 220 LTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQP-LPTAAAPVLSAPLGSHEWQQSLS 278

Query: 371 ELI 373
           + I
Sbjct: 279 QHI 281


>gnl|CDD|200561 cd10935, CE4_WalW, Putative catalytic domain of
          lipopolysaccharide biosynthesis protein WalW and its
          bacterial homologs.  This family corresponds to a group
          of uncharacterized lipopolysaccharide biosynthesis
          protein WalW found in bacteria. Although their
          biochemical properties remain to be determined, members
          of this family is composed of a seven-stranded barrel
          with detectable sequence similarity to the six-stranded
          barrel rhizobial NodB-like proteins, which remove
          N-linked or O-linked acetyl groups from cell wall
          polysaccharides and belong to the larger carbohydrate
          esterase 4 (CE4) superfamily.
          Length = 295

 Score = 29.9 bits (68), Expect = 5.1
 Identities = 13/43 (30%), Positives = 15/43 (34%), Gaps = 4/43 (9%)

Query: 11 PVADCPFNCHDQGVCNSRGHC----HCHPGFAPPYCEYPGGHN 49
          PVA  P           RG      H HP   PP+ E    +N
Sbjct: 52 PVASDPAAVAILARALDRGRAEIGAHLHPWVTPPFDELTDPYN 94


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 30.2 bits (68), Expect = 5.2
 Identities = 17/88 (19%), Positives = 27/88 (30%), Gaps = 9/88 (10%)

Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDDCL 616
            G+  T  G +       P       P                Q PA PP  P      +
Sbjct: 899 TGRVATAPGGD--AASAPPPGAGPPAPPQAVPPPRTT------QPPAAPPRGPDVPPAAV 950

Query: 617 NLLSESMAALANVDHNPNDNIY-AVIEE 643
             L E++A L +   + +   Y A ++ 
Sbjct: 951 AELRETLADLRSAQRSGDFTAYGAALDR 978


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 29.4 bits (66), Expect = 5.3
 Identities = 28/92 (30%), Positives = 35/92 (38%), Gaps = 33/92 (35%)

Query: 472 EDKVDMVKRTQSMRETDVNF--------KRPNLPAFGSMRAKRPTSLAGSQRPSQPPPPA 523
            DKV+ V++       +V F        KRP LP                 +P QPP P 
Sbjct: 102 GDKVEEVRK-------EVEFFNNYLTDAKRPALP---------------EIKPLQPPKPD 139

Query: 524 PSGI-QSLPGYQNPAGQAKTPSGI--QSLPGY 552
           P G+   LPGY     QA  P G     + GY
Sbjct: 140 PGGLAPGLPGYPPQTPQALMPYGQPRPPMMGY 171


>gnl|CDD|215078 PLN00150, PLN00150, potassium ion transporter family protein;
           Provisional.
          Length = 779

 Score = 29.8 bits (67), Expect = 5.4
 Identities = 26/139 (18%), Positives = 42/139 (30%), Gaps = 18/139 (12%)

Query: 317 IPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAK 376
           I   S R  +D  + +A+  PE          S                 Q+      ++
Sbjct: 639 IEIESFREQSDLESMAASWTPEELMGEGNSVGSGL----------FTQYDQSDINFATSQ 688

Query: 377 NAESTRQTTRPASVAKSSTLPPVATESNA-------LPGSTLHPVSSESNAPKGSTLNRI 429
             +         S   S T   VA  +         + G+       +S   K   +N I
Sbjct: 689 EWKRPSSQEDSVSGHSSDTQDEVAFLNKCKEAGVVYILGNNDVKARKDSGFFKKVIINYI 748

Query: 430 TSFMSRNSTRRNSKVLTPP 448
            +F+ R S R +  VL  P
Sbjct: 749 YTFLRRIS-RDSEVVLNIP 766


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 30.1 bits (68), Expect = 5.5
 Identities = 22/90 (24%), Positives = 33/90 (36%), Gaps = 13/90 (14%)

Query: 289 PTRAAPVKPTPPKVPSDAQI---TPIRAAPKIPATSVRLSADFANFSATLPPELPPANKG 345
              AA     P   P+   +    P +AAP+  A  VR++           P +  A   
Sbjct: 428 APAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP---------EPAVASAAPA 478

Query: 346 PNTSPYARPLISSPILSDTTSQTVRELIDA 375
           P  +P A  L  +    D    TV++L  A
Sbjct: 479 PAAAPAAARLTPTE-EGDVWHATVQQLAAA 507



 Score = 29.3 bits (66), Expect = 7.8
 Identities = 22/112 (19%), Positives = 35/112 (31%), Gaps = 9/112 (8%)

Query: 289 PTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNT 348
              A    P  P+  + A      AAP   A +    A     +A+ P   P A      
Sbjct: 373 AAPAEKKTPARPEAAAPA------AAPVAQAAAAPAPAAAPAAAASAPAAPPAA---APP 423

Query: 349 SPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVA 400
           +P A P  ++P  +   +     L  A  A++  +T               A
Sbjct: 424 APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASA 475


>gnl|CDD|215613 PLN03174, PLN03174, Chalcone-flavanone isomerase-related;
           Provisional.
          Length = 278

 Score = 29.4 bits (66), Expect = 5.8
 Identities = 18/73 (24%), Positives = 28/73 (38%), Gaps = 5/73 (6%)

Query: 299 PPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISS 358
           PP+ PS        AA      +   +A  A  +A+  P  P      N    + P  +S
Sbjct: 13  PPRAPSFFAAAAAVAA-----AAAAAAAAAAAIAASRNPPHPFLQNALNFHNSSSPPWAS 67

Query: 359 PILSDTTSQTVRE 371
             L+D +  +V E
Sbjct: 68  ISLADPSPPSVVE 80


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.0 bits (67), Expect = 5.8
 Identities = 85/393 (21%), Positives = 127/393 (32%), Gaps = 55/393 (13%)

Query: 249 NHSNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSD-MSVKLPTRA-APVKPTPPKVPSDA 306
           +  NS S G G+     GEG++ +  +     SSD   +    R+ +P  P+P    SD+
Sbjct: 108 SRPNSPSEGEGE---GEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDS 164

Query: 307 ------QITPIRAAPKIPATSVRLSADFANF----SATLPPELPPANKGPNTSPYARP-- 354
                 Q+   +  P I        A  A      +  +PP+  P    P   P      
Sbjct: 165 DSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPL 224

Query: 355 -LISSPILSDTTSQT-----VRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPG 408
            LIS+P L      +       +    ++ +    ++R    +     PP+       P 
Sbjct: 225 SLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPV 284

Query: 409 STLHPVSS----ESNAPKGSTLNRITSFMSRNSTRRNSKVLT-----------PPATQGS 453
              HP S+       A        + S    +S    S+              PPA    
Sbjct: 285 FLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP 344

Query: 454 ARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGS 513
                 + PIP    + ++                +    P  PA      K  +SL   
Sbjct: 345 HIKPPPTTPIPQLPNQSHK---HPPHLQGPSPFPQMPSNLPPPPAL-----KPLSSLPTH 396

Query: 514 QRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQ 573
             PS  PPP     QS P  Q+   Q    +  QSLP   +         SG+ S P  Q
Sbjct: 397 HPPSAHPPPLQLMPQSQP-LQSVPAQPPVLTQSQSLPPKASTHP-----HSGLHSGPP-Q 449

Query: 574 NPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPP 606
           +P A    T G      +GP   L    PA PP
Sbjct: 450 SPFAQHPFTSGGLPA--IGPPPSLPTSTPAAPP 480


>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like
          domain; laminins are the major noncollagenous
          components of basement membranes that mediate cell
          adhesion, growth migration, and differentiation; the
          laminin-type epidermal growth factor-like module occurs
          in tandem arrays; the domain contains 4 disulfide bonds
          (loops a-d) the first three resemble epidermal growth
          factor (EGF); the number of copies of this domain in
          the different forms of laminins is highly variable
          ranging from 3 up to 22 copies.
          Length = 50

 Score = 26.5 bits (59), Expect = 6.1
 Identities = 8/31 (25%), Positives = 12/31 (38%), Gaps = 5/31 (16%)

Query: 18 NCHDQG----VCNSR-GHCHCHPGFAPPYCE 43
          +C+  G     C+   G C C P      C+
Sbjct: 3  DCNGHGSLSGQCDPGTGQCECKPNTTGRRCD 33


>gnl|CDD|219938 pfam08618, Opi1, Transcription factor Opi1.  Opi1 is a leucine
           zipper containing yeast transcription factor that
           negatively regulates phospholipid biosynthesis. It
           represses the expression of several UAS(INO) cis acting
           element containing genes and its activity is mediated by
           phosphorylations catalyzed by protein kinase A, protein
           kinase C and casein kinase II.
          Length = 387

 Score = 29.7 bits (66), Expect = 6.3
 Identities = 24/168 (14%), Positives = 47/168 (27%), Gaps = 17/168 (10%)

Query: 409 STLHPVSSESNA-PKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNN 467
            +  PV+S   A  + + +     +      R      T P+          S  +  N 
Sbjct: 40  RSAIPVASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNG 99

Query: 468 -----------LRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRP 516
                      +    D +   +   ++   DV  +   +       A    S  G+   
Sbjct: 100 APPLIQKSSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGA--- 156

Query: 517 SQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPS 564
             P   +     +LPGY +      + S +   P     + G  + P 
Sbjct: 157 --PVRTSALSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQ 202


>gnl|CDD|218745 pfam05783, DLIC, Dynein light intermediate chain (DLIC).  This
           family consists of several eukaryotic dynein light
           intermediate chain proteins. The light intermediate
           chains (LICs) of cytoplasmic dynein consist of multiple
           isoforms, which undergo post-translational modification
           to produce a large number of species. DLIC1 is known to
           be involved in assembly, organisation, and function of
           centrosomes and mitotic spindles when bound to
           pericentrin. DLIC2 is a subunit of cytoplasmic dynein 2
           that may play a role in maintaining Golgi organisation
           by binding cytoplasmic dynein 2 to its Golgi-associated
           cargo.
          Length = 490

 Score = 29.8 bits (67), Expect = 6.4
 Identities = 30/118 (25%), Positives = 44/118 (37%), Gaps = 15/118 (12%)

Query: 501 SMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSL----PGYQNHS 556
           S+ AK+P   A   R  + P  +P G         P   A   S + S+    P  +  +
Sbjct: 359 SLLAKQP---ATPTRGVESPARSPGGSPRTTNRSGPRNVASV-SPMTSVKKIDPNMKPGA 414

Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPG-KPKTYD 613
           A +    +   SL       + +  +PG+PG G    G G   Q  A   G KP   D
Sbjct: 415 ASEGVLANFFNSL------LSKKTGSPGSPGGGGSPAGTGTNTQGTAKKSGQKPVLTD 466


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 29.7 bits (66), Expect = 7.1
 Identities = 51/246 (20%), Positives = 77/246 (31%), Gaps = 32/246 (13%)

Query: 276 AKTHKSSDMSV-----------KLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRL 324
           AK HK S +             K P    P +P  PK P  AQ      +PK+P      
Sbjct: 564 AKEHKPSKIPTLSKKPEFPKDPKHPKD--PEEPKKPKRPRSAQRPTRPKSPKLPELLDIP 621

Query: 325 SADFANFSATLP-----------PELPPANKGPNTS-PYARPLISSPILSDTTSQTVREL 372
            +     S   P           PE P   K   +  P   P    P       +   + 
Sbjct: 622 KSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSP--KPPFDPKFKEKFYDDY 679

Query: 373 IDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSF 432
           +DA       +TT     +  S L     E+   P +T  P+  +    +      I   
Sbjct: 680 LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739

Query: 433 MSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFK 492
            +      + +  TPP  +      E     PL ++   E K + +       E D   K
Sbjct: 740 DAEQPD--DIEFFTPPEEE-RTFFHETPADTPLPDILAEEFKEEDI--HAETGEPDEAMK 794

Query: 493 RPNLPA 498
           RP+ P+
Sbjct: 795 RPDSPS 800



 Score = 29.7 bits (66), Expect = 8.1
 Identities = 29/140 (20%), Positives = 47/140 (33%), Gaps = 21/140 (15%)

Query: 270 NYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFA 329
           +Y   AAK+ ++    V   +  + +K T P+ P     TP    PK+P           
Sbjct: 678 DYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE-------- 729

Query: 330 NFSATLPPELPPANKGPNTSPYARPLISSPI----------LSDTTSQTVREL-IDAKNA 378
                  P   P  + P+   +  P                L D  ++  +E  I A+  
Sbjct: 730 --EFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETG 787

Query: 379 ESTRQTTRPASVAKSSTLPP 398
           E      RP S ++    PP
Sbjct: 788 EPDEAMKRPDSPSEHEDKPP 807


>gnl|CDD|235722 PRK06164, PRK06164, acyl-CoA synthetase; Validated.
          Length = 540

 Score = 29.3 bits (66), Expect = 7.4
 Identities = 12/50 (24%), Positives = 15/50 (30%)

Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
           P   A  Q  + P P  P+         +      T SG  S P    H 
Sbjct: 153 PAPGARVQLFALPDPAPPAAAGERAADPDAGALLFTTSGTTSGPKLVLHR 202


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 29.2 bits (65), Expect = 7.5
 Identities = 21/132 (15%), Positives = 42/132 (31%), Gaps = 8/132 (6%)

Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQT-VRELIDAKNAESTRQTTRPASVAKSS 394
           PP     +  P  +  +R +  +P    ++    + E  +A+            + A + 
Sbjct: 237 PPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGTP 296

Query: 395 TLPPVATESNA--LPGSTLHPVSSESNAPKGSTLNRITS-----FMSRNSTRRNSKVLTP 447
                 +E +A     +   P  S S   +G   NR  S       S +   R  +    
Sbjct: 297 DDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYRRRRGRGV 356

Query: 448 PATQGSARSLEI 459
           P  +  +R   +
Sbjct: 357 PGRRSESRQDTV 368


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.5 bits (67), Expect = 7.9
 Identities = 19/77 (24%), Positives = 26/77 (33%)

Query: 287 KLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGP 346
           ++   +AP  PTP   PS        +APK    +   S   A  S  +P    P    P
Sbjct: 367 EIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSP 426

Query: 347 NTSPYARPLISSPILSD 363
                A     S  L +
Sbjct: 427 TPPANAANAPPSLNLEE 443


>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
           (MRP).  This model describes multi drug
           resistance-associated protein (MRP) in eukaryotes. The
           multidrug resistance-associated protein is an integral
           membrane protein that causes multidrug resistance when
           overexpressed in mammalian cells. It belongs to ABC
           transporter superfamily. The protein topology and
           function was experimentally demonstrated by epitope
           tagging and immunofluorescence. Insertion of tags in the
           critical regions associated with drug efflux, abrogated
           its function. The C-terminal domain seem to highly
           conserved [Transport and binding proteins, Other].
          Length = 1522

 Score = 29.5 bits (66), Expect = 8.7
 Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 4/60 (6%)

Query: 351 YARPLISSPILS----DTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNAL 406
           Y +PL  S + S    DT+   V  L++    E  +   +P S       P     S+ L
Sbjct: 226 YRQPLEESDLWSLNKEDTSEMVVPVLVENWKKECKKTRKQPVSAVYGKKDPSKPKGSSQL 285


>gnl|CDD|219253 pfam06986, TraN, Type-1V conjugative transfer system mating pair
           stabilisation.  TraN is a large cysteine-rich outer
           membrane protein involved in the mating-pair
           stabilisation (adhesin) component of the F-type
           conjugative plasmid transfer system. TraN is believed to
           interact with the core type IV secretion system
           apparatus through the TraV protein.
          Length = 243

 Score = 28.9 bits (65), Expect = 8.9
 Identities = 18/74 (24%), Positives = 22/74 (29%), Gaps = 6/74 (8%)

Query: 118 ATGSCCNLETCQPHTAGRQCRAADRECDLPE---FCTGDSEF--CPSDVFKMDGETCKGG 172
            T  C +  TC    +   C      C        C   S    C   V       C GG
Sbjct: 33  RTYQCQSNNTCAALESQPGCSFTRSTCLEGSDEGVCLQWSVTYDCEQRVCSTTQYVC-GG 91

Query: 173 EAFCYEGSCRTHSD 186
           + FC +G C T   
Sbjct: 92  DVFCLDGDCLTPES 105


>gnl|CDD|233595 TIGR01845, outer_NodT, efflux transporter, outer membrane factor
           (OMF) lipoprotein, NodT family.  Members of This model
           comprise a subfamily of the Outer Membrane Factor (TCDB
           1.B.17) porins. OMF proteins operate in conjunction with
           a primary transporter of the RND, MFS, ABC, or PET
           systems, and a MFP (membrane fusion protein) to tranport
           substrates across membranes. The complex thus formed
           allows transport (export) of various solutes (heavy
           metal cations; drugs, oligosaccharides, proteins, etc.)
           across the two envelopes of the Gram-negative bacterial
           cell envelope in a single energy-coupled step. Current
           data suggest that the OMF (and not the MFP) is largely
           responsible for the formation of both the trans-outer
           membrane and trans-periplasmic channels. The roles
           played by the MFP have yet to be determined [Cellular
           processes, Detoxification, Transport and binding
           proteins, Porins].
          Length = 454

 Score = 29.3 bits (66), Expect = 9.1
 Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 1/59 (1%)

Query: 272 NLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFAN 330
           N LAA   K     +    R   +   PP +P       +R  P I A   RL+A  A 
Sbjct: 236 NALAALLGKGPSRGLA-IARPLLLDQLPPDLPLSLPSDLLRRRPDIRAAERRLAAANAQ 293


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
           protein. 
          Length = 314

 Score = 28.7 bits (64), Expect = 9.9
 Identities = 30/159 (18%), Positives = 49/159 (30%), Gaps = 9/159 (5%)

Query: 380 STRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTR 439
           S   T+R   + +          +  L   T   VSS  NA     L R +  +   S +
Sbjct: 6   SLSPTSRKNHLFE-----DKDGGAPQLDHYT--SVSSSKNAKSEKNLKRHSIIIPALSWK 58

Query: 440 RNSKVLTPP--ATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLP 497
           R     +     ++ S  +L  S    +    + E+    V R      +  N      P
Sbjct: 59  RLVASGSKKKKTSKKSTANLASSSYQKIVIQLNRENLKKSVPRPSDTTLSCANLSSLREP 118

Query: 498 AFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNP 536
           + G   A  P+ L+G      P       I       +P
Sbjct: 119 SPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSP 157


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.128    0.390 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,079,005
Number of extensions: 3115737
Number of successful extensions: 2660
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2457
Number of HSP's successfully gapped: 190
Length of query: 718
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 614
Effective length of database: 6,324,786
Effective search space: 3883418604
Effective search space used: 3883418604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)