RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17877
         (1262 letters)



>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 60.7 bits (147), Expect = 5e-09
 Identities = 107/487 (21%), Positives = 144/487 (29%), Gaps = 45/487 (9%)

Query: 683  PEVRTNEFLNPAALTTSSVPPSRDKPTS----LPPIITDRPLDNPHLPNILT-------- 730
             E R       A       PP  D P +     P I+ D P+  P  P +LT        
Sbjct: 2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEEL 2543

Query: 731  HSTPGESPEPYLPTSTLSSLGNIA--------RDSTPNVIAKEFSSRDKPTSLPPIITDR 782
             S     P P LP +   +  + +        R S P V ++       P S  P     
Sbjct: 2544 ASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVD 2603

Query: 783  PLDNPHLPNILTYSTPGESPEPHLPTSTLSSLGNIARDLTPNVIAKEFNTIASSGLPEYH 842
               +P  P   +   P ++  P  P  + S   N      P  +                
Sbjct: 2604 DRGDPRGPAPPS-PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV- 2661

Query: 843  VESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEYSSTNFPNIPPIQNFPKLSKD 902
              S+P          Q++SP +R        T    V   +S   P  PP    P     
Sbjct: 2662 --SRPRRARRLGRAAQASSPPQRPRRRAARPT----VGSLTSLADPPPPPPTPEPAPHAL 2715

Query: 903  LSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSN 962
            +S    P  P       P       PP     P  P  P+     PP  +  P  P    
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTA-GPPAPAPPA 2773

Query: 963  FNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQN 1022
                 PP     P +        +LP   +       V      LPP ++    +PP  +
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833

Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQN 1082
               T PP        PP     P+LP   +  P    +       P  +   P  PPV+ 
Sbjct: 2834 AQPTAPP--------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR 2885

Query: 1083 FNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPN 1142
                    S  +  LPP Q   P  PP     P  PP     P  PP     P  PP P 
Sbjct: 2886 LARPAVSRSTESFALPPDQ---PERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938

Query: 1143 FNPTLPP 1149
              P L P
Sbjct: 2939 PQPPLAP 2945



 Score = 57.6 bits (139), Expect = 5e-08
 Identities = 96/496 (19%), Positives = 143/496 (28%), Gaps = 62/496 (12%)

Query: 678  TFPTLPEVRTNEFLNPAALTTSSVPPSRDKPTSLPPIITDRPLDNPHL-PNILTHSTPGE 736
             FP  P  R      PA         +   P    P   D P     L P IL     GE
Sbjct: 2473 LFPGAPVYR-----RPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGE 2527

Query: 737  SPEPYLPTSTLSSLGNIARDSTPNVIAKEFSSRDKPTSLPP----------IITDRPLDN 786
               P + T  +  L  +A D           + D P  LPP          +   RP   
Sbjct: 2528 PVHPRMLT-WIRGLEELASDD----------AGDPPPPLPPAAPPAAPDRSVPPPRPAPR 2576

Query: 787  PHLPNILTYST-PGESPEPHLPTSTLSSLGNIARDLTPNVI---AKEFNTIASSGLPEYH 842
            P  P + + +  P   P+   P + +   G+      P+ +       +    S  P  +
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636

Query: 843  VESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEYSSTNFPNIPPIQNFPKLSKD 902
                     +         P   + S    +  L + ++ SS      PP +   + ++ 
Sbjct: 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS------PPQRPRRRAAR- 2689

Query: 903  LSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSN 962
                  PT+  L ++  PP   P   P    +  +   P     LPP P+          
Sbjct: 2690 ------PTVGSLTSLADPPP--PPPTPEPAPHALVSATP-----LPPGPAAARQASPALP 2736

Query: 963  FNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQN 1022
              P  P +    P  P          P     P  P         PP     P +  +  
Sbjct: 2737 AAPAPPAVPA-GPATPGGPARPAR--PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793

Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQN 1082
               +LP   +       V      LPP ++    +PP  +  PT PP     P   P   
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPP 2848

Query: 1083 FNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPN 1142
               +LP   +  P     +      P  K   P  PPV+            +  LPP   
Sbjct: 2849 ---SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905

Query: 1143 FNPTLPPIHINPTFPP 1158
              P  P     P   P
Sbjct: 2906 ERPPQPQAPPPPQPQP 2921



 Score = 42.6 bits (100), Expect = 0.002
 Identities = 63/340 (18%), Positives = 94/340 (27%), Gaps = 49/340 (14%)

Query: 893  IQNFPKLSKDLSFDFNPTLPPLQNINLPPSFNPSFPP-------------MSNFYPTLPP 939
            I+   +L+ D + D  P LPP      P + + S PP                  P  PP
Sbjct: 2537 IRGLEELASDDAGDPPPPLPPAA---PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPP 2593

Query: 940  MPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPP 999
                    P  P  +   P        LPP ++     PP  +     P         PP
Sbjct: 2594 QS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649

Query: 1000 VQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPP 1059
             +  +   P   +      P +          +  P  P      PT+  +++     PP
Sbjct: 2650 ERPRDDPAPGRVS-----RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP 2704

Query: 1060 KENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFP--PVKNINPTF 1117
                 P    + +  P  P         P          P     P  P  P     P  
Sbjct: 2705 PPTPEPAPHALVSATPLPPGPAAARQASPA-----LPAAPAPPAVPAGPATPGGPARPAR 2759

Query: 1118 PPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKKNFNVISSQQNFDLSVSS 1177
            PP        PP     P  P  P      PP             ++  S+++       
Sbjct: 2760 PPT----TAGPP----APAPPAAP---AAGPP----RRLTRPAVASLSESRESLPSPWDP 2804

Query: 1178 AFPSNSNWLNNMTPTLPSLSLTTPTVFPLPSRTPTSPPAT 1217
            A P  +  +      LP  +     + P  S  PT+PP  
Sbjct: 2805 ADPPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842



 Score = 38.0 bits (88), Expect = 0.040
 Identities = 79/452 (17%), Positives = 124/452 (27%), Gaps = 71/452 (15%)

Query: 693  PAALTTSSVPPSRDKPTSLPPIITD-RPLDNPHLPNILTHSTPGESPEPYLPTSTLSSLG 751
                  +S PP R +  +  P +     L +P  P      TP  +P   +  + L    
Sbjct: 2670 LGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP----PTPEPAPHALVSATPLPPGP 2725

Query: 752  NIARDSTPNVIAKEFSSRDKPTSLPPIITDRPLDNPHLPNILTYSTPGESPEPHLPTSTL 811
              AR ++P + A   +    P              P        + P     P  P +  
Sbjct: 2726 AAARQASPALPAAP-APPAVPAGPATPGGPARPARPPTT-----AGPPAPAPPAAPAAG- 2778

Query: 812  SSLGNIARDLTPNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQSTSPTERQ---TS 868
                   R LT   +A    +  S   P    +    V      +  + SP       TS
Sbjct: 2779 -----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833

Query: 869  SHITSTTLTK--VSEYSSTNFPNIP--PIQNFPKLSKDLSFDFNPTLPPLQNINLPP--- 921
            +  T+                   P   ++  P      +    P  PP++ +  P    
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR 2893

Query: 922  ---SF-NPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPP------MS 971
               SF  P   P     P  PP P      PP P   P  P      P L P        
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953

Query: 972  NFNPTIP--------------PKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF---- 1013
              +  +P              P+       P  +    + PP     H+L  +S++    
Sbjct: 2954 EPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP--LTGHSLSRVSSWASSL 3011

Query: 1014 ----NPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPP 1069
                    PPV    +  PP    +     + + +     +   +P  P      P  P 
Sbjct: 3012 ALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPE-----PHDPF 3066

Query: 1070 VQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQ 1101
                +   P          P S F P  PP+ 
Sbjct: 3067 A---HEPDPATPEAGARESPSSQFGP--PPLS 3093


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
            Transcription initiation factor IIA (TFIIA) is a
            heterotrimer, the three subunits being known as alpha,
            beta, and gamma, in order of molecular weight. The N and
            C-terminal domains of the gamma subunit are represented
            in pfam02268 and pfam02751, respectively. This family
            represents the precursor that yields both the alpha and
            beta subunits. The TFIIA heterotrimer is an essential
            general transcription initiation factor for the
            expression of genes transcribed by RNA polymerase II.
            Together with TFIID, TFIIA binds to the promoter region;
            this is the first step in the formation of a
            pre-initiation complex (PIC). Binding of the rest of the
            transcription machinery follows this step. After
            initiation, the PIC does not completely dissociate from
            the promoter. Some components, including TFIIA, remain
            attached and re-initiate a subsequent round of
            transcription.
          Length = 332

 Score = 48.2 bits (115), Expect = 2e-05
 Identities = 25/136 (18%), Positives = 35/136 (25%), Gaps = 4/136 (2%)

Query: 978  PPKENFNPTLPPVQN-INPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLP-PVQNFNP 1035
            P  +   P     Q    P                +  PT  P  N  +T   P     P
Sbjct: 51   PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGP 110

Query: 1036 TL--PPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF 1093
            T+   P + +   +P M   NP   P +         Q       P      +    +  
Sbjct: 111  TIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQK 170

Query: 1094 NPTLPPVQNFNPTFPP 1109
            N      Q  N   PP
Sbjct: 171  NDESQLQQQPNGETPP 186



 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 25/144 (17%), Positives = 34/144 (23%), Gaps = 9/144 (6%)

Query: 936  TLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINP 995
              P  PS     P      P             P  +      P  +     PP     P
Sbjct: 46   EFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGS-PAANPPATFALP 104

Query: 996  T--------FPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTL 1047
                       P Q +   +P M   NP   P+          Q       P     P+ 
Sbjct: 105  AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQ 164

Query: 1048 PPMSNFNPTIPPKENFNPTLPPVQ 1071
               +  N     ++  N   PP Q
Sbjct: 165  QQSAQKNDESQLQQQPNGETPPQQ 188



 Score = 38.2 bits (89), Expect = 0.022
 Identities = 21/142 (14%), Positives = 27/142 (19%), Gaps = 5/142 (3%)

Query: 1014 NPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNI 1073
                PPV      LP                      +      PP     P  P    I
Sbjct: 53   PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTI 112

Query: 1074 NPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNF 1133
                 P Q +   +P M   NP   P+          +       P     P+    Q  
Sbjct: 113  QT--EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPS---QQQS 167

Query: 1134 NPTLPPVPNFNPTLPPIHINPT 1155
                                 T
Sbjct: 168  AQKNDESQLQQQPNGETPPQQT 189


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
            of this family are necessary for accurate chromosome
            transmission during cell division.
          Length = 804

 Score = 48.2 bits (115), Expect = 3e-05
 Identities = 28/155 (18%), Positives = 34/155 (21%), Gaps = 1/155 (0%)

Query: 998  PPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTI 1057
                      P         P    F    PP Q      PP  +     P      P+ 
Sbjct: 166  RQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQ 225

Query: 1058 PPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTF 1117
             P +   P   P Q      P     +  +PP     P            PP +N     
Sbjct: 226  APAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPH 285

Query: 1118 P-PVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIH 1151
            P   Q  N   PP Q                 P  
Sbjct: 286  PGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQF 320



 Score = 35.1 bits (81), Expect = 0.25
 Identities = 22/146 (15%), Positives = 32/146 (21%), Gaps = 1/146 (0%)

Query: 1064 NPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNF 1123
                P         PP Q   +  PP  +     P      P+  P +   P   P Q  
Sbjct: 182  QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241

Query: 1124 NPTFPPIQNFNPTLPPVPNFNPTLPPIH-INPTFPPKKNFNVISSQQNFDLSVSSAFPSN 1182
                P     +  +PP P   P            PP +N            +     P  
Sbjct: 242  PLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQ 301

Query: 1183 SNWLNNMTPTLPSLSLTTPTVFPLPS 1208
               L  +                +  
Sbjct: 302  PQLLPLVQQPQGQQRGPQFREQLVQL 327


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
            family represents a conserved region approximately 350
            residues long within a number of plant proteins of
            unknown function.
          Length = 357

 Score = 41.5 bits (97), Expect = 0.002
 Identities = 45/264 (17%), Positives = 74/264 (28%), Gaps = 27/264 (10%)

Query: 924  NPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENF 983
             P+   ++       P+   N  LPP P+     P         PP   +   +PP++  
Sbjct: 50   PPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ-QQSQAPPSHQYPSQLPPQQVQ 108

Query: 984  NPTLPPVQNINPTFPPVQNFNHT---LPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPV 1040
            +    P     P +PP           P      P     Q      PP Q      PP 
Sbjct: 109  SVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPP 168

Query: 1041 ENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPV 1100
            +    + P +S   P   P +    + PP + +  +      ++   P    + P  P  
Sbjct: 169  Q--AQSAPQVSGLYPEESPYQ--PQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSP 224

Query: 1101 QNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKK 1160
              +           P   P   F     P  +            P+        ++ P  
Sbjct: 225  YMYG---------GPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYAP-- 273

Query: 1161 NFNVISSQQNFDLSVSSAFPSNSN 1184
                    Q    S S+A+PS   
Sbjct: 274  --------QGSSQSYSTAYPSLPA 289



 Score = 38.8 bits (90), Expect = 0.018
 Identities = 43/246 (17%), Positives = 70/246 (28%), Gaps = 18/246 (7%)

Query: 909  PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLP 968
            P  P  Q+           PP   +   LPP    ++   P P   P +P  S   P  P
Sbjct: 74   PPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPS--QPQPP 131

Query: 969  PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLP 1028
            P        P      P   P Q    + P    +    PP +   P +  +    S   
Sbjct: 132  PAQQPQAQQPQP----PPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQ 187

Query: 1029 PVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNI-NPTFPPVQNFNHTL 1087
            P      + PP E    ++     ++   P ++ + P  P       P   P   F    
Sbjct: 188  P-----QSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQ 242

Query: 1088 PPMSNFNPTLPPVQNFNPTFPPVKNINPTFPP--VQNFNPTFPPIQNFNPTLPPVPNFNP 1145
             P  +            P+     ++    P    Q+++  +P      P    +P   P
Sbjct: 243  QPPPSQGQEGYGYSGPPPSKGNHGSVASYAPQGSSQSYSTAYPS----LPAATVLPQALP 298

Query: 1146 TLPPIH 1151
                  
Sbjct: 299  MSSAPM 304



 Score = 29.9 bits (67), Expect = 8.2
 Identities = 40/172 (23%), Positives = 50/172 (29%), Gaps = 15/172 (8%)

Query: 1044 NPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNF 1103
             P    ++       P +  N  LPP        P  Q  +   PP   +   LPP Q  
Sbjct: 50   PPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQA-PPSHQYPSQLPPQQVQ 108

Query: 1104 NPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKKNFN 1163
            +    P     P +PP     P  PP Q      P  P   P        P  P      
Sbjct: 109  SVPQQPTPQQEPYYPPPSQ--PQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQP------ 160

Query: 1164 VISSQQNFDLSVSSAFPSNSNWLNNMTPTLPSLSLTTPTVFPLPSRTPTSPP 1215
                QQN      SA   +  +        P    + P   PLPS     PP
Sbjct: 161  --QYQQNPPPQAQSAPQVSGLY----PEESPYQPQSYPPNEPLPSSMAMQPP 206


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the protein
            product of the dentatorubral-pallidoluysian atrophy
            (DRPLA) gene. DRPLA OMIM:125370 is a progressive
            neurodegenerative disorder. It is caused by the expansion
            of a CAG repeat in the DRPLA gene on chromosome 12p. This
            results in an extended polyglutamine region in
            atrophin-1, that is thought to confer toxicity to the
            protein, possibly through altering its interactions with
            other proteins. The expansion of a CAG repeat is also the
            underlying defect in six other neurodegenerative
            disorders, including Huntington's disease. One
            interaction of expanded polyglutamine repeats that is
            thought to be pathogenic is that with the short glutamine
            repeat in the transcriptional coactivator CREB binding
            protein, CBP. This interaction draws CBP away from its
            usual nuclear location to the expanded polyglutamine
            repeat protein aggregates that are characteristic of the
            polyglutamine neurodegenerative disorders. This
            interferes with CBP-mediated transcription and causes
            cytotoxicity.
          Length = 979

 Score = 38.5 bits (89), Expect = 0.024
 Identities = 70/322 (21%), Positives = 106/322 (32%), Gaps = 32/322 (9%)

Query: 909  PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLP 968
            P  PP   +    +  PS PP +     +PP  S     P      PS  ++ +     P
Sbjct: 175  PQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHP 234

Query: 969  PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIP------PVQN 1022
                  P+  P     P  P   +     PP  +  H         P +P      PV  
Sbjct: 235  QRL---PSPHP-----PLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFL 286

Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENF-NPTLPPVQNINPTFPPVQ 1081
             + +  P Q F   L   +     LP  +  +   PP ++   P  PP +      PP  
Sbjct: 287  QHPSSNPPQPF--GLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQ---PLPPAP 341

Query: 1082 NFNHTLPPMSNFNPTLPPVQNFNPTF----PPVKNINPTFPPVQNFNPTFPPIQNFNPTL 1137
            +  H  PP +   P LP   + +P       P   +    PP     P      +  P+ 
Sbjct: 342  SMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSA 401

Query: 1138 PPVP-NFNPTLPPIHINPTFPPKKNFNVISSQQNFDLSVSSAFPSNSNWLNNMTP--TLP 1194
             P P    P   P+   P  PP     V++  Q+     S+   S  +     +P    P
Sbjct: 402  HPPPLQLMPQSQPLQSVPAQPP-----VLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHP 456

Query: 1195 SLSLTTPTVFPLPSRTPTSPPA 1216
              S   P + P PS   ++P A
Sbjct: 457  FTSGGLPAIGPPPSLPTSTPAA 478



 Score = 32.7 bits (74), Expect = 1.6
 Identities = 64/344 (18%), Positives = 104/344 (30%), Gaps = 41/344 (11%)

Query: 767  SSRDKPTSLPPIITDRPLDNPHLPNILTYST----PGESPEPHLPTSTLSSLGNIARDLT 822
            S++  P    PI             +   S     P   P PH P    ++     +   
Sbjct: 199  SAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPA 258

Query: 823  PNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEY 882
            P+         +S   P   +       P+ L    S  P   Q      S         
Sbjct: 259  PSSRH----PQSSHHGPGPPMPHALQQGPVFLQHPSSNPP---QPFGLAQSQVPPLPLPS 311

Query: 883  SSTNFPNIPPIQNFPKLSKDLSFDFNPTLPPLQNINLPPSFN-PSFPPMSNFYPTL---- 937
             +    + PP Q+  +  +       P  P + +I  PP+   P  P  S+ +P      
Sbjct: 312  QAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGP 371

Query: 938  PPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTF 997
             P P     LPP P+  P     ++  P+  P         P     P   P+Q++ P  
Sbjct: 372  SPFPQMPSNLPPPPALKPLSSLPTHHPPSAHP---------PPLQLMPQSQPLQSV-PAQ 421

Query: 998  PPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTI 1057
            PPV   + +LPP ++ +P          +      F     P     P+LP         
Sbjct: 422  PPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPT-------- 473

Query: 1058 PPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQ 1101
                +     P     +    P  +   +    +   P LPP+Q
Sbjct: 474  ----STPAAPPRA---SSGSQPPGSALPSSGGCAGPGPPLPPIQ 510



 Score = 31.2 bits (70), Expect = 4.9
 Identities = 65/332 (19%), Positives = 102/332 (30%), Gaps = 33/332 (9%)

Query: 693  PAALTTSSVPPSRDKPTSLPPIITDRPLDNPHLPNILTHST----PGESPEPYLPTSTLS 748
             A       P ++  P    PI             +   S     P   P P+ P    +
Sbjct: 189  LAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQT 248

Query: 749  SLGNIARDSTPNVIAKEFSSRDKPTSLP------PIITDRPLDNPHLPNILTYSTPGESP 802
            +     +   P+    + S       +P      P+    P  NP  P  L  S     P
Sbjct: 249  ASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLP 308

Query: 803  EPHLPTSTLSSLGNIARDLTPNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQS--- 859
             P       S        L P    +E     +  +P  H++  PT TPI  +  QS   
Sbjct: 309  LPS-QAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP--HIKPPPT-TPIPQLPNQSHKH 364

Query: 860  ------TSPTERQTSSHITSTTLTKVSEYSSTNFPNI--PPIQNFPKLSKDLSFDFNPTL 911
                   SP  +  S+      L  +S   + + P+   PP+Q  P   +       P  
Sbjct: 365  PPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMP---QSQPLQSVPAQ 421

Query: 912  PPL--QNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPP 969
            PP+  Q+ +LPP  +       +  P   P      T   +P+  P     ++     P 
Sbjct: 422  PPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPR 481

Query: 970  MSNFNPTIPPKENFNPTLPPVQNINPTFPPVQ 1001
             S       P  +  P+        P  PP+Q
Sbjct: 482  AS---SGSQPPGSALPSSGGCAGPGPPLPPIQ 510


>gnl|CDD|173288 PRK14827, PRK14827, undecaprenyl pyrophosphate synthase; Provisional.
          Length = 296

 Score = 36.9 bits (85), Expect = 0.055
 Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 7/55 (12%)

Query: 1105 PTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPK 1159
            P  PP  +  PTFP    +   FP        LPP P   P  PP H +    P+
Sbjct: 14   PQLPPAPDDYPTFPDKSTWPVVFP-------ELPPAPYGGPCRPPQHTSKAAAPR 61



 Score = 30.7 bits (69), Expect = 5.3
 Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 947 LPPVPSFNPSFPTMSNFN---PTLPPMSNFNPTIPPK 980
           LPP P   P+FP  S +    P LPP     P  PP+
Sbjct: 16  LPPAPDDYPTFPDKSTWPVVFPELPPAPYGGPCRPPQ 52


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is a
            positive regulator of Vps4. Vps4 is an ATPase that is
            required in the multivesicular body (MVB) sorting pathway
            to dissociate the endosomal sorting complex required for
            transport (ESCRT). Vta1 promotes correct assembly of Vps4
            and stimulates its ATPase activity through its conserved
            Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 36.6 bits (85), Expect = 0.067
 Identities = 27/137 (19%), Positives = 40/137 (29%), Gaps = 26/137 (18%)

Query: 900  SKDLSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPT 959
            + D SF      P   + + PPS +P  P   +           +L         P  P 
Sbjct: 167  NSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSL---------PPAP- 216

Query: 960  MSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPP 1019
                        +  P   P+   NP+ PP         P       +PP+S   PT P 
Sbjct: 217  --------SSFQSDTPPPSPESPTNPSPPP-----GPAAPPPPPVQQVPPLSTAKPTPPS 263

Query: 1020 VQNFNSTLPPVQNFNPT 1036
                 +T  P+      
Sbjct: 264  AS---ATPAPIGGITLD 277



 Score = 35.0 bits (81), Expect = 0.22
 Identities = 26/126 (20%), Positives = 38/126 (30%), Gaps = 8/126 (6%)

Query: 1031 QNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPM 1090
                P     E+ +       N  P             P  + +P  P   +        
Sbjct: 149  NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDP-PSSSPGVPSFPSPPEDPSSP 207

Query: 1091 SNFNPTLPPVQNFNPTFPPVKNINPTFP--PVQNFNPTFPPIQNFNPTLPPVPNFNPTLP 1148
            S+ +    P  +F    PP    +PT P  P     P  PP+Q     +PP+    PT P
Sbjct: 208  SDSSLPPAP-SSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQ----QVPPLSTAKPTPP 262

Query: 1149 PIHINP 1154
                 P
Sbjct: 263  SASATP 268



 Score = 34.3 bits (79), Expect = 0.32
 Identities = 29/136 (21%), Positives = 45/136 (33%), Gaps = 17/136 (12%)

Query: 921  PSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPK 980
               + +     N +P     P+   +    PS +P  P+  +        S+ +    P 
Sbjct: 159  EDADVATTNSDNSFPGEDADPASA-SPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPS 217

Query: 981  ENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPV 1040
             +F    PP    +PT P         PP     P  PPVQ     +PP+    PT P  
Sbjct: 218  -SFQSDTPPPSPESPTNPS--------PPPGPAAPPPPPVQ----QVPPLSTAKPTPPSA 264

Query: 1041 ENFNPTLPPMSNFNPT 1056
               + T  P+      
Sbjct: 265  ---SATPAPIGGITLD 277



 Score = 33.5 bits (77), Expect = 0.68
 Identities = 22/116 (18%), Positives = 31/116 (26%), Gaps = 10/116 (8%)

Query: 968  PPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTL 1027
                N  P             P  + +P  P   +        S+ +    P  +F S  
Sbjct: 166  TNSDNSFPGEDADPASASPSDP-PSSSPGVPSFPSPPEDPSSPSDSSLPPAP-SSFQSDT 223

Query: 1028 PPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPK-ENFNPTLPPVQNINPTFPPVQN 1082
            PP    +PT       NP+ PP     P  P +      T  P        P    
Sbjct: 224  PPPSPESPT-------NPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIG 272



 Score = 32.0 bits (73), Expect = 1.8
 Identities = 29/101 (28%), Positives = 31/101 (30%), Gaps = 15/101 (14%)

Query: 883 SSTNFPNIPPIQNFPKLSKDLSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLP-PMP 941
           SS   P+ P     P    D       +LPP      P SF    PP S   PT P P P
Sbjct: 190 SSPGVPSFPSPPEDPSSPSD------SSLPPA-----PSSFQSDTPPPSPESPTNPSPPP 238

Query: 942 SFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKEN 982
                 PP     P   T     PT P  S     I     
Sbjct: 239 GPAAPPPPPVQQVPPLSTA---KPTPPSASATPAPIGGITL 276


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
           trypanosomal proteins resemble vertebrate mucins. The
           protein consists of three regions. The N and C terminii
           are conserved between all members of the family, whereas
           the central region is not well conserved and contains a
           large number of threonine residues which can be
           glycosylated. Indirect evidence suggested that these
           genes might encode the core protein of parasite mucins,
           glycoproteins that were proposed to be involved in the
           interaction with, and invasion of, mammalian host cells.
           This family contains an N-terminal signal peptide.
          Length = 143

 Score = 33.3 bits (75), Expect = 0.27
 Identities = 26/57 (45%), Positives = 27/57 (47%), Gaps = 1/57 (1%)

Query: 246 TGTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNT-EAPTT 301
           T T   P TT+TTT T T T   TTT    T    TTTT  T        T EAPTT
Sbjct: 48  TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTT 104


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 35.0 bits (80), Expect = 0.28
 Identities = 51/245 (20%), Positives = 65/245 (26%), Gaps = 32/245 (13%)

Query: 912  PPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMS 971
            PP    ++P +  P   PM      L P+P   L + P+      FPT            
Sbjct: 608  PPTTQSHIPETSAPRQWPMP-----LRPIPMRPLRMQPITFNVLVFPT------------ 650

Query: 972  NFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQ 1031
               P  PP+    P  P       T      +    P  +  N  +P      +  PP +
Sbjct: 651  ---PHQPPQVEITPYKP-----TWTQIGHIPYQ---PSPTGANTMLPIQWAPGTMQPPPR 699

Query: 1032 NFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMS 1091
               P  PP         P +      PP        PP        PP        PP +
Sbjct: 700  APTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 759

Query: 1092 NFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVP-NFNPTLPPI 1150
                  PP        P      P  PP     P   P     P   P      P   P 
Sbjct: 760  APGRARPPAAAPGAPTPQPP---PQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPG 816

Query: 1151 HINPT 1155
               PT
Sbjct: 817  QQGPT 821


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family consists
            of several Mycoplasma species specific Cytadhesin P32 and
            P30 proteins. P30 has been found to be membrane
            associated and localised on the tip organelle. It is
            thought that it is important in cytadherence and
            virulence.
          Length = 279

 Score = 33.5 bits (76), Expect = 0.61
 Identities = 33/140 (23%), Positives = 40/140 (28%), Gaps = 22/140 (15%)

Query: 913  PLQNINLPPSFNPSFPPMSNFYPTLPPM---PSFNLTLPPVPSFNPSFPTMSNFNPTLPP 969
            P       P+      P    +P  P M   P FN   P +P   P     +   P   P
Sbjct: 155  PQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPP-----NQMRPGFNP 209

Query: 970  MSNF--NPTIPPKENFNPTLPPVQNINPTFPPVQN-FNHTLPPMSNFNPTIPPVQNFNST 1026
            M      P      N  P +       P F P    FNH   P +   P +     FN  
Sbjct: 210  MPGMPPRPGFNQNPNMMPNMN-----RPGFRPQPGGFNH---PGTPMGPNMQQRPGFNPN 261

Query: 1027 LPPVQNFNPT-LPPVENFNP 1045
                 N  P    P   F P
Sbjct: 262  --QGMNPPPHMAGPRAGFPP 279



 Score = 30.0 bits (67), Expect = 7.5
 Identities = 29/109 (26%), Positives = 36/109 (33%), Gaps = 19/109 (17%)

Query: 905 FDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFN 964
           F   P +      N  P   P  PP +   P   PMP     +PP P FN +   M N N
Sbjct: 176 FPMQPNMGMRPGFNQMPPHMPGMPP-NQMRPGFNPMPG----MPPRPGFNQNPNMMPNMN 230

Query: 965 -PTLPPMSN--------FNPTIPPKENFNPTL-----PPVQNINPTFPP 999
            P   P             P +  +  FNP       P +      FPP
Sbjct: 231 RPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 33.5 bits (77), Expect = 0.71
 Identities = 12/90 (13%), Positives = 37/90 (41%), Gaps = 4/90 (4%)

Query: 591 STPGTKTKVIDKLNIMT--SDLEKSLKSVVSSVMNELNTSNILPTTTPRNSFYNVINSLD 648
                 T V  +        +L ++L+ ++  ++ E  +  ++      + F ++I+S+ 
Sbjct: 96  EAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELV--SIFNDLIDSIK 153

Query: 649 KTNDNARFDDELTSRQRDIHSKIKNIGEIT 678
           + N     +  + S + ++    K + E+ 
Sbjct: 154 EDNLKDDLESLIASAKEELDQLSKKLAELK 183


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 33.7 bits (77), Expect = 0.85
 Identities = 18/60 (30%), Positives = 22/60 (36%), Gaps = 3/60 (5%)

Query: 252 PSTTSTTTETPTPTRIETTTLAPNT---VAAVTTTTFATSKEPLKPNTEAPTTLPTIPSV 308
              TST  E  TP   + T  A NT   VAA T     +S E  + +        T  S 
Sbjct: 35  TPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSE 94



 Score = 33.3 bits (76), Expect = 1.0
 Identities = 10/47 (21%), Positives = 18/47 (38%), Gaps = 3/47 (6%)

Query: 252 PSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEA 298
           P   +T  E P+ +    T    +  +  T TT + ++      TE 
Sbjct: 62  PVAATTAAEAPSSS---ETAETSDPTSEATDTTTSEARTVTPAATET 105


>gnl|CDD|217186 pfam02686, Glu-tRNAGln, Glu-tRNAGln amidotransferase C subunit.
           This is a family of Glu-tRNAGln amidotransferase C
           subunits. The Glu-tRNA Gln amidotransferase enzyme
           itself is an important translational fidelity mechanism
           replacing incorrectly charged Glu-tRNAGln with the
           correct Gln-tRANGln via transmidation of the misacylated
           Glu-tRNAGln. This activity supplements the lack of
           glutaminyl-tRNA synthetase activity in gram-positive
           eubacterteria, cyanobacteria, Archaea, and organelles.
          Length = 72

 Score = 30.1 bits (69), Expect = 1.1
 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 2/36 (5%)

Query: 601 DKLNIMTSDLEKSLKSVVSSVMNELNTSNILPTTTP 636
           ++L   T  L K L  V    +NE++T  + PT+ P
Sbjct: 4   EELEKFTKQLNKILDYV--EQLNEVDTEGVEPTSHP 37


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 32.5 bits (74), Expect = 1.6
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 246 TGTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATS 288
             T ++ + T T T+T T +   T T +P   +  TTTT + S
Sbjct: 800 ETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPS 842



 Score = 32.1 bits (73), Expect = 2.2
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 252 PSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFAT 287
            +TTS+ + T T +  +T+T    T +   TTT   
Sbjct: 814 QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849



 Score = 31.8 bits (72), Expect = 3.4
 Identities = 15/55 (27%), Positives = 19/55 (34%), Gaps = 7/55 (12%)

Query: 256 STTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEAPTTLPTIPSVLT 310
              T +   T+  T T    +  + T TT  T        T   TT  T PS  T
Sbjct: 798 LIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQ-------TSTSTTTTTSPSQTT 845


>gnl|CDD|238384 cd00755, YgdL_like, Family of activating enzymes (E1) of
           ubiquitin-like proteins related to the E.coli
           hypothetical protein ygdL. The common reaction mechanism
           catalyzed by E1-like enzymes begins with a nucleophilic
           attack of the C-terminal carboxylate of the
           ubiquitin-like substrate, on the alpha-phosphate of an
           ATP molecule bound at the active site of the activating
           enzymes, leading to the formation of a high-energy
           acyladenylate intermediate and subsequently to the
           formation of a thiocarboxylate at the C termini of the
           substrate. The exact function of this family is unknown.
          Length = 231

 Score = 31.4 bits (72), Expect = 2.4
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 1/49 (2%)

Query: 411 PQVEVDVPNSFTTRKSPVDDLIIREIAPIVDAIPSKIPKFEMGITSRTH 459
           P+ EVD    F T  +  +DL+  +   +VDAI S   K  +    R  
Sbjct: 79  PECEVDAVEEFLTPDN-SEDLLGGDPDFVVDAIDSIRAKVALIAYCRKR 126


>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit.  This
           model describes the bacterial oxaloacetate decarboxylase
           alpha subunit and its equivalents in archaea. The
           oxaloacetate decarboxylase Na+ pump is the paradigm of
           the family of Na+ transport decarboxylases that present
           in bacteria and archaea. It a multi subunit enzyme
           consisting of a peripheral alpha-subunit and integral
           membrane subunits beta and gamma. The energy released by
           the decarboxylation reaction of oxaloacetate is coupled
           to Na+ ion pumping across the membrane [Transport and
           binding proteins, Cations and iron carrying compounds,
           Energy metabolism, Other].
          Length = 582

 Score = 32.1 bits (73), Expect = 2.5
 Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 4/33 (12%)

Query: 162 YGHPAAETSVYILDVVHREEPFDSEINLEYYLS 194
             HP  ET V  L    R   +D+ +++E  L 
Sbjct: 232 TSHPPTETMVAAL----RGTGYDTGLDIELLLE 260


>gnl|CDD|188622 cd09902, H3TH_MKT1, H3TH domain of Mkt1: A global regulator of mRNAs
            encoding mitochondrial proteins and eukaryotic homologs. 
            The Mkt1 gene product interacts with the Poly(A)-binding
            protein associated factor, Pbp1, and is present at the 3'
            end of RNA transcripts during translation. The Mkt1-Pbp1
            complex is involved in the post-transcriptional
            regulation of HO endonuclease expression. Mkt1 and
            eukaryotic homologs are atypical members of the
            structure-specific, 5' nuclease family. Conical members
            of this family possess a PIN (PilT N terminus) domain
            with a helical arch/clamp region/I domain (not included
            here) and inserted within the PIN domain is an atypical
            helix-hairpin-helix-2 (HhH2)-like region. This atypical
            HhH2 region, the H3TH (helix-3-turn-helix) domain, has an
            extended loop with at least three turns between the first
            two helices, and only three of the four helices appear to
            be conserved.  Although Mkt1 appears to possess both a
            PIN and H3TH domain, the Mkt1 PIN domain lacks several of
            the active site residues necessary to bind essential
            divalent metal ion cofactors (Mg2+/Mn2+) required for
            nuclease activity in this family. Also, Mkt1 lacks the
            glycine-rich loop in the H3TH domain which is proposed to
            facilitate duplex DNA binding.
          Length = 81

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 2/45 (4%)

Query: 995  PTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPP 1039
            PTFPP+Q+     P   NF   +  V+   S +   Q F  T   
Sbjct: 16   PTFPPLQDSPFPKPQ--NFRDALDMVKQARSGISVCQQFPDTPSV 58


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 31.4 bits (71), Expect = 3.6
 Identities = 27/84 (32%), Positives = 33/84 (39%), Gaps = 8/84 (9%)

Query: 912 PPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMS 971
           PP     LPP  +   P  S+      P  SFN  LP      PS P  S   P +PP+ 
Sbjct: 190 PPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIPPVI 249

Query: 972 NFNPTIPPKENFNPTLPPVQNINP 995
                + P     P +PP Q I P
Sbjct: 250 ---QYVAP-----PPVPPPQPIIP 265



 Score = 30.3 bits (68), Expect = 7.5
 Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 937  LPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPT 996
             PP PS    LPP  S     P+ S+      P  +FNP +P      P+ PP     P 
Sbjct: 187  QPPPPSSLPGLPPGSS--SLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPP 244

Query: 997  FPPVQNFNHTLPPMSNFNPTIP 1018
             PPV  +    PP+    P IP
Sbjct: 245  IPPVIQYVAP-PPVPPPQPIIP 265


>gnl|CDD|233031 TIGR00577, fpg, formamidopyrimidine-DNA glycosylase (fpg).  All
           proteins in the FPG family with known functions are
           FAPY-DNA glycosylases that function in base excision
           repair. Homologous to endonuclease VIII (nei). This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 272

 Score = 30.7 bits (70), Expect = 4.3
 Identities = 13/39 (33%), Positives = 18/39 (46%)

Query: 391 LPELTTPRPVLNGEIVNKVLPQVEVDVPNSFTTRKSPVD 429
           LPE+ T R  L   ++ K +  VEV + N       P D
Sbjct: 3   LPEVETVRRGLEPLVLGKTIKSVEVVLRNPVLRPAGPED 41


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2); Provisional.
          Length = 576

 Score = 30.7 bits (69), Expect = 5.8
 Identities = 44/179 (24%), Positives = 60/179 (33%), Gaps = 22/179 (12%)

Query: 949  PVPSFNPSFPTMSNFNPTLP--PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHT 1006
            PVP+  P  P   N  PT     + NFN  +   +N    +PP        P  +N    
Sbjct: 290  PVPAPVPPTPEDDNPRPTDDEFAVPNFNEGLDVPDNPQDPVPPPNEGKDGNPNEENL--- 346

Query: 1007 LPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIP-----PKE 1061
             PP  +  P    V       PP     P     E  +    P +  NP IP       E
Sbjct: 347  FPPGDDEVPDESNV-------PPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPE 399

Query: 1062 NFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF----NPTLP-PVQNFNPTFPPVKNINP 1115
            + N  +P    + P      NFN    P +       P +P P+ N        K +NP
Sbjct: 400  DSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNP 458


>gnl|CDD|215947 pfam00493, MCM, MCM2/3/5 family. 
          Length = 327

 Score = 30.2 bits (69), Expect = 6.3
 Identities = 9/28 (32%), Positives = 13/28 (46%), Gaps = 6/28 (21%)

Query: 907 FNPTLPPLQNINLPPSFNPSFPPMSNFY 934
           ++P     +NINLPP        +S F 
Sbjct: 178 YDPKKSVAENINLPPPL------LSRFD 199


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 30.4 bits (68), Expect = 6.6
 Identities = 16/89 (17%), Positives = 24/89 (26%), Gaps = 11/89 (12%)

Query: 916 NINLPPSFNPSFPPMSNFYPTL--PPMPSFNLTLP-PVPSFNPSFPTMSNFNPTLPP--M 970
              L P       P S+    +    + S        VP    + P +      + P  M
Sbjct: 321 AAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAV------VKPGPM 374

Query: 971 SNFNPTIPPKENFNPTLPPVQNINPTFPP 999
               P  PP       + P   + PT   
Sbjct: 375 EIPTPVPPPGLAIPSLVAPPGLVAPTEIN 403


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 29.7 bits (67), Expect = 8.5
 Identities = 18/61 (29%), Positives = 24/61 (39%), Gaps = 5/61 (8%)

Query: 247 GTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEAPTTLPTIP 306
                 ++TST T     T + TTT    T     TTT +T        TE  ++  T P
Sbjct: 88  SNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTET-----PTEGGSSAATDP 142

Query: 307 S 307
           S
Sbjct: 143 S 143


>gnl|CDD|203376 pfam06016, Reovirus_L2, Reovirus core-spike protein lambda-2 (L2).
            This family consists of several Reovirus core-spike
            protein lambda-2 (L2) sequences. The reovirus L2 genome
            segment encodes the core spike protein lambda-2, which
            mediates enzymatic reactions in 5' capping of the viral
            plus-strand transcripts.
          Length = 1290

 Score = 30.2 bits (68), Expect = 8.9
 Identities = 14/53 (26%), Positives = 18/53 (33%), Gaps = 5/53 (9%)

Query: 1180 PSNSNWLNNMTPTLP---SLSLTT--PTVFPLPSRTPTSPPATFTLISTTQRD 1227
             +N  W   +TP+ P    + L         L S    SP  T  L    Q D
Sbjct: 1084 ATNGEWTLTITPSDPGILDVQLVGSNANPVSLGSFVIDSPDVTIVLAFPAQLD 1136


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 620

 Score = 29.9 bits (68), Expect = 9.3
 Identities = 14/67 (20%), Positives = 17/67 (25%), Gaps = 2/67 (2%)

Query: 1015 PTIPPVQNFNSTLPPVQ--NFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQN 1072
            P          T P  Q     P   P           SN    IP   +  P +P    
Sbjct: 526  PPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPT 585

Query: 1073 INPTFPP 1079
             +PT   
Sbjct: 586  PSPTKDS 592


>gnl|CDD|218382 pfam05013, FGase, N-formylglutamate amidohydrolase.
           Formylglutamate amidohydrolase (FGase) catalyzes the
           terminal reaction in the five-step pathway for histidine
           utilisation in Pseudomonas putida. By this action,
           N-formyl-L-glutamate (FG) is hydrolysed to produce
           L-glutamate plus formate.
          Length = 221

 Score = 29.5 bits (67), Expect = 9.5
 Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 501 RPFSGAPSYTPTDSNPHVIPTEI--KTIFKGGPTVSEAA 537
           RP   AP   P  S   +IPT    + +++ G + +E  
Sbjct: 66  RPPDDAPPAMPGQSGTGLIPTTFGGRPLYRRGLSAAERE 104


>gnl|CDD|215479 PLN02887, PLN02887, hydrolase family protein.
          Length = 580

 Score = 29.8 bits (67), Expect = 9.9
 Identities = 14/57 (24%), Positives = 18/57 (31%), Gaps = 7/57 (12%)

Query: 909 PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNP 965
             +PP   + L PS     P  SNF           +T     SF   F    +  P
Sbjct: 23  REMPPAARLLLQPSSLSFSPKTSNFNRLD-------VTSTEFSSFRKGFHAFRSTKP 72


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.132    0.396 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,802,271
Number of extensions: 6362370
Number of successful extensions: 5532
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5015
Number of HSP's successfully gapped: 220
Length of query: 1262
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1154
Effective length of database: 6,147,370
Effective search space: 7094064980
Effective search space used: 7094064980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (29.0 bits)