RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17877
(1262 letters)
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 60.7 bits (147), Expect = 5e-09
Identities = 107/487 (21%), Positives = 144/487 (29%), Gaps = 45/487 (9%)
Query: 683 PEVRTNEFLNPAALTTSSVPPSRDKPTS----LPPIITDRPLDNPHLPNILT-------- 730
E R A PP D P + P I+ D P+ P P +LT
Sbjct: 2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEEL 2543
Query: 731 HSTPGESPEPYLPTSTLSSLGNIA--------RDSTPNVIAKEFSSRDKPTSLPPIITDR 782
S P P LP + + + + R S P V ++ P S P
Sbjct: 2544 ASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVD 2603
Query: 783 PLDNPHLPNILTYSTPGESPEPHLPTSTLSSLGNIARDLTPNVIAKEFNTIASSGLPEYH 842
+P P + P ++ P P + S N P +
Sbjct: 2604 DRGDPRGPAPPS-PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV- 2661
Query: 843 VESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEYSSTNFPNIPPIQNFPKLSKD 902
S+P Q++SP +R T V +S P PP P
Sbjct: 2662 --SRPRRARRLGRAAQASSPPQRPRRRAARPT----VGSLTSLADPPPPPPTPEPAPHAL 2715
Query: 903 LSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSN 962
+S P P P PP P P P+ PP + P P
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTA-GPPAPAPPA 2773
Query: 963 FNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQN 1022
PP P + +LP + V LPP ++ +PP +
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQN 1082
T PP PP P+LP + P + P + P PPV+
Sbjct: 2834 AQPTAPP--------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR 2885
Query: 1083 FNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPN 1142
S + LPP Q P PP P PP P PP P PP P
Sbjct: 2886 LARPAVSRSTESFALPPDQ---PERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938
Query: 1143 FNPTLPP 1149
P L P
Sbjct: 2939 PQPPLAP 2945
Score = 57.6 bits (139), Expect = 5e-08
Identities = 96/496 (19%), Positives = 143/496 (28%), Gaps = 62/496 (12%)
Query: 678 TFPTLPEVRTNEFLNPAALTTSSVPPSRDKPTSLPPIITDRPLDNPHL-PNILTHSTPGE 736
FP P R PA + P P D P L P IL GE
Sbjct: 2473 LFPGAPVYR-----RPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGE 2527
Query: 737 SPEPYLPTSTLSSLGNIARDSTPNVIAKEFSSRDKPTSLPP----------IITDRPLDN 786
P + T + L +A D + D P LPP + RP
Sbjct: 2528 PVHPRMLT-WIRGLEELASDD----------AGDPPPPLPPAAPPAAPDRSVPPPRPAPR 2576
Query: 787 PHLPNILTYST-PGESPEPHLPTSTLSSLGNIARDLTPNVI---AKEFNTIASSGLPEYH 842
P P + + + P P+ P + + G+ P+ + + S P +
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
Query: 843 VESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEYSSTNFPNIPPIQNFPKLSKD 902
+ P + S + L + ++ SS PP + + ++
Sbjct: 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS------PPQRPRRRAAR- 2689
Query: 903 LSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSN 962
PT+ L ++ PP P P + + P LPP P+
Sbjct: 2690 ------PTVGSLTSLADPPP--PPPTPEPAPHALVSATP-----LPPGPAAARQASPALP 2736
Query: 963 FNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQN 1022
P P + P P P P P PP P + +
Sbjct: 2737 AAPAPPAVPA-GPATPGGPARPAR--PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQN 1082
+LP + V LPP ++ +PP + PT PP P P
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPP 2848
Query: 1083 FNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPN 1142
+LP + P + P K P PPV+ + LPP
Sbjct: 2849 ---SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
Query: 1143 FNPTLPPIHINPTFPP 1158
P P P P
Sbjct: 2906 ERPPQPQAPPPPQPQP 2921
Score = 42.6 bits (100), Expect = 0.002
Identities = 63/340 (18%), Positives = 94/340 (27%), Gaps = 49/340 (14%)
Query: 893 IQNFPKLSKDLSFDFNPTLPPLQNINLPPSFNPSFPP-------------MSNFYPTLPP 939
I+ +L+ D + D P LPP P + + S PP P PP
Sbjct: 2537 IRGLEELASDDAGDPPPPLPPAA---PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPP 2593
Query: 940 MPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPP 999
P P + P LPP ++ PP + P PP
Sbjct: 2594 QS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649
Query: 1000 VQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPP 1059
+ + P + P + + P P PT+ +++ PP
Sbjct: 2650 ERPRDDPAPGRVS-----RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP 2704
Query: 1060 KENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFP--PVKNINPTF 1117
P + + P P P P P P P P
Sbjct: 2705 PPTPEPAPHALVSATPLPPGPAAARQASPA-----LPAAPAPPAVPAGPATPGGPARPAR 2759
Query: 1118 PPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKKNFNVISSQQNFDLSVSS 1177
PP PP P P P PP ++ S+++
Sbjct: 2760 PPT----TAGPP----APAPPAAP---AAGPP----RRLTRPAVASLSESRESLPSPWDP 2804
Query: 1178 AFPSNSNWLNNMTPTLPSLSLTTPTVFPLPSRTPTSPPAT 1217
A P + + LP + + P S PT+PP
Sbjct: 2805 ADPPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
Score = 38.0 bits (88), Expect = 0.040
Identities = 79/452 (17%), Positives = 124/452 (27%), Gaps = 71/452 (15%)
Query: 693 PAALTTSSVPPSRDKPTSLPPIITD-RPLDNPHLPNILTHSTPGESPEPYLPTSTLSSLG 751
+S PP R + + P + L +P P TP +P + + L
Sbjct: 2670 LGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP----PTPEPAPHALVSATPLPPGP 2725
Query: 752 NIARDSTPNVIAKEFSSRDKPTSLPPIITDRPLDNPHLPNILTYSTPGESPEPHLPTSTL 811
AR ++P + A + P P + P P P +
Sbjct: 2726 AAARQASPALPAAP-APPAVPAGPATPGGPARPARPPTT-----AGPPAPAPPAAPAAG- 2778
Query: 812 SSLGNIARDLTPNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQSTSPTERQ---TS 868
R LT +A + S P + V + + SP TS
Sbjct: 2779 -----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
Query: 869 SHITSTTLTK--VSEYSSTNFPNIP--PIQNFPKLSKDLSFDFNPTLPPLQNINLPP--- 921
+ T+ P ++ P + P PP++ + P
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR 2893
Query: 922 ---SF-NPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPP------MS 971
SF P P P PP P PP P P P P L P
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
Query: 972 NFNPTIP--------------PKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF---- 1013
+ +P P+ P + + PP H+L +S++
Sbjct: 2954 EPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP--LTGHSLSRVSSWASSL 3011
Query: 1014 ----NPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPP 1069
PPV + PP + + + + + +P P P P
Sbjct: 3012 ALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPE-----PHDPF 3066
Query: 1070 VQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQ 1101
+ P P S F P PP+
Sbjct: 3067 A---HEPDPATPEAGARESPSSQFGP--PPLS 3093
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 48.2 bits (115), Expect = 2e-05
Identities = 25/136 (18%), Positives = 35/136 (25%), Gaps = 4/136 (2%)
Query: 978 PPKENFNPTLPPVQN-INPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLP-PVQNFNP 1035
P + P Q P + PT P N +T P P
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGP 110
Query: 1036 TL--PPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF 1093
T+ P + + +P M NP P + Q P + +
Sbjct: 111 TIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQK 170
Query: 1094 NPTLPPVQNFNPTFPP 1109
N Q N PP
Sbjct: 171 NDESQLQQQPNGETPP 186
Score = 45.5 bits (108), Expect = 1e-04
Identities = 25/144 (17%), Positives = 34/144 (23%), Gaps = 9/144 (6%)
Query: 936 TLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINP 995
P PS P P P + P + PP P
Sbjct: 46 EFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGS-PAANPPATFALP 104
Query: 996 T--------FPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTL 1047
P Q + +P M NP P+ Q P P+
Sbjct: 105 AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQ 164
Query: 1048 PPMSNFNPTIPPKENFNPTLPPVQ 1071
+ N ++ N PP Q
Sbjct: 165 QQSAQKNDESQLQQQPNGETPPQQ 188
Score = 38.2 bits (89), Expect = 0.022
Identities = 21/142 (14%), Positives = 27/142 (19%), Gaps = 5/142 (3%)
Query: 1014 NPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNI 1073
PPV LP + PP P P I
Sbjct: 53 PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTI 112
Query: 1074 NPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNF 1133
P Q + +P M NP P+ + P P+ Q
Sbjct: 113 QT--EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPS---QQQS 167
Query: 1134 NPTLPPVPNFNPTLPPIHINPT 1155
T
Sbjct: 168 AQKNDESQLQQQPNGETPPQQT 189
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 48.2 bits (115), Expect = 3e-05
Identities = 28/155 (18%), Positives = 34/155 (21%), Gaps = 1/155 (0%)
Query: 998 PPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTI 1057
P P F PP Q PP + P P+
Sbjct: 166 RQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQ 225
Query: 1058 PPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTF 1117
P + P P Q P + +PP P PP +N
Sbjct: 226 APAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPH 285
Query: 1118 P-PVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIH 1151
P Q N PP Q P
Sbjct: 286 PGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQF 320
Score = 35.1 bits (81), Expect = 0.25
Identities = 22/146 (15%), Positives = 32/146 (21%), Gaps = 1/146 (0%)
Query: 1064 NPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNFNPTFPPVKNINPTFPPVQNF 1123
P PP Q + PP + P P+ P + P P Q
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241
Query: 1124 NPTFPPIQNFNPTLPPVPNFNPTLPPIH-INPTFPPKKNFNVISSQQNFDLSVSSAFPSN 1182
P + +PP P P PP +N + P
Sbjct: 242 PLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQ 301
Query: 1183 SNWLNNMTPTLPSLSLTTPTVFPLPS 1208
L + +
Sbjct: 302 PQLLPLVQQPQGQQRGPQFREQLVQL 327
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 41.5 bits (97), Expect = 0.002
Identities = 45/264 (17%), Positives = 74/264 (28%), Gaps = 27/264 (10%)
Query: 924 NPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENF 983
P+ ++ P+ N LPP P+ P PP + +PP++
Sbjct: 50 PPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ-QQSQAPPSHQYPSQLPPQQVQ 108
Query: 984 NPTLPPVQNINPTFPPVQNFNHT---LPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPV 1040
+ P P +PP P P Q PP Q PP
Sbjct: 109 SVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPP 168
Query: 1041 ENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPV 1100
+ + P +S P P + + PP + + + ++ P + P P
Sbjct: 169 Q--AQSAPQVSGLYPEESPYQ--PQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSP 224
Query: 1101 QNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKK 1160
+ P P F P + P+ ++ P
Sbjct: 225 YMYG---------GPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYAP-- 273
Query: 1161 NFNVISSQQNFDLSVSSAFPSNSN 1184
Q S S+A+PS
Sbjct: 274 --------QGSSQSYSTAYPSLPA 289
Score = 38.8 bits (90), Expect = 0.018
Identities = 43/246 (17%), Positives = 70/246 (28%), Gaps = 18/246 (7%)
Query: 909 PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLP 968
P P Q+ PP + LPP ++ P P P +P S P P
Sbjct: 74 PPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPS--QPQPP 131
Query: 969 PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLP 1028
P P P P Q + P + PP + P + + S
Sbjct: 132 PAQQPQAQQPQP----PPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQ 187
Query: 1029 PVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNI-NPTFPPVQNFNHTL 1087
P + PP E ++ ++ P ++ + P P P P F
Sbjct: 188 P-----QSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQ 242
Query: 1088 PPMSNFNPTLPPVQNFNPTFPPVKNINPTFPP--VQNFNPTFPPIQNFNPTLPPVPNFNP 1145
P + P+ ++ P Q+++ +P P +P P
Sbjct: 243 QPPPSQGQEGYGYSGPPPSKGNHGSVASYAPQGSSQSYSTAYPS----LPAATVLPQALP 298
Query: 1146 TLPPIH 1151
Sbjct: 299 MSSAPM 304
Score = 29.9 bits (67), Expect = 8.2
Identities = 40/172 (23%), Positives = 50/172 (29%), Gaps = 15/172 (8%)
Query: 1044 NPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQNF 1103
P ++ P + N LPP P Q + PP + LPP Q
Sbjct: 50 PPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQA-PPSHQYPSQLPPQQVQ 108
Query: 1104 NPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPKKNFN 1163
+ P P +PP P PP Q P P P P P
Sbjct: 109 SVPQQPTPQQEPYYPPPSQ--PQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQP------ 160
Query: 1164 VISSQQNFDLSVSSAFPSNSNWLNNMTPTLPSLSLTTPTVFPLPSRTPTSPP 1215
QQN SA + + P + P PLPS PP
Sbjct: 161 --QYQQNPPPQAQSAPQVSGLY----PEESPYQPQSYPPNEPLPSSMAMQPP 206
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the protein
product of the dentatorubral-pallidoluysian atrophy
(DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the expansion
of a CAG repeat in the DRPLA gene on chromosome 12p. This
results in an extended polyglutamine region in
atrophin-1, that is thought to confer toxicity to the
protein, possibly through altering its interactions with
other proteins. The expansion of a CAG repeat is also the
underlying defect in six other neurodegenerative
disorders, including Huntington's disease. One
interaction of expanded polyglutamine repeats that is
thought to be pathogenic is that with the short glutamine
repeat in the transcriptional coactivator CREB binding
protein, CBP. This interaction draws CBP away from its
usual nuclear location to the expanded polyglutamine
repeat protein aggregates that are characteristic of the
polyglutamine neurodegenerative disorders. This
interferes with CBP-mediated transcription and causes
cytotoxicity.
Length = 979
Score = 38.5 bits (89), Expect = 0.024
Identities = 70/322 (21%), Positives = 106/322 (32%), Gaps = 32/322 (9%)
Query: 909 PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLP 968
P PP + + PS PP + +PP S P PS ++ + P
Sbjct: 175 PQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHP 234
Query: 969 PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIP------PVQN 1022
P+ P P P + PP + H P +P PV
Sbjct: 235 QRL---PSPHP-----PLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFL 286
Query: 1023 FNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPKENF-NPTLPPVQNINPTFPPVQ 1081
+ + P Q F L + LP + + PP ++ P PP + PP
Sbjct: 287 QHPSSNPPQPF--GLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQ---PLPPAP 341
Query: 1082 NFNHTLPPMSNFNPTLPPVQNFNPTF----PPVKNINPTFPPVQNFNPTFPPIQNFNPTL 1137
+ H PP + P LP + +P P + PP P + P+
Sbjct: 342 SMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSA 401
Query: 1138 PPVP-NFNPTLPPIHINPTFPPKKNFNVISSQQNFDLSVSSAFPSNSNWLNNMTP--TLP 1194
P P P P+ P PP V++ Q+ S+ S + +P P
Sbjct: 402 HPPPLQLMPQSQPLQSVPAQPP-----VLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHP 456
Query: 1195 SLSLTTPTVFPLPSRTPTSPPA 1216
S P + P PS ++P A
Sbjct: 457 FTSGGLPAIGPPPSLPTSTPAA 478
Score = 32.7 bits (74), Expect = 1.6
Identities = 64/344 (18%), Positives = 104/344 (30%), Gaps = 41/344 (11%)
Query: 767 SSRDKPTSLPPIITDRPLDNPHLPNILTYST----PGESPEPHLPTSTLSSLGNIARDLT 822
S++ P PI + S P P PH P ++ +
Sbjct: 199 SAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPA 258
Query: 823 PNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQSTSPTERQTSSHITSTTLTKVSEY 882
P+ +S P + P+ L S P Q S
Sbjct: 259 PSSRH----PQSSHHGPGPPMPHALQQGPVFLQHPSSNPP---QPFGLAQSQVPPLPLPS 311
Query: 883 SSTNFPNIPPIQNFPKLSKDLSFDFNPTLPPLQNINLPPSFN-PSFPPMSNFYPTL---- 937
+ + PP Q+ + + P P + +I PP+ P P S+ +P
Sbjct: 312 QAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGP 371
Query: 938 PPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTF 997
P P LPP P+ P ++ P+ P P P P+Q++ P
Sbjct: 372 SPFPQMPSNLPPPPALKPLSSLPTHHPPSAHP---------PPLQLMPQSQPLQSV-PAQ 421
Query: 998 PPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTI 1057
PPV + +LPP ++ +P + F P P+LP
Sbjct: 422 PPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPT-------- 473
Query: 1058 PPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTLPPVQ 1101
+ P + P + + + P LPP+Q
Sbjct: 474 ----STPAAPPRA---SSGSQPPGSALPSSGGCAGPGPPLPPIQ 510
Score = 31.2 bits (70), Expect = 4.9
Identities = 65/332 (19%), Positives = 102/332 (30%), Gaps = 33/332 (9%)
Query: 693 PAALTTSSVPPSRDKPTSLPPIITDRPLDNPHLPNILTHST----PGESPEPYLPTSTLS 748
A P ++ P PI + S P P P+ P +
Sbjct: 189 LAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQT 248
Query: 749 SLGNIARDSTPNVIAKEFSSRDKPTSLP------PIITDRPLDNPHLPNILTYSTPGESP 802
+ + P+ + S +P P+ P NP P L S P
Sbjct: 249 ASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLP 308
Query: 803 EPHLPTSTLSSLGNIARDLTPNVIAKEFNTIASSGLPEYHVESKPTVTPIDLIITQS--- 859
P S L P +E + +P H++ PT TPI + QS
Sbjct: 309 LPS-QAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP--HIKPPPT-TPIPQLPNQSHKH 364
Query: 860 ------TSPTERQTSSHITSTTLTKVSEYSSTNFPNI--PPIQNFPKLSKDLSFDFNPTL 911
SP + S+ L +S + + P+ PP+Q P + P
Sbjct: 365 PPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMP---QSQPLQSVPAQ 421
Query: 912 PPL--QNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPP 969
PP+ Q+ +LPP + + P P T +P+ P ++ P
Sbjct: 422 PPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPR 481
Query: 970 MSNFNPTIPPKENFNPTLPPVQNINPTFPPVQ 1001
S P + P+ P PP+Q
Sbjct: 482 AS---SGSQPPGSALPSSGGCAGPGPPLPPIQ 510
>gnl|CDD|173288 PRK14827, PRK14827, undecaprenyl pyrophosphate synthase; Provisional.
Length = 296
Score = 36.9 bits (85), Expect = 0.055
Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 7/55 (12%)
Query: 1105 PTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVPNFNPTLPPIHINPTFPPK 1159
P PP + PTFP + FP LPP P P PP H + P+
Sbjct: 14 PQLPPAPDDYPTFPDKSTWPVVFP-------ELPPAPYGGPCRPPQHTSKAAAPR 61
Score = 30.7 bits (69), Expect = 5.3
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 947 LPPVPSFNPSFPTMSNFN---PTLPPMSNFNPTIPPK 980
LPP P P+FP S + P LPP P PP+
Sbjct: 16 LPPAPDDYPTFPDKSTWPVVFPELPPAPYGGPCRPPQ 52
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is a
positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting pathway
to dissociate the endosomal sorting complex required for
transport (ESCRT). Vta1 promotes correct assembly of Vps4
and stimulates its ATPase activity through its conserved
Vta1/SBP1/LIP5 region.
Length = 315
Score = 36.6 bits (85), Expect = 0.067
Identities = 27/137 (19%), Positives = 40/137 (29%), Gaps = 26/137 (18%)
Query: 900 SKDLSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPT 959
+ D SF P + + PPS +P P + +L P P
Sbjct: 167 NSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSL---------PPAP- 216
Query: 960 MSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPP 1019
+ P P+ NP+ PP P +PP+S PT P
Sbjct: 217 --------SSFQSDTPPPSPESPTNPSPPP-----GPAAPPPPPVQQVPPLSTAKPTPPS 263
Query: 1020 VQNFNSTLPPVQNFNPT 1036
+T P+
Sbjct: 264 AS---ATPAPIGGITLD 277
Score = 35.0 bits (81), Expect = 0.22
Identities = 26/126 (20%), Positives = 38/126 (30%), Gaps = 8/126 (6%)
Query: 1031 QNFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPM 1090
P E+ + N P P + +P P +
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDP-PSSSPGVPSFPSPPEDPSSP 207
Query: 1091 SNFNPTLPPVQNFNPTFPPVKNINPTFP--PVQNFNPTFPPIQNFNPTLPPVPNFNPTLP 1148
S+ + P +F PP +PT P P P PP+Q +PP+ PT P
Sbjct: 208 SDSSLPPAP-SSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQ----QVPPLSTAKPTPP 262
Query: 1149 PIHINP 1154
P
Sbjct: 263 SASATP 268
Score = 34.3 bits (79), Expect = 0.32
Identities = 29/136 (21%), Positives = 45/136 (33%), Gaps = 17/136 (12%)
Query: 921 PSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPK 980
+ + N +P P+ + PS +P P+ + S+ + P
Sbjct: 159 EDADVATTNSDNSFPGEDADPASA-SPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPS 217
Query: 981 ENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPV 1040
+F PP +PT P PP P PPVQ +PP+ PT P
Sbjct: 218 -SFQSDTPPPSPESPTNPS--------PPPGPAAPPPPPVQ----QVPPLSTAKPTPPSA 264
Query: 1041 ENFNPTLPPMSNFNPT 1056
+ T P+
Sbjct: 265 ---SATPAPIGGITLD 277
Score = 33.5 bits (77), Expect = 0.68
Identities = 22/116 (18%), Positives = 31/116 (26%), Gaps = 10/116 (8%)
Query: 968 PPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTL 1027
N P P + +P P + S+ + P +F S
Sbjct: 166 TNSDNSFPGEDADPASASPSDP-PSSSPGVPSFPSPPEDPSSPSDSSLPPAP-SSFQSDT 223
Query: 1028 PPVQNFNPTLPPVENFNPTLPPMSNFNPTIPPK-ENFNPTLPPVQNINPTFPPVQN 1082
PP +PT NP+ PP P P + T P P
Sbjct: 224 PPPSPESPT-------NPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIG 272
Score = 32.0 bits (73), Expect = 1.8
Identities = 29/101 (28%), Positives = 31/101 (30%), Gaps = 15/101 (14%)
Query: 883 SSTNFPNIPPIQNFPKLSKDLSFDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLP-PMP 941
SS P+ P P D +LPP P SF PP S PT P P P
Sbjct: 190 SSPGVPSFPSPPEDPSSPSD------SSLPPA-----PSSFQSDTPPPSPESPTNPSPPP 238
Query: 942 SFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKEN 982
PP P T PT P S I
Sbjct: 239 GPAAPPPPPVQQVPPLSTA---KPTPPSASATPAPIGGITL 276
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 33.3 bits (75), Expect = 0.27
Identities = 26/57 (45%), Positives = 27/57 (47%), Gaps = 1/57 (1%)
Query: 246 TGTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNT-EAPTT 301
T T P TT+TTT T T T TTT T TTTT T T EAPTT
Sbjct: 48 TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTT 104
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 35.0 bits (80), Expect = 0.28
Identities = 51/245 (20%), Positives = 65/245 (26%), Gaps = 32/245 (13%)
Query: 912 PPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMS 971
PP ++P + P PM L P+P L + P+ FPT
Sbjct: 608 PPTTQSHIPETSAPRQWPMP-----LRPIPMRPLRMQPITFNVLVFPT------------ 650
Query: 972 NFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQ 1031
P PP+ P P T + P + N +P + PP +
Sbjct: 651 ---PHQPPQVEITPYKP-----TWTQIGHIPYQ---PSPTGANTMLPIQWAPGTMQPPPR 699
Query: 1032 NFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHTLPPMS 1091
P PP P + PP PP PP PP +
Sbjct: 700 APTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 759
Query: 1092 NFNPTLPPVQNFNPTFPPVKNINPTFPPVQNFNPTFPPIQNFNPTLPPVP-NFNPTLPPI 1150
PP P P PP P P P P P P
Sbjct: 760 APGRARPPAAAPGAPTPQPP---PQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPG 816
Query: 1151 HINPT 1155
PT
Sbjct: 817 QQGPT 821
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family consists
of several Mycoplasma species specific Cytadhesin P32 and
P30 proteins. P30 has been found to be membrane
associated and localised on the tip organelle. It is
thought that it is important in cytadherence and
virulence.
Length = 279
Score = 33.5 bits (76), Expect = 0.61
Identities = 33/140 (23%), Positives = 40/140 (28%), Gaps = 22/140 (15%)
Query: 913 PLQNINLPPSFNPSFPPMSNFYPTLPPM---PSFNLTLPPVPSFNPSFPTMSNFNPTLPP 969
P P+ P +P P M P FN P +P P + P P
Sbjct: 155 PQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPP-----NQMRPGFNP 209
Query: 970 MSNF--NPTIPPKENFNPTLPPVQNINPTFPPVQN-FNHTLPPMSNFNPTIPPVQNFNST 1026
M P N P + P F P FNH P + P + FN
Sbjct: 210 MPGMPPRPGFNQNPNMMPNMN-----RPGFRPQPGGFNH---PGTPMGPNMQQRPGFNPN 261
Query: 1027 LPPVQNFNPT-LPPVENFNP 1045
N P P F P
Sbjct: 262 --QGMNPPPHMAGPRAGFPP 279
Score = 30.0 bits (67), Expect = 7.5
Identities = 29/109 (26%), Positives = 36/109 (33%), Gaps = 19/109 (17%)
Query: 905 FDFNPTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFN 964
F P + N P P PP + P PMP +PP P FN + M N N
Sbjct: 176 FPMQPNMGMRPGFNQMPPHMPGMPP-NQMRPGFNPMPG----MPPRPGFNQNPNMMPNMN 230
Query: 965 -PTLPPMSN--------FNPTIPPKENFNPTL-----PPVQNINPTFPP 999
P P P + + FNP P + FPP
Sbjct: 231 RPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 33.5 bits (77), Expect = 0.71
Identities = 12/90 (13%), Positives = 37/90 (41%), Gaps = 4/90 (4%)
Query: 591 STPGTKTKVIDKLNIMT--SDLEKSLKSVVSSVMNELNTSNILPTTTPRNSFYNVINSLD 648
T V + +L ++L+ ++ ++ E + ++ + F ++I+S+
Sbjct: 96 EAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELV--SIFNDLIDSIK 153
Query: 649 KTNDNARFDDELTSRQRDIHSKIKNIGEIT 678
+ N + + S + ++ K + E+
Sbjct: 154 EDNLKDDLESLIASAKEELDQLSKKLAELK 183
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 33.7 bits (77), Expect = 0.85
Identities = 18/60 (30%), Positives = 22/60 (36%), Gaps = 3/60 (5%)
Query: 252 PSTTSTTTETPTPTRIETTTLAPNT---VAAVTTTTFATSKEPLKPNTEAPTTLPTIPSV 308
TST E TP + T A NT VAA T +S E + + T S
Sbjct: 35 TPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSE 94
Score = 33.3 bits (76), Expect = 1.0
Identities = 10/47 (21%), Positives = 18/47 (38%), Gaps = 3/47 (6%)
Query: 252 PSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEA 298
P +T E P+ + T + + T TT + ++ TE
Sbjct: 62 PVAATTAAEAPSSS---ETAETSDPTSEATDTTTSEARTVTPAATET 105
>gnl|CDD|217186 pfam02686, Glu-tRNAGln, Glu-tRNAGln amidotransferase C subunit.
This is a family of Glu-tRNAGln amidotransferase C
subunits. The Glu-tRNA Gln amidotransferase enzyme
itself is an important translational fidelity mechanism
replacing incorrectly charged Glu-tRNAGln with the
correct Gln-tRANGln via transmidation of the misacylated
Glu-tRNAGln. This activity supplements the lack of
glutaminyl-tRNA synthetase activity in gram-positive
eubacterteria, cyanobacteria, Archaea, and organelles.
Length = 72
Score = 30.1 bits (69), Expect = 1.1
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 2/36 (5%)
Query: 601 DKLNIMTSDLEKSLKSVVSSVMNELNTSNILPTTTP 636
++L T L K L V +NE++T + PT+ P
Sbjct: 4 EELEKFTKQLNKILDYV--EQLNEVDTEGVEPTSHP 37
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 32.5 bits (74), Expect = 1.6
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 246 TGTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATS 288
T ++ + T T T+T T + T T +P + TTTT + S
Sbjct: 800 ETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPS 842
Score = 32.1 bits (73), Expect = 2.2
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 252 PSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFAT 287
+TTS+ + T T + +T+T T + TTT
Sbjct: 814 QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849
Score = 31.8 bits (72), Expect = 3.4
Identities = 15/55 (27%), Positives = 19/55 (34%), Gaps = 7/55 (12%)
Query: 256 STTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEAPTTLPTIPSVLT 310
T + T+ T T + + T TT T T TT T PS T
Sbjct: 798 LIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQ-------TSTSTTTTTSPSQTT 845
>gnl|CDD|238384 cd00755, YgdL_like, Family of activating enzymes (E1) of
ubiquitin-like proteins related to the E.coli
hypothetical protein ygdL. The common reaction mechanism
catalyzed by E1-like enzymes begins with a nucleophilic
attack of the C-terminal carboxylate of the
ubiquitin-like substrate, on the alpha-phosphate of an
ATP molecule bound at the active site of the activating
enzymes, leading to the formation of a high-energy
acyladenylate intermediate and subsequently to the
formation of a thiocarboxylate at the C termini of the
substrate. The exact function of this family is unknown.
Length = 231
Score = 31.4 bits (72), Expect = 2.4
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 1/49 (2%)
Query: 411 PQVEVDVPNSFTTRKSPVDDLIIREIAPIVDAIPSKIPKFEMGITSRTH 459
P+ EVD F T + +DL+ + +VDAI S K + R
Sbjct: 79 PECEVDAVEEFLTPDN-SEDLLGGDPDFVVDAIDSIRAKVALIAYCRKR 126
>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit. This
model describes the bacterial oxaloacetate decarboxylase
alpha subunit and its equivalents in archaea. The
oxaloacetate decarboxylase Na+ pump is the paradigm of
the family of Na+ transport decarboxylases that present
in bacteria and archaea. It a multi subunit enzyme
consisting of a peripheral alpha-subunit and integral
membrane subunits beta and gamma. The energy released by
the decarboxylation reaction of oxaloacetate is coupled
to Na+ ion pumping across the membrane [Transport and
binding proteins, Cations and iron carrying compounds,
Energy metabolism, Other].
Length = 582
Score = 32.1 bits (73), Expect = 2.5
Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 4/33 (12%)
Query: 162 YGHPAAETSVYILDVVHREEPFDSEINLEYYLS 194
HP ET V L R +D+ +++E L
Sbjct: 232 TSHPPTETMVAAL----RGTGYDTGLDIELLLE 260
>gnl|CDD|188622 cd09902, H3TH_MKT1, H3TH domain of Mkt1: A global regulator of mRNAs
encoding mitochondrial proteins and eukaryotic homologs.
The Mkt1 gene product interacts with the Poly(A)-binding
protein associated factor, Pbp1, and is present at the 3'
end of RNA transcripts during translation. The Mkt1-Pbp1
complex is involved in the post-transcriptional
regulation of HO endonuclease expression. Mkt1 and
eukaryotic homologs are atypical members of the
structure-specific, 5' nuclease family. Conical members
of this family possess a PIN (PilT N terminus) domain
with a helical arch/clamp region/I domain (not included
here) and inserted within the PIN domain is an atypical
helix-hairpin-helix-2 (HhH2)-like region. This atypical
HhH2 region, the H3TH (helix-3-turn-helix) domain, has an
extended loop with at least three turns between the first
two helices, and only three of the four helices appear to
be conserved. Although Mkt1 appears to possess both a
PIN and H3TH domain, the Mkt1 PIN domain lacks several of
the active site residues necessary to bind essential
divalent metal ion cofactors (Mg2+/Mn2+) required for
nuclease activity in this family. Also, Mkt1 lacks the
glycine-rich loop in the H3TH domain which is proposed to
facilitate duplex DNA binding.
Length = 81
Score = 28.8 bits (65), Expect = 3.5
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 2/45 (4%)
Query: 995 PTFPPVQNFNHTLPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPP 1039
PTFPP+Q+ P NF + V+ S + Q F T
Sbjct: 16 PTFPPLQDSPFPKPQ--NFRDALDMVKQARSGISVCQQFPDTPSV 58
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 31.4 bits (71), Expect = 3.6
Identities = 27/84 (32%), Positives = 33/84 (39%), Gaps = 8/84 (9%)
Query: 912 PPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMS 971
PP LPP + P S+ P SFN LP PS P S P +PP+
Sbjct: 190 PPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIPPVI 249
Query: 972 NFNPTIPPKENFNPTLPPVQNINP 995
+ P P +PP Q I P
Sbjct: 250 ---QYVAP-----PPVPPPQPIIP 265
Score = 30.3 bits (68), Expect = 7.5
Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 3/82 (3%)
Query: 937 LPPMPSFNLTLPPVPSFNPSFPTMSNFNPTLPPMSNFNPTIPPKENFNPTLPPVQNINPT 996
PP PS LPP S P+ S+ P +FNP +P P+ PP P
Sbjct: 187 QPPPPSSLPGLPPGSS--SLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPP 244
Query: 997 FPPVQNFNHTLPPMSNFNPTIP 1018
PPV + PP+ P IP
Sbjct: 245 IPPVIQYVAP-PPVPPPQPIIP 265
>gnl|CDD|233031 TIGR00577, fpg, formamidopyrimidine-DNA glycosylase (fpg). All
proteins in the FPG family with known functions are
FAPY-DNA glycosylases that function in base excision
repair. Homologous to endonuclease VIII (nei). This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 272
Score = 30.7 bits (70), Expect = 4.3
Identities = 13/39 (33%), Positives = 18/39 (46%)
Query: 391 LPELTTPRPVLNGEIVNKVLPQVEVDVPNSFTTRKSPVD 429
LPE+ T R L ++ K + VEV + N P D
Sbjct: 3 LPEVETVRRGLEPLVLGKTIKSVEVVLRNPVLRPAGPED 41
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2); Provisional.
Length = 576
Score = 30.7 bits (69), Expect = 5.8
Identities = 44/179 (24%), Positives = 60/179 (33%), Gaps = 22/179 (12%)
Query: 949 PVPSFNPSFPTMSNFNPTLP--PMSNFNPTIPPKENFNPTLPPVQNINPTFPPVQNFNHT 1006
PVP+ P P N PT + NFN + +N +PP P +N
Sbjct: 290 PVPAPVPPTPEDDNPRPTDDEFAVPNFNEGLDVPDNPQDPVPPPNEGKDGNPNEENL--- 346
Query: 1007 LPPMSNFNPTIPPVQNFNSTLPPVQNFNPTLPPVENFNPTLPPMSNFNPTIP-----PKE 1061
PP + P V PP P E + P + NP IP E
Sbjct: 347 FPPGDDEVPDESNV-------PPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPE 399
Query: 1062 NFNPTLPPVQNINPTFPPVQNFNHTLPPMSNF----NPTLP-PVQNFNPTFPPVKNINP 1115
+ N +P + P NFN P + P +P P+ N K +NP
Sbjct: 400 DSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNP 458
>gnl|CDD|215947 pfam00493, MCM, MCM2/3/5 family.
Length = 327
Score = 30.2 bits (69), Expect = 6.3
Identities = 9/28 (32%), Positives = 13/28 (46%), Gaps = 6/28 (21%)
Query: 907 FNPTLPPLQNINLPPSFNPSFPPMSNFY 934
++P +NINLPP +S F
Sbjct: 178 YDPKKSVAENINLPPPL------LSRFD 199
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 30.4 bits (68), Expect = 6.6
Identities = 16/89 (17%), Positives = 24/89 (26%), Gaps = 11/89 (12%)
Query: 916 NINLPPSFNPSFPPMSNFYPTL--PPMPSFNLTLP-PVPSFNPSFPTMSNFNPTLPP--M 970
L P P S+ + + S VP + P + + P M
Sbjct: 321 AAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAV------VKPGPM 374
Query: 971 SNFNPTIPPKENFNPTLPPVQNINPTFPP 999
P PP + P + PT
Sbjct: 375 EIPTPVPPPGLAIPSLVAPPGLVAPTEIN 403
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 29.7 bits (67), Expect = 8.5
Identities = 18/61 (29%), Positives = 24/61 (39%), Gaps = 5/61 (8%)
Query: 247 GTCILPSTTSTTTETPTPTRIETTTLAPNTVAAVTTTTFATSKEPLKPNTEAPTTLPTIP 306
++TST T T + TTT T TTT +T TE ++ T P
Sbjct: 88 SNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTET-----PTEGGSSAATDP 142
Query: 307 S 307
S
Sbjct: 143 S 143
>gnl|CDD|203376 pfam06016, Reovirus_L2, Reovirus core-spike protein lambda-2 (L2).
This family consists of several Reovirus core-spike
protein lambda-2 (L2) sequences. The reovirus L2 genome
segment encodes the core spike protein lambda-2, which
mediates enzymatic reactions in 5' capping of the viral
plus-strand transcripts.
Length = 1290
Score = 30.2 bits (68), Expect = 8.9
Identities = 14/53 (26%), Positives = 18/53 (33%), Gaps = 5/53 (9%)
Query: 1180 PSNSNWLNNMTPTLP---SLSLTT--PTVFPLPSRTPTSPPATFTLISTTQRD 1227
+N W +TP+ P + L L S SP T L Q D
Sbjct: 1084 ATNGEWTLTITPSDPGILDVQLVGSNANPVSLGSFVIDSPDVTIVLAFPAQLD 1136
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.9 bits (68), Expect = 9.3
Identities = 14/67 (20%), Positives = 17/67 (25%), Gaps = 2/67 (2%)
Query: 1015 PTIPPVQNFNSTLPPVQ--NFNPTLPPVENFNPTLPPMSNFNPTIPPKENFNPTLPPVQN 1072
P T P Q P P SN IP + P +P
Sbjct: 526 PPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPT 585
Query: 1073 INPTFPP 1079
+PT
Sbjct: 586 PSPTKDS 592
>gnl|CDD|218382 pfam05013, FGase, N-formylglutamate amidohydrolase.
Formylglutamate amidohydrolase (FGase) catalyzes the
terminal reaction in the five-step pathway for histidine
utilisation in Pseudomonas putida. By this action,
N-formyl-L-glutamate (FG) is hydrolysed to produce
L-glutamate plus formate.
Length = 221
Score = 29.5 bits (67), Expect = 9.5
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 501 RPFSGAPSYTPTDSNPHVIPTEI--KTIFKGGPTVSEAA 537
RP AP P S +IPT + +++ G + +E
Sbjct: 66 RPPDDAPPAMPGQSGTGLIPTTFGGRPLYRRGLSAAERE 104
>gnl|CDD|215479 PLN02887, PLN02887, hydrolase family protein.
Length = 580
Score = 29.8 bits (67), Expect = 9.9
Identities = 14/57 (24%), Positives = 18/57 (31%), Gaps = 7/57 (12%)
Query: 909 PTLPPLQNINLPPSFNPSFPPMSNFYPTLPPMPSFNLTLPPVPSFNPSFPTMSNFNP 965
+PP + L PS P SNF +T SF F + P
Sbjct: 23 REMPPAARLLLQPSSLSFSPKTSNFNRLD-------VTSTEFSSFRKGFHAFRSTKP 72
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.132 0.396
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,802,271
Number of extensions: 6362370
Number of successful extensions: 5532
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5015
Number of HSP's successfully gapped: 220
Length of query: 1262
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1154
Effective length of database: 6,147,370
Effective search space: 7094064980
Effective search space used: 7094064980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (29.0 bits)