RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16259
(896 letters)
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 51.3 bits (123), Expect = 1e-06
Identities = 29/185 (15%), Positives = 52/185 (28%), Gaps = 9/185 (4%)
Query: 43 GVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHIT 102
GV+ SP P P L P + + ++P T
Sbjct: 43 GVAEFPWDPSPQAPPPVAQLP--QPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPAT 100
Query: 103 QHTGPGIFQSP--PPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVL 160
G + + P + + ++ + Q + Q +
Sbjct: 101 FALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQ 160
Query: 161 SPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTS--QSSQQQSDLIKTNGTGDVNPDEF 218
PS Q + D+ S +Q P + QQ D + S+ L + +GT +
Sbjct: 161 LPSQQQSAQKNDE---SQLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGA 217
Query: 219 NQLLA 223
A
Sbjct: 218 EGGGA 222
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 48.9 bits (116), Expect = 9e-06
Identities = 83/428 (19%), Positives = 135/428 (31%), Gaps = 49/428 (11%)
Query: 377 SIDRPYCCSVCMKTFARKEHLENHTRCHTGETPYKCQY--CGKNFSRKEHMVNHVRKHTG 434
+ RP C C +F+R EHL H R HTGE P +C Y C K+FSR + H+R H
Sbjct: 29 NAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88
Query: 435 ETPHRCEICKKAFTRK-------------------EHYVNHVMWHTGETPHVCQVC--GK 473
K + + P + +
Sbjct: 89 NPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRN 148
Query: 474 KYTRKEHLANHMRSHTNDTPFKCEICTKSFTRKEHFTNHIMW--HTGETPHRCDFCSKTF 531
+ ++ +N + S + + I T + +
Sbjct: 149 NPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSPLSSS 208
Query: 532 TRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTRKE 591
+ S ++ + + + DS + +S
Sbjct: 209 YSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTA 268
Query: 592 HLVNHVRQHTGE-------TPYQCQYCPKAFSRKDHLVNHVRQ--HTGES--PHKC--SY 638
+ + P + + C +FSR L H+R H+GES P C S
Sbjct: 269 SSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSL 328
Query: 639 CKQSFTRKDHLNNHVRLHTGESPHKCHF--CSKSFTRKEHLNNHIRLHTGESPHKCD--- 693
C + F+R D L H+ LHT SP K S F+ + L +
Sbjct: 329 CGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSE 388
Query: 694 ----FCLKTFTRKEHLTNHLKQHAGETEHVCN--ICSKPFLKKEHLVNHSRSHTGERPFA 747
C++ F R +L+ H+ H + C CSK F + +L+ H + HT P
Sbjct: 389 TLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLL 448
Query: 748 CTDCGKSF 755
C+
Sbjct: 449 CSILKSFR 456
Score = 42.8 bits (100), Expect = 6e-04
Identities = 59/325 (18%), Positives = 101/325 (31%), Gaps = 22/325 (6%)
Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNY--CAKSFTRKEHLVNHVRQHTGDSP 577
P C C+ +F+R EHL H+R HTGE P +C+Y C KSF+R L H+R H +
Sbjct: 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91
Query: 578 HKCNYCAKS--FTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHVRQHTGESPHK 635
+ ++ S +D + + + +
Sbjct: 92 DLNSKSLPLSNSKASSSSLSSSS-SNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNP 150
Query: 636 CSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFC 695
S N+ P + SK + L + T +
Sbjct: 151 LPGNNSSSVNTPQSNSL------HPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSP 204
Query: 696 LKTFTRKEHLTNHLKQHAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSF 755
L + ++ + S L + L++ S S + +
Sbjct: 205 LSSSYSIPSSSSDQNLENSSSSLPLTTNS--QLSPKSLLSQSPSSLSSSDSSSSASESPR 262
Query: 756 PLKGNLLFHQRSHRKQNGEVLER---PFKCDVCPKDFTCKGHLVSHLRSHL----NERPH 808
S + + + P K C F+ L HLRS + +P
Sbjct: 263 SSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPF 322
Query: 809 VCPH--CSKSFVEHTCMIKHIRKNH 831
CP+ C K F + + +HI +
Sbjct: 323 SCPYSLCGKLFSRNDALKRHILLHT 347
Score = 40.8 bits (95), Expect = 0.003
Identities = 77/429 (17%), Positives = 134/429 (31%), Gaps = 33/429 (7%)
Query: 436 TPHRCEICKKAFTRKEHYVNHVMWHTGETPHVCQV--CGKKYTRKEHLANHMRSHTNDTP 493
P C C +F+R EH H+ HTGE P C C K ++R L+ H+R+H N+
Sbjct: 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91
Query: 494 FKCEIC---TKSFTRKEHFTNHIMWHTGETPHRCDFCSKTFTRKEHLLNHVRQHTGESPH 550
+ S ++ S + ++ L + + +
Sbjct: 92 DLNSKSLPLSNSKASSSSLSSSSSNSNDNNL--LSSHSLPPSSRDPQLPDLLSISNLRNN 149
Query: 551 KCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQY 610
S + +S K S ++ +
Sbjct: 150 PLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSS------SENS 203
Query: 611 CPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKD------HLNNHVRLHTGESPHKC 664
+ + S + Q + L++ + +
Sbjct: 204 PLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263
Query: 665 HFCSKSFTRKEHLNNHIRLHTGES-PHKCDFCLKTFTRKEHLTNHLKQH----AGETEHV 719
+ S + G S P K C +F+R LT HL+
Sbjct: 264 SLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFS 323
Query: 720 C--NICSKPFLKKEHLVNHSRSHTGERPFAC--TDCGKSFPLKGNLLFHQRSHRKQNGEV 775
C ++C K F + + L H HT P + F N Q S ++
Sbjct: 324 CPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQ-SLQQYKDLK 382
Query: 776 LERPFKCDV--CPKDFTCKGHLVSHLRSHLNERPHVC--PHCSKSFVEHTCMIKHIRKNH 831
++ + C ++F +L H+ +HL+ RP+ C P CSKSF H +I H + +
Sbjct: 383 NDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT 442
Query: 832 TLANVNVSP 840
A + S
Sbjct: 443 NHAPLLCSI 451
Score = 37.4 bits (86), Expect = 0.032
Identities = 52/180 (28%), Positives = 74/180 (41%), Gaps = 18/180 (10%)
Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQ--HTGES--PHKC--NYCAKSFTRKEHLVNHVRQHT 573
P + C+ +F+R L H+R H+GES P C + C K F+R + L H+ HT
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347
Query: 574 GDSPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQY---------CPKAFSRKDHLVNH 624
SP K S L N Q + C + F R +L H
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH 407
Query: 625 VRQHTGESPH--KCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIR 682
+ H P+ K C +SF R +L H ++HT +P KSF R L+NH +
Sbjct: 408 IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL-LCSILKSFRRDLDLSNHGK 466
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 46.3 bits (110), Expect = 7e-05
Identities = 39/204 (19%), Positives = 51/204 (25%), Gaps = 35/204 (17%)
Query: 6 ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
L + A + PP PP F P + P +
Sbjct: 155 SLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQ------AAFPQQGPPEQPPGYPQPPQ 208
Query: 66 TSLTSPHQ---MPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP-----TS 117
+P PS PP P Q P + Q PG+ Q PP
Sbjct: 209 GHPEQVQPQQFLPAPSQAPAQPPLPPQL----PQQPPPLQQPQFPGLSQQMPPPPPQPPQ 264
Query: 118 HISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHS 177
+ P P + P + G G N Q Q
Sbjct: 265 QQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQP--------------QLLPLVQQ 310
Query: 178 TVQSPPGQQQQYQQMDTSQSSQQQ 201
G Q + Q + Q SQQQ
Sbjct: 311 PQGQQRGPQFREQLV---QLSQQQ 331
Score = 39.8 bits (93), Expect = 0.007
Identities = 33/191 (17%), Positives = 42/191 (21%), Gaps = 26/191 (13%)
Query: 53 PSQPSPQHGHHTLTSL-------TSPHQMP-PPSPIATSPPPPSASDKHDSQSSPHITQH 104
SQP PQ + SL Q P P P P Q
Sbjct: 141 ESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMP------PRQAAFPQQ 194
Query: 105 TGPGI---FQSPPPTSHISKIPSPSSSSPSSSSQLM----QQGGGGGSNGQTHLHIQNIG 157
P + PP P +PS + Q Q +
Sbjct: 195 GPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQ 254
Query: 158 AVLSPSY-GQHCIQQDDTQHST----VQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGD 212
P Q Q Q +P Q Q Q Q + G
Sbjct: 255 MPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQ 314
Query: 213 VNPDEFNQLLA 223
+F + L
Sbjct: 315 QRGPQFREQLV 325
Score = 39.0 bits (91), Expect = 0.014
Identities = 21/157 (13%), Positives = 30/157 (19%), Gaps = 9/157 (5%)
Query: 48 FISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP 107
+ + +P PQ P P + S +
Sbjct: 124 SLWGTAPKPEPQP---------PQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ 174
Query: 108 GIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQH 167
Q P + P P Q G Q + +
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234
Query: 168 CIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDL 204
+ Q Q P QQ QQQ
Sbjct: 235 QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQ 271
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 45.7 bits (108), Expect = 1e-04
Identities = 31/142 (21%), Positives = 45/142 (31%), Gaps = 12/142 (8%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
P Q + P PPPP R P +P T G S + P P G
Sbjct: 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG--EPSGAVPQPWLG 2965
Query: 62 HHTLTSLTSPH-QMPPPSPIATSPPPPSASDKHDSQS-----SPHITQHTGPGIFQSPPP 115
+ P ++P P+P +P + S S + + H PPP
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEET----DPPP 3021
Query: 116 TSHISKIPSPSSSSPSSSSQLM 137
S + P + S + L
Sbjct: 3022 VSLKQTLWPPDDTEDSDADSLF 3043
Score = 44.2 bits (104), Expect = 4e-04
Identities = 23/134 (17%), Positives = 38/134 (28%), Gaps = 14/134 (10%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEP--------PSPNTGGVSGGFISTSPSQPSPQHGH 62
P + PP P+++R + P P+ G + G + P+
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT---- 2763
Query: 63 HTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISK 121
+P P P P AS +S P P P +
Sbjct: 2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822
Query: 122 IPSPSSSSPSSSSQ 135
P+ P+S+
Sbjct: 2823 SPAGPLPPPTSAQP 2836
Score = 41.5 bits (97), Expect = 0.003
Identities = 27/138 (19%), Positives = 38/138 (27%), Gaps = 14/138 (10%)
Query: 8 SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVS--GGFISTSPSQPSPQH----G 61
+ PA + T PPP P P G V+ G PS+
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPA 2879
Query: 62 HHTLTSLTSPHQMPPPSPIA----TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
+ L P A PP Q P P Q PPP
Sbjct: 2880 RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP---QPPPPPP 2936
Query: 118 HISKIPSPSSSSPSSSSQ 135
+ P ++ P+ + +
Sbjct: 2937 PRPQPPLAPTTDPAGAGE 2954
Score = 41.1 bits (96), Expect = 0.004
Identities = 24/109 (22%), Positives = 33/109 (30%), Gaps = 5/109 (4%)
Query: 25 PPPPSSSRGSEPP--SPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
PP + R PP +P + + P P PQ + P+P +
Sbjct: 2559 APPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAP-PQSAR-PRAPVDDRGDPRGPAPPSP 2616
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPS 131
PP A D SP + PPP P+P S
Sbjct: 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP-RDDPAPGRVSRP 2664
Score = 39.5 bits (92), Expect = 0.010
Identities = 26/125 (20%), Positives = 36/125 (28%), Gaps = 14/125 (11%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPP--SPNTGGVSGG---FISTSPSQPSPQHGHHTL 65
PA+++ PPP P S G S+ P +P + T+
Sbjct: 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV 2692
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
SLTS PPP P P GP + P + P
Sbjct: 2693 GSLTSLADPPPPPPTPEPAPHA---------LVSATPLPPGPAAARQASPALPAAPAPPA 2743
Query: 126 SSSSP 130
+ P
Sbjct: 2744 VPAGP 2748
Score = 38.8 bits (90), Expect = 0.019
Identities = 26/111 (23%), Positives = 37/111 (33%), Gaps = 20/111 (18%)
Query: 26 PPPSSSRGSEP--PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P+ R S +P G GG T P+ P P + P P P P +
Sbjct: 379 SLPTRKRRSARHAATPFARG-PGGDDQTRPAAPVPA---------SVPTPAPTPVPASAP 428
Query: 84 PPP--PSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
PPP P S + S P P + PP + P ++ +
Sbjct: 429 PPPATPLPSAEPGSDDGP------APPPERQPPAPATEPAPDDPDDATRKA 473
Score = 37.2 bits (86), Expect = 0.051
Identities = 26/137 (18%), Positives = 43/137 (31%), Gaps = 16/137 (11%)
Query: 19 SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
S+ PS ++PP+ + + SP+ P P T+ P PPPS
Sbjct: 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
Query: 79 -----------PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP-----PPTSHISKI 122
+ PP S + K + + P + + P + +S PP
Sbjct: 2850 LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
Query: 123 PSPSSSSPSSSSQLMQQ 139
+ P Q
Sbjct: 2910 QPQAPPPPQPQPQPPPP 2926
Score = 36.8 bits (85), Expect = 0.070
Identities = 26/126 (20%), Positives = 36/126 (28%), Gaps = 16/126 (12%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
+T P P ++ G P+P +G P + + +
Sbjct: 2743 AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG----PPRRLTRPAVASLSESRESL 2798
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
P P P A P +A S + P PPPTS P P P
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGP------------LPPPTSAQPTAPPPPPGPP 2846
Query: 131 SSSSQL 136
S L
Sbjct: 2847 PPSLPL 2852
Score = 36.5 bits (84), Expect = 0.085
Identities = 23/135 (17%), Positives = 35/135 (25%), Gaps = 21/135 (15%)
Query: 10 FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
S P P PPSP+ P +
Sbjct: 2608 PRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
Query: 70 SPHQ---MPPPSPIATSPP----PPSASDKHDSQSS---PHITQHTGPGIFQSPPPTSHI 119
S + + A+SPP +A S +S P P P P + +
Sbjct: 2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP-----PPPPTPEPAPHALV 2716
Query: 120 SKIPSPSSSSPSSSS 134
S P P + + +
Sbjct: 2717 SATPLPPGPAAARQA 2731
Score = 36.1 bits (83), Expect = 0.12
Identities = 24/118 (20%), Positives = 36/118 (30%), Gaps = 9/118 (7%)
Query: 25 PPPPSSSRGSEPPS-PNTGGVSGGFISTSPSQPSPQHGHHTLTSLT-------SPHQMPP 76
S + + T G P P+P+ H L S T + Q P
Sbjct: 2674 AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
P A +PP A + T G PP + + P + P+ +S
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG-PPRRLTRPAVAS 2790
Score = 36.1 bits (83), Expect = 0.12
Identities = 25/121 (20%), Positives = 37/121 (30%), Gaps = 11/121 (9%)
Query: 24 TPPPPSSSRG--SEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS--P 79
PP P+ + PP T + S PSP ++ +P PP+ P
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP 2824
Query: 80 IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP-------PPTSHISKIPSPSSSSPSS 132
PPP SA G +P PP+ + P+ + P
Sbjct: 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
Query: 133 S 133
Sbjct: 2885 R 2885
Score = 35.3 bits (81), Expect = 0.21
Identities = 23/133 (17%), Positives = 39/133 (29%), Gaps = 15/133 (11%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPP-SPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
PA+ + ++ P PP+ G P P + P P+P
Sbjct: 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAA----G 2778
Query: 70 SPHQMPPPS--------PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
P ++ P+ SP P+ + + P PP ++ +
Sbjct: 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838
Query: 122 IPSPSSSSPSSSS 134
P P P S
Sbjct: 2839 PPPPPGPPPPSLP 2851
Score = 34.1 bits (78), Expect = 0.48
Identities = 25/106 (23%), Positives = 38/106 (35%), Gaps = 17/106 (16%)
Query: 12 ASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSP 71
+S +S ++ +T PPP S + + P +T + S S+ S P
Sbjct: 3005 SSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSD-------LEALDP 3057
Query: 72 HQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
P P A P P + ++ SP G PPP S
Sbjct: 3058 LPPEPHDPFAHEPDPATPEA--GARESPS--SQFG------PPPLS 3093
Score = 32.2 bits (73), Expect = 1.5
Identities = 23/117 (19%), Positives = 36/117 (30%), Gaps = 19/117 (16%)
Query: 23 DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP----- 77
D PPPP EP +S +P P P +L + P
Sbjct: 2700 DPPPPP---PTPEPAPHAL-------VSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
Query: 78 SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS----KIPSPSSSSP 130
+P + P + +P GP + P + +S +PSP +
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPAD 2806
Score = 31.8 bits (72), Expect = 2.1
Identities = 26/118 (22%), Positives = 37/118 (31%), Gaps = 25/118 (21%)
Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPS----PQHGHHTLTSLTSPHQMPPPSPIAT 82
P ++RG+ P P + P+ P G +P +P P
Sbjct: 268 APETARGATGPPP----------PPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPD--- 314
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH----ISKIPSPSSSSPSSSSQL 136
PPP A D + + P P P H K P+ + PSS L
Sbjct: 315 PPPPAPAGDAEEEDDEDGAMEVVSP----LPRPRQHYPLGFPKRRRPTWTPPSSLEDL 368
Score = 31.1 bits (70), Expect = 3.3
Identities = 24/120 (20%), Positives = 34/120 (28%), Gaps = 12/120 (10%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH-----HTLTSLTSPHQMPPPSPI 80
P P +P +P I P H L L S PP P+
Sbjct: 2496 PDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPL 2555
Query: 81 ATSPPPPSASDKHDSQSSPHITQHTGPGIFQ-----SPPPTSHISKIPSPSSSSPSSSSQ 135
+ PP A D+ P + + P + PP S + P P +
Sbjct: 2556 PPAAPPA-APDRSVPPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAP 2613
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 44.3 bits (105), Expect = 2e-04
Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 23/126 (18%)
Query: 17 STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
S E P S S+PPS + G PS PSP + + + P P
Sbjct: 171 SFPGEDADPASASP---SDPPSSSPG---------VPSFPSPPEDPSSPSDSSLP---PA 215
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
PS + PPPS + S P PG PPP S + +P S+S
Sbjct: 216 PSSFQSDTPPPSPESPT-NPSPP-------PGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267
Query: 137 MQQGGG 142
GG
Sbjct: 268 PAPIGG 273
Score = 43.1 bits (102), Expect = 4e-04
Identities = 26/111 (23%), Positives = 33/111 (29%), Gaps = 12/111 (10%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
PPSSS G S S+ P PS S SP PP A P
Sbjct: 185 SDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPP 244
Query: 85 PPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
PPP S + P +PP S + + ++
Sbjct: 245 PPPVQQVPPLSTAKP------------TPPSASATPAPIGGITLDDDAIAK 283
Score = 40.8 bits (96), Expect = 0.002
Identities = 29/130 (22%), Positives = 43/130 (33%), Gaps = 28/130 (21%)
Query: 6 ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
E + S + P+S+ S+PPS + G PS PSP +
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPG---------VPSFPSPPEDPSSP 207
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
+ + P P PS + PPPS + SPPP P
Sbjct: 208 SDSSLP---PAPSSFQSDTPPPSPESPTN----------------PSPPPGPAAPPPPPV 248
Query: 126 SSSSPSSSSQ 135
P S+++
Sbjct: 249 QQVPPLSTAK 258
Score = 37.4 bits (87), Expect = 0.024
Identities = 27/159 (16%), Positives = 35/159 (22%), Gaps = 25/159 (15%)
Query: 24 TPPPPSSSRGSEPPSPNTGG-VSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
P P E + S + P+ SP P SP
Sbjct: 146 EDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPAS----------ASPSDPPSSSPGVP 195
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
S P P S SS P P+S S P PS S ++ G
Sbjct: 196 SFPSPPEDPSSPSDSSL------------PPAPSSFQSDTPPPSP--ESPTNPSPPPGPA 241
Query: 143 GGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQS 181
A +P +
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDA 280
Score = 33.1 bits (76), Expect = 0.56
Identities = 19/88 (21%), Positives = 28/88 (31%), Gaps = 7/88 (7%)
Query: 9 QFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHT-LTS 67
FP+ + +S + PP SS S+ P P+ T+PS P
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPES------PTNPSPPPGPAAPPPPPVQ 249
Query: 68 LTSPHQMPPPSPIATSPPPPSASDKHDS 95
P P+P + S P
Sbjct: 250 QVPPLSTAKPTPPSASATPAPIGGITLD 277
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 44.7 bits (106), Expect = 2e-04
Identities = 26/116 (22%), Positives = 39/116 (33%), Gaps = 5/116 (4%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFIS--TSPSQPSPQHGHHTLTSLTSPHQ---MPPPS 78
PP R + SP++ IS P +P G +S S + PS
Sbjct: 149 GPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPS 208
Query: 79 PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
+ + P S+SS + I P S S +P+S S S +
Sbjct: 209 NLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPTSGSSSINP 264
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 42.9 bits (101), Expect = 7e-04
Identities = 32/147 (21%), Positives = 46/147 (31%), Gaps = 30/147 (20%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH----HTLTSLTS 70
KTST ME PP S+ S P +P+T S +PS + SL S
Sbjct: 452 KTSTVME----PPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLIS 507
Query: 71 P-----HQMPPPSPIATSPPPPSAS---------------DKHDSQSSPHITQHTGPGIF 110
H+ P + T P P+AS D S I +
Sbjct: 508 KSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSV 567
Query: 111 --QSPPPTSHISKIPSPSSSSPSSSSQ 135
+ S+PS++ +
Sbjct: 568 PLVRVFDIHLRASTTKGRHSTPSTNEK 594
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 41.3 bits (97), Expect = 0.003
Identities = 21/133 (15%), Positives = 32/133 (24%), Gaps = 12/133 (9%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTS-PSQPSPQHGHHTLTSLT 69
PAS +S E P PSS PS S S S S
Sbjct: 284 PASSSSSPR-ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSS--------E 334
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
S + S P + + S + P + + + +
Sbjct: 335 SSRGAAVSPGPSPSRSPSPSRPPPPADPSS--PRKRPRPSRAPSSPAASAGRPTRRRARA 392
Query: 130 PSSSSQLMQQGGG 142
+ + G
Sbjct: 393 AVAGRARRRDATG 405
Score = 40.9 bits (96), Expect = 0.003
Identities = 27/117 (23%), Positives = 40/117 (34%), Gaps = 16/117 (13%)
Query: 23 DTPPPPSSSRGSEP-----PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
+ P P+SS S PSP++ G SG S+ + S + +S TS
Sbjct: 279 SSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSSPRASSSSSSSRESSSSSTSS-SSESS 336
Query: 78 SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
A SP P + S+ P P +PSS + S+
Sbjct: 337 RGAAVSPGPSPSRSPSPSRPPP---------PADPSSPRKRPRPSRAPSSPAASAGR 384
Score = 40.5 bits (95), Expect = 0.005
Identities = 19/127 (14%), Positives = 39/127 (30%), Gaps = 8/127 (6%)
Query: 10 FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH--GHHTLTS 67
+ + PP P + + + + +P+ P+ + +S
Sbjct: 54 TVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSS 113
Query: 68 LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
P PPP+ SP P + S + P S + + S ++
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPP-----AAGASP-AAVASDAA 167
Query: 128 SSPSSSS 134
SS ++
Sbjct: 168 SSRQAAL 174
Score = 40.2 bits (94), Expect = 0.007
Identities = 27/139 (19%), Positives = 35/139 (25%), Gaps = 28/139 (20%)
Query: 9 QFPASHKTSTSM-EVDTPPPPSSSRGSEPPSPN--------TGGVSGGFISTSPSQPSPQ 59
+ PPPP+ S PPSP G G + SP
Sbjct: 99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158
Query: 60 HGHHTLTSLTSPHQMPP---PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
+ +S P P A +P P A P P
Sbjct: 159 PAAVASDAASSRQAALPLSSPEETARAPSSPPAE----------------PPPSTPPAAA 202
Query: 117 SHISKIPSPSSSSPSSSSQ 135
S S S+ +SS
Sbjct: 203 SPRPPRRSSPISASASSPA 221
Score = 39.4 bits (92), Expect = 0.009
Identities = 16/130 (12%), Positives = 37/130 (28%), Gaps = 4/130 (3%)
Query: 6 ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
E + P ++ +G + T+ + S H
Sbjct: 770 EALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGG 829
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
+ + P + P + A PP S+ ++ G + PP P
Sbjct: 830 SESSGPARPPGAA--ARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEP--RARPGA 885
Query: 126 SSSSPSSSSQ 135
++ ++++
Sbjct: 886 AAPPKAAAAA 895
Score = 38.2 bits (89), Expect = 0.026
Identities = 27/114 (23%), Positives = 36/114 (31%), Gaps = 14/114 (12%)
Query: 22 VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
+ P + G PS G S S PSP + P P + +
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSS-SSPRERSPSPSPSS----PGSGPAPSSPRASSS 317
Query: 82 TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS-SSPSSSS 134
+S S+S S S PG P P+ PSPS P+ S
Sbjct: 318 SSSSRESSSSSTSSSSESSRGAAVSPG----PSPSRS----PSPSRPPPPADPS 363
Score = 36.7 bits (85), Expect = 0.065
Identities = 9/67 (13%), Positives = 17/67 (25%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
PP + S P + + S +P+ + + P
Sbjct: 354 SRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413
Query: 85 PPPSASD 91
P P +
Sbjct: 414 PSPLDAG 420
Score = 36.3 bits (84), Expect = 0.096
Identities = 22/157 (14%), Positives = 37/157 (23%), Gaps = 28/157 (17%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
++ S P + + S + S + T+ + SP
Sbjct: 137 MLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE----ETARAPSSPPAE 192
Query: 62 HHTLTS--LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHIT----------------- 102
T SP SPI+ S P+ + + +
Sbjct: 193 PPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPE 252
Query: 103 -----QHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
P + + PS SSSS
Sbjct: 253 NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSS 289
Score = 35.9 bits (83), Expect = 0.10
Identities = 35/143 (24%), Positives = 47/143 (32%), Gaps = 18/143 (12%)
Query: 11 PASHKTSTSMEVDTPPPP------SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHT 64
PA ST +P PP S+S S P+P S+S S S G
Sbjct: 190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGW 249
Query: 65 LTSLTSPHQMPPPSPIATSP------------PPPSASDKHDSQSSPHITQHTGPGIFQS 112
P P P + T P P++S + SP + +
Sbjct: 250 GPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP 309
Query: 113 PPPTSHISKIPSPSSSSPSSSSQ 135
P + S S SSS S+SS
Sbjct: 310 SSPRASSSSSSSRESSSSSTSSS 332
Score = 33.6 bits (77), Expect = 0.53
Identities = 28/131 (21%), Positives = 38/131 (29%), Gaps = 9/131 (6%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTS--PSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P P ++ G +G S G + + P PPP P +
Sbjct: 22 PRPPATPGDAADDLLSG--SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEA 79
Query: 84 PPPPSASDKHDSQS-----SPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQ 138
P S S S S SP P SP P SP S S++++
Sbjct: 80 PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLR 139
Query: 139 QGGGGGSNGQT 149
G G
Sbjct: 140 PVGSPGPPPAA 150
Score = 33.6 bits (77), Expect = 0.59
Identities = 24/106 (22%), Positives = 33/106 (31%), Gaps = 9/106 (8%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
PPP SS S+ GG + G +P + PP+
Sbjct: 844 RPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAP 903
Query: 84 PPPPSASDKHDSQSSPHITQHTGP---GIFQSPPPTSHISKIPSPS 126
P P+ K P GP G F+ PP + PS +
Sbjct: 904 RPRPAPRVK-LGPMPPG-----GPDPRGGFRRVPPGDLHTPAPSAA 943
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 35.8 bits (83), Expect = 0.003
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 648 HLNNHVRLHTGESPHKCHFCSKSFTR 673
+L H+R HTGE P+KC C KSF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 35.4 bits (82), Expect = 0.004
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 676 HLNNHIRLHTGESPHKCDFCLKTFTR 701
+L H+R HTGE P+KC C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 35.4 bits (82), Expect = 0.004
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 480 HLANHMRSHTNDTPFKCEICTKSFTR 505
+L HMR+HT + P+KC +C KSF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 35.1 bits (81), Expect = 0.005
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 592 HLVNHVRQHTGETPYQCQYCPKAFSR 617
+L H+R HTGE PY+C C K+FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 34.3 bits (79), Expect = 0.008
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 732 HLVNHSRSHTGERPFACTDCGKSF 755
+L H R+HTGE+P+ C CGKSF
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSF 24
Score = 34.3 bits (79), Expect = 0.008
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 536 HLLNHVRQHTGESPHKCNYCAKSFTR 561
+L H+R HTGE P+KC C KSF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 34.3 bits (79), Expect = 0.009
Identities = 16/26 (61%), Positives = 18/26 (69%)
Query: 396 HLENHTRCHTGETPYKCQYCGKNFSR 421
+L H R HTGE PYKC CGK+FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 33.9 bits (78), Expect = 0.010
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 424 HMVNHVRKHTGETPHRCEICKKAFTR 449
++ H+R HTGE P++C +C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 33.1 bits (76), Expect = 0.019
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 794 HLVSHLRSHLNERPHVCPHCSKSFV 818
+L H+R+H E+P+ CP C KSF
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 33.1 bits (76), Expect = 0.020
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 620 HLVNHVRQHTGESPHKCSYCKQSFTR 645
+L H+R HTGE P+KC C +SF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 33.1 bits (76), Expect = 0.025
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 564 HLVNHVRQHTGDSPHKCNYCAKSFTR 589
+L H+R HTG+ P+KC C KSF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 32.4 bits (74), Expect = 0.037
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 508 HFTNHIMWHTGETPHRCDFCSKTFTR 533
+ H+ HTGE P++C C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 32.4 bits (74), Expect = 0.039
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 368 SLQRHTKRHSIDRPYCCSVCMKTFAR 393
+L+RH + H+ ++PY C VC K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 30.8 bits (70), Expect = 0.15
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 315 VHRRYHTERKPFTCHVCGKAF 335
H R HT KP+ C VCGK+F
Sbjct: 4 RHMRTHTGEKPYKCPVCGKSF 24
Score = 28.5 bits (64), Expect = 0.89
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 452 HYVNHVMWHTGETPHVCQVCGKKYTR 477
+ H+ HTGE P+ C VCGK ++
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 27.4 bits (61), Expect = 2.7
Identities = 13/32 (40%), Positives = 17/32 (53%), Gaps = 6/32 (18%)
Query: 760 NLLFHQRSHRKQNGEVLERPFKCDVCPKDFTC 791
NL H R+H E+P+KC VC K F+
Sbjct: 1 NLRRHMRTHTG------EKPYKCPVCGKSFSS 26
Score = 27.0 bits (60), Expect = 3.4
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 704 HLTNHLKQHAGETEHVCNICSKPF 727
+L H++ H GE + C +C K F
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSF 24
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 40.7 bits (96), Expect = 0.003
Identities = 24/98 (24%), Positives = 30/98 (30%), Gaps = 6/98 (6%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
I+ L S S S TPPPP S P+P + +P P P
Sbjct: 505 SIKLNLE----SQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTAT 560
Query: 62 HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSP 99
+S SP P P+ S DS
Sbjct: 561 Q--ASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596
Score = 36.9 bits (86), Expect = 0.051
Identities = 26/108 (24%), Positives = 34/108 (31%), Gaps = 24/108 (22%)
Query: 29 SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
S GS + T P Q SP + T P P + +PPPP
Sbjct: 511 ESQSGSASNTAKT---------PPPPQKSPPP-----PAPTPPLPQPTATAPPPTPPPP- 555
Query: 89 ASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
P TQ + Q P +S IP + SP+ S
Sbjct: 556 ---------PPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSP 594
Score = 31.1 bits (71), Expect = 3.3
Identities = 19/95 (20%), Positives = 31/95 (32%), Gaps = 14/95 (14%)
Query: 48 FISTSPSQPSPQHGHHTLTSLTSPHQMPPPSP-IATSPPPPSASDKHDSQSSPHITQHTG 106
FIS + +P + P + P +P + PS S P
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIP------- 416
Query: 107 PGIFQSPPPTSHISKIPSPSSSSPSSS--SQLMQQ 139
PT P+ ++++P S +L QQ
Sbjct: 417 ----VPAEPTEPSPTPPANAANAPPSLNLEELWQQ 447
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 40.0 bits (94), Expect = 0.006
Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 7/136 (5%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSP-NTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
PA + E P P++ P+P + + H ++
Sbjct: 601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP 660
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
P P+A + G +P + P+ +
Sbjct: 661 DASDGGDGWPAKAGGAAPAA----PPPAPAPAAPAAPAG--AAPAQPAPAPAATPPAGQA 714
Query: 130 PSSSSQLMQQGGGGGS 145
++Q Q G +
Sbjct: 715 DDPAAQPPQAAQGASA 730
Score = 39.2 bits (92), Expect = 0.012
Identities = 14/114 (12%), Positives = 36/114 (31%), Gaps = 2/114 (1%)
Query: 22 VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
PS++ + +P + + + P+ + P + A
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAP--AAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPA 450
Query: 82 TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
P P + +Q +P P +P P + + +P++ + ++
Sbjct: 451 GGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
Score = 38.0 bits (89), Expect = 0.027
Identities = 22/129 (17%), Positives = 35/129 (27%), Gaps = 4/129 (3%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
H P+ + G+ P +P + + + P+ T
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712
Query: 71 PHQMPPPSP----IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPS 126
P P S P P+A D P Q PPP + +
Sbjct: 713 QADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772
Query: 127 SSSPSSSSQ 135
+ PS S+
Sbjct: 773 APPPSPPSE 781
Score = 36.5 bits (85), Expect = 0.076
Identities = 9/105 (8%), Positives = 19/105 (18%), Gaps = 5/105 (4%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP 85
++ + ++ P + P P
Sbjct: 638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA 697
Query: 86 PPSASDKHDSQSSP-----HITQHTGPGIFQSPPPTSHISKIPSP 125
P + + Q S P + +P P
Sbjct: 698 PAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
Score = 32.7 bits (75), Expect = 1.00
Identities = 21/128 (16%), Positives = 40/128 (31%), Gaps = 18/128 (14%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFIS--TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P P ++ G PP+P + +P+ P+ P P+ A +
Sbjct: 590 PAPGAAGGEGPPAP-ASSGPPEEAARPAAPAAPAAP-------------AAPAPAGAAAA 635
Query: 84 PPPPSASDKHDSQSSPHITQHTG-PGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
P SA+ + H +H P + +P++ P+ +
Sbjct: 636 PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPA-APAAPA 694
Query: 143 GGSNGQTH 150
G + Q
Sbjct: 695 GAAPAQPA 702
Score = 31.5 bits (72), Expect = 2.8
Identities = 14/78 (17%), Positives = 22/78 (28%), Gaps = 4/78 (5%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL---TSLTSPHQMPPPSPIA 81
P + ++PP G S + P P + P P P+P A
Sbjct: 710 PAGQADDPAAQPPQAAQGA-SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768
Query: 82 TSPPPPSASDKHDSQSSP 99
P S + +
Sbjct: 769 APAAAPPPSPPSEEEEMA 786
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 38.5 bits (89), Expect = 0.016
Identities = 48/230 (20%), Positives = 71/230 (30%), Gaps = 43/230 (18%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG--HHTLTSLTSP----HQMPPPS 78
PP + G + P VS P QH H+ + H P +
Sbjct: 768 RPPQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWA 827
Query: 79 PIATSPPP---PSASDKHDSQSS-PHITQHTGPGIFQ----------------------S 112
P PP SA D S PH+ TGP Q S
Sbjct: 828 PRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWSS 887
Query: 113 PPPTSHISKIPS--PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQ 170
P P + I IP+ P P S + G S+G + + Y Q
Sbjct: 888 PQPRAPIRPIPTRFPPPPMPLQDSMAV----GCDSSGTACPSM-----PFASDYSQGAFT 938
Query: 171 QDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQ 220
D T + P ++ + SQ++ + +++ N V P + Q
Sbjct: 939 PLDINAQTPKRPRVEESSHGPARCSQATTEAQEILSDNSEISVFPKDAKQ 988
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 38.5 bits (89), Expect = 0.016
Identities = 34/115 (29%), Positives = 41/115 (35%), Gaps = 16/115 (13%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP-----PPSPI 80
PSS+ PP P S PSQ P S P Q P PP+P
Sbjct: 287 QHPSSN----PPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPS 342
Query: 81 AT--SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
PPP + + +QS H GP F P S +P P + P SS
Sbjct: 343 MPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMP-----SNLPPPPALKPLSS 392
Score = 37.0 bits (85), Expect = 0.048
Identities = 27/141 (19%), Positives = 38/141 (26%), Gaps = 3/141 (2%)
Query: 7 LSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
L+ S P ++ +P P+ + PSP T
Sbjct: 189 LAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQT 248
Query: 67 SLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQSPPPT-SHISKIP 123
+ Q P PS P H Q P QH Q S + +P
Sbjct: 249 ASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLP 308
Query: 124 SPSSSSPSSSSQLMQQGGGGG 144
PS + P S + Q
Sbjct: 309 LPSQAQPHSHTPPSQSALQPQ 329
Score = 37.0 bits (85), Expect = 0.058
Identities = 28/124 (22%), Positives = 39/124 (31%), Gaps = 5/124 (4%)
Query: 11 PASHKTSTSMEVDTPP--PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL 68
P S + + P PP ++ P + G S P P QH TS
Sbjct: 404 PPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQH---PFTSG 460
Query: 69 TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
P PPPS ++P P + + G P P I + P +
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAE 520
Query: 129 SPSS 132
P S
Sbjct: 521 EPES 524
Score = 36.6 bits (84), Expect = 0.074
Identities = 37/139 (26%), Positives = 52/139 (37%), Gaps = 17/139 (12%)
Query: 25 PPPPSSSRGSEPPSPNTGGV--SGGFISTSPSQPSPQHGHHTLTSLTS--PHQMP-PPSP 79
PP + + + PP+P+ V G I+ P+ Q +L S S P ++P P P
Sbjct: 184 PPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243
Query: 80 I------ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
+ SP PP+ S +H QSS H P Q P + PSS+ P
Sbjct: 244 LQPQTASQQSPQPPAPSSRH-PQSSHHGPGPPMPHALQQGPV-----FLQHPSSNPPQPF 297
Query: 134 SQLMQQGGGGGSNGQTHLH 152
Q Q H
Sbjct: 298 GLAQSQVPPLPLPSQAQPH 316
Score = 33.5 bits (76), Expect = 0.61
Identities = 32/134 (23%), Positives = 42/134 (31%), Gaps = 26/134 (19%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS---LTSPHQMPP-PSP 79
+P PP + + SP S +S P P H L P PP P
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298
Query: 80 IATSPPPPSASDKHDSQSSPHITQHTGPGIFQ-------------------SPPPTSHIS 120
+A S PP SQ+ PH Q PPPT+ I
Sbjct: 299 LAQSQVPPLPL---PSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIP 355
Query: 121 KIPSPSSSSPSSSS 134
++P+ S P
Sbjct: 356 QLPNQSHKHPPHLQ 369
Score = 32.4 bits (73), Expect = 1.4
Identities = 30/110 (27%), Positives = 38/110 (34%), Gaps = 19/110 (17%)
Query: 50 STSPSQPSPQHG---------HHTLTSLTSPHQMPPP--SPIATSPPPPSASDKHDSQSS 98
S+SPS PSPQ L P PP + ++PPP ++ Q S
Sbjct: 149 SSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGS 208
Query: 99 PHITQHTGPGIFQSPPPTSHISKIPSPS---SSSPSSSSQLMQQGGGGGS 145
P Q P S +S I +PS PS L Q S
Sbjct: 209 PIAAQPAPQ-----PQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQS 253
Score = 32.4 bits (73), Expect = 1.6
Identities = 27/126 (21%), Positives = 38/126 (30%), Gaps = 12/126 (9%)
Query: 19 SMEVDTPPPP---------SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
M + PPPP + S P P + + P+QP +L
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKA 435
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQ-HTGPGIFQSPPPTSHISKIPSPSSS 128
S H P + P P A S P I + P + PP + P S+
Sbjct: 436 STH--PHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSAL 493
Query: 129 SPSSSS 134
S
Sbjct: 494 PSSGGC 499
Score = 30.0 bits (67), Expect = 6.3
Identities = 30/125 (24%), Positives = 42/125 (33%), Gaps = 9/125 (7%)
Query: 81 ATSPPPPSASDKH---DSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLM 137
++SP PS D DS + + Q GP Q PP + P P+ S+ + Q
Sbjct: 149 SSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQ-- 206
Query: 138 QQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQS 197
G Q Q + S Q+ + H +Q QQ Q S
Sbjct: 207 ----GSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSR 262
Query: 198 SQQQS 202
Q S
Sbjct: 263 HPQSS 267
Score = 30.0 bits (67), Expect = 7.0
Identities = 26/139 (18%), Positives = 41/139 (29%), Gaps = 10/139 (7%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
SHK ++ +P P S PP+ +S+ P+ P L +
Sbjct: 359 NQSHKHPPHLQGPSPFPQMPSNLPPPPALKP-------LSSLPTHHPPSAHPPPLQLMPQ 411
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
+ PP + S ++S H G QSP + P+ P
Sbjct: 412 SQPLQSV---PAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPP 468
Query: 131 SSSSQLMQQGGGGGSNGQT 149
S S+G
Sbjct: 469 PSLPTSTPAAPPRASSGSQ 487
Score = 30.0 bits (67), Expect = 7.1
Identities = 29/133 (21%), Positives = 43/133 (32%), Gaps = 10/133 (7%)
Query: 1 MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH 60
MP L PA T + + PP +S+ P++G SG S P
Sbjct: 409 MPQSQPLQSVPAQPPVLT--QSQSLPPKAST------HPHSGLHSGPPQSPFAQHPFTSG 460
Query: 61 GHHTLTSLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH 118
G + S P +P S PP ++ + I + P +
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAE 520
Query: 119 ISKIPSPSSSSPS 131
+ P P SPS
Sbjct: 521 EPESPPPPPRSPS 533
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 38.4 bits (89), Expect = 0.020
Identities = 33/166 (19%), Positives = 43/166 (25%), Gaps = 10/166 (6%)
Query: 37 PSPNTGGVSGGFISTSPSQPSPQH--GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
T G G + P +P Q G T ++L + P+ + P
Sbjct: 80 TGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQ-MSRV 138
Query: 95 SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQ 154
P Q G S P S P + M QG G Q
Sbjct: 139 GTMQPG-GQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQ------Q 191
Query: 155 NIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ 200
+ P Q Q Q Q QQ +Q QQ
Sbjct: 192 QPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQ 237
Score = 38.1 bits (88), Expect = 0.023
Identities = 23/71 (32%), Positives = 27/71 (38%), Gaps = 8/71 (11%)
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK-IPSPSSS 128
P M PSP+ S P + GPG S PP S IPSP +
Sbjct: 407 QPGMMSSPSPVPQVQTNQSMP----QPPQPSVPSPGGPG---SQPPQSVSGGMIPSPPAL 459
Query: 129 SPSSSSQLMQQ 139
PS S Q+ Q
Sbjct: 460 MPSPSPQMSQS 470
Score = 35.7 bits (82), Expect = 0.12
Identities = 34/198 (17%), Positives = 44/198 (22%), Gaps = 28/198 (14%)
Query: 18 TSMEVDTPPPPSSSRGSEPPSPNTGGVSGG---FISTSPSQPSPQHGHHTLTSLTSPHQ- 73
T P G P G G + + + G + PHQ
Sbjct: 80 TGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMG----AAGMGPHQM 135
Query: 74 -----MPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
M P S + + Q G Q+ +
Sbjct: 136 SRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQ--GQAQGQAGGM--------NQGQQ 185
Query: 129 SPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQ-----QDDTQHSTVQSPP 183
P Q Q G G G +Q G +Q Q Q Q P
Sbjct: 186 GPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPA 245
Query: 184 GQQQQYQQMDTSQSSQQQ 201
Q Q Q Q Q
Sbjct: 246 DAQAQMGQQQQGQGGMQP 263
Score = 34.6 bits (79), Expect = 0.27
Identities = 31/177 (17%), Positives = 40/177 (22%), Gaps = 21/177 (11%)
Query: 53 PSQPSPQHGH----HTLTSLTS-PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP 107
G + L +LT + P P+ P P Q GP
Sbjct: 60 QQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQ------------QMGGP 107
Query: 108 G----IFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPS 163
G + QS + + S MQ GG G Q
Sbjct: 108 GTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMG 167
Query: 164 YGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQ 220
Q Q + Q P QQQ QM + N
Sbjct: 168 PQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNP 224
Score = 33.4 bits (76), Expect = 0.65
Identities = 35/176 (19%), Positives = 44/176 (25%), Gaps = 14/176 (7%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
SS + PN G G Q Q G Q PP
Sbjct: 148 GGMMQQSSGQPQSQQPNQMGPQQG-------QAQGQAGGMNQGQQGPVGQQQPPQMGQPG 200
Query: 84 PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGG 143
P + Q P Q P + Q + + P+ + M Q G
Sbjct: 201 MPGGGGQGQMQQQGQPGGQQQQNPQMQQQL---QNQQQQQMDQQQGPADAQAQMGQQQQG 257
Query: 144 GSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQ 199
G G + P Q QQ Q S + P Q QQ Q
Sbjct: 258 --QGGMQPQQMQGGQMQVPMQQQPPQQQ--PQQSQLGMLPNQMQQMPGGGQGGPGQ 309
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 37.8 bits (87), Expect = 0.025
Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 6/129 (4%)
Query: 10 FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
HKT+ + PP S+ P +G +G ST P SP + T
Sbjct: 440 HVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNA 499
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS----KIPSP 125
+ +P ATSP SD ++ S I G + PPT S P
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV--IGVTTTATSPPTGTTSVPNATSPQV 557
Query: 126 SSSSPSSSS 134
+ SP +++
Sbjct: 558 TEESPVNNT 566
Score = 35.1 bits (80), Expect = 0.21
Identities = 33/164 (20%), Positives = 51/164 (31%), Gaps = 22/164 (13%)
Query: 8 SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPS----QPSPQHGHH 63
+ + + E ++ + PS T V+ G T S QP H
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH 605
Query: 64 TLTSLTSPHQMPPP-------SPIATSPPPPSASDKHDSQSSP-----HITQHTGPGIFQ 111
+ S P T P S H S SP +Q +GPG
Sbjct: 606 STPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSS 665
Query: 112 SP--PPTSHISKI-PSP---SSSSPSSSSQLMQQGGGGGSNGQT 149
+ P H+++ P+P S S+PS + G +
Sbjct: 666 TSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709
Score = 29.8 bits (66), Expect = 7.9
Identities = 34/140 (24%), Positives = 53/140 (37%), Gaps = 20/140 (14%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTG---GVSGGFISTSPSQPSP 58
P S++P + M P P+++ S P T S G + S ++ +
Sbjct: 661 PGNSSTSRYPGEVHVTEGM-----PNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETS 715
Query: 59 QHGHHTLTSLTSPHQMPPPSP-IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
TS + +P AT+ PPS S S+ P T F SPP T+
Sbjct: 716 GSTLMASTSPHTNEGAFRTTPYNATTYLPPSTS----SKLRPRWT-------FTSPPVTT 764
Query: 118 HISKIPSPSSSSPSSSSQLM 137
+ +P P + P S+ M
Sbjct: 765 KQATVPVPPTQHPDHSNLSM 784
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 37.7 bits (87), Expect = 0.030
Identities = 32/181 (17%), Positives = 54/181 (29%), Gaps = 10/181 (5%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
P + + PP + ++ P+ TG + ++P +
Sbjct: 691 PGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAA 750
Query: 71 PHQMPPPSPIATSPPPPSASD-----KHDSQSSPHITQ--HTGPGIFQSP--PPTSHISK 121
P + PP+ PP+A+ + Q+ P Q P P PTS
Sbjct: 751 PGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810
Query: 122 IPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQS 181
+ + L Q GG G+ L + + G T VQ+
Sbjct: 811 PRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAA-GPTPSPGSGTSDKIVQA 869
Query: 182 P 182
P
Sbjct: 870 P 870
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 37.6 bits (87), Expect = 0.035
Identities = 36/134 (26%), Positives = 47/134 (35%), Gaps = 40/134 (29%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP------- 76
TP PS EPP P + P SP + L TSP PP
Sbjct: 348 TPEAPSPPIEEEPPQP---------KAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASS 398
Query: 77 --------PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--------PPTSHIS 120
P+ P P SAS+ + + + + T P SP PPTS
Sbjct: 399 KSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRP---LSPYARYEDLKPPTS--- 452
Query: 121 KIPSPSSSSPSSSS 134
PSP++ + S S
Sbjct: 453 --PSPTAPTGVSPS 464
Score = 34.5 bits (79), Expect = 0.28
Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 22/124 (17%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
S +S DT P +++ + PP N +S + P+ + +
Sbjct: 464 SVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAP 523
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--------PPTSHISKI 122
++PP A ++H +Q P SP PPTS
Sbjct: 524 SSTNEVVKVGNSAPPTALADEQHHAQPKPRPL---------SPYTMYEDLKPPTS----- 569
Query: 123 PSPS 126
P+PS
Sbjct: 570 PTPS 573
Score = 31.0 bits (70), Expect = 3.0
Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 5/81 (6%)
Query: 53 PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
P+ PSP S++S P P +A+ + ++ + ++
Sbjct: 450 PTSPSPTAPTGVSPSVSS----TSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDD 505
Query: 113 -PPPTSHISKIPSPSSSSPSS 132
PPTS P + S+
Sbjct: 506 LKPPTSPSPAAPVGKVAPSST 526
Score = 31.0 bits (70), Expect = 3.1
Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 37/111 (33%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP 85
PP S ++ P P + T P P + P + PP P A P
Sbjct: 328 VPPKESDAADGPKP---------VPTKPVTPEAP---------SPPIEEEPPQPKAVVPR 369
Query: 86 PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
P S ++ PPTS IP+P SSSP+SS +
Sbjct: 370 PLSPYTAYEDLK----------------PPTS---PIPTPPSSSPASSKSV 401
Score = 31.0 bits (70), Expect = 3.3
Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 7/108 (6%)
Query: 24 TPPPPSSSRGSEPPS-PNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
T P S S S P + +T + P P + PP SP
Sbjct: 456 TAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPA 515
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
+P A S+ + + + H ++ P P SP
Sbjct: 516 APVGKVA-----PSSTNEVVKVGNSAPPTALADEQHHAQ-PKPRPLSP 557
>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
Length = 808
Score = 37.3 bits (86), Expect = 0.042
Identities = 28/107 (26%), Positives = 35/107 (32%), Gaps = 20/107 (18%)
Query: 20 MEVDTPPPPSSSRG-----SEPPSPNTGGVSGGFISTSPSQPS------PQHGHHTLTSL 68
ME +T P + SE SP T +S SPSQ Q+ T + L
Sbjct: 1 METETKEPEDPADNVNDVVSEASSPETD------LSLSPSQSEQNIENDGQNSPETQSPL 54
Query: 69 TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
T P P P SD +SS +PPP
Sbjct: 55 TELQPSPLPPNTTLDAP---VSDSQGDESSSEQQPQNPNSTEPAPPP 98
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 37.0 bits (86), Expect = 0.043
Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 5/80 (6%)
Query: 51 TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIF 110
+S S P+ + H +S +S Q PPS + P S +P I P
Sbjct: 1 SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPS-ASSLLHTPSIL----PLPK 55
Query: 111 QSPPPTSHISKIPSPSSSSP 130
S P ++ P+ ++ +P
Sbjct: 56 LSSPSPPSVTLPPAATTQTP 75
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 34.2 bits (79), Expect = 0.046
Identities = 12/52 (23%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 239 TSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLDIKADL 290
TS T G S+S + +S+ + ++S +++ ++ T IT + ++D
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS--SSSGDSSSGTRITTESESDF 68
Score = 28.4 bits (64), Expect = 4.3
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 4/65 (6%)
Query: 226 EVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLD 285
+ G +SV + + S+ G S S++ +S + S ++ ++S TT D
Sbjct: 12 KRSGSSNTSVTSGSVSSSG--SNSSSSSSNSSNGGSSSSSSSG--DSSSGTRITTESESD 67
Query: 286 IKADL 290
DL
Sbjct: 68 FWGDL 72
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 34.9 bits (80), Expect = 0.051
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 3/53 (5%)
Query: 381 PYCCSVCMKTFARKEHLENHTRCHTGETPYKCQYCGKNFSRKEHMVNHV-RKH 432
PY C +C+ F+ L+ H R E C CGK F + ++HV +KH
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHVCKKH 123
Score = 33.3 bits (76), Expect = 0.15
Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 2/50 (4%)
Query: 604 TPYQCQYCPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKDHLNNHV 653
+PY C C FS L H+R E C C + F D +HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119
Score = 32.9 bits (75), Expect = 0.22
Identities = 20/61 (32%), Positives = 27/61 (44%), Gaps = 5/61 (8%)
Query: 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNV 838
P+ C +C F+ L H+R E VCP C K F + H+ K H N+ V
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHVCKKH---NICV 127
Query: 839 S 839
S
Sbjct: 128 S 128
Score = 32.5 bits (74), Expect = 0.31
Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 6/51 (11%)
Query: 688 SPHKCDFCLKTFTRKEHLTNHLKQHAGETEH--VCNICSKPFLKKEHLVNH 736
SP+ C CL F + LKQH TEH VC +C K F + ++H
Sbjct: 72 SPYVCPLCLMPF----SSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTLDH 118
Score = 31.4 bits (71), Expect = 0.80
Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 6/52 (11%)
Query: 548 SPHKCNYCAKSFTRKEHLVNHVR--QHTGDSPHKCNYCAKSFTRKEHLVNHV 597
SP+ C C F+ L H+R +H+ C C K F + ++HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRYTEHS----KVCPVCGKEFRNTDSTLDHV 119
Score = 31.4 bits (71), Expect = 0.82
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%)
Query: 492 TPFKCEICTKSFTRKEHFTNHIMWHTGETPHRCDFCSKTFTRKEHLLNHV 541
+P+ C +C F+ HI + E C C K F + L+HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119
Score = 31.0 bits (70), Expect = 0.89
Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 2/50 (4%)
Query: 576 SPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHV 625
SP+ C C F+ L H+R C C K F D ++HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119
Score = 30.6 bits (69), Expect = 1.4
Identities = 20/65 (30%), Positives = 26/65 (40%), Gaps = 11/65 (16%)
Query: 660 SPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFCLKTFTRKEHLTNHL-KQHAGETEH 718
SP+ C C F+ L HIR E C C K F + +H+ K+H
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKH------ 123
Query: 719 VCNIC 723
NIC
Sbjct: 124 --NIC 126
Score = 30.2 bits (68), Expect = 1.8
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%)
Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHV 569
+P+ C C F+ L H+R E C C K F + ++HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119
Score = 29.8 bits (67), Expect = 2.9
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 408 TPYKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYVNHV 457
+PY C C FS + H+R E C +C K F + ++HV
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHV 119
Score = 29.1 bits (65), Expect = 5.3
Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)
Query: 298 HTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSH 344
+ C LC F + H RY K C VCGK F + H
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDH 118
Score = 28.3 bits (63), Expect = 9.2
Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 2/50 (4%)
Query: 632 SPHKCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHI 681
SP+ C C F+ L H+R E C C K F + +H+
Sbjct: 72 SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 36.3 bits (83), Expect = 0.064
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS-----HIS 120
+ T+P + PP S PP S S S H SPPP S ++
Sbjct: 18 SQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLA 77
Query: 121 KIPSPSSSSPSSSSQLMQQ 139
K PS SSS S+ SQ + +
Sbjct: 78 KTPSSSSSQTSTPSQPLPR 96
Score = 30.1 bits (67), Expect = 5.6
Identities = 29/106 (27%), Positives = 41/106 (38%), Gaps = 5/106 (4%)
Query: 28 PSSSRGSEPPSPNTGGVSGGFI-STSPSQPSP-QHGHHTLTSLTSPHQMPPPSPIATSPP 85
P + S+ +P+ G S P PSP Q +L P ++ SP S
Sbjct: 12 PGPANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSAT 71
Query: 86 PPSASDKHDSQSSPHITQHTGPGIFQSPPPTS---HISKIPSPSSS 128
PP+ K S SS + + P P PT S +P P +S
Sbjct: 72 PPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNS 117
>gnl|CDD|225751 COG3210, FhaB, Large exoproteins involved in heme utilization or
adhesion [Intracellular trafficking and secretion].
Length = 1013
Score = 36.2 bits (83), Expect = 0.088
Identities = 26/202 (12%), Positives = 51/202 (25%), Gaps = 16/202 (7%)
Query: 95 SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQ 154
+ + + Q I I S + + Q GQ +
Sbjct: 269 NGGTVGLPSTNAGLSLQVSASGGVIGDINLGSGNLGDLVALGGQPIVAYLKGGQIGGQVN 328
Query: 155 NIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ-QSDLIKTNGTGDV 213
NIG L+ + + + + + G +
Sbjct: 329 NIGVQLASDKWDYKQEGLTGGGAAIIANAVTVLTGTAGAGLIGGGAIAGTGALAGLGGTL 388
Query: 214 NPDEFNQLLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNG---------VS 264
+ A G L + +S G S +GD+ + NG +
Sbjct: 389 ANAASLVIAA-----GNLTINAGALLASQ-GSVSLINTKGDIGATGGNGGLKELGAQSLL 442
Query: 265 GTNSVITNNSTENNTTSITLDI 286
NS +++ +N T +
Sbjct: 443 LNNSGGRSSTVKNLTVAAATAG 464
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This
family contains bacterial fibronectin-attachment
proteins (FAP). Family members are rich in alanine and
proline, are approximately 300 long, and seem to be
restricted to mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 35.6 bits (82), Expect = 0.10
Identities = 18/68 (26%), Positives = 23/68 (33%), Gaps = 12/68 (17%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
PPPP S + P+P + QP P+ PPP P +
Sbjct: 42 APPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDD------------PNAAPPPPPADPN 89
Query: 84 PPPPSASD 91
PPP D
Sbjct: 90 APPPPPVD 97
Score = 30.2 bits (68), Expect = 5.1
Identities = 17/60 (28%), Positives = 23/60 (38%)
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
T+ P PPP A + P P+A ++P Q P PPP + P P
Sbjct: 36 TANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPP 95
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 31.2 bits (71), Expect = 0.11
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 663 KCHFCSKSFTRKEHLNNHIRLH 684
KC C KSF+RK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 30.4 bits (69), Expect = 0.18
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 467 VCQVCGKKYTRKEHLANHMRSH 488
C CGK ++RK +L H+R+H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.6 bits (67), Expect = 0.31
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 411 KCQYCGKNFSRKEHMVNHVRKH 432
KC CGK+FSRK ++ H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.6 bits (67), Expect = 0.35
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 635 KCSYCKQSFTRKDHLNNHVRLH 656
KC C +SF+RK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.2 bits (66), Expect = 0.45
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 607 QCQYCPKAFSRKDHLVNHVRQH 628
+C C K+FSRK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.2 bits (66), Expect = 0.55
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 495 KCEICTKSFTRKEHFTNHIMWH 516
KC C KSF+RK + H+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.5 bits (64), Expect = 1.00
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 691 KCDFCLKTFTRKEHLTNHLKQH 712
KC C K+F+RK +L HL+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.5 bits (64), Expect = 1.1
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 551 KCNYCAKSFTRKEHLVNHVRQH 572
KC C KSF+RK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.5 bits (64), Expect = 1.1
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 579 KCNYCAKSFTRKEHLVNHVRQH 600
KC C KSF+RK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.1 bits (63), Expect = 1.3
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 523 RCDFCSKTFTRKEHLLNHVRQH 544
+C C K+F+RK +L H+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 27.3 bits (61), Expect = 2.2
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 748 CTDCGKSFPLKGNLLFHQRSH 768
C DCGKSF K NL H R+H
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 26.9 bits (60), Expect = 3.5
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 439 RCEICKKAFTRKEHYVNHVMWH 460
+C C K+F+RK + H+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 35.3 bits (81), Expect = 0.12
Identities = 48/198 (24%), Positives = 70/198 (35%), Gaps = 38/198 (19%)
Query: 25 PPPPSSSRGSEPPS----------PNTGGVSGGFISTSP-------SQPSPQHG--HHTL 65
PP S SR S+P + P S G+ S+ P ++P PQ
Sbjct: 43 PPVHSESRLSQPSAVPVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQTDPIDPIQ 102
Query: 66 TSLT-SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPS 124
S++ + Q P S + + P S I + P FQS +++ P
Sbjct: 103 ASMSLNSEQTPTSSSLPAASQPQVFQTGSKPLHSSGINVNAAP--FQSMQTVFNMNA-PV 159
Query: 125 PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPG 184
P + P S L QQ S Q S QH ++Q + Q +Q+ G
Sbjct: 160 PPVNEPES---LKQQNQYQASYNQGF-----------SSQPQHPVEQTELQQEQLQTVVG 205
Query: 185 Q-QQQYQQMDTSQSSQQQ 201
Q QQ + S QQ
Sbjct: 206 SFHSQDQQAHQAASGHQQ 223
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 35.6 bits (82), Expect = 0.13
Identities = 18/113 (15%), Positives = 28/113 (24%), Gaps = 6/113 (5%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P P +S S P Q H P + P
Sbjct: 175 PPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRD-YDHPRPDWDRPRRDRTDRP---E 230
Query: 84 PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
PPP + P I P ++ P+P+ +++L
Sbjct: 231 PPPGAGHVHRGGPGPPERDDAPVVPI--RPSAPGPLAAQPAPAPGPGEPTARL 281
Score = 35.2 bits (81), Expect = 0.20
Identities = 27/172 (15%), Positives = 42/172 (24%), Gaps = 50/172 (29%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
PPPP + R SEP P G +P +G P +
Sbjct: 97 PPPPHARRTSEPELPRPGR-----------RPYEGYGGP---RADDRPPGLPRQDQLPTA 142
Query: 85 PPPSASDKHDSQSSPHITQHTGPGIFQSPP--------PTSHISKIPSPSSSSPSSSSQL 136
P + P Q PG + + P S +S + +
Sbjct: 143 RP----------AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQER 192
Query: 137 MQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQ 188
++ G P Y Q D + P + +
Sbjct: 193 DREPYDAG----------------RPEYDQRRRDYDHPRPD--WDRPRRDRT 226
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 35.4 bits (81), Expect = 0.14
Identities = 27/131 (20%), Positives = 40/131 (30%), Gaps = 8/131 (6%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRG----SEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
H+T+T+ + + P ++G S P S S SP P T
Sbjct: 201 TQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQST 260
Query: 67 SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPS 126
+ +PP S S ++P T +P PT+ SP
Sbjct: 261 TGDGQEHTQRR----KTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPP 316
Query: 127 SSSPSSSSQLM 137
SSP
Sbjct: 317 HSSPPGVQANP 327
Score = 30.4 bits (68), Expect = 4.7
Identities = 31/191 (16%), Positives = 53/191 (27%), Gaps = 12/191 (6%)
Query: 13 SHKTSTSMEVDTPPPPS--SSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
S TS EV P PS + + + + +S++ + +
Sbjct: 175 SKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSS----NP 230
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS--HISKIPSPSSS 128
Q PP S PS S +H ++ TG G + + S SP S+
Sbjct: 231 EPQTEPPP----SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHST 286
Query: 129 SPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQ 188
+ + Q+ G + SP Q + PP
Sbjct: 287 ATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSI 346
Query: 189 YQQMDTSQSSQ 199
+ +
Sbjct: 347 CYGVGIYNEAL 357
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 34.9 bits (80), Expect = 0.14
Identities = 22/130 (16%), Positives = 32/130 (24%), Gaps = 7/130 (5%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
S TP PP S I T+ +P+ H L S
Sbjct: 143 HGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPT-----HPLKSFAV 197
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
P P S P PS + + S + + PS+
Sbjct: 198 P--ALPASMSTIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPSAPDV 255
Query: 131 SSSSQLMQQG 140
+ M +
Sbjct: 256 LETGGKMLED 265
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 35.7 bits (82), Expect = 0.15
Identities = 21/137 (15%), Positives = 33/137 (24%), Gaps = 16/137 (11%)
Query: 31 SRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS----PIATSP-- 84
S P S S S+ + + T+ P PS I T P
Sbjct: 522 SIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASPSTAGSPRPKPSLGKFVIGTDPFA 581
Query: 85 -PPPSASDKHDSQSSPHITQHTGPGIFQSPPPT---------SHISKIPSPSSSSPSSSS 134
+ + + G S P T + K PS S + ++
Sbjct: 582 FANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPGSDLKPATLNGKTPSSSLVGAARNA 641
Query: 135 QLMQQGGGGGSNGQTHL 151
+ G
Sbjct: 642 GASSKVKIPSGLGGFTS 658
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 34.6 bits (80), Expect = 0.15
Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 1/96 (1%)
Query: 21 EVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI 80
E D P + + ++PP V G + P+ P + PP P
Sbjct: 52 EPDMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPP-NTPVEPEPAPVEPPKPK 110
Query: 81 ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
P P + ++ P P + + PT
Sbjct: 111 PVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 33.8 bits (77), Expect = 0.15
Identities = 32/105 (30%), Positives = 38/105 (36%), Gaps = 9/105 (8%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---- 80
PPPP + PP P GV+G I P P L +PPP P+
Sbjct: 38 PPPPLPGGAAIPPPPPLPGVAG--IPPPPPLPGAT-AIPPPPPLPGAAGIPPPPPLPGGA 94
Query: 81 ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
PPPP P + GPG+ PPP IP P
Sbjct: 95 GIPPPPPPLPGGAAVPPPPPLP--GGPGVPPPPPPFPGAPGIPPP 137
Score = 30.3 bits (68), Expect = 2.7
Identities = 20/64 (31%), Positives = 21/64 (32%), Gaps = 15/64 (23%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
PPP PP P GG G P P +PPP P SP
Sbjct: 100 PPPLPGGAAVPPPPPLPGG--PGVPPPPPPFPGAP-------------GIPPPPPGMGSP 144
Query: 85 PPPS 88
PPP
Sbjct: 145 PPPP 148
Score = 29.9 bits (67), Expect = 3.9
Identities = 32/111 (28%), Positives = 36/111 (32%), Gaps = 9/111 (8%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---A 81
PPPP PP P G G I P P G L +PPP P+ A
Sbjct: 2 PPPPLPGGVGIPPPPPLPG--GVCIPPPPPLPGGT-GIPPPPPLPGGAAIPPPPPLPGVA 58
Query: 82 TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
PPPP P + G PPP + IP P P
Sbjct: 59 GIPPPPPLPGATAIPPPPPL---PGAAGIPPPPPLPGGAGIPPPPPPLPGG 106
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 35.3 bits (81), Expect = 0.16
Identities = 31/129 (24%), Positives = 45/129 (34%), Gaps = 12/129 (9%)
Query: 21 EVDTPPPPSSSRGSEPPSPNTGGV---SGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
E T PPS + PS N SG S+S S+ S + +S + + PP
Sbjct: 371 EQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPP 430
Query: 78 SPIATSPPPPSAS----DKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
+ P PPS + D ++ +PH S PP + SS
Sbjct: 431 RTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESV--SSNPPIKQPMEK---EGKVKSSG 485
Query: 134 SQLMQQGGG 142
SQ +
Sbjct: 486 SQYHPESKE 494
Score = 31.8 bits (72), Expect = 1.8
Identities = 21/93 (22%), Positives = 32/93 (34%), Gaps = 11/93 (11%)
Query: 62 HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
+ + P P SP P + S S H + HT P P+ +IS+
Sbjct: 69 SNQPHLVGIPKNSVPQSPNNKGKPSFFGDQRGRSIPSSHQSTHTSA---PGPAPSGNISQ 125
Query: 122 IPSPS--------SSSPSSSSQLMQQGGGGGSN 146
S + P S+ Q+ G GS+
Sbjct: 126 SQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSS 158
Score = 30.3 bits (68), Expect = 7.0
Identities = 23/114 (20%), Positives = 33/114 (28%), Gaps = 5/114 (4%)
Query: 26 PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP-----PSPI 80
S R E + + SQ S Q+ +L S +S Q P P
Sbjct: 171 CRSKSPRDLEVNVQSPERPPDSLVLLPSSQHSVQNFPPSLMSKSSAMQQKPTAYVRPMDG 230
Query: 81 ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
P S K + S P + ++K+ PS S S
Sbjct: 231 QDQAPDESPELKSSIEESYGQQSFGKTMDELKSPAKAKLTKLKIPSQPVEQSYS 284
>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
Length = 55
Score = 31.6 bits (71), Expect = 0.16
Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 410 YKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYV 454
Y+C CG+ + +++ M+ H+RKH T + CK+ R Y+
Sbjct: 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48
Score = 27.3 bits (60), Expect = 5.7
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 468 CQVCGKKYTRKEHLANHMRSHTND 491
C +CG+ Y +++ + H+R H +
Sbjct: 8 CPICGEIYIKRKSMITHLRKHNTN 31
Score = 26.9 bits (59), Expect = 6.7
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 810 CPHCSKSFVEHTCMIKHIRKNHT 832
CP C + +++ MI H+RK++T
Sbjct: 8 CPICGEIYIKRKSMITHLRKHNT 30
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 35.0 bits (80), Expect = 0.17
Identities = 37/170 (21%), Positives = 50/170 (29%), Gaps = 26/170 (15%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQ------ 55
PI C ++ P + S D P S G P + T PS
Sbjct: 123 PIPC-FAEVPVFPRPYQSSGDDDGPSTSRGSGVARVRPTV--IQHRVDKTRPSDYENHRP 179
Query: 56 -----PSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS----------ASDKHDSQSSPH 100
+P + P Q P PS SP P+ + SP
Sbjct: 180 RPFAMANPSWVDEPDAAAQRPPQ-PGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAPSPP 238
Query: 101 ITQHTGPGIFQSPPPTSHI-SKIPSPSSSSPSSSSQLMQQGGGGGSNGQT 149
T P + P + + P PSS P + Q G+N QT
Sbjct: 239 RTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQT 288
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 35.3 bits (81), Expect = 0.18
Identities = 20/66 (30%), Positives = 22/66 (33%), Gaps = 13/66 (19%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
P P R P P G PSP S +P Q PP + A P
Sbjct: 397 PDLPQPDRHPGPAKPEAPGAR------PAELPSPA-------SAPTPEQQPPVARSAPLP 443
Query: 85 PPPSAS 90
P P AS
Sbjct: 444 PSPQAS 449
Score = 33.0 bits (75), Expect = 0.85
Identities = 22/112 (19%), Positives = 31/112 (27%), Gaps = 18/112 (16%)
Query: 55 QPSPQHGHHTLTSLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQS 112
PSP P P P P P + + SP + P Q
Sbjct: 381 APSPAGSPDVKKKAPEP---DLPQPDRHPGPAKPEAPGARPAELPSPA----SAPTPEQQ 433
Query: 113 PPPTSHISKIPSPSSSSPSSSSQL---------MQQGGGGGSNGQTHLHIQN 155
PP PSP +S+P + + + NG +Q
Sbjct: 434 PPVARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTRNGSRKQPVQA 485
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 35.1 bits (81), Expect = 0.20
Identities = 29/148 (19%), Positives = 42/148 (28%), Gaps = 37/148 (25%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
Q L A + ++ P +S+ S P + S+P G
Sbjct: 95 ERQLGLLGVKA-KDSQFTVVSQAKKSPPASKTSTPMNT--------------SEPL-VPG 138
Query: 62 HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
H + + SP + PS SP T P Q P S
Sbjct: 139 HSSFSD--SPSRSASPSRK----------------FSPSSTIQQSP---QLTPSNKPASP 177
Query: 122 IPSPSSSSPSSSSQLMQQGGGGGSNGQT 149
S S S SSS + G + +
Sbjct: 178 SSSYQSPSYSSSLGPVNSSGNRSNLRSS 205
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 34.7 bits (79), Expect = 0.22
Identities = 22/125 (17%), Positives = 43/125 (34%), Gaps = 15/125 (12%)
Query: 16 TSTSMEVDTPPPPSSSRGSEPPSPNTGGVS-----GGFISTSPSQPSPQHGHHTLTSLTS 70
T ++ V P S+ S +P++ + ++T + + ++
Sbjct: 295 THGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVL 354
Query: 71 P--------HQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKI 122
P +P P+ P P S ++ S + + GP P S+I
Sbjct: 355 PSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIP-- 412
Query: 123 PSPSS 127
SP+S
Sbjct: 413 VSPTS 417
Score = 29.7 bits (66), Expect = 8.6
Identities = 16/95 (16%), Positives = 26/95 (27%)
Query: 51 TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIF 110
T P T +S +P PSP + S S+ ++
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSD 357
Query: 111 QSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGS 145
+ P T + + S+ QQ G
Sbjct: 358 VTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNI 392
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 34.9 bits (81), Expect = 0.22
Identities = 15/72 (20%), Positives = 18/72 (25%), Gaps = 23/72 (31%)
Query: 137 MQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQ 196
Q+G G GQ G Q Q GQQQ Q
Sbjct: 617 AQRGEQQGQQGQ----------------GG---QGQGQPGQQGQQGQGQQQGQQGQGGQG 657
Query: 197 SS----QQQSDL 204
++Q L
Sbjct: 658 GQGSLAERQQAL 669
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 34.4 bits (79), Expect = 0.23
Identities = 29/127 (22%), Positives = 43/127 (33%), Gaps = 13/127 (10%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGG--FISTSPSQPSPQHGHHTLTSL 68
S K T + + PPP S S+ PS SG S+ S+ + TS
Sbjct: 195 TNSFKEVTPVGLMRTPPPGS--HSKSPS-----KSGIPDLPSSRDSEKTKPEKPQQETSS 247
Query: 69 TSPHQMPPPSPIATSPP-PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
+ P P T P P + D+ + +S ++ PS S
Sbjct: 248 MDTEKSSAPKPRETLDPKSPEKAPPIDTTEEE--LKSPEASPKESEEASARKR-SPSLLS 304
Query: 128 SSPSSSS 134
SP + S
Sbjct: 305 PSPKAES 311
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 30.1 bits (68), Expect = 0.24
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 662 HKCHFCSKSFTRKEHLNNHIRLH 684
++C C K F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 29.7 bits (67), Expect = 0.39
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 410 YKCQYCGKNFSRKEHMVNHVRKH 432
Y+C CGK F K + H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 29.4 bits (66), Expect = 0.45
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 522 HRCDFCSKTFTRKEHLLNHVRQH 544
+RC C K F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 29.4 bits (66), Expect = 0.47
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 466 HVCQVCGKKYTRKEHLANHMRSH 488
+ C CGK + K L HMR+H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 29.4 bits (66), Expect = 0.48
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 606 YQCQYCPKAFSRKDHLVNHVRQH 628
Y+C C K F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 29.0 bits (65), Expect = 0.66
Identities = 8/23 (34%), Positives = 12/23 (52%)
Query: 634 HKCSYCKQSFTRKDHLNNHVRLH 656
++C C + F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 28.6 bits (64), Expect = 0.81
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 550 HKCNYCAKSFTRKEHLVNHVRQH 572
++C C K F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 28.6 bits (64), Expect = 0.81
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 578 HKCNYCAKSFTRKEHLVNHVRQH 600
++C C K F K L H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.8 bits (62), Expect = 1.4
Identities = 8/23 (34%), Positives = 12/23 (52%)
Query: 690 HKCDFCLKTFTRKEHLTNHLKQH 712
++C C K F K L H++ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 5.3
Identities = 10/23 (43%), Positives = 10/23 (43%)
Query: 382 YCCSVCMKTFARKEHLENHTRCH 404
Y C C K F K L H R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 5.8
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 298 HTCDLCGKFFPYRYQMIVHRRYH 320
+ C CGK F + + H R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 25.9 bits (57), Expect = 6.8
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
++C C K F K L H+R+H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 25.9 bits (57), Expect = 7.4
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 354 YTCAVCFHVFANEESLQRHTKRH 376
Y C C VF ++ +L+ H + H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 34.9 bits (80), Expect = 0.24
Identities = 33/140 (23%), Positives = 44/140 (31%), Gaps = 14/140 (10%)
Query: 12 ASHKTSTSMEVDTPP-PPSSSRGSEPPSPNTGGVSGGFISTSPSQP--------SPQHGH 62
A+ TS+ D P+S S PS G SP Q S HG
Sbjct: 98 ATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGT 157
Query: 63 HTLTSLTSPHQMPPPSPIATSPPP-PSASDKHDSQSSPHITQHTG----PGIFQSPPPTS 117
H T + +P A S PS + SPH Q F P
Sbjct: 158 HPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRK 217
Query: 118 HISKIPSPSSSSPSSSSQLM 137
+ + S P SS+ ++
Sbjct: 218 NSRHSSNSMPSFPHSSTAVL 237
Score = 31.0 bits (70), Expect = 3.6
Identities = 29/117 (24%), Positives = 40/117 (34%), Gaps = 11/117 (9%)
Query: 28 PSSSRGSEPPSPNT---GGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
PSSS P G G + + P G ++ L SPH S TS
Sbjct: 150 PSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSM-QLPSPHFRQKFSSSDTSN 208
Query: 85 PPPSASDKHDSQ-------SSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
S + +S+ S PH + S IS +PSSS+ + S
Sbjct: 209 GFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMS 265
>gnl|CDD|177663 PLN00026, PLN00026, aquaporin NIP; Provisional.
Length = 298
Score = 34.5 bits (79), Expect = 0.24
Identities = 15/70 (21%), Positives = 21/70 (30%), Gaps = 7/70 (10%)
Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT------SLTSPHQMPPPSPI 80
+ S P +P T G GG + S +G L + + P
Sbjct: 1 MDNEEVPSAPSTPATPGTPGGPLFGGFRVDSLSYGDRKLLPSCKCFPVEALEWGRLP-TC 59
Query: 81 ATSPPPPSAS 90
PPP S
Sbjct: 60 FCDLPPPDVS 69
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 34.7 bits (80), Expect = 0.27
Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 18/143 (12%)
Query: 15 KTSTSMEVDTPP--PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT--SLTS 70
K+S + TPP P +S SP+ S F + + S ++
Sbjct: 638 KSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSA 697
Query: 71 PHQMP------------PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ--SPPPT 116
P P P + + P +A++ + S+S + + Q + P
Sbjct: 698 PAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQ 757
Query: 117 SHISKIPSPSSSSPSSSSQLMQQ 139
S+++ + +S +Q
Sbjct: 758 VQAEAQSPASTTALTQTSSEVQD 780
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 34.0 bits (78), Expect = 0.30
Identities = 18/62 (29%), Positives = 24/62 (38%), Gaps = 11/62 (17%)
Query: 73 QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
Q PPP+P+ PPP PH P Q P S P+P+S P+
Sbjct: 143 QPPPPAPVVMMQPPP-----------PHAMPPASPPAAQPAPSAPASSPPPTPASPPPAK 191
Query: 133 SS 134
+
Sbjct: 192 AP 193
Score = 33.7 bits (77), Expect = 0.39
Identities = 19/65 (29%), Positives = 22/65 (33%), Gaps = 25/65 (38%)
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
PH MPP SP A P P + + SPPPT SP +
Sbjct: 157 PPHAMPPASPPAAQPAPSAPA--------------------SSPPPTP-----ASPPPAK 191
Query: 130 PSSSS 134
SS
Sbjct: 192 APKSS 196
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 34.1 bits (78), Expect = 0.35
Identities = 23/118 (19%), Positives = 38/118 (32%), Gaps = 12/118 (10%)
Query: 25 PPPPSSSRGSEPPSPNTGGVS------GGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
P+ + + P+P F S +P+QP S+ P P
Sbjct: 477 AAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDD 536
Query: 79 PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
T P P+A+ +P T P + PP S P+ +++L
Sbjct: 537 AFETLAPAPAAA------PAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARL 588
Score = 30.6 bits (69), Expect = 4.6
Identities = 17/115 (14%), Positives = 39/115 (33%), Gaps = 15/115 (13%)
Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPP 86
P S G+ P + V+ + + + + +P P +P A +
Sbjct: 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAP-------APAAPPAAPAAAPAAAAAAR 417
Query: 87 PSASDKHDSQSSPHIT------QHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
A+ ++ SP Q + G +P P + P+ ++ ++ +
Sbjct: 418 AVAAA--PARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPR 470
>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 100
amino acids in length. This domain is found associated
with pfam00096. This domain has two conserved sequence
motifs: YPSPV and PSP.
Length = 100
Score = 31.7 bits (71), Expect = 0.36
Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 16/112 (14%)
Query: 22 VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
D P S+ SPN S +S+ PS + + TS +SP PSP+
Sbjct: 1 ADKAAPVST------ASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVH 54
Query: 82 TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
TS P PS + T + FQ+ TS S + + S SSP ++
Sbjct: 55 TSFPSPSIA----------TTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 34.0 bits (78), Expect = 0.38
Identities = 22/128 (17%), Positives = 30/128 (23%), Gaps = 11/128 (8%)
Query: 21 EVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL----------TS 70
P + S SP G F + +P TS T
Sbjct: 3 GKPRGQPGRWNYDSSDESP-EGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG 61
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
S I T P PP ++ + P PP +
Sbjct: 62 SGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADS 121
Query: 131 SSSSQLMQ 138
+S L Q
Sbjct: 122 TSPRFLYQ 129
Score = 30.5 bits (69), Expect = 4.4
Identities = 14/69 (20%), Positives = 21/69 (30%), Gaps = 4/69 (5%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
TP PP RG+ P S F+ Q + + S + +
Sbjct: 102 TPVPPGGFRGASSPRLGADSTSPRFLY----QVNFPVILAPIGESNSSSEELSEEEEHSR 157
Query: 84 PPPPSASDK 92
PPP +
Sbjct: 158 PPPSESLKV 166
>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein. This small,
uncommon, poorly conserved protein is found primarily in
the Firmicutes. It features are pair of CxxC motifs
separated by about 20 amino acids, followed by a highly
hydrophobic region of about 45 amino acids. It has no
conserved gene neighborhood, and its function is
unknown.
Length = 94
Score = 31.2 bits (71), Expect = 0.44
Identities = 12/50 (24%), Positives = 20/50 (40%), Gaps = 7/50 (14%)
Query: 466 HVCQVCGKKYTRKEHLANHMRSHTNDTPFKCEICTKSFTRKEHFTNHIMW 515
+C+ C +K++ KE L + P KC C K++ T
Sbjct: 1 PICKNCNEKFSYKELLKSLFSL---YRPIKCPNCGT----KQYLTAKSRK 43
Score = 29.6 bits (67), Expect = 1.6
Identities = 10/42 (23%), Positives = 17/42 (40%), Gaps = 3/42 (7%)
Query: 438 HRCEICKKAFTRKEHYVNHVMWHTGETPHVCQVCGKKYTRKE 479
C+ C + F+ KE + ++ + P C CG K
Sbjct: 1 PICKNCNEKFSYKELLKS--LF-SLYRPIKCPNCGTKQYLTA 39
Score = 28.1 bits (63), Expect = 6.4
Identities = 10/45 (22%), Positives = 17/45 (37%), Gaps = 8/45 (17%)
Query: 411 KCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYVN 455
C+ C + FS KE + + P +C C + Y+
Sbjct: 2 ICKNCNEKFSYKELLKSLFSL---YRPIKCPNCG-----TKQYLT 38
>gnl|CDD|223965 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75
kD subunit (chain G) [Energy production and conversion].
Length = 693
Score = 33.9 bits (78), Expect = 0.45
Identities = 29/145 (20%), Positives = 43/145 (29%), Gaps = 42/145 (28%)
Query: 705 LTNHLKQHAGETEHVCNIC-------------------SKPFLKKEHLVNHS----RSHT 741
L NH C +C S+ K + +
Sbjct: 91 LINH--------PLDCPVCDKGGECELQDLAVKYGVSHSRYRETKRTHRDKDLGPLVKYD 142
Query: 742 GERPFACTDC---GKSFPLKGNLLFHQRSHRKQNGEVLERPFK------C-DVCPKD-FT 790
R CT C K L +R + G L++P + C DVCP T
Sbjct: 143 MNRCILCTRCVRFCKEIAGTHELGVIKRGENSEIGTYLDQPLESELCGNCVDVCPVGALT 202
Query: 791 CKGHLVSHLRSHLNERPHVCPHCSK 815
K + + L + P +C HC+
Sbjct: 203 SKPFAFTARKWELKKTPSICVHCAV 227
>gnl|CDD|221463 pfam12211, LMWSLP_N, Low molecular weight S layer protein N
terminal. This family of proteins is found in bacteria.
Proteins in this family are typically between 328 and
381 amino acids in length. There is a conserved LGDG
sequence motif. Clostridial species have a layer of
surface proteins surrounding their membrane. This layer
is comprised of a high molecular weight protein and a
low molecular weight protein. This domain is the N
terminal domain of the low molecular weight protein. It
is a structural domain.
Length = 253
Score = 33.2 bits (76), Expect = 0.45
Identities = 11/53 (20%), Positives = 20/53 (37%)
Query: 239 TSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLDIKADLD 291
T ST G+ + AAD + ++ T+++ + L I A
Sbjct: 112 TGSTAGIVKAKDSSSTKTVAADAPLKVSDVFTFAVVTDSDKVGLKLTITAKAS 164
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 33.2 bits (76), Expect = 0.60
Identities = 23/146 (15%), Positives = 43/146 (29%), Gaps = 18/146 (12%)
Query: 1 MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPS----QP 56
P+ L FP +D P P + S S + ++ + S +
Sbjct: 255 TPLDISLFNFPLIPLLPPE-ALDLPAPTAVSSSSSTFA--VPALTSSSVDQSATPLDQGF 311
Query: 57 S-PQHGHHT----LTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP--GI 109
S ++ T+ + + +P S S + +S +
Sbjct: 312 SNFGSNMYSEPLNPTNDSLLYGLPSSS----SLYANRTIFPAWASTSVSPLDFSTLFNQP 367
Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQ 135
SP + I P+S SP +
Sbjct: 368 SPSPMASQSILAPAQPTSPSPVALPS 393
Score = 30.2 bits (68), Expect = 6.3
Identities = 22/121 (18%), Positives = 36/121 (29%), Gaps = 1/121 (0%)
Query: 8 SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS 67
S +S + P P++S E + + +S +
Sbjct: 129 SDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRR 188
Query: 68 LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
L+ Q+P P TS P + SD ++ + SP IP P S
Sbjct: 189 LSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGD-IPPPVS 247
Query: 128 S 128
Sbjct: 248 V 248
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 33.2 bits (76), Expect = 0.62
Identities = 18/102 (17%), Positives = 28/102 (27%), Gaps = 23/102 (22%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P P +P SP +P+P T + +PP P+ +
Sbjct: 361 VPVPAPQPAKPTAAAP------------SPVRPTP--APSTRPKAAAAANIPPKEPVRET 406
Query: 84 PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
PP + + PH P + IP
Sbjct: 407 ATPPPVPPRPVAPPVPHT---------PESAPKLTRAAIPVD 439
Score = 32.5 bits (74), Expect = 1.1
Identities = 17/100 (17%), Positives = 25/100 (25%), Gaps = 13/100 (13%)
Query: 16 TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
+ T PS R + PS + I
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETA-----------TPPPV 412
Query: 76 PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
PP P+A P S ++++ I P PP
Sbjct: 413 PPRPVAPPVPHTPESAPKLTRAA--IPVDEKPKYTPPAPP 450
Score = 30.5 bits (69), Expect = 4.7
Identities = 18/82 (21%), Positives = 22/82 (26%), Gaps = 9/82 (10%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
PA + PP R + P P +P P LT
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP------VAPPVPHTPESAPKLTRAAI 436
Query: 71 PHQMPPPSPIATSPPPPSASDK 92
P P T P PP +K
Sbjct: 437 P---VDEKPKYTPPAPPKEEEK 455
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 33.4 bits (76), Expect = 0.63
Identities = 32/132 (24%), Positives = 50/132 (37%), Gaps = 15/132 (11%)
Query: 6 ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
+L + S K +E P P+ + + P S +P+ PS + T
Sbjct: 62 DLEKMVLSIKEKKDLEPLALPMPNLAPITPPSS-----------EPTPAPPSSES---TA 107
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKI-PS 124
T P+Q S +TS +D S + + T + P TS + PS
Sbjct: 108 TRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPS 167
Query: 125 PSSSSPSSSSQL 136
SSS S++QL
Sbjct: 168 HISSSYRSTAQL 179
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 33.2 bits (76), Expect = 0.72
Identities = 41/251 (16%), Positives = 73/251 (29%), Gaps = 36/251 (14%)
Query: 54 SQPSPQHGHHTLTSL--TSPHQMPP-------PSPIATSPPPPSASDKHDSQSSPHITQH 104
S+ Q H+L S + +MP P I +PP QSS T
Sbjct: 32 SESPTQRASHSLASEGKKNRKKMPKVFQKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLR 91
Query: 105 TGPGIFQSPPPTSHI--SKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSP 162
+ + T S + S + M+ G V+ P
Sbjct: 92 ELLALPEDDGETQAPESSPSARRLTRSEGVARHEMEDLAG--------------RPVVKP 137
Query: 163 SYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGD-VNPDEFNQL 221
+ Q + S+ + PP +++ ++ + L K + V+ +Q
Sbjct: 138 DADRQLRQDILNKSSSSRRPPVSKEEGTSSKMPATALASAALFKDDEIRQEVDAARSDQA 197
Query: 222 LAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTS 281
+ + + Q + +A G + + N +
Sbjct: 198 SQ--------SRLSRSRGNPPAIPPDAAPRQPMLTRSAGGRFEGEDENLERNLQPQSP-- 247
Query: 282 ITLDIKADLDL 292
ITLD K LD
Sbjct: 248 ITLDKKGKLDF 258
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 32.7 bits (74), Expect = 0.73
Identities = 30/129 (23%), Positives = 39/129 (30%), Gaps = 16/129 (12%)
Query: 24 TPPPPSSSR-------GSEPPSPNTGGVSGGFISTSPSQPSP------QHGHHTLTSLTS 70
+P PSSS G F S SP SP QH +T +
Sbjct: 86 SPSSPSSSSANWHEVEGGMSSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSG 145
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS--- 127
P P+P P + S PH P + P + + PSS
Sbjct: 146 ISASPHPTPSPLHFPTSPILPQQPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQA 205
Query: 128 SSPSSSSQL 136
P+ S Q
Sbjct: 206 GQPNGSGQG 214
Score = 31.2 bits (70), Expect = 2.1
Identities = 19/73 (26%), Positives = 26/73 (35%), Gaps = 4/73 (5%)
Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPP 86
PSS + +P G V F+ T P P + P MP P TS
Sbjct: 199 DPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPP----MARPVPLPMPDTKPPTTSTEG 254
Query: 87 PSASDKHDSQSSP 99
+ S + S+P
Sbjct: 255 GATSPTSPTYSTP 267
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 32.8 bits (75), Expect = 0.89
Identities = 18/71 (25%), Positives = 24/71 (33%), Gaps = 1/71 (1%)
Query: 29 SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
SS++ S G TS S H +S S H + P P + PPP
Sbjct: 136 SSAQRSSAQHLFYIIPGGPSWRTSIKPFSSSHYGSHNSSAYSDH-LRPLQPYSGELPPPL 194
Query: 89 ASDKHDSQSSP 99
+ S S
Sbjct: 195 GPNWQASNSKK 205
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 32.7 bits (75), Expect = 0.89
Identities = 22/125 (17%), Positives = 38/125 (30%), Gaps = 34/125 (27%)
Query: 17 STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
T P P + + +G +++S +P TL
Sbjct: 205 GTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPT----TL----------A 250
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
P + S P P+A S P T+ + P +++S S+S
Sbjct: 251 PGVLEGSEPTPTAP--------------------SSAPATAPAAAAPQAAATSSSASGNF 290
Query: 137 MQQGG 141
+ Q G
Sbjct: 291 VVQVG 295
>gnl|CDD|132765 cd07303, Porin3, Eukaryotic porin family that forms channels in the
mitochondrial outer membrane. The porin family 3
contains two sub-families that play vital roles in the
mitochondrial outer membrane, a translocase for unfolded
pre-proteins (Tom40) and the voltage-dependent anion
channel (VDAC) that regulates the flux of mostly anionic
metabolites through the outer mitochondrial membrane.
Length = 274
Score = 32.2 bits (73), Expect = 0.93
Identities = 24/127 (18%), Positives = 41/127 (32%), Gaps = 9/127 (7%)
Query: 235 VVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENN----TTSITLDIKADL 290
T +T D S S +S + N+ + N T +++ D+
Sbjct: 66 TFTEKWNTDNTLGLEITVEDQLSRGLK--STFDSSFSPNTGKKNAKIKTGYKRINLGCDV 123
Query: 291 DLPVMG---NHTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSHGKC 347
D + G L + + YQM+ + F +FQA +
Sbjct: 124 DFDIAGPLIRGALVLGYEGWLAGYQMVFETVSRVTQSNFAVGYKTDYNEFQAHTNVNDGT 183
Query: 348 HLGGSMY 354
GGS+Y
Sbjct: 184 EFGGSIY 190
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 32.6 bits (74), Expect = 0.94
Identities = 34/206 (16%), Positives = 49/206 (23%), Gaps = 29/206 (14%)
Query: 6 ELSQFPASHKTSTSMEVDT------PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQ 59
+LS A + S VD+ P + ++ P + QP Q
Sbjct: 28 QLSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ 87
Query: 60 HG-----HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP 114
H S P Q+ T P Q P Q P
Sbjct: 88 QQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQA----QQPQP 143
Query: 115 PTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDT 174
P P S Q Q Q+ + + L P + Q
Sbjct: 144 PPQV------PQQQQYQSPPQQPQYQQNPPPQAQSAPQV----SGLYPEESPYQPQ---- 189
Query: 175 QHSTVQSPPGQQQQYQQMDTSQSSQQ 200
+ + P + SQQ
Sbjct: 190 SYPPNEPLPSSMAMQPPYSGAPPSQQ 215
Score = 29.9 bits (67), Expect = 6.4
Identities = 26/153 (16%), Positives = 38/153 (24%), Gaps = 26/153 (16%)
Query: 49 ISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPG 108
+S +Q S H H ++ P P + + P
Sbjct: 29 LSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVN-----AALPPAPA----- 78
Query: 109 IFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHC 168
QSP P S +P S Q +Q++ +P +
Sbjct: 79 -PQSPQPD------QQQQSQAPPSHQYPSQL---------PPQQVQSVPQQPTPQQEPYY 122
Query: 169 IQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQ 201
Q Q P QQ Q Q Q
Sbjct: 123 PPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQS 155
Score = 29.9 bits (67), Expect = 6.5
Identities = 30/170 (17%), Positives = 36/170 (21%), Gaps = 27/170 (15%)
Query: 35 EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP--PPSASDK 92
E + + P P H P A PP P +
Sbjct: 32 EEAQSSEAHSFHVDSTKQPPAPEQVAKHEL-------ADAPLQQVNAALPPAPAPQSPQP 84
Query: 93 HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLH 152
Q S H P S P + +P + Q
Sbjct: 85 DQQQQSQAPPSHQYP----SQLPPQQVQSVPQQPTPQQEPYYPPPSQ--------PQPPP 132
Query: 153 IQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQS 202
Q A Q Q QSPP Q Q Q S Q
Sbjct: 133 AQQPQAQQPQPPPQV------PQQQQYQSPPQQPQYQQNPPPQAQSAPQV 176
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 31.2 bits (71), Expect = 0.95
Identities = 12/70 (17%), Positives = 19/70 (27%), Gaps = 7/70 (10%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
+ T PP + + G PP P + + S P P+
Sbjct: 51 EDVTVQTTPPPPASAITNGGPPPPPPAR-------AEAASPPPPEAPAEPPAEPEPEAPA 103
Query: 75 PPPSPIATSP 84
+A P
Sbjct: 104 ENTVTVAKDP 113
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 32.4 bits (74), Expect = 0.97
Identities = 25/107 (23%), Positives = 32/107 (29%), Gaps = 9/107 (8%)
Query: 14 HKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL---TS 70
++ VD P S S P V +P QP P + TS TS
Sbjct: 176 DDYGANLAVDEP---EQSTMSRP-QEVKQSVPAQQAPPNPQQPMPSASSESATSKSASTS 231
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
P P P D ++ IT +FQ P S
Sbjct: 232 RESSPQPQSPPPRRVPAPTVDSTAKEAE-QITYQAK-RLFQEYPERS 276
Score = 30.1 bits (68), Expect = 5.9
Identities = 13/67 (19%), Positives = 24/67 (35%), Gaps = 7/67 (10%)
Query: 73 QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
+P P ++ + S T QSPPP ++P+P+ S
Sbjct: 201 SVPAQQAPPNPQQPMPSASSESATSKSASTSRESSPQPQSPPP----RRVPAPTVDST-- 254
Query: 133 SSQLMQQ 139
++ +Q
Sbjct: 255 -AKEAEQ 260
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 32.9 bits (76), Expect = 0.98
Identities = 13/99 (13%), Positives = 24/99 (24%), Gaps = 4/99 (4%)
Query: 35 EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
EP P + + + P P +P P S
Sbjct: 367 EPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ----L 422
Query: 95 SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
+ + + G + P + P S+ +S
Sbjct: 423 LAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLAS 461
Score = 29.8 bits (68), Expect = 6.7
Identities = 13/81 (16%), Positives = 20/81 (24%), Gaps = 6/81 (7%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSP-SQPSPQHGHHTLTSLT 69
++ T+ PP + +P +TS Q L
Sbjct: 379 ASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ-----LQRAQ 433
Query: 70 SPHQMPPPSPIATSPPPPSAS 90
+ P A S P S
Sbjct: 434 GATKAKKSEPAAASRARPVNS 454
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 32.8 bits (74), Expect = 1.0
Identities = 25/119 (21%), Positives = 38/119 (31%), Gaps = 6/119 (5%)
Query: 16 TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
TS + + P P S P P S +Q +PHQ
Sbjct: 33 TSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAA 92
Query: 76 PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--PPTSHISKIPSPSSSSPSS 132
+P P +A+ K D + ++P TS SK P P++ + S
Sbjct: 93 SRAPDPAVAPQLAAAPKPD----AAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHS 147
Score = 30.1 bits (67), Expect = 7.1
Identities = 19/121 (15%), Positives = 29/121 (23%), Gaps = 6/121 (4%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
PA + P +S S P P+ Q +
Sbjct: 42 PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVA 101
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
P P P A +A + + P P +H P P + +
Sbjct: 102 PQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAAS------KKPDPAAHTQHSPPPFAYTR 155
Query: 131 S 131
S
Sbjct: 156 S 156
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 32.5 bits (74), Expect = 1.1
Identities = 28/102 (27%), Positives = 39/102 (38%), Gaps = 8/102 (7%)
Query: 35 EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
EP SP+ S S S QP +G L S ++ + P +PP S +
Sbjct: 90 EPQSPSKELNS----SCSQKQPPYPYGEKCLYSYSAYDRKPASGFKPPTPPSTPCSPVNP 145
Query: 95 SQSSPHITQHTGPGIFQSPPPTS---HISKIPSPSSSSPSSS 133
++ + Q +GP SPP P SSP SS
Sbjct: 146 QETVRQL-QPSGPLSNSSPPSPHTPLPNQSPLPPPMSSPDSS 186
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 32.4 bits (74), Expect = 1.1
Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 1/64 (1%)
Query: 76 PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISKIPSPSSSSPSSSS 134
P SPP P+ S S SP T G+ +P P + +P P+SS+P +
Sbjct: 247 HPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPRASRGHLPPPTSSAPPRPN 306
Query: 135 QLMQ 138
++
Sbjct: 307 GGLR 310
>gnl|CDD|237887 PRK15046, PRK15046, 2-aminoethylphosphonate ABC transporter
substrate-binding protein; Provisional.
Length = 349
Score = 32.4 bits (74), Expect = 1.1
Identities = 15/53 (28%), Positives = 19/53 (35%), Gaps = 13/53 (24%)
Query: 222 LAYHEVFGKLQSSVVTHTSSTLGLAST------STAQGD--MN--SAADNGVS 264
Y KLQ++ V + ST L A GD MN A G +
Sbjct: 192 FDY---LAKLQANNVGPSKSTGKLTPLVSKGEIYVANGDLQMNLAQAEHGGPN 241
>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
protein.
Length = 226
Score = 31.9 bits (73), Expect = 1.1
Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 16/111 (14%)
Query: 29 SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
+ +R P S T SGGF S+ P + S S S + S S
Sbjct: 101 NGTRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSS-SYSSSSAS 159
Query: 89 A--SD---KHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
+ S +S P + PG Q T+ ++ PS ++SS + S
Sbjct: 160 SRTSSWVRSQNSNLEPFM-----PGALQ----TAWVTP-PSSTASSSGTVS 200
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 30.4 bits (69), Expect = 1.1
Identities = 16/76 (21%), Positives = 18/76 (23%), Gaps = 4/76 (5%)
Query: 12 ASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSP 71
A S E PPP + EP + P G T P
Sbjct: 34 AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPG----VYPTPP 89
Query: 72 HQMPPPSPIATSPPPP 87
PPPP
Sbjct: 90 PPNSGYMADPQEPPPP 105
Score = 30.1 bits (68), Expect = 1.6
Identities = 13/78 (16%), Positives = 20/78 (25%), Gaps = 13/78 (16%)
Query: 6 ELSQFPASHKTSTSMEVDTPPPPSSSR----GSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
+S+ + + P P PP P G + + P
Sbjct: 39 PVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMAD 98
Query: 62 HHTLTSLTSPHQMPPPSP 79
P + PPP P
Sbjct: 99 ---------PQEPPPPYP 107
Score = 28.5 bits (64), Expect = 6.2
Identities = 9/45 (20%), Positives = 13/45 (28%), Gaps = 6/45 (13%)
Query: 73 QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
P P P P PG++ +PPP +
Sbjct: 54 VHLEPLPAYGQYAAPPPY------GPPPPYYPAPPGVYPTPPPPN 92
>gnl|CDD|238156 cd00253, PL_Passenger_AT, Pertactin-like passenger domains
(virulence factors) of autotransporter proteins of the
type V secretion system. Autotransporters are proteins
used by Gram-negative bacteria to transport proteins
across their outer membranes. The C-terminal (beta)
domain of autotransporters forms a pore in the outer
membrane through which the N-terminal passenger domain
is transported. Following transport, the passenger
domain is generally cleaved by an outer membrane
protease with the passenger domain either remaining in
contact with the surface via a noncovalent interaction
with the beta domain or cleaved to release a soluble
protein. These proteins are highly diverse and perform a
variety of functions that promote virulence, including
catalyzing proteolysis, serving as an adhesin, mediating
actin-promoted motility, or serving as a cytotoxin.
Proteins in this family share similarity in the
C-terminal region of the passenger domain as seen in the
pertactin structure P.69, a Bordetella pertussis
agglutinogen responsible for human pertussis. The P.69
protein consists of a 16-stranded parallel beta-helix
with a V-shaped cross-section, and is one of the largest
beta-helix known to date.
Length = 186
Score = 31.6 bits (72), Expect = 1.1
Identities = 11/58 (18%), Positives = 23/58 (39%), Gaps = 2/58 (3%)
Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITL 284
G + +T S LG +NS + ++G ++V N + N+ ++
Sbjct: 16 TLGGGGALTLTGNSVWLGTLQAGGGTVSLNSNSLWTLTGDSTV--NGNLTNSGGTVDF 71
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 32.6 bits (74), Expect = 1.1
Identities = 26/159 (16%), Positives = 45/159 (28%), Gaps = 23/159 (14%)
Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH-QMPPPSPIATSPP 85
P + S ++++ Q+ + S + Q P
Sbjct: 202 PKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKD 261
Query: 86 PPSASDK--HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS-------------- 129
+ + H + H ++H + Q+ + S PSSSS
Sbjct: 262 HHHSHNHQHHSIGINNHHSKHADSKL-QTIEVIENHSNKSRPSSSSTNGSKETTSNSSSA 320
Query: 130 -----PSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPS 163
S SS+ + SN H G+V S S
Sbjct: 321 AAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSS 359
>gnl|CDD|218693 pfam05687, DUF822, Plant protein of unknown function (DUF822).
This family consists of the N terminal regions of
several plant proteins of unknown function.
Length = 151
Score = 30.9 bits (70), Expect = 1.3
Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 4/60 (6%)
Query: 32 RGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASD 91
+G +P G S S + S S ++ SP SP ++S P PS+ D
Sbjct: 71 KGCKPLERMEGAGS----SATASPCSSYQLSPVSSAFPSPVPSYSASPASSSFPSPSSLD 126
>gnl|CDD|216908 pfam02159, Oest_recep, Oestrogen receptor.
Length = 139
Score = 31.1 bits (70), Expect = 1.3
Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 13/120 (10%)
Query: 31 SRGSEPPSPNTGGVSGGFISTSPS--QPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
+E +P S G+ P G +L S+ PPSP+ P P
Sbjct: 22 GAPTETSAPVYSSASLGYYPAPEDVHGPPSLGGLQSLGSV-------PPSPLMFLPSSPQ 74
Query: 89 ASD--KHDSQSSPH--ITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGG 144
S Q P+ + + + ++ + S + + + S ++G GG
Sbjct: 75 LSPFLHPHGQQVPYYLENEPSPAAVRETSSTAFYRSSVDNRRQAGRERCSAAEERGSGGM 134
>gnl|CDD|240610 cd12813, LbR-like, Left-handed beta-roll, including virulence
factors and various other proteins. This family
contains a variety of protein domains with a left-handed
beta-roll structure including cell surface adhesion
proteins, bacterial virulence factors, and ice-binding
proteins, and other activities. UspA1 Head And Neck
Domain and YadA of Yersinia are part of a class of
pathogenicity factors that act as cell surface adhesion
molecules, in which N-terminal head and neck domains
extend from the bacterial outer membrane. The UspA1 head
domain of Moraxella catarrhalis, is formed from trimeric
beta-rolls of 14-16 amino acid repeats. The UspA1 head
domain connects to a neck region of large extended,
charged loops that maybe be ligand binding, which is in
turn connected to an extended coiled coil domain that
tethers the head and neck region to the cell surface via
a transmembrane region. The collagen-binding domain
virulence factor YadA an adhesion proteins of several
Yersinia species, and related cell surface proteins. The
collagen-binding portion is found in the hydrophobic
N-terminal region. YadA forms a matrix on the bacterial
outer membrane, which mediates binding to collagen and
epithelial cells. YadA inhibits the
complement-activating pathway with the coating of the
cell surface with factor H, which impedes C3b molecules.
The ice-binding protein of the grass Lolium perenne
(LpIBP) discourages the recrystallization of ice.
Ice-binding proteins produced by organisms to prevent
the growing of ice are termed to anti-freeze proteins.
LpIBP consists of an unusual left-handed beta roll.
Ice-binding is mediated by a flat beta-sheet on one side
of the helix. These domains form a left handed beta roll
made up of a series of short repeated elements. .
Length = 99
Score = 30.2 bits (68), Expect = 1.3
Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSV-ITNNSTENNTTSITL 284
V G +S S+ +G A+T+T + S + V+ N+V NN+ + + S L
Sbjct: 26 VIGGDNNSASGSNSTAVGGANTATGSNAVASGTNAIVTDDNAVASGNNNLASGSNSTAL 84
>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2.
Length = 466
Score = 31.9 bits (73), Expect = 1.5
Identities = 27/114 (23%), Positives = 41/114 (35%), Gaps = 7/114 (6%)
Query: 17 STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
S+ +E P PP GS P + + + +P+ + H + T +P P
Sbjct: 108 SSIIESGAPIPPIPGDGSGFPITTSSTTTPAILDVTPTTRT---VHVSRTQYNNPLFTDP 164
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
PP P+ H S I H+ I S + P P SS+P
Sbjct: 165 SV---LQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTT-PPPISSTP 214
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 32.3 bits (73), Expect = 1.6
Identities = 34/123 (27%), Positives = 46/123 (37%), Gaps = 35/123 (28%)
Query: 28 PSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPP 87
PS S+G+ P GG S ++S S L S Q PPPS S
Sbjct: 145 PSRSQGTRP----AGGTSAVAATSSSSH------------LVSQ-QTPPPSLRGVSSGYR 187
Query: 88 SASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNG 147
S+ + + + G+F P SP SPS SS+L+ G GS
Sbjct: 188 SSVEYNACRMK---------GVFV---PA------SSPYDVSPSQSSELVVVMGDRGSQN 229
Query: 148 QTH 150
+ H
Sbjct: 230 ENH 232
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 28.0 bits (62), Expect = 1.6
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 410 YKCQYCGKNFSRKEHMVNHVRKH 432
+KC CGK+FS K+ + H+RKH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 27.6 bits (61), Expect = 1.8
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 606 YQCQYCPKAFSRKDHLVNHVRQH 628
++C C K+FS KD L H+R+H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 27.6 bits (61), Expect = 1.9
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 522 HRCDFCSKTFTRKEHLLNHVRQH 544
+C C K+F+ K+ L H+R+H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 27.6 bits (61), Expect = 2.1
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 662 HKCHFCSKSFTRKEHLNNHIRLH 684
KC C KSF+ K+ L H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.4 bits (58), Expect = 5.2
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 466 HVCQVCGKKYTRKEHLANHMRSH 488
C +CGK ++ K+ L H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.1 bits (57), Expect = 6.1
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 634 HKCSYCKQSFTRKDHLNNHVRLH 656
KC C +SF+ KD L H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.1 bits (57), Expect = 7.0
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
FKC +C K F+ K L HLR H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.1 bits (57), Expect = 7.1
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 550 HKCNYCAKSFTRKEHLVNHVRQH 572
KC C KSF+ K+ L H+R+H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.1 bits (57), Expect = 7.1
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 578 HKCNYCAKSFTRKEHLVNHVRQH 600
KC C KSF+ K+ L H+R+H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.7 bits (56), Expect = 9.7
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 494 FKCEICTKSFTRKEHFTNHIMWH 516
FKC +C KSF+ K+ H+ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.7 bits (56), Expect = 10.0
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 690 HKCDFCLKTFTRKEHLTNHLKQH 712
KC C K+F+ K+ L HL++H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 30.6 bits (69), Expect = 1.7
Identities = 20/76 (26%), Positives = 25/76 (32%), Gaps = 20/76 (26%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
TPP P EP +P T + T P +P PP+P A
Sbjct: 83 TPPAPP-----EPVTPPTAQSPAPAVPTPPPTSTP---------------AVPPAPAAAV 122
Query: 84 PPPPSASDKHDSQSSP 99
P P +A D P
Sbjct: 123 PAPAAAPPPSDPPQPP 138
>gnl|CDD|188717 cd09331, LIM1_PINCH, The first LIM domain of protein PINCH. The
first LIM domain of paxillin: Paxillin is an adaptor
protein, which recruits key components of the
signal-transduction machinery to specific sub-cellular
locations to respond to environmental changes rapidly.
The C-terminal region of paxillin contains four LIM
domains which target paxillin to focal adhesions,
presumably through a direct association with the
cytoplasmic tail of beta-integrin. The N-terminal of
paxillin is leucine-rich LD-motifs. Paxillin is found at
the interface between the plasma membrane and the actin
cytoskeleton. The binding partners of paxillin are
diverse and include protein tyrosine kinases, such as
Src and FAK, structural proteins, such as vinculin and
actopaxin, and regulators of actin organization.
Paxillin recruits these proteins to their function sites
to control the dynamic changes in cell adhesion,
cytoskeletal reorganization and gene expression. LIM
domains are 50-60 amino acids in size and share two
characteristic zinc finger motifs. The two zinc fingers
contain eight conserved residues, mostly cysteines and
histidines, which coordinately bond to two zinc atoms.
LIM domains function as adaptors or scaffolds to support
the assembly of multimeric protein complexes.
Length = 59
Score = 28.8 bits (65), Expect = 1.7
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 720 CNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRK 770
C C + F E +VN + E+ F C C + FP + LF++ RK
Sbjct: 1 CERCREGFEPDEKIVNSNGELYHEQCFVCAQCFQPFP---DGLFYEFEGRK 48
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 31.4 bits (71), Expect = 1.7
Identities = 15/95 (15%), Positives = 26/95 (27%), Gaps = 14/95 (14%)
Query: 24 TPPPPSSSR-GSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
T PP ++ + V ++ P + +P P P P+
Sbjct: 68 TQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGV---------PVAAQTPKPVKP 118
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
PP+ + P P + PT
Sbjct: 119 PKQPPAGAVPAKPTPKPEPK----PVAEPAAAPTG 149
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 31.4 bits (70), Expect = 1.8
Identities = 22/119 (18%), Positives = 39/119 (32%), Gaps = 3/119 (2%)
Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
P + + P+P + P P S Q PP P+ T
Sbjct: 97 ACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTP 156
Query: 84 PPPPSA--SDKHDSQSSPHITQHTGPGIFQSPPPTSHI-SKIPSPSSSSPSSSSQLMQQ 139
P P+A H+ P + P I +P + + +IP + + L+++
Sbjct: 157 KPAPAAKPIFLHNQLPPPDYPAASCPTIETAPAASPVLEPRIPDKIIDADNDDKDLIKK 215
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 32.0 bits (72), Expect = 1.8
Identities = 28/141 (19%), Positives = 41/141 (29%), Gaps = 4/141 (2%)
Query: 1 MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEP-PSPNTGGVSGGFISTSPSQPSPQ 59
P+Q + Q A + + P P + +P ++ P PQ
Sbjct: 750 EPVQ-QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 808
Query: 60 HGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--PPTS 117
P Q P P P P A D+ P + ++ P P S
Sbjct: 809 QPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868
Query: 118 HISKIPSPSSSSPSSSSQLMQ 138
P PS P + L Q
Sbjct: 869 LDLLTPPPSEVEPVDTFALEQ 889
Score = 30.8 bits (69), Expect = 4.7
Identities = 24/114 (21%), Positives = 29/114 (25%), Gaps = 7/114 (6%)
Query: 94 DSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHI 153
D P T P P +P Q QQ +
Sbjct: 738 DGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQ-----PVAPQPQYQ 792
Query: 154 QNIGAVLSPSYGQHCIQ--QDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLI 205
Q V Q Q Q+ Q P Q QYQQ + Q Q L+
Sbjct: 793 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLL 846
>gnl|CDD|221395 pfam12048, DUF3530, Protein of unknown function (DUF3530). This
family of proteins is functionally uncharacterized. This
protein is found in bacteria. Proteins in this family
are typically between 272 to 336 amino acids in length.
These proteins are distantly related to alpa/beta
hydrolases so they may act as enzymes.
Length = 294
Score = 31.6 bits (72), Expect = 1.9
Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 1/43 (2%)
Query: 58 PQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPH 100
P G TL SLT P P + SSP
Sbjct: 107 PDAGWATL-SLTPPDPPSAAPPNRPTEAEEVTKAGAKQLSSPS 148
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 31.8 bits (72), Expect = 2.0
Identities = 22/73 (30%), Positives = 29/73 (39%), Gaps = 9/73 (12%)
Query: 24 TPPPPSSSRGS-------EPPSPNTGGVSGGFISTSP--SQPSPQHGHHTLTSLTSPHQM 74
PPPPSS G P + +T G +S +P PSP S+ +P
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIP 246
Query: 75 PPPSPIATSPPPP 87
P +A P PP
Sbjct: 247 PVIQYVAPPPVPP 259
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 31.6 bits (71), Expect = 2.0
Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 17/111 (15%)
Query: 29 SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH-----HTLTSLTSPHQMPPPSPIATS 83
S+ GS S T +PS SP GH T + + SP PP S + +
Sbjct: 571 STPTGSAASSNTTFSSDSPSTVVAPST-SPPAGHLGSPPATPSKIVSPSTSPPASHLGSP 629
Query: 84 PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
PS+ + +S ++ P S I + SS S S S
Sbjct: 630 STTPSSPESSIKVAST-----------ETASPESSIKVASTESSVSMVSMS 669
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 31.5 bits (72), Expect = 2.1
Identities = 20/110 (18%), Positives = 25/110 (22%), Gaps = 13/110 (11%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQP-SPQHGHHTLTSLT 69
H+ + PP +S P P PQH
Sbjct: 78 AQEHEAARPSPQHQYQPPYASAQPRQPVQQ-----------PPEAQVPPQHAPRPAQPAP 126
Query: 70 SPHQMPPPSPIATSPPPPSASDKHDSQSSP-HITQHTGPGIFQSPPPTSH 118
P Q P P P S + P H FQ P +
Sbjct: 127 QPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAA 176
>gnl|CDD|236170 PRK08166, PRK08166, NADH dehydrogenase subunit G; Validated.
Length = 791
Score = 31.9 bits (73), Expect = 2.2
Identities = 18/56 (32%), Positives = 23/56 (41%), Gaps = 16/56 (28%)
Query: 769 RKQNGEVLERPFK---CDVCPKD-FTCKGHLVSHLRSHLNER------PHVCPHCS 814
R ++G LE F +VCP FT K H N + P +C HCS
Sbjct: 183 RPEDG-TLESEFSGNLVEVCPTGVFTDKTH-----SERYNRKWDMQFAPSICQHCS 232
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has
been found to be fused fused to BRAF gene in many cases
of pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the
regulation of cell division, differentiation and
secretion.
Length = 684
Score = 31.4 bits (71), Expect = 2.2
Identities = 27/116 (23%), Positives = 38/116 (32%), Gaps = 30/116 (25%)
Query: 23 DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
P S GS S + G ST+PS + Q
Sbjct: 383 RKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQQKPTK--------------EEGRK 428
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQ 138
P PPS +D+ S +S IF+ H+ ++ PSS SS +Q
Sbjct: 429 KPAPPSGTDEQLSSAS----------IFE------HVDRLSRPSSDPYDRSSGKIQ 468
>gnl|CDD|150310 pfam09602, PhaP_Bmeg, Polyhydroxyalkanoic acid inclusion protein
(PhaP_Bmeg). This entry describes a protein found in
polyhydroxyalkanoic acid (PHA) gene regions and
incorporated into PHA inclusions in Bacillus cereus and
Bacillus megaterium. The role of the protein may include
amino acid storage.
Length = 168
Score = 30.5 bits (68), Expect = 2.3
Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 4/86 (4%)
Query: 170 QQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIK--TNGTGDVNPDEFNQLL-AYHE 226
QQ D H ++ + + +Q ++ Q D ++ G+ D+ + HE
Sbjct: 41 QQQDALHKLTKAVDELEAEDKQF-KAEFEDLQQDTVENLRKTAGNALADQIEEWQEKTHE 99
Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQ 252
LQ + S+L + + Q
Sbjct: 100 ALAHLQELFFNPSKSSLSILKQAQEQ 125
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 30.5 bits (69), Expect = 2.4
Identities = 18/90 (20%), Positives = 29/90 (32%), Gaps = 19/90 (21%)
Query: 1 MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH 60
M I L+ + K P SSS+G+ P+ G + +++P P
Sbjct: 122 MRIYKALNTYELVVKAPAE---YYSPKHSSSQGTSTTRPSDGSATPN--TSAPPTPGNPA 176
Query: 61 GHHTLTSLTSPHQMPPPSPIATSPPPPSAS 90
P P A +P +S
Sbjct: 177 AQ-------------PEKP-AETPKGNGSS 192
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 31.2 bits (70), Expect = 2.4
Identities = 19/81 (23%), Positives = 24/81 (29%), Gaps = 1/81 (1%)
Query: 36 PPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDS 95
PP+P + G T S +H H + Q P P +P
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQ 232
Query: 96 QSSPHITQHTGPGIFQSPPPT 116
P Q T Q PP T
Sbjct: 233 PVQPA-QQPTPQNPAQQPPQT 252
Score = 30.0 bits (67), Expect = 5.8
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 8/58 (13%)
Query: 145 SNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQS 202
S+G H H +P Q Q+ Q TVQ+P Q QQ Q Q +QQ +
Sbjct: 192 SHGHRHRH--------APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPT 241
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 31.5 bits (71), Expect = 2.5
Identities = 17/100 (17%), Positives = 34/100 (34%), Gaps = 6/100 (6%)
Query: 7 LSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
+ +FP + S ++ + +T G I T +Q S Q +L
Sbjct: 555 VVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQASAQPAGLSLP 614
Query: 67 ------SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPH 100
+ ++P PPP P P +++ ++
Sbjct: 615 APAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQ 654
Score = 31.5 bits (71), Expect = 2.5
Identities = 13/90 (14%), Positives = 22/90 (24%), Gaps = 6/90 (6%)
Query: 52 SPSQPSPQHGHHTLTSLTSPHQMP-----PPSPIATSPPPPSASDKHDSQSSPHITQHTG 106
+ + H P + P+ + PP D
Sbjct: 365 KVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSP-YNPQS 423
Query: 107 PGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
PG P P + P+ P S + +
Sbjct: 424 PGTSYGPEPVGPVPPQPTNPYVMPISMANM 453
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 31.6 bits (71), Expect = 2.5
Identities = 35/163 (21%), Positives = 61/163 (37%), Gaps = 33/163 (20%)
Query: 37 PSPNTGGVSGGFISTSPSQP------SP--QHGHHTLT---SLTSPHQM----PPPSPIA 81
T + +++++P P +P + + L+ S+T+ + P+P
Sbjct: 1127 FGAGTVASAFLYMNSTPQSPVLALLLAPYISYKFNALSLGFSITADAAIFSLFGIPAPQL 1186
Query: 82 TSPPPPSASDKHDSQS----SPHITQHTGPGIFQ----------SPPPTSHISKIPSPSS 127
S P+ S + P I +G F + P + + P PSS
Sbjct: 1187 LSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSS 1246
Query: 128 SSPSSSSQLMQQGGGGGS--NGQTHLHIQNI--GAVLSPSYGQ 166
SS SS+ + G NG T +HI + A+ +P GQ
Sbjct: 1247 SSSSSAQSISTSPGQIQIVLNGSTTIHINFLFFPALSTPKIGQ 1289
Score = 30.5 bits (68), Expect = 4.9
Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 9/74 (12%)
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ---SPPPTSHISKI 122
L PP+ +PPP S+S S S+ I+ T PG Q + T HI+ +
Sbjct: 1224 AQLQLSAASSPPAATTPTPPPSSSS----SSSAQSIS--TSPGQIQIVLNGSTTIHINFL 1277
Query: 123 PSPSSSSPSSSSQL 136
P+ S+P L
Sbjct: 1278 FFPALSTPKIGQIL 1291
>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
Length = 544
Score = 31.4 bits (71), Expect = 2.5
Identities = 20/79 (25%), Positives = 28/79 (35%), Gaps = 2/79 (2%)
Query: 73 QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
Q P PI SPPPP +S S S G P + +K P +
Sbjct: 15 QSLLPLPI--SPPPPHSSSLSSSPSPTQRFLTPSQGSRLPPRRRALCTKSQDPRWRRAMA 72
Query: 133 SSQLMQQGGGGGSNGQTHL 151
S ++ GS ++L
Sbjct: 73 SLAVIPNMQDSGSPPLSYL 91
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 31.5 bits (72), Expect = 2.5
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 5/43 (11%)
Query: 163 SYGQHCIQQDDTQHSTVQSPPGQ----QQQYQQMDTSQSSQQQ 201
+G + Q + S +QS P Q +Q YQQ +Q +Q
Sbjct: 915 QHGNA-LAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQ 956
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 30.9 bits (70), Expect = 2.6
Identities = 19/109 (17%), Positives = 33/109 (30%), Gaps = 4/109 (3%)
Query: 93 HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLH 152
+ S S I+ + I S ++ P +PSSS G +
Sbjct: 1 NISNSLSSISSGSSAPIGTSALRGTNTPAAAKP---APSSSEASNAGSGSSEQKAKLKGQ 57
Query: 153 IQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQ 201
S + + +D + + P + Q SQ Q+Q
Sbjct: 58 ASTAAGSASQELQKQASESND-EEVVGEEEPEKDQAEAGKQLSQKQQEQ 105
>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
Members of this family include high molecular weight
subunits of glutenin. This group of gluten proteins is
thought to be largely responsible for the elastic
properties of gluten, and hence, doughs. Indeed,
glutenin high molecular weight subunits are classified
as elastomeric proteins, because the glutenin network
can withstand significant deformations without breaking,
and return to the original conformation when the stress
is removed. Elastomeric proteins differ considerably in
amino acid sequence, but they are all polymers whose
subunits consist of elastomeric domains, composed of
repeated motifs, and non-elastic domains that mediate
cross-linking between the subunits. The elastomeric
domain motifs are all rich in glycine residues in
addition to other hydrophobic residues. High molecular
weight glutenin subunits have an extensive central
elastomeric domain, flanked by two terminal non-elastic
domains that form disulphide cross-links. The central
elastomeric domain is characterized by the following
three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
possesses overlapping beta-turns within and between the
repeated motifs, and assumes a regular helical secondary
structure with a diameter of approx. 1.9 nm and a pitch
of approx. 1.5 nm.
Length = 779
Score = 31.6 bits (70), Expect = 2.6
Identities = 39/177 (22%), Positives = 48/177 (27%), Gaps = 11/177 (6%)
Query: 27 PPSSSRGSEPPSPNTGGV-SGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---AT 82
+G +P G G+ TS QP H SL P Q P + A
Sbjct: 441 LQQPGQGQQPGQWQQPGQGQPGYYPTSLLQPGQGQPGHDPASLQQPGQGQQPGQLQQPAQ 500
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
P + Q + Q P Q + P + + QQ G
Sbjct: 501 GQPGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQGQQGQQPGQGQHPAQGEQGQQPGQ 560
Query: 143 GGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQ 199
G Q G P GQ Q S PGQ QQ Q Q
Sbjct: 561 GQQGQQP-------GQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQWQQPGQGQ 610
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.0 bits (70), Expect = 2.6
Identities = 21/111 (18%), Positives = 32/111 (28%), Gaps = 29/111 (26%)
Query: 50 STSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGI 109
+P +P TS +P PP SP + P+++ + S S
Sbjct: 83 KVAPPAATP-------TSAPTPTPSPPASPASGMSAAPASAVEEKSPS------------ 123
Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG---GGSNGQTHLHIQNIG 157
+ +P S S S S G T I +G
Sbjct: 124 -EESATA------TAPESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMG 167
Score = 30.2 bits (68), Expect = 5.5
Identities = 16/79 (20%), Positives = 25/79 (31%), Gaps = 14/79 (17%)
Query: 13 SHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH 72
S + + +V P +S P+P + P+ P+ S
Sbjct: 75 SKPKTGTGKVAPPAATPTS----APTPTP---------SPPASPASGMSAAP-ASAVEEK 120
Query: 73 QMPPPSPIATSPPPPSASD 91
S AT+P PS S
Sbjct: 121 SPSEESATATAPESPSTSV 139
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 31.4 bits (72), Expect = 2.8
Identities = 8/77 (10%), Positives = 22/77 (28%), Gaps = 10/77 (12%)
Query: 23 DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
T P +++ + + +P+ P + PP P A
Sbjct: 40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPA----------AAAPAAPPKPAAA 89
Query: 83 SPPPPSASDKHDSQSSP 99
+ + + + ++
Sbjct: 90 AAAAAAPAAPPAAAAAA 106
>gnl|CDD|219902 pfam08558, TRF, Telomere repeat binding factor (TRF). Telomere
repeat binding factor (TRF) family proteins are
important for the regulation of telomere stability. The
two related human TRF proteins hTRF1 and hTRF2 form
homodimers and bind directly to telomeric TTAGGG repeats
via the myb DNA binding domain pfam00249 at the carboxy
terminus. TRF1 is implicated in telomere length
regulation and TRF2 in telomere protection. Other
telomere complex associated proteins are recruited
through their interaction with either TRF1 or TRF2. The
fission yeast protein Taz1p (telomere-associated in
Schizosaccharomyces pombe) has similarity to both hTRF1
and hTRF2 and may perform the dual functions of TRF1 and
TRF2 at fission yeast telomeres. This domain is composed
of multiple alpha helices arranged in a solenoid
conformation similar to TPR repeats. The fungal members
have now also been found to carry two double strand
telomeric repeat binding factors.
Length = 239
Score = 30.8 bits (70), Expect = 2.9
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 519 ETPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHVRQHTG 574
TP DF + R+E LLN +R+ P NY +SF ++ L++++ ++ G
Sbjct: 172 LTPSETDFVERCKKRREKLLNIIREKDDAHPLLQNYSWESFLKE--LLSYLEKNLG 225
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 30.6 bits (69), Expect = 2.9
Identities = 21/85 (24%), Positives = 25/85 (29%), Gaps = 10/85 (11%)
Query: 53 PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
S +P HG H L +L P + S P SA PH QS
Sbjct: 26 NSLAAPNHGPHDLRALMGTLLHYPGGGMDASGRPRSAMSGPMGGGHPH----------QS 75
Query: 113 PPPTSHISKIPSPSSSSPSSSSQLM 137
P + P QLM
Sbjct: 76 MPAYMMFNPSSKPQPFMLVPGPQLM 100
>gnl|CDD|185230 PRK15330, PRK15330, cell invasion protein SipD; Provisional.
Length = 343
Score = 30.9 bits (69), Expect = 3.0
Identities = 17/61 (27%), Positives = 25/61 (40%)
Query: 75 PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
P P +A P PSAS+ + P T+H G I + I + S+P S
Sbjct: 11 PHPGIVAERPQTPSASEHVEIAVVPSTTEHRGTDIISLSQAATKIQQAQQTLQSTPPISE 70
Query: 135 Q 135
+
Sbjct: 71 E 71
>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 402
Score = 30.9 bits (70), Expect = 3.0
Identities = 10/67 (14%), Positives = 20/67 (29%), Gaps = 7/67 (10%)
Query: 802 HLNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNVSPMNSVGTQHNMLPHTMSSIPLS 861
L +R + C C K F E + + T + + +
Sbjct: 72 FLRKRRYKCCRCGKRFAEKVDFVDRA-QRITRRWAQAAA------IALIKGKAFTDTAEL 124
Query: 862 HSMSTIT 868
+ +S +T
Sbjct: 125 NLISRLT 131
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 30.7 bits (69), Expect = 3.0
Identities = 16/100 (16%), Positives = 38/100 (38%), Gaps = 2/100 (2%)
Query: 37 PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQ 96
P+P+ V+ + +P++P H + + Q P PS +A P + +
Sbjct: 79 PTPHVPAVALPGDTDAPAEPGA--APHVVAERAAAMQAPLPSALAADDPQAPPAGATAAD 136
Query: 97 SSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
+ T P + PP + + + ++ + + +
Sbjct: 137 AGDAAPDATPPAAGDASPPAAAQAAASAAAALTDLDAQHI 176
Score = 29.5 bits (66), Expect = 7.6
Identities = 17/111 (15%), Positives = 31/111 (27%), Gaps = 10/111 (9%)
Query: 25 PPPPSSSRGSEPPSP--------NTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
P P SS V G + S + + +P
Sbjct: 21 PVPARSSSADAAGPHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVPT 80
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
P A + P + + ++PH+ Q+P P++ + P
Sbjct: 81 PHVPAVALPGDTDAPAEPG-AAPHVVAERAAA-MQAPLPSALAADDPQAPP 129
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 31.1 bits (70), Expect = 3.0
Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 6/81 (7%)
Query: 53 PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
P+ P P T + SP P PS TSPP + + ++P +
Sbjct: 205 PATPRP-----TPRTTASPETTPTPST-TTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA 258
Query: 113 PPPTSHISKIPSPSSSSPSSS 133
PP P+ ++ +P +S
Sbjct: 259 PPTPGGGEAPPANATPAPEAS 279
Score = 29.5 bits (66), Expect = 9.0
Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 13/68 (19%)
Query: 23 DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
+T P PS++ PPS S + +PQ G T+ + PP+P
Sbjct: 219 ETTPTPSTTT--SPPSTTIPAPS-------TTIAAPQAG----TTPEAEGTPAPPTPGGG 265
Query: 83 SPPPPSAS 90
PP +A+
Sbjct: 266 EAPPANAT 273
>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
protein.
Length = 342
Score = 30.9 bits (70), Expect = 3.1
Identities = 12/80 (15%), Positives = 21/80 (26%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
T P S PP ++ +G ++ Q + S +
Sbjct: 82 STEDKALQFGPSFQRESSEDSPPPSSSTYSAGTRTVSTQGQSAYLSDPKRRPSSEASESE 141
Query: 75 PPPSPIATSPPPPSASDKHD 94
+ PPP + D
Sbjct: 142 TVAFDESDLPPPDPWRCRRD 161
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 30.9 bits (69), Expect = 3.1
Identities = 24/97 (24%), Positives = 43/97 (44%), Gaps = 17/97 (17%)
Query: 12 ASHKTSTSMEVDTPPP---------PSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH 62
A+ KT + + PPP P+ + + PS + G G F++ P+ PS
Sbjct: 46 ATAKTRVARQ--NPPPNPLGQLMNWPALPQDFQLPSMDLGPQVGSFLAQLPAMPSMP--- 100
Query: 63 HTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSP 99
L +P +P P+P + PP + + D+ ++P
Sbjct: 101 -GLLGAAAP--VPAPAPAPAAAPPAAPAPAADTPAAP 134
>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
Length = 796
Score = 31.0 bits (70), Expect = 3.1
Identities = 18/105 (17%), Positives = 28/105 (26%), Gaps = 10/105 (9%)
Query: 46 GGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHT 105
G ST S + + S AT P S+ K + +S + +
Sbjct: 1 GRSSSTMVSAAASTSLS-SSRPQLSSFS-SRSPQSATRSPRASSI-KCSASASASSSATS 57
Query: 106 GPGIFQSPPPTSHISK---IPSPSSSSPSSSSQLMQQGGGGGSNG 147
+ + + S GGGGG G
Sbjct: 58 SS----ASLVANGAVALLSASAISGGGGGGGGGFGGFGGGGGGGG 98
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
PatA/PatG family. This model describes a protease
domain associated with the maturation of various members
of the cyanobactin family of ribosomally produced,
heavily modified bioactive metabolites. Members include
the PatA protein and C-terminal domain of the PatG
protein of Prochloron didemni, TenA and a region of TenG
from Nostoc spongiaeforme var. tenue, etc.
Length = 602
Score = 31.2 bits (71), Expect = 3.2
Identities = 13/94 (13%), Positives = 27/94 (28%), Gaps = 3/94 (3%)
Query: 47 GFISTSPSQPSPQHGHHTLTSLTS-PHQMPP-PSPIATSPPPPSASDKHDS-QSSPHITQ 103
G +++ G ++ S +PP + P A+ + ++ I
Sbjct: 234 GLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIEL 293
Query: 104 HTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLM 137
G P T + S S+ +
Sbjct: 294 SAGVLPNAISPATPVRPASNGVTPSQAPSAEPQL 327
>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
Provisional.
Length = 450
Score = 30.9 bits (70), Expect = 3.2
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 692 CDFCLKTFTRKEHLTNHLK 710
C+FCLK RKE L H+K
Sbjct: 201 CEFCLKFMKRKEQLQRHMK 219
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.0 bits (70), Expect = 3.2
Identities = 15/81 (18%), Positives = 24/81 (29%), Gaps = 4/81 (4%)
Query: 54 SQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP 113
S +P S M +P A D Q+ +T + +
Sbjct: 276 SWQTPAQTPARRIST----PMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELLRER 331
Query: 114 PPTSHISKIPSPSSSSPSSSS 134
P+ + P S S +S S
Sbjct: 332 FPSDLFAISSLPDSDSEASDS 352
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 31.1 bits (70), Expect = 3.2
Identities = 17/117 (14%), Positives = 29/117 (24%), Gaps = 16/117 (13%)
Query: 2 PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
P++ PA+ + + V T P + P F++T+P +P
Sbjct: 129 PVRAASIPSPAAQALAHAAAVRTAPRQEHA---LSAVPEQLFAD--FLTTAPVPRAPVQA 183
Query: 62 HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQ---SSPHITQHTGPGIFQSPPP 115
P P P + A+ D P
Sbjct: 184 P--------VVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDALPQILPPAA 232
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.9 bits (70), Expect = 3.5
Identities = 23/117 (19%), Positives = 35/117 (29%), Gaps = 26/117 (22%)
Query: 8 SQFPASH-KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
+ P H K + P P +++ S PS ++ + PS P
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSS-------AAAQPSAPQSATQPAG-- 422
Query: 67 SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS--PPPTSHISK 121
+P + P A PPS Q P F+ P S +S
Sbjct: 423 ---TPPTVSVDPPAAVPVNPPST-----------APQAVRPAQFKEEKKIPVSKVSS 465
>gnl|CDD|147099 pfam04771, CAV_VP3, Chicken anaemia virus VP-3 protein. This
protein is found in the nucleus of infected cells and
may act as a transcriptional regulator. It induces
apoptosis, and is also known as apoptin.
Length = 121
Score = 29.4 bits (65), Expect = 3.5
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQP 56
K + D P PPS R +P + I+T+PS+P
Sbjct: 71 KNVPDLRTDQPKPPSKKRSCDPSEYRVSELKESLITTTPSRP 112
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 31.2 bits (70), Expect = 3.5
Identities = 18/72 (25%), Positives = 23/72 (31%), Gaps = 11/72 (15%)
Query: 53 PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTG------ 106
PS S G H P + P P P+ + P PS P + H G
Sbjct: 621 PSSLSASMGFHP-----PPFRHPFPLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSP 675
Query: 107 PGIFQSPPPTSH 118
PG + P
Sbjct: 676 PGKDRDSPDLPR 687
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 30.1 bits (68), Expect = 3.5
Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 2/80 (2%)
Query: 30 SSRGSEP-PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
S G+ P ++ V+ + S + T T+ TS P S T+ PS
Sbjct: 56 SGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPS 115
Query: 89 ASDKHD-SQSSPHITQHTGP 107
+D + + + T G
Sbjct: 116 ETDTEEATTTVSTETPTEGG 135
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 30.5 bits (69), Expect = 3.6
Identities = 31/143 (21%), Positives = 37/143 (25%), Gaps = 42/143 (29%)
Query: 19 SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
+ R GG GG S P P PPP
Sbjct: 175 KFFLSCGGVRGGPRPERAGYGGGGGGGGGGGGGGGSGPGP----------------PPPG 218
Query: 79 PIATSPPPPSASDKHDSQSSPHIT----QHTGPGIF-------------------QSPPP 115
++ PPP S T GPG + S
Sbjct: 219 FKSSFPPPYGPGAGPSSGYGSGGTRSGQGGWGPGFWTGLGAGGALGYLFGSRRNNNSSYG 278
Query: 116 TSHISKIPS---PSSSSPSSSSQ 135
S+ S PS SSS+ SSSS
Sbjct: 279 RSYGSGSPSYSPSSSSNSSSSSS 301
Score = 30.1 bits (68), Expect = 5.5
Identities = 16/69 (23%), Positives = 19/69 (27%), Gaps = 7/69 (10%)
Query: 83 SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIP-------SPSSSSPSSSSQ 135
P + G G PPP S P PSS S ++
Sbjct: 184 RGGPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGPSSGYGSGGTR 243
Query: 136 LMQQGGGGG 144
Q G G G
Sbjct: 244 SGQGGWGPG 252
>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
C2H2 Zn-finger [General function prediction only].
Length = 65
Score = 28.0 bits (62), Expect = 3.8
Identities = 14/45 (31%), Positives = 20/45 (44%)
Query: 406 GETPYKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRK 450
GE +C CG F R++ + HV K G R + K +K
Sbjct: 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHGWLFGRGKPKGKRLKKK 58
Score = 27.7 bits (61), Expect = 5.6
Identities = 12/29 (41%), Positives = 15/29 (51%)
Query: 429 VRKHTGETPHRCEICKKAFTRKEHYVNHV 457
VR GE RC C F R++ Y+ HV
Sbjct: 9 VRDRDGEEFLRCPRCGMVFRRRKDYIRHV 37
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 29.8 bits (67), Expect = 3.8
Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 1/61 (1%)
Query: 55 QPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP 114
+ TS T P P P +T+P P + P P Q PP
Sbjct: 107 CRPQRPVMTRATSTTVQTT-PLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPP 165
Query: 115 P 115
P
Sbjct: 166 P 166
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 30.2 bits (68), Expect = 3.9
Identities = 16/62 (25%), Positives = 22/62 (35%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
P + TS + E T PP +++ P T + SP Q P G S
Sbjct: 115 PEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPD 174
Query: 71 PH 72
H
Sbjct: 175 AH 176
Score = 30.2 bits (68), Expect = 4.0
Identities = 18/93 (19%), Positives = 30/93 (32%)
Query: 50 STSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGI 109
S + SQ SP + Q PP +TS +A+D + ++
Sbjct: 88 SDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQ 147
Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
+P + P S P+ S+ G
Sbjct: 148 PATPDERRSPRQRPPVSGEPPTPSTPDAHVAGT 180
>gnl|CDD|219744 pfam08204, V-set_CD47, CD47 immunoglobulin-like domain. This
family represents the CD47 leukocyte antigen V-set like
Ig domain.
Length = 93
Score = 28.6 bits (64), Expect = 4.0
Identities = 11/40 (27%), Positives = 18/40 (45%), Gaps = 4/40 (10%)
Query: 267 NSVITNNSTENNTTSITLDIKADLDLPVMGNHTCDLCGKF 306
N I + NN +++ D L GN+TC++ F
Sbjct: 43 NKDIFIFNGTNNISTLKFDFSDALS----GNYTCEVTELF 78
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 30.4 bits (69), Expect = 4.0
Identities = 12/84 (14%), Positives = 28/84 (33%), Gaps = 3/84 (3%)
Query: 8 SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS 67
++ + S ++ T P+++ P T S + P+ +
Sbjct: 152 AELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV---DPQQNA 208
Query: 68 LTSPHQMPPPSPIATSPPPPSASD 91
+ +P Q + +P P+ D
Sbjct: 209 VVAPSQANVDTAATPAPAAPATPD 232
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 30.4 bits (68), Expect = 4.1
Identities = 14/136 (10%), Positives = 38/136 (27%), Gaps = 13/136 (9%)
Query: 22 VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
+S+ ++ + + +TS + +P + + P+
Sbjct: 120 NQLGNAGASASITKTSNGSDAA------TTSSTANTPAAA-----KVLKANAASAPNTTT 168
Query: 82 TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS--KIPSPSSSSPSSSSQLMQQ 139
+ + + ++ T P + T + K + + + +S+
Sbjct: 169 GVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMG 228
Query: 140 GGGGGSNGQTHLHIQN 155
G N I N
Sbjct: 229 SLGVPQNTSMLAQIPN 244
Score = 30.4 bits (68), Expect = 5.2
Identities = 33/258 (12%), Positives = 71/258 (27%), Gaps = 55/258 (21%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
+ S S+ + +++ S +P V + + S P+ G + + +
Sbjct: 122 LGNAGASASITKTSNGSDAATTSSTANTPAAAKVLK---ANAASAPNTTTGVGSAATTAA 178
Query: 71 PHQMPPPSPIATSPPP------------------PSASDKHDSQS----SPHITQHTGPG 108
+P T P SA + + +
Sbjct: 179 ISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSM 238
Query: 109 IFQSPPPTSHISKIP--SPSSS--------SPSSSSQLMQQ-------------GGGGGS 145
+ Q P PT + + SP+++ SP + M G
Sbjct: 239 LAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQ 298
Query: 146 NGQTHLHIQNIGAVLSPSYGQHCIQQDDTQH--STVQSPPGQQQQYQQMDTSQSSQQQSD 203
+ + ++ +PS + + + + Q Q Q QQ+
Sbjct: 299 SRENSMNQGM-----TPSASMINLNNITPANILNMSLNLAFDLNQNQTPQQLQPQNQQNM 353
Query: 204 LIKTNGTGDVNPDEFNQL 221
+ N + + + N L
Sbjct: 354 NMGMNDVNNFDLLDLNNL 371
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 30.8 bits (69), Expect = 4.2
Identities = 39/195 (20%), Positives = 61/195 (31%), Gaps = 20/195 (10%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
P R + P P G S SQ +P MP P P+ P
Sbjct: 485 PGVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGV--------AFAPV-MPQPMPVEPVP 535
Query: 85 PPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGG 144
P A ++ + P I GPG + + + P S SQ+ +
Sbjct: 536 VPTVALERPVCPAPPLIAM-QGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLAR 594
Query: 145 SNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDL 204
+ + ++ V P TQ S Q + QQM Q +D+
Sbjct: 595 LRAEAQPYQASV-EVQPPQL---------TQVSPQQPMEYPLEPEQQMFPGSPFSQVADV 644
Query: 205 IKTNGTGDVNPDEFN 219
++ G + P F+
Sbjct: 645 MRAGGVPAMQPQYFD 659
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 30.5 bits (69), Expect = 4.5
Identities = 14/81 (17%), Positives = 25/81 (30%), Gaps = 13/81 (16%)
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISKIPS 124
P+P A + PPP+A +P + + P P P
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAA--------APPVPPAAPARPAAARPAPAP----APP 428
Query: 125 PSSSSPSSSSQLMQQGGGGGS 145
+++ P+ S+ G
Sbjct: 429 AAAAPPARSADPAAAASAGDR 449
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 30.5 bits (69), Expect = 4.6
Identities = 35/213 (16%), Positives = 55/213 (25%), Gaps = 18/213 (8%)
Query: 6 ELSQFPAS--HKTSTSMEVDTPPPPSSSRGSEPPSPNTGG---VSGGFISTSPSQPSPQH 60
LSQF H + + P PSS+R + S + TS +P
Sbjct: 209 PLSQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFSDVWPASTQSPREETSAEPLAPAS 268
Query: 61 GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS 120
S Q+ SP A + S S P PP ++
Sbjct: 269 PSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKK 328
Query: 121 KIPS-------------PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQH 167
+ S + G +GQ+ + + L+ +
Sbjct: 329 AKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGA 388
Query: 168 CIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ 200
Q V + +Q T SS
Sbjct: 389 AHQPASLLPPAVVQQQLEDASDKQPPTPGSSLV 421
>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
Length = 509
Score = 30.3 bits (68), Expect = 4.7
Identities = 24/107 (22%), Positives = 49/107 (45%), Gaps = 4/107 (3%)
Query: 172 DDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQLLAYHEV--FG 229
D S ++ P + + + D + SQ Q+D ++N V + + + + +
Sbjct: 135 DTKSQSKLRPVPSKAKSLEIKDPQEQSQNQADAQESNKESVVLQPQSDSMPSPSSIGTED 194
Query: 230 KLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTE 276
K S+V H S+T G+ T T Q +N++ D+ + +S + +E
Sbjct: 195 KEPQSIVNHHSTTSGMGETQTQQ--LNASGDSPIRELDSSAGDPPSE 239
>gnl|CDD|177547 PHA03164, PHA03164, hypothetical protein; Provisional.
Length = 88
Score = 28.4 bits (63), Expect = 4.8
Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 16/54 (29%)
Query: 85 PPPSASDK-----HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
PPPS S+ D Q SPH TS + +P P S + +
Sbjct: 7 PPPSYSEVLMMDVMDGQVSPHQEN-----------DTSFVECLPPPQISRTAWN 49
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 27.4 bits (61), Expect = 4.9
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 9/51 (17%)
Query: 75 PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
PPP SPPPP S + S P +++SPPP ++ K P P
Sbjct: 15 PPPPYYYKSPPPPVKSPVYKSPP---------PPVYKSPPPPKYVYKSPPP 56
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 29.8 bits (67), Expect = 5.0
Identities = 18/78 (23%), Positives = 26/78 (33%), Gaps = 5/78 (6%)
Query: 71 PHQMPPPSPIATSPPPPSASDK-HDSQSSPHITQHTGPGIFQSPP---PTSHISKIPSPS 126
P PPS T P P+ ++ DS + T P + PP P + PS
Sbjct: 143 PAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDT-PVTLELPPAPQPPPPVVPQPSTM 201
Query: 127 SSSPSSSSQLMQQGGGGG 144
S + + G
Sbjct: 202 VVHRRSRIKRTRSSGWQA 219
>gnl|CDD|188836 cd09452, LIM1_Enigma, The first LIM domain of Enigma. The first
LIM domain of Enigma: Enigma was initially characterized
in humans as a protein containing three LIM domains at
the C-terminus and a PDZ domain at N-terminus. The
third LIM domain specifically interacts with the insulin
receptor and the second LIM domain interacts with the
receptor tyrosine kinase Ret and the adaptor protein
APS. Thus Enigma is implicated in signal transduction
processes such as mitogenic activity, insulin related
actin organization, and glucose metabolism. Enigma is
expressed in multiple tissues, such as skeletal muscle,
heart, bone and brain. LIM domains are 50-60 amino
acids in size and share two characteristic zinc finger
motifs. The two zinc fingers contain eight conserved
residues, mostly cysteines and histidines, which
coordinately bond to two zinc atoms. LIM domains
function as adaptors or scaffolds to support the
assembly of multimeric protein complexes.
Length = 52
Score = 27.1 bits (60), Expect = 5.2
Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Query: 300 CDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFD 336
C C K RY + + R YH E FTC C K D
Sbjct: 1 CAQCNKIIRGRYLVALGRSYHPEE--FTCSQCKKVLD 35
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 30.4 bits (68), Expect = 5.3
Identities = 13/60 (21%), Positives = 21/60 (35%), Gaps = 2/60 (3%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGF-ISTSPSQPSPQHGHHTLTSLT 69
+ ST+ TP P+ + S SP G + + S H T++T
Sbjct: 34 IENSDNSTAEMQSTPATPTHTT-SNLTSPFGTGTDNSTNANGTESTTQASQPHSHETTIT 92
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 30.1 bits (67), Expect = 5.5
Identities = 24/126 (19%), Positives = 35/126 (27%), Gaps = 13/126 (10%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
+SH S S + S+S G E S F S P QH ++
Sbjct: 204 ASSH--SNSEDCSLSAGFSNSGGEE-----FTRRSTDFTRRQLSLPDRQHQPALPRQNSA 256
Query: 71 PHQMPPPSPIATSPP------PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPS 124
Q P PPS S+ S ++ G +S P
Sbjct: 257 GPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPE 316
Query: 125 PSSSSP 130
++
Sbjct: 317 LRPAAG 322
Score = 30.1 bits (67), Expect = 6.2
Identities = 21/129 (16%), Positives = 35/129 (27%), Gaps = 11/129 (8%)
Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
P S + + E + S+ S G + + + + SL
Sbjct: 189 PVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQ-----LSLPD 243
Query: 71 PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
P P SA + + G +S S+ SS P
Sbjct: 244 RQHQP------ALPRQNSAGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWP 297
Query: 131 SSSSQLMQQ 139
S ++ QQ
Sbjct: 298 QSGARPRQQ 306
Score = 30.1 bits (67), Expect = 6.3
Identities = 28/161 (17%), Positives = 39/161 (24%), Gaps = 35/161 (21%)
Query: 66 TSLTSPH--QMPPPSPIATSPPPPSAS----------DKHDSQSSPHITQ---------- 103
+L +P Q P +P PPS S D SS T+
Sbjct: 6 AALANPTPVQQQSRRPHERNPSPPSVSCGLSSGLQKIMMDDLDSSVDFTRLPSPTPENKD 65
Query: 104 ----HTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL---------MQQGGGGGSNGQTH 150
SP +S S+I P P+ L Q
Sbjct: 66 LFFVTRPSLAQSSPAYSSSYSEINEPELGMPNGGRSLSLVDLQDAASGQVHSSPMGLPGA 125
Query: 151 LHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQ 191
L ++ S GQ I +P +
Sbjct: 126 LGLRRASVPWSARSGQSSIAGGLRLSQPQSAPTVRLLHAAP 166
>gnl|CDD|148271 pfam06566, Chon_Sulph_att, Chondroitin sulphate attachment domain.
This family represents the chondroitin sulphate
attachment domain of vertebrate neural transmembrane
proteoglycans that contain EGF modules. Evidence has
been accumulated to support the idea that neural
proteoglycans are involved in various cellular events
including mitogenesis, differentiation, axonal outgrowth
and synaptogenesis. This domain contains several
potential sites of chondroitin sulphate attachment, as
well as potential sites of N-linked glycosylation.
Length = 253
Score = 30.0 bits (67), Expect = 5.5
Identities = 17/111 (15%), Positives = 31/111 (27%), Gaps = 2/111 (1%)
Query: 19 SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
S E P G E + T S + + G L + P
Sbjct: 50 SEEAPVGPGVGPEEGLEASAAVTPTAWLEASSPGLGGVTAEAGSGDSQGLPATLPT-PDE 108
Query: 79 PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS-KIPSPSSS 128
+ S P + + ++ + P P + P S + + S+
Sbjct: 109 ALGNSNPSLALPEATEASNPPSPGPGDKPSLLPELPKESPVEVWLNLGGST 159
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 30.5 bits (69), Expect = 5.5
Identities = 25/127 (19%), Positives = 32/127 (25%), Gaps = 29/127 (22%)
Query: 16 TSTSMEVDTPPP--------PSSSRGSEPPSPNTGGV--SGGFISTSPSQPSPQHGHHTL 65
T M + PP P +RG E P G G T+ +
Sbjct: 626 TVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRP 685
Query: 66 TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
T H +P A P P G G+ P + P
Sbjct: 686 T----AHHAALRAPQAPRPGGPPGG---------------GGGLPPPPDLPAAAGPAPCG 726
Query: 126 SSSSPSS 132
SS S
Sbjct: 727 SSLIASP 733
Score = 30.2 bits (68), Expect = 6.9
Identities = 19/99 (19%), Positives = 27/99 (27%), Gaps = 14/99 (14%)
Query: 19 SMEVDTPPPPSSSRGSE-PPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
M T P +RG P+ + + +PQ PPP
Sbjct: 666 GMPAHTARPSRVARGDPVRPTAHHA-----------ALRAPQAPRPGGPPGGGGGLPPPP 714
Query: 78 SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
A + P P S S + PG Q+
Sbjct: 715 DLPAAAGPAPCGSSL--IASPTAPPEPEPPGAEQADGAE 751
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 30.2 bits (68), Expect = 5.9
Identities = 22/96 (22%), Positives = 32/96 (33%), Gaps = 8/96 (8%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
TS + P + ++P +P S P + +PHQ
Sbjct: 199 LTSAQLT-TAQPDDAPGTPAQPLTPLAAEAQAKAEVISTPSPVT----AAASPTITPHQT 253
Query: 75 PPPSPIATSPPPPSASDKHDSQS--SPHITQHTGPG 108
P P A +P + H+ Q S HIT T G
Sbjct: 254 -QPLPTAAAPVLSAPLGSHEWQQSLSQHITLFTRQG 288
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 30.2 bits (68), Expect = 6.1
Identities = 15/111 (13%), Positives = 33/111 (29%), Gaps = 8/111 (7%)
Query: 52 SPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ 111
+P P + L + + + P + + PP P+
Sbjct: 424 TPMGPGGPLRPNGLAPM-NAVRAPSRNAQNAAQKPPMQ----PVMYPPNYQSLPLSQDLP 478
Query: 112 SPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSP 162
P T+ + +S++ MQ+ G + ++ I L+
Sbjct: 479 QPQSTASQGGQNKKLAQVLASATPQMQKQVLG---ERLFPLVEAIEPALAA 526
Score = 29.8 bits (67), Expect = 6.9
Identities = 26/148 (17%), Positives = 42/148 (28%), Gaps = 14/148 (9%)
Query: 64 TLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIP 123
L Q SP + P+ Q P + P
Sbjct: 370 HLQDQFMQLQPRMRQLPMGSPMGGAMGQP------PYYGQG--PQQQFNGQPLGWPRMSM 421
Query: 124 SPSSSSPSSSSQ-----LMQQGGGGGSNGQTHLHIQNIGAVL-SPSYGQHCIQQDDTQHS 177
P+ P + M N Q + V+ P+Y + QD Q
Sbjct: 422 MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQ 481
Query: 178 TVQSPPGQQQQYQQMDTSQSSQQQSDLI 205
+ S GQ ++ Q+ S + Q Q ++
Sbjct: 482 STASQGGQNKKLAQVLASATPQMQKQVL 509
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 30.1 bits (67), Expect = 6.2
Identities = 23/94 (24%), Positives = 32/94 (34%), Gaps = 21/94 (22%)
Query: 462 GETPHVCQV--CGKKYTRKEHLANHMR-SHTND--TPFKCEICTKSFTRKEHFTNHIMWH 516
P+ C V C KKY + L HM H N F+ K+
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKD--------- 396
Query: 517 TGETPHRCDFCSKTFTRKEHLLNHVRQHTGESPH 550
P+RC+ C K + LN ++ H S
Sbjct: 397 ---KPYRCEVCDKRYKN----LNGLKYHRKHSHD 423
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 30.2 bits (67), Expect = 6.2
Identities = 26/130 (20%), Positives = 47/130 (36%)
Query: 17 STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
+T + + P S S + S + PS SPQ + + SP + P
Sbjct: 12 ATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAP 71
Query: 77 PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
P + P + + + +++SS + G + TS P +S +S +
Sbjct: 72 PVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTV 131
Query: 137 MQQGGGGGSN 146
G G+N
Sbjct: 132 TPGPSGKGAN 141
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 29.9 bits (67), Expect = 6.4
Identities = 13/41 (31%), Positives = 16/41 (39%), Gaps = 4/41 (9%)
Query: 75 PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
PP P PP + +SP T + PG PPP
Sbjct: 10 SPPRR----PSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 29.6 bits (66), Expect = 6.8
Identities = 16/73 (21%), Positives = 31/73 (42%), Gaps = 9/73 (12%)
Query: 61 GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS 120
G T T+ + P + + P ++ QS P+ T +PP T +
Sbjct: 19 GVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNAT---------TPPSTKVEA 69
Query: 121 KIPSPSSSSPSSS 133
+P++++PSS+
Sbjct: 70 PQQTPNATTPSST 82
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 28.9 bits (65), Expect = 7.1
Identities = 8/62 (12%), Positives = 16/62 (25%)
Query: 75 PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
+ +S P ++S S + + + +PS S
Sbjct: 1 SSSASSGSSAPAVNSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVPNSEKQ 60
Query: 135 QL 136
L
Sbjct: 61 NL 62
>gnl|CDD|150725 pfam10083, DUF2321, Uncharacterized protein conserved in bacteria
(DUF2321). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 158
Score = 28.7 bits (64), Expect = 7.3
Identities = 20/68 (29%), Positives = 26/68 (38%), Gaps = 6/68 (8%)
Query: 423 EHMVNHVRKHTGETPHRCEICKKAFTRKEHYVNHVMW--HTGETPHVCQVCGKKY---TR 477
E M N K T C C R +YV+ V+ + P C CGK + R
Sbjct: 25 ELMENFCSKCGAATITSCPHCSTPI-RGFYYVDGVLGLGSDYDAPSYCHNCGKPFPWTKR 83
Query: 478 KEHLANHM 485
K AN +
Sbjct: 84 KLEAANEL 91
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.1 bits (65), Expect = 7.3
Identities = 17/82 (20%), Positives = 25/82 (30%), Gaps = 3/82 (3%)
Query: 69 TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
T PP +P +S PPP P + PPP + +
Sbjct: 47 TGYPFCPPTTPHPSSQPPPCPP---SPGHPPQTNDTHEKDLALQPPPGGKKKEKKKKETE 103
Query: 129 SPSSSSQLMQQGGGGGSNGQTH 150
P+ + QG G+ H
Sbjct: 104 KPAQGGEKPDQGPEAKGEGEGH 125
>gnl|CDD|173376 PTZ00082, PTZ00082, L-lactate dehydrogenase; Provisional.
Length = 321
Score = 29.7 bits (67), Expect = 7.7
Identities = 10/32 (31%), Positives = 18/32 (56%), Gaps = 5/32 (15%)
Query: 609 QYCPKAF-----SRKDHLVNHVRQHTGESPHK 635
+YCP AF + D +V +++H+G +K
Sbjct: 117 KYCPNAFVIVITNPLDVMVKLLQEHSGLPKNK 148
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 29.3 bits (66), Expect = 7.9
Identities = 16/111 (14%), Positives = 33/111 (29%), Gaps = 8/111 (7%)
Query: 15 KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT----SLTS 70
+ T + T +E ++ +P+ + SL S
Sbjct: 65 RVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRILSTPELPKRKRSFSASSLES 124
Query: 71 PHQMPPPSP----IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
P P+ A S P P+++ + + + P + Q+P
Sbjct: 125 PSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGG 175
>gnl|CDD|218994 pfam06333, Med13_C, Mediator complex subunit 13 C-terminal.
Mediator is a large complex of up to 33 proteins that is
conserved from plants through fungi to humans - the
number and representation of individual subunits varying
with species. It is arranged into four different
sections, a core, a head, a tail and a kinase-activity
part, and the number of subunits within each of these is
what varies with species. Overall, Mediator regulates
the transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Med13 is part of the
ancillary kinase module, together with Med12, CDK8 and
CycC, which in yeast is implicated in transcriptional
repression, though most of this activity is likely
attributable to the CDK8 kinase. The large Med12 and
Med13 proteins are required for specific developmental
processes in Drosophila, zebrafish, and Caenorhabditis
elegans but their biochemical functions are not
understood.
Length = 406
Score = 29.6 bits (67), Expect = 7.9
Identities = 20/103 (19%), Positives = 29/103 (28%), Gaps = 7/103 (6%)
Query: 16 TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
T + D PP S ++ + + S PSP + T +
Sbjct: 227 TDPKLVPD-PPTQPIPFSRFNTSNSS---TPQSTPCTLSFPSPDQFGNAPTPPLGENGND 282
Query: 76 PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH 118
P PP S S+ I T I P S+
Sbjct: 283 NAPP---VTPPGSPSESDPDSILVDIDDETWGVILSHRLPLSN 322
>gnl|CDD|205996 pfam13825, Paramyxo_PNT, Paramyxovirus structural protein V/P
N-terminus. This family consists of several
Paramyxoviridae structural protein P and V sequences.
From a structural point of view, P is the
best-characterized protein of the replicative complex. P
is organised into two moieties that are functionally and
structurally distinct: a C-terminal moiety (PCT) and an
N-terminal moiety (PNT). PCT is the most conserved in
sequence and contains all regions required for virus
transcription, whereas PNT, which is poorly conserved,
provides several additional functions required for
replication. P protein plays a crucial role in the
enzyme by positioning L onto the N/RNA template through
an interaction with the C-terminal domain of N. Without
P, L is not functional. The N, P, and L proteins of SeV
and measles and mumps viruses are functionally
equivalent. However, sequence identity between proteins
from these viruses is limited, and the viruses have been
placed in different genera (Respirovirus, Morbilivirus,
and Rubulavirus, respectively). SeV P protein (568 aa)
is a modular protein with distinct functional domains.
The N-terminal part of P (PNT) is a chaperone for N and
prevents it from binding to non-viral RNA in the
infected cell.
Length = 309
Score = 29.4 bits (66), Expect = 8.0
Identities = 22/94 (23%), Positives = 28/94 (29%), Gaps = 10/94 (10%)
Query: 25 PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS-PHQMPPPSPIATS 83
PP P RG P G S G T + T Q S +S
Sbjct: 210 PPIPDVKRGDPSCKPIKKGTEERSAS---------SGTETESLSTGGATQSALKSTWGSS 260
Query: 84 PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
P SA + S S+ + Q + P S
Sbjct: 261 EPNASAGNVRQSASNAKMIQKCKQESGTTASPRS 294
>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
secretion system, contains a FHA domain [Intracellular
trafficking, secretion, and vesicular transport;
Signal transduction mechanisms].
Length = 430
Score = 29.4 bits (66), Expect = 8.9
Identities = 26/126 (20%), Positives = 32/126 (25%), Gaps = 24/126 (19%)
Query: 13 SHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH 72
S + DT P + +T P P T P
Sbjct: 105 SRAEPDESDPDTGSP-------QSAVSSTDMDDDFLADIEPEPRLPPGL-------TDPL 150
Query: 73 QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
Q+ P P S P SA QS I Q G S S +P S S+
Sbjct: 151 QLADPVPNGFSADPLSA-----LQSESLIAQPDPAGGAPSISRNS-----EAPLSDPASA 200
Query: 133 SSQLMQ 138
Sbjct: 201 GGIDTP 206
>gnl|CDD|171610 PRK12615, PRK12615, galactose-6-phosphate isomerase subunit LacB;
Reviewed.
Length = 171
Score = 28.8 bits (64), Expect = 9.1
Identities = 24/95 (25%), Positives = 41/95 (43%), Gaps = 8/95 (8%)
Query: 202 SDLIKTNGTGDVNPDEFNQLLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADN 261
SD +K+ G ++ ++ ++ +FGK V + + LG+ T G N A N
Sbjct: 19 SDFLKSKGYDVIDCGTYDHTRTHYPIFGKKVGEAVVNGQADLGVCICGTGVGINN--AVN 76
Query: 262 GVSGTNSVITNNSTENNTTSITLDIKADLDLPVMG 296
V G S + + T L K +L+ V+G
Sbjct: 77 KVPGIRSALVRDMT------TALYAKEELNANVIG 105
>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type. This is a zinc-finger
domain with the CxxCx(12)Hx(6)H motif, found in multiple
copies in a wide range of proteins from plants to
metazoans. Some member proteins, particularly those from
plants, are annotated as being RNA-binding.
Length = 25
Score = 25.5 bits (57), Expect = 9.2
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
F C++C FT + L SHLR
Sbjct: 1 FYCELCNVTFTSESQLKSHLRGK 23
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 29.7 bits (66), Expect = 9.6
Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 6/73 (8%)
Query: 221 LLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNST-ENNT 279
+ + + L++ S+ S ST+ NS N + TNS + NST +N+
Sbjct: 263 VAEWEAILAALEAFANGSASANSTSNSNSTSNSTTNS---NSTTTTNSTTSTNSTSSSNS 319
Query: 280 TSI--TLDIKADL 290
++I +DI A+
Sbjct: 320 STIAGCIDIAANF 332
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 29.0 bits (65), Expect = 9.9
Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 10/58 (17%)
Query: 68 LTSPHQMPPPSPIATSPPPPSASDKH----------DSQSSPHITQHTGPGIFQSPPP 115
++ PP P+ P PS + I +H P FQ PPP
Sbjct: 121 ISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPSFQQPPP 178
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.130 0.417
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 42,341,700
Number of extensions: 3839931
Number of successful extensions: 6990
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5911
Number of HSP's successfully gapped: 546
Length of query: 896
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 790
Effective length of database: 6,236,078
Effective search space: 4926501620
Effective search space used: 4926501620
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.6 bits)