RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7068
(487 letters)
>gnl|CDD|197477 smart00027, EH, Eps15 homology domain. Pair of EF hand motifs that
recognise proteins containing Asn-Pro-Phe (NPF)
sequences.
Length = 96
Score = 124 bits (314), Expect = 1e-34
Identities = 45/96 (46%), Positives = 63/96 (65%)
Query: 273 EWAVPHASKLKYTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQ 332
WA+ K KY Q+F + D+ + G ++G QA+ I++ +GL Q +LA+IWNLAD+D+DG+
Sbjct: 1 PWAISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLKSGLPQTLLAKIWNLADIDNDGE 60
Query: 333 LSCDEFVLAMHLCDLAKGGEKIPVPLPIDMIPPAFR 368
L DEF LAMHL G IP LP +IPP+ R
Sbjct: 61 LDKDEFALAMHLIYRKLNGYPIPASLPPSLIPPSKR 96
Score = 103 bits (260), Expect = 6e-27
Identities = 39/93 (41%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 385 PWNVSRHERTRFETHFQAMKP-VNGIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGK 443
PW +S ++ ++E F+++ +G VTG Q K L++SGLP L +IW+LAD D DG+
Sbjct: 1 PWAISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLKSGLPQTLLAKIWNLADIDNDGE 60
Query: 444 MNINEFVIACKLISNKLRGFDIPPTLPVSLMQS 476
++ +EF +A LI KL G+ IP +LP SL+
Sbjct: 61 LDKDEFALAMHLIYRKLNGYPIPASLPPSLIPP 93
>gnl|CDD|238009 cd00052, EH, Eps15 homology domain; found in proteins implicated in
endocytosis, vesicle transport, and signal transduction.
The alignment contains a pair of EF-hand motifs,
typically one of them is canonical and binds to Ca2+,
while the other may not bind to Ca2+. A hydrophobic
binding pocket is formed by residues from both EF-hand
motifs. The EH domain binds to proteins containing NPF
(class I), [WF]W or SWG (class II), or H[TS]F (class
III) sequence motifs.
Length = 67
Score = 99 bits (250), Expect = 6e-26
Identities = 31/66 (46%), Positives = 43/66 (65%)
Query: 284 YTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQLSCDEFVLAMH 343
Y Q+F + D G +SG +AR + +GL + +LAQIW+LAD D DG+L +EF +AMH
Sbjct: 1 YDQIFRSLDPDGDGLISGDEARPFLGKSGLPRSVLAQIWDLADTDKDGKLDKEEFAIAMH 60
Query: 344 LCDLAK 349
L LA
Sbjct: 61 LIALAL 66
Score = 88.0 bits (219), Expect = 1e-21
Identities = 30/67 (44%), Positives = 46/67 (68%), Gaps = 1/67 (1%)
Query: 396 FETHFQAMKPVN-GIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGKMNINEFVIACK 454
++ F+++ P G+++GD+ + FL +SGLP + L QIW LADTD+DGK++ EF IA
Sbjct: 1 YDQIFRSLDPDGDGLISGDEARPFLGKSGLPRSVLAQIWDLADTDKDGKLDKEEFAIAMH 60
Query: 455 LISNKLR 461
LI+ L
Sbjct: 61 LIALALN 67
>gnl|CDD|193239 pfam12763, efhand_3, Cytoskeletal-regulatory complex EF hand. This
is an efhand family from the N-terminal of actin
cytoskeleton-regulatory complex END3 and similar
proteins from fungi and closely related species.
Length = 104
Score = 76.3 bits (188), Expect = 4e-17
Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 400 FQAMKPVNGIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGKMNINEFVIACKLISNK 459
F +KP N +TGDQV L S LP L +IW LAD D DGK++ EF IA +LI +
Sbjct: 16 FSGLKPENNKLTGDQVSPVLKNSRLPDDQLAKIWDLADIDSDGKLDFEEFCIAMRLIFDL 75
Query: 460 LRGF--DIPPTLP 470
+ G D+P LP
Sbjct: 76 VNGNIADVPDELP 88
Score = 60.5 bits (147), Expect = 2e-11
Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 3/84 (3%)
Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQLSCDEFVLAM 342
KY ++F+ + L+G Q ++ + L LA+IW+LAD+DSDG+L +EF +AM
Sbjct: 11 KYWEIFSGLK-PENNKLTGDQVSPVLKNSRLPDDQLAKIWDLADIDSDGKLDFEEFCIAM 69
Query: 343 HLC-DLAKGGEK-IPVPLPIDMIP 364
L DL G +P LP ++P
Sbjct: 70 RLIFDLVNGNIADVPDELPDWLVP 93
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 41.1 bits (96), Expect = 0.001
Identities = 25/78 (32%), Positives = 31/78 (39%), Gaps = 4/78 (5%)
Query: 146 PLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKL 205
L++ PP P + L+ PPG SP P A +PP VPA
Sbjct: 2694 SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL--PAAPAPPAVPAG--PA 2749
Query: 206 TPPGPLTPDKPPAPVESP 223
TP GP P +PP P
Sbjct: 2750 TPGGPARPARPPTTAGPP 2767
Score = 38.0 bits (88), Expect = 0.012
Identities = 17/72 (23%), Positives = 22/72 (30%), Gaps = 1/72 (1%)
Query: 153 SPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGP-L 211
PP T PP+ P P P + + S SP + + P L
Sbjct: 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
Query: 212 TPDKPPAPVESP 223
P PA P
Sbjct: 2819 PPAASPAGPLPP 2830
Score = 35.3 bits (81), Expect = 0.099
Identities = 23/99 (23%), Positives = 33/99 (33%), Gaps = 23/99 (23%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPG--------AT 193
P+ SP + PP+S P P P + S+ PG +
Sbjct: 2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP--PPPSLPLGGSVAPGGDVRRRPPSR 2869
Query: 194 SPPNVPAA-------------ITKLTPPGPLTPDKPPAP 219
SP PAA +++ T L PD+P P
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
Score = 35.3 bits (81), Expect = 0.100
Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 4/95 (4%)
Query: 140 GGSGPSPLSTSIVSPPNTVAPP--SSTTPLIPGITSP--PGTLQGGITSPVSIIPGATSP 195
G P+P + PP + P +S + + SP P + +P + +P A SP
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP 2824
Query: 196 PNVPAAITKLTPPGPLTPDKPPAPVESPWGAFING 230
T P P P PP P G+ G
Sbjct: 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
Score = 34.9 bits (80), Expect = 0.13
Identities = 24/102 (23%), Positives = 32/102 (31%), Gaps = 12/102 (11%)
Query: 137 VSGGGSGPSPLSTSIVSPPNT---VAPPSSTTPLIPGITSPPGTLQGGITSPVSI---IP 190
VS P P + SP P P PG + P P P
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
Query: 191 GATSPPNVP------AAITKLTPPGPLTPDKPPAPVESPWGA 226
A P + + ++ + P P P PPA V +P A
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817
Score = 32.6 bits (74), Expect = 0.56
Identities = 21/89 (23%), Positives = 25/89 (28%), Gaps = 3/89 (3%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
P + P V+ + + L P P Q P P P
Sbjct: 2879 ARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP-PPPQPQPQPPPPPQPQPPPPP 2935
Query: 204 KLTPPGPLTPDKPPAPVESPWGAFINGWL 232
P PL P PA P GA WL
Sbjct: 2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWL 2964
Score = 31.4 bits (71), Expect = 1.3
Identities = 25/85 (29%), Positives = 32/85 (37%), Gaps = 11/85 (12%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPA--- 200
P TS P+ APP S P P G +G +P S +P T P+ P
Sbjct: 2578 SEPAVTSRARRPD--APPQSARPRAPV--DDRGDPRG--PAPPSPLPPDTHAPDPPPPSP 2631
Query: 201 --AITKLTPPGPLTPDKPPAPVESP 223
A + P P T P P + P
Sbjct: 2632 SPAANEPDPHPPPTVPPPERPRDDP 2656
Score = 31.1 bits (70), Expect = 1.8
Identities = 23/82 (28%), Positives = 30/82 (36%), Gaps = 12/82 (14%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
P L ++ PP A ++ P +P +PP G T PG + P P
Sbjct: 2710 PAPHALVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPAT------PGGPARPARPP- 2761
Query: 202 ITKLTPPGPLTPDKPPAPVESP 223
T GP P P AP P
Sbjct: 2762 ----TTAGPPAPAPPAAPAAGP 2779
Score = 29.1 bits (65), Expect = 7.2
Identities = 15/70 (21%), Positives = 19/70 (27%), Gaps = 1/70 (1%)
Query: 154 PPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN-VPAAITKLTPPGPLT 212
P + P+ +P GG P S P P + PP P T
Sbjct: 374 HPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPAT 433
Query: 213 PDKPPAPVES 222
P P
Sbjct: 434 PLPSAEPGSD 443
>gnl|CDD|200946 pfam00036, efhand, EF hand. The EF-hands can be divided into two
classes: signaling proteins and buffering/transport
proteins. The first group is the largest and includes
the most well-known members of the family such as
calmodulin, troponin C and S100B. These proteins
typically undergo a calcium-dependent conformational
change which opens a target binding site. The latter
group is represented by calbindin D9k and do not undergo
calcium dependent conformational changes.
Length = 29
Score = 35.5 bits (83), Expect = 0.002
Identities = 8/26 (30%), Positives = 13/26 (50%)
Query: 431 QIWSLADTDQDGKMNINEFVIACKLI 456
+ + D D DGK++ EF K +
Sbjct: 4 EAFKEFDKDGDGKISFEEFKELLKKL 29
Score = 35.1 bits (82), Expect = 0.002
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 318 LAQIWNLADMDSDGQLSCDEFVLAMH 343
L + + D D DG++S +EF +
Sbjct: 2 LKEAFKEFDKDGDGKISFEEFKELLK 27
>gnl|CDD|238008 cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily
of calcium sensors and calcium signal modulators; most
examples in this alignment model have 2 active canonical
EF hands. Ca2+ binding induces a conformational change
in the EF-hand motif, leading to the activation or
inactivation of target proteins. EF-hands tend to occur
in pairs or higher copy numbers.
Length = 63
Score = 35.6 bits (83), Expect = 0.004
Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFVL 340
+ + F D+ G +S + + + + G LS+ + ++ D D DG++ +EF+
Sbjct: 1 ELREAFRLFDKDGDGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGDGKIDFEEFLE 60
Query: 341 AMH 343
M
Sbjct: 61 LMA 63
Score = 35.2 bits (82), Expect = 0.005
Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 407 NGIVTGDQVKGFLMQSGLPV--ATLGQIWSLADTDQDGKMNINEFV 450
+G ++ D++K L G + + ++ D D DGK++ EF+
Sbjct: 14 DGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGDGKIDFEEFL 59
>gnl|CDD|185504 PTZ00184, PTZ00184, calmodulin; Provisional.
Length = 149
Score = 37.0 bits (86), Expect = 0.005
Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFVL 340
+ + F DR +GF+S + R++M G L+ + ++ AD+D DGQ++ +EFV
Sbjct: 85 EIKEAFKVFDRDGNGFISAAELRHVMTNLGEKLTDEEVDEMIREADVDGDGQINYEEFVK 144
Query: 341 AM 342
M
Sbjct: 145 MM 146
>gnl|CDD|197492 smart00054, EFh, EF-hand, calcium binding motif. EF-hands are
calcium-binding motifs that occur at least in pairs.
Links between disease states and genes encoding
EF-hands, particularly the S100 subclass, are emerging.
Each motif consists of a 12 residue loop flanked on
either side by a 12 residue alpha-helix. EF-hands
undergo a conformational change unpon binding calcium
ions.
Length = 29
Score = 32.7 bits (76), Expect = 0.018
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 431 QIWSLADTDQDGKMNINEFVIACKLI 456
+ + L D D DGK++ EF K +
Sbjct: 4 EAFRLFDKDGDGKIDFEEFKDLLKAL 29
Score = 29.7 bits (68), Expect = 0.22
Identities = 8/25 (32%), Positives = 14/25 (56%)
Query: 318 LAQIWNLADMDSDGQLSCDEFVLAM 342
L + + L D D DG++ +EF +
Sbjct: 2 LKEAFRLFDKDGDGKIDFEEFKDLL 26
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 36.9 bits (86), Expect = 0.024
Identities = 15/77 (19%), Positives = 21/77 (27%)
Query: 147 LSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLT 206
L + S NT P P P Q T+P P ++
Sbjct: 510 LESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569
Query: 207 PPGPLTPDKPPAPVESP 223
P +P P +P
Sbjct: 570 IPADSSPPPPIPEEPTP 586
Score = 31.5 bits (72), Expect = 1.0
Identities = 16/81 (19%), Positives = 22/81 (27%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
S ++ + T AP S P ++P SP P
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEP 424
Query: 204 KLTPPGPLTPDKPPAPVESPW 224
TPP P +E W
Sbjct: 425 SPTPPANAANAPPSLNLEELW 445
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 36.8 bits (85), Expect = 0.024
Identities = 23/94 (24%), Positives = 33/94 (35%), Gaps = 5/94 (5%)
Query: 138 SGGGSGPSPLSTSIVS--PPNTVAPPSST-TPLIPGITSPPGTLQGGITSPVSIIPGATS 194
SGGG+GP+ + + V+ P AP ++ P P V+ P S
Sbjct: 368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS 427
Query: 195 PPNVPAAITKLTPP--GPLTPDKPPAPVESPWGA 226
P A + P PAP +P A
Sbjct: 428 PAPEALAAARQASARGPGGAPAPAPAPAAAPAAA 461
Score = 31.8 bits (72), Expect = 1.0
Identities = 17/93 (18%), Positives = 27/93 (29%), Gaps = 14/93 (15%)
Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
G G P+P +P P ++ + + P +P
Sbjct: 443 GPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA--------------PARAAPAAA 488
Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGAFINGW 231
PA PP P + +P + A GW
Sbjct: 489 PAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
Score = 30.6 bits (69), Expect = 2.1
Identities = 14/86 (16%), Positives = 24/86 (27%), Gaps = 3/86 (3%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
P+ + + + A ++ P + Q P A +P PAA
Sbjct: 403 PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
Query: 204 KLTPPGPLTP---DKPPAPVESPWGA 226
+ GP +P A
Sbjct: 463 RPAAAGPRPVAAAAAAAPARAAPAAA 488
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 36.2 bits (83), Expect = 0.040
Identities = 25/90 (27%), Positives = 35/90 (38%), Gaps = 1/90 (1%)
Query: 135 LIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
L GS S + S S T + S +T + P + P G L +P I+ +TS
Sbjct: 561 LFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTS 620
Query: 195 PPNVPAAITKLTPPGPLTPDKPPA-PVESP 223
PP TP P + K + SP
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASP 650
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 36.3 bits (84), Expect = 0.040
Identities = 18/84 (21%), Positives = 29/84 (34%), Gaps = 5/84 (5%)
Query: 136 IVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSP 195
+ + + P P + + SP +P S+ P T G VS+ P A P
Sbjct: 382 VFTQPAAAPQPSAAAAASP----SPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVP 436
Query: 196 PNVPAAITKLTPPGPLTPDKPPAP 219
N P+ + P +K
Sbjct: 437 VNPPSTAPQAVRPAQFKEEKKIPV 460
Score = 33.6 bits (77), Expect = 0.26
Identities = 22/85 (25%), Positives = 33/85 (38%), Gaps = 7/85 (8%)
Query: 127 TIQAKGGYLIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPV 186
Q + + PSP +S + P AP S+T P T P ++ PV
Sbjct: 383 FTQPAAAPQPSAAAAASPSPSQSSAAAQP--SAPQSATQ---PAGTPPTVSVDPPAAVPV 437
Query: 187 SIIPGATSPPNVPAAITKLTPPGPL 211
+ P +T+P V A K P+
Sbjct: 438 N--PPSTAPQAVRPAQFKEEKKIPV 460
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 36.2 bits (83), Expect = 0.042
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%)
Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITS-PVSIIPGATSPPNVP 199
G+ P+ T + P T AP ++ +P +P +PP + GI P +P A+ P P
Sbjct: 407 GTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAP 466
Query: 200 AAITKL--TPPGPLTPDKP 216
++ PP P
Sbjct: 467 DSVASASDAPPTLGDPSDT 485
Score = 32.7 bits (74), Expect = 0.46
Identities = 25/92 (27%), Positives = 33/92 (35%), Gaps = 16/92 (17%)
Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
SGGG+ S + + ++T P PG P GT +P S P +P
Sbjct: 374 SGGGASAPSGSAA----EGPASGGAATIP-TPGTQGPQGTAPAAGMTPSSAAPATPAPSA 428
Query: 198 VPAAITKLTPPGPLTP--DKPPAPVESPWGAF 227
P+ P P D PPAP S
Sbjct: 429 APS---------PRVPWDDAPPAPPRSGIPPR 451
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 36.0 bits (83), Expect = 0.046
Identities = 9/85 (10%), Positives = 15/85 (17%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
+ P A ++T P P + N A+
Sbjct: 399 VTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARAS 458
Query: 202 ITKLTPPGPLTPDKPPAPVESPWGA 226
P +P
Sbjct: 459 ADSRCDERDAQPPADSGSASAPASD 483
>gnl|CDD|222177 pfam13499, EF_hand_5, EF-hand domain pair.
Length = 60
Score = 32.3 bits (74), Expect = 0.046
Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 6/57 (10%)
Query: 292 DRTRSGFLSGPQARNIMVATGLSQG------ILAQIWNLADMDSDGQLSCDEFVLAM 342
D+ G++ + R ++ A GL ++ +N D D DG++S +EF+ AM
Sbjct: 4 DKDGDGYIDVEELRKLLKALGLKLTDEEVEELIEADFNEIDKDGDGRISFEEFLEAM 60
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 35.7 bits (82), Expect = 0.052
Identities = 20/86 (23%), Positives = 32/86 (37%)
Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
GG G P + P + P ++ L P +S PG ++ + + + P+
Sbjct: 175 IGGRGGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPS 234
Query: 198 VPAAITKLTPPGPLTPDKPPAPVESP 223
P A P P+ P PV P
Sbjct: 235 APPASIPAPPIPPVIQYVAPPPVPPP 260
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 34.2 bits (79), Expect = 0.082
Identities = 15/45 (33%), Positives = 16/45 (35%)
Query: 170 GITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPD 214
G S P GG P AT P V GPLTP+
Sbjct: 14 GGFSVPAFAYGGEPEPEVPQTEATDPSEVVVETVTEEEGGPLTPE 58
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 34.0 bits (78), Expect = 0.12
Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 8/87 (9%)
Query: 140 GGSGPSPLSTSIVSPPNTVAP---PSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
G P+P ++ P + P P L+P PP + P P
Sbjct: 111 GEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPS-----YEETIKPGPAPVEEP 165
Query: 197 NVPAAITKLTPPGPLTPDKPPAPVESP 223
AI P+T + PPAP P
Sbjct: 166 VDSMAIAVPAIDTPVTLELPPAPQPPP 192
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 34.3 bits (79), Expect = 0.16
Identities = 15/84 (17%), Positives = 25/84 (29%), Gaps = 1/84 (1%)
Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP-NVP 199
P+ + S P + P+ + P + V++ P +
Sbjct: 400 APAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET 459
Query: 200 AAITKLTPPGPLTPDKPPAPVESP 223
AI P P PAP +P
Sbjct: 460 VAIPVRVAPEPAVASAAPAPAAAP 483
Score = 31.2 bits (71), Expect = 1.5
Identities = 14/85 (16%), Positives = 20/85 (23%), Gaps = 5/85 (5%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
+ +P A + S P P A +P PAA
Sbjct: 383 RPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA-- 440
Query: 204 KLTPPGPLTPDKPPAPVESPWGAFI 228
P + P +P I
Sbjct: 441 ---APAAVALAPAPPAQAAPETVAI 462
>gnl|CDD|227455 COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal
transduction mechanisms / Cytoskeleton / Cell division
and chromosome partitioning / General function
prediction only].
Length = 160
Score = 32.7 bits (75), Expect = 0.20
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 2/59 (3%)
Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFV 339
+ + F D+ G++S + R ++ + G LS + ++ D D DG++ +EF
Sbjct: 93 ELREAFKLFDKDHDGYISIGELRRVLKSLGERLSDEEVEKLLKEYDEDGDGEIDYEEFK 151
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 31.6 bits (72), Expect = 0.27
Identities = 23/90 (25%), Positives = 28/90 (31%), Gaps = 15/90 (16%)
Query: 137 VSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
SG S SP ++ P V P P PVS + P
Sbjct: 22 TSGFSSCFSPPESAHPDDPPPVGDPRPPVVDTP--------------PPVSAVWVLPPPS 67
Query: 197 NVPAAITKLTPPGPLTPDKPPAPVESPWGA 226
A PP P P PP+P+ P A
Sbjct: 68 EPAAPPPDPEPPVP-GPAGPPSPLAPPAPA 96
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 33.6 bits (77), Expect = 0.28
Identities = 19/90 (21%), Positives = 25/90 (27%), Gaps = 14/90 (15%)
Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
G P + +P APP + P +P P + P P
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPAR---------PAAARPAPAPAPPA 429
Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGAFI 228
AA PP A W AF+
Sbjct: 430 AAA-----PPARSADPAAAASAGDRWRAFV 454
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 33.4 bits (77), Expect = 0.29
Identities = 23/148 (15%), Positives = 35/148 (23%), Gaps = 5/148 (3%)
Query: 79 STPGSSCSASHTSATNGPCHPSLTSPTHWYLQGKKYHTHVFTAGQFCMTIQAKGGYLIVS 138
+ P + + + + P+ S H A
Sbjct: 616 AAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
+ P P A P+ P G P + P A +
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAP-----APAATPPAGQADDPAAQPPQAAQGASA 730
Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGA 226
P+ P P PD PP P +P
Sbjct: 731 PSPAADDPVPLPPEPDDPPDPAGAPAQP 758
Score = 31.1 bits (71), Expect = 1.5
Identities = 20/102 (19%), Positives = 27/102 (26%), Gaps = 13/102 (12%)
Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
G P S P P + P P +P G + + PG +P +
Sbjct: 593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEH 652
Query: 198 VPAAI-------------TKLTPPGPLTPDKPPAPVESPWGA 226
P + K P P PAP A
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA 694
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 31.7 bits (72), Expect = 0.95
Identities = 12/77 (15%), Positives = 21/77 (27%), Gaps = 5/77 (6%)
Query: 152 VSPPNTVAPPSSTTPLI---PGITSPPGTLQGGITSPVSIIPGATSPPNVPA--AITKLT 206
V P P ++ + P ++ P P + +PP VP +
Sbjct: 363 VPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVP 422
Query: 207 PPGPLTPDKPPAPVESP 223
P A +
Sbjct: 423 HTPESAPKLTRAAIPVD 439
Score = 30.9 bits (70), Expect = 1.9
Identities = 17/68 (25%), Positives = 22/68 (32%), Gaps = 1/68 (1%)
Query: 157 TVAPPSSTTPLIPGITSP-PGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
+ P + P P +P P ++ A PP P T PP P P
Sbjct: 359 LLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVA 418
Query: 216 PPAPVESP 223
PP P
Sbjct: 419 PPVPHTPE 426
Score = 30.2 bits (68), Expect = 3.3
Identities = 25/91 (27%), Positives = 32/91 (35%), Gaps = 7/91 (7%)
Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSP-PGTLQGGITSPVSIIPGATSPPNVPAA 201
P+ + S V P T AP + P + PV P A P+ P +
Sbjct: 370 KPTAAAPSPVRP--TPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPES 427
Query: 202 ITKLTP---PGPLTPD-KPPAPVESPWGAFI 228
KLT P P PPAP + A I
Sbjct: 428 APKLTRAAIPVDEKPKYTPPAPPKEEEKALI 458
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 31.8 bits (73), Expect = 1.1
Identities = 12/72 (16%), Positives = 20/72 (27%), Gaps = 1/72 (1%)
Query: 153 SPPNTVAPPSSTTPLIPGITSPPGTL-QGGITSPVSIIPGATSPPNVPAAITKLTPPGPL 211
P +T AP ++ ++P +P P A +P P
Sbjct: 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96
Query: 212 TPDKPPAPVESP 223
A +P
Sbjct: 97 AAPPAAAAAAAP 108
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 31.2 bits (70), Expect = 1.4
Identities = 18/85 (21%), Positives = 30/85 (35%), Gaps = 2/85 (2%)
Query: 137 VSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
+ PS ++T + A S+ ++P + PGT+ PV++ P S
Sbjct: 323 IPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTT 382
Query: 197 NVPAAITKLTPPGPLTPDK--PPAP 219
+ T P P AP
Sbjct: 383 ETQQSSTGNITSTANGPTTSLPAAP 407
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.0 bits (71), Expect = 1.5
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 9/80 (11%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
P+ + + +P VAPP + P ++P +P +P TS + AA
Sbjct: 377 PAASAQATAAPTAAVAPPQAPAVPPPPASAPQ-------QAPAVPLPETTSQ--LLAARQ 427
Query: 204 KLTPPGPLTPDKPPAPVESP 223
+L T K P +
Sbjct: 428 QLQRAQGATKAKKSEPAAAS 447
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 30.7 bits (69), Expect = 1.7
Identities = 17/71 (23%), Positives = 21/71 (29%), Gaps = 2/71 (2%)
Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAI 202
+ + P T PS+TT P T+ P P A P P
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTS--PPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPG 263
Query: 203 TKLTPPGPLTP 213
PP TP
Sbjct: 264 GGEAPPANATP 274
Score = 29.9 bits (67), Expect = 3.4
Identities = 15/71 (21%), Positives = 22/71 (30%)
Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
P P + SP T P ++T+P I +P T+ G +PP
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267
Query: 204 KLTPPGPLTPD 214
P
Sbjct: 268 PPANATPAPEA 278
Score = 28.4 bits (63), Expect = 9.4
Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 6/60 (10%)
Query: 160 PPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDKPPAP 219
P+ T P T P T + P + IP ++ P A T G TP PP P
Sbjct: 209 RPTPRTTASPETTPTPST---TTSPPSTTIPAPSTTIAAPQAGTTPEAEG--TPA-PPTP 262
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 30.7 bits (69), Expect = 1.8
Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 7/83 (8%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGI-TSPPGTLQGGITSPVSIIPGATSPPNVPA 200
+ PSP + P V+P S+T +P + + P T +P SP V
Sbjct: 451 TSPSP------TAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYD 504
Query: 201 AITKLTPPGPLTPDKPPAPVESP 223
+ T P P P AP +
Sbjct: 505 DLKPPTSPSPAAPVGKVAPSSTN 527
>gnl|CDD|225207 COG2326, COG2326, Uncharacterized conserved protein [Function
unknown].
Length = 270
Score = 30.4 bits (69), Expect = 2.0
Identities = 8/26 (30%), Positives = 12/26 (46%), Gaps = 2/26 (7%)
Query: 99 PSLTSPTHWYLQGKKYHTHVFTAGQF 124
P+ WY Q +Y H+ AG+
Sbjct: 111 PTDRERGQWYFQ--RYVAHLPAAGEI 134
>gnl|CDD|218471 pfam05157, T2SE_Nter, Type II secretion system (T2SS), protein E,
N-terminal domain. This domain is found at the
N-terminus of members of the Type II secretion system
protein E. Proteins in this subfamily are typically
involved in Type 4 pilus biogenesis , though some are
involved in other processes; for instance aggregation in
Myxococcus xanthus. The structure of this domain is now
known.
Length = 110
Score = 28.8 bits (65), Expect = 2.3
Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 13/71 (18%)
Query: 357 PLPIDMIPPAFRRQRQ--------NSVTLAANVAMDPWNVSRHERTRFETHFQAMKPVNG 408
P + ++P F R+ ++ +A DP ++ + RF T + ++PV
Sbjct: 35 PELLRLLPREFARRHLVLPLREDDGTLVVAVA---DPLDLELLDELRFLTGRKRVEPV-- 89
Query: 409 IVTGDQVKGFL 419
+ T ++ L
Sbjct: 90 LATPSDIRRAL 100
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 30.4 bits (69), Expect = 2.4
Identities = 15/89 (16%), Positives = 22/89 (24%), Gaps = 13/89 (14%)
Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
S PL TS + P T P++ + P Q + P
Sbjct: 155 SQNSGQSVPLDTSTTTDPATTPAPAAPVD------TTPTNSQTPAVAT-------APAPA 201
Query: 198 VPAAITKLTPPGPLTPDKPPAPVESPWGA 226
V + P D P +
Sbjct: 202 VDPQQNAVVAPSQANVDTAATPAPAAPAT 230
Score = 30.0 bits (68), Expect = 2.5
Identities = 19/85 (22%), Positives = 27/85 (31%), Gaps = 2/85 (2%)
Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPA 200
+ P +T + P P +S TP + +P Q S T+ PA
Sbjct: 167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPA 226
Query: 201 AITKLTPPGPLTPDK--PPAPVESP 223
A PL D+ P P
Sbjct: 227 APATPDGAAPLPTDQAGVSTPAADP 251
>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
biogenesis, outer membrane].
Length = 760
Score = 30.6 bits (69), Expect = 2.4
Identities = 19/96 (19%), Positives = 29/96 (30%), Gaps = 8/96 (8%)
Query: 125 CMTIQAKGGYLIVSGGGSGPSPLSTSIVSPPN-----TVAPPSSTTPLIPGITSPPGTLQ 179
+ G + G+ PL V + +P S + P P L
Sbjct: 2 SYSRGTIDGAATATMKGANVQPLFALTVIILLFPTGASASPHSVSVPKGDSNVEFPPILS 61
Query: 180 GGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
G I +P S P P + GP+ +K
Sbjct: 62 GTI-NPPSTFRSVQHPV--PPPGVRSDSTGPIQTNK 94
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 30.0 bits (68), Expect = 2.4
Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 5/73 (6%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
S P +S S +P S T P + PPG + P +T+ P P+A
Sbjct: 210 SSLPPAPSSFQSDTPPPSPESPTNP-----SPPPGPAAPPPPPVQQVPPLSTAKPTPPSA 264
Query: 202 ITKLTPPGPLTPD 214
P G +T D
Sbjct: 265 SATPAPIGGITLD 277
>gnl|CDD|177641 PHA03413, PHA03413, putative internal core protein; Provisional.
Length = 1304
Score = 30.4 bits (68), Expect = 2.6
Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 10/103 (9%)
Query: 313 LSQGILA-QIWNLADMDSDGQLSCDEFVLAMHLCDLAKGGEKIPVPLPIDMIPPAFRRQR 371
L+ GI+A ++ N+ D++ + +EF L + D A P + PP + +
Sbjct: 1077 LNAGIIADELENMFKGDAEAFKALEEFGLTPEIADQAIAANAAN---PGALFPPDLQME- 1132
Query: 372 QNSVTLAANVAMDPWNVSRHERTRFETHFQAMKPVNGIVTGDQ 414
+ + AMD ++ ++ RT +HF P+ I+ G Q
Sbjct: 1133 ---IEIVGTRAMD--SLMQNIRTGETSHFAQFNPIGKIIVGYQ 1170
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.4 bits (68), Expect = 2.8
Identities = 21/70 (30%), Positives = 27/70 (38%), Gaps = 4/70 (5%)
Query: 137 VSGGGSGPSPLSTSIVSPP--NTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
+ G G S S S P T P+S+ P PG P + + +T I G
Sbjct: 265 LPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPG--GPGSSSRNAVTRSTDSISGPIP 322
Query: 195 PPNVPAAITK 204
P P AIT
Sbjct: 323 SPGAPRAITG 332
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 30.1 bits (68), Expect = 2.9
Identities = 10/87 (11%), Positives = 18/87 (20%), Gaps = 3/87 (3%)
Query: 140 GGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVP 199
S + S + P S+ P P + + +
Sbjct: 89 TWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP---PPSPAPDLSEMLRPVGSPG 145
Query: 200 AAITKLTPPGPLTPDKPPAPVESPWGA 226
P +P + S A
Sbjct: 146 PPPAASPPAAGASPAAVASDAASSRQA 172
>gnl|CDD|220472 pfam09924, DUF2156, Uncharacterized conserved protein (DUF2156).
This domain, found in various hypothetical prokaryotic
proteins, has no known function.
Length = 246
Score = 29.5 bits (67), Expect = 3.3
Identities = 12/47 (25%), Positives = 18/47 (38%)
Query: 98 HPSLTSPTHWYLQGKKYHTHVFTAGQFCMTIQAKGGYLIVSGGGSGP 144
+ S S + L K + G + + KGG L+V G G
Sbjct: 3 YGSENSFANLALWRDKSYYFYAPDGDAFIAYRVKGGSLVVLGDPVGD 49
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is
found in the aflatoxin regulatory protein (AflR) which
is involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 29.5 bits (66), Expect = 3.5
Identities = 31/121 (25%), Positives = 50/121 (41%), Gaps = 21/121 (17%)
Query: 159 APPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPD---- 214
P ++++P IP T+ T PV P + PP T TP G +P
Sbjct: 4 TPNTASSPTIPANTTANTTSSSHPQPPVQSGPSSIQPPVA----TPHTPNGTSSPSPKFS 59
Query: 215 --KPPAPVESPWGAFINGWLSTKEKDLEVNNSNLVDLKKTLANMMSDSEKLYNVLGGPPL 272
PPA E WG+ ++ S +++ DL +L ++ +D +L+ L PL
Sbjct: 60 HQSPPAEPEL-WGSILSPNASNQDQG---------DL-SSLLSVNTDFGQLFASLSPSPL 108
Query: 273 E 273
Sbjct: 109 F 109
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 29.4 bits (65), Expect = 3.7
Identities = 24/90 (26%), Positives = 29/90 (32%), Gaps = 10/90 (11%)
Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLI--------PGITSPPGTLQGGITSPVSIIPGAT 193
SG SPL+ S A P+ P PP T P ++ P
Sbjct: 77 SGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPAT--APTCPPPAVCPAPA 134
Query: 194 SPPNVPAAITKLTPPGPLTPDKPPAPVESP 223
P T+ PP P P PAP P
Sbjct: 135 RPAPACPPSTRQCPPAPPLPTPKPAPAAKP 164
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 29.3 bits (66), Expect = 3.8
Identities = 20/90 (22%), Positives = 27/90 (30%), Gaps = 10/90 (11%)
Query: 137 VSGGGSGPSPLSTSIVSPPNTVA--PPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
SG GSG +P P T A P +TT P + + + VS T+
Sbjct: 53 YSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT 112
Query: 195 PP------NVPAAI--TKLTPPGPLTPDKP 216
P + T G P
Sbjct: 113 SPSETDTEEATTTVSTETPTEGGSSAATDP 142
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 29.4 bits (66), Expect = 4.1
Identities = 16/83 (19%), Positives = 26/83 (31%), Gaps = 1/83 (1%)
Query: 141 GSGPSPLSTSIVSPPNTVAPPS-STTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVP 199
G S + V T P S S +P GI P + T P +S
Sbjct: 193 GRTNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEK 252
Query: 200 AAITKLTPPGPLTPDKPPAPVES 222
++ K + P+++
Sbjct: 253 SSAPKPRETLDPKSPEKAPPIDT 275
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 29.6 bits (66), Expect = 4.2
Identities = 21/82 (25%), Positives = 26/82 (31%), Gaps = 7/82 (8%)
Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAI 202
P+P P + P T I I P P+ PG PP P A
Sbjct: 648 FPTPHQP----PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPP--PRAP 701
Query: 203 TKLTPP-GPLTPDKPPAPVESP 223
T + PP P + PA
Sbjct: 702 TPMRPPAAPPGRAQRPAAATGR 723
Score = 29.6 bits (66), Expect = 4.7
Identities = 22/81 (27%), Positives = 29/81 (35%), Gaps = 4/81 (4%)
Query: 146 PLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKL 205
P + + + P AP + P + P G P + PG PP
Sbjct: 717 PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP 776
Query: 206 TPPGPLTPDKPPAPVESPWGA 226
PP P PPAP + P GA
Sbjct: 777 QPP----PQAPPAPQQRPRGA 793
>gnl|CDD|178165 PLN02550, PLN02550, threonine dehydratase.
Length = 591
Score = 29.5 bits (66), Expect = 4.4
Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 10/139 (7%)
Query: 156 NTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
++V P++ +PL I SP + G S IP A A T + P P P
Sbjct: 2 SSVGLPTAGSPLRSHIGSPSKPVVGSTPFSRSRIPAAVD----SADETSMAP--PPPPSP 55
Query: 216 PPAPVESPWG-AFINGWL-STKEKDLEVNNSNLVDLKKTLANMMSDSEKLYNVLGGPPLE 273
P SP + G+L + E+ E N ++ + + L N++ S K+Y+V PL+
Sbjct: 56 LPLLKVSPNSLQYPAGYLGAVPERTNEAENGSIPEAMEYLTNIL--SAKVYDVAIESPLQ 113
Query: 274 WAVPHASKLKYTQLFNTTD 292
A + +L L D
Sbjct: 114 LAKKLSERLGVKVLLKRED 132
>gnl|CDD|163669 cd08162, MPP_PhoA_N, Synechococcus sp. strain PCC 7942 PhoA and
related proteins, N-terminal metallophosphatase domain.
Synechococcus sp. strain PCC 7942 PhoA is a large
atypical alkaline phosphatase. It is known to be
transported across the inner cytoplasmic membrane and
into the periplasmic space. In vivo inactivation of the
gene encoding PhoA leads to a loss of extracellular,
phosphate-regulated phosphatase activity, but does not
appear to affect the cells capacity for phosphate
uptake. PhoA may play a role in scavenging phosphate
during growth of Synechococcus sp. strain PCC 7942 in
its natural environment. PhoA belongs to a domain
family which includes the bacterial enzyme UshA and
several other related enzymes including SoxB, CpdB,
YhcR, and CD73. All members have a similar domain
architecture which includes an N-terminal
metallophosphatase domain and a C-terminal nucleotidase
domain. The N-terminal metallophosphatase domain
belongs to a large superfamily of distantly related
metallophosphatases (MPPs) that includes:
Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
debranching enzymes, YfcE-like phosphodiesterases,
purple acid phosphatases (PAPs), YbbF-like
UDP-2,3-diacylglucosamine hydrolases, and acid
sphingomyelinases (ASMases). MPPs are functionally
diverse, but all share a conserved domain with an active
site consisting of two metal ions (usually manganese,
iron, or zinc) coordinated with octahedral geometry by a
cage of histidine, aspartate, and asparagine residues.
The conserved domain is a double beta-sheet sandwich
with a di-metal active site made up of residues located
at the C-terminal side of the sheets. This domain is
thought to allow for productive metal coordination.
Length = 313
Score = 28.8 bits (65), Expect = 5.5
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 2/46 (4%)
Query: 164 TTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPG 209
TTP +P I+SP G + + + + +I LT G
Sbjct: 164 TTPTLPSISSPGGVT--VNPDDTADMQALAAVEQIQPSIDALTAQG 207
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.4 bits (65), Expect = 6.2
Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 7/69 (10%)
Query: 149 TSIVSPPNTVAPPSSTTPLIPGITSP---PGTLQGGITSPVSIIPG----ATSPPNVPAA 201
TS + A + P TSP + TSP I+ G ATSPP +
Sbjct: 489 TSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTS 548
Query: 202 ITKLTPPGP 210
+ T P
Sbjct: 549 VPNATSPQV 557
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 28.7 bits (64), Expect = 6.5
Identities = 26/91 (28%), Positives = 31/91 (34%), Gaps = 20/91 (21%)
Query: 153 SPPNTVAPPSSTTPLIPGIT---SPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPG 209
SPP +PP T P P + +PP G P PG + P P + P G
Sbjct: 10 SPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPP----PGRAAGPAAPRRRPRGCPAG 65
Query: 210 -----------PLTPDKPPAPVESP--WGAF 227
PL D PA P W F
Sbjct: 66 VTFSSSAPPRPPLGLDDAPAATPPPLDWTEF 96
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 29.0 bits (65), Expect = 6.6
Identities = 14/99 (14%), Positives = 23/99 (23%), Gaps = 6/99 (6%)
Query: 130 AKGGYLIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSII 189
G + P+P + N + + P +
Sbjct: 342 LGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAG-----APLPDFD 396
Query: 190 PGATSPPNVPAAITKLTPPGPLTPDKPPAPVESPWGAFI 228
P PP P P + P PA + W +
Sbjct: 397 PRPRGPPA-PEPARSAEAPPLVAPAAAPAGLALRWRDVL 434
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 28.9 bits (65), Expect = 7.7
Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 22/107 (20%)
Query: 20 IGQTSFEDKRKEN---YEKGQAE-LER--RRKALELPHQSSFNRALCATTPHHETSSSA- 72
I + ++D RKEN +E+ E LE+ RR+A E +S N +
Sbjct: 672 IARYGYKDVRKENHQAFEQLLIESLEKFIRREAQERALESDGNDDTDDEDSVTSSRVLIA 731
Query: 73 ---------------HNPGWDSTPGSSCSASHTSATNGPCHPSLTSP 104
+ P +S S + S
Sbjct: 732 PNGSVYSLGVPLLADYRLTSKPIPEASTSEEVSPVLPSSSMSSDEDQ 778
>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus. This family includes the
NUT protein. The gene encoding for NUT protein (Nuclear
Testis protein) is found fused to BRD3 or BRD4 genes, in
some aggressive types of carcinoma, due to chromosomal
translocations. Proteins of the BRD family contain two
bromodomains that bind transcriptionally active
chromatin through associations with acetylated histones
H3 and H4. Such proteins are crucial for the regulation
of cell cycle progression. On the other hand, little is
known about NUT protein. NUT is known to have a Nuclear
Export Sequence (NES) as well as a Nuclear Localization
Signal (NLS), both located towards the C-terminal end of
the protein. A fused NUT-GFP protein showed either
cytoplasmic or nuclear localization, suggesting that it
is subject to nuclear/cytoplasmic shuttling. Consistent
with this possibility, treatment with leptomycin B an
inhibitor of CRM1-dependent nuclear export resulted in
re-distribution of NUT-GFP to the nucleus. Inspection of
NUT revealed a C-terminal sequence similar to known
nuclear export sequences (NES) which are often regulated
by phosphorylation.
Length = 328
Score = 28.4 bits (63), Expect = 8.8
Identities = 27/99 (27%), Positives = 36/99 (36%), Gaps = 10/99 (10%)
Query: 135 LIVSGGGSGPSPLSTS--IVSPPNTVAP--PSSTTPLI----PGITSPPGTLQGGITSPV 186
L+ GGG G S IV P PS T L+ P + PGT GG+ P
Sbjct: 63 LVTEGGGPGLSGAGAGNVIVQVKTEGGPVEPSQTQNLVLTQTPLNWNAPGTPCGGVEGPP 122
Query: 187 SIIPGATSPPNVPAAITKLTPP--GPLTPDKPPAPVESP 223
+ + +PA ++ P G P P P
Sbjct: 123 PFVTASNVKTILPARAVGVSQPQEGTWILGLPLPPPPPP 161
>gnl|CDD|235577 PRK05718, PRK05718,
keto-hydroxyglutarate-aldolase/keto-deoxy-
phosphogluconate aldolase; Provisional.
Length = 212
Score = 27.9 bits (63), Expect = 9.7
Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 169 PGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
PG+T P L+ P+ +IPG ++P +
Sbjct: 94 PGLT--PPLLKAAQEGPIPLIPGVSTPSELMLG 124
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.403
Gapped
Lambda K H
0.267 0.0908 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,745,555
Number of extensions: 2359214
Number of successful extensions: 2603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2350
Number of HSP's successfully gapped: 189
Length of query: 487
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 386
Effective length of database: 6,457,848
Effective search space: 2492729328
Effective search space used: 2492729328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (27.0 bits)