RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14673
(912 letters)
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 389 bits (1001), Expect = e-121
Identities = 222/620 (35%), Positives = 346/620 (55%), Gaps = 17/620 (2%)
Query: 285 GDPYRRPEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQL 344
D Y RPE++S V++LA +Y++R PP P +Y+FL+DVS A ++G + +L L
Sbjct: 248 SDRYSRPELKSGVVDFLAPKEYSLRQPP-PPVYVFLIDVSFEAIKNGLVKAAIRAILENL 306
Query: 345 KSMPG-DRRTSIAIITYDSAVHFYSLAEGQTQPSQMILTDIDDIFLPSPEN-ILVNLSEC 402
+P D RT IAII +DS++HF+ L+ +I++D+D+ FLP P ++ L C
Sbjct: 307 DQIPNFDPRTKIAIICFDSSLHFFKLSPD-LDEQMLIVSDLDEPFLPFPSGLFVLPLKSC 365
Query: 403 YDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFKLLQPTGGRITIFQTCLPNRGPGALQS 462
+I LL ++P F+D+ +ALGPAL+AA L+ TGG+I +F + LPN G G LQ
Sbjct: 366 KQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQL 425
Query: 463 RENPNQRSNDDPPHMNPATDFYKKIALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSG 522
RE+ ++ FYK+ A+EC+ I+VDLF+ + Y+D+ATLS + +++ G
Sbjct: 426 REDKESS------LLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGG 479
Query: 523 CIHHIPLYSGKNMSHVYQLDRMFSRYLTRKIGFESVMRIRCTKGMSIHTFHGNFFVRSTD 582
+ P +S + +L +L+ +IG+E+VMR+RC+ G+ + +F+GNFF RS+D
Sbjct: 480 QTYFYPNFSATRPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSD 539
Query: 583 LLALPNVNPDAGYGIQISIDENLTHIQYACFQVAVLYTSSKGDRRIRVHTLCLPVVSNIN 642
L A + D ++ SIDE L FQVA+LYT + G+RRIRV L LP S+I
Sbjct: 540 LCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIR 598
Query: 643 DVLVGADQACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTYKMTQAGHLTGSLLA 702
+V ADQ IA +LAK A ++L SL +AR + D+L YK T + L
Sbjct: 599 EVYASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKKELVKSNTSTQLP 658
Query: 703 -PSSLKLLPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHT 761
P++LKLLPL +LALLK AFR G ST D R+ A+ + LPL LM +IYP LY +H
Sbjct: 659 LPANLKLLPLLMLALLKSSAFRSG-STPSDIRISALNRLTSLPLKQLMRNIYPTLYALHD 717
Query: 762 LDQDPTIEWNELQVPIPRILQLSAERLQATGTYLLNMPDLIIILVRHGTSPAICSGLFGF 821
+ +E + +P + ++ L++ G YL++ I + P++ LFG
Sbjct: 718 M-PIEAGLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGV 776
Query: 822 QSVAQMPETLFELPQLETQLSLKLNSFIS--YLNDDKAYVSPVKVIKDSSPD-RMEFYDK 878
S++ +P F LP + + ++ + I +D + + V V P R+ F+
Sbjct: 777 DSLSDIPSGKFTLPPTGNEFNERVRNIIGELRSVNDDSTLPLVLVRGGGDPSLRLWFFST 836
Query: 879 LIEDKNAGGMAYYEFLLRIR 898
L+EDK +Y ++L +
Sbjct: 837 LVEDKTLNIPSYLDYLQILH 856
Score = 30.9 bits (70), Expect = 3.4
Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 7/59 (11%)
Query: 102 SQQTFRMPPNTYSQ-------SSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIP 153
SQ + P SQ S K+A + + + QQQ ++ I
Sbjct: 2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQID 60
>gnl|CDD|238756 cd01479, Sec24-like, Sec24-like: Protein and membrane traffic in
eukaryotes is mediated by at least in part by the
budding and fusion of intracellular transport vesicles
that selectively carry cargo proteins and lipids from
donor to acceptor organelles. The two main classes of
vesicular carriers within the endocytic and the
biosynthetic pathways are COP- and clathrin-coated
vesicles. Formation of COPII vesicles requires the
ordered assembly of the coat built from several
cytosolic components GTPase Sar1, complexes of
Sec23-Sec24 and Sec13-Sec31. The process is initiated by
the conversion of GDP to GTP by the GTPase Sar1 which
then recruits the heterodimeric complex of Sec23 and
Sec24. This heterodimeric complex generates the
pre-budding complex. The final step leading to membrane
deformation and budding of COPII-coated vesicles is
carried by the heterodimeric complex Sec13-Sec31. The
members of this CD belong to the Sec23-like family. Sec
24 is very similar to Sec23. The Sec23 and Sec24
polypeptides fold into five distinct domains: a
beta-barrel, a zinc finger, a vWA or trunk, an all
helical region and a carboxy Gelsolin domain. The
members of this subgroup carry a partial MIDAS motif and
have the overall Para-Rossmann type fold that is
characteristic of this superfamily.
Length = 244
Score = 311 bits (800), Expect = e-100
Identities = 120/244 (49%), Positives = 168/244 (68%), Gaps = 2/244 (0%)
Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDR-RTSIAIITYDSAVHFYSLA 370
PQPA+Y+FL+DVS A +SG L T CE LL+ L ++PGD RT + IT+DS +HF++L
Sbjct: 1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDDPRTRVGFITFDSTLHFFNLK 60
Query: 371 EGQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPA 430
QP M+++D+DD FLP P+ +LVNL E +I DLL Q+P F+D+ ++ SALGPA
Sbjct: 61 SSLEQPQMMVVSDLDDPFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPA 120
Query: 431 LQAAFKLLQPTGGRITIFQTCLPNRGPGALQSRENPNQRSNDD-PPHMNPATDFYKKIAL 489
LQAAF LL+ TGG+I +FQ+ LP G G L+SRE+P S D + P TDFYKK+AL
Sbjct: 121 LQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPKLLSTDKEKQLLQPQTDFYKKLAL 180
Query: 490 ECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRMFSRYL 549
EC I+VDLF+ ++QYVD+ATL +S+ + G +++ P ++ + V +L +RYL
Sbjct: 181 ECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFNFSAPNDVEKLVNELARYL 240
Query: 550 TRKI 553
TRKI
Sbjct: 241 TRKI 244
>gnl|CDD|238745 cd01468, trunk_domain, trunk domain. COPII-coated vesicles carry
proteins from the endoplasmic reticulum to the Golgi
complex. This vesicular transport can be reconstituted
by using three cytosolic components containing five
proteins: the small GTPase Sar1p, the Sec23p/24p
complex, and the Sec13p/Sec31p complex. This domain is
known as the trunk domain and has an alpha/beta vWA fold
and forms the dimer interface. Some members of this
family possess a partial MIDAS motif that is a
characteristic feature of most vWA domain proteins.
Length = 239
Score = 255 bits (654), Expect = 7e-79
Identities = 103/230 (44%), Positives = 140/230 (60%), Gaps = 5/230 (2%)
Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAE 371
PQP +++F++DVS A + G L + E LL L +PGD R + +ITYDS VHFY+L+
Sbjct: 1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSS 60
Query: 372 GQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKD--SLDSGSALGP 429
QP +++D+ D+FLP P+ LV LSEC +I DLL+QLP F + LGP
Sbjct: 61 DLAQPKMYVVSDLKDVFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGP 120
Query: 430 ALQAAFKLLQPT--GGRITIFQTCLPNRGPGALQSRENPNQ-RSNDDPPHMNPATDFYKK 486
ALQAAF LL+ T GGRI +FQ LP GPG L+SRE+ RS+D+ + PAT FYK
Sbjct: 121 ALQAAFLLLKGTFAGGRIIVFQGGLPTVGPGKLKSREDKEPIRSHDEAQLLKPATKFYKS 180
Query: 487 IALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMS 536
+A EC I VDLF + YVD+ATL ++K + G ++ + N
Sbjct: 181 LAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDG 230
>gnl|CDD|147125 pfam04811, Sec23_trunk, Sec23/Sec24 trunk domain. COPII-coated
vesicles carry proteins from the endoplasmic reticulum
to the Golgi complex. This vesicular transport can be
reconstituted by using three cytosolic components
containing five proteins: the small GTPase Sar1p, the
Sec23p/24p complex, and the Sec13p/Sec31p complex. This
domain is known as the trunk domain and has an
alpha/beta vWA fold and forms the dimer interface.
Length = 241
Score = 238 bits (609), Expect = 2e-72
Identities = 94/243 (38%), Positives = 140/243 (57%), Gaps = 7/243 (2%)
Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAE 371
PQP ++LF++DVS A +SG L + E LL L +PGD R + IT+DS VHF++L+
Sbjct: 1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRALVGFITFDSTVHFFNLSS 60
Query: 372 GQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPAL 431
QP ++++D+ D+FLP P+ LV LSEC ++ DLL++LP F + LGPAL
Sbjct: 61 SLRQPKMLVVSDLQDMFLPLPDRFLVPLSECRFVLEDLLEELPRMFPVTKRPERCLGPAL 120
Query: 432 QAAFKLLQ--PTGGRITIFQTCLPNRGP-GALQSRENPNQR--SNDDPPHMNPATDFYKK 486
QAA LL+ TGG+I +FQ LP GP G L+SR + + + + A FYK
Sbjct: 121 QAAVLLLKAAFTGGKIMLFQGGLPTVGPGGKLKSRLDESHHDTDKEKAKLVKKADKFYKS 180
Query: 487 IALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRMFS 546
+A EC + +VDLF + YVD+A L +S+ + G ++ P + + ++ D
Sbjct: 181 LAKECVAQGHSVDLFAFSLDYVDVAELGCLSRLTGGQVYLYPSFQADVDTSKFKQD--LQ 238
Query: 547 RYL 549
RY
Sbjct: 239 RYF 241
>gnl|CDD|219707 pfam08033, Sec23_BS, Sec23/Sec24 beta-sandwich domain.
Length = 86
Score = 117 bits (295), Expect = 2e-31
Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 1/86 (1%)
Query: 554 GFESVMRIRCTKGMSIHTFHGNFFVRS-TDLLALPNVNPDAGYGIQISIDENLTHIQYAC 612
GF +V+R+R +KG+ + F GNFF RS D LP+++PD Y + IDE L + A
Sbjct: 1 GFNAVLRVRTSKGLKVSGFIGNFFSRSSGDTWKLPSLDPDTSYAFEFDIDEPLDSGKQAY 60
Query: 613 FQVAVLYTSSKGDRRIRVHTLCLPVV 638
Q A+LYT S G+RRIRV T+ LPV
Sbjct: 61 IQFALLYTHSSGERRIRVTTVALPVT 86
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 132 bits (332), Expect = 3e-31
Identities = 126/512 (24%), Positives = 247/512 (48%), Gaps = 41/512 (8%)
Query: 314 PAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAEGQ 373
P ++F+++ S A + YT+ E + ++++ + T IAIIT++S+++FY G+
Sbjct: 952 PPYFVFVVECSYNAIYNNITYTILEGIRYAVQNVKCPQ-TKIAIITFNSSIYFYHCKGGK 1010
Query: 374 TQPSQ-------------MILTDIDDIFLPSP-ENILVNLSECYDMIVDLLKQLPGKFKD 419
+ ++++D+DD FLP P E++ E D I L+ +
Sbjct: 1011 GVSGEEGDGGGGSGNHQVIVMSDVDDPFLPLPLEDLFFGCVEEIDKINTLIDTIKSVSTT 1070
Query: 420 SLDSGSALGPALQAAFKLLQPTGG--RITIFQTCLPNRGPGALQSRENPNQRSNDDPPHM 477
GS AL+ A +L+ G I +F T PN G GA++ + Q + +
Sbjct: 1071 MQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNCGIGAIKELKKDLQENFLEVKQ- 1129
Query: 478 NPATDFYKKIALECTSFYIAVDLFILNSQYVDLA--TLSGVSKFSSGCIHHIPLYS-GKN 534
FY + L+ +F I+VD+FI++S V + +L V++ + G I + + K+
Sbjct: 1130 ---KIFYDSLLLDLYAFNISVDIFIISSNNVRVCVPSLQYVAQNTGGKILFVENFLWQKD 1186
Query: 535 MSHVYQLDRMFSRYLTRKIGFESVMRIRCTKGMSIHTF---HGNF-FVRSTDLLALPNVN 590
+Y + + I + +++R + MS+ + NF + S D + +P +
Sbjct: 1187 YKEIYM--NIMDTLTSEDIAYCCELKLRYSHHMSVKKLFCCNNNFNSIISVDTIKIPKIR 1244
Query: 591 PDAGYGIQISIDENLTHIQYACFQVAVLYTSSKGDRRIRVHTLCLPVVSNINDVLVGADQ 650
D + ++ + + FQ A +YT+ GDR +R+HT + + S+++ V D
Sbjct: 1245 HDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDA 1304
Query: 651 ACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTYKMTQAGHL-TGSLLAPSSLKLL 709
+ +L K L + + + ++ + +L +Y++ A +G L+ P +LKLL
Sbjct: 1305 EALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKLL 1362
Query: 710 PLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHTLDQDPTIE 769
PL+ +LLKH + + D +++++ ++ +P+ S +L +YP +Y IH + T E
Sbjct: 1363 PLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIHI--KGKTNE 1418
Query: 770 WNELQVP----IPRILQLSAERLQATGTYLLN 797
+ + V IP+ + SAE++ + G YLL+
Sbjct: 1419 IDSMDVDDDLFIPKTIPSSAEKIYSNGIYLLD 1450
>gnl|CDD|218277 pfam04815, Sec23_helical, Sec23/Sec24 helical domain. COPII-coated
vesicles carry proteins from the endoplasmic reticulum
to the Golgi complex. This vesicular transport can be
reconstituted by using three cytosolic components
containing five proteins: the small GTPase Sar1p, the
Sec23p/24p complex, and the Sec13p/Sec31p complex. This
domain is composed of five alpha helices.
Length = 103
Score = 99.1 bits (248), Expect = 8e-25
Identities = 47/104 (45%), Positives = 64/104 (61%), Gaps = 2/104 (1%)
Query: 650 QACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTY-KMTQAGHLTGSLLAPSSLKL 708
Q IA LLAK AV+++L SL DAR + D+L+ Y K + +G L+ P SLKL
Sbjct: 1 QEAIAVLLAKKAVEKALTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKL 60
Query: 709 LPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSI 752
LPLY+LALLK PA R G + D+R +A + LP+ +L+L I
Sbjct: 61 LPLYMLALLKSPALR-GGNVSPDERAYARCLLLSLPVENLLLMI 103
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 63.6 bits (155), Expect = 3e-10
Identities = 40/202 (19%), Positives = 50/202 (24%), Gaps = 7/202 (3%)
Query: 28 SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
P P P + Q + + Q P
Sbjct: 135 QPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ-----PPQQVLPQGMPPRQA 189
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ--QQQQQQ 145
Q P +Q + Q P Q AP P L QQ QQ Q
Sbjct: 190 AFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFP 249
Query: 146 QQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQ 205
QQ P P P QQ Q N P G +P Q
Sbjct: 250 GLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQ 309
Query: 206 QQQQQQQQQQQQQQTLNSQFSN 227
Q Q QQ+ Q ++Q +
Sbjct: 310 QPQGQQRGPQFREQLVQLSQQQ 331
Score = 62.5 bits (152), Expect = 8e-10
Identities = 40/197 (20%), Positives = 53/197 (26%), Gaps = 21/197 (10%)
Query: 28 SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
P PP L +P ++ P Q P
Sbjct: 165 QRQQAPQLPQPPQQVLPQGMPPRQAAFPQ------QGPPEQPPGYPQPPQ---GHPEQVQ 215
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
PQQF Q + +PP + P P QQ Q
Sbjct: 216 PQQFLPAPSQ----APAQPPLPPQ---LPQQPPPLQQPQFPGL-----SQQMPPPPPQPP 263
Query: 148 QQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQ 207
QQQ P P QN P N +PP QP P V +++Q
Sbjct: 264 QQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQ 323
Query: 208 QQQQQQQQQQQQTLNSQ 224
Q QQQ++ +
Sbjct: 324 LVQLSQQQREALSQEEA 340
Score = 47.1 bits (112), Expect = 4e-05
Identities = 33/236 (13%), Positives = 45/236 (19%), Gaps = 59/236 (25%)
Query: 71 NRFQAPGATINSLSTPVPQQFQRPQQQNA----------------NHSQQTFRMPPNTYS 114
+ S T Q + + + + P S
Sbjct: 83 GAPSVGPDSDLSQKTSTFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPES 142
Query: 115 QSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI---------PNIPPLPNSQQNS 165
Q P L+ + Q QQ+QQ Q Q P
Sbjct: 143 QPQPQTP---AQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQ 199
Query: 166 SNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS-------LNQAGYQQQQQQQQQQQQQQ 218
P + QV P+ PA P Q QQ
Sbjct: 200 PPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPP 259
Query: 219 QTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQ 274
Q Q PPP P P + +
Sbjct: 260 PQPPQQQQQPPQPQAQ------------------------PPPQNQPTPHPGLPQG 291
Score = 43.6 bits (103), Expect = 5e-04
Identities = 27/149 (18%), Positives = 41/149 (27%), Gaps = 14/149 (9%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
Q + + IP++ P P+ + +P Q P P+
Sbjct: 100 FSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPP-----QAPESQPQPQTPAQKML 154
Query: 196 YPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVN---------QGMNKMWGYEPHDLL 246
+A QQ+QQ Q Q QQ L + +
Sbjct: 155 SLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQV 214
Query: 247 NTRDILPATKVEPPPIRLPQNLPEIHEQF 275
+ LPA P LP LP+
Sbjct: 215 QPQQFLPAPSQAPAQPPLPPQLPQQPPPL 243
Score = 37.1 bits (86), Expect = 0.055
Identities = 27/130 (20%), Positives = 35/130 (26%), Gaps = 4/130 (3%)
Query: 21 MQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATI 80
P + A PP+ PQ P Q P S M PP + Q
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQ--PP 270
Query: 81 NSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQ--QQ 138
+ P PQ P P P P L Q QQ
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330
Query: 139 QQQQQQQQQQ 148
Q++ Q++
Sbjct: 331 QREALSQEEA 340
Score = 32.8 bits (75), Expect = 1.1
Identities = 22/130 (16%), Positives = 32/130 (24%), Gaps = 20/130 (15%)
Query: 22 QVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATIN 81
Q P P Q PP+ +S P + P P
Sbjct: 225 QAPAQPPLPPQLPQQPPPL-QQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP- 282
Query: 82 SLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQ 141
+ +PQ P P Q + P Q ++Q
Sbjct: 283 TPHPGLPQGQNAPLPPPQQPQLLPLVQQP----QGQQRGP--------------QFREQL 324
Query: 142 QQQQQQQQQP 151
Q QQQ++
Sbjct: 325 VQLSQQQREA 334
>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
Length = 761
Score = 54.9 bits (133), Expect = 1e-07
Identities = 54/250 (21%), Positives = 86/250 (34%), Gaps = 49/250 (19%)
Query: 294 QSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRT 353
Q TVEY T P P +++F++D I + G L + + L P +
Sbjct: 106 QYTTVEY--TLPPGSGGAPSPPVFVFVVDTCMIEEELGALKSALLQAIALL---PENAL- 159
Query: 354 SIAIITYDSAVHFYSLAEG-----------------------------QTQPSQMILTDI 384
+ +IT+ + VH + L + I
Sbjct: 160 -VGLITFGTHVHVHELGFSECSKSYVFRGNKEVSKDQILEQLGLGGKKRRPAGGGIAGAR 218
Query: 385 DDIFLPSPENILVNLSEC---YDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFKLLQP- 440
D + L+ SEC + ++ L++ P G AL A LL
Sbjct: 219 DGLSSSGVNRFLLPASECEFTLNSALEELQKDPWPVPPGHRPARCTGAALSVAAGLLGAC 278
Query: 441 ---TGGRITIFQTCLPNRGPGALQSRENPNQ-RS-----NDDPPHMNPATDFYKKIALEC 491
TG RI F GPGA+ S++ RS D P+ A FY+ +A +
Sbjct: 279 VPGTGARIMAFVGGPCTEGPGAIVSKDLSEPIRSHKDLDKDAAPYYKKAVKFYEGLAKQL 338
Query: 492 TSFYIAVDLF 501
+ +D+F
Sbjct: 339 VAQGHVLDVF 348
Score = 32.6 bits (75), Expect = 1.1
Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 613 FQVAVLYTSSKGDRRIRVHTLCLPVV--SNINDVLVGADQACIAGLLAKMAVDR 664
Q Y S G R+RV T+ V S+ +++ G DQ A ++A++A +
Sbjct: 480 LQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSEELVAGFDQEAAAVVMARLASHK 533
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 52.3 bits (125), Expect = 9e-07
Identities = 41/201 (20%), Positives = 55/201 (27%), Gaps = 21/201 (10%)
Query: 52 SSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQ-------------QQN 98
++ Q L M P+ + T P+ RP Q+
Sbjct: 57 AAQQQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQS 116
Query: 99 ANHSQQ----TFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN 154
N Q M P+ S+ P + +S Q QQ Q QQ Q
Sbjct: 117 LNVRGQMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQ 176
Query: 155 IPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQ 214
+ QQ Q + P G + Q Q QQQ Q
Sbjct: 177 AGGMNQGQQGPVGQQQPPQMG----QPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQN 232
Query: 215 QQQQQTLNSQFSNLSVNQGMN 235
QQQQQ Q + Q
Sbjct: 233 QQQQQMDQQQGPADAQAQMGQ 253
Score = 51.9 bits (124), Expect = 1e-06
Identities = 56/258 (21%), Positives = 67/258 (25%), Gaps = 45/258 (17%)
Query: 21 MQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP---- 76
M + P Q S G Q + SS Q N M P + QA
Sbjct: 123 MPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQ 182
Query: 77 GATINSLSTPVPQQFQ--RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN---------G 125
G PQ Q P QQ + N
Sbjct: 183 GQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQ 242
Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPN------IPPLPNSQQNSSNFQPS-------- 171
A + QQQQ Q Q QQ Q PP QQ+ P+
Sbjct: 243 GPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302
Query: 172 --NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQ--------------QQQQQQQQQQ 215
+ PP+ A + AG Q Q QQQQQQQ
Sbjct: 303 GQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQG 362
Query: 216 QQQQTLNSQFSNLSVNQG 233
+ Q N V QG
Sbjct: 363 GNHPAAHQQQMNQQVGQG 380
Score = 49.2 bits (117), Expect = 8e-06
Identities = 51/264 (19%), Positives = 66/264 (25%), Gaps = 36/264 (13%)
Query: 15 GSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQ 74
G A +MQ QP+ P Q Q N Q P+G +
Sbjct: 145 GQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGG 204
Query: 75 APGATINSLSTPVPQQFQRPQQQNANHSQQTFRM-----PPNTYSQSSKTAPFNNGLTPL 129
+ P QQ Q PQ Q +QQ +M P + +Q + G+ P
Sbjct: 205 GGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQ 264
Query: 130 TNASLYQQQQQQQQQQQQQQQPIP---------------------NIPPLPNSQQNSSNF 168
Q QQQ QQQ Q + P P
Sbjct: 265 QMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQG 324
Query: 169 QPS----------NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQ 218
+ L R Q + A QQ QQ Q Q
Sbjct: 325 GQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMV 384
Query: 219 QTLNSQFSNLSVNQGMNKMWGYEP 242
G N M +P
Sbjct: 385 ALGYLNIQGNQGGLGANPMQQGQP 408
Score = 48.5 bits (115), Expect = 2e-05
Identities = 57/304 (18%), Positives = 78/304 (25%), Gaps = 5/304 (1%)
Query: 30 PSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQ 89
Q G S L + V Q P+G++ M + PG +
Sbjct: 100 MGQQMGGPGTASNLLQSLNVRG---QMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSG 156
Query: 90 QFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQ 149
Q + QQ N QQ + P P Q Q QQQ
Sbjct: 157 Q-PQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPG-MPGGGGQGQMQQQG 214
Query: 150 QPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQ 209
QP P QQ N Q +AQ G G G Q Q
Sbjct: 215 QPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVP 274
Query: 210 QQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLP 269
QQQ QQQ + + Q M P + P + +
Sbjct: 275 MQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMS 334
Query: 270 EIHEQFLYDRNTEMFGDPYRRPEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQ 329
+Q + M G + + Q + A + + L I
Sbjct: 335 AGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGN 394
Query: 330 SGYL 333
G L
Sbjct: 395 QGGL 398
Score = 43.5 bits (102), Expect = 6e-04
Identities = 42/232 (18%), Positives = 60/232 (25%), Gaps = 12/232 (5%)
Query: 2 QPLVSNKQGPNSYGSASNVMQVPLIASN-PSQPSGAAPPISGLQPQVPVSNSSLQTPLGS 60
QP +Q P N Q + P+ Q + +
Sbjct: 215 QPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ-GGQMQV 273
Query: 61 SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQR-PQQQNANHSQQTFRMPPNTYS--QSS 117
+PP + G N + +P Q P Q Q+ +P + Q
Sbjct: 274 PMQQQPPQQQPQQSQLGMLPNQMQQ-MPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGV 332
Query: 118 KTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI-----PNIPPLPNSQQNSSNFQPSN 172
+A QQQ QQQQQQQ P N Q + +
Sbjct: 333 MSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQ 392
Query: 173 NLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
+ P Q G + + P Q Q Q SQ
Sbjct: 393 GNQGGLGANP-MQQGQPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQ 443
Score = 38.8 bits (90), Expect = 0.014
Identities = 41/229 (17%), Positives = 58/229 (25%), Gaps = 9/229 (3%)
Query: 1 MQPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQP--QVPVSNSSLQTPL 58
M + G MQVP+ P Q + Q+P
Sbjct: 251 MGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQP 310
Query: 59 GSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRP-----QQQNANHSQQTFRMPPNTY 113
RP V QA + S +Q + QQ QQ P
Sbjct: 311 MGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQ 370
Query: 114 SQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN 173
Q ++ + L ++ Q QQ QP P P Q ++ P
Sbjct: 371 QQMNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSPSPVPQVQTNQSMPQPP 430
Query: 174 LYNNIQ-VPPRAQPGNY-PSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
+ P +QP P Q Q Q+ Q
Sbjct: 431 QPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQ 479
>gnl|CDD|227380 COG5047, SEC23, Vesicle coat complex COPII, subunit SEC23
[Intracellular trafficking and secretion].
Length = 755
Score = 51.4 bits (123), Expect = 2e-06
Identities = 81/443 (18%), Positives = 158/443 (35%), Gaps = 88/443 (19%)
Query: 291 PEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGD 350
QS+T+EY + P ++ F++D + L + L+ L +P +
Sbjct: 104 LLPQSSTIEYT-----LSKPVILPPVFFFVVDACCDEEELTALK---DSLIVSLSLLPPE 155
Query: 351 RRTSIAIITYDSAVHFYSLA----------EGQTQPSQMILTD----------------I 384
+ +ITY +++ + L G + ++ L + I
Sbjct: 156 --ALVGLITYGTSIQVHELNAENHRRSYVFSGNKEYTKENLQELLALSKPTKSGGFESKI 213
Query: 385 DDIFLPSPENILVNLSECYDMIVDLLKQL-------PGKFKDSLDSGSALGPALQAAFKL 437
I + L+ +C ++++L+QL P + +GSAL A +
Sbjct: 214 SGIGQFASSRFLLPTQQCEFKLLNILEQLQPDPWPVPAGKRPLRCTGSALNIASSLLEQC 273
Query: 438 LQPTGGRITIFQTCLPNRGPGALQSRE------NPNQRSNDDPPHMNPATDFYKKIALEC 491
G I +F GPG + S E + + +D H AT FYK +A
Sbjct: 274 FPNAGCHIVLFAGGPCTVGPGTVVSTELKEPMRSHHDIESDSAQHSKKATKFYKGLAERV 333
Query: 492 TSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRM---FSRY 548
+ A+D+F + + + ++ + G + +S + F R
Sbjct: 334 ANQGHALDIFAGCLDQIGIMEMEPLTTSTGGAL---------VLSDSFTTSIFKQSFQRI 384
Query: 549 LTR------KIGFESVMRIRCTKGMSIHTFHGN---------------FFVRSTDLLALP 587
R K+GF + M ++ +K + I G+ + +T+ +
Sbjct: 385 FNRDSEGYLKMGFNANMEVKTSKNLKIKGLIGHAVSVKKKANNISDSEIGIGATNSWKMA 444
Query: 588 NVNPDAGYGI--QISIDENLTHIQYACF---QVAVLYTSSKGDRRIRVHTLCLPVVSNIN 642
+++P + Y + +I++ Q Q Y S G RIRV T+
Sbjct: 445 SLSPKSNYALYFEIALGAASGSAQRPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDGGL 504
Query: 643 DVLVGA-DQACIAGLLAKMAVDR 664
+ + DQ A +A++A +
Sbjct: 505 PKINRSFDQEAAAVFMARIAAFK 527
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 47.7 bits (113), Expect = 2e-05
Identities = 52/257 (20%), Positives = 83/257 (32%), Gaps = 36/257 (14%)
Query: 6 SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQV-----PVSNSSLQTPLGS 60
SN + S +N A+ + + A+ P + +S ++ TP +
Sbjct: 135 SNGSDAATTSSTANTPA----AAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTT 190
Query: 61 SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQS---- 116
R P T + A S Q Q + Q M + +
Sbjct: 191 QKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTPLMQ 250
Query: 117 --SKTAPFNNGLTPLTNASLYQQ-QQQQQQQQQQQQQPIPNIPPLPNSQQNSS---NFQP 170
+ +P N +PL N S + Q Q Q P N + N + +S P
Sbjct: 251 LLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTP 310
Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSV 230
S ++ N + P LN + Q Q QQ Q N Q ++
Sbjct: 311 SASMINLNNITPAN-------------ILNMSLNLAFDLNQNQTPQQLQPQNQQ----NM 353
Query: 231 NQGMNKMWGYEPHDLLN 247
N GMN + ++ DL N
Sbjct: 354 NMGMNDVNNFDLLDLNN 370
>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3. The
C.elegans Notch pathway, involved in the control of
growth, differentiation and patterning in animal
development, relies on either of the receptors GLP-1 or
LIN-12. Both these receptors promote signalling by the
recruitment of LAG-3 to target promoters, where it then
acts as a transcriptional activator. LAG-3 works as a
ternary complex together with the DNA binding protein,
LAG-1.
Length = 476
Score = 45.7 bits (107), Expect = 9e-05
Identities = 36/143 (25%), Positives = 51/143 (35%), Gaps = 7/143 (4%)
Query: 84 STPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTN-----ASLYQQQ 138
P + + P + + Q+ P ++ A N + L + QQ+
Sbjct: 290 MEPEGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQR 349
Query: 139 QQQ--QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGY 196
QQQ Q QQQQQQ L QQ Q + + Q +A +
Sbjct: 350 QQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYLQQMQH 409
Query: 197 PSLNQAGYQQQQQQQQQQQQQQQ 219
L + QQQQ Q QQ QQQ
Sbjct: 410 MRLQEQIQHQQQQAQHHQQAQQQ 432
Score = 31.1 bits (69), Expect = 3.5
Identities = 38/190 (20%), Positives = 58/190 (30%), Gaps = 14/190 (7%)
Query: 10 GPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLV 69
GP++Y + L A P + +P +G + P + + +
Sbjct: 276 GPDTY----DTFLAELDAMEPEGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNK 331
Query: 70 TNRFQAPGATINSLSTPVPQQF--QRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNG-- 125
Q + + QQ Q QQQ H QQ + + NG
Sbjct: 332 MRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGG 391
Query: 126 --LTPLTNASLYQQQQQ----QQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
T + Y QQ Q Q+Q Q QQQQ + QQ + + Q + N
Sbjct: 392 QFATQAHQHAAYLQQMQHMRLQEQIQHQQQQAQHHQQAQQQHQQPAQHGQMGYGIPNGYP 451
Query: 180 VPPRAQPGNY 189
Y
Sbjct: 452 AHMHGHAPAY 461
Score = 29.9 bits (66), Expect = 6.7
Identities = 17/39 (43%), Positives = 21/39 (53%)
Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWG 239
Q Q QQQQQQ+ QQQQ L Q + Q ++M G
Sbjct: 351 QQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNG 389
>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
eukaryotes is mediated by at least in part by the
budding and fusion of intracellular transport vesicles
that selectively carry cargo proteins and lipids from
donor to acceptor organelles. The two main classes of
vesicular carriers within the endocytic and the
biosynthetic pathways are COP- and clathrin-coated
vesicles. Formation of COPII vesicles requires the
ordered assembly of the coat built from several
cytosolic components GTPase Sar1, complexes of
Sec23-Sec24 and Sec13-Sec31. The process is initiated by
the conversion of GDP to GTP by the GTPase Sar1 which
then recruits the heterodimeric complex of Sec23 and
Sec24. This heterodimeric complex generates the
pre-budding complex. The final step leading to membrane
deformation and budding of COPII-coated vesicles is
carried by the heterodimeric complex Sec13-Sec31. The
members of this CD belong to the Sec23-like family. Sec
23 is very similar to Sec24. The Sec23 and Sec24
polypeptides fold into five distinct domains: a
beta-barrel, a zinc finger, a vWA or trunk, an all
helical region and a carboxy Gelsolin domain. The
members of this subgroup lack the consensus MIDAS motif
but have the overall Para-Rossmann type fold that is
characteristic of this superfamily.
Length = 267
Score = 45.1 bits (107), Expect = 9e-05
Identities = 55/236 (23%), Positives = 89/236 (37%), Gaps = 55/236 (23%)
Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLA- 370
P ++LF++D + L E L+ L +P + + +IT+ + V + L
Sbjct: 1 TSPPVFLFVVDTCMDEEELDALK---ESLIMSLSLLPPN--ALVGLITFGTMVQVHELGF 55
Query: 371 ---------EGQTQPSQMILTDIDDIFLPS-------------------PENILVNLSEC 402
G + + D+ + P+ L+ +S+C
Sbjct: 56 EECSKSYVFRGNKDYTAKQIQDMLGLGGPAMRPSASQHPGAGNPLPSAAASRFLLPVSQC 115
Query: 403 YDMIVDLLKQL-------PGKFKDSLDSGSALGPA---LQAAFKLLQPTGGRITIFQTCL 452
+ DLL+QL P + +G AL A L+A F TG RI +F
Sbjct: 116 EFTLTDLLEQLQPDPWPVPAGHRPLRCTGVALSIAVGLLEACFP---NTGARIMLFAGGP 172
Query: 453 PNRGPGALQSRE--NPNQRS-----NDDPPHMNPATDFYKKIALECTSFYIAVDLF 501
GPGA+ S E +P RS D+ + A FY +A + AVD+F
Sbjct: 173 CTVGPGAVVSTELKDP-IRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIF 227
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 45.1 bits (107), Expect = 1e-04
Identities = 25/147 (17%), Positives = 37/147 (25%), Gaps = 8/147 (5%)
Query: 81 NSLSTPVPQQFQRPQQQNANHSQQTFRMP-PNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
PV Q Q Q + Q ++ + + N T A
Sbjct: 53 PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPA-----GP 107
Query: 140 QQQQQQQQQQQPIPNIPPLPNSQQ--NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYP 197
Q + Q P P+ +Q NS QP+ P +
Sbjct: 108 AGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQS 167
Query: 198 SLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
+ Q QQQ + QQ
Sbjct: 168 AQKNDESQLQQQPNGETPPQQTDGAGD 194
Score = 44.7 bits (106), Expect = 1e-04
Identities = 23/143 (16%), Positives = 36/143 (25%), Gaps = 14/143 (9%)
Query: 93 RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI 152
P Q Q + P + A P + + +
Sbjct: 50 DPSPQAPPPVAQLPQPLPQPPPTQALQAL------PAGDQQQHNTPTGSPAANPPATFAL 103
Query: 153 PNIPPLPNSQQNSSNF--------QPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGY 204
P P P Q N + + P + + G P+ Q
Sbjct: 104 PAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPS 163
Query: 205 QQQQQQQQQQQQQQQTLNSQFSN 227
QQQ Q+ + Q QQ N +
Sbjct: 164 QQQSAQKNDESQLQQQPNGETPP 186
Score = 44.0 bits (104), Expect = 2e-04
Identities = 36/208 (17%), Positives = 52/208 (25%), Gaps = 34/208 (16%)
Query: 27 ASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTP 86
A P PS APP PQ Q P +TP
Sbjct: 45 AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQ---------------ALPAGDQQQHNTP 89
Query: 87 VPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQ 146
A + TF +P + +T P + + Q Q
Sbjct: 90 --------TGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPV-MVTQNPANSPLDQP 140
Query: 147 QQQ-QPIPNIPPL--PNSQQNSSNFQPSNNLY-------NNIQVPPRAQPGNYPSVPAGY 196
QQ P S Q S Q + N + PP+ G
Sbjct: 141 AQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEAL 200
Query: 197 PSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
L +A +Q+ + + + Q
Sbjct: 201 VRLREADGTLEQRIKGAEGGGAMKVLKQ 228
Score = 41.7 bits (98), Expect = 0.001
Identities = 23/127 (18%), Positives = 31/127 (24%), Gaps = 6/127 (4%)
Query: 128 PLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG 187
P A L Q Q Q Q P + + + P+ P Q
Sbjct: 56 PPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTE 115
Query: 188 NYPSVPAGYPSLN--QAGYQQQQQQQQ----QQQQQQQTLNSQFSNLSVNQGMNKMWGYE 241
P P + Q Q QQ QQ+ + S Q K +
Sbjct: 116 PGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQ 175
Query: 242 PHDLLNT 248
N
Sbjct: 176 LQQQPNG 182
Score = 37.0 bits (86), Expect = 0.038
Identities = 31/228 (13%), Positives = 53/228 (23%), Gaps = 57/228 (25%)
Query: 2 QPLVSNKQGPNSY-----GSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQT 56
L P A + Q +P+ A + +QT
Sbjct: 60 AQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT-----IQT 114
Query: 57 PLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQS 116
G ++ P++ + A QQ Q+ A+ Q
Sbjct: 115 EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPAS-------------GQL 161
Query: 117 SKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYN 176
QQQ Q+ + Q Q PN P Q + + S L
Sbjct: 162 P------------------SQQQSAQKNDESQLQQQPN-GETPPQQTDGAGDDESEALVR 202
Query: 177 NIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
+ +L Q + + +Q +
Sbjct: 203 LREAD---------------GTLEQRIKGAEGGGAMKVLKQPKKQAKS 235
Score = 31.3 bits (71), Expect = 2.2
Identities = 25/158 (15%), Positives = 42/158 (26%), Gaps = 14/158 (8%)
Query: 1 MQPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGS 60
P S P + + P I + P Q P+ Q +NS L P
Sbjct: 87 NTPTGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQ---NPANSPLDQP--- 140
Query: 61 SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPN----TYSQS 116
+ + R+ AP S P QQ + ++ Q PP
Sbjct: 141 AQQRALQQLQQRYGAPA----SGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDE 196
Query: 117 SKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN 154
S+ + + + +Q +
Sbjct: 197 SEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAK 234
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 45.8 bits (108), Expect = 1e-04
Identities = 41/141 (29%), Positives = 54/141 (38%), Gaps = 12/141 (8%)
Query: 121 PFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQV 180
P P+ YQQ QQ Q Q QQP + P P QQ P Y Q
Sbjct: 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQ 809
Query: 181 PPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGY 240
P QP Y Q Q Q QQ QQ Q ++ L + G ++ +
Sbjct: 810 PVAPQP--------QYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPL-H 860
Query: 241 EPHDLLNTRDIL--PATKVEP 259
+P L + D+L P ++VEP
Sbjct: 861 KPTTPLPSLDLLTPPPSEVEP 881
Score = 40.1 bits (93), Expect = 0.006
Identities = 24/89 (26%), Positives = 32/89 (35%), Gaps = 1/89 (1%)
Query: 86 PVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQ 145
PV Q Q Q Q Q ++ P + + + P QQ Q Q
Sbjct: 758 PVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY 817
Query: 146 QQQQQPIPNIPPLPNSQQNSSNFQPSNNL 174
QQ QQP+ P QQ + QP + L
Sbjct: 818 QQPQQPVAPQPQYQQPQQPVAP-QPQDTL 845
Score = 39.3 bits (91), Expect = 0.011
Identities = 39/189 (20%), Positives = 56/189 (29%), Gaps = 23/189 (12%)
Query: 24 PLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP-----GA 78
P+IA P G QP V N LQ P+ P Q P
Sbjct: 371 PVIAP---APEGYPQQSQYAQPAVQ-YNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPE 426
Query: 79 TINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQ 138
P Q +Q P + Q+ +T P LYQQ
Sbjct: 427 QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQ-----QPAAQEPLYQQP 481
Query: 139 QQQQQQQQQQQQPI-----PNIPPL----PNSQQNSSNFQPSNNLYNNIQVPPRAQPGNY 189
Q +QQ + +P+ P PPL ++ + + Y I P +
Sbjct: 482 QPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIK 541
Query: 190 PSVPAGYPS 198
S+ A +
Sbjct: 542 SSLKAPSVA 550
Score = 34.7 bits (79), Expect = 0.28
Identities = 27/88 (30%), Positives = 33/88 (37%), Gaps = 4/88 (4%)
Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
A L +Q Q QQQ+ +QP P + F P L ++ P P P
Sbjct: 697 AELARQFAQTQQQRYSGEQPAGANP----FSLDDFEFSPMKALLDDGPHEPLFTPIVEPV 752
Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
P Q YQQ QQ Q Q QQ
Sbjct: 753 QQPQQPVAPQQQYQQPQQPVAPQPQYQQ 780
Score = 34.3 bits (78), Expect = 0.39
Identities = 25/129 (19%), Positives = 41/129 (31%), Gaps = 3/129 (2%)
Query: 135 YQQQQQQQQQQQQQQQPIPN-IPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVP 193
Y QQ Q Q Q +P+ + P ++ Y P QP P+
Sbjct: 380 YPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPE 439
Query: 194 AGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILP 253
P A ++QQ Q QT + + + E ++ ++
Sbjct: 440 Q--PVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVE 497
Query: 254 ATKVEPPPI 262
TK PP+
Sbjct: 498 ETKPARPPL 506
Score = 31.2 bits (70), Expect = 3.1
Identities = 28/97 (28%), Positives = 30/97 (30%), Gaps = 17/97 (17%)
Query: 128 PLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG 187
PL + QQ QQ QQQ P P P Q QP V P+ Q
Sbjct: 743 PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQ----QPQQ------PVAPQPQYQ 792
Query: 188 NYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
A P Q QQ QQ Q Q Q
Sbjct: 793 QPQQPVAPQP-------QYQQPQQPVAPQPQYQQPQQ 822
Score = 30.4 bits (68), Expect = 5.2
Identities = 27/136 (19%), Positives = 38/136 (27%), Gaps = 4/136 (2%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNS-QQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
Q Q Q P I P P Q S QP+ +Q P + Q Y
Sbjct: 354 AQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAE 413
Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPA 254
Q Y +Q QQ Q + Q + + P T
Sbjct: 414 QPA---QQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQ 470
Query: 255 TKVEPPPIRLPQNLPE 270
+ P + PQ + +
Sbjct: 471 PAAQEPLYQQPQPVEQ 486
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 45.3 bits (108), Expect = 1e-04
Identities = 27/98 (27%), Positives = 36/98 (36%), Gaps = 6/98 (6%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ-PGNYPSVPA 194
Q Q Q Q + QQ + + QQ S+ + ++Q AQ
Sbjct: 574 QVTQGGQGGQSEMQQAMEGLGETLREQQ-----GLSDETFRDLQEQFNAQRGEQQGQQGQ 628
Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQ 232
G Q G Q QQ Q QQQ QQ Q +L+ Q
Sbjct: 629 GGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQ 666
Score = 43.4 bits (103), Expect = 5e-04
Identities = 19/99 (19%), Positives = 28/99 (28%), Gaps = 21/99 (21%)
Query: 135 YQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
++ Q+Q Q+ +QQ Q Q Q + Q G
Sbjct: 608 FRDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQG--------QGQQQGQQG------- 652
Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQG 233
Q G Q ++QQ + L Q L G
Sbjct: 653 ------QGGQGGQGSLAERQQALRDELGRQRGGLPGMGG 685
Score = 41.8 bits (99), Expect = 0.002
Identities = 26/94 (27%), Positives = 33/94 (35%), Gaps = 16/94 (17%)
Query: 134 LYQQQQQQQQQQQQQQQPIPNIPPLPNS---QQNSSNFQPSNNLYNNIQVPPRAQPGNYP 190
L Q Q Q + QQ + L + QQ S+ + ++Q AQ G
Sbjct: 573 LQVTQGGQGGQSEMQQA----MEGLGETLREQQ-----GLSDETFRDLQEQFNAQRGEQQ 623
Query: 191 SVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
G G Q QQ QQ Q QQQ Q
Sbjct: 624 ----GQQGQGGQGQGQPGQQGQQGQGQQQGQQGQ 653
Score = 32.2 bits (74), Expect = 1.5
Identities = 20/126 (15%), Positives = 33/126 (26%), Gaps = 54/126 (42%)
Query: 134 LYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNS---------------------------S 166
L +Q Q+ QQ Q NS + +
Sbjct: 512 LAEQAQRNPQQGDQPPDQ-------GNSMELTQQDLQRMMDRIEELMESGRRAEAQQLLE 564
Query: 167 NFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAG-------------YQQQQQQQQQ 213
Q + N+QV Q G ++ G ++ Q+Q
Sbjct: 565 QLQ---QMMENLQVTQGGQGGQ----SEMQQAMEGLGETLREQQGLSDETFRDLQEQFNA 617
Query: 214 QQQQQQ 219
Q+ +QQ
Sbjct: 618 QRGEQQ 623
Score = 29.5 bits (67), Expect = 9.5
Identities = 10/15 (66%), Positives = 10/15 (66%)
Query: 136 QQQQQQQQQQQQQQQ 150
QQQ QQQ Q QQQ
Sbjct: 740 AQQQGQQQGGQGQQQ 754
Score = 29.5 bits (67), Expect = 9.5
Identities = 10/15 (66%), Positives = 10/15 (66%)
Query: 205 QQQQQQQQQQQQQQQ 219
QQQ QQQ Q QQQ
Sbjct: 740 AQQQGQQQGGQGQQQ 754
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.0 bits (101), Expect = 9e-04
Identities = 34/194 (17%), Positives = 56/194 (28%), Gaps = 4/194 (2%)
Query: 11 PNSYGSASNVMQVPLIASNPSQ-PSGAAPPISGLQPQVPVSNSS-LQTPLGSSNMMRPPL 68
P + P A P+ P+G PP + QP P L + P
Sbjct: 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
Query: 69 VTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTP 128
R + + P +R + + S ++F +PP+ + +
Sbjct: 2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ 2920
Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN--LYNNIQVPPRAQP 186
Q Q + Q P P + + + QP + + VP P
Sbjct: 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP 2980
Query: 187 GNYPSVPAGYPSLN 200
PS A S
Sbjct: 2981 QPAPSREAPASSTP 2994
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 36.2 bits (84), Expect = 0.012
Identities = 35/121 (28%), Positives = 39/121 (32%), Gaps = 38/121 (31%)
Query: 124 NGLTPLTNASLYQQQQQQ--QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVP 181
N TN Q QQQQ QQQ QQ P P P P
Sbjct: 1 NPYQQNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSP-----------------------P 37
Query: 182 PRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN----LSVNQGMNKM 237
+ Q P P YP Q QQQQ Q + SQF N N+ MN +
Sbjct: 38 QQQQGHFMPPQPQPYPK---------QSPQQQQPPQFSSFLSQFKNSDGTFDFNKMMNTV 88
Query: 238 W 238
Sbjct: 89 G 89
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 38.8 bits (90), Expect = 0.012
Identities = 36/144 (25%), Positives = 43/144 (29%), Gaps = 22/144 (15%)
Query: 89 QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASL---YQQQQQQQQQQ 145
Q Q + HS + + NA+L Q Q QQ
Sbjct: 28 QLSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ 87
Query: 146 QQQQQP-------------IPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSV 192
QQ Q P + ++P P QQ PS Q P QP P V
Sbjct: 88 QQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQV 147
Query: 193 PAGYPSLNQAGYQQQQQQQQQQQQ 216
P Q YQ QQ Q QQ
Sbjct: 148 P------QQQQYQSPPQQPQYQQN 165
Score = 34.9 bits (80), Expect = 0.20
Identities = 40/197 (20%), Positives = 55/197 (27%), Gaps = 18/197 (9%)
Query: 27 ASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTP 86
A + S PP + ++ PL N PP + P S + P
Sbjct: 39 AHSFHVDSTKQPPAPEQVAKHELA----DAPLQQVNAALPPAPAPQSPQPDQQQQSQAPP 94
Query: 87 VPQQ-FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLT---------NASLYQ 136
Q Q P QQ + QQ Y S+ P YQ
Sbjct: 95 SHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQ 154
Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGY 196
QQ Q QQ + P + S +QP + Y + P + P +G
Sbjct: 155 SPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQS--YPPNEPLPSSMAMQPP--YSGA 210
Query: 197 PSLNQAGYQQQQQQQQQ 213
P Q Q
Sbjct: 211 PPSQQFYGPPQPSPYMY 227
Score = 32.6 bits (74), Expect = 1.0
Identities = 40/216 (18%), Positives = 51/216 (23%), Gaps = 17/216 (7%)
Query: 18 SNVMQVPLIASNPSQPSGAAPPISGLQPQVPVS--NSSLQTPLGSSNMMRPPLVTNRFQA 75
N P A QP + Q P +Q+ + P Q
Sbjct: 69 VNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQP 128
Query: 76 PGATINSLSTPVPQ-QFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASL 134
PQ Q PQQQ Q +++AP +GL P +
Sbjct: 129 QPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ-QPQYQQNPPPQAQSAPQVSGLYP-EESPY 186
Query: 135 YQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNN------------IQVPP 182
Q + P P SQQ QPS +Y Q PP
Sbjct: 187 QPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPP 246
Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQ 218
Y N Q Q
Sbjct: 247 SQGQEGYGYSGPPPSKGNHGSVASYAPQGSSQSYST 282
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 38.6 bits (90), Expect = 0.013
Identities = 27/169 (15%), Positives = 38/169 (22%), Gaps = 30/169 (17%)
Query: 63 MMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPF 122
MR + + PQQ Q MP
Sbjct: 381 RMRQLPMGSPMG-GAMGQPPYYGQGPQQQFNGQPLGWPRMSM---MPTPMGPGGPLRP-- 434
Query: 123 NNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPP 182
NGL P+ + + Q Q+ P+ + PN Q
Sbjct: 435 -NGLAPMNA----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPL-------------SQD 476
Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVN 231
QP + Q Q Q Q+Q L + L
Sbjct: 477 LPQP------QSTASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEA 519
Score = 37.5 bits (87), Expect = 0.030
Identities = 22/136 (16%), Positives = 34/136 (25%), Gaps = 12/136 (8%)
Query: 111 NTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI------PNIPPLPNSQQN 164
+ + Q +P+ A Q QQQ QP+ P+
Sbjct: 373 DQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
Query: 165 SSNFQPSNNL-----YNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
N N N + P Y SL + Q Q Q Q +
Sbjct: 433 RPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPP-NYQSLPLSQDLPQPQSTASQGGQNK 491
Query: 220 TLNSQFSNLSVNQGMN 235
L ++ +
Sbjct: 492 KLAQVLASATPQMQKQ 507
Score = 34.8 bits (80), Expect = 0.20
Identities = 18/92 (19%), Positives = 24/92 (26%), Gaps = 4/92 (4%)
Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPL---PNSQQNSSNFQPSNNLYNNIQVP-PRAQPG 187
A L Q Q Q + +Q P + P Q L P
Sbjct: 369 AHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428
Query: 188 NYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
P P G +N + Q Q+ Q
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQ 460
Score = 33.6 bits (77), Expect = 0.49
Identities = 29/125 (23%), Positives = 38/125 (30%), Gaps = 18/125 (14%)
Query: 29 NPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVP 88
+P + PP G PQ + L P S MM P+ P N L+
Sbjct: 389 SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMS--MMPTPMGPGGPLRP----NGLAPMNA 442
Query: 89 QQFQRPQQQNA--NHSQQTFRMPPNTYSQSS-------KTAPFNNGLTPLTN---ASLYQ 136
+ QNA Q PPN S ++ G AS
Sbjct: 443 VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATP 502
Query: 137 QQQQQ 141
Q Q+Q
Sbjct: 503 QMQKQ 507
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 38.5 bits (89), Expect = 0.017
Identities = 23/112 (20%), Positives = 39/112 (34%), Gaps = 21/112 (18%)
Query: 108 MPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSN 167
+PPN + AP T++ ++ + + QQ Q P + N Q
Sbjct: 172 IPPN----PPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQ---- 223
Query: 168 FQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
P+ QP P PA P+ Q Q +Q ++ ++Q
Sbjct: 224 -------------QPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQ 262
Score = 35.0 bits (80), Expect = 0.17
Identities = 16/60 (26%), Positives = 19/60 (31%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
QQ Q QQ P P QQ N Q PP+ + G+ S G
Sbjct: 204 TQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQG 263
Score = 32.7 bits (74), Expect = 0.87
Identities = 15/63 (23%), Positives = 24/63 (38%), Gaps = 1/63 (1%)
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
P+ Q+P QN Q T + P Q + P P Q Q +Q ++
Sbjct: 201 PKPTQQPTVQNPAQ-QPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRS 259
Query: 148 QQQ 150
++Q
Sbjct: 260 REQ 262
>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
Members of this family include high molecular weight
subunits of glutenin. This group of gluten proteins is
thought to be largely responsible for the elastic
properties of gluten, and hence, doughs. Indeed,
glutenin high molecular weight subunits are classified
as elastomeric proteins, because the glutenin network
can withstand significant deformations without breaking,
and return to the original conformation when the stress
is removed. Elastomeric proteins differ considerably in
amino acid sequence, but they are all polymers whose
subunits consist of elastomeric domains, composed of
repeated motifs, and non-elastic domains that mediate
cross-linking between the subunits. The elastomeric
domain motifs are all rich in glycine residues in
addition to other hydrophobic residues. High molecular
weight glutenin subunits have an extensive central
elastomeric domain, flanked by two terminal non-elastic
domains that form disulphide cross-links. The central
elastomeric domain is characterized by the following
three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
possesses overlapping beta-turns within and between the
repeated motifs, and assumes a regular helical secondary
structure with a diameter of approx. 1.9 nm and a pitch
of approx. 1.5 nm.
Length = 779
Score = 38.5 bits (88), Expect = 0.019
Identities = 37/138 (26%), Positives = 48/138 (34%), Gaps = 2/138 (1%)
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
P Q Q+ Q H + + +P G Q QQ +Q QQ
Sbjct: 216 PGQLQQSGQGQPGHYPTSLQQLGQGQQGHYLASPQQPGQGQQPGQLQQPAQGQQPEQGQQ 275
Query: 148 QQQPIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQ 206
QQP P Q QP + + Q P + QPG+YP+ + G Q
Sbjct: 276 GQQPAQGQQGHQPAQGQQPGQGQPGH-YPASPQQPGQGQPGHYPASSQQPTQSQEPGQGQ 334
Query: 207 QQQQQQQQQQQQQTLNSQ 224
Q QQ Q QQ Q Q
Sbjct: 335 QGQQVGQGQQAQIPAQGQ 352
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 37.3 bits (87), Expect = 0.024
Identities = 30/118 (25%), Positives = 39/118 (33%), Gaps = 11/118 (9%)
Query: 77 GATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQ 136
G +NS + +Q Q +Q A+ QQ ++ Y N TP S
Sbjct: 71 GGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVPY----------NEQTPQVPRST-V 119
Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
Q QQQ QQQQ P P Q + + P Q P V A
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEA 177
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 38.1 bits (88), Expect = 0.028
Identities = 49/182 (26%), Positives = 64/182 (35%), Gaps = 12/182 (6%)
Query: 12 NSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQT--PLGSSNMMRPPLV 69
N S S+ Q L P PS PP + L P P S Q P GS +P
Sbjct: 160 NESDSDSSAQQQLLQPQGP--PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ 217
Query: 70 TNRFQAPGATINSLSTPVPQQFQRP----QQQNANHSQQTFRMPPNTYSQSSKTAPFNNG 125
Q ++ S + PQ+ P Q Q A+ P + + QSS P
Sbjct: 218 PQ--QPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPM 275
Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRA 184
L ++ Q Q P PLP+ Q S+ PS + Q PPR
Sbjct: 276 PHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQ-PPRE 334
Query: 185 QP 186
QP
Sbjct: 335 QP 336
>gnl|CDD|172294 PRK13756, PRK13756, tetracycline repressor protein TetR;
Provisional.
Length = 205
Score = 36.4 bits (84), Expect = 0.040
Identities = 24/113 (21%), Positives = 51/113 (45%), Gaps = 12/113 (10%)
Query: 47 VPVSNSSLQTPLGSSNM-MRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQT 105
+P+ S Q L ++ R L++ R GA ++ + P +Q++ + Q A QQ
Sbjct: 68 LPLEGESWQDFLRNNAKSFRCALLSYR---DGAKVHLGTRPTEKQYETLENQLAFLCQQG 124
Query: 106 FRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPL 158
F + Y+ S+ + T + + Q+ Q +++++ ++PPL
Sbjct: 125 FSLENALYALSA--------VGHFTLGCVLEDQEHQVAKEERETPTTDSMPPL 169
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 36.6 bits (85), Expect = 0.044
Identities = 23/137 (16%), Positives = 41/137 (29%), Gaps = 12/137 (8%)
Query: 91 FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQ 150
F R + + S + P QS P T S + + + Q QQ+ +Q
Sbjct: 40 FTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEES---ESENEVQIQQEVEQ 96
Query: 151 PIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRAQP--GNYPSVPAGYPSLNQAGYQQ- 206
+ I LPN + + Q P P N S+ +
Sbjct: 97 SVDEIKITLPNQEPAYYMQNHRSEPIQPTQ-PQYQSPTQTNVASMTIEETQSPNVPIEGI 155
Query: 207 ----QQQQQQQQQQQQQ 219
+Q + + + +
Sbjct: 156 NSSSEQLRVELAELAAE 172
Score = 35.5 bits (82), Expect = 0.10
Identities = 23/145 (15%), Positives = 44/145 (30%), Gaps = 3/145 (2%)
Query: 74 QAPGATINSLSTPVPQQFQRPQ--QQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTN 131
Q S P FQ+PQ + + + Q + + + T P N
Sbjct: 57 QPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQQEVEQSVDEIKITLP-NQEPAYYMQ 115
Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
+ Q Q Q Q Q + +Q + + N+ ++V S
Sbjct: 116 NHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNVPIEGINSSSEQLRVELAELAAEIYS 175
Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQ 216
+ L + + Q + + Q +
Sbjct: 176 DASHRVELAKNFMEPQAETEAQPEA 200
Score = 32.4 bits (74), Expect = 1.0
Identities = 26/181 (14%), Positives = 47/181 (25%), Gaps = 9/181 (4%)
Query: 6 SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMR 65
SN++ + Y S +N S P A P + L PQ V + +
Sbjct: 25 SNRREKSQYFSNANTFTRTRPPSRPISNEEADQPNT-LNPQSYVETTPPPFQQPQTEESE 83
Query: 66 PPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNG 125
Q +++ + +P Q QN P + A
Sbjct: 84 SENEVQIQQEVEQSVDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIE 143
Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLP--------NSQQNSSNFQPSNNLYNN 177
T N + +Q + + + I N + + + N
Sbjct: 144 ETQSPNVPIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAETEAQPEATTN 203
Query: 178 I 178
Sbjct: 204 F 204
Score = 29.3 bits (66), Expect = 8.9
Identities = 23/169 (13%), Positives = 45/169 (26%), Gaps = 16/169 (9%)
Query: 111 NTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQP 170
NT++++ + + +L Q + QQ Q Q
Sbjct: 38 NTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQ----QE 93
Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNL-- 228
+ I++ P+ Y N Q Q Q Q Q + S
Sbjct: 94 VEQSVDEIKI-------TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQ 146
Query: 229 SVNQGMNKMWGYEPHDLLNTRDILP-ATKVEPPPIRLPQNL--PEIHEQ 274
S N + + + ++ + L +N P+ +
Sbjct: 147 SPNVPIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAETE 195
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 37.2 bits (86), Expect = 0.045
Identities = 35/149 (23%), Positives = 43/149 (28%), Gaps = 29/149 (19%)
Query: 77 GATINSLS---TPVPQQFQRPQQQNANHSQQ-TFRMPPNTYSQSSKTAPFNNGLTPLTNA 132
G +I L P R + Q SQ R N QSS+
Sbjct: 432 GQSIRFLPQSDQQRPVSRGRGRGQRGPRSQPQNQRRQQNRGRQSSQPPR----------- 480
Query: 133 SLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG-NYPS 191
QQQ + Q Q+Q Q NL P R G
Sbjct: 481 ---QQQNRSNQNNQRQS----------QGPNQGPRGQGGYNLRPRTYQPQRYGGGQGRRW 527
Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
P Y Q QQQQ Q + + Q +T
Sbjct: 528 NPNPYRQSGQGRSQQQQPQPEARGNQSRT 556
>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
box-containing transcription factors [Transcription].
Length = 412
Score = 36.5 bits (84), Expect = 0.063
Identities = 30/221 (13%), Positives = 49/221 (22%), Gaps = 44/221 (19%)
Query: 28 SNPSQPSGAAPPISGLQPQVPVSNS---SLQTPLGSSNMMRPPLVTNRFQA--------- 75
SN S P S P N+ S Q P Q
Sbjct: 160 SNAPSDSSEEPSSSASFSVDPNDNNPMGSFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDH 219
Query: 76 -PGATINSLSTPVPQQFQRPQQQNANHSQQTFRMP---------------PNTYSQSSKT 119
N+ + P +F P+ +++ + P + T
Sbjct: 220 PTVPHSNTNNGRPPAKFMIPELHSSHSTLDLPSDFISDSGFPNQSSTSIFPLDSAIIQIT 279
Query: 120 APFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
P P N + +N N P+ + NN
Sbjct: 280 PPHLPNNPPQENRHELYSNDSSMVSET-------------PPPKNLPNGSPNQSPLNN-- 324
Query: 180 VPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
R P + S+ + + +Q T
Sbjct: 325 -LSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTT 364
Score = 33.5 bits (76), Expect = 0.59
Identities = 31/193 (16%), Positives = 50/193 (25%), Gaps = 22/193 (11%)
Query: 2 QPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSS 61
Q S K P S +N + P P S + + S Q SS
Sbjct: 211 QQHSSRKDHPTVPHSNTNNGRPPAKFMIPELHSSH-STLDLPSDFISDSGFPNQ----SS 265
Query: 62 NMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAP 121
+ P P N+ + ++ S+ P N + S +P
Sbjct: 266 TSIFPLDSAIIQITPPHLPNNPPQENRHELY--SNDSSMVSETP--PPKNLPNGSPNQSP 321
Query: 122 FNNGLTPLTNAS-------------LYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNF 168
NN + + +Q PN + S+
Sbjct: 322 LNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLHTEASTAPSST 381
Query: 169 QPSNNLYNNIQVP 181
P++ L N Q
Sbjct: 382 IPADPLKNAAQTN 394
>gnl|CDD|220774 pfam10477, EIF4E-T, Nucleocytoplasmic shuttling protein for mRNA
cap-binding EIF4E. EIF4E-T is the transporter protein
for shuttling the mRNA cap-binding protein EIF4E
protein, targeting it for nuclear import. EIF4E-T
contains several key binding domains including two
functional leucine-rich NESs (nuclear export signals)
between residues 438-447 and 613-638 in the human
protein. The other two binding domains are an
EIF4E-binding site, between residues 27-42 in Q9EST3,
and a bipartite NLS (nuclear localisation signals)
between 194-211, and these lie in family EIF4E-T_N.
EIF4E is the eukaryotic translation initiation factor 4E
that is the rate-limiting factor for cap-dependent
translation initiation.
Length = 520
Score = 35.7 bits (81), Expect = 0.12
Identities = 37/188 (19%), Positives = 56/188 (29%), Gaps = 18/188 (9%)
Query: 22 QVPLIASNPSQPSGAAPPISGLQPQ----VPVSNSSLQTPLGSSNMMRPPLVTNRFQAPG 77
Q+P +AS P Q PI+G P N++ + L MM N A
Sbjct: 326 QMPKLASRPQQ--DLINPIAGSLGPGGDPPPDQNNAFKELLD---MMAQNRANNNLVAQQ 380
Query: 78 ATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPL-------T 130
N P+P Q+ R+ S S +N TP T
Sbjct: 381 QQYNV--PPMPTSSHVFLQKRLEIQHLIQRLHCGDVSHSFLEKFLDNPSTPAATKDVIAT 438
Query: 131 NASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYP 190
+ + QQ + P P S + + + + P G P
Sbjct: 439 LLQMLPKNQQSEVVDGDPNIATQQSPLQPQSVHQHYSQELHSQNTARHNISPVISHGTSP 498
Query: 191 SVPAGYPS 198
P+ P+
Sbjct: 499 LSPSFTPT 506
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 35.1 bits (81), Expect = 0.13
Identities = 18/106 (16%), Positives = 31/106 (29%), Gaps = 13/106 (12%)
Query: 127 TPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ--------PSNNLYNNI 178
L++ S + P P +S+ +++
Sbjct: 4 NSLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSSEASNAGSGSSEQKAKLKGQASTAAG 63
Query: 179 QVPPRAQPGNYPSVPAG-----YPSLNQAGYQQQQQQQQQQQQQQQ 219
Q S P +QA +Q Q+QQ+QQQQ+
Sbjct: 64 SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109
Score = 30.9 bits (70), Expect = 2.4
Identities = 20/125 (16%), Positives = 42/125 (33%), Gaps = 16/125 (12%)
Query: 28 SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
S S SG++ PI + + ++ + SS N+ S
Sbjct: 5 SLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSS----------------EASNAGSGSS 48
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
Q+ + Q + + + + + + Q Q+QQ+QQQQ
Sbjct: 49 EQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQ 108
Query: 148 QQQPI 152
+++ +
Sbjct: 109 EEKQV 113
Score = 29.4 bits (66), Expect = 7.7
Identities = 19/138 (13%), Positives = 39/138 (28%), Gaps = 27/138 (19%)
Query: 80 INSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
I++ + + P +A T S + N G + + Q
Sbjct: 2 ISNSLSSISSGSSAPIGTSALRGTNTPAA---AKPAPSSSEASNAGSGSSEQKAKLKGQA 58
Query: 140 QQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSL 199
Q+ + ++ + P + Q
Sbjct: 59 STAAGSASQEL-------------QKQASESNDEEVVGEEEPEKDQAEA----------- 94
Query: 200 NQAGYQQQQQQQQQQQQQ 217
+ Q+QQ+QQQQ+++Q
Sbjct: 95 GKQLSQKQQEQQQQEEKQ 112
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 34.6 bits (80), Expect = 0.16
Identities = 18/88 (20%), Positives = 25/88 (28%), Gaps = 14/88 (15%)
Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
A+L Q + ++ + Q Q P Q+S F L P P+
Sbjct: 55 AALKQANARIEELEAQAQHP---------QSQSSGGF-----LSGMFGGGAPRPPPAAPA 100
Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
V P QQ QQ
Sbjct: 101 VQPPAPPARPGWGSGGPSQQGAGQQPGY 128
Score = 33.1 bits (76), Expect = 0.44
Identities = 18/86 (20%), Positives = 23/86 (26%), Gaps = 13/86 (15%)
Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
+ ++ + Q Q Q Q + S PP A P P
Sbjct: 58 KQANARIEELEAQAQHPQS-----------QSSGGFL--SGMFGGGAPRPPPAAPAVQPP 104
Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQ 217
P P G QQ QQ Q
Sbjct: 105 APPARPGWGSGGPSQQGAGQQPGYAQ 130
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 35.1 bits (80), Expect = 0.17
Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 158 LPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQ 217
LPN+ Q S + +++ +P Q +P G+ +Q +Q QQQ Q Q+Q
Sbjct: 280 LPNAHQISGADHHQHQHHHHPSIPAHHQH----QLPEGHQHDHQHHHQHQQQDLQHQEQH 335
Query: 218 QQTLNS 223
QQ +++
Sbjct: 336 QQHVDN 341
Score = 30.9 bits (69), Expect = 3.3
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 185 QPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
Q ++PS+PA + G+Q Q Q QQQ Q
Sbjct: 295 QHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQ 334
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 35.1 bits (81), Expect = 0.20
Identities = 22/114 (19%), Positives = 38/114 (33%), Gaps = 7/114 (6%)
Query: 2 QPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQP-QVPVSNSSLQTPLGS 60
P S + S S+ P + +QP+G P +S P VPV P +
Sbjct: 389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPV-----NPPSTA 443
Query: 61 SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYS 114
+RP + P + ++SL + Q +Q + +
Sbjct: 444 PQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG-NIKEAPTGTQKEI 496
Score = 31.3 bits (71), Expect = 2.5
Identities = 23/143 (16%), Positives = 37/143 (25%), Gaps = 29/143 (20%)
Query: 39 PISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQN 98
P + QP + S + Q+ A S Q P +
Sbjct: 386 PAAAPQPSAAAAASPSPS-----------------QSSAAAQPSAPQSATQPAGTPPTVS 428
Query: 99 ANHSQQTFRMPPNTYSQSSKTAPFNN------GLTPLTNASLYQQQQQQQQQQQQQQQPI 152
+ PP+T Q+ + A F S + Q++ +Q
Sbjct: 429 VDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG----- 483
Query: 153 PNIPPLPNSQQNSSNFQPSNNLY 175
NI P Q + Y
Sbjct: 484 -NIKEAPTGTQKEIFTEEDLQYY 505
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 33.9 bits (78), Expect = 0.31
Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 14/85 (16%)
Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA-- 194
++ +Q+ QQ +Q P +SS + P R N A
Sbjct: 135 EEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSA--------KPER----NVSQEEAKK 182
Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQ 219
+QQQQ++++++++Q+
Sbjct: 183 RLQEWELKKLKQQQQKREEERRKQR 207
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 34.0 bits (78), Expect = 0.39
Identities = 27/187 (14%), Positives = 50/187 (26%), Gaps = 23/187 (12%)
Query: 33 PSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQ 92
S A P G+Q VP S+ S + + + +P + P+Q
Sbjct: 285 SSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEG 344
Query: 93 RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI 152
T ++ + P +G +P +N+ L Q +
Sbjct: 345 GAAVSQN--------HDGGT-VRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASL 395
Query: 153 PNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQ 212
+ +++S+ QP PG+ VP + Q
Sbjct: 396 LPPAVVQQQLEDASDKQPP-------------TPGSSL-VPQPDEQELGPSVMALLDRDQ 441
Query: 213 QQQQQQQ 219
Sbjct: 442 GILALIL 448
Score = 32.1 bits (73), Expect = 1.7
Identities = 26/180 (14%), Positives = 46/180 (25%), Gaps = 27/180 (15%)
Query: 54 LQTPLGSSNMMRPPLVTNRFQAPGATINSLST----PVPQQFQRPQQQNANHSQQTFRMP 109
LQ P PL ++F + RPQQ +
Sbjct: 196 LQNPEEILETRASPL--SQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFS-------- 245
Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
+ + S+++ PL + ++ Q++Q + P S Q
Sbjct: 246 -DVWPASTQSPREETSAEPL---APASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQ 301
Query: 170 PSNNLYNNIQVPPRAQPGNYP---------SVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
S++ + P QP P + Q G Q +
Sbjct: 302 SSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPR 361
>gnl|CDD|180234 PRK05751, PRK05751, preprotein translocase subunit SecB; Validated.
Length = 156
Score = 32.5 bits (75), Expect = 0.45
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 6/33 (18%)
Query: 194 AGYPSLN------QAGYQQQQQQQQQQQQQQQT 220
G+P LN A Y+Q+ QQ QQ + Q
Sbjct: 123 GGFPPLNLAPINFDALYEQRLQQAQQAAAEAQE 155
Score = 32.1 bits (74), Expect = 0.53
Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 121 PFNNGLTPLTNASLYQQQQQQQQQQQQQQQP 151
P N L P+ +LY+Q+ QQ QQ + Q
Sbjct: 127 PLN--LAPINFDALYEQRLQQAQQAAAEAQE 155
>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
[Transcription].
Length = 610
Score = 34.0 bits (78), Expect = 0.46
Identities = 30/186 (16%), Positives = 56/186 (30%), Gaps = 25/186 (13%)
Query: 7 NKQGPNSYGSASNVMQVPLIASNPSQPSGAAP----PISGLQPQVPVSNSSLQTPLGSSN 62
N + P SA +V Q + N S A S + Q ++ + + G SN
Sbjct: 426 NPRSPKKSPSAHSVHQKLSLHVNDLYQSPATSDIASSSSQVNSQPEFISTQIHSSKGVSN 485
Query: 63 MMRPPLVTNRFQAPG-------------ATINSLSTPVPQQFQRPQQQNANHSQQTFRMP 109
+ + + + G INS S Q+ Q + N
Sbjct: 486 VDLTEQDSQKEASKGNFLDDSGSLSPNTNEINSFSLNTTDS-QQKQSPSHNAPTNNSLNE 544
Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN-------IPPLPNSQ 162
+ + +S+T N+ ++ Q ++ Q P + PN
Sbjct: 545 MASKNSNSQTQASNSNENVAAVKAILDASAQMEKPYDLSQAATPTKATESASVRQAPNPP 604
Query: 163 QNSSNF 168
+ +
Sbjct: 605 PHQNLV 610
>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit
TAF12 (also component of histone acetyltransferase SAGA)
[Transcription].
Length = 505
Score = 33.9 bits (77), Expect = 0.47
Identities = 14/67 (20%), Positives = 21/67 (31%), Gaps = 3/67 (4%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQ-NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
+ Q QQ Q+Q P L ++ +S P + + PG VP
Sbjct: 230 LVDRYPQFQQGQKQVLSPQQRFLHGMERYEASGMPPPAEWAGSNGL--HVLPGRREEVPR 287
Query: 195 GYPSLNQ 201
G
Sbjct: 288 GIFRCPS 294
Score = 32.7 bits (74), Expect = 0.87
Identities = 6/36 (16%), Positives = 11/36 (30%)
Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNK 236
G +++ + Q QQ GM +
Sbjct: 222 HDGRSKRRLVDRYPQFQQGQKQVLSPQQRFLHGMER 257
Score = 30.0 bits (67), Expect = 6.2
Identities = 15/91 (16%), Positives = 29/91 (31%), Gaps = 8/91 (8%)
Query: 89 QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQ 148
+ Q+ QQ F Y S G+ P + ++++
Sbjct: 234 YPQFQQGQKQVLSPQQRFLHGMERYEAS--------GMPPPAEWAGSNGLHVLPGRREEV 285
Query: 149 QQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
+ I P +S+ ++ N + NN Q
Sbjct: 286 PRGIFRCPSPESSRGEPTHLDYRNGMANNAQ 316
>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain. Gall11/MED15 acts
in the general regulation of GAL structural genes and is
required for full expression for several genes in this
pathway, including GALs 1,7, and 10 in Saccharomyces
cerevisiae. GAL11 function is dependent on GCN4
functionality and binds GCN4 in a degenerate manner with
multiple orientations found at the GCN4-Gal11 interface.
Length = 90
Score = 31.2 bits (71), Expect = 0.49
Identities = 28/102 (27%), Positives = 36/102 (35%), Gaps = 27/102 (26%)
Query: 136 QQQQQQQQQQQQQQQPIPN----------IPP-----LPNSQQNSSNFQPSNNLYNNIQV 180
QQQQQ QQQQQQQ P IPP +PN ++ +
Sbjct: 1 PQQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNI 60
Query: 181 PPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLN 222
PP+A + Q Q Q++QQQQ
Sbjct: 61 PPQAME------------QIKEVQQTHFQLLMQRRQQQQQQQ 90
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 31.4 bits (72), Expect = 0.52
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQ 224
Q Q+QQQ Q++Q++ Q +Q
Sbjct: 15 QKQAQEQQQAQEEQEEAQAQAEAQ 38
Score = 28.3 bits (64), Expect = 7.4
Identities = 7/15 (46%), Positives = 12/15 (80%)
Query: 136 QQQQQQQQQQQQQQQ 150
+QQQ Q++Q++ Q Q
Sbjct: 20 EQQQAQEEQEEAQAQ 34
Score = 28.3 bits (64), Expect = 7.7
Identities = 7/16 (43%), Positives = 12/16 (75%)
Query: 135 YQQQQQQQQQQQQQQQ 150
Q+QQQ Q++Q++ Q
Sbjct: 18 AQEQQQAQEEQEEAQA 33
Score = 27.9 bits (63), Expect = 9.0
Identities = 8/15 (53%), Positives = 13/15 (86%)
Query: 136 QQQQQQQQQQQQQQQ 150
Q+Q Q+QQQ Q++Q+
Sbjct: 15 QKQAQEQQQAQEEQE 29
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 33.1 bits (75), Expect = 0.61
Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 1/89 (1%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
Q+Q +Q Q +QQP + +P ++Q Q + Q Q S
Sbjct: 118 QRQLLEQMQADMRQQPT-QLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTE 176
Query: 196 YPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
Q Q Q Q +Q + Q
Sbjct: 177 QSWQQQTRTSQAAPVQAQPRQSKPASTQQ 205
Score = 32.0 bits (72), Expect = 1.6
Identities = 29/127 (22%), Positives = 43/127 (33%), Gaps = 12/127 (9%)
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
P+Q Q+Q Q R P + P+N TP +Q+QQ Q+Q+Q
Sbjct: 111 PEQLTPEQRQLLEQMQADMRQQPTQLVE----VPWNEQ-TP-------EQRQQTLQRQRQ 158
Query: 148 QQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQ 207
QQ S+ ++Q V + + S Y L Q
Sbjct: 159 AQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTT 218
Query: 208 QQQQQQQ 214
Q + QQ
Sbjct: 219 AQSKPQQ 225
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 33.3 bits (76), Expect = 0.63
Identities = 24/72 (33%), Positives = 34/72 (47%), Gaps = 8/72 (11%)
Query: 154 NIP--PLPNS---QQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQ 208
++P PLPNS +Q + Q N++ ++ + G + A LN QQ Q
Sbjct: 255 DVPDTPLPNSASVEQIQNKMQELNDVLEELR---ESFDGYIGNAFANQIQLNFVIPQQAQ 311
Query: 209 QQQQQQQQQQQT 220
QQ Q QQQQ Q
Sbjct: 312 QQGQGQQQQAQA 323
Score = 32.5 bits (74), Expect = 1.0
Identities = 11/15 (73%), Positives = 11/15 (73%)
Query: 136 QQQQQQQQQQQQQQQ 150
QQ QQQ Q QQQQ Q
Sbjct: 308 QQAQQQGQGQQQQAQ 322
Score = 31.8 bits (72), Expect = 2.0
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 205 QQQQQQQQQQQQQQQT 220
Q QQQ Q QQQQ Q T
Sbjct: 309 QAQQQGQGQQQQAQAT 324
Score = 31.8 bits (72), Expect = 2.1
Identities = 11/19 (57%), Positives = 12/19 (63%)
Query: 206 QQQQQQQQQQQQQQTLNSQ 224
QQ QQQ Q QQQQ +Q
Sbjct: 308 QQAQQQGQGQQQQAQATAQ 326
Score = 30.6 bits (69), Expect = 3.9
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 138 QQQQQQQQQQQQQP 151
QQ QQQ Q QQQQ
Sbjct: 308 QQAQQQGQGQQQQA 321
>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein. The frequency clock
protein, is the central component of the frq-based
circadian negative feedback loop, regulates various
aspects of the circadian clock in Neurospora crassa.
This protein has been shown to interact with itself via
a coiled-coil.
Length = 989
Score = 33.4 bits (76), Expect = 0.65
Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 7/49 (14%)
Query: 114 SQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQ 162
+QS+ T + L P+ Q QQQ+ Q QP N P S
Sbjct: 319 TQSTLTPSVDAALPPV-------QAQQQEGTQMAPPQPPSNFITNPFSS 360
Score = 31.1 bits (70), Expect = 3.0
Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 4/52 (7%)
Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGM 234
R Q PSV A P + QQQ+ Q Q + SV + +
Sbjct: 318 RTQSTLTPSVDAALPPVQA----QQQEGTQMAPPQPPSNFITNPFSSVPEPL 365
>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
Length = 84
Score = 30.7 bits (69), Expect = 0.68
Identities = 16/34 (47%), Positives = 18/34 (52%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
Q +QQQQQ QQQ P P PN QQ+ Q
Sbjct: 50 QTPEQQQQQPQQQIAPNQIEAPQPNFQQHYQGQQ 83
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 32.7 bits (75), Expect = 0.75
Identities = 22/129 (17%), Positives = 33/129 (25%), Gaps = 16/129 (12%)
Query: 88 PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
PQ Q P ++ P + QQ Q Q +Q
Sbjct: 82 EAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQP 141
Query: 148 QQQPI-PNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQ 206
QQP+ P + P P P AQ P+ P P
Sbjct: 142 LQQPVSPQVAPAPQPVH---------------SAPQPAQQAFQPAEPVAAPQPEPVAEPA 186
Query: 207 QQQQQQQQQ 215
+ +++
Sbjct: 187 PVMDKPKRK 195
>gnl|CDD|220882 pfam10818, DUF2547, Protein of unknown function (DUF2547). This
bacterial family of proteins has no known function.
Length = 90
Score = 30.5 bits (69), Expect = 0.85
Identities = 14/68 (20%), Positives = 17/68 (25%), Gaps = 11/68 (16%)
Query: 91 FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQ 150
F P+ Q NT + + QQ QQQQ Q
Sbjct: 16 FALPEAQQLQAQNANNENEQNTVINQ-----------QIAQTAQIAQQVQQQQLFIALSQ 64
Query: 151 PIPNIPPL 158
I P
Sbjct: 65 KPLQIAPH 72
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 32.8 bits (75), Expect = 1.1
Identities = 23/148 (15%), Positives = 43/148 (29%), Gaps = 7/148 (4%)
Query: 9 QGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQ----PQVPVSNSSLQTPLGSSNMM 64
Q P S SA + + +A + + P + Q ++ SS S
Sbjct: 104 QAPESSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEE 163
Query: 65 RPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN 124
A + Q + + + R N + AP
Sbjct: 164 GTSSKMPATALASAALFKDDEIR--QEVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQP 221
Query: 125 GLTPLTNASLYQQQQQQQQQQQQQQQPI 152
LT + ++ + + ++ Q Q PI
Sbjct: 222 MLTR-SAGGRFEGEDENLERNLQPQSPI 248
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
protein. This family represents an accessory protein
that works with the bacteriocin maturation and ABC
transport secretion protein described by TIGR01193
[Transport and binding proteins, Other].
Length = 457
Score = 32.4 bits (74), Expect = 1.1
Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 9/118 (7%)
Query: 112 TYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPS 171
Q + + N T A L QQ + Q+ Q Q I ++ +NF P
Sbjct: 155 ETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQDYQALKNAI----SNGTKVANFNPY 210
Query: 172 NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLS 229
+LY N Q AQ S + QQQ Q Q+ + Q + L+
Sbjct: 211 QSLYENYQ----AQ-LKSASDKDQKNQVKSTILATIQQQIDQLQKSIASYQVQKAGLT 263
Score = 30.1 bits (68), Expect = 6.4
Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 10/108 (9%)
Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSN------NLYNNIQ 179
L N + Q+Q +QQ + ++ L S +N N P++ NL+N
Sbjct: 87 LVVYDNGNEENQKQLLEQQLDNLKDQKKSLDTLKQSIENGRNQFPTDDSFGYRNLFNGYL 146
Query: 180 VPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
AQ + S +Q + ++ + Q QQ + + +
Sbjct: 147 ----AQVESLTSETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQD 190
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 32.4 bits (74), Expect = 1.3
Identities = 31/114 (27%), Positives = 40/114 (35%), Gaps = 27/114 (23%)
Query: 123 NNGLTPLTNAS----LYQQQQQQQQQQQQQQQPIP--NIPP---LPNSQQNSSNFQPSNN 173
NN T L AS +YQ+ + Q P P PP + NF +
Sbjct: 723 NNQPTKLYGASGAMTIYQRYLENQT-------PTPLNLTPPEDIVDMGVDYDGNFVCGSG 775
Query: 174 LYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
+ V + P SL Q QQQ Q QQQ QQQ + S+
Sbjct: 776 GMRVLPV--------WTDDPQ---SLCQQSEMQQQPSQPQQQPQQQPAEQKDSD 818
>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
Length = 732
Score = 32.2 bits (73), Expect = 1.4
Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 6/99 (6%)
Query: 130 TNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNY 189
T A QQ QQ QQ QQ QQ P P + + + ++ P+ + +++ P +
Sbjct: 617 TQAEQPQQPQQPQQPQQPQQPQQPVSPVINDKKSDAGVNVPAGGIEQELKMKPEEEMEQQ 676
Query: 190 --PSVPAGYPSLNQAGYQQQQQQQ----QQQQQQQQTLN 222
P + ++ A Y+ QQ+ QQQ Q+++ +N
Sbjct: 677 LPPGISESGEVVDMAAYEAWQQENHPDIQQQMQRREEVN 715
>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722). This
eukaryotic family of proteins has no known function.
Length = 373
Score = 32.2 bits (73), Expect = 1.4
Identities = 34/172 (19%), Positives = 58/172 (33%), Gaps = 18/172 (10%)
Query: 29 NPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVP 88
P Q A P+ + P S+S L R P R A A + +L+TP
Sbjct: 135 APEQNGTNASPVPPSSYKFPPSSSGLTPRHTVLPTHRRPNSPARIGA--AAVANLATPTT 192
Query: 89 ---------QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
R ++ + Q+ +P NT + + + T + N Q
Sbjct: 193 PYKEESLGASSPLRRKKFGSQLHQRNMSLPSNTPTSGNTNSNIPKPATSVLNFKPSPAQP 252
Query: 140 QQQQ---QQQQQQQPIPN----IPPLPNSQQNSSNFQPSNNLYNNIQVPPRA 184
+Q Q Q+ + + I P SQQ S++ + I + +
Sbjct: 253 LHKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSIS 304
>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1. Members of this protein
family are the major adhesin of the Mycoplasma terminal
organelle. The protein is called adhesin P1, cytadhesin
P1, P140, attachment protein, and MgPa, with locus names
MG191 in Mycoplasma genitalium and MPN141 in M.
pneumoniae. A conserved C-terminal region is shared by
additional paralogs in M. pneumoniae and M.
gallisepticum, as well as by the member of this family
[Cell envelope, Surface structures, Cellular processes,
Pathogenesis].
Length = 1425
Score = 32.4 bits (73), Expect = 1.5
Identities = 44/215 (20%), Positives = 70/215 (32%), Gaps = 21/215 (9%)
Query: 10 GPNSYGSASNVMQ--VPLIASNPSQPSGAAPPISG--LQPQVPVSN------SSLQTPLG 59
G N S + + VP I + + AP S SN L + L
Sbjct: 753 GTNLDSSVLDQVLDYVPWIGNGYRYGNNTAPQTSAGSSSGLPTFSNITVGTKQQLASTLA 812
Query: 60 SSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNAN-----HSQQTFRMPPNTYS 114
N M + G + S +F Q Q+ S+Q
Sbjct: 813 GENTMITGGSPQLWTGAGWRNDKASDENHTKFTSAQGQSGTSAGNPRSKQDNISKSGDSQ 872
Query: 115 QSSKTAPFNNGLTPLT---NASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSN-FQP 170
+++ LTP NA + + Q+ Q + + NIP L N+ N++N FQ
Sbjct: 873 EATNYTNLPPNLTPTADWLNALTFTNKNNDQRDQLFLRGLLGNIPVLVNTSGNNTNKFQA 932
Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGY-PSLNQAGY 204
++ + P + V Y +LN Y
Sbjct: 933 TDQKWTYTYQVGGNLPT-WGEVNGLYNTALNITYY 966
>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182.
Length = 238
Score = 31.4 bits (71), Expect = 1.6
Identities = 40/204 (19%), Positives = 57/204 (27%), Gaps = 11/204 (5%)
Query: 26 IASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSS-NMMRPPLVTNRFQAPGATINSLS 84
+ P P S ++L LG N A+++SL+
Sbjct: 5 MGFAGPFGGDRFPSGGSSVNSPPFSQNNLPNNLGGGGGGPGGGGGGNN--PNLASLSSLT 62
Query: 85 TPVPQQF---QRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQ 141
+ + +P P AP N QQ +
Sbjct: 63 SQGLGKILSGLQPPPLGNGGGSGAGGPGPVGGGGGPGVAPNNIQPNAQAQQPSTQQLRML 122
Query: 142 QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ-----PGNYPSVPAGY 196
QQ Q Q P + N N L NI+V +AQ GN V
Sbjct: 123 VQQIQMAVQKGYLNPQILNQPLAPQTLLLLNQLLQNIKVLQQAQQSLQRGGNASGVEPVQ 182
Query: 197 PSLNQAGYQQQQQQQQQQQQQQQT 220
++ +QQ Q Q Q QQ
Sbjct: 183 LNVEINKLKQQISQLQNQINAQQA 206
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 31.7 bits (72), Expect = 1.7
Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 4/68 (5%)
Query: 87 VPQQFQRPQQQNANHS----QQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
V +Q+ R QQQ + Q+ + K A L L L Q+Q++Q
Sbjct: 60 VVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119
Query: 143 QQQQQQQQ 150
++ +Q
Sbjct: 120 AEEAAKQA 127
>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
Length = 238
Score = 31.2 bits (71), Expect = 1.9
Identities = 17/89 (19%), Positives = 32/89 (35%), Gaps = 2/89 (2%)
Query: 139 QQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS 198
+Q + + I NS ++S N + + S A +
Sbjct: 2 TPEQDPLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPEGDAAEAESSPDSGSAASET 61
Query: 199 L--NQAGYQQQQQQQQQQQQQQQTLNSQF 225
N A Q +Q+ + +Q+ + LNSQ+
Sbjct: 62 AADNAARLAQLEQELESLKQELEELNSQY 90
>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain. This
domain is believed to bind double-stranded DNA of 20
bases length.
Length = 105
Score = 29.8 bits (68), Expect = 1.9
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 200 NQAGYQQQQQQQQQQQQQQQ 219
Q G ++QQQQ++QQQQ++
Sbjct: 11 QQQGGLEEQQQQEEQQQQEE 30
Score = 29.1 bits (66), Expect = 3.7
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 200 NQAGYQQQQQQQQQQQQQQQ 219
Q G ++QQQQ++QQQQ++
Sbjct: 12 QQGGLEEQQQQEEQQQQEEA 31
Score = 28.3 bits (64), Expect = 5.7
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 201 QAGYQQQQQQQQQQQQQQQTLNS 223
+ QQ++QQQQ++ Q+Q L
Sbjct: 17 EEQQQQEEQQQQEEAQKQAILRQ 39
Score = 28.3 bits (64), Expect = 6.0
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 205 QQQQQQQQQQQQQQQTLNSQF 225
QQQ++QQQQ++ Q+Q + Q
Sbjct: 20 QQQEEQQQQEEAQKQAILRQI 40
>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo
red. RCR proteins are ER membrane proteins that
regulate chitin deposition in fungal cell walls.
Although chitin, a linear polymer of beta-1,4-linked
N-acetylglucosamine, constitutes only 2% of the cell
wall it plays a vital role in the overall protection of
the cell wall against stress, noxious chemicals and
osmotic pressure changes. Congo red is a cell
wall-disrupting benzidine-type dye extensively used in
many cell wall mutant studies that specifically targets
chitin in yeast cells and inhibits growth. RCR proteins
render the yeasts resistant to Congo red by diminishing
the content of chitin in the cell wall. RCR proteins are
probably regulating chitin synthase III interact
directly with ubiquitin ligase Rsp5, and the VPEY motif
is necessary for this, via interaction with the WW
domains of Rsp5.
Length = 124
Score = 30.1 bits (68), Expect = 1.9
Identities = 14/87 (16%), Positives = 26/87 (29%), Gaps = 11/87 (12%)
Query: 146 QQQQQPIPNI----PPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVP-----AGY 196
++ QPI PP Q N P VPP + N + +
Sbjct: 28 RRGLQPIYGTAWLAPPSYRQSQQQYNNTPQGRPPQ--YVPPYTETANENDLGYYDGQGEF 85
Query: 197 PSLNQAGYQQQQQQQQQQQQQQQTLNS 223
+A + Q ++ ++ +
Sbjct: 86 HPNPKAPAIELQPPPNAYERGTRSPTT 112
Score = 29.0 bits (65), Expect = 5.6
Identities = 14/83 (16%), Positives = 22/83 (26%), Gaps = 11/83 (13%)
Query: 123 NNGLTPLTNASLYQ---QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNF--------QPS 171
GL P+ + +Q QQQ Q P P N ++
Sbjct: 28 RRGLQPIYGTAWLAPPSYRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHP 87
Query: 172 NNLYNNIQVPPRAQPGNYPSVPA 194
N I++ P +
Sbjct: 88 NPKAPAIELQPPPNAYERGTRSP 110
>gnl|CDD|227823 COG5536, BET4, Protein prenyltransferase, alpha subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 328
Score = 31.4 bits (71), Expect = 1.9
Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 9/86 (10%)
Query: 237 MWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQFLYDRNTEMFG-DPYRRPEIQS 295
M G + DL I+ ++ + +NLP +H + R Q
Sbjct: 246 MIGEKVEDLGKYIVIINGKEL---DLGPKENLPCLHSLLELEFLCHAEKALLTERDIEQK 302
Query: 296 ATVEYLATADYTVRDPPQPAMYLFLL 321
A VE DP + +Y L
Sbjct: 303 ALVELAIKV-----DPARRNLYSTLH 323
>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
kinases [Cell motility and secretion / Signal
transduction mechanisms].
Length = 716
Score = 31.9 bits (73), Expect = 2.0
Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 24/104 (23%)
Query: 709 LPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHTLDQDPTI 768
LPL LA+++ R+ ++A +PL S+ ++ + I ++ +
Sbjct: 571 LPL-TLAIIQ------ALLVRVGGEVYA------IPLASVEETLRIDPEDIKPVEGQEVL 617
Query: 769 EWNELQVPIPR---ILQLSAERLQATGTYLLNMPDLIIILVRHG 809
+P+ R +L + + +++VR G
Sbjct: 618 RLRGEVLPLVRLGELLNVPPASPEPE--------QGPVVIVRSG 653
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 31.0 bits (70), Expect = 2.1
Identities = 12/59 (20%), Positives = 21/59 (35%), Gaps = 1/59 (1%)
Query: 169 QPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
+ N N +Q + N + P Y + +Q Y Q Q Q ++ +Q
Sbjct: 108 KSDNPQANAMQDNSFHENFNN-NYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPPK 165
Score = 30.2 bits (68), Expect = 3.1
Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 5/66 (7%)
Query: 154 NIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQ 213
+ P N+ Q++S + NN Y P P + S NQ Y ++ QQ Q
Sbjct: 109 SDNPQANAMQDNSFHENFNNNYPGNYNNPSQDPYMNQA-----QSYNQNAYAKENQQAQP 163
Query: 214 QQQQQQ 219
+ +
Sbjct: 164 PKYENS 169
>gnl|CDD|227549 COG5224, HAP2, CCAAT-binding factor, subunit B [Transcription].
Length = 248
Score = 31.3 bits (70), Expect = 2.2
Identities = 17/86 (19%), Positives = 21/86 (24%), Gaps = 11/86 (12%)
Query: 157 PLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS-----------LNQAGYQ 205
P P + +S +N NNI P + S P S L Q
Sbjct: 61 PAPINSATASMSPANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSN 120
Query: 206 QQQQQQQQQQQQQQTLNSQFSNLSVN 231
Q S N S
Sbjct: 121 TSNLMSSGSQLGSFATQSTNGNNSTT 146
>gnl|CDD|214621 smart00327, VWA, von Willebrand factor (vWF) type A domain. VWA
domains in extracellular eukaryotic proteins mediate
adhesion via metal ion-dependent adhesion sites (MIDAS).
Intracellular VWA domains and homologues in prokaryotes
have recently been identified. The proposed VWA domains
in integrin beta subunits have recently been
substantiated using sequence-based methods.
Length = 175
Score = 30.5 bits (69), Expect = 2.3
Identities = 24/123 (19%), Positives = 39/123 (31%), Gaps = 35/123 (28%)
Query: 319 FLLDVSQ-IAAQSGYLYTVCEVLLNQLKSMP-GDRRTSIAIITYDSAVHFYSLAEGQTQP 376
FLLD S + E +L ++ + G + ++T+
Sbjct: 4 FLLDGSGSMGGN--RFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDAR----------- 50
Query: 377 SQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFK 436
+N S D +++ L L K L G+ LG ALQ A +
Sbjct: 51 ----------------VLFPLNDSRSKDALLEALASLSYK----LGGGTNLGAALQYALE 90
Query: 437 LLQ 439
L
Sbjct: 91 NLF 93
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 31.6 bits (71), Expect = 2.6
Identities = 34/198 (17%), Positives = 64/198 (32%), Gaps = 12/198 (6%)
Query: 5 VSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMM 64
V K GP S ++P ++ P P P +P+ P S Q P +
Sbjct: 557 VGKKPGPAKEHKPS---KIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPK 613
Query: 65 RPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN 124
P L+ P + S P++ PQ+ ++ + ++ + S PF+
Sbjct: 614 LPELL----DIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFD- 668
Query: 125 GLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPP-R 183
P Y + ++ + + + + + P +PP
Sbjct: 669 ---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKL 725
Query: 184 AQPGNYPSVPAGYPSLNQ 201
+ +P P G P Q
Sbjct: 726 PRDEEFPFEPIGDPDAEQ 743
>gnl|CDD|185081 PRK15127, PRK15127, multidrug efflux system protein AcrB;
Provisional.
Length = 1049
Score = 31.4 bits (71), Expect = 2.6
Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 24/126 (19%)
Query: 52 SSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPN 111
+ L+ P+ + PP VT PGA ++ V Q +QN N M N
Sbjct: 26 AILKLPVAQYPTIAPPAVTISASYPGADAKTVQDTVTQVI----EQNMNGIDNLMYMSSN 81
Query: 112 TYSQSSK--TAPFNNGLTPLTNASLYQQQQQQQQQ-------QQQQQQPIPNIPPLPNSQ 162
+ S + T F +G T+A + Q Q Q + Q Q+ QQQ + + +
Sbjct: 82 SDSTGTVQITLTFESG----TDADIAQVQVQNKLQLAMPLLPQEVQQQGV-------SVE 130
Query: 163 QNSSNF 168
++SS+F
Sbjct: 131 KSSSSF 136
>gnl|CDD|220724 pfam10380, CRF1, Transcription factor CRF1. CRF1 is a
transcription factor that co-represses ribosomal genes
with FHL1 via the TOR signalling pathway and protein
kinase A.
Length = 114
Score = 29.7 bits (67), Expect = 2.8
Identities = 7/41 (17%), Positives = 16/41 (39%)
Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
L+ SLY Q + ++ + Q I + + ++
Sbjct: 57 LSTRSLYPLITQNNKSEEPRLQSITQTDASEENDVSDASDD 97
>gnl|CDD|114726 pfam06020, Roughex, Drosophila roughex protein. This family
consists of several roughex (RUX) proteins specific to
Drosophila species. Roughex can influence the
intracellular distribution of cyclin A and is therefore
defined as a distinct and specialised cell cycle
inhibitor for cyclin A-dependent kinase activity. Rux is
though to regulate the metaphase to anaphase transition
during development.
Length = 334
Score = 31.1 bits (70), Expect = 2.8
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 136 QQQQQQQQQQQQQQQPIPNIP---PLPNSQQNSSNFQPSNNLY--NNIQVPPRAQPG 187
+Q+Q QQ ++ + NIP P P N S++ P N +VPP+ PG
Sbjct: 272 EQEQPAPQQAEETAREAVNIPVDLPTPLETTNFSSYTPRKRQQTTNGNEVPPKRTPG 328
>gnl|CDD|220771 pfam10471, APC_CDC26, Anaphase-promoting complex APC subunit 1.
The anaphase-promoting complex (APC) or cyclosome is a
cell cycle-regulated ubiquitin-protein ligase that
regulates important events in mitosis such as the
initiation of anaphase and exit from telophase. The APC,
in conjunction with other enzymes, assembles
multi-ubiquitin chains on a variety of regulatory
proteins thereby targeting them for proteolysis by the
26S proteasome. CDC26 is one of the nine or so subunits
identified within APC but its exact function is not
known. The APC/C becomes active at the
metaphase/anaphase transition and remains active during
G1 phase. One mechanism linked to activation of the
APC/C is phosphorylation. The yeast APC/C is composed of
at least 13 subunits, but the function of many of the
subunits is unknown. Hcn1 is the smallest subunit of the
S. pombe APC/C, and is found to be essential for cell
viability, APC/C integrity, and proper APC/C regulation.
In addition, Hcn1 phosphorylation indicates a specific
role for the phosphorylation of this subunit late in the
cell cycle.
Length = 78
Score = 28.6 bits (64), Expect = 2.8
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 3/48 (6%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPR 183
++Q+ QQQQ QQQ N + +S P+++ Q R
Sbjct: 24 KEQKLQQQQNAQQQSVSSN---SAAGETAASTDSPNDSSKPLPQDSAR 68
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 29.3 bits (66), Expect = 3.0
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNK 236
Q QQQQ ++ QQQQQ+ + + +G+N+
Sbjct: 69 QLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNR 100
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 31.2 bits (70), Expect = 3.2
Identities = 33/174 (18%), Positives = 53/174 (30%), Gaps = 18/174 (10%)
Query: 22 QVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATIN 81
Q P + P +P+ QP +P G++ M+ Q P
Sbjct: 653 QPPQVEITPYKPTWTQIGHIPYQP----------SPTGANTMLPIQWAPGTMQPPPRAPT 702
Query: 82 SLSTPV--PQQFQRPQQQNAN----HSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLY 135
+ P P + QRP + PP ++ G A+
Sbjct: 703 PMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 762
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNS--SNFQPSNNLYNNIQVPPRAQPG 187
+ + QP P PP P + + P ++Q+ PRA PG
Sbjct: 763 RARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPG 816
>gnl|CDD|238163 cd00261, AAI_SS, AAI_SS: Alpha-Amylase Inhibitors (AAIs) and Seed
Storage (SS) Protein subfamily; composed of cereal-type
AAIs and SS proteins. They are mainly present in the
seeds of a variety of plants. AAIs play an important
role in the natural defenses of plants against insects
and pathogens such as fungi, bacteria and viruses. AAIs
impede the digestion of plant starch and proteins by
inhibiting digestive alpha-amylases and proteinases.
Also included in this subfamily are SS proteins such as
2S albumin, gamma-gliadin, napin, and prolamin. These
AAIs and SS proteins are also known allergens in humans.
Length = 110
Score = 29.3 bits (66), Expect = 3.3
Identities = 26/94 (27%), Positives = 32/94 (34%), Gaps = 22/94 (23%)
Query: 135 YQQQQQQQQQQQ---------QQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ 185
Q QQQ QQ QQ + P P Q L Q+P + +
Sbjct: 3 QCQPGQQQPQQPLNSCREYLRQQCSGVGGPPVWPQQSCEVLRQQCCQQLA---QIPEQCR 59
Query: 186 PGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
+L Q QQQQQQQ+QQQ
Sbjct: 60 ----------CEALRQMVQGVIQQQQQQQEQQQG 83
Score = 28.1 bits (63), Expect = 7.3
Identities = 12/17 (70%), Positives = 12/17 (70%)
Query: 207 QQQQQQQQQQQQQTLNS 223
QQQ Q QQQ QQ LNS
Sbjct: 1 QQQCQPGQQQPQQPLNS 17
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 30.3 bits (68), Expect = 3.3
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 136 QQQQQQQQQQQQQQQPIPNIP 156
QQ Q+ QQ QQQ +P P P
Sbjct: 135 QQAQRPQQAAQQQSRPAPQQP 155
Score = 28.7 bits (64), Expect = 9.5
Identities = 14/48 (29%), Positives = 17/48 (35%), Gaps = 3/48 (6%)
Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQ---QPIPN 154
P Q + N P A QQ QQQ + QQ QP +
Sbjct: 115 PQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAAD 162
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 30.3 bits (69), Expect = 3.5
Identities = 9/27 (33%), Positives = 12/27 (44%)
Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSN 227
+ G + Q Q Q Q Q L +QF
Sbjct: 98 EQGLAEGQAQAAPQAAQLQALAAQFQE 124
Score = 29.1 bits (66), Expect = 8.5
Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 5/33 (15%)
Query: 201 QAGYQQ-----QQQQQQQQQQQQQTLNSQFSNL 228
+AGYQ+ +Q + Q Q +Q L
Sbjct: 86 EAGYQEGLAQGLEQGLAEGQAQAAPQAAQLQAL 118
>gnl|CDD|215106 PLN00212, PLN00212, glutelin; Provisional.
Length = 493
Score = 30.6 bits (69), Expect = 3.9
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 124 NGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQN 164
NGL L QQ+Q QQQQQ+ QQ + + N
Sbjct: 261 NGLQLLQPTLTQQQEQAQQQQQRLYQQVQYQQSQQTSGRWN 301
>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
Provisional.
Length = 134
Score = 29.5 bits (66), Expect = 4.0
Identities = 10/43 (23%), Positives = 23/43 (53%), Gaps = 7/43 (16%)
Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLN 247
++++QQQ++ ++Q+ +N+ + K G P +LL
Sbjct: 42 EEEEQQQRELAERQEKINTWLE-------LMKADGINPEELLG 77
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 30.0 bits (67), Expect = 4.2
Identities = 13/54 (24%), Positives = 26/54 (48%), Gaps = 3/54 (5%)
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ---VPPRAQP 186
+Q++ ++ + Q ++Q P+ P Q QP+NN ++ P R +P
Sbjct: 160 KQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEEGLSPDRTRP 213
Score = 30.0 bits (67), Expect = 5.0
Identities = 10/38 (26%), Positives = 18/38 (47%)
Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSS 166
L + + ++Q+Q QQ Q Q P PN+ + +
Sbjct: 164 LEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKA 201
>gnl|CDD|221872 pfam12954, DUF3843, Protein of unknown function (DUF3843). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 325
Score = 30.4 bits (69), Expect = 4.3
Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 6/58 (10%)
Query: 219 QTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQFL 276
L F +N G + ++ L RD+L PPI + LPE +E+FL
Sbjct: 140 GLLEEAFETAPINDGPSSLF------LHEGRDLLKMPFTPLPPIEPEEKLPEDYEKFL 191
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 30.7 bits (70), Expect = 4.6
Identities = 30/114 (26%), Positives = 40/114 (35%), Gaps = 17/114 (14%)
Query: 127 TPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN---LYNNIQVPPR 183
L+ SL Q + + Q Q Q ++ NSQ S QP LY N Q R
Sbjct: 116 ETLSTLSLRQLESRLAQTLDQLQNAQNDLAEY-NSQLVSLQTQPERAQAALYANSQ---R 171
Query: 184 AQP------GNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTL---NSQFSNL 228
Q G A PS + Q +Q Q Q+ N+Q +L
Sbjct: 172 LQQIRNLLKGGKVGGKALRPSQ-RVLLQAEQALLNAQNDLQRKSLEGNTQLQDL 224
>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
Provisional.
Length = 1195
Score = 30.6 bits (69), Expect = 4.7
Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 2/95 (2%)
Query: 38 PPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQ 97
P + Q + N S Q SSN + L + + + A + + Q PQ +
Sbjct: 26 PQTNWSQHAKALENISSQRKFLSSNFLYS-LESQKDREMAARSINCAPVNVQTLASPQIE 84
Query: 98 NANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNA 132
A H+ + + N Y + KT P +NG T L++
Sbjct: 85 KAWHALSSLSIN-NNYLRPGKTPPIDNGSTDLSSD 118
>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin). This family
consists of several mammalian Ameloblastin precursor
(Amelin) proteins. Matrix proteins of tooth enamel
consist mainly of amelogenin but also of non-amelogenin
proteins, which, although their volumetric percentage is
low, have an important role in enamel mineralisation.
One of the non-amelogenin proteins is ameloblastin, also
known as amelin and sheathlin. Ameloblastin (AMBN) is
one of the enamel sheath proteins which is though to
have a role in determining the prismatic structure of
growing enamel crystals.
Length = 411
Score = 30.2 bits (68), Expect = 4.9
Identities = 40/157 (25%), Positives = 56/157 (35%), Gaps = 22/157 (14%)
Query: 74 QAPGAT-INSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFN----NGLTP 128
Q PG + SLS +Q Q N P + YS+ PFN +GL P
Sbjct: 22 QQPGTPGMASLSLETMRQLGSLQGLN----------PLSQYSRFGFGKPFNSLWLHGLLP 71
Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ--- 185
S + + ++ + QQ + +P PP SQ + QP + P Q
Sbjct: 72 --PHSSFPWLRPREHETQQYEYSLPVHPPPLPSQPSLQPQQPGLKPFLQPTALPTNQATP 129
Query: 186 --PGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
G P + G P L QA Q + QT
Sbjct: 130 QKNGPQPPMHLGQPPLQQAELPMIPPQVAPSDKPPQT 166
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 30.1 bits (68), Expect = 5.4
Identities = 10/35 (28%), Positives = 15/35 (42%), Gaps = 4/35 (11%)
Query: 200 NQAGYQQ----QQQQQQQQQQQQQTLNSQFSNLSV 230
N A Q Q+ Q+ Q+Q+ + L Q L
Sbjct: 69 NDALANQLTALQKAQESQKQELEGILKQQAKALDQ 103
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 30.0 bits (68), Expect = 5.8
Identities = 22/98 (22%), Positives = 27/98 (27%), Gaps = 8/98 (8%)
Query: 6 SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMR 65
S P+S + P S+PS S P S P S S +N
Sbjct: 183 SPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPES------PTNPSP 236
Query: 66 PPLVTNRFQAPGATINSLST--PVPQQFQRPQQQNANH 101
PP P + LST P P
Sbjct: 237 PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGI 274
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 28.4 bits (64), Expect = 6.5
Identities = 8/46 (17%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 204 YQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTR 249
+ QQ+ +Q +++ + L Q G +L N +
Sbjct: 7 LAEAQQELEQAEEKLEELQQYRLEYR-QQLSGSGQGISAAELRNYQ 51
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 30.0 bits (68), Expect = 7.0
Identities = 9/20 (45%), Positives = 13/20 (65%), Gaps = 3/20 (15%)
Query: 205 QQQQQQQQ---QQQQQQQTL 221
++Q QQ+ +QQ QQQ L
Sbjct: 191 EKQSQQKTLLYEQQAQQQKL 210
>gnl|CDD|221416 pfam12090, Spt20, Spt20 family. This presumed domain is found in
the Spt20 proteins from both human and yeast. The Spt20
protein is part of the SAGA complex which is a large
cmplex mediating histone deacetylation. Yeast Spt20 has
been shown to play a role in structural integrity of the
SAGA complex as as no intact SAGA could be purified in
spt20 deletion strains.
Length = 191
Score = 29.3 bits (66), Expect = 7.1
Identities = 10/41 (24%), Positives = 15/41 (36%)
Query: 130 TNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQP 170
S Q Q Q Q+QQ ++ P N Q+ +
Sbjct: 87 PQQSQQQAQAQSQEQQDKENSPAQNQQADGQKQKPVKKPRV 127
>gnl|CDD|150393 pfam09712, PHA_synth_III_E, Poly(R)-hydroxyalkanoic acid synthase
subunit (PHA_synth_III_E). This entry represents the
PhaE subunit of the heterodimeric class (class III) of
polymerase for poly(R)-hydroxyalkanoic acids (PHAs),
carbon and energy storage polymers of many bacteria. The
most common PHA is polyhydroxybutyrate but about 150
different constituent hydroxyalkanoic acids (HAs) have
been identified in various species.
Length = 293
Score = 29.8 bits (67), Expect = 7.1
Identities = 35/154 (22%), Positives = 48/154 (31%), Gaps = 18/154 (11%)
Query: 83 LSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
+S P Q Q F Q + A L Q Q Q
Sbjct: 45 MSPAAPDQTQAAMDALMEQGGNFFGTAEQFLRQFEEQNA----------AHLEQMQAQFA 94
Query: 143 QQQQQQQQPIPNIP-----PLPNSQQ--NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA- 194
PL Q+ +S + P + L N R+Q + S+PA
Sbjct: 95 GSLDTGDTTSDRAAQAWQLPLDGWQRMFSSLSPMPGDGLRNMPHQQFRSQLDPFLSMPAL 154
Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNL 228
GY +QA YQ + Q + QQ Q Q+S
Sbjct: 155 GYTREHQAQYQALIRAQMEYQQALQEYTGQYSQA 188
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 30.0 bits (67), Expect = 7.2
Identities = 33/139 (23%), Positives = 42/139 (30%), Gaps = 16/139 (11%)
Query: 35 GAAPPISGLQPQVPVSNSSLQTPLGSSNM-------MRPPLVTNRFQAPGATINSLSTP- 86
G PP+ QPQ + G+S M PLV R+ GAT++ P
Sbjct: 680 GPVPPVPATQPQYFDIPLTEPINQGASAAHFLPQQPMEGPLVPERWMFQGATLSQSVRPG 739
Query: 87 --VPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQ 144
Q F P Q NH Q P+ A Q Q Q
Sbjct: 740 VAQSQYFDLPLTQPINHGAPAAHFLH----QPPMEGPWVPEQWMFQGAPPSQGTDVVQHQ 795
Query: 145 QQQQQQP--IPNIPPLPNS 161
+ N P +P S
Sbjct: 796 LDALGYVLHVLNHPGVPVS 814
>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489). This is
a bacterial family of uncharacterized proteins.
Length = 131
Score = 28.7 bits (65), Expect = 7.5
Identities = 5/17 (29%), Positives = 12/17 (70%)
Query: 134 LYQQQQQQQQQQQQQQQ 150
L +Q++ Q++ Q + +Q
Sbjct: 15 LKKQKELQEKAQAEAKQ 31
>gnl|CDD|226036 COG3505, VirD4, Type IV secretory pathway, VirD4 components
[Intracellular trafficking and secretion].
Length = 596
Score = 29.7 bits (67), Expect = 7.7
Identities = 20/113 (17%), Positives = 33/113 (29%), Gaps = 15/113 (13%)
Query: 295 SATVEYLATADYTVRDP---PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDR 351
+ ++ L +RD + + L + A + L Q S D
Sbjct: 271 TQAIKGLEGVFGRLRDILSSTELSSLAMLTRILLGDALPW-----VKAALGQFFSNKTDE 325
Query: 352 RTSIAIITYDSAVHFYSLAE--GQTQPSQMILTDI-----DDIFLPSPENILV 397
TS + T SA+ T S + D+ F P+ I
Sbjct: 326 ETSGVLSTASSALTLLMNPLVAAATSGSDFDIRDLRKKKTTVYFGLDPDRISR 378
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 28.0 bits (63), Expect = 7.8
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 200 NQAGYQQQQQQQQQQQQQQQTLNSQFSNL 228
+QQ QQQ Q QQ + L +Q +
Sbjct: 3 LINKFQQLQQQLQLLAQQIKQLETQLKEI 31
>gnl|CDD|151422 pfam10975, DUF2802, Protein of unknown function (DUF2802). This
bacterial family of proteins has no known function.
Length = 70
Score = 27.1 bits (61), Expect = 7.8
Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKM 237
+ +QQ ++ Q +Q++ + ++ K+
Sbjct: 9 ELEQQLKELQDRQEELEQQDPQSRLYSRAA-KL 40
>gnl|CDD|223555 COG0479, FrdB, Succinate dehydrogenase/fumarate reductase, Fe-S
protein subunit [Energy production and conversion].
Length = 234
Score = 29.2 bits (66), Expect = 8.3
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 3/35 (8%)
Query: 848 FISYLNDDKAYVSP---VKVIKDSSPDRMEFYDKL 879
+ L + + P VI+D D EFY+KL
Sbjct: 76 LMKDLEEGVITIEPLPNFPVIRDLVVDMEEFYEKL 110
>gnl|CDD|184029 PRK13399, PRK13399, fructose-1,6-bisphosphate aldolase;
Provisional.
Length = 347
Score = 29.4 bits (66), Expect = 8.4
Identities = 23/59 (38%), Positives = 25/59 (42%), Gaps = 12/59 (20%)
Query: 778 PRILQLSA-ERLQATGTYLLNM--------PDLIIIL-VRHGTSPAICSGLF--GFQSV 824
P ILQ S R A L +M PD+ I L HG SPA C GF SV
Sbjct: 44 PVILQASRGARKYAGDAMLRHMVLAAAEMYPDIPICLHQDHGNSPATCQSAIRSGFTSV 102
>gnl|CDD|233516 TIGR01661, ELAV_HUD_SF, ELAV/HuD family splicing factor. This
model describes the ELAV/HuD subfamily of splicing
factors found in metazoa. HuD stands for the human
paraneoplastic encephalomyelitis antigen D of which
there are 4 variants in human. ELAV stnds for the
Drosophila Embryonic lethal abnormal visual protein.
ELAV-like splicing factors are also known in human as
HuB (ELAV-like protein 2), HuC (ELAV-like protein 3,
Paraneoplastic cerebellar degeneration-associated
antigen) and HuR (ELAV-like protein 1). These genes are
most closely related to the sex-lethal subfamily of
splicing factors found in Dipteran insects (TIGR01659).
These proteins contain 3 RNA-recognition motifs (rrm:
pfam00076).
Length = 352
Score = 29.5 bits (66), Expect = 9.0
Identities = 29/143 (20%), Positives = 43/143 (30%), Gaps = 33/143 (23%)
Query: 23 VPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINS 82
+ N + PSG PI+ V +N+ + +S + L Q P T
Sbjct: 146 RAIKTLNGTTPSGCTEPIT-----VKFANNPSSS---NSKGLLSQLEA--VQNPQTTRVP 195
Query: 83 LSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
LST + P +H+ FR ++ QQQQ
Sbjct: 196 LSTILTAAGIGP----MHHAAARFRPSAGDFTA-------------------VLAHQQQQ 232
Query: 143 QQQQQQQQPIPNIPPLPNSQQNS 165
QQ PP + Q
Sbjct: 233 HAVAQQHAAQRASPPATDGQTAG 255
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 29.8 bits (67), Expect = 9.3
Identities = 31/206 (15%), Positives = 51/206 (24%), Gaps = 35/206 (16%)
Query: 18 SNVMQVPLIASNPSQPS-GAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP 76
S Q + +Q + G S S SSLQ+ L S + A
Sbjct: 304 SLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSK-------SLDLTAE 356
Query: 77 GATINSLSTPVP-QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLY 135
GAT+++L P Q Q+ A S+ +L
Sbjct: 357 GATVDALGAPDGVQWLDESQKTLATLSELLS---------------TGIDGVSEGLDALE 401
Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQN-----------SSNFQPSNNLYNNIQVPPRA 184
Q Q + + + + I + N L + A
Sbjct: 402 QASAQLAKSLAKLKTAVAQIAASIAQLLPGASEVLKTLKSKGLDKLLNQLNGALAKGSNA 461
Query: 185 QPGNYPSVPAGYPSLNQAGYQQQQQQ 210
+ S+ A +
Sbjct: 462 LVQGLSDANDSFRSITSAQLKAGLNT 487
>gnl|CDD|119430 cd05170, PIKKc_SMG1, Suppressor of morphogenetic effect on
genitalia-1 (SMG-1), catalytic domain; The SMG-1
catalytic domain subfamily is part of a larger
superfamily that includes the catalytic domains of other
kinases such as the typical serine/threonine/tyrosine
protein kinases (PKs), aminoglycoside
phosphotransferase, choline kinase, and RIO kinases.
SMG-1 is a member of the phosphoinositide
3-kinase-related protein kinase (PIKK) subfamily. PIKKs
have intrinsic serine/threonine kinase activity and are
distinguished from other PKs by their unique catalytic
domain, similar to that of lipid PI3K, and their large
molecular weight (240-470 kDa). In addition to its
catalytic domain, SMG-1 contains a FATC (FRAP, ATM and
TRRAP, C-terminal) domain at the C-terminus. SMG-1 plays
a critical role in the mRNA surveillance mechanism known
as non-sense mediated mRNA decay (NMD). NMD protects the
cells from the accumulation of aberrant mRNAs with
premature termination codons (PTCs) generated by genome
mutations and by errors during transcription and
splicing. SMG-1 phosphorylates Upf1, another central
component of NMD, at the C-terminus upon recognition of
PTCs. The phosphorylation/dephosphorylation cycle of
Upf1 is essential for promoting NMD.
Length = 307
Score = 29.3 bits (66), Expect = 9.4
Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 20/61 (32%)
Query: 125 GLTPLTNASLY-------QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNN 177
G TPL LY Q Q+ Q Q IP I P +PS+ YN
Sbjct: 90 GATPLFG--LYKRWQQREAVLQAQKSQVGYQNPQIPGIVP-----------RPSDLFYNK 136
Query: 178 I 178
I
Sbjct: 137 I 137
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.133 0.389
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 45,949,362
Number of extensions: 4482886
Number of successful extensions: 11411
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8540
Number of HSP's successfully gapped: 367
Length of query: 912
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 806
Effective length of database: 6,236,078
Effective search space: 5026278868
Effective search space used: 5026278868
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.6 bits)