RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18120
(392 letters)
>gnl|CDD|128456 smart00151, SWIB, SWI complex, BAF60b domains.
Length = 77
Score = 110 bits (277), Expect = 4e-30
Identities = 32/77 (41%), Positives = 39/77 (50%)
Query: 300 QPLQFKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCP 359
+ L P LA++LG TR II LW+YIK H LQD + I CD EQIF
Sbjct: 1 ITKKVTLSPELAKVLGAPEMTRTEIIKRLWEYIKEHNLQDPQNKREILCDSKLEQIFGKD 60
Query: 360 RMKFAEIPQRLNPLLHP 376
RM E+ + L P L
Sbjct: 61 RMDMFEMNKLLTPHLIK 77
>gnl|CDD|145386 pfam02201, SWIB, SWIB/MDM2 domain. This family includes the SWIB
domain and the MDM2 domain. The p53-associated protein
(MDM2) is an inhibitor of the p53 tumour suppressor gene
binding the transactivation domain and down regulating
the ability of p53 to activate transcription. This
family contains the p53 binding domain of MDM2.
Length = 76
Score = 97.7 bits (244), Expect = 2e-25
Identities = 27/75 (36%), Positives = 38/75 (50%)
Query: 300 QPLQFKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCP 359
F L P LA+ LG +R ++ LWQYIK H LQD + I CD+ + IF
Sbjct: 1 LTKPFPLSPDLAKFLGAGELSRTEVVKKLWQYIKEHNLQDPKNKRIILCDEKLKSIFGGD 60
Query: 360 RMKFAEIPQRLNPLL 374
R+ F E+ + L+
Sbjct: 61 RVGFFEMSKLLSSHF 75
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 70.2 bits (172), Expect = 8e-14
Identities = 48/186 (25%), Positives = 82/186 (44%), Gaps = 12/186 (6%)
Query: 197 LLEDSKNDPNKSGEGEEGSVASWELRVEGRLLEDSKNDPNKVKRKFSSFFKSLVIE---- 252
L+ + D N + + + W +R+EGR ND ++ KF S + +
Sbjct: 14 WLQLDERDTNNEKDVGKLLFSEWTVRLEGRF--KDNNDL--IRDKFDSLAEEPRVLRKEK 69
Query: 253 ---LDKDLYGPDNHLVEWHRTPTTQET-DGFQVKRPGDKNVRCTILLLLDYQPLQFKLDP 308
K D E P+++ET +G + DK + + + + KL P
Sbjct: 70 YNITRKTTGKNDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVKLSP 129
Query: 309 RLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKFAEIPQ 368
+LA +LG+ TRP + LW+YIK H LQD + + I CD +++ + E+ +
Sbjct: 130 KLAAILGLEPGTRPEAVKKLWKYIKKHNLQDPNNKRLILCDSKLKKVLGSDPIDMFELTK 189
Query: 369 RLNPLL 374
L+P L
Sbjct: 190 PLSPHL 195
>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
Length = 987
Score = 44.9 bits (106), Expect = 6e-05
Identities = 19/57 (33%), Positives = 29/57 (50%)
Query: 310 LARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKFAEI 366
LA ++G RP +I LW YIK + LQD ++ IN D +F ++ E+
Sbjct: 922 LAAVIGAEPVARPEVIKKLWDYIKANNLQDPADKRAINADAKLRPVFGKDQVTMFEL 978
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 44.4 bits (105), Expect = 9e-05
Identities = 15/68 (22%), Positives = 29/68 (42%)
Query: 304 FKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKF 363
+ P LA ++G R +W YIK H LQ ++ I D + + +
Sbjct: 788 YTPSPALAAMIGAEPVGRGEATKKVWDYIKEHGLQSPENKKLIIPDSKLQGVIGPDPIDM 847
Query: 364 AEIPQRLN 371
++ ++L+
Sbjct: 848 FQLSKKLS 855
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 38.4 bits (89), Expect = 0.008
Identities = 29/103 (28%), Positives = 37/103 (35%), Gaps = 16/103 (15%)
Query: 7 GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG--PRPGGPGVPPNQQPPYT 64
G +Q P P+ P A P P TP P GP PP +QPP
Sbjct: 398 GPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457
Query: 65 GMRPSGPVNPNIANKRPSDA----RPPNNLKNDYQHGPPGPGP 103
P+ +P+ A ++ DA RPP PPG
Sbjct: 458 ATEPAPD-DPDDATRKALDALRERRPPE---------PPGADL 490
Score = 37.6 bits (87), Expect = 0.012
Identities = 25/90 (27%), Positives = 28/90 (31%), Gaps = 14/90 (15%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGP----RPGGPGVPPNQQPPYTGMRPSG-PVN 73
PPP PP P P P P P R P PP P + G P
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRG 2610
Query: 74 PNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
P + P D P+ PP P P
Sbjct: 2611 PAPPSPLPPDTHAPD---------PPPPSP 2631
Score = 36.5 bits (84), Expect = 0.030
Identities = 20/91 (21%), Positives = 23/91 (25%), Gaps = 14/91 (15%)
Query: 17 RYPPPSGPPMRYGNQNMAMQPRP----GFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
PPP P +P P TP PPGP P P P+GP
Sbjct: 2700 DPPPPPPTP----------EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
Query: 73 NPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
P + P P
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPP 2780
Score = 32.2 bits (73), Expect = 0.52
Identities = 28/95 (29%), Positives = 35/95 (36%), Gaps = 16/95 (16%)
Query: 8 SQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMR 67
S++T L P PP QP+ P PP P+P P P Q PP R
Sbjct: 2892 SRSTESFALPPDQPERPP----------QPQA---PPPPQPQPQPPPPPQPQPPPPPPPR 2938
Query: 68 PSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPG 102
P P+ P P+ A P+ G PG
Sbjct: 2939 PQPPLAP---TTDPAGAGEPSGAVPQPWLGALVPG 2970
Score = 32.2 bits (73), Expect = 0.54
Identities = 23/93 (24%), Positives = 28/93 (30%), Gaps = 9/93 (9%)
Query: 14 APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
PPPS P N+ PP RP P P R
Sbjct: 2622 HAPDPPPPSPSP--AANE----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
Query: 74 PNIANKRPSD--ARPP-NNLKNDYQHGPPGPGP 103
+ +RP ARP +L + PP P P
Sbjct: 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
Score = 31.8 bits (72), Expect = 0.75
Identities = 18/88 (20%), Positives = 24/88 (27%), Gaps = 1/88 (1%)
Query: 15 PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP-PGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
PP QP+P P P P P P P P P T +G +
Sbjct: 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
Query: 74 PNIANKRPSDARPPNNLKNDYQHGPPGP 101
+ P ++ P P
Sbjct: 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
Score = 31.8 bits (72), Expect = 0.76
Identities = 19/91 (20%), Positives = 19/91 (20%), Gaps = 6/91 (6%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
P PP QP P P PP P P P P G
Sbjct: 2905 PERPPQPQAPPPPQP------QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958
Query: 73 NPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
P R Q P P
Sbjct: 2959 VPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
Score = 30.7 bits (69), Expect = 1.6
Identities = 26/117 (22%), Positives = 34/117 (29%), Gaps = 3/117 (2%)
Query: 11 TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGP-RPGGPGVPPNQQPPYTGMRPS 69
+ + P+ PP+R + + F P P RP P PP QP P
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
Query: 70 GPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPES 126
P P RP P G P L + R VP+
Sbjct: 2928 QPQPPPPPPPRP-QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV-PRFRVPQP 2982
Score = 29.9 bits (67), Expect = 3.0
Identities = 25/95 (26%), Positives = 31/95 (32%), Gaps = 15/95 (15%)
Query: 11 TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGP--GVPPNQQPPYTGMRP 68
T+ P PPP GPP P P PGG PP++ P P
Sbjct: 2832 TSAQPTAPPPPPGPP-------------PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
Query: 69 SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
+ P +A S + L D PP P
Sbjct: 2879 ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA 2913
Score = 29.5 bits (66), Expect = 3.9
Identities = 18/69 (26%), Positives = 23/69 (33%), Gaps = 3/69 (4%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
PP+ A P P P +P P PP PP + G V P
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGP-LPPPTSAQPTAPPPPPGPPPPSLPL--GGSVAPGGDV 2862
Query: 79 KRPSDARPP 87
+R +R P
Sbjct: 2863 RRRPPSRSP 2871
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 37.4 bits (86), Expect = 0.009
Identities = 29/91 (31%), Positives = 34/91 (37%), Gaps = 11/91 (12%)
Query: 19 PPPSGPPMRYG-NQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQ----PPYTGM--RPSGP 71
P P R+G M RPGF PP PG+PPNQ P GM RP
Sbjct: 165 PQQRINPQRFGFPMQPNMGMRPGFNQMPPH----MPGMPPNQMRPGFNPMPGMPPRPGFN 220
Query: 72 VNPNIANKRPSDARPPNNLKNDYQHGPPGPG 102
NPN+ P ++ P GP
Sbjct: 221 QNPNMMPNMNRPGFRPQPGGFNHPGTPMGPN 251
Score = 33.9 bits (77), Expect = 0.11
Identities = 27/85 (31%), Positives = 28/85 (32%), Gaps = 9/85 (10%)
Query: 12 NQAPLRYPPPSGPPMRYG-NQNMAMQPRPGFTPTP--------PGPRPGGPGVPPNQQPP 62
NQ P P MR G N M PRPGF P PG RP G P
Sbjct: 189 NQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPM 248
Query: 63 YTGMRPSGPVNPNIANKRPSDARPP 87
M+ NPN P P
Sbjct: 249 GPNMQQRPGFNPNQGMNPPPHMAGP 273
Score = 31.2 bits (70), Expect = 0.91
Identities = 23/73 (31%), Positives = 25/73 (34%), Gaps = 11/73 (15%)
Query: 1 MSQRFQGSQNTNQAPLRY-----PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGV 55
M R +QN N P P P G MQ RPGF P G P
Sbjct: 213 MPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQ-GMNP----- 266
Query: 56 PPNQQPPYTGMRP 68
PP+ P G P
Sbjct: 267 PPHMAGPRAGFPP 279
Score = 29.3 bits (65), Expect = 3.8
Identities = 16/65 (24%), Positives = 22/65 (33%), Gaps = 1/65 (1%)
Query: 25 PMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDA 84
P N QP P P+ P PN Q R P+ PN+ + +
Sbjct: 132 PTEEVNTQEPTQPAGVNVANNPQPQVQ-PQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQ 190
Query: 85 RPPNN 89
PP+
Sbjct: 191 MPPHM 195
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 36.7 bits (85), Expect = 0.018
Identities = 29/140 (20%), Positives = 38/140 (27%), Gaps = 12/140 (8%)
Query: 3 QRFQGSQNTNQAPLRYPPP--------SGPPMRYGNQNMAMQPRP-GFTPTPPGPRPGGP 53
Q Q +R P PP +P G+ P P GP
Sbjct: 369 AHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428
Query: 54 GVP--PNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP-IKKKKKL 110
G P PN P +R N A K P P P +
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488
Query: 111 ADKILPQKVRDLVPESQAYM 130
+K L Q + P+ Q +
Sbjct: 489 QNKKLAQVLASATPQMQKQV 508
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 35.6 bits (82), Expect = 0.032
Identities = 20/78 (25%), Positives = 24/78 (30%), Gaps = 8/78 (10%)
Query: 10 NTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
N + AP PP + A P P PP P PN PP P+
Sbjct: 38 NADPAPPPPPPSTAAA--------APAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPN 89
Query: 70 GPVNPNIANKRPSDARPP 87
P P + P P
Sbjct: 90 APPPPPVDPNAPPPPAPE 107
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 35.9 bits (83), Expect = 0.043
Identities = 34/160 (21%), Positives = 49/160 (30%), Gaps = 12/160 (7%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPY----TGMRP 68
+ P YP P + PP P PP QQP + M P
Sbjct: 198 EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP 257
Query: 69 SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQA 128
P P + P P P PG + + ++ LV + Q
Sbjct: 258 PPPQPPQQQQQPPQPQAQPPPQ----NQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG 313
Query: 129 YMDLLAFERKLDSTIMRKRLDI-QEALKRPMKQKRKLRIF 167
F +L ++R + QE KR KR+ +I
Sbjct: 314 QQRGPQFREQLVQLSQQQREALSQEEAKR---AKRRHKIV 350
Score = 31.3 bits (71), Expect = 0.92
Identities = 15/69 (21%), Positives = 20/69 (28%), Gaps = 2/69 (2%)
Query: 6 QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
Q + Q P + P PP + +P P PP P P
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPS-- 224
Query: 66 MRPSGPVNP 74
P+ P P
Sbjct: 225 QAPAQPPLP 233
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human
Gammaherpesvirus that infects and establishes latency
in B lymphocytes in vivo. The latent membrane protein 2
(LMP2) gene is expressed in latently infected B cells
and encodes two protein isoforms, LMP2A and LMP2B, that
are identical except for an additional N-terminal 119
aa cytoplasmic domain which is present in the LMP2A
isoform. LMP2A is thought to play a key role in either
the establishment or the maintenance of latency and/or
the reactivation of productive infection from the
latent state. The significance of LMP2B and its role in
pathogenesis remain unclear.
Length = 489
Score = 35.6 bits (82), Expect = 0.044
Identities = 15/56 (26%), Positives = 17/56 (30%), Gaps = 2/56 (3%)
Query: 33 MAMQP-RPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPP 87
+ MQP G P G G G P G P P + RPP
Sbjct: 4 LEMQPLGAG-GPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPP 58
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 35.0 bits (80), Expect = 0.060
Identities = 16/50 (32%), Positives = 19/50 (38%)
Query: 15 PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYT 64
P + P P + QN A QP+ P P P QQPP T
Sbjct: 203 PTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQT 252
Score = 28.1 bits (62), Expect = 8.2
Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 5/88 (5%)
Query: 15 PLRYPPPSGPPMRYGNQN-----MAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
P R P G P + + + A +P T P +P Q
Sbjct: 176 PPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQ 235
Query: 70 GPVNPNIANKRPSDARPPNNLKNDYQHG 97
P N + K + G
Sbjct: 236 PAQQPTPQNPAQQPPQTEQGHKRSREQG 263
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 34.1 bits (78), Expect = 0.083
Identities = 21/70 (30%), Positives = 25/70 (35%), Gaps = 4/70 (5%)
Query: 3 QRFQGSQNTNQAPLRYPPPSGPPMR-YGNQNMAMQPRPGFTPTPPGPRPGGP---GVPPN 58
Q + + N YP P P + Y NQ M G P G PGGP G +
Sbjct: 5 QNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGS 64
Query: 59 QQPPYTGMRP 68
PP G
Sbjct: 65 GGPPGGGEVA 74
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 34.7 bits (79), Expect = 0.093
Identities = 20/82 (24%), Positives = 29/82 (35%), Gaps = 3/82 (3%)
Query: 6 QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
Q +Q AP P +G + Q P+ + +P P QQP
Sbjct: 427 QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE-PLYQQPQPVE 485
Query: 66 MRPSGPVNPNIANKRPSDARPP 87
+P P + +P ARPP
Sbjct: 486 QQPVVEPEPVVEETKP--ARPP 505
Score = 29.7 bits (66), Expect = 3.9
Identities = 21/113 (18%), Positives = 30/113 (26%), Gaps = 1/113 (0%)
Query: 15 PLRYPPPSG-PPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
P+ P P G P Q P P P P Q PY P P
Sbjct: 371 PVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQ 430
Query: 74 PNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPES 126
P N + + Q P + ++ + Q+ P+
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQP 483
Score = 28.5 bits (63), Expect = 7.2
Identities = 25/98 (25%), Positives = 36/98 (36%), Gaps = 16/98 (16%)
Query: 3 QRFQGSQNTNQAPLRYPPPS---GPPMRYGNQNMAMQPRPGFT----PTPPGPRPGGPGV 55
Q++Q Q +Y P P +Y + P+P + P P P+ P
Sbjct: 763 QQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ 822
Query: 56 PPNQQPPYTGMRPSGPVNPNIANK-------RPSDARP 86
P QP Y +P PV P + R D+RP
Sbjct: 823 PVAPQPQY--QQPQQPVAPQPQDTLLHPLLMRNGDSRP 858
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 34.8 bits (80), Expect = 0.11
Identities = 18/85 (21%), Positives = 23/85 (27%), Gaps = 4/85 (4%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
PPP N++ + T P P G PP P P P +
Sbjct: 71 PPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDP--PPPTPPPASPPP 128
Query: 79 KRPSDARPPNNLKNDYQHGPPGPGP 103
D L+ GPP
Sbjct: 129 SPAPDLSEM--LRPVGSPGPPPAAS 151
Score = 30.9 bits (70), Expect = 1.5
Identities = 16/86 (18%), Positives = 20/86 (23%), Gaps = 5/86 (5%)
Query: 2 SQRFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
+ + P R P + P P G R G +P
Sbjct: 822 RSHTPDGGSESSGPARPPGAAARPPP-----ARSSESSKSKPAAAGGRARGKNGRRRPRP 876
Query: 62 PYTGMRPSGPVNPNIANKRPSDARPP 87
P RP P A P P
Sbjct: 877 PEPRARPGAAAPPKAAAAAPPAGAPA 902
Score = 29.8 bits (67), Expect = 3.5
Identities = 19/93 (20%), Positives = 21/93 (22%)
Query: 11 TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSG 70
+ A R P + A P P P P G
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRR 872
Query: 71 PVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
P RP A PP G P P P
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 34.3 bits (78), Expect = 0.12
Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 7/121 (5%)
Query: 9 QNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRP 68
Q+ A P P+GP +R +++ P F P P P P P VP P RP
Sbjct: 489 QDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEP-VP---VPTVALERP 544
Query: 69 SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQA 128
P P IA + P + ++ ++ P P P + +++ + ++R E+Q
Sbjct: 545 VCPAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLARLR---AEAQP 601
Query: 129 Y 129
Y
Sbjct: 602 Y 602
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 34.2 bits (78), Expect = 0.14
Identities = 12/53 (22%), Positives = 14/53 (26%), Gaps = 1/53 (1%)
Query: 14 APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGM 66
A P GP P+PG P R GG P +
Sbjct: 209 AGQSMYQPPGPYPNAMVGRQPFYPQPG-AVAGPPKRRGGHKAPRGHRASQAAG 260
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 32.0 bits (73), Expect = 0.17
Identities = 14/72 (19%), Positives = 19/72 (26%)
Query: 16 LRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPN 75
+ P Y A P G PP P G+ P+ P +
Sbjct: 34 AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNS 93
Query: 76 IANKRPSDARPP 87
P + PP
Sbjct: 94 GYMADPQEPPPP 105
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 33.9 bits (77), Expect = 0.17
Identities = 23/100 (23%), Positives = 28/100 (28%), Gaps = 11/100 (11%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP---------PGPRPGGPGVPPNQQPPY 63
+AP PP+ PP R A P PG P +PP
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 758
Query: 64 TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
+ P P A P+ PP Q P P
Sbjct: 759 AAPGRARP--PAAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796
Score = 30.0 bits (67), Expect = 2.8
Identities = 21/104 (20%), Positives = 28/104 (26%), Gaps = 9/104 (8%)
Query: 6 QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
+ + P P + PP +P P P P PG P Q PP
Sbjct: 728 AAAPGRARPPAAAPGRARPP---AAAPGRARP-PAAAPGRARPPAAAPGAPTPQPPP--- 780
Query: 66 MRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKK 109
+ P P+ PP Q P + K
Sbjct: 781 --QAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTK 822
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.0 bits (73), Expect = 0.42
Identities = 27/135 (20%), Positives = 39/135 (28%), Gaps = 14/135 (10%)
Query: 4 RFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG--PRPGGPGVPPNQQP 61
+ G+Q P PP P GP P Q
Sbjct: 57 PPVAQLPQPLPQPPPTQAL-QALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTE 115
Query: 62 P---YTGMRPSG-PVNP-NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILP 116
P Y P NP N +P+ R L+ Y G P G + +++ A K
Sbjct: 116 PGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY--GAPASGQLPSQQQSAQKNDE 173
Query: 117 QKVRDL----VPESQ 127
+++ P Q
Sbjct: 174 SQLQQQPNGETPPQQ 188
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 31.8 bits (72), Expect = 0.43
Identities = 13/47 (27%), Positives = 15/47 (31%), Gaps = 4/47 (8%)
Query: 12 NQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPN 58
Q P+G P Q RPG P P +PPN
Sbjct: 110 TQYQGHAGAPAGHPG----GGGPQQFRPGAGQPPGMQHMPAPALPPN 152
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.5 bits (74), Expect = 0.48
Identities = 21/85 (24%), Positives = 22/85 (25%), Gaps = 19/85 (22%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
P PP R T P PRPG P P P +
Sbjct: 94 EPAPPPPH---------ARR---TSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPT 141
Query: 79 KRPSDARPPNNLKNDYQHGPPGPGP 103
RP A P Q PG P
Sbjct: 142 ARP--AYP-----AYQQRPEPGAWP 159
Score = 29.0 bits (65), Expect = 5.2
Identities = 18/77 (23%), Positives = 23/77 (29%), Gaps = 3/77 (3%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
+ R P + RP P GGPG P P +RPS P
Sbjct: 203 EYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPG 262
Query: 73 NPNIANKRPSDARPPNN 89
+P+ A P
Sbjct: 263 PL---AAQPAPAPGPGE 276
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 32.0 bits (73), Expect = 0.49
Identities = 20/108 (18%), Positives = 31/108 (28%), Gaps = 6/108 (5%)
Query: 7 GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPP---Y 63
S + +P PS P + ++ P P + P P P N PP
Sbjct: 184 PSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP--SSFQSDTPPPSPESPTNPSPPPGPA 241
Query: 64 TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLA 111
P P ++ +P+ I K +K A
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT-LDDDAIAKAQKHA 288
Score = 31.2 bits (71), Expect = 0.86
Identities = 20/100 (20%), Positives = 26/100 (26%), Gaps = 17/100 (17%)
Query: 19 PPPSGPPMRYGNQNMA---MQPRPGFTPTPPGPRPGGPGVPPNQQP-------------- 61
P GPP+ +++ F P P PP+ P
Sbjct: 147 DPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS 206
Query: 62 PYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGP 101
P P P + PS P N PP P
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPP 246
Score = 30.0 bits (68), Expect = 2.0
Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 1/83 (1%)
Query: 20 PPSGPPMRYGNQNMAMQPRPGFTPTPPGPR-PGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
P S P + + + P P P P P + Q P P NP+
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP 238
Query: 79 KRPSDARPPNNLKNDYQHGPPGP 101
+ PP P P
Sbjct: 239 GPAAPPPPPVQQVPPLSTAKPTP 261
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 32.4 bits (73), Expect = 0.52
Identities = 26/96 (27%), Positives = 36/96 (37%), Gaps = 14/96 (14%)
Query: 6 QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQP--RPGFTPTPPGPRPGGPGVPPNQQPPY 63
Q + + A P S P Y N + P P + TP PPN PPY
Sbjct: 404 QSNAGFSNAGYSNPGNSNPG--YNNAPNSNTPYNNPPNSNTPYSN-------PPNSNPPY 454
Query: 64 TGMRPSGP--VNPNIANKRPSDARP-PNNLKNDYQH 96
+ + S N ++N PS A+ + YQH
Sbjct: 455 SNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQH 490
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 32.1 bits (73), Expect = 0.54
Identities = 21/89 (23%), Positives = 30/89 (33%), Gaps = 17/89 (19%)
Query: 18 YPPPS---GPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNP 74
P G P+R + A++ P PRPGGP PP + + P
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRA-------PQAPRPGGPPGGGGGLPPPPDLPAAAGPAP 724
Query: 75 NIANKRPSDARPPNNLKNDYQHGPPGPGP 103
++ S PP + PPG
Sbjct: 725 CGSSLIASPTAPP-------EPEPPGAEQ 746
Score = 31.7 bits (72), Expect = 0.76
Identities = 20/88 (22%), Positives = 23/88 (26%), Gaps = 10/88 (11%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
P R R G P P P G+P + P R PV P +
Sbjct: 637 PVFKTALPRPDYN------RGGEAGGPGVPGPVPVGMPAHTARPSRVAR-GDPVRPTAHH 689
Query: 79 KR---PSDARPPNNLKNDYQHGPPGPGP 103
P RP PP P
Sbjct: 690 AALRAPQAPRPGGPPGGGGGLPPPPDLP 717
Score = 31.3 bits (71), Expect = 1.0
Identities = 15/79 (18%), Positives = 18/79 (22%), Gaps = 16/79 (20%)
Query: 10 NTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
+ A LR P P PG P P P P +
Sbjct: 686 TAHHAALRAPQAPRPG-----------GPPGGGGGLPPPPDLPAAAGP---APCGSSLIA 731
Query: 70 GPVNPNIANKRPSDARPPN 88
P P P A +
Sbjct: 732 SPTAP--PEPEPPGAEQAD 748
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 31.1 bits (70), Expect = 1.2
Identities = 24/102 (23%), Positives = 29/102 (28%), Gaps = 7/102 (6%)
Query: 2 SQRFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
Q QG N P M + P PP P PG P +Q P
Sbjct: 391 IQGNQGGLGANPMQ-----QGQPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPP 445
Query: 62 PY--TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGP 101
GM PS P + + S + P GP
Sbjct: 446 QSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGP 487
>gnl|CDD|219914 pfam08577, PI31_Prot_C, PI31 proteasome regulator. PI31 is a
cellular regulator of proteasome formation and of
proteasome-mediated antigen processing.
Length = 68
Score = 28.1 bits (63), Expect = 1.6
Identities = 20/67 (29%), Positives = 22/67 (32%), Gaps = 3/67 (4%)
Query: 7 GSQNTNQAPLRYPPPSGPPMRYGNQNMA--MQPRPGFTPTPPGPRPGGPGVPPNQQPPYT 64
G ++ L P P P N M P P PR GG PP PP
Sbjct: 3 GDRDLYPPGLGPPDPFDPLPAPLGGNGQGGMIFDP-NRPGFGPPRGGGGDGPPPGVPPGA 61
Query: 65 GMRPSGP 71
P GP
Sbjct: 62 RYDPFGP 68
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 30.8 bits (69), Expect = 1.7
Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 6/89 (6%)
Query: 44 TPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQH---GPPG 100
TPP P G PP PP +GP + P D PP+ GPP
Sbjct: 551 TPPKVSPSDRG-PPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPA 609
Query: 101 PGPIKKKKKLA--DKILPQKVRDLVPESQ 127
GP +K+ + + P VR + E
Sbjct: 610 SGPHEKQPPSSAPRDMAPSVVRMFLRERL 638
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 30.2 bits (68), Expect = 1.9
Identities = 17/87 (19%), Positives = 25/87 (28%), Gaps = 1/87 (1%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
P G +Q + P G P G GP + +
Sbjct: 191 PDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPPSR 250
Query: 79 KRPS-DARPPNNLKNDYQHGPPGPGPI 104
RP+ +A P + Y GPG +
Sbjct: 251 YRPAYEAAPLAPAISSYGPAGGGPGSV 277
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.3 bits (69), Expect = 1.9
Identities = 26/110 (23%), Positives = 32/110 (29%), Gaps = 19/110 (17%)
Query: 6 QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
Q +N A PP PP P P T T P P P P PP +
Sbjct: 513 QSGSASNTAKTPPPPQKSPP-----PPAPTPPLPQPTATAPPPTP--PPPPPTATQASSN 565
Query: 66 MRPSGPVNPNIANKRPS--DARPPNNLKNDYQHGPPGPGPIKKK-KKLAD 112
P + + P P + + I K K LAD
Sbjct: 566 APAQIPADSSPPPPIPEEPTPSPTKDSSPE---------EIDKAAKNLAD 606
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 29.6 bits (67), Expect = 2.0
Identities = 12/56 (21%), Positives = 17/56 (30%), Gaps = 5/56 (8%)
Query: 1 MSQRFQGSQNTNQAPLRYPPPSGPPMR--YGNQNMAMQPRPGFTPTPPGPRPGGPG 54
+S F G P PP R +G+ + G P +PG
Sbjct: 83 LSGMFGGGAPRPPPAAPAVQPPAPPARPGWGSGG---PSQQGAGQQPGYAQPGPGS 135
>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
This family consists of several Neisseria meningitidis
TspB virulence factor proteins.
Length = 502
Score = 30.3 bits (68), Expect = 2.0
Identities = 21/79 (26%), Positives = 23/79 (29%), Gaps = 11/79 (13%)
Query: 12 NQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP---PGPRPGGPGVPPNQQPPYTGMRP 68
PL P+ P N RP P P P P G P G RP
Sbjct: 328 EAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQP--------GTRP 379
Query: 69 SGPVNPNIANKRPSDARPP 87
P P+ N R R
Sbjct: 380 DSPAVPDRPNGRHRKERKE 398
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.2 bits (68), Expect = 2.2
Identities = 20/71 (28%), Positives = 25/71 (35%), Gaps = 6/71 (8%)
Query: 19 PPPSGPPMRYGNQNMAMQP--RPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNI 76
P PS P N+ + R TP P PRP P VP + R + PV+
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD--- 439
Query: 77 ANKRPSDARPP 87
K P
Sbjct: 440 -EKPKYTPPAP 449
>gnl|CDD|223631 COG0557, VacB, Exoribonuclease R [Transcription].
Length = 706
Score = 30.1 bits (68), Expect = 2.3
Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 8/98 (8%)
Query: 113 KILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKRLDIQEALKRPMKQKRKLRIFISNTF 172
++LP R E A + +L ER + + R P ++ I I
Sbjct: 114 ELLPSDKRGRFKE-AAVVRIL--ERANSALVGELRPSQGIGRVLPDDKRLPFLIAIPPEQ 170
Query: 173 YPAKESGEGEEGSVASWELR---VEGRLLED--SKNDP 205
P E G + W G ++E DP
Sbjct: 171 APGAEEGHLVVVEITRWPDTSRPPFGEVVEVLGDLGDP 208
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 30.0 bits (67), Expect = 2.4
Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 11/133 (8%)
Query: 8 SQNTNQAPLRYPPPSGPPMRYGN-QNMAMQPRPGFTPTPPGPR-PGGPGVPPNQQPPYTG 65
SQ ++PP + + R P R P PP
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET 298
Query: 66 MRPS-GPVNPNIANKRPSDARPP--------NNLKNDYQHGPPGPGPIKKKKKLADKILP 116
RP+ P + P + PP NL + + PP P I + ++ LP
Sbjct: 299 GRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSICYGVGIYNEALP 358
Query: 117 QKVRDLVPESQAY 129
+ +VP Y
Sbjct: 359 RGCDIVVPLCSTY 371
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 29.4 bits (66), Expect = 3.4
Identities = 15/79 (18%), Positives = 24/79 (30%), Gaps = 9/79 (11%)
Query: 4 RFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPY 63
R +Q ++ L Y P GP R ++ + PY
Sbjct: 134 RESSAQRSSAQHLFYIIPGGPSWR---TSIKPFSSSHYGSHN----SSAYSDHLRPLQPY 186
Query: 64 TGM--RPSGPVNPNIANKR 80
+G P GP +K+
Sbjct: 187 SGELPPPLGPNWQASNSKK 205
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 29.6 bits (66), Expect = 3.5
Identities = 30/134 (22%), Positives = 43/134 (32%), Gaps = 16/134 (11%)
Query: 37 PRPGFTPTPPGP----RPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKN 92
T GP RP P P P + M PV + P +
Sbjct: 367 AVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGT 426
Query: 93 DYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKR-LDIQ 151
Y P GP P + P + P S A M ++ RKR +++
Sbjct: 427 SYGPEPVGPVPPQ----------PTNPYVM-PISMANMVYPGHPQEHGHERKRKRGGELK 475
Query: 152 EALKRPMKQKRKLR 165
E L +K +KL+
Sbjct: 476 EELIETLKLVKKLK 489
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 29.4 bits (66), Expect = 3.6
Identities = 10/33 (30%), Positives = 11/33 (33%)
Query: 39 PGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
PG P GP PP PP +P
Sbjct: 906 PGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAA 938
Score = 29.1 bits (65), Expect = 4.9
Identities = 11/35 (31%), Positives = 13/35 (37%)
Query: 37 PRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
G + P P G P P PP T P+ P
Sbjct: 905 APGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAP 939
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 29.3 bits (66), Expect = 3.8
Identities = 17/63 (26%), Positives = 18/63 (28%), Gaps = 11/63 (17%)
Query: 42 TPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPN-IANKRPSDARPPNNLKNDYQHGPPG 100
P PP G P PP P P P A RP+ A P P
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAP----------PAA 430
Query: 101 PGP 103
P
Sbjct: 431 AAP 433
>gnl|CDD|205967 pfam13794, MiaE_2, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like.
Length = 185
Score = 28.7 bits (64), Expect = 4.1
Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 5/67 (7%)
Query: 184 GSVASWELRVEGRLLEDSKNDPNKSGEGEEGSVASWELR----VEGRLLEDSKNDPNKVK 239
G++A EL RL ED++ P +A E R + R L D
Sbjct: 12 GALAYGELSAFERLAEDARYAPTLHDRAVLARMAVAEFRHYELLSDR-LAARGIDAEDAM 70
Query: 240 RKFSSFF 246
F +
Sbjct: 71 LPFVAAV 77
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 29.2 bits (66), Expect = 4.8
Identities = 20/92 (21%), Positives = 22/92 (23%), Gaps = 1/92 (1%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAM-QPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
AP PP P A QP P TPP + P P Q
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAA 735
Query: 72 VNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
+P P D P P
Sbjct: 736 DDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPA 767
Score = 28.8 bits (65), Expect = 6.2
Identities = 14/90 (15%), Positives = 15/90 (16%), Gaps = 8/90 (8%)
Query: 14 APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
P + P A P P P P P
Sbjct: 389 GGAGAPAAAAPSAAAAAPA-AAPAPAAAAPAAAAAPAPAAAPQPAPAPA-PAPAPPSPAG 446
Query: 74 PNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
A PS P P P
Sbjct: 447 NAPAGGAPSPPPAAA------PSAQPAPAP 470
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 28.9 bits (64), Expect = 4.9
Identities = 19/77 (24%), Positives = 26/77 (33%), Gaps = 7/77 (9%)
Query: 11 TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSG 70
+QA R P P+ P ++ +P PTP P P+Q P
Sbjct: 46 PHQAASRAPDPAVAPTSAASR----KPDLAQAPTPAASEKFDPAPAPHQA---ASRAPDP 98
Query: 71 PVNPNIANKRPSDARPP 87
V P +A DA
Sbjct: 99 AVAPQLAAAPKPDAAEA 115
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 28.3 bits (63), Expect = 5.0
Identities = 25/94 (26%), Positives = 31/94 (32%), Gaps = 13/94 (13%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGG----PGVPPNQQPPYTGMRPSGPVNP 74
P P +G Q M P P PGG P +PP P PS V
Sbjct: 53 SSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPG 112
Query: 75 NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKK 108
+ +DA+ +N G IKK K
Sbjct: 113 STPLSMETDAKSSDNKDK---------GSIKKSK 137
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 29.0 bits (65), Expect = 5.2
Identities = 17/64 (26%), Positives = 31/64 (48%)
Query: 98 PPGPGPIKKKKKLADKILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKRLDIQEALKRP 157
PP P IK+KKK K+ + P ++ + LL ER D ++ + + ++P
Sbjct: 26 PPPPHEIKRKKKRKGPDAASKLPKVTPNTKCRLRLLKLERIKDYLLLEEEFITNQEAQKP 85
Query: 158 MKQK 161
++K
Sbjct: 86 AQEK 89
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 28.8 bits (65), Expect = 5.4
Identities = 21/83 (25%), Positives = 25/83 (30%), Gaps = 7/83 (8%)
Query: 17 RYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNI 76
+Y PP + QP P PRP P P QQP Y P P
Sbjct: 91 QYQPPYASAQ---PRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQ----PQPEQPLQ 143
Query: 77 ANKRPSDARPPNNLKNDYQHGPP 99
P A P + + Q
Sbjct: 144 QPVSPQVAPAPQPVHSAPQPAQQ 166
>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein).
Length = 560
Score = 29.0 bits (65), Expect = 5.8
Identities = 12/69 (17%), Positives = 17/69 (24%), Gaps = 4/69 (5%)
Query: 7 GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG---PRPGGPGVPPNQ-QPP 62
G Q + P P + + Q TPP G V P + +
Sbjct: 46 GGQVSWSPPCTPPQSVCSGCPDLSAYLTDQSLWPPCVTPPKRTPHANGSFYVCPGECRTR 105
Query: 63 YTGMRPSGP 71
G
Sbjct: 106 ANRRHCGGY 114
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 28.7 bits (64), Expect = 6.2
Identities = 21/106 (19%), Positives = 28/106 (26%), Gaps = 7/106 (6%)
Query: 4 RFQGSQNTNQAPLRYPPPSG-----PPMRYGNQNMAMQPRPGFTPT-PPGPRPGGPGVPP 57
R Q + R G PP + N++ R P P + G P
Sbjct: 453 RGQRGPRSQPQNQRRQQNRGRQSSQPPRQQQNRSNQNNQRQSQGPNQGPRGQGGYNLRPR 512
Query: 58 NQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
QP G NPN ++ R G P
Sbjct: 513 TYQPQRYGGGQGRRWNPNPY-RQSGQGRSQQQQPQPEARGNQSRTP 557
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 28.8 bits (64), Expect = 6.3
Identities = 15/30 (50%), Positives = 15/30 (50%), Gaps = 5/30 (16%)
Query: 33 MAMQPRPGFTPTPPGPRPGGPGVPPNQQPP 62
MA P PG P PP P PG P QPP
Sbjct: 1 MASLP-PGNPPPPPPP----PGFEPPSQPP 25
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 28.0 bits (62), Expect = 6.5
Identities = 13/37 (35%), Positives = 13/37 (35%), Gaps = 8/37 (21%)
Query: 21 PSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPP 57
G P Y P PP P PG PG PP
Sbjct: 151 APGVPPPYNP--------PMLMMKPPPPSPGPPGAPP 179
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 28.7 bits (64), Expect = 6.7
Identities = 11/54 (20%), Positives = 13/54 (24%)
Query: 34 AMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPP 87
A QP P P P P + P +P R P
Sbjct: 14 AEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKP 67
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 28.2 bits (63), Expect = 6.9
Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 2/49 (4%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPP--GPRPGGPGVPPNQ 59
Q P P P + P P PP G GGP PPNQ
Sbjct: 134 QPPKPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQ 182
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 28.6 bits (64), Expect = 7.1
Identities = 24/133 (18%), Positives = 34/133 (25%), Gaps = 17/133 (12%)
Query: 15 PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNP 74
P+ P + P + P P+ P PP + P V
Sbjct: 381 PVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQ---PAGTPPTVSVDPPAAVPV 437
Query: 75 NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKIL-PQK-----VRDLVPESQA 128
N + P RP Q P+ K L L P + + E+
Sbjct: 438 NPPSTAPQAVRPA-------QFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPT 490
Query: 129 YMDLLAF-ERKLD 140
F E L
Sbjct: 491 GTQKEIFTEEDLQ 503
>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
[Chromatin structure and dynamics].
Length = 1163
Score = 28.7 bits (64), Expect = 7.4
Identities = 19/95 (20%), Positives = 27/95 (28%), Gaps = 12/95 (12%)
Query: 8 SQNTNQAPLRYPPPSGPPMRYGNQN---MAMQPRPGFTPTPPGPRPGGPGV-PPNQQPPY 63
N + L P GP Q+ +A T P P P P+ P
Sbjct: 201 LPQPNGSRLHVTTPQGPLSSPPVQSSYYVAPCNHDQRTSHPTLPSDSQPEPSAPSHMPSD 260
Query: 64 TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGP 98
+ N I N +K +Q+ P
Sbjct: 261 ARGKHQVDFNQAIIFV--------NKVKVRFQNNP 287
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 28.5 bits (63), Expect = 7.7
Identities = 26/97 (26%), Positives = 33/97 (34%), Gaps = 13/97 (13%)
Query: 13 QAPLRYPPPSGPPMRYGNQNMAMQPRP------GFTPTPPGPRPGGPGVPPNQQPPYTGM 66
P S PP +G + P P + TPP P PP +QP
Sbjct: 282 GPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQP----- 336
Query: 67 RPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
P P P+I K P P ++H P GP
Sbjct: 337 LPPAPSMPHI--KPPPTTPIPQLPNQSHKHPPHLQGP 371
>gnl|CDD|173839 cd01150, AXO, Peroxisomal acyl-CoA oxidase. Peroxisomal acyl-CoA
oxidases (AXO) catalyze the first set in the peroxisomal
fatty acid beta-oxidation, the alpha,beta
dehydrogenation of the corresponding trans-enoyl-CoA by
FAD, which becomes reduced. In a second oxidative
half-reaction, the reduced FAD is reoxidized by
molecular oxygen. AXO is generally a homodimer, but it
has been reported to form a different type of oligomer
in yeast. There are several subtypes of AXO's, based on
substrate specificity. Palmitoyl-CoA oxidase acts on
straight-chain fatty acids and prostanoids; whereas, the
closely related Trihydroxycoprostanoly-CoA oxidase has
the greatest activity for 2-methyl branched side chains
of bile precursors. Pristanoyl-CoA oxidase, acts on
2-methyl branched fatty acids. AXO has an additional
domain, C-terminal to the region with similarity to
acyl-CoA dehydrogenases, which is included in this
alignment.
Length = 610
Score = 28.4 bits (64), Expect = 8.0
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 296 LLDYQPLQFKLDPRLARLLGVHTQTR 321
+LDYQ Q++L P+LA H +
Sbjct: 328 ILDYQLQQYRLFPQLAAAYAFHFAAK 353
>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 121
Score = 27.1 bits (60), Expect = 8.3
Identities = 12/48 (25%), Positives = 14/48 (29%), Gaps = 1/48 (2%)
Query: 7 GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG-PRPGGP 53
G + S G A P G TP+ G PGG
Sbjct: 67 GGASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSAYGPATPGGG 114
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 28.0 bits (62), Expect = 8.6
Identities = 13/70 (18%), Positives = 15/70 (21%), Gaps = 2/70 (2%)
Query: 19 PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRP--SGPVNPNI 76
P P R T P P Q T P P
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264
Query: 77 ANKRPSDARP 86
P++A P
Sbjct: 265 GEAPPANATP 274
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 28.3 bits (63), Expect = 8.7
Identities = 12/48 (25%), Positives = 16/48 (33%), Gaps = 4/48 (8%)
Query: 14 APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
AP +G P+ + +PR P P P V P P
Sbjct: 380 APTAAATAAGAPL----PDFDPRPRGPPAPEPARSAEAPPLVAPAAAP 423
>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
[Energy production and conversion].
Length = 769
Score = 28.1 bits (63), Expect = 9.3
Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 9/56 (16%)
Query: 221 LRVEGRLLEDSKNDPNKVKRKFSSFFKSLVIELD------KDLYGPDNHLVEWHRT 270
+R +GRL+ + + K +F +IEL+ ++++GP H+V + R
Sbjct: 440 MRSKGRLVHQAAAPNSLQK---GTFVAPTLIELENLDELQREVFGPVLHVVRYKRD 492
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 28.1 bits (62), Expect = 9.6
Identities = 12/63 (19%), Positives = 18/63 (28%)
Query: 27 RYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARP 86
G+ N A P P P + P+ P P P+ A P
Sbjct: 199 IIGDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAP 258
Query: 87 PNN 89
++
Sbjct: 259 VSS 261
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.138 0.421
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,943,795
Number of extensions: 2075022
Number of successful extensions: 2892
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2454
Number of HSP's successfully gapped: 229
Length of query: 392
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 293
Effective length of database: 6,546,556
Effective search space: 1918140908
Effective search space used: 1918140908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.1 bits)