RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15122
(927 letters)
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 203 bits (518), Expect = 7e-58
Identities = 77/201 (38%), Positives = 124/201 (61%), Gaps = 2/201 (0%)
Query: 591 PEMYTDCHKNIINGKQACEEVKQSKKLAKILELILLMGNYMNSGSRNGGAFGFEINFLTK 650
E + ++ + A EE+++S+K K+LELIL +GNYMNSG+R G A GF+++ L K
Sbjct: 173 EEEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTRRGNAKGFKLSSLLK 232
Query: 651 LSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNI 710
LS TK +NKTTLLHYLV I +K P+ L F EL HV++AA+V + ++ ++++E +
Sbjct: 233 LSDTKSTDNKTTLLHYLVKIIREKLPDLLDFSSELSHVEKAAKVDLEQLEKDVKELEKGL 292
Query: 711 KNLETDIQNCKQAPVANENDKFLEIMEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTF 770
K LE +++ + +DKF+E M+ F +E +K+ L ++ K M L+ +L E++
Sbjct: 293 KKLERELELSALDE--HPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKELTEYFGE 350
Query: 771 DKNIYTLEEFFTDIKTFKDSF 791
D + EEFF ++ F F
Sbjct: 351 DPKETSPEEFFKILRDFLRMF 371
>gnl|CDD|219000 pfam06367, Drf_FH3, Diaphanous FH3 Domain. This region is found in
the Formin-like and and diaphanous proteins.
Length = 197
Score = 189 bits (482), Expect = 3e-55
Identities = 88/223 (39%), Positives = 129/223 (57%), Gaps = 28/223 (12%)
Query: 267 DGHDKVIKAITMSGEL-KGKERFQPVVQGLMVKGN-NEALRTACLQLINAIVATPDDLEF 324
GH+KV++A E+ + + RF+ +V L N N + A +Q INA+V +P+DL F
Sbjct: 1 GGHEKVLEATLNFKEVCRERGRFRSLVGALDSSENDNVEYKVATMQFINALVNSPEDLNF 60
Query: 325 RLHLRNEIMRVGLYDLLDALEKDASEDVSVQLKVFIEHKEEDYYEFIQRFDNVRMEIDDV 384
RLHLR+E R+GL +LD L + +E + QL++F E++EED E ++RFD+V +++DD
Sbjct: 61 RLHLRSEFTRLGLDRILDKLRELENEKLDDQLQIFEENREEDVEELLERFDDVNVDLDDP 120
Query: 385 NDCFETVRNMVMDSACEPYLLSILQHLLFIRDDQNMVMDSACEPYLLSILQHLLFIRDDQ 444
+ FE ++ N V D+ EP+ LSILQHLL IRDD+
Sbjct: 121 VELFELLK--------------------------NKVKDTEAEPHFLSILQHLLLIRDDE 154
Query: 445 YVRLAYYKLLEECVSQIVLHRGGCDPDFRSSRRFQLDVQPLVE 487
Y+KLLEE VSQIVLHR DPDF + +D+ L++
Sbjct: 155 EELPQYWKLLEELVSQIVLHRTKPDPDFDERKNLNIDINRLLD 197
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 177 bits (452), Expect = 9e-49
Identities = 88/264 (33%), Positives = 127/264 (48%), Gaps = 45/264 (17%)
Query: 591 PEMYTDCHKNIINGKQACEEVKQSKKLAKILELILLMGNYMNSGSRNGGAFGFEINFLTK 650
E D I + ACEE+++SKK K+LELIL +GNYMN GSR G A+GF+++ L K
Sbjct: 173 EEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSRRGQAYGFKLSSLLK 232
Query: 651 LSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNI 710
LS K +NKTTLLH+LV I +K+ +
Sbjct: 233 LSDVKSADNKTTLLHFLVKIIRKKY----------------------------------L 258
Query: 711 KNLETDIQNCKQAPVANENDKFLEIMEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTF 770
L N +DKF+E+M+PF K ++K L ++ T + L E+Y
Sbjct: 259 GGLS---------DPENLDDKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLVEYYGE 309
Query: 771 DKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENEK-KDKAARK 829
D + EEFF D F F +A +ENIK E EE+ + + +E E E+ + +
Sbjct: 310 DPKDTSPEEFFKDFNEFLKEFSKAAEENIKKEEEEEERRK-KLVKETTEYEQSSSRQKER 368
Query: 830 KALIDMTTDQTQQGVMDSLLEALQ 853
+D ++ GV+DSLLE L
Sbjct: 369 NPSMDFEVERDFLGVLDSLLEELG 392
>gnl|CDD|219001 pfam06371, Drf_GBD, Diaphanous GTPase-binding Domain. This domain
is bound to by GTP-attached Rho proteins, leading to
activation of the Drf protein.
Length = 187
Score = 140 bits (354), Expect = 6e-38
Identities = 55/195 (28%), Positives = 102/195 (52%), Gaps = 14/195 (7%)
Query: 75 LEKLNPEQLNQKFEDMLNDMNLSDEKKEPLRRQPLANKKKMLLMHYKGT------VTSYE 128
L+K + +++ + FE+++ +MNL +EK+ P+R +P+ K ++++ + S
Sbjct: 1 LQKPDDDEILKLFEELMEEMNLPEEKRRPMRAKPIEKKWQLVVQYLLTNFQGESLSKSLS 60
Query: 129 NKSKFDKPIEYIQYLSQPELSVNKMYSCIESLRIALTNHPLSWVNEFVLQDNKNFRKYPI 188
++ P Y++ L + C+ESLR+AL P+SWV F+ + + I
Sbjct: 61 KSNETGSPEYYVKKLKDGSIDQK----CLESLRVALRTQPVSWVKRFIGAEGLS-ALLNI 115
Query: 189 AFLYPRFPSRSRNDSRYDRVQYECVRCIRAIMNNTVGLKQMFGQKEALTIVARSLDPNKP 248
R S S D +YE ++C++A+MNN G+ + G E + ++ARSLD +P
Sbjct: 116 LSKIVRKKSESDEDLD---REYEILKCLKALMNNKFGIDHVLGHPEVILLLARSLDSPRP 172
Query: 249 TVMLEAVKVLAAVCL 263
A+++L A+CL
Sbjct: 173 KTRKLALELLTALCL 187
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 61.5 bits (149), Expect = 5e-11
Identities = 31/49 (63%), Positives = 32/49 (65%), Gaps = 8/49 (16%)
Query: 551 AAGIPTGP-----KGGPPPPPPPPGGMG-PPPPPMPGMPG--PPPPPMP 591
AAGIP P G PPPPPP PGG PPPPP+PG PG PPPPP P
Sbjct: 81 AAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFP 129
Score = 60.3 bits (146), Expect = 2e-10
Identities = 29/51 (56%), Positives = 29/51 (56%), Gaps = 8/51 (15%)
Query: 551 AAGIPTGPKGGP-----PPPPPPPGGMG--PPPPPMPGMPG-PPPPPMPEM 593
AGIP P P PPPPP PGG G PPPPP PG PG PPPPP
Sbjct: 93 GAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGS 143
Score = 58.8 bits (142), Expect = 5e-10
Identities = 25/44 (56%), Positives = 26/44 (59%), Gaps = 7/44 (15%)
Query: 554 IPTGPKGGPPPPPPPPGGMG-PPPPPMPGMPGPPP-----PPMP 591
+P GP G PPPPPP PG G PPPPP G P PPP P P
Sbjct: 115 LPGGP-GVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAP 157
Score = 58.0 bits (140), Expect = 9e-10
Identities = 27/48 (56%), Positives = 29/48 (60%), Gaps = 7/48 (14%)
Query: 551 AAGIPTGP----KGGPPPPPPPPGGMG--PPPPPMPGMPG-PPPPPMP 591
A IP P G PPPPP PGG G PPPPP+PG PPPPP+P
Sbjct: 69 ATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLP 116
Score = 56.9 bits (137), Expect = 2e-09
Identities = 26/46 (56%), Positives = 28/46 (60%), Gaps = 6/46 (13%)
Query: 552 AGIPTGPK----GGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
A IP P G PPPPP PG PPPPP+PG G PPPPP+P
Sbjct: 46 AAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91
Score = 56.5 bits (136), Expect = 4e-09
Identities = 27/48 (56%), Positives = 29/48 (60%), Gaps = 7/48 (14%)
Query: 551 AAGIPTGP----KGGPPPPPPPPGGMG-PPPPPMPGMPG--PPPPPMP 591
AGIP P PPPPP PG G PPPPP+PG G PPPPP+P
Sbjct: 57 VAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLP 104
Score = 56.1 bits (135), Expect = 4e-09
Identities = 23/40 (57%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 555 PTGPKGGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMPE 592
P PPPPP PGG G PPPPP+PG PPPPP+P
Sbjct: 17 PLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPG 56
Score = 55.7 bits (134), Expect = 6e-09
Identities = 25/46 (54%), Positives = 27/46 (58%), Gaps = 6/46 (13%)
Query: 552 AGIPTGPK----GGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
GIP P PPPPP PG G PPPPP+PG PPPPP+P
Sbjct: 34 TGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79
Score = 55.3 bits (133), Expect = 8e-09
Identities = 27/46 (58%), Positives = 30/46 (65%), Gaps = 6/46 (13%)
Query: 552 AGIPTGPK--GGP--PPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
IP P GG PPPPP PGG PPPPP+PG+ G PPPPP+P
Sbjct: 22 VCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67
Score = 52.6 bits (126), Expect = 8e-08
Identities = 24/40 (60%), Positives = 26/40 (65%), Gaps = 3/40 (7%)
Query: 555 PTGPKGG--PPPPPPPPGGMGPPPPPMPGMPG-PPPPPMP 591
P P G PPPPP P G PPPPP+PG G PPPPP+P
Sbjct: 4 PPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43
Score = 51.5 bits (123), Expect = 2e-07
Identities = 22/31 (70%), Positives = 25/31 (80%), Gaps = 2/31 (6%)
Query: 563 PPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
PPPPP PGG+G PPPPP+PG PPPPP+P
Sbjct: 1 PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31
Score = 42.2 bits (99), Expect = 2e-04
Identities = 15/30 (50%), Positives = 16/30 (53%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
A GIP P G PPPPP G P P +P
Sbjct: 131 APGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 52.2 bits (125), Expect = 5e-07
Identities = 15/37 (40%), Positives = 15/37 (40%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
T P PPPP PP P P P PPP P
Sbjct: 49 STAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
Score = 49.1 bits (117), Expect = 5e-06
Identities = 14/34 (41%), Positives = 14/34 (41%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P PPPP P PP P P PP P P
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQP 74
Score = 46.4 bits (110), Expect = 3e-05
Identities = 14/37 (37%), Positives = 14/37 (37%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PPPPPP P P P PPPPP
Sbjct: 51 AAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPAD 87
Score = 46.0 bits (109), Expect = 5e-05
Identities = 20/45 (44%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 551 AAGIPTGPKGGPPP---PPPPPGGMGPPPPPMPGMP-GPPPPPMP 591
AA P P PPP P P P PPP P P PPPPP+
Sbjct: 53 AAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVD 97
Score = 45.7 bits (108), Expect = 7e-05
Identities = 18/46 (39%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMPE 592
P P+ P PPP P PPP P P PPPP PE
Sbjct: 62 PPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPE 107
Score = 43.7 bits (103), Expect = 3e-04
Identities = 15/42 (35%), Positives = 15/42 (35%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A P P P P PPPPP P P P P P
Sbjct: 36 TANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDP 77
Score = 43.3 bits (102), Expect = 4e-04
Identities = 14/44 (31%), Positives = 14/44 (31%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A A P P PP P PP P PPP P
Sbjct: 52 AAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPP 95
Score = 31.0 bits (70), Expect = 2.5
Identities = 13/34 (38%), Positives = 13/34 (38%), Gaps = 2/34 (5%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
P P PPPP P PPP P PG
Sbjct: 78 NAAPPPPPADPNAPPPPPVDPNA--PPPPAPEPG 109
Score = 30.2 bits (68), Expect = 4.1
Identities = 12/30 (40%), Positives = 12/30 (40%), Gaps = 4/30 (13%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
P P PPPP P P P P P P
Sbjct: 83 PPPADPNAPPPP----PVDPNAPPPPAPEP 108
>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
Members of this family belong to the collagen
superfamily. Collagens are generally extracellular
structural proteins involved in formation of connective
tissue structure. The alignment contains 20 copies of
the G-X-Y repeat that forms a triple helix. The first
position of the repeat is glycine, the second and third
positions can be any residue but are frequently proline
and hydroxyproline. Collagens are post translationally
modified by proline hydroxylase to form the
hydroxyproline residues. Defective hydroxylation is the
cause of scurvy. Some members of the collagen
superfamily are not involved in connective tissue
structure but share the same triple helical structure.
Length = 60
Score = 45.6 bits (109), Expect = 2e-06
Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 555 PTGPKG--GPPPPPPPPGGMGPPPPP-MPGMPGPPPPP 589
P GP G GPP PP PPG GPP PP PG PGPP PP
Sbjct: 2 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 39
Score = 45.2 bits (108), Expect = 2e-06
Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 555 PTGPKG--GPPPPPPPPGGMGPP-PPPMPGMPGPPPPP 589
P GP G GPP PP PPG GPP PP PG PGPP PP
Sbjct: 11 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 48
Score = 45.2 bits (108), Expect = 3e-06
Identities = 24/41 (58%), Positives = 24/41 (58%), Gaps = 5/41 (12%)
Query: 551 AAGIPTGPKG--GPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
G P GP G GPP PP PPG GPP P PG PG P PP
Sbjct: 23 PPGPP-GPPGPPGPPGPPGPPGPPGPPGP--PGAPGAPGPP 60
Score = 41.7 bits (99), Expect = 4e-05
Identities = 24/39 (61%), Positives = 24/39 (61%), Gaps = 5/39 (12%)
Query: 557 GPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMP 591
GP G PP PP PPG GPP PP PG PGPP PP P
Sbjct: 1 GPPG-PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGP 38
Score = 40.9 bits (97), Expect = 8e-05
Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 555 PTGPKG--GPPPPPPPPGGMGPPPPP-MPGMPGPPPPP 589
P GP G GPP PP PPG GPP PP PG PGPP PP
Sbjct: 5 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 42
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 44.3 bits (105), Expect = 2e-05
Identities = 20/47 (42%), Positives = 21/47 (44%), Gaps = 7/47 (14%)
Query: 552 AGIPTGPKGGPPPP-------PPPPGGMGPPPPPMPGMPGPPPPPMP 591
G P P PPP PPP PPP P P +PGP PP P
Sbjct: 43 VGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSP 89
Score = 40.4 bits (95), Expect = 5e-04
Identities = 14/44 (31%), Positives = 16/44 (36%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
+P + PPP P P GP PP P P P P
Sbjct: 57 VSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPP 100
Score = 37.0 bits (86), Expect = 0.008
Identities = 16/40 (40%), Positives = 16/40 (40%), Gaps = 5/40 (12%)
Query: 555 PTGPKGGPPPPPPPPGGMGP-----PPPPMPGMPGPPPPP 589
P P PP PG GP PP P P PPP P
Sbjct: 67 SEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106
Score = 35.0 bits (81), Expect = 0.038
Identities = 15/38 (39%), Positives = 15/38 (39%), Gaps = 4/38 (10%)
Query: 558 PKGGPPPPP----PPPGGMGPPPPPMPGMPGPPPPPMP 591
P G P PP PPP PP PPP P P
Sbjct: 41 PPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEP 78
Score = 34.3 bits (79), Expect = 0.058
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PPP P + PPP+ + PPP P
Sbjct: 40 PPPVGDPRPPVVDTPPPVSAVWVLPPPSEP 69
Score = 33.9 bits (78), Expect = 0.093
Identities = 14/47 (29%), Positives = 17/47 (36%), Gaps = 5/47 (10%)
Query: 551 AAGI---PTGPKGGPPPPPPPPGGMGPPPP--PMPGMPGPPPPPMPE 592
+G + P+ P PPP G PP P P PP E
Sbjct: 22 TSGFSSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSE 68
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 46.5 bits (110), Expect = 7e-05
Identities = 19/31 (61%), Positives = 19/31 (61%), Gaps = 5/31 (16%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
P G PPPPPPPP G PP P P PPPP
Sbjct: 5 PPGNPPPPPPPP-GFEPPSQPPP----PPPP 30
Score = 38.0 bits (88), Expect = 0.026
Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 4/42 (9%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNI 601
PP PPPP PPPP PPPPP P +
Sbjct: 3 SLPPGNPPPP----PPPPGFEPPSQPPPPPPPGVNVKKRSRK 40
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 40.9 bits (95), Expect = 2e-04
Identities = 17/36 (47%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
I PK PP P PP P PPP P P P P P
Sbjct: 3 IFRAPKPPEPPAPLPPA---PVPPPPPAPPAPVPEP 35
Score = 39.0 bits (90), Expect = 0.001
Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 4/32 (12%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P PP PP P PP P P PP PP P
Sbjct: 4 FRAPKPPEPP----APLPPAPVPPPPPAPPAP 31
Score = 31.2 bits (70), Expect = 0.52
Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 4/38 (10%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P PP P PP PPP P +P P P+
Sbjct: 9 PPEPPAPLPPAPVPP----PPPAPPAPVPEPTVKPVNA 42
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 39.8 bits (93), Expect = 2e-04
Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 6/41 (14%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP------PPP 589
P PPPP PPPP+ P PP PPP
Sbjct: 16 PPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPP 56
Score = 39.0 bits (91), Expect = 4e-04
Identities = 15/34 (44%), Positives = 15/34 (44%), Gaps = 7/34 (20%)
Query: 562 PPPPP-------PPPGGMGPPPPPMPGMPGPPPP 588
PPPP PPP PPPP PPPP
Sbjct: 24 PPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPPP 57
Score = 38.6 bits (90), Expect = 5e-04
Identities = 15/33 (45%), Positives = 16/33 (48%), Gaps = 5/33 (15%)
Query: 562 PPP-----PPPPPGGMGPPPPPMPGMPGPPPPP 589
PPP PPPP PPPP + PPPP
Sbjct: 7 PPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPP 39
Score = 38.2 bits (89), Expect = 7e-04
Identities = 19/45 (42%), Positives = 20/45 (44%), Gaps = 11/45 (24%)
Query: 562 PPPP-----PPPPGGMGPPPPPMPGM--PGPPPP----PMPEMYT 595
PPPP PPPP PPPP+ PPPP P P Y
Sbjct: 6 PPPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYV 50
Score = 37.1 bits (86), Expect = 0.002
Identities = 12/37 (32%), Positives = 13/37 (35%), Gaps = 4/37 (10%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
PPPP PPP PPPP +Y
Sbjct: 1 YHYKSPPPPVKQY----SPPPPYYYKSPPPPVKSPVY 33
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 41.2 bits (97), Expect = 2e-04
Identities = 13/50 (26%), Positives = 15/50 (30%), Gaps = 1/50 (2%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
+G P P P G PPP P P P P +Y
Sbjct: 38 QPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAP-PGVYP 86
Score = 40.5 bits (95), Expect = 4e-04
Identities = 20/50 (40%), Positives = 21/50 (42%), Gaps = 6/50 (12%)
Query: 549 QVAAGIPTGPK--GGPPPPPPPPGGMGPPPPPM----PGMPGPPPPPMPE 592
A G P G PPP P P G+ P PPP P PPPP P
Sbjct: 59 LPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPG 108
Score = 33.5 bits (77), Expect = 0.12
Identities = 12/55 (21%), Positives = 18/55 (32%), Gaps = 1/55 (1%)
Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
++ + + P + G PPP + P P PPP P Y
Sbjct: 25 QAMERLAERAQRAQPVSRESGYYPPPGAYVHL-EPLPAYGQYAAPPPYGPPPPYY 78
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 44.0 bits (104), Expect = 3e-04
Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 3/86 (3%)
Query: 510 EAIMLRQEAEAKLVQAQK---TLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPP 566
E +L+ A + LE L G P + A P P PP
Sbjct: 352 EMALLKMATLAPGAPVSELLDRLEALERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPA 411
Query: 567 PPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P P PP
Sbjct: 412 APARPAAARPAPAPAPPAAAAPPARS 437
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 43.1 bits (101), Expect = 4e-04
Identities = 22/45 (48%), Positives = 22/45 (48%), Gaps = 16/45 (35%)
Query: 562 PPPPPPPPGGMG---------PP-----PPPMPGMPG--PPPPPM 590
PP PPPPP G G PP PPP P P PPPPPM
Sbjct: 173 PPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217
Score = 33.8 bits (77), Expect = 0.33
Identities = 16/29 (55%), Positives = 16/29 (55%), Gaps = 6/29 (20%)
Query: 557 GPKGGPP-----PPPPPPGG-MGPPPPPM 579
P G PP PPP P G M PPPPPM
Sbjct: 189 KPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217
Score = 29.2 bits (65), Expect = 9.5
Identities = 12/20 (60%), Positives = 12/20 (60%)
Query: 572 MGPPPPPMPGMPGPPPPPMP 591
MGPP P P P PPPP P
Sbjct: 162 MGPPSPWNPRFPPGPPPPPP 181
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 41.0 bits (96), Expect = 4e-04
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
G+ +G PP P PG PP P PG
Sbjct: 5 DGTGGVSSGSSAPAPPAGPGPGPNAPPAPAAPG 37
Score = 31.3 bits (71), Expect = 0.94
Identities = 12/36 (33%), Positives = 12/36 (33%), Gaps = 1/36 (2%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
G TG P PP G P P P P P
Sbjct: 2 TGGDGTGGVSSGSSAPAPPAG-PGPGPNAPPAPAAP 36
Score = 30.5 bits (69), Expect = 1.5
Identities = 9/37 (24%), Positives = 9/37 (24%), Gaps = 2/37 (5%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
TG G P P P PP P
Sbjct: 1 STGGDGTGGVSSGSSAPAPPAGPGPG--PNAPPAPAA 35
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 43.6 bits (103), Expect = 5e-04
Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 4/46 (8%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMPE 592
G P G G PPPP P P P P P PP E
Sbjct: 700 PGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAE 745
Score = 37.9 bits (88), Expect = 0.030
Identities = 12/30 (40%), Positives = 12/30 (40%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P PPGG G PPP P P
Sbjct: 697 APRPGGPPGGGGGLPPPPDLPAAAGPAPCG 726
Score = 37.5 bits (87), Expect = 0.039
Identities = 17/65 (26%), Positives = 20/65 (30%), Gaps = 8/65 (12%)
Query: 536 RPVEKNRLDEVK-AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-------P 587
RP R D V+ A + P PP GG+ PPP P
Sbjct: 673 RPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIAS 732
Query: 588 PPMPE 592
P P
Sbjct: 733 PTAPP 737
Score = 32.1 bits (73), Expect = 1.8
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P P PP P PG E
Sbjct: 709 GLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAE 751
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.8 bits (103), Expect = 6e-04
Identities = 17/37 (45%), Positives = 17/37 (45%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P PP P PP P PP P P PPPPP P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 43.4 bits (102), Expect = 8e-04
Identities = 26/94 (27%), Positives = 29/94 (30%), Gaps = 25/94 (26%)
Query: 528 TLEDLSSGR--------PVEKNR-LDEVKAQVAAGI-------PTGPKGGPPPPPPPPGG 571
+LEDLS+GR P K R A G P P P P P
Sbjct: 364 SLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPV 423
Query: 572 M--------GPPPPPMPGMP-GPPPPPMPEMYTD 596
P P PG GP PPP +
Sbjct: 424 PASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457
Score = 43.0 bits (101), Expect = 9e-04
Identities = 16/38 (42%), Positives = 17/38 (44%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P PPP PPPP P P PP P +
Sbjct: 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
Score = 41.8 bits (98), Expect = 0.002
Identities = 19/39 (48%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPG-MPGPPPPPMPE 592
P P+ P P PPP P PPP P P PPPPP P+
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
Score = 41.5 bits (97), Expect = 0.003
Identities = 19/45 (42%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A+ A T PP P P PPPP P P PPPPP P+
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQP-QPQPPPPPQPQ 2930
Score = 40.3 bits (94), Expect = 0.006
Identities = 15/40 (37%), Positives = 15/40 (37%), Gaps = 1/40 (2%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
V A P P P P P GP PPP P P P
Sbjct: 423 VPASAPPPPAT-PLPSAEPGSDDGPAPPPERQPPAPATEP 461
Score = 39.5 bits (92), Expect = 0.009
Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 1/43 (2%)
Query: 551 AAGIPTGPKGGPPPP-PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A G GP PP P PP P PPP P P
Sbjct: 2600 APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
Score = 38.8 bits (90), Expect = 0.018
Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 12/62 (19%)
Query: 536 RPVE-----KNRLDEVKAQVAAG-IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
RP E + R + Q A P +G P P PP P PP P PPPP
Sbjct: 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPS-----PLPPDTHAP-DPPPP 2629
Query: 590 MP 591
P
Sbjct: 2630 SP 2631
Score = 38.8 bits (90), Expect = 0.020
Identities = 13/45 (28%), Positives = 16/45 (35%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
A P +G P G GPPPPP P P ++
Sbjct: 254 APAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVW 298
Score = 38.0 bits (88), Expect = 0.027
Identities = 20/48 (41%), Positives = 22/48 (45%), Gaps = 3/48 (6%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPP-PPPGGMGPPPPPMPGMPGPPPPPMP 591
+ A V A P P P PPP P PP P PGPPPP +P
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP--PGPPPPSLP 2851
Score = 38.0 bits (88), Expect = 0.029
Identities = 14/47 (29%), Positives = 15/47 (31%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDC 597
P PP P G + PP P P PPP P P
Sbjct: 2807 PPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853
Score = 37.6 bits (87), Expect = 0.036
Identities = 18/44 (40%), Positives = 19/44 (43%), Gaps = 2/44 (4%)
Query: 548 AQVAAGIPTGP--KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A +P GP GGP P PP GPP P P P PP
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
Score = 37.2 bits (86), Expect = 0.054
Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 5/32 (15%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
G PPPP PP PP P PPP P P
Sbjct: 2549 GDPPPPLPP-----AAPPAAPDRSVPPPRPAP 2575
Score = 37.2 bits (86), Expect = 0.058
Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 4/30 (13%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PPPPPP P P P + P PP P
Sbjct: 2701 PPPPPPT----PEPAPHALVSATPLPPGPA 2726
Score = 36.5 bits (84), Expect = 0.088
Identities = 12/27 (44%), Positives = 13/27 (48%), Gaps = 2/27 (7%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPP 588
PPPPPP P P + P PP P
Sbjct: 2701 PPPPPPTPEP--APHALVSATPLPPGP 2725
Score = 36.1 bits (83), Expect = 0.12
Identities = 14/42 (33%), Positives = 14/42 (33%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A P P P PPPP P P PPP P
Sbjct: 2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653
Score = 35.7 bits (82), Expect = 0.14
Identities = 13/34 (38%), Positives = 13/34 (38%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
P P PPPPPP PP P G P
Sbjct: 2922 QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
Score = 35.7 bits (82), Expect = 0.16
Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 3/96 (3%)
Query: 485 LVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLD 544
L E L E+S+ D + E ++ R +A A + ++ + R V + LD
Sbjct: 665 LSEWLLERSRAGPDTLFAPATPTHPEPLLQRVQALAGFARGEEIRAA-AEDREV-RGALD 722
Query: 545 EVKAQV-AAGIPTGPKGGPPPPPPPPGGMGPPPPPM 579
+ V A GP P P P G P PPP+
Sbjct: 723 ALARGVDAVARRAGPLTVAPAPAAPGGQGAPRPPPL 758
Score = 35.3 bits (81), Expect = 0.18
Identities = 18/54 (33%), Positives = 21/54 (38%)
Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
P + D A +A P P P PPP P PP P P PP P+
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
Score = 35.3 bits (81), Expect = 0.19
Identities = 26/108 (24%), Positives = 31/108 (28%), Gaps = 41/108 (37%)
Query: 525 AQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPP--PPPPGGMGP-------- 574
L +L G PV + + A P GGPP P PP P + P
Sbjct: 2466 LSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPV 2525
Query: 575 -------------------------PPPPMPGMPGPP------PPPMP 591
PPPP+P P PPP P
Sbjct: 2526 GEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRP 2573
Score = 35.3 bits (81), Expect = 0.19
Identities = 12/38 (31%), Positives = 13/38 (34%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P PP P PPP P PP+ P P
Sbjct: 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
Score = 35.3 bits (81), Expect = 0.22
Identities = 15/46 (32%), Positives = 16/46 (34%), Gaps = 5/46 (10%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPP----MPGMPGPPP-PPMP 591
A +P GP P P PP P PG P P PP
Sbjct: 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763
Score = 34.5 bits (79), Expect = 0.37
Identities = 18/64 (28%), Positives = 21/64 (32%), Gaps = 13/64 (20%)
Query: 546 VKAQVAAGIPTGPKGG-PPPPPPPPGGMGPPPPPMPG------------MPGPPPPPMPE 592
V + A P +G PPPPP P G +P PP PP P
Sbjct: 260 VVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA 319
Query: 593 MYTD 596
D
Sbjct: 320 PAGD 323
Score = 34.1 bits (78), Expect = 0.48
Identities = 10/34 (29%), Positives = 11/34 (32%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P P P P PP P P +P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736
Score = 33.8 bits (77), Expect = 0.55
Identities = 10/39 (25%), Positives = 13/39 (33%), Gaps = 1/39 (2%)
Query: 555 PTGP-KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P G + PP P P PP+ + P E
Sbjct: 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
Score = 33.8 bits (77), Expect = 0.58
Identities = 12/37 (32%), Positives = 12/37 (32%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P P PPPPP PP P P
Sbjct: 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
Score = 33.8 bits (77), Expect = 0.59
Identities = 15/48 (31%), Positives = 15/48 (31%), Gaps = 6/48 (12%)
Query: 548 AQVAAGIPTGPKGGPPPPPPP-----PGGMGPPP-PPMPGMPGPPPPP 589
A P PP P PGG P PP P P PP
Sbjct: 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP 2772
Score = 33.8 bits (77), Expect = 0.66
Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 8/43 (18%)
Query: 555 PTGPKGGPPPPPPPPGGMGPP--------PPPMPGMPGPPPPP 589
P P P P PP P +P P PP P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP 2745
Score = 33.4 bits (76), Expect = 0.67
Identities = 15/65 (23%), Positives = 20/65 (30%), Gaps = 8/65 (12%)
Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP-----PPPPMPGMPG 584
D + V + R + A + P P P +G PPP P P
Sbjct: 2653 RDDPAPGRVSRPR--RARRLGRAAQASSPPQRPRRRAARPT-VGSLTSLADPPPPPPTPE 2709
Query: 585 PPPPP 589
P P
Sbjct: 2710 PAPHA 2714
Score = 33.4 bits (76), Expect = 0.84
Identities = 14/40 (35%), Positives = 14/40 (35%), Gaps = 2/40 (5%)
Query: 555 PTGPKGGPP--PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P PP P PP P P PP P PE
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650
Score = 33.0 bits (75), Expect = 1.1
Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 11/52 (21%)
Query: 548 AQVAAGIPTGPK-------GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++ A P P P PP P PP P P
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----PPQPQAPP 2915
Score = 32.6 bits (74), Expect = 1.3
Identities = 23/68 (33%), Positives = 23/68 (33%), Gaps = 6/68 (8%)
Query: 523 VQAQKTLEDLSSGRPVEKNRLD-----EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP 577
A L D G PV L E A AG P P P PP P PPP
Sbjct: 2514 RLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPP-PPLPPAAPPAAPDRSVPPPR 2572
Query: 578 PMPGMPGP 585
P P P
Sbjct: 2573 PAPRPSEP 2580
Score = 32.6 bits (74), Expect = 1.4
Identities = 9/31 (29%), Positives = 10/31 (32%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
P P P PP P P +P P
Sbjct: 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
Score = 32.2 bits (73), Expect = 1.7
Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 1/47 (2%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPPMP 591
V+ +V P P PPP G G P G GPPPPP
Sbjct: 238 VERRVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEA 284
Score = 32.2 bits (73), Expect = 1.9
Identities = 12/45 (26%), Positives = 13/45 (28%), Gaps = 4/45 (8%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
A P P GPP P P P P PP +
Sbjct: 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVL 2812
Score = 31.4 bits (71), Expect = 3.2
Identities = 13/41 (31%), Positives = 14/41 (34%), Gaps = 3/41 (7%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P GP P PP G P PP P PP +
Sbjct: 2747 GPATPGGPAR-PARPPTTAGPPAPAPPAAP--AAGPPRRLT 2784
Score = 31.4 bits (71), Expect = 3.4
Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 7/47 (14%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG--MPGPP----PPPMP 591
A +P P PPPPP G PP P+ G PG PP
Sbjct: 2825 AGPLPPPTSAQPTAPPPPP-GPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
Score = 31.1 bits (70), Expect = 4.2
Identities = 14/43 (32%), Positives = 15/43 (34%), Gaps = 6/43 (13%)
Query: 555 PTGPKGGPPPPPPPPGGMGP------PPPPMPGMPGPPPPPMP 591
P P G PPP P G + P PP P P P
Sbjct: 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881
Score = 30.3 bits (68), Expect = 6.2
Identities = 10/57 (17%), Positives = 12/57 (21%), Gaps = 2/57 (3%)
Query: 537 PVEKNRLDEVKAQVAAGIPTGPKG--GPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P R P P PP + +P P P P
Sbjct: 2751 PGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
Score = 29.9 bits (67), Expect = 7.7
Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 7/51 (13%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGM-------GPPPPPMPGMPGPPPPPMP 591
++ + P P P P + GP PPP P PPPP
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
Score = 29.9 bits (67), Expect = 8.3
Identities = 14/49 (28%), Positives = 16/49 (32%), Gaps = 11/49 (22%)
Query: 555 PTGPK--------GGPPPP--PPPPGG-MGPPPPPMPGMPGPPPPPMPE 592
P+ P PP P P G P P P P PP P+
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
Score = 29.9 bits (67), Expect = 8.7
Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 5/46 (10%)
Query: 551 AAGIPTGPKGGPPP-PPPPPGG----MGPPPPPMPGMPGPPPPPMP 591
AA P P PP P G GP P P PP P P
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771
Score = 29.9 bits (67), Expect = 9.5
Identities = 10/41 (24%), Positives = 14/41 (34%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
++ +P+ PP PP P P PPP
Sbjct: 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT 2832
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 43.0 bits (101), Expect = 6e-04
Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 10/60 (16%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPP----------GGMGPPPPPMPGMPGPPPPPMPEM 593
+ A+ + GP+ P PP P G P PP+ P P P P+ E+
Sbjct: 148 NSTAARAVVFLVVGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAPHPHPIAEV 207
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 41.4 bits (97), Expect = 7e-04
Identities = 14/32 (43%), Positives = 15/32 (46%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
G PPPP + P P P M GP P MP
Sbjct: 39 GPRPPPPYFNSSVASSPQPHPYMWGPQQPMMP 70
Score = 35.6 bits (82), Expect = 0.068
Identities = 18/46 (39%), Positives = 19/46 (41%), Gaps = 10/46 (21%)
Query: 557 GPKGGPPP--------PPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
GP+ PP P P P GP P MP P PPP MY
Sbjct: 39 GPRPPPPYFNSSVASSPQPHPYMWGPQQPMMP--PYGTPPPYAAMY 82
Score = 32.1 bits (73), Expect = 0.76
Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 7/58 (12%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPP--PG-----GMGPPPPPMPGMPGPPPPPMPEMY 594
+E K ++ + + P P P G PPPP P P P M+
Sbjct: 5 EEGKPSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMW 62
Score = 32.1 bits (73), Expect = 0.90
Identities = 26/96 (27%), Positives = 34/96 (35%), Gaps = 15/96 (15%)
Query: 558 PKGGPPPPPP--PPGGMG--PPPPPMPGMPGPPPPPMPE------MYTDCHKNIINGKQA 607
P G PPP PPGG+ P PP P P E + + + K
Sbjct: 71 PYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAKSSDNKDK 130
Query: 608 CEEVKQSKKLAKILELILLMGNYMN----SGSRNGG 639
+K+SK L L + N + S NGG
Sbjct: 131 G-SIKKSKGSDGSLGLAMSGKNGESGKASGSSANGG 165
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 40.9 bits (96), Expect = 8e-04
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
P P+ PP P P M P PP P P PP PP+P M+
Sbjct: 99 PFQPQPLQPPQPQQP--MQPQPPVHPIPPLPPQPPLPPMF 136
Score = 39.8 bits (93), Expect = 0.002
Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 2/31 (6%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PP P PP PP PPM M P PP+
Sbjct: 118 PPVHPIPPLPPQPPLPPMFPMQ--PLPPLLP 146
Score = 39.0 bits (91), Expect = 0.004
Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 5/40 (12%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
+ P P PP PP PP PPM M P PP +P++
Sbjct: 114 MQPQPPVHPIPPLPPQ----PPLPPMFPMQ-PLPPLLPDL 148
Score = 38.6 bits (90), Expect = 0.004
Identities = 15/39 (38%), Positives = 15/39 (38%), Gaps = 4/39 (10%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P P P P P P P P P PP PPM M
Sbjct: 104 PLQPPQPQQPMQPQP----PVHPIPPLPPQPPLPPMFPM 138
Score = 38.2 bits (89), Expect = 0.007
Identities = 15/45 (33%), Positives = 18/45 (40%), Gaps = 1/45 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMPEMYTDCH 598
P+ P P PP PP PP P P P P+P + D
Sbjct: 105 LQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLP 149
Score = 32.5 bits (74), Expect = 0.51
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
P PP PP M P PP +P +P P
Sbjct: 128 PQPPLPPMFPMQPLPPLLPDLPLEAWPAT 156
Score = 32.5 bits (74), Expect = 0.56
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P P P + PP P P P PP P+P +
Sbjct: 95 PAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPL 126
Score = 31.3 bits (71), Expect = 1.4
Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 1/38 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPM-PGMPGPPPPPMP 591
P P+ P P P P PM P P P PP+P
Sbjct: 90 PNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLP 127
Score = 29.4 bits (66), Expect = 4.8
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P P P P PP P P P PP+ +
Sbjct: 93 PQPAQQPFQPQPLQPPQPQQPMQPQPPVHPI 123
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 43.2 bits (102), Expect = 8e-04
Identities = 16/43 (37%), Positives = 16/43 (37%), Gaps = 4/43 (9%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPP----MPGMPGPPPPPMPEM 593
P PPPP PPP P P P M G P PP
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAAS 151
Score = 41.7 bits (98), Expect = 0.002
Identities = 14/39 (35%), Positives = 14/39 (35%), Gaps = 3/39 (7%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A PP P P PPP P P P P P P
Sbjct: 97 PASPAREGSPTPPGPSSPDP---PPPTPPPASPPPSPAP 132
Score = 40.5 bits (95), Expect = 0.005
Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCH 598
+ P GP P PPPP P PPP P P + P
Sbjct: 105 SPTPPGP-SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA 150
Score = 37.5 bits (87), Expect = 0.037
Identities = 15/38 (39%), Positives = 15/38 (39%), Gaps = 3/38 (7%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PT P PPP P P G PGPPP P
Sbjct: 119 PTPPPASPPPSPAPD---LSEMLRPVGSPGPPPAASPP 153
Score = 36.7 bits (85), Expect = 0.066
Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 12/62 (19%)
Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP-----P 589
GR KN + P P+ P PP PP P P P P
Sbjct: 863 GRARGKNGRRRPR-------PPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGP 915
Query: 590 MP 591
MP
Sbjct: 916 MP 917
Score = 35.5 bits (82), Expect = 0.17
Identities = 14/61 (22%), Positives = 17/61 (27%)
Query: 531 DLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
+ P + + K A G G G P PP P P PP
Sbjct: 841 AAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGA 900
Query: 591 P 591
P
Sbjct: 901 P 901
Score = 34.8 bits (80), Expect = 0.30
Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 2/39 (5%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPP 587
AA P PP P P P + MP GP P
Sbjct: 885 AAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923
Score = 34.8 bits (80), Expect = 0.31
Identities = 10/41 (24%), Positives = 11/41 (26%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A + GP P P P P P P P
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378
Score = 34.4 bits (79), Expect = 0.37
Identities = 11/50 (22%), Positives = 14/50 (28%), Gaps = 1/50 (2%)
Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP-PPMPGMPGPPPPP 589
L + + + GP P P P P PG P P
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP 312
Score = 33.2 bits (76), Expect = 0.76
Identities = 11/52 (21%), Positives = 13/52 (25%)
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
+ A A G + PP P G PP P P
Sbjct: 855 KSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPR 906
Score = 33.2 bits (76), Expect = 0.83
Identities = 11/41 (26%), Positives = 12/41 (29%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
A + P P PP P PP P P P
Sbjct: 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
Score = 33.2 bits (76), Expect = 0.88
Identities = 11/41 (26%), Positives = 11/41 (26%), Gaps = 4/41 (9%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMP----GPPPPPMP 591
PPP P P PP P P P P
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224
Score = 32.8 bits (75), Expect = 1.0
Identities = 10/57 (17%), Positives = 15/57 (26%), Gaps = 1/57 (1%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMP 591
P+ + + ++ P P P P P PG P P
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSS 311
Score = 32.8 bits (75), Expect = 1.0
Identities = 9/59 (15%), Positives = 15/59 (25%), Gaps = 1/59 (1%)
Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPK-GGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
S + ++ ++ + G P P P P P P P
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPR 366
Score = 32.5 bits (74), Expect = 1.3
Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 15/55 (27%)
Query: 552 AGIPTGPKGGPPPPPPP---------------PGGMGPPPPPMPGMPGPPPPPMP 591
PTGP GP P P P P P P PP P
Sbjct: 65 FEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP 119
Score = 30.9 bits (70), Expect = 3.8
Identities = 14/57 (24%), Positives = 17/57 (29%), Gaps = 9/57 (15%)
Query: 545 EVKAQVAAGIPTGPKGGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A VA+ + + P P PP P PP P PP P
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPI 213
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 41.0 bits (96), Expect = 0.001
Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 6/49 (12%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGM------GPPPPPMPGMPGPPPPPMPEM 593
A+G P GP P M P P P M P P M M
Sbjct: 55 ASGRPRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASM 103
Score = 38.3 bits (89), Expect = 0.009
Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 1/38 (2%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGP-PPPPMPGMPGPPPPP 589
G P G P P G PP M MP P PP
Sbjct: 114 GHAGAPAGHPGGGGPQQFRPGAGQPPGMQHMPAPALPP 151
Score = 30.2 bits (68), Expect = 4.1
Identities = 14/50 (28%), Positives = 16/50 (32%), Gaps = 1/50 (2%)
Query: 566 PPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK-NIINGKQACEEVKQS 614
P G G P PP M M N+I+ EEV S
Sbjct: 117 GAPAGHPGGGGPQQFRPGAGQPPGMQHMPAPALPPNVIDTDLIDEEVLMS 166
Score = 29.1 bits (65), Expect = 8.4
Identities = 12/43 (27%), Positives = 12/43 (27%), Gaps = 4/43 (9%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMPEM 593
G P G MG P P M P P P M
Sbjct: 50 GGGMDASGRPRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFM 92
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 40.6 bits (95), Expect = 0.002
Identities = 23/66 (34%), Positives = 30/66 (45%), Gaps = 16/66 (24%)
Query: 542 RLDEVKAQVA--------AGIPTGPKGGPPPPP-PPPGGMGPPPP------PMPGMP-GP 585
+++EV+ +V A P P+ P PP P PGG+ P P P MP G
Sbjct: 104 KVEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQ 163
Query: 586 PPPPMP 591
P PPM
Sbjct: 164 PRPPMM 169
Score = 36.7 bits (85), Expect = 0.032
Identities = 17/41 (41%), Positives = 18/41 (43%), Gaps = 2/41 (4%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPPPPMP 591
G+ G G PP P G P PPM G GPP PP
Sbjct: 142 GLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQ 182
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 41.8 bits (98), Expect = 0.002
Identities = 20/56 (35%), Positives = 22/56 (39%), Gaps = 4/56 (7%)
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPPPPMPEMYTD 596
LD+V + T P G PPP G GPP PP P PP P D
Sbjct: 891 LDQVFGPGTGRVATAPGGDAASAPPP--GAGPPAPPQAVPPPRTTQPPAAPPRGPD 944
Score = 33.3 bits (76), Expect = 0.85
Identities = 14/46 (30%), Positives = 17/46 (36%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
A+ P G PP PP PP P P PP + E+
Sbjct: 908 GDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAEL 953
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 39.1 bits (91), Expect = 0.002
Identities = 11/37 (29%), Positives = 13/37 (35%), Gaps = 5/37 (13%)
Query: 562 PPPPPP-----PPGGMGPPPPPMPGMPGPPPPPMPEM 593
P PP P + PP MP P P P +
Sbjct: 2 PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYL 38
Score = 33.7 bits (77), Expect = 0.14
Identities = 17/51 (33%), Positives = 18/51 (35%), Gaps = 5/51 (9%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMP-GPPPPPMP 591
V Q + P GP P M P PPM P G PPP P
Sbjct: 18 VPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYP 68
Score = 32.9 bits (75), Expect = 0.27
Identities = 8/36 (22%), Positives = 10/36 (27%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P PP P P P P +P +
Sbjct: 14 PSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQ 49
Score = 31.7 bits (72), Expect = 0.64
Identities = 10/38 (26%), Positives = 10/38 (26%), Gaps = 5/38 (13%)
Query: 562 PPPPPPPPGGMGPPPPPM-----PGMPGPPPPPMPEMY 594
PP P P P M M P P M
Sbjct: 19 PPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMA 56
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 39.6 bits (92), Expect = 0.003
Identities = 20/54 (37%), Positives = 24/54 (44%), Gaps = 5/54 (9%)
Query: 534 SGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
++ L + + G P G PPP PP M PPPP PG PG PP
Sbjct: 131 PSEGSVRSSLSHPSSHSSYGAP-----GVPPPYNPPMLMMKPPPPSPGPPGAPP 179
Score = 37.3 bits (86), Expect = 0.017
Identities = 14/40 (35%), Positives = 17/40 (42%), Gaps = 1/40 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-PPPMPEM 593
+ G P PPP P P PGPP PP+ E+
Sbjct: 144 SSHSSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPPVREL 183
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 38.5 bits (90), Expect = 0.004
Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 19/94 (20%)
Query: 506 AKLEEAIMLRQEAEAKLV------QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPK 559
+KL EA Q+ E L + LED++
Sbjct: 21 SKLVEAERSLQQLETTLSLLEAKLASIPGLEDVTVQTTP------------PPPASAITN 68
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
GGPPPPPP PPPP PP P PE
Sbjct: 69 GGPPPPPPARAEAASPPPPEAPAE-PPAEPEPEA 101
Score = 33.5 bits (77), Expect = 0.18
Identities = 24/79 (30%), Positives = 32/79 (40%), Gaps = 6/79 (7%)
Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP 576
E+KLV+A+++L+ L + L E K G+ PPPP G PP
Sbjct: 18 VCESKLVEAERSLQQLET-----TLSLLEAKLASIPGLEDVTVQTTPPPPASAITNGGPP 72
Query: 577 PPMPGMPGPPPPPMPEMYT 595
PP P PP P
Sbjct: 73 PPPPARAEAASPP-PPEAP 90
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 39.9 bits (93), Expect = 0.006
Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 3/51 (5%)
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
R++ ++ ++ IP P P P P P P P P+
Sbjct: 375 RVERIETRLDMSIPANLLHNAPQAAPAP---SAAAPEPKHQPAPEPRPVLA 422
Score = 31.8 bits (72), Expect = 2.0
Identities = 11/40 (27%), Positives = 12/40 (30%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
A AA P+ P P P P P G P
Sbjct: 391 LLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEP 430
Score = 30.2 bits (68), Expect = 5.7
Identities = 11/33 (33%), Positives = 11/33 (33%), Gaps = 4/33 (12%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
G P P GG PP P PPP
Sbjct: 511 GKPAAAASPAGGGANAPPAKP----VKPPPSCL 539
>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal.
Length = 313
Score = 39.0 bits (91), Expect = 0.007
Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 4/30 (13%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
+ +P P PPPPP PPPPP
Sbjct: 219 KSALPAVSSSAPSAPPPPP----PPPPPSV 244
Score = 35.2 bits (81), Expect = 0.15
Identities = 12/37 (32%), Positives = 14/37 (37%), Gaps = 4/37 (10%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
A+ A + PPPPPPP PPP
Sbjct: 214 PVAAAKSALPAVSSSAPSAPPPPPPP----PPPSVPT 246
Score = 30.9 bits (70), Expect = 3.0
Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 6/39 (15%)
Query: 555 PTGPKGGPPP--PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
GP P PPPP P PPPP +P
Sbjct: 211 KKGPVAAAKSALPAVSSSAPSAPPPP----PPPPPPSVP 245
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 39.3 bits (92), Expect = 0.011
Identities = 16/77 (20%), Positives = 18/77 (23%), Gaps = 5/77 (6%)
Query: 518 AEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP-- 575
AEA +KT + P A A P P PP P
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP--AAAASAPAAPPAAAPPAPVA 427
Query: 576 -PPPMPGMPGPPPPPMP 591
P P P
Sbjct: 428 APAAAAPAAAPAAAPAA 444
Score = 33.5 bits (77), Expect = 0.61
Identities = 14/57 (24%), Positives = 15/57 (26%)
Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P EK +A A P P P P P P P P P
Sbjct: 375 PAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAA 431
Score = 31.2 bits (71), Expect = 2.8
Identities = 10/58 (17%), Positives = 11/58 (18%), Gaps = 2/58 (3%)
Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPP--PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
V A P+ P P P P P P E
Sbjct: 436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEE 493
Score = 30.8 bits (70), Expect = 3.7
Identities = 10/81 (12%), Positives = 15/81 (18%), Gaps = 8/81 (9%)
Query: 505 SAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPP 564
A + A A L + + V+ P+
Sbjct: 423 PAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVA--------PEPAVAS 474
Query: 565 PPPPPGGMGPPPPPMPGMPGP 585
P P P G
Sbjct: 475 AAPAPAAAPAAARLTPTEEGD 495
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 38.2 bits (89), Expect = 0.014
Identities = 34/120 (28%), Positives = 49/120 (40%), Gaps = 20/120 (16%)
Query: 478 FQLDVQPLVEHLA-EKSKTEEDRRVED-----LSAKLEEAIMLRQEAEAKLVQAQKTLED 531
+L QP V +L E+ + + + + +SA +E LR + AK+V AQ E
Sbjct: 92 MKLSRQPSVPNLRQEEPRLQSLKPPNEEALQKISALEDELAFLRAQI-AKIVAAQ---EQ 147
Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
+S + + V + P PP P PPPPP P PPPPP
Sbjct: 148 SNSTTSDLLSSDESVPSSSTTSFP------ISPPTEEPVLEVPPPPPPP----PPPPPPS 197
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 38.3 bits (89), Expect = 0.018
Identities = 25/105 (23%), Positives = 38/105 (36%), Gaps = 17/105 (16%)
Query: 489 LAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKA 548
+ K + ++ RR+E L L ++ +A ++G PV +
Sbjct: 94 VLNKQRGDDQRRIEKLGQDNAA---LAEQVKALGANPV-----TATGEPVPQM------P 139
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM---PGMPGPPPPPM 590
G P+ G P PP G PPP PG PPP +
Sbjct: 140 ASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQV 184
Score = 36.7 bits (85), Expect = 0.061
Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 17/92 (18%)
Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAA-----------GIPTGPKGGPPPP 565
+ + + + ++ L+ L+ R ++ R++++ AA + + P P
Sbjct: 80 QMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPVPQMP 139
Query: 566 PPPPGGMGPPPPPMPGMPGPP------PPPMP 591
PPG G P P + PP PPP
Sbjct: 140 ASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTA 171
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 38.0 bits (88), Expect = 0.018
Identities = 11/41 (26%), Positives = 11/41 (26%), Gaps = 7/41 (17%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPM-------PGMPGPPPP 588
PT PP P P G P PP P
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
Score = 37.6 bits (87), Expect = 0.030
Identities = 8/42 (19%), Positives = 9/42 (21%), Gaps = 3/42 (7%)
Query: 553 GIPTGPKGGPPPP---PPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P P P PP +P P
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247
Score = 33.0 bits (75), Expect = 0.78
Identities = 9/35 (25%), Positives = 9/35 (25%), Gaps = 4/35 (11%)
Query: 562 PPPPPPPPGGMGPP----PPPMPGMPGPPPPPMPE 592
P P P P P P P P P
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPS 239
Score = 30.3 bits (68), Expect = 5.6
Identities = 11/47 (23%), Positives = 12/47 (25%), Gaps = 5/47 (10%)
Query: 551 AAGIPTGPKGGPPPP-----PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
+ P P P G PP P P P PE
Sbjct: 231 PSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE 277
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 37.1 bits (86), Expect = 0.020
Identities = 13/35 (37%), Positives = 13/35 (37%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
GP G P PP P P P PP PP
Sbjct: 25 RAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59
Score = 35.1 bits (81), Expect = 0.078
Identities = 14/34 (41%), Positives = 14/34 (41%), Gaps = 2/34 (5%)
Query: 561 GPPPPPPPPGGMGPPPPPMPGMPG--PPPPPMPE 592
GPP P PPG P P P P PP P
Sbjct: 27 GPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPR 60
Score = 29.3 bits (66), Expect = 5.6
Identities = 8/35 (22%), Positives = 8/35 (22%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P P P P PP P P
Sbjct: 37 PAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPL 71
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 38.0 bits (88), Expect = 0.023
Identities = 46/211 (21%), Positives = 78/211 (36%), Gaps = 51/211 (24%)
Query: 411 LLFIRDDQ-NMVMDSACEPYLLSILQHLLFIRDDQYVRLAYY-KLLEECVSQIVLHRGGC 468
L +I D+Q V D+ E + +L+ F+ D+ Y + KL+E + +VL+
Sbjct: 254 LNYIDDEQFFDVTDAIAENDAVKMLEVARFVIDNGYDEQDFLEKLIEHFRNFLVLY---- 309
Query: 469 DPDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKT 528
+ RS+R L E+ ++ ++R + + L A +++ L++ Q
Sbjct: 310 --NLRSTR------------LIERPESVKERYQQS-AGGLTPAAVMQMTD--FLMKTQGE 352
Query: 529 LE---------DLSSGRPVEKNRLD-EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
L+ +L+ R +E R D V A K P P P GP P
Sbjct: 353 LKFQFEYQFRFELALLRLIELVRNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPE 412
Query: 579 MPGM------------------PGPPPPPMP 591
PG P P+P
Sbjct: 413 APGARPAELPSPASAPTPEQQPPVARSAPLP 443
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 37.0 bits (85), Expect = 0.023
Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 16/67 (23%)
Query: 558 PKGGPPPPPPPPGGMGPPPP-------PMPGM---------PGPPPPPMPEMYTDCHKNI 601
G P PPP GG G PP P G P PP P P+ + +
Sbjct: 133 LHGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPPAPEPKPHNGDATHG 192
Query: 602 INGKQAC 608
+ G AC
Sbjct: 193 LLGTVAC 199
Score = 32.8 bits (74), Expect = 0.60
Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 11/41 (26%)
Query: 556 TGPKGGPPPPPPPPGGMG-----PPPPPMPGMPGPPPPPMP 591
P+GG PP G G PPP P P P P P
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHPPPLP------PAPEPKP 184
Score = 31.6 bits (71), Expect = 1.6
Identities = 25/93 (26%), Positives = 32/93 (34%), Gaps = 25/93 (26%)
Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAG--------IPTGPKGGPPPPP--------- 566
QA + RP ++++ DE + P G P PP
Sbjct: 90 QAPLAADADDDPRPGKRSKADEHGPAPGRAALAPFKLDLDQDPLHGDPDPPPGATGGQGE 149
Query: 567 -PPPGGMGPPPPP------MPGMPGPPPPPMPE 592
PP GG PP + G P PP PP PE
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHP-PPLPPAPE 181
>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763). This
eukaryotic family of proteins has no known function.
Length = 91
Score = 34.7 bits (80), Expect = 0.032
Identities = 17/50 (34%), Positives = 19/50 (38%), Gaps = 9/50 (18%)
Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPP------GGMGPPPPPMPG 581
V K K + G +GGP PP GG GP PPM G
Sbjct: 44 VTKRYR---KNSSSGKRWDGGRGGPGPPGGGRRMGRIGGGGGPSRPPMAG 90
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 37.0 bits (85), Expect = 0.035
Identities = 19/41 (46%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
AG P G G P P M PPPP P M P P PMP+
Sbjct: 205 AGQPNGSGQGKVPNHFLPTPMLAPPPP-PPMARPVPLPMPD 244
Score = 32.0 bits (72), Expect = 1.5
Identities = 13/44 (29%), Positives = 15/44 (34%), Gaps = 18/44 (40%)
Query: 555 PTGPKGGPP------------------PPPPPPGGMGPPPPPMP 580
P+ + G P PPPPP P P PMP
Sbjct: 200 PSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPPMARPVPLPMP 243
Score = 30.0 bits (67), Expect = 6.1
Identities = 12/36 (33%), Positives = 13/36 (36%), Gaps = 3/36 (8%)
Query: 558 PKGGPPPPPPPPGGMGPPP---PPMPGMPGPPPPPM 590
P G G P P P + PPPPPM
Sbjct: 199 DPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPPM 234
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 35.6 bits (82), Expect = 0.035
Identities = 18/47 (38%), Positives = 20/47 (42%), Gaps = 7/47 (14%)
Query: 555 PTGPKGGPPPPPPPPGGMGPP--PPPMPGMPGP-----PPPPMPEMY 594
P P P PPP P G P PPP +P P PP P + Y
Sbjct: 78 PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPY 124
Score = 29.5 bits (66), Expect = 4.5
Identities = 23/82 (28%), Positives = 27/82 (32%), Gaps = 19/82 (23%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPP-------------PPPGGMGPPPPPM--- 579
R E+ R D + P P PP PP PPP G P PPP
Sbjct: 34 RERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQP 93
Query: 580 ---PGMPGPPPPPMPEMYTDCH 598
P PPP +P +
Sbjct: 94 GYNPADYPPPPGAVPPPQNYPY 115
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 34.7 bits (80), Expect = 0.037
Identities = 10/30 (33%), Positives = 10/30 (33%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PPPP PP P P P P
Sbjct: 30 PPPPYSPPQQQQGHFMPPQPQPYPKQSPQQ 59
Score = 32.0 bits (73), Expect = 0.37
Identities = 12/44 (27%), Positives = 13/44 (29%), Gaps = 6/44 (13%)
Query: 555 PTGPKGGPPPP---PPPPGGMGPPP---PPMPGMPGPPPPPMPE 592
P+ P P M PPP P PP P P
Sbjct: 9 QYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPY 52
Score = 28.9 bits (65), Expect = 3.6
Identities = 9/37 (24%), Positives = 11/37 (29%), Gaps = 3/37 (8%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
P +G PP P P P P P +
Sbjct: 36 PPQQQQGHFMPPQPQPY---PKQSPQQQQPPQFSSFL 69
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 36.7 bits (85), Expect = 0.038
Identities = 15/30 (50%), Positives = 15/30 (50%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P PPPP P M PPPP P PP P
Sbjct: 142 PQPPPPAPVVMMQPPPPHAMPPASPPAAQP 171
Score = 34.4 bits (79), Expect = 0.19
Identities = 14/44 (31%), Positives = 14/44 (31%), Gaps = 4/44 (9%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A V P P PP PP P P PP P P
Sbjct: 148 APVVMMQPPPPHAMPPASPPAAQPAPSAPASSP----PPTPASP 187
Score = 34.0 bits (78), Expect = 0.25
Identities = 11/36 (30%), Positives = 12/36 (33%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCH 598
PP PP P P P P PP + H
Sbjct: 162 PPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSH 197
Score = 30.6 bits (69), Expect = 3.8
Identities = 11/34 (32%), Positives = 11/34 (32%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P PP PP P P P P P P
Sbjct: 156 PPPHAMPPASPPAAQPAPSAPASSPPPTPASPPP 189
Score = 30.2 bits (68), Expect = 4.4
Identities = 12/38 (31%), Positives = 12/38 (31%), Gaps = 1/38 (2%)
Query: 555 PTGPKGGPPPPPP-PPGGMGPPPPPMPGMPGPPPPPMP 591
P P PP P P P P PPP P
Sbjct: 156 PPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAP 193
Score = 29.8 bits (67), Expect = 5.4
Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 16/51 (31%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPP------------PMPGM----PGPPPPP 589
P P PPP PPP PM G P P PP
Sbjct: 168 AAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSPMAGTFYRSPAPGEPP 218
>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
Reviewed.
Length = 275
Score = 36.8 bits (86), Expect = 0.039
Identities = 18/50 (36%), Positives = 19/50 (38%), Gaps = 15/50 (30%)
Query: 558 PKGGP---PPPPPPPG-GMG-----------PPPPPMPGMPGPPPPPMPE 592
P G P PPPPG G G PP P PPPP+P
Sbjct: 217 PAGSPECEDQAPPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPP 266
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 37.4 bits (87), Expect = 0.042
Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 1/48 (2%)
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
L+EV+AQ+ P+ PP P GM P P P PP
Sbjct: 156 LEEVEAQLQ-QRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPG 202
Score = 34.0 bits (78), Expect = 0.46
Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 3/46 (6%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
+Q A P P P PPP P MP PPP P +
Sbjct: 224 SQAPAQPPLPP---QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQ 266
Score = 33.6 bits (77), Expect = 0.55
Identities = 11/50 (22%), Positives = 13/50 (26%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGKQA 607
P+G PP P P PP P P + Q
Sbjct: 181 PQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQP 230
Score = 32.4 bits (74), Expect = 1.2
Identities = 10/37 (27%), Positives = 13/37 (35%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P+ P P P PP+P PPP+
Sbjct: 208 QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQ 244
Score = 32.4 bits (74), Expect = 1.2
Identities = 19/84 (22%), Positives = 23/84 (27%), Gaps = 3/84 (3%)
Query: 508 LEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPP 567
LEE Q+ + Q + L G P + Q P G P P
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF---PQQGPPEQPPGYPQPPQGHPE 212
Query: 568 PPGGMGPPPPPMPGMPGPPPPPMP 591
P P PP PP
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQL 236
Score = 32.0 bits (73), Expect = 1.8
Identities = 16/61 (26%), Positives = 19/61 (31%), Gaps = 13/61 (21%)
Query: 553 GIPTGPKGGPPPPP------------PPPGGMGPPPPPMP-GMPGPPPPPMPEMYTDCHK 599
G+ PP PP PPP P P +P G P PPP +
Sbjct: 250 GLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQ 309
Query: 600 N 600
Sbjct: 310 Q 310
Score = 30.1 bits (68), Expect = 6.6
Identities = 12/41 (29%), Positives = 12/41 (29%), Gaps = 5/41 (12%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-----PPPM 590
P P P PPP P PP PPP
Sbjct: 239 QPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQ 279
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 37.0 bits (86), Expect = 0.047
Identities = 8/42 (19%), Positives = 9/42 (21%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A P + P P PPP P
Sbjct: 267 DAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNP 308
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
Length = 767
Score = 37.1 bits (86), Expect = 0.048
Identities = 13/47 (27%), Positives = 14/47 (29%), Gaps = 1/47 (2%)
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
LDE VAA P P P P + P P P
Sbjct: 44 LDEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIA-APAAVPAP 89
Score = 30.2 bits (68), Expect = 6.0
Identities = 10/42 (23%), Positives = 14/42 (33%), Gaps = 1/42 (2%)
Query: 551 AAGIPTGPKGGPPP-PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
+ + PP P P P + P P + P P P
Sbjct: 48 LGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 37.3 bits (87), Expect = 0.049
Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 2/51 (3%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM--PGMPGPPPPPMPEM 593
A + P P P P P P P P P P P
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPS 443
Score = 36.9 bits (86), Expect = 0.057
Identities = 9/45 (20%), Positives = 9/45 (20%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A P P PP P P P P
Sbjct: 464 AQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
Score = 36.9 bits (86), Expect = 0.061
Identities = 15/78 (19%), Positives = 16/78 (20%), Gaps = 1/78 (1%)
Query: 515 RQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP 574
A + A S AA P P P P P
Sbjct: 424 APAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ-PAPAPAAAPEPTAAPAP 482
Query: 575 PPPPMPGMPGPPPPPMPE 592
PP P P P
Sbjct: 483 APPAAPAPAAAPAAPAAP 500
Score = 36.1 bits (84), Expect = 0.089
Identities = 10/45 (22%), Positives = 11/45 (24%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
AA P+ P P P P P P P
Sbjct: 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA 436
Score = 36.1 bits (84), Expect = 0.11
Identities = 14/37 (37%), Positives = 14/37 (37%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
G P PPPP P P P PP EM D
Sbjct: 752 AGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
Score = 35.7 bits (83), Expect = 0.12
Identities = 16/48 (33%), Positives = 17/48 (35%), Gaps = 4/48 (8%)
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPPMPE 592
QV A + P PP P GPP P P P P P P
Sbjct: 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA 630
Score = 35.7 bits (83), Expect = 0.14
Identities = 11/42 (26%), Positives = 12/42 (28%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A+ P P P P P P P P PP
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712
Score = 35.3 bits (82), Expect = 0.17
Identities = 12/40 (30%), Positives = 13/40 (32%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P G PPPPP P P P P P +
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
Score = 35.3 bits (82), Expect = 0.18
Identities = 5/42 (11%), Positives = 6/42 (14%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P P+
Sbjct: 389 GGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQ 430
Score = 34.6 bits (80), Expect = 0.28
Identities = 8/51 (15%), Positives = 12/51 (23%), Gaps = 5/51 (9%)
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
RL+ ++ + G P P P P P
Sbjct: 377 RLE----RLERRLGVAGGAGAPAAAAPS-AAAAAPAAAPAPAAAAPAAAAA 422
Score = 34.2 bits (79), Expect = 0.34
Identities = 13/38 (34%), Positives = 13/38 (34%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P PP P P P PP PP E
Sbjct: 745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782
Score = 33.4 bits (77), Expect = 0.58
Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 2/45 (4%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
+P P PP P G PPPP P P P
Sbjct: 736 DDPVP--LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
Score = 33.4 bits (77), Expect = 0.74
Identities = 11/44 (25%), Positives = 12/44 (27%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
+A P P PP P P P P P P
Sbjct: 726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769
Score = 33.0 bits (76), Expect = 0.79
Identities = 13/46 (28%), Positives = 13/46 (28%), Gaps = 1/46 (2%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
A A P P P P P P P P P PP
Sbjct: 403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA-PAPAPPSPAGN 447
Score = 33.0 bits (76), Expect = 0.96
Identities = 10/62 (16%), Positives = 12/62 (19%)
Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
E ++ P K G G P PP P P
Sbjct: 638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA 697
Query: 590 MP 591
Sbjct: 698 PA 699
Score = 33.0 bits (76), Expect = 0.98
Identities = 11/49 (22%), Positives = 11/49 (22%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
D A P PP P P P P P P
Sbjct: 664 DGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712
Score = 32.3 bits (74), Expect = 1.6
Identities = 11/53 (20%), Positives = 12/53 (22%), Gaps = 4/53 (7%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMPEM 593
+ P GG P PPP P P P P
Sbjct: 436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAP 488
Score = 31.9 bits (73), Expect = 1.8
Identities = 12/45 (26%), Positives = 12/45 (26%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A AA P P P P P P P P P
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAP 449
Score = 31.9 bits (73), Expect = 2.1
Identities = 13/38 (34%), Positives = 13/38 (34%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P P PP P P PPP P
Sbjct: 742 PPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
Score = 31.9 bits (73), Expect = 2.2
Identities = 10/38 (26%), Positives = 10/38 (26%), Gaps = 1/38 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P P G P P PE
Sbjct: 615 PAAPAAPAAPAAPAPAG-AAAAPAEASAAPAPGVAAPE 651
Score = 31.5 bits (72), Expect = 2.3
Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 4/51 (7%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP----GMPGPPPPPMPEM 593
A G P+ P P P P P P P P P
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496
Score = 31.5 bits (72), Expect = 2.5
Identities = 11/47 (23%), Positives = 11/47 (23%), Gaps = 5/47 (10%)
Query: 551 AAGIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPGPPPPPMPE 592
A GP PP P P P P G P
Sbjct: 593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEA 639
Score = 31.5 bits (72), Expect = 2.8
Identities = 14/50 (28%), Positives = 15/50 (30%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
A A P+ P P P P P G P PPP P
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464
Score = 31.1 bits (71), Expect = 3.5
Identities = 10/46 (21%), Positives = 11/46 (23%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
A AA P P P P P G P +
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721
Score = 31.1 bits (71), Expect = 3.6
Identities = 12/43 (27%), Positives = 13/43 (30%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A P P P P PP G P P P P +
Sbjct: 423 PAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465
Score = 30.7 bits (70), Expect = 4.0
Identities = 12/45 (26%), Positives = 12/45 (26%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
A A P P P P P PP G P P
Sbjct: 413 AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPP 457
Score = 30.7 bits (70), Expect = 4.1
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A A P P P P PP P P
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723
Score = 30.7 bits (70), Expect = 4.2
Identities = 14/74 (18%), Positives = 15/74 (20%), Gaps = 1/74 (1%)
Query: 519 EAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
+ SSG P E R A A P G P P
Sbjct: 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA-GAAAAPAEASAAPAPGVAA 649
Query: 579 MPGMPGPPPPPMPE 592
P P
Sbjct: 650 PEHHPKHVAVPDAS 663
Score = 30.3 bits (69), Expect = 5.5
Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 3/55 (5%)
Query: 545 EVKAQVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
AQ A P G P PP G P P PP P+ D
Sbjct: 696 AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPD 750
Score = 30.3 bits (69), Expect = 5.6
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
A P P G P P P PP PP
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
Score = 30.3 bits (69), Expect = 6.4
Identities = 9/42 (21%), Positives = 10/42 (23%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
+ G P P P P P P P P
Sbjct: 662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
Score = 29.6 bits (67), Expect = 9.1
Identities = 12/51 (23%), Positives = 14/51 (27%), Gaps = 1/51 (1%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP-PPPPMPGMPGPPPPPMPEM 593
D+ A P P P P P PP M P M +
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDE 796
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 37.1 bits (86), Expect = 0.051
Identities = 24/124 (19%), Positives = 44/124 (35%), Gaps = 32/124 (25%)
Query: 493 SKTEEDRRVEDLS------AKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVE-----KN 541
K E +R V++ + +A+ + +A++ + Q +K L++L P + +
Sbjct: 540 PKDEVERMVQEAEKFAKEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVEA 599
Query: 542 RLDEVKAQVAAGIPTGPK---------------------GGPPPPPPPPGGMGPPPPPMP 580
+L E+K +A+G K G P P G G
Sbjct: 600 KLKELKDAIASGSTQKMKDAMAALNQEVMQIGQSLYNQPGAGGAGPAPGGEAGSSSSSSS 659
Query: 581 GMPG 584
G G
Sbjct: 660 GGDG 663
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 35.2 bits (81), Expect = 0.052
Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 4/51 (7%)
Query: 545 EVKAQVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
EV+A+ GI P P P P P P P P P P E
Sbjct: 35 EVEAK-GLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGE 84
Score = 29.4 bits (66), Expect = 5.3
Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 1/46 (2%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
V+ QV P P P P P P P P PMP
Sbjct: 48 VQEQVPTPAPA-PAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.0 bits (85), Expect = 0.053
Identities = 14/51 (27%), Positives = 16/51 (31%), Gaps = 8/51 (15%)
Query: 555 PTGPKGGPPPPPPP-----PGG---MGPPPPPMPGMPGPPPPPMPEMYTDC 597
+G GPP P GG +GPPP P PP
Sbjct: 440 HSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPG 490
Score = 34.3 bits (78), Expect = 0.36
Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 2/40 (5%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPP--PPMPGMPGPPPPPMP 591
+P G P PPP P PP P+ P P P
Sbjct: 183 VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPS 222
Score = 32.7 bits (74), Expect = 1.1
Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 12/52 (23%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGM------------PGPPPPPMPEM 593
G PPP P PP G P PP+P +
Sbjct: 458 TSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI 509
Score = 32.0 bits (72), Expect = 2.1
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKN 600
+AQ + P P PP + PP P MP + PP P+P++ HK+
Sbjct: 312 QAQPHSHTPPSQSALQPQQPPREQPL-PPAPSMPHIKPPPTTPIPQLPNQSHKH 364
Score = 30.8 bits (69), Expect = 3.8
Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 11/47 (23%)
Query: 555 PTGPKGGPPPPPP-----PPGGMGPP----PPPMPGMPG--PPPPPM 590
P+ P PPP P PP P P P MP PPPP +
Sbjct: 341 PSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPAL 387
Score = 30.8 bits (69), Expect = 4.3
Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 10/69 (14%)
Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPP--PPPPPP-----GGMGPPPPPMPGMPGPPP 587
G P+ + + + + P P P P PP P PP P P
Sbjct: 207 GSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQS 266
Query: 588 P---PMPEM 593
P P M
Sbjct: 267 SHHGPGPPM 275
Score = 30.0 bits (67), Expect = 6.7
Identities = 15/45 (33%), Positives = 15/45 (33%), Gaps = 4/45 (8%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPPMP 591
A G P P PP PP P G GP PP P
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPP 508
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 35.8 bits (82), Expect = 0.057
Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 4/43 (9%)
Query: 549 QVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPP 588
V G +GP P P PGG G P PP P PGP P
Sbjct: 159 AVRPGDLSGPGSRPTSGPVAGGLPGG-GAPAPPTPPRPGPTDP 200
Score = 30.0 bits (67), Expect = 4.9
Identities = 37/141 (26%), Positives = 51/141 (36%), Gaps = 23/141 (16%)
Query: 471 DFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLE 530
DF + R LD L E+ EDR + D +LEEA A A L+
Sbjct: 89 DFLIAHRRYLD-----PALGERLDEAEDR-LADQEEQLEEA-----AANACLLSGDGDPA 137
Query: 531 D--LSSGRP-----VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
D +S + V+ +G + P GP P GG PP P
Sbjct: 138 DGWMSPEDSDLLIMWQLGSAPAVRPGDLSGPGSRPTSGPVAGGLPGGGAPAPPT-----P 192
Query: 584 GPPPPPMPEMYTDCHKNIING 604
P P P +DC ++ +G
Sbjct: 193 PRPGPTDPASESDCGRDSRDG 213
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 35.2 bits (80), Expect = 0.059
Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 8/75 (10%)
Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
AQK +E + + KNR + A I P P P PPG G P P
Sbjct: 59 DAAQKAVEAAKNQKQGLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSPASQ 118
Query: 583 --------PGPPPPP 589
PG P
Sbjct: 119 LSLVTTVSPGSGLAP 133
Score = 30.2 bits (67), Expect = 3.2
Identities = 16/73 (21%), Positives = 23/73 (31%), Gaps = 12/73 (16%)
Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP---------PPPMPGM 582
L + + ++ K Q + P P +GPP PP PG
Sbjct: 54 LHAAHDAAQKAVEAAKNQKQG---LKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGA 110
Query: 583 PGPPPPPMPEMYT 595
P P P + T
Sbjct: 111 PKPSPASQLSLVT 123
>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
Length = 262
Score = 36.0 bits (83), Expect = 0.063
Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 8/78 (10%)
Query: 513 MLRQEAEAKLVQAQKTLEDLSS-GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGG 571
++ Q+ + +TL+ LS+ P+E G G + P G
Sbjct: 165 LVPQKITGDVTATVRTLQGLSTYAVPLEP-------RTTRRGRRRGARRPRSRRRPRGGA 217
Query: 572 MGPPPPPMPGMPGPPPPP 589
GPP P P P PP
Sbjct: 218 AGPPVPAEPAAPERPPAD 235
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 130
Score = 34.8 bits (80), Expect = 0.063
Identities = 14/52 (26%), Positives = 17/52 (32%), Gaps = 3/52 (5%)
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP---PPPMP 591
++E+ A A P P P P P P P P PMP
Sbjct: 18 MEEIGAPAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 36.6 bits (84), Expect = 0.064
Identities = 21/73 (28%), Positives = 23/73 (31%), Gaps = 2/73 (2%)
Query: 519 EAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
A +AQ+ RP A G P P PP G PP
Sbjct: 708 AAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP 767
Query: 579 M--PGMPGPPPPP 589
PG P P PPP
Sbjct: 768 AAAPGAPTPQPPP 780
Score = 33.9 bits (77), Expect = 0.51
Identities = 19/53 (35%), Positives = 19/53 (35%), Gaps = 12/53 (22%)
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMG-----PPP--PPMP-----GMPGPPPPP 589
A G P P PP G PPP PP P G P P PPP
Sbjct: 748 AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP 800
Score = 32.7 bits (74), Expect = 1.2
Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Query: 558 PKGGPPPPPPP---PGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK 599
P+G P P PPP P M P PG GP + ++ T K
Sbjct: 790 PRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834
Score = 32.0 bits (72), Expect = 2.0
Identities = 11/39 (28%), Positives = 11/39 (28%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A P P PP P P PG PP
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPP 737
Score = 31.2 bits (70), Expect = 3.2
Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 1/41 (2%)
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
Q A G P P P PP G P G PP
Sbjct: 688 QWAPGTMQPPPRAPTPMRPPAAPPGRAQRPA-AATGRARPP 727
>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518). This
domain, which is usually found tandemly repeated, is
found various receptor co-activating proteins.
Length = 56
Score = 32.5 bits (74), Expect = 0.068
Identities = 16/45 (35%), Positives = 17/45 (37%), Gaps = 1/45 (2%)
Query: 550 VAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPPMPEM 593
G P P+ P P PP GMG P P P P PM
Sbjct: 4 GQMGNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSPQSPMMSP 48
Score = 29.8 bits (67), Expect = 0.61
Identities = 10/35 (28%), Positives = 10/35 (28%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P G P P P P M GP P
Sbjct: 22 PPNYGMGQQPDPAFTSPFSPQSPMMSPRMGPSQSP 56
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 36.5 bits (84), Expect = 0.077
Identities = 21/71 (29%), Positives = 28/71 (39%), Gaps = 5/71 (7%)
Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPP---PPPPPPGGMGPPPPPMP 580
Q +SS PV + + ++ Q P+ P G P PP GGM P PP +
Sbjct: 403 MQQGQPGMMSSPSPVPQVQTNQSMPQ--PPQPSVPSPGGPGSQPPQSVSGGMIPSPPALM 460
Query: 581 GMPGPPPPPMP 591
P P P
Sbjct: 461 PSPSPQMSQSP 471
Score = 30.4 bits (68), Expect = 5.6
Identities = 12/48 (25%), Positives = 13/48 (27%), Gaps = 4/48 (8%)
Query: 548 AQVA-AGIPTGPKGGPPPP---PPPPGGMGPPPPPMPGMPGPPPPPMP 591
QV P + P M PG P PPP P
Sbjct: 271 MQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRP 318
>gnl|CDD|219914 pfam08577, PI31_Prot_C, PI31 proteasome regulator. PI31 is a
cellular regulator of proteasome formation and of
proteasome-mediated antigen processing.
Length = 68
Score = 32.7 bits (75), Expect = 0.080
Identities = 15/50 (30%), Positives = 16/50 (32%), Gaps = 10/50 (20%)
Query: 552 AGIPTGPKGGP---PPPPPPPGGM-------GPPPPPMPGMPGPPPPPMP 591
G+ P P GGM G PP G GPPP P
Sbjct: 10 PGLGPPDPFDPLPAPLGGNGQGGMIFDPNRPGFGPPRGGGGDGPPPGVPP 59
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 36.0 bits (83), Expect = 0.084
Identities = 8/45 (17%), Positives = 10/45 (22%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
Q A P+ PPPP P +
Sbjct: 15 EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKD 59
Score = 32.6 bits (74), Expect = 1.2
Identities = 9/48 (18%), Positives = 9/48 (18%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
E A P PP P P P P
Sbjct: 14 AEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKP 61
Score = 30.3 bits (68), Expect = 5.8
Identities = 10/57 (17%), Positives = 11/57 (19%), Gaps = 4/57 (7%)
Query: 539 EKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
E + AA P P P P P P P
Sbjct: 13 EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKP 69
Score = 30.3 bits (68), Expect = 6.2
Identities = 11/35 (31%), Positives = 11/35 (31%)
Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
G P P PP P PPPP P
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAP 46
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 36.4 bits (84), Expect = 0.089
Identities = 18/111 (16%), Positives = 24/111 (21%), Gaps = 1/111 (0%)
Query: 481 DVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEK 540
QP A + A A + A + E L++ R
Sbjct: 382 VAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A A P P P + P P P P
Sbjct: 442 RGPGGAPAPAPAPAAA-PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491
Score = 34.5 bits (79), Expect = 0.32
Identities = 12/50 (24%), Positives = 13/50 (26%), Gaps = 4/50 (8%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP----PPPMPEM 593
A AA P P P P P PPP E+
Sbjct: 453 APAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEEL 502
Score = 32.9 bits (75), Expect = 1.0
Identities = 7/55 (12%), Positives = 8/55 (14%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
A A PP P + P P
Sbjct: 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Score = 32.5 bits (74), Expect = 1.3
Identities = 17/73 (23%), Positives = 20/73 (27%)
Query: 515 RQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP 574
R A L A++ G P A A GP+ P P
Sbjct: 426 RSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485
Query: 575 PPPPMPGMPGPPP 587
P P PPP
Sbjct: 486 AAAPAPADDDPPP 498
Score = 30.6 bits (69), Expect = 4.5
Identities = 10/39 (25%), Positives = 10/39 (25%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
GP G P P P P P P P
Sbjct: 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
Score = 30.2 bits (68), Expect = 6.1
Identities = 9/36 (25%), Positives = 9/36 (25%)
Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P P P P P P P P
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAA 414
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 35.8 bits (82), Expect = 0.100
Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 16/48 (33%)
Query: 555 PTGPKGGPP-PPPPPPGGMGPP---------------PPPMPGMPGPP 586
P G G P P P P PPP P P P
Sbjct: 310 PAGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVP 357
Score = 35.0 bits (80), Expect = 0.19
Identities = 12/54 (22%), Positives = 16/54 (29%), Gaps = 1/54 (1%)
Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
G + + + + A G P P G P PG P P P
Sbjct: 279 GDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDG-AAGGEPKPGPPRPAPD 331
Score = 34.6 bits (79), Expect = 0.25
Identities = 16/72 (22%), Positives = 22/72 (30%), Gaps = 12/72 (16%)
Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKG----GPPP--PPPPPGGMGPPP------P 577
+D +P D + G GP+ G P P P P P
Sbjct: 280 DDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPEGWP 339
Query: 578 PMPGMPGPPPPP 589
+ + PPP P
Sbjct: 340 SLEAITFPPPTP 351
Score = 30.4 bits (68), Expect = 4.9
Identities = 13/30 (43%), Positives = 13/30 (43%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PP G PPP P G P PP PE
Sbjct: 255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPE 284
Score = 30.4 bits (68), Expect = 5.7
Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 8/43 (18%)
Query: 554 IPTGPKGGPPPPP--------PPPGGMGPPPPPMPGMPGPPPP 588
P P GG P PP P PG + P P
Sbjct: 266 PPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGP 308
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 34.9 bits (80), Expect = 0.11
Identities = 24/121 (19%), Positives = 37/121 (30%), Gaps = 16/121 (13%)
Query: 484 PLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRL 543
P E +KS+ ++ED+S K ++ + + L +G
Sbjct: 70 PRYESYRDKSR----EKLEDVSHKHQDMGYYKGPPYSGYPFLMLPNPYLPNGSLSPLPPS 125
Query: 544 DEVKAQVAAGIPTGPKGGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPP-MPEM 593
V P P P PP + P G+P PP PP +
Sbjct: 126 SNKVPVVQP--PHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPF 183
Query: 594 Y 594
Y
Sbjct: 184 Y 184
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 35.0 bits (81), Expect = 0.11
Identities = 21/98 (21%), Positives = 32/98 (32%), Gaps = 10/98 (10%)
Query: 499 RRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDL---SSGRPVEKNR--LDEV--KAQVA 551
RR D L + I++++ A L QA +E+L + + + L +
Sbjct: 37 RRQPDAPYYLVQTILVQEAA---LKQANARIEELEAQAQHPQSQSSGGFLSGMFGGGAPR 93
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P PP P G G P G P
Sbjct: 94 PPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQP 131
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 35.8 bits (82), Expect = 0.11
Identities = 16/42 (38%), Positives = 19/42 (45%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++ P P P P P PP PP G+P P P MPE
Sbjct: 417 SSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPE 458
Score = 32.0 bits (72), Expect = 1.8
Identities = 18/42 (42%), Positives = 18/42 (42%), Gaps = 3/42 (7%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A AA P P PP PP G PP P P MP P P
Sbjct: 425 APSAAPSPRVPWDDAPPAPPRSGI---PPRPAPRMPEASPVP 463
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 35.8 bits (83), Expect = 0.12
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 12/87 (13%)
Query: 471 DFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLS---AKLEEAIMLRQEAEAKLVQAQK 527
+ RR ++ + + + R DL+ A+L+ I R++AEA L +AQ
Sbjct: 314 LWLRRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQD 373
Query: 528 TLEDLSSGRPVEKNRLDEVKAQVAAGI 554
L V+ +L Q++AGI
Sbjct: 374 EL--------VQAGKL-AALGQMSAGI 391
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 35.7 bits (82), Expect = 0.12
Identities = 15/39 (38%), Positives = 18/39 (46%), Gaps = 1/39 (2%)
Query: 555 PTGPKGGPPPPPPPPGGM-GPPPPPMPGMPGPPPPPMPE 592
P P P P PP + PP PP+ PPP P P+
Sbjct: 223 PFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQ 261
Score = 31.8 bits (72), Expect = 2.0
Identities = 16/70 (22%), Positives = 22/70 (31%), Gaps = 1/70 (1%)
Query: 522 LVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
L +L +S P + + P P P P PP P
Sbjct: 197 LPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPP 256
Query: 582 MPGPPPPPMP 591
+P PP P +P
Sbjct: 257 VP-PPQPIIP 265
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 35.6 bits (83), Expect = 0.12
Identities = 12/47 (25%), Positives = 15/47 (31%)
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
DE ++ G P+ GG PP P PG P
Sbjct: 380 PDEALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAA 426
Score = 35.3 bits (82), Expect = 0.19
Identities = 13/49 (26%), Positives = 14/49 (28%), Gaps = 1/49 (2%)
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
RL E G GP GG P P P G P +
Sbjct: 386 RLQEGPPSPGGG-GGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433
Score = 34.5 bits (80), Expect = 0.33
Identities = 14/51 (27%), Positives = 15/51 (29%), Gaps = 10/51 (19%)
Query: 551 AAGIPT----------GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
AA +PT GP PPGG G P P P
Sbjct: 374 AADLPTPDEALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAP 424
Score = 32.6 bits (75), Expect = 1.2
Identities = 10/42 (23%), Positives = 12/42 (28%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
G P G G P P P G P +P+
Sbjct: 397 GGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPD 438
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 35.6 bits (82), Expect = 0.13
Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 4/28 (14%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P PPPPP PPP P P P
Sbjct: 270 EPQPPPPP----PPPEPPEPEEEPDEPD 293
Score = 35.2 bits (81), Expect = 0.19
Identities = 12/38 (31%), Positives = 13/38 (34%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P PPP PP P P G +PE
Sbjct: 271 PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
Score = 34.8 bits (80), Expect = 0.23
Identities = 13/31 (41%), Positives = 14/31 (45%), Gaps = 4/31 (12%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P PPP PPPPP P P P +
Sbjct: 268 EPEPQPPP----PPPPPEPPEPEEEPDEPDQ 294
Score = 34.4 bits (79), Expect = 0.28
Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 4/46 (8%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIIN 603
P+ P PPPPPP PP PP P P + I
Sbjct: 267 PEPEPQPPPPPP----PPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 34.7 bits (80), Expect = 0.18
Identities = 13/37 (35%), Positives = 13/37 (35%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
G G GG P PPP G PP G P
Sbjct: 200 GGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGPSSG 236
Score = 33.9 bits (78), Expect = 0.30
Identities = 15/36 (41%), Positives = 15/36 (41%), Gaps = 4/36 (11%)
Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
G GG P PPPPG PPP GP P
Sbjct: 205 GGGGGSGPGPPPPGFKSSFPPP----YGPGAGPSSG 236
Score = 32.0 bits (73), Expect = 1.4
Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 1/41 (2%)
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPP-PMPGMPGPPPPPMP 591
AG G GG G GPPPP P P P
Sbjct: 192 AGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAG 232
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 35.2 bits (81), Expect = 0.18
Identities = 13/41 (31%), Positives = 14/41 (34%)
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
+ P PP P PPP P PP P PE
Sbjct: 386 STRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPE 426
Score = 33.2 bits (76), Expect = 0.64
Identities = 26/111 (23%), Positives = 41/111 (36%), Gaps = 14/111 (12%)
Query: 494 KTEEDRRVEDLSAKLEEAIMLRQEAE----AKLVQAQKTLEDL-------SSGR-PVEKN 541
+ DR + DL+A +E L++ ++ L + K L S G+ P+E
Sbjct: 297 NSGADRSLLDLTA--DEKAALQKVSQIANLEALTKWVKAFSQLDFQLRTTSYGQLPLELA 354
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++ + V A P P P P P P PP P+ E
Sbjct: 355 VIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRE 405
Score = 33.2 bits (76), Expect = 0.74
Identities = 13/48 (27%), Positives = 13/48 (27%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
K AA P P P P PP PPP P
Sbjct: 369 AKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416
Score = 32.5 bits (74), Expect = 1.1
Identities = 10/35 (28%), Positives = 10/35 (28%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P P PPP P PP P P
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433
Score = 32.5 bits (74), Expect = 1.3
Identities = 10/46 (21%), Positives = 14/46 (30%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A+ A P+ + P P P PP + PP
Sbjct: 367 QPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPV 412
Score = 30.9 bits (70), Expect = 3.2
Identities = 12/53 (22%), Positives = 12/53 (22%)
Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P E R V P P P P P P PP
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPK 451
Score = 30.5 bits (69), Expect = 4.5
Identities = 18/63 (28%), Positives = 19/63 (30%), Gaps = 17/63 (26%)
Query: 547 KAQVAAGIPTGP---KGGPPPPPPPPGGMGPPPPPMPGMP--------------GPPPPP 589
KA AA IP + PPP PP P P P PP P
Sbjct: 390 KAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449
Query: 590 MPE 592
E
Sbjct: 450 PKE 452
>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus. This family includes the
NUT protein. The gene encoding for NUT protein (Nuclear
Testis protein) is found fused to BRD3 or BRD4 genes, in
some aggressive types of carcinoma, due to chromosomal
translocations. Proteins of the BRD family contain two
bromodomains that bind transcriptionally active
chromatin through associations with acetylated histones
H3 and H4. Such proteins are crucial for the regulation
of cell cycle progression. On the other hand, little is
known about NUT protein. NUT is known to have a Nuclear
Export Sequence (NES) as well as a Nuclear Localization
Signal (NLS), both located towards the C-terminal end of
the protein. A fused NUT-GFP protein showed either
cytoplasmic or nuclear localization, suggesting that it
is subject to nuclear/cytoplasmic shuttling. Consistent
with this possibility, treatment with leptomycin B an
inhibitor of CRM1-dependent nuclear export resulted in
re-distribution of NUT-GFP to the nucleus. Inspection of
NUT revealed a C-terminal sequence similar to known
nuclear export sequences (NES) which are often regulated
by phosphorylation.
Length = 328
Score = 34.9 bits (80), Expect = 0.19
Identities = 12/32 (37%), Positives = 12/32 (37%), Gaps = 1/32 (3%)
Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
T G P PPPPPP P P P
Sbjct: 148 TWILGLPLPPPPPPVAQLVPIVS-LENAWPGP 178
Score = 29.2 bits (65), Expect = 9.8
Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 1/30 (3%)
Query: 562 PPPPPPPPGGM-GPPPPPMPGMPGPPPPPM 590
PP P PP PPPP + P P +
Sbjct: 25 PPAPQPPDQPFWEPPPPLVTAGFSPGNPLV 54
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 35.2 bits (81), Expect = 0.19
Identities = 15/60 (25%), Positives = 15/60 (25%), Gaps = 12/60 (20%)
Query: 548 AQVAAGIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPG-------PPPPPMPEMYT 595
Q G P G P P P G P P PP P MY
Sbjct: 406 QQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYP 465
>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 165 and 237 amino acids in length.
CD99L2 and CD99 are involved in trans-endothelial
migration of neutrophils in vitro and in the recruitment
of neutrophils into inflamed peritoneum.
Length = 154
Score = 33.6 bits (77), Expect = 0.19
Identities = 10/44 (22%), Positives = 10/44 (22%), Gaps = 3/44 (6%)
Query: 552 AGIPTGPKGGPPP---PPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P G GG P P P P P
Sbjct: 19 PATPKKPSKGDDDFDLEDALDGGNDGGAGPKPKPRPQPDPGNPG 62
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 33.3 bits (76), Expect = 0.19
Identities = 12/37 (32%), Positives = 14/37 (37%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P P P P P PPP P +P P +P
Sbjct: 87 PPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVP 123
Score = 33.3 bits (76), Expect = 0.20
Identities = 21/66 (31%), Positives = 27/66 (40%), Gaps = 1/66 (1%)
Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
+A+K L L+ P E ++ A +PT P P PP P P P P
Sbjct: 74 EARKILAPLTPPAPPEPVTPPTAQSP-APAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPS 132
Query: 584 GPPPPP 589
PP PP
Sbjct: 133 DPPQPP 138
Score = 31.7 bits (72), Expect = 0.67
Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 3/41 (7%)
Query: 555 PTGPKGGPPPPP---PPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P PP P PP P P P P
Sbjct: 90 PVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130
Score = 31.0 bits (70), Expect = 1.2
Identities = 12/29 (41%), Positives = 13/29 (44%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PP PP P P P +P PPP P
Sbjct: 84 PPAPPEPVTPPTAQSPAPAVPTPPPTSTP 112
Score = 29.8 bits (67), Expect = 3.3
Identities = 13/38 (34%), Positives = 14/38 (36%), Gaps = 1/38 (2%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P+ PP P P PPP P PP P
Sbjct: 85 PAPPEPVTPPTAQSPAPAVPTPPP-TSTPAVPPAPAAA 121
>gnl|CDD|115029 pfam06345, Drf_DAD, DRF Autoregulatory Domain. This motif is found
in Diaphanous-related formins. It binds the N-terminal
GTPase-binding domain; this link is broken when
GTP-bound Rho binds to the GBD and activates the
protein. The addition of DAD to mammalian cells induces
actin filament formation, stabilises microtubules, and
activates serum-response mediated transcription.
Length = 15
Score = 30.4 bits (68), Expect = 0.21
Identities = 12/13 (92%), Positives = 13/13 (100%)
Query: 843 GVMDSLLEALQTG 855
GVMDSLLEALQ+G
Sbjct: 1 GVMDSLLEALQSG 13
>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut. This
domain is often found at the very N-terminal of
argonaut-like proteins.
Length = 102
Score = 32.6 bits (74), Expect = 0.22
Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 9/44 (20%)
Query: 560 GGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
GGPP PP G G GPP P +PE++
Sbjct: 1 GGPPEYQGRGRGGPPQQGGRGGGGGGRGGGSTGGPPRPSVPELH 44
Score = 30.7 bits (69), Expect = 0.79
Identities = 12/47 (25%), Positives = 13/47 (27%), Gaps = 10/47 (21%)
Query: 553 GIPTGPKGGPPPPPPP----------PGGMGPPPPPMPGMPGPPPPP 589
G G GGPP P P + P P PP
Sbjct: 26 GRGGGSTGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPE 72
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 34.1 bits (78), Expect = 0.22
Identities = 13/38 (34%), Positives = 13/38 (34%), Gaps = 1/38 (2%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGM-GPPPPPMPGMPGPPP 587
A GP GG P G GP G GPP
Sbjct: 32 ANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPG 69
Score = 32.9 bits (75), Expect = 0.50
Identities = 10/40 (25%), Positives = 10/40 (25%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A G P P GG G P GP
Sbjct: 21 PAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGG 60
Score = 32.5 bits (74), Expect = 0.85
Identities = 12/41 (29%), Positives = 12/41 (29%), Gaps = 5/41 (12%)
Query: 553 GIPTGPKGGPPPPP-----PPPGGMGPPPPPMPGMPGPPPP 588
G P G GGP P GG G P G
Sbjct: 37 GGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEA 77
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 34.3 bits (78), Expect = 0.23
Identities = 35/134 (26%), Positives = 47/134 (35%), Gaps = 34/134 (25%)
Query: 491 EKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
EK EE R E ++ +L+ I + E +A + + + +P N + + QV
Sbjct: 99 EKRLLEEKERQEQMAEQLQR-ISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQPQV 157
Query: 551 AAGIPTGPK----------GGPPPP-----------PPPPGGMGPPP-----PPMPGM-- 582
GP G P P PP GM P PMPGM
Sbjct: 158 Q--PQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPP 215
Query: 583 -PGPPPPP--MPEM 593
PG P MP M
Sbjct: 216 RPGFNQNPNMMPNM 229
Score = 31.6 bits (71), Expect = 1.9
Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 2/42 (4%)
Query: 555 PTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
P + G P P PP G P MP M P P P +
Sbjct: 200 PNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGF 241
Score = 31.2 bits (70), Expect = 2.2
Identities = 14/34 (41%), Positives = 14/34 (41%), Gaps = 3/34 (8%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPPP---PPMPGMP 583
G P GP P P GM PPP P G P
Sbjct: 245 GTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFP 278
Score = 29.3 bits (65), Expect = 8.7
Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 2/41 (4%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPPP--PPMPGMPGPPPPPMP 591
G P G P P M P P GM PP P
Sbjct: 233 GFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGP 273
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.1 bits (81), Expect = 0.23
Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 6/78 (7%)
Query: 486 VEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGR-----PVEK 540
+ ++ DRR+E L +LEE +E E KL + +K + SG+ VEK
Sbjct: 468 DKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEK 527
Query: 541 NRLDEV-KAQVAAGIPTG 557
L+ + +A+ GI G
Sbjct: 528 LTLEAIEEAEEEYGIKEG 545
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 35.0 bits (81), Expect = 0.23
Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 8/66 (12%)
Query: 493 SKTEEDRRVEDLSA------KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEV 546
S+ E +R V++ A K +E I R A++ QA+KTL++ P E+ +++
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEK--EKI 556
Query: 547 KAQVAA 552
+ VA
Sbjct: 557 EKAVAE 562
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 34.8 bits (80), Expect = 0.25
Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 7/51 (13%)
Query: 549 QVAAGIPTGPKGGPPPPPPP-------PGGMGPPPPPMPGMPGPPPPPMPE 592
+ I P G P PPPP P P P G GP P
Sbjct: 81 PIRIAITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPP 131
>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720). This
domain is found in different combinations with cortical
patch components EF hand, SH3 and ENTH and is therefore
likely to be involved in cytoskeletal processes. This
family contains many hypothetical proteins.
Length = 73
Score = 31.3 bits (71), Expect = 0.29
Identities = 9/44 (20%), Positives = 12/44 (27%), Gaps = 5/44 (11%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPP-----PPMPGMPGPPPPPMPEM 593
TG + P G + P P P G P+
Sbjct: 5 QTGYQPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQ 48
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.4 bits (80), Expect = 0.31
Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 10/82 (12%)
Query: 472 FRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLE----EAIMLRQEAEAKLVQAQK 527
F ++R L ++E A+K E+ ++ +L A LE E +EAEA L +A+K
Sbjct: 491 FEIAKRLGLPEN-IIEE-AKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEK 548
Query: 528 TLEDLSSGRPVEKNRLDEVKAQ 549
E+L +K +L E + +
Sbjct: 549 LKEELEE----KKEKLQEEEDK 566
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 32.6 bits (73), Expect = 0.37
Identities = 9/49 (18%), Positives = 10/49 (20%), Gaps = 1/49 (2%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG-MPGPPPPPMPE 592
E +V A G P P P PE
Sbjct: 45 EKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPE 93
Score = 32.6 bits (73), Expect = 0.38
Identities = 5/35 (14%), Positives = 6/35 (17%), Gaps = 1/35 (2%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
+ P P P PE
Sbjct: 68 EEPEPEEEGEEEPEPEEEGEEEPE-PEETGEEEPE 101
Score = 30.7 bits (68), Expect = 1.9
Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 4/42 (9%)
Query: 555 PTGP-KGGPPPPPPP---PGGMGPPPPPMPGMPGPPPPPMPE 592
G + P P G P P P P P PE
Sbjct: 74 EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPE 115
Score = 29.2 bits (64), Expect = 6.3
Identities = 12/38 (31%), Positives = 13/38 (34%), Gaps = 1/38 (2%)
Query: 556 TGPKGGPPPPP-PPPGGMGPPPPPMPGMPGPPPPPMPE 592
P+ P P G P P P P P P PE
Sbjct: 80 PEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPE 117
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.5 bits (77), Expect = 0.40
Identities = 14/65 (21%), Positives = 17/65 (26%)
Query: 527 KTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
+ ED N A A+ P+ P P P P P P P
Sbjct: 156 EEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA 215
Query: 587 PPPMP 591
P
Sbjct: 216 PSSFQ 220
Score = 32.3 bits (74), Expect = 0.96
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
+ +P P PPP P P PG PPPPP+
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248
Score = 32.3 bits (74), Expect = 1.1
Identities = 13/42 (30%), Positives = 14/42 (33%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P+ P PP P PPP P PPP P
Sbjct: 200 PPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241
Score = 32.0 bits (73), Expect = 1.4
Identities = 10/33 (30%), Positives = 10/33 (30%), Gaps = 2/33 (6%)
Query: 562 PPPPPPPPGGMGPP--PPPMPGMPGPPPPPMPE 592
PP P PP P PP P P
Sbjct: 212 LPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPP 244
Score = 31.6 bits (72), Expect = 2.0
Identities = 13/73 (17%), Positives = 19/73 (26%)
Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP 576
E + A ++ G + ++ P P P PP
Sbjct: 156 EEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA 215
Query: 577 PPMPGMPGPPPPP 589
P PPP P
Sbjct: 216 PSSFQSDTPPPSP 228
Score = 30.0 bits (68), Expect = 5.3
Identities = 17/87 (19%), Positives = 25/87 (28%), Gaps = 10/87 (11%)
Query: 520 AKLVQAQKTLEDLSSGRPVEKNRLDEV--KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP 577
A++ +A K ED + G P+++ D P P P P P
Sbjct: 136 ARIHKALKEGEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVP 195
Query: 578 PMPG--------MPGPPPPPMPEMYTD 596
P PP +D
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSD 222
Score = 29.7 bits (67), Expect = 6.9
Identities = 8/35 (22%), Positives = 8/35 (22%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
PP P PP P P P
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAP 243
Score = 29.3 bits (66), Expect = 9.5
Identities = 10/37 (27%), Positives = 12/37 (32%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
P+ G P P PP P +P P P
Sbjct: 188 PSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTP 224
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 34.3 bits (78), Expect = 0.41
Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 4/38 (10%)
Query: 558 PKGGPPPPP--PPPGGMGPPPP--PMPGMPGPPPPPMP 591
P PPP P G P P PG P PP P
Sbjct: 736 PDQAPPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAP 773
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 33.4 bits (76), Expect = 0.41
Identities = 13/23 (56%), Positives = 13/23 (56%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPG 584
PPPPPPPP P PP PG
Sbjct: 120 PPPPPPPPARAEPAPPVARPAPG 142
Score = 32.6 bits (74), Expect = 0.66
Identities = 13/28 (46%), Positives = 13/28 (46%), Gaps = 5/28 (17%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
PPPPPPP PP P P P P
Sbjct: 119 PPPPPPP-----PPARAEPAPPVARPAP 141
Score = 30.3 bits (68), Expect = 3.8
Identities = 12/27 (44%), Positives = 12/27 (44%), Gaps = 4/27 (14%)
Query: 565 PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PPPPP PPPP P P P
Sbjct: 119 PPPPP----PPPPARAEPAPPVARPAP 141
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 34.1 bits (79), Expect = 0.43
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
A A K P P PP P PP P
Sbjct: 53 AAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95
Score = 32.9 bits (76), Expect = 0.90
Identities = 10/46 (21%), Positives = 10/46 (21%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
AA P P P PP P P P
Sbjct: 64 AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPA 109
Score = 32.6 bits (75), Expect = 1.3
Identities = 17/71 (23%), Positives = 19/71 (26%), Gaps = 5/71 (7%)
Query: 526 QKTLEDLSSGRP-----VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
Q+ L D +S P A AA P P P P P
Sbjct: 16 QQYLADPNSVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP 75
Query: 581 GMPGPPPPPMP 591
P PP P
Sbjct: 76 AAAAPAAPPKP 86
Score = 32.2 bits (74), Expect = 1.7
Identities = 11/61 (18%), Positives = 12/61 (19%), Gaps = 1/61 (1%)
Query: 529 LEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
D G A A+ P P P P P P P
Sbjct: 33 FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK-PAAAAA 91
Query: 589 P 589
Sbjct: 92 A 92
Score = 31.0 bits (71), Expect = 4.0
Identities = 11/41 (26%), Positives = 11/41 (26%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
AA P P PP PP P P P
Sbjct: 59 AAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAP 99
Score = 30.6 bits (70), Expect = 4.5
Identities = 8/44 (18%), Positives = 8/44 (18%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A AA P P P P P
Sbjct: 49 AAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92
Score = 30.6 bits (70), Expect = 5.1
Identities = 8/45 (17%), Positives = 9/45 (20%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
A+ A P P PP P P
Sbjct: 63 AAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 33.6 bits (76), Expect = 0.45
Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 1/36 (2%)
Query: 551 AAGIPT-GPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
AA +P P PP P P P+P P
Sbjct: 106 AAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQP 141
Score = 33.2 bits (75), Expect = 0.72
Identities = 9/30 (30%), Positives = 11/30 (36%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P P P PP P P+P+
Sbjct: 108 PVPAPAPAPAAAPPAAPAPAADTPAAPIPD 137
>gnl|CDD|227934 COG5647, COG5647, Cullin, a subunit of E3 ubiquitin ligase
[Posttranslational modification, protein turnover,
chaperones].
Length = 773
Score = 34.0 bits (78), Expect = 0.46
Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 3/49 (6%)
Query: 7 LEEQENMLEKLNPE---QLNQKFEDMLNDMNLSDEKKEPLRRQPLANKK 52
+ + M+ L + K E M D++LS E E + P + K
Sbjct: 482 AQAELKMISMLKKVCGQEFTSKLEGMFRDISLSSEFTEAFQHSPQSYNK 530
Score = 33.2 bits (76), Expect = 0.79
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)
Query: 61 GTVTSYEN-KKQENMLEKLNPEQLNQKFEDMLNDMNLSDEKKEPLRRQPLANKK 113
G S + K +ML+K+ ++ K E M D++LS E E + P + K
Sbjct: 477 GRSASAQAELKMISMLKKVCGQEFTSKLEGMFRDISLSSEFTEAFQHSPQSYNK 530
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 33.8 bits (78), Expect = 0.47
Identities = 13/39 (33%), Positives = 16/39 (41%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P P+ PPPPPPPP PP + E+
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
Score = 31.9 bits (73), Expect = 1.8
Identities = 12/28 (42%), Positives = 12/28 (42%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P PP PP PPPPP PP
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDE 286
Score = 31.5 bits (72), Expect = 2.1
Identities = 14/30 (46%), Positives = 14/30 (46%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P PP PP PPPPP P PP E
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEE 288
Score = 31.1 bits (71), Expect = 3.0
Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 5/43 (11%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
V A A +P P+ PPPPP PPP P
Sbjct: 250 VLAPRATRLPAPPQPPEEEPPPPP-----PPPEDDDDPPEDEE 287
Score = 30.0 bits (68), Expect = 6.9
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 6/51 (11%)
Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++E +AA + P+ P PP P PPPPP PPP +
Sbjct: 237 AVEEEDLALAARLVLAPRATRLPAPPQPPEEEPPPPP------PPPEDDDD 281
>gnl|CDD|223006 PHA03179, PHA03179, UL43 envelope protein; Provisional.
Length = 387
Score = 33.7 bits (77), Expect = 0.47
Identities = 9/33 (27%), Positives = 9/33 (27%), Gaps = 5/33 (15%)
Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
G PP P P P PPP
Sbjct: 202 VIGAGESLSRRPPED-----AERPTAPDPEPPP 229
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 33.4 bits (76), Expect = 0.49
Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 4/41 (9%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGP----PPPPMPEMY 594
PP P P M PP P PP P P MY
Sbjct: 187 QPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMY 227
Score = 33.0 bits (75), Expect = 0.73
Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 2/39 (5%)
Query: 555 PTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPPPPPMP 591
+ P P PPP PPP P P PPP
Sbjct: 109 SVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQV 147
Score = 30.7 bits (69), Expect = 4.2
Identities = 15/45 (33%), Positives = 16/45 (35%), Gaps = 2/45 (4%)
Query: 555 PTGPKGGPPPPPPP--PGGMGPPPPPMPGMPGPPPPPMPEMYTDC 597
P+ GPP P P G G P P PPP E Y
Sbjct: 212 PSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYS 256
Score = 30.3 bits (68), Expect = 5.4
Identities = 11/41 (26%), Positives = 12/41 (29%), Gaps = 4/41 (9%)
Query: 555 PTGPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMP 591
P + P PPP PP P PPP
Sbjct: 131 PPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQ 171
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 33.2 bits (76), Expect = 0.51
Identities = 10/36 (27%), Positives = 11/36 (30%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
P P PP P P P P P P+
Sbjct: 66 QPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP 101
Score = 29.0 bits (65), Expect = 9.4
Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)
Query: 534 SGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
V + +AA + P P P P PP P PP P P P PP P+
Sbjct: 28 QEDFVGIELVPLAVFLLAAKVLEAPTEEPQPEPEPPE-EQPKPPTEPETPPEPTPPKPK 85
>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfC [Energy production and conversion].
Length = 529
Score = 33.5 bits (77), Expect = 0.51
Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 1/107 (0%)
Query: 784 IKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQG 843
++ F+ + + +L++AEE R + E EK ++ R K
Sbjct: 423 VQYFRQEKAEILAQRQELKKAEEAKTRFEARTARLEREKAERIPRMKKAAVKRLSFVTAA 482
Query: 844 VMDSLLEALQTGRPKKTGSSIKSVGCPSHS-ALQTGSAFTREQRRKR 889
++D++ AL+ + KK + S + +G RE R +
Sbjct: 483 LIDAIAAALERVKEKKAYAKQPSAALGAPILPSNSGLVAAREARAYQ 529
>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
Length = 136
Score = 31.9 bits (73), Expect = 0.51
Identities = 9/30 (30%), Positives = 9/30 (30%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
G P P P G PP PP
Sbjct: 101 GNGRPTPSSPDAAGPAPPDATPLAPGERPP 130
Score = 30.4 bits (69), Expect = 1.8
Identities = 10/34 (29%), Positives = 10/34 (29%), Gaps = 4/34 (11%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
G P P PP P PG PP
Sbjct: 101 GNGRPTPSSPDAAGP----APPDATPLAPGERPP 130
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 33.7 bits (77), Expect = 0.51
Identities = 14/42 (33%), Positives = 15/42 (35%), Gaps = 5/42 (11%)
Query: 553 GIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPGPPPPP 589
G P G +G P P GPP P P PPP
Sbjct: 18 GGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59
Score = 32.5 bits (74), Expect = 1.0
Identities = 14/48 (29%), Positives = 14/48 (29%), Gaps = 11/48 (22%)
Query: 555 PTGPKGGPPPPPPPPGGMG-----PP-----PPPMPGMPGPPPPPMPE 592
P G GGP P G G P PG P P P
Sbjct: 8 PLGA-GGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPS 54
Score = 30.6 bits (69), Expect = 4.7
Identities = 9/31 (29%), Positives = 9/31 (29%), Gaps = 3/31 (9%)
Query: 561 GPPPPPPPPGGMGPPPP---PMPGMPGPPPP 588
GPP P PPP G P
Sbjct: 43 GPPVPEDYDAPSHRPPPYGGSNGDRHGGYQP 73
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 33.9 bits (77), Expect = 0.52
Identities = 18/43 (41%), Positives = 22/43 (51%), Gaps = 5/43 (11%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGK 605
PP PP P P P P P P P P P PE T ++ ++N K
Sbjct: 920 PPSPPDP---DPTPDPDP-TPDPDPTPDPEP-TPAYQPVLNAK 957
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 33.7 bits (77), Expect = 0.52
Identities = 12/46 (26%), Positives = 13/46 (28%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
K +A PPPP P P P P PP
Sbjct: 201 KPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSG 246
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 33.7 bits (77), Expect = 0.52
Identities = 13/55 (23%), Positives = 14/55 (25%), Gaps = 3/55 (5%)
Query: 537 PVEKNRL--DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P + R D V P P P P G P P P P
Sbjct: 317 PDTEGREGDDLAPTAVCTDAGLLP-DTPLLPDAPEGSSDPVVPVAAATPVDASLP 370
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 33.5 bits (77), Expect = 0.55
Identities = 14/36 (38%), Positives = 15/36 (41%), Gaps = 5/36 (13%)
Query: 562 PPPPPPP----PGGMGPPP-PPMPGMPGPPPPPMPE 592
P P P P PG +GP P P P PP P
Sbjct: 227 PLPSPQPVRHSPGILGPSPLHPHTTRPSPPRPAFSR 262
Score = 30.8 bits (70), Expect = 3.5
Identities = 14/40 (35%), Positives = 15/40 (37%), Gaps = 5/40 (12%)
Query: 557 GPKGGPPPPPPP--PG---GMGPPPPPMPGMPGPPPPPMP 591
PKG P P + PPPPP P P P P
Sbjct: 44 HPKGTSSFPQLLRHPSLRLRLHPPPPPCPQNPRDFPSLQP 83
>gnl|CDD|144655 pfam01141, Gag_p12, Gag polyprotein, inner coat protein p12. The
retroviral p12 is a virion structural protein. p12 is
proline rich. The function carried out by p12 in
assembly and replication is unknown. p12 is associated
with pathogenicity of the virus.
Length = 85
Score = 31.0 bits (70), Expect = 0.58
Identities = 10/30 (33%), Positives = 10/30 (33%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
P GP P PP G P P P
Sbjct: 34 PYRGPGPSPPAGEGDEEEAATTSEAPDPSP 63
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 33.4 bits (76), Expect = 0.59
Identities = 13/59 (22%), Positives = 16/59 (27%), Gaps = 4/59 (6%)
Query: 538 VEKNRLDEVKAQVA----AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++ EV +A A P P G P P P P P P
Sbjct: 185 APEDDTVEVGTVLAIIGDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP 243
Score = 31.5 bits (71), Expect = 2.1
Identities = 12/52 (23%), Positives = 15/52 (28%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
E +A + + P P P P P P P P P D
Sbjct: 212 EEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGD 263
Score = 30.8 bits (69), Expect = 4.7
Identities = 13/54 (24%), Positives = 16/54 (29%), Gaps = 4/54 (7%)
Query: 539 EKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
+ N A+ A P+ P P P P PP P P P
Sbjct: 202 DANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP----APAPAKTA 251
>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15. The matrix protein,
p15, is encoded by the gag gene. MA is involved in
pathogenicity.
Length = 129
Score = 31.7 bits (72), Expect = 0.62
Identities = 15/38 (39%), Positives = 17/38 (44%), Gaps = 5/38 (13%)
Query: 562 PPP--PP--PPPGGMGPPPPPMPGMPGPPPPPMP-EMY 594
PPP P PP + P P P P P PP P +Y
Sbjct: 92 PPPWVKPFVDPPKVLLPSSTPKPVSPSPSAPPRPSSLY 129
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion and
virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response and
virulence.
Length = 201
Score = 32.6 bits (74), Expect = 0.64
Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 1/55 (1%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPP 589
P R AQ+ P P PPPP P P PPP
Sbjct: 14 DPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEA 68
Score = 31.8 bits (72), Expect = 1.0
Identities = 12/42 (28%), Positives = 12/42 (28%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
PP P PPP P P PPP D
Sbjct: 32 DALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEAPTDAD 73
>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668). Members
in this family of proteins are annotated as Cysteine and
tyrosine-rich protein 1, however currently no function
is known.
Length = 154
Score = 32.1 bits (73), Expect = 0.68
Identities = 18/64 (28%), Positives = 20/64 (31%), Gaps = 17/64 (26%)
Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPP-------------PPPPGGMGPPPPPMPGMPG 584
++ NR V AA I PPP PPP P P
Sbjct: 88 MKNNRGTRVGVIRAAHINAISYPMAPPPYTYDHEMEYPTDLPPPYS----PAPQASAQRS 143
Query: 585 PPPP 588
PPPP
Sbjct: 144 PPPP 147
Score = 31.0 bits (70), Expect = 1.3
Identities = 11/24 (45%), Positives = 11/24 (45%), Gaps = 2/24 (8%)
Query: 562 PPP--PPPPPGGMGPPPPPMPGMP 583
PPP P P PPPP PG
Sbjct: 129 PPPYSPAPQASAQRSPPPPYPGNS 152
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 32.2 bits (73), Expect = 0.69
Identities = 13/41 (31%), Positives = 16/41 (39%)
Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
P+ P PP P P PG P P P+P + D
Sbjct: 116 QNPQPQQPTHTQPPVQPQQPADPQPGQPMFPMQPLPPLVPD 156
>gnl|CDD|220403 pfam09788, Tmemb_55A, Transmembrane protein 55A. Members of this
family catalyze the hydrolysis of the 4-position
phosphate of phosphatidylinositol 4,5-bisphosphate, in
the reaction: 1-phosphatidyl-myo-inositol
4,5-bisphosphate + H(2)O =
1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Length = 258
Score = 32.9 bits (75), Expect = 0.71
Identities = 11/37 (29%), Positives = 14/37 (37%)
Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
G+ G +G PP PP P P + PP
Sbjct: 17 GGTGVSPGDEGAGPPSKPPYDQNFAPFPVLLSELPPP 53
>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
Provisional.
Length = 589
Score = 33.2 bits (76), Expect = 0.73
Identities = 30/129 (23%), Positives = 43/129 (33%), Gaps = 22/129 (17%)
Query: 478 FQLDVQPLVEHLAEKSK--TEED-RRVEDLSA-----KLEEAIM-------LRQEAEAKL 522
+QL + P+ E EED RR+ D SA E+ + L +E +
Sbjct: 8 YQLPIVPIYEGHVRNILRCPEEDLRRLRDDSALRLRRYREDLLRDRLLRRRLGEELDDLQ 67
Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
+ Q EDL S R+ E +A + G GG G +
Sbjct: 68 KRLQTECEDLRS-------RVSEAEALLLHDASGGEGGGATNGGEVNVDGGADRTWLAQS 120
Query: 583 PGPPPPPMP 591
P P P
Sbjct: 121 PERPADGGP 129
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members of
this family confer resistance to the metalloid element
tellurium and its salts.
Length = 98
Score = 30.8 bits (70), Expect = 0.73
Identities = 11/26 (42%), Positives = 11/26 (42%), Gaps = 3/26 (11%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPP 587
P P PPP P PP P PP
Sbjct: 1 PAAPVPPP---APAPPAPAPPPAAPP 23
>gnl|CDD|223054 PHA03359, PHA03359, UL17 tegument protein; Provisional.
Length = 686
Score = 33.1 bits (76), Expect = 0.75
Identities = 32/159 (20%), Positives = 50/159 (31%), Gaps = 49/159 (30%)
Query: 454 LEECVSQIVLHRGGCDPDFRSSRRFQLDVQPLV------EHLAEKSKTEEDRRVEDLSAK 507
LE C+ + R DP + + + L A+ E+DR
Sbjct: 196 LEACLGE--DGRSNGDPVLSADAYIDPE-EQLTAPGGFDTPAADARIEEDDR-------- 244
Query: 508 LEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKG------- 560
I+ L++ + E ++S PV EV VA+G+ G +
Sbjct: 245 ---EIL-------SLIR--RAAEVVASRHPVRAAAPREVGGAVASGLRQGARALGAAGAA 292
Query: 561 GPPPPPP----------PPGG---MGPPPPPMPGMPGPP 586
G PPG +G P +P GP
Sbjct: 293 GAASGAADADAVLAGLEPPGKGRFVGSGNPRVPDPEGPV 331
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 32.9 bits (75), Expect = 0.76
Identities = 21/100 (21%), Positives = 33/100 (33%), Gaps = 7/100 (7%)
Query: 492 KSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVA 551
+ K E + E A+ EA Q A + ++ + SS + + + A
Sbjct: 320 RVKHPEAGKEEPDGARNAEAKEPAQPATST--SSKGS----SSAQNKDSGSTGPGSSLAA 373
Query: 552 AGIPT-GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
A G PP P P + P PP P+
Sbjct: 374 ASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPL 413
>gnl|CDD|238560 cd01140, FatB, Siderophore binding protein FatB. These proteins
have been shown to function as ABC-type initial
receptors in the siderophore-mediated iron uptake in
some eubacterial species. They belong to the TroA
superfamily of periplasmic metal binding proteins that
share a distinct fold and ligand binding mechanism. A
typical TroA protein is comprised of two globular
subdomains connected by a single helix and can bind
their ligands in the cleft between these domains.
Length = 270
Score = 32.6 bits (75), Expect = 0.77
Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 1/78 (1%)
Query: 762 GDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENE 821
G LAE Y K I + D+K + +S Q + K+ EE++ + + + E
Sbjct: 80 GRLAEKYDELKKIAPTIDLGADLKNYLESVKQNIETLGKIFGKEEEAKELVAEIDASIAE 139
Query: 822 KKDKAARK-KALIDMTTD 838
K A K KAL+ +
Sbjct: 140 AKSAAKGKKKALVVLVNG 157
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 32.9 bits (75), Expect = 0.79
Identities = 12/37 (32%), Positives = 12/37 (32%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
PP P PP P P PP PP P
Sbjct: 4 ARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGP 40
Score = 30.2 bits (68), Expect = 4.5
Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 8/49 (16%)
Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
++G + R G P G PP PP G+ P P
Sbjct: 48 RAAGPAAPRRRPR--------GCPAGVTFSSSAPPRPPLGLDDAPAATP 88
Score = 29.1 bits (65), Expect = 9.8
Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 6/38 (15%)
Query: 558 PKGGPPPPPPPPGGMGPP------PPPMPGMPGPPPPP 589
P P PP P P PP P PG PPP
Sbjct: 9 PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46
>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
Length = 537
Score = 33.0 bits (75), Expect = 0.88
Identities = 9/35 (25%), Positives = 11/35 (31%), Gaps = 4/35 (11%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P P PPPP P P + + E
Sbjct: 79 PIVTPLPPPPSS----PSLPSSLLLDHFRNRSLSE 109
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 33.0 bits (75), Expect = 0.89
Identities = 28/98 (28%), Positives = 33/98 (33%), Gaps = 23/98 (23%)
Query: 507 KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGI--PTGPKGGPPP 564
K+ E I E A L ++ L + S R K AA P K P
Sbjct: 301 KVVEVI---AETTAPLTPMEELLAKIPSQRV-------PPKESDAADGPKPVPTKPVTPE 350
Query: 565 PPPPPGGMGPPPP----PMPGMP-------GPPPPPMP 591
P PP PP P P P P PP P+P
Sbjct: 351 APSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIP 388
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 32.9 bits (75), Expect = 0.96
Identities = 15/83 (18%), Positives = 18/83 (21%), Gaps = 17/83 (20%)
Query: 528 TLEDLSSGRPVEKNR--------LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM 579
T DL R ++ L GGP P MGPP P
Sbjct: 189 TNPDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPP 248
Query: 580 ---------PGMPGPPPPPMPEM 593
+ P
Sbjct: 249 SRYRPAYEAAPLAPAISSYGPAG 271
>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
protease mycosin. Members of this family are
subtilisin-related serine proteases, found strictly in
the Actinobacteria and associated with type VII
secretion operons. The designation mycosin is used for
members from Mycobacterium [Protein fate, Protein and
peptide secretion and trafficking, Protein fate, Protein
modification and repair].
Length = 350
Score = 32.3 bits (74), Expect = 1.1
Identities = 10/29 (34%), Positives = 13/29 (44%)
Query: 564 PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PP + P P P + P PPP P+
Sbjct: 286 LPPEDGRPLRPAPAPARPVAAPAPPPPPD 314
Score = 32.3 bits (74), Expect = 1.2
Identities = 17/42 (40%), Positives = 19/42 (45%), Gaps = 9/42 (21%)
Query: 550 VAA--GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
VAA G P+ G P P P P P+ P PPPPP
Sbjct: 279 VAALTGEL-PPEDGRPLRPAP-----APARPVAA-PAPPPPP 313
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 31.9 bits (73), Expect = 1.1
Identities = 26/124 (20%), Positives = 52/124 (41%), Gaps = 18/124 (14%)
Query: 742 EVRQKITLLSNMSKNMMTLYGDLAE-------------FYTFDKNIYTLEEFFTDIKTFK 788
E R TL S N++ G ++E D+ + L E T +KT+
Sbjct: 82 ESRDHSTLSDIYSNNLVQRLGQMSEDLQRISKKCREIGSQLHDELLRVLNELQTAMKTYH 141
Query: 789 DSFYQAWQENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQGVMDSL 848
++ KL+EAE++ E +E++ +K +++ K I+ ++ Q ++
Sbjct: 142 TYHAESKSAERKLKEAEKQ----EEKQEQSPEKKLERSRSSK-KIEKEVEKRQAKYSEAK 196
Query: 849 LEAL 852
L+
Sbjct: 197 LKCT 200
>gnl|CDD|202620 pfam03411, Peptidase_M74, Penicillin-insensitive murein
endopeptidase.
Length = 240
Score = 32.2 bits (73), Expect = 1.1
Identities = 12/45 (26%), Positives = 14/45 (31%), Gaps = 8/45 (17%)
Query: 555 PTGPKGGPPPPPPPPGGMGP-------PPPPMPGMPGP-PPPPMP 591
+ P P G G PP P P PPP+P
Sbjct: 184 ADSLECEDQPLVPSGDGCGAELQSWFEPPKPGTTKPEKKTPPPLP 228
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 32.3 bits (73), Expect = 1.2
Identities = 9/42 (21%), Positives = 9/42 (21%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK 599
P P P P P P P HK
Sbjct: 216 PTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHK 257
Score = 30.4 bits (68), Expect = 4.5
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
Q A PT P P P PP E
Sbjct: 210 QNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 32.5 bits (74), Expect = 1.3
Identities = 27/138 (19%), Positives = 50/138 (36%), Gaps = 29/138 (21%)
Query: 476 RRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAK-------------LEEAIMLRQEAEAKL 522
R + D+ +V AEK K E++ E + AK L++ + + +++
Sbjct: 511 RLSKADIDRMVNE-AEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDSDK 569
Query: 523 VQAQKTLED----LSSGRPVEKN----RLDEVKA-------QVAAGIPTGPKGGPPPPPP 567
+K +++ L + EK + EV++ ++ G GG P P
Sbjct: 570 ATIEKAIDEALEWLEKNQLAEKEEFEHKQKEVESVCNPIMTKMYQAAGGGMPGGMPGGMP 629
Query: 568 PPGGMGPPPPPMPGMPGP 585
G P GP
Sbjct: 630 GGMPGGAGPAGAGASSGP 647
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 31.5 bits (71), Expect = 1.3
Identities = 20/43 (46%), Positives = 21/43 (48%), Gaps = 5/43 (11%)
Query: 551 AAGIPTGPKGGPPPP---PPPPGGMGP-PPPPMPGMPGPPPPP 589
A G G + PPP PPPPGG G P PG PGP P
Sbjct: 118 AKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGPSPG-PGPGPLD 159
Score = 29.1 bits (65), Expect = 7.3
Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 3/37 (8%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPP---MPGMPGPPPP 588
PT P PPP PP PP + PPP
Sbjct: 54 PTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPP 90
Score = 28.8 bits (64), Expect = 9.8
Identities = 12/28 (42%), Positives = 12/28 (42%)
Query: 561 GPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
G P PP PPP P PG PP
Sbjct: 48 GYPFCPPTTPHPSSQPPPCPPSPGHPPQ 75
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 32.1 bits (73), Expect = 1.3
Identities = 23/111 (20%), Positives = 36/111 (32%), Gaps = 4/111 (3%)
Query: 483 QPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNR 542
Q EH A + + + E SA+++ + + Q T L +P E+ R
Sbjct: 74 QFTQEHEAARQSPQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEAR 133
Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
+ A P P P + P P P P PE+
Sbjct: 134 QPVL--PQEAPAPQPVHSAAPQPAVQT--VQPAVPEQQVQPEEVVEPAPEV 180
>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter. The Epithelial Na+
Channel (ENaC) Family (TC 1.A.06)The ENaC family
consists of sodium channels from animals and has no
recognizable homologues in other eukaryotes or bacteria.
The vertebrate ENaC proteins from epithelial cells
cluster tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a mechano-transducing
complex for touch sensitivity. Other members of the ENaC
family, the acid-sensing ion channels, ASIC1-3,are homo-
or hetero-oligomeric neuronal H+-gated channels that
mediate pain sensation in response to tissue acidosis.
The homologous Helix aspersa(FMRF-amide)-activated Na+
channel is the first peptide neurotransmitter-gated
ionotropic receptor to be sequenced.Mammalian ENaC is
important for the maintenance of Na+ balance and the
regulation of blood pressure. Three homologous ENaC
subunits, a, b and g, havebeen shown to assemble to form
the highly Na+-selective channel.This model is designed
from the vertebrate members of the ENaC family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 595
Score = 32.4 bits (74), Expect = 1.4
Identities = 11/34 (32%), Positives = 13/34 (38%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
T P PP P + P +PG PPP
Sbjct: 557 DTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 31.7 bits (72), Expect = 1.6
Identities = 11/38 (28%), Positives = 15/38 (39%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
I + PP + PP+P +P PP P P
Sbjct: 44 IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPP 81
Score = 31.7 bits (72), Expect = 1.7
Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 2/57 (3%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
++ + Q + + G PP P PP + PP P G P P + E
Sbjct: 44 IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPP--LPPPLPVDLGAPVLPDQQVEE 98
>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 427
Score = 32.0 bits (72), Expect = 1.7
Identities = 16/40 (40%), Positives = 16/40 (40%), Gaps = 2/40 (5%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
A P P P P G PP PM MP P PPM
Sbjct: 375 ATAEPAFKSAMAIPMPSMPHVQGFPPFPM--MPLPQMPPM 412
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 31.8 bits (73), Expect = 1.7
Identities = 10/49 (20%), Positives = 11/49 (22%), Gaps = 2/49 (4%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
D A AA P P P P P P+
Sbjct: 86 DAGAAPAAA--AEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132
Score = 31.4 bits (72), Expect = 2.4
Identities = 7/48 (14%), Positives = 10/48 (20%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
E + A + P P P P P +
Sbjct: 82 ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAA 129
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 31.3 bits (71), Expect = 1.7
Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 19/100 (19%)
Query: 503 DLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRL---------DEVKAQV-AA 552
+ + +EA LR EA+L ++T+ D S P R +E+K+ + A
Sbjct: 45 EFRRQFDEA--LR---EAELDDVRQTISDARSLNPRTSLRQAMNPLRQAGNEIKSDLQKA 99
Query: 553 GIPTGPKGGPPPPPP----PPGGMGPPPPPMPGMPGPPPP 588
+ P P P G+ PP +P P
Sbjct: 100 TSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAV 139
>gnl|CDD|146151 pfam03363, Herpes_LP, Herpesvirus leader protein.
Length = 177
Score = 31.0 bits (70), Expect = 1.9
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKG----GPPPPPPPPGGMGPPPP 577
R V + L E + +V +G P+GP+G P P P P G+GP P
Sbjct: 45 RRVRRRVLLEEEEEVVSGPPSGPRGDPSEAPGPSRPGPPGLGPEGP 90
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 31.9 bits (72), Expect = 2.0
Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 1/28 (3%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
PP P P PPPPP PG P
Sbjct: 428 RQPPGAPAPRRDNDPPPPPRA-RPGSTP 454
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 30.9 bits (70), Expect = 2.0
Identities = 14/30 (46%), Positives = 14/30 (46%), Gaps = 5/30 (16%)
Query: 564 PPPPPPGGMGPP-----PPPMPGMPGPPPP 588
P PP PG PP PPP P PPP
Sbjct: 147 PMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176
>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family. The herpesvirus
UL25 gene product is a virion component involved in
virus penetration and capsid assembly. The product of
the UL25 gene is required for packaging but not cleavage
of replicated viral DNA. This family includes a number
of herpesvirus proteins: EHV-1 36, EBV BVRF1, HCMV UL77,
ILTV ORF2, and VZV gene 34.
Length = 538
Score = 31.6 bits (72), Expect = 2.0
Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 14/116 (12%)
Query: 470 PDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTL 529
D S + A + +T ++ + + L+ E + Q
Sbjct: 21 CDEPSLFETRPR-------FALRLRTRAEQLRLAKA----KKRALKAELDNLEQLHQTRS 69
Query: 530 EDLSSG-RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPG 584
++ + RP+E N+L+EV A + T + P P P G+ G
Sbjct: 70 AEIDADLRPIE-NQLEEV-ADPLDLLETAARAAEPADAAQPDATAGEAAPGEGVGG 123
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 31.0 bits (70), Expect = 2.1
Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 17/81 (20%)
Query: 860 TGSSIKSVGCPSHSALQTGSAFTREQRRKRQN-----------------DRPMGAERRAQ 902
GSSI+S+ + Q+ + ++RK D MG R
Sbjct: 99 PGSSIQSLPSTTGVRPQSSAENANSEKRKLLVIRVQTALVVFGYYGGIIDGIMGEATRVA 158
Query: 903 LNRSRSRNGIVITRELSNEVL 923
LNR + NG+ +T + + L
Sbjct: 159 LNRYQMDNGLPVTGTIDTQTL 179
>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597). This
family of proteins is found in bacteria, eukaryotes and
viruses. Proteins in this family are typically between
126 and 281 amino acids in length. The function of this
domain is unknown. The structure of this domain has been
found to contain five helices with a long flexible loop
between helices one and two.
Length = 124
Score = 30.0 bits (68), Expect = 2.1
Identities = 6/31 (19%), Positives = 7/31 (22%)
Query: 559 KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
+ P P P P PP
Sbjct: 12 EAAAAAPAPAAAPATAPAAAAAAAPAATPPA 42
Score = 29.6 bits (67), Expect = 3.4
Identities = 5/27 (18%), Positives = 5/27 (18%)
Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPP 588
P P P PP
Sbjct: 17 APAPAAAPATAPAAAAAAAPAATPPAA 43
>gnl|CDD|221172 pfam11701, UNC45-central, Myosin-binding striated muscle assembly
central. The UNC-45 or small muscle protein 1 of
C.elegans is expressed in two forms from different
genomic positions in mammals, as a general tissue
protein UNC-45a and a specific form Unc-45b expressed
only in striated and skeletal muscle. All members carry
up to three amino-terminal tetratricopeptide repeat
(TPR) domains towards their N-terminal, a UCS domain at
the C-terminal that contains a number of Arm repeats
pfam00514 and this central region of approximately 400
residues. Both the general form and the muscle form of
UNC-45 function in myotube formation through cell
fusion. Myofibril formation requires both GC and SM
UNC-45, consistent with the fact that the cytoskeleton
is necessary for the development and maintenance of
organised myofibrils. The S. pombe Rng3p, is crucial for
cell shape, normal actin cytoskeleton, and contractile
ring assembly, and is essential for assembly of the
myosin II-containing progenitors of the contractile
ring. Widespread defects in the cytoskeleton are found
in null mutants of all three fungal proteins. Mammalian
Unc45 is found to act as a specific chaperone during the
folding of myosin and the assembly of striated muscle by
forming a stable complex with the general chaperone
Hsp90. The exact function of this central region is not
known.
Length = 155
Score = 30.3 bits (69), Expect = 2.2
Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 10/63 (15%)
Query: 252 LEAVKVLAAVCLIPPDGHDKVIKAITMSGELKGKERFQPVVQGLMVKGNNEALRTACLQL 311
+ AV L A+ PPD V + +S E F + L+ + L+ A L+L
Sbjct: 61 INAVSALTALFPGPPD----VGATLFLS------EGFLESLLPLVARSERRKLQLAALEL 110
Query: 312 INA 314
++A
Sbjct: 111 LSA 113
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 31.1 bits (70), Expect = 2.2
Identities = 18/85 (21%), Positives = 24/85 (28%), Gaps = 10/85 (11%)
Query: 515 RQEAEAKLVQAQKTLED--LSSGRPVEKNR-LDEVKAQVAAGIPTGPKGGPPPPPPPPGG 571
R +AE + + D +G E R D ++ P GP P G
Sbjct: 54 RSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAG 113
Query: 572 MG-------PPPPPMPGMPGPPPPP 589
P P P P P
Sbjct: 114 SRGRGRRAESPSPRDPPNPKGASAP 138
>gnl|CDD|214545 smart00182, CULLIN, Cullin.
Length = 143
Score = 30.4 bits (69), Expect = 2.3
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 7/52 (13%)
Query: 8 EEQENMLEKLNPE---QLNQKFEDMLNDMNLS----DEKKEPLRRQPLANKK 52
+ +ENM+ KL E + K E M D++LS K+ L P A
Sbjct: 29 DAEENMITKLKQECGYEFTSKLERMFRDISLSKDLNQSFKDMLENNPSAKPI 80
Score = 29.6 bits (67), Expect = 3.9
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 7/51 (13%)
Query: 70 KQENMLEKLNPE---QLNQKFEDMLNDMNLS----DEKKEPLRRQPLANKK 113
+ENM+ KL E + K E M D++LS K+ L P A
Sbjct: 30 AEENMITKLKQECGYEFTSKLERMFRDISLSKDLNQSFKDMLENNPSAKPI 80
>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis,
outer membrane].
Length = 284
Score = 31.3 bits (71), Expect = 2.4
Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 9/46 (19%)
Query: 555 PTGPKGGPPPPPPPPG-GMGP-------PPPPMPGMPGPP-PPPMP 591
P G PPG G G PPPP P P PPP+P
Sbjct: 222 PAGSLECEDQALVPPGDGCGKELASWFEPPPPGTTKPEPKAPPPLP 267
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 31.0 bits (69), Expect = 2.4
Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 4/42 (9%)
Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPP 589
A PT P P P P PP PP P +P P P P
Sbjct: 119 ATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAP 160
Score = 30.2 bits (67), Expect = 4.2
Identities = 17/47 (36%), Positives = 18/47 (38%), Gaps = 11/47 (23%)
Query: 556 TGPKGGPPPPP------PPPGGMGPPPPPMPGMPG-----PPPPPMP 591
T P P PP PPP P P P P PP PP+P
Sbjct: 108 TCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLP 154
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.4 bits (72), Expect = 2.4
Identities = 13/57 (22%), Positives = 14/57 (24%)
Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P EV Q AA + P P PPP P P
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLP 416
>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092). This
family consists of several hypothetical proteins of
unknown function all from photosynthetic organisms
including plants and cyanobacteria.
Length = 270
Score = 31.1 bits (71), Expect = 2.5
Identities = 13/27 (48%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 566 PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
P PG M PPP+ PPP P+PE
Sbjct: 109 PQEPGYMALAPPPVALDK-PPPQPLPE 134
>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587). This is
a bacterial family of proteins with no known function.
Length = 168
Score = 30.5 bits (69), Expect = 2.5
Identities = 10/27 (37%), Positives = 11/27 (40%)
Query: 568 PPGGMGPPPPPMPGMPGPPPPPMPEMY 594
P G+G PP P PPP Y
Sbjct: 141 LPPGVGIAPPGQPQGARGGPPPGTGQY 167
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 31.3 bits (71), Expect = 2.5
Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
EV++ +A G P P P PP P P PP +
Sbjct: 373 EVRSLRSAPTAAATAAGAPLPDFDPRP-RGPPAPEPARSAEAPPLVA 418
Score = 30.2 bits (68), Expect = 6.4
Identities = 15/73 (20%), Positives = 17/73 (23%), Gaps = 3/73 (4%)
Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
A + +EK A AA G P P P P P P
Sbjct: 356 PAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGA---PLPDFDPRPRGPPAPEPARSAE 412
Query: 584 GPPPPPMPEMYTD 596
PP
Sbjct: 413 APPLVAPAAAPAG 425
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.3 bits (71), Expect = 2.6
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 793 QAWQENIKLREAEEKSIRVREAREKAENEKKDKA-ARKKA 831
QA +E + A E + EA++KAE E K KA A+ KA
Sbjct: 151 QAEEEAKA-KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKA 189
>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region. The region
described in this family is found towards the N-terminus
of various eukaryotic fork head/HNF-3-related
transcription factors (which contain the pfam00250
domain). These proteins play key roles in embryogenesis,
maintenance of differentiated cell states, and
tumorigenesis.
Length = 137
Score = 29.9 bits (67), Expect = 2.6
Identities = 14/48 (29%), Positives = 14/48 (29%), Gaps = 4/48 (8%)
Query: 551 AAGIPTGPKGGPPPPPP--PPGGMGP--PPPPMPGMPGPPPPPMPEMY 594
AG P G G P G MG P P G P M
Sbjct: 69 GAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRAS 116
Score = 29.6 bits (66), Expect = 4.3
Identities = 6/43 (13%), Positives = 7/43 (16%)
Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
+ P G M GM M
Sbjct: 32 TSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMSPGYGGAGSPM 74
>gnl|CDD|99790 cd06193, siderophore_interacting, Siderophore interacting proteins
share the domain structure of the ferredoxin reductase
like family. Siderophores are produced in various
bacteria (and some plants) to extract iron from hosts.
Binding constants are high, so iron can be pilfered from
transferrin and lactoferrin for bacterial uptake,
contributing to pathogen virulence. Ferredoxin reductase
(FNR), an FAD and NAD(P) binding protein, was intially
identified as a chloroplast reductase activity,
catalyzing the electron transfer from reduced
iron-sulfur protein ferredoxin to NADP+ as the final
step in the electron transport mechanism of photosystem
I. FNR transfers electrons from reduced ferredoxin to
FAD (forming FADH2 via a semiquinone intermediate) and
then transfers a hydride ion to convert NADP+ to NADPH.
FNR has since been shown to utilize a variety of
electron acceptors and donors and has a variety of
physiological functions including nitrogen assimilation,
dinitrogen fixation, steroid hydroxylation, fatty acid
metabolism, oxygenase activity, and methane assimilation
in a variety of organisms. FNR has an NAD(P)-binding
sub-domain of the alpha/beta class and a discrete
(usually N-terminal) flavin sub-domain which vary in
orientation with respect to the NAD(P) binding domain.
The N-terminal moeity may contain a flavin prosthetic
group (as in flavoenzymes) or use flavin as a substrate.
Because flavins such as FAD can exist in oxidized,
semiquinone (one-electron reduced), or fully reduced
hydroquinone forms, FNR can interact with one and two
electron carriers. FNR has a strong preference for
NADP(H) vs NAD(H).
Length = 235
Score = 30.7 bits (70), Expect = 2.6
Identities = 12/31 (38%), Positives = 13/31 (41%), Gaps = 2/31 (6%)
Query: 567 PPPGGMGPPPPPMPGMPGPPPPPMPEM--YT 595
P PG P P + PP P P M YT
Sbjct: 38 PDPGQAPPVLPVLGRRRWPPEEPRPVMRTYT 68
>gnl|CDD|188647 cd05396, An_peroxidase_like, Animal heme peroxidases and related
proteins. A diverse family of enzymes, which includes
prostaglandin G/H synthase, thyroid peroxidase,
myeloperoxidase, linoleate diol synthase,
lactoperoxidase, peroxinectin, peroxidasin, and others.
Despite its name, this family is not restricted to
metazoans: members are found in fungi, plants, and
bacteria as well.
Length = 370
Score = 31.2 bits (71), Expect = 2.7
Identities = 14/60 (23%), Positives = 20/60 (33%), Gaps = 4/60 (6%)
Query: 755 KNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQA----WQENIKLREAEEKSIR 810
K + F D Y L EFFT F S EN + +E + ++
Sbjct: 138 KFTDPRDDLVLLFPDPDVVPYVLSEFFTAAYRFGHSLVPEGVDRIDENGQPKEIPDVPLK 197
>gnl|CDD|214009 cd12810, Esterase_713_like-3, Uncharacterized enzymes similar to
novel bacterial esterase that cleaves esters on
halogenated cyclic compounds. This family contains
uncharacterized proteins similar to a novel bacterial
esterase (Alcaligenes esterase 713) with the alpha/beta
hydrolase fold but does not contain the GXSXXG
pentapeptide around the active site serine residue as
commonly seen in other enzymes of this class. Esterase
713 shows negligible sequence homology to other esterase
and lipase enzymes. It is active as a dimer and cleaves
esters on halogenated cyclic compounds though its
natural substrate is unknown.
Length = 328
Score = 31.0 bits (71), Expect = 2.8
Identities = 24/77 (31%), Positives = 25/77 (32%), Gaps = 28/77 (36%)
Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIIN 603
D VKA VA P GGP PP PPPP P GP P +
Sbjct: 197 DNVKAIVA----YEPGGGPFPPG------EEPPPPPPSAFGPLGPVGVPLE--------- 237
Query: 604 GKQACEEVKQSKKLAKI 620
KL KI
Sbjct: 238 ---------DFLKLTKI 245
>gnl|CDD|177663 PLN00026, PLN00026, aquaporin NIP; Provisional.
Length = 298
Score = 31.0 bits (70), Expect = 2.9
Identities = 7/23 (30%), Positives = 8/23 (34%)
Query: 568 PPGGMGPPPPPMPGMPGPPPPPM 590
P P P PG P P+
Sbjct: 1 MDNEEVPSAPSTPATPGTPGGPL 23
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 31.2 bits (71), Expect = 3.1
Identities = 14/24 (58%), Positives = 14/24 (58%), Gaps = 3/24 (12%)
Query: 557 GPKGGPPPPPPPPGGMGPPPPPMP 580
GPK GP P PPPP P P P P
Sbjct: 6 GPKPGPTPTPPPP---APTPEPTP 26
Score = 30.5 bits (69), Expect = 4.8
Identities = 14/33 (42%), Positives = 14/33 (42%), Gaps = 7/33 (21%)
Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
G P P P P P PPP P P P P P
Sbjct: 2 DGEPGPKPGP---TPTPPP----PAPTPEPTPA 27
>gnl|CDD|233515 TIGR01659, sex-lethal, sex-lethal family splicing factor. This
model describes the sex-lethal family of splicing
factors found in Dipteran insects. The sex-lethal
phenotype, however, may be limited to the Melanogasters
and closely related species. In Drosophila the protein
acts as an inhibitor of splicing. This subfamily is most
closely related to the ELAV/HUD subfamily of splicing
factors (TIGR01661).
Length = 346
Score = 30.8 bits (69), Expect = 3.2
Identities = 8/21 (38%), Positives = 8/21 (38%)
Query: 553 GIPTGPKGGPPPPPPPPGGMG 573
G G PP PP MG
Sbjct: 298 GNMGMAGGSGMNPPNPPAFMG 318
>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
Length = 427
Score = 30.8 bits (70), Expect = 3.4
Identities = 11/34 (32%), Positives = 11/34 (32%)
Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
G PP PG P P G P P P
Sbjct: 8 SAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRG 41
Score = 30.4 bits (69), Expect = 5.0
Identities = 8/36 (22%), Positives = 8/36 (22%)
Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
P G PP P P P P
Sbjct: 5 DPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGR 40
Score = 30.4 bits (69), Expect = 5.0
Identities = 8/32 (25%), Positives = 8/32 (25%), Gaps = 4/32 (12%)
Query: 564 PPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
P GG PP P P P
Sbjct: 6 PFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPR 37
Score = 29.2 bits (66), Expect = 9.7
Identities = 8/36 (22%), Positives = 8/36 (22%), Gaps = 4/36 (11%)
Query: 565 PPPPPGGMGPPPPP----MPGMPGPPPPPMPEMYTD 596
P G PP P P P P
Sbjct: 6 PFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRG 41
>gnl|CDD|227444 COG5113, UFD2, Ubiquitin fusion degradation protein 2
[Posttranslational modification, protein turnover,
chaperones].
Length = 929
Score = 31.1 bits (70), Expect = 3.4
Identities = 29/154 (18%), Positives = 53/154 (34%), Gaps = 18/154 (11%)
Query: 402 PYLLSILQHLLFIRDDQNMVMDSACEPYLLSILQH-LLFIRDDQYVRLAYYKLLEECVSQ 460
P L L + L + D+ M M+S + I + +F R +A+Y +E
Sbjct: 549 PMLNRKLDYYLSLGRDE-MRMES--RSIIHDIFKEGKVFSRWLLPALMAFYIEIESTGQS 605
Query: 461 IVLHRGGCDPDFRSSR-------RFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIM 513
+ D + R F+ E L+ T V+ + L +
Sbjct: 606 TQFY------DKFNIRFIICMMKDFEYKQPSYSEGLSSIKDTNLPFFVKFDAKMLNDLTR 659
Query: 514 LRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVK 547
L EA +LV+ ++ L + N + +
Sbjct: 660 LLDEALKELVEEHN-IQSLLADAISNSNISERIG 692
>gnl|CDD|206165 pfam13995, YebF, YebF-like protein. The YebF-like protein family
appears to be a group of colicin immunity proteins. As
well as YebF the family includes cmi, the colicin M
immunity protein. This domain family is found in
bacteria, and is approximately 80 amino acids in length.
The alignment contains two conserved cysteine residues
that form a disulphide bond in the solved structure.
Length = 89
Score = 28.8 bits (65), Expect = 3.5
Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 674 KFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNIKNLETDIQNCKQAPVANENDK 731
KFP C D L + AA+V D +QN I + +++ K L T PVA N
Sbjct: 4 KFPSC----DGLTYEQVAAQVKRDFLQNRIPRWDDDKKLLGTAK------PVAWVNVN 51
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 31.2 bits (70), Expect = 3.5
Identities = 14/57 (24%), Positives = 17/57 (29%), Gaps = 17/57 (29%)
Query: 553 GIPTGPKGGPPPPPPPPGGMGPP-----------------PPPMPGMPGPPPPPMPE 592
G+ + P P P G + P P PMP P P P E
Sbjct: 486 GVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALE 542
>gnl|CDD|217044 pfam02451, Nodulin, Nodulin. Nodulin is a plant protein of unknown
function. It is induced during nodulation in legume
roots after rhizobium infection.
Length = 177
Score = 30.0 bits (67), Expect = 3.6
Identities = 26/116 (22%), Positives = 38/116 (32%), Gaps = 7/116 (6%)
Query: 475 SRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEE---AIMLRQEAEAKLVQAQKTLED 531
S +F+ V H+AE T + + A L E L + +L D
Sbjct: 6 SPKFKKFVTHCTSHVAETCSTTDPSHGSEALHNQGPLGLAFCLFDSMEKCLADHKASLVD 65
Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
LS P ++LD ++ I P PP P + P P P
Sbjct: 66 LSLFFPRRSSKLDPHQSL-PGSIQLPP---IPPKSPFEPSISRVIPWPPIQPRDQG 117
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.2 bits (71), Expect = 3.6
Identities = 29/213 (13%), Positives = 74/213 (34%), Gaps = 4/213 (1%)
Query: 633 SGSRNGGAFGFEINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAA 692
G GG+ + L + +++E K L + +E+ E K +EL
Sbjct: 658 GGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL 717
Query: 693 RVSTDVIQNSIRQMENNIKNLETDIQNCKQ--APVANENDKFLEIMEPFAKEVRQKITLL 750
R + + I + ++ LE +++ ++ A ++ E + +E + + + L
Sbjct: 718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777
Query: 751 SNMSKNMMTLYGDLAEFYT-FDKNIYTLEEFFTDIKTFKDSFYQAWQENIKL-REAEEKS 808
+ + L + + L+E ++ + + L R
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837
Query: 809 IRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
R+ + E+ E +D + + ++ +
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
Score = 30.4 bits (69), Expect = 5.1
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 501 VEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
+E L A+LEE +E E++L + ++ LE L S + ++ + ++
Sbjct: 353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEI 402
Score = 29.6 bits (67), Expect = 9.1
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQ 549
AE+ E +++L KLEE + E E ++ + QK L L++ E +RL++ K
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN----EISRLEQQKQI 306
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 30.4 bits (69), Expect = 3.6
Identities = 15/46 (32%), Positives = 18/46 (39%), Gaps = 1/46 (2%)
Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM-PGMPGPPPPPMPE 592
V AG P P PP + P P P+ P P P P P+
Sbjct: 73 EAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPK 118
Score = 30.0 bits (68), Expect = 4.5
Identities = 16/48 (33%), Positives = 18/48 (37%), Gaps = 1/48 (2%)
Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
V+A AA P PP P P PP P P P P P+
Sbjct: 74 AVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKP-KPVEKPKPKPK 120
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 30.5 bits (69), Expect = 3.7
Identities = 8/34 (23%), Positives = 10/34 (29%), Gaps = 4/34 (11%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
P P PP + P P P+ M
Sbjct: 143 PAAPEPP----SYEETIKPGPAPVEEPVDSMAIA 172
>gnl|CDD|152747 pfam12312, NeA_P2, Nepovirus subgroup A polyprotein. This family
of proteins is found in viruses. Proteins in this family
are typically between 259 and 1110 amino acids in
length. The family is found in association with
pfam03688, pfam03689, pfam03391. This family is one of
the polyproteins expressed by Nepoviruses in subgroup A.
Length = 175
Score = 29.9 bits (67), Expect = 3.9
Identities = 12/19 (63%), Positives = 12/19 (63%), Gaps = 2/19 (10%)
Query: 559 KGGP--PPPPPPPGGMGPP 575
GGP PPPPPPP PP
Sbjct: 113 PGGPCLPPPPPPPPIQKPP 131
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 30.8 bits (69), Expect = 3.9
Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 3/111 (2%)
Query: 476 RRFQLDVQPLVEHLAE---KSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDL 532
+R +L+ + + E K++ E R L A+ L E +A L + L
Sbjct: 305 QRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKE 364
Query: 533 SSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
+ E +L A + + T K P PP MGPPP P P P
Sbjct: 365 LEAKKREAEQLRMELAIRISALDTCIKAKSLPAMPPSRPMGPPPNPPPIDP 415
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 30.8 bits (69), Expect = 4.1
Identities = 22/69 (31%), Positives = 25/69 (36%), Gaps = 4/69 (5%)
Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
AQ+ S P LD K+ P PK PPPP P P P +
Sbjct: 601 RSAQRPTRPKSPKLP---ELLDIPKSPKRPESPKSPKR-PPPPQRPSSPERPEGPKIIKS 656
Query: 583 PGPPPPPMP 591
P PP P P
Sbjct: 657 PKPPKSPKP 665
>gnl|CDD|99780 cd06183, cyt_b5_reduct_like, Cytochrome b5 reductase catalyzes the
reduction of 2 molecules of cytochrome b5 using NADH as
an electron donor. Like ferredoxin reductases, these
proteins have an N-terminal FAD binding subdomain and a
C-terminal NADH binding subdomain, separated by a cleft,
which accepts FAD. The NADH-binding moiety interacts
with part of the FAD and resembles a Rossmann fold.
However, NAD is bound differently than in canonical
Rossmann fold proteins. Nitrate reductases,
flavoproteins similar to pyridine nucleotide cytochrome
reductases, catalyze the reduction of nitrate to
nitrite. The enzyme can be divided into three functional
fragments that bind the cofactors molybdopterin,
heme-iron, and FAD/NADH.
Length = 234
Score = 30.2 bits (69), Expect = 4.1
Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 10/59 (16%)
Query: 306 TACLQLINAIVATPDDLEFRLHL------RNEIMRVGLYDLLDALEKDASEDVSVQLKV 358
T LQLI AI+ P+D ++ L +I+ L + LD L K + V +
Sbjct: 117 TPMLQLIRAILKDPEDKT-KISLLYANRTEEDIL---LREELDELAKKHPDRFKVHYVL 171
>gnl|CDD|184753 PRK14581, hmsF, outer membrane N-deacetylase; Provisional.
Length = 672
Score = 30.7 bits (69), Expect = 4.1
Identities = 26/107 (24%), Positives = 43/107 (40%), Gaps = 17/107 (15%)
Query: 736 MEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAW 795
+ PF EVRQ+I + +Y D+A D IY + +D F A
Sbjct: 435 LSPFNPEVRQRI----------IDIYRDMAYSAPIDGIIYHDDAVMSD-------FEDAS 477
Query: 796 QENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQ 842
+ I+ E + R+ E ++ + K LID T + T++
Sbjct: 478 PDAIRAYEKAGFPGSITTIRQDPEMMQRWTRYKSKYLIDFTNELTRE 524
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 30.2 bits (68), Expect = 4.1
Identities = 15/48 (31%), Positives = 16/48 (33%), Gaps = 1/48 (2%)
Query: 546 VKAQVAAGIPTGPK-GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
V Q A P G P P + PP P G P P PE
Sbjct: 89 VDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPE 136
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.7 bits (70), Expect = 4.2
Identities = 21/67 (31%), Positives = 24/67 (35%), Gaps = 7/67 (10%)
Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGKQACEEV--K 612
T P PPPPPP P PPPP+PE K + E K
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPE-----EPTPSPTKDSSPEEIDK 599
Query: 613 QSKKLAK 619
+K LA
Sbjct: 600 AAKNLAD 606
Score = 29.5 bits (67), Expect = 8.9
Identities = 15/42 (35%), Positives = 16/42 (38%)
Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
+ A P K PPP P PP P P P PPP
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPT 558
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 30.6 bits (69), Expect = 4.3
Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 32/133 (24%)
Query: 477 RFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGR 536
+F++D++P K K E E+ A+ KL+ + K L D
Sbjct: 12 KFKIDMEPDETVKELKEKIEA-----------EQGKDAYPVAQQKLIYSGKILSDDK--- 57
Query: 537 PVEKNRLDE-------------VKAQVAAGIPT---GPKGGPPPPPPPPGGMGPPPPP-- 578
V++ ++ E +VA T P P PP P GM P
Sbjct: 58 TVKEYKIKEKDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAV 117
Query: 579 MPGMPGPPPPPMP 591
P
Sbjct: 118 EEKSPSEESATAT 130
>gnl|CDD|139164 PRK12700, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 230
Score = 30.0 bits (67), Expect = 4.3
Identities = 10/16 (62%), Positives = 10/16 (62%)
Query: 556 TGPKGGPPPPPPPPGG 571
TGP PPPPPP P
Sbjct: 32 TGPLTAPPPPPPQPSA 47
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.5 bits (69), Expect = 4.3
Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 15/89 (16%)
Query: 803 EAEEKSIRVREAREKAENEKKDKAAR-------------KKALIDMTTDQTQQGVMDSLL 849
EAE K EA++KA E K AA+ + +D L
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLF 263
Query: 850 EALQTGR--PKKTGSSIKSVGCPSHSALQ 876
L +G+ PK G + + + +
Sbjct: 264 GGLDSGKNAPKTGGGAKGNGAQGAGAGNG 292
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 30.7 bits (70), Expect = 4.4
Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 3/104 (2%)
Query: 738 PFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQE 797
K R+ + LS + K+ L LA F + EF T K + ++
Sbjct: 362 SKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEE---EIEFLTGSKKATKKIKKIVEK 418
Query: 798 NIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
K RE E+K + + K + E++++ KK ++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 30.5 bits (70), Expect = 4.6
Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 8/66 (12%)
Query: 493 SKTEEDRRVEDLSA------KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEV 546
S E +R V+D A K +E + R +A++ + Q +KTL++L G V + +++
Sbjct: 501 SDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKEL--GDKVPADEKEKI 558
Query: 547 KAQVAA 552
+A +
Sbjct: 559 EAAIKE 564
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 30.4 bits (69), Expect = 4.6
Identities = 16/65 (24%), Positives = 27/65 (41%)
Query: 488 HLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVK 547
A+ D L+A + +QE E L Q K L+ + + +LDE++
Sbjct: 60 QQAQNQTATNDALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQ 119
Query: 548 AQVAA 552
+VA
Sbjct: 120 QKVAT 124
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 30.4 bits (69), Expect = 4.6
Identities = 36/184 (19%), Positives = 59/184 (32%), Gaps = 36/184 (19%)
Query: 638 GGAFGFEINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTD 697
GGA I T K+ + L+D + K L ++
Sbjct: 432 GGA---RIKG-TIELPFKE------VCEKLLDKDKNKPFIKLILLSNNSQEKAVLKL--- 478
Query: 698 VIQNSIRQMENNIKNLETDIQNCKQAPVANE---------NDKFLEIMEPFAKEVRQKI- 747
I++ + IK ETD ++ + + K I E F + +
Sbjct: 479 --LQKIKKNNDFIKEFETDALKLQEILEKVDSKSEKLEKISAKIDNIKELFDESKMSILQ 536
Query: 748 -TLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENI-KLREAE 805
L + + +A Y NI E+F K + + + W E I EA+
Sbjct: 537 EILQPALHHIELN----IARIYKL--NIKDKEDFL--NKLYIKA-HLEWLEIIYGKLEAQ 587
Query: 806 EKSI 809
KSI
Sbjct: 588 IKSI 591
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 29.9 bits (68), Expect = 4.7
Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 16/71 (22%)
Query: 473 RSSRRFQLDVQPLVEHLAEKSKTEEDR--------------RVEDLSAKLEEAIMLRQEA 518
RS R ++ L E AEK+ E +R ++E L A+L+EA + +EA
Sbjct: 54 RSEERLATALEKLEE--AEKAADESERGRKVLENRSLKDDEKMEQLEAQLKEAKEIAEEA 111
Query: 519 EAKLVQAQKTL 529
+ K + + L
Sbjct: 112 DRKYEEVARKL 122
>gnl|CDD|236522 PRK09450, cyaA, adenylate cyclase; Provisional.
Length = 830
Score = 30.7 bits (70), Expect = 4.8
Identities = 17/80 (21%), Positives = 30/80 (37%), Gaps = 6/80 (7%)
Query: 659 NKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVI---QNSIRQMENNIKNLET 715
N+ LH+ EQ + LK +H D A S +V Q+ + ++ L +
Sbjct: 605 NEVRTLHF---EGEQALLDALKTILGKMHQDAAPPPSVEVFCYSQHLRGLIRTRVQQLVS 661
Query: 716 DIQNCKQAPVANENDKFLEI 735
+ + E K L +
Sbjct: 662 ECIELRLGSTRQEPGKALRV 681
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 30.5 bits (69), Expect = 5.0
Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 22/201 (10%)
Query: 644 EINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSI 703
+++ L K + L L + +E +EL + + + + + +
Sbjct: 171 KLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEEL 230
Query: 704 RQMENNIKNLETDIQNCKQ------APVANENDKFLEIMEPFAKEVRQKITLLS---NMS 754
Q ++ +++ K+ A + LE ++ +E+R+ LL
Sbjct: 231 EQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKI 290
Query: 755 KNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREA 814
+ + L ++ E E + + + KL+ EE+ ++ E
Sbjct: 291 ERLEELEREIEELEE-------ELEGLRALLEELEELLE------KLKSLEERLEKLEEK 337
Query: 815 REKAENEKKDKAARKKALIDM 835
EK E+E ++ A K L +
Sbjct: 338 LEKLESELEELAEEKNELAKL 358
>gnl|CDD|219944 pfam08633, Rox3, Rox3 mediator complex subunit. The mediator
complex is part of the RNA polymerase II holoenzyme.
Rox3 is a subunit of the mediator complex.
Length = 210
Score = 29.8 bits (67), Expect = 5.1
Identities = 12/68 (17%), Positives = 16/68 (23%), Gaps = 7/68 (10%)
Query: 522 LVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPP-------PPGGMGP 574
L+ ++ G + V Q P G P G G
Sbjct: 97 LLSELSRARTMARGTVPNMDWCSSVLGQFERSYPNGFLNSAGFAVPDLAFDLDGTQGNGR 156
Query: 575 PPPPMPGM 582
P PG
Sbjct: 157 PKGGNPGA 164
>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p. In Saccharomyces
cerevisiae, Sec2p is a GDP/GTP exchange factor for
Sec4p, which is required for vesicular transport at the
post-Golgi stage of yeast secretion.
Length = 90
Score = 28.1 bits (63), Expect = 5.8
Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 8/51 (15%)
Query: 489 LAEKSKTEEDRRVEDLSAKL--------EEAIMLRQEAEAKLVQAQKTLED 531
AEK K + ++ VEDL+A L +A R+ E K + ++ L++
Sbjct: 12 RAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAVEIKNEKLEEQLKE 62
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 30.2 bits (68), Expect = 5.9
Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 2/63 (3%)
Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQ 549
A+ + R L A L + + A+A+L +A K L ++ K R +E + +
Sbjct: 209 AKTAAAAAAREAAPLKASLRKLERAKARADAELKRADKAL--AAAKTDEAKARAEERQQK 266
Query: 550 VAA 552
A
Sbjct: 267 AAQ 269
>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 211
Score = 29.4 bits (66), Expect = 6.0
Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPG 570
++ P E + LDE A AA + + P P P P P
Sbjct: 88 DEPRRTEPQEPDPLDESPAS-AAPVASAPAPAPSPQSPKPA 127
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 30.2 bits (68), Expect = 6.1
Identities = 19/72 (26%), Positives = 27/72 (37%), Gaps = 10/72 (13%)
Query: 520 AKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAG---IPTGPKGGPPPPPPPP--GGMGP 574
K Q + + DL+ E + + VK Q + + T PPPPPPPP
Sbjct: 90 PKDAQTLRLVVDLT-----ENGKTEAVKRQNGSNYTVVFTINADVPPPPPPPPVVAKRVE 144
Query: 575 PPPPMPGMPGPP 586
P + P
Sbjct: 145 TPAVVAPRVSEP 156
>gnl|CDD|216175 pfam00888, Cullin, Cullin family.
Length = 603
Score = 30.3 bits (68), Expect = 6.2
Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 3/31 (9%)
Query: 11 ENMLEKL---NPEQLNQKFEDMLNDMNLSDE 38
+ M+EKL Q K E M D++LS E
Sbjct: 413 KKMIEKLKQECGYQFTSKLERMFKDISLSKE 443
Score = 30.3 bits (68), Expect = 6.2
Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 3/31 (9%)
Query: 72 ENMLEKL---NPEQLNQKFEDMLNDMNLSDE 99
+ M+EKL Q K E M D++LS E
Sbjct: 413 KKMIEKLKQECGYQFTSKLERMFKDISLSKE 443
>gnl|CDD|236098 PRK07789, PRK07789, acetolactate synthase 1 catalytic subunit;
Validated.
Length = 612
Score = 29.9 bits (68), Expect = 6.8
Identities = 10/32 (31%), Positives = 10/32 (31%)
Query: 564 PPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
P PP P PP P PE T
Sbjct: 1 PSPPTPAAAASAAPPPAAPAARPRIVAPERMT 32
>gnl|CDD|183198 PRK11560, PRK11560, phosphoethanolamine transferase; Provisional.
Length = 558
Score = 30.0 bits (68), Expect = 6.9
Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 8/57 (14%)
Query: 91 LNDMNLSDEKKEPLRRQPLANKKKMLLMHYKGTVTSYENKSKFDKPIEYIQYLSQPE 147
++DM L DE K+ L R P + K ++++H KG+ +Y + P + +Y QPE
Sbjct: 369 VDDMLLVDEMKQSLGRNP--DGKHLIILHTKGSHYNYTQRY----PRSFARY--QPE 417
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.0 bits (68), Expect = 7.1
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 479 QLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSG--- 535
LD+Q L ++TE R++ L+ A+ +++AE +LVQA LE+
Sbjct: 587 SLDLQRLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAE 646
Query: 536 --RPVEKNRLDEVKAQVA 551
+++ RLD + Q
Sbjct: 647 ARTALKQARLDLQRLQNE 664
>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
Length = 249
Score = 29.4 bits (66), Expect = 7.1
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)
Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPP 586
+ +R++++ A +AA +P P G P P P P P P P P
Sbjct: 88 IRASRIEDIAAAIAADLPVLPPSGARERPLEPSPAEAEPEPAKAPEKPPEP 138
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 28.9 bits (65), Expect = 7.2
Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 3/71 (4%)
Query: 67 ENKKQENMLEKLNPEQLNQKFEDMLNDMNLSDEK---KEPLRRQPLANKKKMLLMHYKGT 123
E ++ E M EK E +++F+ + DE+ K+ +RQ KKK KG
Sbjct: 48 EYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGN 107
Query: 124 VTSYENKSKFD 134
+ SK
Sbjct: 108 KKEEKEGSKSS 118
>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form. The
Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
2.A.36) The CPA1 family is a large family of proteins
derived from Gram-positive and Gram-negative bacteria,
blue green bacteria, yeast, plants and animals.
Transporters from eukaryotes have been functionally
characterized, and all of these catalyze Na+:H+
exchange. Their primary physiological functions may be
in (1) cytoplasmic pH regulation, extruding the H+
generated during metabolism, and (2) salt tolerance (in
plants), due to Na+ uptake into vacuoles. This model is
specific for the bacterial members of this family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 525
Score = 29.9 bits (67), Expect = 7.2
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 500 RVEDLSAKLEEAIMLR-QEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
V + S + E I+ R A + K +E L PV + L E+ A++
Sbjct: 408 FVSEHSERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARI 459
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 30.0 bits (68), Expect = 7.4
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSS 534
AEK K++ V+DL A+L++ + AE K Q + L +L
Sbjct: 150 AEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQV 194
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 30.0 bits (67), Expect = 7.8
Identities = 12/40 (30%), Positives = 13/40 (32%)
Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
A V G P PPP P + PP P P
Sbjct: 365 APAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEINP 404
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 29.3 bits (66), Expect = 7.8
Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 5/30 (16%)
Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
PP PPP P+ P P P P+PE
Sbjct: 59 EPPQAVQ----PPPEPVV-EPEPEPEPIPE 83
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 30.0 bits (67), Expect = 7.8
Identities = 28/186 (15%), Positives = 50/186 (26%), Gaps = 36/186 (19%)
Query: 470 PDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTL 529
S ++ + L + L K +E+ ++ ++ + + +
Sbjct: 286 GSSDESPEWKTFYEALADQLNNLYKLLRTIYKHKDETVIEQYLIEGRKLFSTINGLKAHN 345
Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGP------------PPPPPPPGGMGPPPP 577
E L + +R+ A+VA GP P P M PP
Sbjct: 346 EILKTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPP 405
Query: 578 ------------------------PMPGMPGPPPPPMPEMYTDCHKNIINGKQACEEVKQ 613
P P P PP P P + N++ E +
Sbjct: 406 VPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHE 465
Query: 614 SKKLAK 619
K+
Sbjct: 466 RKRKRG 471
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 29.3 bits (66), Expect = 7.8
Identities = 11/45 (24%), Positives = 19/45 (42%)
Query: 797 ENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
E IK AEEK E + E ++++ L++ + D
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEEILTGNPLLNTSGDFKV 196
>gnl|CDD|171770 PRK12851, groEL, chaperonin GroEL; Reviewed.
Length = 541
Score = 29.7 bits (67), Expect = 8.2
Identities = 10/16 (62%), Positives = 10/16 (62%)
Query: 558 PKGGPPPPPPPPGGMG 573
PK P PP PP GGM
Sbjct: 525 PKKEPAPPAPPGGGMD 540
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 29.6 bits (66), Expect = 8.3
Identities = 11/32 (34%), Positives = 12/32 (37%), Gaps = 2/32 (6%)
Query: 558 PKGGPPPPPPPPGGMGP--PPPPMPGMPGPPP 587
+ PP P P M P P PG PP
Sbjct: 210 GQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPP 241
>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4. This
is a family of E## ubiquitin ligase enzymes.
Length = 794
Score = 30.0 bits (68), Expect = 8.3
Identities = 12/45 (26%), Positives = 21/45 (46%), Gaps = 2/45 (4%)
Query: 801 LREAEEKSIRVREAREKAENEKKDKA--ARKKALIDMTTDQTQQG 843
L E E + ++ E R++ EK+ A R+K L + + G
Sbjct: 400 LAEKEGVAKKIDEVRDETRAEKRRLAMAMREKQLQALGMRTSSGG 444
>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 233
Score = 29.1 bits (65), Expect = 8.5
Identities = 29/122 (23%), Positives = 42/122 (34%), Gaps = 20/122 (16%)
Query: 485 LVEHLAEKSKTEEDRRVEDLSAK-----------LEEAIMLRQEAEAKLVQAQKTLED-- 531
L L + E D + E L A+ L + ++++++A K K LE
Sbjct: 12 LFHRLKKAEANERDPQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRI 71
Query: 532 --LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
L +G + L A T + PP PPP PP GP P
Sbjct: 72 AILQAGEAGSGSFLSN-----AFKWGTPQEPAPPANAPPPKEPAAPPSWRSSPAGPTTQP 126
Query: 590 MP 591
P
Sbjct: 127 SP 128
>gnl|CDD|184062 PRK13455, PRK13455, F0F1 ATP synthase subunit B; Provisional.
Length = 184
Score = 29.0 bits (65), Expect = 8.6
Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 9/38 (23%)
Query: 498 DRRVEDLSAKLEEAIMLRQEAE---------AKLVQAQ 526
D+R E + ++LEEA LR+EA+ + VQ Q
Sbjct: 57 DKRAEGIRSELEEARALREEAQTLLASYERKQREVQEQ 94
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
1). Dystroglycan is one of the dystrophin-associated
glycoproteins, which is encoded by a 5.5 kb transcript
in human. The protein product is cleaved into two
non-covalently associated subunits, [alpha] (N-terminal)
and [beta] (C-terminal). In skeletal muscle the
dystroglycan complex works as a transmembrane linkage
between the extracellular matrix and the cytoskeleton.
[alpha]-dystroglycan is extracellular and binds to
merosin ([alpha]-2 laminin) in the basement membrane,
while [beta]-dystroglycan is a transmembrane protein and
binds to dystrophin, which is a large rod-like
cytoskeletal protein, absent in Duchenne muscular
dystrophy patients. Dystrophin binds to intracellular
actin cables. In this way, the dystroglycan complex,
which links the extracellular matrix to the
intracellular actin cables, is thought to provide
structural integrity in muscle tissues. The dystroglycan
complex is also known to serve as an agrin receptor in
muscle, where it may regulate agrin-induced
acetylcholine receptor clustering at the neuromuscular
junction. There is also evidence which suggests the
function of dystroglycan as a part of the signal
transduction pathway because it is shown that Grb2, a
mediator of the Ras-related signal pathway, can interact
with the cytoplasmic domain of dystroglycan. In general,
aberrant expression of dystrophin-associated protein
complex underlies the pathogenesis of Duchenne muscular
dystrophy, Becker muscular dystrophy and severe
childhood autosomal recessive muscular dystrophy.
Interestingly, no genetic disease has been described for
either [alpha]- or [beta]-dystroglycan. Dystroglycan is
widely distributed in non-muscle tissues as well as in
muscle tissues. During epithelial morphogenesis of
kidney, the dystroglycan complex is shown to act as a
receptor for the basement membrane. Dystroglycan
expression in mouse brain and neural retina has also
been reported. However, the physiological role of
dystroglycan in non-muscle tissues has remained unclear.
Length = 290
Score = 29.6 bits (66), Expect = 8.6
Identities = 16/38 (42%), Positives = 16/38 (42%), Gaps = 3/38 (7%)
Query: 555 PTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPP--PP 589
P P PPPP P G G P M PPP PP
Sbjct: 253 PNAPPYQPPPPFTTPMEGKGSRPKNMTPYRSPPPYVPP 290
>gnl|CDD|221441 pfam12145, Med12-LCEWAV, Eukaryotic Mediator 12 subunit domain.
This domain is found in eukaryotes, and is typically
between 325 and 354 amino acids in length. The function
of this particular region of the Mediator subunit Med12
is not known, but there is a conserved sequence motif:
LCEWAV, from which the name derives.
Length = 471
Score = 29.4 bits (66), Expect = 9.5
Identities = 11/34 (32%), Positives = 13/34 (38%), Gaps = 2/34 (5%)
Query: 543 LDEVKAQVAAGIPT--GPKGGPPPPPPPPGGMGP 574
L + KA + GI T G G PP P
Sbjct: 25 LSDTKAHTSHGIKTQDGTNGPTTSSGTPPTSSSP 58
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.134 0.380
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,715,060
Number of extensions: 4834122
Number of successful extensions: 13587
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11184
Number of HSP's successfully gapped: 708
Length of query: 927
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 821
Effective length of database: 6,236,078
Effective search space: 5119820038
Effective search space used: 5119820038
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.6 bits)