RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4663
(966 letters)
>gnl|CDD|214539 smart00147, RasGEF, Guanine nucleotide exchange factor for Ras-like
small GTPases.
Length = 242
Score = 240 bits (614), Expect = 6e-73
Identities = 102/249 (40%), Positives = 153/249 (61%), Gaps = 13/249 (5%)
Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSP-NLLKIIKHTTNFTRW 497
+L+L P E+A QLTLL+ E +R + PSE++GS W K +K+ SP NL I+ + W
Sbjct: 1 LLLLDPKELAEQLTLLDFELFRKIDPSELLGSVWGKRSKKSPSPLNLEAFIRRFNEVSNW 60
Query: 498 LEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALP 557
+ I++ ++R ++S+ I++ +LNNFN ++AIVSAL S+ + RLK T++ LP
Sbjct: 61 VATEILKQTTPKDRAELLSKFIQVAKHCRELNNFNSLMAIVSALSSSPISRLKKTWEKLP 120
Query: 558 ARLENALVELREL--GNDHFRKYQERLRSIN-PPCVPFFGMYLTNILHIEEGNPDYLPNS 614
++ + EL EL +++ Y+E L S N PPC+PF G+ L ++ I+EGNPD+L N
Sbjct: 121 SKYKKLFEELEELLSPERNYKNYREALSSCNLPPCIPFLGVLLKDLTFIDEGNPDFLEN- 179
Query: 615 NEELINFSKRRRVAEITGEIQQFQNSPYCLSVEPR-IRRFLENLTPFDNWKEIDISNYLY 673
L+NF KRR++AEI EI+Q Q+ PY L I+ L+ L D+ E LY
Sbjct: 180 --GLVNFEKRRQIAEILREIRQLQSQPYNLRPNRSDIQSLLQQL--LDHLDEE---EELY 232
Query: 674 EQSLKIEPR 682
+ SLKIEPR
Sbjct: 233 QLSLKIEPR 241
>gnl|CDD|238087 cd00155, RasGEF, Guanine nucleotide exchange factor for Ras-like
small GTPases. Small GTP-binding proteins of the Ras
superfamily function as molecular switches in
fundamental events such as signal transduction,
cytoskeleton dynamics and intracellular trafficking.
Guanine-nucleotide-exchange factors (GEFs) positively
regulate these GTP-binding proteins in response to a
variety of signals. GEFs catalyze the dissociation of
GDP from the inactive GTP-binding proteins. GTP can then
bind and induce structural changes that allow
interaction with effectors.
Length = 237
Score = 239 bits (611), Expect = 1e-72
Identities = 98/245 (40%), Positives = 152/245 (62%), Gaps = 13/245 (5%)
Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKE-EKSPNLLKIIKHTTNFTRW 497
L L P E+A QLTLL+ E +R ++P E++GS W+K +K SPNL + I+ N + W
Sbjct: 1 FLSLDPKELAEQLTLLDFELFRKIEPFELLGSLWSKKDKNIHLSPNLERFIERFNNLSNW 60
Query: 498 LEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALP 557
+ I+ N ++R ++S+ I++ +LNNFN ++AIVSAL S+ + RLK T++ L
Sbjct: 61 VASEILLCTNPKKRARLLSKFIQVAKHCRELNNFNSLMAIVSALSSSPISRLKKTWEVLS 120
Query: 558 ARLENALVELRELG--NDHFRKYQERLRSI--NPPCVPFFGMYLTNILHIEEGNPDYLPN 613
++L+ EL EL + +F+ Y++ L+S+ NPPCVPF G+YL ++ + EGNPD+L
Sbjct: 121 SKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVYLKDLTFLHEGNPDFLEG 180
Query: 614 SNEELINFSKRRRVAEITGEIQQFQNSPYCLSVEPRIRRFLENLTPFDNWKEIDISNYLY 673
L+NF KRR++AEI EI+Q Q++ Y L+ + I FL L + I + LY
Sbjct: 181 ---NLVNFEKRRKIAEILREIRQLQSNSYELNRDEDILAFLWKL-----LELILNEDELY 232
Query: 674 EQSLK 678
E SL+
Sbjct: 233 ELSLE 237
>gnl|CDD|216025 pfam00617, RasGEF, RasGEF domain. Guanine nucleotide exchange
factor for Ras-like small GTPases.
Length = 185
Score = 210 bits (538), Expect = 5e-63
Identities = 80/189 (42%), Positives = 122/189 (64%), Gaps = 6/189 (3%)
Query: 440 LVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSPNLLKIIKHTTNFTRWLE 499
L L P+E+ARQLTL+ESE + A+ P E++ SAW K + + SPN+ I +RW+
Sbjct: 1 LDLDPLELARQLTLIESELFSAIDPRELLDSAWGKKDSK-LSPNIEAFISRFNQLSRWVA 59
Query: 500 KTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALPAR 559
I+ E+L++R ++ + I+I +LNNFN ++AI+S L S+ + RLK T++ + +
Sbjct: 60 TEILSEEDLKKRAKVIKKFIKIAQHCRELNNFNSLMAIISGLNSSPISRLKKTWELVSKK 119
Query: 560 LENALVELREL--GNDHFRKYQERLRSINPPCVPFFGMYLTNILHIEEGNPDYLPNSNEE 617
+ L EL EL + +F+ Y+E L+ NPPC+PF G+YL ++ IEEGNPD+L N
Sbjct: 120 YKKLLEELEELMSPSKNFKNYREALKKANPPCIPFLGLYLKDLTFIEEGNPDFLDN---G 176
Query: 618 LINFSKRRR 626
LINF KRR+
Sbjct: 177 LINFEKRRK 185
>gnl|CDD|241292 cd01261, PH_SOS, Son of Sevenless (SOS) Pleckstrin homology (PH)
domain. SOS is a Ras guanine nucleotide exchange
factor. SOS is thought to transmit signals from
activated receptor tyrosine kinases to the Ras signaling
pathway. SOS contains a histone domain, Dbl-homology
(DH), a PH domain, Rem domain, Cdc25 domain, and a Grb2
binding domain. The SOS PH domain binds to
phosphatidylinositol-4,5-bisphosphate (PIP2) and
phosphatidic acid (PA). SOS is dependent on Ras binding
to the allosteric site via its histone domain for both a
lower level of activity (Ras GDP) and maximal activity
(Ras GTP). The DH domain blocks the allosteric Ras
binding site in SOS. The PH domain is closely associated
with the DH domain and the action of the DH-PH unit
gates a reciprocal interaction between Ras and SOS. The
C-terminal proline-rich domain of SOS binds to the
adapter protein Grb2 which localizes the Sos protein to
the plasma membrane and diminishes the negative effect
of the C-terminal domain on the guanine nucleotide
exchange activity of the CDC25-homology domain of SOS.
PH domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 109
Score = 160 bits (406), Expect = 4e-46
Identities = 65/117 (55%), Positives = 76/117 (64%), Gaps = 8/117 (6%)
Query: 108 QCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEY 167
QCCNEF E TL K+ G STER FLFDGLLLLCK N T+ + P EY
Sbjct: 1 QCCNEFIMEGTLTKV--GAGKRSTERHLFLFDGLLLLCKSNQSSTASGQSKP------EY 52
Query: 168 RLKERFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLN 224
RLKE+ FIR++EI DL D+D+ KNAF+IVPR IL A S E+KNNWMA L+ L
Sbjct: 53 RLKEKIFIRKVEINDLEDTDELKNAFEIVPRDEPSYILFAKSAEEKNNWMAALVTLQ 109
>gnl|CDD|214571 smart00229, RasGEFN, Guanine nucleotide exchange factor for
Ras-like GTPases; N-terminal motif. A subset of guanine
nucleotide exchange factor for Ras-like small GTPases
appear to possess this domain N-terminal to the RasGef
(Cdc25-like) domain. The recent crystal structureof Sos
shows that this domain is alpha-helical and plays a
"purely structural role" (Nature 394, 337-343).
Length = 127
Score = 128 bits (325), Expect = 7e-35
Identities = 52/132 (39%), Positives = 72/132 (54%), Gaps = 7/132 (5%)
Query: 275 GVPLIKGATLYKLVERLTYHIY-ADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQG 333
LIKG TL L+E LT ADP+FV TFL TYRSF T LQLL+ R++ +
Sbjct: 1 DGGLIKGGTLEALIEHLTEAFDKADPSFVETFLLTYRSFITTQELLQLLLYRYNAIPPES 60
Query: 334 TVREDLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLFLETINGKSMRK 393
V K + V+ RVLN+LR WV++++ DFE D L+ L FLE ++ +
Sbjct: 61 WV------EEKVNPRRVKNRVLNILRTWVENYWEDFEDDPKLISFLLEFLELVDDEKYPG 114
Query: 394 WVDSVSKVIFRK 405
V S+ ++ R
Sbjct: 115 LVTSLLNLLRRL 126
>gnl|CDD|100121 cd06224, REM, Guanine nucleotide exchange factor for Ras-like
GTPases; N-terminal domain (RasGef_N), also called REM
domain (Ras exchanger motif). This domain is common in
nucleotide exchange factors for Ras-like small GTPases
and is typically found immediately N-terminal to the
RasGef (Cdc25-like) domain. REM contacts the GTPase and
is assumed to participate in the catalytic activity of
the exchange factor. Proteins with the REM domain
include Sos1 and Sos2, which relay signals from
tyrosine-kinase mediated signalling to Ras, RasGRP1-4,
RasGRF1,2, CNrasGEF, and RAP-specific nucleotide
exchange factors, to name a few.
Length = 122
Score = 117 bits (294), Expect = 7e-31
Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 6/123 (4%)
Query: 283 TLYKLVERLTYH-IYADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQGTVREDLKK 341
TL L+E LT DP+FV TFL TYRSF TP L+ LIER+ I + +
Sbjct: 1 TLEALIEHLTSTFDMPDPSFVSTFLLTYRSFTTPTELLEKLIERYEIAPPENL---EYND 57
Query: 342 FRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLFLETIN--GKSMRKWVDSVS 399
+ K+ +P++ RVLNVLR WV+++ YDF D LLE L+ FL + G +++ +
Sbjct: 58 WDKKKSKPIRLRVLNVLRTWVENYPYDFFDDEELLELLEEFLNRLVQEGALLQELKKLLR 117
Query: 400 KVI 402
K++
Sbjct: 118 KLL 120
>gnl|CDD|216026 pfam00618, RasGEF_N, RasGEF N-terminal motif. A subset of guanine
nucleotide exchange factor for Ras-like small GTPases
appear to possess this motif/domain N-terminal to the
RasGef (Cdc25-like) domain.
Length = 94
Score = 99.4 bits (248), Expect = 6e-25
Identities = 44/105 (41%), Positives = 58/105 (55%), Gaps = 13/105 (12%)
Query: 279 IKGATLYKLVERLTYH-IYADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQGTVRE 337
+KG TL LVE LT D +F+ TFL TYRSF TP L+LL+ERF P + +E
Sbjct: 2 VKGGTLEALVEHLTSTDDMLDDSFLETFLLTYRSFTTPGELLELLLERFLSPPLGLSYQE 61
Query: 338 DLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLF 382
++ RVLNVLR W++++ DF D LL L+ F
Sbjct: 62 ------------IRIRVLNVLRKWIENYNSDFVDDPELLSLLEEF 94
>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain. Domain commonly found
in eukaryotic signalling proteins. The domain family
possesses multiple functions including the abilities to
bind inositol phosphates, and various proteins. PH
domains have been found to possess inserted domains
(such as in PLC gamma, syntrophins) and to be inserted
within other domains. Mutations in Brutons tyrosine
kinase (Btk) within its PH domain cause X-linked
agammaglobulinaemia (XLA) in patients. Point mutations
cluster into the positively charged end of the molecule
around the predicted binding site for
phosphatidylinositol lipids.
Length = 102
Score = 59.1 bits (143), Expect = 1e-10
Identities = 25/110 (22%), Positives = 41/110 (37%), Gaps = 15/110 (13%)
Query: 113 FFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKER 172
+E L K + G K +R LF+ LL K + S Y+ K
Sbjct: 1 VIKEGWLYKKSGGGKKSWKKRYFVLFNSTLLYYKSKKDKKS-------------YKPKGS 47
Query: 173 FFIRRIEILDLPDSD--DYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
+ + + PD D + F+I ++L A S E++ W+ L
Sbjct: 48 IDLSGCTVREAPDPDSSKKPHCFEIKTSDRKTLLLQAESEEEREKWVEAL 97
>gnl|CDD|241258 cd01224, PH_Collybistin_ASEF, Collybistin/APC-stimulated guanine
nucleotide exchange factor pleckstrin homology (PH)
domain. Collybistin (also called PEM2) is homologous to
the Dbl proteins ASEF (also called ARHGEF4/RhoGEF4) and
SPATA13 (Spermatogenesis-associated protein 13; also
called ASEF2). It activates CDC42 specifically and not
any other Rho-family GTPases. Collybistin consists of an
SH3 domain, followed by a RhoGEF/DH and PH domain. In
Dbl proteins, the DH and PH domains catalyze the
exchange of GDP for GTP in Rho GTPases, allowing them to
signal to downstream effectors. It induces submembrane
clustering of the receptor-associated peripheral
membrane protein gephyrin, which is thought to form a
scaffold underneath the postsynaptic membrane linking
receptors to the cytoskeleton. It also acts as a tumor
suppressor that links adenomatous polyposis coli (APC)
protein, a negative regulator of the Wnt signaling
pathway and promotes the phosphorylation and degradation
of beta-catenin, to Cdc42. Autoinhibition of collybistin
is accomplished by the binding of its SH3 domain with
both the RhoGEF and PH domains to block access of Cdc42
to the GTPase-binding site. Inactivation promotes cancer
progression. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 139
Score = 57.6 bits (140), Expect = 9e-10
Identities = 36/137 (26%), Positives = 56/137 (40%), Gaps = 21/137 (15%)
Query: 88 IEKTNELSKTVDGWTSVDIGQCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKP 147
+EK +++GW D+ +E L K++ K + ER FLFD L+ CK
Sbjct: 4 LEKLAAWQSSIEGWEGEDLLDRSSELIHSGELSKISSKGKAQ--ERTFFLFDHQLVYCKK 61
Query: 148 NSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPD-SDDY-----KNAFQIVPRCGS 201
+ + K R + +EI DL D D KNA++I +
Sbjct: 62 DLLKRDN------------LIYKGRIDLDNMEIEDLEDGKDKDSGVTVKNAWKIYNTSKN 109
Query: 202 PV-ILIANSPEDKNNWM 217
+L A S E+K W+
Sbjct: 110 KWYLLCAKSAEEKQRWL 126
>gnl|CDD|215766 pfam00169, PH, PH domain. PH stands for pleckstrin homology.
Length = 101
Score = 51.7 bits (124), Expect = 3e-08
Identities = 28/112 (25%), Positives = 44/112 (39%), Gaps = 16/112 (14%)
Query: 114 FREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERF 173
+E LLK G + +R LFDG+LL K + K R K
Sbjct: 2 IKEGWLLKKGSGGRKSWKKRYFVLFDGVLLYYKDSKKS--------------SSRPKGSI 47
Query: 174 FIRRIEILDLPDSD--DYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIML 223
+ ++ +PDS+ KN F+I +L A S E++ W+ +
Sbjct: 48 PLSGCQVTKVPDSEDGKRKNCFEIRTGDRETFLLQAESEEERKEWVKAIRSA 99
>gnl|CDD|241540 cd13389, PH1_FGD5_FGD6, FYVE, RhoGEF and PH domain
containing/faciogenital dysplasia proteins 5 and 6
Pleckstrin Homology (PH) domain. FGD5 regulates
promotes angiogenesis of vascular endothelial growth
factor (VEGF) in vascular endothelial cells, including
network formation, permeability, directional movement,
and proliferation. The specific function of FGD6 is
unknown. In general, FGDs have a RhoGEF (DH) domain,
followed by a PH domain, a FYVE domain and a C-terminal
PH domain. All FGDs are guanine nucleotide exchange
factors that activate the Rho GTPase Cdc42, an important
regulator of membrane trafficking. The RhoGEF domain is
responsible for GEF catalytic activity, while the PH
domain is involved in intracellular targeting of the DH
domain. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 125
Score = 51.5 bits (124), Expect = 9e-08
Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 17/109 (15%)
Query: 112 EFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE 171
+E L+K++ K E R FLF+ +LL P+ G+Y+LK
Sbjct: 13 VLIKEGELMKVS-RK--EMQPRYLFLFNDVLLYT------------TPVQSGVGKYKLKN 57
Query: 172 RFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
+ +++ LP+ ++Y N FQI S L A+S E+++ W+ L
Sbjct: 58 ELPLSGMKVS-LPEDEEYSNEFQIESTKRS-FTLSASSAEERDEWVKAL 104
>gnl|CDD|241231 cd00821, PH, Pleckstrin homology (PH) domain. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 92
Score = 45.4 bits (107), Expect = 5e-06
Identities = 23/105 (21%), Positives = 36/105 (34%), Gaps = 15/105 (14%)
Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
+E L K +R LFD +LL K + + K
Sbjct: 1 KEGWLKKRGGKGLKSWKKRWFVLFDDVLLYYKSKKDSS--------------KKPKGLIP 46
Query: 175 IRRIEILDLPDSDDYKNAFQIV-PRCGSPVILIANSPEDKNNWMA 218
+ ++L S N F++V P G L A S E++ W+
Sbjct: 47 LSDGLEVELVSSSGKPNCFELVTPDRGRTYYLQAESEEEREEWLE 91
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 49.0 bits (116), Expect = 1e-05
Identities = 45/194 (23%), Positives = 73/194 (37%), Gaps = 20/194 (10%)
Query: 720 TVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA--FS 777
T TG +S L E D +P + T T N+ +P + SP + T+ +
Sbjct: 470 TSGTPTGTTSSTLPE-DTSPTSRTTSATPNAT----SPTPAVTTPNATSPTTQKTSDTPN 524
Query: 778 MFSTQSVQSLASSTITLRPGSSSGPEFS-----GEDSPGLMGSTFDGLYSPPPLPPRTNR 832
S + ++T T P ++ + E+SP + + + S P +
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESP-VNNTNTPVVTSAPSVLTSAVT 583
Query: 833 RREV-----STSDQSVHSRDISPPPLPPRRETRPSLSS--PPGYASSPDPSPSSPGRFHV 885
+ TS Q P T P L+S P G + + +PS P HV
Sbjct: 584 TGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHV 643
Query: 886 SSPSPSKPSGSTSR 899
S+ SP G+TS+
Sbjct: 644 STLSPGPGPGTTSQ 657
Score = 36.7 bits (84), Expect = 0.072
Identities = 41/213 (19%), Positives = 68/213 (31%), Gaps = 23/213 (10%)
Query: 718 ALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFS 777
+V T P+ + S + T +T + V SP S
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT----GQHGTGSSPTSQQPGIP 601
Query: 778 MFSTQSVQSLASSTITLRPGSSSGP---EFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRR 834
S + +S +ST T +S+ P E E++P + +T SP P P T++
Sbjct: 602 SSSHSTPRS--NSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVS 659
Query: 835 EVSTSDQS-----VHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPS 889
S S VH + P P+ +SP + P+ ++ +
Sbjct: 660 GPGNSSTSRYPGEVHVTEGMPN---------PNATSPSAPSGQKTAVPTVTSTGGKANST 710
Query: 890 PSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
+ SGST P + P
Sbjct: 711 TKETSGSTLMASTSPHTNEGAFRTTPYNATTYL 743
Score = 35.1 bits (80), Expect = 0.21
Identities = 45/227 (19%), Positives = 68/227 (29%), Gaps = 38/227 (16%)
Query: 707 SKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNAN-- 764
S+ A N + T T SP ++ +P T A +
Sbjct: 490 SRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSV 549
Query: 765 --------GSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGST 816
+SP + + + S SV + A +T GSS + G S
Sbjct: 550 PNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPR 609
Query: 817 FDGLYSPPPL----PPRTNRRREVS---TSDQSVHSRDISPPPLPPRRETRPSLSSPPGY 869
+ + P L P E + S V + P P
Sbjct: 610 SNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP------------GTTSQ 657
Query: 870 ASSPDPSPSS--PGRFHVS-------SPSPSKPSGSTSRLPLLPPKP 907
S P S +S PG HV+ + SPS PSG + +P +
Sbjct: 658 VSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704
Score = 30.5 bits (68), Expect = 5.9
Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 18/138 (13%)
Query: 717 PALTVNKITGPKNSPLSESDGTPQ-TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA 775
T ++++GP NS S G T P NA +P TA
Sbjct: 651 GPGTTSQVSGPGNSSTSRYPGEVHVTEGMPN--------------PNATSPSAPSGQKTA 696
Query: 776 FSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPP-LPPRTNRRR 834
++ ++ +++ T GS+ S + G +T + P R R
Sbjct: 697 VPTVTSTGGKANSTTKET--SGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPR 754
Query: 835 EVSTSDQSVHSRDISPPP 852
TS + P P
Sbjct: 755 WTFTSPPVTTKQATVPVP 772
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 47.9 bits (114), Expect = 3e-05
Identities = 42/218 (19%), Positives = 60/218 (27%), Gaps = 13/218 (5%)
Query: 701 SPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTV--FAPI 758
PG +A N +L+ P G PPT AP
Sbjct: 74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPD 133
Query: 759 MLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGS---SSGPEFSGEDSPGLMGS 815
+ SPG A + S ++AS + R + SS E +P +
Sbjct: 134 LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPA 191
Query: 816 TFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
P PR RR ++ S SP P P R + +S +SS
Sbjct: 192 EPPPSTPPAAASPRPPRRSSPISASAS------SPAPAPGRSAADDAGASSSDSSSSESS 245
Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
P P+ + T S
Sbjct: 246 GCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
Score = 46.7 bits (111), Expect = 6e-05
Identities = 34/181 (18%), Positives = 53/181 (29%), Gaps = 12/181 (6%)
Query: 727 PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQS 786
+SP +E + P + + + SP + + S
Sbjct: 185 APSSPPAEPPPSTPPAAASPRPPRRSSPISA-------SASSPAPAPGRSAADD--AGAS 235
Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSR 846
+ S+ + G GPE + P + + S R
Sbjct: 236 SSDSSSSESSGCGWGPE---NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPR 292
Query: 847 DISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPK 906
+ SP P P + P+ SSP +SS SS SS S + S P P
Sbjct: 293 ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPS 352
Query: 907 P 907
P
Sbjct: 353 P 353
Score = 42.1 bits (99), Expect = 0.002
Identities = 32/217 (14%), Positives = 62/217 (28%), Gaps = 9/217 (4%)
Query: 691 PRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANS 750
PRR +S + + A ++ A + + + S + PL P +
Sbjct: 207 PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE--SSGCGWGPENECPLPRPAPIT 264
Query: 751 EFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSP 810
T N S+ + S++ + S + + + S
Sbjct: 265 LPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRES 324
Query: 811 GLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYA 870
++ S + +S PP P RP S P
Sbjct: 325 SSSSTSSSSESSRGAAVSPG------PSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378
Query: 871 SSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
++ P+ R + ++ +T R P P+P
Sbjct: 379 AASAGRPTRR-RARAAVAGRARRRDATGRFPAGRPRP 414
Score = 41.7 bits (98), Expect = 0.002
Identities = 30/142 (21%), Positives = 50/142 (35%), Gaps = 11/142 (7%)
Query: 764 NGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSP 823
+ S + S+ + S+ S + +SS+ + SS G S SP S S
Sbjct: 301 SSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSP-----SR 355
Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
PP P + R+ ++ S S P RR R +++ + P+ R
Sbjct: 356 PPPPADPSSPRKRPRPSRAPSSPAASAGR-PTRRRARAAVAGRARRRDATGRFPAGRPR- 413
Query: 884 HVSSPSPSKPSGSTSRLPLLPP 905
PSP ++ P
Sbjct: 414 ----PSPLDAGAASGAFYARYP 431
Score = 38.6 bits (90), Expect = 0.022
Identities = 30/174 (17%), Positives = 49/174 (28%), Gaps = 20/174 (11%)
Query: 756 APIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDS------ 809
P L+ A P ++ LR + S S
Sbjct: 764 VPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRS 823
Query: 810 --PGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS------VHSRDISPPPLPPRRETRP 861
P + P R S+ + ++ P PP RP
Sbjct: 824 HTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARP 883
Query: 862 SLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
++PP A++ P+ + R P+P G +P P P ++VP
Sbjct: 884 GAAAPPKAAAAAPPAGAPAPRPR---PAPRVKLG---PMPPGGPDPRGGFRRVP 931
Score = 38.2 bits (89), Expect = 0.025
Identities = 33/159 (20%), Positives = 52/159 (32%), Gaps = 11/159 (6%)
Query: 725 TGPKNSPLSESDGTPQ--TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ 782
+S E +P +P + P +S S+++ S+ S ST+ S S++
Sbjct: 284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRAS------SSSSSSRESSSSSTSSSSESSR 337
Query: 783 SVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS 842
+ + P S P + SP SP R RRR +
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397
Query: 843 VHSRDISPP---PLPPRRETRPSLSSPPGYASSPDPSPS 878
RD + P +S YA P +PS
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436
Score = 36.3 bits (84), Expect = 0.099
Identities = 22/118 (18%), Positives = 29/118 (24%), Gaps = 8/118 (6%)
Query: 795 RPGSSSGPEFSGEDSPGLMG----STFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISP 850
G + G F+ PPP P E + + ++P
Sbjct: 39 SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAP 97
Query: 851 -PPLPPRRETRPSLSSPPGYASSPDP--SPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
P T P SSP +P P P SP P G
Sbjct: 98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAA 155
Score = 31.7 bits (72), Expect = 2.3
Identities = 22/139 (15%), Positives = 37/139 (26%), Gaps = 10/139 (7%)
Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVS----TSDQSV 843
A E + L S + R +R +S +
Sbjct: 778 AEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPAR 837
Query: 844 HSRDISPPPLPPRRETRPSLSSPPGYASSPD-----PSPSSPGRFH-VSSPSPSKPSGST 897
+ PP E+ S + G + P P P ++P + +
Sbjct: 838 PPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPP 897
Query: 898 SRLPLLPPKPFSFGKQVPM 916
+ P P+P K PM
Sbjct: 898 AGAPAPRPRPAPRVKLGPM 916
Score = 30.9 bits (70), Expect = 4.0
Identities = 22/117 (18%), Positives = 31/117 (26%), Gaps = 8/117 (6%)
Query: 783 SVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS 842
+V +A + R +GP S +S L P + R T
Sbjct: 52 AVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP 111
Query: 843 VHSRDISPPPLPPRRETRPSL--SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST 897
S P PP S S P + P S S P+ +
Sbjct: 112 ------SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAV 162
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 44.2 bits (104), Expect = 5e-04
Identities = 27/112 (24%), Positives = 36/112 (32%), Gaps = 2/112 (1%)
Query: 796 PGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPP 855
P S P + D P P R +R R ++ SPP P
Sbjct: 2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQASSPPQRPR 2684
Query: 856 RRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
RR RP++ S A P P P+ H + P G + P P
Sbjct: 2685 RRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736
Score = 42.6 bits (100), Expect = 0.001
Identities = 45/224 (20%), Positives = 60/224 (26%), Gaps = 28/224 (12%)
Query: 695 PNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTV 754
P P AR A P +P + P LT P S +
Sbjct: 2741 PPAVPAGPATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRRLTRPAVAS-LSE 2793
Query: 755 FAPIMLSNANGSQSPGSVSTAFSMF-STQSVQSLASSTITLRPGSSSGPEFSGEDSPGLM 813
+ S + + P +V + S + +P + P S L
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853
Query: 814 GSTFDG--------LYSPPPLP-----PRTNR--RREVSTSDQSVHSRDISPPPLP-PRR 857
GS G SP P P R R VS S +S P P P+
Sbjct: 2854 GSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA 2913
Query: 858 ETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLP 901
P P P P P P R P P+ +
Sbjct: 2914 PPPPQPQPQPPPPPQPQPPPPPPPR----PQPPLAPTTDPAGAG 2953
Score = 40.7 bits (95), Expect = 0.005
Identities = 25/92 (27%), Positives = 31/92 (33%), Gaps = 11/92 (11%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPP---RRETRPSLSSPPGYASSP-DPSPS 878
PPPLPP + D+SV +P P P R RP +P D
Sbjct: 2552 PPPLPPAA----PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD 2607
Query: 879 SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSF 910
G + PSP P P P P +
Sbjct: 2608 PRGP---APPSPLPPDTHAPDPPPPSPSPAAN 2636
Score = 39.5 bits (92), Expect = 0.010
Identities = 32/177 (18%), Positives = 46/177 (25%), Gaps = 13/177 (7%)
Query: 738 TPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPG 797
P P P P P + + + + + +T
Sbjct: 2735 LPAAPAPPAVPA------GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
Query: 798 SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRR 857
+S SP + +P P S PPP PP
Sbjct: 2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
Query: 858 ETRPSLSSPPG-------YASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
S PG + SP P++P R V + S ST L P +P
Sbjct: 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
Score = 38.8 bits (90), Expect = 0.017
Identities = 24/84 (28%), Positives = 30/84 (35%), Gaps = 8/84 (9%)
Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
P P T+R R QS R PR P S P +PDP P SP
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPP--SPLPPDTHAPDPPPPSP--- 2631
Query: 884 HVSSPSPSKPSGSTSRLPLLPPKP 907
SP+ ++P P +P
Sbjct: 2632 ---SPAANEPDPHPPPTVPPPERP 2652
Score = 38.4 bits (89), Expect = 0.026
Identities = 23/95 (24%), Positives = 34/95 (35%), Gaps = 16/95 (16%)
Query: 822 SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
P P + PPP PPR + P L+ A + +PS + P
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ--PPLAPTTDPAGAGEPSGAVPQ 2961
Query: 882 --------------RFHVSSPSPSKPSGSTSRLPL 902
RF V P+PS+ + ++S PL
Sbjct: 2962 PWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
Score = 36.8 bits (85), Expect = 0.062
Identities = 40/209 (19%), Positives = 60/209 (28%), Gaps = 41/209 (19%)
Query: 691 PRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANS 750
P+R + + S +A P T P+ +P + TP P P A
Sbjct: 2680 PQRPRRRAARPT--VGSLTSLADPPPPPPT------PEPAPHALVSATP-LPPGPAAARQ 2730
Query: 751 EF--TVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGED 808
AP + G +PG + A T P + + P
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARP------------ARPPTTAGPPAPAPPA----- 2773
Query: 809 SPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPG 868
+P PPR R V++ +S S P P ++ P
Sbjct: 2774 -------------APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
Query: 869 YASSPDPSPSSPGRFHVSSPSPSKPSGST 897
AS P P + P P P +
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
Score = 33.8 bits (77), Expect = 0.68
Identities = 28/125 (22%), Positives = 41/125 (32%), Gaps = 16/125 (12%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNR-------RREVSTSDQSVHSRD 847
RP + P+ + +P G P PLPP T+ +
Sbjct: 2588 RPDAP--PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
Query: 848 ISPPPLP-----PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPL 902
+ PP P P R +RP + G A+ P P R ++ S + P
Sbjct: 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR--RAARPTVGSLTSLADPPP 2703
Query: 903 LPPKP 907
PP P
Sbjct: 2704 PPPTP 2708
Score = 33.0 bits (75), Expect = 1.0
Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 2/87 (2%)
Query: 822 SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
SPP P R R V + P P P + PPG A++ SP+ P
Sbjct: 2678 SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737
Query: 882 RFHVSSPSPSKPSG-STSRLPLLPPKP 907
P+ P+ P PP
Sbjct: 2738 A-PAPPAVPAGPATPGGPARPARPPTT 2763
Score = 32.2 bits (73), Expect = 1.7
Identities = 22/92 (23%), Positives = 32/92 (34%), Gaps = 8/92 (8%)
Query: 819 GLYSPPPLPPRTNRRREVSTSDQSV---HSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
G + P T +RR + D P P + P+ + P AS+P P
Sbjct: 371 GRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAP-P 429
Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
P++P + S P G PP P
Sbjct: 430 PPATP----LPSAEPGSDDGPAPPPERQPPAP 457
Score = 31.4 bits (71), Expect = 3.0
Identities = 40/168 (23%), Positives = 58/168 (34%), Gaps = 15/168 (8%)
Query: 727 PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ--SV 784
P+ P PQ PL P T + + + G+ PG V+ S
Sbjct: 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR 2986
Query: 785 QSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
++ ASST L S S S S L T PPP+ + ++ + D +
Sbjct: 2987 EAPASSTPPLTGHSLSRVS-SWASSLALHEET-----DPPPV----SLKQTLWPPDDTED 3036
Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSP---DPSPSSPGRFHVSSPS 889
S S R +L P P +P P++P SPS
Sbjct: 3037 SDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPS 3084
Score = 31.1 bits (70), Expect = 4.2
Identities = 21/86 (24%), Positives = 28/86 (32%), Gaps = 15/86 (17%)
Query: 832 RRREVSTSDQSV-------HSRDISPPPLPPRRETRPS---LSSPPGYASSPDPSPSSPG 881
RRR T S+ H + P RR R + + PG P+ P
Sbjct: 354 RRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKP 907
S P+P+ S P P P
Sbjct: 414 ----SVPTPAPTPVPASA-PPPPATP 434
Score = 30.7 bits (69), Expect = 5.1
Identities = 31/120 (25%), Positives = 40/120 (33%), Gaps = 16/120 (13%)
Query: 796 PGSSSGPEFSGEDSPGLMGSTFDGLYSPP---PLPPRT---NRRREVSTSDQSVHSRDIS 849
PG P+ +P + L P P+ PR R E SD +
Sbjct: 2498 PGGGGPPDPDAPPAPSRLAPAI--LPDEPVGEPVHPRMLTWIRGLEELASDDAGDP---- 2551
Query: 850 PPPLPPR-RETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPF 908
PPPLPP P S PP P P PS P + + P + R P+
Sbjct: 2552 PPPLPPAAPPAAPDRSVPP---PRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDP 2608
>gnl|CDD|241253 cd01218, PH_Phafin2-like, Phafin2 (also called EAPF, FLJ13187,
ZFYVE18 or PLEKHF2) Pleckstrin Homology (PH) domain.
Phafin2 is differentially expressed in the liver cancer
cell and regulates the structure and function of the
endosomes through Rab5-dependent processes. Phafin2
modulates the cell's response to extracellular
stimulation by modulating the receptor density on the
cell surface. Phafin2 contains a PH domain and a FYVE
domain. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 123
Score = 40.3 bits (95), Expect = 6e-04
Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 178 IEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
++I DL D+ + KN +QI+ S V+ A + +K+ WM +
Sbjct: 79 VKIEDLEDTGELKNGWQIISPKKSFVV-YAATATEKSEWMDHI 120
>gnl|CDD|241263 cd01229, PH_Ect2, Epithelial cell transforming 2 (Ect2) pleckstrin
homology (PH) domain. Ect2, a mammalian ortholog of
Drosophila pebble, plays a role in neuronal
differentiation and brain development. Pebble and Ect2
have been identified as Rho-family guanine nucleotide
exchange factors (GEF) that mediate activation of Rho
during cytokinesis, but are proposed to play slightly
different roles. PH domains have diverse functions, but
in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 180
Score = 39.9 bits (94), Expect = 0.002
Identities = 30/128 (23%), Positives = 55/128 (42%), Gaps = 27/128 (21%)
Query: 120 LKLNIGKKTESTERVC-FLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE-RFFIRR 177
++L K+ + + FLF LL +CK KR+SV TA L+E + +
Sbjct: 49 VELGDSLKSGRGDSLALFLFSDLLEICK---KRSSVKGTAKSPRGSTGSGLRESKKKYKH 105
Query: 178 IE---------ILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLN---T 225
I+ ++D+ +++D +N F ++ R SP + + +L+
Sbjct: 106 IKLMPLSNIKRVIDITETEDCQNVFALLFR----------SPTELKEKLYSFQILDEETD 155
Query: 226 KSMFERTL 233
K F +TL
Sbjct: 156 KEKFLKTL 163
>gnl|CDD|241539 cd13388, PH1_FGD4, FYVE, RhoGEF and PH domain
containing/faciogenital dysplasia protein 4 pleckstrin
homology (PH), N-terminal domain. In general, FGDs have
a RhoGEF (DH) domain, followed by an N-terminal PH
domain, a FYVE domain and a C-terminal PH domain. All
FGDs are guanine nucleotide exchange factors that
activates the Rho GTPase Cdc42, an important regulator
of membrane trafficking. The RhoGEF domain is
responsible for GEF catalytic activity, while the
N-terminal PH domain is involved in intracellular
targeting of the DH domain. FGD4 is one of the genes
associated with Charcot-Marie-Tooth neuropathy type 4
(CMT4), a group of progressive motor and sensory axonal
and demyelinating neuropathies that are distinguished
from other forms of CMT by autosomal recessive
inheritance. Those affected have distal muscle weakness
and atrophy associated with sensory loss and,
frequently, pes cavus foot deformity. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 92
Score = 38.2 bits (89), Expect = 0.002
Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 16/100 (16%)
Query: 120 LKLNIGKKTEST-ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRI 178
L I ++ T ER FLF+ LLLLC S R +Y+++ F I +
Sbjct: 5 KLLKISSRSGDTQERHLFLFNDLLLLC---SPRLIP--------VGSKYKVRALFGISGM 53
Query: 179 EILDLPDSDDYKNAFQIVPRCGSPVI-LIANSPEDKNNWM 217
++L+ D+ + +N F + R + L + ++K W+
Sbjct: 54 QVLE-GDNLETENTFYV--RGKQRCVELCTGTQKEKEEWI 90
>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
protein.
Length = 342
Score = 39.8 bits (93), Expect = 0.006
Identities = 32/150 (21%), Positives = 44/150 (29%), Gaps = 13/150 (8%)
Query: 727 PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQS 786
K P S ++ F F + + + S T +VQ
Sbjct: 8 KKRKPSSIGISDDESATDSEAEPKGFREFQSVGVQVEDEKLQRRRF--KRSSSVTANVQ- 64
Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTN----RRREVSTSDQS 842
A + PG S ED G +F S PP ++ R VST QS
Sbjct: 65 -ADLELEGFPGLPVVRHVSTEDKALQFGPSFQRESSEDSPPPSSSTYSAGTRTVSTQGQS 123
Query: 843 VHSRDISPPPLPPRRET-----RPSLSSPP 867
+ D P E+ S PP
Sbjct: 124 AYLSDPKRRPSSEASESETVAFDESDLPPP 153
>gnl|CDD|241482 cd13328, PH1_FDG_family, FYVE, RhoGEF and PH domain
containing/faciogenital dysplasia family proteins
pleckstrin homology (PH), N-terminal domain. In
general, FGDs have a RhoGEF (DH) domain, followed by an
N-terminal PH domain, a FYVE domain and a C-terminal PH
domain. All FGDs are guanine nucleotide exchange factors
that activates the Rho GTPase Cdc42, an important
regulator of membrane trafficking. The RhoGEF domain is
responsible for GEF catalytic activity, while the
N-terminal PH domain is involved in intracellular
targeting of the DH domain. Mutations in the FGD1 gene
are responsible for the X-linked disorder known as
faciogenital dysplasia (FGDY). PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 91
Score = 36.7 bits (85), Expect = 0.006
Identities = 27/102 (26%), Positives = 46/102 (45%), Gaps = 14/102 (13%)
Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
+E +LKL K S ER FLF+ +LL C P L +Y ++ R
Sbjct: 1 KEGHILKL-SAKNGTSQERHLFLFNDMLLYCVPK-----------LRLIGQKYGVRARMD 48
Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNW 216
+ + + + D+ + F I + S + L A++ E+K+ W
Sbjct: 49 VDGMNVQVVKG-DEVPHTFYIEGKQKS-LELQASTAEEKDAW 88
>gnl|CDD|241397 cd13243, PH_PLEKHG1_G2_G3, Pleckstrin homology domain-containing
family G members 1, 2, and 3 pleckstrin homology (PH)
domain. PLEKHG1 (also called ARHGEF41), PLEKHG2 (also
called ARHGEF42 or CLG/common-site lymphoma/leukemia
guanine nucleotide exchange factor2), and PLEKHG3 (also
called ARHGEF43) have RhoGEF DH/double-homology domains
in tandem with a PH domain which is involved in
phospholipid binding. They function as a guanine
nucleotide exchange factor (GEF) and are involved in the
regulation of Rho protein signal transduction. Mutations
in PLEKHG1 have been associated panic disorder (PD), an
anxiety disorder characterized by panic attacks and
anticipatory anxiety. PH domains have diverse functions,
but in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 147
Score = 38.1 bits (89), Expect = 0.006
Identities = 36/154 (23%), Positives = 56/154 (36%), Gaps = 26/154 (16%)
Query: 74 TGIRAY-NRLKR-QQAIEKTNELSKTVDGWTSVDIGQCCNEFFREDTLLKLNIGKKTEST 131
T + + N +KR + + E+ +DGW D+ E E + G K E
Sbjct: 12 TQVAWHINDMKRKHEHAVRLQEIQSLLDGWLGPDL-TTYGELVLEGSFRVQ--GAKNE-- 66
Query: 132 ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPDSDDYKN 191
R+ FLFD +LL+ K KR K + +++ D
Sbjct: 67 -RLLFLFDKMLLITK---KRE-----------DDHLVYKTHIMCSNLMLVESIPKD--PL 109
Query: 192 AFQIVPRCGSPV--ILIANSPEDKNNWMADLIML 223
+FQ++P L A + E K W L L
Sbjct: 110 SFQVLPFDNPKAQYTLQAKNLEQKRLWTQHLKRL 143
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 39.0 bits (91), Expect = 0.010
Identities = 29/111 (26%), Positives = 40/111 (36%), Gaps = 13/111 (11%)
Query: 796 PGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLP-PRTNRRREVSTSDQSVHSRDISPPPLP 854
P SS S S L P L P+T +RR S+ S + P
Sbjct: 78 PALSSERLLSPSPSV---------LDLSPVLASPQTGKRRRSSS--PSDDEDEAERPSKR 126
Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
PR ++ S SSP + PSP + S + + P + S P P
Sbjct: 127 PRSDSISSSSSPAKPPEACLPSP-AASTQDELSEASAAPLPTPSLSPPHTP 176
Score = 39.0 bits (91), Expect = 0.012
Identities = 27/115 (23%), Positives = 39/115 (33%), Gaps = 7/115 (6%)
Query: 786 SLASSTITLRPGSSSGPEFSGEDSPGLMG-STFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
S +SST + +SS + S +P G S F PL P + + S
Sbjct: 284 SSSSSTFAVPALTSSSVDQSA--TPLDQGFSNFGSNMYSEPLNPTND---SLLYGLPSSS 338
Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSP-SSPGRFHVSSPSPSKPSGSTS 898
S + P T S + P PSP +S + P+ P S
Sbjct: 339 SLYANRTIFPAWASTSVSPLDFSTLFNQPSPSPMASQSILAPAQPTSPSPVALPS 393
Score = 32.8 bits (75), Expect = 0.89
Identities = 35/189 (18%), Positives = 52/189 (27%), Gaps = 29/189 (15%)
Query: 730 SPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLAS 789
SP + + L+P + + + SP + S S+ S +
Sbjct: 73 SPERSPALSSERLLSPSPSVLDLSPVL----------ASPQTGKRRRS--SSPSDDEDEA 120
Query: 790 STITLRPGSSSGPEFSGEDSPGLMG------STFDGLYSPPP-------LPPRTNRRREV 836
+ RP S S S P ST D L L P
Sbjct: 121 ERPSKRPRSDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTA 180
Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
+ + D P P R +T SS P S P P ++ + S
Sbjct: 181 PSGKRKRRLSDGFQLPAPKRPQT----SSRPQTVSDPLPLHATTDWDTWFQATVSSSPSL 236
Query: 897 TSRLPLLPP 905
+ PP
Sbjct: 237 LLTGDIPPP 245
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 35.7 bits (83), Expect = 0.016
Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 1/39 (2%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
PP P + RRR S SD S + S PP P P
Sbjct: 30 PPWAPKKRCRRRLESDSD-SSSTTSPSSPPTPQPTTECP 67
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 38.8 bits (91), Expect = 0.017
Identities = 29/183 (15%), Positives = 46/183 (25%), Gaps = 35/183 (19%)
Query: 725 TGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSV 784
GP S P P A + +P + A + S
Sbjct: 598 EGPPAPASSGPPEEAARPAAPAAPA-------------APAAPAPAGAAAAPAEASAAPA 644
Query: 785 QSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
+A+ + + G+ P G P P +
Sbjct: 645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAG-------GAAPAAP--------PPAPAPAA 689
Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
+ P + P+ + P G A P P + S PS + LP
Sbjct: 690 PAAPAGAA-PAQPAPAPAATPPAGQADDPAAQPPQAAQ------GASAPSPAADDPVPLP 742
Query: 905 PKP 907
P+P
Sbjct: 743 PEP 745
Score = 32.3 bits (74), Expect = 1.6
Identities = 22/166 (13%), Positives = 38/166 (22%), Gaps = 19/166 (11%)
Query: 733 SESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTI 792
+ P A + ++ G P A + A +
Sbjct: 634 AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAP 693
Query: 793 TLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPP 852
+ P + G + PP + + PP
Sbjct: 694 AGAAPAQPAPAPAATPPAGQADDP----AAQPPQAAQGASAP------SPAADDPVPLPP 743
Query: 853 LPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTS 898
P PP A +P P P ++P+ + P S
Sbjct: 744 EPD---------DPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 38.0 bits (88), Expect = 0.019
Identities = 38/222 (17%), Positives = 62/222 (27%), Gaps = 36/222 (16%)
Query: 681 PRYAKLRKDIPRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQ 740
+P P L + A+ R T + + P +
Sbjct: 181 SADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST 240
Query: 741 TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA-----FSMFSTQSVQ-SLASSTITL 794
T P + P + G P + + A + + TQ +Q ++ +S I
Sbjct: 241 TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRYELTVTQIIQIAIPASIIAC 300
Query: 795 -------------------------RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPR 829
RP S P S + + D L PP PR
Sbjct: 301 VFLGSCACCLHRRCRRRRRRPARIYRPPSPVAPSISAVNEAA-LARLGDELKRHPPESPR 359
Query: 830 TNRRREVSTSDQSVH--SRDISPPPLP--PRRETRPSLSSPP 867
++RR T S+ S + P + R RP +
Sbjct: 360 RSKRRSSQTMVPSLTAISEESEAPAVVELSRSPRRPGGPTAR 401
Score = 31.1 bits (70), Expect = 2.8
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 5/57 (8%)
Query: 851 PPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
P PR P + P +SP PS + P S + + P T+ P P
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSP-PSTTIPA----PSTTIAAPQAGTTPEAEGTPAP 259
Score = 31.1 bits (70), Expect = 3.5
Identities = 18/79 (22%), Positives = 28/79 (35%), Gaps = 3/79 (3%)
Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
P PRT E + + + S + P P P + P +P P G
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGE- 266
Query: 884 HVSSPSPSKPSGSTSRLPL 902
+ P+ + P+ SR L
Sbjct: 267 --APPANATPAPEASRYEL 283
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 37.3 bits (86), Expect = 0.038
Identities = 36/160 (22%), Positives = 55/160 (34%), Gaps = 18/160 (11%)
Query: 781 TQSVQSLASSTITLRPGSSSGPEFSGEDSPG----LMGSTFDGLYSPPPLPPRTNRRREV 836
TQ Q+ A++ L G + + S + G P PP T + +
Sbjct: 201 TQGHQT-ATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQS 259
Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPP-------GYASSPDPS---PSSPGRFHVS 886
+T D H++ PP R + S ++PP +P P+ S H S
Sbjct: 260 TTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSS 319
Query: 887 SPSPSK-PSGSTSRL--PLLPPKPFSFGKQVPMFQFNAPG 923
P P+ L PPKP S V ++ P
Sbjct: 320 PPGVQANPTTQNLVDCKELDPPKPNSICYGVGIYNEALPR 359
>gnl|CDD|241308 cd10572, PH_RhoGEF3_XPLN, Rho guanine nucleotide exchange factor 3
Pleckstrin homology (PH) domain. RhoGEF3/XPLN, a Rho
family GEF, preferentially stimulates guanine nucleotide
exchange on RhoA and RhoB, but not RhoC, RhoG, Rac1, or
Cdc42 in vitro. It also possesses transforming activity.
RhoGEF3/XPLN contains a tandem Dbl homology and PH
domain, but lacks homology with other known functional
domains or motifs. It is expressed in the brain,
skeletal muscle, heart, kidney, platelets, and
macrophage and neuronal cell lines. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 143
Score = 35.4 bits (82), Expect = 0.040
Identities = 30/133 (22%), Positives = 51/133 (38%), Gaps = 30/133 (22%)
Query: 104 VDIGQCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGN 163
+D Q LL K T+ FLF+ +L+L +P ++
Sbjct: 15 LDEKQKDPLIDESRALLCHGELKNKRGTKLHVFLFEDVLVLTRPVTR-----------NE 63
Query: 164 QGEYRL-KERFFIRRIEILDLPD---------------SDDYKNAFQIVPRCGSP---VI 204
Q Y++ ++ ++ + + DLPD +D KN F++ + GS
Sbjct: 64 QLCYQVYRQPIPVQDLVLEDLPDGEVRLGGSFRGAFSNNDKAKNFFRVSFKDGSKGQSHT 123
Query: 205 LIANSPEDKNNWM 217
L AN DK W+
Sbjct: 124 LQANDEFDKQQWL 136
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 36.9 bits (85), Expect = 0.049
Identities = 25/98 (25%), Positives = 33/98 (33%), Gaps = 20/98 (20%)
Query: 823 PPPLPPRTNRRRE------VSTSDQSV-HSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
P P P NR V D V +R +P P PR +T PDP
Sbjct: 201 PQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAPSP--PRTDT-----------LDPDP 247
Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
+ + P R +PSP S +P +G
Sbjct: 248 AIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTN 285
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 34.3 bits (79), Expect = 0.061
Identities = 34/118 (28%), Positives = 43/118 (36%), Gaps = 24/118 (20%)
Query: 790 STITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDIS 849
ST+ R S+SG FS SP + + P P R V T
Sbjct: 13 STLYTRTWSTSG--FSSCFSP------PESAHPDDPPPVGDPRPPVVDT----------- 53
Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
PPP+ P S P ++P P P P PSP P + PL PP+P
Sbjct: 54 PPPVSAVW-VLPPPSEP----AAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106
>gnl|CDD|241271 cd01238, PH_Btk, Bruton's tyrosine kinase pleckstrin homology (PH)
domain. Btk is a member of the Tec family of
cytoplasmic protein tyrosine kinases that includes BMX,
IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a
role in the maturation of B cells. Tec proteins general
have an N-terminal PH domain, followed by a Tek homology
(TH) domain, a SH3 domain, a SH2 domain and a kinase
domain. The Btk PH domain binds phosphatidylinositol
3,4,5-trisphosphate and responds to signalling via
phosphatidylinositol 3-kinase. The PH domain is also
involved in membrane anchoring which is confirmed by the
discovery of a mutation of a critical arginine residue
in the BTK PH domain. This results in severe human
immunodeficiency known as X-linked agammaglobulinemia
(XLA) in humans and a related disorder is mice.PH
domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 140
Score = 34.9 bits (81), Expect = 0.063
Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
I+ +E + S + K FQ+V + + A S ED++ W+ L
Sbjct: 54 IKCVEEVKDDASFERKYPFQVV-YDDYTLYVFAPSEEDRDEWILAL 98
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 36.2 bits (84), Expect = 0.068
Identities = 19/115 (16%), Positives = 28/115 (24%), Gaps = 7/115 (6%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
+S P + + P P S P
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPE------DPSSPSDSSLPPAPSSF 219
Query: 855 PRRETRPSLSSPPGYASSPDP-SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPF 908
PS SP + P P +P P V S +KP+ ++ P
Sbjct: 220 QSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGI 274
Score = 32.3 bits (74), Expect = 1.00
Identities = 21/104 (20%), Positives = 29/104 (27%), Gaps = 1/104 (0%)
Query: 806 GEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSS 865
E+ +T + S P S PP P + SL
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214
Query: 866 PPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFS 909
P S P P SP SP P + + +PP +
Sbjct: 215 APSSFQSDTP-PPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTA 257
Score = 30.8 bits (70), Expect = 3.1
Identities = 22/87 (25%), Positives = 26/87 (29%), Gaps = 11/87 (12%)
Query: 850 PPPLPPRRE------TRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST-----S 898
P P PP E + S DP+ +SP SSP
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207
Query: 899 RLPLLPPKPFSFGKQVPMFQFNAPGND 925
LPP P SF P +P N
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNP 234
Score = 30.8 bits (70), Expect = 3.4
Identities = 20/88 (22%), Positives = 24/88 (27%), Gaps = 7/88 (7%)
Query: 835 EVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPS 894
S S S P P P SS P+PSS P +
Sbjct: 179 PASASPSDPPSSSPGVPSFPS-----PPEDPSSPSDSSLPPAPSSFQSDT--PPPSPESP 231
Query: 895 GSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
+ S P P +QVP P
Sbjct: 232 TNPSPPPGPAAPPPPPVQQVPPLSTAKP 259
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 36.7 bits (85), Expect = 0.076
Identities = 36/197 (18%), Positives = 53/197 (26%), Gaps = 34/197 (17%)
Query: 732 LSESDGTPQTPLTPP----TANSEFTVFAPIMLSNANGSQSPGSVSTAFSM--------- 778
+ DG + L P N V P L++ + V A +
Sbjct: 561 TDDGDGNVREHLPPTREFAEGNVTVRVTRPTGLASGASNAEDPEVRRARTRGARALAQAR 620
Query: 779 -FSTQSVQSL-------ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRT 830
F +V + T RP + G E G PG + + P R
Sbjct: 621 TFGRATVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARG 680
Query: 831 NRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
+ R + H+ P PR G PD ++ SS
Sbjct: 681 DPVRPTA-----HHAA--LRAPQAPRPG--GPPGGGGGLPPPPDLPAAAGPAPCGSSLIA 731
Query: 891 SKPSGSTSRLPLLPPKP 907
S T+ PP
Sbjct: 732 S----PTAPPEPEPPGA 744
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 36.0 bits (83), Expect = 0.081
Identities = 31/110 (28%), Positives = 40/110 (36%), Gaps = 13/110 (11%)
Query: 794 LRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNR----RREVSTSDQSV--HSRD 847
L G S G F + S D L P R NR R S S+ ++ +++
Sbjct: 32 LPSGLSRGNSFLSNRNSDAAPSGTDSLSGRPA--SRLNREPSSRPGSSHSEATIVRQAKE 89
Query: 848 ISPPPLPPRRET-----RPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK 892
P PP RPSL S P S+ + R PSPSK
Sbjct: 90 GERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSK 139
Score = 32.5 bits (74), Expect = 1.1
Identities = 34/144 (23%), Positives = 45/144 (31%), Gaps = 12/144 (8%)
Query: 768 SPGSVSTAFSMFST---QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP 824
S V+ M + +S + S I P S E + + P S+ D S
Sbjct: 197 SFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRD-SEKTKPEKPQQETSSMDTEKSSA 255
Query: 825 PLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFH 884
P P T + + I + S A PS SP
Sbjct: 256 PKPRETLDPKSPEKAPP------IDTTEEELKSPEASPKESEEASARKRSPSLLSPSP-K 308
Query: 885 VSSPSPSKPSGSTSRLPLLP-PKP 907
SP P G + R PL P PKP
Sbjct: 309 AESPKPLASPGKSPRDPLSPRPKP 332
>gnl|CDD|220233 pfam09422, WTX, WTX protein. The WTX protein is found to be
inactivated in one third of Wilms tumours. The WTX
protein is functionally uncharacterized.
Length = 467
Score = 35.3 bits (81), Expect = 0.15
Identities = 16/58 (27%), Positives = 21/58 (36%), Gaps = 3/58 (5%)
Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
P R E P + G S P + +PG V P+KP +S P P
Sbjct: 128 ETPRAAREENAPPQDADGGKVS-PASALEAPGTSCVECGDPAKPGSESSA--FEDPGP 182
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 35.1 bits (81), Expect = 0.17
Identities = 23/96 (23%), Positives = 30/96 (31%), Gaps = 25/96 (26%)
Query: 808 DSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPP 867
SPG++G P PL P T +R P P R + P P
Sbjct: 236 HSPGILG--------PSPLHPHT--------------TRPSPPRPAFSRSPSSPLSPLPR 273
Query: 868 GYASS---PDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
P+P R H+ P+ S P L
Sbjct: 274 PSTRRGLLPNPRLPRASRGHLPPPTSSAPPRPNGGL 309
Score = 33.1 bits (76), Expect = 0.74
Identities = 24/92 (26%), Positives = 30/92 (32%), Gaps = 4/92 (4%)
Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
P+PP + ++ V P P PP +S + P PS R
Sbjct: 370 LPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASSSKHHRLP-PSVLPGPRL 428
Query: 884 HVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
SPSPS P T R PP P
Sbjct: 429 SSPSPSPSLP---TRRPGTPPPPASPPTPSPP 457
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 35.1 bits (81), Expect = 0.19
Identities = 29/112 (25%), Positives = 34/112 (30%), Gaps = 18/112 (16%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
GSSS +S +DSP L PP LPP+ RR VS ++
Sbjct: 3 GRGSSSDSPYSSDDSPSL---------EPPELPPKKKGRRRVSPVEEEE--------EEE 45
Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS-TSRLPLLPP 905
S PP S S K S R PL P
Sbjct: 46 EAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKKASTEAAVRNPLSDP 97
Score = 30.1 bits (68), Expect = 6.2
Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 1/79 (1%)
Query: 832 RRREVSTSD-QSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
R S S S S + PP LPP+++ R +S + + G +
Sbjct: 3 GRGSSSDSPYSSDDSPSLEPPELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRIS 62
Query: 891 SKPSGSTSRLPLLPPKPFS 909
G PL K
Sbjct: 63 RGKDGKRPVRPLKEEKDSE 81
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 35.1 bits (80), Expect = 0.19
Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 2/97 (2%)
Query: 804 FSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSL 863
F+G +PG ST G + ++ V S + + PP P + PS
Sbjct: 562 FAG--NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST 619
Query: 864 SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
S P + SP +PSSP + + + S+ ++
Sbjct: 620 SPPASHLGSPSTTPSSPESSIKVASTETASPESSIKV 656
Score = 30.4 bits (68), Expect = 5.5
Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 5/55 (9%)
Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFS 909
P PS S P G+ SP +PS + SPS S P+ P P S
Sbjct: 589 PSTVVAPSTSPPAGHLGSPPATPSK-----IVSPSTSPPASHLGSPSTTPSSPES 638
Score = 30.4 bits (68), Expect = 6.2
Identities = 24/112 (21%), Positives = 35/112 (31%), Gaps = 12/112 (10%)
Query: 788 ASSTIT--LRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHS 845
+ST T +++ S S G PP P + + S +
Sbjct: 569 TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSK-----IVSPSTSPPA 623
Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST 897
+ P P S S AS+ SP S + + S S S ST
Sbjct: 624 SHLGSPSTTP-----SSPESSIKVASTETASPESSIKVASTESSVSMVSMST 670
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 34.9 bits (80), Expect = 0.23
Identities = 23/97 (23%), Positives = 32/97 (32%), Gaps = 5/97 (5%)
Query: 824 PPLPP--RTNRRREVSTSDQSVHSRDI---SPPPLPPRRETRPSLSSPPGYASSPDPSPS 878
P L + RRE + S +R P + +RP S G A P PS
Sbjct: 8 PHLHHSYSSLDRREKTCLRSSQKTRRFPKPKASLHPSIKRSRPGRCSTNGAAVPESPKPS 67
Query: 879 SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
GR + SP K + R + G +
Sbjct: 68 RRGRKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEED 104
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 34.8 bits (79), Expect = 0.25
Identities = 54/203 (26%), Positives = 82/203 (40%), Gaps = 27/203 (13%)
Query: 712 ARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGS---QS 768
+R+ P+L+ + + P+ +P+ S + P+AN +AP S A+ S +
Sbjct: 1836 SRSATPSLSPSP-SPPRRAPVDRSRSGRRRERDRPSANP--FRWAPRQRSRADHSPDGTA 1892
Query: 769 PGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP---- 824
PG + S+ T P S SGPE S +++ T D +PP
Sbjct: 1893 PGDAPLNLEDGPGRGRPIWTPSSATTLP-SRSGPEDSVDET-----ETEDS--APPARLA 1944
Query: 825 PLPPRTNRRREVSTSDQSVHSR---DISPPPLPPRRETRPSLSSP--PGYASSPDPSPSS 879
P P T+R + S+ +S SPP L R RPS P P + S+
Sbjct: 1945 PSPLETSRAEDSEDSEYPEYSNPRLGKSPPALKSREARRPSSKQPRRPSSGKNGHTDVSA 2004
Query: 880 PGRFHVSSPSPSKPSGSTSRLPL 902
F + P+P GS R PL
Sbjct: 2005 ASAFFLGRPAP----GSCGRRPL 2023
Score = 31.4 bits (70), Expect = 3.1
Identities = 21/54 (38%), Positives = 24/54 (44%), Gaps = 1/54 (1%)
Query: 827 PPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
P R +R S+S S S D SP RR+ RPS P S P PS P
Sbjct: 1557 PQRPGGKRPRSSSSSSSASHDRSPSSSSRRRDGRPSSRRRPSRRMSARP-PSRP 1609
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 34.5 bits (79), Expect = 0.31
Identities = 26/91 (28%), Positives = 37/91 (40%), Gaps = 18/91 (19%)
Query: 824 PPLPPRTNRR--REVSTSDQSVHSRDISPPPLPPRR---ETRPSLSSP------------ 866
P +PPR RR R VS + S ++ PP P+R T PSL
Sbjct: 1 PKVPPRPRRRGDRSVSPNPDSFAPSPLNEPPDVPQRPPSVTLPSLGEEGAEYEALEEAEL 60
Query: 867 -PGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
+ S+P + + + +P PS PS S
Sbjct: 61 SDSHHSTPAQTRNVGEDLKLHAPKPSLPSSS 91
>gnl|CDD|241398 cd13244, PH_PLEKHG5_G6, Pleckstrin homology domain-containing
family G member 5 and 6 pleckstrin homology (PH) domain.
PLEKHG5 has a RhoGEF DH/double-homology domain in
tandem with a PH domain which is involved in
phospholipid binding. PLEKHG5 activates the nuclear
factor kappa B (NFKB1) signaling pathway. Mutations in
PLEKHG5 are associated with autosomal recessive distal
spinal muscular atrophy. PLEKHG6 (also called MyoGEF)
has no known function to date. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 154
Score = 33.0 bits (76), Expect = 0.31
Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 3/39 (7%)
Query: 113 FFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKR 151
D LK K S + CFLF +LL+CKP K+
Sbjct: 49 LLEGDLKLK---EGKGSSVDVHCFLFTDMLLICKPVKKK 84
>gnl|CDD|241479 cd13325, PH_unc89, unc89 pleckstrin homology (PH) domain. unc89 is
a myofibrillar protein. unc89-B the largest isoform is
composed of 53 immunoglobulin (Ig) domains, 2 Fn3
domains, a triplet of SH3, DH and PH domains at its
N-terminus, and 2 protein kinase domains (PK1 and PK2)
at its C-terminus. unc-89 mutants display
disorganization of muscle A-bands, and usually lack
M-lines. The COOH-terminal region of obscurin, the human
homolog of unc89, interacts via two specific Ig-like
domains with the NH(2)-terminal Z-disk region of titin,
a protein that connects the Z line to the M line in the
sarcomere and contributes to the contraction of striated
muscle. obscurin is also thought to be involved in
Ca2+/calmodulin via its IQ domains, as well as G
protein-coupled signal transduction in the sarcomere via
its RhoGEF/DH domain. The DH-PH region of OBSCN and
unc89, the C. elegans homolog, has exchange activity for
RhoA and Rho-1 respectively, but not for the small
GTPases homologous to Cdc42 or Rac. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 113
Score = 31.9 bits (73), Expect = 0.42
Identities = 21/98 (21%), Positives = 44/98 (44%), Gaps = 16/98 (16%)
Query: 126 KKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPD 185
+ ++ ER FLF +L+ K +R S ++ + LK+ + + + PD
Sbjct: 20 GEGKAKERYLFLFKSRILITK--VRRIS--------EDRSVFILKDIIRLPEVNVKQHPD 69
Query: 186 SDDYKNAFQIVPRC---GSPVILIANSPEDKNNWMADL 220
+ F++ P+ G P+ A+ E K+ W+ ++
Sbjct: 70 D---ERTFELQPKPAFKGYPIDFKAHKDEIKDAWLNEI 104
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 33.7 bits (77), Expect = 0.42
Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 1/71 (1%)
Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPS-PSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
R S P PPRR + P + P +SP+ + PS PG P + + R P
Sbjct: 3 RARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGC 62
Query: 905 PKPFSFGKQVP 915
P +F P
Sbjct: 63 PAGVTFSSSAP 73
Score = 30.6 bits (69), Expect = 3.4
Identities = 23/87 (26%), Positives = 30/87 (34%), Gaps = 6/87 (6%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSS-PG 881
PP P R + D S SPP P E P ++ P ++P P P
Sbjct: 10 SPPRRPSPPRPTPPRSPDASPEETPPSPP--GPGAEPPPGRAAGP---AAPRRRPRGCPA 64
Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKPF 908
SS +P +P P P P
Sbjct: 65 GVTFSSSAPPRPPLGLDDAPAATPPPL 91
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 32.6 bits (75), Expect = 0.45
Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 11/67 (16%)
Query: 121 KLNIGKKTESTER--VCFLFDGLLLLCK------PNSKRTSVSVTAPLGGNQ-GEYRLKE 171
+ K + +ER +LF+ +LL CK S S S ++ + G +LK
Sbjct: 36 TFTVRK--DDSEREYHVYLFERILLCCKEVKKKKKKSSLKSKSSSSSKKKKKKGPLQLKG 93
Query: 172 RFFIRRI 178
R FI I
Sbjct: 94 RIFISNI 100
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 33.4 bits (76), Expect = 0.47
Identities = 33/164 (20%), Positives = 48/164 (29%), Gaps = 31/164 (18%)
Query: 766 SQSPGSVSTAF---SMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYS 822
SQ P V +A ++ + Q L S +FS P T LY
Sbjct: 89 SQPPQPVVSAHPIHTLDNPQYPGILPSPRC-----GYLHHQFSLRPMP-FSTLTVQRLYQ 142
Query: 823 PPPLPPRTNRRREV---------STSDQSVHSRDISPP-PLPPRRETRP----SLSSPPG 868
R+ + + S P R T P ++ + P
Sbjct: 143 HGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPA 202
Query: 869 YASSPDPSPSSPGRFHVSSPSPSKPSGST--------SRLPLLP 904
S+ +P S P+ K S T +R PLLP
Sbjct: 203 SMSTIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLP 246
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 33.4 bits (76), Expect = 0.48
Identities = 37/150 (24%), Positives = 48/150 (32%), Gaps = 18/150 (12%)
Query: 768 SPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLP 827
S S ST+ S +S S+ L +SS P + SPP
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPA------------SPPSSS 51
Query: 828 PR--TNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHV 885
P T+ S Q S R PS SP AS S ++
Sbjct: 52 PARNTSSSSSFGLSKQRPSSL---SRGRLSSRFVSPSRGSPSAAASLN-GSLATASTSGS 107
Query: 886 SSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
SSPS S+ + S+ P SF V
Sbjct: 108 SSPSRSRRTTSSDLSSGNGPSVLSFMADVK 137
Score = 30.3 bits (68), Expect = 5.6
Identities = 18/82 (21%), Positives = 25/82 (30%), Gaps = 4/82 (4%)
Query: 834 REVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP 893
R S S S SP RR + LS+ +P+SP SS
Sbjct: 1 RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASP----PSSSPARNT 56
Query: 894 SGSTSRLPLLPPKPFSFGKQVP 915
S S+S ++
Sbjct: 57 SSSSSFGLSKQRPSSLSRGRLS 78
Score = 29.5 bits (66), Expect = 9.0
Identities = 28/113 (24%), Positives = 33/113 (29%), Gaps = 14/113 (12%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
R S S SG+ S S + R + + SPP
Sbjct: 1 RAASVSSGSTSGDASSPRSSSRR---RLSSSFLSTSASSRPRRLNAPA------SPPSSS 51
Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
P R T S SS PSS R +SS S GS S L
Sbjct: 52 PARNTS-SSSS----FGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSL 99
>gnl|CDD|144068 pfam00336, DNA_pol_viral_C, DNA polymerase (viral) C-terminal
domain.
Length = 245
Score = 33.0 bits (75), Expect = 0.58
Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 852 PLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSP 888
PLP R RPS YA+SP P P R +SP
Sbjct: 200 PLP-RLRFRPSTGRTSLYAASPPVPPHRPVRVAFASP 235
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 33.4 bits (76), Expect = 0.62
Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 22/169 (13%)
Query: 728 KNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSL 787
S ++ Q PP + + A L+N N Q S + Q L
Sbjct: 484 AASTYGQTSRERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTSLR--YQEL 541
Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRD 847
ED P LPP + S ++Q V +
Sbjct: 542 --------------TPVQEEDEP-----EDQTDDDDSSLPPLESDDDPGSDNEQGVDLTE 582
Query: 848 ISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
++PP P R+ + P + DP+ S S SKPS
Sbjct: 583 VAPPA-PVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRSNSKPSAP 630
>gnl|CDD|241423 cd13269, PH_alsin, Alsin Pleckstrin homology (PH) domain. The ALS2
gene encodes alsin, a GEF, that has dual specificity for
Rac1 and Rab5 GTPases. Alsin mutations in the form of
truncated proteins are responsible for motor function
disorders including juvenile-onset amyotrophic lateral
sclerosis, familial juvenile primary lateral sclerosis,
and infantile-onset ascending hereditary spastic
paralysis. The alsin protein is widely expressed in the
developing CNS including neurons of the cerebral cortex,
brain stem, spinal cord, and cerebellum. Alsin contains
a regulator of chromosome condensation 1 (RCC1) domain,
a Rho guanine nucleotide exchanging factor (RhoGEF)
domain, a PH domain, a Membrane Occupation and
Recognition Nexus (MORN), a vacuolar protein sorting 9
(Vps9) domain, and a Dbl homology (DH) domain. Alsin
interacts with Rab5 through its Vps9 domain and through
this interaction modulates early endosome fusion and
trafficking. The GEF activity of alsin towards Rab5 is
regulated by Rac1 function. The GEF activity of alsin
for Rac1 occurs via its DH domain and this interaction
plays a role in promoting spinal motor neuron survival
via multiple Rac-dependent signaling pathways. PH
domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 106
Score = 31.2 bits (71), Expect = 0.65
Identities = 12/36 (33%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 182 DLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
+PD D +NA +I S L+A++P++K W+
Sbjct: 59 PIPDEDSGQNALKITTPEES-FTLVASTPQEKAEWL 93
>gnl|CDD|241401 cd13247, BAR-PH_APPL, Adaptor protein containing PH domain, PTB
domain, and Leucine zipper motif Bin1/amphiphysin/Rvs167
(BAR)-Pleckstrin homology (PH) domain. APPL (also
called DCC-interacting protein (DIP)-13alpha) interacts
with oncoprotein serine/threonine kinase AKT2, tumor
suppressor protein DCC (deleted in colorectal cancer),
Rab5, GIPC (GAIP-interacting protein, C terminus), human
follicle-stimulating hormone receptor (FSHR), and the
adiponectin receptors AdipoR1 and AdipoR2. There are two
isoforms of human APPL: APPL1 and APPL2, which share
about 50% sequence identity. APPL has a BAR and a PH
domain near its N terminus, and the two domains are
thought to function as a unit (BAR-PH domain).
C-terminal to this is a PTB domain. Lipid binding assays
show that the BAR, PH, and PTB domains can bind
phospholipids. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 125
Score = 31.6 bits (72), Expect = 0.69
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 185 DSDDYKNAFQIV-PRCGSPVILIANSPEDKNNWMA 218
D +D + FQI P ++L A S +D+ W+A
Sbjct: 83 DCEDRRFVFQITSPDGKKAIVLQAESEKDREEWIA 117
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.78
Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 4/103 (3%)
Query: 483 NLLKIIKHTTNFTRWLEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALG 542
NL ++IK LEK I TEN+EE I + E+ VL ++N + L +
Sbjct: 166 NLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEK--ELEEVLREINEISSELPELREEL 223
Query: 543 SASVYRLKCTFQALPARLENALVELRELGNDHFRKYQERLRSI 585
+K + L +E EL L RK +E++R +
Sbjct: 224 EKLEKEVK-ELEELKEEIEELEKELESL-EGSKRKLEEKIREL 264
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 33.2 bits (76), Expect = 0.80
Identities = 28/131 (21%), Positives = 36/131 (27%), Gaps = 18/131 (13%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH---------S 845
+PG + + S E G FD L P + S D ++
Sbjct: 8 QPGRWN-YDSSDESPEGSRDENFDA-ERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGG 65
Query: 846 RDISPPPLPPRRETRPSL------SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSR 899
S PR P S P P P+P PG F +S STS
Sbjct: 66 VATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGAD-STSP 124
Query: 900 LPLLPPKPFSF 910
L
Sbjct: 125 RFLYQVNFPVI 135
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 33.1 bits (75), Expect = 0.89
Identities = 32/127 (25%), Positives = 44/127 (34%), Gaps = 19/127 (14%)
Query: 796 PGSSSGPEFSGEDSPGLM--------GSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRD 847
PG G E EDS G T +G P P + ++ ++ T S+
Sbjct: 524 PGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPT-----LSKK 578
Query: 848 ISPP--PLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSS--PSPSKPSGSTS-RLPL 902
P P P+ P P A P P SP + SP +P S + P
Sbjct: 579 PEFPKDPKHPKDPEEPKKPKRPRSAQRP-TRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
Query: 903 LPPKPFS 909
P +P S
Sbjct: 638 PPQRPSS 644
Score = 30.4 bits (68), Expect = 6.5
Identities = 44/250 (17%), Positives = 65/250 (26%), Gaps = 43/250 (17%)
Query: 687 RKDIPRRWPNLSLKSPGIKASKN----YMARNNLPALTVNKITGPKNSPLSESDGTPQTP 742
++ P + P LS K K K+ + + + T PK+ L E P++P
Sbjct: 565 KEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSP 624
Query: 743 LTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ----------------SVQS 786
P + S P S+ + P + + S + +
Sbjct: 625 KRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAA 684
Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSR 846
+ T T S E P G+ F PP PR D
Sbjct: 685 KSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQP 744
Query: 847 DISPPPLPPRRET---------------------RPSLSSPPGYASSPDPSPSSPGRFHV 885
D PP E + + G P SP
Sbjct: 745 DDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHED 804
Query: 886 SSPS--PSKP 893
P PS P
Sbjct: 805 KPPGDHPSLP 814
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 32.9 bits (75), Expect = 0.99
Identities = 29/126 (23%), Positives = 45/126 (35%), Gaps = 22/126 (17%)
Query: 802 PEFSGEDSPGLMGSTFDGLYSP----PP------LPPRTNRR--REVSTSDQSVHSRDIS 849
P E++P L+ S Y P PP LP R T+ +
Sbjct: 282 PGLGFEEAPELLWSFLGQFYGPQSTIPPRIVVPWLPDTEGREGDDLAPTAVCTDAGLLPD 341
Query: 850 PPPLPPRRE-----TRPSLSSPPGYASSP----DPSPSSPGRFHVSSPSPSKPSGSTSRL 900
P LP E P ++ P AS P +P+S + P+ ++P+ + + L
Sbjct: 342 TPLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAAT-L 400
Query: 901 PLLPPK 906
PK
Sbjct: 401 AGAAPK 406
>gnl|CDD|241419 cd13265, PH_evt, Evectin Pleckstrin homology (PH) domain. There
are 2 members of the evectin family (also called
pleckstrin homology domain containing, family B): evt-1
(also called PLEKHB1) and evt-2 (also called PLEKHB2).
evt-1 is specific to the nervous system, where it is
expressed in photoreceptors and myelinating glia. evt-2
is widely expressed in both neural and nonneural
tissues. Evectins possess a single N-terminal PH domain
and a C-terminal hydrophobic region. evt-1 is thought to
function as a mediator of post-Golgi trafficking in
cells that produce large membrane-rich organelles. It is
a candidate gene for the inherited human retinopathy
autosomal dominant familial exudative vitreoretinopathy
and a susceptibility gene for multiple sclerosis. evt-2
is essential for retrograde endosomal membrane transport
from the plasma membrane (PM) to the Golgi. Two membrane
trafficking pathways pass through recycling endosomes: a
recycling pathway and a retrograde pathway that links
the PM to the Golgi/ER. Its PH domain that is unique in
that it specifically recognizes phosphatidylserine (PS),
but not polyphosphoinositides. PS is an anionic
phospholipid class in eukaryotic biomembranes, is highly
enriched in the PM, and plays key roles in various
physiological processes such as the coagulation cascade,
recruitment and activation of signaling molecules, and
clearance of apoptotic cells. PH domains are only found
in eukaryotes. They share little sequence conservation,
but all have a common fold, which is electrostatically
polarized. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 108
Score = 30.7 bits (70), Expect = 0.99
Identities = 13/40 (32%), Positives = 17/40 (42%)
Query: 181 LDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
+ P+ QIV R G +IL A S +D W L
Sbjct: 65 VQPPEGRSRDCLLQIVTRDGKRLILCAESADDALAWKLAL 104
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 33.0 bits (75), Expect = 1.0
Identities = 37/249 (14%), Positives = 60/249 (24%), Gaps = 53/249 (21%)
Query: 714 NNLPALTVNKITGPKNSPLSESDGTPQTPLTP--------PTANSEFTVFAPIMLSN--- 762
N+ + T + PL+ +D T P + + + P + N
Sbjct: 378 NSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDER 437
Query: 763 ANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGS------------------SSGPEF 804
+ S G + S P ++ PE
Sbjct: 438 SIAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLDDAKRPEVTATPES 497
Query: 805 SGEDSPG---------------------LMGSTFDGLYSPPPLPPRTNRRREVSTSDQSV 843
SG DS G L SPPP P +RR + +
Sbjct: 498 SGSDSEGGASGREDETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLAS 557
Query: 844 HSRDISPPPLPPRRE---TRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
S SP P P + + + + + V + T
Sbjct: 558 PSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPG 617
Query: 901 PLLPPKPFS 909
L P +
Sbjct: 618 SDLKPATLN 626
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.7 bits (74), Expect = 1.1
Identities = 33/142 (23%), Positives = 43/142 (30%), Gaps = 20/142 (14%)
Query: 782 QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQ 841
Q +Q +I + PG++ P PP P + Q
Sbjct: 171 QLLQPQGPPSIQVPPGAALAPSAPPPTPSAQA--------VPPQGSPIAAQ--PAPQPQQ 220
Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSS----PSPSKPSGST 897
IS P L P+R P P AS P P +P H S P P P
Sbjct: 221 PSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQ 280
Query: 898 SRLPLL------PPKPFSFGKQ 913
L PP+PF +
Sbjct: 281 QGPVFLQHPSSNPPQPFGLAQS 302
Score = 30.4 bits (68), Expect = 5.7
Identities = 41/191 (21%), Positives = 52/191 (27%), Gaps = 25/191 (13%)
Query: 727 PKNSPLSESDGTPQTPLTPPTA-NSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQ 785
P P + S +PQ P S P M G V +
Sbjct: 242 PPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHAL----QQGPVFLQHPSSNPPQPF 297
Query: 786 SLASSTITLRPG-SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
LA S + P S + P S + P PPR +
Sbjct: 298 GLAQSQVPPLPLPSQAQPHSHTPPSQSALQ---------PQQPPREQPLPPAPSMPH--- 345
Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP--SGSTSRLPL 902
I PPP P + P + P P P P ++ P KP S T P
Sbjct: 346 ---IKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPS--NLPPPPALKPLSSLPTHHPPS 400
Query: 903 LPPKPFSFGKQ 913
P P Q
Sbjct: 401 AHPPPLQLMPQ 411
Score = 29.7 bits (66), Expect = 9.1
Identities = 33/193 (17%), Positives = 46/193 (23%), Gaps = 36/193 (18%)
Query: 732 LSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASST 791
L + G P + P A + SP + A Q S
Sbjct: 172 LLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ--PQQPSPLSLISA 229
Query: 792 ITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPP 851
+L P P PPL P+T ++ P
Sbjct: 230 PSLHPQRLPSP--------------------HPPLQPQTASQQSPQ-------------P 256
Query: 852 PLPPRRETRPSLSSP-PGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSF 910
P P R + S P P + P + P P + S LP +
Sbjct: 257 PAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316
Query: 911 GKQVPMFQFNAPG 923
P P
Sbjct: 317 SHTPPSQSALQPQ 329
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 30.4 bits (69), Expect = 1.2
Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 7/58 (12%)
Query: 848 ISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
+ PLP ++PP Y P P+ PG + + P P+ + + P PP
Sbjct: 54 VHLEPLPAY----GQYAAPPPYGPPPPYYPAPPGVYP-TPPPPNSGYMADPQEP--PP 104
>gnl|CDD|219118 pfam06640, P_C, P protein C-terminus. This family represents the
C-terminus of plant P proteins. The maize P gene is a
transcriptional regulator of genes encoding enzymes for
flavonoid biosynthesis in the pathway leading to the
production of a red phlobaphene pigment, and P proteins
are homologous to the DNA-binding domain of myb-like
transcription factors. All members of this family
contain the pfam00249 domain.
Length = 209
Score = 32.0 bits (72), Expect = 1.2
Identities = 24/82 (29%), Positives = 29/82 (35%), Gaps = 12/82 (14%)
Query: 830 TNRRREVST---SDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVS 886
++RR T S +S SR P E ASSP S R V
Sbjct: 12 ADKRRGGRTPGRSPKSSASRTKQADADQPGGEAA----GDAAAASSP--RHSDGARSAVV 65
Query: 887 SPSPSKP---SGSTSRLPLLPP 905
P P++P SGST P
Sbjct: 66 DPGPNQPNSSSGSTGTAEEGPC 87
>gnl|CDD|233411 TIGR01431, adm_rel, adenosine deaminase-related growth factor.
Members of this family have been described as secreted
proteins with growth factor activity and regions of
adenosine deaminase homology in insects, mollusks, and
vertebrates.
Length = 479
Score = 32.5 bits (74), Expect = 1.3
Identities = 22/107 (20%), Positives = 40/107 (37%), Gaps = 11/107 (10%)
Query: 318 FLQLLIERFSIPEFQGTVREDLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLE 377
F L P F+ L++F ++ VQ ++ R + Y+ H E
Sbjct: 182 FFTLSGLLHYAPVFRDYYFRALEEFYEDNVQYMELRS-RLFP------LYELSGTHHDEE 234
Query: 378 QLKLFLETINGKSMRKWVDSV-SKVIF---RKTDPLELQKQIKFAFD 420
+ + K + + D + K+I+ R D E+ + IK A
Sbjct: 235 WSVKTYKEVTEKFVEEHPDFIGIKIIYSDLRSKDVEEIAEYIKMAMG 281
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 31.8 bits (72), Expect = 1.5
Identities = 20/93 (21%), Positives = 30/93 (32%), Gaps = 6/93 (6%)
Query: 825 PLPPRTNRRREVSTSDQ-SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPS--PSSPG 881
P P +E+ + + S D P P+ + P A +P P P P
Sbjct: 67 PTQPPEGVAQEIQDAGDAAAASVD--PQPVAQPPVESTP-AGVPVAAQTPKPVKPPKQPP 123
Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQV 914
V + KP P P +F Q+
Sbjct: 124 AGAVPAKPTPKPEPKPVAEPAAAPTGQAFVVQL 156
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 32.1 bits (73), Expect = 1.6
Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 11/79 (13%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGR 882
P + R + S + + + PP P RET PP P P
Sbjct: 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETAT----PPPVPPRPVAPP----- 420
Query: 883 FHVSSPSPSKPSGSTSRLP 901
V S P + + +P
Sbjct: 421 --VPHTPESAPKLTRAAIP 437
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.1 bits (73), Expect = 1.7
Identities = 18/100 (18%), Positives = 22/100 (22%), Gaps = 16/100 (16%)
Query: 816 TFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
T D P PP RR TS+ + P P P
Sbjct: 87 TVDPSAGEPAPPPPHARR----TSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTA 142
Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
P+ P P P +G Q
Sbjct: 143 RPAYPAYQQRPEPGA------------WPRAADDYGWQQQ 170
Score = 31.3 bits (71), Expect = 3.0
Identities = 18/120 (15%), Positives = 26/120 (21%), Gaps = 7/120 (5%)
Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
PG+ +G Y+ P R R
Sbjct: 154 EPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAG----RPEYDQRRR 209
Query: 855 PRRETRPSLSSPPGYASS-PDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
RP P + P+P P + P + P+ P P Q
Sbjct: 210 DYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVV--PIRPSAPGPLAAQ 267
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 32.0 bits (72), Expect = 1.7
Identities = 43/234 (18%), Positives = 74/234 (31%), Gaps = 24/234 (10%)
Query: 738 TPQTPLTPPTANSEFTVFAPIMLSNANGS-QSPGSVSTAFSMFSTQSVQSLASSTITLRP 796
TP L P S A + + A + +V+ A + + SS++
Sbjct: 283 TPPDALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPSSSLPTDI 342
Query: 797 GSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPR 856
G+ ++ S PPLP + P P+PP
Sbjct: 343 GN-----------KAVVSSAKKEAEEVPPLPQ--------AAPAVVKPGPMEIPTPVPPP 383
Query: 857 RETRPSLSSPPGYASSPDPSPS---SPGRFHVSSPSPSKPSGSTSRLPLL-PPKPFSFGK 912
PSL +PPG + + +PS SP + P L P K +
Sbjct: 384 GLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKEDQTSE 443
Query: 913 QVPMFQFNAPGNDKWAMHEAKEAIEKSKKKISCLVLPVEKIHYLLSKDVLQYSG 966
M A+ K+ EK +++ ++ + L S++ + G
Sbjct: 444 DGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLASQEGMSIRG 497
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 32.0 bits (72), Expect = 1.7
Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 6/86 (6%)
Query: 831 NRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
+ +S S+ + S+ +P + +PS PP S+P +PS
Sbjct: 160 TKVAIISKSEDAA-SQVTPSQKIPETTDPKPS---PPAEDKQKPKVESAPVAEKPKAPSS 215
Query: 891 SKPSGSTSRLPLLPPKPFSFGKQVPM 916
P +++ P LPPK ++VPM
Sbjct: 216 PPPPKQSAKEPQLPPKERE--RRVPM 239
>gnl|CDD|215361 PLN02673, PLN02673, quinolinate synthetase A.
Length = 724
Score = 31.9 bits (72), Expect = 1.7
Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 15/79 (18%)
Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSS---PPG-----------YASSPDPSPSSPGR 882
TS S + P P P R T P+ S + PD +PS+
Sbjct: 5 PTSSSSSSFLSLLPNPSPNFRTTHPNFGSQRRIGTINPLFKSFKCIQSPPPDSAPSNASP 64
Query: 883 FHVSSPSPSKPSGSTSRLP 901
F S+ + S PS +T +P
Sbjct: 65 FSCSAVAFS-PSQTTELVP 82
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 31.8 bits (72), Expect = 1.8
Identities = 24/63 (38%), Positives = 27/63 (42%), Gaps = 5/63 (7%)
Query: 799 SSGPEFSGEDSPGLMGSTFDGL--YSPPPLPPRTNRRREVST--SDQSVHSRDISPPPLP 854
S G DSP GS F L P P P RR+++T SD RD S P P
Sbjct: 478 SDPDRLPGTDSPPKRGSGFQILSGTKPDPWSPGARSRRDLTTFRSDDPDRYRD-SQAPDP 536
Query: 855 PRR 857
P R
Sbjct: 537 PHR 539
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 32.0 bits (72), Expect = 1.9
Identities = 23/110 (20%), Positives = 36/110 (32%), Gaps = 8/110 (7%)
Query: 821 YSPPPLPPRTNRRREVSTSDQSVHSR---DISPPPLPPRRETRPSLSS-----PPGYASS 872
Y P P T + + R + PP PP R RP+ ++ P
Sbjct: 674 YQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGR 733
Query: 873 PDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
P ++PGR + +P + + P + G P AP
Sbjct: 734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP 783
Score = 29.6 bits (66), Expect = 9.7
Identities = 19/108 (17%), Positives = 34/108 (31%), Gaps = 2/108 (1%)
Query: 798 SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRR 857
+SS P ++ P S + P T+ R+ + + R + P+
Sbjct: 586 ASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNV 645
Query: 858 ETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
P+ PP +P + PSP+ +T P
Sbjct: 646 LVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGA--NTMLPIQWAP 691
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 31.4 bits (71), Expect = 2.0
Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 1/67 (1%)
Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSS-PGRFHVSSPSPSKPSGSTSRL 900
+ + D +PPP PP + P P P+ P + + P P +
Sbjct: 35 ATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPP 94
Query: 901 PLLPPKP 907
P+ P P
Sbjct: 95 PVDPNAP 101
>gnl|CDD|241472 cd13318, PH_IQSEC, IQ motif and SEC7 domain-containing protein
family Pleckstrin homology domain. The IQSEC (also
called BRAG/Brefeldin A-resistant Arf-gunanine
nucleotide exchange factor) family are a subset of Arf
GEFs that have been shown to activate Arf6, which acts
in the endocytic pathway to control the trafficking of a
subset of cargo proteins including integrins and have
key roles in the function and organization of distinct
excitatory and inhibitory synapses in the retina. The
family consists of 3 members: IQSEC1 (also called
BRAG2/GEP100), IQSEC2 (also called BRAG1), and IQSEC3
(also called SynArfGEF, BRAG3, or KIAA1110). IQSEC1
interacts with clathrin and modulates cell adhesion by
regulating integrin surface expression and in addition
to Arf6, it also activates the class II Arfs, Arf4 and
Arf5. Mutations in IQSEC2 cause non-syndromic X-linked
intellectual disability as well as reduced activation of
Arf substrates (Arf1, Arf6). IQSEC3 regulates Arf6 at
inhibitory synapses and associates with the
dystrophin-associated glycoprotein complex and S-SCAM.
These members contains a IQ domain that may bind
calmodulin, a PH domain that is thought to mediate
membrane localization by binding of phosphoinositides,
and a SEC7 domain that can promote GEF activity on ARF.
PH domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 128
Score = 30.3 bits (69), Expect = 2.0
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 13/91 (14%)
Query: 132 ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPDSDDYKN 191
+R FLF+ LL++ K SK+ S SVT Y ++ F + +++L L ++ Y +
Sbjct: 27 QREVFLFNDLLVVTKIFSKKKS-SVT---------YTFRQSFPLLGMQVL-LFENSYYPH 75
Query: 192 AFQIVPRCGSPVILIANSP--EDKNNWMADL 220
++ + V++ N+P D+ ++ DL
Sbjct: 76 GIRLTSPLDNKVLITFNAPNESDRKKFVEDL 106
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 31.7 bits (72), Expect = 2.3
Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK 892
S+S + PP P E SSP AS P PSS R +SP K
Sbjct: 203 SSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI-FASPLARK 257
>gnl|CDD|223652 COG0579, COG0579, Predicted dehydrogenase [General function
prediction only].
Length = 429
Score = 31.5 bits (72), Expect = 2.4
Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 10/70 (14%)
Query: 645 SVEPRIRRFLENLTPFDNWKEIDISNYLYEQSLKIEPRYAKLRKDIP-----RRWPNLS- 698
SV R + +L K ++NY + + EPR L IP + N++
Sbjct: 297 SVRKANSRGMPDL----GIKNNVLANYAGIRPILKEPRLPALDFIIPEAKDEDWFINVAG 352
Query: 699 LKSPGIKASK 708
++S G+ A
Sbjct: 353 IRSQGLTADP 362
>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560. This
family of proteins has no known function.
Length = 805
Score = 31.8 bits (72), Expect = 2.4
Identities = 22/122 (18%), Positives = 38/122 (31%), Gaps = 18/122 (14%)
Query: 812 LMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPG--- 868
++ S + P L + R+ ++S++ + + LP E+ P +S G
Sbjct: 341 MVSSAGEADMHTPMLKSAFSSSRDFTSSEELLAHKAEDKSQLPQSGESFPLKASRSGEQK 400
Query: 869 ----------YASSPDPSPSSPGRFHVSSPSPSK-----PSGSTSRLPLLPPKPFSFGKQ 913
Y SS H S + + S P P P K+
Sbjct: 401 EEYSKLETEEYRRGYGTVESSSLENHRDSFGLANQNHYSAPPTVSIQPPSGPVPPKPEKE 460
Query: 914 VP 915
V
Sbjct: 461 VL 462
>gnl|CDD|223034 PHA03293, PHA03293, deoxyribonuclease; Provisional.
Length = 523
Score = 31.6 bits (72), Expect = 2.6
Identities = 25/111 (22%), Positives = 38/111 (34%), Gaps = 29/111 (26%)
Query: 530 NFNGVLAIVSALGSASVYRLKCTFQAL--------PARLENALVELRELGNDHFRKYQER 581
+ +GVLA + + +KC + ++ L+ R R +
Sbjct: 257 DPHGVLAPHPTQTTLEFFEIKCRAKYAFPPDDGSPLSQAYERLLARRTAAA--LRAF--- 311
Query: 582 LRSINPPCVPFFGMYLTNILHIEEGNPDYLPNSNEELINFS---KRRRVAE 629
LRSI P V +F PD +P + E LI K R A
Sbjct: 312 LRSIRNPGVEYFA-------------PDRVPGAAEALITCDDAWKPGRAAG 349
>gnl|CDD|237138 PRK12563, PRK12563, sulfate adenylyltransferase subunit 2;
Provisional.
Length = 312
Score = 31.3 bits (71), Expect = 2.6
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 659 PFDNWKEIDISNYLYEQSLKIEPRYAKLRKDIPRR 693
P NW E+D+ Y+ + + + P Y R+ + R
Sbjct: 195 PLSNWTELDVWQYIAREKIPLVPLYFAKRRPVVER 229
>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 843
Score = 31.5 bits (72), Expect = 2.7
Identities = 33/175 (18%), Positives = 59/175 (33%), Gaps = 22/175 (12%)
Query: 473 TKSNKEEKSPNLLKIIKHT-TNF-----TRWLEKTIVETENLEERIAIVS---RAIEIMM 523
+ K +L ++ T T RWL + +++ E +EER+ V E+
Sbjct: 274 SNLRGNGKKGSLFWLLDRTVTPMGSRLLKRWLLQPLLDKEEIEERLDAVEELKDNPELRE 333
Query: 524 VLND-LNNFNGVLAIVS--ALGSAS---VYRLKCTFQALPARLENALVELRELGNDHFRK 577
L + L + ++S +LG AS + RL+ + + +P + E +
Sbjct: 334 KLREMLKKVPDLERLLSRLSLGRASPRDLLRLRDSLEKIPEIFKLLSSLKSESDLLLLLE 393
Query: 578 YQERLRSINPPCVPFFGMYLTNI-LHIEEGNPDYLPNS--NEELINFSKRRRVAE 629
E L + T I D + N EL A+
Sbjct: 394 DIESL----DYLAELLELLETAINEDPPLAVRDGIIKEGYNIELDELRDLLNNAK 444
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 31.3 bits (71), Expect = 2.7
Identities = 12/48 (25%), Positives = 21/48 (43%)
Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP 893
R PP P+ + P A++P P++P R + P+P+
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPA 426
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 31.2 bits (70), Expect = 2.9
Identities = 29/134 (21%), Positives = 44/134 (32%), Gaps = 39/134 (29%)
Query: 765 GSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP 824
G+Q P + A M + + + T P ++ P +D+P
Sbjct: 401 GTQGPQGTAPAAGMTPSSAAPA------TPAPSAAPSPRVPWDDAP-------------- 440
Query: 825 PLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFH 884
P PPR+ I P P P E P +P AS+ D P+
Sbjct: 441 PAPPRSG----------------IPPRPAPRMPEASPVPGAPDSVASASDAPPTLGD--- 481
Query: 885 VSSPSPSKPSGSTS 898
S + PSG +
Sbjct: 482 PSDTAEHTPSGPRT 495
>gnl|CDD|184628 PRK14331, PRK14331, (dimethylallyl)adenosine tRNA
methylthiotransferase; Provisional.
Length = 437
Score = 31.3 bits (71), Expect = 3.0
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 8/60 (13%)
Query: 192 AFQIVPRCGSPVILIAN--SPEDKNNWMADLIMLNTKSMFER------TLDSILLDEEKK 243
+F+ PR G+P + E K M L+ L + F++ T+ +L++EEK+
Sbjct: 331 SFKYSPRPGTPAAYMEGQEPDEVKTKRMNRLLELQKEITFKKALSYEGTVQEVLVEEEKE 390
>gnl|CDD|241259 cd01225, PH_Cool_Pix, Cloned out of library/PAK-interactive
exchange factor pleckstrin homology (PH) domain. There
are two forms of Pix proteins: alpha Pix (also called
Rho guanine nucleotide exchange factor (GEF) 6/90Cool-2)
and beta Pix (GEF7/p85Cool-1). betaPix contains an
N-terminal SH3 domain, a RhoGEF/DH domain, a PH domain,
a GIT1 binding domain (GBD), and a C-terminal
coiled-coil (CC) domain. alphaPix differs in that it
contains a calponin homology (CH) domain, which
interacts with beta-parvin, N-terminal to the SH3
domain. alphaPix is an exchange factor for Rac1 and
Cdc42 and mediates Pak activation on cell adhesion to
fibronectin. Mutations in alphaPix can cause X-linked
mental retardation. alphaPix also interacts with
Huntington's disease protein (htt), and enhances the
aggregation of mutant htt (muthtt) by facilitating
SDS-soluble muthtt-muthtt interactions. The DH-PH domain
of a Pix was required for its binding to htt. In the
majority of Rho GEF proteins, the DH-PH domain is
responsible for the exchange activity. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 100
Score = 29.2 bits (66), Expect = 3.1
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 183 LPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
L D++ K+AF+I +++I NS D+ W+
Sbjct: 57 LEDTEGIKHAFEISGPMIERIVVICNSSNDQQEWI 91
>gnl|CDD|147993 pfam06126, Herpes_LAMP2, Herpesvirus Latent membrane protein 2.
Family of Kaposi's sarcoma-associated herpesvirus (HHV8)
latent membrane protein.
Length = 510
Score = 31.2 bits (70), Expect = 3.1
Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 15/57 (26%)
Query: 832 RRREVSTSDQSVHSRD--------------ISPPPLPPR-RETRPSLSSPPGYASSP 873
RR + T D ++H+ ISPPPLPP R LS A SP
Sbjct: 369 RRLPIYTPDDNLHAHAGRICPDVNHLANNLISPPPLPPFFRIHSRILSHTTDMALSP 425
>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA binding)
region. This family represents the N-terminal head
region of intermediate filaments. Intermediate filament
heads bind DNA. Vimentin heads are able to alter nuclear
architecture and chromatin distribution, and the
liberation of heads by HIV-1 protease liberates may play
an important role in HIV-1 associated cytopathogenesis
and carcinogenesis. Phosphorylation of the head region
can affect filament stability. The head has been shown
to interaction with the rod domain of the same protein.
Length = 89
Score = 29.0 bits (65), Expect = 3.2
Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 1/53 (1%)
Query: 854 PPR-RETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
P R +R S S +SS S R SSPSPS S R PP
Sbjct: 13 PTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSSYRKRSSSAPP 65
>gnl|CDD|132626 TIGR03587, Pse_Me-ase, pseudaminic acid biosynthesis-associated
methylase. Members of this small clade are
methyltransferases of the pfam08241 family and are
observed within operons for the biosynthesis of
pseudaminic acid, a component of exopolysaccharide and
flagellin glycosyl modifications. Notable among these
genomes is Pseudomonas fluorescens PfO-1. Possibly one
of the two hydroxyl groups of pseudaminic acid, at
positions 4 and 8 is converted to a methoxy group by
this enzyme.
Length = 204
Score = 30.5 bits (69), Expect = 3.3
Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 554 QALPARLENALVELRELGNDHFRKYQERLRSIN-----PPCVPFFGMYLTN--ILHIEEG 606
+ L VE+ E + + Y + I P FF + LT ++HI
Sbjct: 62 KRLLPFKHIYGVEINEYAVEKAKAYLPNINIIQGSLFDPFKDNFFDLVLTKGVLIHI--- 118
Query: 607 NPDYLPNSNEELINFSKR 624
NPD LP + EL S R
Sbjct: 119 NPDNLPTAYRELYRCSNR 136
>gnl|CDD|241255 cd01220, PH1_FARP1-like, FERM, RhoGEF and pleckstrin
domain-containing protein 1 and related proteins
Pleckstrin Homology (PH) domain, repeat 1. Members here
include FARP1 (also called Chondrocyte-derived
ezrin-like protein; PH domain-containing family C member
2), FARP2 (also called FIR/FERM domain including RhoGEF;
FGD1-related Cdc42-GEF/FRG), and FARP6 (also called Zinc
finger FYVE domain-containing protein 24). They are
members of the Dbl family guanine nucleotide exchange
factors (GEFs) which are upstream positive regulators of
Rho GTPases. Little is known about FARP1 and FARP6,
though FARP1 has increased expression in differentiated
chondrocytes. FARP2 is thought to regulate neurite
remodeling by mediating the signaling pathways from
membrane proteins to Rac. It is found in brain, lung,
and testis, as well as embryonic hippocampal and
cortical neurons. FARP1 and FARP2 are composed of a
N-terminal FERM domain, a proline-rich (PR) domain,
Dbl-homology (DH), and two C-terminal PH domains. FARP6
is composed of Dbl-homology (DH), and two C-terminal PH
domains separated by a FYVE domain. This hierarchy
contains the first PH repeat. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 123
Score = 29.6 bits (67), Expect = 3.4
Identities = 29/112 (25%), Positives = 51/112 (45%), Gaps = 18/112 (16%)
Query: 112 EFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE 171
EF RE LLKL+ K +R+ FLF +L+ TS T+ L ++++
Sbjct: 21 EFIREGCLLKLS---KKGLQQRMFFLFSDVLIY-------TSRGPTSTL-----QFKVHG 65
Query: 172 RFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILI-ANSPEDKNNWMADLIM 222
+ +R + + + + F I G+ + + A+S E+K W+ DL
Sbjct: 66 QLPLRGLMVEESEPEMGVPHCFTI--YGGNRALTVAASSEEEKERWLEDLSR 115
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 30.6 bits (69), Expect = 3.4
Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 7/55 (12%)
Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLL 903
PP P P S P + P+ S P + SP S P L
Sbjct: 153 MQPPPPH---AMPPASPPAAQPAPSAPASSPPP----TPASPPPAKAPKSSHPPL 200
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 31.0 bits (70), Expect = 3.6
Identities = 43/181 (23%), Positives = 60/181 (33%), Gaps = 28/181 (15%)
Query: 733 SESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTI 792
SE Q PP+ N+ + + A+ S S S S S +SS
Sbjct: 365 SEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSG-----------SESSS-- 411
Query: 793 TLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRR------EVSTSDQSVHSR 846
GS S E S DS SP P PP TN+ + +V+ S
Sbjct: 412 ----GSDSESESSSSDSEENEPPRTA---SPEPEPPSTNKWQLDNWLNKVNPHKVSPAES 464
Query: 847 DISPPPL--PPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
S PP+ P +E + S + S +P P S + + K S P
Sbjct: 465 VSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSP 524
Query: 905 P 905
Sbjct: 525 A 525
>gnl|CDD|235765 PRK06274, PRK06274, indolepyruvate oxidoreductase subunit B;
Reviewed.
Length = 197
Score = 30.0 bits (68), Expect = 3.9
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 7/64 (10%)
Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSPNLLKIIK------HTT 492
+L L P E+AR L L+ V I + T +++ +K K
Sbjct: 71 LLALEPAEVARNLHFLKKGGKIIVNAYAIHPAT-TVGSEKYDPEKEIKFAKEKICDVICI 129
Query: 493 NFTR 496
+FT+
Sbjct: 130 DFTK 133
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 31.0 bits (70), Expect = 3.9
Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 9/76 (11%)
Query: 848 ISPPPLPPRRETRPSLSSPPGYASSP-DPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPK 906
PPP S S P +S+P + P + PSP++PS + +P PP
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIP-APPI 245
Query: 907 PFSFGKQVPMFQFNAP 922
P P+ Q+ AP
Sbjct: 246 P-------PVIQYVAP 254
>gnl|CDD|227950 COG5663, COG5663, Uncharacterized conserved protein [Function
unknown].
Length = 194
Score = 30.2 bits (68), Expect = 4.1
Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 181 LDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLNTKSMFERTLDSIL 237
+DL D + NA QI G PVILI NSP ++ ++I N + +DS L
Sbjct: 135 IDLFFEDSHDNAGQIAKNAGIPVILI-NSPYNRKPAAKNIIRANNWAEAYEWVDSRL 190
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 30.8 bits (69), Expect = 4.1
Identities = 18/90 (20%), Positives = 25/90 (27%), Gaps = 2/90 (2%)
Query: 791 TITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISP 850
L +S+GP+ D P G PP L + S + S
Sbjct: 247 QPALPRQNSAGPQ-RRVDQPSPPGGGSHRGRIPPSLL-SSLPSEGSMLSSEWPQSGARPR 304
Query: 851 PPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
+ P L G+ P P S
Sbjct: 305 QQSSSSKGDSPELRPAAGHLQQPSPVNMSA 334
>gnl|CDD|223328 COG0250, NusG, Transcription antiterminator [Transcription].
Length = 178
Score = 30.0 bits (68), Expect = 4.1
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)
Query: 473 TKSNKEEKSPNLLKIIKHTTNFTRWLEKTIVETENLEE----RIAIVSRA-----IEIMM 523
T S +E+K L+ + + +V TE + E R IV R + + M
Sbjct: 10 TYSGQEKKVKENLERKAELLGMEDLIFEVLVPTEEVVEVKGKRKVIVERKLFPGYVLVEM 69
Query: 524 VLND-----LNNFNGVLAIVSALGS 543
+ D + N GV V + G+
Sbjct: 70 DMTDEAWHLVRNTPGVTGFVGSGGA 94
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 30.4 bits (69), Expect = 4.2
Identities = 25/137 (18%), Positives = 36/137 (26%), Gaps = 7/137 (5%)
Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGST----FDGLYSPPPLPPRTNRRREVSTSDQSV 843
T S E + L T G+ P R+R S S
Sbjct: 65 RVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRILSTPELPKRKRSFSASSLES 124
Query: 844 HSRDISPPPLPPRRETRPSLSSPPGYASSPDPSP--SSPGRFHVSSPSPSKPSGSTSRLP 901
S SP P + ++S +P + +P P S ++
Sbjct: 125 PSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGGPD-SDPKVK 183
Query: 902 LLPPKPFSFGKQVPMFQ 918
L K MFQ
Sbjct: 184 LSANFDAKKYKYRTMFQ 200
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 30.6 bits (69), Expect = 4.3
Identities = 17/93 (18%), Positives = 22/93 (23%), Gaps = 11/93 (11%)
Query: 802 PEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
P+ G PGL + + P S S PL P
Sbjct: 191 PDLPGRRDPGLDQTHRQRQPALLGQHPS-------SYGHDGYSSHGGPLAPLAGGDRMGP 243
Query: 862 SLSSP----PGYASSPDPSPSSPGRFHVSSPSP 890
P P Y ++P S P
Sbjct: 244 PHGPPSRYRPAYEAAPLAPAISSYGPAGGGPGS 276
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 30.7 bits (69), Expect = 5.2
Identities = 15/51 (29%), Positives = 18/51 (35%), Gaps = 10/51 (19%)
Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
PPP PP PPG+ P P P +V S + S L
Sbjct: 9 PPPPPP----------PPGFEPPSQPPPPPPPGVNVKKRSRKQLSIVGDIL 49
>gnl|CDD|241537 cd13386, PH1_FGD2, FYVE, RhoGEF and PH domain
containing/faciogenital dysplasia protein 2 pleckstrin
homology (PH), N-terminal domain. In general, FGDs have
a RhoGEF (DH) domain, followed by an N-terminal PH
domain, a FYVE domain and a C-terminal PH domain. All
FGDs are guanine nucleotide exchange factors that
activates the Rho GTPase Cdc42, an important regulator
of membrane trafficking. The RhoGEF domain is
responsible for GEF catalytic activity, while the
N-terminal PH domain is involved in intracellular
targeting of the DH domain. Not much is known about
FGD2. FGD1 is the best characterized member of the
group with mutations here leading to the X-linked
disorder known as faciogenital dysplasia (FGDY). PH
domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 107
Score = 28.8 bits (64), Expect = 5.4
Identities = 23/103 (22%), Positives = 52/103 (50%), Gaps = 14/103 (13%)
Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
+E + K++ ++ + E+ FLF+ +LL C P + V A E++++ R
Sbjct: 2 KEGPIQKISF-RRNSTMEKYLFLFNNMLLYCVPK----VIQVGA-------EFQVRTRID 49
Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
+ +++ +L D+ ++ ++F +V + L A S E+ W+
Sbjct: 50 VAGMKVRELMDA-EFPHSF-LVSGKQRTLELQARSQEEMIAWI 90
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.5 bits (69), Expect = 5.5
Identities = 18/121 (14%), Positives = 32/121 (26%), Gaps = 12/121 (9%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGR 882
P P + + S + P P+ T+P+ + P P P +P
Sbjct: 388 AAPQPS------AAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNP-- 439
Query: 883 FHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAPGNDKWAMHEAKEAIEKSKKK 942
+ + P + A KEA ++K+
Sbjct: 440 ----PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKE 495
Query: 943 I 943
I
Sbjct: 496 I 496
Score = 29.7 bits (67), Expect = 9.4
Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 7/68 (10%)
Query: 847 DISPPPLPPRRETRPSLSSP-----PGYASSPDPSPSSPGRFHVSS--PSPSKPSGSTSR 899
D + P++ +P + P P A++ PSPS S S ++P+G+
Sbjct: 367 DDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426
Query: 900 LPLLPPKP 907
+ + PP
Sbjct: 427 VSVDPPAA 434
>gnl|CDD|215430 PLN02800, PLN02800, imidazoleglycerol-phosphate dehydratase.
Length = 261
Score = 29.8 bits (67), Expect = 5.6
Identities = 10/61 (16%), Positives = 18/61 (29%), Gaps = 4/61 (6%)
Query: 822 SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
+ L P+ + + V S S P S ++ S ++ G
Sbjct: 9 AAQLLRPKLSFIDLLPRRAAIVSSPSSSLPRFLRME----SQLRQSISCAASSSSSNALG 64
Query: 882 R 882
R
Sbjct: 65 R 65
>gnl|CDD|220966 pfam11072, DUF2859, Protein of unknown function (DUF2859). This is
a bacterial family of uncharacterized proteins.
Length = 142
Score = 29.2 bits (66), Expect = 5.8
Identities = 14/59 (23%), Positives = 14/59 (23%), Gaps = 2/59 (3%)
Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSP--SPSKPSGSTSRLPLLPP 905
P P P SP V SP SP LP L P
Sbjct: 6 GVSAAPYYEALNPQPDEPQPSPPSPPAGALEAAMLPVRSPRLSPGVVERRALNLPGLQP 64
>gnl|CDD|225979 COG3448, COG3448, CBS-domain-containing membrane protein [Signal
transduction mechanisms].
Length = 382
Score = 30.2 bits (68), Expect = 6.3
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 515 VSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALPARLENALV 565
VS AI M L L+ +G +A+ + LG +V+RL F P N+L+
Sbjct: 113 VSLAIGAMFALRCLHPPSGAVALTAVLGGPAVHRLGYNFVLWPV-ALNSLI 162
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 29.9 bits (67), Expect = 6.5
Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 6/55 (10%)
Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSS------PGRFHVSSPSPSKPSGSTS 898
PP P P+ S P AS +P+S P ++ +P PS S
Sbjct: 86 PPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVP 140
>gnl|CDD|114448 pfam05722, Ustilago_mating, Ustilago B locus mating-type protein.
This family consists of several Ustilago mating-type
proteins. The b locus of the phytopathogenic fungus
Ustilago maydis encodes a multiallelic recognition
function that controls the ability of the fungus to form
a dikaryon and complete the sexual stage of the life
cycle. The b locus has at least 25 alleles and any
combination of two different alleles, brought together
by mating between haploid cells, allows the fungus to
cause disease and undergo sexual development within the
plant.
Length = 239
Score = 29.8 bits (66), Expect = 6.7
Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 7/92 (7%)
Query: 827 PPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSP--SSPGRFH 884
PP+ R V+T+ + +R +P P ++ +P S P S+ D S S+P
Sbjct: 95 PPKPGMARAVTTAAKRHPARKTAPAAKPKPKKAKPRASKTPSIDSTLDSSALESTP---E 151
Query: 885 VSSPSPSKPSGST--SRLPLLPPKPFSFGKQV 914
+S+ S + S S+ S L PF G +
Sbjct: 152 LSACSTADNSFSSFCSNGSLGHHDPFQDGNDI 183
>gnl|CDD|220819 pfam10599, Nup_retrotrp_bd, Retro-transposon transporting motif.
This is the highly conserved C-terminal motif
GRKIxxxxxRRKx of nucleoporins that plays a critical and
unique role in the nuclear import of retro-transposons
in both yeasts and higher organisms. It would appear
that the arginine residues at positions 2 and 9-10
constitute a bipartite nuclear localisation signal, with
two basic peptide motifs separated by an interchangeable
spacer sequence, that is crucial for the
retro-transposon activity.
Length = 101
Score = 28.2 bits (62), Expect = 6.8
Identities = 9/67 (13%), Positives = 21/67 (31%)
Query: 740 QTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSS 799
+ P T+++ T F + + + S S + + + + +
Sbjct: 24 GSNNQPHTSSTPSTPFTFGASPSTPANAASPSGSNPPAFSGSPASAPNQFNVGSNNSNMF 83
Query: 800 SGPEFSG 806
S SG
Sbjct: 84 SSSSVSG 90
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 30.1 bits (68), Expect = 7.2
Identities = 11/64 (17%), Positives = 26/64 (40%)
Query: 698 SLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAP 757
+ P I++S+ + V P+ + S+ + + PP+++ F++
Sbjct: 665 QSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSSQAFSLSDL 724
Query: 758 IMLS 761
M S
Sbjct: 725 PMQS 728
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.9 bits (67), Expect = 7.3
Identities = 17/64 (26%), Positives = 22/64 (34%), Gaps = 5/64 (7%)
Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK---PSG--STSRLPLL 903
SP P ++ P P P +PG PSP+ P +R L
Sbjct: 383 SPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPL 442
Query: 904 PPKP 907
PP P
Sbjct: 443 PPSP 446
>gnl|CDD|184643 PRK14354, glmU, bifunctional N-acetylglucosamine-1-phosphate
uridyltransferase/glucosamine-1-phosphate
acetyltransferase; Provisional.
Length = 458
Score = 29.8 bits (68), Expect = 7.4
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 10/51 (19%)
Query: 263 ANIILEEKDNHTGV-----PLIKGATLYKLVERLTYHIYADPAFVRTFLTT 308
A L +K+ T V PLI TL L++ +H + T LT
Sbjct: 85 AEEFLADKEGTTLVICGDTPLITAETLKNLID---FHE--EHKAAATILTA 130
>gnl|CDD|219741 pfam08193, INO80_Ies4, INO80 complex subunit Ies4. The INO80
ATPase is a member of the SNF2 family of ATPases and
functions as an integral component of a multisubunit
ATP-dependent chromatin remodelling complex. This family
of proteins corresponds to the fungal Ies4 subunit of
INO80.
Length = 228
Score = 29.6 bits (66), Expect = 7.8
Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 6/89 (6%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPL------PPRRETRPSLSSPPGYASSPDPS 876
P TN R S + + +SP L P +E PS S A P PS
Sbjct: 2 APASKATTNGRSSRSGPSKLIVVLKLSPELLRAFLAPSPVKEDEPSDSPASSAADPPVPS 61
Query: 877 PSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
S+P+ + T R P
Sbjct: 62 SVDNASDAASTPAAGTSATDTPRRKGGPG 90
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 29.8 bits (67), Expect = 7.8
Identities = 29/182 (15%), Positives = 51/182 (28%), Gaps = 19/182 (10%)
Query: 701 SPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIML 760
+PG +A+ A V G +P + + PP A AP
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAP------APPAT 435
Query: 761 SNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGL 820
++ + G + S S +P + SG +
Sbjct: 436 ADRGDDAADGDAPVPAKANARASADSRCDER-DAQPPADSGSASAPASD----------- 483
Query: 821 YSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
+PP R ++ D P R + + + P A P P+ ++P
Sbjct: 484 -APPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAP 542
Query: 881 GR 882
Sbjct: 543 AA 544
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 30.0 bits (67), Expect = 8.3
Identities = 25/143 (17%), Positives = 41/143 (28%), Gaps = 22/143 (15%)
Query: 782 QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQ 841
+ Q + + G + + + SP P+P ++Q
Sbjct: 373 MNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSPSPVPQ--------VQTNQ 424
Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLP 901
S+ P P P P P + PSP + + SPSP S+
Sbjct: 425 SMPQ---PPQPSVPSPGG-PGSQPPQSVSGGMIPSPPA----LMPSPSPQMSQSPASQ-- 474
Query: 902 LLPPKPFSFGKQVPMFQFNAPGN 924
+ P+ N PG
Sbjct: 475 -RTIQQDMVSPGGPL---NTPGQ 493
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 29.7 bits (67), Expect = 8.3
Identities = 18/99 (18%), Positives = 29/99 (29%), Gaps = 8/99 (8%)
Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISP-----PPLPPRRETRPSLSSPPGYASSPDPSP 877
P + R SD+ R ++P P PR + P+ P P
Sbjct: 159 AVTASPSSMIARNTPISDRL-RPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVP 217
Query: 878 SSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPM 916
P + +S S + S + + P S
Sbjct: 218 --PPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST 254
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 30.1 bits (67), Expect = 8.6
Identities = 16/101 (15%), Positives = 29/101 (28%), Gaps = 6/101 (5%)
Query: 820 LYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPP--GYASSPDPSP 877
L++P P + ++ + P P + +P P Y P
Sbjct: 744 LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVA 799
Query: 878 SSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQ 918
P P +P + P+ P + +Q Q
Sbjct: 800 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 840
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 28.8 bits (64), Expect = 9.0
Identities = 27/97 (27%), Positives = 31/97 (31%), Gaps = 7/97 (7%)
Query: 819 GLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPS 878
+ PPPLP T + I PPP P P PG + P P P
Sbjct: 23 CIPPPPPLPGGTG----IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIP-PPPP 77
Query: 879 SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
PG + P P G P PP P P
Sbjct: 78 LPGAAGIPPPPPL--PGGAGIPPPPPPLPGGAAVPPP 112
Score = 28.8 bits (64), Expect = 9.4
Identities = 25/67 (37%), Positives = 26/67 (38%), Gaps = 6/67 (8%)
Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLL-----P 904
PPPLP P PG A+ P P P PG V P P P P P
Sbjct: 87 PPPLPGGAGIPPPPPPLPGGAAVP-PPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPP 145
Query: 905 PKPFSFG 911
P PF FG
Sbjct: 146 PPPFGFG 152
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 30.0 bits (67), Expect = 9.1
Identities = 51/203 (25%), Positives = 67/203 (33%), Gaps = 42/203 (20%)
Query: 708 KNYMARNNLPA---LTVNKITGPKNSPLSESDGTPQ-TPLTPPTANSEFTVFAPIMLSNA 763
++ + +N L + N T + TPQ T L A SE V IM +
Sbjct: 275 RSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGS- 333
Query: 764 NGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSP 823
S + STA +++ S T +S E +P S
Sbjct: 334 --SPAETKASTA-----AWKIRNPLSRTSAPAVRIASATFRGLEKNP-----------ST 375
Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
P P T R R V T+ Q H + P P A PSPS
Sbjct: 376 APSTPATPRVRAVLTT-QVHHCVVVKPAP-----------------AVPTTPSPSLTTAL 417
Query: 884 HVSSPSPSKPSGSTSRLPLLPPK 906
+PSPS PS P L PK
Sbjct: 418 FPEAPSPS-PSALPPGQPDLHPK 439
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 29.5 bits (66), Expect = 9.5
Identities = 35/176 (19%), Positives = 55/176 (31%), Gaps = 44/176 (25%)
Query: 742 PLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSG 801
PL P N P S S +T + Q+++S S++ T + S
Sbjct: 78 PLALPMPN--LAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSE 135
Query: 802 PEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
S P +++SD S S P R +
Sbjct: 136 QS------------------STPN----------LNSSDTSTSSSGALPSTSVVRGFSPS 167
Query: 862 SLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMF 917
+SS Y S+ + +PSP+K + T+ KP K+ MF
Sbjct: 168 HISS--SYRSTAQLNK---------APSPTKSAEPTAA---PQAKPELPKKKQAMF 209
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.404
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,408,231
Number of extensions: 5079593
Number of successful extensions: 6418
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5884
Number of HSP's successfully gapped: 260
Length of query: 966
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 860
Effective length of database: 6,236,078
Effective search space: 5363027080
Effective search space used: 5363027080
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)