RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16398
(537 letters)
>gnl|CDD|219498 pfam07651, ANTH, ANTH domain. AP180 is an endocytotic accessory
proteins that has been implicated in the formation of
clathrin-coated pits. The domain is involved in
phosphatidylinositol 4,5-bisphosphate binding and is a
universal adaptor for nucleation of clathrin coats.
Length = 278
Score = 246 bits (629), Expect = 5e-78
Identities = 86/279 (30%), Positives = 136/279 (48%), Gaps = 13/279 (4%)
Query: 19 GQGLAKAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIERTQNSSS-VVVFK 77
L AV KAT+ + P KK + + + L L R + + VV K
Sbjct: 1 DSDLEVAVVKATSHD-EVPPKKKHVREILVGTSSPAKVAALFWALSRRLPLTRNWVVALK 59
Query: 78 ALITIHHLMCYGNERFTQYLASSNCSLQLGNFLDK---SGIHGYDMTPFIHRYSRYINEK 134
ALI +H L+ G+ Q L + + + S +D FI Y++Y++E+
Sbjct: 60 ALILVHKLLREGHPSVLQELLRARRRISSLLRISSFDDSMSLTWDYGAFIRAYAKYLDER 119
Query: 135 SLSYRTVAFD--FCKVKRG-----KEDGLLRTMPANKLLKSLPVLQSQIDALLEFDCNNS 187
+R + D F +V+ G + TM LL +P LQ +DALL+ +
Sbjct: 120 LDFHRKLPRDPTFERVEYGSLRAVGDPNSRYTMSMEDLLDIIPKLQKLLDALLKCKPTGN 179
Query: 188 DLRNGVINGAFMLLFRDLIRLFAGYNDSIINLLEKYFDMNKKQCKEALDLYKKFLIRMDR 247
L N I A +LL ++ L+ N+ IINLLEK+F+M+K AL +YK+F+ + +R
Sbjct: 180 ALTNECIIAALILLIKESFGLYGAINEGIINLLEKFFEMSKPDADAALGIYKRFVSQFER 239
Query: 248 VAEFLKVAENVGIDKG-DIPDLTKAPSSLLEALEQHLAT 285
+ EF +V +N+G + +IP L P +LL+ALE+HL
Sbjct: 240 LKEFYEVCKNLGYFRSLEIPKLPHIPPNLLDALEEHLRD 278
>gnl|CDD|239622 cd03564, ANTH_AP180_CALM, ANTH domain family; composed of adaptor
protein 180 (AP180), clathrin assembly lymphoid myeloid
leukemia protein (CALM) and similar proteins. A set of
proteins previously designated as harboring an ENTH
domain in fact contains a highly similar, yet unique
module referred to as an AP180 N-terminal homology
(ANTH) domain. AP180 and CALM play important roles in
clathrin-mediated endocytosis. AP180 is a brain-specific
clathrin-binding protein which stimulates clathrin
assembly during the recycling of synaptic vesicles. The
ANTH domain is structurally similar to the VHS domain
and is composed of a superhelix of eight alpha helices.
ANTH domains bind both inositol phospholipids and
proteins, and contribute to the nucleation and formation
of clathrin coats on membranes. ANTH-bearing proteins
have recently been shown to function with adaptor
protein-1 and GGA adaptors at the trans-Golgi network,
which suggests that the ANTH domain is a universal
component of the machinery for clathrin-mediated
membrane budding.
Length = 117
Score = 157 bits (399), Expect = 3e-46
Identities = 48/117 (41%), Positives = 67/117 (57%)
Query: 22 LAKAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIERTQNSSSVVVFKALIT 81
L KAV KAT+ + PK KH+ ++ T+ + SIP A+ L R + + VVV KALI
Sbjct: 1 LEKAVKKATSHDEAPPKDKHVRKIIAGTSSSPASIPSFASALSRRLLDRNWVVVLKALIL 60
Query: 82 IHHLMCYGNERFTQYLASSNCSLQLGNFLDKSGIHGYDMTPFIHRYSRYINEKSLSY 138
+H L+ G+ F Q L S L L NFLDKS GY + FI Y+RY++E+ +
Sbjct: 61 LHRLLREGHPSFLQELLSRRGWLNLSNFLDKSSSLGYGYSAFIRAYARYLDERLSFH 117
>gnl|CDD|214594 smart00273, ENTH, Epsin N-terminal homology (ENTH) domain.
Length = 127
Score = 125 bits (316), Expect = 3e-34
Identities = 38/127 (29%), Positives = 62/127 (48%), Gaps = 3/127 (2%)
Query: 21 GLAKAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIERTQNSSS-VVVFKAL 79
L V KAT + GPK KHL ++ T+ S ++ +L R ++ + VV+KAL
Sbjct: 2 DLEVKVRKATNNDEWGPKGKHLREIIQGTHNEKSSFAEIMAVLWRRLNDTKNWRVVYKAL 61
Query: 80 ITIHHLMCYGNERFT-QYLASSNCSLQLGNFLDKSGIHGYDMTPFIHRYSRYINEKSLSY 138
I +H+L+ G+ R + L + N L L +F D G D I Y++Y+ E+
Sbjct: 62 ILLHYLLRNGSPRVILEALRNRNRILNLSDFQDIDS-RGKDQGANIRTYAKYLLERLEDD 120
Query: 139 RTVAFDF 145
R + +
Sbjct: 121 RRLKEER 127
>gnl|CDD|238118 cd00197, VHS_ENTH_ANTH, VHS, ENTH and ANTH domain superfamily;
composed of proteins containing a VHS, ENTH or ANTH
domain. The VHS domain is present in Vps27 (Vacuolar
Protein Sorting), Hrs (Hepatocyte growth
factor-regulated tyrosine kinase substrate) and STAM
(Signal Transducing Adaptor Molecule). It is located at
the N-termini of proteins involved in intracellular
membrane trafficking. The epsin N-terminal homology
(ENTH) domain is an evolutionarily conserved protein
module found primarily in proteins that participate in
clathrin-mediated endocytosis. A set of proteins
previously designated as harboring an ENTH domain in
fact contains a highly similar, yet unique module
referred to as an AP180 N-terminal homology (ANTH)
domain. VHS, ENTH and ANTH domains are structurally
similar and are composed of a superhelix of eight alpha
helices. ENTH adnd ANTH (E/ANTH) domains bind both
inositol phospholipids and proteins and contribute to
the nucleation and formation of clathrin coats on
membranes. ENTH domains also function in the development
of membrane curvature through lipid remodeling during
the formation of clathrin-coated vesicles. E/ANTH
domain-bearing proteins have recently been shown to
function with adaptor protein-1 and GGA adaptors at the
trans-Golgi network, which suggests that E/ANTH domains
are universal components of the machinery for
clathrin-mediated membrane budding.
Length = 115
Score = 101 bits (253), Expect = 1e-25
Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 1/114 (0%)
Query: 24 KAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIERTQNSSSVVVFKALITIH 83
K V KAT+ E +GP + + NE NV + + +R N + VV KAL +
Sbjct: 3 KTVEKATSNENMGPDWPLIMEICDLINETNVGPKEAVDAIKKRINNKNPHVVLKALTLLE 62
Query: 84 HLMCYGNERFTQYLASSNCSLQLGNFLDKSGIHGYDMTPFIHRYSRYINEKSLS 137
+ + ERF Q +AS++ +++L F DKS + G D++ + + + + S
Sbjct: 63 YCVKNCGERFHQEVASNDFAVELLKF-DKSKLLGDDVSTNVREKAIELVQLWAS 115
>gnl|CDD|216488 pfam01417, ENTH, ENTH domain. The ENTH (Epsin N-terminal homology)
domain is found in proteins involved in endocytosis and
cytoskeletal machinery. The function of the ENTH domain
is unknown.
Length = 124
Score = 63.3 bits (155), Expect = 3e-12
Identities = 23/114 (20%), Positives = 45/114 (39%), Gaps = 7/114 (6%)
Query: 22 LAKAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIER--TQNSSSVVVFKAL 79
V +AT + GP + + T + V ++ +L +R + V+KAL
Sbjct: 4 TELKVREATNNDPWGPSGTLMAEIARLTYDY-VEFFEIMEVLWKRLNDSGKNWRHVYKAL 62
Query: 80 ITIHHLMCYGNERFTQYLASSNCSL--QLGNFLDKSGIHGYDMTPFIHRYSRYI 131
+ +L+ G+ER L + + L +F G D + + ++ I
Sbjct: 63 TLLEYLLKNGSERVVDDL-REHIYIIRTLRDFHYIDE-DGKDQGINVRKKAKQI 114
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 39.1 bits (91), Expect = 0.006
Identities = 36/189 (19%), Positives = 62/189 (32%), Gaps = 12/189 (6%)
Query: 316 SSAFSAVSGADLEESLKKQALAEEEAILNQYKAKVSSPTSSMPSNAASTNPFLASPTQPI 375
++A AV+ A S +ALA + +P + + A+ A+ +P+
Sbjct: 413 AAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPV 472
Query: 376 VDLFGSAPASVEPVQTSKPSDDLLQLGNPFVDDFTGAAAPGSQPPVNNAWTSNGFNMNTT 435
+APA P P+DD +F + AP W + T
Sbjct: 473 AAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA-SPAPAQPDAAPAGWVAESIPDPAT 531
Query: 436 SVQQDSFVSDKAFSSVFGDLDPSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQ- 494
+ D+F + P+ P + RP S + + F
Sbjct: 532 ADPDDAFETLAP--------APAAAPAPRAAAATEPVVAPRPPRASASG--LPDMFDGDW 581
Query: 495 PVGAPRMPM 503
P A R+P+
Sbjct: 582 PALAARLPV 590
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 35.6 bits (82), Expect = 0.066
Identities = 23/78 (29%), Positives = 34/78 (43%), Gaps = 12/78 (15%)
Query: 457 PSKTMPPPSTVPGALGAN-YRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGS 515
+ +P S + GA+G Y G F QP+G PRM MMP+ G G
Sbjct: 381 RMRQLPMGSPMGGAMGQPPYYGQGPQQQ-------FNGQPLGWPRMSMMPTPMG---PGG 430
Query: 516 PRQVPSGQPPVASTQLDP 533
P + P+G P+ + +
Sbjct: 431 PLR-PNGLAPMNAVRAPS 447
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 34.1 bits (78), Expect = 0.17
Identities = 19/112 (16%), Positives = 27/112 (24%), Gaps = 1/112 (0%)
Query: 410 TGAAAPGSQPPVNNAWTSNGFNMNTTSVQQDSFVSDKAFSSVFGDLDPSKTMPPPSTVPG 469
+ V A + S QD + + G T P PG
Sbjct: 155 NTTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPG 214
Query: 470 ALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGSPRQVPS 521
P+ + PV A P + PL + V S
Sbjct: 215 TPAQPLTPLAAEAQA-KAEVISTPSPVTAAASPTITPHQTQPLPTAAAPVLS 265
>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
[RNA processing and modification].
Length = 654
Score = 33.9 bits (77), Expect = 0.24
Identities = 43/178 (24%), Positives = 60/178 (33%), Gaps = 31/178 (17%)
Query: 360 NAASTNPFLASPTQPIVD-LFGSAPASVEPVQTSKPSDDLLQLGNPFVDDFTGAAAPGSQ 418
T P + + F +++ QT L +GN A PG
Sbjct: 463 KPYFTAPTWLNTIEESYKTFFPDEDTAIQEAQTRFQQRQLNSMGN---------AVPGMN 513
Query: 419 PPV-NNAWTSNGFNMNTTSVQQDSFVSDKAFSSVFGDL----DPSKTMPPPSTVPGALGA 473
P + N GF M S + ++ A S+ + P P P +P +G+
Sbjct: 514 PAMGMNMGGMMGFPMGGPSASPNPMMNGFAAGSMGMYMPFQPQPMFYHPSPQMMP-VMGS 572
Query: 474 NYRPMGQSMT----PFYTDNAFPAQPVGAPRMPMMPSSGGTP---LMGSPRQVPSGQP 524
N G P A P P+GA P GG P +MGS PSG P
Sbjct: 573 NGAEEGGGNISPHVPAGFMAAGPGAPMGAFGYP-----GGIPFQGMMGSG---PSGMP 622
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 32.2 bits (73), Expect = 0.39
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 495 PVGAPRMPMMPSSGGTPLMGSPRQVPSGQPPVA 527
+ P+ PMMP G P++ Q P QPP
Sbjct: 75 LILPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQ 107
>gnl|CDD|202094 pfam02025, IL5, Interleukin 5.
Length = 112
Score = 30.6 bits (69), Expect = 0.60
Identities = 23/67 (34%), Positives = 33/67 (49%), Gaps = 16/67 (23%)
Query: 189 LRNGVING-AFMLLFRDLIRLFAGYNDSIINLLEKYFDMNKKQCKEALDLYKKFLIRMDR 247
L+N + G A LF++L +L++KY D KK+C E K+FL D
Sbjct: 53 LKNQTVQGGAVEKLFQNL------------SLIKKYIDRQKKKCGEERRRVKQFL---DY 97
Query: 248 VAEFLKV 254
+ EFL V
Sbjct: 98 LQEFLGV 104
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 31.7 bits (72), Expect = 0.68
Identities = 11/57 (19%), Positives = 15/57 (26%), Gaps = 1/57 (1%)
Query: 462 PPPSTVPGALGANYRPMGQSMTPFYTD-NAFPAQPVGAPRMPMMPSSGGTPLMGSPR 517
P P + L Q++ P+ G P P GG R
Sbjct: 138 PDPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQYGGGRGNYDEFR 194
>gnl|CDD|233646 TIGR01936, nqrA, NADH:ubiquinone oxidoreductase,
Na(+)-translocating, A subunit. This model represents
the NqrA subunit of the six-protein, Na(+)-pumping
NADH-quinone reductase of a number of marine and
pathogenic Gram-negative bacteria. This oxidoreductase
complex functions primarily as a sodium ion pump
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 447
Score = 31.7 bits (72), Expect = 1.1
Identities = 22/81 (27%), Positives = 34/81 (41%), Gaps = 7/81 (8%)
Query: 306 KSGVNSFSTTSSA-FSAVSGADLEESLKKQALAEEEAILNQYKAKVSSPTSSMPS---NA 361
K G F+T + +S ++E+L + L AI + V++P SS A
Sbjct: 94 KDGDGEFATFPTYDLQTLSAEQIKENLLESGLWT--AIKQRPFDIVATPDSSPRDIFITA 151
Query: 362 ASTNPFLASPTQPIVDLFGSA 382
A T P P ++ F SA
Sbjct: 152 ADTRPLAPDPEV-VLAEFSSA 171
>gnl|CDD|224542 COG1627, COG1627, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 419
Score = 31.4 bits (71), Expect = 1.4
Identities = 28/121 (23%), Positives = 46/121 (38%), Gaps = 11/121 (9%)
Query: 240 KFLIRMDRVAEFLKVAENVGI---DKGDIPDLTKAPSSLLEALEQHLATLEGKKSAAGTP 296
L R V L + V I G + + + +A E + ++
Sbjct: 259 LVLPRDSVVGGLLSPGDKVTIYAKSNGTKEIVPEGRVDAVLVAASSIAVYESEDKSS--- 315
Query: 297 TQASHRNDVKSGVNSFSTTSSAFSAVSGADLEESLKKQAL-----AEEEAILNQYKAKVS 351
Q+++ N G N S++SS+ S DL E LK A A+ +A L +Y K+
Sbjct: 316 QQSTYTNYNPLGENQSSSSSSSASYSYSVDLREILKAIAAGRLDPAKIKAELGKYGWKLL 375
Query: 352 S 352
Sbjct: 376 D 376
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 31.7 bits (72), Expect = 1.4
Identities = 26/130 (20%), Positives = 36/130 (27%), Gaps = 10/130 (7%)
Query: 396 DDLLQLGNPFVDDFTGAAAPGSQPPVNNAWTSNGFNMNTTSVQQDSFVSDKAFSSVFGDL 455
DL++ + F A + S G ++ ++ V + D
Sbjct: 8 YDLIEAAAEGGEFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTV---VAGAAACDR 64
Query: 456 DPSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGS 515
T PPP PG T G+P P PSS P
Sbjct: 65 FEPPTGPPPG--PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP-GPSSPDPP---- 117
Query: 516 PRQVPSGQPP 525
P P PP
Sbjct: 118 PPTPPPASPP 127
>gnl|CDD|239628 cd03572, ENTH_epsin_related, ENTH domain, Epsin Related family;
composed of hypothetical proteins containing an
ENTH-like domain. The epsin N-terminal homology (ENTH)
domain is an evolutionarily conserved protein module
found primarily in proteins that participate in
clathrin-mediated endocytosis. A set of proteins
previously designated as harboring an ENTH domain in
fact contains a highly similar, yet unique module
referred to as an AP180 N-terminal homology (ANTH)
domain. ENTH and ANTH (E/ANTH) domains are structurally
similar to the VHS domain and are composed of a
superhelix of eight alpha helices. E/ANTH domains bind
both inositol phospholipids and proteins and contribute
to the nucleation and formation of clathrin coats on
membranes. ENTH domains also function in the
development of membrane curvature through lipid
remodeling during the formation of clathrin-coated
vesicles. E/ANTH-bearing proteins have recently been
shown to function with adaptor protein-1 and GGA
adaptors at the trans-Golgi network, which suggests
that E/ANTH domains are universal components of the
machinery for clathrin-mediated membrane budding.
Length = 122
Score = 29.6 bits (67), Expect = 1.5
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 58 DLATLLIERTQNSSSVVVFKALITIHHLMCYGNERFTQYL 97
+L L++R + SS V K L I HL GN F + L
Sbjct: 38 ELLEYLLKRLKRSSPHVKLKVLKIIKHLCEKGNSDFKREL 77
>gnl|CDD|130302 TIGR01235, pyruv_carbox, pyruvate carboxylase. This enzyme plays a
role in gluconeogensis but not glycolysis [Energy
metabolism, Glycolysis/gluconeogenesis].
Length = 1143
Score = 31.0 bits (70), Expect = 2.1
Identities = 36/122 (29%), Positives = 52/122 (42%), Gaps = 18/122 (14%)
Query: 188 DLRNGVINGAFMLLFRDLIRL-FAGYNDSIINLLEKYFDMNKKQCK-------EALDLYK 239
DLR GV N F +L R + + Y D+++ KYF Q ++L+ +
Sbjct: 597 DLRKGVPNILFQMLLRGANGVGYTNYPDNVV----KYFVKQAAQGGIDIFRVFDSLNWVE 652
Query: 240 KFLIRMDRVAEFLKVAENVGIDKGDIPDLTKAP------SSLLEALEQHLATLEGKKSAA 293
+ MD VAE KV E GDI D + ++L LE+ A + G K A
Sbjct: 653 NMRVGMDAVAEAGKVVEAAICYTGDILDPARPKYDLKYYTNLAVELEKAGAHILGIKDMA 712
Query: 294 GT 295
G
Sbjct: 713 GL 714
>gnl|CDD|226911 COG4536, CorB, Putative Mg2+ and Co2+ transporter CorB [Inorganic
ion transport and metabolism].
Length = 423
Score = 30.7 bits (70), Expect = 2.1
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 3/68 (4%)
Query: 15 HSLAGQGLAKAVCKATTEELIGPKKKHLDYLLHCTNEANVSIPDLATLLIERTQNSSSVV 74
LA QG A E+L+ + + +L N N+ LAT+L R + V
Sbjct: 35 RHLAKQGNRGA---KRVEKLLEKPDRLIGTILIGNNLVNILASALATILGIRLYGDAGVA 91
Query: 75 VFKALITI 82
+ ++T
Sbjct: 92 IATGVLTF 99
>gnl|CDD|182985 PRK11130, moaD, molybdopterin synthase small subunit; Provisional.
Length = 81
Score = 28.4 bits (64), Expect = 2.3
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 4/45 (8%)
Query: 245 MDRVAEFLKVAENVGIDKGDIPDLTKAPSSLLEALEQHLATLEGK 289
M +V F +V E VG D ++ A +EAL QHLA +
Sbjct: 1 MIKVLFFAQVRELVGTDALELA----ADFPTVEALRQHLAQKGDR 41
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.7 bits (69), Expect = 2.5
Identities = 19/77 (24%), Positives = 26/77 (33%), Gaps = 14/77 (18%)
Query: 457 PSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGSP 516
P +++PPP P RP ++T P Q PR P+ G P
Sbjct: 2564 PDRSVPPPRPAP-------RPSEPAVTSRARRPDAPPQS-ARPRAPVDDR--GDP----R 2609
Query: 517 RQVPSGQPPVASTQLDP 533
P P + DP
Sbjct: 2610 GPAPPSPLPPDTHAPDP 2626
Score = 28.8 bits (64), Expect = 9.7
Identities = 33/183 (18%), Positives = 48/183 (26%), Gaps = 18/183 (9%)
Query: 353 PTSSMPSNAASTNPFLASPTQPIVDLFGSAPASVEPVQTSKPSDDLLQLGNPFVDDFTGA 412
+ + P A P L A AS+ + S PS P+ A
Sbjct: 2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS--------PWDPADPPA 2809
Query: 413 AAPGSQPPVNNAWTSNGFNMNTTSVQQDSFVSDKAFSSVFGDLD-------PSKTMPPPS 465
A + A + G TS Q + L + PP
Sbjct: 2810 AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSR 2869
Query: 466 TVPGALGANYRPMGQSMT---PFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGSPRQVPSG 522
+ A RP + + + +F P R P + P P
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
Query: 523 QPP 525
QPP
Sbjct: 2930 QPP 2932
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.1 bits (68), Expect = 2.8
Identities = 16/80 (20%), Positives = 22/80 (27%), Gaps = 3/80 (3%)
Query: 449 SSVFGDLDPSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMP---MMP 505
L P T PG L P+ + P + PAQ ++ P
Sbjct: 97 PPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAP 156
Query: 506 SSGGTPLMGSPRQVPSGQPP 525
+SG P Q
Sbjct: 157 ASGQLPSQQQSAQKNDESQL 176
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.4 bits (68), Expect = 3.1
Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 8/65 (12%)
Query: 461 MPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGSPRQVP 520
PP+ PGA P P A +P GAP P +G T + PR P
Sbjct: 764 ARPPAAAPGAPT----PQPPPQAP----PAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAP 815
Query: 521 SGQPP 525
Q P
Sbjct: 816 GQQGP 820
Score = 29.3 bits (65), Expect = 6.9
Identities = 15/71 (21%), Positives = 21/71 (29%), Gaps = 6/71 (8%)
Query: 461 MPPPSTVPGALGANYRPMGQSMTPFYTDNAFP--AQPVGAPRMPMMPSSGGTPLMGSPRQ 518
M PP+ PG + + A P A+P A P + P
Sbjct: 704 MRPPAAPPGRA----QRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 759
Query: 519 VPSGQPPVAST 529
P P A+
Sbjct: 760 APGRARPPAAA 770
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 30.5 bits (69), Expect = 3.3
Identities = 39/250 (15%), Positives = 67/250 (26%), Gaps = 18/250 (7%)
Query: 297 TQASHRNDVKSGVNSFSTTSSAFSAVSGADLEESLKKQALAEEEAILNQYKAKVSSPTSS 356
+A+ +G+ S + + ++ S A S K+ + + + A +
Sbjct: 16 PEATPSAGAPTGLQQSSESPTQRASHSLA----SEGKKNRKKMPKVFQKSSAPRQIQAAP 71
Query: 357 MPSNAASTNPFLASPTQPIVDLFGSAPASVEPVQTSKPSDDLLQLGNPFVDDFTGAAAPG 416
+ + +S + +L + P Q + S +L
Sbjct: 72 PQALNPTAAAPQSSRGPTLRELL-ALPEDDGETQAPESSPSARRLTRS-EGVARHEMEDL 129
Query: 417 SQPPVNNAWTSNGFNMNTTSVQQDSFVSDKAFSSVFGDLDPSKTMPPPSTVPG------- 469
+ PV + + + S S G S
Sbjct: 130 AGRPVVKP--DADRQLRQDILNKSSSSRRPPVSKEEGTSSKMPATALASAALFKDDEIRQ 187
Query: 470 -ALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSS-GGTPLMGSPRQVPSGQPPVA 527
A QS N P APR PM+ S GG + QP +
Sbjct: 188 EVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQPMLTRSAGGRFEGEDENLERNLQPQ-S 246
Query: 528 STQLDPFGSL 537
LD G L
Sbjct: 247 PITLDKKGKL 256
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 29.0 bits (65), Expect = 3.6
Identities = 12/51 (23%), Positives = 20/51 (39%)
Query: 477 PMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMGSPRQVPSGQPPVA 527
P ++ P + PAQ P+ P+MP G + + P+ P
Sbjct: 47 PPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQ 97
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 28.1 bits (63), Expect = 3.6
Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 2/42 (4%)
Query: 201 LFRDLIRLFAGYNDSIINLLEKYFDMNKKQCKEALDLYKKFL 242
+ + L + NL Y+ + K +EAL+ Y+K L
Sbjct: 56 DYEKALELDPDNAKAYYNLGLAYYKLGKY--EEALEAYEKAL 95
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 29.7 bits (67), Expect = 4.5
Identities = 30/157 (19%), Positives = 53/157 (33%), Gaps = 35/157 (22%)
Query: 250 EFLKVAENVGIDKGDIPDLTKAPSSLLEALEQHLATLEGKKSAAGTPTQASHRNDVKSGV 309
+ K+ ++ G + D+P+ S L+ + + + L+ SH V +
Sbjct: 198 KVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLE-----QTESHLEKVLVKI 252
Query: 310 ---------------------NSFSTTSSAFSA---VSGADLEESLKK--QALAEEEAIL 343
N F+ + A V DLE+ LK + E +
Sbjct: 253 ADELLAWDEQVSKEKAVYETLNLFNYDTKTLIAEGWVPAKDLEK-LKAALENATEGSGSV 311
Query: 344 NQYKAKVSSPTSSMPSNAASTNPFLASPTQPIVDLFG 380
+ + T+ P N F +P Q IVD +G
Sbjct: 312 PSIENDI--ETNEEPPTYLKNNKF-TAPFQMIVDAYG 345
>gnl|CDD|201213 pfam00406, ADK, Adenylate kinase.
Length = 186
Score = 28.8 bits (65), Expect = 4.6
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 154 DGLLRTMPANKLLKSLPVLQSQIDALLEFDCNNSDL 189
DG RT+P + L+ + ++D ++EFD + L
Sbjct: 80 DGFPRTVPQAEALEEMLEYGIKLDYVIEFDVPDEVL 115
>gnl|CDD|218745 pfam05783, DLIC, Dynein light intermediate chain (DLIC). This
family consists of several eukaryotic dynein light
intermediate chain proteins. The light intermediate
chains (LICs) of cytoplasmic dynein consist of multiple
isoforms, which undergo post-translational modification
to produce a large number of species. DLIC1 is known to
be involved in assembly, organisation, and function of
centrosomes and mitotic spindles when bound to
pericentrin. DLIC2 is a subunit of cytoplasmic dynein 2
that may play a role in maintaining Golgi organisation
by binding cytoplasmic dynein 2 to its Golgi-associated
cargo.
Length = 490
Score = 29.4 bits (66), Expect = 5.0
Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 29/147 (19%)
Query: 336 LAEEEAILNQYKAKVSSPTSSMPSNAASTNPFLASPTQPIVDLFGSAPASVEPVQTSKPS 395
L +++++L + A + S P+ + +P + + P ASV P+ + K
Sbjct: 354 LMKQQSLLAKQPATPTRGVES-PARSPGGSPRTTNRSGP------RNVASVSPMTSVKKI 406
Query: 396 DDLLQLGNP-------FVDDF------------TGAAAPGSQPPVNNAWTSNGFNMNTTS 436
D ++ G F + G + G+ +G T
Sbjct: 407 DPNMKPGAASEGVLANFFNSLLSKKTGSPGSPGGGGSPAGTGTNTQGTAKKSGQKPVLTD 466
Query: 437 VQQDSFVSDKAFSSVFGDLDPSKTMPP 463
VQ + D+ DLDP+
Sbjct: 467 VQAE---LDRMSRKPDKDLDPNSQTEC 490
>gnl|CDD|234647 PRK00118, PRK00118, putative DNA-binding protein; Validated.
Length = 104
Score = 27.9 bits (63), Expect = 5.0
Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 8/44 (18%)
Query: 219 LLEKYFDMNKKQCKEALDLYKKFLIRMDRVAEFLKVAENVGIDK 262
LLE Y +E L LY+KF+ R + + + E D+
Sbjct: 60 LLEDY--------EEKLHLYEKFIERNELFDKIAYLKEKYPDDE 95
>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
secretion system, contains a FHA domain [Intracellular
trafficking, secretion, and vesicular transport;
Signal transduction mechanisms].
Length = 430
Score = 29.4 bits (66), Expect = 5.2
Identities = 21/81 (25%), Positives = 31/81 (38%), Gaps = 8/81 (9%)
Query: 454 DLDPSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLM 513
D++P +PP T P L P G S P +A ++ + A +GG P +
Sbjct: 135 DIEPEPRLPPGLTDPLQLADPV-PNGFSADPL---SALQSESLIAQP----DPAGGAPSI 186
Query: 514 GSPRQVPSGQPPVASTQLDPF 534
+ P P A PF
Sbjct: 187 SRNSEAPLSDPASAGGIDTPF 207
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 28.9 bits (64), Expect = 6.4
Identities = 17/71 (23%), Positives = 18/71 (25%), Gaps = 6/71 (8%)
Query: 461 MPPPSTVPG---ALGANYRP---MGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTPLMG 514
MPP PG G RP +M P F QP G G
Sbjct: 198 MPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPG 257
Query: 515 SPRQVPSGQPP 525
PP
Sbjct: 258 FNPNQGMNPPP 268
>gnl|CDD|224640 COG1726, NqrA, Na+-transporting NADH:ubiquinone oxidoreductase,
subunit NqrA [Energy production and conversion].
Length = 447
Score = 28.9 bits (65), Expect = 6.7
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 4/37 (10%)
Query: 349 KVSSPTSSMPS---NAASTNPFLASPTQPIVDLFGSA 382
KV +P S+ + NA TNP A P + ++ F A
Sbjct: 138 KVPAPDSTPRAIFVNAMDTNPLAADP-EVVIAEFSEA 173
>gnl|CDD|165315 PHA03016, PHA03016, hypothetical protein; Provisional.
Length = 441
Score = 28.8 bits (64), Expect = 7.3
Identities = 20/94 (21%), Positives = 45/94 (47%), Gaps = 20/94 (21%)
Query: 152 KEDGLLRTMPANKLLKSLPVLQSQIDALLEFDCNNSDLRNGVINGAFMLLFRDLIRLFAG 211
+ ++ P + K+L IDA++ N+++ G+ + +
Sbjct: 357 IDTKIIDICPKKYIKKALKKAIKIIDAII-----NNNIEAGINDKEY------------E 399
Query: 212 YNDSIINLLEKYFDMNKKQC---KEALDLYKKFL 242
Y+++I++ + F++NK +C K+ DLYK++L
Sbjct: 400 YDENILDKIANEFNLNKIECDKIKKNKDLYKRYL 433
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 29.0 bits (65), Expect = 7.6
Identities = 17/77 (22%), Positives = 21/77 (27%), Gaps = 2/77 (2%)
Query: 462 PPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGA-PRMPMMPSSGGTPLMGSPRQVP 520
PPP P G + + P P QP P G P + P
Sbjct: 240 PPPLQQPQFPGLSQQMPPPPPQP-PQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPP 298
Query: 521 SGQPPVASTQLDPFGSL 537
QP + P G
Sbjct: 299 PQQPQLLPLVQQPQGQQ 315
>gnl|CDD|233007 TIGR00522, dph5, diphthine synthase. Alternate name: diphthamide
biosynthesis S-adenosylmethionine-dependent
methyltransferase. This protein participates in the
modification of a specific His of elongation factor 2 of
eukarotes and Archaea to diphthamide. The protein was
characterized in Saccharomyces cerevisiae and designated
DPH5 [Protein fate, Protein modification and repair].
Length = 257
Score = 28.6 bits (64), Expect = 7.7
Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 3/34 (8%)
Query: 465 STVPGALG-ANYRPMGQSMT-PFYTDNAFPAQPV 496
S V G G Y+ G++ T F+TDN P P
Sbjct: 116 SAVCGLTGLQLYK-FGKTATIVFFTDNYRPQTPY 148
>gnl|CDD|149487 pfam08440, Poty_PP, Potyviridae polyprotein. This domain is found
in polyproteins of the viral Potyviridae taxon.
Length = 274
Score = 28.7 bits (65), Expect = 8.1
Identities = 20/87 (22%), Positives = 30/87 (34%), Gaps = 22/87 (25%)
Query: 163 NKLLKSLPVLQSQIDALLEFDCNNSDLRNGVINGAFMLLFRDLIRLFAGYNDSIINLLEK 222
+ +++ VLQ LLEF N D N I + LE
Sbjct: 207 DHTEENISVLQQAKAQLLEFSNLNGDGSNEEILRDY-------------------GALET 247
Query: 223 YFDMNKKQCKEALDL---YKKFLIRMD 246
+K++ + L L + K LI D
Sbjct: 248 VQHQSKEEVAKHLGLKGKWNKSLITKD 274
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 28.4 bits (63), Expect = 9.7
Identities = 15/55 (27%), Positives = 17/55 (30%), Gaps = 2/55 (3%)
Query: 457 PSKTMPPPSTVPGALGANYRPMGQSMTPFYTDNAFPAQPVGAPRMPMMPSSGGTP 511
PS T PPST A TP P P G P ++ P
Sbjct: 224 PSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPP--ANATPAP 276
>gnl|CDD|143356 cd07851, STKc_p38, Catalytic domain of the Serine/Threonine Kinase,
p38 Mitogen-Activated Protein Kinase. Serine/Threonine
Kinases (STKs), p38 subfamily, catalytic (c) domain.
STKs catalyze the transfer of the gamma-phosphoryl group
from ATP to serine/threonine residues on protein
substrates. The p38 subfamily is part of a larger
superfamily that includes the catalytic domains of other
protein STKs, protein tyrosine kinases, RIO kinases,
aminoglycoside phosphotransferase, choline kinase, and
phosphoinositide 3-kinase. p38 kinases are
mitogen-activated protein kinases (MAPKs), serving as
important mediators of cellular responses to
extracellular signals. They function in the regulation
of the cell cycle, cell development, cell
differentiation, senescence, tumorigenesis, apoptosis,
pain development and pain progression, and immune
responses. p38 kinases are activated by the MAPK kinases
MKK3 and MKK6, which in turn are activated by upstream
MAPK kinase kinases including TAK1, ASK1, and MLK3, in
response to cellular stresses or inflammatory cytokines.
p38 substrates include other protein kinases and factors
that regulate transcription, nuclear export, mRNA
stability and translation. p38 kinases are drug targets
for the inflammatory diseases psoriasis, rheumatoid
arthritis, and chronic pulmonary disease. Vertebrates
contain four isoforms of p38, named alpha, beta, gamma,
and delta, which show varying substrate specificity and
expression patterns. p38alpha and p38beta are
ubiquitously expressed, p38gamma is predominantly found
in skeletal muscle, and p38delta is found in the heart,
lung, testis, pancreas, and small intestine.
Length = 343
Score = 28.4 bits (64), Expect = 9.7
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 203 RDLIRLFAGYNDSIINLLEKYFDMNKKQ---CKEAL 235
+D +F+G N I+LLEK ++ + EAL
Sbjct: 261 KDFKEVFSGANPLAIDLLEKMLVLDPDKRITAAEAL 296
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.131 0.376
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,926,719
Number of extensions: 2595885
Number of successful extensions: 2599
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2572
Number of HSP's successfully gapped: 71
Length of query: 537
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 435
Effective length of database: 6,413,494
Effective search space: 2789869890
Effective search space used: 2789869890
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.4 bits)