RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18165
(259 letters)
>gnl|CDD|150843 pfam10232, Med8, Mediator of RNA polymerase II transcription
complex subunit 8. Arc32, or Med8, is one of the
subunits of the Mediator complex of RNA polymerase II.
The region conserved contains two alpha helices
putatively necessary for binding to other subunits
within the core of the Mediator complex. The N-terminus
of Med8 binds to the essential core Head part of
Mediator and the C-terminus hinges to Med18 on the
non-essential part of the Head that also includes Med20.
Length = 226
Score = 177 bits (449), Expect = 3e-55
Identities = 99/265 (37%), Positives = 134/265 (50%), Gaps = 45/265 (16%)
Query: 1 MQR-EEKQLESSLESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAI 59
MQR EKQL SLE+V +L ++K L ++ K++ + + +WPTF D FA+LS Q ++
Sbjct: 1 MQRTSEKQLVDSLEAVRNRLAQLKQSLTSLIDKLEQEYELPSWPTFQDQFAILSSQLVSL 60
Query: 60 AKVLSHDKTPPIRNLTVLPLILNPEREEELLRLTEGRVPAFTHDLVPDYLRTKQEPDVEH 119
K+L+H+K +RN V PL L P R++E LV LRTK +P+VE
Sbjct: 61 TKILAHEK-DLLRNTVVYPLPLFPTRDQE--------------PLVTTLLRTKPDPEVEE 105
Query: 120 KLIQLEHKAASLSYDAAQKQA-----ATFNKVITHVLDIVSKAREEWESEAGARGGTGQT 174
+ Q KA + + D A KQ NK +++ LD V+K REEWE E+ R QT
Sbjct: 106 WMKQARQKAENSTKDTAAKQEVEIDILQQNKEVSNWLDKVTKLREEWEFESSHRVEIEQT 165
Query: 175 SSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPM 234
S DT+ LVAAV G+GL R P G M GQP
Sbjct: 166 SEEEDTNQLVAAVKDGKGL-------------------RPAIRQASPMGGKQMSQAGQP- 205
Query: 235 GMMGKNPSGIKTNIKAAAAIHPYSR 259
GK PS I T IK + ++HPY R
Sbjct: 206 ---GKMPSDILTFIK-SGSVHPYQR 226
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 34.1 bits (78), Expect = 0.043
Identities = 19/58 (32%), Positives = 20/58 (34%), Gaps = 1/58 (1%)
Query: 186 AVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
A G G G PVG G G G P GP G G GG P G +
Sbjct: 22 APGQPAQQGYANQGMGGPVGGGGG-PGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEAE 78
Score = 29.9 bits (67), Expect = 1.0
Identities = 20/50 (40%), Positives = 20/50 (40%), Gaps = 4/50 (8%)
Query: 198 PGPGAPVGPGM----MVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
P PG P G M P G G G G G P GP G G G P G
Sbjct: 21 PAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGG 70
Score = 27.9 bits (62), Expect = 5.5
Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 1/46 (2%)
Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
PG G G G + GG G G P G GG G MG G
Sbjct: 185 PGGGGYGGGGGGMGGGGGGGMGEMGGMGPQGGGG-GYGGMGAPGQG 229
>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
[RNA processing and modification].
Length = 654
Score = 33.5 bits (76), Expect = 0.10
Identities = 26/62 (41%), Positives = 33/62 (53%), Gaps = 9/62 (14%)
Query: 201 GAPVGPGMMVAPGGPGRPVGPGPGMPMG----PGGQPM-GMMGKNPSGIKTNIKAAAAIH 255
GA G G ++P P + GPG PMG PGG P GMMG PSG+ N +A+H
Sbjct: 574 GAEEG-GGNISPHVPAGFMAAGPGAPMGAFGYPGGIPFQGMMGSGPSGMPAN---GSAMH 629
Query: 256 PY 257
+
Sbjct: 630 SH 631
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 32.9 bits (75), Expect = 0.15
Identities = 16/63 (25%), Positives = 20/63 (31%), Gaps = 7/63 (11%)
Query: 189 MGRGLKMIMPGPGAPVGPGMMVAPGG----PGRPVGPGPGMPMGPGGQPMGMMGKNPSGI 244
R L M P GA P G G+P+G M P G P+G+
Sbjct: 381 RMRQLPMGSPMGGAMGQP-PYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGG--PLRPNGL 437
Query: 245 KTN 247
Sbjct: 438 APM 440
Score = 28.6 bits (64), Expect = 4.1
Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 7/60 (11%)
Query: 190 GRGLKMIMPG--PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQP----MGMMGKNPSG 243
G+G + G G P MM P GPG P+ P PM P K P
Sbjct: 402 GQGPQQQFNGQPLGWPRMS-MMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQ 460
>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
Length = 993
Score = 32.8 bits (75), Expect = 0.19
Identities = 22/61 (36%), Positives = 28/61 (45%), Gaps = 8/61 (13%)
Query: 199 GPG-APVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAAIHPY 257
GPG P+G +AP P PV P G+P QP+G + P G +A I P
Sbjct: 739 GPGMGPIGVKKHLAPFLPSHPVVPTGGIPRPEKTQPLGTISAAPWG-------SALILPI 791
Query: 258 S 258
S
Sbjct: 792 S 792
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 32.3 bits (73), Expect = 0.22
Identities = 20/48 (41%), Positives = 20/48 (41%), Gaps = 6/48 (12%)
Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPG----PGMPMGPGGQPMGMMGKNP 241
P PG P MM PG PG PG PMGP Q G NP
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQ--RPGFNP 260
Score = 30.4 bits (68), Expect = 0.89
Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 5/46 (10%)
Query: 197 MPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPS 242
M PG PG PG P GP M PG P M P
Sbjct: 229 MNRPGFRPQPGGFNHPGTP-----MGPNMQQRPGFNPNQGMNPPPH 269
>gnl|CDD|182724 PRK10780, PRK10780, periplasmic chaperone; Provisional.
Length = 165
Score = 31.4 bits (71), Expect = 0.29
Identities = 13/47 (27%), Positives = 20/47 (42%)
Query: 110 RTKQEPDVEHKLIQLEHKAASLSYDAAQKQAATFNKVITHVLDIVSK 156
RTK E DV + KA + D ++ NK++T + V
Sbjct: 85 RTKLEKDVMAQRQTFSQKAQAFEQDRRRRSNEERNKILTRIQTAVKS 131
>gnl|CDD|224213 COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy
production and conversion].
Length = 346
Score = 30.7 bits (70), Expect = 0.77
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 26 LVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVL 63
L G+ +++ L++ L+ FALL G + VL
Sbjct: 144 LQGVPIELNGGYAGLSFDQLLNPFALLCGLGLVLMYVL 181
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.4 bits (68), Expect = 1.0
Identities = 23/103 (22%), Positives = 38/103 (36%), Gaps = 10/103 (9%)
Query: 158 REEWESEAGARGGTGQTSSLTDTHSLVAAVGMG--------RGLKMIMPGPGAP-VGPGM 208
+ +++ G +G T+S TD ++ + G+G G P +G
Sbjct: 137 KGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGSHAQQDPAVSQSGVVGVPGLGVPG 196
Query: 209 MVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAA 251
+ PGG G PG G+ G P +G I A+
Sbjct: 197 VGVPGGGGAGALPGVGVG-RAGVSPGVGVGGLGGVPGVGILAS 238
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 30.3 bits (69), Expect = 1.0
Identities = 14/46 (30%), Positives = 15/46 (32%)
Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
PG G PG APG P PG G P + P
Sbjct: 394 PGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDA 439
>gnl|CDD|218323 pfam04910, Tcf25, Transcriptional repressor TCF25. Members of this
family are transcriptional repressors. They may act by
increasing histone deacetylase activity at promoter
regions.
Length = 342
Score = 30.0 bits (68), Expect = 1.1
Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 2/50 (4%)
Query: 29 MLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPPIRNLTVLP 78
+L +D + D L +D +AL S ++ + +NL++LP
Sbjct: 125 LLLSLDPEEDPLGALLLIDYYALRSKEYEWLIDFAESPLA--YKNLSLLP 172
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 29.2 bits (65), Expect = 2.4
Identities = 18/76 (23%), Positives = 26/76 (34%), Gaps = 8/76 (10%)
Query: 11 SLESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPP 70
S + L+ + S+ M FK+ D+ DN AL A K L
Sbjct: 162 SEDETASALD-LVSKKGKMAFKLHQIDDAE-----HDNKAL--WFQHAQKKYLFTSNEKE 213
Query: 71 IRNLTVLPLILNPERE 86
+ LT + E E
Sbjct: 214 LDELTKSMIPKIREGE 229
>gnl|CDD|240095 cd04747, OYE_like_5_FMN, Old yellow enzyme (OYE)-related FMN
binding domain, group 5. Each monomer of OYE contains
FMN as a non-covalently bound cofactor, uses NADPH as a
reducing agent with oxygens, quinones, and
alpha,beta-unsaturated aldehydes and ketones, and can
act as electron acceptors in the catalytic reaction.
Other members of OYE family include trimethylamine
dehydrogenase, 2,4-dienoyl-CoA reductase, enoate
reductase, pentaerythriol tetranitrate reductase,
xenobiotic reductase, and morphinone reductase.
Length = 361
Score = 29.2 bits (66), Expect = 2.4
Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 1/33 (3%)
Query: 187 VGMGRGLKMIMPGPGAPVGPGMMVAPGGP-GRP 218
VG R L P+ P +V PG P GR
Sbjct: 101 VGAMRKLGTPPFPDVPPLSPSGLVGPGKPVGRE 133
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.7 bits (65), Expect = 2.5
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 2/52 (3%)
Query: 118 EHKLIQLEHKAASLSYDAAQKQAATFNK-VITHVLDIVSKAREEWESEAGAR 168
E K+ E +A + + A+K+A K + + + K R E E E R
Sbjct: 26 EAKISSAEEEAKQI-IEEAKKEAEALKKEALLEAKEEIHKLRAEAERELKER 76
>gnl|CDD|221405 pfam12069, DUF3549, Protein of unknown function (DUF3549). This
family of proteins is functionally uncharacterized. This
protein is found in bacteria. Proteins in this family
are about 340 amino acids in length. This protein has a
conserved LDE sequence motif.
Length = 342
Score = 29.1 bits (66), Expect = 2.5
Identities = 14/47 (29%), Positives = 20/47 (42%), Gaps = 5/47 (10%)
Query: 45 FLDNFALLSGQ--FSAIAKVLSHDKTPPIRNLTVLPLILNPEREEEL 89
FL+ A Q F+ + L P +R +L L+ NP R L
Sbjct: 290 FLEALAQTQDQALFNQLFADLVA--IPELRP-QLLMLLRNPNRSPAL 333
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 28.8 bits (64), Expect = 3.2
Identities = 11/27 (40%), Positives = 12/27 (44%)
Query: 212 PGGPGRPVGPGPGMPMGPGGQPMGMMG 238
G P+GPGPG PMG G
Sbjct: 85 RGPQMGPMGPGPGRPMGQQMGGPGTAS 111
Score = 28.0 bits (62), Expect = 6.1
Identities = 24/97 (24%), Positives = 37/97 (38%), Gaps = 7/97 (7%)
Query: 145 KVITHVLDIVSKAREEWESEAGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPV 204
++I H D+ KA ++ + GG G + +L G + + PGPG P+
Sbjct: 45 RLIIHFRDMSKKAAQQQVLQ----GGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPM 100
Query: 205 GPGM--MVAPGGPGRPVGPGPGMPMGPGGQ-PMGMMG 238
G M + + MPMG G P M
Sbjct: 101 GQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSR 137
>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182.
Length = 238
Score = 28.3 bits (63), Expect = 3.4
Identities = 20/75 (26%), Positives = 26/75 (34%), Gaps = 5/75 (6%)
Query: 169 GGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMG 228
GG G +L SL + GL I+ G P + G PVG G G +
Sbjct: 47 GGGGNNPNLASLSSLTSQ-----GLGKILSGLQPPPLGNGGGSGAGGPGPVGGGGGPGVA 101
Query: 229 PGGQPMGMMGKNPSG 243
P + PS
Sbjct: 102 PNNIQPNAQAQQPST 116
>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region. The region
described in this family is found towards the N-terminus
of various eukaryotic fork head/HNF-3-related
transcription factors (which contain the pfam00250
domain). These proteins play key roles in embryogenesis,
maintenance of differentiated cell states, and
tumorigenesis.
Length = 137
Score = 27.6 bits (61), Expect = 3.8
Identities = 10/58 (17%), Positives = 16/58 (27%), Gaps = 1/58 (1%)
Query: 188 GMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGP-GMPMGPGGQPMGMMGKNPSGI 244
G G G G ++P G +GP P G + + +
Sbjct: 64 SPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRASSQNNL 121
Score = 27.6 bits (61), Expect = 4.2
Identities = 13/77 (16%), Positives = 18/77 (23%), Gaps = 2/77 (2%)
Query: 163 SEAGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPG 222
AG G +SS + S M M G + P + GP G
Sbjct: 45 GYAGPGAMNGMSSSSMNGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGG 104
Query: 223 PGM--PMGPGGQPMGMM 237
+
Sbjct: 105 SLSPNMSMSRASSQNNL 121
Score = 27.2 bits (60), Expect = 5.9
Identities = 14/63 (22%), Positives = 17/63 (26%), Gaps = 6/63 (9%)
Query: 186 AVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIK 245
M M G P M + G G G MGMMG + G
Sbjct: 32 TSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMSPGYGGAGSP------MGMMGMSSMGTS 85
Query: 246 TNI 248
+
Sbjct: 86 LSP 88
>gnl|CDD|215271 PLN02491, PLN02491, carotenoid 9,10(9',10')-cleavage dioxygenase.
Length = 545
Score = 28.7 bits (64), Expect = 3.9
Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 30/76 (39%)
Query: 12 LESVVMKLNEIKSQLVGMLFKIDHDRDSLNWPTFLDNFALLSGQFSAIAKVLSHDKTPPI 71
LE +++KL SQ L++ LSG F+ + D+TPP+
Sbjct: 34 LEKLIVKLMHDASQ-------------PLHY---------LSGNFAPV-----RDETPPV 66
Query: 72 RNLTV---LPLILNPE 84
++L V LP LN E
Sbjct: 67 KDLPVRGHLPECLNGE 82
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 28.1 bits (63), Expect = 4.3
Identities = 16/56 (28%), Positives = 16/56 (28%), Gaps = 1/56 (1%)
Query: 199 GPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPG-GQPMGMMGKNPSGIKTNIKAAAA 253
G P V P P G G G P G GQ G P AA
Sbjct: 90 GAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPGSFLGGAAQTAA 145
>gnl|CDD|212017 cd11627, HR1_Ste20-like, Protein kinase C-related kinase homology
region 1 (HR1) Rho-binding domain of
Schizosaccharomyces pombe Ste20-like proteins. This
group is composed of predominantly uncharacterized
fungal proteins, which contain two known domains: HR1
at the N-terminal region and REM (Ras exchanger motif)
at the C-terminal region. One member protein from
Schizosaccharomyces pombe is named Ste16 while its gene
is called ste20 (a target of rapamycin complex 2
subunit). It is a subunit in the protein kinase TOR
complexes in fission yeast. The REM domain is usually
found in nucleotide exchange factors for Ras-like small
GTPases. HR1 domains are anti-parallel coiled-coil
(ACC) domains that bind small GTPases from the Rho
family.
Length = 71
Score = 26.2 bits (58), Expect = 4.3
Identities = 10/26 (38%), Positives = 15/26 (57%), Gaps = 2/26 (7%)
Query: 1 MQREEKQLESSLESVVMKLNEIKSQL 26
QR ++ES L S K+ ++ SQL
Sbjct: 41 DQR--ARVESELNSSNRKIAQLTSQL 64
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 28.1 bits (63), Expect = 5.2
Identities = 19/49 (38%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 138 KQAATFNKVITHVLDIVSKAREEWESEAG---ARGGTGQTSSLTDTHSL 183
KQ N +I+H+ I+ K +E EA AR G G SL DT +L
Sbjct: 174 KQIPQ-NSIISHLKTILEKEGVSYEPEALEILARSGNG---SLRDTLTL 218
>gnl|CDD|215113 PLN02168, PLN02168, copper ion binding / pectinesterase.
Length = 545
Score = 28.0 bits (62), Expect = 5.4
Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 3/49 (6%)
Query: 205 GPGMMVAPGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAA 253
G ++ P P PVGP P+ P + I+ ++ AA
Sbjct: 291 GVALIRYPNSPLDPVGPL---PLAPALHDYFSSVEQALSIRMDLNVGAA 336
>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763). This
eukaryotic family of proteins has no known function.
Length = 91
Score = 26.6 bits (59), Expect = 5.8
Identities = 12/32 (37%), Positives = 13/32 (40%)
Query: 212 PGGPGRPVGPGPGMPMGPGGQPMGMMGKNPSG 243
GG G P PG G MG G G +G
Sbjct: 59 DGGRGGPGPPGGGRRMGRIGGGGGPSRPPMAG 90
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 27.5 bits (61), Expect = 6.0
Identities = 11/38 (28%), Positives = 11/38 (28%), Gaps = 2/38 (5%)
Query: 200 PGAPVGPGMMVAPGGPG--RPVGPGPGMPMGPGGQPMG 235
P P P G P P MP G PM
Sbjct: 132 PLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMM 169
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 28.0 bits (62), Expect = 6.9
Identities = 19/61 (31%), Positives = 24/61 (39%), Gaps = 1/61 (1%)
Query: 200 PGAPVGPGMMVAPGGPGRPVGP-GPGMPMGPGGQPMGMMGKNPSGIKTNIKAAAAIHPYS 258
P P+ P P P P GP P +P+G P G + + P K AA P
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883
Query: 259 R 259
R
Sbjct: 2884 R 2884
>gnl|CDD|182670 PRK10715, flk, flagella biosynthesis regulator; Provisional.
Length = 335
Score = 27.7 bits (62), Expect = 7.2
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 213 GGPGRPVGPGPGMPMGPGGQPMGM 236
G PG+P G G P G QP+
Sbjct: 6 GPPGQPPGGGRNPPSTAGDQPLSP 29
>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
Members of this family belong to the collagen
superfamily. Collagens are generally extracellular
structural proteins involved in formation of connective
tissue structure. The alignment contains 20 copies of
the G-X-Y repeat that forms a triple helix. The first
position of the repeat is glycine, the second and third
positions can be any residue but are frequently proline
and hydroxyproline. Collagens are post translationally
modified by proline hydroxylase to form the
hydroxyproline residues. Defective hydroxylation is the
cause of scurvy. Some members of the collagen
superfamily are not involved in connective tissue
structure but share the same triple helical structure.
Length = 60
Score = 25.1 bits (56), Expect = 8.0
Identities = 18/34 (52%), Positives = 18/34 (52%)
Query: 200 PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGGQP 233
PG P PG PG PG P PGP P GP G P
Sbjct: 6 PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 39
>gnl|CDD|197548 smart00157, PRP, Major prion protein. The prion protein is a major
component of scrapie-associated fibrils in
Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
syndrome and bovine spongiform encephalopathy.
Length = 218
Score = 27.1 bits (60), Expect = 8.3
Identities = 21/47 (44%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 198 PGPGAPVGPGMMVAPGGPGRPVGPGPGMPMGPG-GQPMGMMGKNPSG 243
PG G+P G GG G+P G G G P G G GQP G P G
Sbjct: 18 PGQGSPGGNRYPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHG 64
>gnl|CDD|153333 cd07649, F-BAR_GAS7, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Growth Arrest Specific
protein 7. F-BAR domains are dimerization modules that
bind and bend membranes and are found in proteins
involved in membrane dynamics and actin reorganization.
Growth Arrest Specific protein 7 (GAS7) is mainly
expressed in the brain and is required for neurite
outgrowth. It may also play a role in the protection and
migration of embryonic stem cells. Treatment-related
acute myeloid leukemia (AML) has been reported resulting
from mixed-lineage leukemia (MLL)-GAS7 translocations as
a complication of primary cancer treatment. GAS7
contains an N-terminal SH3 domain, followed by a WW
domain, and a central F-BAR domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 233
Score = 27.3 bits (60), Expect = 8.6
Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 111 TKQEPDVEHKLIQLEHKAASLSYDAAQK---QAATFNKVITHVLDIVSKAREEWESE 164
+++ D+E K QLE K ++ + + +K ++ + +D+ ++A+ +W E
Sbjct: 131 LERQKDLEGKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRCVDLYNQAQSKWFEE 187
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 27.2 bits (60), Expect = 8.9
Identities = 22/74 (29%), Positives = 27/74 (36%), Gaps = 16/74 (21%)
Query: 186 AVGMGRGLKMIM----PGPGAPVGPGMMVA-------PGGPGRP-VGPGP----GMPMGP 229
A G+G GL ++ G GAP G + PG P P P G P GP
Sbjct: 5 AAGLGLGLWSLLWLLCAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGP 64
Query: 230 GGQPMGMMGKNPSG 243
P K+P
Sbjct: 65 ASPPKSGPPKSPPA 78
>gnl|CDD|163682 cd07944, DRE_TIM_HOA_like, 4-hydroxy-2-oxovalerate aldolase-like,
N-terminal catalytic TIM barrel domain. This family of
bacterial enzymes is sequence-similar to
4-hydroxy-2-oxovalerate aldolase (HOA) but its exact
function is unknown. This family includes the
Bacteroides vulgatus Bvu_2661 protein and belongs to the
DRE-TIM metallolyase superfamily. DRE-TIM metallolyases
include 2-isopropylmalate synthase (IPMS),
alpha-isopropylmalate synthase (LeuA),
3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate
synthase, citramalate synthase, 4-hydroxy-2-oxovalerate
aldolase, re-citrate synthase, transcarboxylase 5S,
pyruvate carboxylase, AksA, and FrbC. These members all
share a conserved triose-phosphate isomerase (TIM)
barrel domain consisting of a core beta(8)-alpha(8)
motif with the eight parallel beta strands forming an
enclosed barrel surrounded by eight alpha helices. The
domain has a catalytic center containing a divalent
cation-binding site formed by a cluster of invariant
residues that cap the core of the barrel. In addition,
the catalytic site includes three invariant residues -
an aspartate (D), an arginine (R), and a glutamate (E) -
which is the basis for the domain name "DRE-TIM".
Length = 266
Score = 27.1 bits (61), Expect = 8.9
Identities = 4/27 (14%), Positives = 9/27 (33%)
Query: 232 QPMGMMGKNPSGIKTNIKAAAAIHPYS 258
M + G + + ++ I P
Sbjct: 128 NLMAISGYSDEELLELLELVNEIKPDV 154
>gnl|CDD|232988 TIGR00468, pheS, phenylalanyl-tRNA synthetase, alpha subunit. Most
phenylalanyl-tRNA synthetases are heterodimeric, with 2
alpha (pheS) and 2 beta (pheT) subunits. This model
describes the alpha subunit, which shows some similarity
to class II aminoacyl-tRNA ligases. Mitochondrial
phenylalanyl-tRNA synthetase is a single polypeptide
chain, active as a monomer, and similar to this chain
rather than to the beta chain, but excluded from this
model. An interesting feature of the alignment of all
sequences captured by this model is a deep split between
non-spirochete bacterial examples and all other
examples; supporting this split is a relative deletion
of about 50 residues in the former set between two
motifs well conserved throughout the alignment [Protein
synthesis, tRNA aminoacylation].
Length = 293
Score = 27.3 bits (61), Expect = 9.1
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 21/84 (25%)
Query: 10 SSLESVVMKLNEIKSQLVGMLFKID---------HDRDSLNWP----------TF-LDNF 49
SL + ++EI+ +G+ F + + D+LN P TF + +
Sbjct: 69 GSLHPLTRVIDEIRDIFLGLGFTEETGPEVETDFWNFDALNIPQDHPARDMQDTFYIKDR 128
Query: 50 ALLSGQFSAI-AKVLSHDKTPPIR 72
LL +A+ + + + PPIR
Sbjct: 129 LLLRTHTTAVQLRTMEEQEKPPIR 152
>gnl|CDD|221598 pfam12484, PE_PPE_C, Polymorphic PE/PPE proteins C terminal. This
domain family is found in bacteria, and is approximately
90 amino acids in length. The family is found in
association with pfam00823. There is a conserved SVP
sequence motif. There is a single completely conserved
residue W that may be functionally important. The
proteins in this family are PE/PPE proteins implicated
in immunostimulation and virulence.
Length = 81
Score = 25.8 bits (57), Expect = 9.1
Identities = 9/32 (28%), Positives = 9/32 (28%)
Query: 200 PGAPVGPGMMVAPGGPGRPVGPGPGMPMGPGG 231
A PG VA PGMP
Sbjct: 29 AAAAALPGAGVAAAAEAAAGAVLPGMPGAAAA 60
>gnl|CDD|173848 cd01159, NcnH, Naphthocyclinone hydroxylase. Naphthocyclinone is
an aromatic polyketide and an antibiotic, which is
active against Gram-positive bacteria. Polyketides are
secondary metabolites, which have important biological
functions such as antitumor, immunosupressive or
antibiotic activities. NcnH is a hydroxylase involved in
the biosynthesis of naphthocyclinone and possibly other
polyketides.
Length = 370
Score = 27.3 bits (61), Expect = 9.7
Identities = 15/58 (25%), Positives = 21/58 (36%), Gaps = 1/58 (1%)
Query: 165 AGARGGTGQTSSLTDTHSLVAAVGMGRGLKMIMPGPGAPVGPGMMVAPGGPGRPVGPG 222
A A G +S+ THS + A + + + G APGG V G
Sbjct: 63 AEACGSAAWVASIVATHSRMLAAFPPEAQEEVWGDGPDTLLAGS-YAPGGRAERVDGG 119
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.134 0.390
Gapped
Lambda K H
0.267 0.0613 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,381,983
Number of extensions: 1310507
Number of successful extensions: 1295
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1175
Number of HSP's successfully gapped: 91
Length of query: 259
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 164
Effective length of database: 6,723,972
Effective search space: 1102731408
Effective search space used: 1102731408
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 58 (26.4 bits)