RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy447
(417 letters)
>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain.
Length = 96
Score = 201 bits (514), Expect = 1e-64
Identities = 76/96 (79%), Positives = 87/96 (90%)
Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
KPPYSYI+LITMAIQ SP KMLTLSEIYQ+IMDLFP+YRQN+Q WQNSIRH+LS N CF+
Sbjct: 1 KPPYSYIALITMAIQQSPEKMLTLSEIYQWIMDLFPYYRQNKQGWQNSIRHNLSLNKCFI 60
Query: 183 KVPRTPDKPGKGSFWTLHKDSGNMFENGCYLRRQKR 218
KVPR+PDKPGKGS+WTL +S NMFENG YL+R+KR
Sbjct: 61 KVPRSPDKPGKGSYWTLDPESENMFENGKYLKRRKR 96
>gnl|CDD|214627 smart00339, FH, FORKHEAD. FORKHEAD, also known as a "winged
helix".
Length = 89
Score = 167 bits (425), Expect = 1e-51
Identities = 62/89 (69%), Positives = 73/89 (82%)
Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
KPPYSYI+LI MAI +SP K LTLSEIY++I D FP+YR+N+ WQNSIRH+LS NDCFV
Sbjct: 1 KPPYSYIALIAMAILSSPDKRLTLSEIYKWIEDNFPYYRENRAGWQNSIRHNLSLNDCFV 60
Query: 183 KVPRTPDKPGKGSFWTLHKDSGNMFENGC 211
KVPR D+PGKGS+WTL + NMFENG
Sbjct: 61 KVPREGDRPGKGSYWTLDPAAENMFENGN 89
>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix". FH is
named for the Drosophila fork head protein, a
transcription factor which promotes terminal rather than
segmental development. This family of transcription
factor domains, which bind to B-DNA as monomers, are
also found in the Hepatocyte nuclear factor (HNF)
proteins, which provide tissue-specific gene regulation.
The structure contains 2 flexible loops or "wings" in
the C-terminal region, hence the term winged helix.
Length = 78
Score = 163 bits (414), Expect = 6e-50
Identities = 54/78 (69%), Positives = 63/78 (80%)
Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
KPPYSY +LI MAIQ+SP K LTLSEIY++I D FP++R WQNSIRH+LS N CFV
Sbjct: 1 KPPYSYSALIAMAIQSSPEKRLTLSEIYKWISDNFPYFRDAPAGWQNSIRHNLSLNKCFV 60
Query: 183 KVPRTPDKPGKGSFWTLH 200
KVPR PD+PGKGS+WTL
Sbjct: 61 KVPREPDEPGKGSYWTLD 78
>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
[Transcription].
Length = 610
Score = 89.1 bits (221), Expect = 4e-19
Identities = 60/205 (29%), Positives = 87/205 (42%), Gaps = 6/205 (2%)
Query: 13 SPGMTPSYSMNSVINSCSPQSGSTGTPLGGFSSNMHSMGSINGMNGPNCMSTGSMGYSSN 72
P + S S+ + ST TP+ S+ S + T + SS
Sbjct: 233 IPADSSSNLDVSLGHH--ISQPSTHTPVLDNHSSGEENISRINNSSQIDSPTPNYRMSSI 290
Query: 73 MNAACMGGINSYNTALTTGRAELAENSPSPGALQRVARADKTYRRSYTHAKPPYSYISLI 132
+ + N + +++SP P + +KP +SY + I
Sbjct: 291 --DSSVNPSRLANN--KDEGRKGSKSSPVPKDAAPPSTLSDLSADVNRTSKPAFSYANSI 346
Query: 133 TMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPG 192
T AI +SP+ +TLSEIY +I P+YR WQNSIRH+LS N F KVPR+ +PG
Sbjct: 347 TQAILSSPSGKMTLSEIYSWISSNLPYYRHKPTAWQNSIRHNLSLNKSFEKVPRSASQPG 406
Query: 193 KGSFWTLHKDSGNMFENGCYLRRQK 217
KG FW + E+ R K
Sbjct: 407 KGCFWKIDYSYIYEKESKRNPRSPK 431
Score = 74.1 bits (182), Expect = 3e-14
Identities = 39/84 (46%), Positives = 52/84 (61%)
Query: 120 THAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFND 179
T+ PPYSY + +AI NSP K LTLS+IY +I + F +Y + RWQNSIRH+LS ND
Sbjct: 83 TYYVPPYSYATGRGLAILNSPDKPLTLSKIYTWIHNTFFYYAKVVSRWQNSIRHNLSLND 142
Query: 180 CFVKVPRTPDKPGKGSFWTLHKDS 203
F+K+ KG FW++
Sbjct: 143 AFIKIEGRNGAKVKGHFWSIGPGH 166
>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
Length = 279
Score = 45.2 bits (107), Expect = 3e-05
Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 1/46 (2%)
Query: 239 HHHPTHHHHHHSH-HTHHNSHNTHSHQSHDAGGKKSPETKPHLSEL 283
HHH H H H H H H + H+ H A + + H +++
Sbjct: 121 HHHDHDHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHERAHANDI 166
Score = 42.1 bits (99), Expect = 3e-04
Identities = 12/36 (33%), Positives = 15/36 (41%)
Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
+ HH H H H H H+ H H H G + E
Sbjct: 118 ENMHHHDHDHDHDHDHEHHHDHGHHHHHEHGATAEE 153
Score = 41.7 bits (98), Expect = 4e-04
Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDA 268
HH H H HHH H HH HH+ H + + DA
Sbjct: 121 HHHDHDHDHDHDHEHHHDHGHH-HHHEHGATAEEYQDA 157
Score = 31.3 bits (71), Expect = 0.80
Identities = 10/34 (29%), Positives = 12/34 (35%)
Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
H HH H HHHH H +H+
Sbjct: 126 DHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHE 159
>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region. The region
described in this family is found towards the N-terminus
of various eukaryotic fork head/HNF-3-related
transcription factors (which contain the pfam00250
domain). These proteins play key roles in embryogenesis,
maintenance of differentiated cell states, and
tumorigenesis.
Length = 137
Score = 40.7 bits (95), Expect = 3e-04
Identities = 36/132 (27%), Positives = 49/132 (37%), Gaps = 20/132 (15%)
Query: 9 TSLGSPGMTPSYSMNSVINSCSPQSGSTGTPLGGFSSNMHSMGSINGMNGPNCMSTGSMG 68
S S YSMNS+ + S G++NGM+ + M+ S G
Sbjct: 8 ESYSSVSGGMVYSMNSMNTYGPMNTSQGSANSSMNMSGYAGPGAMNGMS-SSSMNGMSPG 66
Query: 69 YSSNMNAACMGGINSYNTALTT------------------GRAELAENSPSPGALQRVAR 110
Y + M G++S T+L+ + S L R AR
Sbjct: 67 YGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRASSQNNLNR-AR 125
Query: 111 ADKTYRRSYTHA 122
KTYRRSYTHA
Sbjct: 126 DPKTYRRSYTHA 137
>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796). This
bacterial family of proteins has no known function.
Length = 161
Score = 40.0 bits (94), Expect = 7e-04
Identities = 12/37 (32%), Positives = 13/37 (35%)
Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
T S H HH H HH HH+ H H
Sbjct: 70 TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGHS 106
Score = 37.3 bits (87), Expect = 0.004
Identities = 12/35 (34%), Positives = 14/35 (40%)
Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
VS H H H H HH+ H+ H H H
Sbjct: 71 LTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGH 105
Score = 35.7 bits (83), Expect = 0.016
Identities = 10/37 (27%), Positives = 14/37 (37%)
Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHS 262
+ + +S H HH H H H + H HS
Sbjct: 70 TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGHS 106
Score = 29.9 bits (68), Expect = 1.4
Identities = 9/27 (33%), Positives = 11/27 (40%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQS 265
HH HH HH H H + H+
Sbjct: 87 DHHDHDHHDHHDDHEHAGHSDFHAEYQ 113
Score = 28.4 bits (64), Expect = 5.1
Identities = 7/35 (20%), Positives = 11/35 (31%)
Query: 236 SPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGG 270
+ T H HH+ + H H+ G
Sbjct: 70 TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAG 104
>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
component; Reviewed.
Length = 311
Score = 40.4 bits (95), Expect = 0.001
Identities = 12/36 (33%), Positives = 14/36 (38%)
Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
H HH H H H S H H ++ SPE
Sbjct: 117 AHDDHHDDDHDHAGHEKSDEDHHHGEYNMHIWLSPE 152
Score = 38.1 bits (89), Expect = 0.005
Identities = 9/39 (23%), Positives = 13/39 (33%), Gaps = 2/39 (5%)
Query: 223 KKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
K +++ H H H H HH +N H
Sbjct: 110 KPLLMKGAHDDHHDDDHDH--AGHEKSDEDHHHGEYNMH 146
Score = 35.8 bits (83), Expect = 0.037
Identities = 6/28 (21%), Positives = 8/28 (28%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
H H H H H+ + H
Sbjct: 119 DDHHDDDHDHAGHEKSDEDHHHGEYNMH 146
Score = 31.9 bits (73), Expect = 0.60
Identities = 6/26 (23%), Positives = 7/26 (26%), Gaps = 1/26 (3%)
Query: 243 THHHHHHSH-HTHHNSHNTHSHQSHD 267
H HH H H + H
Sbjct: 116 GAHDDHHDDDHDHAGHEKSDEDHHHG 141
>gnl|CDD|238501 cd01019, ZnuA, Zinc binding protein ZnuA. These proteins have been
shown to function as initial receptors in the ABC uptake
of Zn2+. They belong to the TroA superfamily of
periplasmic metal binding proteins that share a distinct
fold and ligand binding mechanism. They are comprised
of two globular subdomains connected by a single helix
and bind their specific ligands in the cleft between
these domains. A typical TroA protein is comprised of
two globular subdomains connected by a single helix and
can bind the metal ion in the cleft between these
domains. In addition, these proteins sometimes have a
low complexity region containing a metal-binding
histidine-rich motif (repetitive HDH sequence).
Length = 286
Score = 40.0 bits (94), Expect = 0.001
Identities = 10/48 (20%), Positives = 15/48 (31%)
Query: 220 KDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
K +V+ + +H H H H H H+ H D
Sbjct: 73 GRKKGKVLTLAKLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEEGGLD 120
Score = 36.6 bits (85), Expect = 0.015
Identities = 11/41 (26%), Positives = 13/41 (31%), Gaps = 5/41 (12%)
Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPHL 280
H H H + H H H+ GG PHL
Sbjct: 88 LKTLEDGASHGDHEHDHEHAHGEHDGHEEGGL-----DPHL 123
Score = 35.8 bits (83), Expect = 0.032
Identities = 8/28 (28%), Positives = 9/28 (32%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
H H H H H H+ H H
Sbjct: 95 ASHGDHEHDHEHAHGEHDGHEEGGLDPH 122
Score = 32.7 bits (75), Expect = 0.31
Identities = 5/31 (16%), Positives = 7/31 (22%)
Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
+ H H H H + H
Sbjct: 92 EDGASHGDHEHDHEHAHGEHDGHEEGGLDPH 122
>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region. SelP is the
only known eukaryotic selenoprotein that contains
multiple selenocysteine (Sec) residues, and accounts for
more than 50% of the selenium content of rat and human
plasma. It is thought to be glycosylated. SelP may have
antioxidant properties. It can attach to epithelial
cells, and may protect vascular endothelial cells
against peroxynitrite toxicity. The high selenium
content of SelP suggests that it may be involved in
selenium intercellular transport or storage. The
promoter structure of bovine SelP suggest that it may be
involved in countering heavy metal intoxication, and may
also have a developmental function. The N-terminal
region of SelP can exist independently of the C terminal
region. Zebrafish selenoprotein Pb lacks the C terminal
Sec-rich region, and a protein encoded by the rat SelP
gene and lacking this region has also been reported.
N-terminal region contains a conserved SecxxCys motif,
which is similar to the CysxxCys found in thioredoxins.
It is speculated that the N terminal region may adopt a
thioredoxin fold and catalyze redox reactions. The
N-terminal region also contains a His-rich region, which
is thought to mediate heparin binding. Binding to
heparan proteoglycans could account for the membrane
binding properties of SelP. The function of the
bacterial members of this family is uncharcterised.
Length = 238
Score = 39.1 bits (91), Expect = 0.002
Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 1/69 (1%)
Query: 221 DDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ-SHDAGGKKSPETKPH 279
D+ K+V T +K V HHH H H H H+ H H +S + P
Sbjct: 163 DECKKVTLATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPD 222
Query: 280 LSELAMLGL 288
GL
Sbjct: 223 KPTEPPSGL 231
Score = 35.2 bits (81), Expect = 0.042
Identities = 15/52 (28%), Positives = 20/52 (38%)
Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
RQ H S H HHH H HH ++ ++ S Q + P H
Sbjct: 182 RQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPDKPTEPPSGLHH 233
Score = 27.9 bits (62), Expect = 8.1
Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 20/46 (43%)
Query: 231 HHKSVSPTHHHPTHHHHHHS--------------------HHTHHN 256
HH+ HHHP HHH S H HHN
Sbjct: 191 HHEHQGHAHHHPHGHHHPGSNSHSESQQPDPDKPTEPPSGLHHHHN 236
>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
Length = 137
Score = 36.5 bits (84), Expect = 0.007
Identities = 9/45 (20%), Positives = 15/45 (33%), Gaps = 2/45 (4%)
Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPET 276
H++ P + HHH H + H+ Q + P
Sbjct: 40 HRTWDPADYL--DRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHH 82
Score = 31.9 bits (72), Expect = 0.26
Identities = 6/36 (16%), Positives = 7/36 (19%), Gaps = 2/36 (5%)
Query: 230 THHKSVSP--THHHPTHHHHHHSHHTHHNSHNTHSH 263
+ H H HH H H
Sbjct: 46 ADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPH 81
Score = 28.0 bits (62), Expect = 5.4
Identities = 4/42 (9%), Positives = 9/42 (21%)
Query: 225 EVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
+ + H +H + HH +
Sbjct: 25 DGRAENQHIQREGGNHRTWDPADYLDRPHHHPHRHQQDDHHL 66
>gnl|CDD|215173 PLN02304, PLN02304, probable pectinesterase.
Length = 379
Score = 34.1 bits (78), Expect = 0.14
Identities = 9/27 (33%), Positives = 11/27 (40%)
Query: 234 SVSPTHHHPTHHHHHHSHHTHHNSHNT 260
S +P HHH HH HH +
Sbjct: 26 SSKIATSYPHSIHHHSHHHHHHKHPDD 52
Score = 32.5 bits (74), Expect = 0.42
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 233 KSVSPTHHHPTHHHHHHSHHTHHN 256
K + H HH HHH HH H +
Sbjct: 28 KIATSYPHSIHHHSHHHHHHKHPD 51
Score = 30.6 bits (69), Expect = 1.4
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 243 THHHHHHSHHTHHNSHNTHS 262
T + H HH+HH+ H+ H
Sbjct: 31 TSYPHSIHHHSHHHHHHKHP 50
Score = 30.2 bits (68), Expect = 1.8
Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 5/30 (16%)
Query: 244 HHHHHHSHHTHHNSHNTHSHQSHDAGGKKS 273
+ H H H HH H H+ D G K S
Sbjct: 33 YPHSIHHHSHHH-----HHHKHPDDGKKVS 57
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 33.9 bits (78), Expect = 0.14
Identities = 7/42 (16%), Positives = 15/42 (35%), Gaps = 3/42 (7%)
Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSH---NTHSHQSHDA 268
KS + HH + H S + ++ + +D+
Sbjct: 84 LKSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDS 125
Score = 31.2 bits (71), Expect = 0.94
Identities = 14/69 (20%), Positives = 25/69 (36%), Gaps = 3/69 (4%)
Query: 209 NGCYLRRQKRFKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDA 268
+ R KR K ++ + T KS S H HH H S T + +
Sbjct: 61 SPSRSRLHKRKKSSRRSPMSDTLLKSKSSAH---LLHHQSTRSHRRSKSGTTSPRKPSSS 117
Query: 269 GGKKSPETK 277
++ +++
Sbjct: 118 AHRRRNDSE 126
>gnl|CDD|238500 cd01018, ZntC, Metal binding protein ZntC. These proteins are
predicted to function as initial receptors in ABC
transport of metal ions. They belong to the TroA
superfamily of helical backbone metal receptor proteins
that share a distinct fold and ligand binding mechanism.
They are comprised of two globular subdomains connected
by a long alpha helix and bind their specific ligands in
the cleft between these domains. In addition, many of
these proteins possess a metal-binding histidine-rich
motif (repetitive HDH sequence).
Length = 266
Score = 33.1 bits (76), Expect = 0.22
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 239 HHHPTHHHHHHSHHTHHNSH 258
HHH H H H HH +++ H
Sbjct: 94 HHHHHHGEHEHHHHGNYDPH 113
Score = 31.6 bits (72), Expect = 0.70
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 244 HHHHHHSHHTHHNSHNTHSH 263
HHHHHH H HH+ N H
Sbjct: 94 HHHHHHGEHEHHHHGNYDPH 113
Score = 30.4 bits (69), Expect = 1.7
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 233 KSVSPTHHHPTHHHHHHSH-HTHHNSHNTH 261
K ++ HHHHH H H HH +++ H
Sbjct: 84 KGITLIPMADHHHHHHGEHEHHHHGNYDPH 113
Score = 28.5 bits (64), Expect = 6.3
Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 10/88 (11%)
Query: 242 PTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPHLSELAMLGLTKEDAMSAGLLHS 301
P HHHH H H + H+ + +D SP ++E L + D +A
Sbjct: 90 PMADHHHHHHGEHEHHHHGN----YDPHIWLSPANAKIMAENIYEALAELDPQNATY--- 142
Query: 302 DFQQN--SYHQSFHQQDSELAAMVASGR 327
+Q N + DSE+ +++ +
Sbjct: 143 -YQANLDALLAELDALDSEIRTILSKLK 169
>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
family. The immune response of the purple sea urchin
appears to be more complex than previously believed in
that it uses immune-related gene families homologous to
vertebrate Toll-like and NOD/NALP-like receptor families
as well as C-type lectins and a rudimentary complement
system. In addition, the species also produces this
unusual family of mRNAs, also known as 185/333, which is
strongly upregulated in response to pathogen challenge.
Length = 312
Score = 32.3 bits (73), Expect = 0.41
Identities = 10/41 (24%), Positives = 14/41 (34%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
H H HH H+ ++H G + ET H
Sbjct: 177 HPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQGHNETGDH 217
Score = 31.9 bits (72), Expect = 0.52
Identities = 16/87 (18%), Positives = 28/87 (32%), Gaps = 5/87 (5%)
Query: 189 DKPGKGSFWTLHKDSGNMFENGCYLRRQKRFKDDKKEVIRQTHHKSVSPTHHHPTH--HH 246
D PG+ + E + +R + ++D + H + P
Sbjct: 140 DGPGRRHHGHRQGPPQDRPEEQPFGQRNESSEEDGRPHPHHHRHHGHNHFGRKPFGGRPF 199
Query: 247 HHHSHHTHHNSHNT---HSHQSHDAGG 270
+H H HN H H+ H+ G
Sbjct: 200 GRRNHTEGHQGHNETGDHPHRHHNKTG 226
>gnl|CDD|237322 PRK13261, ureE, urease accessory protein UreE; Provisional.
Length = 159
Score = 31.1 bits (71), Expect = 0.56
Identities = 6/16 (37%), Positives = 7/16 (43%)
Query: 240 HHPTHHHHHHSHHTHH 255
+ HH HSH H
Sbjct: 144 YGGAFRHHGHSHDHSH 159
Score = 30.7 bits (70), Expect = 0.88
Identities = 5/20 (25%), Positives = 7/20 (35%)
Query: 239 HHHPTHHHHHHSHHTHHNSH 258
H H+H +SH
Sbjct: 140 ERGAYGGAFRHHGHSHDHSH 159
Score = 29.1 bits (66), Expect = 2.6
Identities = 5/15 (33%), Positives = 5/15 (33%)
Query: 238 THHHPTHHHHHHSHH 252
HH H H H
Sbjct: 144 YGGAFRHHGHSHDHS 158
>gnl|CDD|236005 PRK07376, PRK07376, NAD(P)H-quinone oxidoreductase subunit F;
Validated.
Length = 673
Score = 31.9 bits (73), Expect = 0.62
Identities = 10/32 (31%), Positives = 13/32 (40%)
Query: 223 KKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTH 254
+KE++ HH HHHHS H
Sbjct: 466 QKELLAAADAPGHHEGHHEEHGEHHHHSGSPH 497
Score = 30.8 bits (70), Expect = 1.4
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 3/43 (6%)
Query: 219 FKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
F+ + KE+ Q + + H HH H H HH+S + H
Sbjct: 458 FRGNDKEL--QKELLAAADAPGHHEGHHEEHGEH-HHHSGSPH 497
Score = 29.6 bits (67), Expect = 4.2
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 241 HPTHHHHHHSHHTHHNSHNTHSHQS 265
P HH HH H H+ H+ H+S
Sbjct: 475 APGHHEGHHEEHGEHHHHSGSPHES 499
>gnl|CDD|237323 PRK13263, ureE, urease accessory protein UreE; Provisional.
Length = 206
Score = 31.3 bits (71), Expect = 0.65
Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 2/33 (6%)
Query: 240 HHPTHHHHHHSHHTHHNSHN-THSHQSHDAGGK 271
HH H HSH H + H+ H H H G
Sbjct: 167 VFHEHHGHSHSHSDHDHDHDHDHDHV-HGPGCG 198
Score = 30.1 bits (68), Expect = 1.7
Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAG 269
R H + + + H H H+H +S + H H HD
Sbjct: 150 RHGHDATFAEDYALAQQVFHEHHGHSHSHSDHDHDH-DHDHD 190
Score = 27.8 bits (62), Expect = 8.6
Identities = 10/36 (27%), Positives = 11/36 (30%)
Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
H S +H H H H H H H H
Sbjct: 171 HHGHSHSHSDHDHDHDHDHDHVHGPGCGHGHHHHHR 206
Score = 27.8 bits (62), Expect = 9.3
Identities = 12/40 (30%), Positives = 14/40 (35%)
Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQS 265
V + H S S + H H H H H H H H
Sbjct: 167 VFHEHHGHSHSHSDHDHDHDHDHDHVHGPGCGHGHHHHHR 206
>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter. The ZIP family consists of
zinc transport proteins and many putative metal
transporters. The main contribution to this family is
from the Arabidopsis thaliana ZIP protein family these
proteins are responsible for zinc uptake in the plant.
Also found within this family are C. elegans proteins of
unknown function which are annotated as being similar to
human growth arrest inducible gene product, although
this protein in not found within this family.
Length = 314
Score = 31.6 bits (72), Expect = 0.73
Identities = 8/27 (29%), Positives = 9/27 (33%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQS 265
H HHHHH H S +
Sbjct: 138 EHGEGPHHHHHEGHEKGESDKESGLRV 164
Score = 29.6 bits (67), Expect = 3.0
Identities = 5/35 (14%), Positives = 9/35 (25%)
Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
+ P HHHH + + +
Sbjct: 131 ESNVVHHEHGEGPHHHHHEGHEKGESDKESGLRVR 165
Score = 28.9 bits (65), Expect = 4.8
Identities = 7/31 (22%), Positives = 7/31 (22%)
Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSHNT 260
H P HHH H S
Sbjct: 136 HHEHGEGPHHHHHEGHEKGESDKESGLRVRD 166
Score = 28.9 bits (65), Expect = 5.2
Identities = 7/38 (18%), Positives = 13/38 (34%)
Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAG 269
+ + HH H H HH+ + ++G
Sbjct: 124 DEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESG 161
Score = 28.5 bits (64), Expect = 7.2
Identities = 5/33 (15%), Positives = 8/33 (24%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGK 271
+ H HH H S + +
Sbjct: 133 NVVHHEHGEGPHHHHHEGHEKGESDKESGLRVR 165
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 31.1 bits (70), Expect = 1.1
Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 3/66 (4%)
Query: 14 PGMTPSYSMNSVINSCSPQSGST---GTPLGGFSSNMHSMGSINGMNGPNCMSTGSMGYS 70
G++P+ +M S +N+ SP GG + + G++N + N M+ G +
Sbjct: 253 NGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTPSA 312
Query: 71 SNMNAA 76
S +N
Sbjct: 313 SMINLN 318
>gnl|CDD|223597 COG0523, COG0523, Putative GTPases (G3E family) [General function
prediction only].
Length = 323
Score = 30.7 bits (70), Expect = 1.2
Identities = 5/26 (19%), Positives = 7/26 (26%)
Query: 237 PTHHHPTHHHHHHSHHTHHNSHNTHS 262
P H H H ++ S
Sbjct: 204 VGRTSPLFSDHQHDHEHDEGGISSFS 229
Score = 30.0 bits (68), Expect = 2.5
Identities = 7/33 (21%), Positives = 9/33 (27%)
Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSH 258
+ V T + H H H H S
Sbjct: 195 EGLFDLDRDVGRTSPLFSDHQHDHEHDEGGISS 227
>gnl|CDD|226908 COG4531, ZnuA, ABC-type Zn2+ transport system, periplasmic
component/surface adhesin [Inorganic ion transport and
metabolism].
Length = 318
Score = 30.5 bits (69), Expect = 1.7
Identities = 26/142 (18%), Positives = 41/142 (28%), Gaps = 14/142 (9%)
Query: 219 FKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKP 278
K + R+ H H H H H HH+ H K+
Sbjct: 108 LPGVKPLLFREAHDHEEEHDHGHDHAGADHKGDHDHHHEGEYDMHLWLSPAIAKAVAAAI 167
Query: 279 HLSELAMLGLTKEDAMSAGLLHSDFQQN--SYHQSFHQQDSELAAMVASGRCHPHLIPPD 336
L + D +A N + D ++ +A + P + D
Sbjct: 168 AKK------LAELDPQNAAKY----DANLKDFEAQLAALDKKVGEELAPVKGKPFFVFHD 217
Query: 337 HYNHHLKQEYGASAVNHPFPVS 358
Y + + YG + H F VS
Sbjct: 218 AYG-YFENAYGLKPLGH-FTVS 237
>gnl|CDD|224151 COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion
transport and metabolism].
Length = 296
Score = 30.4 bits (69), Expect = 1.8
Identities = 7/20 (35%), Positives = 8/20 (40%)
Query: 243 THHHHHHSHHTHHNSHNTHS 262
H HHSH H N +
Sbjct: 1 MHGAAHHSHGHDHAHDNPRN 20
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 30.5 bits (68), Expect = 1.9
Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 6/44 (13%)
Query: 229 QTHHKSVSPTHHH---PTHHHHHHSHHTHHNSHNTH---SHQSH 266
Q HH P HH P H H H HH H + HQ H
Sbjct: 295 QHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQHQQH 338
Score = 28.9 bits (64), Expect = 5.3
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 241 HPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
HH H+ H HH+ HN H++ A ++ +
Sbjct: 99 GGHHHGHNMHAHHHHHHHNGHTNGHGLAEDDRAED 133
Score = 28.5 bits (63), Expect = 7.0
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
HH H+ H H HH HHN H + D
Sbjct: 101 HHHGHNMHAHHHHHHHNGHTNGHGLAED 128
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.9 bits (67), Expect = 3.4
Identities = 19/116 (16%), Positives = 32/116 (27%), Gaps = 11/116 (9%)
Query: 203 SGNMFENGCYLRRQKRFK--DDKKEVIRQTHHKSVSPTHHHP----THHHHHHSHHTHHN 256
S + + Q K QT S H HH + HH+
Sbjct: 221 SKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS 280
Query: 257 SHNTHSHQ-----SHDAGGKKSPETKPHLSELAMLGLTKEDAMSAGLLHSDFQQNS 307
H Q + + + + + S+ + A S G S ++S
Sbjct: 281 KHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHS 336
>gnl|CDD|238499 cd01017, AdcA, Metal binding protein AcdA. These proteins have
been shown to function in the ABC uptake of Zn2+ and
Mn2+ and in competence for genetic transformation and
adhesion. The AcdA proteins belong to the TroA
superfamily of helical backbone metal receptor proteins
that share a distinct fold and ligand binding mechanism.
They are comprised of two globular subdomains connected
by a long alpha helix and they bind their ligand in the
cleft between these domains. In addition, many of these
proteins have a low complexity region containing metal
binding histidine-rich motif (repetitive HDH sequence).
Length = 282
Score = 29.2 bits (66), Expect = 4.0
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 244 HHHHHHSHHTHHNSHNTH 261
H H HSH HH ++ H
Sbjct: 97 EHDHDHSHSHHHGDYDPH 114
Score = 28.8 bits (65), Expect = 5.2
Identities = 6/21 (28%), Positives = 8/21 (38%)
Query: 238 THHHPTHHHHHHSHHTHHNSH 258
H H H HH ++ H
Sbjct: 94 GGAEHDHDHSHSHHHGDYDPH 114
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 29.4 bits (67), Expect = 4.5
Identities = 9/10 (90%), Positives = 9/10 (90%)
Query: 217 KRFKDDKKEV 226
KRFKDD KEV
Sbjct: 704 KRFKDDVKEV 713
>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
Length = 2435
Score = 29.4 bits (65), Expect = 5.1
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 46 NMHSMGSINGMNGPNCMSTGSMGYSSNMNAACMGGINSYNTALTTG 91
N+HS+ ++NG++ PN T +M + N + A G + + L G
Sbjct: 760 NVHSVNALNGLSVPNASFTVTMSHGKNRDNATTGFTDPSDGTLVMG 805
>gnl|CDD|227443 COG5112, UFD2, U1-like Zn-finger-containing protein [General
function prediction only].
Length = 126
Score = 27.7 bits (61), Expect = 5.6
Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 2/56 (3%)
Query: 171 IRHSLSFNDCFVKVPRTPDKPGKGSFWTLHKDSGNMFENGCYLRRQKRFKDDKKEV 226
I++ LS + K+P P+ PG G + + F L K+ K K+
Sbjct: 31 IKNDLSTKESQKKLPYDPELPGLGQHYCIECA--RYFITEKALMEHKKGKVHKRRA 84
>gnl|CDD|236748 PRK10737, PRK10737, FKBP-type peptidyl-prolyl cis-trans isomerase;
Provisional.
Length = 196
Score = 28.0 bits (62), Expect = 6.8
Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 227 IRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGK 271
IR+ + ++ H H H HHH H + H H H+ GG+
Sbjct: 139 IREATEEELAHGHVHGAHDHHHDHDHDGCCGGHGHDH-GHEHGGE 182
>gnl|CDD|172935 PRK14460, PRK14460, ribosomal RNA large subunit methyltransferase
N; Provisional.
Length = 354
Score = 28.2 bits (63), Expect = 7.8
Identities = 13/44 (29%), Positives = 19/44 (43%), Gaps = 6/44 (13%)
Query: 63 STGSMGYSSNMNAACMGGINSYNTALTTGRAELAENSPSPGALQ 106
STG+MG+ NM + G + R L +N P L+
Sbjct: 120 STGTMGFERNMTMGEILG------QVLVAREHLGDNGPDHPILR 157
>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein. High
affinity nickel transporters involved in the
incorporation of nickel into H2-uptake hydrogenase and
urease enzymes. Essential for the expression of
catalytically active hydrogenase and urease. Ion uptake
is dependent on proton motive force. HoxN in Alcaligenes
eutrophus is thought to be an integral membrane protein
with seven transmembrane helices. The family also
includes a cobalt transporter.
Length = 278
Score = 28.1 bits (63), Expect = 8.1
Identities = 3/41 (7%), Positives = 7/41 (17%)
Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
+ + H + G + E
Sbjct: 121 AALQKRGLLSRILGPRLVTKHWHMYPVGGLGFDTATEVALL 161
>gnl|CDD|216419 pfam01297, SBP_bac_9, Periplasmic solute binding protein family.
This family includes periplasmic solute binding proteins
such as TroA that interacts with an ATP-binding cassette
transport system in Treponema pallidum.
Length = 263
Score = 28.1 bits (63), Expect = 8.3
Identities = 6/24 (25%), Positives = 7/24 (29%), Gaps = 1/24 (4%)
Query: 235 VSPTHHHPTHHHHHHSHHTHHNSH 258
S H H HH + H
Sbjct: 79 ASEGIDLH-HGPEHEDHHGGGDPH 101
Score = 28.1 bits (63), Expect = 9.3
Identities = 6/18 (33%), Positives = 7/18 (38%)
Query: 244 HHHHHHSHHTHHNSHNTH 261
HH H HH + H
Sbjct: 84 DLHHGPEHEDHHGGGDPH 101
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.126 0.383
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,231,530
Number of extensions: 1825357
Number of successful extensions: 2873
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2355
Number of HSP's successfully gapped: 133
Length of query: 417
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 318
Effective length of database: 6,546,556
Effective search space: 2081804808
Effective search space used: 2081804808
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.1 bits)