RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy447
         (417 letters)



>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain. 
          Length = 96

 Score =  201 bits (514), Expect = 1e-64
 Identities = 76/96 (79%), Positives = 87/96 (90%)

Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
           KPPYSYI+LITMAIQ SP KMLTLSEIYQ+IMDLFP+YRQN+Q WQNSIRH+LS N CF+
Sbjct: 1   KPPYSYIALITMAIQQSPEKMLTLSEIYQWIMDLFPYYRQNKQGWQNSIRHNLSLNKCFI 60

Query: 183 KVPRTPDKPGKGSFWTLHKDSGNMFENGCYLRRQKR 218
           KVPR+PDKPGKGS+WTL  +S NMFENG YL+R+KR
Sbjct: 61  KVPRSPDKPGKGSYWTLDPESENMFENGKYLKRRKR 96


>gnl|CDD|214627 smart00339, FH, FORKHEAD.  FORKHEAD, also known as a "winged
           helix".
          Length = 89

 Score =  167 bits (425), Expect = 1e-51
 Identities = 62/89 (69%), Positives = 73/89 (82%)

Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
           KPPYSYI+LI MAI +SP K LTLSEIY++I D FP+YR+N+  WQNSIRH+LS NDCFV
Sbjct: 1   KPPYSYIALIAMAILSSPDKRLTLSEIYKWIEDNFPYYRENRAGWQNSIRHNLSLNDCFV 60

Query: 183 KVPRTPDKPGKGSFWTLHKDSGNMFENGC 211
           KVPR  D+PGKGS+WTL   + NMFENG 
Sbjct: 61  KVPREGDRPGKGSYWTLDPAAENMFENGN 89


>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix".  FH is
           named for the Drosophila fork head protein, a
           transcription factor which promotes terminal rather than
           segmental development. This family of transcription
           factor domains, which bind to B-DNA as monomers, are
           also found in the Hepatocyte nuclear factor (HNF)
           proteins, which provide tissue-specific gene regulation.
           The structure contains 2 flexible loops or "wings" in
           the C-terminal region, hence the term winged helix.
          Length = 78

 Score =  163 bits (414), Expect = 6e-50
 Identities = 54/78 (69%), Positives = 63/78 (80%)

Query: 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFV 182
           KPPYSY +LI MAIQ+SP K LTLSEIY++I D FP++R     WQNSIRH+LS N CFV
Sbjct: 1   KPPYSYSALIAMAIQSSPEKRLTLSEIYKWISDNFPYFRDAPAGWQNSIRHNLSLNKCFV 60

Query: 183 KVPRTPDKPGKGSFWTLH 200
           KVPR PD+PGKGS+WTL 
Sbjct: 61  KVPREPDEPGKGSYWTLD 78


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score = 89.1 bits (221), Expect = 4e-19
 Identities = 60/205 (29%), Positives = 87/205 (42%), Gaps = 6/205 (2%)

Query: 13  SPGMTPSYSMNSVINSCSPQSGSTGTPLGGFSSNMHSMGSINGMNGPNCMSTGSMGYSSN 72
            P  + S    S+ +       ST TP+    S+     S    +      T +   SS 
Sbjct: 233 IPADSSSNLDVSLGHH--ISQPSTHTPVLDNHSSGEENISRINNSSQIDSPTPNYRMSSI 290

Query: 73  MNAACMGGINSYNTALTTGRAELAENSPSPGALQRVARADKTYRRSYTHAKPPYSYISLI 132
              + +      N        + +++SP P      +            +KP +SY + I
Sbjct: 291 --DSSVNPSRLANN--KDEGRKGSKSSPVPKDAAPPSTLSDLSADVNRTSKPAFSYANSI 346

Query: 133 TMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPG 192
           T AI +SP+  +TLSEIY +I    P+YR     WQNSIRH+LS N  F KVPR+  +PG
Sbjct: 347 TQAILSSPSGKMTLSEIYSWISSNLPYYRHKPTAWQNSIRHNLSLNKSFEKVPRSASQPG 406

Query: 193 KGSFWTLHKDSGNMFENGCYLRRQK 217
           KG FW +        E+    R  K
Sbjct: 407 KGCFWKIDYSYIYEKESKRNPRSPK 431



 Score = 74.1 bits (182), Expect = 3e-14
 Identities = 39/84 (46%), Positives = 52/84 (61%)

Query: 120 THAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFND 179
           T+  PPYSY +   +AI NSP K LTLS+IY +I + F +Y +   RWQNSIRH+LS ND
Sbjct: 83  TYYVPPYSYATGRGLAILNSPDKPLTLSKIYTWIHNTFFYYAKVVSRWQNSIRHNLSLND 142

Query: 180 CFVKVPRTPDKPGKGSFWTLHKDS 203
            F+K+        KG FW++    
Sbjct: 143 AFIKIEGRNGAKVKGHFWSIGPGH 166


>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
          Length = 279

 Score = 45.2 bits (107), Expect = 3e-05
 Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 1/46 (2%)

Query: 239 HHHPTHHHHHHSH-HTHHNSHNTHSHQSHDAGGKKSPETKPHLSEL 283
           HHH   H H H H H H + H+ H      A   +    + H +++
Sbjct: 121 HHHDHDHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHERAHANDI 166



 Score = 42.1 bits (99), Expect = 3e-04
 Identities = 12/36 (33%), Positives = 15/36 (41%)

Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
            +  HH H H H   H  H+ H H  H   G  + E
Sbjct: 118 ENMHHHDHDHDHDHDHEHHHDHGHHHHHEHGATAEE 153



 Score = 41.7 bits (98), Expect = 4e-04
 Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDA 268
           HH      H H   HHH H HH HH+ H   + +  DA
Sbjct: 121 HHHDHDHDHDHDHEHHHDHGHH-HHHEHGATAEEYQDA 157



 Score = 31.3 bits (71), Expect = 0.80
 Identities = 10/34 (29%), Positives = 12/34 (35%)

Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
            H       HH  H HHHH  H         +H+
Sbjct: 126 DHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHE 159


>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region.  The region
           described in this family is found towards the N-terminus
           of various eukaryotic fork head/HNF-3-related
           transcription factors (which contain the pfam00250
           domain). These proteins play key roles in embryogenesis,
           maintenance of differentiated cell states, and
           tumorigenesis.
          Length = 137

 Score = 40.7 bits (95), Expect = 3e-04
 Identities = 36/132 (27%), Positives = 49/132 (37%), Gaps = 20/132 (15%)

Query: 9   TSLGSPGMTPSYSMNSVINSCSPQSGSTGTPLGGFSSNMHSMGSINGMNGPNCMSTGSMG 68
            S  S      YSMNS+       +           S     G++NGM+  + M+  S G
Sbjct: 8   ESYSSVSGGMVYSMNSMNTYGPMNTSQGSANSSMNMSGYAGPGAMNGMS-SSSMNGMSPG 66

Query: 69  YSSNMNAACMGGINSYNTALTT------------------GRAELAENSPSPGALQRVAR 110
           Y    +   M G++S  T+L+                           + S   L R AR
Sbjct: 67  YGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRASSQNNLNR-AR 125

Query: 111 ADKTYRRSYTHA 122
             KTYRRSYTHA
Sbjct: 126 DPKTYRRSYTHA 137


>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796).  This
           bacterial family of proteins has no known function.
          Length = 161

 Score = 40.0 bits (94), Expect = 7e-04
 Identities = 12/37 (32%), Positives = 13/37 (35%)

Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
             T     S       H HH H HH HH+ H    H 
Sbjct: 70  TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGHS 106



 Score = 37.3 bits (87), Expect = 0.004
 Identities = 12/35 (34%), Positives = 14/35 (40%)

Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
              VS        H H H  H HH+ H+ H H  H
Sbjct: 71  LTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGH 105



 Score = 35.7 bits (83), Expect = 0.016
 Identities = 10/37 (27%), Positives = 14/37 (37%)

Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHS 262
            + +   +S     H   HH H H  H   + H  HS
Sbjct: 70  TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAGHS 106



 Score = 29.9 bits (68), Expect = 1.4
 Identities = 9/27 (33%), Positives = 11/27 (40%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQS 265
            HH   HH HH  H H    + H+   
Sbjct: 87  DHHDHDHHDHHDDHEHAGHSDFHAEYQ 113



 Score = 28.4 bits (64), Expect = 5.1
 Identities = 7/35 (20%), Positives = 11/35 (31%)

Query: 236 SPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGG 270
           + T             H HH+  +   H  H+  G
Sbjct: 70  TLTEVSLESSLFGDHDHDHHDHDHHDHHDDHEHAG 104


>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
           component; Reviewed.
          Length = 311

 Score = 40.4 bits (95), Expect = 0.001
 Identities = 12/36 (33%), Positives = 14/36 (38%)

Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
            H  HH   H H  H  S   H H  ++     SPE
Sbjct: 117 AHDDHHDDDHDHAGHEKSDEDHHHGEYNMHIWLSPE 152



 Score = 38.1 bits (89), Expect = 0.005
 Identities = 9/39 (23%), Positives = 13/39 (33%), Gaps = 2/39 (5%)

Query: 223 KKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
           K  +++  H       H H    H       HH  +N H
Sbjct: 110 KPLLMKGAHDDHHDDDHDH--AGHEKSDEDHHHGEYNMH 146



 Score = 35.8 bits (83), Expect = 0.037
 Identities = 6/28 (21%), Positives = 8/28 (28%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
             H    H H  H      H+   +  H
Sbjct: 119 DDHHDDDHDHAGHEKSDEDHHHGEYNMH 146



 Score = 31.9 bits (73), Expect = 0.60
 Identities = 6/26 (23%), Positives = 7/26 (26%), Gaps = 1/26 (3%)

Query: 243 THHHHHHSH-HTHHNSHNTHSHQSHD 267
             H  HH   H H     +     H 
Sbjct: 116 GAHDDHHDDDHDHAGHEKSDEDHHHG 141


>gnl|CDD|238501 cd01019, ZnuA, Zinc binding protein ZnuA. These proteins have been
           shown to function as initial receptors in the ABC uptake
           of Zn2+.  They belong to the TroA superfamily of
           periplasmic metal binding proteins that share a distinct
           fold and ligand binding mechanism.  They are comprised
           of two globular subdomains connected by a single helix
           and bind their specific ligands in the cleft between
           these domains.  A typical TroA protein is comprised of
           two globular subdomains connected by a single helix and
           can bind the metal ion in the cleft between these
           domains. In addition, these proteins sometimes have a
           low complexity region containing a metal-binding
           histidine-rich motif (repetitive HDH sequence).
          Length = 286

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 10/48 (20%), Positives = 15/48 (31%)

Query: 220 KDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
              K +V+       +       +H  H H H   H  H+ H     D
Sbjct: 73  GRKKGKVLTLAKLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEEGGLD 120



 Score = 36.6 bits (85), Expect = 0.015
 Identities = 11/41 (26%), Positives = 13/41 (31%), Gaps = 5/41 (12%)

Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPHL 280
                    H  H H + H    H  H+ GG       PHL
Sbjct: 88  LKTLEDGASHGDHEHDHEHAHGEHDGHEEGGL-----DPHL 123



 Score = 35.8 bits (83), Expect = 0.032
 Identities = 8/28 (28%), Positives = 9/28 (32%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
             H  H H H   H  H+ H       H
Sbjct: 95  ASHGDHEHDHEHAHGEHDGHEEGGLDPH 122



 Score = 32.7 bits (75), Expect = 0.31
 Identities = 5/31 (16%), Positives = 7/31 (22%)

Query: 231 HHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
              +    H H   H H           + H
Sbjct: 92  EDGASHGDHEHDHEHAHGEHDGHEEGGLDPH 122


>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region.  SelP is the
           only known eukaryotic selenoprotein that contains
           multiple selenocysteine (Sec) residues, and accounts for
           more than 50% of the selenium content of rat and human
           plasma. It is thought to be glycosylated. SelP may have
           antioxidant properties. It can attach to epithelial
           cells, and may protect vascular endothelial cells
           against peroxynitrite toxicity. The high selenium
           content of SelP suggests that it may be involved in
           selenium intercellular transport or storage. The
           promoter structure of bovine SelP suggest that it may be
           involved in countering heavy metal intoxication, and may
           also have a developmental function. The N-terminal
           region of SelP can exist independently of the C terminal
           region. Zebrafish selenoprotein Pb lacks the C terminal
           Sec-rich region, and a protein encoded by the rat SelP
           gene and lacking this region has also been reported.
           N-terminal region contains a conserved SecxxCys motif,
           which is similar to the CysxxCys found in thioredoxins.
           It is speculated that the N terminal region may adopt a
           thioredoxin fold and catalyze redox reactions. The
           N-terminal region also contains a His-rich region, which
           is thought to mediate heparin binding. Binding to
           heparan proteoglycans could account for the membrane
           binding properties of SelP. The function of the
           bacterial members of this family is uncharcterised.
          Length = 238

 Score = 39.1 bits (91), Expect = 0.002
 Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 1/69 (1%)

Query: 221 DDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ-SHDAGGKKSPETKPH 279
           D+ K+V   T +K V         HHH H  H  H  H+ H H         +S +  P 
Sbjct: 163 DECKKVTLATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPD 222

Query: 280 LSELAMLGL 288
                  GL
Sbjct: 223 KPTEPPSGL 231



 Score = 35.2 bits (81), Expect = 0.042
 Identities = 15/52 (28%), Positives = 20/52 (38%)

Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
           RQ H    S   H    HHH H HH   ++ ++ S Q       + P    H
Sbjct: 182 RQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPDKPTEPPSGLHH 233



 Score = 27.9 bits (62), Expect = 8.1
 Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 20/46 (43%)

Query: 231 HHKSVSPTHHHPTHHHHHHS--------------------HHTHHN 256
           HH+     HHHP  HHH  S                     H HHN
Sbjct: 191 HHEHQGHAHHHPHGHHHPGSNSHSESQQPDPDKPTEPPSGLHHHHN 236


>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
          Length = 137

 Score = 36.5 bits (84), Expect = 0.007
 Identities = 9/45 (20%), Positives = 15/45 (33%), Gaps = 2/45 (4%)

Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPET 276
           H++  P  +      HHH H    + H+    Q       + P  
Sbjct: 40  HRTWDPADYL--DRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHH 82



 Score = 31.9 bits (72), Expect = 0.26
 Identities = 6/36 (16%), Positives = 7/36 (19%), Gaps = 2/36 (5%)

Query: 230 THHKSVSP--THHHPTHHHHHHSHHTHHNSHNTHSH 263
             +        H H    HH          H    H
Sbjct: 46  ADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPH 81



 Score = 28.0 bits (62), Expect = 5.4
 Identities = 4/42 (9%), Positives = 9/42 (21%)

Query: 225 EVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSH 266
           +   +  H      +H       +     HH   +       
Sbjct: 25  DGRAENQHIQREGGNHRTWDPADYLDRPHHHPHRHQQDDHHL 66


>gnl|CDD|215173 PLN02304, PLN02304, probable pectinesterase.
          Length = 379

 Score = 34.1 bits (78), Expect = 0.14
 Identities = 9/27 (33%), Positives = 11/27 (40%)

Query: 234 SVSPTHHHPTHHHHHHSHHTHHNSHNT 260
           S      +P   HHH  HH HH   + 
Sbjct: 26  SSKIATSYPHSIHHHSHHHHHHKHPDD 52



 Score = 32.5 bits (74), Expect = 0.42
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 233 KSVSPTHHHPTHHHHHHSHHTHHN 256
           K  +   H   HH HHH HH H +
Sbjct: 28  KIATSYPHSIHHHSHHHHHHKHPD 51



 Score = 30.6 bits (69), Expect = 1.4
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query: 243 THHHHHHSHHTHHNSHNTHS 262
           T + H   HH+HH+ H+ H 
Sbjct: 31  TSYPHSIHHHSHHHHHHKHP 50



 Score = 30.2 bits (68), Expect = 1.8
 Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 5/30 (16%)

Query: 244 HHHHHHSHHTHHNSHNTHSHQSHDAGGKKS 273
           + H  H H  HH     H H+  D G K S
Sbjct: 33  YPHSIHHHSHHH-----HHHKHPDDGKKVS 57


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 33.9 bits (78), Expect = 0.14
 Identities = 7/42 (16%), Positives = 15/42 (35%), Gaps = 3/42 (7%)

Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSH---NTHSHQSHDA 268
              KS +   HH +   H  S     +     ++   + +D+
Sbjct: 84  LKSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDS 125



 Score = 31.2 bits (71), Expect = 0.94
 Identities = 14/69 (20%), Positives = 25/69 (36%), Gaps = 3/69 (4%)

Query: 209 NGCYLRRQKRFKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDA 268
           +    R  KR K  ++  +  T  KS S  H     HH     H    S  T   +   +
Sbjct: 61  SPSRSRLHKRKKSSRRSPMSDTLLKSKSSAH---LLHHQSTRSHRRSKSGTTSPRKPSSS 117

Query: 269 GGKKSPETK 277
             ++  +++
Sbjct: 118 AHRRRNDSE 126


>gnl|CDD|238500 cd01018, ZntC, Metal binding protein ZntC.  These proteins are
           predicted to function as initial receptors in ABC
           transport of metal ions.  They belong to the TroA
           superfamily of helical backbone metal receptor proteins
           that share a distinct fold and ligand binding mechanism.
            They are comprised of two globular subdomains connected
           by a long alpha helix and bind their specific ligands in
           the cleft between these domains.  In addition, many of
           these proteins possess a metal-binding histidine-rich
           motif (repetitive HDH sequence).
          Length = 266

 Score = 33.1 bits (76), Expect = 0.22
 Identities = 9/20 (45%), Positives = 12/20 (60%)

Query: 239 HHHPTHHHHHHSHHTHHNSH 258
           HHH  H  H H HH +++ H
Sbjct: 94  HHHHHHGEHEHHHHGNYDPH 113



 Score = 31.6 bits (72), Expect = 0.70
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 244 HHHHHHSHHTHHNSHNTHSH 263
           HHHHHH  H HH+  N   H
Sbjct: 94  HHHHHHGEHEHHHHGNYDPH 113



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 233 KSVSPTHHHPTHHHHHHSH-HTHHNSHNTH 261
           K ++       HHHHH  H H HH +++ H
Sbjct: 84  KGITLIPMADHHHHHHGEHEHHHHGNYDPH 113



 Score = 28.5 bits (64), Expect = 6.3
 Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 10/88 (11%)

Query: 242 PTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPHLSELAMLGLTKEDAMSAGLLHS 301
           P   HHHH H  H + H+ +    +D     SP     ++E     L + D  +A     
Sbjct: 90  PMADHHHHHHGEHEHHHHGN----YDPHIWLSPANAKIMAENIYEALAELDPQNATY--- 142

Query: 302 DFQQN--SYHQSFHQQDSELAAMVASGR 327
            +Q N  +        DSE+  +++  +
Sbjct: 143 -YQANLDALLAELDALDSEIRTILSKLK 169


>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
           family.  The immune response of the purple sea urchin
           appears to be more complex than previously believed in
           that it uses immune-related gene families homologous to
           vertebrate Toll-like and NOD/NALP-like receptor families
           as well as C-type lectins and a rudimentary complement
           system. In addition, the species also produces this
           unusual family of mRNAs, also known as 185/333, which is
           strongly upregulated in response to pathogen challenge.
          Length = 312

 Score = 32.3 bits (73), Expect = 0.41
 Identities = 10/41 (24%), Positives = 14/41 (34%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
           H H   HH H+              ++H  G +   ET  H
Sbjct: 177 HPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQGHNETGDH 217



 Score = 31.9 bits (72), Expect = 0.52
 Identities = 16/87 (18%), Positives = 28/87 (32%), Gaps = 5/87 (5%)

Query: 189 DKPGKGSFWTLHKDSGNMFENGCYLRRQKRFKDDKKEVIRQTHHKSVSPTHHHPTH--HH 246
           D PG+           +  E   + +R +  ++D +       H   +     P      
Sbjct: 140 DGPGRRHHGHRQGPPQDRPEEQPFGQRNESSEEDGRPHPHHHRHHGHNHFGRKPFGGRPF 199

Query: 247 HHHSHHTHHNSHNT---HSHQSHDAGG 270
              +H   H  HN    H H+ H+  G
Sbjct: 200 GRRNHTEGHQGHNETGDHPHRHHNKTG 226


>gnl|CDD|237322 PRK13261, ureE, urease accessory protein UreE; Provisional.
          Length = 159

 Score = 31.1 bits (71), Expect = 0.56
 Identities = 6/16 (37%), Positives = 7/16 (43%)

Query: 240 HHPTHHHHHHSHHTHH 255
           +     HH HSH   H
Sbjct: 144 YGGAFRHHGHSHDHSH 159



 Score = 30.7 bits (70), Expect = 0.88
 Identities = 5/20 (25%), Positives = 7/20 (35%)

Query: 239 HHHPTHHHHHHSHHTHHNSH 258
                     H  H+H +SH
Sbjct: 140 ERGAYGGAFRHHGHSHDHSH 159



 Score = 29.1 bits (66), Expect = 2.6
 Identities = 5/15 (33%), Positives = 5/15 (33%)

Query: 238 THHHPTHHHHHHSHH 252
                 HH H H H 
Sbjct: 144 YGGAFRHHGHSHDHS 158


>gnl|CDD|236005 PRK07376, PRK07376, NAD(P)H-quinone oxidoreductase subunit F;
           Validated.
          Length = 673

 Score = 31.9 bits (73), Expect = 0.62
 Identities = 10/32 (31%), Positives = 13/32 (40%)

Query: 223 KKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTH 254
           +KE++           HH     HHHHS   H
Sbjct: 466 QKELLAAADAPGHHEGHHEEHGEHHHHSGSPH 497



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 3/43 (6%)

Query: 219 FKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTH 261
           F+ + KE+  Q    + +    H   HH  H  H HH+S + H
Sbjct: 458 FRGNDKEL--QKELLAAADAPGHHEGHHEEHGEH-HHHSGSPH 497



 Score = 29.6 bits (67), Expect = 4.2
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 241 HPTHHHHHHSHHTHHNSHNTHSHQS 265
            P HH  HH  H  H+ H+   H+S
Sbjct: 475 APGHHEGHHEEHGEHHHHSGSPHES 499


>gnl|CDD|237323 PRK13263, ureE, urease accessory protein UreE; Provisional.
          Length = 206

 Score = 31.3 bits (71), Expect = 0.65
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 2/33 (6%)

Query: 240 HHPTHHHHHHSHHTHHNSHN-THSHQSHDAGGK 271
               HH H HSH  H + H+  H H  H  G  
Sbjct: 167 VFHEHHGHSHSHSDHDHDHDHDHDHV-HGPGCG 198



 Score = 30.1 bits (68), Expect = 1.7
 Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 228 RQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAG 269
           R  H  + +  +       H H  H+H +S + H H  HD  
Sbjct: 150 RHGHDATFAEDYALAQQVFHEHHGHSHSHSDHDHDH-DHDHD 190



 Score = 27.8 bits (62), Expect = 8.6
 Identities = 10/36 (27%), Positives = 11/36 (30%)

Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
           H   S +H    H H H   H H        H  H 
Sbjct: 171 HHGHSHSHSDHDHDHDHDHDHVHGPGCGHGHHHHHR 206



 Score = 27.8 bits (62), Expect = 9.3
 Identities = 12/40 (30%), Positives = 14/40 (35%)

Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQS 265
           V  + H  S S + H   H H H   H     H  H H  
Sbjct: 167 VFHEHHGHSHSHSDHDHDHDHDHDHVHGPGCGHGHHHHHR 206


>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter.  The ZIP family consists of
           zinc transport proteins and many putative metal
           transporters. The main contribution to this family is
           from the Arabidopsis thaliana ZIP protein family these
           proteins are responsible for zinc uptake in the plant.
           Also found within this family are C. elegans proteins of
           unknown function which are annotated as being similar to
           human growth arrest inducible gene product, although
           this protein in not found within this family.
          Length = 314

 Score = 31.6 bits (72), Expect = 0.73
 Identities = 8/27 (29%), Positives = 9/27 (33%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQS 265
            H    HHHHH  H    S      + 
Sbjct: 138 EHGEGPHHHHHEGHEKGESDKESGLRV 164



 Score = 29.6 bits (67), Expect = 3.0
 Identities = 5/35 (14%), Positives = 9/35 (25%)

Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQ 264
             +         P HHHH        +  +    +
Sbjct: 131 ESNVVHHEHGEGPHHHHHEGHEKGESDKESGLRVR 165



 Score = 28.9 bits (65), Expect = 4.8
 Identities = 7/31 (22%), Positives = 7/31 (22%)

Query: 230 THHKSVSPTHHHPTHHHHHHSHHTHHNSHNT 260
            H     P HHH   H    S          
Sbjct: 136 HHEHGEGPHHHHHEGHEKGESDKESGLRVRD 166



 Score = 28.9 bits (65), Expect = 5.2
 Identities = 7/38 (18%), Positives = 13/38 (34%)

Query: 232 HKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAG 269
            +       +  HH H    H HH+  +       ++G
Sbjct: 124 DEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESG 161



 Score = 28.5 bits (64), Expect = 7.2
 Identities = 5/33 (15%), Positives = 8/33 (24%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGK 271
           +     H     HH H       S +      +
Sbjct: 133 NVVHHEHGEGPHHHHHEGHEKGESDKESGLRVR 165


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 31.1 bits (70), Expect = 1.1
 Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 3/66 (4%)

Query: 14  PGMTPSYSMNSVINSCSPQSGST---GTPLGGFSSNMHSMGSINGMNGPNCMSTGSMGYS 70
            G++P+ +M S +N+ SP            GG  +   + G++N  +  N M+ G    +
Sbjct: 253 NGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTPSA 312

Query: 71  SNMNAA 76
           S +N  
Sbjct: 313 SMINLN 318


>gnl|CDD|223597 COG0523, COG0523, Putative GTPases (G3E family) [General function
           prediction only].
          Length = 323

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 5/26 (19%), Positives = 7/26 (26%)

Query: 237 PTHHHPTHHHHHHSHHTHHNSHNTHS 262
                P    H H H       ++ S
Sbjct: 204 VGRTSPLFSDHQHDHEHDEGGISSFS 229



 Score = 30.0 bits (68), Expect = 2.5
 Identities = 7/33 (21%), Positives = 9/33 (27%)

Query: 226 VIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSH 258
                  + V  T    + H H H H     S 
Sbjct: 195 EGLFDLDRDVGRTSPLFSDHQHDHEHDEGGISS 227


>gnl|CDD|226908 COG4531, ZnuA, ABC-type Zn2+ transport system, periplasmic
           component/surface adhesin [Inorganic ion transport and
           metabolism].
          Length = 318

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 26/142 (18%), Positives = 41/142 (28%), Gaps = 14/142 (9%)

Query: 219 FKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKP 278
               K  + R+ H       H H      H   H HH+      H        K+     
Sbjct: 108 LPGVKPLLFREAHDHEEEHDHGHDHAGADHKGDHDHHHEGEYDMHLWLSPAIAKAVAAAI 167

Query: 279 HLSELAMLGLTKEDAMSAGLLHSDFQQN--SYHQSFHQQDSELAAMVASGRCHPHLIPPD 336
                    L + D  +A         N   +       D ++   +A  +  P  +  D
Sbjct: 168 AKK------LAELDPQNAAKY----DANLKDFEAQLAALDKKVGEELAPVKGKPFFVFHD 217

Query: 337 HYNHHLKQEYGASAVNHPFPVS 358
            Y  + +  YG   + H F VS
Sbjct: 218 AYG-YFENAYGLKPLGH-FTVS 237


>gnl|CDD|224151 COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion
           transport and metabolism].
          Length = 296

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 7/20 (35%), Positives = 8/20 (40%)

Query: 243 THHHHHHSHHTHHNSHNTHS 262
            H   HHSH   H   N  +
Sbjct: 1   MHGAAHHSHGHDHAHDNPRN 20


>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
           ABF1 is a sequence-specific DNA binding protein involved
           in transcription activation, gene silencing and
           initiation of DNA replication. ABF1 is known to remodel
           chromatin, and it is proposed that it mediates its
           effects on transcription and gene expression by
           modifying local chromatin architecture. These functions
           require a conserved stretch of 20 amino acids in the
           C-terminal region of ABF1 (amino acids 639 to 662 S.
           cerevisiae). The N-terminal two thirds of the protein
           are necessary for DNA binding, and the N-terminus (amino
           acids 9 to 91 in S. cerevisiae) is thought to contain a
           novel zinc-finger motif which may stabilise the protein
           structure.
          Length = 508

 Score = 30.5 bits (68), Expect = 1.9
 Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 6/44 (13%)

Query: 229 QTHHKSVSPTHHH---PTHHHHHHSHHTHHNSHNTH---SHQSH 266
           Q HH    P HH    P  H H H HH  H   +      HQ H
Sbjct: 295 QHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQHQQH 338



 Score = 28.9 bits (64), Expect = 5.3
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 241 HPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPE 275
              HH H+   H HH+ HN H++    A   ++ +
Sbjct: 99  GGHHHGHNMHAHHHHHHHNGHTNGHGLAEDDRAED 133



 Score = 28.5 bits (63), Expect = 7.0
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 240 HHPTHHHHHHSHHTHHNSHNTHSHQSHD 267
           HH  H+ H H HH HHN H      + D
Sbjct: 101 HHHGHNMHAHHHHHHHNGHTNGHGLAED 128


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 29.9 bits (67), Expect = 3.4
 Identities = 19/116 (16%), Positives = 32/116 (27%), Gaps = 11/116 (9%)

Query: 203 SGNMFENGCYLRRQKRFK--DDKKEVIRQTHHKSVSPTHHHP----THHHHHHSHHTHHN 256
           S  +      +  Q         K    QT     S           H HH    + HH+
Sbjct: 221 SKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS 280

Query: 257 SHNTHSHQ-----SHDAGGKKSPETKPHLSELAMLGLTKEDAMSAGLLHSDFQQNS 307
            H     Q      + +   +   +  + S+      +   A S G   S   ++S
Sbjct: 281 KHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHS 336


>gnl|CDD|238499 cd01017, AdcA, Metal binding protein AcdA.  These proteins have
           been shown to function in the ABC uptake of Zn2+ and
           Mn2+ and in competence for genetic transformation and
           adhesion.  The AcdA proteins belong to the TroA
           superfamily of helical backbone metal receptor proteins
           that share a distinct fold and ligand binding mechanism.
            They are comprised of two globular subdomains connected
           by a long alpha helix and they bind their ligand in the
           cleft between these domains.  In addition, many of these
           proteins have a low complexity region containing metal
           binding histidine-rich motif (repetitive HDH sequence).
          Length = 282

 Score = 29.2 bits (66), Expect = 4.0
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 244 HHHHHHSHHTHHNSHNTH 261
            H H HSH  HH  ++ H
Sbjct: 97  EHDHDHSHSHHHGDYDPH 114



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 6/21 (28%), Positives = 8/21 (38%)

Query: 238 THHHPTHHHHHHSHHTHHNSH 258
                 H H H  HH  ++ H
Sbjct: 94  GGAEHDHDHSHSHHHGDYDPH 114


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 29.4 bits (67), Expect = 4.5
 Identities = 9/10 (90%), Positives = 9/10 (90%)

Query: 217 KRFKDDKKEV 226
           KRFKDD KEV
Sbjct: 704 KRFKDDVKEV 713


>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
          Length = 2435

 Score = 29.4 bits (65), Expect = 5.1
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 46  NMHSMGSINGMNGPNCMSTGSMGYSSNMNAACMGGINSYNTALTTG 91
           N+HS+ ++NG++ PN   T +M +  N + A  G  +  +  L  G
Sbjct: 760 NVHSVNALNGLSVPNASFTVTMSHGKNRDNATTGFTDPSDGTLVMG 805


>gnl|CDD|227443 COG5112, UFD2, U1-like Zn-finger-containing protein [General
           function prediction only].
          Length = 126

 Score = 27.7 bits (61), Expect = 5.6
 Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 2/56 (3%)

Query: 171 IRHSLSFNDCFVKVPRTPDKPGKGSFWTLHKDSGNMFENGCYLRRQKRFKDDKKEV 226
           I++ LS  +   K+P  P+ PG G  + +       F     L   K+ K  K+  
Sbjct: 31  IKNDLSTKESQKKLPYDPELPGLGQHYCIECA--RYFITEKALMEHKKGKVHKRRA 84


>gnl|CDD|236748 PRK10737, PRK10737, FKBP-type peptidyl-prolyl cis-trans isomerase;
           Provisional.
          Length = 196

 Score = 28.0 bits (62), Expect = 6.8
 Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 227 IRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGK 271
           IR+   + ++  H H  H HHH   H      + H H  H+ GG+
Sbjct: 139 IREATEEELAHGHVHGAHDHHHDHDHDGCCGGHGHDH-GHEHGGE 182


>gnl|CDD|172935 PRK14460, PRK14460, ribosomal RNA large subunit methyltransferase
           N; Provisional.
          Length = 354

 Score = 28.2 bits (63), Expect = 7.8
 Identities = 13/44 (29%), Positives = 19/44 (43%), Gaps = 6/44 (13%)

Query: 63  STGSMGYSSNMNAACMGGINSYNTALTTGRAELAENSPSPGALQ 106
           STG+MG+  NM    + G       +   R  L +N P    L+
Sbjct: 120 STGTMGFERNMTMGEILG------QVLVAREHLGDNGPDHPILR 157


>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein.  High
           affinity nickel transporters involved in the
           incorporation of nickel into H2-uptake hydrogenase and
           urease enzymes. Essential for the expression of
           catalytically active hydrogenase and urease. Ion uptake
           is dependent on proton motive force. HoxN in Alcaligenes
           eutrophus is thought to be an integral membrane protein
           with seven transmembrane helices. The family also
           includes a cobalt transporter.
          Length = 278

 Score = 28.1 bits (63), Expect = 8.1
 Identities = 3/41 (7%), Positives = 7/41 (17%)

Query: 239 HHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPH 279
                             + + H +     G   + E    
Sbjct: 121 AALQKRGLLSRILGPRLVTKHWHMYPVGGLGFDTATEVALL 161


>gnl|CDD|216419 pfam01297, SBP_bac_9, Periplasmic solute binding protein family.
           This family includes periplasmic solute binding proteins
           such as TroA that interacts with an ATP-binding cassette
           transport system in Treponema pallidum.
          Length = 263

 Score = 28.1 bits (63), Expect = 8.3
 Identities = 6/24 (25%), Positives = 7/24 (29%), Gaps = 1/24 (4%)

Query: 235 VSPTHHHPTHHHHHHSHHTHHNSH 258
            S       H   H  HH   + H
Sbjct: 79  ASEGIDLH-HGPEHEDHHGGGDPH 101



 Score = 28.1 bits (63), Expect = 9.3
 Identities = 6/18 (33%), Positives = 7/18 (38%)

Query: 244 HHHHHHSHHTHHNSHNTH 261
             HH   H  HH   + H
Sbjct: 84  DLHHGPEHEDHHGGGDPH 101


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.126    0.383 

Gapped
Lambda     K      H
   0.267   0.0630    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,231,530
Number of extensions: 1825357
Number of successful extensions: 2873
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2355
Number of HSP's successfully gapped: 133
Length of query: 417
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 318
Effective length of database: 6,546,556
Effective search space: 2081804808
Effective search space used: 2081804808
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.1 bits)