Query         021872
Match_columns 306
No_of_seqs    211 out of 399
Neff          3.7 
Searched_HMMs 46136
Date          Fri Mar 29 06:19:01 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/021872.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/021872hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF14379 Myb_CC_LHEQLE:  MYB-CC  99.9 1.7E-25 3.6E-30  163.9   6.8   51  136-186     1-51  (51)
  2 PLN03162 golden-2 like transcr  99.9 5.1E-24 1.1E-28  207.1   6.6   62   36-98    231-292 (526)
  3 TIGR01557 myb_SHAQKYF myb-like  99.9 4.7E-22   1E-26  147.8   6.5   56   40-95      1-56  (57)
  4 PF00249 Myb_DNA-binding:  Myb-  97.4 0.00043 9.2E-09   48.6   5.4   48   42-93      1-48  (48)
  5 smart00426 TEA TEA domain.      90.8    0.27 5.8E-06   38.7   3.2   46   44-91      5-67  (68)
  6 PF14379 Myb_CC_LHEQLE:  MYB-CC  89.9    0.85 1.8E-05   34.1   5.0   34  151-185     6-39  (51)
  7 PF15235 GRIN_C:  G protein-reg  78.0     1.7 3.6E-05   38.4   2.3   20  157-176    70-89  (137)
  8 TIGR02894 DNA_bind_RsfA transc  68.5      27 0.00059   31.7   7.7   52   37-94     43-94  (161)
  9 PF01519 DUF16:  Protein of unk  66.7      26 0.00056   29.7   6.7   20  161-180    70-89  (102)
 10 smart00501 BRIGHT BRIGHT, ARID  66.1     5.7 0.00012   31.3   2.7   47   47-94     32-85  (93)
 11 PF12776 Myb_DNA-bind_3:  Myb/S  64.2     9.4  0.0002   29.4   3.5   55   44-98      1-67  (96)
 12 smart00717 SANT SANT  SWI3, AD  62.0      38 0.00082   21.8   5.7   44   43-91      2-45  (49)
 13 PF01285 TEA:  TEA/ATTS domain   61.6     5.5 0.00012   40.6   2.2   55   38-92     45-112 (431)
 14 cd00167 SANT 'SWI3, ADA2, N-Co  60.6      39 0.00084   21.5   5.5   44   44-92      1-44  (45)
 15 cd07646 I-BAR_IMD_IRSp53 Inver  54.9 1.1E+02  0.0023   29.4   9.4   69  137-208    66-143 (232)
 16 cd07644 I-BAR_IMD_BAIAP2L2 Inv  48.3 1.3E+02  0.0028   28.6   8.7   67  138-208    65-141 (215)
 17 cd07645 I-BAR_IMD_BAIAP2L1 Inv  36.5   3E+02  0.0066   26.4   9.3   69  137-208    64-141 (226)
 18 KOG2856 Adaptor protein PACSIN  36.3 1.7E+02  0.0036   30.5   7.9   13  283-295   398-410 (472)
 19 cd08783 Death_MALT1 Death doma  34.5      66  0.0014   27.1   4.1   49   40-92     35-84  (97)
 20 PF01388 ARID:  ARID/BRIGHT DNA  30.1      29 0.00063   26.9   1.3   46   47-93     36-88  (92)
 21 PF07384 DUF1497:  Protein of u  29.4      45 0.00097   25.4   2.1   22   43-64     36-57  (59)
 22 PF00435 Spectrin:  Spectrin re  23.1   3E+02  0.0065   20.0   6.6   49  159-210    42-90  (105)
 23 PRK10803 tol-pal system protei  22.5 2.5E+02  0.0053   26.6   6.2   40  144-184    54-94  (263)
 24 PF08127 Propeptide_C1:  Peptid  22.4      53  0.0012   23.2   1.3   35   49-90      1-35  (41)
 25 PF09535 Gmx_para_CXXCG:  Prote  21.6      44 0.00096   32.0   1.0   12   50-61    214-225 (237)
 26 KOG3841 TEF-1 and related tran  20.3      67  0.0014   33.2   2.0   56   40-97     74-146 (455)

No 1  
>PF14379 Myb_CC_LHEQLE:  MYB-CC type transfactor, LHEQLE motif
Probab=99.92  E-value=1.7e-25  Score=163.86  Aligned_cols=51  Identities=88%  Similarity=1.125  Sum_probs=48.9

Q ss_pred             CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh
Q 021872          136 GYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALN  186 (306)
Q Consensus       136 ~~qi~EALr~QmEVQrrLhEQLEVQR~LQlRIEaQGKYLqsiLekaq~~L~  186 (306)
                      +++|+||||+||||||||||||||||+||+|||||||||++|||||+++++
T Consensus         1 g~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek~~~~~s   51 (51)
T PF14379_consen    1 GMQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQKALS   51 (51)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence            578999999999999999999999999999999999999999999999863


No 2  
>PLN03162 golden-2 like transcription factor; Provisional
Probab=99.89  E-value=5.1e-24  Score=207.11  Aligned_cols=62  Identities=48%  Similarity=0.748  Sum_probs=58.9

Q ss_pred             cCCCCCCcccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhhhhcccc
Q 021872           36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQA   98 (306)
Q Consensus        36 s~~~K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKyRl~~~~   98 (306)
                      ...+|+||+||+|||++||+||++|| .++||||+||++|+|+|||++||||||||||+.+++
T Consensus       231 ~g~KKpRLrWTpELH~rFVeAV~qLG-~dKATPK~ILelMnV~GLTRenVKSHLQKYRl~rk~  292 (526)
T PLN03162        231 PGKKKAKVDWTPELHRRFVHAVEQLG-VEKAFPSRILELMGVQCLTRHNIASHLQKYRSHRRH  292 (526)
T ss_pred             CCCCCCcccCCHHHHHHHHHHHHHhC-cCccchHHHHHHcCCCCcCHHHHHHHHHHHHHhccc
Confidence            34789999999999999999999999 799999999999999999999999999999998874


No 3  
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class. This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by Pfam model pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif.
Probab=99.86  E-value=4.7e-22  Score=147.78  Aligned_cols=56  Identities=63%  Similarity=0.997  Sum_probs=54.4

Q ss_pred             CCCcccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhhhhc
Q 021872           40 KPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLG   95 (306)
Q Consensus        40 K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKyRl~   95 (306)
                      |+|++||+|+|++|++||+.||+.+.||||.|+++|++++||+.||+|||||||+.
T Consensus         1 k~r~~WT~eeh~~Fl~ai~~~G~g~~a~pk~I~~~~~~~~lT~~qV~SH~QKy~~k   56 (57)
T TIGR01557         1 KPRVVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYRLK   56 (57)
T ss_pred             CCCCCCCHHHHHHHHHHHHHhCCCcccchHHHHHHcCCCCCCHHHHHHHHHHHHcc
Confidence            78999999999999999999998899999999999999999999999999999985


No 4  
>PF00249 Myb_DNA-binding:  Myb-like DNA-binding domain;  InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=97.39  E-value=0.00043  Score=48.62  Aligned_cols=48  Identities=31%  Similarity=0.439  Sum_probs=41.5

Q ss_pred             CcccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhhh
Q 021872           42 RLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYR   93 (306)
Q Consensus        42 RlrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKyR   93 (306)
                      |-.||+|=++.|++||.++|. +  .-+.|-+.|+ .+-|..++++|.++|+
T Consensus         1 r~~Wt~eE~~~l~~~v~~~g~-~--~W~~Ia~~~~-~~Rt~~qc~~~~~~~~   48 (48)
T PF00249_consen    1 RGPWTEEEDEKLLEAVKKYGK-D--NWKKIAKRMP-GGRTAKQCRSRYQNLL   48 (48)
T ss_dssp             S-SS-HHHHHHHHHHHHHSTT-T--HHHHHHHHHS-SSSTHHHHHHHHHHHT
T ss_pred             CCCCCHHHHHHHHHHHHHhCC-c--HHHHHHHHcC-CCCCHHHHHHHHHhhC
Confidence            568999999999999999993 2  6899999998 8999999999999985


No 5  
>smart00426 TEA TEA domain.
Probab=90.77  E-value=0.27  Score=38.69  Aligned_cols=46  Identities=22%  Similarity=0.375  Sum_probs=29.9

Q ss_pred             ccCHHHHHHHHHHHHHhCCCCCCChh-HHHHh---hC-----------CCCC--CHHHHHHhhhh
Q 021872           44 RWTAELHDRFVDAVTQLGGPDKATPK-TIMRT---MG-----------VKGL--TLYHLKSHLQK   91 (306)
Q Consensus        44 rWT~ELH~rFV~AV~qLGG~~kAtPK-~Il~l---M~-----------v~gL--T~~hVkSHLQK   91 (306)
                      .|.++|-..|++|+...-  ...+-| .+...   .|           ..|.  |..+|.||+|.
T Consensus         5 vWp~~lE~Af~~aL~~~~--~~g~~kik~~~r~k~~gRNelIs~YI~~~tGk~Rt~KQVsShIQv   67 (68)
T smart00426        5 VWSPDIEQAFQEALAIYP--PCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQV   67 (68)
T ss_pred             cCcHHHHHHHHHHHHHcC--ccCcccchhhhcCcccchhHHHHHHHHHHhCCccchhhhcchhee
Confidence            699999999999998775  222222 22211   01           2454  77899999984


No 6  
>PF14379 Myb_CC_LHEQLE:  MYB-CC type transfactor, LHEQLE motif
Probab=89.90  E-value=0.85  Score=34.11  Aligned_cols=34  Identities=44%  Similarity=0.507  Sum_probs=27.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHh
Q 021872          151 RRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKAL  185 (306)
Q Consensus       151 rrLhEQLEVQR~LQlRIEaQGKYLqsiLekaq~~L  185 (306)
                      --|..|+||||+|.=.+|.| |-||.-+|...+-|
T Consensus         6 EALr~QmEvQrrLhEQLEvQ-r~Lqlrieaqgkyl   39 (51)
T PF14379_consen    6 EALRMQMEVQRRLHEQLEVQ-RHLQLRIEAQGKYL   39 (51)
T ss_pred             HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhHHH
Confidence            56889999999999999999 67887777665655


No 7  
>PF15235 GRIN_C:  G protein-regulated inducer of neurite outgrowth C-terminus
Probab=77.95  E-value=1.7  Score=38.40  Aligned_cols=20  Identities=20%  Similarity=0.400  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHHHhhHHHHH
Q 021872          157 LEVQRRLQLRIEAQGKYLQS  176 (306)
Q Consensus       157 LEVQR~LQlRIEaQGKYLqs  176 (306)
                      +.||+||+++||+|++....
T Consensus        70 ~AIQkHLE~qi~e~~~q~~~   89 (137)
T PF15235_consen   70 MAIQKHLERQIEEHERQRAP   89 (137)
T ss_pred             HHHHHHHHHHHHHhhhcccc
Confidence            34999999999999987754


No 8  
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=68.47  E-value=27  Score=31.67  Aligned_cols=52  Identities=23%  Similarity=0.326  Sum_probs=37.5

Q ss_pred             CCCCCCcccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhhhh
Q 021872           37 TDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRL   94 (306)
Q Consensus        37 ~~~K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKyRl   94 (306)
                      |..-..|||+..+-..+.+||..-- -.+-.++..     ...||+.+|-+-||.|..
T Consensus        43 TsAACGFRWNs~VRkqY~~~i~~AK-kqRk~~~~~-----~~~ltl~~vI~fLq~l~~   94 (161)
T TIGR02894        43 TAAACGFRWNAYVRKQYEEAIELAK-KQRKELKRE-----AGSLTLQDVISFLQNLKT   94 (161)
T ss_pred             cHHHhcchHHHHHHHHHHHHHHHHH-HHHhccccC-----cccCCHHHHHHHHHHHHh
Confidence            4567899999999999999997532 111111111     266999999999998863


No 9  
>PF01519 DUF16:  Protein of unknown function DUF16;  InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=66.70  E-value=26  Score=29.72  Aligned_cols=20  Identities=50%  Similarity=0.637  Sum_probs=18.0

Q ss_pred             HHHHHHHHHhhHHHHHHHHH
Q 021872          161 RRLQLRIEAQGKYLQSILEK  180 (306)
Q Consensus       161 R~LQlRIEaQGKYLqsiLek  180 (306)
                      +.||.+|.+||+-|++|++.
T Consensus        70 kel~~e~k~qgktL~~I~~~   89 (102)
T PF01519_consen   70 KELQVEQKAQGKTLQLILKT   89 (102)
T ss_dssp             HHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            78999999999999999843


No 10 
>smart00501 BRIGHT BRIGHT, ARID (A/T-rich interaction domain) domain. DNA-binding domain containing a helix-turn-helix structure
Probab=66.10  E-value=5.7  Score=31.29  Aligned_cols=47  Identities=30%  Similarity=0.464  Sum_probs=36.3

Q ss_pred             HHHHHHHHHHHHHhCCCCCCC----hhHHHHhhCCCCC---CHHHHHHhhhhhhh
Q 021872           47 AELHDRFVDAVTQLGGPDKAT----PKTIMRTMGVKGL---TLYHLKSHLQKYRL   94 (306)
Q Consensus        47 ~ELH~rFV~AV~qLGG~~kAt----PK~Il~lM~v~gL---T~~hVkSHLQKyRl   94 (306)
                      -+|++.|. +|..+||.+..+    =+.|.+.||++.-   ...++|+|.+||=+
T Consensus        32 vdL~~Ly~-~V~~~GG~~~v~~~~~W~~Va~~lg~~~~~~~~~~~lk~~Y~k~L~   85 (93)
T smart00501       32 LDLYRLYR-LVQERGGYDQVTKDKKWKEIARELGIPDTSTSAASSLRKHYERYLL   85 (93)
T ss_pred             CcHHHHHH-HHHHccCHHHHcCCCCHHHHHHHhCCCcccchHHHHHHHHHHHHhH
Confidence            38999998 588999987544    3678899998752   35689999998844


No 11 
>PF12776 Myb_DNA-bind_3:  Myb/SANT-like DNA-binding domain;  InterPro: IPR024752 This domain, found in a range of uncharacterised proteins, may be related to Myb/SANT-like DNA binding domains.
Probab=64.18  E-value=9.4  Score=29.39  Aligned_cols=55  Identities=16%  Similarity=0.257  Sum_probs=36.7

Q ss_pred             ccCHHHHHHHHHHHHHh---CCC-CCCCh-----hHHHHhhCC---CCCCHHHHHHhhhhhhhcccc
Q 021872           44 RWTAELHDRFVDAVTQL---GGP-DKATP-----KTIMRTMGV---KGLTLYHLKSHLQKYRLGKQA   98 (306)
Q Consensus        44 rWT~ELH~rFV~AV~qL---GG~-~kAtP-----K~Il~lM~v---~gLT~~hVkSHLQKyRl~~~~   98 (306)
                      +||++..+-||+.+-+.   |.- ....+     ..|.+.++-   -.+|..||++|+...|..-+.
T Consensus         1 ~Wt~~~~~~ll~~~~e~~~~g~~~~~~~fk~~~w~~i~~~~~~~~~~~~t~~qlknk~~~lk~~y~~   67 (96)
T PF12776_consen    1 SWTPEMTRFLLDLLIEQINKGNRPTNGGFKKEGWNNIAEEFNEKTGLNYTKKQLKNKWKTLKKDYRI   67 (96)
T ss_pred             CCChHHHHHHHHHHHHHHHhCCCCCCCCcCHHHHHHHHHHHHHHhCCcccHHHHHHHHHHHHHHHHH
Confidence            69999999999988443   433 13333     446665552   346889999998876664443


No 12 
>smart00717 SANT SANT  SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Probab=61.99  E-value=38  Score=21.83  Aligned_cols=44  Identities=18%  Similarity=0.324  Sum_probs=33.6

Q ss_pred             cccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhh
Q 021872           43 LRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQK   91 (306)
Q Consensus        43 lrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQK   91 (306)
                      -.||++=...|+.+|.++| .  ..-+.|-+.|+  +=|..+|+.+...
T Consensus         2 ~~Wt~~E~~~l~~~~~~~g-~--~~w~~Ia~~~~--~rt~~~~~~~~~~   45 (49)
T smart00717        2 GEWTEEEDELLIELVKKYG-K--NNWEKIAKELP--GRTAEQCRERWNN   45 (49)
T ss_pred             CCCCHHHHHHHHHHHHHHC-c--CCHHHHHHHcC--CCCHHHHHHHHHH
Confidence            4699999999999999999 1  23566766664  7788888877654


No 13 
>PF01285 TEA:  TEA/ATTS domain family;  InterPro: IPR000818 Transcriptional enhancer activators are nuclear proteins that contain a TEA/ATTSdomain, a DNA-binding region of 66-68 amino acids. The TEA/ATTS domain is found in the N-termini of certain gene regulatory proteins, such as the Simian virus 40 (SV40) enhancer factor TEF-1, yeast trans-acting factor TEC-1 (which is required for TY1 enhancer activity), and the Aspergillus abaA regulatory gene product. SV40 and retroviral enhancers, and those to which TEF-1, TEC-1 and abaA proteins bind, contain GT-IIC sites: the TEA/ATTS domain may therefore recognise and bind such sites. Secondary structure predictions suggest the presence of 3 helices, but have not confirmed the presence of the helix-turn-helix motif characteristic of many DNA-binding proteins: DNA-binding may therefore be effected by a different mechanism [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3L15_B 2HZD_A 3KYS_A 3JUA_C 4EAZ_B.
Probab=61.55  E-value=5.5  Score=40.61  Aligned_cols=55  Identities=18%  Similarity=0.255  Sum_probs=29.4

Q ss_pred             CCCCCcccCHHHHHHHHHHHHHhCCCCCCChh----------HHHHhhCC-CC--CCHHHHHHhhhhh
Q 021872           38 DPKPRLRWTAELHDRFVDAVTQLGGPDKATPK----------TIMRTMGV-KG--LTLYHLKSHLQKY   92 (306)
Q Consensus        38 ~~K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK----------~Il~lM~v-~g--LT~~hVkSHLQKy   92 (306)
                      +.+..-+|++|+...|++|+...---...+-+          -|-+-+.. .|  =|+.+|.||+|..
T Consensus        45 ~~~~~~vw~~~~e~af~~al~~~~~~g~~k~~~~~~~~grn~li~~yi~~~tg~~rt~kqvsshiqvl  112 (431)
T PF01285_consen   45 DGDGEGVWPPDIEQAFQEALAIYPPCGRRKLSDEGKMYGRNELISDYIKLKTGKTRTRKQVSSHIQVL  112 (431)
T ss_dssp             -GGGS--S-HHHHHHHHHHHHHS-SSS---HHHH-----THHHHHHHHHHHHS----SHHHHHHHHHH
T ss_pred             CCCCCCCCCHHHHHHHHHHHHhCCCCCCcccccccccccchhHHHHHHHHHhCcccchhHHHHHHHHH
Confidence            45678899999999999999776411122211          11111110 23  3778999999988


No 14 
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Probab=60.57  E-value=39  Score=21.53  Aligned_cols=44  Identities=20%  Similarity=0.363  Sum_probs=33.6

Q ss_pred             ccCHHHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhh
Q 021872           44 RWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKY   92 (306)
Q Consensus        44 rWT~ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKy   92 (306)
                      .||++=+..|+.++.++|-   ..-+.|-+.|+  +=|..+|+.|..++
T Consensus         1 ~Wt~eE~~~l~~~~~~~g~---~~w~~Ia~~~~--~rs~~~~~~~~~~~   44 (45)
T cd00167           1 PWTEEEDELLLEAVKKYGK---NNWEKIAKELP--GRTPKQCRERWRNL   44 (45)
T ss_pred             CCCHHHHHHHHHHHHHHCc---CCHHHHHhHcC--CCCHHHHHHHHHHh
Confidence            4999999999999999992   24567777764  46778888876543


No 15 
>cd07646 I-BAR_IMD_IRSp53 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Insulin Receptor tyrosine kinase Substrate p53. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. IRSp53 (Insulin Receptor tyrosine kinase Substrate p53) is also known as BAIAP2 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2). It is a scaffolding protein that takes part in many signaling pathways including Cdc42-induced filopodia formation, Rac-mediated lamellipodia extension, and spine morphogenesis. IRSp53 exists as multiple splicing variants that differ mainly at the C-termini. One variant (T-form) is expressed exclusively in human breast cancer cells. The gene encoding IRSp53 is a putative susceptibility gene for Gilles de la Tourette syndrome. IRSp53 contains an N-terminal IMD, a CRIB (Cdc42 and Rac interactive binding motif), an SH3 domain, and a WASP 
Probab=54.92  E-value=1.1e+02  Score=29.40  Aligned_cols=69  Identities=28%  Similarity=0.414  Sum_probs=50.4

Q ss_pred             cchHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHhhHHHHHHHHHHHHHhhHHHHhhhcHHHHHHHHHHHHh
Q 021872          137 YQVTEALRVQMEVQRRLHEQLEVQ---------RRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAI  207 (306)
Q Consensus       137 ~qi~EALr~QmEVQrrLhEQLEVQ---------R~LQlRIEaQGKYLqsiLekaq~~L~~~~~~~~gle~a~~eLs~l~~  207 (306)
                      -.|..||.-=-+|+|.++.+||++         ..|+-++|..-||+...+.+=   ...+-....++|-+++||-.+--
T Consensus        66 keLG~~L~~m~~~hr~i~~~le~~lk~Fh~eli~pLE~k~E~D~k~i~a~~Kky---~~e~k~k~~sleK~qseLKKlRr  142 (232)
T cd07646          66 KELGDVLFQMAEVHRQIQNQLEEMLKSFHNELLTQLEQKVELDSRYLTAALKKY---QTEHRSKGESLEKCQAELKKLRK  142 (232)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH
Confidence            345666644347777777777644         368899999999999877543   33466777899999999988765


Q ss_pred             h
Q 021872          208 K  208 (306)
Q Consensus       208 ~  208 (306)
                      +
T Consensus       143 K  143 (232)
T cd07646         143 K  143 (232)
T ss_pred             h
Confidence            5


No 16 
>cd07644 I-BAR_IMD_BAIAP2L2 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. This group is composed of uncharacterized proteins known as BAIAP2L2 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2). They contain an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The related proteins, BAIAP2L1 and IRSp53, function as regulators of membrane dynamics and the actin cytoskeleton. The IMD domain binds and bundles actin filaments, binds membranes and produces membrane protrusions, and interacts with the small GTPase Rac.
Probab=48.33  E-value=1.3e+02  Score=28.63  Aligned_cols=67  Identities=25%  Similarity=0.295  Sum_probs=44.4

Q ss_pred             chHHHHHHHH-HHHHHHHHHHHHH---------HHHHHHHHHhhHHHHHHHHHHHHHhhHHHHhhhcHHHHHHHHHHHHh
Q 021872          138 QVTEALRVQM-EVQRRLHEQLEVQ---------RRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAI  207 (306)
Q Consensus       138 qi~EALr~Qm-EVQrrLhEQLEVQ---------R~LQlRIEaQGKYLqsiLekaq~~L~~~~~~~~gle~a~~eLs~l~~  207 (306)
                      .+.++| ||| ||+|+|+.|||+.         .+|.-..|---||++.-+.+=   ...|-.-+.+||-+.+||-.+--
T Consensus        65 ~LG~vL-mqisev~r~i~~~le~~lk~FH~ell~~LEkk~elD~kyi~~s~KkY---q~E~r~k~dsleK~~selkk~rr  140 (215)
T cd07644          65 SLGEIL-IQMSETQRKLSADLEVVFQTFHVDLLQHMDKNTKLDMQFIEDSRRVY---ELEYRHRAANLEKCMSELWRMER  140 (215)
T ss_pred             HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH---HHHHHHhhhhHHHHHHHHHHHHH
Confidence            355555 555 9999999999854         345556666778887766443   33456666778877776666544


Q ss_pred             h
Q 021872          208 K  208 (306)
Q Consensus       208 ~  208 (306)
                      +
T Consensus       141 k  141 (215)
T cd07644         141 Q  141 (215)
T ss_pred             h
Confidence            4


No 17 
>cd07645 I-BAR_IMD_BAIAP2L1 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. BAIAP2L1 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1) is also known as IRTKS (Insulin Receptor Tyrosine Kinase Substrate). It is widely expressed, serves as a substrate for the insulin receptor, and binds the small GTPase Rac. It plays a role in regulating the actin cytoskeleton and colocalizes with F-actin, cortactin, VASP, and vinculin. BAIAP2L1 expression leads to the formation of short actin bundles, distinct from filopodia-like protrusions induced by the expression of the related protein IRSp53. It contains an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The IMD domain of 
Probab=36.54  E-value=3e+02  Score=26.38  Aligned_cols=69  Identities=19%  Similarity=0.326  Sum_probs=52.0

Q ss_pred             cchHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHhhHHHHHHHHHHHHHhhHHHHhhhcHHHHHHHHHHHHh
Q 021872          137 YQVTEALRVQMEVQRRLHEQLEVQ---------RRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAI  207 (306)
Q Consensus       137 ~qi~EALr~QmEVQrrLhEQLEVQ---------R~LQlRIEaQGKYLqsiLekaq~~L~~~~~~~~gle~a~~eLs~l~~  207 (306)
                      ..|.++|--=-||+|+++.|||..         ..|.-.+|..-||+...+.+=   ...+-.-..+||-+.++|--+--
T Consensus        64 keLG~~L~qi~ev~r~i~~~le~~lK~Fh~Ell~~LE~k~elD~kyi~a~~Kky---q~E~k~k~dsLeK~~seLKK~RR  140 (226)
T cd07645          64 KELGHVLMEISDVHKKLNDSLEENFKKFHREIIAELERKTDLDVKYMTATLKRY---QTEHKNKLDSLEKSQADLKKIRR  140 (226)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHh
Confidence            345666643348999998887733         467889999999999999763   33466777899999999888876


Q ss_pred             h
Q 021872          208 K  208 (306)
Q Consensus       208 ~  208 (306)
                      +
T Consensus       141 K  141 (226)
T cd07645         141 K  141 (226)
T ss_pred             c
Confidence            6


No 18 
>KOG2856 consensus Adaptor protein PACSIN [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport; Cytoskeleton]
Probab=36.32  E-value=1.7e+02  Score=30.46  Aligned_cols=13  Identities=31%  Similarity=0.751  Sum_probs=10.0

Q ss_pred             CCCCCCCCCcccc
Q 021872          283 GNGESLPLEGNMR  295 (306)
Q Consensus       283 ~~~~~~~~~~~~r  295 (306)
                      +|||+.|++|+-.
T Consensus       398 ~nGdsnPfdd~a~  410 (472)
T KOG2856|consen  398 SNGDSNPFDDDAK  410 (472)
T ss_pred             CCCCCCCCccccc
Confidence            5789999987743


No 19 
>cd08783 Death_MALT1 Death domain similar to that found in Mucosa-associated lymphoid tissue-lymphoma-translocation gene 1. Death domain (DD) similar to that found in Malt1 (mucosa-associated lymphoid tissue-lymphoma-translocation gene 1). Malt1, together with  Bcl10 (B-cell lymphoma 10), are the integral components of the CBM signalosome. They associate with CARD9 to form M-CBM (CBM complex in myeloid immune cells) and with CARMA1 to form L-CBM (CBM complex in lymphoid immune cells), to mediate activation of NF-kB and MAPK by ITAM-coupled receptors expressed on immune cells. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins 
Probab=34.46  E-value=66  Score=27.12  Aligned_cols=49  Identities=20%  Similarity=0.448  Sum_probs=39.7

Q ss_pred             CCCcccCH-HHHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhhhh
Q 021872           40 KPRLRWTA-ELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKY   92 (306)
Q Consensus        40 K~RlrWT~-ELH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQKy   92 (306)
                      .+|||-++ ||+++-...+..-|    .--|..|+.|++.|=|+.++-.-||+-
T Consensus        35 ~~~fr~S~~el~~cslkvl~p~g----SPsk~LL~~~~~rg~Tv~~Ll~~L~~M   84 (97)
T cd08783          35 RGRFRLSCLDLEQCSLKVLEPEG----SPSRSLLKLLGERGCTVTELSEFLQAM   84 (97)
T ss_pred             CCccccCHHHHHHHHHHHhcCCC----CchHHHHHHHHHcCCcHHHHHHHHHHh
Confidence            34888888 88888887776555    234889999999999999999998864


No 20 
>PF01388 ARID:  ARID/BRIGHT DNA binding domain;  InterPro: IPR001606 Members of the recently discovered ARID (AT-rich interaction domain; also known as BRIGHT domain)) family of DNA-binding proteins are found in fungi and invertebrate and vertebrate metazoans. ARID-encoding genes are involved in a variety of biological processes including embryonic development, cell lineage gene regulation and cell cycle control. Although the specific roles of this domain and of ARID-containing proteins in transcriptional regulation are yet to be elucidated, they include both positive and negative transcriptional regulation and a likely involvement in the modification of chromatin structure []. The basic structure of the ARID domain domain appears to be a series of six alpha-helices separated by beta-strands, loops, or turns, but the structured region may extend to an additional helix at either or both ends of the basic six. Based on primary sequence homology, they can be partitioned into three structural classes: Minimal ARID proteins that consist of a core domain formed by six alpha helices; ARID proteins that supplement the core domain with an N-terminal alpha-helix; and Extended-ARID proteins, which contain the core domain and additional alpha-helices at their N- and C-termini. The human SWI-SNF complex protein p270 is an ARID family member with non-sequence-specific DNA binding activity. The ARID consensus and other structural features are common to both p270 and yeast SWI1, suggesting that p270 is a human counterpart of SWI1 []. The approximately 100-residue ARID sequence is present in a series of proteins strongly implicated in the regulation of cell growth, development, and tissue-specific gene expression. Although about a dozen ARID proteins can be identified from database searches, to date, only Bright (a regulator of B-cell-specific gene expression), dead ringer (a Drosophila melanogaster gene product required for normal development), and MRF-2 (which represses expression from the Cytomegalovirus enhancer) have been analyzed directly in regard to their DNA binding properties. Each binds preferentially to AT-rich sites. In contrast, p270 shows no sequence preference in its DNA binding activity, thereby demonstrating that AT-rich binding is not an intrinsic property of ARID domains and that ARID family proteins may be involved in a wider range of DNA interactions [].; GO: 0003677 DNA binding, 0005622 intracellular; PDB: 1C20_A 1KQQ_A 2JRZ_A 2LM1_A 2YQE_A 2JXJ_A 2EH9_A 2CXY_A 2LI6_A 1KN5_A ....
Probab=30.07  E-value=29  Score=26.87  Aligned_cols=46  Identities=33%  Similarity=0.503  Sum_probs=32.1

Q ss_pred             HHHHHHHHHHHHHhCCCCCCCh----hHHHHhhCCCCC-C--HHHHHHhhhhhh
Q 021872           47 AELHDRFVDAVTQLGGPDKATP----KTIMRTMGVKGL-T--LYHLKSHLQKYR   93 (306)
Q Consensus        47 ~ELH~rFV~AV~qLGG~~kAtP----K~Il~lM~v~gL-T--~~hVkSHLQKyR   93 (306)
                      -+|+..|. +|...||.++.+-    ..|-+.||++.- +  ...++.|..+|=
T Consensus        36 vDL~~Ly~-~V~~~GG~~~V~~~~~W~~va~~lg~~~~~~~~~~~L~~~Y~~~L   88 (92)
T PF01388_consen   36 VDLYKLYK-AVMKRGGFDKVTKNKKWREVARKLGFPPSSTSAAQQLRQHYEKYL   88 (92)
T ss_dssp             -SHHHHHH-HHHHHTSHHHHHHHTTHHHHHHHTTS-TTSCHHHHHHHHHHHHHT
T ss_pred             CcHHHHHH-HHHhCcCcccCcccchHHHHHHHhCCCCCCCcHHHHHHHHHHHHh
Confidence            37888886 5668999876554    468888998653 2  257888888774


No 21 
>PF07384 DUF1497:  Protein of unknown function (DUF1497);  InterPro: IPR009972 This entry is represented by Bacteriophage r1t, Orf17. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage and bacterial proteins of around 59 residues in length. Members of this family seem to be found exclusively in Lactococcus lactis and the bacteriophages that infect this organism. The function of this family is unknown.
Probab=29.41  E-value=45  Score=25.41  Aligned_cols=22  Identities=23%  Similarity=0.635  Sum_probs=19.1

Q ss_pred             cccCHHHHHHHHHHHHHhCCCC
Q 021872           43 LRWTAELHDRFVDAVTQLGGPD   64 (306)
Q Consensus        43 lrWT~ELH~rFV~AV~qLGG~~   64 (306)
                      -++..|+|..|-+-|..|||-+
T Consensus        36 ~kfnqem~aefheri~klggk~   57 (59)
T PF07384_consen   36 NKFNQEMQAEFHERIKKLGGKN   57 (59)
T ss_pred             hHhhHHHHHHHHHHHHHhcccc
Confidence            3578999999999999999854


No 22 
>PF00435 Spectrin:  Spectrin repeat;  InterPro: IPR002017 Spectrin repeats [] are found in several proteins involved in cytoskeletal structure. These include spectrin alpha and beta subunits [, ], alpha-actinin [] and dystrophin. The spectrin repeat forms a three-helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.; GO: 0005515 protein binding; PDB: 1HCI_A 1QUU_A 3FB2_B 1S35_A 1U5P_A 1U4Q_A 1CUN_B 1YDI_B 3EDV_A 1AJ3_A ....
Probab=23.13  E-value=3e+02  Score=19.97  Aligned_cols=49  Identities=24%  Similarity=0.338  Sum_probs=26.9

Q ss_pred             HHHHHHHHHHHhhHHHHHHHHHHHHHhhHHHHhhhcHHHHHHHHHHHHhhhh
Q 021872          159 VQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVS  210 (306)
Q Consensus       159 VQR~LQlRIEaQGKYLqsiLekaq~~L~~~~~~~~gle~a~~eLs~l~~~~~  210 (306)
                      -.+.++--|..+..-+..|.+.++.-....   +..-...+..+.+|.....
T Consensus        42 ~~~~~~~ei~~~~~~l~~l~~~~~~L~~~~---~~~~~~i~~~~~~l~~~w~   90 (105)
T PF00435_consen   42 KHKELQEEIESRQERLESLNEQAQQLIDSG---PEDSDEIQEKLEELNQRWE   90 (105)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT---HTTHHHHHHHHHHHHHHHH
T ss_pred             HHhhhhhHHHHHHHHHHHHHHHHHHHHHcC---CCcHHHHHHHHHHHHHHHH
Confidence            334445556666677777777776653322   3333445555666555433


No 23 
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=22.45  E-value=2.5e+02  Score=26.58  Aligned_cols=40  Identities=18%  Similarity=0.341  Sum_probs=31.1

Q ss_pred             HHHHHHHHHHHH-HHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q 021872          144 RVQMEVQRRLHE-QLEVQRRLQLRIEAQGKYLQSILEKACKA  184 (306)
Q Consensus       144 r~QmEVQrrLhE-QLEVQR~LQlRIEaQGKYLqsiLekaq~~  184 (306)
                      +.+.|+|.+|.+ |-|| +.|.=.||.+.--|+.|.+++.+-
T Consensus        54 ~~~~~l~~ql~~lq~ev-~~LrG~~E~~~~~l~~~~~rq~~~   94 (263)
T PRK10803         54 QLLTQLQQQLSDNQSDI-DSLRGQIQENQYQLNQVVERQKQI   94 (263)
T ss_pred             HHHHHHHHHHHHHHHHH-HHHhhHHHHHHHHHHHHHHHHHHH
Confidence            456788888866 4445 357788999999999999988774


No 24 
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=22.39  E-value=53  Score=23.23  Aligned_cols=35  Identities=23%  Similarity=0.273  Sum_probs=20.7

Q ss_pred             HHHHHHHHHHHhCCCCCCChhHHHHhhCCCCCCHHHHHHhhh
Q 021872           49 LHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQ   90 (306)
Q Consensus        49 LH~rFV~AV~qLGG~~kAtPK~Il~lM~v~gLT~~hVkSHLQ   90 (306)
                      |-+.||+-||...-..+|-.-       .++.|..+|++.|-
T Consensus         1 Lsde~I~~IN~~~~tWkAG~N-------F~~~~~~~ik~LlG   35 (41)
T PF08127_consen    1 LSDEFIDYINSKNTTWKAGRN-------FENTSIEYIKRLLG   35 (41)
T ss_dssp             S-HHHHHHHHHCT-SEEE-----------SSB-HHHHHHCS-
T ss_pred             CCHHHHHHHHcCCCcccCCCC-------CCCCCHHHHHHHcC
Confidence            457899999998766666543       46777777776654


No 25 
>PF09535 Gmx_para_CXXCG:  Protein of unknown function (Gmx_para_CXXCG);  InterPro: IPR011750 This entry consists of at least 10 paralogous proteins from Myxococcus xanthus that lack detectable sequence similarity to any other protein family. An imperfectly conserved CXXCG motif, a probable binding site, appears twice in the multiple sequence alignment.
Probab=21.58  E-value=44  Score=32.05  Aligned_cols=12  Identities=58%  Similarity=1.057  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHhC
Q 021872           50 HDRFVDAVTQLG   61 (306)
Q Consensus        50 H~rFV~AV~qLG   61 (306)
                      -+|||+||++||
T Consensus       214 TERFVeAv~rL~  225 (237)
T PF09535_consen  214 TERFVEAVQRLG  225 (237)
T ss_pred             eHHHHHHHHhcC
Confidence            479999999999


No 26 
>KOG3841 consensus TEF-1 and related transcription factor, TEAD family [Transcription]
Probab=20.32  E-value=67  Score=33.18  Aligned_cols=56  Identities=20%  Similarity=0.265  Sum_probs=36.0

Q ss_pred             CCCcccCHHHHHHHHHHHHHhCCCCCCChhHHHH--------------hhCC---CCCCHHHHHHhhhhhhhccc
Q 021872           40 KPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMR--------------TMGV---KGLTLYHLKSHLQKYRLGKQ   97 (306)
Q Consensus        40 K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~Il~--------------lM~v---~gLT~~hVkSHLQKyRl~~~   97 (306)
                      -.-=+|+++.-+.|.+|+...-  .--+-|-|+.              -+..   +-=|+.+|.||.|..-..+.
T Consensus        74 daegvWSpdIEqsFqEALaiyp--pcGrrKIilsdegkmyGRNELIarYIKlrtgktRTrKQVSSHIQVlarrk~  146 (455)
T KOG3841|consen   74 DAEGVWSPDIEQSFQEALAIYP--PCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQVLARRKL  146 (455)
T ss_pred             ccccccChhHHHHHHHHHhhcC--CCCceeEEEccCccccchHHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHH
Confidence            3445899999999999997653  1123333332              2222   33478999999997654443


Done!