Query         029056
Match_columns 200
No_of_seqs    158 out of 461
Neff          4.9 
Searched_HMMs 46136
Date          Fri Mar 29 06:47:09 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029056.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029056hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0324 Uncharacterized conser 100.0 8.8E-57 1.9E-61  382.4  13.6  158   22-179    12-171 (214)
  2 PF05903 Peptidase_C97:  PPPDE  100.0 1.1E-47 2.3E-52  311.7  10.3  136   24-159     1-149 (151)
  3 PF04970 LRAT:  Lecithin retino  94.8   0.011 2.3E-07   46.0   0.8   35   94-130    85-119 (125)
  4 PF05608 DUF778:  Protein of un  93.3    0.23   5E-06   40.5   5.7   40   96-135    75-114 (136)
  5 PF09601 DUF2459:  Protein of u  72.0     4.7  0.0001   33.7   3.5   42   94-136    97-161 (173)
  6 KOG2673 Uncharacterized conser  70.2     4.1   9E-05   39.3   3.1   44  127-176   186-231 (485)
  7 PF13387 DUF4105:  Domain of un  59.6      13 0.00028   30.6   3.7   53   88-140    98-154 (176)
  8 PF04046 PSP:  PSP;  InterPro:   52.0      13 0.00028   25.3   2.1   26  126-151     5-32  (48)
  9 smart00581 PSP proline-rich do  46.7      18 0.00038   25.3   2.2   25  127-151    10-36  (54)
 10 PF11931 DUF3449:  Domain of un  43.8     7.7 0.00017   33.5   0.0   22  136-157    85-106 (196)
 11 PF08405 Calici_PP_N:  Viral po  42.8      35 0.00076   31.8   4.1   94   47-156    22-120 (358)
 12 TIGR02117 chp_urease_rgn conse  42.5      30 0.00064   29.9   3.4   40   93-133   126-189 (208)
 13 PF15643 Tox-PL-2:  Papain fold  40.4      21 0.00046   27.8   2.0   44   97-147     2-45  (100)
 14 KOG3150 Uncharacterized conser  38.5      59  0.0013   27.6   4.4   56   82-137    74-132 (182)
 15 PF04603 Mog1:  Ran-interacting  38.5      10 0.00022   30.9  -0.0   28  149-176     5-33  (141)
 16 cd02998 PDI_a_ERp38 PDIa famil  31.0      20 0.00042   25.3   0.4   45   99-143    35-80  (105)
 17 PF13964 Kelch_6:  Kelch motif   30.7      47   0.001   21.0   2.2   20   47-66      3-22  (50)
 18 PF07646 Kelch_2:  Kelch motif;  25.7      72  0.0016   20.3   2.4   17   48-64      4-20  (49)
 19 PF12419 DUF3670:  SNF2 Helicas  24.4      81  0.0018   25.1   2.9   27   82-108    71-97  (141)
 20 PF04412 DUF521:  Protein of un  24.0      74  0.0016   30.2   3.0   45   93-138   268-316 (400)
 21 cd01781 AF6_RA_repeat2 Ubiquit  22.1 2.2E+02  0.0048   22.1   4.8   52   70-122     2-55  (100)
 22 smart00460 TGc Transglutaminas  21.0      60  0.0013   21.4   1.4   23  121-143     7-29  (68)
 23 TIGR01514 NAPRTase nicotinate   21.0      43 0.00093   31.7   0.8   43   98-140    46-89  (394)
 24 PF01344 Kelch_1:  Kelch motif;  20.4      99  0.0021   18.9   2.2   19   48-66      4-22  (47)

No 1  
>KOG0324 consensus Uncharacterized conserved protein [Function unknown]
Probab=100.00  E-value=8.8e-57  Score=382.35  Aligned_cols=158  Identities=57%  Similarity=1.097  Sum_probs=152.1

Q ss_pred             CcceEEEEEeeCCCcchhhhhhcceeeeeeEEEeCeEEEeccccCCCCCeEEeCCCCCCCCceEEEEEeccccCCHHHHH
Q 029056           22 NHALLYLNVYDLTPLNNYLYWFGLGIFHSGIEAHGLEYGFGAHEYRTSGVFEVEPKSCPGFIFRRSVLLGTTNLSRAEFR  101 (200)
Q Consensus        22 ~~~~V~LnVYDLs~~n~~l~~lG~GI~HTgVeV~G~EY~FG~~~~~~sGI~~~~P~~~pg~~~r~si~LG~T~lt~~e~~  101 (200)
                      ...+|+|||||+++.|.|++|+|+|||||||||||+||+||+|+++.+|||+++|+.+|+++||++|.||.|++++++|+
T Consensus        12 ~~~~v~lnvyd~~~~n~y~~~lGlGIfHSgIeV~g~EyayG~h~~~~sGIfe~~P~~~~~f~fr~sI~lG~Td~~~~~v~   91 (214)
T KOG0324|consen   12 GRVPVYLNVYDLTPINKYLYWLGLGIFHSGIEVHGVEYAYGAHEYPSSGIFEVEPGNCPEFTFRKSILLGSTDLTEDDVR   91 (214)
T ss_pred             cceeeeeeeecceehhhhhhhhcceeEeeeEEEeceeeeccccccCCCCeEeeCCCCCCCCceeEEEEecCCCCCHHHHH
Confidence            67899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHhccCCCCCcccccccccchHHHHHHHHhcCCCCChHHHHHhhhh--ccccccCCccccccccccCCCCCCCCCCC
Q 029056          102 SFMEHLSGKYHGDTYHLIAKNCNHFTEEVCMQLTGKCIPGWVNRMARLG--SFCNCLLPESIQITAVAHLPDRPTYSVPR  179 (200)
Q Consensus       102 ~~l~~L~~~f~g~tY~LL~rNCNHFsd~l~~~L~gk~IP~wInrla~iG--~~~~cllp~~~~~~~v~~~~~~~~~~~~~  179 (200)
                      +||++|+++|+|++||||.|||||||+++|++|+||+||+||||||++|  .+++|++|.....+++.+.+...+..+++
T Consensus        92 ~~le~L~~ey~G~~YhL~~kNCNHFsn~la~~Ltgk~IP~winrLa~~~~~~~~~~~~p~~~~~t~~~~~~~~~~~~~~~  171 (214)
T KOG0324|consen   92 RILEELSEEYRGNSYHLLTKNCNHFSNELALQLTGKKIPSWVNRLARAGLCSLCNCLLPMLQNLTPVVLASSVVERFDEE  171 (214)
T ss_pred             HHHHHHHhhcCCceehhhhhccchhHHHHHHHHcCCCccHHHHHHHHHhhhhHHhhcchhhhcCCccccccccccccCcc
Confidence            9999999999999999999999999999999999999999999999999  88899999999999999888877766554


No 2  
>PF05903 Peptidase_C97:  PPPDE putative peptidase domain;  InterPro: IPR008580 This domain consists of the N-terminal portion of several eukaryotic sequences. The function of this domain is unknown.; PDB: 2WP7_A 3EBQ_A.
Probab=100.00  E-value=1.1e-47  Score=311.71  Aligned_cols=136  Identities=51%  Similarity=0.984  Sum_probs=105.1

Q ss_pred             ceEEEEEeeCCCcch------hhhhhcc---eeeeeeEEEeCeEEEeccccCCCCCeEEeCCCC-CCCCceEEEEEeccc
Q 029056           24 ALLYLNVYDLTPLNN------YLYWFGL---GIFHSGIEAHGLEYGFGAHEYRTSGVFEVEPKS-CPGFIFRRSVLLGTT   93 (200)
Q Consensus        24 ~~V~LnVYDLs~~n~------~l~~lG~---GI~HTgVeV~G~EY~FG~~~~~~sGI~~~~P~~-~pg~~~r~si~LG~T   93 (200)
                      .+|+||||||++.|.      ...++|.   |||||||||+|+||+||+++.+.+||++++|++ .+++.|++++.||+|
T Consensus         1 h~V~L~vYDL~~~~~~~~~~~~~~~lG~~~~Gi~HtgV~v~G~Ey~fg~~~~~~~GI~~~~P~~~~~~~~~~~~i~lG~T   80 (151)
T PF05903_consen    1 HPVYLNVYDLSPINNGMARQLSLMWLGLQIDGIYHTGVEVYGKEYAFGGHDDPDSGIFECPPGHTSPGGTPRESIELGET   80 (151)
T ss_dssp             -EEEEEEEETT---TTHHHHHHHHHCSS-----EEEEEEETTEEEEEET-----TECEEESTT-STT--S-SEEEEEEEE
T ss_pred             CeEEEEEEECccccchhHHHhhHhhhCCccCceEEEEEEEccEEEEecccccccCcceEccCcCCCCCcceEEEEeCCCc
Confidence            489999999996654      4568998   999999999999999999999889999999998 788899999999999


Q ss_pred             cCCHHHHHHHHHHhccCCCCCcccccccccchHHHHHHHHhcCCCCChHHHHHhhhhc---cccccCCc
Q 029056           94 NLSRAEFRSFMEHLSGKYHGDTYHLIAKNCNHFTEEVCMQLTGKCIPGWVNRMARLGS---FCNCLLPE  159 (200)
Q Consensus        94 ~lt~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFsd~l~~~L~gk~IP~wInrla~iG~---~~~cllp~  159 (200)
                      .+++++|+++|++|+++|++++||||.|||||||++||++|+|++||+||||+|+++.   +..|++|.
T Consensus        81 ~~~~~~~~~~l~~l~~~~~~~~Y~Ll~~NCNhFs~~l~~~L~g~~iP~~i~~~a~~~~~~p~~~~l~p~  149 (151)
T PF05903_consen   81 TLSEEEFEEILRSLSREFTGDSYHLLNRNCNHFSDALCQFLTGKPIPSWINRLARIALSSPFGQMLLPM  149 (151)
T ss_dssp             ---HHHHHHHHHHHHTT-SGGG-BTTTBSHHHHHHHHHHHHHS----HHHHTHHHHHHTSHHHHHHCC-
T ss_pred             cCCHHHHHHHHHHHHhhccCCcchhhhhhhhHHHHHHHHHhCCCCCCHHHHhhhHHhcccchhhhhCcC
Confidence            9999999999999999999999999999999999999999999999999999999887   56788875


No 3  
>PF04970 LRAT:  Lecithin retinol acyltransferase;  InterPro: IPR007053 This entry represents a conserved sequence region found in proteins from viruses, bacteria and eukaryotes. It contains a well-conserved NCEHF motif, though its function in these proteins is unknown.; PDB: 2KYT_A 4DOT_A 4FA0_A.
Probab=94.84  E-value=0.011  Score=45.98  Aligned_cols=35  Identities=29%  Similarity=0.408  Sum_probs=17.5

Q ss_pred             cCCHHHHHHHHHHhccCCCCCcccccccccchHHHHH
Q 029056           94 NLSRAEFRSFMEHLSGKYHGDTYHLIAKNCNHFTEEV  130 (200)
Q Consensus        94 ~lt~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFsd~l  130 (200)
                      ..+.+++.+-.+++-.+  ...|||+.+||=||+.-.
T Consensus        85 ~~~~~~iv~rA~~~lg~--~~~Y~l~~nNCEhFa~~c  119 (125)
T PF04970_consen   85 PFPPEEIVERAESRLGK--EFEYNLLFNNCEHFATWC  119 (125)
T ss_dssp             -S-HHHHHHHHHHTTT---EESS---HHHHHHHHHHH
T ss_pred             CCCHHHHHHHHHHHHcC--CCccCCCcCCHHHHHHHH
Confidence            44445544433333221  338999999999998754


No 4  
>PF05608 DUF778:  Protein of unknown function (DUF778);  InterPro: IPR008496 This family consists of several eukaryotic proteins of unknown function.
Probab=93.28  E-value=0.23  Score=40.45  Aligned_cols=40  Identities=18%  Similarity=0.440  Sum_probs=34.4

Q ss_pred             CHHHHHHHHHHhccCCCCCcccccccccchHHHHHHHHhc
Q 029056           96 SRAEFRSFMEHLSGKYHGDTYHLIAKNCNHFTEEVCMQLT  135 (200)
Q Consensus        96 t~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFsd~l~~~L~  135 (200)
                      +.+..++.|++-.++|....|||+..||.+|...+...|.
T Consensus        75 ~~~~wD~Av~~a~~~y~~r~yNlf~~NCHSfVA~aLN~m~  114 (136)
T PF05608_consen   75 GAESWDDAVQKASEEYKHRMYNLFTDNCHSFVANALNRMR  114 (136)
T ss_pred             cHHHHHHHHHHHHHHHhhCceeeeccCcHHHHHHHHHhcc
Confidence            5667777888888999999999999999999988877764


No 5  
>PF09601 DUF2459:  Protein of unknown function (DUF2459);  InterPro: IPR011727 This conserved hypothetical protein of unknown function is predominantly found in proteobacteria. Its function is unknown and its genome context is not well-conserved. It is found amid urease genes in at least one species.
Probab=71.98  E-value=4.7  Score=33.75  Aligned_cols=42  Identities=24%  Similarity=0.499  Sum_probs=31.7

Q ss_pred             cCCHHHHHHHHHHhccCCCCC-----------------------cccccccccchHHHHHHHHhcC
Q 029056           94 NLSRAEFRSFMEHLSGKYHGD-----------------------TYHLIAKNCNHFTEEVCMQLTG  136 (200)
Q Consensus        94 ~lt~~e~~~~l~~L~~~f~g~-----------------------tY~LL~rNCNHFsd~l~~~L~g  136 (200)
                      .+++++++++++.++..|.-+                       +|||+ ++||+-+++..+...-
T Consensus        97 ~ls~~~y~~L~~~I~~sf~~~~~g~~~~i~~~y~~~d~Fy~A~G~Y~l~-~TCNtWta~~L~aaG~  161 (173)
T PF09601_consen   97 RLSEAQYRRLVAFIRASFQRDADGRPIPIGPGYGPDDAFYEAKGRYSLF-NTCNTWTARALKAAGL  161 (173)
T ss_pred             EcCHHHHHHHHHHHHHHhccCCCCCeEEeccccCCCCeeEeeccceEee-cCcHHHHHHHHHHcCC
Confidence            679999999998887655543                       36654 7999999998776543


No 6  
>KOG2673 consensus Uncharacterized conserved protein, contains PSP domain [Function unknown]
Probab=70.21  E-value=4.1  Score=39.34  Aligned_cols=44  Identities=23%  Similarity=0.329  Sum_probs=34.5

Q ss_pred             HHHHHHHh--cCCCCChHHHHHhhhhccccccCCccccccccccCCCCCCCC
Q 029056          127 TEEVCMQL--TGKCIPGWVNRMARLGSFCNCLLPESIQITAVAHLPDRPTYS  176 (200)
Q Consensus       127 sd~l~~~L--~gk~IP~wInrla~iG~~~~cllp~~~~~~~v~~~~~~~~~~  176 (200)
                      +..+..+|  .+..||.|++|+-++|      .|+.+...+.++++...-|+
T Consensus       186 S~~~R~al~l~~~d~P~~~yRMR~lG------YPPg~L~~s~~e~s~i~if~  231 (485)
T KOG2673|consen  186 SGNTRSALGLSPGDPPEWKYRMRRLG------YPPGYLRKSDMEVSGIKIFS  231 (485)
T ss_pred             chhHHHhhcCCCCCchHHHHHHhhcc------CCchhhhhhhcccccceeec
Confidence            34444444  4689999999999999      78887778889988887777


No 7  
>PF13387 DUF4105:  Domain of unknown function (DUF4105)
Probab=59.57  E-value=13  Score=30.57  Aligned_cols=53  Identities=21%  Similarity=0.274  Sum_probs=40.4

Q ss_pred             EEeccccCCHHHHHHHHHHhcc----CCCCCcccccccccchHHHHHHHHhcCCCCC
Q 029056           88 VLLGTTNLSRAEFRSFMEHLSG----KYHGDTYHLIAKNCNHFTEEVCMQLTGKCIP  140 (200)
Q Consensus        88 i~LG~T~lt~~e~~~~l~~L~~----~f~g~tY~LL~rNCNHFsd~l~~~L~gk~IP  140 (200)
                      |..=...++++|.+.++..|.+    .-++-.||.+..||=.-.-++.....++.+|
T Consensus        98 v~~y~LnLs~ee~~~l~~~l~e~~~~~~~~~~Y~f~~~NCat~i~~~l~~~~~~~l~  154 (176)
T PF13387_consen   98 VWEYPLNLSPEEKQRLFRHLWENANPENRPYRYNFFTDNCATRIRDLLDKARPGSLP  154 (176)
T ss_pred             EEEEEeeCCHHHHHHHHHHHHHhccccccceeehhhhcchHHHHHHHHHHHcCCCee
Confidence            3333458899999999887743    2355699999999999998888888876444


No 8  
>PF04046 PSP:  PSP;  InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=51.99  E-value=13  Score=25.28  Aligned_cols=26  Identities=35%  Similarity=0.599  Sum_probs=21.2

Q ss_pred             HHHHHHHHhc--CCCCChHHHHHhhhhc
Q 029056          126 FTEEVCMQLT--GKCIPGWVNRMARLGS  151 (200)
Q Consensus       126 Fsd~l~~~L~--gk~IP~wInrla~iG~  151 (200)
                      .|+++.++|.  ...+|.||.++.++|-
T Consensus         5 lS~~LR~ALg~~~~~~PPwl~~M~~~G~   32 (48)
T PF04046_consen    5 LSDELREALGMQENDPPPWLYRMRRLGY   32 (48)
T ss_pred             cCHHHHHHcCCCCCCCChHHHHHHhcCC
Confidence            4677888875  4679999999999983


No 9  
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=46.66  E-value=18  Score=25.26  Aligned_cols=25  Identities=36%  Similarity=0.571  Sum_probs=20.7

Q ss_pred             HHHHHHHhc--CCCCChHHHHHhhhhc
Q 029056          127 TEEVCMQLT--GKCIPGWVNRMARLGS  151 (200)
Q Consensus       127 sd~l~~~L~--gk~IP~wInrla~iG~  151 (200)
                      |+++.++|.  ...+|.|+.++.++|-
T Consensus        10 S~~LR~ALG~~~~~pPPWl~~Mq~~G~   36 (54)
T smart00581       10 SDELREALGLPPGQPPPWLYRMRRLGY   36 (54)
T ss_pred             CHHHHHHcCCCCCCCChHHHHHHHHCC
Confidence            678888886  3579999999999984


No 10 
>PF11931 DUF3449:  Domain of unknown function (DUF3449);  InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=43.81  E-value=7.7  Score=33.49  Aligned_cols=22  Identities=36%  Similarity=0.759  Sum_probs=0.0

Q ss_pred             CCCCChHHHHHhhhhccccccC
Q 029056          136 GKCIPGWVNRMARLGSFCNCLL  157 (200)
Q Consensus       136 gk~IP~wInrla~iG~~~~cll  157 (200)
                      |||||-|+-.|.-++.-+.|=|
T Consensus        85 GkPIPyWLYKLhGL~~ey~CEI  106 (196)
T PF11931_consen   85 GKPIPYWLYKLHGLGVEYKCEI  106 (196)
T ss_dssp             ----------------------
T ss_pred             CCcccHHHHHHhCCCCeeeeEe
Confidence            7999999999999988777754


No 11 
>PF08405 Calici_PP_N:  Viral polyprotein N-terminal;  InterPro: IPR013614 This domain is found at the N terminus of non-structural viral polyproteins of the Caliciviridae subfamily. ; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0017111 nucleoside-triphosphatase activity, 0044419 interspecies interaction between organisms
Probab=42.75  E-value=35  Score=31.82  Aligned_cols=94  Identities=28%  Similarity=0.433  Sum_probs=53.0

Q ss_pred             eeeeeEEEeCeEEEeccccCC---CCCeEEeCCCCCCCCceEEEEEeccccCCHHHHHHHHHHhccCCCCCccccccccc
Q 029056           47 IFHSGIEAHGLEYGFGAHEYR---TSGVFEVEPKSCPGFIFRRSVLLGTTNLSRAEFRSFMEHLSGKYHGDTYHLIAKNC  123 (200)
Q Consensus        47 I~HTgVeV~G~EY~FG~~~~~---~sGI~~~~P~~~pg~~~r~si~LG~T~lt~~e~~~~l~~L~~~f~g~tY~LL~rNC  123 (200)
                      +||-||-|. +-+.+|-|.-+   +.|-+..+|=+.+    =+.+..-+-.++.+++++.+   .++|   -|..+.+||
T Consensus        22 ~yHYaIYi~-~G~~lgvh~p~aai~~a~i~l~~ls~~----WRvvy~P~~~~~~~~L~~l~---ge~~---PY~a~~nNC   90 (358)
T PF08405_consen   22 IYHYAIYIG-KGLVLGVHSPGAAISIATIELEPLSLW----WRVVYTPRQRLSYDQLRKLE---GEKF---PYAAFTNNC   90 (358)
T ss_pred             eEEEEEEec-CCeEEeecCcchhceeeeEEEeecccc----cccccCCCCCCCHHHHHHhc---CCCC---Cchhhccch
Confidence            999999864 45677766432   2344455554433    11222222355666644432   3455   489999999


Q ss_pred             chHHHHHHHHhcCCCCChHHHHH--hhhhcccccc
Q 029056          124 NHFTEEVCMQLTGKCIPGWVNRM--ARLGSFCNCL  156 (200)
Q Consensus       124 NHFsd~l~~~L~gk~IP~wInrl--a~iG~~~~cl  156 (200)
                      =|||   |+-|.-+.  .|++|-  ++-|.|..+.
T Consensus        91 y~fc---c~vl~L~D--~wl~rr~v~r~~~F~~P~  120 (358)
T PF08405_consen   91 YTFC---CWVLGLND--SWLKRRLVSRTGPFYHPY  120 (358)
T ss_pred             HHHh---HhhcCCCh--HHHhhccccccCCCCCCC
Confidence            9999   66553221  366653  3456655543


No 12 
>TIGR02117 chp_urease_rgn conserved hypothetical protein. This conserved hypothetical protein of unknown function is found in several Proteobacteria. Its function is unknown and its genome context is not well-conserved. It is found amid urease genes in at least one species.
Probab=42.46  E-value=30  Score=29.88  Aligned_cols=40  Identities=18%  Similarity=0.445  Sum_probs=27.4

Q ss_pred             ccCCHHHHHHHHHHhccCCCC------------------------CcccccccccchHHHHHHHH
Q 029056           93 TNLSRAEFRSFMEHLSGKYHG------------------------DTYHLIAKNCNHFTEEVCMQ  133 (200)
Q Consensus        93 T~lt~~e~~~~l~~L~~~f~g------------------------~tY~LL~rNCNHFsd~l~~~  133 (200)
                      ..+++++++++++.+++.|.-                        ..|||+ ++||+-+.+..+.
T Consensus       126 l~vs~~qy~~L~~~I~~sf~~~~~g~~~~l~~~~yg~~d~Fy~A~G~Y~l~-~TCNtWta~aL~a  189 (208)
T TIGR02117       126 LLVSENQYNRLMDFISASFVRDAEGRVIPLPGGIYGDSDAFYAANGRYNAL-NTCNTWTAAALRS  189 (208)
T ss_pred             EEcCHHHHHHHHHHHHHhcCcCCCCCceecCCCCCCCCceeEeeeeeEEee-ccchHHHHHHHHH
Confidence            378899999888766532221                        125654 7999999987664


No 13 
>PF15643 Tox-PL-2:  Papain fold toxin 2
Probab=40.45  E-value=21  Score=27.80  Aligned_cols=44  Identities=20%  Similarity=0.344  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHhccCCCCCcccccccccchHHHHHHHHhcCCCCChHHHHHh
Q 029056           97 RAEFRSFMEHLSGKYHGDTYHLIAKNCNHFTEEVCMQLTGKCIPGWVNRMA  147 (200)
Q Consensus        97 ~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFsd~l~~~L~gk~IP~wInrla  147 (200)
                      +.+++..|..++.+|.       ---|--|++++.++|..++||..+-+|-
T Consensus         2 ~~~~~Q~I~~I~~~f~-------~~qC~~cA~Al~~~L~~~gI~Gk~i~l~   45 (100)
T PF15643_consen    2 PAEVRQQIGKIASRFK-------IFQCVECASALKQFLKQAGIPGKIIRLY   45 (100)
T ss_pred             HHHHHHHHHHhhcccC-------ceehHHHHHHHHHHHHHCCCCceEEEEE
Confidence            5678888888888874       3469999999999999999999887774


No 14 
>KOG3150 consensus Uncharacterized conserved protein [Function unknown]
Probab=38.49  E-value=59  Score=27.64  Aligned_cols=56  Identities=13%  Similarity=0.174  Sum_probs=40.9

Q ss_pred             CceEEEEEeccccC---CHHHHHHHHHHhccCCCCCcccccccccchHHHHHHHHhcCC
Q 029056           82 FIFRRSVLLGTTNL---SRAEFRSFMEHLSGKYHGDTYHLIAKNCNHFTEEVCMQLTGK  137 (200)
Q Consensus        82 ~~~r~si~LG~T~l---t~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFsd~l~~~L~gk  137 (200)
                      +.+-+-+.|+.+..   .+...++.+++-+++|...+|||+--||..|-.-...+|.=+
T Consensus        74 G~paRY~ql~p~~~~~~g~~~wD~Av~~as~~y~hr~hNi~cdNCHShVA~aLn~mry~  132 (182)
T KOG3150|consen   74 GPPARYIQLDPEKVCGPGARTWDNAVSKASREYKHRTHNIFCDNCHSHVANALNRMRYG  132 (182)
T ss_pred             CCcceeEEeChhheeCCCCchHHHHHHHHHHHhhhcccceeeccHHHHHHHHHHHhhcC
Confidence            33444455554432   456777888988999999999999999999987766666533


No 15 
>PF04603 Mog1:  Ran-interacting Mog1 protein;  InterPro: IPR007681 Segregation of nuclear and cytoplasmic processes facilitates regulation of many eukaryotic cellular functions such as gene expression and cell cycle progression. Trafficking through the nuclear pore requires a number of highly conserved soluble factors that escort macromolecular substrates into and out of the nucleus. The Mog1 protein has been shown to interact with RanGTP, which stimulates guanine nucleotide release, suggesting Mog1 regulates the nuclear transport functions of Ran []. The human homologue of Mog1 is thought to be alternatively spliced.; PDB: 1JHS_A 1EQ6_A.
Probab=38.45  E-value=10  Score=30.88  Aligned_cols=28  Identities=21%  Similarity=0.455  Sum_probs=19.0

Q ss_pred             hhccccccCCcccc-ccccccCCCCCCCC
Q 029056          149 LGSFCNCLLPESIQ-ITAVAHLPDRPTYS  176 (200)
Q Consensus       149 iG~~~~cllp~~~~-~~~v~~~~~~~~~~  176 (200)
                      .|-++.|.||..++ ++.+|+.||.+|.=
T Consensus         5 yGGAi~~~lP~~f~DvS~~RqVPD~QEVf   33 (141)
T PF04603_consen    5 YGGAITCDLPSGFIDVSDFRQVPDNQEVF   33 (141)
T ss_dssp             GGGTEEEEE-TTEEEGGGTS---TTEEEE
T ss_pred             cCceEEEeCCCCccchhhceeCCCCCEEe
Confidence            46778899998854 89999999998753


No 16 
>cd02998 PDI_a_ERp38 PDIa family, endoplasmic reticulum protein 38 (ERp38) subfamily; composed of proteins similar to the P5-like protein first isolated from alfalfa, which contains two redox active TRX (a) domains at the N-terminus, like human P5, and a C-terminal domain with homology to the C-terminal domain of ERp29, unlike human P5. The cDNA clone of this protein (named G1) was isolated from an alfalfa cDNA library by screening with human protein disulfide isomerase (PDI) cDNA. The G1 protein is constitutively expressed in all major organs of the plant and its expression is induced by treatment with tunicamycin, indicating that it may be a glucose-regulated protein. The G1 homolog in the eukaryotic social amoeba Dictyostelium discoideum is also described as a P5-like protein, which is located in the endoplasmic reticulum (ER) despite the absence of an ER-retrieval signal. G1 homologs from Aspergillus niger and Neurospora crassa have also been characterized, and are named TIGA and ER
Probab=31.00  E-value=20  Score=25.28  Aligned_cols=45  Identities=9%  Similarity=0.152  Sum_probs=33.6

Q ss_pred             HHHHHHHHhccCCC-CCcccccccccchHHHHHHHHhcCCCCChHH
Q 029056           99 EFRSFMEHLSGKYH-GDTYHLIAKNCNHFTEEVCMQLTGKCIPGWV  143 (200)
Q Consensus        99 e~~~~l~~L~~~f~-g~tY~LL~rNCNHFsd~l~~~L~gk~IP~wI  143 (200)
                      ++...++.+++.+. ...+.++.=||+....++|+.+....+|..+
T Consensus        35 ~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~~~~i~~~P~~~   80 (105)
T cd02998          35 NLAPEYEKLAAVFANEDDVVIAKVDADEANKDLAKKYGVSGFPTLK   80 (105)
T ss_pred             hhChHHHHHHHHhCCCCCEEEEEEECCCcchhhHHhCCCCCcCEEE
Confidence            44556666666665 3467888899999767889999888888854


No 17 
>PF13964 Kelch_6:  Kelch motif
Probab=30.71  E-value=47  Score=21.04  Aligned_cols=20  Identities=30%  Similarity=0.371  Sum_probs=16.3

Q ss_pred             eeeeeEEEeCeEEEeccccC
Q 029056           47 IFHSGIEAHGLEYGFGAHEY   66 (200)
Q Consensus        47 I~HTgVeV~G~EY~FG~~~~   66 (200)
                      ..|+.+.+.|+-|.|||...
T Consensus         3 ~~~s~v~~~~~iyv~GG~~~   22 (50)
T PF13964_consen    3 YGHSAVVVGGKIYVFGGYDN   22 (50)
T ss_pred             ccCEEEEECCEEEEECCCCC
Confidence            36888999999999998643


No 18 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=25.71  E-value=72  Score=20.25  Aligned_cols=17  Identities=35%  Similarity=0.434  Sum_probs=15.1

Q ss_pred             eeeeEEEeCeEEEeccc
Q 029056           48 FHSGIEAHGLEYGFGAH   64 (200)
Q Consensus        48 ~HTgVeV~G~EY~FG~~   64 (200)
                      -|+.+++.++-|-|||.
T Consensus         4 ~hs~~~~~~kiyv~GG~   20 (49)
T PF07646_consen    4 GHSAVVLDGKIYVFGGY   20 (49)
T ss_pred             ceEEEEECCEEEEECCc
Confidence            38889999999999986


No 19 
>PF12419 DUF3670:  SNF2 Helicase protein ;  InterPro: IPR022138  This domain family is found in bacteria, archaea and eukaryotes, and is approximately 140 amino acids in length. The family is found in association with PF00271 from PFAM, PF00176 from PFAM. Most of the proteins in this family are annotated as SNF2 helicases but there is little accompanying literature to confirm this. 
Probab=24.38  E-value=81  Score=25.13  Aligned_cols=27  Identities=30%  Similarity=0.487  Sum_probs=23.7

Q ss_pred             CceEEEEEeccccCCHHHHHHHHHHhc
Q 029056           82 FIFRRSVLLGTTNLSRAEFRSFMEHLS  108 (200)
Q Consensus        82 ~~~r~si~LG~T~lt~~e~~~~l~~L~  108 (200)
                      ..|+=++.||...+|++||+++++.=+
T Consensus        71 ~~f~W~lalGd~~Ls~eEf~~L~~~~~   97 (141)
T PF12419_consen   71 LDFDWELALGDEELSEEEFEQLVEQKR   97 (141)
T ss_pred             hcceEEEEECCEECCHHHHHHHHHcCC
Confidence            568889999999999999999997544


No 20 
>PF04412 DUF521:  Protein of unknown function (DUF521);  InterPro: IPR007506 This is a group of hypothetical proteins.
Probab=24.03  E-value=74  Score=30.20  Aligned_cols=45  Identities=22%  Similarity=0.456  Sum_probs=36.8

Q ss_pred             ccCCHHHHHHHHHHhccCCCCCcccccccccchHH----HHHHHHhcCCC
Q 029056           93 TNLSRAEFRSFMEHLSGKYHGDTYHLIAKNCNHFT----EEVCMQLTGKC  138 (200)
Q Consensus        93 T~lt~~e~~~~l~~L~~~f~g~tY~LL~rNCNHFs----d~l~~~L~gk~  138 (200)
                      ..++.+++++..+++. .-..+.-|++.=-|+|||    .++++.|-||+
T Consensus       268 i~i~~~dl~~~~~~l~-~~~~~~~D~V~lGcPH~S~~El~~ia~ll~gr~  316 (400)
T PF04412_consen  268 ITITDADLEEVYEELN-TAGDEKVDLVALGCPHLSLEELREIAELLEGRK  316 (400)
T ss_pred             EEeCHHHHHHHHHHhc-cCCCCCCCEEEECCCCCCHHHHHHHHHHHhCCC
Confidence            3668899999999987 666778899999999999    55677777765


No 21 
>cd01781 AF6_RA_repeat2 Ubiquitin domain of AT-6, second repeat. The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
Probab=22.06  E-value=2.2e+02  Score=22.07  Aligned_cols=52  Identities=19%  Similarity=0.306  Sum_probs=37.0

Q ss_pred             CeEEeCCCCC-CCCceEEEEEeccccCCHHHHHHHHHHhc-cCCCCCcccccccc
Q 029056           70 GVFEVEPKSC-PGFIFRRSVLLGTTNLSRAEFRSFMEHLS-GKYHGDTYHLIAKN  122 (200)
Q Consensus        70 GI~~~~P~~~-pg~~~r~si~LG~T~lt~~e~~~~l~~L~-~~f~g~tY~LL~rN  122 (200)
                      |+..+-|+.. |+..| ++|.+-..+.+++-+.+.|++.. ++-.++.|.|..-+
T Consensus         2 G~LKIYg~~L~~~~~Y-KSIlvt~~~~a~~vV~eALeKygL~~e~p~~Y~LveV~   55 (100)
T cd01781           2 GTLKIYGGSLVPTRPY-KTILLSINDNADRIVGEALEKYGLEKSDPDDYCLVEVS   55 (100)
T ss_pred             CeEEEcCCcccCCCCe-EEEEecCCccHHHHHHHHHHHhCCCccCccceEEEEEe
Confidence            4455555543 44444 79999999999999999998775 55667788776543


No 22 
>smart00460 TGc Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Probab=21.01  E-value=60  Score=21.41  Aligned_cols=23  Identities=26%  Similarity=0.411  Sum_probs=20.2

Q ss_pred             cccchHHHHHHHHhcCCCCChHH
Q 029056          121 KNCNHFTEEVCMQLTGKCIPGWV  143 (200)
Q Consensus       121 rNCNHFsd~l~~~L~gk~IP~wI  143 (200)
                      -+|.+|+..++..|-..+||.-+
T Consensus         7 G~C~~~a~l~~~llr~~GIpar~   29 (68)
T smart00460        7 GTCGEFAALFVALLRSLGIPARV   29 (68)
T ss_pred             eeeHHHHHHHHHHHHHCCCCeEE
Confidence            47999999999999999999844


No 23 
>TIGR01514 NAPRTase nicotinate phosphoribosyltransferase. This model represents nicotinate phosphoribosyltransferase, the first enzyme in the salvage pathway of NAD biosynthesis from nicontinate (niacin). Members are primary proteobacterial but also include yeasts and Methanosarcina acetivorans. A related family, apparently non-overlapping in species distribution, is TIGR01513. Members of that family differ in substantially in sequence and have a long C-terminal extension missing from this family, but are proposed also to act as nicotinate phosphoribosyltransferase (see model TIGR01513).
Probab=20.99  E-value=43  Score=31.73  Aligned_cols=43  Identities=9%  Similarity=0.145  Sum_probs=38.2

Q ss_pred             HHHHHHHHHhcc-CCCCCcccccccccchHHHHHHHHhcCCCCC
Q 029056           98 AEFRSFMEHLSG-KYHGDTYHLIAKNCNHFTEEVCMQLTGKCIP  140 (200)
Q Consensus        98 ~e~~~~l~~L~~-~f~g~tY~LL~rNCNHFsd~l~~~L~gk~IP  140 (200)
                      .++++.|+.|.. .|+.+...-|++||+.|.+++..+|-+-+++
T Consensus        46 ~gl~~~i~~l~~l~ft~eel~yL~~~~~~~~~~fl~~L~~frf~   89 (394)
T TIGR01514        46 EALREEISALGNLRFTDDEIEYLKQELPYLKSDYIDYLRNFRFH   89 (394)
T ss_pred             HHHHHHHHHHHHCCCCHHHHHHHHhccCCCCHHHHHHHHhCCCC
Confidence            688888988875 8999999999999999999999999876654


No 24 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=20.35  E-value=99  Score=18.93  Aligned_cols=19  Identities=16%  Similarity=0.125  Sum_probs=15.8

Q ss_pred             eeeeEEEeCeEEEeccccC
Q 029056           48 FHSGIEAHGLEYGFGAHEY   66 (200)
Q Consensus        48 ~HTgVeV~G~EY~FG~~~~   66 (200)
                      .|+.+.+.++=|-+||...
T Consensus         4 ~~~~~~~~~~iyv~GG~~~   22 (47)
T PF01344_consen    4 GHAAVVVGNKIYVIGGYDG   22 (47)
T ss_dssp             SEEEEEETTEEEEEEEBES
T ss_pred             cCEEEEECCEEEEEeeecc
Confidence            5788888999999998654


Done!