Query         psy6915
Match_columns 127
No_of_seqs    115 out of 427
Neff          6.5 
Searched_HMMs 46136
Date          Fri Aug 16 21:41:44 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6915.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6915hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2435|consensus              100.0 3.7E-32   8E-37  213.3  10.3  119    2-120   201-319 (323)
  2 PF08547 CIA30:  Complex I inte 100.0 4.7E-29   1E-33  182.9   9.6   91    1-95     67-157 (157)
  3 PF03425 CBM_11:  Carbohydrate   99.0 1.6E-09 3.4E-14   80.9   8.4   92    1-102    72-169 (178)
  4 PF04620 FlaA:  Flagellar filam  94.3       1 2.2E-05   35.1  10.6  100    4-107   109-213 (217)
  5 COG3506 Uncharacterized conser  78.1      28  0.0006   26.5   9.4   32   69-100   149-180 (189)
  6 PF14330 DUF4387:  Domain of un  54.2     2.6 5.7E-05   29.1  -1.0   17   82-98      7-23  (99)
  7 smart00210 TSPN Thrombospondin  54.1      85  0.0019   23.0   9.1   61   42-104   115-184 (184)
  8 PHA01795 hypothetical protein   52.5 1.2E+02  0.0026   24.6   8.1   77   14-101   124-214 (280)
  9 PF13598 DUF4139:  Domain of un  52.5      22 0.00047   28.2   3.8   47   14-63     14-60  (317)
 10 COG0835 CheW Chemotaxis signal  41.8   1E+02  0.0022   22.5   5.8   49   72-127    14-62  (165)
 11 COG5007 Predicted transcriptio  41.5      31 0.00067   22.9   2.6   20    6-25     18-37  (80)
 12 KOG3186|consensus               40.3      14 0.00031   28.2   1.0   73   43-118    41-116 (202)
 13 PF03295 Pox_TAA1:  Poxvirus tr  37.3     4.2 9.1E-05   25.7  -1.9   32   95-126     3-34  (63)
 14 cd07374 CYTH-like_Pase CYTH-li  37.1      36 0.00079   24.5   2.7   23    3-25     49-71  (174)
 15 PF14508 GH97_N:  Glycosyl-hydr  35.9      83  0.0018   24.7   4.8   36    2-42     79-115 (259)
 16 PF09496 CENP-O:  Cenp-O kineto  35.2      87  0.0019   20.8   4.1   23    4-26      2-28  (90)
 17 COG3025 Uncharacterized conser  31.4      51  0.0011   28.4   3.0   24    3-26     51-74  (432)
 18 PF01562 Pep_M12B_propep:  Repr  31.3 1.7E+02  0.0037   19.9   8.0   59    3-66     29-87  (131)
 19 PF13049 DUF3910:  Protein of u  31.3 1.5E+02  0.0032   19.8   4.6   49    2-55     24-76  (93)
 20 TIGR02231 conserved hypothetic  30.5      49  0.0011   28.5   2.8   46   15-63    210-255 (525)
 21 COG0335 RplS Ribosomal protein  28.4      60  0.0013   23.0   2.5   20    1-20     43-64  (115)
 22 PF03944 Endotoxin_C:  delta en  28.0 1.3E+02  0.0028   21.3   4.3   91    6-102    41-141 (143)
 23 smart00712 PUR DNA/RNA-binding  27.3 1.6E+02  0.0035   18.3   4.7   43    9-57      7-49  (63)
 24 cd05163 TRRAP TRansformation/t  27.1      74  0.0016   25.0   3.1   19    6-24     20-39  (253)
 25 PF10636 hemP:  Hemin uptake pr  27.0 1.3E+02  0.0028   17.1   3.3   19    6-24     14-32  (38)
 26 PF09137 Glucodextran_N:  Gluco  26.4      95  0.0021   25.0   3.6   18   69-86     44-61  (269)
 27 cd08694 C2_Dock-A C2 domains f  24.9 3.2E+02   0.007   21.0   6.2   86   11-100    34-135 (196)
 28 cd07756 CYTH-like_Pase_CHAD Un  24.4      84  0.0018   23.7   2.9   23    3-25     50-72  (197)
 29 cd05171 PIKKc_ATM Ataxia telan  24.4      83  0.0018   25.0   3.0   21    6-26     19-40  (279)
 30 cd07762 CYTH-like_Pase_1 Uncha  22.6      98  0.0021   22.9   2.9   22    4-25     45-66  (180)
 31 KOG3658|consensus               22.5 2.1E+02  0.0046   26.3   5.4   54    2-67     62-115 (764)
 32 PF13385 Laminin_G_3:  Concanav  21.0 2.5E+02  0.0055   18.3   7.6  102    3-110    50-156 (157)
 33 cd05170 PIKKc_SMG1 Suppressor   20.9 1.1E+02  0.0024   24.7   3.1   22    5-26     18-40  (307)

No 1  
>KOG2435|consensus
Probab=99.97  E-value=3.7e-32  Score=213.28  Aligned_cols=119  Identities=60%  Similarity=1.094  Sum_probs=116.2

Q ss_pred             cceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCCCCCCCccCccEEEE
Q psy6915           2 YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGL   81 (127)
Q Consensus         2 y~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~~~ld~~~I~~iG~   81 (127)
                      |+.|.||||||||.|.++|+++..++..|+++|+|.+.|++||-||.++|||+.|.+|++||+++.+-.|++.+|.+|||
T Consensus       201 fn~L~LrvRGDGRsy~inihte~~~dq~wndsys~flft~gGp~wq~~KIPfSKff~t~kGriqDrq~e~nl~~vssig~  280 (323)
T KOG2435|consen  201 FNTLYLRVRGDGRSYMINIHTETDFDQRWNDSYSYFLFTRGGPYWQEVKIPFSKFFFTNKGRIQDRQHELNLDKVSSIGF  280 (323)
T ss_pred             cceEEEEEecCCceEEEEecCccchhhhcccceeeEEecCCCCceeEEecchhhheeccccceeecccccCccceeeEeE
Confidence            78999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EeecCCCeeEEEEEcEEEEEeCCCchhhhhhhhccccce
Q psy6915          82 SLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENE  120 (127)
Q Consensus        82 ~i~~kq~G~F~LeI~~I~~~~~~~~~~~~~y~~~~~~~~  120 (127)
                      .++++..|||.||||.|+++.+++|+|+|+||+|..+++
T Consensus       281 sl~dk~dGpF~LEIDfIGv~~d~~H~EdFayE~y~~p~~  319 (323)
T KOG2435|consen  281 SLADKVDGPFFLEIDFIGVFTDPAHTEDFAYENYPEPNP  319 (323)
T ss_pred             EEeeccCCcceeeEEEEEEecCCCcccceeeeccccccc
Confidence            999999999999999999999999999999999998875


No 2  
>PF08547 CIA30:  Complex I intermediate-associated protein 30 (CIA30);  InterPro: IPR013857  Mitochondrial complex I intermediate-associated protein 30 (CIA30) is present in human and mouse, and also in Schizosaccharomyces pombe (Fission yeast) which does not contain the NADH dehydrogenase component of complex I, or many of the other essential subunits. This means it is possible that it is not directly involved in oxidative phosphorylation [, ]. 
Probab=99.96  E-value=4.7e-29  Score=182.95  Aligned_cols=91  Identities=43%  Similarity=0.748  Sum_probs=85.6

Q ss_pred             CcceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCCCCCCCccCccEEE
Q psy6915           1 MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIG   80 (127)
Q Consensus         1 ~y~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~~~ld~~~I~~iG   80 (127)
                      .|+||+|+||||||+|+++|+++....   ..+|++.|.|+++ +|++|+|||++|+|++||++++++++||+++|++||
T Consensus        67 ~y~~l~l~vrgdGr~Y~~~l~~~~~~~---~~~y~~~f~t~~~-~w~~v~iPFs~F~~~~rG~~v~~~~~l~~~~I~~ig  142 (157)
T PF08547_consen   67 GYDGLELRVRGDGRTYKVNLRTDNDEP---SDSYQARFQTPPG-EWQTVRIPFSDFVPTFRGRPVDDAPPLDPSRIRSIG  142 (157)
T ss_pred             CCcEEEEEEEcCCceEEEEEEeCCCCC---CceEEEEEeccCC-ccEEEEEEHHHCceeeCCcccCCCCCcChHHceEEE
Confidence            489999999999999999999998742   5699999999866 799999999999999999999999999999999999


Q ss_pred             EEeecCCCeeEEEEE
Q psy6915          81 LSLVDQNNGPFQLEL   95 (127)
Q Consensus        81 ~~i~~kq~G~F~LeI   95 (127)
                      |+++++++|||+|+|
T Consensus       143 ~~~~~~~~G~F~L~I  157 (157)
T PF08547_consen  143 FMISDKQEGPFELEI  157 (157)
T ss_pred             EEEecCCCCCEEEeC
Confidence            999999999999997


No 3  
>PF03425 CBM_11:  Carbohydrate binding domain (family 11);  InterPro: IPR005087 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM11 from CAZY which binds both beta-1,4-glucan and beta-1,3-1,4-mixed linked glucans.; GO: 0008810 cellulase activity, 0030245 cellulose catabolic process; PDB: 1V0A_A.
Probab=99.02  E-value=1.6e-09  Score=80.92  Aligned_cols=92  Identities=21%  Similarity=0.340  Sum_probs=55.5

Q ss_pred             CcceEEEEEeeCC--ceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceee----ecceecCCCCCCCcc
Q psy6915           1 MYNHLVIKCRGDG--RSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQ----SKGRIQDIQNPLDTD   74 (127)
Q Consensus         1 ~y~gl~lrvrGDG--r~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t----~rG~~~~~~~~ld~~   74 (127)
                      .|+||++.+||||  ++..|.+++...     ...|.+.+....  +|++|+|||++|.+.    --|...  ..+||++
T Consensus        72 ~~~gl~Fw~k~dgs~~~l~vqi~d~~~-----~e~~~~~~~~~~--~W~~V~IPF~~f~~~~~~~p~g~~~--~~~ldl~  142 (178)
T PF03425_consen   72 GYGGLSFWIKGDGSGNKLRVQIKDGGD-----YEYWEASFTDSS--TWKTVEIPFSDFTQRPDYQPGGWGA--DGTLDLT  142 (178)
T ss_dssp             T--EEEEEEEE------EEEEEEEE-E-----EEEEEEEE---S--S-EEEEEEGGG-EE--S---TT------SS--TT
T ss_pred             cCCcEEEEEEcCCCCcEEEEEEecCCc-----ceeeEeecCCCC--cCEEEEEEHHHcccccccCCCCCCc--ccccChH
Confidence            4899999999665  666666666651     347887777633  599999999999971    112111  2578999


Q ss_pred             CccEEEEEeecCCCeeEEEEEcEEEEEe
Q psy6915          75 RISSIGLSLVDQNNGPFQLELDYIGIEY  102 (127)
Q Consensus        75 ~I~~iG~~i~~kq~G~F~LeI~~I~~~~  102 (127)
                      +|.+|.|.+.+. .+++.|.||.|++|.
T Consensus       143 ~v~~~~~~~~~~-~~~~~~~iDdI~l~~  169 (178)
T PF03425_consen  143 NVWEFAFYVNGG-GGAGTFYIDDIRLYG  169 (178)
T ss_dssp             SEEEEEEEESSS----EEEEEEEEEEE-
T ss_pred             HcEEEEEEEcCC-CceeEEEEEeEEEEe
Confidence            999999998854 445999999999994


No 4  
>PF04620 FlaA:  Flagellar filament outer layer protein Flaa;  InterPro: IPR006714 Periplasmic flagella are the organelles of spirochete mobility, and are structurally different from the flagella of other motile bacteria. They reside inside the cell within the periplasmic space, and confer mobility in viscous gel-like media such as connective tissue []. The flagella are composed of an outer sheath of FlaA proteins and a core filament of FlaB proteins. Each species usually has several FlaA protein species [].; GO: 0001539 ciliary or flagellar motility, 0030288 outer membrane-bounded periplasmic space
Probab=94.26  E-value=1  Score=35.11  Aligned_cols=100  Identities=13%  Similarity=0.192  Sum_probs=70.2

Q ss_pred             eEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCCCC--CCCccCccEEEE
Q psy6915           4 HLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQN--PLDTDRISSIGL   81 (127)
Q Consensus         4 gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~~--~ld~~~I~~iG~   81 (127)
                      .|.+-|.|+|..+.|.+.-.+...    ..++..+-.-.-..|+.+..+.-...|.-|-+.+...+  +.....|+=.||
T Consensus       109 ~I~vWV~G~n~~h~L~v~lrD~~G----~~~~l~~G~L~f~GWK~L~~~iP~~ip~~r~r~~q~~~~yp~~~~~l~f~gf  184 (217)
T PF04620_consen  109 SISVWVYGDNYPHWLEVLLRDAKG----EVHQLPLGSLNFDGWKNLTVNIPPYIPDVRNRILQQYPRYPRSDPGLKFVGF  184 (217)
T ss_pred             EEEEEEECCCCCceEEEEEEcCCC----CEEEEEeeeecCCceeEEEEECCCCCCccceeeeeecccCCCCCCceEEEEE
Confidence            577888999977666665554432    24555554333446999999999999866555544222  334455888899


Q ss_pred             Eeec---CCCeeEEEEEcEEEEEeCCCch
Q psy6915          82 SLVD---QNNGPFQLELDYIGIEYDPNHT  107 (127)
Q Consensus        82 ~i~~---kq~G~F~LeI~~I~~~~~~~~~  107 (127)
                      .|..   ...|+|-+-+|.+++..|..+.
T Consensus       185 rI~~dp~~~~g~f~iYfDdlkv~tD~~~v  213 (217)
T PF04620_consen  185 RIDRDPDERGGDFYIYFDDLKVLTDKFVV  213 (217)
T ss_pred             EEECChhhcCCCEEEEEeeeEEEEEEEEE
Confidence            9975   3689999999999999887655


No 5  
>COG3506 Uncharacterized conserved protein [Function unknown]
Probab=78.11  E-value=28  Score=26.48  Aligned_cols=32  Identities=16%  Similarity=0.446  Sum_probs=27.1

Q ss_pred             CCCCccCccEEEEEeecCCCeeEEEEEcEEEE
Q psy6915          69 NPLDTDRISSIGLSLVDQNNGPFQLELDYIGI  100 (127)
Q Consensus        69 ~~ld~~~I~~iG~~i~~kq~G~F~LeI~~I~~  100 (127)
                      +++..++..++|+|+...|.+-|+......++
T Consensus       149 apfp~~~~v~vGvmacSP~r~gf~v~F~e~sL  180 (189)
T COG3506         149 APFPTAKVVSVGVMACSPQRSGFEVTFSEMSL  180 (189)
T ss_pred             CCCcccceEEEEeEEecCCcCceEEEEeeeec
Confidence            68888999999999999888889887766554


No 6  
>PF14330 DUF4387:  Domain of unknown function (DUF4387)
Probab=54.21  E-value=2.6  Score=29.08  Aligned_cols=17  Identities=41%  Similarity=0.825  Sum_probs=14.2

Q ss_pred             EeecCCCeeEEEEEcEE
Q psy6915          82 SLVDQNNGPFQLELDYI   98 (127)
Q Consensus        82 ~i~~kq~G~F~LeI~~I   98 (127)
                      +|..|++|||+|.+|=|
T Consensus         7 vIRSKNAGPf~lT~DI~   23 (99)
T PF14330_consen    7 VIRSKNAGPFELTFDII   23 (99)
T ss_pred             HHhccCCCCcEEEEEEE
Confidence            56789999999999844


No 7  
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=54.07  E-value=85  Score=23.03  Aligned_cols=61  Identities=28%  Similarity=0.375  Sum_probs=37.0

Q ss_pred             CCCceEEEEEeCCCceeee--cceecCCC-----C--CCCccCccEEEEEeecCCCeeEEEEEcEEEEEeCC
Q psy6915          42 GGPYWQITKIPFSKFFLQS--KGRIQDIQ-----N--PLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDP  104 (127)
Q Consensus        42 ~~~~W~tv~iPfs~F~~t~--rG~~~~~~-----~--~ld~~~I~~iG~~i~~kq~G~F~LeI~~I~~~~~~  104 (127)
                      .+++|..|.|-.+.=.-+.  -++.+...     .  .++++.+.-+|-...+  .++|+-.|.++.++.++
T Consensus       115 ~dg~WH~lal~V~~~~v~LyvDC~~~~~~~l~~~~~~~~~~~g~~~~g~~~~~--~~~f~G~lq~l~i~~dp  184 (184)
T smart00210      115 ADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGQPPIDTDGIEVRGAQAAD--RKPFQGDLQQLKIVCDP  184 (184)
T ss_pred             ccCCceEEEEEEeCCEEEEEECCccccceecCCcccccccccceEEEeeccCC--CCcceEEeEEEEEecCC
Confidence            3558998888887665542  34333321     1  3444444444433332  46899999999999875


No 8  
>PHA01795 hypothetical protein
Probab=52.52  E-value=1.2e+02  Score=24.64  Aligned_cols=77  Identities=16%  Similarity=0.241  Sum_probs=41.1

Q ss_pred             ceEEEEEEcCCCCCC----cccccEEEEEEeCCCCceEEEEEeCCCceee----------ecceecCCCCCCCccCccEE
Q psy6915          14 RSYMLNLHTPGDFDV----TWGDMFTYALYTRGGPYWQITKIPFSKFFLQ----------SKGRIQDIQNPLDTDRISSI   79 (127)
Q Consensus        14 r~Y~l~L~~~~~~~~----~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t----------~rG~~~~~~~~ld~~~I~~i   79 (127)
                      ++|++.+...-..+.    .-+..|.+.+      +++|++|||.+=.|+          ++|.+.-+  .|+..-  -|
T Consensus       124 kRykV~~~~~~~~~~~g~~~~g~~~~~sl------~FeT~eLPfgEsi~~si~l~~~s~~Y~Gtvp~s--Qle~~~--~i  193 (280)
T PHA01795        124 YRFKVILDDTINYEFIGKTSQGLKYNLSF------TLRTAELPFGETKPKDITLSGGSFAYRGTAKAS--QLEWPF--II  193 (280)
T ss_pred             eEEEEEecCccCHHHccccCCceEEEEEE------EEEEEccccccCCCCcEEEecceEEeCCcccCc--cccCcE--EE
Confidence            788887754432210    0022343333      478999999887665          45553222  232222  33


Q ss_pred             EEEeecCCCeeEEEEEcEEEEE
Q psy6915          80 GLSLVDQNNGPFQLELDYIGIE  101 (127)
Q Consensus        80 G~~i~~kq~G~F~LeI~~I~~~  101 (127)
                      -|.+. +..+.|.|++..-...
T Consensus       194 ~it~k-~~s~sf~i~ttg~~w~  214 (280)
T PHA01795        194 ELTPS-GGQTNFYIEIDGRRFE  214 (280)
T ss_pred             EEEEe-cCCccEEEEECCcEEE
Confidence            34444 3366888887755544


No 9  
>PF13598 DUF4139:  Domain of unknown function (DUF4139)
Probab=52.49  E-value=22  Score=28.25  Aligned_cols=47  Identities=21%  Similarity=0.203  Sum_probs=38.3

Q ss_pred             ceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecce
Q psy6915          14 RSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGR   63 (127)
Q Consensus        14 r~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~   63 (127)
                      =.|.++|.+.+..   -...|.+.+....|..|+.|.|-|+.-.|...+.
T Consensus        14 p~Y~l~l~~~~~~---~~l~~~a~V~q~TGeDW~~v~LtLsT~~P~~~~~   60 (317)
T PF13598_consen   14 PSYDLRLDTETGT---VDLTYWAEVRQNTGEDWNNVKLTLSTARPSRGGT   60 (317)
T ss_pred             EEEEEEEeCCCCe---EEEEEEEEEEcCCCCCccCceEEEEeCCCccccc
Confidence            5799999886543   2468999999888889999999999999887654


No 10 
>COG0835 CheW Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]
Probab=41.78  E-value=1e+02  Score=22.49  Aligned_cols=49  Identities=12%  Similarity=0.162  Sum_probs=35.8

Q ss_pred             CccCccEEEEEeecCCCeeEEEEEcEEEEEeCCCchhhhhhhhccccceEEeeecC
Q psy6915          72 DTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW  127 (127)
Q Consensus        72 d~~~I~~iG~~i~~kq~G~F~LeI~~I~~~~~~~~~~~~~y~~~~~~~~~~~~~~~  127 (127)
                      ......-++|.+.+   ..|.+.|..|+=.......-..+-    -|++|.|++||
T Consensus        14 ~~~~~~~l~F~Lg~---e~y~i~i~~V~EI~~~~~iT~vP~----ap~~v~GViNl   62 (165)
T COG0835          14 GENELEFLGFRLGD---EEYGIDILKVREIVGYPPITPVPN----APPYVLGVINL   62 (165)
T ss_pred             cCCceEEEEEEECC---EEEEEEeeeEEEEecCCCcccCCC----CCccEEEEEec
Confidence            44566777888875   789999988886665555444443    68899999996


No 11 
>COG5007 Predicted transcriptional regulator, BolA superfamily [Transcription]
Probab=41.46  E-value=31  Score=22.89  Aligned_cols=20  Identities=15%  Similarity=0.267  Sum_probs=17.0

Q ss_pred             EEEEeeCCceEEEEEEcCCC
Q psy6915           6 VIKCRGDGRSYMLNLHTPGD   25 (127)
Q Consensus         6 ~lrvrGDGr~Y~l~L~~~~~   25 (127)
                      ++.|+|||..|++++=++.-
T Consensus        18 ~v~V~Gdg~Hf~vi~Vs~~F   37 (80)
T COG5007          18 EVEVEGDGSHFQVIAVSEEF   37 (80)
T ss_pred             EEEEecCCceEEEEEehHhh
Confidence            67899999999999977753


No 12 
>KOG3186|consensus
Probab=40.26  E-value=14  Score=28.19  Aligned_cols=73  Identities=22%  Similarity=0.323  Sum_probs=54.1

Q ss_pred             CCceEEEEEeCCCceeeecceecCCCCCCCccCccEEEEEeecC---CCeeEEEEEcEEEEEeCCCchhhhhhhhcccc
Q psy6915          43 GPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQ---NNGPFQLELDYIGIEYDPNHTEEFAYEMYLFE  118 (127)
Q Consensus        43 ~~~W~tv~iPfs~F~~t~rG~~~~~~~~ld~~~I~~iG~~i~~k---q~G~F~LeI~~I~~~~~~~~~~~~~y~~~~~~  118 (127)
                      .+-|+...=|+..+..--   ...-.+.|...++.++.++|.+|   ....|.++++++....+..+.=..+||||-|.
T Consensus        41 ~pv~~S~hP~Ln~Yi~~v---l~s~~elL~~~~v~~v~l~i~~KE~r~~E~yvf~~~s~p~~~~~~d~~l~~~eq~mRa  116 (202)
T KOG3186|consen   41 TPVWQSRHPILNEYIEEV---LQSCIELLQTGSVHQVALSIINKEQREEERYVFSTDSIPIIPDFLDEFLDAYEQYMRA  116 (202)
T ss_pred             CceeeecCchHHHHHHHH---HHHHHHHHhcCCeeEEEEEEEccccChhhheEeeccCCCcCchhHHHHHHHHHHHHHH
Confidence            346888777777765431   12223567888999999999987   44579999999999888777656779999764


No 13 
>PF03295 Pox_TAA1:  Poxvirus trans-activator protein A1 C-terminal;  InterPro: IPR004975 Late transcription factor VLTF-2, acts with RNA polymerase to initiate transcription from late gene promoters [].
Probab=37.33  E-value=4.2  Score=25.70  Aligned_cols=32  Identities=16%  Similarity=0.086  Sum_probs=26.3

Q ss_pred             EcEEEEEeCCCchhhhhhhhccccceEEeeec
Q psy6915          95 LDYIGIEYDPNHTEEFAYEMYLFENEVRGIVN  126 (127)
Q Consensus        95 I~~I~~~~~~~~~~~~~y~~~~~~~~~~~~~~  126 (127)
                      |.++-|.+++.-....+-++|..|++|..|+|
T Consensus         3 ik~~vALREEPKisLLPLv~Y~~Pe~Vi~iIN   34 (63)
T PF03295_consen    3 IKSHVALREEPKISLLPLVFYEDPEEVINIIN   34 (63)
T ss_pred             ccceeeeccCCcceEEeeeeccCHHHHHHHHH
Confidence            56677888888888888889999999887766


No 14 
>cd07374 CYTH-like_Pase CYTH-like (also known as triphosphate tunnel metalloenzyme (TTM)-like) Phosphatases. CYTH-like superfamily enzymes hydrolyze triphosphate-containing substrates and require metal cations as cofactors. They have a unique active site located at the center of an eight-stranded antiparallel beta barrel tunnel (the triphosphate tunnel). The name CYTH originated from the gene designation for bacterial class IV adenylyl cyclases (CyaB), and from thiamine triphosphatase. Class IV adenylate cyclases catalyze the conversion of ATP to 3',5'-cyclic AMP (cAMP) and PPi. Thiamine triphosphatase is a soluble cytosolic enzyme which converts thiamine triphosphate to thiamine diphosphate. This domain superfamily also contains RNA triphosphatases, membrane-associated polyphosphate polymerases, tripolyphosphatases, nucleoside triphosphatases, nucleoside tetraphosphatases and other proteins with unknown functions.
Probab=37.11  E-value=36  Score=24.50  Aligned_cols=23  Identities=26%  Similarity=0.381  Sum_probs=19.4

Q ss_pred             ceEEEEEeeCCceEEEEEEcCCC
Q psy6915           3 NHLVIKCRGDGRSYMLNLHTPGD   25 (127)
Q Consensus         3 ~gl~lrvrGDGr~Y~l~L~~~~~   25 (127)
                      .+|+||++.+|+.|.+.||+...
T Consensus        49 ~~lrlR~r~~~~~~~~TlK~~~~   71 (174)
T cd07374          49 AGLRLRRRTGGADAGWHLKLPGG   71 (174)
T ss_pred             CCcEEEEEcCCCccEEEEEccCC
Confidence            46889998888999999998864


No 15 
>PF14508 GH97_N:  Glycosyl-hydrolase 97 N-terminal; PDB: 3A24_A 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A.
Probab=35.92  E-value=83  Score=24.69  Aligned_cols=36  Identities=33%  Similarity=0.589  Sum_probs=25.4

Q ss_pred             cceEEEEEee-CCceEEEEEEcCCCCCCcccccEEEEEEeCC
Q psy6915           2 YNHLVIKCRG-DGRSYMLNLHTPGDFDVTWGDMFTYALYTRG   42 (127)
Q Consensus         2 y~gl~lrvrG-DGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~   42 (127)
                      |+.+.|.+++ +|+++.|.+|.-+.     +.+|++.+..++
T Consensus        79 yne~~l~~~~~~~~~l~l~fRaydd-----GvAfRY~~p~~~  115 (259)
T PF14508_consen   79 YNELTLSFKNKYGRRLNLEFRAYDD-----GVAFRYELPEQG  115 (259)
T ss_dssp             EEEEEEEECCCCCEEEEEEEEEETT-----EEEEEEEE--BT
T ss_pred             EEEEEEEEecCCCceeEEEEEEEcC-----CEEEEEEECCCC
Confidence            7788999987 46688888887764     358888777543


No 16 
>PF09496 CENP-O:  Cenp-O kinetochore centromere component;  InterPro: IPR018464 Chromosome segregation in eukaryotes requires the kinetochore, a multi-protein structure that assembles on centromeric DNA, and which acts to link chromosomes to spindle microtubules. Kinetochore structure and composition is highly conserved among vertebrates. The inner kinetochore is essential for kinetochore assembly, and is involved in chromosome segregation via regulation of the spindle. Inner kinetochore components include the multi-subunit CENP-H/I complex, which may function, in part, in directing centromere protein A (CENP-A) deposition to centromeres, where CENP-A is a centromere-specific histone H3 variant required for the organisation of centromeric chromatin during interphase. The CENP-H/I complex contains three functional classes of proteins [, ]:    CENP-H class (includes CENP-H, -I, -K, -L) CENP-M class (includes CENP-M) CENP-O class (includes CENP-O, -P, -Q, -R, -50)   The CENP-O class proteins form a stable complex and are required for proper kinetochore function. They are involved in the prevention of premature sister chromatid separation during recovery from spindle damage []. CENP-O mediates the attachment of the centromere to the mitotic spindle by forming essential interactions between the microtubule-associated outer kinetochore proteins and the centromere-associated inner kinetochore proteins. It has been shown to be involved in chromosome segregation via regulation of the spindle in both yeast [] and human [].; GO: 0007059 chromosome segregation, 0051301 cell division, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 3ZXU_C.
Probab=35.17  E-value=87  Score=20.79  Aligned_cols=23  Identities=17%  Similarity=0.303  Sum_probs=14.4

Q ss_pred             eEEEEEeeC-C---ceEEEEEEcCCCC
Q psy6915           4 HLVIKCRGD-G---RSYMLNLHTPGDF   26 (127)
Q Consensus         4 gl~lrvrGD-G---r~Y~l~L~~~~~~   26 (127)
                      ||++++... |   +.|-+.|+.....
T Consensus         2 GiR~di~~~~g~f~~pYYviL~~~~~~   28 (90)
T PF09496_consen    2 GIRFDIFSREGRFLEPYYVILKRPSKS   28 (90)
T ss_dssp             EEEE--EETTTEE---EEEEEEE-SSS
T ss_pred             EEEEEEEcCCCCCCCceEEEEEecCCC
Confidence            889999877 7   6889999887654


No 17 
>COG3025 Uncharacterized conserved protein [Function unknown]
Probab=31.43  E-value=51  Score=28.37  Aligned_cols=24  Identities=25%  Similarity=0.509  Sum_probs=21.3

Q ss_pred             ceEEEEEeeCCceEEEEEEcCCCC
Q psy6915           3 NHLVIKCRGDGRSYMLNLHTPGDF   26 (127)
Q Consensus         3 ~gl~lrvrGDGr~Y~l~L~~~~~~   26 (127)
                      .++-||||+.|..|..+||+.+..
T Consensus        51 ~~~gLRIR~~~~~y~~TlKtaG~v   74 (432)
T COG3025          51 HDMGLRIRREGGQYEQTLKTAGGV   74 (432)
T ss_pred             CCceEEEeccCCeEEEEEEecCcc
Confidence            467899999999999999999865


No 18 
>PF01562 Pep_M12B_propep:  Reprolysin family propeptide;  InterPro: IPR002870 This signature covers the region of the propeptide for members of the MEROPS peptidase family M12B (clan MA(M), adamalysin family). The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins, which mediate cell-cell or cell-matrix interactions.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis
Probab=31.35  E-value=1.7e+02  Score=19.90  Aligned_cols=59  Identities=19%  Similarity=0.272  Sum_probs=38.5

Q ss_pred             ceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecC
Q psy6915           3 NHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQD   66 (127)
Q Consensus         3 ~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~   66 (127)
                      +.+.++++..|+.|.|+|+.+...-.   ..+........+ . .+...|...=.-.|+|.+..
T Consensus        29 ~~~~~~~~a~g~~~~L~L~~n~~l~~---~~~~v~~~~~~g-~-~~~~~~~~~~~c~Y~G~V~g   87 (131)
T PF01562_consen   29 SHLSYSFRAFGREFHLDLEPNHSLFS---PGFVVEVRNEDG-T-REESSPSHQEHCHYQGHVEG   87 (131)
T ss_pred             ceEEEEEEECCEEEEEEEEECCcccC---CCCEEEEEcccc-c-ccccCCccccceEEEeeeCC
Confidence            45788999999999999999876532   345555554333 2 24555544444457888655


No 19 
>PF13049 DUF3910:  Protein of unknown function (DUF3910)
Probab=31.32  E-value=1.5e+02  Score=19.79  Aligned_cols=49  Identities=18%  Similarity=0.332  Sum_probs=27.1

Q ss_pred             cceEE--EEEeeCCceEEEEEEcCCCCCCcccccEEEEEE-eCCCCc-eEEEEEeCCC
Q psy6915           2 YNHLV--IKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALY-TRGGPY-WQITKIPFSK   55 (127)
Q Consensus         2 y~gl~--lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~-t~~~~~-W~tv~iPfs~   55 (127)
                      |++..  +.+.+|.++|++.+-..+..     ..|...-+ +++|.+ -=-|.|||..
T Consensus        24 yda~~idfsl~~ddnryklivlk~e~~-----~hyk~vqyg~kpgsqkpfpidipf~~   76 (93)
T PF13049_consen   24 YDATSIDFSLENDDNRYKLIVLKHEEN-----THYKFVQYGIKPGSQKPFPIDIPFEQ   76 (93)
T ss_pred             eeeeEEEEEeccCCCeEEEEEEeccCc-----ceeEEEEeccCcCCCCCccccccchh
Confidence            45444  45578999999887655432     24554433 333311 1246667653


No 20 
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=30.49  E-value=49  Score=28.49  Aligned_cols=46  Identities=20%  Similarity=0.157  Sum_probs=32.4

Q ss_pred             eEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecce
Q psy6915          15 SYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGR   63 (127)
Q Consensus        15 ~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~   63 (127)
                      .|.++|.+....   -+..|.+.+....|.+|..|.|-|+.-.|...+.
T Consensus       210 ~Ydlrl~~~~~~---~~l~~~A~V~Q~TGeDW~~V~LtLsTa~P~~~~~  255 (525)
T TIGR02231       210 SYDARLDTGAPT---VELTYLAEIRQSTGEDWSDIPLVLSTAQPSLGGR  255 (525)
T ss_pred             eeEEEecCCCce---EEEEEEEEEEeCCCCCCCCCcEEEeccCCCCCCC
Confidence            566666554321   1346777777666778999999999999987553


No 21 
>COG0335 RplS Ribosomal protein L19 [Translation, ribosomal structure and biogenesis]
Probab=28.44  E-value=60  Score=23.01  Aligned_cols=20  Identities=25%  Similarity=0.576  Sum_probs=14.8

Q ss_pred             CcceEEEEEeeCC--ceEEEEE
Q psy6915           1 MYNHLVIKCRGDG--RSYMLNL   20 (127)
Q Consensus         1 ~y~gl~lrvrGDG--r~Y~l~L   20 (127)
                      +|+|+.|+.+|.|  .+|.++=
T Consensus        43 ~FeGvVia~r~~G~~~tftvRk   64 (115)
T COG0335          43 AFEGVVIARRGRGISETFTVRK   64 (115)
T ss_pred             eeeEEEEEECCCCccceEEEEE
Confidence            5899999999888  4554443


No 22 
>PF03944 Endotoxin_C:  delta endotoxin;  InterPro: IPR005638 This family contains insecticidal toxins produced by Bacillus species of bacteria. During spore formation the bacteria produce crystals of this protein. When an insect ingests these proteins, they are activated by proteolytic cleavage. The N terminus is cleaved in all of the proteins and a C-terminal extension is cleaved in some members. Once activated, the endotoxin binds to the gut epithelium and causes cell lysis by the formation of cation-selective channels, which leads to death. The activated region of the delta toxin is composed of three distinct structural domains: an N-terminal helical bundle domain (IPR005639 from INTERPRO) involved in membrane insertion and pore formation; a beta-sheet central domain (IPR001178 from INTERPRO) involved in receptor binding; and a C-terminal beta-sandwich domain that interacts with the N-terminal domain to form a channel [, ]. This entry represents the conserved C-terminal domain.; PDB: 1DLC_A 1JI6_A 1W99_A 1CIY_A 1I5P_A 2C9K_A 3EB7_A.
Probab=28.01  E-value=1.3e+02  Score=21.29  Aligned_cols=91  Identities=9%  Similarity=0.074  Sum_probs=44.6

Q ss_pred             EEEEe---eCCceEEEEEEcCCCCCCc-----ccccE--EEEEEeCCCCceEEEEEeCCCceeeecceecCCCCCCCccC
Q psy6915           6 VIKCR---GDGRSYMLNLHTPGDFDVT-----WGDMF--TYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDR   75 (127)
Q Consensus         6 ~lrvr---GDGr~Y~l~L~~~~~~~~~-----~~~~y--~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~~~ld~~~   75 (127)
                      .++|.   .-.++|++++|-....+..     +....  ...+..... ....+.+..++|...--...    -.+....
T Consensus        41 ~~~v~~~~~~~~~YrIRiRYAs~~~~~~~i~~~~~~~~~~~~~~~T~~-~~~~~~~~y~~F~y~~~~~~----~~~~~~~  115 (143)
T PF03944_consen   41 KIRVTINNSSSQKYRIRIRYASNSNGTLSISINNSSGNLSFNFPSTMS-NGDNLTLNYESFQYVEFPTP----FTFSSNQ  115 (143)
T ss_dssp             EEEEEESSSSTEEEEEEEEEEESS-EEEEEEETTEEEECEEEE--SSS-TTGGCCETGGG-EEEEESSE----EEESTSE
T ss_pred             EEEEEecCCCCceEEEEEEEEECCCcEEEEEECCccceeeeecccccc-CCCccccccceeEeeecCce----EEecCCC
Confidence            66666   3448899999877654310     11111  333443322 12235555667766532111    1233333


Q ss_pred             ccEEEEEeecCCCeeEEEEEcEEEEEe
Q psy6915          76 ISSIGLSLVDQNNGPFQLELDYIGIEY  102 (127)
Q Consensus        76 I~~iG~~i~~kq~G~F~LeI~~I~~~~  102 (127)
                      ...+-+.+..... .=.|-||.|++..
T Consensus       116 ~~~~~i~i~~~~~-~~~v~IDkIEFIP  141 (143)
T PF03944_consen  116 SITITISIQNISS-NGNVYIDKIEFIP  141 (143)
T ss_dssp             EEEEEEEEESSTT-TS-EEEEEEEEEE
T ss_pred             ceEEEEEEEecCC-CCeEEEEeEEEEe
Confidence            4556666664422 2678889999864


No 23 
>smart00712 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria.
Probab=27.33  E-value=1.6e+02  Score=18.33  Aligned_cols=43  Identities=5%  Similarity=-0.023  Sum_probs=28.1

Q ss_pred             EeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCce
Q psy6915           9 CRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFF   57 (127)
Q Consensus         9 vrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~   57 (127)
                      ++-+.|+|.+.|+.+...      .|---....++..=.+|.||.+++.
T Consensus         7 l~~~~k~fyfDvk~N~rG------~fLrIsE~~~~~~r~~I~lp~~~~~   49 (63)
T smart00712        7 LTVENKRFYFDVKENRRG------RFLRISEVKNNGGRSSITVPEQGAA   49 (63)
T ss_pred             EEeCCcEEEEEecccCCc------cEEEEEEecCCCCceEEEEEHHHHH
Confidence            466889999999988742      3443333434423578888876553


No 24 
>cd05163 TRRAP TRansformation/tRanscription domain-Associated Protein (TRRAP), pseudokinase domain; The TRRAP catalytic domain is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. TRRAP shows some similarity to members of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily in that it contains a FATC (FRAP, ATM and TRRAP, C-terminal) domain and has a large molecular weight. Unlike PIKK proteins, however, it contains an inactive PI3K-like pseudokinase domain, which lacks the conserved residues necessary for ATP binding and catalytic activity. TRRAP also contains many motifs that may be critical for protein-protein interactions. TRRAP is a common component of many histone acetyltransferase (HAT) complexes, and is responsible for the recruitment of these complexes to chromatin during transcription, replicat
Probab=27.10  E-value=74  Score=24.99  Aligned_cols=19  Identities=32%  Similarity=0.597  Sum_probs=14.2

Q ss_pred             EEEEee-CCceEEEEEEcCC
Q psy6915           6 VIKCRG-DGRSYMLNLHTPG   24 (127)
Q Consensus         6 ~lrvrG-DGr~Y~l~L~~~~   24 (127)
                      +|.++| ||+.|.+-++...
T Consensus        20 ri~i~gsdG~~y~fLvk~~~   39 (253)
T cd05163          20 RLTIRGHDGSIYPFLVQYPA   39 (253)
T ss_pred             EEEEECCCCCEEEEEEecCC
Confidence            355664 9999999998654


No 25 
>PF10636 hemP:  Hemin uptake protein hemP;  InterPro: IPR019600  This entry represents bacterial proteins that are involved in the uptake of the iron source hemin []. ; PDB: 2JRA_B 2LOJ_A.
Probab=27.01  E-value=1.3e+02  Score=17.11  Aligned_cols=19  Identities=21%  Similarity=0.480  Sum_probs=15.3

Q ss_pred             EEEEeeCCceEEEEEEcCC
Q psy6915           6 VIKCRGDGRSYMLNLHTPG   24 (127)
Q Consensus         6 ~lrvrGDGr~Y~l~L~~~~   24 (127)
                      ++.+.-+|..|.+++....
T Consensus        14 ev~I~H~g~~Y~LR~Tr~g   32 (38)
T PF10636_consen   14 EVRIEHGGQIYRLRITRQG   32 (38)
T ss_dssp             EEEEEETTEEEEEEEETTT
T ss_pred             EEEEEeCCeEEEeeEccCC
Confidence            5778889999999986553


No 26 
>PF09137 Glucodextran_N:  Glucodextranase, domain N;  InterPro: IPR015220 Glucodextranase domain N, uniquely found in bacterial and archaeal glucoamylases and glucodextranases, adopts a structure consisting of 17 antiparallel beta-strands. These beta-strands are divided into two beta-sheets, and one of the beta-sheets is wrapped by an extended polypeptide, which appears to stabilise the domain. This domain, together with glycoside hydrolase domain A (IPR011613 from INTERPRO), is mainly involved with catalytic activity, hydrolysing alpha-1,6-glucosidic linkages of dextran to release beta-D-glucose from the non-reducing end via an inverting reaction mechanism []. ; PDB: 1UG9_A 1ULV_A 1LF6_B 1LF9_A.
Probab=26.44  E-value=95  Score=24.99  Aligned_cols=18  Identities=22%  Similarity=0.386  Sum_probs=13.9

Q ss_pred             CCCCccCccEEEEEeecC
Q psy6915          69 NPLDTDRISSIGLSLVDQ   86 (127)
Q Consensus        69 ~~ld~~~I~~iG~~i~~k   86 (127)
                      |.+|..+|+.|+|++.|+
T Consensus        44 P~iD~~qiRdl~fiVtDg   61 (269)
T PF09137_consen   44 PRIDTPQIRDLQFIVTDG   61 (269)
T ss_dssp             SSTT-EEEEEEEEEEE-S
T ss_pred             cCCCCccceeeEEEEEcC
Confidence            678889999999999863


No 27 
>cd08694 C2_Dock-A C2 domains found in Dedicator Of CytoKinesis (Dock) class A proteins. Dock-A is one of 4 classes of Dock family proteins.  The members here include: Dock180/Dock1, Dock2, and Dock5.  Most of these members have been shown to be GEFs specific for Rac.  Dock5 has not been well characterized to date, but most likely also is a GEF specific for Rac. In addition to the C2 domain (AKA Dock homology region (DHR)-1, CED-5, Dock180, MBC-zizimin homology (CZH) 1) and the DHR-2 (AKA CZH2, or Docker), which all Dock180-related proteins have, Dock-A members contain a proline-rich region and a SH3 domain upstream of the C2 domain. DHR-2 has the catalytic activity for Rac and/or Cdc42, but is structurally unrelated to the DH domain. The C2/DHR-1 domains of Dock180 and Dock4 have been shown to bind phosphatidylinositol-3, 4, 5-triphosphate (PtdIns(3,4,5)P3). The C2 domain was first identified in PKC. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangemen
Probab=24.88  E-value=3.2e+02  Score=20.97  Aligned_cols=86  Identities=16%  Similarity=0.255  Sum_probs=41.7

Q ss_pred             eCCceEEEEEEcCCCCCCcccccEEEEE-EeCCCCce-EE--EEEeCCCceee-----ecceecCCCCCCC-ccCccEEE
Q psy6915          11 GDGRSYMLNLHTPGDFDVTWGDMFTYAL-YTRGGPYW-QI--TKIPFSKFFLQ-----SKGRIQDIQNPLD-TDRISSIG   80 (127)
Q Consensus        11 GDGr~Y~l~L~~~~~~~~~~~~~y~~~f-~t~~~~~W-~t--v~iPfs~F~~t-----~rG~~~~~~~~ld-~~~I~~iG   80 (127)
                      .||..-.-.+-......  ....|...+ +...+|.| ++  |.||.+.|.-.     +|=+-..+.+  | +...-.++
T Consensus        34 ~~G~~i~~~i~~gsg~~--~~se~~S~V~Yh~~~P~W~EtIKl~lP~~~~~~~HL~FtfrH~S~~~~k--d~~e~pfg~s  109 (196)
T cd08694          34 EDGKIIPGVISLGAGEE--PIDEYKSVIYYQVDKPKWFETFKVAIPIEDFKSSHLRFTFKHRSSNEAK--DKSEKPFALS  109 (196)
T ss_pred             CCCCCCceeEEcCCCCC--cceeEEEEEEeecCCCCCceeEEEecChhhCCCeEEEEEEEeecccccc--CCCCCceEEE
Confidence            46755443343333222  123555554 56677889 34  55677766443     3332211111  1 12222333


Q ss_pred             EEee---cC---CCeeEEEEEcEEEE
Q psy6915          81 LSLV---DQ---NNGPFQLELDYIGI  100 (127)
Q Consensus        81 ~~i~---~k---q~G~F~LeI~~I~~  100 (127)
                      ++=.   ++   +.|++.|-|-.+.-
T Consensus       110 ~lpL~~~~gt~l~dG~H~L~vYK~d~  135 (196)
T cd08694         110 FVKLMQENGTTLTDGEHDLIVYKVDA  135 (196)
T ss_pred             EEeeeccCCcEEccCCEEEEEEEecc
Confidence            3211   22   68999999887764


No 28 
>cd07756 CYTH-like_Pase_CHAD Uncharacterized subgroup of the CYTH-like superfamily having an associated CHAD domain. This subgroup belongs to the CYTH-like (also known as triphosphate tunnel metalloenzyme (TTM)-like) superfamily. Members of this superfamily hydrolyze triphosphate-containing substrates, require metal cations as cofactors, and have a unique active site located at the center of an eight-stranded antiparallel beta barrel tunnel (the triphosphate tunnel). A number of proteins in this subgroup also contain a C-terminal CHAD (Conserved Histidine Alpha-helical Domain) domain which may participate in metal chelation or act as a phosphor-acceptor. The name CYTH originated from the gene designation for bacterial class IV adenylyl cyclases (CyaB) and from thiamine triphosphatase. Class IV adenylate cyclases catalyze the conversion of ATP to 3',5'-cyclic AMP (cAMP) and PPi. Thiamine triphosphatase is a soluble cytosolic enzyme which converts thiamine triphosphate to thiamine diphosp
Probab=24.45  E-value=84  Score=23.70  Aligned_cols=23  Identities=22%  Similarity=0.551  Sum_probs=18.6

Q ss_pred             ceEEEEEeeCCceEEEEEEcCCC
Q psy6915           3 NHLVIKCRGDGRSYMLNLHTPGD   25 (127)
Q Consensus         3 ~gl~lrvrGDGr~Y~l~L~~~~~   25 (127)
                      .++.||+|..|..|.++||....
T Consensus        50 ~~~aLRiR~~~~~~~~TlK~~~~   72 (197)
T cd07756          50 AGIALRVRREGGQWVQTLKTAGS   72 (197)
T ss_pred             CCCEEEEEeeCCeEEEEEeeCCc
Confidence            46788998878889999998753


No 29 
>cd05171 PIKKc_ATM Ataxia telangiectasia mutated (ATM), catalytic domain; The ATM catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. ATM is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). ATM contains a FAT (FRAP, ATM and TRRAP) domain, a catalytic domain, and a FATC domain at the C-terminus. ATM is critical in the response to DNA double strand breaks (DSBs) caused by radiation. It is activated at the site of a DSB and phosphorylates key substrates that trigger pathways that regulate DNA repair and cell cycle checkpoints at the G1/S, S phase, and G2/M transi
Probab=24.42  E-value=83  Score=24.97  Aligned_cols=21  Identities=38%  Similarity=0.657  Sum_probs=16.7

Q ss_pred             EEEEee-CCceEEEEEEcCCCC
Q psy6915           6 VIKCRG-DGRSYMLNLHTPGDF   26 (127)
Q Consensus         6 ~lrvrG-DGr~Y~l~L~~~~~~   26 (127)
                      +|.++| ||+.|.+-++..+..
T Consensus        19 ri~~~gsdG~~y~fl~K~~dDl   40 (279)
T cd05171          19 IITCVGSDGKKYKQLLKGGDDD   40 (279)
T ss_pred             EEEEECCCCCEEEEEecCCCcc
Confidence            566766 999999999987653


No 30 
>cd07762 CYTH-like_Pase_1 Uncharacterized subgroup 1 of the CYTH-like superfamily. Enzymes belonging to the CYTH-like (also known as triphosphate tunnel metalloenzyme (TTM)-like) superfamily hydrolyze triphosphate-containing substrates, require metal cations as cofactors, and have a unique active site located at the center of an eight-stranded antiparallel beta barrel tunnel (the triphosphate tunnel). The name CYTH originated from the gene designation for bacterial class IV adenylyl cyclases (CyaB) and from thiamine triphosphatase. Class IV adenylate cyclases catalyze the conversion of ATP to 3',5'-cyclic AMP (cAMP) and PPi. Thiamine triphosphatase is a soluble cytosolic enzyme which converts thiamine triphosphate to thiamine diphosphate. This domain superfamily also contains RNA triphosphatases, membrane-associated polyphosphate polymerases, tripolyphosphatases, nucleoside triphosphatases, nucleoside tetraphosphatases and other proteins with unknown functions. Proteins of this subgroup
Probab=22.57  E-value=98  Score=22.89  Aligned_cols=22  Identities=23%  Similarity=0.385  Sum_probs=17.3

Q ss_pred             eEEEEEeeCCceEEEEEEcCCC
Q psy6915           4 HLVIKCRGDGRSYMLNLHTPGD   25 (127)
Q Consensus         4 gl~lrvrGDGr~Y~l~L~~~~~   25 (127)
                      +..||||..|..+.++|+....
T Consensus        45 ~~aLRiR~~~~~~~~TlK~~~~   66 (180)
T cd07762          45 HSALRIREKEGKAELTLKVPQE   66 (180)
T ss_pred             CcEEEEEeeCCeEEEEEeeCCC
Confidence            5677888777889999998764


No 31 
>KOG3658|consensus
Probab=22.53  E-value=2.1e+02  Score=26.33  Aligned_cols=54  Identities=24%  Similarity=0.388  Sum_probs=40.5

Q ss_pred             cceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCC
Q psy6915           2 YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDI   67 (127)
Q Consensus         2 y~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~   67 (127)
                      .++|.|+.-.=||.|.|.|+.+...     .+-.+.+++..++    +.+-.++|   |-|.+..+
T Consensus        62 ~~~l~l~F~a~~R~F~l~L~~~~~~-----f~~~fkv~~~dg~----~d~~~~~~---YtG~v~ge  115 (764)
T KOG3658|consen   62 ASTLALTFSALGRHFRLRLHPDTST-----FSHNFKVYTVDGP----IDVDPSHF---YTGHVVGE  115 (764)
T ss_pred             cceEEEEEeecCceEEEEEeccccc-----hhccceEEeecCC----ccccccce---eeeEEecC
Confidence            5688899999999999999988532     2455566776664    77888888   56777664


No 32 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=21.01  E-value=2.5e+02  Score=18.32  Aligned_cols=102  Identities=14%  Similarity=0.129  Sum_probs=46.5

Q ss_pred             ceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCC--ceeeecceecCCCCCCCc-cCccEE
Q psy6915           3 NHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSK--FFLQSKGRIQDIQNPLDT-DRISSI   79 (127)
Q Consensus         3 ~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~--F~~t~rG~~~~~~~~ld~-~~I~~i   79 (127)
                      +++.|.+.++| ...+.+......    ...+......+.+ +|..|.+-++.  ...+.-|..+...+.-.. .....-
T Consensus        50 ~~~~l~~~~~~-~l~~~~~~~~~~----~~~~~~~~~~~~~-~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~~~~~~~~  123 (157)
T PF13385_consen   50 GGFGLFINNNG-RLRFYIGNGGGG----NYSFSSDSNLPDN-KWHHLALTYDGSTVTLYVNGELVGSSTIPSNISLNSNG  123 (157)
T ss_dssp             EEEEEEEETTS-EEEEEETTSEEE----SS-EE-BS---TT--EEEEEEEEETTEEEEEETTEEETTCTEESSSSTTSCC
T ss_pred             CEEEEEEECCC-EEEEEEeCCCce----eEEEecCcccCCC-CEEEEEEEEECCeEEEEECCEEEEeEeccCCcCCCCcc
Confidence            35666665444 555555444311    0122222333344 89888877753  333445666553221111 112233


Q ss_pred             EEEeecC--CCeeEEEEEcEEEEEeCCCchhhh
Q psy6915          80 GLSLVDQ--NNGPFQLELDYIGIEYDPNHTEEF  110 (127)
Q Consensus        80 G~~i~~k--q~G~F~LeI~~I~~~~~~~~~~~~  110 (127)
                      .+.|...  ...+|...|+.|.+|...-..+|+
T Consensus       124 ~~~iG~~~~~~~~~~g~i~~~~i~~~aLt~~eI  156 (157)
T PF13385_consen  124 PLFIGGSGGGSSPFNGYIDDLRIYNRALTAEEI  156 (157)
T ss_dssp             EEEESS-STT--B-EEEEEEEEEESS---HHHH
T ss_pred             eEEEeecCCCCCceEEEEEEEEEECccCCHHHc
Confidence            3444432  357899999999999766555543


No 33 
>cd05170 PIKKc_SMG1 Suppressor of morphogenetic effect on genitalia-1 (SMG-1), catalytic domain; The SMG-1 catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. SMG-1 is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). In addition to its catalytic domain, SMG-1 contains a FATC (FRAP, ATM and TRRAP, C-terminal) domain at the C-terminus. SMG-1 plays a critical role in the mRNA surveillance mechanism known as non-sense mediated mRNA decay (NMD). NMD protects the cells from the accumulation of aberrant mRNAs with premature termination codons (PTCs) generated by geno
Probab=20.89  E-value=1.1e+02  Score=24.67  Aligned_cols=22  Identities=23%  Similarity=0.410  Sum_probs=17.2

Q ss_pred             EEEEEee-CCceEEEEEEcCCCC
Q psy6915           5 LVIKCRG-DGRSYMLNLHTPGDF   26 (127)
Q Consensus         5 l~lrvrG-DGr~Y~l~L~~~~~~   26 (127)
                      .+|.++| ||+.|.+-++..+..
T Consensus        18 kri~~~gsDG~~y~fLlK~~dDL   40 (307)
T cd05170          18 KKLAFLGSDGKKYTYLFKGREDL   40 (307)
T ss_pred             eEEEEECCCCCEEEEEecCCCcc
Confidence            3577777 999999999976443


Done!