Query         002685
Match_columns 893
No_of_seqs    366 out of 1366
Neff          4.4 
Searched_HMMs 46136
Date          Fri Mar 29 05:03:32 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/002685.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/002685hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF06087 Tyr-DNA_phospho:  Tyro 100.0 5.5E-54 1.2E-58  483.3   0.6  321  414-797     2-358 (443)
  2 KOG2031 Tyrosyl-DNA phosphodie 100.0 3.1E-48 6.7E-53  428.0  14.3  330  397-804    63-414 (519)
  3 PF00498 FHA:  FHA domain;  Int  99.5 6.8E-14 1.5E-18  118.7   6.6   67   71-174     1-67  (68)
  4 cd00060 FHA Forkhead associate  99.1 5.1E-10 1.1E-14   99.9  10.8   79   60-175    13-92  (102)
  5 TIGR03354 VI_FHA type VI secre  99.0 9.4E-10   2E-14  124.1   7.7   74   61-174    17-94  (396)
  6 KOG1882 Transcriptional regula  98.9 4.8E-10   1E-14  117.4   4.4   97   57-177   182-278 (293)
  7 COG1716 FOG: FHA domain [Signa  98.9 2.8E-09   6E-14  106.6   9.2   68   69-176    89-156 (191)
  8 PLN02927 antheraxanthin epoxid  98.7 2.2E-08 4.7E-13  119.5   7.7   97   55-197   541-649 (668)
  9 smart00240 FHA Forkhead associ  98.7 2.9E-08 6.2E-13   79.8   5.5   52   71-158     1-52  (52)
 10 COG3456 Predicted component of  98.4 3.9E-07 8.4E-12  102.4   6.1   71   64-174    22-95  (430)
 11 KOG1881 Anion exchanger adapto  97.9 3.8E-06 8.2E-11   99.2   1.5   93   55-174   163-255 (793)
 12 KOG0615 Serine/threonine prote  97.8   2E-05 4.3E-10   89.1   4.3   90   61-176    57-147 (475)
 13 KOG1880 Nuclear inhibitor of p  97.2 0.00015 3.4E-09   78.9   1.7   86   56-177    26-111 (337)
 14 TIGR02500 type_III_yscD type I  95.7   0.026 5.7E-07   64.7   7.8   46   49-99      3-50  (410)
 15 PF06087 Tyr-DNA_phospho:  Tyro  95.7  0.0067 1.5E-07   70.1   2.9   76  796-871   156-267 (443)
 16 TIGR01663 PNK-3'Pase polynucle  91.5    0.34 7.3E-06   57.8   6.5   79   61-179    25-103 (526)
 17 KOG0245 Kinesin-like protein [  91.3    0.51 1.1E-05   59.3   7.8   76   62-175   470-547 (1221)
 18 PF08797 HIRAN:  HIRAN domain;   87.1     0.1 2.2E-06   48.8  -1.7   63  712-776    29-91  (107)
 19 KOG2293 Daxx-interacting prote  80.2       4 8.8E-05   48.5   6.9   74   64-174   443-522 (547)
 20 KOG1892 Actin filament-binding  73.1      10 0.00023   48.0   7.9   85   51-176   360-447 (1629)
 21 PF00614 PLDc:  Phospholipase D  72.0       3 6.6E-05   31.1   2.0   24  525-551     3-26  (28)
 22 smart00155 PLDc Phospholipase   70.0     4.4 9.5E-05   29.6   2.5   24  525-551     3-26  (28)
 23 cd00138 PLDc Phospholipase D.   62.0     9.9 0.00022   37.5   4.1   37  525-564   112-149 (176)
 24 PF13091 PLDc_2:  PLD-like doma  58.8     7.3 0.00016   36.2   2.4   36  526-564    74-110 (126)
 25 TIGR02868 CydC thiol reductant  53.6     5.8 0.00013   46.6   1.0   67  191-281   350-416 (529)
 26 cd01764 Urm1 Urm1-like ubuitin  47.7      16 0.00036   34.1   2.8   27  151-177    62-90  (94)
 27 PRK13657 cyclic beta-1,2-gluca  46.0     8.4 0.00018   46.0   0.8   67  191-280   350-416 (588)
 28 TIGR02988 YaaA_near_RecF S4 do  44.8      25 0.00053   29.7   3.2   25  149-174    34-58  (59)
 29 PF01479 S4:  S4 domain;  Inter  44.1      19  0.0004   28.9   2.3   24  148-172    25-48  (48)
 30 PRK13912 nuclease NucT; Provis  42.3      23  0.0005   36.2   3.2   93  450-555    50-145 (177)
 31 smart00363 S4 S4 RNA-binding d  41.2      36 0.00078   26.7   3.5   28  147-175    24-51  (60)
 32 TIGR01687 moaD_arch MoaD famil  40.1      31 0.00067   31.0   3.3   28  149-176    56-83  (88)
 33 cd00754 MoaD Ubiquitin domain   38.1      33 0.00073   29.9   3.1   27  148-176    49-75  (80)
 34 COG1188 Ribosome-associated he  33.5      49  0.0011   31.9   3.6   28  150-179    35-62  (100)
 35 cd00165 S4 S4/Hsp/ tRNA synthe  32.7      53  0.0011   26.5   3.3   28  147-175    24-51  (70)
 36 PRK01777 hypothetical protein;  32.4      46 0.00099   31.5   3.2   28  146-175    47-74  (95)
 37 TIGR02857 CydD thiol reductant  31.9      18 0.00038   42.6   0.5   52  190-262   336-387 (529)
 38 COG2501 S4-like RNA binding pr  28.6      56  0.0012   29.9   2.9   26  147-174    35-61  (73)
 39 PRK11507 ribosome-associated p  27.1      69  0.0015   29.1   3.2   26  147-174    35-61  (70)
 40 cd03292 ABC_FtsE_transporter F  26.9      34 0.00073   35.1   1.5   48  190-258    15-62  (214)
 41 TIGR00958 3a01208 Conjugate Tr  26.3      22 0.00047   43.9  -0.0   53  190-263   495-547 (711)
 42 PF09014 Sushi_2:  Beta-2-glyco  25.9 1.1E+02  0.0024   28.8   4.4   39  151-195    13-54  (85)
 43 PF02824 TGS:  TGS domain;  Int  25.8      58  0.0013   27.9   2.5   27  147-175    33-59  (60)
 44 cd01666 TGS_DRG_C TGS_DRG_C:    25.3      57  0.0012   29.6   2.4   21  153-175    54-74  (75)
 45 TIGR01193 bacteriocin_ABC ABC-  25.1      27 0.00059   42.8   0.5   51  191-262   489-539 (708)
 46 PF09565 RE_NgoFVII:  NgoFVII r  24.8      60  0.0013   36.6   3.0   27  526-552    80-107 (296)
 47 COG1124 DppF ABC-type dipeptid  24.6      26 0.00056   38.6   0.1   52  191-263    22-73  (252)
 48 PF02597 ThiS:  ThiS family;  I  23.8      57  0.0012   28.1   2.1   28  149-176    44-72  (77)
 49 PRK06944 sulfur carrier protei  23.7      86  0.0019   26.6   3.2   26  151-176    33-60  (65)
 50 TIGR02673 FtsE cell division A  23.6      47   0.001   34.1   1.8   48  191-259    17-64  (214)
 51 PRK10348 ribosome-associated h  23.1 1.2E+02  0.0027   30.5   4.5   29  148-178    33-61  (133)
 52 COG1559 Aminodeoxychorismate l  23.0 1.6E+02  0.0035   33.9   6.0   94  750-846    76-174 (342)
 53 PRK11174 cysteine/glutathione   22.9      29 0.00063   41.5   0.1   50  190-261   364-413 (588)
 54 PRK11176 lipid transporter ATP  22.8      28 0.00062   41.4   0.0   50  191-261   358-407 (582)
 55 TIGR00960 3a0501s02 Type II (G  22.8      48   0.001   34.2   1.7   49  191-260    18-66  (216)
 56 TIGR02982 heterocyst_DevA ABC   22.7      37 0.00079   35.3   0.8   48  191-259    20-67  (220)
 57 PRK10790 putative multidrug tr  22.5      38 0.00082   40.6   1.0   51  191-262   356-406 (592)
 58 PF13275 S4_2:  S4 domain; PDB:  22.4      42 0.00091   29.9   1.0   27  146-174    30-57  (65)
 59 PRK06437 hypothetical protein;  22.3 1.1E+02  0.0024   26.8   3.6   26  149-176    37-62  (67)
 60 TIGR03873 F420-0_ABC_ATP propo  21.5      53  0.0012   34.9   1.8   33  213-259    31-63  (256)
 61 TIGR01682 moaD molybdopterin c  21.5 1.1E+02  0.0023   27.3   3.4   24  151-176    52-75  (80)
 62 cd00565 ThiS ThiaminS ubiquiti  21.3      96  0.0021   26.6   3.0   27  150-176    32-60  (65)
 63 PRK10908 cell division protein  21.2      46 0.00099   34.5   1.2   49  190-259    16-64  (222)
 64 PF14451 Ub-Mut7C:  Mut7-C ubiq  21.1   1E+02  0.0022   28.5   3.2   26  149-176    50-75  (81)
 65 PRK11160 cysteine/glutathione   20.6      38 0.00081   40.7   0.4   51  191-262   355-405 (574)
 66 TIGR03797 NHPM_micro_ABC2 NHPM  20.5      34 0.00074   41.8   0.0   52  190-262   467-518 (686)
 67 cd03256 ABC_PhnC_transporter A  20.2      40 0.00088   35.2   0.5   49  190-259    15-63  (241)
 68 PRK13644 cbiO cobalt transport  20.2      59  0.0013   35.3   1.8   48  191-259    17-64  (274)
 69 PRK06488 sulfur carrier protei  20.1      82  0.0018   27.1   2.3   25  152-176    34-60  (65)

No 1  
>PF06087 Tyr-DNA_phospho:  Tyrosyl-DNA phosphodiesterase;  InterPro: IPR010347 Covalent intermediates between topoisomerase I and DNA can become dead-end complexes that lead to cell death. Tyrosyl-DNA phosphodiesterase can hydrolyse the bond between topoisomerase I and DNA [].; GO: 0008081 phosphoric diester hydrolase activity, 0006281 DNA repair, 0005634 nucleus; PDB: 3SQ8_A 3SQ5_B 3SQ3_A 1Q32_D 3SQ7_A 1QZQ_A 1RGU_B 1RG2_B 1MU9_B 1RFI_B ....
Probab=100.00  E-value=5.5e-54  Score=483.34  Aligned_cols=321  Identities=24%  Similarity=0.383  Sum_probs=209.1

Q ss_pred             CcEEEEEeccCCCC-CCCCCccccccccCCchhHhHHHHhhcccchhhhhhhcCCC-C-CCcEEEEEccCccccccCcCC
Q 002685          414 KKFYLNRLEFMDLT-SLNYDVISLPELLYPVESISRMFIATFTSDILWFMSYCEIP-S-HLPVTIACHNTERCWSTSADK  490 (893)
Q Consensus       414 spFyLnrI~~lp~~-~~n~dtISL~DLLgP~~~L~~allfnF~~DidWLLs~f~~~-r-~lpVtIVvh~~~~~wk~e~~~  490 (893)
                      .+|++|++.+++.. ..+.++|||+|||++ ++|++||+||||+|++|||++|+.. + .+||+++.+......    ..
T Consensus         2 ~~~~~~~i~~~~~~~~~~~~~itl~dil~~-~~l~~~~~~nf~~D~~wll~~~~~~~~~~~~i~~v~g~~~~~~----~~   76 (443)
T PF06087_consen    2 FKLYLTTIYDLPPRSNNNPDTITLEDILGD-PDLEEALLFNFMIDLDWLLSQFPPSTRKNIPITIVHGTKDPPD----KR   76 (443)
T ss_dssp             CEEEEBTTTTS-GG--GTTTEE-HHHHCSG-TTEEEEEEE-SSEEHHHHHCCS-CCGTTCEEEEEECTSEEHHH----HH
T ss_pred             cceEEeeecCCCccccCCCCcEeHHHHcCC-ccHHHHHhhhheeeHHHHHHhCCHhhcccceEEEEeCCCcchh----hh
Confidence            36899999999987 347999999999997 6899999999999999999999842 2 468888644222211    11


Q ss_pred             cCCCCCCCCCCeEEEcCCCCccccccccccCCCcccccceEEEEEeCCc-EEEEEeCCCCCHHhhccccCeEEeecC-CC
Q 002685          491 RTSVPYPNFPNLVLVFPPFPESVAFGENCKKQGIACHHPKLFVLQREDS-IRVIVTSANLGAKQWNAVTNTVWWQDF-PR  568 (893)
Q Consensus       491 rl~~~~~~~pNv~li~PpMPe~I~FG~d~~~~g~GtHHSKMmIL~Y~D~-LRVVI~TANLi~~DW~~~TQ~VWiQDF-Pr  568 (893)
                      .+......++|+++|.|+||  ++||         |||||||||+|+|+ ||||||||||++.||+.+||+||+||| |+
T Consensus        77 ~~~~~~~~~~nv~~~~~~mp--~~~g---------~hHsKm~ll~y~~~~lRVvI~TaNl~~~Dw~~~~q~vw~~d~lP~  145 (443)
T PF06087_consen   77 EIRQQAAIYPNVKLIFPPMP--IPFG---------THHSKMMLLFYEDGSLRVVIPTANLTPYDWNNKTQGVWIQDFLPR  145 (443)
T ss_dssp             HHHHHHCCHTTEEEEEE-----STT-----------B--EEEEEEETTCEEEEEEESS-BSHHHHCSSB-EEEE---E-B
T ss_pred             hhhhhcccCCCeEEEccCCC--cccc---------cccceeEEEEeCCccEEEEEECCCCCHHHHCCcceeEEEecccCc
Confidence            11112347899999999999  4788         99999999999988 999999999999999999999999999 99


Q ss_pred             CCCCCccccccCCCcccccCCCCCcHHHHHHHHHHHhccCCCchhHHHHhhcccccccccEEEEEecCCcccCCCCCCcC
Q 002685          569 RCAPDYLSLFVQTPVEEINQDSRSDFTAQLASFMASLVIDVPSQAHWIVELTKYDFASAAGHLVASVPGIHSYRNPNLSE  648 (893)
Q Consensus       569 l~~~~~~~~~~~~~~~e~~~~s~sdFk~dL~~YL~sy~~~~Ps~~~~i~~L~kyDFSsakv~LVaSVPG~H~g~~~~~~~  648 (893)
                      ++.....           ....+++|+.+|++||.+|+.  +....|++.|++||||+++|+||+||||+|. .+.    
T Consensus       146 ~~~~~~~-----------~~~~~~~F~~dL~~yL~~y~~--~~~~~~~~~l~~yDFS~~~v~lV~SvPG~h~-~~~----  207 (443)
T PF06087_consen  146 LPSSKSS-----------SEESGSRFKKDLVAYLNSYGK--SPLDKLIERLRKYDFSSARVHLVASVPGKHK-EDK----  207 (443)
T ss_dssp             -ECTS-S-------------SSTTHHHHHHHHHHHTT----HHHHHCHHHHHTEE-CCGTSEEEEE-SECCC-GGG----
T ss_pred             ccccccc-----------cCCCCCchHHHHHHHHHHhCC--cchhHHHHHHHhcCCccCceEEEeccCcccc-CCC----
Confidence            9752111           135789999999999999974  3366899999999999999999999999999 222    


Q ss_pred             CcccCCCCCCcccccccccccceeccc----cCcceeEEeecccccchhhhhH-HHhhhhh-hccchhhh--hh-----c
Q 002685          649 STYSKPVDHGAIWSSDGKFLGSVETSV----VGLSHLFRTAADSNGTQIKKLA-AFLGKSF-SAHGMLKI--LL-----R  715 (893)
Q Consensus       649 ~~~G~~rl~n~~~~~~~s~LgSv~~s~----~gls~l~~q~~DSs~asLg~~~-~~L~k~~-~~~~~~E~--v~-----~  715 (893)
                      ..||+++|        .+.|.......    ....+++.|+  ||+++++... +||...+ ..+.....  ..     .
T Consensus       208 ~~~G~~~L--------~~~L~~~~~~~~~~~~~~~~~~~Q~--SSIGs~~~~~~~Wl~~~f~~sl~~~~~~~~~~~~~~~  277 (443)
T PF06087_consen  208 DKWGHMRL--------RKVLKRLGLPSNKDKDKESDIVCQF--SSIGSLGSSPKDWLTSEFLTSLYPASFSSPSTPSSKS  277 (443)
T ss_dssp             GGSHHHHH--------HHHHHHCCTT---TTCCCCEEEEE---SBB---SSSTTTTTTTHHHHHCCHHCCT------HHH
T ss_pred             cchhHHHH--------HHHHHhccccccCCcCCCCeEEEEc--ccccccCcchhhhHHHHHHHHHhhccccccccccccc
Confidence            26997655        44444433332    1246788888  8898887555 3765433 22221100  00     0


Q ss_pred             cCCCCCCCCCCeEEEecCCCccc-------CCce-----------eeccCcchhcccccCCCCCcccceeecccchHHHH
Q 002685          716 RNTNVPADANAVSVLVPNPDELS-------DGDC-----------IQLGFMPRDIAKWVSPLWDIGFIRFSGFISRDEVL  777 (893)
Q Consensus       716 sn~n~P~~~~~~~IIfPT~deVr-------sg~s-----------kQL~fL~~~l~kW~s~~~D~~~~~f~g~~~p~~~l  777 (893)
                      .....+...++++|||||.++|+       +|++           .+..||+++||+|.++..                 
T Consensus       278 ~~~~~~~~~~~~~IvfPT~e~Vr~S~~G~~~ggsi~~~~~~~~~~~~~~~l~~~~~~w~~~~~-----------------  340 (443)
T PF06087_consen  278 SSSQQENNRPPLKIVFPTVEEVRNSPEGYNGGGSIPFKYKWWEPNFPQEWLRPYFHKWYASDD-----------------  340 (443)
T ss_dssp             HHHHCCHHTEEEEEE--BCHHHCTSTTGGGGGGGSB--HC--GHHCCHHHHHHCCE-EEEC-T-----------------
T ss_pred             cccccccCCCCceEECCCHHHHhhCccCCcCceeEEecchhccccchHHHHHHHHhhhccccc-----------------
Confidence            00011123578999999999999       4444           223689999999998833                 


Q ss_pred             HHHhhcccceeEEEEeeccC
Q 002685          778 AAALEGISKKVQLILHVSQG  797 (893)
Q Consensus       778 ~~a~g~~~~~~ql~~~~~~~  797 (893)
                        ..||..+.+|+|+|++-.
T Consensus       341 --~~~R~~~~pH~K~y~~~~  358 (443)
T PF06087_consen  341 --PSGRSRAPPHIKTYMRFS  358 (443)
T ss_dssp             --GCTTTTS-B--EEEEEEE
T ss_pred             --cCCCCCcCcceEEEEEec
Confidence              678999999999999765


No 2  
>KOG2031 consensus Tyrosyl-DNA phosphodiesterase [Replication, recombination and repair]
Probab=100.00  E-value=3.1e-48  Score=428.02  Aligned_cols=330  Identities=19%  Similarity=0.275  Sum_probs=244.0

Q ss_pred             CCCCCcccCCCCCCCCCCcEEEEEeccC-CCCCCCCCccccccccCC-chhHhHHHHhhcccchhhhhhhcC-CCCCCcE
Q 002685          397 GSMDKSKSLGSSCSPPGKKFYLNRLEFM-DLTSLNYDVISLPELLYP-VESISRMFIATFTSDILWFMSYCE-IPSHLPV  473 (893)
Q Consensus       397 ~~~~~~~~~~~~~~~~gspFyLnrI~~l-p~~~~n~dtISL~DLLgP-~~~L~~allfnF~~DidWLLs~f~-~~r~lpV  473 (893)
                      .+.+...+...+|.|-+  ++++.+.+. -..+.+...+++.+||.+ .+++..+++||||+|++||+.+|+ .++..|+
T Consensus        63 ~e~~~~~~~~~~~~p~~--~~ft~v~~~s~~~~~s~~s~sl~~il~~i~g~~v~silfsfmvdid~Lv~~y~~~~~~~~i  140 (519)
T KOG2031|consen   63 QEKDDSERILTNDNPKG--AVFTTVKGDSVPRYDSMGSVSLMEILADIFGTPVNSILFSFMVDIDWLVGQYPPSVRIKPI  140 (519)
T ss_pred             cccccHHHHhccCCccc--ccccccccccccccCcccchHHHHHHHHhhCCchhheEEEEEeEHHHHHhhCcchhccCce
Confidence            34444457777887776  666776662 222334567787777763 246899999999999999999998 4688888


Q ss_pred             EEEEccCccccccCcCCcCCCCCCCCCCeEEEcCCCCccccccccccCCCcccccceEEEEEeCCcEEEEEeCCCCCHHh
Q 002685          474 TIACHNTERCWSTSADKRTSVPYPNFPNLVLVFPPFPESVAFGENCKKQGIACHHPKLFVLQREDSIRVIVTSANLGAKQ  553 (893)
Q Consensus       474 tIVvh~~~~~wk~e~~~rl~~~~~~~pNv~li~PpMPe~I~FG~d~~~~g~GtHHSKMmIL~Y~D~LRVVI~TANLi~~D  553 (893)
                      |++ |++..      ..++........+++++.+.||  ||||         |||+|||+|+|++++|||||||||++.|
T Consensus       141 ~l~-~G~~d------~~~~~~~~K~~~l~~~~~~~Lp--ipF~---------thHtKm~~l~y~~G~rvvv~taNl~~~D  202 (519)
T KOG2031|consen  141 TLV-HGEPD------EARLLAQTKAPILVTVKLASLP--IPFG---------THHTKMIILFYEEGCRVVVHTANLIHDD  202 (519)
T ss_pred             EEE-ecCCc------hHHHHhhhhccceeeeeccccc--cccc---------ccccceEEEeccCccEEEEecCCcceec
Confidence            875 55431      1122223344559999999999  8999         9999999999999999999999999999


Q ss_pred             hccccCeEEeecCCCCCCCCccccccCCCcccccCCCCCcHHHHHHHHHHHhccCCCchhHHHHhhcccccccccEEEEE
Q 002685          554 WNAVTNTVWWQDFPRRCAPDYLSLFVQTPVEEINQDSRSDFTAQLASFMASLVIDVPSQAHWIVELTKYDFASAAGHLVA  633 (893)
Q Consensus       554 W~~~TQ~VWiQDFPrl~~~~~~~~~~~~~~~e~~~~s~sdFk~dL~~YL~sy~~~~Ps~~~~i~~L~kyDFSsakv~LVa  633 (893)
                      |+++||++|++++++....              ....++.|+.||++||.+|.  .|.++.|++.|++||||.++|+||+
T Consensus       203 w~~ktQ~~w~sp~~~~~~~--------------~g~~~~~Fk~DLi~YL~~Y~--~~~l~~~i~~lkk~DfS~i~v~fIg  266 (519)
T KOG2031|consen  203 WNNKTQGFWCSPLLKIGDK--------------KGVSPTGFKQDLIEYLNSYR--LPQLKEWIASLKKVDFSAINVRFIG  266 (519)
T ss_pred             ccccccceeecCCcccccC--------------CCCCCCchHHHHHHHHHHhc--cchhHHHHHHHHhcchhhceEEEEe
Confidence            9999999999976665421              13578999999999999995  4888999999999999999999999


Q ss_pred             ecCCcccCCCCCCcCCcccCCCCCCcccccccccccceeccc-cCcceeEEeecccccchhhhhH-HHhhhhh-hccchh
Q 002685          634 SVPGIHSYRNPNLSESTYSKPVDHGAIWSSDGKFLGSVETSV-VGLSHLFRTAADSNGTQIKKLA-AFLGKSF-SAHGML  710 (893)
Q Consensus       634 SVPG~H~g~~~~~~~~~~G~~rl~n~~~~~~~s~LgSv~~s~-~gls~l~~q~~DSs~asLg~~~-~~L~k~~-~~~~~~  710 (893)
                      ||||++.+....    +|||.+|        ++.|.-..++. ..-...+-|.  ||+++||+.- +|+...+ ..+.  
T Consensus       267 StPG~f~gs~~~----~WGh~kL--------~kiL~~~~~~~~~~r~~~v~q~--sS~gsl~~~~~~~~~~~f~~~l~--  330 (519)
T KOG2031|consen  267 STPGKFQGSGLL----SWGHNKL--------KKILKEHAASPYLERTPVVGQS--SSIGSLGSLWSAWFIGDFVESLA--  330 (519)
T ss_pred             ecCCcccCcccc----cccHHHH--------HHHHHhhccCcccccCceeeee--eccccccchhhhhhhhhhccchh--
Confidence            999998877544    6999776        22332211110 0111233344  5566666443 4444333 3221  


Q ss_pred             hhhhccCCCCCCCCCCeEEEecCCCccc-------CCce---------eeccCcchhcccccCCCCCcccceeecccchH
Q 002685          711 KILLRRNTNVPADANAVSVLVPNPDELS-------DGDC---------IQLGFMPRDIAKWVSPLWDIGFIRFSGFISRD  774 (893)
Q Consensus       711 E~v~~sn~n~P~~~~~~~IIfPT~deVr-------sg~s---------kQL~fL~~~l~kW~s~~~D~~~~~f~g~~~p~  774 (893)
                          +.++..-..++.+.+||||.++||       +|+|         +|. ||..|||||.+.                
T Consensus       331 ----kdt~~~gk~~~~~yiIfPTveeVrtS~~G~~~Ggsipy~~~~~~kq~-~lk~y~~kW~A~----------------  389 (519)
T KOG2031|consen  331 ----KDTTPPGKLRPPFYIIFPTVEEVRTSLLGYAGGGSLPYGKNTNEKQP-WLKKYLCKWKAM----------------  389 (519)
T ss_pred             ----hccCCCCCCCCCeeEEcccHHHhhccccccccCceecccchhhhhhH-HHHHHHHhhhhh----------------
Confidence                222222223466999999999999       6666         555 999999999999                


Q ss_pred             HHHHHHhhcccceeEEEEeeccCCChhHHH
Q 002685          775 EVLAAALEGISKKVQLILHVSQGPKFSDIS  804 (893)
Q Consensus       775 ~~l~~a~g~~~~~~ql~~~~~~~~~f~~~~  804 (893)
                           +.||+++++|+++||+-.++|+.|+
T Consensus       390 -----~s~R~ramPHiKtYmr~~~d~q~l~  414 (519)
T KOG2031|consen  390 -----DSRRSRAMPHIKTYMRFNLDDQKLA  414 (519)
T ss_pred             -----hhhccccCCcceeeeeecCCCCEEE
Confidence                 8899999999999999999998874


No 3  
>PF00498 FHA:  FHA domain;  InterPro: IPR000253 The forkhead-associated (FHA) domain [] is a phosphopeptide recognition domain found in many regulatory proteins. It displays specificity for phosphothreonine-containing epitopes but will also recognise phosphotyrosine with relatively high affinity. It spans approximately 80-100 amino acid residues folded into an 11-stranded beta sandwich, which sometimes contain small helical insertions between the loops connecting the strands [].  To date, genes encoding FHA-containing proteins have been identified in eubacterial and eukaryotic but not archaeal genomes. The domain is present in a diverse range of proteins, such as kinases, phosphatases, kinesins, transcription factors, RNA-binding proteins and metabolic enzymes which partake in many different cellular processes - DNA repair, signal transduction, vesicular transport and protein degradation are just a few examples.; GO: 0005515 protein binding; PDB: 1LGQ_B 1LGP_A 2CSW_A 2PIE_A 3FM8_A 3MDB_B 3GQS_B 1UHT_A 1WLN_A 3POA_A ....
Probab=99.47  E-value=6.8e-14  Score=118.68  Aligned_cols=67  Identities=43%  Similarity=0.762  Sum_probs=57.8

Q ss_pred             EEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccccccccccccccc
Q 002685           71 YSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDEGFSRVMVSLNG  150 (893)
Q Consensus        71 ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~~L~~~~~S~NG  150 (893)
                      |+|||++ .||++++++.|||+||.|.++..        +            .|      +|+|      +    .|+||
T Consensus         1 ~~iGR~~-~~di~l~~~~iSr~Ha~i~~~~~--------~------------~~------~i~d------~----~s~ng   43 (68)
T PF00498_consen    1 VTIGRSP-DCDIVLPDPSISRRHARISFDDD--------G------------QF------YIED------L----GSTNG   43 (68)
T ss_dssp             EEEESST-TSSEEETSTTSSTTSEEEEEETT--------E------------EE------EEEE------S----SSSS-
T ss_pred             CEEcCCC-CCCEEECCHheeeeeeEEEEece--------e------------eE------EEEe------C----CCCCc
Confidence            6899998 99999999999999999996532        1            14      7888      3    79999


Q ss_pred             eEECCEEccCCCeeeCCCCCEEEE
Q 002685          151 VFVNGIRLKSGIVRELNAGDEVLF  174 (893)
Q Consensus       151 TfVNG~RI~k~~~~~L~~GD~I~~  174 (893)
                      |||||+++.+++.++|.+||+|.|
T Consensus        44 t~vng~~l~~~~~~~L~~gd~i~~   67 (68)
T PF00498_consen   44 TFVNGQRLGPGEPVPLKDGDIIRF   67 (68)
T ss_dssp             EEETTEEESSTSEEEE-TTEEEEE
T ss_pred             EEECCEEcCCCCEEECCCCCEEEc
Confidence            999999999999999999999998


No 4  
>cd00060 FHA Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53,  Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Probab=99.11  E-value=5.1e-10  Score=99.87  Aligned_cols=79  Identities=38%  Similarity=0.558  Sum_probs=66.0

Q ss_pred             ceeEEecCCCCEEEcCCCCCC-CEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccc
Q 002685           60 VDSLTLEPDRPYSIGRASINC-DFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEED  138 (893)
Q Consensus        60 ~~~i~L~~~~~ytIGR~~~~c-Divi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~  138 (893)
                      ...+.|.++..++|||++ .| |++|.++.|||+||+|.++..       ++             +      .+.|    
T Consensus        13 ~~~~~l~~~~~~~iGr~~-~~~~i~l~~~~iS~~H~~i~~~~~-------~~-------------~------~~~~----   61 (102)
T cd00060          13 GRRYYLDPGGTYTIGRDS-DNCDIVLDDPSVSRRHAVIRYDGD-------GG-------------V------VLID----   61 (102)
T ss_pred             eeEEEECCCCeEEECcCC-CcCCEEcCCCCeeCcceEEEEcCC-------CC-------------E------EEEE----
Confidence            458888885689999999 88 999999999999999997642       11             2      3444    


Q ss_pred             cccccccccccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          139 EGFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       139 ~~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      .      .|.|||||||+++.++....|.+||+|.|+
T Consensus        62 ~------~s~~g~~vn~~~~~~~~~~~l~~gd~i~ig   92 (102)
T cd00060          62 L------GSTNGTFVNGQRVSPGEPVRLRDGDVIRLG   92 (102)
T ss_pred             C------CCCCCeEECCEECCCCCcEECCCCCEEEEC
Confidence            2      799999999999998778999999999994


No 5  
>TIGR03354 VI_FHA type VI secretion system FHA domain protein. Members of this protein family are FHA (forkhead-associated) domain-containing proteins that are part of type VI secretion loci in a considerable number of bacteria, most of which are known pathogens. Species include Pseudomonas aeruginosa PAO1, Aeromonas hydrophila, Yersinia pestis, Burkholderia mallei, etc.
Probab=98.96  E-value=9.4e-10  Score=124.13  Aligned_cols=74  Identities=35%  Similarity=0.612  Sum_probs=64.2

Q ss_pred             eeEEecCCCCEEEcCCCCCCCEEecCCc--cccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccc
Q 002685           61 DSLTLEPDRPYSIGRASINCDFTFDNRL--VSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEED  138 (893)
Q Consensus        61 ~~i~L~~~~~ytIGR~~~~cDivi~d~~--VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~  138 (893)
                      ..+.+..+ .++|||++ +||++|+|+.  ||++||+|.++         +|            .|      +|+|    
T Consensus        17 ~~~~f~~~-~~~IGR~~-~~d~~l~d~~~~VS~~Ha~I~~~---------~g------------~~------~l~D----   63 (396)
T TIGR03354        17 AQKTFGTN-GGTIGRSE-DCDWVLPDPERHVSGRHARIRYR---------DG------------AY------LLTD----   63 (396)
T ss_pred             eEEEECCC-CEEEecCC-CCCEEeCCCCCCcchhhcEEEEE---------CC------------EE------EEEE----
Confidence            36777764 59999999 9999999998  99999999954         23            25      7889    


Q ss_pred             cccccccccccceEEC--CEEccCCCeeeCCCCCEEEE
Q 002685          139 EGFSRVMVSLNGVFVN--GIRLKSGIVRELNAGDEVLF  174 (893)
Q Consensus       139 ~~L~~~~~S~NGTfVN--G~RI~k~~~~~L~~GD~I~~  174 (893)
                             .|+||||||  |.+|.++..+.|.+||+|.|
T Consensus        64 -------lStNGT~VN~sg~~l~~~~~~~L~~GD~I~i   94 (396)
T TIGR03354        64 -------LSTNGVFLNGSGSPLGRGNPVRLEQGDRLRL   94 (396)
T ss_pred             -------CCCCCeEECCCCCCCCCCCceEcCCCCEEEE
Confidence                   599999999  99999988889999999999


No 6  
>KOG1882 consensus Transcriptional regulator SNIP1, contains FHA domain [Signal transduction mechanisms]
Probab=98.95  E-value=4.8e-10  Score=117.41  Aligned_cols=97  Identities=25%  Similarity=0.254  Sum_probs=78.3

Q ss_pred             CCcceeEEecCCCCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcc
Q 002685           57 GSPVDSLTLEPDRPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEE  136 (893)
Q Consensus        57 g~~~~~i~L~~~~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~  136 (893)
                      ++++..+.+.....|.+||...-.||-|++|..|+|||+|+|......-  .||+            ..+++..+|.|  
T Consensus       182 ~e~l~~l~iHrqs~yL~gRerkIaDi~idhpScSKQHaviQyR~v~~~r--~dGt------------~grrvkpYiiD--  245 (293)
T KOG1882|consen  182 YEVLPVLYIHRQSCYLDGRERKIADIPIDHPSCSKQHAVIQYRLVEFTR--ADGT------------VGRRVKPYIID--  245 (293)
T ss_pred             CcccchheeeeeeeeecCceeeeeccCCCCccccccceeeeeeeccccc--CCCc------------cceeeeeEEEe--
Confidence            4444566777778899999888899999999999999999997654322  4652            33445557889  


Q ss_pred             cccccccccccccceEECCEEccCCCeeeCCCCCEEEEEeC
Q 002685          137 EDEGFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFVCR  177 (893)
Q Consensus       137 ~~~~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~~g  177 (893)
                          |    +|+||||+|..+|...++.+|..+|+|.|+..
T Consensus       246 ----L----gS~NgTfLNnk~IepqRYyEL~ekDvlkfgfs  278 (293)
T KOG1882|consen  246 ----L----GSGNGTFLNNKVIEPQRYYELREKDVLKFGFS  278 (293)
T ss_pred             ----c----CCCCcceecCcccCchheeeeecCceeeeccc
Confidence                4    99999999999999999999999999999543


No 7  
>COG1716 FOG: FHA domain [Signal transduction mechanisms]
Probab=98.94  E-value=2.8e-09  Score=106.59  Aligned_cols=68  Identities=32%  Similarity=0.510  Sum_probs=57.8

Q ss_pred             CCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccccccccccccc
Q 002685           69 RPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDEGFSRVMVSL  148 (893)
Q Consensus        69 ~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~~L~~~~~S~  148 (893)
                      ..++|||++ ++|+++++..|||+||.|..+.        +             ++      +++|      |    .|+
T Consensus        89 ~~~tigr~~-~~~i~~~~~~vSR~Ha~l~~~~--------~-------------~~------~~~d------~----~S~  130 (191)
T COG1716          89 PVTTIGRDP-DNDIVLDDDVVSRRHAELRREG--------N-------------EV------FLED------L----GST  130 (191)
T ss_pred             ceEEeccCC-CCCEEcCCCccccceEEEEEeC--------C-------------ce------EEEE------C----CCC
Confidence            479999998 9999999999999999998432        2             23      6778      4    899


Q ss_pred             cceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          149 NGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       149 NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      |||||||.++...  +.|.+||.|.|+.
T Consensus       131 nGt~vn~~~v~~~--~~l~~gd~i~i~~  156 (191)
T COG1716         131 NGTYVNGEKVRQR--VLLQDGDVIRLGG  156 (191)
T ss_pred             cceEECCeEccCc--EEcCCCCEEEECc
Confidence            9999999999973  6799999999943


No 8  
>PLN02927 antheraxanthin epoxidase/zeaxanthin epoxidase
Probab=98.70  E-value=2.2e-08  Score=119.52  Aligned_cols=97  Identities=23%  Similarity=0.390  Sum_probs=71.4

Q ss_pred             CCCCcceeEEec--CCCCEEEcCCCCCCCE-----EecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhc
Q 002685           55 ATGSPVDSLTLE--PDRPYSIGRASINCDF-----TFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKK  127 (893)
Q Consensus        55 ~~g~~~~~i~L~--~~~~ytIGR~~~~cDi-----vi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~  127 (893)
                      ..+...+.+.|.  .+.+++|||.+ +||+     ||+++.||++||+|.+         .++            .|   
T Consensus       541 ~~~~~~~~~~l~~~~~~p~~iG~~~-~~~~~~~~i~i~~~~vS~~Ha~i~~---------~~~------------~~---  595 (668)
T PLN02927        541 DDCCVSETLCLTKDEDQPCIVGSEP-DQDFPGMRIVIPSSQVSKMHARVIY---------KDG------------AF---  595 (668)
T ss_pred             CCCcccceeeeecCCCCCeEecCCC-CcCCCCceEEecCCccChhHeEEEE---------ECC------------EE---
Confidence            334444567772  23689999999 9996     9999999999999984         344            25   


Q ss_pred             ccccccCcccccccccccccccceEECCEE---cc--CCCeeeCCCCCEEEEEeCCCccccccceeEEEEEEEEE
Q 002685          128 RDKSLEDEEEDEGFSRVMVSLNGVFVNGIR---LK--SGIVRELNAGDEVLFVCRDESLCRLQTRIGFLILGIIF  197 (893)
Q Consensus       128 ~~~~i~D~~~~~~L~~~~~S~NGTfVNG~R---I~--k~~~~~L~~GD~I~~~~g~~~~~~~~~~igf~v~~i~~  197 (893)
                         +|+|      |    +|+|||||||.+   |.  .+.++.|++||+|.|+-.++        +.|.|+.+..
T Consensus       596 ---~~~D------l----~S~nGT~v~~~~~~r~~~~p~~~~~l~~~d~I~~g~~~~--------~~fr~~~~~~  649 (668)
T PLN02927        596 ---FLMD------L----RSEHGTYVTDNEGRRYRATPNFPARFRSSDIIEFGSDKK--------AAFRVKVIRK  649 (668)
T ss_pred             ---EEEE------C----CCCCccEEeCCCCceEecCCCCceEeCCCCEEEeCCCcc--------eeEEEEeecC
Confidence               8999      4    999999996665   54  45568899999999944322        2466665543


No 9  
>smart00240 FHA Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
Probab=98.68  E-value=2.9e-08  Score=79.79  Aligned_cols=52  Identities=44%  Similarity=0.800  Sum_probs=42.8

Q ss_pred             EEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccccccccccccccc
Q 002685           71 YSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDEGFSRVMVSLNG  150 (893)
Q Consensus        71 ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~~L~~~~~S~NG  150 (893)
                      ++|||.+..||++++++.||+.||.|.++..        +            .|      +++|      +    .|+||
T Consensus         1 ~~iGr~~~~~~i~~~~~~vs~~H~~i~~~~~--------~------------~~------~i~d------~----~s~~g   44 (52)
T smart00240        1 VTIGRSSEDCDIQLPGPSISRRHAEIVYDGG--------G------------RF------YLID------L----GSTNG   44 (52)
T ss_pred             CEeCCCCCCCCEEeCCCCcchhHcEEEECCC--------C------------eE------EEEE------C----CCCCC
Confidence            4799998679999999999999999997532        1            13      6778      2    67999


Q ss_pred             eEECCEEc
Q 002685          151 VFVNGIRL  158 (893)
Q Consensus       151 TfVNG~RI  158 (893)
                      |||||++|
T Consensus        45 t~vng~~v   52 (52)
T smart00240       45 TFVNGKRI   52 (52)
T ss_pred             eeECCEEC
Confidence            99999986


No 10 
>COG3456 Predicted component of the type VI protein secretion system, contains a FHA domain [Intracellular trafficking, secretion, and vesicular    transport; Signal transduction mechanisms]
Probab=98.38  E-value=3.9e-07  Score=102.45  Aligned_cols=71  Identities=42%  Similarity=0.753  Sum_probs=59.5

Q ss_pred             EecCCCCEEEcCCCCCCCEEecC--CccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCccccccc
Q 002685           64 TLEPDRPYSIGRASINCDFTFDN--RLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDEGF  141 (893)
Q Consensus        64 ~L~~~~~ytIGR~~~~cDivi~d--~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~~L  141 (893)
                      .+.. +...|||++ +||-+|+|  +.||++||.|.++         +|.            |      +|+|       
T Consensus        22 ~f~~-~~g~IGrs~-dcdW~i~D~~~~VS~~Hc~I~~~---------dg~------------f------~L~D-------   65 (430)
T COG3456          22 LFDR-GGGVIGRSP-DCDWQIDDPERFVSKQHCTISYR---------DGG------------F------CLTD-------   65 (430)
T ss_pred             hhhc-CCcccccCC-CCCccccCcccccchhheEEEec---------CCe------------E------EEEe-------
Confidence            3433 347999999 99999997  6899999999953         442            6      8899       


Q ss_pred             ccccccccceEECCEEccCCCe-eeCCCCCEEEE
Q 002685          142 SRVMVSLNGVFVNGIRLKSGIV-RELNAGDEVLF  174 (893)
Q Consensus       142 ~~~~~S~NGTfVNG~RI~k~~~-~~L~~GD~I~~  174 (893)
                          .|.|||||||..+..|.. ..|+.||+|.|
T Consensus        66 ----tS~g~l~VNgs~~~~g~~~~RLqqGd~i~i   95 (430)
T COG3456          66 ----TSNGGLLVNGSDLPLGEGSARLQQGDEILI   95 (430)
T ss_pred             ----cCCCceeecccccCCCCCccccccCCEEee
Confidence                789999999999987766 88999999999


No 11 
>KOG1881 consensus Anion exchanger adaptor protein Kanadaptin, contains FHA domain [General function prediction only]
Probab=97.90  E-value=3.8e-06  Score=99.22  Aligned_cols=93  Identities=23%  Similarity=0.365  Sum_probs=74.0

Q ss_pred             CCCCcceeEEecCCCCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccC
Q 002685           55 ATGSPVDSLTLEPDRPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLED  134 (893)
Q Consensus        55 ~~g~~~~~i~L~~~~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D  134 (893)
                      ..|..+++.+|.....|+|||.. .||+.+.||.|||.||.|.+....     .++    |-.+--+ .|      +|.|
T Consensus       163 Keg~iiet~~l~~~~~~~fgr~~-~cD~~~eHpsISr~h~vlQy~~~~-----~~~----p~~s~~~-g~------~i~d  225 (793)
T KOG1881|consen  163 KEGAIIETEDLKGAAACLFGRLG-GCDVALEHPSISRFHAVLQYKASG-----PDD----PCASNGE-GW------YIYD  225 (793)
T ss_pred             ccCceeeeeecccceeEEecccC-CCccccccCcccccceeeeccCCC-----CCc----cccCCCC-ce------EEee
Confidence            35778889999988889999999 999999999999999999965432     121    0000001 14      7778


Q ss_pred             cccccccccccccccceEECCEEccCCCeeeCCCCCEEEE
Q 002685          135 EEEDEGFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLF  174 (893)
Q Consensus       135 ~~~~~~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~  174 (893)
                            |    +|+.|||+|..||.+..+..++.|+++.|
T Consensus       226 ------l----gsThgt~~NK~rvppk~yir~~Vg~v~~f  255 (793)
T KOG1881|consen  226 ------L----GSTHGTFLNKDRVPPKVYIRDRVGHVARF  255 (793)
T ss_pred             ------c----cccccceeccccCCCcchhhhhHHHHHHh
Confidence                  4    99999999999999999999999999988


No 12 
>KOG0615 consensus Serine/threonine protein kinase Chk2 and related proteins [Cell cycle control, cell division, chromosome partitioning]
Probab=97.76  E-value=2e-05  Score=89.14  Aligned_cols=90  Identities=33%  Similarity=0.469  Sum_probs=67.9

Q ss_pred             eeEEecCCCCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEE-ecCCcccccccccccchhhhcccccccCccccc
Q 002685           61 DSLTLEPDRPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIY-VLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDE  139 (893)
Q Consensus        61 ~~i~L~~~~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy-~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~  139 (893)
                      ..+.+..+ .|++||.+ .||++++...+|..|..+.+-....++. ..++         .=+.+    +++++|     
T Consensus        57 ~~~d~~nd-~f~fGR~~-~~d~~ln~~~~s~~~~~i~~~~~~~~~~f~~dr---------~~~sn----~~y~~D-----  116 (475)
T KOG0615|consen   57 KSIDLAND-EFTFGRGD-SCDAPLNLNNVSNKHFKILLYNKISKIHFRIDR---------DKNSN----RVYLHD-----  116 (475)
T ss_pred             ccceeccc-eEEecCCC-cccccccCccccccchheeeeeeeeeeeecccC---------CCccc----eEEEEe-----
Confidence            45666654 69999999 9999999999999999988642211221 1111         10111    448889     


Q ss_pred             ccccccccccceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          140 GFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       140 ~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                            .|+||||||-++++++..++|++||+|.|..
T Consensus       117 ------hS~nGT~VN~e~i~k~~~r~lkN~dei~is~  147 (475)
T KOG0615|consen  117 ------HSRNGTFVNDEMIGKGLSRILKNGDEISISI  147 (475)
T ss_pred             ------cccCcccccHhHhhccccccccCCCEEEecc
Confidence                  7999999999999999999999999999844


No 13 
>KOG1880 consensus Nuclear inhibitor of phosphatase-1 [General function prediction only]
Probab=97.17  E-value=0.00015  Score=78.91  Aligned_cols=86  Identities=24%  Similarity=0.348  Sum_probs=67.9

Q ss_pred             CCCcceeEEecCCCCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCc
Q 002685           56 TGSPVDSLTLEPDRPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDE  135 (893)
Q Consensus        56 ~g~~~~~i~L~~~~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~  135 (893)
                      ++.+.+.+.+. ++.|.+||.+..||++|++..+||+|+-+.+-        ..+           +      ++++.| 
T Consensus        26 d~li~kl~idd-kr~y~Fgrn~q~~df~idh~scSrvhaa~vyh--------khl-----------~------~~~lid-   78 (337)
T KOG1880|consen   26 DKLIQKLIIDD-KRRYLFGRNHQTCDFVIDHASCSRVHAALVYH--------KHL-----------S------RIFLID-   78 (337)
T ss_pred             chhHHHHHhhh-hhhhhhccCCCccceEeecchhhhhHhhhhhh--------hcc-----------c------eEEEEE-
Confidence            45444555555 46799999999999999999999999987731        111           1      126778 


Q ss_pred             ccccccccccccccceEECCEEccCCCeeeCCCCCEEEEEeC
Q 002685          136 EEDEGFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFVCR  177 (893)
Q Consensus       136 ~~~~~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~~g  177 (893)
                           |    +|+-|||+...|++..+.++|..|-...|+.+
T Consensus        79 -----l----~s~hgtf~g~~rL~~~~p~~l~i~~~~~fgas  111 (337)
T KOG1880|consen   79 -----L----GSTHGTFLGNERLEPHKPVQLEIGSTFHFGAS  111 (337)
T ss_pred             -----c----cCCcceeeeeeeeccCCCccccCCceEEEecc
Confidence                 4    89999999999999999999999999999554


No 14 
>TIGR02500 type_III_yscD type III secretion apparatus protein, YscD/HrpQ family. This family represents a conserved protein of bacterial type III secretion systems. Gene symbols are variable from species to species. Members are designated YscD in Yersinia, HrpQ in Pseudomonas syringae, and EscD in enteropathogenic Escherichia coli. In the Chlamydiae, this model describes the C-terminal 400 residues of a longer protein.
Probab=95.70  E-value=0.026  Score=64.70  Aligned_cols=46  Identities=22%  Similarity=0.455  Sum_probs=38.6

Q ss_pred             eeEEecC-CCCcceeEEecCCCCEEEc-CCCCCCCEEecCCccccceEEEEEc
Q 002685           49 LPLVSTA-TGSPVDSLTLEPDRPYSIG-RASINCDFTFDNRLVSRQHCQILFD   99 (893)
Q Consensus        49 ~~lis~~-~g~~~~~i~L~~~~~ytIG-R~~~~cDivi~d~~VSr~Ha~I~fd   99 (893)
                      +-++++. .|.   .+.|.+++ ++|| |++ +||+++.|+.||++|+.|..+
T Consensus         3 lrvl~G~~~G~---~~~L~~g~-~~iG~~~~-~~di~L~d~~~~~~h~~l~v~   50 (410)
T TIGR02500         3 LRVLSGPHRGA---ELPLPEGN-LVLGTDAA-DCDIVLSDGGIAAVHVSLHVR   50 (410)
T ss_pred             EEEecCCCCCc---EEECCCCc-eEeccCCC-CcEEEeCCCCccchheEEEEc
Confidence            4566677 565   67888875 9999 998 999999999999999999854


No 15 
>PF06087 Tyr-DNA_phospho:  Tyrosyl-DNA phosphodiesterase;  InterPro: IPR010347 Covalent intermediates between topoisomerase I and DNA can become dead-end complexes that lead to cell death. Tyrosyl-DNA phosphodiesterase can hydrolyse the bond between topoisomerase I and DNA [].; GO: 0008081 phosphoric diester hydrolase activity, 0006281 DNA repair, 0005634 nucleus; PDB: 3SQ8_A 3SQ5_B 3SQ3_A 1Q32_D 3SQ7_A 1QZQ_A 1RGU_B 1RG2_B 1MU9_B 1RFI_B ....
Probab=95.67  E-value=0.0067  Score=70.10  Aligned_cols=76  Identities=32%  Similarity=0.445  Sum_probs=54.6

Q ss_pred             cCCCh-hHHHhhhchhhHHHHHHHHHHhHh-----------------------hhchhhHHHHhhccCCCc----ccccc
Q 002685          796 QGPKF-SDISKMMQIEHVVALSSLIASIQR-----------------------CRGLWRLQEVLGQYRWPE----LQESD  847 (893)
Q Consensus       796 ~~~~f-~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~----~~~~~  847 (893)
                      .+.+| .++...+..........|+..|++                       .+|+.||.++|.++.++.    ..+.+
T Consensus       156 ~~~~F~~dL~~yL~~y~~~~~~~~~~~l~~yDFS~~~v~lV~SvPG~h~~~~~~~G~~~L~~~L~~~~~~~~~~~~~~~~  235 (443)
T PF06087_consen  156 SGSRFKKDLVAYLNSYGKSPLDKLIERLRKYDFSSARVHLVASVPGKHKEDKDKWGHMRLRKVLKRLGLPSNKDKDKESD  235 (443)
T ss_dssp             STTHHHHHHHHHHHTT--HHHHHCHHHHHTEE-CCGTSEEEEE-SECCCGGGGGSHHHHHHHHHHHCCTT---TTCCCCE
T ss_pred             CCCchHHHHHHHHHHhCCcchhHHHHHHHhcCCccCceEEEeccCccccCCCcchhHHHHHHHHHhccccccCCcCCCCe
Confidence            57777 456666655554445556666655                       599999999999999987    88999


Q ss_pred             eeeeccCcccc--------cHHHHHHHHHhhc
Q 002685          848 FSYGSSSIGSI--------NAQFLAAFAAASG  871 (893)
Q Consensus       848 ~~~~~~~~~~~--------~~~~~~~~~~~~~  871 (893)
                      |+|-+|||||.        +..|+.+++.++.
T Consensus       236 ~~~Q~SSIGs~~~~~~~Wl~~~f~~sl~~~~~  267 (443)
T PF06087_consen  236 IVCQFSSIGSLGSSPKDWLTSEFLTSLYPASF  267 (443)
T ss_dssp             EEEE-SBB---SSSTTTTTTTHHHHHCCHHCC
T ss_pred             EEEEcccccccCcchhhhHHHHHHHHHhhccc
Confidence            99999999977        6779998888775


No 16 
>TIGR01663 PNK-3'Pase polynucleotide 5'-kinase 3'-phosphatase. Note that the EC number for the kinase function is: 2.7.1.78
Probab=91.51  E-value=0.34  Score=57.75  Aligned_cols=79  Identities=22%  Similarity=0.274  Sum_probs=63.1

Q ss_pred             eeEEecCCCCEEEcCCCCCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCcccccc
Q 002685           61 DSLTLEPDRPYSIGRASINCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDEG  140 (893)
Q Consensus        61 ~~i~L~~~~~ytIGR~~~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~~  140 (893)
                      ..|+|.. ....|||.|   +..|.|..-||+..++.-|-..       |             +     |.+.-    + 
T Consensus        25 ~~~~~~~-~~~~~gr~p---et~i~d~~cs~~qv~l~a~~~~-------~-------------~-----v~~k~----l-   70 (526)
T TIGR01663        25 HFIHLDA-GALFLGRGP---ETGIRDRKCSKRQIELQADLEK-------A-------------T-----VALKQ----L-   70 (526)
T ss_pred             CeeccCC-CceEEccCc---ccccchhhhchhhheeeecccC-------c-------------e-----EEEEE----c-
Confidence            4778775 568899999   7788899999999998865332       1             1     24444    3 


Q ss_pred             cccccccccceEECCEEccCCCeeeCCCCCEEEEEeCCC
Q 002685          141 FSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFVCRDE  179 (893)
Q Consensus       141 L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~~g~~  179 (893)
                            ..|-+-|||+-+.++...+|++||.+.|+.|..
T Consensus        71 ------g~np~~~~~~~~~~~~~~~l~~g~~l~~v~~~~  103 (526)
T TIGR01663        71 ------GVNPCGTGGLELKPGGEGELGHGDLLEIVNGLH  103 (526)
T ss_pred             ------cCCCcccCceEecCCCeeeecCCCEEEEecccc
Confidence                  459999999999999999999999999988764


No 17 
>KOG0245 consensus Kinesin-like protein [Cytoskeleton]
Probab=91.33  E-value=0.51  Score=59.26  Aligned_cols=76  Identities=24%  Similarity=0.405  Sum_probs=53.4

Q ss_pred             eEEecCCCCEEEcCCC--CCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCccccc
Q 002685           62 SLTLEPDRPYSIGRAS--INCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEEDE  139 (893)
Q Consensus        62 ~i~L~~~~~ytIGR~~--~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~~~  139 (893)
                      .+.+-++...+|||.+  ...||+++-..|--+||.|.-++... +               |.--      -.+|     
T Consensus       470 LlY~ikeG~TrVG~~~a~~~~DI~LsG~~I~~qHC~i~~~~g~~-~---------------vtl~------p~e~-----  522 (1221)
T KOG0245|consen  470 LLYYIKEGETRVGREDASSRQDIVLSGQLIREQHCSIRNEGGND-V---------------VTLE------PCED-----  522 (1221)
T ss_pred             EEEEeccCceecCCCCcccCCceEecchhhhhhceEEEecCCCc-e---------------EEec------cCCc-----
Confidence            3444455678999876  46899999999999999998332210 0               0000      2233     


Q ss_pred             ccccccccccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          140 GFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       140 ~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                               --|||||..|..-  ..|+.||+|.++
T Consensus       523 ---------aetyVNGk~v~ep--~qL~~GdRiilG  547 (1221)
T KOG0245|consen  523 ---------AETYVNGKLVTEP--TQLRSGDRIILG  547 (1221)
T ss_pred             ---------cceeEccEEcCCc--ceeccCCEEEEc
Confidence                     3499999999974  579999999993


No 18 
>PF08797 HIRAN:  HIRAN domain;  InterPro: IPR014905 The HIRAN protein (HIP116, Rad5p N-terminal) is found in the N-terminal regions of the SWI2/SNF2 proteins typified by HIP116 and Rad5p. HIRAN is found as a standalone protein in several bacteria and prophages, or fused to other catalytic domains, such as a nuclease of the restriction endonuclease fold and TDP1-like DNA phosphoesterases, in the eukaryotes []. It has been predicted that this protein functions as a DNA-binding domain that probably recognises features associated with damaged DNA or stalled replication forks [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides; PDB: 2L1I_A 3K2Y_C.
Probab=87.15  E-value=0.1  Score=48.80  Aligned_cols=63  Identities=35%  Similarity=0.592  Sum_probs=44.6

Q ss_pred             hhhccCCCCCCCCCCeEEEecCCCcccCCceeeccCcchhcccccCCCCCcccceeecccchHHH
Q 002685          712 ILLRRNTNVPADANAVSVLVPNPDELSDGDCIQLGFMPRDIAKWVSPLWDIGFIRFSGFISRDEV  776 (893)
Q Consensus       712 ~v~~sn~n~P~~~~~~~IIfPT~deVrsg~skQL~fL~~~l~kW~s~~~D~~~~~f~g~~~p~~~  776 (893)
                      ..+.+..+.|.|.+++++.....-....  ..++|||++..++|.+++-|.+.+.|.|.+...+.
T Consensus        29 v~L~re~~N~~D~nAi~v~~~~~~~~~~--~~~iGylp~~~a~~l~~Lld~~~~~~~~~v~~~~~   91 (107)
T PF08797_consen   29 VVLVREPDNPYDPNAIKVYTIDRFSNSR--GREIGYLPREIAKWLSPLLDSGGVKFEGTVVFVPD   91 (107)
T ss_dssp             EEEEEETT-TTHCCCCCEEETT--------BGCCCCCHHHHHHHHHHHCCT-TEEEEEEEECCCT
T ss_pred             EEEEEcCCCCCCcccEEEEEeecccccC--CCEEEEecHHHHHHHHHHHhCCCcEEEEEEEECCc
Confidence            4445555666678888888877532222  24899999999999999999999999999876555


No 19 
>KOG2293 consensus Daxx-interacting protein MSP58/p78, contains FHA domain [Transcription; Signal transduction mechanisms]
Probab=80.22  E-value=4  Score=48.50  Aligned_cols=74  Identities=23%  Similarity=0.302  Sum_probs=54.1

Q ss_pred             EecCCCCEEEcCCCCCCCEEec------CCccccceEEEEEccCcceEEecCCcccccccccccchhhhcccccccCccc
Q 002685           64 TLEPDRPYSIGRASINCDFTFD------NRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKRDKSLEDEEE  137 (893)
Q Consensus        64 ~L~~~~~ytIGR~~~~cDivi~------d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~~~~i~D~~~  137 (893)
                      ++...+.+++||+..+|-+=||      .-.|||+.++|...        .+|            -|      +|..   
T Consensus       443 h~mrk~EVtlGRat~d~~VDIDLgkegpatKISRRQa~IkL~--------n~G------------sF------~IkN---  493 (547)
T KOG2293|consen  443 HYMRKKEVTLGRATGDLKVDIDLGKEGPATKISRRQALIKLK--------NDG------------SF------FIKN---  493 (547)
T ss_pred             hhhcCcceEeeccCCCcceeeeccccCccceeeccceeEEec--------cCC------------cE------Eecc---
Confidence            3444578999999976666555      36899999998731        344            25      6766   


Q ss_pred             ccccccccccccceEECCEEccCCCeeeCCCCCEEEE
Q 002685          138 DEGFSRVMVSLNGVFVNGIRLKSGIVRELNAGDEVLF  174 (893)
Q Consensus       138 ~~~L~~~~~S~NGTfVNG~RI~k~~~~~L~~GD~I~~  174 (893)
                       +       ..--.||||.+|..|+.+.|++.-.|.|
T Consensus       494 -l-------GK~~I~vng~~l~~gq~~~L~~nclveI  522 (547)
T KOG2293|consen  494 -L-------GKRSILVNGGELDRGQKVILKNNCLVEI  522 (547)
T ss_pred             -C-------cceeEEeCCccccCCceEEeccCcEEEE
Confidence             3       3355899999999999988887777766


No 20 
>KOG1892 consensus Actin filament-binding protein Afadin [Cytoskeleton]
Probab=73.09  E-value=10  Score=47.97  Aligned_cols=85  Identities=20%  Similarity=0.338  Sum_probs=58.9

Q ss_pred             EEecCCCCcc-eeEEecCCCCEEEcCCC-CCCCEEecCCccccceEEEEEccCcceEEecCCcccccccccccchhhhcc
Q 002685           51 LVSTATGSPV-DSLTLEPDRPYSIGRAS-INCDFTFDNRLVSRQHCQILFDSSERKIYVLDGTFLLPAFSSVVNEFRKKR  128 (893)
Q Consensus        51 lis~~~g~~~-~~i~L~~~~~ytIGR~~-~~cDivi~d~~VSr~Ha~I~fd~~~~~iy~~~g~~~~~~~~~~v~~~r~~~  128 (893)
                      +.-+.+|.-- ..|.|..+ +.-+|-.. .++.|.+.-|.|-.+||-|.+         .+||.                
T Consensus       360 ve~s~dG~~s~~ri~L~~~-vtEVGs~~~~~~~iqLfGP~IqprHc~it~---------meGVv----------------  413 (1629)
T KOG1892|consen  360 VELSPDGSDSRKRIRLQLS-VTEVGSEKLDDNSIQLFGPGIQPRHCDITN---------MEGVV----------------  413 (1629)
T ss_pred             EEEcCCCCCcceeEEeccC-ceeccccccCCcceeeeCCCCCccccchhh---------ccceE----------------
Confidence            3334455432 56777764 56788877 346899999999999998873         45632                


Q ss_pred             cccccCcccccccccccccc-cceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          129 DKSLEDEEEDEGFSRVMVSL-NGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       129 ~~~i~D~~~~~~L~~~~~S~-NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                        .++-           .++ -=|||||.+|..-  ..|..|+.|.|+.
T Consensus       414 --TvTP-----------~~~DA~t~VnGh~isqt--tiL~~G~~v~fGa  447 (1629)
T KOG1892|consen  414 --TVTP-----------RSMDAETYVNGHRISQT--TILQSGMKVQFGA  447 (1629)
T ss_pred             --Eecc-----------cccchhhhccceecchh--hhhccCCEEEecc
Confidence              2222           122 3499999999874  6799999999953


No 21 
>PF00614 PLDc:  Phospholipase D Active site motif;  InterPro: IPR001736 Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs [, , , ].; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3HSI_C.
Probab=71.96  E-value=3  Score=31.12  Aligned_cols=24  Identities=33%  Similarity=0.468  Sum_probs=13.8

Q ss_pred             ccccceEEEEEeCCcEEEEEeCCCCCH
Q 002685          525 ACHHPKLFVLQREDSIRVIVTSANLGA  551 (893)
Q Consensus       525 GtHHSKMmIL~Y~D~LRVVI~TANLi~  551 (893)
                      ++||+|++++--+   +..|.++||..
T Consensus         3 ~~~H~K~~vvD~~---~a~vGg~nl~~   26 (28)
T PF00614_consen    3 GSHHQKFVVVDDR---VAFVGGANLCD   26 (28)
T ss_dssp             BEE---EEEETTT---EEEEE---SSH
T ss_pred             cceeeEEEEEcCC---EEEECceecCC
Confidence            4899999998433   88999999864


No 22 
>smart00155 PLDc Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are  activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic  acid from phosphatidylcholine, which may be essential for the formation  of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways.  PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to  possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid,  and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not be
Probab=69.95  E-value=4.4  Score=29.55  Aligned_cols=24  Identities=29%  Similarity=0.394  Sum_probs=19.5

Q ss_pred             ccccceEEEEEeCCcEEEEEeCCCCCH
Q 002685          525 ACHHPKLFVLQREDSIRVIVTSANLGA  551 (893)
Q Consensus       525 GtHHSKMmIL~Y~D~LRVVI~TANLi~  551 (893)
                      +.+|+|+|+.-.   -.++|.|+||..
T Consensus         3 ~~~H~K~~v~D~---~~~~iGs~N~~~   26 (28)
T smart00155        3 GVLHTKLMIVDD---EIAYIGSANLDG   26 (28)
T ss_pred             CcEEeEEEEEcC---CEEEEeCccCCC
Confidence            479999999844   379999999964


No 23 
>cd00138 PLDc Phospholipase D. Active site motifs; The PLD superfamily includes enzymes involved in signal transduction, lipid biosynthesis, endonucleases and open reading frames in pathogenic viruses and bacteria.  PLD hydrolyzes the terminal phosphodiester bond of phospholipids to phosphatidic acid and a hydrophilic constituent. Phosphatidic acid is a compound that is heavily involved in signal transduction.  The common features of the family members are that they can bind to a phosphodiester moiety, and that most of these enzymes are active as bi-lobed monomers or dimers.
Probab=62.02  E-value=9.9  Score=37.48  Aligned_cols=37  Identities=22%  Similarity=0.228  Sum_probs=28.3

Q ss_pred             ccccceEEEEEeCCcEEEEEeCCCCCHHhhc-cccCeEEee
Q 002685          525 ACHHPKLFVLQREDSIRVIVTSANLGAKQWN-AVTNTVWWQ  564 (893)
Q Consensus       525 GtHHSKMmIL~Y~D~LRVVI~TANLi~~DW~-~~TQ~VWiQ  564 (893)
                      +.+|+|+||+=   .=.++|.|+|+....|. +.+.++.+.
T Consensus       112 ~~~H~K~~iiD---~~~~~vGS~N~~~~~~~~~~e~~~~~~  149 (176)
T cd00138         112 GVLHTKLVIVD---DETAYIGSANLDGRSLTLNSEVGVVIY  149 (176)
T ss_pred             cceeeeEEEEc---CCEEEEECCcCChhhhhhhcceEEEEe
Confidence            48999999994   34899999999999885 344455444


No 24 
>PF13091 PLDc_2:  PLD-like domain; PDB: 2ZE4_A 2ZE9_A 1BYS_A 1BYR_A 1V0T_A 1V0U_A 1V0V_A 1V0S_A 1V0R_A 1V0W_A ....
Probab=58.76  E-value=7.3  Score=36.16  Aligned_cols=36  Identities=25%  Similarity=0.358  Sum_probs=22.5

Q ss_pred             cccceEEEEEeCCcEEEEEeCCCCCHHhhccc-cCeEEee
Q 002685          526 CHHPKLFVLQREDSIRVIVTSANLGAKQWNAV-TNTVWWQ  564 (893)
Q Consensus       526 tHHSKMmIL~Y~D~LRVVI~TANLi~~DW~~~-TQ~VWiQ  564 (893)
                      .+|.|+++.-  +. .++|.|+|++...|..- +.++.+.
T Consensus        74 ~~H~K~~i~d--~~-~~iiGS~N~t~~~~~~n~E~~~~~~  110 (126)
T PF13091_consen   74 RLHAKFYIID--DK-VAIIGSANLTSSSFRRNYELGVIID  110 (126)
T ss_dssp             -B--EEEEET--TT-EEEEES--CSCCCSCTSEEEEEEEE
T ss_pred             CCCcceEEec--Cc-cEEEcCCCCCcchhcCCcceEEEEE
Confidence            6899999993  22 99999999999988644 4344444


No 25 
>TIGR02868 CydC thiol reductant ABC exporter, CydC subunit. The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain (pfam00664) and a C-terminal ATP-binding domain (pfam00005). In E. coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in export of redox-active thiol compounds such as cysteine and glutathione. The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this model are all either found in the gammproteobacterial context or the CydABCD context. All members of this family scoring above trusted at the time of its creation were from genomes which encode a cytochrome bd complex.
Probab=53.59  E-value=5.8  Score=46.61  Aligned_cols=67  Identities=16%  Similarity=0.106  Sum_probs=47.9

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHHHHhhhhhc
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLLSQCRNILN  270 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~~~~r~il~  270 (893)
                      ++++++|+   +..++.+    .+.|.|+    |||+          ++..-..||+++..|.|.-.+..+-+. ++-|.
T Consensus       350 vL~~isl~---i~~G~~v----aIvG~SG----sGKS----------TLl~lL~g~~~p~~G~I~i~g~~i~~~-~~~lr  407 (529)
T TIGR02868       350 VLDGVSLD---LPPGERV----AILGPSG----SGKS----------TLLMLLTGLLDPLQGEVTLDGVSVSSL-QDELR  407 (529)
T ss_pred             eeecceEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHhcCCCCCCcEEEECCEEhhhH-HHHHH
Confidence            56667666   4433332    4679999    9999          999999999999999998887666554 44443


Q ss_pred             CCCchhhhhhh
Q 002685          271 SNDPVSCIQQI  281 (893)
Q Consensus       271 s~dp~s~~~~~  281 (893)
                      +.  |+|+.|-
T Consensus       408 ~~--i~~V~Q~  416 (529)
T TIGR02868       408 RR--ISVFAQD  416 (529)
T ss_pred             hh--eEEEccC
Confidence            22  6776553


No 26 
>cd01764 Urm1 Urm1-like ubuitin domain. Urm1 (Ubiquitin-Related Modifier1)  The Urm1 fold, like those of two closely related proteins MoaD (molybdopterin synthase) and ThiS (sulfur carrier protein), is similar to that of ubiquitin although there is little or no sequence similarity. The C-terminal glycines of Urm1 are conjugated to an E1-like protein Uba4 as part of a novel conjugation system in yeast.  The Urm1 fold is found only in eukaryotes.
Probab=47.69  E-value=16  Score=34.12  Aligned_cols=27  Identities=26%  Similarity=0.409  Sum_probs=22.0

Q ss_pred             eEECCEEcc--CCCeeeCCCCCEEEEEeC
Q 002685          151 VFVNGIRLK--SGIVRELNAGDEVLFVCR  177 (893)
Q Consensus       151 TfVNG~RI~--k~~~~~L~~GD~I~~~~g  177 (893)
                      ++|||+.+.  .|....|++||+|.|...
T Consensus        62 VlvN~~di~~l~g~~t~L~dgD~v~i~P~   90 (94)
T cd01764          62 VLINDTDWELLGEEDYILEDGDHVVFIST   90 (94)
T ss_pred             EEECCccccccCCcccCCCCcCEEEEECC
Confidence            679999885  456678999999999653


No 27 
>PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional
Probab=45.96  E-value=8.4  Score=46.01  Aligned_cols=67  Identities=15%  Similarity=0.119  Sum_probs=47.2

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHHHHhhhhhc
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLLSQCRNILN  270 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~~~~r~il~  270 (893)
                      ++++|+++   +..++.+    .+.|+|+    +||+          +++.-..|+++|..|.|.-.+..+-+.=+.-+.
T Consensus       350 iL~~inl~---i~~G~~v----~IvG~sG----sGKS----------TLl~lL~gl~~p~~G~I~i~g~~i~~~~~~~~r  408 (588)
T PRK13657        350 GVEDVSFE---AKPGQTV----AIVGPTG----AGKS----------TLINLLQRVFDPQSGRILIDGTDIRTVTRASLR  408 (588)
T ss_pred             eecceeEE---ECCCCEE----EEECCCC----CCHH----------HHHHHHhcCcCCCCCEEEECCEEhhhCCHHHHH
Confidence            56677776   4444443    4679999    9999          999999999999999998888766543333333


Q ss_pred             CCCchhhhhh
Q 002685          271 SNDPVSCIQQ  280 (893)
Q Consensus       271 s~dp~s~~~~  280 (893)
                      +.  |+|+.|
T Consensus       409 ~~--i~~v~Q  416 (588)
T PRK13657        409 RN--IAVVFQ  416 (588)
T ss_pred             hh--eEEEec
Confidence            33  566644


No 28 
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=44.75  E-value=25  Score=29.74  Aligned_cols=25  Identities=32%  Similarity=0.379  Sum_probs=19.9

Q ss_pred             cceEECCEEccCCCeeeCCCCCEEEE
Q 002685          149 NGVFVNGIRLKSGIVRELNAGDEVLF  174 (893)
Q Consensus       149 NGTfVNG~RI~k~~~~~L~~GD~I~~  174 (893)
                      +.++|||..+.+. ...|..||+|.|
T Consensus        34 G~V~VNg~~~~~~-~~~l~~Gd~v~i   58 (59)
T TIGR02988        34 NEVLVNGELENRR-GKKLYPGDVIEI   58 (59)
T ss_pred             CCEEECCEEccCC-CCCCCCCCEEEe
Confidence            6799999998432 247999999986


No 29 
>PF01479 S4:  S4 domain;  InterPro: IPR002942 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterised, small proteins that may be involved in translation regulation []. The S4 domain probably mediates binding to RNA.; GO: 0003723 RNA binding; PDB: 3BBU_A 1DM9_B 2K6P_A 3U5G_E 3U5C_E 3IZB_D 2XZM_D 2XZN_D 3O30_E 3O2Z_E ....
Probab=44.11  E-value=19  Score=28.88  Aligned_cols=24  Identities=29%  Similarity=0.288  Sum_probs=19.4

Q ss_pred             ccceEECCEEccCCCeeeCCCCCEE
Q 002685          148 LNGVFVNGIRLKSGIVRELNAGDEV  172 (893)
Q Consensus       148 ~NGTfVNG~RI~k~~~~~L~~GD~I  172 (893)
                      .++++|||+.+.... ..+..||+|
T Consensus        25 ~g~V~VNg~~v~~~~-~~v~~~d~I   48 (48)
T PF01479_consen   25 QGRVKVNGKVVKDPS-YIVKPGDVI   48 (48)
T ss_dssp             TTTEEETTEEESSTT-SBESTTEEE
T ss_pred             CCEEEECCEEEcCCC-CCCCCcCCC
Confidence            478999999999442 468999987


No 30 
>PRK13912 nuclease NucT; Provisional
Probab=42.28  E-value=23  Score=36.22  Aligned_cols=93  Identities=16%  Similarity=0.186  Sum_probs=52.4

Q ss_pred             HHhhcccchhhhhhhcC--CCCCCcEEEEEccCccccccCcCCcCCCCCCCCCCeEEEcC-CCCccccccccccCCCccc
Q 002685          450 FIATFTSDILWFMSYCE--IPSHLPVTIACHNTERCWSTSADKRTSVPYPNFPNLVLVFP-PFPESVAFGENCKKQGIAC  526 (893)
Q Consensus       450 llfnF~~DidWLLs~f~--~~r~lpVtIVvh~~~~~wk~e~~~rl~~~~~~~pNv~li~P-pMPe~I~FG~d~~~~g~Gt  526 (893)
                      .+..|.+..+.+.+.+-  ..|++.|-|+.+.....   ............++|+.+..- +++  ...+     ...+.
T Consensus        50 ~i~~Y~~~~~~i~~aL~~Aa~RGV~VrIlld~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~--~~~~-----~~~~~  119 (177)
T PRK13912         50 KIAIYSFTHKDIAKALKSAAKRGVKISIIYDYESNH---NNDQSTIGYLDKYPNIKVCLLKGLK--AKNG-----KYYGI  119 (177)
T ss_pred             EEEEEEEchHHHHHHHHHHHHCCCEEEEEEeCcccc---CcchhHHHHHHhCCCceEEEecCcc--ccCc-----ccccc
Confidence            34567777777776653  24888888887743210   000000000124556655421 111  0001     12358


Q ss_pred             ccceEEEEEeCCcEEEEEeCCCCCHHhhc
Q 002685          527 HHPKLFVLQREDSIRVIVTSANLGAKQWN  555 (893)
Q Consensus       527 HHSKMmIL~Y~D~LRVVI~TANLi~~DW~  555 (893)
                      +|.|+||+   |+-.++|.|+|+...-+.
T Consensus       120 ~H~K~~vi---D~~~~~iGS~N~t~~s~~  145 (177)
T PRK13912        120 MHQKVAII---DDKIVVLGSANWSKNAFE  145 (177)
T ss_pred             cceeEEEE---cCCEEEEeCCCCChhHhc
Confidence            99999998   555699999999976544


No 31 
>smart00363 S4 S4 RNA-binding domain.
Probab=41.20  E-value=36  Score=26.73  Aligned_cols=28  Identities=21%  Similarity=0.226  Sum_probs=21.1

Q ss_pred             cccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          147 SLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       147 S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      ..++++|||+++... ...|..||.|.+.
T Consensus        24 ~~g~i~vng~~~~~~-~~~l~~gd~i~~~   51 (60)
T smart00363       24 EQGRVKVNGKKVTKP-SYIVKPGDVISVR   51 (60)
T ss_pred             HcCCEEECCEEecCC-CeEeCCCCEEEEc
Confidence            346799999999332 2568999999883


No 32 
>TIGR01687 moaD_arch MoaD family protein, archaeal. Members of this family appear to be archaeal versions of MoaD, subunit 1 of molybdopterin converting factor. This model has been split from the bacterial/eukaryotic equivalog model TIGR01682 because the presence of two members of this family in a substantial number of archaeal species suggests that roles might not be interchangeable.
Probab=40.09  E-value=31  Score=31.05  Aligned_cols=28  Identities=32%  Similarity=0.390  Sum_probs=21.1

Q ss_pred             cceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          149 NGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       149 NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      --++|||+.+.......|++||+|.|..
T Consensus        56 ~~v~vN~~~v~~~~~~~l~dgdev~i~P   83 (88)
T TIGR01687        56 VIILVNGRNVDWGLGTELKDGDVVAIFP   83 (88)
T ss_pred             EEEEECCEecCccCCCCCCCCCEEEEeC
Confidence            4567888888765324799999999954


No 33 
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit  is inserted into the lare subunit to form the active site.  The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=38.07  E-value=33  Score=29.87  Aligned_cols=27  Identities=30%  Similarity=0.361  Sum_probs=21.4

Q ss_pred             ccceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          148 LNGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       148 ~NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      .--++|||+.+...  ..|++||+|.|..
T Consensus        49 ~~~v~vNg~~v~~~--~~l~~gD~v~i~p   75 (80)
T cd00754          49 RVRIAVNGEYVRLD--TPLKDGDEVAIIP   75 (80)
T ss_pred             cEEEEECCeEcCCC--cccCCCCEEEEeC
Confidence            34578899988854  5799999999964


No 34 
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=33.54  E-value=49  Score=31.92  Aligned_cols=28  Identities=32%  Similarity=0.444  Sum_probs=23.4

Q ss_pred             ceEECCEEccCCCeeeCCCCCEEEEEeCCC
Q 002685          150 GVFVNGIRLKSGIVRELNAGDEVLFVCRDE  179 (893)
Q Consensus       150 GTfVNG~RI~k~~~~~L~~GD~I~~~~g~~  179 (893)
                      -+.|||.+++.+  .+++.||+|.|..++.
T Consensus        35 rV~vNG~~aKpS--~~VK~GD~l~i~~~~~   62 (100)
T COG1188          35 RVKVNGQRAKPS--KEVKVGDILTIRFGNK   62 (100)
T ss_pred             eEEECCEEcccc--cccCCCCEEEEEeCCc
Confidence            477999999876  4699999999988763


No 35 
>cd00165 S4 S4/Hsp/ tRNA synthetase RNA-binding domain; The domain surface is populated by conserved, charged residues that define a likely RNA-binding site;  Found in stress proteins, ribosomal proteins and tRNA synthetases; This may imply a hitherto unrecognized functional similarity between these three protein classes.
Probab=32.70  E-value=53  Score=26.53  Aligned_cols=28  Identities=21%  Similarity=0.251  Sum_probs=21.4

Q ss_pred             cccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          147 SLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       147 S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      ..++++|||+++... ...+..||+|.+.
T Consensus        24 ~~g~V~vn~~~~~~~-~~~v~~~d~i~i~   51 (70)
T cd00165          24 KHGHVLVNGKVVTKP-SYKVKPGDVIEVD   51 (70)
T ss_pred             HcCCEEECCEEccCC-ccCcCCCCEEEEc
Confidence            457899999999432 2468999999884


No 36 
>PRK01777 hypothetical protein; Validated
Probab=32.42  E-value=46  Score=31.53  Aligned_cols=28  Identities=25%  Similarity=0.160  Sum_probs=24.0

Q ss_pred             ccccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          146 VSLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       146 ~S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      .+.+-+.|||+.+.-.  ..|++||+|.|.
T Consensus        47 ~~~~~vgI~Gk~v~~d--~~L~dGDRVeIy   74 (95)
T PRK01777         47 LAKNKVGIYSRPAKLT--DVLRDGDRVEIY   74 (95)
T ss_pred             cccceEEEeCeECCCC--CcCCCCCEEEEe
Confidence            4668899999999876  469999999994


No 37 
>TIGR02857 CydD thiol reductant ABC exporter, CydD subunit. Unfortunately, the gene symbol nomenclature adopted based on this operon in B. subtilis assigns cydC to the third gene in the operon where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologs in this family in accordance with the precedence of publication of the E. coli name, CydD
Probab=31.87  E-value=18  Score=42.65  Aligned_cols=52  Identities=21%  Similarity=0.091  Sum_probs=39.3

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLL  262 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~  262 (893)
                      .++++++|+   +..++.+    .+.|.|+    |||+          +++.-..||+++..|.|.-.+..+-
T Consensus       336 ~il~~i~l~---i~~G~~~----~ivG~sG----sGKS----------TL~~ll~g~~~~~~G~I~~~g~~i~  387 (529)
T TIGR02857       336 PALRPVSFT---VPPGERV----ALVGPSG----AGKS----------TLLNLLLGFVDPTEGSIAVNGVPLA  387 (529)
T ss_pred             ccccceeEE---ECCCCEE----EEECCCC----CCHH----------HHHHHHhcCCCCCCcEEEECCEehh
Confidence            356667666   4433332    4679999    9999          9999999999999999987776543


No 38 
>COG2501 S4-like RNA binding protein [Replication, recombination, and repair]
Probab=28.64  E-value=56  Score=29.89  Aligned_cols=26  Identities=35%  Similarity=0.415  Sum_probs=21.0

Q ss_pred             cccceEECCEEc-cCCCeeeCCCCCEEEE
Q 002685          147 SLNGVFVNGIRL-KSGIVRELNAGDEVLF  174 (893)
Q Consensus       147 S~NGTfVNG~RI-~k~~~~~L~~GD~I~~  174 (893)
                      +-+.++|||+.- +++  ..|++||+|.|
T Consensus        35 ~eg~V~vNGe~EtRRg--kKlr~gd~V~i   61 (73)
T COG2501          35 AEGEVKVNGEVETRRG--KKLRDGDVVEI   61 (73)
T ss_pred             HCCeEEECCeeeeccC--CEeecCCEEEE
Confidence            557899999965 445  45999999999


No 39 
>PRK11507 ribosome-associated protein; Provisional
Probab=27.11  E-value=69  Score=29.08  Aligned_cols=26  Identities=27%  Similarity=0.308  Sum_probs=20.3

Q ss_pred             cccceEECCEEc-cCCCeeeCCCCCEEEE
Q 002685          147 SLNGVFVNGIRL-KSGIVRELNAGDEVLF  174 (893)
Q Consensus       147 S~NGTfVNG~RI-~k~~~~~L~~GD~I~~  174 (893)
                      ..+.+.|||+-- +++  ..|.+||+|.|
T Consensus        35 ~eg~V~VNGeve~rRg--kKl~~GD~V~~   61 (70)
T PRK11507         35 AEGQVKVDGAVETRKR--CKIVAGQTVSF   61 (70)
T ss_pred             HcCceEECCEEecccC--CCCCCCCEEEE
Confidence            447799999944 455  45999999999


No 40 
>cd03292 ABC_FtsE_transporter FtsE is a hydrophilic nucleotide-binding protein that binds FtsX to form a heterodimeric ATP-binding cassette (ABC)-type transporter that associates with the bacterial inner membrane.  The FtsE/X transporter is thought to be involved in cell division and is important for assembly or stability of the septal ring.
Probab=26.94  E-value=34  Score=35.08  Aligned_cols=48  Identities=13%  Similarity=-0.067  Sum_probs=35.0

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechh
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRA  258 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~  258 (893)
                      .++..++|+   +..+..+    .+.|.|+    |||+          +++....|+.++..|.|.-.+
T Consensus        15 ~~l~~~sl~---i~~G~~~----~i~G~nG----sGKS----------TLl~~l~G~~~~~~G~i~~~g   62 (214)
T cd03292          15 AALDGINIS---ISAGEFV----FLVGPSG----AGKS----------TLLKLIYKEELPTSGTIRVNG   62 (214)
T ss_pred             eeeeeeEEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHhcCCCCCceEEEECC
Confidence            356777776   3332322    4669999    9999          999999999999888776554


No 41 
>TIGR00958 3a01208 Conjugate Transporter-2 (CT2) Family protein.
Probab=26.30  E-value=22  Score=43.88  Aligned_cols=53  Identities=17%  Similarity=0.140  Sum_probs=39.4

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHHH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLLS  263 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~~  263 (893)
                      .++++++|+   +..++.+    .+.|.|+    |||+          +++.-..|++++..|.|.-.+..+-+
T Consensus       495 ~vL~~isl~---i~~Ge~v----aIvG~SG----sGKS----------TLl~lL~gl~~p~~G~I~idg~~i~~  547 (711)
T TIGR00958       495 PVLKGLTFT---LHPGEVV----ALVGPSG----SGKS----------TVAALLQNLYQPTGGQVLLDGVPLVQ  547 (711)
T ss_pred             ccccCceEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHHhccCCCCCEEEECCEEHHh
Confidence            345666665   4433332    4669999    9999          99999999999999999888776543


No 42 
>PF09014 Sushi_2:  Beta-2-glycoprotein-1 fifth domain;  InterPro: IPR015104 The fifth domain of beta-2-glycoprotein-1 (b2GP-1) is composed of four well-defined anti-parallel beta-strands and two short alpha-helices, as well as a long highly flexible loop. It plays an important role in the binding of b2GP-1 to negatively charged compounds and subsequent capture for binding of anti-b2GP-1 antibodies []. ; PDB: 1C1Z_A 3OP8_B 2KRI_A 1QUB_A 1G4G_A 1G4F_A.
Probab=25.89  E-value=1.1e+02  Score=28.80  Aligned_cols=39  Identities=23%  Similarity=0.428  Sum_probs=26.0

Q ss_pred             eEECCEEccC--CCeeeCCCCCEEEEEeCC-CccccccceeEEEEEEE
Q 002685          151 VFVNGIRLKS--GIVRELNAGDEVLFVCRD-ESLCRLQTRIGFLILGI  195 (893)
Q Consensus       151 TfVNG~RI~k--~~~~~L~~GD~I~~~~g~-~~~~~~~~~igf~v~~i  195 (893)
                      ++.||+|+.-  -..-.+.+||.|.|.|.+ +..|      |+.+...
T Consensus        13 Vly~g~k~~i~d~~~~~v~Hge~VsffCknkekkC------sy~~~~q   54 (85)
T PF09014_consen   13 VLYNGEKVWIQDLFKNGVLHGEIVSFFCKNKEKKC------SYTVDAQ   54 (85)
T ss_dssp             EEETTEEEEHHHHTTT-BETT-EEEEEEEETTTTE------EEEEEEE
T ss_pred             EEECCEEechhhcccCceeeCCEEEEEEcCCcccC------CCccccE
Confidence            4467776621  111137899999999988 6677      8888877


No 43 
>PF02824 TGS:  TGS domain;  InterPro: IPR004095  The TGS domain is present in a number of enzymes, for example, in threonyl-tRNA synthetase (ThrRS), GTPase, and guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase (SpoT) []. The TGS domain is also present at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochaete Borrelia burgdorferi).  TGS is a small domain that consists of ~50 amino acid residues and is predicted to possess a predominantly beta-sheet structure. There is no direct information on the functions of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role []. ; PDB: 1WXQ_A 1NYR_B 1NYQ_B 2KMM_A 1WWT_A 1TKE_A 1TKG_A 1TJE_A 1QF6_A 1TKY_A ....
Probab=25.77  E-value=58  Score=27.90  Aligned_cols=27  Identities=26%  Similarity=0.279  Sum_probs=21.5

Q ss_pred             cccceEECCEEccCCCeeeCCCCCEEEEE
Q 002685          147 SLNGVFVNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       147 S~NGTfVNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      ..-+..|||+++.-.  .+|.+||+|.|.
T Consensus        33 ~~~~A~Vng~~vdl~--~~L~~~d~v~ii   59 (60)
T PF02824_consen   33 RAVAAKVNGQLVDLD--HPLEDGDVVEII   59 (60)
T ss_dssp             CEEEEEETTEEEETT--SBB-SSEEEEEE
T ss_pred             heeEEEEcCEECCCC--CCcCCCCEEEEE
Confidence            446778999999986  579999999985


No 44 
>cd01666 TGS_DRG_C TGS_DRG_C:   DRG (developmentally regulated GTP-binding protein) represents a family of GTP-binding proteins that includes two members, DRG1 and DRG2. DRG1 and DRG2 have a C-terminal TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) with a predominantly beta-sheet structure. The function of TGS is unknown but its presence in two types of regulatory proteins (the DRG GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
Probab=25.33  E-value=57  Score=29.58  Aligned_cols=21  Identities=24%  Similarity=0.166  Sum_probs=18.3

Q ss_pred             ECCEEccCCCeeeCCCCCEEEEE
Q 002685          153 VNGIRLKSGIVRELNAGDEVLFV  175 (893)
Q Consensus       153 VNG~RI~k~~~~~L~~GD~I~~~  175 (893)
                      ++|+|++..  ++|++||+|.|.
T Consensus        54 ~~gq~Vgl~--~~L~d~DvVeI~   74 (75)
T cd01666          54 HSPQRVGLD--HVLEDEDVVQIV   74 (75)
T ss_pred             CCCeECCCC--CEecCCCEEEEe
Confidence            489999987  679999999985


No 45 
>TIGR01193 bacteriocin_ABC ABC-type bacteriocin transporter. This model describes ABC-type bacteriocin transporter. The amino terminal domain (pfam03412) processes the N-terminal leader peptide from the bacteriocin while C-terminal domains resemble ABC transporter membrane protein and ATP-binding cassette domain. In general, bacteriocins are agents which are responsible for killing or inhibiting the closely related species or even different strains of the same species. Bacteriocins are usually encoded by bacterial plasmids. Bacteriocins are named after the species and hence in literature one encounters various names e.g., leucocin from Leuconostic geldium; pedicocin from Pedicoccus acidilactici; sakacin from Lactobacillus sake etc.
Probab=25.08  E-value=27  Score=42.84  Aligned_cols=51  Identities=16%  Similarity=0.146  Sum_probs=38.5

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLL  262 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~  262 (893)
                      ++++++|+   +..+..+    .+.|.|+    |||+          +++.-..|+++|..|.|.-.+..+-
T Consensus       489 iL~~isl~---i~~G~~v----aIvG~SG----sGKS----------TLlklL~gl~~p~~G~I~idg~~i~  539 (708)
T TIGR01193       489 ILSDISLT---IKMNSKT----TIVGMSG----SGKS----------TLAKLLVGFFQARSGEILLNGFSLK  539 (708)
T ss_pred             ceeceeEE---ECCCCEE----EEECCCC----CCHH----------HHHHHHhccCCCCCcEEEECCEEHH
Confidence            45566665   4433332    4669999    9999          9999999999999999988886653


No 46 
>PF09565 RE_NgoFVII:  NgoFVII restriction endonuclease;  InterPro: IPR019065 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This domain is found in NgoFVII restriction endonuclease, which recognises GCSGC but cleavage site is unknown. It is also found as the C-terminal domain of the res subunit of some type III restriction endonucleases. 
Probab=24.83  E-value=60  Score=36.58  Aligned_cols=27  Identities=26%  Similarity=0.450  Sum_probs=23.3

Q ss_pred             cccceEEEEEeCC-cEEEEEeCCCCCHH
Q 002685          526 CHHPKLFVLQRED-SIRVIVTSANLGAK  552 (893)
Q Consensus       526 tHHSKMmIL~Y~D-~LRVVI~TANLi~~  552 (893)
                      .-|+||.+...++ -.+++|.||||...
T Consensus        80 ~~HgKlY~f~k~g~~~~a~IGSANfS~~  107 (296)
T PF09565_consen   80 PYHGKLYIFSKNGKPFRAYIGSANFSQI  107 (296)
T ss_pred             CcccEEEEEecCCCceEEEEeecccccc
Confidence            3699999997764 69999999999985


No 47 
>COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]
Probab=24.63  E-value=26  Score=38.63  Aligned_cols=52  Identities=15%  Similarity=0.138  Sum_probs=38.8

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLLS  263 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~~  263 (893)
                      .+++|.|+   |.-+..+    .+.|.|+    |||+          +|-...-|+.++..|.|.-.++-+..
T Consensus        22 ~l~~VS~~---i~~Ge~l----givGeSG----sGKS----------TL~r~l~Gl~~p~~G~I~~~G~~~~~   73 (252)
T COG1124          22 ALNNVSLE---IERGETL----GIVGESG----SGKS----------TLARLLAGLEKPSSGSILLDGKPLAP   73 (252)
T ss_pred             hhcceeEE---ecCCCEE----EEEcCCC----CCHH----------HHHHHHhcccCCCCceEEECCcccCc
Confidence            56666665   3332332    3449999    9999          88888999999999999998876655


No 48 
>PF02597 ThiS:  ThiS family;  InterPro: IPR003749 ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in Escherichia coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS []. MoaD, a protein involved in sulphur transfer during molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end.; PDB: 1JW9_D 1JWB_D 1JWA_D 3BII_D 1NVI_D 1FMA_D 1FM0_D 2QIE_G 2Q5W_D 2K5P_A ....
Probab=23.82  E-value=57  Score=28.14  Aligned_cols=28  Identities=36%  Similarity=0.440  Sum_probs=20.6

Q ss_pred             cceEECCEEccC-CCeeeCCCCCEEEEEe
Q 002685          149 NGVFVNGIRLKS-GIVRELNAGDEVLFVC  176 (893)
Q Consensus       149 NGTfVNG~RI~k-~~~~~L~~GD~I~~~~  176 (893)
                      --++|||+.+.+ +....|++||+|.|..
T Consensus        44 ~~v~vN~~~v~~~~~~~~l~~gD~V~i~p   72 (77)
T PF02597_consen   44 VAVAVNGEIVPDDGLDTPLKDGDEVAILP   72 (77)
T ss_dssp             EEEEETTEEEGGGTTTSBEETTEEEEEEE
T ss_pred             EEEEECCEEcCCccCCcCcCCCCEEEEEC
Confidence            446778888877 2225699999999964


No 49 
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=23.69  E-value=86  Score=26.61  Aligned_cols=26  Identities=38%  Similarity=0.394  Sum_probs=18.7

Q ss_pred             eEECCEEccCC--CeeeCCCCCEEEEEe
Q 002685          151 VFVNGIRLKSG--IVRELNAGDEVLFVC  176 (893)
Q Consensus       151 TfVNG~RI~k~--~~~~L~~GD~I~~~~  176 (893)
                      +.|||+-+.+.  ....|++||+|.|+.
T Consensus        33 v~vN~~~v~~~~~~~~~L~~gD~vei~~   60 (65)
T PRK06944         33 VAVNGDFVARTQHAARALAAGDRLDLVQ   60 (65)
T ss_pred             EEECCEEcCchhcccccCCCCCEEEEEe
Confidence            45777777653  235699999999964


No 50 
>TIGR02673 FtsE cell division ATP-binding protein FtsE. This model describes FtsE, a member of the ABC transporter ATP-binding protein family. This protein, and its permease partner FtsX, localize to the division site. In a number of species, the ftsEX gene pair is located next to FtsY, the signal recognition particle-docking protein.
Probab=23.60  E-value=47  Score=34.09  Aligned_cols=48  Identities=10%  Similarity=0.000  Sum_probs=33.9

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      ++++++|+   +..+..    +.+.|.++    |||+          +++....|..++..|-|.-.+.
T Consensus        17 il~~is~~---i~~G~~----~~l~G~nG----sGKS----------TLl~~i~Gl~~~~~G~i~~~g~   64 (214)
T TIGR02673        17 ALHDVSLH---IRKGEF----LFLTGPSG----AGKT----------TLLKLLYGALTPSRGQVRIAGE   64 (214)
T ss_pred             eecceeEE---EcCCCE----EEEECCCC----CCHH----------HHHHHHhCCCCCCCceEEECCE
Confidence            46666666   333222    24569999    9999          8999999999888887765543


No 51 
>PRK10348 ribosome-associated heat shock protein Hsp15; Provisional
Probab=23.14  E-value=1.2e+02  Score=30.50  Aligned_cols=29  Identities=17%  Similarity=0.152  Sum_probs=23.0

Q ss_pred             ccceEECCEEccCCCeeeCCCCCEEEEEeCC
Q 002685          148 LNGVFVNGIRLKSGIVRELNAGDEVLFVCRD  178 (893)
Q Consensus       148 ~NGTfVNG~RI~k~~~~~L~~GD~I~~~~g~  178 (893)
                      ..-+.|||++.+.+  .++..||+|.|..+.
T Consensus        33 ~G~V~vnG~~~Kps--~~V~~gd~l~v~~~~   61 (133)
T PRK10348         33 GGKVHYNGQRSKPS--KIVELNATLTLRQGN   61 (133)
T ss_pred             CCCEEECCEECCCC--CccCCCCEEEEEECC
Confidence            36689999996665  568999999996654


No 52 
>COG1559 Aminodeoxychorismate lyase [Coenzyme transport and metabolism]
Probab=23.04  E-value=1.6e+02  Score=33.93  Aligned_cols=94  Identities=21%  Similarity=0.196  Sum_probs=69.3

Q ss_pred             hhcccccCCCC--CcccceeecccchHHHHHHHhhcccceeEEEEeeccCCChhHHHhhhchhhHHH---HHHHHHHhHh
Q 002685          750 RDIAKWVSPLW--DIGFIRFSGFISRDEVLAAALEGISKKVQLILHVSQGPKFSDISKMMQIEHVVA---LSSLIASIQR  824 (893)
Q Consensus       750 ~~l~kW~s~~~--D~~~~~f~g~~~p~~~l~~a~g~~~~~~ql~~~~~~~~~f~~~~~~~~~~~~~~---~~~~~~~~~~  824 (893)
                      ++..|-.....  ..|+|++.+-..+.|.|..-..+......+.  +-||-.+.||.+.++.+.+..   .-..++.+..
T Consensus        76 ~~~~~~~~~~~~lkaG~Y~l~~~ms~~~il~~L~~Gk~~~~~~~--ipEG~t~~~i~~~l~~~~~l~~~~~~~~~~~~~~  153 (342)
T COG1559          76 RLYLKYNPAAGSLKAGEYRLRPGMSAADILKLLESGKSAQFKVT--IPEGYTLKQIAKKLAEAPVLDGVLDGLAIAEIAA  153 (342)
T ss_pred             HHHHHhcCccCCcCCceeecCCCCCHHHHHHHHhcCccccceEe--cCCCccHHHHHHHHhcchhhhhhhhhhhhHHHHH
Confidence            33444444444  6799999999999999998877765554444  999999999999888655432   2345677788


Q ss_pred             hhchhhHHHHhhccCCCccccc
Q 002685          825 CRGLWRLQEVLGQYRWPELQES  846 (893)
Q Consensus       825 ~~~~~~~~~~~~~~~~~~~~~~  846 (893)
                      .++.| -.+-|..|-||++-+-
T Consensus       154 ~~p~~-~~~~~EG~l~PdTY~~  174 (342)
T COG1559         154 RLPKL-DKERLEGYLFPDTYEF  174 (342)
T ss_pred             hcCCC-cCCCCceeecCceeee
Confidence            88888 5567788889998653


No 53 
>PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed
Probab=22.88  E-value=29  Score=41.46  Aligned_cols=50  Identities=20%  Similarity=0.078  Sum_probs=37.2

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFL  261 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l  261 (893)
                      .++++++|+   +..++.+    .+.|.|+    |||+          ++..-..||+ |..|.|.-.+..+
T Consensus       364 ~vL~~i~l~---i~~G~~v----aIvG~SG----sGKS----------TL~~lL~g~~-p~~G~I~i~g~~i  413 (588)
T PRK11174        364 TLAGPLNFT---LPAGQRI----ALVGPSG----AGKT----------SLLNALLGFL-PYQGSLKINGIEL  413 (588)
T ss_pred             eeeeeeEEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHhcCC-CCCcEEEECCEec
Confidence            456777777   4433332    5779999    9999          9999999999 7778777766554


No 54 
>PRK11176 lipid transporter ATP-binding/permease protein; Provisional
Probab=22.83  E-value=28  Score=41.40  Aligned_cols=50  Identities=24%  Similarity=0.269  Sum_probs=38.0

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFL  261 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l  261 (893)
                      ++++++++   +..++.+    .+.|.|+    |||+          ++..-..|+++|..|-|.-.+..+
T Consensus       358 il~~i~l~---i~~G~~~----aIvG~sG----sGKS----------TLl~ll~gl~~p~~G~I~i~g~~i  407 (582)
T PRK11176        358 ALRNINFK---IPAGKTV----ALVGRSG----SGKS----------TIANLLTRFYDIDEGEILLDGHDL  407 (582)
T ss_pred             cccCceEE---eCCCCEE----EEECCCC----CCHH----------HHHHHHHhccCCCCceEEECCEEh
Confidence            45666666   4433332    4679999    9999          999999999999999998877654


No 55 
>TIGR00960 3a0501s02 Type II (General) Secretory Pathway (IISP) Family protein.
Probab=22.80  E-value=48  Score=34.20  Aligned_cols=49  Identities=18%  Similarity=0.071  Sum_probs=35.8

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASF  260 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~  260 (893)
                      +++.++|+   +..+..    +.+.|.|+    |||+          +++.-..|..++..|.|.-++..
T Consensus        18 il~~isl~---i~~Ge~----~~i~G~nG----sGKS----------TLl~~l~Gl~~~~~G~i~~~g~~   66 (216)
T TIGR00960        18 ALDNLNFH---ITKGEM----VFLVGHSG----AGKS----------TFLKLILGIEKPTRGKIRFNGQD   66 (216)
T ss_pred             EEEeeEEE---EcCCCE----EEEECCCC----CCHH----------HHHHHHhCCCCCCceEEEECCEe
Confidence            56666666   332222    24669999    9999          99999999999988888766643


No 56 
>TIGR02982 heterocyst_DevA ABC exporter ATP-binding subunit, DevA family. Members of this protein family are found mostly in the Cyanobacteria, but also in the Planctomycetes. Cyanobacterial examples are involved in heterocyst formation, by which some fraction of members of the colony undergo a developmental change and become capable of nitrogen fixation. The DevBCA proteins are thought export of either heterocyst-specific glycolipids or an enzyme essential for formation of the laminated layer found in heterocysts.
Probab=22.66  E-value=37  Score=35.27  Aligned_cols=48  Identities=10%  Similarity=0.090  Sum_probs=35.1

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      +++.++|+   +..+..    +.+.|.++    +||+          +++.-..|+.++..|-|.-.+.
T Consensus        20 il~~vs~~---i~~G~~----~~I~G~nG----sGKS----------tLl~~l~G~~~~~~G~i~~~g~   67 (220)
T TIGR02982        20 VLFDINLE---INPGEI----VILTGPSG----SGKT----------TLLTLIGGLRSVQEGSLKVLGQ   67 (220)
T ss_pred             EEeeeEEE---EcCCCE----EEEECCCC----CCHH----------HHHHHHhCCCCCCCeEEEECCE
Confidence            56667666   333222    24669999    9999          8999999999998887876554


No 57 
>PRK10790 putative multidrug transporter membrane\ATP-binding components; Provisional
Probab=22.53  E-value=38  Score=40.60  Aligned_cols=51  Identities=22%  Similarity=0.169  Sum_probs=39.2

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLL  262 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~  262 (893)
                      ++++++|+   +..+..+    .+.|.|+    +||.          +++.-..|+++|..|.|.-.+..+-
T Consensus       356 il~~i~l~---i~~Ge~i----aIvG~SG----sGKS----------TLl~lL~gl~~p~~G~I~idg~~i~  406 (592)
T PRK10790        356 VLQNINLS---VPSRGFV----ALVGHTG----SGKS----------TLASLLMGYYPLTEGEIRLDGRPLS  406 (592)
T ss_pred             eeeceeEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHhcccCCCCceEEECCEEhh
Confidence            46666666   4433332    4669999    9999          9999999999999999988876553


No 58 
>PF13275 S4_2:  S4 domain; PDB: 1P9K_A.
Probab=22.39  E-value=42  Score=29.88  Aligned_cols=27  Identities=33%  Similarity=0.402  Sum_probs=16.5

Q ss_pred             ccccceEECCEEcc-CCCeeeCCCCCEEEE
Q 002685          146 VSLNGVFVNGIRLK-SGIVRELNAGDEVLF  174 (893)
Q Consensus       146 ~S~NGTfVNG~RI~-k~~~~~L~~GD~I~~  174 (893)
                      -..+.++|||+... ++  ..|.+||+|.|
T Consensus        30 I~~g~V~VNGe~e~rrg--~Kl~~GD~V~~   57 (65)
T PF13275_consen   30 IQEGEVKVNGEVETRRG--KKLRPGDVVEI   57 (65)
T ss_dssp             HHHHHHEETTB----SS------SSEEEEE
T ss_pred             HHcCceEECCEEccccC--CcCCCCCEEEE
Confidence            34578999999664 44  46999999999


No 59 
>PRK06437 hypothetical protein; Provisional
Probab=22.25  E-value=1.1e+02  Score=26.81  Aligned_cols=26  Identities=27%  Similarity=0.364  Sum_probs=20.8

Q ss_pred             cceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          149 NGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       149 NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      =.+.+||+.|...  ..|++||+|.|+.
T Consensus        37 vaV~vNg~iv~~~--~~L~dgD~Veiv~   62 (67)
T PRK06437         37 YVVIVNGSPVLED--HNVKKEDDVLILE   62 (67)
T ss_pred             EEEEECCEECCCc--eEcCCCCEEEEEe
Confidence            3567899999854  6799999999953


No 60 
>TIGR03873 F420-0_ABC_ATP proposed F420-0 ABC transporter, ATP-binding protein. This small clade of ABC-type transporter ATP-binding protein components is found as a three gene cassette along with a periplasmic substrate-binding protein (TIGR03868) and a permease (TIGR03869). The organisms containing this cassette are all Actinobacteria and all contain numerous genes requiring the coenzyme F420. This model was defined based on five such organisms, four of which are lacking all F420 biosynthetic capability save the final side-chain polyglutamate attachment step (via the gene cofE: TIGR01916). In Jonesia denitrificans DSM 20603 and marine actinobacterium PHSC20C1 this cassette is in an apparent operon with the cofE gene and, in PHSC20C1, also with a F420-dependent glucose-6-phosphate dehydrogenase (TIGR03554). Based on these observations we propose that this ATP-binding protein is a component of an F420-0 (that is, F420 lacking only the polyglutamate tail) transporter.
Probab=21.54  E-value=53  Score=34.94  Aligned_cols=33  Identities=9%  Similarity=-0.101  Sum_probs=27.3

Q ss_pred             eeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          213 ASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       213 ~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      .+.|.++    +||+          +++.-..|..++..|.|.-.+.
T Consensus        31 ~i~G~nG----sGKS----------TLl~~i~G~~~~~~G~i~~~g~   63 (256)
T TIGR03873        31 GLLGPNG----SGKS----------TLLRLLAGALRPDAGTVDLAGV   63 (256)
T ss_pred             EEECCCC----CCHH----------HHHHHHcCCCCCCCCEEEECCE
Confidence            3559999    9999          9999999999998887776554


No 61 
>TIGR01682 moaD molybdopterin converting factor, subunit 1, non-archaeal. The C-terminal Gly-Gly of this model is critical to function.
Probab=21.46  E-value=1.1e+02  Score=27.28  Aligned_cols=24  Identities=42%  Similarity=0.473  Sum_probs=18.8

Q ss_pred             eEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          151 VFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       151 TfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      ++|||+.+...  ..|++||+|.|..
T Consensus        52 v~vn~~~v~~~--~~l~dgDevai~P   75 (80)
T TIGR01682        52 VAVNEEYVTDD--ALLNEGDEVAFIP   75 (80)
T ss_pred             EEECCEEcCCC--cCcCCCCEEEEeC
Confidence            56788877754  5799999999964


No 62 
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria.  The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=21.35  E-value=96  Score=26.63  Aligned_cols=27  Identities=26%  Similarity=0.346  Sum_probs=20.4

Q ss_pred             ceEECCEEccCC--CeeeCCCCCEEEEEe
Q 002685          150 GVFVNGIRLKSG--IVRELNAGDEVLFVC  176 (893)
Q Consensus       150 GTfVNG~RI~k~--~~~~L~~GD~I~~~~  176 (893)
                      .+.|||+-|.+.  ....|++||+|.|..
T Consensus        32 ~V~vNg~~v~~~~~~~~~L~~gD~V~ii~   60 (65)
T cd00565          32 AVALNGEIVPRSEWASTPLQDGDRIEIVT   60 (65)
T ss_pred             EEEECCEEcCHHHcCceecCCCCEEEEEE
Confidence            466899988774  124699999999953


No 63 
>PRK10908 cell division protein FtsE; Provisional
Probab=21.18  E-value=46  Score=34.54  Aligned_cols=49  Identities=14%  Similarity=0.142  Sum_probs=35.1

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      .+++.|+|+   +..+..+    .+.|.++    +||+          +++.-..|..++..|-|.-.+.
T Consensus        16 ~~l~~vsl~---i~~Ge~~----~i~G~nG----sGKS----------TLl~~l~G~~~~~~G~i~~~g~   64 (222)
T PRK10908         16 QALQGVTFH---MRPGEMA----FLTGHSG----AGKS----------TLLKLICGIERPSAGKIWFSGH   64 (222)
T ss_pred             eEEeeeeEE---EcCCCEE----EEECCCC----CCHH----------HHHHHHhCCCCCCceEEEECCE
Confidence            356667666   3332322    4669999    9999          9999999999988887766554


No 64 
>PF14451 Ub-Mut7C:  Mut7-C ubiquitin
Probab=21.14  E-value=1e+02  Score=28.45  Aligned_cols=26  Identities=27%  Similarity=0.310  Sum_probs=21.7

Q ss_pred             cceEECCEEccCCCeeeCCCCCEEEEEe
Q 002685          149 NGVFVNGIRLKSGIVRELNAGDEVLFVC  176 (893)
Q Consensus       149 NGTfVNG~RI~k~~~~~L~~GD~I~~~~  176 (893)
                      .-.+|||+.+.-.  ..+++||+|.+..
T Consensus        50 ~~i~vNG~~v~~~--~~~~~Gd~v~V~P   75 (81)
T PF14451_consen   50 GLILVNGRPVDFD--YRLKDGDRVAVYP   75 (81)
T ss_pred             EEEEECCEECCCc--ccCCCCCEEEEEe
Confidence            5578999999886  5699999999853


No 65 
>PRK11160 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed
Probab=20.57  E-value=38  Score=40.70  Aligned_cols=51  Identities=10%  Similarity=0.128  Sum_probs=38.4

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLL  262 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~  262 (893)
                      ++++++|+   +..+..+    .+.|.|+    |||+          ++..-..||+++..|.|.-.+..+-
T Consensus       355 il~~i~~~---i~~G~~~----aivG~sG----sGKS----------TL~~ll~g~~~p~~G~I~i~g~~i~  405 (574)
T PRK11160        355 VLKGLSLQ---IKAGEKV----ALLGRTG----CGKS----------TLLQLLTRAWDPQQGEILLNGQPIA  405 (574)
T ss_pred             ceecceEE---ECCCCEE----EEECCCC----CCHH----------HHHHHHhcCCCCCCceEEECCEEhh
Confidence            45666666   3433332    4669999    9999          9999999999999999988876543


No 66 
>TIGR03797 NHPM_micro_ABC2 NHPM bacteriocin system ABC transporter, ATP-binding protein. Members of this protein family are ABC transporter ATP-binding subunits, part of a three-gene putative bacteriocin transport operon. The other subunits include another ATP-binding subunit (TIGR03796), which has an N-terminal propeptide cleavage domain, and an HlyD homolog (TIGR03794). In a number of genomes, a conserved propeptide sequence with a classic Gly-Gly motif
Probab=20.46  E-value=34  Score=41.85  Aligned_cols=52  Identities=15%  Similarity=0.119  Sum_probs=40.0

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhHHHH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRASFLL  262 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~~l~  262 (893)
                      .++++|+|+   +..++.+    .+.|.|+    |||+          ++..--.||+.|..|.|.-.+..+-
T Consensus       467 ~vL~~isl~---i~~Ge~v----aIvG~sG----sGKS----------TLlklL~gl~~p~~G~I~idg~~i~  518 (686)
T TIGR03797       467 LILDDVSLQ---IEPGEFV----AIVGPSG----SGKS----------TLLRLLLGFETPESGSVFYDGQDLA  518 (686)
T ss_pred             cceeeeEEE---ECCCCEE----EEECCCC----CCHH----------HHHHHHhCCCCCCCCEEEECCEEcC
Confidence            456677776   4433432    5679999    9999          9999999999999999988876553


No 67 
>cd03256 ABC_PhnC_transporter ABC-type phosphate/phosphonate transport system.  Phosphonates are a class of organophosphorus compounds characterized by a chemically stable carbon-to-phosphorus (C-P) bond.  Phosphonates are widespread among naturally occurring compounds in all kingdoms of wildlife, but only procaryotic microorganisms are able to cleave this bond.  Certain bacteria such as E. coli can use alkylphosphonates as a phosphorus source.  ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.
Probab=20.23  E-value=40  Score=35.18  Aligned_cols=49  Identities=8%  Similarity=0.004  Sum_probs=35.7

Q ss_pred             EEEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          190 FLILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       190 f~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      .++++++|+   +..+..    +.+.|.|+    |||+          +++....|+.++..|.|+-.+.
T Consensus        15 ~~l~~vs~~---i~~Ge~----~~i~G~nG----sGKS----------TLl~~l~Gl~~~~~G~i~~~g~   63 (241)
T cd03256          15 KALKDVSLS---INPGEF----VALIGPSG----AGKS----------TLLRCLNGLVEPTSGSVLIDGT   63 (241)
T ss_pred             EEEecceEE---EcCCCE----EEEECCCC----CCHH----------HHHHHHhCCcCCCCceEEECCE
Confidence            356777776   333222    24669999    9999          9999999999988887776554


No 68 
>PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional
Probab=20.17  E-value=59  Score=35.27  Aligned_cols=48  Identities=10%  Similarity=0.095  Sum_probs=35.3

Q ss_pred             EEEEEEEEEeeeeCCceeeceeeeccccCCcccCCCccceeeeeecCCccCCCCCCCcccccceechhH
Q 002685          191 LILGIIFKEEVVLKRPRLVGTMASSGHSQGSVSSGTRSKRVFALRENDVSNPDSVFPKLKRRDIFGRAS  259 (893)
Q Consensus       191 ~v~~i~~~~~~v~g~k~l~~~~~f~g~s~g~~~~g~~~krvfa~~~~~~~~~~~~f~~~~~~~~~~~~~  259 (893)
                      ++++|+|+   +..+.-    +.+.|.++    +||+          +++....|..++..|.|.-++.
T Consensus        17 ~l~~vsl~---i~~Ge~----~~i~G~nG----sGKS----------TLl~~l~Gl~~p~~G~i~~~g~   64 (274)
T PRK13644         17 ALENINLV---IKKGEY----IGIIGKNG----SGKS----------TLALHLNGLLRPQKGKVLVSGI   64 (274)
T ss_pred             eeeeeEEE---EeCCCE----EEEECCCC----CCHH----------HHHHHHhcCCCCCCceEEECCE
Confidence            46677776   333222    24669999    9999          9999999999998888876554


No 69 
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=20.13  E-value=82  Score=27.06  Aligned_cols=25  Identities=24%  Similarity=0.282  Sum_probs=18.1

Q ss_pred             EECCEEccCCC--eeeCCCCCEEEEEe
Q 002685          152 FVNGIRLKSGI--VRELNAGDEVLFVC  176 (893)
Q Consensus       152 fVNG~RI~k~~--~~~L~~GD~I~~~~  176 (893)
                      -|||+-|.+..  ...|++||+|.|+.
T Consensus        34 avN~~iv~~~~~~~~~L~dgD~Ieiv~   60 (65)
T PRK06488         34 AVNGELVHKEARAQFVLHEGDRIEILS   60 (65)
T ss_pred             EECCEEcCHHHcCccccCCCCEEEEEE
Confidence            46777777632  35699999999954


Done!