Query         gi|254780612|ref|YP_003065025.1| putative transmembrane protein [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 503
No_of_seqs    421 out of 678
Neff          10.1
Searched_HMMs 39220
Date          Mon May 30 02:59:27 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254780612.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PHA02107 hypothetical protein   79.9     4.6 0.00012   20.0   5.1   49  433-481    62-124 (216)
  2 pfam03725 RNase_PH_C 3' exorib  63.0      11 0.00028   17.4   5.1   58  427-495     2-62  (67)
  3 cd04619 CBS_pair_6 The CBS dom  61.9      10 0.00026   17.6   3.4   44  435-481    65-109 (114)
  4 KOG2915 consensus               59.8      12 0.00029   17.2   3.4   76  377-473   159-238 (314)
  5 pfam08016 PKD_channel Polycyst  58.5      13 0.00033   16.9   5.2   58  415-478   101-158 (423)
  6 KOG0933 consensus               44.6     7.1 0.00018   18.7   0.3   32  439-480    46-81  (1174)
  7 PRK09532 DNA polymerase III su  44.1      19  0.0005   15.7   2.5   17  443-460   568-584 (874)
  8 TIGR02129 hisA_euk phosphoribo  43.0      11 0.00027   17.5   1.0   16  462-477   219-234 (274)
  9 smart00709 Zpr1 Duplicated dom  42.2      23 0.00058   15.2   3.0   63  360-422    38-108 (160)
 10 COG4738 Predicted transcriptio  42.2     9.5 0.00024   17.8   0.6   20  448-467    55-74  (124)
 11 cd03600 CLECT_thrombomodulin_l  41.8     5.1 0.00013   19.7  -0.8   38  435-472    55-94  (141)
 12 PRK01033 imidazole glycerol ph  41.7     6.1 0.00016   19.1  -0.4   69  412-481    18-90  (253)
 13 TIGR02992 ectoine_eutC ectoine  40.7      12  0.0003   17.2   0.9   13  428-440    62-74  (326)
 14 TIGR03549 conserved hypothetic  40.4      24 0.00062   15.0   2.6   98  359-500   120-224 (718)
 15 TIGR00311 aIF-2beta translatio  40.2      16 0.00042   16.2   1.6   22    3-24      1-22  (134)
 16 pfam07249 Cerato-platanin Cera  40.1      22 0.00055   15.4   2.2   33  468-503    65-97  (119)
 17 cd03592 CLECT_selectins_like C  39.8      11 0.00027   17.5   0.6   37  426-472    36-75  (115)
 18 pfam10369 ALS_ss_C Small subun  39.6      25 0.00064   14.9   4.0   46  429-477    27-72  (75)
 19 PRK08637 hypothetical protein;  38.3      26 0.00067   14.8   3.4   60  436-496   322-384 (386)
 20 TIGR01279 DPOR_bchN light-inde  38.3      10 0.00026   17.6   0.3  131  346-498   140-281 (458)
 21 PRK05412 nucleotide-binding pr  37.6      21 0.00055   15.4   1.9   28  446-474   133-160 (161)
 22 cd03595 CLECT_chondrolectin_li  37.0     6.4 0.00016   19.0  -0.9   39  435-473    62-101 (149)
 23 PRK02145 consensus              36.0      12  0.0003   17.2   0.3   65  412-477    19-87  (257)
 24 cd03375 TPP_OGFOR Thiamine pyr  35.8     7.2 0.00018   18.7  -0.8   56  406-476    54-110 (193)
 25 TIGR02354 thiF_fam2 thiamine b  35.6      18 0.00047   15.9   1.2   26  432-459   157-182 (200)
 26 COG3204 Uncharacterized protei  35.3      20  0.0005   15.6   1.4  117  368-486   142-278 (316)
 27 smart00116 CBS Domain in cysta  35.2      28 0.00072   14.5   2.2   29  451-479    11-40  (49)
 28 pfam03367 zf-ZPR1 ZPR1 zinc-fi  35.0      29 0.00075   14.4   3.0   62  362-423    41-110 (160)
 29 PRK01659 consensus              34.7      12  0.0003   17.2   0.2   57  423-479    28-88  (252)
 30 KOG3047 consensus               34.7      22 0.00056   15.3   1.5   32  430-461    88-120 (157)
 31 cd04606 CBS_pair_Mg_transporte  34.4      30 0.00076   14.4   2.7   78  394-481    24-103 (109)
 32 PRK00830 consensus              34.3      17 0.00043   16.1   0.9   84  407-491    17-112 (273)
 33 TIGR03578 EF_0831 conserved hy  32.3      32 0.00082   14.1   2.3   21  480-500    11-31  (96)
 34 COG2876 AroA 3-deoxy-D-arabino  32.3      16  0.0004   16.3   0.5   22  453-474   140-161 (286)
 35 PRK12702 mannosyl-3-phosphogly  32.3      32 0.00082   14.1   4.3   54  439-502   227-298 (302)
 36 PRK00748 1-(5-phosphoribosyl)-  32.0      22 0.00057   15.3   1.2   25  453-477   176-201 (241)
 37 PRK04281 consensus              31.9      20 0.00051   15.6   1.0   42  436-477    44-86  (254)
 38 pfam09837 DUF2064 Uncharacteri  30.6      23  0.0006   15.1   1.1   20  435-454    97-116 (121)
 39 cd04638 CBS_pair_25 The CBS do  30.3      35 0.00089   13.9   3.2   43  435-480    58-100 (106)
 40 pfam06089 Asparaginase_II L-as  30.3      35 0.00089   13.9   2.6   31  429-466   267-297 (325)
 41 PRK02621 consensus              29.2      31 0.00078   14.3   1.5   42  436-477    44-86  (254)
 42 PRK13597 imidazole glycerol ph  28.8      32 0.00082   14.2   1.6   66  412-478    19-88  (252)
 43 PRK03220 consensus              28.8      35  0.0009   13.9   1.8   65  412-477    19-87  (257)
 44 PRK10076 pyruvate formate lyas  28.8      37 0.00094   13.7   4.3   32  438-469    91-137 (213)
 45 KOG1383 consensus               28.7      37 0.00094   13.7   2.4   36  437-472   382-417 (491)
 46 pfam11217 DUF3013 Protein of u  28.7      29 0.00075   14.4   1.4   25  392-416    31-57  (160)
 47 PRK12595 bifunctional 3-deoxy-  28.2      33 0.00085   14.0   1.6   33  437-474   202-234 (360)
 48 COG2987 HutU Urocanate hydrata  27.5      26 0.00066   14.8   0.9   29  438-466   360-396 (561)
 49 pfam12281 DUF3620 Protein of u  27.2      39   0.001   13.6   2.0  110  364-480    47-159 (216)
 50 PRK02747 consensus              26.5      21 0.00053   15.5   0.2   67  412-479    18-88  (257)
 51 pfam07946 DUF1682 Protein of u  26.4      28 0.00072   14.5   0.9   15  470-484   212-226 (322)
 52 cd04495 BRCA2DBD_OB3 BRCA2DBD_  26.3      14 0.00035   16.7  -0.7   40  454-498    57-96  (100)
 53 PRK13415 flagella biosynthesis  26.2      41   0.001   13.4   2.1   40  407-460   105-144 (216)
 54 KOG1693 consensus               26.2      14 0.00036   16.7  -0.7  103  363-465    60-164 (209)
 55 smart00310 PTBI Phosphotyrosin  26.0      41   0.001   13.4   3.4   68  434-502    25-98  (98)
 56 PRK13396 3-deoxy-7-phosphohept  25.8      40   0.001   13.5   1.6   37  432-474   181-217 (352)
 57 pfam11322 DUF3124 Protein of u  25.8      41  0.0011   13.4   2.2   17  374-391    25-41  (126)
 58 KOG0526 consensus               25.6      42  0.0011   13.3   3.4   96  372-475   141-243 (615)
 59 pfam03425 CBM_11 Carbohydrate   25.5      34 0.00087   14.0   1.2   31  385-415   135-168 (170)
 60 PRK05211 consensus              25.2     9.8 0.00025   17.7  -1.6   65  412-477     9-77  (248)
 61 KOG3883 consensus               25.0      43  0.0011   13.3   2.4   84  392-478    41-131 (198)
 62 TIGR00400 mgtE magnesium trans  24.5      44  0.0011   13.2   4.5   76  395-479   171-247 (460)
 63 cd04731 HisF The cyclase subun  24.1      13 0.00033   16.9  -1.1   40  437-476    42-82  (243)
 64 cd03596 CLECT_tetranectin_like  24.1      31  0.0008   14.2   0.8   25  438-472    59-86  (129)
 65 pfam03180 Lipoprotein_9 NLPA l  24.0      45  0.0011   13.2   2.8   28  358-385    96-128 (236)
 66 pfam00869 Flavi_glycoprot Flav  24.0      17 0.00042   16.1  -0.6   38  437-482   193-238 (293)
 67 PRK13585 1-(5-phosphoribosyl)-  24.0      40   0.001   13.5   1.3   40  437-476    46-86  (240)
 68 PRK07006 isocitrate dehydrogen  23.9      37 0.00093   13.8   1.1   18  454-472   279-296 (409)
 69 cd07136 ALDH_YwdH-P39616 Bacil  23.7      18 0.00046   15.9  -0.5   16  456-471   423-438 (449)
 70 COG1628 Endonuclease V homolog  23.4      31 0.00079   14.3   0.7   84  406-498    22-124 (185)
 71 COG4148 ModC ABC-type molybdat  23.2      35 0.00088   13.9   0.9   30  444-473   156-187 (352)
 72 cd04732 HisA HisA.  Phosphorib  23.0      45  0.0012   13.1   1.5   40  436-475    43-83  (234)
 73 cd02072 Glm_B12_BD B12 binding  22.8      47  0.0012   13.0   2.9   38  435-472    47-88  (128)
 74 PRK07374 dnaE DNA polymerase I  22.7      47  0.0012   13.0   2.4   17  464-480  1123-1140(1171)
 75 PRK02079 pyrroloquinoline quin  22.5      48  0.0012   13.0   1.7   22  445-466    66-87  (88)
 76 KOG3107 consensus               22.4      48  0.0012   13.0   1.5   24  379-402   180-203 (468)
 77 KOG3972 consensus               22.3      47  0.0012   13.0   1.4   24    3-26     81-104 (252)
 78 pfam10794 DUF2606 Protein of u  22.1      48  0.0012   12.9   3.1   78  391-468    37-128 (131)
 79 cd04643 CBS_pair_30 The CBS do  22.0      49  0.0012   12.9   3.3   41  435-480    69-110 (116)
 80 PRK06826 dnaE DNA polymerase I  21.9      49  0.0012   12.9   2.3   40  457-497  1107-1148(1157)
 81 COG3814 Uncharacterized protei  21.9      44  0.0011   13.2   1.2   12  434-445    75-86  (157)
 82 TIGR03572 WbuZ glycosyl amidat  21.9      16  0.0004   16.3  -1.1   65  412-477    18-86  (232)
 83 pfam00948 Flavi_NS1 Flavivirus  21.8      46  0.0012   13.1   1.3   51  360-411   183-252 (355)
 84 CHL00073 chlN photochlorophyll  21.7      35  0.0009   13.9   0.7   12  394-405   415-426 (457)
 85 PRK06273 ferredoxin; Provision  21.5      45  0.0011   13.1   1.2   25  406-440    55-83  (163)
 86 TIGR02318 phosphono_phnM phosp  21.4      50  0.0013   12.8   3.1   51  437-503   162-215 (391)
 87 pfam03452 Anp1 Anp1. The membe  21.4      50  0.0013   12.8   2.6   77  392-470    55-156 (270)
 88 cd04582 CBS_pair_ABC_OpuCA_ass  20.9      51  0.0013   12.7   3.5   33  448-480    67-100 (106)
 89 TIGR01858 tag_bisphos_ald clas  20.9      38 0.00098   13.6   0.7   44  457-500   192-247 (282)
 90 cd01339 LDH-like_MDH L-lactate  20.8      23 0.00059   15.1  -0.4   45  380-424   235-287 (300)
 91 cd03557 L-arabinose_isomerase   20.7      52  0.0013   12.7   2.2  108  331-443   270-400 (484)
 92 PRK05673 dnaE DNA polymerase I  20.5      52  0.0013   12.7   2.4   44  458-501  1092-1138(1143)
 93 COG2014 Uncharacterized conser  20.4      22 0.00055   15.4  -0.6   23  458-480    98-121 (250)
 94 COG4742 Predicted transcriptio  20.2      18 0.00046   15.9  -1.1   22  431-452   207-228 (260)

No 1  
>PHA02107 hypothetical protein
Probab=79.90  E-value=4.6  Score=20.03  Aligned_cols=49  Identities=22%  Similarity=0.171  Sum_probs=35.9

Q ss_pred             EEECCCEE------------EEECCC-CCHHHH-HHHHHHHCCCCEEEEEEECCCCEEEEEEC
Q ss_conf             67469828------------998178-825688-77999730781168788379967899961
Q gi|254780612|r  433 FVISKNSY------------LISLKG-SEEDPF-RNSKILEEYRFIDIPITYRSGQKILFTID  481 (503)
Q Consensus       433 ~kv~~~~f------------~v~l~~-~~~~~~-~n~~ll~~~~w~dip~~y~ngrra~~~~e  481 (503)
                      +||.+++|            .||||- ++.|-. .-|+||+.++-+----.-+|||--|.+|-
T Consensus        62 ~KI~PN~~N~~A~~~~sV~G~IW~NT~FEsDC~~~RL~~L~~YG~~V~i~~~~NG~N~iV~FS  124 (216)
T PHA02107         62 AKISPNHINLAAVGTHSVCGTIWVNTSFESDCVTNRLQTLEAYGVFVEVARRTNGSNHVVTFS  124 (216)
T ss_pred             CCCCCCEEEEECCCCEEEEEEEEEECHHHHHHHHHHHHHHHHHCEEEEEEECCCCCCCEEEEE
T ss_conf             313777044411133688778997223267789989999986072899984158865169999


No 2  
>pfam03725 RNase_PH_C 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterized subfamily. This subfamily is found in both eukaryotes and archaebacteria.
Probab=62.96  E-value=11  Score=17.41  Aligned_cols=58  Identities=12%  Similarity=0.068  Sum_probs=32.4

Q ss_pred             CCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCH---HHHHHHHHH
Q ss_conf             311001674698289981788256887799973078116878837996789996158768---999999999
Q gi|254780612|r  427 LIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKG---ADIFKSAIM  495 (503)
Q Consensus       427 ~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g---~~~f~~a~~  495 (503)
                      +.+++.+=+-||.|+|-++.-+...        ..+.+  .|+| ||.+-++++=|+..+   +.-|.++|.
T Consensus         2 ~v~avtvg~i~~~~i~dpt~~Ee~~--------~~~~l--~v~~-~~~~~~v~~~~~g~~~l~~~~l~~~l~   62 (67)
T pfam03725         2 PVAAVTVGKIDGELVLDPTLEEESL--------ADSDL--TIVV-AGTGDIVALMKEGGGGLTEEELLEALE   62 (67)
T ss_pred             CEEEEEEEEECCEEEECCCHHHHHH--------HCCEE--EEEE-ECCCCEEEEECCCCCCCCHHHHHHHHH
T ss_conf             8899999999999999989678633--------14736--8999-689999999918997859999999999


No 3  
>cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria.  The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair.  The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here.  It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=61.89  E-value=10  Score=17.63  Aligned_cols=44  Identities=5%  Similarity=0.068  Sum_probs=32.8

Q ss_pred             ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEEC
Q ss_conf             4698289981788256887799973078116878837996-7899961
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTID  481 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~e  481 (503)
                      -|.+.+-+--.+.   ...=+++|.+++.-.+|++..||| .+|||+-
T Consensus        65 Mt~~vitv~~~~~---i~~a~~~M~~~~i~~lPVVD~~~~~vGiit~~  109 (114)
T cd04619          65 MTRAVVSCRPGDL---LHDVWQVMKQRGLKNIPVVDENARPLGVLNAR  109 (114)
T ss_pred             CCCCCEEECCCCC---HHHHHHHHHHCCCCEEEEEECCCEEEEEEEHH
T ss_conf             1589889999890---99999988876985889990799599999935


No 4  
>KOG2915 consensus
Probab=59.83  E-value=12  Score=17.25  Aligned_cols=76  Identities=16%  Similarity=0.144  Sum_probs=46.4

Q ss_pred             EEEEECCCCCCCCCE-EEEEEEEEE-CCC-C-CCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHH
Q ss_conf             999951789877781-288999971-688-8-873211032310213321246631100167469828998178825688
Q gi|254780612|r  377 ASMTLKCNADISLSI-THVMEIMFS-FPK-E-SQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPF  452 (503)
Q Consensus       377 ~~~~~rrn~d~slpa-sh~~e~~f~-~~~-~-~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~  452 (503)
                      ++++-|-=--..+.. |++++-+|. +|. . .-.--.+                     +-..+|-.|+-.|-.=+-.+
T Consensus       159 vt~~hrDVc~~GF~~ks~~aDaVFLDlPaPw~AiPha~~---------------------~lk~~g~r~csFSPCIEQvq  217 (314)
T KOG2915         159 VTVTHRDVCGSGFLIKSLKADAVFLDLPAPWEAIPHAAK---------------------ILKDEGGRLCSFSPCIEQVQ  217 (314)
T ss_pred             EEEEEEECCCCCCCCCCCCCCEEEECCCCHHHHHHHHHH---------------------HHHHCCCEEEECCHHHHHHH
T ss_conf             489996415677353134235699758980552233677---------------------75406825996217899999


Q ss_pred             HHHHHHHCCCCEEEEEEECCC
Q ss_conf             779997307811687883799
Q gi|254780612|r  453 RNSKILEEYRFIDIPITYRSG  473 (503)
Q Consensus       453 ~n~~ll~~~~w~dip~~y~ng  473 (503)
                      |-.++|+.++||||-|+---+
T Consensus       218 rtce~l~~~gf~~i~~vEv~~  238 (314)
T KOG2915         218 RTCEALRSLGFIEIETVEVLL  238 (314)
T ss_pred             HHHHHHHHCCCCEEEEEEEEH
T ss_conf             999999867982379987603


No 5  
>pfam08016 PKD_channel Polycystin cation channel. This family contains the cation channel region of PKD1 and PKD2 proteins.
Probab=58.51  E-value=13  Score=16.89  Aligned_cols=58  Identities=12%  Similarity=0.183  Sum_probs=44.6

Q ss_pred             CEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEE
Q ss_conf             1021332124663110016746982899817882568877999730781168788379967899
Q gi|254780612|r  415 ISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILF  478 (503)
Q Consensus       415 ~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~  478 (503)
                      -..+..+.-.|.+-.|.-.-=..|=|.+-|+....+-.+.|+-||..+|||      .+-||++
T Consensus       101 w~y~~~~~l~~~~~~G~~~~Y~gGGYv~~L~~~~~~~~~~l~~L~~~~WiD------~~TRAvf  158 (423)
T pfam08016       101 RIYSPPNLLMGAWHWGHLASYPSGGYVVLLPLSREESLKRLAYLQDHNWLD------RRTRAVF  158 (423)
T ss_pred             CEEECHHHHCCCCEEEEEEEECCCCEEEECCCCHHHHHHHHHHHHHCCCCC------CCCCEEE
T ss_conf             755177881887517677898899889978999999999999998768700------0452799


No 6  
>KOG0933 consensus
Probab=44.57  E-value=7.1  Score=18.72  Aligned_cols=32  Identities=16%  Similarity=0.354  Sum_probs=23.8

Q ss_pred             EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC----CEEEEEE
Q ss_conf             28998178825688779997307811687883799----6789996
Q gi|254780612|r  439 SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG----QKILFTI  480 (503)
Q Consensus       439 ~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng----rra~~~~  480 (503)
                      .|..|++|+..-|+.|||=|          +|.||    -+|..|+
T Consensus        46 CFvLGI~nl~~VRA~nlqeL----------Iyk~GQAGiTkAsVsV   81 (1174)
T KOG0933          46 CFVLGITNLSQVRASNLQEL----------IYKNGQAGITKASVSV   81 (1174)
T ss_pred             HHHHCCCHHHHHHHHHHHHH----------HHHCCCCCCEEEEEEE
T ss_conf             99982452888888789999----------9846866642579999


No 7  
>PRK09532 DNA polymerase III subunit alpha; Reviewed
Probab=44.15  E-value=19  Score=15.66  Aligned_cols=17  Identities=18%  Similarity=0.249  Sum_probs=5.8

Q ss_pred             ECCCCCHHHHHHHHHHHC
Q ss_conf             817882568877999730
Q gi|254780612|r  443 SLKGSEEDPFRNSKILEE  460 (503)
Q Consensus       443 ~l~~~~~~~~~n~~ll~~  460 (503)
                      ||.++..= .+=+.|+++
T Consensus       568 GL~~Lt~I-~~a~~lI~~  584 (874)
T PRK09532        568 GLKNLTMI-QKTADLIKE  584 (874)
T ss_pred             CCCHHHHH-HHHHHHHHH
T ss_conf             55768899-999999998


No 8  
>TIGR02129 hisA_euk phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase; InterPro: IPR011858    This enzyme acts in the biosynthesis of histidine and has been characterised in Saccharomyces cerevisiae  and Arabidopsis  where it complements the Escherichia coli HisA gene. In eukaryotes the gene is known as HIS6. In bacteria, this gene is found in Fibrobacter succinogenes, presumably due to lateral gene transfer from plants in the rumen gut.; GO: 0003949 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imidazole-4-carboxamide isomerase activity, 0000105 histidine biosynthetic process.
Probab=43.04  E-value=11  Score=17.50  Aligned_cols=16  Identities=44%  Similarity=0.821  Sum_probs=12.4

Q ss_pred             CCEEEEEEECCCCEEE
Q ss_conf             8116878837996789
Q gi|254780612|r  462 RFIDIPITYRSGQKIL  477 (503)
Q Consensus       462 ~w~dip~~y~ngrra~  477 (503)
                      ..+||||||+.|=|-|
T Consensus       219 ~~~~ip~tYAGGakS~  234 (274)
T TIGR02129       219 VYDDIPVTYAGGAKSI  234 (274)
T ss_pred             CCCCCCEEEECCCEEH
T ss_conf             8752431772786556


No 9  
>smart00709 Zpr1 Duplicated domain in the epidermal growth factor- and elongation factor-1alpha-binding protein Zpr1. Also present in archaeal proteins.
Probab=42.24  E-value=23  Score=15.18  Aligned_cols=63  Identities=11%  Similarity=0.161  Sum_probs=41.7

Q ss_pred             CCCEEEEEEECCCCCEEEEEEECCCCCC------CCCEEE-EEEEEEECCCCC-CCCHHHCCCCEECCCCC
Q ss_conf             6742367630787771799995178987------778128-899997168888-73211032310213321
Q gi|254780612|r  360 KGLVIKGDIPMIDNAFSASMTLKCNADI------SLSITH-VMEIMFSFPKES-QDAVVDLRRISMRKTDN  422 (503)
Q Consensus       360 ~~~aira~i~iP~~~~~~~~~~rrn~d~------slpash-~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~  422 (503)
                      .-.-|+.--.|.++|.+.+|.++--.|.      |=-|+- +=||-|.++.+. +|-|.||-||+++--++
T Consensus        38 r~~ev~~~~~~~p~G~r~tl~V~~~~DL~r~VvkS~tati~IPEL~~ei~pg~~~G~vTTVEGlL~r~~~~  108 (160)
T smart00709       38 RNNEVKSGGAIEPKGTRITLKVESPEDLNRDVVKSETATISIPELDLEIPPGPLGGFITTVEGLLSRVREV  108 (160)
T ss_pred             CCEEEEECCCCCCCEEEEEEEECCHHHCCCEEEECCCEEEEEEEEEEEECCCCCCCEEECHHHHHHHHHHH
T ss_conf             52378887666896189999988979908607972865899854303766977686798189999999999


No 10 
>COG4738 Predicted transcriptional regulator [Transcription]
Probab=42.18  E-value=9.5  Score=17.82  Aligned_cols=20  Identities=15%  Similarity=0.179  Sum_probs=14.9

Q ss_pred             CHHHHHHHHHHHCCCCEEEE
Q ss_conf             25688779997307811687
Q gi|254780612|r  448 EEDPFRNSKILEEYRFIDIP  467 (503)
Q Consensus       448 ~~~~~~n~~ll~~~~w~dip  467 (503)
                      ..++.--+.+||+++|||+=
T Consensus        55 QPEVSiAMr~Lre~gWV~~R   74 (124)
T COG4738          55 QPEVSIAMRYLRENGWVDER   74 (124)
T ss_pred             CCHHHHHHHHHHHCCCCCHH
T ss_conf             83068999999875630067


No 11 
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved.  TM exerts anti-fibrinolytic and anti-inflammatory activity.  TM also regulates blood coagulation in the anticoagulant protein C pathway.  In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM.  TM also plays a key role in tumor biology.  It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma.  Loss of TM expression correlates with advanced stage and poor prognosis.  Loss of function of TM func
Probab=41.84  E-value=5.1  Score=19.72  Aligned_cols=38  Identities=11%  Similarity=0.087  Sum_probs=24.2

Q ss_pred             ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEE--EEEECC
Q ss_conf             46982899817882568877999730781168--788379
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDI--PITYRS  472 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~di--p~~y~n  472 (503)
                      .++..|||||++......-.+..|+.-.|+|=  ++.|.|
T Consensus        55 ~~~~~~WIGL~~~~~~~~~~~~~~~~f~W~~g~~~~~y~n   94 (141)
T cd03600          55 RGSLRLWIGLQREPRQCSDPSLPLRGFSWVTGDQDTDFSN   94 (141)
T ss_pred             CCCCEEEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCC
T ss_conf             6552177631013444556656656537788998777667


No 12 
>PRK01033 imidazole glycerol phosphate synthase subunit HisF; Provisional
Probab=41.66  E-value=6.1  Score=19.15  Aligned_cols=69  Identities=12%  Similarity=0.164  Sum_probs=48.5

Q ss_pred             CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEEEC
Q ss_conf             323102133212466311001---67469828998178825688779997307-81168788379967899961
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFTID  481 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~~e  481 (503)
                      |+.+.||... --|+|+.-+-   -.=.|..++|-|+.....+..|+.++++- +=+++||.|..|=|.+-.++
T Consensus        18 Vr~~~f~~~~-~~gdP~~~ak~f~~~GadelhivDld~a~~g~~~n~~~I~~I~~~~~ipi~vGGGIrs~e~~~   90 (253)
T PRK01033         18 VKTVKFKDPK-YVGDPINAVRIFNEKEADELIVLDIDASRKGREPNYELIENLASECFMPLCYGGGIKTVEQAK   90 (253)
T ss_pred             EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCHHHHH
T ss_conf             9785577674-888999999999987999899994745424880169999999987699889868812168889


No 13 
>TIGR02992 ectoine_eutC ectoine utilization protein EutC; InterPro: IPR014334   Members of this entry are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine). Members belong to the ornithine cyclodeaminase/mu-crystallin family..
Probab=40.74  E-value=12  Score=17.17  Aligned_cols=13  Identities=15%  Similarity=0.276  Sum_probs=9.8

Q ss_pred             CCCEEEEECCCEE
Q ss_conf             1100167469828
Q gi|254780612|r  428 IDSNIFVISKNSY  440 (503)
Q Consensus       428 l~~~~~kv~~~~f  440 (503)
                      |.+-+.||++|||
T Consensus        62 ld~FAiK~SPGFF   74 (326)
T TIGR02992        62 LDGFAIKVSPGFF   74 (326)
T ss_pred             CCCCEEEECCCCC
T ss_conf             7565155257723


No 14 
>TIGR03549 conserved hypothetical protein TIGR03549. This family consists of remarkably well-conserved proteins from gamma and beta Proteobacteria, heavily skewed towards organisms of marine environments. This family has an OsmC-like N-terminal domain. It shares a central domain, modeled by pfam02624 and TIGR00702, with other families of smaller proteins. The function is unknown. Fifteen of the first sixteen members of this family are from selenouridine-positive genomes, but this correlation may be fortuitous.
Probab=40.41  E-value=24  Score=14.99  Aligned_cols=98  Identities=16%  Similarity=0.211  Sum_probs=62.3

Q ss_pred             CCCCEEEEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECCCCC--CCCHHHCCCC----EECCCCCCCCCCCCCE-
Q ss_conf             76742367630787771799995178987778128899997168888--7321103231----0213321246631100-
Q gi|254780612|r  359 LKGLVIKGDIPMIDNAFSASMTLKCNADISLSITHVMEIMFSFPKES--QDAVVDLRRI----SMRKTDNSPSVLIDSN-  431 (503)
Q Consensus       359 ~~~~aira~i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~~--~~~v~~v~~~----~~k~~e~~~g~~l~~~-  431 (503)
                      +++-.|. .|+.=|.+-.+.|+++...|.+         +||..++-  -..|.|++++    .||-.|.+=-.|+-.| 
T Consensus       120 ~p~f~ie-~v~~l~~da~~~l~~~~~~~~~---------TfI~GKDa~LE~sIa~m~~~L~~LGf~IEeaSWlNpVpnVW  189 (718)
T TIGR03549       120 GPEFQIE-TVENLDADAQALLMGTPDSGAS---------TYIEGKDLPLEQTIANMTAILADLGMKIEIASWRNIVPNVW  189 (718)
T ss_pred             CCCEEEE-EHHCCCHHHHHHHCCCCCCCCC---------EEECCCCCCHHHHHHHHHHHHHHCCCCCEEHHHCCCCCCCC
T ss_conf             8844775-5302163578772578898874---------13237874389999999999987486311054337888720


Q ss_pred             EEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHHC
Q ss_conf             167469828998178825688779997307811687883799678999615876899999999998522
Q gi|254780612|r  432 IFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAIMQWENR  500 (503)
Q Consensus       432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~~~  500 (503)
                      +|.|.|                          -|-||.++||        ||.+-+.|...|++..--+
T Consensus       190 SvHIRD--------------------------~~cP~cFTNG--------KGatk~aALASALGE~~ER  224 (718)
T TIGR03549       190 SLHIRD--------------------------AASPMCFTNG--------KGATKESALCSALGEFIER  224 (718)
T ss_pred             EEEEEC--------------------------CCCCCCCCCC--------CCCCHHHHHHHHHHHHHHH
T ss_conf             587515--------------------------8885220378--------7501888999999999998


No 15 
>TIGR00311 aIF-2beta translation initiation factor aIF-2, beta subunit, putative; InterPro: IPR004458   Archaeal transcription initiation factor 2 is, like its eukaryotic homologue, a heterotrimeric protein with alpha, beta and gamma subunits . It is thought play an essential role in the recognition of the correct codon for the start of translation, similar to the role of eukaryotic transcription initiation factor. The eukaryotic factor forms a ternary complex with the methionine initiator tRNA and GTP, which binds to the 40S ribosomal subunit as part of the 43S preinitiation complex . The beta and gamma subunits are responsible for recruiting the initiator tRNA and GTP, while the alpha subunit is involved in the regulation of the translation initiation process. The beta subunit has also been shown to interact with other transcription factors. All three subunits of the archaeal and eukaryotic factors are well conserved among the diverse species of eukaryotes and archaea, but do not occur in prokaryotes.    The archaeal beta subunit has an unfolded N-terminal domain, a mixed alpha/beta core domain and a C-terminal zinc finger . The N-terminal region is thought to interact with the gamma subunit, while the central and C-terminal domains are thought to provide RNA-binding sites.; GO: 0003743 translation initiation factor activity, 0006413 translational initiation.
Probab=40.20  E-value=16  Score=16.18  Aligned_cols=22  Identities=32%  Similarity=0.708  Sum_probs=18.6

Q ss_pred             CHHHHHHHHHHCCCCCCCHHHH
Q ss_conf             1799999886167765600013
Q gi|254780612|r    3 DFILVIQRAVDNLPENTPERRS   24 (503)
Q Consensus         3 D~~~~l~kai~~l~~~tpe~r~   24 (503)
                      ||-.||+||++-|++..=+.+.
T Consensus         1 dYe~LLeRA~~~lpd~vf~r~~   22 (134)
T TIGR00311         1 DYEELLERAIEQLPDKVFERKE   22 (134)
T ss_pred             CHHHHHHHHHHHCCCCCCCCCC
T ss_conf             9578899988736841013154


No 16 
>pfam07249 Cerato-platanin Cerato-platanin. This family contains a number of fungal cerato-platanin phytotoxic proteins approximately 150 residues long. Cerato-platanin contains four cysteine residues that form two disulphide bonds.
Probab=40.12  E-value=22  Score=15.36  Aligned_cols=33  Identities=24%  Similarity=0.418  Sum_probs=23.5

Q ss_pred             EEECCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCC
Q ss_conf             883799678999615876899999999998522369
Q gi|254780612|r  468 ITYRSGQKILFTIDKGKKGADIFKSAIMQWENRSNN  503 (503)
Q Consensus       468 ~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~~~~~~  503 (503)
                      |+|.+...-||+|.....|   ||=+..|-...||+
T Consensus        65 ltyng~sI~vlaID~A~~G---Fni~~~Am~~LT~g   97 (119)
T pfam07249        65 VTYNGNTIFVLAIDSAGGG---FNIAKTAMNELTNG   97 (119)
T ss_pred             EEECCEEEEEEEEECCCCC---CEECHHHHHHHHCC
T ss_conf             8989928999998158876---16529999987577


No 17 
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration.  L- sel is expressed constitutively on most leukocytes.  P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets.  E- sels are present on endothelial cells.  Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=39.80  E-value=11  Score=17.47  Aligned_cols=37  Identities=16%  Similarity=0.298  Sum_probs=22.0

Q ss_pred             CCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEE---EEEEECC
Q ss_conf             6311001674698289981788256887799973078116---8788379
Q gi|254780612|r  426 VLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFID---IPITYRS  472 (503)
Q Consensus       426 ~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d---ip~~y~n  472 (503)
                      +-|..+.-+..+++|+|||+++..          +-.|+|   -++.|.|
T Consensus        36 ~~l~~~~~~~~~~~~WiG~~d~~~----------~~~W~~~~g~~~~y~n   75 (115)
T cd03592          36 ALLNGFALKYNLGYYWIDGNDINN----------EGTWVDTDKKELEYKN   75 (115)
T ss_pred             HHHHHHHHCCCCCCEEEEEECCCC----------CCEEEECCCCCCCCCC
T ss_conf             999999861288758987153488----------7879939899545076


No 18 
>pfam10369 ALS_ss_C Small subunit of acetolactate synthase. ALS_ss_C is the C-terminal half of a family of proteins which are the small subunits of acetolactate synthase. Acetolactate synthase is a tetrameric enzyme, containing probably two large and two small subunits, which catalyses the first step in branched-chain amino acid biosynthesis. This reaction is sensitive to certain herbicides.
Probab=39.60  E-value=25  Score=14.91  Aligned_cols=46  Identities=22%  Similarity=0.381  Sum_probs=37.4

Q ss_pred             CCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEE
Q ss_conf             1001674698289981788256887799973078116878837996789
Q gi|254780612|r  429 DSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKIL  477 (503)
Q Consensus       429 ~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~  477 (503)
                      -|-.+-+++++++|=|++.....+.-+.+|+..+-+.   +-+.|+-||
T Consensus        27 ~a~ivd~~~~~~iiE~tG~~~kid~fi~~l~~~gI~E---ivRtG~vAi   72 (75)
T pfam10369        27 RAKIVDVTPDSYTIELTGDPEKIDAFIDLLRPFGILE---VVRTGVVAI   72 (75)
T ss_pred             CCEEEEECCCEEEEEEECCHHHHHHHHHHHHHHCCEE---EEECCHHHC
T ss_conf             9899970388899999399899999999876609799---995387342


No 19 
>PRK08637 hypothetical protein; Provisional
Probab=38.34  E-value=26  Score=14.78  Aligned_cols=60  Identities=17%  Similarity=0.197  Sum_probs=38.9

Q ss_pred             CCCEE-EEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEE--EEECCCCCHHHHHHHHHHH
Q ss_conf             69828-9981788256887799973078116878837996789--9961587689999999999
Q gi|254780612|r  436 SKNSY-LISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKIL--FTIDKGKKGADIFKSAIMQ  496 (503)
Q Consensus       436 ~~~~f-~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~--~~~ekg~~g~~~f~~a~~~  496 (503)
                      ..||| +|.|++...+..| .-||.+.+-.=|+.--.++|.++  |+.|+=.-+-+++.+++..
T Consensus       322 ~~G~F~~i~~~~~~~e~l~-~~Lle~~~V~~v~~g~~~iRIs~aals~~nI~~~~~~I~~al~~  384 (386)
T PRK08637        322 NSGYFMCLKLKDVDAEELR-VHLLEKYGIGVIALNETDLRIAFSCVEEEDIPHVFDSIAKAIDD  384 (386)
T ss_pred             CCEEEEEECCCCCCHHHHH-HHHHHCCCEEEEECCCCCEEEEEEECCHHHHHHHHHHHHHHHHH
T ss_conf             7615886479989999999-99986395899955999779999858899999999999999985


No 20 
>TIGR01279 DPOR_bchN light-independent protochlorophyllide reductase, N subunit; InterPro: IPR005970    Synonym: dark protochlorophyllide reductase    Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite.    This enzyme describes the N subunit of the dark form protochlorophyllide reductase, a nitrogenase-like enzyme . ; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction.
Probab=38.26  E-value=10  Score=17.58  Aligned_cols=131  Identities=12%  Similarity=0.047  Sum_probs=76.8

Q ss_pred             CCCCEEECCCCCCCCCCEEEEEEECCCCCEEEEEEECCCCCCCCCEEEEE---EEEEECCCCCCCCHHHCCCCEECCCCC
Q ss_conf             62212304577777674236763078777179999517898777812889---999716888873211032310213321
Q gi|254780612|r  346 KILWSLQQEKSQGLKGLVIKGDIPMIDNAFSASMTLKCNADISLSITHVM---EIMFSFPKESQDAVVDLRRISMRKTDN  422 (503)
Q Consensus       346 sVvWs~~~es~~~~~~~aira~i~iP~~~~~~~~~~rrn~d~slpash~~---e~~f~~~~~~~~~v~~v~~~~~k~~e~  422 (503)
                      +|--.|+-.=+..   .-+.|+=.. |++-.-.|||=|..|..-=-|...   .|+|+      |+|.+.=-=.|+..=.
T Consensus       140 ~VL~AL~P~~P~s---a~apa~e~~-E~~~~~~~~FG~~~~~EKvese~~~~~~lv~~------GSv~D~~~~~l~~e~~  209 (458)
T TIGR01279       140 TVLAALVPFCPKS---AEAPAEEKV-ERKKQKLLSFGKKADKEKVESEAVDQRKLVLV------GSVNDIVADQLRLELK  209 (458)
T ss_pred             HHHHHHCCCCCCC---CCCCCCCCH-HHHCCHHHHCCCHHHHHHHHHHCCCCCCEEEE------ECCCCHHHHHHHHHHH
T ss_conf             9999631257887---769898711-22122121136301122221000278656899------4355123677999998


Q ss_pred             CCCCCCCCE-EEE-------ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHH
Q ss_conf             246631100-167-------469828998178825688779997307811687883799678999615876899999999
Q gi|254780612|r  423 SPSVLIDSN-IFV-------ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAI  494 (503)
Q Consensus       423 ~~g~~l~~~-~~k-------v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~  494 (503)
                      --|-+..|. |.+       |.+|++++.|+=+=.+-.  .+|.|+|.==-|          ==-|==|+.|.+.|=++|
T Consensus       210 ~LGi~~~~flPa~~~~~lP~i~pgt~~a~~qPyL~~ta--~~L~~eR~~~~L----------~~~FPfGPDGT~~f~e~i  277 (458)
T TIGR01279       210 QLGIPVVGFLPASRFTELPVIGPGTVVAPLQPYLSDTA--TTLRRERGAKVL----------SALFPFGPDGTRRFLEAI  277 (458)
T ss_pred             HCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHH--HHHHHHCCCEEE----------CCCCCCCCCHHHHHHHHH
T ss_conf             37886434277433455768798616983374188999--999850266561----------157898851689999999


Q ss_pred             HHHH
Q ss_conf             9985
Q gi|254780612|r  495 MQWE  498 (503)
Q Consensus       495 ~~w~  498 (503)
                      .+--
T Consensus       278 a~~f  281 (458)
T TIGR01279       278 AAEF  281 (458)
T ss_pred             HHHH
T ss_conf             9985


No 21 
>PRK05412 nucleotide-binding protein; Reviewed
Probab=37.63  E-value=21  Score=15.37  Aligned_cols=28  Identities=11%  Similarity=0.178  Sum_probs=23.5

Q ss_pred             CCCHHHHHHHHHHHCCCCEEEEEEECCCC
Q ss_conf             88256887799973078116878837996
Q gi|254780612|r  446 GSEEDPFRNSKILEEYRFIDIPITYRSGQ  474 (503)
Q Consensus       446 ~~~~~~~~n~~ll~~~~w~dip~~y~ngr  474 (503)
                      -...|.+.=++|||+.+ ||+||.|.|=|
T Consensus       133 KkrDDLQ~vi~llk~~d-~~~pLQF~NfR  160 (161)
T PRK05412        133 KKRDDLQAVIALLRGAD-LGQPLQFNNFR  160 (161)
T ss_pred             CCHHHHHHHHHHHHHCC-CCCCCEECCCC
T ss_conf             98889999999998327-79971133688


No 22 
>cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation.  Various alternatively spliced isoforms have been of CHODL have been identified.  The transmembrane form of CHODL is localized in the ER-Golgi apparatus.  Layilin is widely expressed in different cell types.  The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM).  The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin).  The ERM proteins are cytoskeleton-membrane l
Probab=36.98  E-value=6.4  Score=19.01  Aligned_cols=39  Identities=23%  Similarity=0.297  Sum_probs=22.0

Q ss_pred             ECCCEEEEECCCCCHHHHHHHHHHHCCCCEE-EEEEECCC
Q ss_conf             4698289981788256887799973078116-87883799
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRNSKILEEYRFID-IPITYRSG  473 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d-ip~~y~ng  473 (503)
                      .+++.|||||.+....-......-..-.|.| -|+.|+|-
T Consensus        62 ~~~~~~WIGL~~~~~~~~~~~~~~~~f~W~DGs~~~y~nW  101 (149)
T cd03595          62 ASDGDFWIGLRRSSQYNVTSSACSSLYYWLDGSISTFRNW  101 (149)
T ss_pred             CCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCCCCCC
T ss_conf             6666347876622666544477774059779981143576


No 23 
>PRK02145 consensus
Probab=36.00  E-value=12  Score=17.22  Aligned_cols=65  Identities=9%  Similarity=0.107  Sum_probs=32.0

Q ss_pred             CCCCEECCCCCCCCCCCCCE---EEEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE
Q ss_conf             32310213321246631100---167469828998178825688779997307-8116878837996789
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSN---IFVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL  477 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~---~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~  477 (503)
                      |+++.++... .-|+||.-+   ...=.|..|+|-|+.....+..|+.|++.- .=++|||.|..|=|.+
T Consensus        19 Vkg~~~~~~~-~~gdP~~~a~~~~~~GadelhivDld~a~~~~~~~~~~I~~i~~~~~iPi~vGGGIrs~   87 (257)
T PRK02145         19 VKGVNFVELR-DAGDPVEIARRYDEQGADELTFLDITATSDGRDLILPIIEAVASQVFIPLTVGGGVRAV   87 (257)
T ss_pred             EECCCCCCEE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCH
T ss_conf             8577777738-88899999999998799989999788876675408999999996568748962773046


No 24 
>cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity.
Probab=35.79  E-value=7.2  Score=18.67  Aligned_cols=56  Identities=14%  Similarity=-0.081  Sum_probs=34.5

Q ss_pred             CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCC-EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEE
Q ss_conf             732110323102133212466311001674698-28998178825688779997307811687883799678
Q gi|254780612|r  406 QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKN-SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKI  476 (503)
Q Consensus       406 ~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~-~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra  476 (503)
                      |.++.--.||.+       +.+=.-|-+-+-|| ||-|||+.+-....||..++        -|+|.|+--+
T Consensus        54 G~a~~~A~G~k~-------a~p~~~Vva~~GDG~~~~iG~~~l~~A~~rn~~i~--------~iv~DN~~yg  110 (193)
T cd03375          54 GRALAVATGVKL-------ANPDLTVIVVSGDGDLAAIGGNHFIHAARRNIDIT--------VIVHNNQIYG  110 (193)
T ss_pred             CCCHHHHHHHHH-------HCCCCEEEEEEECCHHHHCCHHHHHHHHHHCCCEE--------EEEECCCCEE
T ss_conf             160888867987-------59997399997065576456188999998099749--------9998584044


No 25 
>TIGR02354 thiF_fam2 thiamine biosynthesis protein ThiF; InterPro: IPR012729    Members of the HesA/MoeB/ThiF family of proteins (IPR000594 from INTERPRO) include a number of members encoded in the midst of thiamine biosynthetic operons. This mix of known and putative ThiF proteins shows a deep split in phylogenetic trees. The Escherichia coli ThiF and MoeB proteins are seemingly more closely related than the E. coli ThiF and Campylobacter (for example) ThiF. This entry represents the divergent clade of putative ThiF proteins as found in Campylobacter..
Probab=35.64  E-value=18  Score=15.85  Aligned_cols=26  Identities=19%  Similarity=0.238  Sum_probs=17.5

Q ss_pred             EEEECCCEEEEECCCCCHHHHHHHHHHH
Q ss_conf             1674698289981788256887799973
Q gi|254780612|r  432 IFVISKNSYLISLKGSEEDPFRNSKILE  459 (503)
Q Consensus       432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~  459 (503)
                      +=||++.||++|  |...+...=..||-
T Consensus       157 trKI~k~FY~cG--Dg~~~ak~G~gLma  182 (200)
T TIGR02354       157 TRKISKRFYVCG--DGKSDAKSGRGLMA  182 (200)
T ss_pred             CCEECCCCEEEE--CCCCCCCCCCCCCC
T ss_conf             121146303874--47222001631025


No 26 
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.34  E-value=20  Score=15.62  Aligned_cols=117  Identities=15%  Similarity=0.125  Sum_probs=66.9

Q ss_pred             EECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECC---C---CC-CCCHHHCCCCEECCCCCC--------CCC-CCCC-
Q ss_conf             30787771799995178987778128899997168---8---88-732110323102133212--------466-3110-
Q gi|254780612|r  368 IPMIDNAFSASMTLKCNADISLSITHVMEIMFSFP---K---ES-QDAVVDLRRISMRKTDNS--------PSV-LIDS-  430 (503)
Q Consensus       368 i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~---~---~~-~~~v~~v~~~~~k~~e~~--------~g~-~l~~-  430 (503)
                      .-|=|+.....-.|.=+.|+.+=  -+..+.|.|.   .   +| |=+-+-+.+-.|=..|-.        ++. .|.. 
T Consensus       142 fvi~dER~~~l~~~~vd~~t~~~--~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~  219 (316)
T COG3204         142 FVIVDERDRALYLFTVDADTTVI--SAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVH  219 (316)
T ss_pred             EEEEEHHCCEEEEEEECCCCCEE--ECCCEEEECCCCCCCCCCCEEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCC
T ss_conf             99886122658999986896187--2132587525568877572334417788648999813882899981488533220


Q ss_pred             EEEEECCC--EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEECCCCCH
Q ss_conf             01674698--289981788256887799973078116878837996-789996158768
Q gi|254780612|r  431 NIFVISKN--SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTIDKGKKG  486 (503)
Q Consensus       431 ~~~kv~~~--~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~ekg~~g  486 (503)
                      ++.-.++.  |||-.+|+++-+...|..|...-.-=-|--+...|. +-+|++-||--|
T Consensus       220 ~~~~~~~~~~~f~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~g  278 (316)
T COG3204         220 ASLDPTADRDLFVLDVSGLEFNAITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHG  278 (316)
T ss_pred             CCCCCCCCCCEEEECCCCCEECCCCCCEEEEECCCCEEEEEECCCCEEEEEEECCCCCC
T ss_conf             24582101262762034415348888089986477238998358872543772267677


No 27 
>smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Probab=35.24  E-value=28  Score=14.53  Aligned_cols=29  Identities=17%  Similarity=0.198  Sum_probs=22.9

Q ss_pred             HHHHHHHHHCCCCEEEEEEECCCCE-EEEE
Q ss_conf             8877999730781168788379967-8999
Q gi|254780612|r  451 PFRNSKILEEYRFIDIPITYRSGQK-ILFT  479 (503)
Q Consensus       451 ~~~n~~ll~~~~w~dip~~y~ngrr-a~~~  479 (503)
                      ...=+++|+++++-.+|+|..+||- .|||
T Consensus        11 v~~a~~~m~~~~i~~lPVVd~~~~lvGiit   40 (49)
T smart00116       11 LEEALELLREHGIRRLPVVDEEGRLVGIVT   40 (49)
T ss_pred             HHHHHHHHHHHCCCEEEEECCCCCEEEEEE
T ss_conf             999999999809985769989991999988


No 28 
>pfam03367 zf-ZPR1 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other mitogens, ZPR1 accumulates in the nucleolus. ZPR1 interacts with the cytoplasmic domain of the inactive EGF receptor (EGFR) and is thought to inhibit the basal protein tyrosine kinase activity of EGFR. This interaction is disrupted when cells are treated with EGF, though by themselves, inactive EGFRs are not sufficient to sequester ZPR1 to the cytoplasm. Upon stimulation by EGF, ZPR1 directly binds the eukaryotic translation elongation factor-1alpha (eEF-1alpha) to form ZPR1/eEF-1alpha complexes. These move into the nucleus, localising particularly at the nucleolus. Indeed, the interaction between ZPR1 and eEF-1alpha has been shown to be essential for normal cellular proliferation, and ZPR1 is thought to be involved 
Probab=34.98  E-value=29  Score=14.42  Aligned_cols=62  Identities=8%  Similarity=0.103  Sum_probs=40.0

Q ss_pred             CEEEEEEECCCCCEEEEEEECCCCCC------CCCEEE-EEEEEEECCCCC-CCCHHHCCCCEECCCCCC
Q ss_conf             42367630787771799995178987------778128-899997168888-732110323102133212
Q gi|254780612|r  362 LVIKGDIPMIDNAFSASMTLKCNADI------SLSITH-VMEIMFSFPKES-QDAVVDLRRISMRKTDNS  423 (503)
Q Consensus       362 ~aira~i~iP~~~~~~~~~~rrn~d~------slpash-~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~  423 (503)
                      .-|+.--.|.++|.+.+|.+..-.|.      |=-|+- +=||-|.+|... +|-|.||-||+++--++=
T Consensus        41 ~ev~~~~~~~p~g~~~~l~V~~~~DL~r~VvkS~tati~IPEL~~eip~~~~~G~iTtVEG~L~~~~d~L  110 (160)
T pfam03367        41 NEVQSGGEIEPKGTRYTLKVEDEEDLNRDVVKSETASISIPELGLEIPGPALGGRFTTVEGLLERVIEGL  110 (160)
T ss_pred             EEEEECCCCCCCEEEEEEEECCHHHHCCEEEECCCEEEEEEEEEEEECCCCCCCEEEEHHHHHHHHHHHH
T ss_conf             4888976669970899999889798156179768538996555258669876868980999999999999


No 29 
>PRK01659 consensus
Probab=34.71  E-value=12  Score=17.16  Aligned_cols=57  Identities=11%  Similarity=0.099  Sum_probs=33.1

Q ss_pred             CCCCCCCCEEE---EECCCEEEEECCCCCHHHHHHHHHHHCCC-CEEEEEEECCCCEEEEE
Q ss_conf             24663110016---74698289981788256887799973078-11687883799678999
Q gi|254780612|r  423 SPSVLIDSNIF---VISKNSYLISLKGSEEDPFRNSKILEEYR-FIDIPITYRSGQKILFT  479 (503)
Q Consensus       423 ~~g~~l~~~~~---kv~~~~f~v~l~~~~~~~~~n~~ll~~~~-w~dip~~y~ngrra~~~  479 (503)
                      .-|+|+.-.-.   .=.|..++|-|+.....+..|+.|+++-. =+.|||.|..|=|.+=.
T Consensus        28 ~~gDP~~~ak~~~~~Gad~ihivDld~a~~g~~~n~~~I~~i~~~~~ipi~vGGGIrs~e~   88 (252)
T PRK01659         28 DVGDPVEIAAAYNEAGADELVFLDITATHEGRKTMVDVVEKVAAKVFIPLTVGGGISSVKD   88 (252)
T ss_pred             ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEECHHH
T ss_conf             8789999999999879999999946766568864899999999756974799633200688


No 30 
>KOG3047 consensus
Probab=34.69  E-value=22  Score=15.27  Aligned_cols=32  Identities=16%  Similarity=0.179  Sum_probs=22.9

Q ss_pred             CEEEEECCCEEEE-ECCCCCHHHHHHHHHHHCC
Q ss_conf             0016746982899-8178825688779997307
Q gi|254780612|r  430 SNIFVISKNSYLI-SLKGSEEDPFRNSKILEEY  461 (503)
Q Consensus       430 ~~~~kv~~~~f~v-~l~~~~~~~~~n~~ll~~~  461 (503)
                      -+-|+..|||||- -|-+.=-..+|.++||++.
T Consensus        88 ~i~VaL~~~fflElkLadAiKf~DRK~dlLkel  120 (157)
T KOG3047          88 HIVVALCDDFFLELKLADAIKFCDRKMDLLKEL  120 (157)
T ss_pred             EEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHH
T ss_conf             477775066211001998998887767999999


No 31 
>cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE.  MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=34.38  E-value=30  Score=14.36  Aligned_cols=78  Identities=17%  Similarity=0.361  Sum_probs=49.4

Q ss_pred             EEEEEEECCC-CCCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECC
Q ss_conf             8999971688-887321103231021332124663110016746982899817882568877999730781168788379
Q gi|254780612|r  394 VMEIMFSFPK-ESQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRS  472 (503)
Q Consensus       394 ~~e~~f~~~~-~~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~n  472 (503)
                      -+.-+|++-+ +---|+.....|....    ...++.-+   ++.+++.|.-+.   ...-=+.+|++.++--||++..|
T Consensus        24 ~~~~i~Vvd~~~~l~G~vt~~dll~~~----~~~~v~~i---M~~~~~~v~~~~---~~~~a~~~m~~~~~~~lPVVd~~   93 (109)
T cd04606          24 TIYYIYVVDEEGRLLGVVSLRDLLLAD----PDTPVSDI---MDTDVISVSADD---DQEEVARLFEKYDLLALPVVDEE   93 (109)
T ss_pred             EEEEEEEECCCCEEEEEEEHHHHHHCC----CCCCHHHH---HCCCEEECCCCC---CHHHHHHHHHHHCCCEEEEECCC
T ss_conf             177999997999799999867722418----88716663---002136337999---89999999998527604689889


Q ss_pred             CC-EEEEEEC
Q ss_conf             96-7899961
Q gi|254780612|r  473 GQ-KILFTID  481 (503)
Q Consensus       473 gr-ra~~~~e  481 (503)
                      || ..+||+.
T Consensus        94 ~~lvGiIt~~  103 (109)
T cd04606          94 GRLVGIITVD  103 (109)
T ss_pred             CEEEEEEEHH
T ss_conf             9799999968


No 32 
>PRK00830 consensus
Probab=34.29  E-value=17  Score=16.13  Aligned_cols=84  Identities=12%  Similarity=0.069  Sum_probs=57.4

Q ss_pred             CCHHHCCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEEEC-
Q ss_conf             32110323102133212466311001---67469828998178825688779997307-81168788379967899961-
Q gi|254780612|r  407 DAVVDLRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFTID-  481 (503)
Q Consensus       407 ~~v~~v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~~e-  481 (503)
                      .++.=|+|+.||.+- --|+|+.-+-   -+=.|-.|+|-|+....-+..|+.|+++- .=+.+||.|..|=|.+=.++ 
T Consensus        17 k~~rlvKg~~f~~~~-y~gdP~~~ak~~~~~gadelhivDld~a~~g~~~~~~~I~~i~~~~~~pi~vGGGIrs~e~~~~   95 (273)
T PRK00830         17 PGGRVVKGVEFVQLR-YAGDPVELAKRYYEDGADELVFLDITASHEGRATMIDVIERTAEEVFIPLTVGGGIRSIEDIRQ   95 (273)
T ss_pred             CCCEEEECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEECCEEECCCHHH
T ss_conf             379898586776875-7889999999999879998999953246468842799999999866995896088437732899


Q ss_pred             -------CCCCHHHHHH
Q ss_conf             -------5876899999
Q gi|254780612|r  482 -------KGKKGADIFK  491 (503)
Q Consensus       482 -------kg~~g~~~f~  491 (503)
                             |=.-|..||+
T Consensus        96 ll~~GadkVvIgS~a~~  112 (273)
T PRK00830         96 ILRAGADKVSVNTAAVK  112 (273)
T ss_pred             HHHCCCCEEECHHHHHH
T ss_conf             99769863983798985


No 33 
>TIGR03578 EF_0831 conserved hypothetical protein EF_0831/AHA_3912. Members of this family of small (about 100 amino acid), relatively rare proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.
Probab=32.33  E-value=32  Score=14.13  Aligned_cols=21  Identities=24%  Similarity=0.325  Sum_probs=15.0

Q ss_pred             ECCCCCHHHHHHHHHHHHHHC
Q ss_conf             615876899999999998522
Q gi|254780612|r  480 IDKGKKGADIFKSAIMQWENR  500 (503)
Q Consensus       480 ~ekg~~g~~~f~~a~~~w~~~  500 (503)
                      =-||-.-++||++||+.-|.+
T Consensus        11 ~GkG~tKe~Afa~a~s~iq~~   31 (96)
T TIGR03578        11 SGKGNTKQLAFASALSKIQKK   31 (96)
T ss_pred             ECCCCCHHHHHHHHHHHHHHH
T ss_conf             847863899999999999999


No 34 
>COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]
Probab=32.30  E-value=16  Score=16.30  Aligned_cols=22  Identities=23%  Similarity=0.357  Sum_probs=19.7

Q ss_pred             HHHHHHHCCCCEEEEEEECCCC
Q ss_conf             7799973078116878837996
Q gi|254780612|r  453 RNSKILEEYRFIDIPITYRSGQ  474 (503)
Q Consensus       453 ~n~~ll~~~~w~dip~~y~ngr  474 (503)
                      -|+.|||+.+-++.||+.++|-
T Consensus       140 QNF~LLke~G~~~kPvLLKRg~  161 (286)
T COG2876         140 QNFALLKEVGRQNKPVLLKRGL  161 (286)
T ss_pred             HHHHHHHHHCCCCCCEEEECCC
T ss_conf             5169999823559976972474


No 35 
>PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed
Probab=32.29  E-value=32  Score=14.13  Aligned_cols=54  Identities=15%  Similarity=0.294  Sum_probs=38.3

Q ss_pred             EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC------------------CEEEEEECCCCCHHHHHHHHHHHHHHC
Q ss_conf             28998178825688779997307811687883799------------------678999615876899999999998522
Q gi|254780612|r  439 SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG------------------QKILFTIDKGKKGADIFKSAIMQWENR  500 (503)
Q Consensus       439 ~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng------------------rra~~~~ekg~~g~~~f~~a~~~w~~~  500 (503)
                      ..-|||-|.++|..    +|   .+-||+||-+|=                  ++-.+|=.-|+.|   .|+|+..|..+
T Consensus       227 v~tIaLGDspNDi~----ML---eaaD~aVvI~~p~~~~l~~~~~~~~~~~~~~~~~~a~~~GP~G---W~eaV~~~L~~  296 (302)
T PRK12702        227 IKALGIGCSPPDLA----FL---RWSEQKVVLPSPIADSLWKEALRLGGPEVQPQWQLAQLPGPEG---WNEVVLMWLEE  296 (302)
T ss_pred             CEEEEECCCHHHHH----HH---HHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCHH---HHHHHHHHHHH
T ss_conf             54999718776599----99---8588769968998776655444567766676347668998467---99999999986


Q ss_pred             CC
Q ss_conf             36
Q gi|254780612|r  501 SN  502 (503)
Q Consensus       501 ~~  502 (503)
                      ++
T Consensus       297 ~~  298 (302)
T PRK12702        297 TD  298 (302)
T ss_pred             HH
T ss_conf             51


No 36 
>PRK00748 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated
Probab=32.04  E-value=22  Score=15.27  Aligned_cols=25  Identities=24%  Similarity=0.320  Sum_probs=15.4

Q ss_pred             HHHHHHHCC-CCEEEEEEECCCCEEE
Q ss_conf             779997307-8116878837996789
Q gi|254780612|r  453 RNSKILEEY-RFIDIPITYRSGQKIL  477 (503)
Q Consensus       453 ~n~~ll~~~-~w~dip~~y~ngrra~  477 (503)
                      =|+.|++.- .+++||+.|..|=+.+
T Consensus       176 ~d~~l~~~i~~~~~ipviasGGv~s~  201 (241)
T PRK00748        176 PNVELTRELAAATPIPVIASGGVSSL  201 (241)
T ss_pred             CCHHHHHHHHHHCCCCEEEECCCCCH
T ss_conf             89999999998689989998899999


No 37 
>PRK04281 consensus
Probab=31.92  E-value=20  Score=15.56  Aligned_cols=42  Identities=19%  Similarity=0.293  Sum_probs=21.1

Q ss_pred             CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE
Q ss_conf             69828998178825688779997307-8116878837996789
Q gi|254780612|r  436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL  477 (503)
Q Consensus       436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~  477 (503)
                      .|-.++|=|+....-+..|+.++++- .=+.|||.|..|=|-+
T Consensus        44 adelhivDld~a~~~~~~~~~~I~~i~~~~~vpi~vGGGIrs~   86 (254)
T PRK04281         44 ADELTFLDITASSDNRDTILHIIEEVAGQVFIPLTVGGGVRTV   86 (254)
T ss_pred             CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEEEC
T ss_conf             9999999688987775308999999985079628997775451


No 38 
>pfam09837 DUF2064 Uncharacterized protein conserved in bacteria (DUF2064). This domain, found in various prokaryotic proteins, has no known function.
Probab=30.55  E-value=23  Score=15.10  Aligned_cols=20  Identities=25%  Similarity=0.433  Sum_probs=13.1

Q ss_pred             ECCCEEEEECCCCCHHHHHH
Q ss_conf             46982899817882568877
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRN  454 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n  454 (503)
                      -.-||+||||+....+.++|
T Consensus        97 ~DGGy~LiG~~~~~~~lF~~  116 (121)
T pfam09837        97 EDGGYYLLGLRRPTPDLFEG  116 (121)
T ss_pred             CCCCEEEEEECCCCHHHCCC
T ss_conf             88999999747778887079


No 39 
>cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria.  The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair.  The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here.  It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=30.34  E-value=35  Score=13.91  Aligned_cols=43  Identities=14%  Similarity=0.254  Sum_probs=24.9

Q ss_pred             ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEE
Q ss_conf             4698289981788256887799973078116878837996789996
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTI  480 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~  480 (503)
                      .+.+++.|.-   +.....=+++|.+++--.+|++..+.=.+|||.
T Consensus        58 Mt~~~~tv~~---~~~~~~a~~~m~~~~i~~lpVvd~~~lvGiIt~  100 (106)
T cd04638          58 MTRDPPTVSP---DDDVKEAAKLMVENNIRRVPVVDDGKLVGIVTV  100 (106)
T ss_pred             HCCCCEECCC---CCCHHHHHHHHHHCCCCEEEEEECCEEEEEEEH
T ss_conf             3589736799---491999999998769759999999999999997


No 40 
>pfam06089 Asparaginase_II L-asparaginase II. This family consists of several bacterial L-asparaginase II proteins. L-asparaginase (EC:3.5.1.1) catalyses the hydrolysis of L-asparagine to L-aspartate and ammonium. Rhizobium etli possesses two asparaginases: asparaginase I, which is thermostable and constitutive, and asparaginase II, which is thermolabile, induced by asparagine and repressed by the carbon source.
Probab=30.26  E-value=35  Score=13.90  Aligned_cols=31  Identities=10%  Similarity=0.067  Sum_probs=20.5

Q ss_pred             CCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEE
Q ss_conf             10016746982899817882568877999730781168
Q gi|254780612|r  429 DSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDI  466 (503)
Q Consensus       429 ~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~di  466 (503)
                      .|+++||.||-.       ..--.-=+.||+..+|||-
T Consensus       267 ~gialKi~DGs~-------Ra~~~~~~~~L~~lg~~~~  297 (325)
T pfam06089       267 LGIALKIEDGAT-------RAAEAVAAALLRQLGVLDP  297 (325)
T ss_pred             EEEEEEECCCCH-------HHHHHHHHHHHHHCCCCCH
T ss_conf             599999637857-------5799999999998599986


No 41 
>PRK02621 consensus
Probab=29.23  E-value=31  Score=14.30  Aligned_cols=42  Identities=14%  Similarity=0.202  Sum_probs=23.5

Q ss_pred             CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE
Q ss_conf             69828998178825688779997307-8116878837996789
Q gi|254780612|r  436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL  477 (503)
Q Consensus       436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~  477 (503)
                      .|..++|-|......+..|++|+++- .=+.|||.|..|=|-+
T Consensus        44 ad~lhivDld~a~~~~~~~~~~I~~i~~~~~ipi~vGGGIrs~   86 (254)
T PRK02621         44 ADELVFLDITATHEGRATLIDVVYRTAEQVFIPLTVGGGISSL   86 (254)
T ss_pred             CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEEEH
T ss_conf             9999998266765675428999999998679858996335357


No 42 
>PRK13597 imidazole glycerol phosphate synthase subunit HisF; Provisional
Probab=28.78  E-value=32  Score=14.16  Aligned_cols=66  Identities=9%  Similarity=0.131  Sum_probs=36.6

Q ss_pred             CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEE
Q ss_conf             323102133212466311001---67469828998178825688779997307-81168788379967899
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILF  478 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~  478 (503)
                      |+++.++..- --|+|+.-+-   -.=.|..|+|=|+.....+..|+.++++- .=+.|||.|..|=|.+=
T Consensus        19 Vkg~~~~~~~-~~gdP~~~a~~~~~~Gad~lhlvDld~a~~~~~~n~~~I~~i~~~~~vpiqvGGGIrs~e   88 (252)
T PRK13597         19 VKGVNFVNLR-DAGDPVEAARAYDEAGADELVFLDISATHEERAILLDVVARVAERVFIPLTVGGGVRSLE   88 (252)
T ss_pred             EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHH
T ss_conf             8787777867-888999999999986999999995646666866379999999862698289847713089


No 43 
>PRK03220 consensus
Probab=28.77  E-value=35  Score=13.86  Aligned_cols=65  Identities=8%  Similarity=0.141  Sum_probs=43.2

Q ss_pred             CCCCEECCCCCCCCCCCCCEEE---EECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE
Q ss_conf             3231021332124663110016---7469828998178825688779997307-8116878837996789
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSNIF---VISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL  477 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~~~---kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~  477 (503)
                      |+++.++... .-|+||.-.-.   .=.|-+++|-|+.....+..|+++++.- .=+.+||.|..|=|-+
T Consensus        19 Vkg~~~~~~~-~~gdP~~~a~~~~~~G~d~lhivDld~a~~g~~~n~~~I~~i~~~~~~pi~vGGGIrs~   87 (257)
T PRK03220         19 VKGVNFENLR-DAGDPVELAAVYDAEGADELTFLDVTASSSGRATMLDVVRRTAEQVFIPLTVGGGVRTV   87 (257)
T ss_pred             EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCH
T ss_conf             8577777867-88899999999998699989999088875676307999999985069648984785879


No 44 
>PRK10076 pyruvate formate lyase II activase; Provisional
Probab=28.77  E-value=37  Score=13.73  Aligned_cols=32  Identities=19%  Similarity=0.264  Sum_probs=16.6

Q ss_pred             CEEEEECCCCCHHHH------------HHHHHHHCCC---CEEEEEE
Q ss_conf             828998178825688------------7799973078---1168788
Q gi|254780612|r  438 NSYLISLKGSEEDPF------------RNSKILEEYR---FIDIPIT  469 (503)
Q Consensus       438 ~~f~v~l~~~~~~~~------------~n~~ll~~~~---w~dip~~  469 (503)
                      |+||+-|...+.++-            .|+..|.+++   ||++|++
T Consensus        91 Dl~L~DiK~~d~~~h~~~TG~~n~~il~Nl~~l~~~~~~v~iR~pvI  137 (213)
T PRK10076         91 DEVLFDLKIMDATQARDVVKMNLPRVLENLRLLVSEGVNVIPRLPLI  137 (213)
T ss_pred             CEEEEEECCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEEEEEE
T ss_conf             98998617798489999979993999999999996799689988677


No 45 
>KOG1383 consensus
Probab=28.73  E-value=37  Score=13.73  Aligned_cols=36  Identities=14%  Similarity=0.202  Sum_probs=24.0

Q ss_pred             CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECC
Q ss_conf             982899817882568877999730781168788379
Q gi|254780612|r  437 KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRS  472 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~n  472 (503)
                      +..=+|++++...+..+=.++||.++||==-+..=+
T Consensus       382 p~vs~~~~~s~~~~i~elsd~l~~~GW~lnalq~P~  417 (491)
T KOG1383         382 PLVSFILFGSNDVNIFELSDLLRKKGWILNALQFPA  417 (491)
T ss_pred             CCEEEEECCCCCCCCEEHHHHHHHCCCCCCCCCCCC
T ss_conf             737999716876531114578986386766567887


No 46 
>pfam11217 DUF3013 Protein of unknown function (DUF3013). This bacterial family of proteins with unknown function appear to be restricted to Firmicutes.
Probab=28.72  E-value=29  Score=14.42  Aligned_cols=25  Identities=20%  Similarity=0.282  Sum_probs=17.9

Q ss_pred             EEEEEEEEECC-CCC-CCCHHHCCCCE
Q ss_conf             28899997168-888-73211032310
Q gi|254780612|r  392 THVMEIMFSFP-KES-QDAVVDLRRIS  416 (503)
Q Consensus       392 sh~~e~~f~~~-~~~-~~~v~~v~~~~  416 (503)
                      +|+|||.|++= +|. |--|.+..|+.
T Consensus        31 nH~iEv~f~l~aen~~~v~~~D~~G~~   57 (160)
T pfam11217        31 NHAIELEFTLEAQNKEGIETVDDEGEV   57 (160)
T ss_pred             CCEEEEEEEEEEECCCCEEEECCCCCC
T ss_conf             746999999998254550678687725


No 47 
>PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed
Probab=28.16  E-value=33  Score=14.02  Aligned_cols=33  Identities=15%  Similarity=0.281  Sum_probs=26.6

Q ss_pred             CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC
Q ss_conf             98289981788256887799973078116878837996
Q gi|254780612|r  437 KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ  474 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr  474 (503)
                      =|+++||=-|..     |..||++-.=.|.||..+||-
T Consensus       202 vDilqIGARnmq-----Nf~LLk~vg~~~kPVLlKrg~  234 (360)
T PRK12595        202 VDVIQIGARNMQ-----NFELLKAAGRVNKPVLLKRGL  234 (360)
T ss_pred             CCEEEECCHHCC-----CHHHHHHHHCCCCCEEEECCC
T ss_conf             868988841035-----999999986139937960799


No 48 
>COG2987 HutU Urocanate hydratase [Amino acid transport and metabolism]
Probab=27.47  E-value=26  Score=14.78  Aligned_cols=29  Identities=28%  Similarity=0.554  Sum_probs=22.0

Q ss_pred             CEEEEECCCCCHHHHHHHHHHHCC--------CCEEE
Q ss_conf             828998178825688779997307--------81168
Q gi|254780612|r  438 NSYLISLKGSEEDPFRNSKILEEY--------RFIDI  466 (503)
Q Consensus       438 ~~f~v~l~~~~~~~~~n~~ll~~~--------~w~di  466 (503)
                      -|-||+||..+.|.-+-=..++|.        +|||+
T Consensus       360 PFRW~aLSgdpeDi~~tD~~~~el~p~n~~l~~Wid~  396 (561)
T COG2987         360 PFRWVALSGDPEDIYKTDAAVKELFPDNKHLHRWIDM  396 (561)
T ss_pred             CEEEEEECCCHHHHHHHHHHHHHHCCCCHHHHHHHHH
T ss_conf             7157870699788888799999858982889999999


No 49 
>pfam12281 DUF3620 Protein of unknown function (DUF3620). This family of proteins is found in bacteria. Proteins in this family are typically between 281 and 358 amino acids in length. There are two completely conserved residues (G and P) that may be functionally important.
Probab=27.25  E-value=39  Score=13.55  Aligned_cols=110  Identities=17%  Similarity=0.177  Sum_probs=56.2

Q ss_pred             EEEEEECCCCCEEEEEEECCCCCCCCCE-EEEEEEEEE-CCCCCCC-CHHHCCCCEECCCCCCCCCCCCCEEEEECCCEE
Q ss_conf             3676307877717999951789877781-288999971-6888873-211032310213321246631100167469828
Q gi|254780612|r  364 IKGDIPMIDNAFSASMTLKCNADISLSI-THVMEIMFS-FPKESQD-AVVDLRRISMRKTDNSPSVLIDSNIFVISKNSY  440 (503)
Q Consensus       364 ira~i~iP~~~~~~~~~~rrn~d~slpa-sh~~e~~f~-~~~~~~~-~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f  440 (503)
                      --+||.+- ....+.+.|--+.|.+|.. =+-+++.|. +|...++ ...|-+|...-==--.+|+..     +-..--+
T Consensus        47 ~T~DvDla-~~~~~slal~~~~~~~ll~vL~~vd~~F~~vp~~~~~~~~~~~~g~~VdfLtp~~~~~~-----~~~~p~~  120 (216)
T pfam12281        47 ATGDVDLA-QDFRRSLALGDSVRPGLLGLLRSVDLTFEPVPRLSGGFRARNRQGFMVEFLTPSRGGGE-----GEDGPRL  120 (216)
T ss_pred             HCCCHHHH-HHCCCCCCCCCCCCCCHHHHHHHCCCCCEECCCCCCCCEEECCCCEEEEEECCCCCCCC-----CCCCCCC
T ss_conf             10364677-63112313555567539999986498833776557885104158759998468888875-----5578654


Q ss_pred             EEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEE
Q ss_conf             9981788256887799973078116878837996789996
Q gi|254780612|r  441 LISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTI  480 (503)
Q Consensus       441 ~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~  480 (503)
                      |-+|-++....-|+|+.|-..- |+..++|+||+=+.+.+
T Consensus       121 ~~~~~~~~A~~l~~l~~Ll~~P-v~av~l~~~G~~v~v~v  159 (216)
T pfam12281       121 LPALDGLSAQPLRFLDWLLNDP-VRAVVLDRSGRPVRVQV  159 (216)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCC-EEEEEECCCCCEEEEEC
T ss_conf             7654783632235120010598-06899837896789957


No 50 
>PRK02747 consensus
Probab=26.48  E-value=21  Score=15.48  Aligned_cols=67  Identities=7%  Similarity=0.106  Sum_probs=43.4

Q ss_pred             CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEE
Q ss_conf             323102133212466311001---67469828998178825688779997307-811687883799678999
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFT  479 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~  479 (503)
                      |+++.++.. .--|+|+.-+-   -.=.|-.++|-|+....-+..|+.|+++- +=+.|||.|..|=|.+=.
T Consensus        18 Vkg~~~~~~-~~~gdP~~~ak~~~~~Gadelh~vDl~~a~~~~~~~~~lI~~i~~~~~ipi~vGGGIrs~e~   88 (257)
T PRK02747         18 VKGVNFVDL-RDAGDPVEAARAYDAAGADELCFLDITASHENRGTMLDVVARTAEQCFMPLTVGGGVRTVDD   88 (257)
T ss_pred             EECCCCCCE-EECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHH
T ss_conf             858777770-78889999999999869998999947677567552899999999866998898488207388


No 51 
>pfam07946 DUF1682 Protein of unknown function (DUF1682). The members of this family are all hypothetical eukaryotic proteins of unknown function. One member is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=26.43  E-value=28  Score=14.54  Aligned_cols=15  Identities=7%  Similarity=0.304  Sum_probs=8.1

Q ss_pred             ECCCCEEEEEECCCC
Q ss_conf             379967899961587
Q gi|254780612|r  470 YRSGQKILFTIDKGK  484 (503)
Q Consensus       470 y~ngrra~~~~ekg~  484 (503)
                      ++.-+|-+|+|--=.
T Consensus       212 ~~~~k~l~~~~~lP~  226 (322)
T pfam07946       212 TKPKKRLILSFRLPK  226 (322)
T ss_pred             CCCCEEEEEEEECCC
T ss_conf             667608999997389


No 52 
>cd04495 BRCA2DBD_OB3 BRCA2DBD_OB3: A subfamily of OB folds corresponding to the third OB fold (OB3) of the 800-amino acid C-terminal ssDNA binding domain (DBD) of BRCA2 (breast cancer susceptibility gene 2) protein, called BRCA2DBD. BRCA2 participates in homologous recombination-mediated repair of double-strand DNA breaks. It stimulates the displacement of Replication protein A (RPA), the most abundant eukaryotic ssDNA binding protein. It also facilitates filament formation. Mutations that map throughout the BRCA2 protein are associated with breast cancer susceptibility. BRCA2 is a large nuclear protein and its most conserved region is the C-terminal BRCA2DBD. BRCA2DBD binds ssDNA in vitro, and is composed of five structural domains, three of which are OB folds (OB1, OB2, and OB3). BRCA2DBD OB2 and OB3 are arranged in tandem, and their mode of binding can be considered qualitatively similar to two OB folds of RPA1, DBD-A and DBD-B (the major DBDs of RPA).
Probab=26.31  E-value=14  Score=16.73  Aligned_cols=40  Identities=10%  Similarity=0.136  Sum_probs=25.5

Q ss_pred             HHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHH
Q ss_conf             799973078116878837996789996158768999999999985
Q gi|254780612|r  454 NSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAIMQWE  498 (503)
Q Consensus       454 n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~  498 (503)
                      |||.=.+..| +||+.|..-. ..+|   -.+-+.-|.++|..-+
T Consensus        57 NLQwR~~s~~-~iPtl~A~e~-s~FS---~nPKe~hlQe~~~~Lk   96 (100)
T cd04495          57 NLQWRTESTS-GVPTLFAGEY-STFS---ANPKESHLQEEFNNLK   96 (100)
T ss_pred             CCEEECCCCC-CCCEEEECCC-CEEC---CCCCHHHHHHHHHHHH
T ss_conf             3255046567-7860664032-0214---7940778999999988


No 53 
>PRK13415 flagella biosynthesis protein FliZ; Provisional
Probab=26.23  E-value=41  Score=13.43  Aligned_cols=40  Identities=10%  Similarity=0.265  Sum_probs=28.0

Q ss_pred             CCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHC
Q ss_conf             321103231021332124663110016746982899817882568877999730
Q gi|254780612|r  407 DAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEE  460 (503)
Q Consensus       407 ~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~  460 (503)
                      +.|.|+.|+.+-.|-       .=.-+||.+..|+||.-       -|+|||||
T Consensus       105 ~~i~~lGGtslG~nr-------SvQlvkvG~~l~vvGVG-------e~iqLLke  144 (216)
T PRK13415        105 QYVENIGGTSVGQNR-------SVQLIKVGNRVLVVGVG-------ESIQLLKE  144 (216)
T ss_pred             CHHHHCCCCCCCCCC-------EEEEEEECCEEEEEECC-------CHHHHHHH
T ss_conf             234522674457886-------57899976889999547-------25789986


No 54 
>KOG1693 consensus
Probab=26.17  E-value=14  Score=16.67  Aligned_cols=103  Identities=10%  Similarity=0.018  Sum_probs=61.6

Q ss_pred             EEEEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECCCCCCCCHHHCCCCEECCCCCCCCCCCC--CEEEEECCCEE
Q ss_conf             2367630787771799995178987778128899997168888732110323102133212466311--00167469828
Q gi|254780612|r  363 VIKGDIPMIDNAFSASMTLKCNADISLSITHVMEIMFSFPKESQDAVVDLRRISMRKTDNSPSVLID--SNIFVISKNSY  440 (503)
Q Consensus       363 aira~i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l~--~~~~kv~~~~f  440 (503)
                      -|-=+|+-||.++=..-+=||-.|-++=|.|.=+-+|-+..+|+..-..+--+.++--++-.--+..  ...+++.=+--
T Consensus        60 DVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~q~~~~~~~~~~~~~~~~~~~~mena  139 (209)
T KOG1693          60 DVDYDIEAPDGKVIYSEKKKRYDSFLFKAEGKGEYTFCFSNEFSTFSHKIVYMDFQVGEEPPLHPAVSNRDTALTQMENA  139 (209)
T ss_pred             EEEEEEECCCCCEEEECCCCCCCCEEEEEECCEEEEEEECCCCCCCCCEEEEEHHHHCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             25889988999887540102103379998511689999627645642237651024244666673312365488888889


Q ss_pred             EEECCCCCHHHHHHHHHHHCCCCEE
Q ss_conf             9981788256887799973078116
Q gi|254780612|r  441 LISLKGSEEDPFRNSKILEEYRFID  465 (503)
Q Consensus       441 ~v~l~~~~~~~~~n~~ll~~~~w~d  465 (503)
                      .+.....=....||.+-+|-|.|=+
T Consensus       140 ~~~I~~~L~~I~~~q~y~R~RE~rn  164 (209)
T KOG1693         140 IVEIHRALNKIDDTQTYYRLREARN  164 (209)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCC
T ss_conf             9999999999998899998887628


No 55 
>smart00310 PTBI Phosphotyrosine-binding domain (IRS1-like).
Probab=25.97  E-value=41  Score=13.40  Aligned_cols=68  Identities=18%  Similarity=0.164  Sum_probs=51.4

Q ss_pred             EEC-CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEE-----EEEECCCCCHHHHHHHHHHHHHHCCC
Q ss_conf             746-9828998178825688779997307811687883799678-----99961587689999999999852236
Q gi|254780612|r  434 VIS-KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKI-----LFTIDKGKKGADIFKSAIMQWENRSN  502 (503)
Q Consensus       434 kv~-~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra-----~~~~ekg~~g~~~f~~a~~~w~~~~~  502 (503)
                      .++ +.+-|+-++.-..-..=.|..||..+-.+--..++.|||+     +++|+ =.-|+.+|+-..++=+++.|
T Consensus        25 ~lt~~~L~L~~~~~~~~l~~wpl~~lRryG~~~~~F~fEaGRrc~~G~G~f~f~-t~~~~~i~~~v~~am~a~k~   98 (98)
T smart00310       25 RLTSETLVLWRLNPRVELVVWPLLSLRRYGRDKNFFFFEAGRRCVSGPGEFTFQ-TVVAQEIFQLVLEAMQAQSN   98 (98)
T ss_pred             EECCCEEEEECCCCCCEEEEEEHHHEEECCCCCCEEEEECCCCCCCCCCEEEEE-CCCHHHHHHHHHHHHHHHCC
T ss_conf             987888999868999508996736623403689999999058878898679998-28499999999999986029


No 56 
>PRK13396 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=25.82  E-value=40  Score=13.50  Aligned_cols=37  Identities=16%  Similarity=0.321  Sum_probs=28.7

Q ss_pred             EEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC
Q ss_conf             1674698289981788256887799973078116878837996
Q gi|254780612|r  432 IFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ  474 (503)
Q Consensus       432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr  474 (503)
                      ..++. |+++||=-|     -+|..||++-.=.|.||..+||-
T Consensus       181 v~~~~-DilQIGARn-----~qNf~LL~~~g~t~kPVllKrg~  217 (352)
T PRK13396        181 IAEVA-DVIQVGARN-----MQNFSLLKKVGAQDKPVLLKRGM  217 (352)
T ss_pred             HHHHC-CEEEECCHH-----HCCHHHHHHHHCCCCEEEECCCC
T ss_conf             98658-889989254-----05999999985469807973788


No 57 
>pfam11322 DUF3124 Protein of unknown function (DUF3124). This bacterial family of proteins has no known function.
Probab=25.76  E-value=41  Score=13.37  Aligned_cols=17  Identities=29%  Similarity=0.462  Sum_probs=11.7

Q ss_pred             CEEEEEEECCCCCCCCCE
Q ss_conf             717999951789877781
Q gi|254780612|r  374 AFSASMTLKCNADISLSI  391 (503)
Q Consensus       374 ~~~~~~~~rrn~d~slpa  391 (503)
                      .|+++|+| ||||.+-|-
T Consensus        25 ~Lt~tLSi-RNtd~~~pi   41 (126)
T pfam11322        25 NLTATLSI-RNTDPSQPI   41 (126)
T ss_pred             EEEEEEEE-ECCCCCCCE
T ss_conf             68999999-749999988


No 58 
>KOG0526 consensus
Probab=25.56  E-value=42  Score=13.35  Aligned_cols=96  Identities=14%  Similarity=0.166  Sum_probs=51.7

Q ss_pred             CCCEEEEEEECCCCCCCCCEEEEEEEEEECCCC-C-CCCHHHC----C-CCEECCCCCCCCCCCCCEEEEECCCEEEEEC
Q ss_conf             777179999517898777812889999716888-8-7321103----2-3102133212466311001674698289981
Q gi|254780612|r  372 DNAFSASMTLKCNADISLSITHVMEIMFSFPKE-S-QDAVVDL----R-RISMRKTDNSPSVLIDSNIFVISKNSYLISL  444 (503)
Q Consensus       372 ~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~-~-~~~v~~v----~-~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l  444 (503)
                      ..+=.++|-|+-|.|  -|+. +|||-|-.|.+ + |-.-+-|    . .+.+-+-+++.|+++.    ..++=++|-=-
T Consensus       141 ~gKNEv~LEFh~nDd--a~~~-LmEmRF~iP~d~~~ged~~~~e~F~~~v~~kAdv~~~~gdaI~----~f~~i~~lTPR  213 (615)
T KOG0526         141 TGKNEVTLEFHQNDD--APVG-LMEMRFHIPEDQEDGEDRDKVEAFYENVLAKADVSSAVGDAIV----SFEEILCLTPR  213 (615)
T ss_pred             CCCCEEEEEEECCCC--CCCC-EEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEE----EEEEEEEECCC
T ss_conf             377517999861677--7743-3899995485212444356788999999986375412477458----76522541577


Q ss_pred             CCCCHHHHHHHHHHHCCCCEEEEEEECCCCE
Q ss_conf             7882568877999730781168788379967
Q gi|254780612|r  445 KGSEEDPFRNSKILEEYRFIDIPITYRSGQK  475 (503)
Q Consensus       445 ~~~~~~~~~n~~ll~~~~w~dip~~y~ngrr  475 (503)
                      --.+-+..++.--|++.. +|-=|-|++=.|
T Consensus       214 GRYdI~iy~t~lrL~GkT-yDyKI~y~SI~r  243 (615)
T KOG0526         214 GRYDIKIYPTFLRLHGKT-YDYKIPYKSINR  243 (615)
T ss_pred             CCCEEEEEHHHHHHCCCC-CCEECCHHHEEE
T ss_conf             620068730134221664-420120331404


No 59 
>pfam03425 CBM_11 Carbohydrate binding domain (family 11).
Probab=25.54  E-value=34  Score=13.98  Aligned_cols=31  Identities=16%  Similarity=0.375  Sum_probs=14.7

Q ss_pred             CCCCCCEEEEEEEEEECCCCCC-CC--HHHCCCC
Q ss_conf             9877781288999971688887-32--1103231
Q gi|254780612|r  385 ADISLSITHVMEIMFSFPKESQ-DA--VVDLRRI  415 (503)
Q Consensus       385 ~d~slpash~~e~~f~~~~~~~-~~--v~~v~~~  415 (503)
                      -+..|=..|+.+|.|....+-+ |.  |+||+.+
T Consensus       135 ~~~~~dL~~i~~l~f~~~~~~~~g~f~iDnI~l~  168 (170)
T pfam03425       135 KDVPLDLDNITALAFQPKGGNSKGSFKIDNVRLI  168 (170)
T ss_pred             CCCEECHHHCEEEEEEECCCCCCCCEEEEEEEEE
T ss_conf             5330373463189998648987565899779971


No 60 
>PRK05211 consensus
Probab=25.25  E-value=9.8  Score=17.74  Aligned_cols=65  Identities=12%  Similarity=0.176  Sum_probs=36.6

Q ss_pred             CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHC-CCCEEEEEEECCCCEEE
Q ss_conf             323102133212466311001---6746982899817882568877999730-78116878837996789
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEE-YRFIDIPITYRSGQKIL  477 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~-~~w~dip~~y~ngrra~  477 (503)
                      |+++.|+ .+.--|+|+..+-   -+=.|-.++|-|......+..|+.++++ ..=+.+||+|..|=|-+
T Consensus         9 Vk~~~f~-~~~~~gDP~~~ak~~~~~gadelhivDld~a~~g~~~n~~~I~~i~~~~~~Pl~vGGGIrs~   77 (248)
T PRK05211          9 VKGVQFR-NHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVAGGIKSV   77 (248)
T ss_pred             EECCCCC-CCEECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCH
T ss_conf             7685766-88677899999999998699989999786776787214999999997679858962780138


No 61 
>KOG3883 consensus
Probab=24.97  E-value=43  Score=13.28  Aligned_cols=84  Identities=14%  Similarity=0.163  Sum_probs=45.3

Q ss_pred             EEEEEEEEECCCCCCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEE------
Q ss_conf             28899997168888732110323102133212466311001674698289981788256887799973078116------
Q gi|254780612|r  392 THVMEIMFSFPKESQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFID------  465 (503)
Q Consensus       392 sh~~e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d------  465 (503)
                      --|||=+|...=..+.|+...-+|.=-.-=|.--.-|-.--+.-+|+|-||--++...- ..-+.|||-  |||      
T Consensus        41 ~pTiEDiY~~svet~rgarE~l~lyDTaGlq~~~~eLprhy~q~aDafVLVYs~~d~eS-f~rv~llKk--~Idk~KdKK  117 (198)
T KOG3883          41 HPTIEDIYVASVETDRGAREQLRLYDTAGLQGGQQELPRHYFQFADAFVLVYSPMDPES-FQRVELLKK--EIDKHKDKK  117 (198)
T ss_pred             CCCHHHHEEEEEECCCCHHHEEEEEECCCCCCCHHHHHHHHHCCCCEEEEEECCCCHHH-HHHHHHHHH--HHHHCCCCC
T ss_conf             66344224676406878101688865244557600202767105765799963799889-889999999--986255655


Q ss_pred             -EEEEECCCCEEEE
Q ss_conf             -8788379967899
Q gi|254780612|r  466 -IPITYRSGQKILF  478 (503)
Q Consensus       466 -ip~~y~ngrra~~  478 (503)
                       ||||--...|++-
T Consensus       118 EvpiVVLaN~rdr~  131 (198)
T KOG3883         118 EVPIVVLANKRDRA  131 (198)
T ss_pred             CCCEEEEECHHHCC
T ss_conf             12289973001013


No 62 
>TIGR00400 mgtE magnesium transporter; InterPro: IPR006669    This is the MgtE family of magnesium transporters. All the archaebacterial and eukaryotic examples have two copies of the integral membrane region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg^2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport.
Probab=24.54  E-value=44  Score=13.22  Aligned_cols=76  Identities=18%  Similarity=0.377  Sum_probs=59.2

Q ss_pred             EEEEEECCCCC-CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC
Q ss_conf             99997168888-73211032310213321246631100167469828998178825688779997307811687883799
Q gi|254780612|r  395 MEIMFSFPKES-QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG  473 (503)
Q Consensus       395 ~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng  473 (503)
                      +-..|+.-+.- -.|+-++..+.+...|.--|+-+      ++++|+..++++-+.|..   .++...+++-+|++..+|
T Consensus       171 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~d~~~~~~---~~~~~~d~~~~p~~~~~~  241 (460)
T TIGR00400       171 IYTLYVTNESKRLTGVLSLRDLILAKPEEYLGDLL------FPDGVSVDGLNDEEEDVA---RLIEKYDFLAVPVVDNEG  241 (460)
T ss_pred             HEEEEEECCCHHHHHHHHHHHHHHCCCHHHHHHHH------HHCCEEEEECCCCHHHHH---HHHHHCCCEEEEEECCCC
T ss_conf             02457633410220001123333035056776654------210013520454046789---887640210200131677


Q ss_pred             CEEEEE
Q ss_conf             678999
Q gi|254780612|r  474 QKILFT  479 (503)
Q Consensus       474 rra~~~  479 (503)
                      |..-+.
T Consensus       242 ~~~g~~  247 (460)
T TIGR00400       242 RLVGIV  247 (460)
T ss_pred             CEEEEE
T ss_conf             168754


No 63 
>cd04731 HisF The cyclase subunit of imidazoleglycerol phosphate synthase (HisF). Imidazole glycerol phosphate synthase (IGPS) catalyzes the fifth step of histidine biosynthesis, the formation of the imidazole ring. IGPS converts N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to imidazole glycerol phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR). This conversion involves two tightly coupled reactions in distinct active sites of IGPS. The two catalytic domains can be fused, like in fungi and plants, or peformed by a heterodimer (HisH-glutaminase and HisF-cyclase), like in bacteria.
Probab=24.10  E-value=13  Score=16.88  Aligned_cols=40  Identities=18%  Similarity=0.234  Sum_probs=17.7

Q ss_pred             CCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEE
Q ss_conf             9828998178825688779997307-811687883799678
Q gi|254780612|r  437 KNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKI  476 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra  476 (503)
                      |-.++|-|+.....+..|+.||++- .=+.+||.|..|=|.
T Consensus        42 delhivDld~a~~g~~~n~~~i~~i~~~~~~pi~vGGGIrs   82 (243)
T cd04731          42 DELVFLDITASSEGRETMLDVVERVAEEVFIPLTVGGGIRS   82 (243)
T ss_pred             CEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEEE
T ss_conf             99999706732037700799999999867986899850664


No 64 
>cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes.  The TN CTLD binds two calcium ions.  Its calcium free form binds to various kringle-like protein ligands.  Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4).  TN binds the kringle 1-4 form of angiostatin (AST K1-4).  AST K1-4 is a fragment of Plg, commonly found in cancer tissues.  TN inhibits the bin
Probab=24.06  E-value=31  Score=14.22  Aligned_cols=25  Identities=16%  Similarity=0.375  Sum_probs=15.4

Q ss_pred             CEEEEECCCCCHHHHHHHHHHHCCCCEE---EEEEECC
Q ss_conf             8289981788256887799973078116---8788379
Q gi|254780612|r  438 NSYLISLKGSEEDPFRNSKILEEYRFID---IPITYRS  472 (503)
Q Consensus       438 ~~f~v~l~~~~~~~~~n~~ll~~~~w~d---ip~~y~n  472 (503)
                      .-|||||+|...+          -.|++   .||.|.|
T Consensus        59 ~~~WIGl~d~~~~----------g~w~w~~g~~~~~~~   86 (129)
T cd03596          59 WEVWLGINDMVAE----------GKWVDVNGSPISYFN   86 (129)
T ss_pred             CCEEEECCCCCCC----------CCEECCCCCCCCCCC
T ss_conf             6478761112679----------966819998856257


No 65 
>pfam03180 Lipoprotein_9 NLPA lipoprotein. This family of bacterial lipoproteins contains several antigenic members, that may be involved in bacterial virulence. Their precise function is unknown. However they are probably distantly related to pfam00497 which are solute binding proteins.
Probab=24.03  E-value=45  Score=13.16  Aligned_cols=28  Identities=18%  Similarity=0.078  Sum_probs=16.4

Q ss_pred             CCCCCEEEEEEECCCCCEEEE-----EEECCCC
Q ss_conf             776742367630787771799-----9951789
Q gi|254780612|r  358 GLKGLVIKGDIPMIDNAFSAS-----MTLKCNA  385 (503)
Q Consensus       358 ~~~~~aira~i~iP~~~~~~~-----~~~rrn~  385 (503)
                      +|...+|--|.+.=.|.|.+.     ++|+.+.
T Consensus        96 ~Ga~IaIpnD~sN~~RAL~lL~~aGLI~Lk~~~  128 (236)
T pfam03180        96 DGATIAVPNDPSNEGRALLLLEKAGLIKLKDGA  128 (236)
T ss_pred             CCCEEEECCCCCHHHHHHHHHHHCCCEEECCCC
T ss_conf             999898158844299999999988988976899


No 66 
>pfam00869 Flavi_glycoprot Flavivirus glycoprotein, central and dimerization domains.
Probab=23.98  E-value=17  Score=16.14  Aligned_cols=38  Identities=21%  Similarity=0.539  Sum_probs=22.4

Q ss_pred             CCEEEEECCCCCHHHHHHHHHHHCCCCE-EEEEEECCC-------CEEEEEECC
Q ss_conf             9828998178825688779997307811-687883799-------678999615
Q gi|254780612|r  437 KNSYLISLKGSEEDPFRNSKILEEYRFI-DIPITYRSG-------QKILFTIDK  482 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~-dip~~y~ng-------rra~~~~ek  482 (503)
                      ++++|+.+.        |-..|-+|+|| ||||=++.|       |-.|+.|+-
T Consensus       193 ~~~yl~~~~--------~~~wlVhrdWf~DL~LPW~~~~~~~W~~~e~LVeF~~  238 (293)
T pfam00869       193 SNYYLLEMD--------GKSWLVHRDWFHDLPLPWTHGGATSWRNKERLVEFEE  238 (293)
T ss_pred             HHEEEEEEC--------CCEEEEEHHHHHCCCCCCCCCCCCCCCCCCEEEEECC
T ss_conf             678999976--------9589987557533788851799973514523788669


No 67 
>PRK13585 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional
Probab=23.95  E-value=40  Score=13.46  Aligned_cols=40  Identities=28%  Similarity=0.414  Sum_probs=17.8

Q ss_pred             CCEEEEECCCCCHHHHHHHHHHHC-CCCEEEEEEECCCCEE
Q ss_conf             982899817882568877999730-7811687883799678
Q gi|254780612|r  437 KNSYLISLKGSEEDPFRNSKILEE-YRFIDIPITYRSGQKI  476 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~~n~~ll~~-~~w~dip~~y~ngrra  476 (503)
                      |-+|+|=|+....-+..|+.++++ ..=+.+||.|..|=|-
T Consensus        46 ~~lhivDLd~a~~g~~~n~~~I~~i~~~~~~pi~vGGGIrs   86 (240)
T PRK13585         46 KTLHLVDLDGAFEGSRKNADIIEKIVEATDVSIQLGGGIRS   86 (240)
T ss_pred             CEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCC
T ss_conf             97999989772118944499999999737977899788587


No 68 
>PRK07006 isocitrate dehydrogenase; Reviewed
Probab=23.90  E-value=37  Score=13.75  Aligned_cols=18  Identities=22%  Similarity=0.108  Sum_probs=12.9

Q ss_pred             HHHHHHCCCCEEEEEEECC
Q ss_conf             7999730781168788379
Q gi|254780612|r  454 NSKILEEYRFIDIPITYRS  472 (503)
Q Consensus       454 n~~ll~~~~w~dip~~y~n  472 (503)
                      =++|++.-.+||+ ||-.|
T Consensus       279 ~~~lv~~P~~fDV-ivt~N  296 (409)
T PRK07006        279 LQQILLRPAEYDV-IATMN  296 (409)
T ss_pred             HHHHHHCCCCCCE-EEECC
T ss_conf             9999639645547-98755


No 69 
>cd07136 ALDH_YwdH-P39616 Bacillus subtilis aldehyde dehydrogenase ywdH-like. Uncharacterized Bacillus subtilis ywdH aldehyde dehydrogenase (locus P39616)  most closely related to the ALDHs and fatty ALDHs of families 3 and 14, and similar sequences, are included in this CD.
Probab=23.74  E-value=18  Score=15.88  Aligned_cols=16  Identities=31%  Similarity=0.783  Sum_probs=12.5

Q ss_pred             HHHHCCCCEEEEEEEC
Q ss_conf             9973078116878837
Q gi|254780612|r  456 KILEEYRFIDIPITYR  471 (503)
Q Consensus       456 ~ll~~~~w~dip~~y~  471 (503)
                      ..|+.+.|||||+.|-
T Consensus       423 tV~~~~~~~~~~~~~p  438 (449)
T cd07136         423 SILKKSTWFDLPLRYP  438 (449)
T ss_pred             EEEECCCCCCCCCCCC
T ss_conf             7998887787570089


No 70 
>COG1628 Endonuclease V homolog [Replication, recombination, and repair]
Probab=23.37  E-value=31  Score=14.27  Aligned_cols=84  Identities=17%  Similarity=0.143  Sum_probs=47.9

Q ss_pred             CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCC---------------CCEEEEEEE
Q ss_conf             73211032310213321246631100167469828998178825688779997307---------------811687883
Q gi|254780612|r  406 QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEY---------------RFIDIPITY  470 (503)
Q Consensus       406 ~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~---------------~w~dip~~y  470 (503)
                      ++.-+.+-|..|+.+-     =+.|+.--    -+=|.=.|.-.+..+=+.-++.+               ||+||.-+|
T Consensus        22 ~~~~~~~vG~v~r~~~-----~v~gv~~~----~i~vDG~D~T~~i~~~v~~~~~~~~rvVlLdGIt~aGFNivDi~~l~   92 (185)
T COG1628          22 GDDKVTLVGVVMRGDR-----VVDGVAFS----LITVDGLDVTDAISDMVNRSKRRDLRVVLLDGITFAGFNIVDIEALY   92 (185)
T ss_pred             CCCEEEEEEEEEECCE-----EEEEEEEE----EEEECCCHHHHHHHHHHHHHHCCCCCEEEECCEEECCCEEECHHHHH
T ss_conf             7860899999998886-----88766778----99866823889999999985255611899877203052484289988


Q ss_pred             -CCCCEEEEEECCCCCHHHHHH---HHHHHHH
Q ss_conf             -799678999615876899999---9999985
Q gi|254780612|r  471 -RSGQKILFTIDKGKKGADIFK---SAIMQWE  498 (503)
Q Consensus       471 -~ngrra~~~~ekg~~g~~~f~---~a~~~w~  498 (503)
                       +.|+=-|--+.|++.+|+++.   +-|.-|.
T Consensus        93 ~~tg~PVi~V~~k~P~~e~i~~Al~k~f~d~e  124 (185)
T COG1628          93 KETGLPVIVVYRKKPDIERIESALRKHFDDAE  124 (185)
T ss_pred             HHHCCCEEEEEECCCCHHHHHHHHHHHCCCHH
T ss_conf             76199689999637987899999997588799


No 71 
>COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism]
Probab=23.20  E-value=35  Score=13.92  Aligned_cols=30  Identities=27%  Similarity=0.165  Sum_probs=14.8

Q ss_pred             CCCCCHHHHHHHHHH--HCCCCEEEEEEECCC
Q ss_conf             178825688779997--307811687883799
Q gi|254780612|r  444 LKGSEEDPFRNSKIL--EEYRFIDIPITYRSG  473 (503)
Q Consensus       444 l~~~~~~~~~n~~ll--~~~~w~dip~~y~ng  473 (503)
                      |++++..++|-+-=.  |=++=|+|||+|-+-
T Consensus       156 LaSLD~~RK~EilpylERL~~e~~IPIlYVSH  187 (352)
T COG4148         156 LASLDLPRKREILPYLERLRDEINIPILYVSH  187 (352)
T ss_pred             HHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEC
T ss_conf             03236510467789999987621887899936


No 72 
>cd04732 HisA HisA.  Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase catalyzes the fourth step in histidine biosynthesis, an isomerisation of the aminoaldose moiety of ProFAR to the aminoketose of PRFAR (N-(5'-phospho-D-1'-ribulosylformimino)-5-amino-1-(5''-phospho-ribosyl)-4-imidazolecarboxamide). In bacteria and archaea, ProFAR isomerase is encoded by the HisA gene.
Probab=23.04  E-value=45  Score=13.10  Aligned_cols=40  Identities=23%  Similarity=0.395  Sum_probs=17.6

Q ss_pred             CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCE
Q ss_conf             69828998178825688779997307-81168788379967
Q gi|254780612|r  436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQK  475 (503)
Q Consensus       436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrr  475 (503)
                      .|.++++=|+.....+..|+.|+++- .=+++||.|..|-|
T Consensus        43 ~d~l~i~DLdaa~~~~~~n~~~I~~I~~~~~~pi~vGGGIr   83 (234)
T cd04732          43 AKWLHVVDLDGAKGGEPVNLELIEEIVKAVGIPVQVGGGIR   83 (234)
T ss_pred             CCEEEEEECCCHHHCCCCHHHHHHHHHHHCCCCEEECCCCC
T ss_conf             99899996753030891159999999976795689737717


No 73 
>cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains.
Probab=22.79  E-value=47  Score=13.00  Aligned_cols=38  Identities=16%  Similarity=0.024  Sum_probs=27.2

Q ss_pred             ECCCEEEEECCCCCH----HHHHHHHHHHCCCCEEEEEEECC
Q ss_conf             469828998178825----68877999730781168788379
Q gi|254780612|r  435 ISKNSYLISLKGSEE----DPFRNSKILEEYRFIDIPITYRS  472 (503)
Q Consensus       435 v~~~~f~v~l~~~~~----~~~~n~~ll~~~~w~dip~~y~n  472 (503)
                      +..+--.|++|++..    +..+.+++|++++--||+|+...
T Consensus        47 ~e~~ad~i~vSsl~g~~~~~~~~l~~~L~e~G~~di~vvvGG   88 (128)
T cd02072          47 IETDADAILVSSLYGHGEIDCKGLREKCDEAGLKDILLYVGG   88 (128)
T ss_pred             HHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECC
T ss_conf             873999999823202562489999999996799999899789


No 74 
>PRK07374 dnaE DNA polymerase III subunit alpha; Validated
Probab=22.71  E-value=47  Score=12.99  Aligned_cols=17  Identities=18%  Similarity=0.296  Sum_probs=8.8

Q ss_pred             EEEEEEECCCC-EEEEEE
Q ss_conf             16878837996-789996
Q gi|254780612|r  464 IDIPITYRSGQ-KILFTI  480 (503)
Q Consensus       464 ~dip~~y~ngr-ra~~~~  480 (503)
                      +-|-|.|.|+. ++.|.|
T Consensus      1123 ~pv~~~~~~~~~~~~~~l 1140 (1171)
T PRK07374       1123 IPVVAAVRAGNSVRYVRL 1140 (1171)
T ss_pred             CCEEEEEECCCCEEEEEE
T ss_conf             776999976983699981


No 75 
>PRK02079 pyrroloquinoline quinone biosynthesis protein PqqD; Provisional
Probab=22.54  E-value=48  Score=12.96  Aligned_cols=22  Identities=18%  Similarity=0.386  Sum_probs=19.2

Q ss_pred             CCCCHHHHHHHHHHHCCCCEEE
Q ss_conf             7882568877999730781168
Q gi|254780612|r  445 KGSEEDPFRNSKILEEYRFIDI  466 (503)
Q Consensus       445 ~~~~~~~~~n~~ll~~~~w~di  466 (503)
                      .++..|+-.=|+.|..+.||+|
T Consensus        66 ~~i~~DV~~fL~~a~~k~wi~l   87 (88)
T PRK02079         66 PGLDEDVLEFLEVARAKHWIEL   87 (88)
T ss_pred             HHHHHHHHHHHHHHHHCCCEEC
T ss_conf             1258999999999998819762


No 76 
>KOG3107 consensus
Probab=22.45  E-value=48  Score=12.96  Aligned_cols=24  Identities=17%  Similarity=0.009  Sum_probs=17.7

Q ss_pred             EEECCCCCCCCCEEEEEEEEEECC
Q ss_conf             995178987778128899997168
Q gi|254780612|r  379 MTLKCNADISLSITHVMEIMFSFP  402 (503)
Q Consensus       379 ~~~rrn~d~slpash~~e~~f~~~  402 (503)
                      -.=|||+|+|-|-...+|-+|+|.
T Consensus       180 ~rgkk~~~psp~~ds~~eRVFiWD  203 (468)
T KOG3107         180 GRGKKKTGPSPPGDSTLERVFIWD  203 (468)
T ss_pred             CCCCCCCCCCCCCCCCCEEEEEEE
T ss_conf             556666899899877600588860


No 77 
>KOG3972 consensus
Probab=22.28  E-value=47  Score=13.01  Aligned_cols=24  Identities=21%  Similarity=0.346  Sum_probs=19.6

Q ss_pred             CHHHHHHHHHHCCCCCCCHHHHHH
Q ss_conf             179999988616776560001322
Q gi|254780612|r    3 DFILVIQRAVDNLPENTPERRSHI   26 (503)
Q Consensus         3 D~~~~l~kai~~l~~~tpe~r~~v   26 (503)
                      -||-|||||-.||.-.+.+-|-..
T Consensus        81 ayyklLkka~~GL~si~~d~~~~~  104 (252)
T KOG3972          81 AYYKLLKKAQEGLNSITEDGRLHN  104 (252)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCH
T ss_conf             999999999877873585677761


No 78 
>pfam10794 DUF2606 Protein of unknown function, DUF2606. Family of bacterial proteins with unknown function.
Probab=22.12  E-value=48  Score=12.91  Aligned_cols=78  Identities=15%  Similarity=0.221  Sum_probs=53.6

Q ss_pred             EEEEE-EEEEECCCCCCCCHHHCCCCEECCCCCCCCCCC-CCEEE-----------E-ECCCEEEEECCCCCHHHHHHHH
Q ss_conf             12889-999716888873211032310213321246631-10016-----------7-4698289981788256887799
Q gi|254780612|r  391 ITHVM-EIMFSFPKESQDAVVDLRRISMRKTDNSPSVLI-DSNIF-----------V-ISKNSYLISLKGSEEDPFRNSK  456 (503)
Q Consensus       391 ash~~-e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l-~~~~~-----------k-v~~~~f~v~l~~~~~~~~~n~~  456 (503)
                      +|.++ -++|-+-+--|.-|.+|.-++||..+.+..--- +|..-           | +..|-|.|.|-+.+.-..||+.
T Consensus        37 ~~kv~~pVT~hVen~e~qpik~~ei~lmKa~d~~p~Pskeig~~IGKTd~eGkiiWk~~rKG~Yiv~lp~~et~~~r~is  116 (131)
T pfam10794        37 AKKVVNPVTLHVEDAEGQPIKGVEVTLMKAADSDPQPSKEIGEIIGKTDHEGKIIWKSGRKGKYIVVLPKNETPETRNIS  116 (131)
T ss_pred             HCEECCCEEEEEECCCCCCCCCEEEEEEECCCCCCCCCHHHHEEECCCCCCCEEEEECCCCCEEEEEECCCCCEEEEEEE
T ss_conf             41003747999825778932675999996565699974212114455476770887337772089996389850577542


Q ss_pred             HHHCCCCEEEEE
Q ss_conf             973078116878
Q gi|254780612|r  457 ILEEYRFIDIPI  468 (503)
Q Consensus       457 ll~~~~w~dip~  468 (503)
                      |...|.=-.|+.
T Consensus       117 l~~dr~~~~vi~  128 (131)
T pfam10794       117 LIEDRKDHKVIS  128 (131)
T ss_pred             EECCCCCCEEEE
T ss_conf             102677635898


No 79 
>cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria.  The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair.  The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here.  It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=22.00  E-value=49  Score=12.89  Aligned_cols=41  Identities=15%  Similarity=0.138  Sum_probs=28.4

Q ss_pred             ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEE
Q ss_conf             4698289981788256887799973078116878837996-789996
Q gi|254780612|r  435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTI  480 (503)
Q Consensus       435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~  480 (503)
                      .+.++.-|..+..=   ..=+.||.+.+.  ||++..||+ ..|||-
T Consensus        69 M~~~~~tv~~~~~i---~~~~~~m~~~~~--lpVVD~~~~lvGIITr  110 (116)
T cd04643          69 MNTDVPVIIDDADI---EEILHLLIDQPF--LPVVDDDGIFIGIITR  110 (116)
T ss_pred             HCCCCEEEECCCCH---HHHHHHHHHCCE--EEEEECCCEEEEEEEH
T ss_conf             15698899079999---999999987898--9999459999999992


No 80 
>PRK06826 dnaE DNA polymerase III DnaE; Reviewed
Probab=21.91  E-value=49  Score=12.88  Aligned_cols=40  Identities=8%  Similarity=0.042  Sum_probs=20.7

Q ss_pred             HHHCCCCEEEEEEECCCCEEEEEECC--CCCHHHHHHHHHHHH
Q ss_conf             97307811687883799678999615--876899999999998
Q gi|254780612|r  457 ILEEYRFIDIPITYRSGQKILFTIDK--GKKGADIFKSAIMQW  497 (503)
Q Consensus       457 ll~~~~w~dip~~y~ngrra~~~~ek--g~~g~~~f~~a~~~w  497 (503)
                      |.+.++=+.|-|.|.++++ .|-|.+  ...-...|-+.+..-
T Consensus      1107 l~~~~G~~~V~l~~~~~~~-~~~L~~~~~V~~~~~l~~~L~~l 1148 (1157)
T PRK06826       1107 LKKHPGNTPVYLYTEKERK-KFKLDRELWVSLSEELIDNLRNL 1148 (1157)
T ss_pred             HHHCCCCCEEEEEECCCCE-EEECCCCCCCCCCHHHHHHHHHH
T ss_conf             9739799859999868977-99889987467799999999986


No 81 
>COG3814 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.89  E-value=44  Score=13.19  Aligned_cols=12  Identities=8%  Similarity=0.445  Sum_probs=6.5

Q ss_pred             EECCCEEEEECC
Q ss_conf             746982899817
Q gi|254780612|r  434 VISKNSYLISLK  445 (503)
Q Consensus       434 kv~~~~f~v~l~  445 (503)
                      ||+|.+|-|+||
T Consensus        75 ~v~dtgFsv~ls   86 (157)
T COG3814          75 KVTDTGFSVTLS   86 (157)
T ss_pred             EECCCCEEEEEE
T ss_conf             102462489988


No 82 
>TIGR03572 WbuZ glycosyl amidation-associated protein WbuZ. This clade of sequences is highly similar to the HisF protein, but generally represents the second HisF homolog in the genome where the other is an authentic HisF observed in the context of a complete histidine biosynthesis operon. The similarity between these WbuZ sequences and true HisFs is such that often the closest match by BLAST of a WbuZ is a HisF. Only by making a multiple sequence alignment is the homology relationship among the WbuZ sequences made apparent. WbuZ genes are invariably observed in the presence of a homolog of the HisH protein (designated WbuY) and a proposed N-acetyl sugar amidotransferase designated in WbuX in E. coli, IfnA in P. aeriginosa and PseA in C. jejuni. Similarly, this trio of genes is invariably found in the context of saccharide biosynthesis loci. It has been shown that the WbuYZ homologs are not essential components of the activity expressed by WbuX, leading to the proposal that these to pr
Probab=21.85  E-value=16  Score=16.30  Aligned_cols=65  Identities=8%  Similarity=0.103  Sum_probs=43.4

Q ss_pred             CCCCEECCCCCCCCCCCCCE---EEEECCCEEEEECCCCCHHHHHHHHHHHCCC-CEEEEEEECCCCEEE
Q ss_conf             32310213321246631100---1674698289981788256887799973078-116878837996789
Q gi|254780612|r  412 LRRISMRKTDNSPSVLIDSN---IFVISKNSYLISLKGSEEDPFRNSKILEEYR-FIDIPITYRSGQKIL  477 (503)
Q Consensus       412 v~~~~~k~~e~~~g~~l~~~---~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~-w~dip~~y~ngrra~  477 (503)
                      |++..||. +.--|+|+.-+   .-+=.|-.++|=|+.....+..|+.|+++-. =+.|||.|..|=|-+
T Consensus        18 Vk~~~~~~-~~~~gdP~~~ak~~~~~g~d~lhivDld~a~~~~~~n~~~I~~i~~~~~ipi~vGGGIrs~   86 (232)
T TIGR03572        18 VKTVQFKD-PRYIGDPVNAARIYNAKGADELIVLDIDASKRGREPLFELISNLAEECFMPLTVGGGIRSL   86 (232)
T ss_pred             EECCCCCC-CEECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEEECEEEH
T ss_conf             97847877-6578899999999998699999999687643488217999999999729858997133038


No 83 
>pfam00948 Flavi_NS1 Flavivirus non-structural Protein NS1. The NS1 protein is well conserved amongst the flaviviruses. It contains 12 cysteines, and undergoes glycosylation in a similar manner to other NS proteins. Mutational analysis has strongly implied a role for NS1 in the early stages of RNA replication.
Probab=21.80  E-value=46  Score=13.09  Aligned_cols=51  Identities=14%  Similarity=0.116  Sum_probs=28.1

Q ss_pred             CCCEEEEEEECCCCCEEEEEEECCCCCC-------------CCCEEEEEE------EEEECCCCCCCCHHH
Q ss_conf             6742367630787771799995178987-------------778128899------997168888732110
Q gi|254780612|r  360 KGLVIKGDIPMIDNAFSASMTLKCNADI-------------SLSITHVME------IMFSFPKESQDAVVD  411 (503)
Q Consensus       360 ~~~aira~i~iP~~~~~~~~~~rrn~d~-------------slpash~~e------~~f~~~~~~~~~v~~  411 (503)
                      -..||++++.| -.+..|=|-=+.|.--             .-|.|||+.      -..++|.-.||=++.
T Consensus       183 ~GaAVK~~~aV-HtD~~~WmeS~~NgTw~i~~l~~~~~~~C~WP~shTl~~~~v~eS~lfiP~~lgGP~S~  252 (355)
T pfam00948       183 MGAAVKNGRAV-HTDLSYWIESRLNGTWKIERAELGEVKSCTWPETHTLWGDGVLESELFIPVTLAGPRSQ  252 (355)
T ss_pred             EEHEEECCEEE-ECCCCEEEEECCCCEEEEEEEEECCEEEEECCCCCCCCCCCCCCCCEECCCCCCCCCHH
T ss_conf             00211188659-65886278974688199999986130564288761259998440230054302587200


No 84 
>CHL00073 chlN photochlorophyllide reductase subunit N
Probab=21.67  E-value=35  Score=13.86  Aligned_cols=12  Identities=17%  Similarity=0.127  Sum_probs=5.3

Q ss_pred             EEEEEEECCCCC
Q ss_conf             899997168888
Q gi|254780612|r  394 VMEIMFSFPKES  405 (503)
Q Consensus       394 ~~e~~f~~~~~~  405 (503)
                      -|||+|.==.+|
T Consensus       415 SiEl~F~pIHGf  426 (457)
T CHL00073        415 SVEFTFAQIHGF  426 (457)
T ss_pred             EEEEEECCCCCH
T ss_conf             787772043460


No 85 
>PRK06273 ferredoxin; Provisional
Probab=21.53  E-value=45  Score=13.14  Aligned_cols=25  Identities=12%  Similarity=0.149  Sum_probs=15.4

Q ss_pred             C-CCHHHC---CCCEECCCCCCCCCCCCCEEEEECCCEE
Q ss_conf             7-321103---2310213321246631100167469828
Q gi|254780612|r  406 Q-DAVVDL---RRISMRKTDNSPSVLIDSNIFVISKNSY  440 (503)
Q Consensus       406 ~-~~v~~v---~~~~~k~~e~~~g~~l~~~~~kv~~~~f  440 (503)
                      | ||-+||   ..|.|+.-|          |||||||+-
T Consensus        55 GC~GCaNvCPT~AIeM~~~e----------PVkite~~v   83 (163)
T PRK06273         55 GCGGCANACPTKAIEMIPVE----------PVKITETYV   83 (163)
T ss_pred             CCCCCCCCCCCCCEEEEECC----------CEEEECCHH
T ss_conf             65550023887755777656----------567411033


No 86 
>TIGR02318 phosphono_phnM phosphonate metabolism protein PhnM; InterPro: IPR012696    This family consists of proteins from in the PhnM family. PhnM is a protein associated with phosphonate utilization in a number of bacterial species. Pseudomonas stutzeri WM88, has a related protein that is part of a system for the oxidation of phosphites (another form of reduced phosphorous compound), which is not contained in this entry..
Probab=21.42  E-value=50  Score=12.81  Aligned_cols=51  Identities=22%  Similarity=0.244  Sum_probs=33.7

Q ss_pred             CCEEEEECCCCCHHHH--HHHHHHHCCCCEEEEEEECCCCEE-EEEECCCCCHHHHHHHHHHHHHHCCCC
Q ss_conf             9828998178825688--779997307811687883799678-999615876899999999998522369
Q gi|254780612|r  437 KNSYLISLKGSEEDPF--RNSKILEEYRFIDIPITYRSGQKI-LFTIDKGKKGADIFKSAIMQWENRSNN  503 (503)
Q Consensus       437 ~~~f~v~l~~~~~~~~--~n~~ll~~~~w~dip~~y~ngrra-~~~~ekg~~g~~~f~~a~~~w~~~~~~  503 (503)
                      +..-||.|.|+-.=+-  +|+++.|+         |-.|||+ +||       +..|++-|..|++..+|
T Consensus       162 ~~v~L~SlMDHtPGQrQ~~~~~~yr~---------~~~~k~grvls-------d~~~~~~~~~r~~~~~~  215 (391)
T TIGR02318       162 PRVDLISLMDHTPGQRQFRDLEKYRE---------YYQKKRGRVLS-------DDEFDEIVEERIARSAE  215 (391)
T ss_pred             CCCEEEEEECCCCHHHCCHHHHHHHH---------HHHHHHCCCCC-------HHHHHHHHHHHHHHHCC
T ss_conf             97018886068870110112578999---------87245267458-------99999999999997358


No 87 
>pfam03452 Anp1 Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins co-localize within the cis Golgi, and that they are physically associated in two distinct complexes.
Probab=21.40  E-value=50  Score=12.81  Aligned_cols=77  Identities=17%  Similarity=0.200  Sum_probs=38.4

Q ss_pred             EEEEEEEEECCCCC-CCCHHHCCCCEECCCCCC--CCCCCCCEEEEECCCEEEEECCCCCH---------------HHHH
Q ss_conf             28899997168888-732110323102133212--46631100167469828998178825---------------6887
Q gi|254780612|r  392 THVMEIMFSFPKES-QDAVVDLRRISMRKTDNS--PSVLIDSNIFVISKNSYLISLKGSEE---------------DPFR  453 (503)
Q Consensus       392 sh~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~--~g~~l~~~~~kv~~~~f~v~l~~~~~---------------~~~~  453 (503)
                      -|+|+|-|++++.. |-..-.+-...++.....  --....-+.. +..+| .-.+|-...               .+.|
T Consensus        55 h~lI~L~Flv~~~~~~d~t~~~l~~~~~~~Q~~~~~~~~F~~iti-~~kdf-~~~~~q~~~~RH~~~~Q~~RR~~maraR  132 (270)
T pfam03452        55 HELIDLGFLVPDTKEGDKTLEVLAAALKKTQSKGDGEKRFRSITI-LRKDF-GDVEGQSESDRHAFAVQGPRRKLMARAR  132 (270)
T ss_pred             HHHEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE-ECCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHH
T ss_conf             345357887168876316999999999997417876777325799-72775-3102545454301334478999999999


Q ss_pred             HHHHHHCCC-------CEEEEEEE
Q ss_conf             799973078-------11687883
Q gi|254780612|r  454 NSKILEEYR-------FIDIPITY  470 (503)
Q Consensus       454 n~~ll~~~~-------w~dip~~y  470 (503)
                      |--|.....       |+|..|+-
T Consensus       133 N~Ll~~aL~p~~swVlW~DaDiv~  156 (270)
T pfam03452       133 NWLLSTALKPYHSWVLWLDADIVE  156 (270)
T ss_pred             HHHHHHHCCCCCCEEEEEECCHHH
T ss_conf             999998438876469998264322


No 88 
>cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown.  In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi
Probab=20.94  E-value=51  Score=12.75  Aligned_cols=33  Identities=12%  Similarity=-0.060  Sum_probs=25.2

Q ss_pred             CHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEE
Q ss_conf             256887799973078116878837996-789996
Q gi|254780612|r  448 EEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTI  480 (503)
Q Consensus       448 ~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~  480 (503)
                      +.+...=+.||.+++---+|++..+|| ..|||.
T Consensus        67 d~~l~~a~~~M~~~~i~~lPVVD~~grlvGivT~  100 (106)
T cd04582          67 DDDLRIVLSRMFAHDMSWLPCVDEDGRYVGEVTQ  100 (106)
T ss_pred             CCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEH
T ss_conf             9999999999997298762589899909999986


No 89 
>TIGR01858 tag_bisphos_ald class II aldolase, tagatose bisphosphate family; InterPro: IPR011288   This entry describes tagatose-1,6-bisphosphate aldolases, and perhaps other closely related class II aldolases. This tetrameric, Zn2+-dependent enzyme is related to the class II fructose bisphosphate aldolase; fructose 1,6-bisphosphate and tagatose 1,6 bisphosphate differ only in chirality at C4 , .; GO: 0008270 zinc ion binding, 0016832 aldehyde-lyase activity, 0019512 lactose catabolic process via tagatose-6-phosphate.
Probab=20.88  E-value=38  Score=13.61  Aligned_cols=44  Identities=14%  Similarity=0.183  Sum_probs=29.1

Q ss_pred             HHHCCCCEEEEEEECCC--------CEE-EEEECCCCCH---HHHHHHHHHHHHHC
Q ss_conf             97307811687883799--------678-9996158768---99999999998522
Q gi|254780612|r  457 ILEEYRFIDIPITYRSG--------QKI-LFTIDKGKKG---ADIFKSAIMQWENR  500 (503)
Q Consensus       457 ll~~~~w~dip~~y~ng--------rra-~~~~ekg~~g---~~~f~~a~~~w~~~  500 (503)
                      |-+=|.|.|||||.=.+        ||+ .|=|=|=--|   .=+|.+|+++|...
T Consensus       192 L~~IR~~v~vPLVLHGASdvpde~v~~~IeLGi~KvNVATeLKiAFS~AlK~Yf~e  247 (282)
T TIGR01858       192 LAKIREKVDVPLVLHGASDVPDEDVRRTIELGICKVNVATELKIAFSGALKAYFLE  247 (282)
T ss_pred             HHHHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCEEECCHHCCHHHHHHHHHHHHH
T ss_conf             98752203777242177788767899998707413411111003689999999974


No 90 
>cd01339 LDH-like_MDH L-lactate dehydrogenase-like malate dehydrogenase proteins. Members of this subfamily have an LDH-like structure and an MDH enzymatic activity. Some members, like MJ0490 from Methanococcus jannaschii, exhibit both MDH and LDH activities. Tetrameric MDHs, including those from phototrophic bacteria, are more similar to LDHs than to other MDHs. LDH catalyzes the last step of glycolysis in which pyruvate is converted to L-lactate. MDH is one of the key enzymes in the citric acid cycle, facilitating both the conversion of malate to oxaloacetate and replenishing levels of oxalacetate by reductive carboxylation of pyruvate. The LDH-like MDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenas
Probab=20.77  E-value=23  Score=15.14  Aligned_cols=45  Identities=11%  Similarity=0.240  Sum_probs=29.8

Q ss_pred             EECCCCCCCCCEEEEEE-------EEEECCCCC-CCCHHHCCCCEECCCCCCC
Q ss_conf             95178987778128899-------997168888-7321103231021332124
Q gi|254780612|r  380 TLKCNADISLSITHVME-------IMFSFPKES-QDAVVDLRRISMRKTDNSP  424 (503)
Q Consensus       380 ~~rrn~d~slpash~~e-------~~f~~~~~~-~~~v~~v~~~~~k~~e~~~  424 (503)
                      .|.+|...-||+|+.++       +.|.+|--. ..||..|-.+.|-..||.+
T Consensus       235 aIl~d~~~v~pvs~~~~g~yG~~~v~~s~P~~ig~~Gv~~i~~l~L~~~E~~~  287 (300)
T cd01339         235 AILKDKKRVLPCSAYLEGEYGIKDIFVGVPVVLGKNGVEKIIELDLTDEEKEA  287 (300)
T ss_pred             HHHCCCCCEEEEEEEECCCCCCCCEEEEEEEEECCCCEEEECCCCCCHHHHHH
T ss_conf             99647897899999844777860479998999858831898589999999999


No 91 
>cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute.
Probab=20.73  E-value=52  Score=12.72  Aligned_cols=108  Identities=11%  Similarity=0.118  Sum_probs=65.3

Q ss_pred             CCCCCCCCCCCCCCCCCCCEEECCCCC--CCCC--CE--EEEEEECCCCCEE--------EEEEECCCCCCCCCEEEEEE
Q ss_conf             345545455544466622123045777--7767--42--3676307877717--------99995178987778128899
Q gi|254780612|r  331 VFINKGRGQSSILSGKILWSLQQEKSQ--GLKG--LV--IKGDIPMIDNAFS--------ASMTLKCNADISLSITHVME  396 (503)
Q Consensus       331 ~~~~e~~~~~~~~~GsVvWs~~~es~~--~~~~--~a--ira~i~iP~~~~~--------~~~~~rrn~d~slpash~~e  396 (503)
                      +.+-|+...-+..+|-.+-+|-.+.-+  +.-+  +|  +|+-=-+ ..+|.        -+|-|.++.+.-|= ||..|
T Consensus       270 Tt~FEdL~Gl~QLPGLAvQrLMa~GYGFggEGDwKTAal~r~mKvm-~~gl~gGtSFMEDYTy~f~~g~~~iLG-AHMLE  347 (484)
T cd03557         270 TTTFEDLHGLKQLPGLAVQRLMAEGYGFGAEGDWKTAALVRAMKVM-SAGLPGGTSFMEDYTYHFEPGNELVLG-AHMLE  347 (484)
T ss_pred             EEHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCCCCCCHHHHHCCCCCCEEEHH-HHCCC
T ss_conf             0012045670016216787786538663444118999999999998-378999822144544028999844303-10055


Q ss_pred             EEEECCCC-------C-C-CCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEE
Q ss_conf             99716888-------8-7-3211032310213321246631100167469828998
Q gi|254780612|r  397 IMFSFPKE-------S-Q-DAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLIS  443 (503)
Q Consensus       397 ~~f~~~~~-------~-~-~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~  443 (503)
                      +-++.-.+       + | |+-.+-.||.|....   |..+..--+-..|.|+||.
T Consensus       348 vcpsIA~~KP~levhPLgIGgk~DP~RLVF~~~~---Gpav~vslvD~G~rFRLiv  400 (484)
T cd03557         348 VCPSIAAAKPRLEVHPLGIGGKEDPARLVFTGKA---GPAVNASLVDMGNRFRLVV  400 (484)
T ss_pred             CCHHHHCCCCEEEECCCCCCCCCCCCEEEEECCC---CCEEEEEEEECCCCEEEEE
T ss_conf             6815536895467615787886886269997698---8879999986588479999


No 92 
>PRK05673 dnaE DNA polymerase III subunit alpha; Validated
Probab=20.50  E-value=52  Score=12.68  Aligned_cols=44  Identities=7%  Similarity=0.080  Sum_probs=22.5

Q ss_pred             HHCCCCEEEEEEECCCC-EEEEEECCC--CCHHHHHHHHHHHHHHCC
Q ss_conf             73078116878837996-789996158--768999999999985223
Q gi|254780612|r  458 LEEYRFIDIPITYRSGQ-KILFTIDKG--KKGADIFKSAIMQWENRS  501 (503)
Q Consensus       458 l~~~~w~dip~~y~ngr-ra~~~~ekg--~~g~~~f~~a~~~w~~~~  501 (503)
                      .+.++=..|-|.|.+.. ...|.|-+.  ..-+..|-+.+..-.+..
T Consensus      1092 ~~~~G~~~v~l~~~~~~~~~~~~l~~~~~v~~~~~l~~~l~~l~G~~ 1138 (1143)
T PRK05673       1092 EPGPGTVPVHLYYQDGDAEAELRLGDRWRVSPSDELLGDLKALLGVE 1138 (1143)
T ss_pred             HHCCCCCEEEEEEECCCEEEEEECCCCCEECCCHHHHHHHHHHHCCC
T ss_conf             70989986999997698279998699852777999999999850976


No 93 
>COG2014 Uncharacterized conserved protein [Function unknown]
Probab=20.38  E-value=22  Score=15.35  Aligned_cols=23  Identities=22%  Similarity=0.209  Sum_probs=15.0

Q ss_pred             HHCCCCEEEEEEECCCC-EEEEEE
Q ss_conf             73078116878837996-789996
Q gi|254780612|r  458 LEEYRFIDIPITYRSGQ-KILFTI  480 (503)
Q Consensus       458 l~~~~w~dip~~y~ngr-ra~~~~  480 (503)
                      |++++|+||-..-.+++ .+.|.+
T Consensus        98 l~~~~~~Dil~li~~~d~IkmI~~  121 (250)
T COG2014          98 LEEANWFDILDLIQRDDKIKMIAE  121 (250)
T ss_pred             HHHCCHHHHHHHHCCCCCEEEEEE
T ss_conf             775346779998747873457775


No 94 
>COG4742 Predicted transcriptional regulator [Transcription]
Probab=20.17  E-value=18  Score=15.90  Aligned_cols=22  Identities=9%  Similarity=0.064  Sum_probs=17.0

Q ss_pred             EEEEECCCEEEEECCCCCHHHH
Q ss_conf             0167469828998178825688
Q gi|254780612|r  431 NIFVISKNSYLISLKGSEEDPF  452 (503)
Q Consensus       431 ~~~kv~~~~f~v~l~~~~~~~~  452 (503)
                      +.+-|||.||+++|-+.+..-+
T Consensus       207 ~~~~VTD~f~~L~Lf~k~G~~d  228 (260)
T COG4742         207 ASLIVTDRFMALSLFDKDGRYD  228 (260)
T ss_pred             EEEEEECHHHHHCCCCCCCCCC
T ss_conf             2688861146400415699807


Done!