Query gi|254780427|ref|YP_003064840.1| hypothetical protein CLIBASIA_01560 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 171 No_of_seqs 112 out of 1735 Neff 7.0 Searched_HMMs 39220 Date Sun May 29 19:58:01 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780427.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00016 putative metalloprote 100.0 0 0 300.1 18.7 153 10-170 1-157 (159) 2 COG0319 Predicted metal-depend 100.0 0 0 294.1 17.0 152 10-171 2-153 (153) 3 PRK13963 unkown domain/putativ 100.0 0 0 290.1 16.1 125 34-169 133-258 (258) 4 TIGR00043 TIGR00043 conserved 100.0 0 0 294.2 11.9 109 55-163 1-123 (123) 5 pfam02130 UPF0054 Uncharacteri 100.0 1.7E-40 4.2E-45 254.2 16.4 110 54-163 31-141 (141) 6 pfam11350 DUF3152 Protein of u 72.9 5.2 0.00013 20.3 3.7 97 53-168 81-186 (189) 7 cd04268 ZnMc_MMP_like Zinc-dep 71.3 5.1 0.00013 20.3 3.4 44 94-145 70-113 (165) 8 cd04276 ZnMc_MMP_like_2 Zinc-d 69.1 9.5 0.00024 18.8 4.9 67 78-144 59-134 (197) 9 COG5149 TOA1 Transcription ini 68.0 3 7.8E-05 21.6 1.7 27 62-88 30-56 (293) 10 TIGR00232 tktlase_bact transke 66.7 3.2 8.1E-05 21.5 1.6 13 129-141 65-77 (675) 11 pfam06262 DUF1025 Domain of un 64.2 6.9 0.00018 19.6 2.9 41 103-144 50-90 (96) 12 smart00235 ZnMc Zinc-dependent 59.5 4.4 0.00011 20.7 1.2 17 129-145 89-105 (140) 13 TIGR01120 rpiB ribose 5-phosph 55.6 7.1 0.00018 19.5 1.7 38 57-113 80-117 (143) 14 cd04281 ZnMc_BMP1_TLD Zinc-dep 53.0 7.2 0.00018 19.5 1.4 17 128-144 89-105 (200) 15 cd00203 ZnMc Zinc-dependent me 50.9 9.7 0.00025 18.7 1.8 21 125-145 95-115 (167) 16 cd04280 ZnMc_astacin_like Zinc 50.7 8.9 0.00023 18.9 1.6 54 77-145 38-93 (180) 17 cd04283 ZnMc_hatching_enzyme Z 49.7 8.9 0.00023 18.9 1.4 38 107-144 55-95 (182) 18 cd04282 ZnMc_meprin Zinc-depen 49.0 10 0.00027 18.5 1.7 57 73-144 80-138 (230) 19 cd04327 ZnMc_MMP_like_3 Zinc-d 48.8 5.7 0.00014 20.1 0.3 23 123-145 89-111 (198) 20 pfam01400 Astacin Astacin (Pep 46.6 9.5 0.00024 18.8 1.2 17 128-144 82-98 (192) 21 cd04278 ZnMc_MMP Zinc-dependen 46.3 15 0.00039 17.6 2.2 108 29-152 24-132 (157) 22 pfam01457 Peptidase_M8 Leishma 45.4 16 0.0004 17.5 2.1 41 96-143 217-257 (569) 23 pfam00413 Peptidase_M10 Matrix 43.2 27 0.00069 16.1 4.2 107 30-151 25-131 (158) 24 smart00731 SprT SprT homologue 42.0 28 0.00072 16.0 7.2 81 65-151 5-88 (146) 25 pfam04450 BSP Plant Basic Secr 36.7 34 0.00087 15.5 5.7 62 102-165 72-133 (200) 26 KOG3714 consensus 32.0 17 0.00043 17.3 0.6 40 106-145 136-178 (411) 27 KOG4584 consensus 30.9 38 0.00097 15.3 2.2 24 146-169 84-107 (348) 28 cd00770 SerRS_core Seryl-tRNA 30.2 24 0.00062 16.4 1.1 51 118-170 137-199 (297) 29 PTZ00337 surface protease GP63 29.6 28 0.0007 16.1 1.3 40 97-145 209-248 (567) 30 COG0021 TktA Transketolase [Ca 29.2 46 0.0012 14.8 2.7 18 65-82 224-242 (663) 31 pfam00587 tRNA-synt_2b tRNA sy 28.5 44 0.0011 14.9 2.2 33 136-170 113-145 (170) 32 pfam07759 DUF1615 Protein of u 24.8 55 0.0014 14.3 2.6 72 68-144 98-177 (318) 33 KOG3618 consensus 24.4 48 0.0012 14.7 1.8 24 117-140 221-244 (1318) 34 TIGR03006 pepcterm_polyde poly 24.4 56 0.0014 14.3 2.5 55 98-165 52-111 (274) 35 PRK05431 seryl-tRNA synthetase 21.5 64 0.0016 14.0 2.6 32 137-170 284-317 (422) 36 PRK00912 ribonuclease P protei 20.4 68 0.0017 13.8 4.4 37 56-92 63-118 (237) 37 pfam08328 ASL_C Adenylosuccina 20.2 43 0.0011 15.0 0.9 13 157-169 61-73 (115) No 1 >PRK00016 putative metalloprotease; Provisional Probab=100.00 E-value=0 Score=300.11 Aligned_cols=153 Identities=42% Similarity=0.647 Sum_probs=126.7 Q ss_pred CEEEEEEECCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCC Q ss_conf 30588885485211113525799999999999974322014667438999985779999999998504898515663255 Q gi|254780427|r 10 RLDLQIAVENALWGDEIHLRTLCEVVFAKAVSNLISKGYFVKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTA 89 (171) Q Consensus 10 ~i~I~i~ie~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~ 89 (171) +|+|++.++...|... ..+.+. ..+...+.. ......+||||.|||+++|++||++||++|+||||||||+. T Consensus 1 ~m~i~i~~e~~~~~~~-----~~~~~~-~~~~~~~~~--~~~~~~~Elsi~~vd~~~i~~LN~~yr~kd~pTDVLSFp~~ 72 (159) T PRK00016 1 ILDIEIANETGQVSEE-----QFEKWL-ELLLFALDK--LGVQPEAELSIRLVDNAEIRELNLEYRGKDKPTDVLSFPME 72 (159) T ss_pred CEEEEEECCCCCCCHH-----HHHHHH-HHHHHHHHH--CCCCCCEEEEEEEECHHHHHHHHHHHHCCCCCCCEEEEECC T ss_conf 9059974577999768-----999999-999999997--59998817999998989999999999589999867874065 Q ss_pred CCCC----CCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 6677----765653002046518888744666189899999999999899863888998899999999999999986888 Q gi|254780427|r 90 FASS----DGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDLGIN 165 (171) Q Consensus 90 ~~~~----~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M~~~E~~iL~~l~i~ 165 (171) .... ...+..+|||||||+++|.+||+++++++.+|++||+|||+|||+||||.+++++++|+++|+++|+++||+ T Consensus 73 ~~~~~~~~~~~~~~~LGDI~Is~e~i~~eA~e~~~s~~~el~~l~iHG~LHLlGYDH~~~~~~~~M~~~E~~iL~~lgi~ 152 (159) T PRK00016 73 EEPELEEPPGEEPPLLGDIVICPEVAEEQAEEQGHSLEAELAHLTVHGILHLLGYDHIEDEEAEEMEALEEEILAAFGLP 152 (159) T ss_pred CCCCCCCCCCCCCCCEEEEEECHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 66554445566765023489769999989998199999999999999999871788798689999999999999976999 Q ss_pred CCCCC Q ss_conf 86447 Q gi|254780427|r 166 DPYEV 170 (171) Q Consensus 166 ~PY~~ 170 (171) |||-. T Consensus 153 ~py~~ 157 (159) T PRK00016 153 DPYIA 157 (159) T ss_pred CCCCC T ss_conf 98867 No 2 >COG0319 Predicted metal-dependent hydrolase [General function prediction only] Probab=100.00 E-value=0 Score=294.08 Aligned_cols=152 Identities=39% Similarity=0.612 Sum_probs=126.0 Q ss_pred CEEEEEEECCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCC Q ss_conf 30588885485211113525799999999999974322014667438999985779999999998504898515663255 Q gi|254780427|r 10 RLDLQIAVENALWGDEIHLRTLCEVVFAKAVSNLISKGYFVKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTA 89 (171) Q Consensus 10 ~i~I~i~ie~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~ 89 (171) +++|++..++ |..... ....+.+++..++.... .+..|+||+|||+++|++||++||++|+||||||||+. T Consensus 2 ~~~i~~~~~~--~~~~~~----~~~~~~~~~~~~l~~~~---~~~~elsv~~vdd~ei~eLN~~yR~kD~pTDVLSFp~~ 72 (153) T COG0319 2 MMDIEVSDED--GQVRLP----ELKQFEEWLAAALKPEK---ISTAELSVVLVDDAEIRELNLEYRGKDKPTDVLSFPYN 72 (153) T ss_pred CCCEEECCCC--CCCCCH----HHHHHHHHHHHHHHHCC---CCCCEEEEEEECHHHHHHHHHHHCCCCCCCCEEEECCC T ss_conf 7530000277--763620----68999999999986504---88726899995779999999998089988745770588 Q ss_pred CCCCCCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66777656530020465188887446661898999999999998998638889988999999999999999868888644 Q gi|254780427|r 90 FASSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDLGINDPYE 169 (171) Q Consensus 90 ~~~~~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M~~~E~~iL~~l~i~~PY~ 169 (171) +... ...+.+|||||||+++|.+||.++|+|+++|++||+|||+|||+||||.+++++++|+..|++||..+|+++||. T Consensus 73 e~~~-~~~~~~LGDIvI~~~~~~~qA~e~g~s~~~e~~~l~vHG~LHLlGYDH~~~~e~~~M~~le~~iL~~l~~~~~~~ 151 (153) T COG0319 73 EPGE-AGEPPLLGDIVICPEVAEEQAKEQGHSLERELAHLTIHGILHLLGYDHEEDEEEEEMEELEEEILAALGLSDPYL 151 (153) T ss_pred CCCC-CCCCCCCEEEEECHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9877-777766500687589999999980998999999999999999716889992589999999999999828775454 Q ss_pred CC Q ss_conf 79 Q gi|254780427|r 170 VD 171 (171) Q Consensus 170 ~~ 171 (171) .+ T Consensus 152 ~~ 153 (153) T COG0319 152 TA 153 (153) T ss_pred CC T ss_conf 79 No 3 >PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional Probab=100.00 E-value=0 Score=290.14 Aligned_cols=125 Identities=38% Similarity=0.583 Sum_probs=111.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHHHHHH Q ss_conf 99999999974322014667438999985779999999998504898515663255667776565300204651888874 Q gi|254780427|r 34 VVFAKAVSNLISKGYFVKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEI 113 (171) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~~i~~ 113 (171) ..+.+|+...+. ...||+|+|||.+++|.||++||+||+||||||||+.+.+. ...+||||||++++.+ T Consensus 133 ~~~~~W~~~al~-------~daelTvR~Vd~~E~r~LN~~YRgKDypTNVLSFp~~e~~~----~pllGDIVIC~pVV~r 201 (258) T PRK13963 133 KLIAEWIEPALF-------ADAQLTVRFVGEEEGRTLNHGYRHKDYATNVLTFAYGEEPD----GPVVGDLVLCCPVVEK 201 (258) T ss_pred HHHHHHHHHHHC-------CCCEEEEEEECHHHHHHHHHHHCCCCCCCCEEEECCCCCCC----CCCEEEEEECHHHHHH T ss_conf 999999999856-------79779999858599999999864999898457523888889----9853338962899999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 4666189899999999999899863888998-8999999999999999868888644 Q gi|254780427|r 114 EANVLGKEFENHLVHLIIHGFLHLLGYDHVD-DKDACVMEGLERSILEDLGINDPYE 169 (171) Q Consensus 114 eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~-~~~~~~M~~~E~~iL~~l~i~~PY~ 169 (171) ||+++++++.+|++||+|||+|||+||||+. +.+++.|+.+|++||+++||+|||+ T Consensus 202 EA~EQgKsl~~H~AHLvVHG~LHLLGYDHe~~d~eA~~ME~lEi~iL~~LGi~nPYk 258 (258) T PRK13963 202 EAREQGKPLVAHYAHLLVHGALHAQGYDHETDDEDAAEMEALETDILAKLGFPNPYR 258 (258) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 999848999999999999999996368888887999999999999999759999999 No 4 >TIGR00043 TIGR00043 conserved hypothetical protein TIGR00043; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions .. Probab=100.00 E-value=0 Score=294.22 Aligned_cols=109 Identities=47% Similarity=0.758 Sum_probs=101.2 Q ss_pred EEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCC----------CCCCC----CCHHEEEEEHHHHHHHHHHHCC Q ss_conf 389999857799999999985048985156632556677----------76565----3002046518888744666189 Q gi|254780427|r 55 VELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASS----------DGCSS----LMLGDIVLAYEIIEIEANVLGK 120 (171) Q Consensus 55 ~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~----------~~~~~----~~LGDIvIs~~~i~~eA~~~~~ 120 (171) ++|||+||||++||+||++||++|+||||||||+.+... ....+ ..+||||||+++|.+||++||+ T Consensus 1 ~~~s~~f~D~~~i~~lN~~yR~~D~pTDVlSFp~~~~~~~~~~Ed~~~~~~~~~~def~~lGDivis~e~~~~~A~e~~~ 80 (123) T TIGR00043 1 IELSLLFVDDEEIQELNREYRDKDYPTDVLSFPYEEEEPFLEDEDEPDAGGFPEKDEFEVLGDIVISLEVAKEQAKEYGH 80 (123) T ss_pred CEEEEEEECCHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHHHHHCCC T ss_conf 91368875748899998997157877743205767678888776643446787310245431122046899999987289 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 8999999999998998638889988999999999999999868 Q gi|254780427|r 121 EFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDLG 163 (171) Q Consensus 121 ~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M~~~E~~iL~~l~ 163 (171) ++.+||+||+|||+||||||||++++++++|+.+|.+||+.+| T Consensus 81 ~~~~El~~l~vHG~LHLlGYDH~~~~e~~~M~~~e~~~L~~~g 123 (123) T TIGR00043 81 SFERELAHLTVHGLLHLLGYDHEEEDEEKEMFALEEEILALLG 123 (123) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999998568999881789999999999998509 No 5 >pfam02130 UPF0054 Uncharacterized protein family UPF0054. Probab=100.00 E-value=1.7e-40 Score=254.20 Aligned_cols=110 Identities=50% Similarity=0.774 Sum_probs=101.5 Q ss_pred CEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCC-CCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 4389999857799999999985048985156632556677765-653002046518888744666189899999999999 Q gi|254780427|r 54 IVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGC-SSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIH 132 (171) Q Consensus 54 ~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~-~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iH 132 (171) .++|+|.|||+++|++||++||++|+||||||||+++...... ...+||||+||+++|.+||.++++++.+|++++++| T Consensus 31 ~~~l~i~~~~~~~~~~LN~~~r~kd~~TDVLSFp~~~~~~~~~~~~~~lGdI~is~~~i~~~a~~~~~~~~~e~~~l~iH 110 (141) T pfam02130 31 DFELSILFVDDEEIRELNREYRGKDKPTDVLSFPMDEPEEGDEPLLLLLGDIVISPETAAREAKEYGHSLERELAHLLVH 110 (141) T ss_pred CEEEEEEEECHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCEEEEEEECHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 95999999888999999999958999875577535567655667776315899819999999999386399999999999 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 8998638889988999999999999999868 Q gi|254780427|r 133 GFLHLLGYDHVDDKDACVMEGLERSILEDLG 163 (171) Q Consensus 133 G~LHLlGydH~~~~~~~~M~~~E~~iL~~l~ 163 (171) |+|||+||||+++.++++|+.+|+++|+.+| T Consensus 111 G~LHLlGyDH~~~~~~~~M~~~E~~il~~~~ 141 (141) T pfam02130 111 GLLHLLGYDHEEDEEAEEMEALEEEILAKLG 141 (141) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 9999827888986899999999999999666 No 6 >pfam11350 DUF3152 Protein of unknown function (DUF3152). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function. Probab=72.92 E-value=5.2 Score=20.28 Aligned_cols=97 Identities=18% Similarity=0.139 Sum_probs=64.2 Q ss_pred CCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 74389999857799999999985048985156632556677765653002046518888744666189899999999999 Q gi|254780427|r 53 NIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIH 132 (171) Q Consensus 53 ~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iH 132 (171) ....+.|.|++-....+|-. +-+ |+.- .....-+.++|...+=.+-|..++-.+..+=.+++=| T Consensus 81 g~~Df~i~Lasp~Tt~~lC~---gl~--t~~e-----------~SC~~~~~VviN~~RW~~ga~~f~gdl~~YR~y~INH 144 (189) T pfam11350 81 GAPDFRISLASPGTTRELCA---GLD--TGGE-----------VSCYNGDRVVINEARWVRGAPPFEGDLGSYRQYVINH 144 (189) T ss_pred CCCCEEEEECCCCCHHHCCC---CCC--CCCE-----------EEECCCCEEEEEHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 99988999778660743255---767--6877-----------8746999799878996536865365899999998703 Q ss_pred HHHHHCCCCCCCH----HHHHHHHHHHHHHHHHCC-----CCCCC Q ss_conf 8998638889988----999999999999999868-----88864 Q gi|254780427|r 133 GFLHLLGYDHVDD----KDACVMEGLERSILEDLG-----INDPY 168 (171) Q Consensus 133 G~LHLlGydH~~~----~~~~~M~~~E~~iL~~l~-----i~~PY 168 (171) -+=|-|||.|+.= .-|-.|. ++.+..-| .+||| T Consensus 145 EVGH~LG~~H~~Cp~~G~~APvMm---QQt~~l~gd~~~C~pN~W 186 (189) T pfam11350 145 EVGHAIGYGHEPCPKPGELAPVMM---QQTFSLSNDGATCRFNPW 186 (189) T ss_pred HHHHHCCCCCCCCCCCCCCCCHHH---HHHHCCCCCCCCEECCCC T ss_conf 232012557766998998372023---100135898661515889 No 7 >cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases. Probab=71.33 E-value=5.1 Score=20.35 Aligned_cols=44 Identities=20% Similarity=0.198 Sum_probs=33.7 Q ss_pred CCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 7656530020465188887446661898999999999998998638889988 Q gi|254780427|r 94 DGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 94 ~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~ 145 (171) +......++++.+....+.- ..+++..+++|=+-|-||..|.-. T Consensus 70 p~tgeI~~a~v~~~~~~~~~--------~~~~~~~v~~HEvGHtLGL~Hn~~ 113 (165) T cd04268 70 PLTGEILLARVYLYSSFVEY--------SGARLRNTAEHELGHALGLRHNFA 113 (165) T ss_pred CCCCEEEEEEEEEECHHHHH--------HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99841885578762039999--------999999999998763505754453 No 8 >cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=69.06 E-value=9.5 Score=18.76 Aligned_cols=67 Identities=18% Similarity=0.190 Sum_probs=49.8 Q ss_pred CCCCCEEEECCCCC---------CCCCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 89851566325566---------77765653002046518888744666189899999999999899863888998 Q gi|254780427|r 78 DKPTNVLSFPTAFA---------SSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVD 144 (171) Q Consensus 78 d~~TDVLSFP~~~~---------~~~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~ 144 (171) |--.||+.|-.... .++-++..+-+||.+-..++..-...+..-..+-+..+++|-+=|-||..|.= T Consensus 59 DiRyn~Irw~~s~~~~~ayGps~~dPrTGEIl~a~I~~~~~~~~~~~~~~~~~~~~~lr~~~aHEVGHtLGL~HNf 134 (197) T cd04276 59 DIRYNVIRWIHSPNGGWAYGPSVVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNF 134 (197) T ss_pred CCCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 7426689996379876666887579998667765788741776547999998999999999998764031653301 No 9 >COG5149 TOA1 Transcription initiation factor IIA, large chain [Transcription] Probab=68.03 E-value=3 Score=21.62 Aligned_cols=27 Identities=22% Similarity=0.299 Sum_probs=25.5 Q ss_pred ECHHHHHHHHHHHHHCCCCCCEEEECC Q ss_conf 577999999999850489851566325 Q gi|254780427|r 62 TDSHRIETLNFEYRGIDKPTNVLSFPT 88 (171) Q Consensus 62 ~dd~~i~~LN~~yr~kd~~TDVLSFP~ 88 (171) +||+..|+|-.-|..|=.+|||-+||- T Consensus 30 vdd~tlrelqnlwq~kl~~t~vatf~w 56 (293) T COG5149 30 VDDATLRELQNLWQSKLVATDVATFPW 56 (293) T ss_pred CCHHHHHHHHHHHHHHHHHHEEEECCH T ss_conf 648999999999987543202211126 No 10 >TIGR00232 tktlase_bact transketolase; InterPro: IPR005478 Transketolase (EC 2.2.1.1) (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. This group includes two proteins from the yeast Saccharomyces cerevisiae but excludes dihydroxyactetone synthases (formaldehyde transketolases) from various yeasts and the even more distant mammalian transketolases. Among the family of thiamine diphosphate-dependent enzymes that includes transketolases, dihydroxyacetone synthases, pyruvate dehydrogenase E1-beta subunits, and deoxyxylulose-5-phosphate synthases, mammalian and bacterial transketolases seem not to be orthologous. ; GO: 0004802 transketolase activity. Probab=66.67 E-value=3.2 Score=21.50 Aligned_cols=13 Identities=46% Similarity=1.035 Sum_probs=8.4 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9999899863888 Q gi|254780427|r 129 LIIHGFLHLLGYD 141 (171) Q Consensus 129 l~iHG~LHLlGyd 141 (171) +++=++|||.||| T Consensus 65 mLlYSlLHL~GyD 77 (675) T TIGR00232 65 MLLYSLLHLTGYD 77 (675) T ss_pred HHHHHHHHHHCCC T ss_conf 9999999985036 No 11 >pfam06262 DUF1025 Domain of unknown function (DUF1025). Family of bacterial protein with undetermined function. Probab=64.24 E-value=6.9 Score=19.58 Aligned_cols=41 Identities=24% Similarity=0.218 Sum_probs=33.5 Q ss_pred EEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 046518888744666189899999999999899863888998 Q gi|254780427|r 103 DIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVD 144 (171) Q Consensus 103 DIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~ 144 (171) =|+|=-.-+.+.|.+... +.+.+.+.++|-+-|.+|+|+.. T Consensus 50 ~I~lyR~pi~~~~~~~~e-L~~~V~~tviHEiaH~fG~dd~~ 90 (96) T pfam06262 50 RITLYRRPILDEWRDREE-LGELVRHVVIHEIGHHFGLSDDD 90 (96) T ss_pred EEEEEHHHHHHHHCCHHH-HHHHHHHHHHHHHHHHCCCCHHH T ss_conf 998661689988489999-99999999999988770898889 No 12 >smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site. Probab=59.48 E-value=4.4 Score=20.67 Aligned_cols=17 Identities=29% Similarity=0.415 Sum_probs=15.4 Q ss_pred HHHHHHHHHCCCCCCCH Q ss_conf 99998998638889988 Q gi|254780427|r 129 LIIHGFLHLLGYDHVDD 145 (171) Q Consensus 129 l~iHG~LHLlGydH~~~ 145 (171) .++|=++|.||+.|+.. T Consensus 89 ~i~HEigHaLGl~He~~ 105 (140) T smart00235 89 VAAHELGHALGLYHEQS 105 (140) T ss_pred HHHHHHHHHCCCCCCCC T ss_conf 24553453426655457 No 13 >TIGR01120 rpiB ribose 5-phosphate isomerase B; InterPro: IPR004785 Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes). RpiB is also involved in metabolism of the rare sugar, allose, in addition to ribose sugars. The structures of RpiA and RpiB are distinct, RpiB having a Rossmann-type alpha/beta/alpha sandwich topology . This entry represents the RpiB enzyme.; GO: 0004751 ribose-5-phosphate isomerase activity, 0006098 pentose-phosphate shunt. Probab=55.61 E-value=7.1 Score=19.49 Aligned_cols=38 Identities=16% Similarity=0.236 Sum_probs=29.2 Q ss_pred EEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHHHHHH Q ss_conf 999985779999999998504898515663255667776565300204651888874 Q gi|254780427|r 57 LSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEI 113 (171) Q Consensus 57 isi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~~i~~ 113 (171) |-=.+|.|++|.+|-+++ -.||||.| |+=|+..+.+.. T Consensus 80 IRAalC~~~YmA~msRlH----NdaNVLcl---------------Gerv~G~elA~s 117 (143) T TIGR01120 80 IRAALCLDEYMAKMSRLH----NDANVLCL---------------GERVVGVELAKS 117 (143) T ss_pred CEEEECCCHHHHHHHHHH----HHHHHHHH---------------CCEEHHHHHHHH T ss_conf 056632652467863321----00220321---------------574015789999 No 14 >cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning. Probab=52.98 E-value=7.2 Score=19.46 Aligned_cols=17 Identities=24% Similarity=0.628 Sum_probs=15.1 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99999899863888998 Q gi|254780427|r 128 HLIIHGFLHLLGYDHVD 144 (171) Q Consensus 128 ~l~iHG~LHLlGydH~~ 144 (171) -.++|=++|.||+-|+. T Consensus 89 G~i~HEl~HalGf~HEq 105 (200) T cd04281 89 GIVVHELGHVIGFWHEH 105 (200) T ss_pred CCHHHHHHHHHCCCCCC T ss_conf 30299998875465543 No 15 >cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease. Probab=50.86 E-value=9.7 Score=18.70 Aligned_cols=21 Identities=33% Similarity=0.444 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCH Q ss_conf 999999998998638889988 Q gi|254780427|r 125 HLVHLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 125 e~~~l~iHG~LHLlGydH~~~ 145 (171) +....++|=+.|.||+.|+.. T Consensus 95 ~~~gti~HElgHaLGf~He~s 115 (167) T cd00203 95 EGAQTIAHELGHALGFYHDHD 115 (167) T ss_pred CCCCHHHHHHHHHHCCCCCCC T ss_conf 554248999999848875556 No 16 >cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development. Probab=50.72 E-value=8.9 Score=18.92 Aligned_cols=54 Identities=13% Similarity=0.252 Sum_probs=30.1 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHH--HHHHHHHHHHHHCCCCCCCH Q ss_conf 4898515663255667776565300204651888874466618989999--99999998998638889988 Q gi|254780427|r 77 IDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENH--LVHLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 77 kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e--~~~l~iHG~LHLlGydH~~~ 145 (171) ....+|-|.|-.. . | |...+-++.-.+..++... -.-.++|=++|.||+-|+.. T Consensus 38 r~~~~~yi~f~~~----~-------g----C~S~vG~~~g~q~isl~~~C~~~g~i~HEl~HalGf~HE~~ 93 (180) T cd04280 38 RTTEKDYIRIVKG----S-------G----CWSYVGRVGGRQVVSLGSGCFSLGTIVHELMHALGFYHEQS 93 (180) T ss_pred CCCCCCEEEEECC----C-------C----EEECCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCCCC T ss_conf 8998758999659----9-------7----13615701882368847886867633999998745655434 No 17 >cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion. Probab=49.67 E-value=8.9 Score=18.94 Aligned_cols=38 Identities=21% Similarity=0.376 Sum_probs=23.4 Q ss_pred EHHHHHHHHHHHCCCHHHH---HHHHHHHHHHHHCCCCCCC Q ss_conf 1888874466618989999---9999999899863888998 Q gi|254780427|r 107 AYEIIEIEANVLGKEFENH---LVHLIIHGFLHLLGYDHVD 144 (171) Q Consensus 107 s~~~i~~eA~~~~~~~~~e---~~~l~iHG~LHLlGydH~~ 144 (171) |+..|-++.-.+..++... ..-.++|=++|.||+-|+. T Consensus 55 C~S~vGr~gg~Q~isl~~~gC~~~g~i~HEl~HalGf~HEq 95 (182) T cd04283 55 CWSYIGRQGGRQTVSLQKQGCMYKGIIQHELLHALGFYHEQ 95 (182) T ss_pred EEECCCEECCEEEEEECCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 85405710981248868999787672078888884555544 No 18 >cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells. Probab=48.96 E-value=10 Score=18.52 Aligned_cols=57 Identities=18% Similarity=0.342 Sum_probs=34.7 Q ss_pred HHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHHHHHHHHHHHCCCHHHHH--HHHHHHHHHHHCCCCCCC Q ss_conf 985048985156632556677765653002046518888744666189899999--999999899863888998 Q gi|254780427|r 73 EYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENHL--VHLIIHGFLHLLGYDHVD 144 (171) Q Consensus 73 ~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~--~~l~iHG~LHLlGydH~~ 144 (171) +|+-....+|-+.|-.. . | |+..|-++.-.+..++..-. .-.++|-++|.||+-|+. T Consensus 80 rF~~r~~e~~yi~~~~~----~-------G----C~S~VG~~gg~Q~vslg~gC~~~G~i~HEl~HaLGf~HEq 138 (230) T cd04282 80 DFKPYEGESNYIFFFKG----S-------G----CWSMVGDQQGGQNLSIGAGCDYKATVEHEFLHALGFYHEQ 138 (230) T ss_pred EEEECCCCCCEEEEECC----C-------C----CCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 14867898755999569----9-------5----1142265388056650788787766799999885566542 No 19 >cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=48.78 E-value=5.7 Score=20.06 Aligned_cols=23 Identities=26% Similarity=0.600 Sum_probs=19.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 99999999998998638889988 Q gi|254780427|r 123 ENHLVHLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 123 ~~e~~~l~iHG~LHLlGydH~~~ 145 (171) ..++...++|-+.|.||+.|+.. T Consensus 89 ~~~~~~~~~he~~h~~g~~he~~ 111 (198) T cd04327 89 DPEFSRVVLHEFGHALGFIHEHQ 111 (198) T ss_pred CCCCCEEHHHHHHHHHCCCCCCC T ss_conf 53300100577676642600223 No 20 >pfam01400 Astacin Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila. Probab=46.58 E-value=9.5 Score=18.77 Aligned_cols=17 Identities=35% Similarity=0.620 Sum_probs=15.0 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99999899863888998 Q gi|254780427|r 128 HLIIHGFLHLLGYDHVD 144 (171) Q Consensus 128 ~l~iHG~LHLlGydH~~ 144 (171) -.++|=++|.||+=|+. T Consensus 82 g~i~HEl~HaLGf~HEh 98 (192) T pfam01400 82 GIIVHELGHALGFWHEQ 98 (192) T ss_pred CCCHHHHHHHHCCCCCC T ss_conf 70178888884555544 No 21 >cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases). Probab=46.28 E-value=15 Score=17.58 Aligned_cols=108 Identities=16% Similarity=0.071 Sum_probs=53.8 Q ss_pred HHHHHHHHHHHHHHHHHHH-HCCCCCCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEE Q ss_conf 5799999999999974322-014667438999985779999999998504898515663255667776565300204651 Q gi|254780427|r 29 RTLCEVVFAKAVSNLISKG-YFVKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLA 107 (171) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs 107 (171) ......+|..|-...-... .........|.|.|..... ...-.-|++.+||.+..... ..-|+|-+. T Consensus 24 r~~i~~Af~~Ws~v~~l~F~ev~~~~~AdI~i~f~~~~h-----~d~~~fDG~gg~lahA~~p~-------~~~G~ihfd 91 (157) T cd04278 24 RRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNH-----GDGYPFDGPGGTLAHAFFPG-------GIGGDIHFD 91 (157) T ss_pred HHHHHHHHHHHHCCCCCCEEEECCCCCCCEEEEEECCCC-----CCCCCCCCCCCEEEEEECCC-------CCCCEEEEC T ss_conf 999999999986647855598158887878999866766-----89888578888689875688-------866418877 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 888874466618989999999999989986388899889999999 Q gi|254780427|r 108 YEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVME 152 (171) Q Consensus 108 ~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M~ 152 (171) .+--.. .. +-.-..-|..+++|-+=|.||-+|.... .-.|. T Consensus 92 ~~e~w~--~~-~~~~~~~l~~va~HEIGHaLGL~Hs~~~-~siM~ 132 (157) T cd04278 92 DDEQWT--LG-SDSGGTDLFSVAAHEIGHALGLGHSSDP-DSIMY 132 (157) T ss_pred CCCCCC--CC-CCCCCCCHHHHHHHHHHHHCCCCCCCCC-CCEEC T ss_conf 776621--26-7887601899999875654068888999-75333 No 22 >pfam01457 Peptidase_M8 Leishmanolysin. Probab=45.43 E-value=16 Score=17.53 Aligned_cols=41 Identities=17% Similarity=0.261 Sum_probs=31.2 Q ss_pred CCCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 565300204651888874466618989999999999989986388899 Q gi|254780427|r 96 CSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHV 143 (171) Q Consensus 96 ~~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~ 143 (171) ......|-|=|++..+.... .+...+.++|-+.|.|||+.. T Consensus 217 dgRP~vG~iN~~P~~i~~~~-------~~~~vrv~aHEiaHALGFs~~ 257 (569) T pfam01457 217 DGRPAVGVINIPAANITSRY-------DQLVTRVVTHEIAHALGFSVT 257 (569) T ss_pred CCCEEEEEEEECHHHCCCCC-------CCEEEEHHHHHHHHHHCCCHH T ss_conf 89805899841877764135-------541423898888888635898 No 23 >pfam00413 Peptidase_M10 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Probab=43.15 E-value=27 Score=16.15 Aligned_cols=107 Identities=19% Similarity=0.090 Sum_probs=51.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCHHEEEEEHH Q ss_conf 79999999999997432201466743899998577999999999850489851566325566777656530020465188 Q gi|254780427|r 30 TLCEVVFAKAVSNLISKGYFVKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYE 109 (171) Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~eisi~l~dd~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~LGDIvIs~~ 109 (171) ..+..+|..|-...-............|.|.|.....- ..-.-|++-.||.+.+.. .. .-|||-+-.+ T Consensus 25 ~~i~~Af~~Ws~v~~l~F~ev~~~~adI~i~F~~~~h~-----d~~~fDg~gg~laha~~P---~~----~~G~ihfd~~ 92 (158) T pfam00413 25 RAIRRAFKVWSEVTPLTFTEVPEGTADIMIGFGRGEHG-----DGYPFDGPGGVLAHAFFP---GP----IGGDIHFDDD 92 (158) T ss_pred HHHHHHHHHHHHHCCCCEEECCCCCCCEEEEECCCCCC-----CCCCCCCCCCEEEECCCC---CC----CCCEEEECCC T ss_conf 99999999998636854797778987789994467679-----988867989867751269---86----5733896578 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 887446661898999999999998998638889988999999 Q gi|254780427|r 110 IIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVM 151 (171) Q Consensus 110 ~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M 151 (171) .-.. .....+....+..+++|-+=|.||-+|..... -.| T Consensus 93 e~w~--~~~~~~~g~~l~~va~HEIGHaLGL~Hs~~~~-siM 131 (158) T pfam00413 93 EQWT--VGNESPEGTNLFLVAAHEIGHALGLGHSSDPD-AIM 131 (158) T ss_pred CCCC--CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCC-CEE T ss_conf 6530--58987665115566545402002668899986-677 No 24 >smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function. Probab=42.04 E-value=28 Score=16.04 Aligned_cols=81 Identities=15% Similarity=0.171 Sum_probs=47.6 Q ss_pred HHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCC--HHEEEEEHHHHHHHHH-HHCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999850489851566325566777656530--0204651888874466-6189899999999999899863888 Q gi|254780427|r 65 HRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLM--LGDIVLAYEIIEIEAN-VLGKEFENHLVHLIIHGFLHLLGYD 141 (171) Q Consensus 65 ~~i~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~~~--LGDIvIs~~~i~~eA~-~~~~~~~~e~~~l~iHG~LHLlGyd 141 (171) +-+.++|.+|.+..-|-.=+.|-.-... -.+... -..|-+|+..+..... ....++..|++|+.+|- .=-|+- T Consensus 5 ~~~~~~~~~~F~~~f~~~~i~wn~Rl~t--taG~~~~~~~~I~ln~~ll~~~~~~~l~~tl~HElcH~~~~~--~~~~~~ 80 (146) T smart00731 5 QRLEDASLRVFGRKLPHPKVVWNKRLRK--TGGRCLLKSAEIRLNPKLLTENGRDRLRETLLHELCHAALYL--FGRGYG 80 (146) T ss_pred HHHHHHHHHHHCCCCCCCEEEECCHHHH--HHHHEECCCCEEEECHHHHHHCCHHHHHHHHHHHHHHHHHHH--HCCCCC T ss_conf 9999999999589899958998704630--678100569889978999976748899987899999999999--779898 Q ss_pred CCCHHHHHHH Q ss_conf 9988999999 Q gi|254780427|r 142 HVDDKDACVM 151 (171) Q Consensus 142 H~~~~~~~~M 151 (171) |. ++.+.+ T Consensus 81 Hg--~~fk~~ 88 (146) T smart00731 81 HG--DEWKRW 88 (146) T ss_pred CC--HHHHHH T ss_conf 78--999999 No 25 >pfam04450 BSP Plant Basic Secretory Protein. These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens. Probab=36.69 E-value=34 Score=15.55 Aligned_cols=62 Identities=16% Similarity=0.256 Sum_probs=44.7 Q ss_pred HEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 2046518888744666189899999999999899863888998899999999999999986888 Q gi|254780427|r 102 GDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDLGIN 165 (171) Q Consensus 102 GDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~~~~~~M~~~E~~iL~~l~i~ 165 (171) -+|.+|+.++..-+. +.....|+.=++.|.+-|+.-|+-......-.-+.+-+.+=-+.|+. T Consensus 72 ~eIh~S~~yI~~~~~--~~d~~~Ei~Gvl~HE~~H~~Q~~~~g~~p~~liEGiAD~VRl~ag~~ 133 (200) T pfam04450 72 KEIHLSARYIEGYSA--EGDVKYEVTGVLYHELVHCWQWDGTGRAPGGLIEGIADYVRLKAGYA 133 (200) T ss_pred CEEEEEHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 279973788456666--67478999999998776642037988997358889999999974888 No 26 >KOG3714 consensus Probab=31.96 E-value=17 Score=17.30 Aligned_cols=40 Identities=18% Similarity=0.216 Sum_probs=28.5 Q ss_pred EEHHHHHHHHHH-HCCCHHHHHH--HHHHHHHHHHCCCCCCCH Q ss_conf 518888744666-1898999999--999998998638889988 Q gi|254780427|r 106 LAYEIIEIEANV-LGKEFENHLV--HLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 106 Is~~~i~~eA~~-~~~~~~~e~~--~l~iHG~LHLlGydH~~~ 145 (171) -|+..|-+.... +..++-.... -.++|=++|.||+=|+.. T Consensus 136 gC~S~VGr~gg~~q~~sl~~~C~~~G~i~HEl~HaLGf~Hehs 178 (411) T KOG3714 136 GCYSYVGRRGGGQQLLSLGDGCDRFGTIVHELMHALGFWHEHS 178 (411) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEHHHHHHHHHHHCCCC T ss_conf 7313127778865321117874767610566776641310256 No 27 >KOG4584 consensus Probab=30.90 E-value=38 Score=15.29 Aligned_cols=24 Identities=29% Similarity=0.427 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999999999999868888644 Q gi|254780427|r 146 KDACVMEGLERSILEDLGINDPYE 169 (171) Q Consensus 146 ~~~~~M~~~E~~iL~~l~i~~PY~ 169 (171) .+...|.+.-+++|..+|+.+||+ T Consensus 84 ~~g~~Ll~lRE~~LrE~gF~Diy~ 107 (348) T KOG4584 84 PLGINLLRLREQILRELGFRDIYK 107 (348) T ss_pred CCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 504789999999999708963888 No 28 >cd00770 SerRS_core Seryl-tRNA synthetase (SerRS) class II core catalytic domain. SerRS is responsible for the attachment of serine to the 3' OH group of ribose of the appropriate tRNA. This domain It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. SerRS synthetase is a homodimer. Probab=30.18 E-value=24 Score=16.39 Aligned_cols=51 Identities=18% Similarity=0.317 Sum_probs=31.6 Q ss_pred HCCCHHHHHHHH--HHHHHHHHCCCC----------CCCHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 189899999999--999899863888----------99889999999999999998688886447 Q gi|254780427|r 118 LGKEFENHLVHL--IIHGFLHLLGYD----------HVDDKDACVMEGLERSILEDLGINDPYEV 170 (171) Q Consensus 118 ~~~~~~~e~~~l--~iHG~LHLlGyd----------H~~~~~~~~M~~~E~~iL~~l~i~~PY~~ 170 (171) ++.-+..|-.-. =.||++.+-.|+ .+.+++.++|.....++|+.||+ ||++ T Consensus 137 ~s~cfR~Eags~G~d~~GL~RvhqF~kvE~~~~c~pe~s~~~~e~~~~~~e~~l~~L~l--pyrv 199 (297) T cd00770 137 YSPCFRKEAGSAGRDTRGLFRVHQFEKVEQFVFTKPEESWEELEELISNAEEILQELGL--PYRV 199 (297) T ss_pred CCCHHHHCCCCCCCCCCCEEEEEECCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC--CEEE T ss_conf 46341320377775555457887211003776168788999999999999999998155--0145 No 29 >PTZ00337 surface protease GP63; Provisional Probab=29.64 E-value=28 Score=16.09 Aligned_cols=40 Identities=18% Similarity=0.228 Sum_probs=29.9 Q ss_pred CCCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 6530020465188887446661898999999999998998638889988 Q gi|254780427|r 97 SSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDD 145 (171) Q Consensus 97 ~~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~~~ 145 (171) .....|=+=|.+.++.. .+...|.++|-+.|.|||+...- T Consensus 209 gRP~vGv~N~~P~~i~~---------~~~~vRvaaHEiAHALGFs~~~m 248 (567) T PTZ00337 209 GRPFAAAVNFDPRQIAV---------TNGYVRVAAHELGHALGFARDQF 248 (567) T ss_pred CCEEEEEEECCHHHCCC---------CCHHHHHHHHHHHHHHCCCHHHH T ss_conf 98258997118132234---------40157899999988863688898 No 30 >COG0021 TktA Transketolase [Carbohydrate transport and metabolism] Probab=29.20 E-value=46 Score=14.82 Aligned_cols=18 Identities=33% Similarity=0.329 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHH-CCCCCC Q ss_conf 999999999850-489851 Q gi|254780427|r 65 HRIETLNFEYRG-IDKPTN 82 (171) Q Consensus 65 ~~i~~LN~~yr~-kd~~TD 82 (171) +.|.+-=.+=+. .|+||= T Consensus 224 e~I~~Ai~~Ak~~~dkPtl 242 (663) T COG0021 224 EAIDKAIEEAKASTDKPTL 242 (663) T ss_pred HHHHHHHHHHHHCCCCCEE T ss_conf 9999999999746799738 No 31 >pfam00587 tRNA-synt_2b tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synthetases. Probab=28.53 E-value=44 Score=14.90 Aligned_cols=33 Identities=30% Similarity=0.612 Sum_probs=21.6 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 86388899889999999999999998688886447 Q gi|254780427|r 136 HLLGYDHVDDKDACVMEGLERSILEDLGINDPYEV 170 (171) Q Consensus 136 HLlGydH~~~~~~~~M~~~E~~iL~~l~i~~PY~~ 170 (171) |+.|-+.+....-.+|...-.++|+.+|+ ||+. T Consensus 113 ~~~~~~e~s~~~~~e~~~~~~~~~~~lgl--~~~i 145 (170) T pfam00587 113 EIFGTPEQSEEEDEELLKLAEEILQDLGL--PYRV 145 (170) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHHHCCC--CEEE T ss_conf 99967004299999999999999998799--6799 No 32 >pfam07759 DUF1615 Protein of unknown function (DUF1615). This is a family of proteins of unknown function expressed by various bacterial species. Some members of this family are thought to be lipoproteins. Another member of this family is thought to be involved in photosynthesis. Probab=24.84 E-value=55 Score=14.35 Aligned_cols=72 Identities=18% Similarity=0.225 Sum_probs=43.8 Q ss_pred HHHHHHHHHCCCCCCEEEECCCCCCCCCCCC-CCHHEEEEEHHHHHHHHHHHCCCHHHHHHHH-------HHHHHHHHCC Q ss_conf 9999998504898515663255667776565-3002046518888744666189899999999-------9998998638 Q gi|254780427|r 68 ETLNFEYRGIDKPTNVLSFPTAFASSDGCSS-LMLGDIVLAYEIIEIEANVLGKEFENHLVHL-------IIHGFLHLLG 139 (171) Q Consensus 68 ~~LN~~yr~kd~~TDVLSFP~~~~~~~~~~~-~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l-------~iHG~LHLlG 139 (171) ++|+.-|+..= .. .|.........++ .-+|-.=+|...++..++.+.......+.+. +-.|+-|||| T Consensus 98 kel~~ly~d~i---~~--~p~g~~l~g~~NPV~TgGpMQVSI~fAe~h~r~yp~~~~~siR~evftrrGG~yfG~a~LL~ 172 (318) T pfam07759 98 RDLSLLYDDFI---SA--VPLGKTLFGGLNPVTTGGPMQVSIAFAEQHAKGYPYPVDGSIRDEVFTRRGGVYFGIAHLLG 172 (318) T ss_pred HHHHHHHHHHH---HH--CCCHHHHHCCCCCCCCCCCCEEEHHHHHHCCCCCCCCCCCCHHHHHHHCCCCEEHHHHHHCC T ss_conf 89999999999---74--72413453267875547774654699986341589988640899987306760110798807 Q ss_pred CCCCC Q ss_conf 88998 Q gi|254780427|r 140 YDHVD 144 (171) Q Consensus 140 ydH~~ 144 (171) |-+.- T Consensus 173 Ypa~Y 177 (318) T pfam07759 173 YPADY 177 (318) T ss_pred CCCCC T ss_conf 98888 No 33 >KOG3618 consensus Probab=24.44 E-value=48 Score=14.68 Aligned_cols=24 Identities=29% Similarity=0.421 Sum_probs=17.1 Q ss_pred HHCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 618989999999999989986388 Q gi|254780427|r 117 VLGKEFENHLVHLIIHGFLHLLGY 140 (171) Q Consensus 117 ~~~~~~~~e~~~l~iHG~LHLlGy 140 (171) +.+-+-+.-+.+++.||+.||+|- T Consensus 221 ~aG~~~~~l~~r~~lH~~vHliGV 244 (1318) T KOG3618 221 GAGALHWELLSRGLLHGCVHLIGV 244 (1318) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 457244999999998621665556 No 34 >TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily; InterPro: IPR014344 Members of this entry belong to the family of polysaccharide deacetylases. All are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the gene coding region for the EpsH homologue that is the putative exosortase gene.. Probab=24.43 E-value=56 Score=14.30 Aligned_cols=55 Identities=24% Similarity=0.305 Sum_probs=44.1 Q ss_pred CCCHHEEEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCC-----CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5300204651888874466618989999999999989986388899-----8899999999999999986888 Q gi|254780427|r 98 SLMLGDIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHV-----DDKDACVMEGLERSILEDLGIN 165 (171) Q Consensus 98 ~~~LGDIvIs~~~i~~eA~~~~~~~~~e~~~l~iHG~LHLlGydH~-----~~~~~~~M~~~E~~iL~~l~i~ 165 (171) ...||=|+==+|.+.+.-.+.||-+.-| ||+|+ +.+++..=-.+=.++|+.|+-. T Consensus 52 FFtLGWvAERyP~LVr~Iv~~GHElASH-------------Gy~H~Rv~~qtp~~Fr~Di~rsK~lLEDl~G~ 111 (274) T TIGR03006 52 FFTLGWVAERYPELVRRIVAAGHELASH-------------GYGHERVTTQTPEEFRADIRRSKKLLEDLSGQ 111 (274) T ss_pred EEECCHHHHCCHHHHHHHHHCCCEEEEC-------------CCHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 4240347540528899999719826532-------------41035554089879999999999997257886 No 35 >PRK05431 seryl-tRNA synthetase; Provisional Probab=21.47 E-value=64 Score=13.96 Aligned_cols=32 Identities=25% Similarity=0.490 Sum_probs=22.9 Q ss_pred HCCCCCCC--HHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 63888998--89999999999999998688886447 Q gi|254780427|r 137 LLGYDHVD--DKDACVMEGLERSILEDLGINDPYEV 170 (171) Q Consensus 137 LlGydH~~--~~~~~~M~~~E~~iL~~l~i~~PY~~ 170 (171) +..|-|.+ .+..+.|...+.++|..||+ ||++ T Consensus 284 ~f~~~~pe~S~~~~e~~~~~~e~i~~~L~l--pyrv 317 (422) T PRK05431 284 LVKFTKPEDSYAELEELTGNAEEILQKLEL--PYRV 317 (422) T ss_pred EEEECCHHHHHHHHHHHHHHHHHHHHHCCC--CCCC T ss_conf 335358677999999999999999987598--5023 No 36 >PRK00912 ribonuclease P protein component 3; Provisional Probab=20.39 E-value=68 Score=13.82 Aligned_cols=37 Identities=14% Similarity=0.237 Sum_probs=27.5 Q ss_pred EEEEEEECHHHHHHHHHHHHH-------------------CCCCCCEEEECCCCCC Q ss_conf 899998577999999999850-------------------4898515663255667 Q gi|254780427|r 56 ELSLVFTDSHRIETLNFEYRG-------------------IDKPTNVLSFPTAFAS 92 (171) Q Consensus 56 eisi~l~dd~~i~~LN~~yr~-------------------kd~~TDVLSFP~~~~~ 92 (171) -+.|...|...++.|=.+||+ .+---||||.|..... T Consensus 63 gvei~~~dp~~~~~l~~k~r~~~dIv~V~g~~~~~nr~A~e~~~VDIL~~p~~~r~ 118 (237) T PRK00912 63 GVEIVEENPSKLRGLVGKFRKKVDVLIVHGGDEKVNRAACENPRVDILNHPYLNRK 118 (237) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCC T ss_conf 67997249788778899885248789994787288999872689878846543468 No 37 >pfam08328 ASL_C Adenylosuccinate lyase C-terminal. This domain is found at the C-terminus of adenylosuccinate lyase(ASL; PurB in E. coli). It has been identified in bacteria, eukaryotes and archaea and is found together with the lyase domain pfam00206. ASL catalyses the cleavage of succinylaminoimidazole carboxamide ribotide to aminoimidazole carboxamide ribotide and fumarate and the cleavage of adenylosuccinate to adenylate and fumarate. Probab=20.21 E-value=43 Score=14.96 Aligned_cols=13 Identities=38% Similarity=0.984 Sum_probs=8.0 Q ss_pred HHHHHCCCCCCCC Q ss_conf 9999868888644 Q gi|254780427|r 157 SILEDLGINDPYE 169 (171) Q Consensus 157 ~iL~~l~i~~PY~ 169 (171) -+|.+.|++|||+ T Consensus 61 tiLR~~g~~~~YE 73 (115) T pfam08328 61 TVMRRYGIPNPYE 73 (115) T ss_pred HHHHHCCCCCHHH T ss_conf 9999819998999 Done!