Query gi|254780146|ref|YP_003064559.1| 50S ribosomal protein L1 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 232 No_of_seqs 118 out of 1923 Neff 6.3 Searched_HMMs 39220 Date Sun May 22 23:02:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780146.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01169 rplA_bact ribosomal 100.0 0 0 637.0 21.8 227 4-230 1-227 (227) 2 PRK05424 rplA 50S ribosomal pr 100.0 0 0 541.2 24.4 231 2-232 1-231 (232) 3 CHL00129 rpl1 ribosomal protei 100.0 0 0 535.3 24.2 228 2-230 1-228 (229) 4 COG0081 RplA Ribosomal protein 100.0 0 0 501.9 21.8 227 2-228 1-228 (228) 5 pfam00687 Ribosomal_L1 Ribosom 100.0 0 0 451.1 22.2 202 21-222 1-203 (203) 6 PRK04203 rpl1P 50S ribosomal p 100.0 0 0 432.0 20.3 200 23-227 7-216 (216) 7 PTZ00029 60S ribosomal protein 100.0 0 0 380.1 19.2 198 23-225 8-215 (217) 8 PTZ00225 60S ribosomal protein 100.0 0 0 362.1 19.2 198 22-225 7-213 (214) 9 cd00403 Ribosomal_L1 Ribosomal 100.0 0 0 350.0 21.0 200 24-225 1-208 (208) 10 KOG1569 consensus 100.0 0 0 331.5 19.0 210 18-230 86-308 (323) 11 TIGR01170 rplA_mito ribosomal 100.0 7.7E-32 2E-36 217.9 8.2 139 22-160 1-155 (155) 12 KOG1570 consensus 99.9 1.4E-25 3.6E-30 179.1 10.6 197 22-225 8-217 (218) 13 KOG1685 consensus 99.0 1.5E-09 3.9E-14 79.7 7.4 188 34-230 47-263 (343) 14 PTZ00029 60S ribosomal protein 93.5 0.043 1.1E-06 33.5 1.9 26 123-149 114-139 (217) 15 COG1064 AdhP Zn-dependent alco 83.5 2 5.1E-05 23.1 4.3 60 74-137 191-254 (339) 16 PRK13958 N-(5'-phosphoribosyl) 68.7 1.9 4.9E-05 23.2 0.7 22 75-96 55-79 (207) 17 PRK09427 bifunctional indole-3 63.6 2.4 6.2E-05 22.6 0.5 40 52-97 244-283 (459) 18 cd04735 OYE_like_4_FMN Old yel 63.1 8.2 0.00021 19.3 3.1 14 83-96 148-161 (353) 19 COG0135 TrpF Phosphoribosylant 59.3 3.5 9E-05 21.6 0.7 18 140-157 157-175 (208) 20 TIGR02817 adh_fam_1 zinc-bindi 59.0 15 0.00038 17.7 4.8 67 73-139 176-246 (338) 21 PRK01222 N-(5'-phosphoribosyl) 58.6 3.8 9.6E-05 21.4 0.7 34 74-107 56-97 (212) 22 pfam00107 ADH_zinc_N Zinc-bind 57.1 16 0.00041 17.5 4.3 65 74-138 16-86 (131) 23 KOG1588 consensus 51.3 4.1 0.00011 21.1 -0.0 37 54-90 102-145 (259) 24 PRK10222 L-ascorbate-specific 48.7 14 0.00037 17.8 2.4 65 89-157 12-79 (85) 25 cd00405 PRAI Phosphoribosylant 46.5 7.2 0.00018 19.6 0.6 24 73-96 51-77 (203) 26 pfam00697 PRAI N-(5'phosphorib 46.4 7.1 0.00018 19.7 0.5 22 75-96 51-75 (195) 27 PRK13523 NADPH dehydrogenase N 44.3 17 0.00045 17.3 2.3 14 83-96 146-159 (337) 28 cd02803 OYE_like_FMN_family Ol 42.6 27 0.00069 16.1 3.0 13 84-96 146-158 (327) 29 cd05564 PTS_IIB_chitobiose_lic 42.5 27 0.0007 16.0 3.6 67 75-145 1-84 (96) 30 KOG1198 consensus 42.4 27 0.0007 16.0 3.8 63 65-128 176-245 (347) 31 PRK13803 bifunctional phosphor 40.9 10 0.00027 18.6 0.7 23 74-96 59-84 (611) 32 cd04734 OYE_like_3_FMN Old yel 40.7 29 0.00074 15.9 3.8 14 83-96 145-158 (343) 33 TIGR00789 flhB_rel FlhB domain 40.4 29 0.00075 15.8 3.5 52 60-111 2-59 (84) 34 COG4213 XylF ABC-type xylose t 35.3 30 0.00076 15.8 2.3 33 72-104 80-118 (341) 35 TIGR02922 TIGR02922 conserved 33.4 30 0.00076 15.8 2.0 25 60-84 33-57 (69) 36 TIGR01163 rpe ribulose-phospha 32.6 39 0.001 15.1 2.7 85 18-107 89-214 (216) 37 PRK10754 quinone oxidoreductas 30.8 42 0.0011 14.9 3.5 63 75-138 167-235 (327) 38 COG1061 SSL2 DNA or RNA helica 30.5 42 0.0011 14.9 2.5 65 60-124 56-135 (442) 39 TIGR00381 cdhD CO dehydrogenas 29.9 28 0.00071 16.0 1.4 113 23-141 145-274 (401) 40 PRK05347 glutaminyl-tRNA synth 28.7 46 0.0012 14.7 4.8 55 20-75 149-209 (556) 41 COG3259 FrhA Coenzyme F420-red 27.1 27 0.00068 16.1 0.9 103 44-149 18-170 (441) 42 TIGR03152 cyto_c552_HCOOH form 26.9 41 0.001 15.0 1.8 33 88-120 320-352 (439) 43 pfam10302 DUF2407 Putative mem 26.6 47 0.0012 14.6 2.1 25 110-134 120-148 (252) 44 cd04733 OYE_like_2_FMN Old yel 26.2 51 0.0013 14.4 3.0 14 83-96 153-166 (338) 45 cd01820 PAF_acetylesterase_lik 25.8 51 0.0013 14.3 7.1 63 143-205 66-130 (214) 46 TIGR01182 eda 2-dehydro-3-deox 25.6 52 0.0013 14.3 2.5 79 22-110 17-103 (205) 47 TIGR00838 argH argininosuccina 25.4 37 0.00095 15.2 1.4 21 137-159 101-121 (469) 48 PRK10537 voltage-gated potassi 25.2 53 0.0013 14.3 3.0 26 71-96 291-317 (356) 49 pfam08899 DUF1844 Domain of un 25.1 40 0.001 15.0 1.5 50 135-194 19-70 (74) 50 PRK01189 V-type ATP synthase s 25.0 50 0.0013 14.4 2.0 58 75-132 4-67 (104) 51 TIGR01242 26Sp45 26S proteasom 24.3 49 0.0013 14.5 1.8 10 116-125 142-151 (364) 52 pfam05918 API5 Apoptosis inhib 23.8 56 0.0014 14.1 2.9 22 185-207 432-453 (543) 53 COG4844 Uncharacterized protei 23.5 57 0.0015 14.1 2.1 17 180-196 58-74 (78) 54 pfam07021 MetW Methionine bios 22.9 59 0.0015 14.0 3.0 40 81-121 45-85 (193) 55 PRK08306 dipicolinate synthase 22.5 59 0.0015 14.0 4.4 42 75-120 177-219 (296) 56 PRK06512 thiamine-phosphate py 22.2 60 0.0015 13.9 2.1 29 70-98 107-137 (221) 57 TIGR00603 rad25 DNA repair hel 22.1 61 0.0015 13.9 3.8 27 72-98 512-540 (756) 58 pfam10740 DUF2529 Protein of u 22.1 31 0.00078 15.7 0.4 125 1-141 1-144 (172) 59 PRK05396 tdh L-threonine 3-deh 21.9 61 0.0016 13.9 4.5 50 75-124 190-248 (341) 60 PRK09310 aroDE bifunctional 3- 21.8 62 0.0016 13.9 3.1 52 122-178 164-227 (477) 61 COG2089 SpsE Sialic acid synth 21.7 62 0.0016 13.9 2.2 12 23-34 28-39 (347) 62 TIGR02407 ectoine_ectB diamino 21.3 57 0.0015 14.1 1.7 46 23-68 111-167 (413) 63 KOG1500 consensus 21.0 64 0.0016 13.8 3.1 30 111-140 243-281 (517) 64 TIGR01452 PGP_euk phosphoglyco 20.8 64 0.0016 13.7 1.9 38 65-102 81-118 (288) 65 TIGR01211 ELP3 histone acetylt 20.7 48 0.0012 14.5 1.2 31 53-84 57-96 (573) 66 TIGR02437 FadB fatty oxidation 20.6 38 0.00097 15.1 0.7 137 59-200 148-314 (716) 67 TIGR00004 TIGR00004 endoribonu 20.6 65 0.0017 13.7 2.3 42 157-200 27-68 (129) 68 pfam01262 AlaDh_PNT_C Alanine 20.4 66 0.0017 13.7 4.6 109 60-175 7-131 (150) 69 TIGR00730 TIGR00730 conserved 20.3 52 0.0013 14.3 1.3 58 75-142 2-69 (205) 70 cd02395 SF1_like-KH Splicing f 20.2 13 0.00033 18.0 -1.8 14 123-136 12-25 (120) 71 COG0541 Ffh Signal recognition 20.1 31 0.0008 15.7 0.1 62 75-140 292-354 (451) No 1 >TIGR01169 rplA_bact ribosomal protein L1; InterPro: IPR005878 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L1 is the largest protein from the large ribosomal subunit. In Escherichia coli, L1 is known to bind to the 23S rRNA. This model describe s bacterial and chloroplast ribosomal protein L1. Most mitochondrial L1 sequences are sufficiently divergent to be the contained in a different entry (IPR005879 from INTERPRO).; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=100.00 E-value=0 Score=637.00 Aligned_cols=227 Identities=62% Similarity=0.956 Sum_probs=225.2 Q ss_pred CCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHH Q ss_conf 22343798985138476889899999998608789986299999953787645443013799527788654069997504 Q gi|254780146|r 4 KISKRMQQISQGIDRSALYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSS 83 (232) Q Consensus 4 k~~Kr~~~~~~~~~~~~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~ 83 (232) |+||||+++.+.+|+++.|+|+||+++||+.+.+|||||||++|+||+||+|+||+|||+|.||||+||..||+|||.|+ T Consensus 1 K~~Kr~~~a~~~vd~~K~Y~~~eAi~LlK~~~~aKFDeTVeva~~LG~DprKsDQ~vRG~vvLP~GtGK~~RVaVfa~G~ 80 (227) T TIGR01169 1 KLSKRYRAAREKVDRNKLYSLDEAIALLKETATAKFDETVEVAIRLGIDPRKSDQQVRGTVVLPHGTGKTVRVAVFAKGE 80 (227) T ss_pred CCHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCH T ss_conf 94178899998510436889889999997326554334178888747588537830511386576778625899971643 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 45768741200002789999975175333468864467788886433200100174223356665579999998601110 Q gi|254780146|r 84 KADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVD 163 (232) Q Consensus 84 ~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~ 163 (232) .+++|++|||||||++||||+|++||+|||.+|||||||+.|+||||+|||||||||||.||||.|+.++|.++|+|+|+ T Consensus 81 ~~~eA~~AGAD~VG~~DLie~Ik~G~~dFDV~IATPDmM~~VGkLG~iLGPRGLMPNPK~GTVT~dva~Av~~~K~G~ve 160 (227) T TIGR01169 81 KAKEAKAAGADYVGSDDLIEKIKKGWLDFDVVIATPDMMRKVGKLGRILGPRGLMPNPKTGTVTADVAKAVKEAKKGRVE 160 (227) T ss_pred HHHHHHHCCCEEECCHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEE T ss_conf 48889870980444887999995589850258827577688763310047888475359665224379999987279504 Q ss_pred CCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECHHC Q ss_conf 2235752678753157789899999999999999860866254433779999923897277400100 Q gi|254780146|r 164 FRSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLSSF 230 (232) Q Consensus 164 ~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~~~ 230 (232) ||+||.|+||++||+.||++++|.||+.+++++|.+.||+.+||+||+++||||||||||.||++++ T Consensus 161 fR~DK~G~iH~~~GK~SF~~~~L~eN~~Al~~~i~k~KP~~~KG~Yik~~~lSsTMGPgi~vd~~~~ 227 (227) T TIGR01169 161 FRADKAGNIHAPIGKVSFDEEKLKENLEALLDAIKKAKPSGAKGQYIKNIALSSTMGPGIKVDLSSL 227 (227) T ss_pred EEECCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCEEEECCCC T ss_conf 6515851587775266688899999999999999850883012307888898404887347421669 No 2 >PRK05424 rplA 50S ribosomal protein L1; Validated Probab=100.00 E-value=0 Score=541.23 Aligned_cols=231 Identities=60% Similarity=0.967 Sum_probs=228.3 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEEC Q ss_conf 74223437989851384768898999999986087899862999999537876454430137995277886540699975 Q gi|254780146|r 2 VSKISKRMQQISQGIDRSALYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFAT 81 (232) Q Consensus 2 m~k~~Kr~~~~~~~~~~~~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~ 81 (232) |+|+||||+++.+.+|+.+.|+++||++++|+.+.+||+||||++++||+|++|++|+|||+|.||||+|++.||||||+ T Consensus 1 m~K~~Kr~~~~~~~~~~~k~y~l~eAi~~lk~~~~~kF~esve~~i~Lnid~kk~dq~irg~v~LPhg~gk~~kV~Vfa~ 80 (232) T PRK05424 1 MAKLSKRYRAAAEKVDRTKLYSLEEAIALVKETATAKFDETVEVAVNLGVDPRKADQQVRGAVVLPHGTGKTVRVAVFAK 80 (232) T ss_pred CCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCCCCCEEEEEECC T ss_conf 99536899999861653566799999999997065799988899999767876677614579989988998638999838 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC Q ss_conf 04457687412000027899999751753334688644677888864332001001742233566655799999986011 Q gi|254780146|r 82 SSKADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSGA 161 (232) Q Consensus 82 ~~~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~ 161 (232) ++.+++|++||||+||++|||++|++||++||+|||||+|||+|++|||+|||||||||||.|||++|+.++|+++++|+ T Consensus 81 ~~~~~~Ak~aGAd~vG~~dli~~Ik~g~~~fD~~iAtp~~m~~l~klgkiLGPrGLMPnpK~GTVt~di~~~I~~~k~G~ 160 (232) T PRK05424 81 GEKAEEAKAAGADIVGAEDLIEKIKKGWLDFDVVIATPDMMAVVGKLGKVLGPRGLMPNPKTGTVTPDVAKAVKEAKAGK 160 (232) T ss_pred CHHHHHHHHCCCCEECHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 24389999869988751878999866864477777368899999998763074567999998844627999999986893 Q ss_pred CCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECHHCCC Q ss_conf 10223575267875315778989999999999999986086625443377999992389727740010059 Q gi|254780146|r 162 VDFRSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLSSFSV 232 (232) Q Consensus 162 i~~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~~~~~ 232 (232) ++||+|+.|+||++||+++|+++||.||+.+++++|.+++|.+.||.||+++|||||||||++||+++|+. T Consensus 161 v~~r~dk~g~i~~~IGk~sf~~e~l~eNi~a~l~~i~~~kp~~~kg~~ik~i~lssTMGp~i~i~~~~l~~ 231 (232) T PRK05424 161 VEFRVDKAGIIHAPIGKVSFDAEKLKENLKALLDAIKKAKPATAKGTYIKKVSLSSTMGPGVKVDPASLSA 231 (232) T ss_pred EEEEECCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCEEECHHHHHC T ss_conf 79998788859998707869989999999999999998098655776587899989999998975788513 No 3 >CHL00129 rpl1 ribosomal protein L1; Reviewed Probab=100.00 E-value=0 Score=535.27 Aligned_cols=228 Identities=45% Similarity=0.768 Sum_probs=225.4 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEEC Q ss_conf 74223437989851384768898999999986087899862999999537876454430137995277886540699975 Q gi|254780146|r 2 VSKISKRMQQISQGIDRSALYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFAT 81 (232) Q Consensus 2 m~k~~Kr~~~~~~~~~~~~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~ 81 (232) |+|+||||+++.+.++ ++.|++.||++++|+.+.+||+||||++|+||+|+++++|+|||+|.||||+|++.||||||+ T Consensus 1 m~k~~k~~~~~~~~~~-~k~y~~~eAi~~lk~~~~~kF~Esvdv~i~L~iD~k~~dq~irg~v~LPhg~gK~~kV~Vfa~ 79 (229) T CHL00129 1 MKKRSRRLKELLSKVE-PKLYSPIEAINLLKETATAKFVETAEAHIRLNIDPKYADQQLRTTVVLPKGTGKTVRIAVITN 79 (229) T ss_pred CCCCCHHHHHHHHHCC-CCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCCCCCEEEEEEEC T ss_conf 9962589999997511-565799999999986276898875899999688987667602578989999998459999918 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC Q ss_conf 04457687412000027899999751753334688644677888864332001001742233566655799999986011 Q gi|254780146|r 82 SSKADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSGA 161 (232) Q Consensus 82 ~~~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~ 161 (232) ++.+++|++|||++||++|||++|++||++||+|||||+|||+|++|||+||||||||||+.|||++|+.++|+++++|+ T Consensus 80 g~~~~~Ak~aGAd~vG~~eli~~i~~g~~~fD~~iAtp~~M~~l~kLgriLGPrGlMPnPk~GTvt~di~~av~~~K~g~ 159 (229) T CHL00129 80 GEKINEAKNAGADIVGSEDLIDEIAKGRLDFDLLIATPDMMPKIAKLGRVLGPRGLMPSPKSGTVTTDLAEAINEFKKGK 159 (229) T ss_pred CHHHHHHHHCCCCEECCHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCE T ss_conf 13699998769959864878788757984454687661789999998575074457999999976606999999986890 Q ss_pred CCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECHHC Q ss_conf 102235752678753157789899999999999999860866254433779999923897277400100 Q gi|254780146|r 162 VDFRSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLSSF 230 (232) Q Consensus 162 i~~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~~~ 230 (232) ++||+|+.|+||++||+++|+++||.||+.+++++|.+++|.+.||.||+++|||||||||++||+++| T Consensus 160 v~~r~dk~g~ih~~IGk~sf~~e~L~eNi~a~~~~i~~~kp~~~kg~~ik~i~issTMGp~i~id~~~l 228 (229) T CHL00129 160 LEYRADKTGIVHVNFGKSNFSEEDLLENLKALQESIDQNRPSGVKGKYWKSFYICSTMGPSIQLDINSL 228 (229) T ss_pred EEEEECCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCEEEEHHHC T ss_conf 899867888688876277589899999999999999973987658755978999899899889505555 No 4 >COG0081 RplA Ribosomal protein L1 [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=501.86 Aligned_cols=227 Identities=60% Similarity=0.960 Sum_probs=224.2 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEEC Q ss_conf 74223437989851384768898999999986087899862999999537876454430137995277886540699975 Q gi|254780146|r 2 VSKISKRMQQISQGIDRSALYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFAT 81 (232) Q Consensus 2 m~k~~Kr~~~~~~~~~~~~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~ 81 (232) |+|.+||++.+.+..|..+.|++.||++++++.+.++|+||||++++|++|++|++|+|||+|.||||+|+..||||||+ T Consensus 1 m~k~~k~~~~~~~~~d~~~~~~i~eai~~~ke~~~~kF~etVevav~L~vD~~k~dq~vrg~VvLP~g~gk~vrV~Vfa~ 80 (228) T COG0081 1 MAKKSKRYRAASEKVDRNKLYSLEEAVKLLKETSKRKFDETVEVAVNLKVDPRKPDQRVRGSVVLPNGTGKTVRVAVFAD 80 (228) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEECCCCCCCCEEEEEECC T ss_conf 95067889999987737755449999999874052575358999997155888654420202878899998407999817 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHCC-CCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 044576874120000278999997517-5333468864467788886433200100174223356665579999998601 Q gi|254780146|r 82 SSKADEAREAGADIVGGEDLFEIVKGG-QIDFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSG 160 (232) Q Consensus 82 ~~~~~~Ak~aGa~~vg~~eli~~i~~~-~~~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g 160 (232) ++.+++|++|||||||++||+++|+++ |.+||+|||+||||+.+++||++||||||||||+.||+++|+.++|+++|+| T Consensus 81 g~~~~~A~~AGad~Vg~edl~e~ik~~r~~~fD~~IAtpd~M~~v~~LG~vLGPRGlMPnPk~gTvt~Dv~~av~~~K~g 160 (228) T COG0081 81 GEKAEEAKAAGADYVGGEDLIELIKNGRAKDFDVFIATPDMMPLVGKLGKVLGPRGLMPNPKTGTVTDDVAKAVEELKKG 160 (228) T ss_pred HHHHHHHHHCCCCEECHHHHHHHHHCCCHHHCCEEEECCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCC T ss_conf 47699998759987318999999717750006789977048899999855417787899987788776899999997567 Q ss_pred CCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECH Q ss_conf 11022357526787531577898999999999999998608662544337799999238972774001 Q gi|254780146|r 161 AVDFRSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLS 228 (232) Q Consensus 161 ~i~~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~ 228 (232) +++||.|+.|++|++||+++|++++|.||+.++++.|.+.+|+++||+||+++|+|||||||++|+.. T Consensus 161 ~v~~R~dk~g~ih~~iGk~sf~~e~l~eNi~a~l~~i~~~~p~~~kg~~ik~v~vstTMGP~v~v~~~ 228 (228) T COG0081 161 TVEFRADKAGVIHVPIGKVSFDDEKLAENIEALLNAIVKAKPAGAKGQYIKSVYVSTTMGPGVKVDLA 228 (228) T ss_pred CEEEEECCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCEEECCC T ss_conf 47999779865888752777999999999999999999707677665158679996676897883069 No 5 >pfam00687 Ribosomal_L1 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10. Probab=100.00 E-value=0 Score=451.06 Aligned_cols=202 Identities=61% Similarity=0.983 Sum_probs=198.1 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHHH Q ss_conf 88989999999860878998629999995378764544301379952778865406999750445768741200002789 Q gi|254780146|r 21 LYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVGGED 100 (232) Q Consensus 21 ~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~e 100 (232) .|+++||++++|+.+.++|+||||++|+|++|++|++++|||+|.|||++|++.||||||+++.+++|+++||++||++| T Consensus 1 ~~~i~eA~~~lk~~~~~kF~esvel~v~l~id~kk~~~~ir~~v~LP~~~gk~~kV~Vf~~~~~~~~Ak~~Ga~~vg~~e 80 (203) T pfam00687 1 LYAIKEALELLKETATRKFDETVELAINLGVDPKKADQRVRGTVVLPHGTGKTVKVAVFAKGEKAKEAKAAGADYVGGDE 80 (203) T ss_pred CCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCCCCCEEEEEECCHHHHHHHHHCCCCEECCHH T ss_conf 98989999999865757988758999995888877764168999867788985289997476779998765998975799 Q ss_pred HHHHHH-CCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECCC Q ss_conf 999975-1753334688644677888864332001001742233566655799999986011102235752678753157 Q gi|254780146|r 101 LFEIVK-GGQIDFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKAGIIHAGIGKV 179 (232) Q Consensus 101 li~~i~-~~~~~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i~~~IG~~ 179 (232) ++++|+ ++|++||+|||||+||++|++|||+|||||||||||.|||++|+.++|+++++|+++||+++.|++|++||++ T Consensus 81 li~ki~~~~~~~fd~~ia~~~~m~~l~klg~iLGprGlMP~pk~GTv~~di~~~i~~~k~g~v~~r~~k~~~i~~~IG~~ 160 (203) T pfam00687 81 LLEKIKKDGWLDFDVFIATPDMMPQVGKLGRILGPRGLMPNPKTGTVTPDVAKAVKELKSGSVEFRADKAGIIHVPVGKV 160 (203) T ss_pred HHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCEEEEEEECC T ss_conf 99999747875566899858999999988776063557898877877736899999996796289757887799986167 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCC Q ss_conf 7898999999999999998608662544337799999238972 Q gi|254780146|r 180 SFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCG 222 (232) Q Consensus 180 ~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~ 222 (232) +|+++||.||+.+++++|.+++|.+.||.||+++||||||||| T Consensus 161 ~m~~e~i~eNi~a~~~~i~~~~p~~~kg~~ik~v~lksTMGP~ 203 (203) T pfam00687 161 SFDDEKLAENIKAVLNALVKLKPSLKKGWYIKSVYIKSTMGPG 203 (203) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCC T ss_conf 6998999999999999999819653466668579997887987 No 6 >PRK04203 rpl1P 50S ribosomal protein L1P; Reviewed Probab=100.00 E-value=0 Score=431.98 Aligned_cols=200 Identities=31% Similarity=0.516 Sum_probs=187.7 Q ss_pred CHHHHHHH-HHHCCCCCCCCCEEEEEEE-CCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHHH Q ss_conf 98999999-9860878998629999995-378764544301379952778865406999750445768741200002789 Q gi|254780146|r 23 GLSDAVVM-LKERATARFDETVEIAMNL-GIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVGGED 100 (232) Q Consensus 23 si~eAi~~-lk~~~~~kf~esvel~i~L-~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~e 100 (232) .++||++. +++.+.+||+||||++|+| ++|++|++|+|||+|.|||++|++.||||||+++.+++|++|||++||++| T Consensus 7 ~l~eAv~~~~~~~k~rkF~esvel~i~Lk~iDpkk~dq~vrg~v~LP~g~gK~~rV~Vfa~~e~~~~Ak~aGad~Vg~~d 86 (216) T PRK04203 7 KIEEAVKEALEESKPRNFTQSVELIVNLKDLDLKKPENRIDEEVVLPHGRGKEVKIAVIADGELALQAKEAGADYVITRE 86 (216) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEEEECCHHHHHHHHHCCCCEEECHH T ss_conf 99999999998687668874189999616889987665457899899999996389998886669999976999870798 Q ss_pred HHHHHHCC-------CCCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEE Q ss_conf 99997517-------5333468864467788886-433200100174223356665579999998601110223575267 Q gi|254780146|r 101 LFEIVKGG-------QIDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKAGII 172 (232) Q Consensus 101 li~~i~~~-------~~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i 172 (232) +++++.++ |.+||+|||||+||++|++ |||+||||||||||+.||+ |+..+|+++++ +++||+|+.|++ T Consensus 87 ~i~ki~~~k~~~kk~~~~fD~~IAtpdmM~~v~k~LGkiLGPrGlMPnPk~gt~--di~~~v~~~k~-~v~~r~dk~~~i 163 (216) T PRK04203 87 ELEELGGNKRAAKKLANEYDFFIAEADLMPLIGRILGPVLGPRGKMPTPVPPNA--DIKPLIERLKN-TVRIRSRDQPTF 163 (216) T ss_pred HHHHHHHHHHHHHHHHHHCCEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCC--CHHHHHHHHHC-CEEEEECCCCEE T ss_conf 999987568888876522588996789999999877664276777898889987--89999999737-569996587667 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEEC Q ss_conf 8753157789899999999999999860866254433779999923897277400 Q gi|254780146|r 173 HAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDL 227 (232) Q Consensus 173 ~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~ 227 (232) |++||+++|++|||.||+.+++++|.+.+|.+. +||+++|||||||||++|+. T Consensus 164 h~~IGk~sf~~e~l~eNi~a~~~~i~~~~p~g~--~~Ik~v~lksTMGp~vkv~~ 216 (216) T PRK04203 164 HVRVGTEDMSPEELAENIEAVLNRIERKLEKGR--QNIKSVYVKTTMGPAVRVEA 216 (216) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHHCCCHH--CCEEEEEEECCCCCCEECCC T ss_conf 533417779999999999999999998384421--10536999899898876259 No 7 >PTZ00029 60S ribosomal protein L1/L10a; Provisional Probab=100.00 E-value=0 Score=380.12 Aligned_cols=198 Identities=23% Similarity=0.368 Sum_probs=181.6 Q ss_pred CHHHHHHHHHH---CCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHH Q ss_conf 98999999986---087899862999999537876454430137995277886540699975044576874120000278 Q gi|254780146|r 23 GLSDAVVMLKE---RATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVGGE 99 (232) Q Consensus 23 si~eAi~~lk~---~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~ 99 (232) +++||++.+.+ .+.+||+||||++|+|+.|+.+++|+|||+|.|||+.|++.||||||+++.+++|++|||+++|.+ T Consensus 8 ~l~~ai~~v~~~~~~kkrkF~ETVel~i~Lk~~d~~~dq~vrg~v~LP~g~gk~~kV~Vfa~g~~~~~Ak~agad~v~~e 87 (217) T PTZ00029 8 TLNKAISAVLEGSKTKKRKFVETVELQIGLKDYDTQRDKRFSGTVVLQNVPRKNMKVCVFGDAVHCEEAQKLGLDYMDIE 87 (217) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCCCCCCEEEEEECHHHHHHHHHCCCCEECHH T ss_conf 99999999986364667788614999996688898767364678986899999867999907177789998599788799 Q ss_pred HHHHHHHCCC------CCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEE Q ss_conf 9999975175------333468864467788886-433200100174223356665579999998601110223575267 Q gi|254780146|r 100 DLFEIVKGGQ------IDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKAGII 172 (232) Q Consensus 100 eli~~i~~~~------~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i 172 (232) ||.+..++++ .+||+|||+|++|+++++ ||+.|||+|+||++. |+++|+.+.|+++++ +++||+++.+++ T Consensus 88 dl~~~~k~k~~~kk~a~~~D~fiA~~~~m~~i~r~LGp~L~p~gKmP~~~--~~~~~i~~~i~~~k~-~v~~r~kk~~~i 164 (217) T PTZ00029 88 GMKKFNKNKTLVKKLAKKYDAFLASQVLLPQIPRLLGPGLNKAGKFPTQI--THNDKIEDKINELKS-SIKFQLKKVLCM 164 (217) T ss_pred HHHHHHCCHHHHHHHHHHCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCHHHHHHHHHC-EEEEEECCCCEE T ss_conf 99976325079998763268799755898888887563466679989877--898889999999850-489996786678 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEE Q ss_conf 87531577898999999999999998608662544337799999238972774 Q gi|254780146|r 173 HAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKV 225 (232) Q Consensus 173 ~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~i 225 (232) |++||+.+|++|||.||+.+++++|.+.+|. .+++|+++||||||||+++| T Consensus 165 h~~VG~~~m~~e~l~eNi~~~i~~l~~~l~~--~~~nIk~v~iksTMGp~vrv 215 (217) T PTZ00029 165 GVAVGNVNMTEEELRANIVLAINFLVSLLKK--NWHNIRGLTIKSTMGKPQRI 215 (217) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHHC--CCCCCCEEEEECCCCCCEEC T ss_conf 7877168799899999999999999986213--56772259998899988772 No 8 >PTZ00225 60S ribosomal protein L10a; Provisional Probab=100.00 E-value=0 Score=362.15 Aligned_cols=198 Identities=22% Similarity=0.367 Sum_probs=181.5 Q ss_pred CCHHHHHH-HHHHCCCCCCCCCEEEEEEE-CCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHH Q ss_conf 89899999-99860878998629999995-37876454430137995277886540699975044576874120000278 Q gi|254780146|r 22 YGLSDAVV-MLKERATARFDETVEIAMNL-GIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVGGE 99 (232) Q Consensus 22 ysi~eAi~-~lk~~~~~kf~esvel~i~L-~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~ 99 (232) -++.|||+ .+++.+.++|+||||++|+| ++|+++ +++|+|+|.|||+.|++.+|||||+++.+++|+++||+++|.+ T Consensus 7 ~~l~eai~~~l~~~k~rkF~ESvdl~i~Lk~~D~kk-d~ri~~~v~LP~~~~k~~kv~v~ad~~~~~~Ak~~g~~~v~~e 85 (214) T PTZ00225 7 QTLSEAIQAVLKVDKERKFKESIDLQVNLKNYDPQK-DKRFSGSLKLPNVCRPRMTVCLLCDLVHEDIAKKEGVPTMNQE 85 (214) T ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCC-CCEEEEEEECCCCCCCCCEEEEECCHHHHHHHHHCCCCEECHH T ss_conf 999999999997476778873089999736778986-7641367988899998856999778898999998699772799 Q ss_pred HHHHHHHC------CCCCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEE Q ss_conf 99999751------75333468864467788886-433200100174223356665579999998601110223575267 Q gi|254780146|r 100 DLFEIVKG------GQIDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKAGII 172 (232) Q Consensus 100 eli~~i~~------~~~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i 172 (232) ||.+..++ .+.+||+|||++++|+++++ ||++|||+|+||+ .+|+++|+.+.|+++++ ++.||+++.+++ T Consensus 86 dl~~l~k~kk~~kkla~~~D~fia~~~~m~~i~r~LGp~Lg~~gK~P~--~~~~~~di~~~v~~~k~-tVk~r~kk~~~~ 162 (214) T PTZ00225 86 ELKKLNKNKKLVKKMCNQYDAFLCSESIIKTVPRLVGPHMHRMGKFPT--VCSPSESLPDKVVELRS-TVKFQLKKVLCL 162 (214) T ss_pred HHHHHCCCHHHHHHHHHHCCEEEECHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCHHHHHHHHHH-EEEEEECCCCEE T ss_conf 999870026788887742698996578887777762744677899898--77875468999999862-699998887568 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEE Q ss_conf 87531577898999999999999998608662544337799999238972774 Q gi|254780146|r 173 HAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKV 225 (232) Q Consensus 173 ~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~i 225 (232) |++||+++|++|||.||+.+++++|.+..|.+ .++|+++||+|||||+++| T Consensus 163 ~~~VG~~~m~~e~l~eNi~a~l~~l~~~l~kg--~~nIk~vyiKsTMG~~vrv 213 (214) T PTZ00225 163 GTCVGHVEMTEEQLRQNVVMAINFLVSLLKKN--WQNLKSAYIKSTMGKPQRI 213 (214) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHCCC--CCCCCEEEEECCCCCCEEC T ss_conf 88870687998999999999999999871037--6650169998899988707 No 9 >cd00403 Ribosomal_L1 Ribosomal protein L1. The L1 protein, located near the E-site of the ribosome, forms part of the L1 stalk along with 23S rRNA. In bacteria and archaea, L1 functions both as a ribosomal protein that binds rRNA, and as a translation repressor that binds its own mRNA. Like several other large ribosomal subunit proteins, L1 displays RNA chaperone activity. L1 is one of the largest ribosomal proteins. It is composed of two domains that cycle between open and closed conformations via a hinge motion. The RNA-binding site of L1 is highly conserved, with both mRNA and rRNA binding the same binding site. Probab=100.00 E-value=0 Score=350.02 Aligned_cols=200 Identities=47% Similarity=0.783 Sum_probs=190.8 Q ss_pred HHHHHHHHHHCC--CCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHHHH Q ss_conf 899999998608--789986299999953787645443013799527788654069997504457687412000027899 Q gi|254780146|r 24 LSDAVVMLKERA--TARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVGGEDL 101 (232) Q Consensus 24 i~eAi~~lk~~~--~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~el 101 (232) +.||++.+++.+ .++|+|+|+++++|+.+++++++++++.|.|||+++++.+|||||+++.+++|+++||+++|++|| T Consensus 1 ~~~Aik~l~~~~~~~~~~~~~v~l~i~l~~~~~~~~~~~~~~v~LPh~~~~~~~i~v~~~~~~~~~a~~~ga~vvg~~~L 80 (208) T cd00403 1 LEEAIKALKKTSVKKRKFDETVELQINLKKDDRKKDQRIRGTVILPHGLGKDVKVCVFAKDEQAKEAKAAGADVVGGEDL 80 (208) T ss_pred CHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCEEEEEEECCCCCCCCEEEEEECCHHHHHHHHHCCCCEECHHHH T ss_conf 96899999970667788663499999827988987765789998899999860899975788789998769857358999 Q ss_pred HHHHHCCC-----CCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEE Q ss_conf 99975175-----333468864467788886-433200100174223356665579999998601110223575267875 Q gi|254780146|r 102 FEIVKGGQ-----IDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKAGIIHAG 175 (232) Q Consensus 102 i~~i~~~~-----~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i~~~ 175 (232) ++++++++ .+||+|||++++|+.+.+ ||++|||||+||+|+.||+++|+.+.|+++++ ++.||.++.++++++ T Consensus 81 ~~~~k~~~~kkl~~~fD~fla~~~~~~~l~~~LGk~l~~k~kmP~~~~~t~~~dl~~~i~~~~~-~~~~~~~~~~~~~v~ 159 (208) T cd00403 81 KKKIKNGEAKKLAKDFDLFLADPRIMMLLPKLLGKVLGPRGKMPNPKTGTVTEDLAKAIEEAKS-SVEFRLDKGGCIHVP 159 (208) T ss_pred HHHHHCCHHHHHHHCCCEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHC-CEEEEECCCCEEEEE T ss_conf 9887563156676328999987899899998751452103788855368774659999998738-489998899748876 Q ss_pred ECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEE Q ss_conf 31577898999999999999998608662544337799999238972774 Q gi|254780146|r 176 IGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKV 225 (232) Q Consensus 176 IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~i 225 (232) ||+++|+++||.||+.+++++|.+++|. .++.||+++||+|||||+++| T Consensus 160 VG~~~m~~e~l~eNi~a~~~~i~~~~p~-~~~~~I~~v~lktt~~~~l~I 208 (208) T cd00403 160 VGKVSMSPEQLVENIEAVINALVKKLPS-KKGQNIKSIYLKTTMGPSLPI 208 (208) T ss_pred EECCCCCHHHHHHHHHHHHHHHHHHCCC-CCCCEEEEEEEECCCCCCEEC T ss_conf 6136699999999999999999984787-567629899998999986479 No 10 >KOG1569 consensus Probab=100.00 E-value=0 Score=331.55 Aligned_cols=210 Identities=34% Similarity=0.601 Sum_probs=193.3 Q ss_pred CCCCCCHHHHHHHHHHCCCCC----CCCCEEEEEEECCCCCC---CCCCCEEEEEECCCCCC--CCCEEEEECH-HHHHH Q ss_conf 476889899999998608789----98629999995378764---54430137995277886--5406999750-44576 Q gi|254780146|r 18 RSALYGLSDAVVMLKERATAR----FDETVEIAMNLGIDPRH---ANQMVRGVVTMPNGTGV--NVRVAVFATS-SKADE 87 (232) Q Consensus 18 ~~~~ysi~eAi~~lk~~~~~k----f~esvel~i~L~id~~k---~~~~irg~v~LPh~~gk--~~kV~Vf~~~-~~~~~ 87 (232) +.++|++++|++++|++.... +.+++.+.+.|++-..+ ....+.+.+..|||++. ..+|+||+++ +..++ T Consensus 86 kr~~y~vq~a~~~~ke~~~~sa~~~pk~~~~l~i~l~~~a~~~~k~~~~~~~~v~vP~Pf~~ge~r~IavFtn~~~kvee 165 (323) T KOG1569 86 KRQIYEVQKAVHLHKELQILSAYNVPKQPVNLRIELNMLAEKETKGVAPLQGSVTVPHPFAHGEIRKIAVFTNDASKVEE 165 (323) T ss_pred CCCHHHHHHHHHHHHHHCCCHHHHCCCCCEEEEEEEEEEEEEEECCCCCCCCEEECCCCCCCCCEEEEEEECCCHHHHHH T ss_conf 75210378888888875642365378865144676302565540255787604753555777855899996188278899 Q ss_pred HHHCCCCCCCHHHHHHHHHCCCC--CCCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCC-CCCC Q ss_conf 87412000027899999751753--33468864467788886433200100174223356665579999998601-1102 Q gi|254780146|r 88 AREAGADIVGGEDLFEIVKGGQI--DFDRCIATPDMMPLVGRLGRILGPRGIMPNLRVGTVTTDVATAVRESKSG-AVDF 164 (232) Q Consensus 88 Ak~aGa~~vg~~eli~~i~~~~~--~fd~~iAt~~~m~~i~klgriLGPkglMPn~k~GTv~~di~~~i~~~k~g-~i~~ 164 (232) |+++||+++||.|||++|++|.+ ||||||||||||++|..|+++|||| ||||+.|||++||+++|+++++| .++| T Consensus 166 ArEaGA~l~GG~dLIkkI~~Gei~~Dyd~~vA~Pdim~~l~~Lr~iL~~r--~Pn~k~gtvg~nipemieeFk~G~~i~~ 243 (323) T KOG1569 166 AREAGAALAGGTDLIKKIKSGEIVADYDFYVAHPDIMPELNRLRKILGPR--FPNPKRGTVGRNIPEMIEEFKNGHEIKF 243 (323) T ss_pred HHHCCCCCCCHHHHHHHHHCCEEEEEECEEEECCHHHHHHHHHHHHHCCC--CCCCCCCCCCCCHHHHHHHHHCCCCCCC T ss_conf 88633001140879998636828873032430632779999998873546--8986658611156999999628764544 Q ss_pred CCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECHHC Q ss_conf 235752678753157789899999999999999860866254433779999923897277400100 Q gi|254780146|r 165 RSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLSSF 230 (232) Q Consensus 165 r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~~~ 230 (232) +.++.+.++++||+++|+++||++||.++++++++++|. .+|.|+.+++|+|+||||+.||++++ T Consensus 244 d~~~~~~~~~~vGkl~mt~e~i~~Ni~a~i~av~t~rp~-~~G~fv~ra~L~Ss~g~~i~l~i~p~ 308 (323) T KOG1569 244 DEERENILQIKVGKLDMTSEQILANIQAVIAAVCTHRPK-ALGPFVVRALLRSSPGEGILLKIDPL 308 (323) T ss_pred CCCCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHHCCCC-CCCCEEEEEEEECCCCCCEEEECCCC T ss_conf 555576146413554388899999999999999832884-45760565676058997626741456 No 11 >TIGR01170 rplA_mito ribosomal protein L1, mitochondrial; InterPro: IPR005879 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L1 is the largest protein from the large ribosomal subunit. In Escherichia coli, L1 is known to bind to the 23S rRNA. This model describes the mitochondrial L1 protein.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005739 mitochondrion, 0015934 large ribosomal subunit. Probab=99.97 E-value=7.7e-32 Score=217.94 Aligned_cols=139 Identities=37% Similarity=0.675 Sum_probs=126.4 Q ss_pred CCHHHHHHHHHHCCCC---CCCCCEEEEEEECCCCCCCC--CCCEEEEEECCCCCC-CC-CEEEEECHHH-HHHHHHC-C Q ss_conf 8989999999860878---99862999999537876454--430137995277886-54-0699975044-5768741-2 Q gi|254780146|r 22 YGLSDAVVMLKERATA---RFDETVEIAMNLGIDPRHAN--QMVRGVVTMPNGTGV-NV-RVAVFATSSK-ADEAREA-G 92 (232) Q Consensus 22 ysi~eAi~~lk~~~~~---kf~esvel~i~L~id~~k~~--~~irg~v~LPh~~gk-~~-kV~Vf~~~~~-~~~Ak~a-G 92 (232) |++.+|++++|+.+.. .+.++|.|.+.|+....+.. ..|++.+.|||++++ .. |||||+.++. .++|+++ | T Consensus 1 ~~V~~A~~~lk~~~i~~~~~~~~~~~L~~~l~~~~gk~~~~~~F~~~~~fP~pF~~G~~~KIavFT~~~~~~~~~~e~~~ 80 (155) T TIGR01170 1 YKVEKAVKYLKEVSISDFVSPKQSVNLDIELKMELGKESRVVDFSGVVSFPHPFAKGEIRKIAVFTKGASEVEEAREKYG 80 (155) T ss_pred CCCCHHEEEEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCEEEEECCCCHHHHHHHHCCC T ss_conf 93320011310111476357763057754322101465542255103634754567995168984178035678876388 Q ss_pred CCCCCHHHHHHHHHCCCC---CCCEEEECHHHHHHHH-HHHHHHHHHHCCCCCCCCCCCCC---HHHHHHHHHCC Q ss_conf 000027899999751753---3346886446778888-64332001001742233566655---79999998601 Q gi|254780146|r 93 ADIVGGEDLFEIVKGGQI---DFDRCIATPDMMPLVG-RLGRILGPRGIMPNLRVGTVTTD---VATAVRESKSG 160 (232) Q Consensus 93 a~~vg~~eli~~i~~~~~---~fd~~iAt~~~m~~i~-klgriLGPkglMPn~k~GTv~~d---i~~~i~~~k~g 160 (232) |+||||+|||++|.++.+ |||+||||||.|+.|. +++|+|||||+||++|.|||++| |.++|+.+++| T Consensus 81 a~~vGg~dLikKI~~~ei~~sDf~~~~A~P~~~~~L~s~~~~~Lg~Kgl~P~~K~gT~~~~nnil~~~i~~~~~G 155 (155) T TIGR01170 81 ASYVGGEDLIKKIKDGEIKVSDFDYLIAHPDIVPELASQLAKLLGPKGLMPSPKRGTVSDDNNILKSLIEKFKKG 155 (155) T ss_pred CCEECCHHHHCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 322351232201448952211554168711146778999987604676687477787454134579999986169 No 12 >KOG1570 consensus Probab=99.93 E-value=1.4e-25 Score=179.13 Aligned_cols=197 Identities=26% Similarity=0.375 Sum_probs=168.5 Q ss_pred CCHHHHHH----HHHHCCCCCCCCCEEEEEEE-CCCCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCC Q ss_conf 89899999----99860878998629999995-37876454430137995277886540699975044576874120000 Q gi|254780146|r 22 YGLSDAVV----MLKERATARFDETVEIAMNL-GIDPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIV 96 (232) Q Consensus 22 ysi~eAi~----~lk~~~~~kf~esvel~i~L-~id~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~v 96 (232) +++.+|+. ..++.+.++|.||++++++| |.|+.| +.++.|++.|||...+..++|||++..++.+|++.|.+++ T Consensus 8 ~~l~~~v~~i~~~~~~~k~~~~~~tielQi~Lk~ydp~K-dkr~s~~~~l~~~~~p~~k~cv~gd~~h~~~ak~~~~p~m 86 (218) T KOG1570 8 EALREAVAEILLRSKEFKNRNFVETIELQINLKNYDPQK-DKRFSGSVKLPHAPRPKMKVCVIGDQQHCDEAKAIDLPAM 86 (218) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCCHHH-HCCCCCEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCC T ss_conf 899999999985031124765215655541324468677-2765505764788984311455206777787502799987 Q ss_pred CHHHHHHH------HHCCCCCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCC Q ss_conf 27899999------75175333468864467788886-433200100174223356665579999998601110223575 Q gi|254780146|r 97 GGEDLFEI------VKGGQIDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDFRSEKA 169 (232) Q Consensus 97 g~~eli~~------i~~~~~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~r~~k~ 169 (232) .-++|-+. +++-...||+|||..+...+|.+ ||+-|...|++|++.. -.+|+.+.+++.++ +++++.++. T Consensus 87 ~~d~Lk~lnk~Kklvkklakky~~flase~~ikQlpr~lgp~l~kaGkfps~l~--~~d~l~akv~evk~-t~k~q~kkv 163 (218) T KOG1570 87 DIDALKKLNKNKKLVKKLAKKYDAFLASESLIKQLPRILGPGLSKAGKFPSLLV--HGDSLYAKVEEVKS-TIKFQMKKV 163 (218) T ss_pred CHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC--CHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 879872476443078999876556776689999867654645130024775441--53768999999875-477777766 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCC-CCEEEEEEEEECCCCCEEE Q ss_conf 2678753157789899999999999999860866254-4337799999238972774 Q gi|254780146|r 170 GIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAK-GDYVKRVTLSSTMGCGIKV 225 (232) Q Consensus 170 g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~k-g~~Ik~v~lssTmGp~i~i 225 (232) -+..+.||...|++++|.+||.--++++.+.. .+ +++++.+|++|||||+.++ T Consensus 164 lclaVaVGhv~M~d~el~~NI~l~vnFlVSlL---KknwQNvral~iKst~g~p~~l 217 (218) T KOG1570 164 LCLAVAVGHVGMTDEELVYNIHLAVNFLVSLL---KKNWQNVRALYIKSTMGKPQRL 217 (218) T ss_pred HHEEEEECCCCCCHHHHHHHHHHHHHHHHHHH---HHCCCCHHHEEECCCCCCCEEC T ss_conf 41002525777778998865566455579988---6233012113541688986504 No 13 >KOG1685 consensus Probab=99.00 E-value=1.5e-09 Score=79.67 Aligned_cols=188 Identities=14% Similarity=0.174 Sum_probs=129.7 Q ss_pred CCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCCC----C--CCCCEEEEECHHHH---------HHHHHCCCC---- Q ss_conf 0878998629999995378764544301379952778----8--65406999750445---------768741200---- Q gi|254780146|r 34 RATARFDETVEIAMNLGIDPRHANQMVRGVVTMPNGT----G--VNVRVAVFATSSKA---------DEAREAGAD---- 94 (232) Q Consensus 34 ~~~~kf~esvel~i~L~id~~k~~~~irg~v~LPh~~----g--k~~kV~Vf~~~~~~---------~~Ak~aGa~---- 94 (232) .+..++...+.+..+.. +.....+-.+.+||.+ - +...||+|..+... +.-.++|.. T Consensus 47 ~~~~k~~~~~~~~k~~~----~~~~~n~~~IP~k~~~~~~~~~~~~~~vcLIvkD~~~~~~t~~~~~d~l~~~gi~~itk 122 (343) T KOG1685 47 LEALKNVYKVVLQKNTP----QKVGTNKLKIPLKHLLKKLLLKHDDTDVCLIVKDLQQSYLTEEDAEDLLESEGIKSITK 122 (343) T ss_pred HHHHHHHHHHHHHHHCC----CCCCCCCCCCCCCCHHCCCCCCCCCCCEEEEECCCHHCCCCHHHHHHHHHHCCCCCCEE T ss_conf 03255567789987403----30123301165751101355688997558984370101221789999888729961205 Q ss_pred CCCHHHHHHHHHCCC------CCCCEEEECHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCC-C--HHHHHHHHHCCCCCC Q ss_conf 002789999975175------333468864467788886-433200100174223356665-5--799999986011102 Q gi|254780146|r 95 IVGGEDLFEIVKGGQ------IDFDRCIATPDMMPLVGR-LGRILGPRGIMPNLRVGTVTT-D--VATAVRESKSGAVDF 164 (232) Q Consensus 95 ~vg~~eli~~i~~~~------~~fd~~iAt~~~m~~i~k-lgriLGPkglMPn~k~GTv~~-d--i~~~i~~~k~g~i~~ 164 (232) ++...+|-...+.-+ -.||+|+|.--+++.+.+ |||..-.+.+-|-|-. +.. + +.+.|.+... .+-| T Consensus 123 VI~~~~LK~~yksyeaKr~L~~sfDlflaD~RV~~~LP~llGK~f~q~kk~Pv~i~--l~k~~~~l~~qi~~a~~-~t~~ 199 (343) T KOG1685 123 VISLSKLKRDYKSYEAKRKLLSSFDLFLADDRVIPLLPKLLGKEFYQKKKVPVSIR--LSKKNELLKQQIENACG-STYF 199 (343) T ss_pred EEEHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHHHHCCCCCCCEEEE--ECCCCHHHHHHHHHHHH-HHEE T ss_conf 65659989998779999988732257877544676778885166403556734788--31661478999999863-5134 Q ss_pred CCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCEEEECHHC Q ss_conf 235752678753157789899999999999999860866254433779999923897277400100 Q gi|254780146|r 165 RSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKAKPSVAKGDYVKRVTLSSTMGCGIKVDLSSF 230 (232) Q Consensus 165 r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~kp~~~kg~~Ik~v~lssTmGp~i~id~~~~ 230 (232) +..+..+.-+.+|.++|+.++|.||+.++++.|....|.+. .+|+++||++.-+|++-|=.+.- T Consensus 200 ~~~~g~~~~i~vg~l~~~~~~l~ENv~~vlk~l~e~~P~~~--~~Irsl~lKt~~s~aLPly~s~~ 263 (343) T KOG1685 200 RLRKGTSSAIKVGHLGNTSEELAENVVAVLKGLSEILPGGW--KNIRSLHLKTSRSPALPLYVSVI 263 (343) T ss_pred ECCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCHH--HHHHHHHHHCCCCCCCCEEECCC T ss_conf 10577535788313466188989999999998998762367--67678763125775220122366 No 14 >PTZ00029 60S ribosomal protein L1/L10a; Provisional Probab=93.48 E-value=0.043 Score=33.46 Aligned_cols=26 Identities=31% Similarity=0.478 Sum_probs=19.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 888864332001001742233566655 Q gi|254780146|r 123 PLVGRLGRILGPRGIMPNLRVGTVTTD 149 (232) Q Consensus 123 ~~i~klgriLGPkglMPn~k~GTv~~d 149 (232) ..++++||+||| ||||++|.+++... T Consensus 114 ~~m~~i~r~LGp-~L~p~gKmP~~~~~ 139 (217) T PTZ00029 114 VLLPQIPRLLGP-GLNKAGKFPTQITH 139 (217) T ss_pred HHHHHHHHHHCC-CCCCCCCCCCCCCC T ss_conf 898888887563-46667998987789 No 15 >COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only] Probab=83.53 E-value=2 Score=23.07 Aligned_cols=60 Identities=28% Similarity=0.219 Sum_probs=39.8 Q ss_pred CCEEEEECH-HHHHHHHHCCCCCCCH---HHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHHC Q ss_conf 406999750-4457687412000027---899999751753334688644677888864332001001 Q gi|254780146|r 74 VRVAVFATS-SKADEAREAGADIVGG---EDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRGI 137 (232) Q Consensus 74 ~kV~Vf~~~-~~~~~Ak~aGa~~vg~---~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkgl 137 (232) .+|.+|..+ +..+.|++.||+++-. +|..+.+++. ||..|-|.. -..+...-+.|.+.|. T Consensus 191 a~Via~~~~~~K~e~a~~lGAd~~i~~~~~~~~~~~~~~---~d~ii~tv~-~~~~~~~l~~l~~~G~ 254 (339) T COG1064 191 AEVIAITRSEEKLELAKKLGADHVINSSDSDALEAVKEI---ADAIIDTVG-PATLEPSLKALRRGGT 254 (339) T ss_pred CEEEEEECCHHHHHHHHHHCCCEEEECCCCHHHHHHHHH---CCEEEECCC-HHHHHHHHHHHHCCCE T ss_conf 969999578779999998488289976781166776734---739999877-4559999998642978 No 16 >PRK13958 N-(5'-phosphoribosyl)anthranilate isomerase; Provisional Probab=68.68 E-value=1.9 Score=23.22 Aligned_cols=22 Identities=9% Similarity=0.234 Sum_probs=9.2 Q ss_pred CEEEEECHH--HH-HHHHHCCCCCC Q ss_conf 069997504--45-76874120000 Q gi|254780146|r 75 RVAVFATSS--KA-DEAREAGADIV 96 (232) Q Consensus 75 kV~Vf~~~~--~~-~~Ak~aGa~~v 96 (232) +|+||.+.+ .. +.+...+.+++ T Consensus 55 ~VgVfvn~~~~~i~~~~~~~~ld~v 79 (207) T PRK13958 55 KVCVMVNPDLTTIEHVLSNTSINTI 79 (207) T ss_pred EEEEEECCCHHHHHHHHHHCCCCEE T ss_conf 6999946979999999985799779 No 17 >PRK09427 bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase; Provisional Probab=63.57 E-value=2.4 Score=22.55 Aligned_cols=40 Identities=25% Similarity=0.318 Sum_probs=28.9 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHCCCCCCC Q ss_conf 8764544301379952778865406999750445768741200002 Q gi|254780146|r 52 DPRHANQMVRGVVTMPNGTGVNVRVAVFATSSKADEAREAGADIVG 97 (232) Q Consensus 52 d~~k~~~~irg~v~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg 97 (232) ....+.+.+|..+ .| ..|||=+++.+.+..|.++|||++| T Consensus 244 r~~d~~~~~r~l~-----~g-~~KICGIT~~eDA~~a~~~GAD~iG 283 (459) T PRK09427 244 AEDDLDLAVRKLI-----LG-ENKVCGLTRPQDAKAAYDAGAVYGG 283 (459) T ss_pred CCCCHHHHHHHHH-----CC-CCCCCCCCCHHHHHHHHHCCCCEEE T ss_conf 7999899999973-----78-9846789989999999984999898 No 18 >cd04735 OYE_like_4_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 4. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=63.07 E-value=8.2 Score=19.29 Aligned_cols=14 Identities=43% Similarity=0.344 Sum_probs=9.4 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 44576874120000 Q gi|254780146|r 83 SKADEAREAGADIV 96 (232) Q Consensus 83 ~~~~~Ak~aGa~~v 96 (232) +.|..|++||.|.| T Consensus 148 ~AA~rA~~AGfDgV 161 (353) T cd04735 148 EATRRAIEAGFDGV 161 (353) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999998399989 No 19 >COG0135 TrpF Phosphoribosylanthranilate isomerase [Amino acid transport and metabolism] Probab=59.28 E-value=3.5 Score=21.55 Aligned_cols=18 Identities=22% Similarity=0.311 Sum_probs=7.7 Q ss_pred CCCCCCCC-CCHHHHHHHH Q ss_conf 22335666-5579999998 Q gi|254780146|r 140 NLRVGTVT-TDVATAVRES 157 (232) Q Consensus 140 n~k~GTv~-~di~~~i~~~ 157 (232) -...|.++ +|+.++|+.. T Consensus 157 ~~LAGGL~p~NV~~ai~~~ 175 (208) T COG0135 157 VMLAGGLNPDNVAEAIALG 175 (208) T ss_pred EEEECCCCHHHHHHHHHHC T ss_conf 7997788989999999835 No 20 >TIGR02817 adh_fam_1 zinc-binding alcohol dehydrogenase family protein; InterPro: IPR014182 Members of this entry form a distinct subset of the larger family of oxidoreductases that includes zinc-binding alcohol dehydrogenases and NADPH:quinone reductases. Some current members of this entry carry designations as putative alginate lyase, however none have been characterised as such.. Probab=58.97 E-value=15 Score=17.67 Aligned_cols=67 Identities=15% Similarity=0.139 Sum_probs=48.7 Q ss_pred CCCEEEEEC-HHHHHHHHHCCCCCCCH--HHHHHHHHCCCC-CCCEEEECHHHHHHHHHHHHHHHHHHCCC Q ss_conf 540699975-04457687412000027--899999751753-33468864467788886433200100174 Q gi|254780146|r 73 NVRVAVFAT-SSKADEAREAGADIVGG--EDLFEIVKGGQI-DFDRCIATPDMMPLVGRLGRILGPRGIMP 139 (232) Q Consensus 73 ~~kV~Vf~~-~~~~~~Ak~aGa~~vg~--~eli~~i~~~~~-~fd~~iAt~~~m~~i~klgriLGPkglMP 139 (232) +.+|..-|. .|..+=.++.||++|-. ..|-.+|++-.+ ..+|+++++--=.....+..+|-|=|++= T Consensus 176 ~ltVIaTASRpEs~~Wv~~LGAH~VIDHskPL~~ql~~L~l~~v~~V~SlT~TDqH~~~ive~laPQGrla 246 (338) T TIGR02817 176 GLTVIATASRPESQEWVLELGAHHVIDHSKPLKAQLEKLGLEAVSYVFSLTHTDQHFKEIVELLAPQGRLA 246 (338) T ss_pred CCEEEEECCCHHHHHHHHHCCCCEEECCCCCHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHHHCCCCCEE T ss_conf 96499972857899999973991886588436899998288998557516607899999999856787400 No 21 >PRK01222 N-(5'-phosphoribosyl)anthranilate isomerase; Provisional Probab=58.59 E-value=3.8 Score=21.38 Aligned_cols=34 Identities=24% Similarity=0.409 Sum_probs=19.4 Q ss_pred CCEEEEECHH--HH-HHHHHCCCCCC---C--HHHHHHHHHC Q ss_conf 4069997504--45-76874120000---2--7899999751 Q gi|254780146|r 74 VRVAVFATSS--KA-DEAREAGADIV---G--GEDLFEIVKG 107 (232) Q Consensus 74 ~kV~Vf~~~~--~~-~~Ak~aGa~~v---g--~~eli~~i~~ 107 (232) .+|+||.+.+ .. +.+++.|.+++ | .++.++.++. T Consensus 56 ~~VgVfv~~~~~~i~~~~~~~~~d~vQlHG~e~~~~~~~l~~ 97 (212) T PRK01222 56 KVVGVFVNASDEEIAEIVEAVGLDLLQLHGDETPEFCRQLKR 97 (212) T ss_pred CEEEEEECCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH T ss_conf 379999458179999999856998899857437899999997 No 22 >pfam00107 ADH_zinc_N Zinc-binding dehydrogenase. Probab=57.11 E-value=16 Score=17.47 Aligned_cols=65 Identities=28% Similarity=0.273 Sum_probs=38.8 Q ss_pred CCEEEEE-CHHHHHHHHHCCCCCCC---HHHHHHHHHCCCC--CCCEEEECHHHHHHHHHHHHHHHHHHCC Q ss_conf 4069997-50445768741200002---7899999751753--3346886446778888643320010017 Q gi|254780146|r 74 VRVAVFA-TSSKADEAREAGADIVG---GEDLFEIVKGGQI--DFDRCIATPDMMPLVGRLGRILGPRGIM 138 (232) Q Consensus 74 ~kV~Vf~-~~~~~~~Ak~aGa~~vg---~~eli~~i~~~~~--~fd~~iAt~~~m~~i~klgriLGPkglM 138 (232) .+|.+.. +.+..+.|++.||+.+- .++..+.+.+..- -+|+++-+...-..+...=+.+.|.|.+ T Consensus 16 ~~Vi~~~~~~~r~~~a~~lGa~~~i~~~~~~~~~~i~~~~~~~g~d~vid~~g~~~~~~~~~~~~~~~G~i 86 (131) T pfam00107 16 ARVIAVDRSEEKLELAKELGADHVINYRDEDFVERVRELTGGRGVDVVIDCVGAPATLEQALELLRPGGRV 86 (131) T ss_pred CEEEEEECCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHCCCCEE T ss_conf 87999969889999999759973235332212455654049977649886688666799998753599789 No 23 >KOG1588 consensus Probab=51.28 E-value=4.1 Score=21.12 Aligned_cols=37 Identities=19% Similarity=0.288 Sum_probs=23.5 Q ss_pred CCCCCCCEEEEEECCCCC-------CCCCEEEEECHHHHHHHHH Q ss_conf 645443013799527788-------6540699975044576874 Q gi|254780146|r 54 RHANQMVRGVVTMPNGTG-------VNVRVAVFATSSKADEARE 90 (232) Q Consensus 54 ~k~~~~irg~v~LPh~~g-------k~~kV~Vf~~~~~~~~Ak~ 90 (232) ..++.+|-|-+.=|.|.. ..-||.|...+..-+.||+ T Consensus 102 ~yP~fNFVGRILGPrGnSlkrLe~eTgCki~IrGrgSmrD~~KE 145 (259) T KOG1588 102 EYPKFNFVGRILGPRGNSLKRLEEETGCKIMIRGRGSMRDKAKE 145 (259) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCEEEEECCCCCCCHHHH T ss_conf 78887611311068753199999987976999668766665779 No 24 >PRK10222 L-ascorbate-specific enzyme IIB component of PTS; Provisional Probab=48.69 E-value=14 Score=17.76 Aligned_cols=65 Identities=14% Similarity=0.156 Sum_probs=44.3 Q ss_pred HHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHH-H-HHHHHHHHHCC-CCCCCCCCCCCHHHHHHHH Q ss_conf 74120000278999997517533346886446778888-6-43320010017-4223356665579999998 Q gi|254780146|r 89 REAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVG-R-LGRILGPRGIM-PNLRVGTVTTDVATAVRES 157 (232) Q Consensus 89 k~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~-k-lgriLGPkglM-Pn~k~GTv~~di~~~i~~~ 157 (232) ++.|.+..-..--+-..+.+--.||.+||+.-+..++. . =+.+||-+++| || +.++.+.+.|++. T Consensus 12 ~k~gi~~~~~SCavgd~K~~l~~~DIIVaS~HL~~Ei~~~~~~~vlGV~Nml~~n----~fg~eL~~vIk~~ 79 (85) T PRK10222 12 TQSNIDHTVNSCAVGEYKSELSGADIIIASTHIAGEITVTGNKYVVGVRNMLSPA----DFGPKLLEVIKEH 79 (85) T ss_pred HHCCCCCHHHHHHHHHHHCCCCCCCEEEECHHHHHHCCCCCCCEEEEHHHCCCHH----HHHHHHHHHHHHH T ss_conf 9829952100203565543467777898706657542678892688302036866----7789999999985 No 25 >cd00405 PRAI Phosphoribosylanthranilate isomerase (PRAI) catalyzes the fourth step of the tryptophan biosynthesis, the conversion of N-(5'- phosphoribosyl)-anthranilate (PRA) to 1-(o-carboxyphenylamino)- 1-deoxyribulose 5-phosphate (CdRP). Most PRAIs are monomeric, monofunctional and thermolabile, but in some thermophile organisms PRAI is dimeric for reasons of stability and in others it is fused to other components of the tryptophan biosynthesis pathway to form multifunctional enzymes. Probab=46.52 E-value=7.2 Score=19.62 Aligned_cols=24 Identities=38% Similarity=0.486 Sum_probs=11.9 Q ss_pred CCCEEEEECHHH---HHHHHHCCCCCC Q ss_conf 540699975044---576874120000 Q gi|254780146|r 73 NVRVAVFATSSK---ADEAREAGADIV 96 (232) Q Consensus 73 ~~kV~Vf~~~~~---~~~Ak~aGa~~v 96 (232) ..+|+||.+.+. .+.+.+.+.+++ T Consensus 51 ~~~V~Vfvn~~~~~i~~~~~~~~~d~v 77 (203) T cd00405 51 VKRVGVFVNEDLEEILEIAEELGLDVV 77 (203) T ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEE T ss_conf 539999916839999999987699889 No 26 >pfam00697 PRAI N-(5'phosphoribosyl)anthranilate (PRA) isomerase. Probab=46.40 E-value=7.1 Score=19.67 Aligned_cols=22 Identities=32% Similarity=0.421 Sum_probs=9.9 Q ss_pred CEEEEECHH--H-HHHHHHCCCCCC Q ss_conf 069997504--4-576874120000 Q gi|254780146|r 75 RVAVFATSS--K-ADEAREAGADIV 96 (232) Q Consensus 75 kV~Vf~~~~--~-~~~Ak~aGa~~v 96 (232) .|.||.+.+ . .+.+.+.+.+++ T Consensus 51 ~VgVfv~~~~~~i~~~~~~~~~d~v 75 (195) T pfam00697 51 LVGVFVNQPIEEVLRIAQVLGLDVV 75 (195) T ss_pred EEEEECCCCHHHHHHHHHHCCCCEE T ss_conf 5999847868999999983799879 No 27 >PRK13523 NADPH dehydrogenase NamA; Provisional Probab=44.30 E-value=17 Score=17.25 Aligned_cols=14 Identities=43% Similarity=0.700 Sum_probs=8.4 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 44576874120000 Q gi|254780146|r 83 SKADEAREAGADIV 96 (232) Q Consensus 83 ~~~~~Ak~aGa~~v 96 (232) +.|..|++||.|-| T Consensus 146 ~AA~rA~~AGfDgV 159 (337) T PRK13523 146 QAAKRAKEAGFDVI 159 (337) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999998499989 No 28 >cd02803 OYE_like_FMN_family Old yellow enzyme (OYE)-like FMN binding domain. OYE was the first flavin-dependent enzyme identified, however its true physiological role remains elusive to this day. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=42.65 E-value=27 Score=16.06 Aligned_cols=13 Identities=54% Similarity=0.542 Sum_probs=7.7 Q ss_pred HHHHHHHCCCCCC Q ss_conf 4576874120000 Q gi|254780146|r 84 KADEAREAGADIV 96 (232) Q Consensus 84 ~~~~Ak~aGa~~v 96 (232) .+..|++||.|-| T Consensus 146 AA~~A~~AGfDgV 158 (327) T cd02803 146 AARRAKEAGFDGV 158 (327) T ss_pred HHHHHHHCCCCEE T ss_conf 9999998499989 No 29 >cd05564 PTS_IIB_chitobiose_lichenan PTS_IIB_chitobiose_lichenan: subunit IIB of enzyme II (EII) of the N,N-diacetylchitobiose-specific and lichenan-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In these systems, EII is either a lichenan- or an N,N-diacetylchitobiose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. In the chitobiose system, these subunits are expressed as separate proteins from chbA, chbB, and chbC of the chb operon (formerly the cel (cellulose) operon). In the lichenan system, these subunits are expressed from licA, licB, and licC of the lic operon. The lic operon of Bacillus subtilis is required for the transport and degradation of oligomeric beta-glucosides, which are produced by extracellular enzymes on substrates such as lichenan or barley glucan. The lic operon is transcribed from a gammaA-dependent promoter and is inducible by lichenan, lichenan hydrolysate, and cellobiose. The IIB d Probab=42.55 E-value=27 Score=16.05 Aligned_cols=67 Identities=16% Similarity=0.294 Sum_probs=40.0 Q ss_pred CEEEEECH---------HHHHHHHHCCCCC----CCHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHH----C Q ss_conf 06999750---------4457687412000----02789999975175333468864467788886433200100----1 Q gi|254780146|r 75 RVAVFATS---------SKADEAREAGADI----VGGEDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRG----I 137 (232) Q Consensus 75 kV~Vf~~~---------~~~~~Ak~aGa~~----vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkg----l 137 (232) ||+.+|.+ ...+.|.+.|.++ ++..++-+.+ .+||.++-.|.+--....+-....+.| . T Consensus 1 kIlL~C~~GmSts~l~~km~~~a~~~~~~~~i~A~~~~~~~~~~----~~~DviLl~PQv~y~~~~i~~~~~~~~ipV~v 76 (96) T cd05564 1 KILLVCSAGMSTSILVKKMKKAAEKRGIDAEIEAVPESELEEYI----DDADVVLLGPQVRYMLDEVKKKAAEYGIPVAV 76 (96) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHH----HCCCEEEECCHHHHHHHHHHHHHHHCCCCEEE T ss_conf 98999599865999999999999974997799996489999774----15999997727999999999986104992796 Q ss_pred CCCCCCCC Q ss_conf 74223356 Q gi|254780146|r 138 MPNLRVGT 145 (232) Q Consensus 138 MPn~k~GT 145 (232) +|.---|+ T Consensus 77 I~~~~Yg~ 84 (96) T cd05564 77 IDMMDYGM 84 (96) T ss_pred ECHHHCCC T ss_conf 07351435 No 30 >KOG1198 consensus Probab=42.40 E-value=27 Score=16.04 Aligned_cols=63 Identities=19% Similarity=0.298 Sum_probs=39.8 Q ss_pred EECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCH---HHHHHHHHCC-CCCCCEE---EECHHHHHHHHHH Q ss_conf 9527788654069997504457687412000027---8999997517-5333468---8644677888864 Q gi|254780146|r 65 TMPNGTGVNVRVAVFATSSKADEAREAGADIVGG---EDLFEIVKGG-QIDFDRC---IATPDMMPLVGRL 128 (232) Q Consensus 65 ~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~---~eli~~i~~~-~~~fd~~---iAt~~~m~~i~kl 128 (232) .|=+-.+ -.+|.+.|..+..+.+++.|||.+-. ++.++++++- ...||++ ++.+.....+..+ T Consensus 176 QlAk~~~-~~~v~t~~s~e~~~l~k~lGAd~vvdy~~~~~~e~~kk~~~~~~DvVlD~vg~~~~~~~~~~l 245 (347) T KOG1198 176 QLAKHAG-AIKVVTACSKEKLELVKKLGADEVVDYKDENVVELIKKYTGKGVDVVLDCVGGSTLTKSLSCL 245 (347) T ss_pred HHHHHCC-CCEEEEEECCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHH T ss_conf 9998749-747999815541689997299651248857799998762278850999888897324323433 No 31 >PRK13803 bifunctional phosphoribosylanthranilate isomerase/tryptophan synthase subunit beta; Provisional Probab=40.88 E-value=10 Score=18.65 Aligned_cols=23 Identities=35% Similarity=0.436 Sum_probs=14.3 Q ss_pred CCEEEEECHHH---HHHHHHCCCCCC Q ss_conf 40699975044---576874120000 Q gi|254780146|r 74 VRVAVFATSSK---ADEAREAGADIV 96 (232) Q Consensus 74 ~kV~Vf~~~~~---~~~Ak~aGa~~v 96 (232) .+|.||++.+. .+.+.+.|.+++ T Consensus 59 k~VGVFVN~~~e~I~~i~~~~~LDiV 84 (611) T PRK13803 59 RPVAVFVDFSAKALESFCKSTGIDVV 84 (611) T ss_pred CEEEEEECCCHHHHHHHHHHCCCCEE T ss_conf 76999969998999999985699979 No 32 >cd04734 OYE_like_3_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 3. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. One member of this subgroup, the Sinorhizobium meliloti stachydrine utilization protein stcD, has been idenified as a putative N-methylproline demethylase. Probab=40.67 E-value=29 Score=15.87 Aligned_cols=14 Identities=29% Similarity=0.337 Sum_probs=9.0 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 44576874120000 Q gi|254780146|r 83 SKADEAREAGADIV 96 (232) Q Consensus 83 ~~~~~Ak~aGa~~v 96 (232) +.|..|++||.|-| T Consensus 145 ~AA~~A~~AGfDgV 158 (343) T cd04734 145 DAARRCQAGGLDGV 158 (343) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999997399889 No 33 >TIGR00789 flhB_rel FlhB domain protein; InterPro: IPR004683 This group describes a short protein (80-93 residues) homologous to the C-terminus of the flagellar biosynthetic protein FlhB. It is found so far only in species that also have FlhB. In a phylogenetic tree based on alignment of both this family and the homologous region of FlhB and its homologs, the members of this family form a monophyletic set.. Probab=40.36 E-value=29 Score=15.84 Aligned_cols=52 Identities=21% Similarity=0.344 Sum_probs=39.5 Q ss_pred CEEEEEECCCCCCCC--CEEEEECHHHH----HHHHHCCCCCCCHHHHHHHHHCCCCC Q ss_conf 013799527788654--06999750445----76874120000278999997517533 Q gi|254780146|r 60 VRGVVTMPNGTGVNV--RVAVFATSSKA----DEAREAGADIVGGEDLFEIVKGGQID 111 (232) Q Consensus 60 irg~v~LPh~~gk~~--kV~Vf~~~~~~----~~Ak~aGa~~vg~~eli~~i~~~~~~ 111 (232) ++..|.|-+-.+++. +|.+=+.|+-| +.||+.|..+.-.++|++....-.++ T Consensus 2 ~~KAVAL~Y~~~~d~AP~V~AsG~GevA~~II~~AK~~gipi~ed~~L~~~L~~l~l~ 59 (84) T TIGR00789 2 IKKAVALKYDKGKDKAPKVVASGVGEVAEKIIELAKKHGIPIKEDEDLVDVLLKLDLD 59 (84) T ss_pred CCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCC T ss_conf 7420013655678868857741676578899999976389976582899987440545 No 34 >COG4213 XylF ABC-type xylose transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=35.28 E-value=30 Score=15.81 Aligned_cols=33 Identities=18% Similarity=0.379 Sum_probs=16.5 Q ss_pred CCCCEEEEEC--H----HHHHHHHHCCCCCCCHHHHHHH Q ss_conf 6540699975--0----4457687412000027899999 Q gi|254780146|r 72 VNVRVAVFAT--S----SKADEAREAGADIVGGEDLFEI 104 (232) Q Consensus 72 k~~kV~Vf~~--~----~~~~~Ak~aGa~~vg~~eli~~ 104 (232) +..+++|+.. + +..++|+++|+.+++.+-||.. T Consensus 80 qg~~vlvi~a~d~~~l~~~i~~A~~~gikViaYDRlI~n 118 (341) T COG4213 80 QGVKVLVIGAIDGGVLSNAVEKAKSEGIKVIAYDRLINN 118 (341) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 599789998135325899999998759949974010026 No 35 >TIGR02922 TIGR02922 conserved hypothetical protein TIGR02922; InterPro: IPR014271 Two members of this family are found in Colwellia psychrerythraea (strain 34H / ATCC BAA-681) and one each in various other species of Colwellia and Shewanella. One member from C. psychrerythraea is of special interest because it is preceded by the same cis-regulatory site as a number of genes that have the PEP-CTERM domain described by IPR013424 from INTERPRO.. Probab=33.41 E-value=30 Score=15.82 Aligned_cols=25 Identities=12% Similarity=0.146 Sum_probs=22.3 Q ss_pred CEEEEEECCCCCCCCCEEEEECHHH Q ss_conf 0137995277886540699975044 Q gi|254780146|r 60 VRGVVTMPNGTGVNVRVAVFATSSK 84 (232) Q Consensus 60 irg~v~LPh~~gk~~kV~Vf~~~~~ 84 (232) =.|.|.+|..|.|-..|.+.|+|+- T Consensus 33 ~~Grv~iP~eFk~GKsIiAV~EG~i 57 (69) T TIGR02922 33 ESGRVVIPQEFKKGKSIIAVCEGEI 57 (69) T ss_pred CCCCEECCCCCCCCCEEEEEECCCE T ss_conf 6257760675668986899824866 No 36 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=32.58 E-value=39 Score=15.07 Aligned_cols=85 Identities=21% Similarity=0.373 Sum_probs=50.7 Q ss_pred CCCCCCHHHHHHHHHHCCCC-----C----------CCCCEEEEEEECCCCCCCCCCCEEEEEECCC------------- Q ss_conf 47688989999999860878-----9----------9862999999537876454430137995277------------- Q gi|254780146|r 18 RSALYGLSDAVVMLKERATA-----R----------FDETVEIAMNLGIDPRHANQMVRGVVTMPNG------------- 69 (232) Q Consensus 18 ~~~~ysi~eAi~~lk~~~~~-----k----------f~esvel~i~L~id~~k~~~~irg~v~LPh~------------- 69 (232) -+-.+-+...++++|+.=.. + +-+.+|+.+-..+.|==+.|+ -+|+- T Consensus 89 ~Ea~~h~~R~l~~Ik~~G~~AG~v~NP~TPl~~~~~~L~~~D~VLlMSVnPGFgGQk-----FIP~~~~Kir~~R~~id~ 163 (216) T TIGR01163 89 AEATEHIHRLLQLIKELGAKAGIVLNPATPLEALEYVLEDVDLVLLMSVNPGFGGQK-----FIPETLEKIRELRKMIDK 163 (216) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCEEEEEEEECCCCCCC-----CHHHHHHHHHHHHHHHHH T ss_conf 377626799999999718970688679999878998987629899887607998841-----105789999999999986 Q ss_pred CCCCCCEEEEECH----HHHHHHHHCCCCCC-------CHH--HHHHHHHC Q ss_conf 8865406999750----44576874120000-------278--99999751 Q gi|254780146|r 70 TGVNVRVAVFATS----SKADEAREAGADIV-------GGE--DLFEIVKG 107 (232) Q Consensus 70 ~gk~~kV~Vf~~~----~~~~~Ak~aGa~~v-------g~~--eli~~i~~ 107 (232) .+.+..+.+..|| +.+.++.+||||++ +.+ |..+.|.. T Consensus 164 ~~~~~~~~ieVDGGv~~~ni~~~~~AGAD~~VaGSaiF~~~s~d~~~~i~~ 214 (216) T TIGR01163 164 LELGLSILIEVDGGVNEDNIAEVAEAGADILVAGSAIFGADSLDYKEAIRS 214 (216) T ss_pred HCCCCCEEEEECCCCCHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHH T ss_conf 027995589971798976799999758989998310208886687999973 No 37 >PRK10754 quinone oxidoreductase, NADPH-dependent; Provisional Probab=30.79 E-value=42 Score=14.89 Aligned_cols=63 Identities=24% Similarity=0.125 Sum_probs=36.4 Q ss_pred CEEEEE-CHHHHHHHHHCCCCCCC---HHHHHHHHHCCC--CCCCEEEECHHHHHHHHHHHHHHHHHHCC Q ss_conf 069997-50445768741200002---789999975175--33346886446778888643320010017 Q gi|254780146|r 75 RVAVFA-TSSKADEAREAGADIVG---GEDLFEIVKGGQ--IDFDRCIATPDMMPLVGRLGRILGPRGIM 138 (232) Q Consensus 75 kV~Vf~-~~~~~~~Ak~aGa~~vg---~~eli~~i~~~~--~~fd~~iAt~~~m~~i~klgriLGPkglM 138 (232) +|.+.+ +.+..+.+++.||+.+- .+++.+++++-. ..+|.++-+..- ..+...-+.|.|.|.+ T Consensus 167 ~Vi~t~~s~~k~e~~~~lGA~~vi~~~~~~~~~~i~~~t~g~gvdvv~D~vG~-~~~~~~~~~l~~~G~i 235 (327) T PRK10754 167 KLIGTVGSAQKAQRALKAGAWQVINYREENIVERVKEITGGKKVRVVYDSVGK-DTWEASLDCLQRRGLM 235 (327) T ss_pred EEEEEECCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCCEEEEECCCH-HHHHHHHHHHHCCCEE T ss_conf 99999898999999996699999989999999999998689983699989888-9999999986349899 No 38 >COG1061 SSL2 DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair] Probab=30.52 E-value=42 Score=14.86 Aligned_cols=65 Identities=26% Similarity=0.358 Sum_probs=35.9 Q ss_pred CEEEEEECCCCCCCC-----------CEEEEEC-HHHHHHHHHCCCCCCCHHHHHHHHHCCCCCC---CEEEECHHHHHH Q ss_conf 013799527788654-----------0699975-0445768741200002789999975175333---468864467788 Q gi|254780146|r 60 VRGVVTMPNGTGVNV-----------RVAVFAT-SSKADEAREAGADIVGGEDLFEIVKGGQIDF---DRCIATPDMMPL 124 (232) Q Consensus 60 irg~v~LPh~~gk~~-----------kV~Vf~~-~~~~~~Ak~aGa~~vg~~eli~~i~~~~~~f---d~~iAt~~~m~~ 124 (232) =+|.+.||-|.||.. +++|+|. .+.++++.++-....+..+-+..+..+..++ ++.+||-+.+.. T Consensus 56 ~~gvivlpTGaGKT~va~~~~~~~~~~~Lvlv~~~~L~~Qw~~~~~~~~~~~~~~g~~~~~~~~~~~~~i~vat~qtl~~ 135 (442) T COG1061 56 RRGVIVLPTGAGKTVVAAEAIAELKRSTLVLVPTKELLDQWAEALKKFLLLNDEIGIYGGGEKELEPAKVTVATVQTLAR 135 (442) T ss_pred CCEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCCCCCEEEEEEHHHHH T ss_conf 86799967999889999999998269889997829999999999997348867660336872335777489998389764 No 39 >TIGR00381 cdhD CO dehydrogenase/acetyl-CoA synthase, delta subunit; InterPro: IPR004486 This is the small subunit of a heterodimer which catalyzes the reaction CO + H2O + Acceptor = CO2 + Reduced acceptor and is involved in the synthesis of acetyl-CoA from CO2 and H2.; GO: 0006730 one-carbon compound metabolic process. Probab=29.88 E-value=28 Score=16.01 Aligned_cols=113 Identities=19% Similarity=0.266 Sum_probs=59.2 Q ss_pred CHHHHHHHHHHCCCCCCCCCEEEEEEE-CCCCCCCCCCCEEEEEECCCCCCCCCE-EEEE-CH--HHHHHHHHCCCCCCC Q ss_conf 989999999860878998629999995-378764544301379952778865406-9997-50--445768741200002 Q gi|254780146|r 23 GLSDAVVMLKERATARFDETVEIAMNL-GIDPRHANQMVRGVVTMPNGTGVNVRV-AVFA-TS--SKADEAREAGADIVG 97 (232) Q Consensus 23 si~eAi~~lk~~~~~kf~esvel~i~L-~id~~k~~~~irg~v~LPh~~gk~~kV-~Vf~-~~--~~~~~Ak~aGa~~vg 97 (232) +-+=|-++.|++-. .- +.||| .-||+=.|..=++.+.+=--.=+-++| +|++ +| +.--+..|+.|+++- T Consensus 145 P~eWArKcVK~fGA----dm--vTiHlIsTdPk~~Dksp~EAaK~~EdvLQAVdvP~viGGSGnpeKDPlVLEkaAEvAE 218 (401) T TIGR00381 145 PAEWARKCVKEFGA----DM--VTIHLISTDPKVKDKSPKEAAKVLEDVLQAVDVPLVIGGSGNPEKDPLVLEKAAEVAE 218 (401) T ss_pred HHHHHHHHHHHHCC----CE--EEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHHHCC T ss_conf 33588888876276----63--8864433788546887224788999876340677577478888667578998843113 Q ss_pred HHHH----------HHHHHCCCCCCCEE-EECHHHHHHHHH-HHHHHHHHHCCCCC Q ss_conf 7899----------99975175333468-864467788886-43320010017422 Q gi|254780146|r 98 GEDL----------FEIVKGGQIDFDRC-IATPDMMPLVGR-LGRILGPRGIMPNL 141 (232) Q Consensus 98 ~~el----------i~~i~~~~~~fd~~-iAt~~~m~~i~k-lgriLGPkglMPn~ 141 (232) |+-. -++|.+...+||+. ||=.-|=....| |-|.|=.+|+||+- T Consensus 219 GeR~lLASAnLdlDy~kia~AA~ky~H~VLsW~~mD~N~qktLNRyLlk~gl~p~d 274 (401) T TIGR00381 219 GERCLLASANLDLDYKKIANAALKYDHVVLSWAIMDINMQKTLNRYLLKRGLKPRD 274 (401) T ss_pred CCHHHHHHCCCCCCHHHHHHHHHHCCCEEEEHHHCCCCHHHHHHHHHHHHHCCCCC T ss_conf 62164541351345789999985349807761112800144676999984148677 No 40 >PRK05347 glutaminyl-tRNA synthetase; Provisional Probab=28.70 E-value=46 Score=14.66 Aligned_cols=55 Identities=13% Similarity=0.137 Sum_probs=41.1 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCEE-----EEEECCCC-CCCCC Q ss_conf 6889899999998608789986299999953787645443013-----79952778-86540 Q gi|254780146|r 20 ALYGLSDAVVMLKERATARFDETVEIAMNLGIDPRHANQMVRG-----VVTMPNGT-GVNVR 75 (232) Q Consensus 20 ~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id~~k~~~~irg-----~v~LPh~~-gk~~k 75 (232) .--|++|.+++.+++..+.|.+- +..+++++|...++..+|. .+.-||+. |.+.+ T Consensus 149 R~~~~eenl~lf~~M~~G~~~eg-~~vlR~K~Dm~~~N~~mRDpv~yRi~~~~H~rtg~k~~ 209 (556) T PRK05347 149 RDRSVEENLDLFERMRAGEFPEG-SAVLRAKIDMASPNINMRDPVLYRIRHAHHHRTGDKWC 209 (556) T ss_pred CCCCHHHHHHHHHHHHCCCCCCC-EEEEEEEEECCCCCCCCCCCEEEEECCCCCCCCCCCEE T ss_conf 56998999999999866887786-28999985148999541245257842468888688226 No 41 >COG3259 FrhA Coenzyme F420-reducing hydrogenase, alpha subunit [Energy production and conversion] Probab=27.14 E-value=27 Score=16.11 Aligned_cols=103 Identities=21% Similarity=0.324 Sum_probs=53.7 Q ss_pred EEEEEECCCCCCCCCCCEEEEEECCCC-----CCC--------CCEEEEECHHHHHHHHHCCCCCCCH--HHHHHH---- Q ss_conf 999995378764544301379952778-----865--------4069997504457687412000027--899999---- Q gi|254780146|r 44 EIAMNLGIDPRHANQMVRGVVTMPNGT-----GVN--------VRVAVFATSSKADEAREAGADIVGG--EDLFEI---- 104 (232) Q Consensus 44 el~i~L~id~~k~~~~irg~v~LPh~~-----gk~--------~kV~Vf~~~~~~~~Ak~aGa~~vg~--~eli~~---- 104 (232) .+.+.++-+-...+. +-.+.=|.|| ||. .|||-+|+-.+.-.|.+|.-...|- +|-.++ T Consensus 18 kv~i~vdd~G~V~~~--~~~it~~RgfEk~~~Gkp~EeaP~i~~RICgiC~~sH~LAs~eA~e~a~gveip~~~~~LREl 95 (441) T COG3259 18 KVTIEVDDDGIVEDA--RFHITEVRGFEKFVLGKPIEEAPRIVPRICGICPASHHLASVEAAEAALGVEIPEEAEKLREL 95 (441) T ss_pred EEEEEECCCCCEEEE--EEEECCCCHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 799998688853335--788414511878853898677357764330057537788899999985287578289999999 Q ss_pred ------HHCCCCCCCEEEECHHH-----------------------HHHHHH-HHHHHHHHHCCC-CCCCCCCCCC Q ss_conf ------75175333468864467-----------------------788886-433200100174-2233566655 Q gi|254780146|r 105 ------VKGGQIDFDRCIATPDM-----------------------MPLVGR-LGRILGPRGIMP-NLRVGTVTTD 149 (232) Q Consensus 105 ------i~~~~~~fd~~iAt~~~-----------------------m~~i~k-lgriLGPkglMP-n~k~GTv~~d 149 (232) |.+-.+.| +++|-||+ +.+++. +-.+||-|+.-| |...|.+..+ T Consensus 96 ~~~g~~i~SHALh~-y~La~PDfl~~~d~~~~~i~~~~e~~k~~i~Lr~~gq~Iv~~lGGk~IHp~~~~pGG~~~~ 170 (441) T COG3259 96 LNIGNIIESHALHL-YFLAAPDFLGPPDADPGMIDEYPELAKEAIRLRKIGQTIVEVLGGKAIHPVNAVPGGVAKN 170 (441) T ss_pred HHHHHHHHHHHHHH-HHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 87777876647788-8860762247866672554442899999999999999999986687778876677744788 No 42 >TIGR03152 cyto_c552_HCOOH formate-dependent cytochrome c nitrite reductase, c552 subunit. Members of this protein family are cytochrome c552, a component of cytochrome c nitrite reductase, which is known more formally as nitrite reductase (cytochrome; ammonia-forming) (EC 1.7.2.2). Nitrate can be reduced by several enzymes. EC 1.7.2.2 reduces nitrite all the way to ammonia, rather than to ammonium hydroxide (nitrite reductase (NAD(P)H), EC 1.7.1.4) or nitric oxide (nitrite reductase (NO-forming), EC 1.7.2.1). Some examples of EC 1.7.2.2 occur in a seven gene system that enables formate-dependent nitrite reduction, but is also found in simpler contexts. Members of this protein family, however, belong to the formate-dependent system. Probab=26.86 E-value=41 Score=14.96 Aligned_cols=33 Identities=24% Similarity=0.311 Sum_probs=14.9 Q ss_pred HHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHH Q ss_conf 874120000278999997517533346886446 Q gi|254780146|r 88 AREAGADIVGGEDLFEIVKGGQIDFDRCIATPD 120 (232) Q Consensus 88 Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~ 120 (232) |.++||.-.-..+.++.|..++.-.||+.|+.+ T Consensus 320 a~~aga~~~em~~~l~~~R~AQwrwDf~~Aens 352 (439) T TIGR03152 320 AWDAGATEAEMKPILQDIRHAQWRWDYAIASHG 352 (439) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHEEEEECCCC T ss_conf 997699999999999999856552033010256 No 43 >pfam10302 DUF2407 Putative membrane protein (DUF2407). This is a family of proteins found in fungi. The function is not known. There is a characteristic GFDRL sequence motif. Probab=26.59 E-value=47 Score=14.60 Aligned_cols=25 Identities=28% Similarity=0.539 Sum_probs=14.1 Q ss_pred CCCCEEEEC---HHHHHHHHH-HHHHHHH Q ss_conf 333468864---467788886-4332001 Q gi|254780146|r 110 IDFDRCIAT---PDMMPLVGR-LGRILGP 134 (232) Q Consensus 110 ~~fd~~iAt---~~~m~~i~k-lgriLGP 134 (232) .-||.+++. ++-...|.. +-.+.|. T Consensus 120 rGFDRLlsaGFs~~EV~~LR~QF~ai~~~ 148 (252) T pfam10302 120 RGFDRLLSAGFSPAEVSLLRSQFRAIYGS 148 (252) T ss_pred CCHHHHHHCCCCHHHHHHHHHHHHHHHCC T ss_conf 44878987589999999999999987156 No 44 >cd04733 OYE_like_2_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 2. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=26.21 E-value=51 Score=14.39 Aligned_cols=14 Identities=50% Similarity=0.394 Sum_probs=7.9 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 44576874120000 Q gi|254780146|r 83 SKADEAREAGADIV 96 (232) Q Consensus 83 ~~~~~Ak~aGa~~v 96 (232) +.+..|++||.|.| T Consensus 153 ~AA~rA~~AGfDgV 166 (338) T cd04733 153 HAARLAQEAGFDGV 166 (338) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999998399989 No 45 >cd01820 PAF_acetylesterase_like PAF_acetylhydrolase (PAF-AH)_like subfamily of SGNH-hydrolases. Platelet-activating factor (PAF) and PAF-AH are key players in inflammation and in atherosclerosis. PAF-AH is a calcium independent phospholipase A2 which exhibits strong substrate specificity towards PAF, hydrolyzing an acetyl ester at the sn-2 position. PAF-AH also degrades a family of oxidized PAF-like phospholipids with short sn-2 residues. In addition, PAF and PAF-AH are associated with neural migration and mammalian reproduction. Probab=25.76 E-value=51 Score=14.34 Aligned_cols=63 Identities=17% Similarity=0.285 Sum_probs=36.7 Q ss_pred CCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECCC--CCCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3566655799999986011102235752678753157--78989999999999999986086625 Q gi|254780146|r 143 VGTVTTDVATAVRESKSGAVDFRSEKAGIIHAGIGKV--SFENKKIEENVLAFVSAVVKAKPSVA 205 (232) Q Consensus 143 ~GTv~~di~~~i~~~k~g~i~~r~~k~g~i~~~IG~~--~m~~e~i~eNi~~~~~~i~~~kp~~~ 205 (232) .|-=++-....+-++.+|.+..-.=+.-++.+..=++ +.+.+++.+|+.++++.|.+..|... T Consensus 66 rGigGd~t~~vL~Rl~~~~l~~~~Pk~Vvl~iGtNDl~~~~~~~~i~~~i~~iv~~ir~~~P~t~ 130 (214) T cd01820 66 FGIGGDRTQNVLWRLENGELDGVNPKVVVLLIGTNNIGHTTTAEEIAEGILAIVEEIREKLPNAK 130 (214) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 25160227779998523100358998899993235546898999999999999999998789987 No 46 >TIGR01182 eda 2-dehydro-3-deoxyphosphogluconate aldolase/4-hydroxy-2-oxoglutarate aldolase; InterPro: IPR000887 4-Hydroxy-2-oxoglutarate aldolase (4.1.3.16 from EC) (KHG-aldolase) catalyzes the interconversion of 4-hydroxy-2-oxoglutarate into pyruvate and glyoxylate. Phospho-2-dehydro-3-deoxygluconate aldolase (4.1.2.14 from EC) (KDPG-aldolase) catalyzes the interconversion of 6-phospho-2-dehydro-3-deoxy-D-gluconate into pyruvate and glyceraldehyde 3-phosphate. These two enzymes are structurally and functionally related . They are both homotrimeric proteins of approximately 220 amino-acid residues. They are class I aldolases whose catalytic mechanism involves the formation of a Schiff-base intermediate between the substrate and the epsilon-amino group of a lysine residue. In both enzymes, an arginine is required for catalytic activity.; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=25.60 E-value=52 Score=14.32 Aligned_cols=79 Identities=18% Similarity=0.196 Sum_probs=44.1 Q ss_pred CCHHHHHHHHHHCC-CCCCCCCEEEEEEECCCCCCCCCCCEE-EEEECCCCCCCCCEEEE--ECHHHHHHHHHCCCCCCC Q ss_conf 89899999998608-789986299999953787645443013-79952778865406999--750445768741200002 Q gi|254780146|r 22 YGLSDAVVMLKERA-TARFDETVEIAMNLGIDPRHANQMVRG-VVTMPNGTGVNVRVAVF--ATSSKADEAREAGADIVG 97 (232) Q Consensus 22 ysi~eAi~~lk~~~-~~kf~esvel~i~L~id~~k~~~~irg-~v~LPh~~gk~~kV~Vf--~~~~~~~~Ak~aGa~~vg 97 (232) -++++|+.+.+..- .+ ...+|+.++.. .+...||- .-..| ++..|.+= -+.++..+|.+|||+++= T Consensus 17 ~~~~~A~~lA~aL~egG--~~~~EvTlRT~----~A~~aI~~l~~~~P----~~~~iGAGTVL~~~Q~~~A~~AGA~F~v 86 (205) T TIGR01182 17 DDVEDALPLAKALIEGG--LRVLEVTLRTP----VALEAIRALRKEVP----KDALIGAGTVLNPEQLRQAVAAGAQFIV 86 (205) T ss_pred CCHHHHHHHHHHHHHCC--CEEEEEEECCC----CHHHHHHHHHHHCC----CCCEECCCCCCCHHHHHHHHHCCCCEEE T ss_conf 67877789999998679--80898851472----16899999997282----3348716764898999999970895787 Q ss_pred HH----HHHHHHHCCCC Q ss_conf 78----99999751753 Q gi|254780146|r 98 GE----DLFEIVKGGQI 110 (232) Q Consensus 98 ~~----eli~~i~~~~~ 110 (232) ++ +|++--++..+ T Consensus 87 SPG~~p~l~~~~~~~~~ 103 (205) T TIGR01182 87 SPGLTPELAKHAKDKGI 103 (205) T ss_pred CCCCCHHHHHHHHHCCC T ss_conf 69788899999850888 No 47 >TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049 This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine. Probab=25.40 E-value=37 Score=15.20 Aligned_cols=21 Identities=24% Similarity=0.118 Sum_probs=8.9 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 17422335666557999999860 Q gi|254780146|r 137 IMPNLRVGTVTTDVATAVRESKS 159 (232) Q Consensus 137 lMPn~k~GTv~~di~~~i~~~k~ 159 (232) +|=+-++-. |.++.-++-+.+ T Consensus 101 klHTGRSRN--DQVATDlrl~lr 121 (469) T TIGR00838 101 KLHTGRSRN--DQVATDLRLYLR 121 (469) T ss_pred CCCCCCCCC--CHHHHHHHHHHH T ss_conf 124678764--178998999999 No 48 >PRK10537 voltage-gated potassium channel; Provisional Probab=25.20 E-value=53 Score=14.27 Aligned_cols=26 Identities=8% Similarity=0.282 Sum_probs=10.8 Q ss_pred CCCCCEEEEECH-HHHHHHHHCCCCCC Q ss_conf 865406999750-44576874120000 Q gi|254780146|r 71 GVNVRVAVFATS-SKADEAREAGADIV 96 (232) Q Consensus 71 gk~~kV~Vf~~~-~~~~~Ak~aGa~~v 96 (232) +++.+|.+-++. +..+..+.||||.| T Consensus 291 nP~v~IVA~a~~~en~~KlrrAGAD~V 317 (356) T PRK10537 291 SSDVKTVAAVNDSKNLEKIKRVHPDMI 317 (356) T ss_pred CCCCEEEEEECCHHHHHHHHHHCCCEE T ss_conf 899769999676757999986289999 No 49 >pfam08899 DUF1844 Domain of unknown function (DUF1844). This family of proteins are functionally uncharacterized. Probab=25.14 E-value=40 Score=14.99 Aligned_cols=50 Identities=20% Similarity=0.310 Sum_probs=33.4 Q ss_pred HHCCCCCCCCCCCCCHHHHHHHH--HCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 00174223356665579999998--6011102235752678753157789899999999999 Q gi|254780146|r 135 RGIMPNLRVGTVTTDVATAVRES--KSGAVDFRSEKAGIIHAGIGKVSFENKKIEENVLAFV 194 (232) Q Consensus 135 kglMPn~k~GTv~~di~~~i~~~--k~g~i~~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~ 194 (232) -|++|+|-.|.+..|+..+=..+ .. .++=|+ =|+++-..+++.+|+.+=+ T Consensus 19 LG~~~~P~tg~~~~nl~~Ak~~ID~L~-mLqeKT---------kGNLt~~E~~lL~~~l~eL 70 (74) T pfam08899 19 LGEIPDPETGETRKDLPLAKHTIDLLS-MLQEKT---------KGNLTADEARLLENLLYDL 70 (74) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHH-HHHHHH---------CCCCCHHHHHHHHHHHHHH T ss_conf 689999998882569999999989999-999985---------3699999999999999997 No 50 >PRK01189 V-type ATP synthase subunit F; Provisional Probab=24.99 E-value=50 Score=14.41 Aligned_cols=58 Identities=14% Similarity=0.048 Sum_probs=37.7 Q ss_pred CEEEEECHHHHHHHHHCCCC---CCCHHHHHHHHHCCCC--CCCEEEECHHHHHHHHH-HHHHH Q ss_conf 06999750445768741200---0027899999751753--33468864467788886-43320 Q gi|254780146|r 75 RVAVFATSSKADEAREAGAD---IVGGEDLFEIVKGGQI--DFDRCIATPDMMPLVGR-LGRIL 132 (232) Q Consensus 75 kV~Vf~~~~~~~~Ak~aGa~---~vg~~eli~~i~~~~~--~fd~~iAt~~~m~~i~k-lgriL 132 (232) .|||.++.+...--+=+|.. .+.++++.+++.+... +|...+++.+++..+.+ .-+.+ T Consensus 4 ~IaViG~re~VlGFrL~GI~~t~~a~~~~~~~~i~~l~~~~~~~iIi~se~~~~~l~~~~~~~~ 67 (104) T PRK01189 4 CITVIGERDVVLGFRLLGIGDTIEAEGKDLVKKFLEIFNNPKCKYIFVSESTKNMFDKNTLRSL 67 (104) T ss_pred CEEEECCCCEEEEEEEECCCCEEEECCHHHHHHHHHHHCCCCCCEEEEEHHHHHHHCHHHHHHH T ss_conf 0899838432777787335325770687799999998638984489986377655388889887 No 51 >TIGR01242 26Sp45 26S proteasome subunit P45 family; InterPro: IPR005937 Intracellular proteins, including short-lived proteins such as cyclin, Mos, Myc, p53, NF-kappaB, and IkappaB, are degraded by the ubiquitin-proteasome system. The 26S proteasome (a 2 MDa complex) is made up of two subcomplexes: the 20S proteasome and the regulatory complex. The former is a 700 kDa cylindrical protease complex consisting of four stacks of heptameric rings with 28 subunits (i.e., 7777) with molecular masses of about 20-35 kDa, whereas the latter is a 700-1000 kDa complex consisting of at least 18 subunits with molecular masses of 28-110 kDa, including 6 putative ATPases (Rpt1-Rpt6) and 12 non-ATPase subunits (Rpn1-12). Members of the 26S proteasome subunit P45 family: ATPase p45/Sug1/Rpt6 may be phosphorylated within the proteasome. This phosphorylation event may play a key role in ATP-dependent proteolysis because a good correlation exists between the inhibition pattern of protein kinase inhibitors against the phosphorylation of p45 and that against the ATP-dependent proteolytic activity , . More information about these protein can be found at Protein of the Month: AAA ATPases .; GO: 0016787 hydrolase activity, 0030163 protein catabolic process, 0005634 nucleus, 0005737 cytoplasm. Probab=24.28 E-value=49 Score=14.47 Aligned_cols=10 Identities=20% Similarity=0.577 Sum_probs=4.0 Q ss_pred EECHHHHHHH Q ss_conf 8644677888 Q gi|254780146|r 116 IATPDMMPLV 125 (232) Q Consensus 116 iAt~~~m~~i 125 (232) |-+|++..++ T Consensus 142 lk~PeLF~~v 151 (364) T TIGR01242 142 LKKPELFEEV 151 (364) T ss_pred CCCCHHHHHC T ss_conf 8883167762 No 52 >pfam05918 API5 Apoptosis inhibitory protein 5 (API5). This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterized by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in human chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors. Probab=23.81 E-value=56 Score=14.11 Aligned_cols=22 Identities=23% Similarity=0.439 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999998608662544 Q gi|254780146|r 185 KIEENVLAFVSAVVKAKPSVAKG 207 (232) Q Consensus 185 ~i~eNi~~~~~~i~~~kp~~~kg 207 (232) ....||.+++..+... |+..|| T Consensus 432 kt~nNI~~likdLfh~-pPsfk~ 453 (543) T pfam05918 432 RTCNNILAMTKDLFHS-PPIFKH 453 (543) T ss_pred HHHHHHHHHHHHHCCC-CCCCCC T ss_conf 9985299999886039-985467 No 53 >COG4844 Uncharacterized protein conserved in bacteria [Function unknown] Probab=23.48 E-value=57 Score=14.07 Aligned_cols=17 Identities=18% Similarity=0.397 Sum_probs=13.1 Q ss_pred CCCHHHHHHHHHHHHHH Q ss_conf 78989999999999999 Q gi|254780146|r 180 SFENKKIEENVLAFVSA 196 (232) Q Consensus 180 ~m~~e~i~eNi~~~~~~ 196 (232) .-+++||.+||-++++. T Consensus 58 Get~eeLv~NIY~~i~E 74 (78) T COG4844 58 GETPEELVENIYTFIEE 74 (78) T ss_pred CCCHHHHHHHHHHHHHC T ss_conf 89989999999998740 No 54 >pfam07021 MetW Methionine biosynthesis protein MetW. This family consists of several bacterial and one archaeal methionine biosynthesis MetW proteins. Biosynthesis of methionine from homoserine in Pseudomonas putida takes place in three steps. The first step is the acylation of homoserine to yield an acyl-L-homoserine. This reaction is catalysed by the products of the metXW genes and is equivalent to the first step in enterobacteria, gram-positive bacteria and fungi, except that in these microorganisms the reaction is catalysed by a single polypeptide (the product of the metA gene in Escherichia coli and the met5 gene product in Neurospora crassa). In Pseudomonas putida, as in gram-positive bacteria and certain fungi, the second and third steps are a direct sulfhydrylation that converts the O-acyl-L-homoserine into homocysteine and further methylation to yield methionine. The latter reaction can be mediated by either of the two methionine synthetases present in the cells. Probab=22.86 E-value=59 Score=13.99 Aligned_cols=40 Identities=15% Similarity=0.253 Sum_probs=19.0 Q ss_pred CHHHHHHHHHCCCCCCCHHHHHHHHHC-CCCCCCEEEECHHH Q ss_conf 504457687412000027899999751-75333468864467 Q gi|254780146|r 81 TSSKADEAREAGADIVGGEDLFEIVKG-GQIDFDRCIATPDM 121 (232) Q Consensus 81 ~~~~~~~Ak~aGa~~vg~~eli~~i~~-~~~~fd~~iAt~~~ 121 (232) +.+.+.+|.+.|..++-++ +-+.... ..-.||++|++..+ T Consensus 45 d~~~v~~a~~kg~~Vi~~D-~d~~l~~f~d~sFD~VIls~vL 85 (193) T pfam07021 45 DAAGVAECVAKGLSVIQGD-ADKGLEHFPDKSFDYVILSQTL 85 (193) T ss_pred CHHHHHHHHHCCCCEECCC-HHHCHHHCCCCCCCEEEHHHHH T ss_conf 9999999986479545077-4459745776780378698799 No 55 >PRK08306 dipicolinate synthase subunit A; Reviewed Probab=22.54 E-value=59 Score=13.95 Aligned_cols=42 Identities=29% Similarity=0.161 Sum_probs=29.1 Q ss_pred CEEEEEC-HHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHH Q ss_conf 0699975-044576874120000278999997517533346886446 Q gi|254780146|r 75 RVAVFAT-SSKADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPD 120 (232) Q Consensus 75 kV~Vf~~-~~~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~ 120 (232) +|.|.+. .+....|.+.|...+.-++|-+.+. +||+++.|-. T Consensus 177 ~V~V~aR~~~~~a~a~~~G~~~v~~~~L~~~~~----~~D~i~NTIP 219 (296) T PRK08306 177 KVKVGARKSAHLARITEMGYSPVHLSELAEEVG----EADIIFNTIP 219 (296) T ss_pred EEEEEECCHHHHHHHHHCCCEEEEHHHHHHHHC----CCCEEEECCC T ss_conf 699997999999999976985871999997735----6999998985 No 56 >PRK06512 thiamine-phosphate pyrophosphorylase; Provisional Probab=22.22 E-value=60 Score=13.91 Aligned_cols=29 Identities=17% Similarity=-0.074 Sum_probs=15.3 Q ss_pred CCCCCCEEEEEC--HHHHHHHHHCCCCCCCH Q ss_conf 886540699975--04457687412000027 Q gi|254780146|r 70 TGVNVRVAVFAT--SSKADEAREAGADIVGG 98 (232) Q Consensus 70 ~gk~~kV~Vf~~--~~~~~~Ak~aGa~~vg~ 98 (232) .|.+.-|-+.|- -+.+.+|.++||||++- T Consensus 107 lg~~~IIG~~~~~s~~~A~~A~e~GADYv~f 137 (221) T PRK06512 107 HAPKMIVGFGNLRDRHGAMEVGELQPDYLFF 137 (221) T ss_pred HCCCCEEEEECCCCHHHHHHHHHCCCCEEEE T ss_conf 4788678640578899999999739985765 No 57 >TIGR00603 rad25 DNA repair helicase rad25; InterPro: IPR001161 Xeroderma pigmentosum (XP) is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair , . XP-G can be corrected by a 133 Kd nuclear protein, XPGC . XPGC is an acidic protein that confers normal UV resistance in expressing cells . It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms , . XPGC cleaves one strand of the duplex at the border with the single-stranded region . XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases , ; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C-terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents XP group B (XP-B) give rise to both XP and Cockayne syndrome . The DNA/RNA helicase domain IPR001650 from INTERPRO is also present in this group of proteins.; GO: 0003677 DNA binding, 0004003 ATP-dependent DNA helicase activity, 0005524 ATP binding, 0006289 nucleotide-excision repair, 0005634 nucleus. Probab=22.08 E-value=61 Score=13.90 Aligned_cols=27 Identities=19% Similarity=0.294 Sum_probs=14.1 Q ss_pred CCCCEEEEECH--HHHHHHHHCCCCCCCH Q ss_conf 65406999750--4457687412000027 Q gi|254780146|r 72 VNVRVAVFATS--SKADEAREAGADIVGG 98 (232) Q Consensus 72 k~~kV~Vf~~~--~~~~~Ak~aGa~~vg~ 98 (232) +.=||.||+|. ...+-|.+.|=.++-| T Consensus 512 RgDKIIVFsDNVfAL~~YA~kl~KpfIYG 540 (756) T TIGR00603 512 RGDKIIVFSDNVFALKEYAIKLGKPFIYG 540 (756) T ss_pred CCCEEEEECCHHHHHHHHHHHCCCCEEEC T ss_conf 88858994244789999998738965407 No 58 >pfam10740 DUF2529 Protein of unknown function (DUF2529). This family is conserved in the Bacillales. The function is not known. Several members are annotated as being YWJG, a protein expressed downstream of pyrG, a gene encoding for cytidine triphosphate synthetase. Probab=22.07 E-value=31 Score=15.73 Aligned_cols=125 Identities=17% Similarity=0.177 Sum_probs=66.6 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEECCC-----CCCCCCCCEEEEEECCC------ Q ss_conf 9742234379898513847688989999999860878998629999995378-----76454430137995277------ Q gi|254780146|r 1 MVSKISKRMQQISQGIDRSALYGLSDAVVMLKERATARFDETVEIAMNLGID-----PRHANQMVRGVVTMPNG------ 69 (232) Q Consensus 1 mm~k~~Kr~~~~~~~~~~~~~ysi~eAi~~lk~~~~~kf~esvel~i~L~id-----~~k~~~~irg~v~LPh~------ 69 (232) |+|-++-...-+.+.+......++++|-++|-..-.+. -+|.+.=.=.+. .-.+..++.+.-.||.- T Consensus 1 MlKIftTQL~Gif~rI~ekee~~iEdaARlLAQAiige--G~VYi~G~~Em~~v~~eal~g~E~l~~~k~l~~~~~~~~~ 78 (172) T pfam10740 1 MLKIFTTQLTGIFNRITEKEELAIEDGARLLAQAIIGE--GNVYIKGFNEMEAVELEALEGVEPLPSAKPLPLDATEFAD 78 (172) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--CEEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 91488889999998842667876999999999997269--7799972546998999997287767544337555665100 Q ss_pred CCCCCCEEEEECHH----H---HHHHHHCCCCCCCHHHHHHHHHCCCCC-CCEEEECHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 88654069997504----4---576874120000278999997517533-346886446778888643320010017422 Q gi|254780146|r 70 TGVNVRVAVFATSS----K---ADEAREAGADIVGGEDLFEIVKGGQID-FDRCIATPDMMPLVGRLGRILGPRGIMPNL 141 (232) Q Consensus 70 ~gk~~kV~Vf~~~~----~---~~~Ak~aGa~~vg~~eli~~i~~~~~~-fd~~iAt~~~m~~i~klgriLGPkglMPn~ 141 (232) ....-||++|+... . ++...+.|.++|+-.... +=..++.+ .|++|.+. . |||++|+- T Consensus 79 ~t~tDRVLlfs~~~~d~e~~~~a~~L~~~gi~~v~Vs~~~-~~~~~l~~~~~vhId~~-----------l--~~~LvP~E 144 (172) T pfam10740 79 ITETDRVLLFSRFSNDEEAVALAKKLYDQGIDVVAVSPNA-KEGDSLADHADVHIDLK-----------L--KRPLVPDE 144 (172) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEECCC-CCCCCCHHHEEEEEECC-----------C--CCCCCCCC T ss_conf 7755769997699997899999999998699889981688-88876144402667647-----------7--87866188 No 59 >PRK05396 tdh L-threonine 3-dehydrogenase; Validated Probab=21.92 E-value=61 Score=13.88 Aligned_cols=50 Identities=22% Similarity=0.258 Sum_probs=29.5 Q ss_pred CEEEEE-CHHHHHHHHHCCCCCCC---HHHHHHHHHCCC--CCCCEEEEC---HHHHHH Q ss_conf 069997-50445768741200002---789999975175--333468864---467788 Q gi|254780146|r 75 RVAVFA-TSSKADEAREAGADIVG---GEDLFEIVKGGQ--IDFDRCIAT---PDMMPL 124 (232) Q Consensus 75 kV~Vf~-~~~~~~~Ak~aGa~~vg---~~eli~~i~~~~--~~fd~~iAt---~~~m~~ 124 (232) +|.++. ..+..+.|++.||+.+- .+++.+.+.+-. .-+|.++-. ++.+.+ T Consensus 190 ~Vi~~d~~~~kl~~a~~lGa~~~i~~~~~~~~~~~~~~t~~~G~Dvvid~~G~~~~~~~ 248 (341) T PRK05396 190 HVVITDVNEYRLELARKMGATRAVNVAKEDLRDVMAELGMTEGFDVGLEMSGAPSAFRQ 248 (341) T ss_pred EEEEEECCHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCCCCEEEEECCCCHHHHHH T ss_conf 89999489999998986499499968850689999997489997699987898999999 No 60 >PRK09310 aroDE bifunctional 3-dehydroquinate dehydratase/shikimate dehydrogenase protein; Reviewed Probab=21.77 E-value=62 Score=13.86 Aligned_cols=52 Identities=19% Similarity=0.202 Sum_probs=30.1 Q ss_pred HHHHHHHHHHHHHHHCC--------CCC----CCCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECC Q ss_conf 78888643320010017--------422----3356665579999998601110223575267875315 Q gi|254780146|r 122 MPLVGRLGRILGPRGIM--------PNL----RVGTVTTDVATAVRESKSGAVDFRSEKAGIIHAGIGK 178 (232) Q Consensus 122 m~~i~klgriLGPkglM--------Pn~----k~GTv~~di~~~i~~~k~g~i~~r~~k~g~i~~~IG~ 178 (232) |+..|++.|+|+|+ | |+. -+|.++ +.++. .++-+.+.-.++-.|+|.-||+. T Consensus 164 MGe~G~lSRiL~~~--~g~~~t~a~~~~~~~~aPGQ~t--~~~l~-~y~~~~~~~~tk~fgviG~PI~H 227 (477) T PRK09310 164 MGNHGIPSRILSPL--LQNAFNYAAGIGAPPVAPGQLS--LEHLL-FYNYANLSAQSHIYGLIGDPVDR 227 (477) T ss_pred CCCCCEEEEECCCC--CCCCCCCCCCCCCCCCCCCCCC--HHHHH-HHHHCCCCCCCCEEEEECCCHHH T ss_conf 17787021000022--2676412455566668988777--99999-87760568567489998176231 No 61 >COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane] Probab=21.75 E-value=62 Score=13.85 Aligned_cols=12 Identities=17% Similarity=0.232 Sum_probs=6.4 Q ss_pred CHHHHHHHHHHC Q ss_conf 989999999860 Q gi|254780146|r 23 GLSDAVVMLKER 34 (232) Q Consensus 23 si~eAi~~lk~~ 34 (232) +++-|.++++.. T Consensus 28 ~le~A~~lIdaA 39 (347) T COG2089 28 DLERAKELIDAA 39 (347) T ss_pred CHHHHHHHHHHH T ss_conf 589999999999 No 62 >TIGR02407 ectoine_ectB diaminobutyrate--2-oxoglutarate aminotransferase; InterPro: IPR012773 Members of this family of class III pyridoxal-phosphate-dependent aminotransferases are diaminobutyrate--2-oxoglutarate aminotransferase (2.6.1.76 from EC) that catalyze the first step in ectoine biosynthesis from L-aspartate beta-semialdehyde. This family is readily separated phylogenetically from enzymes with the same substrate and product but involved in other process such as siderophore (Q9Z3R2 from SWISSPROT) or 1,3-diaminopropane (P44951 from SWISSPROT) biosynthesis. Ectoine is a compatible solute particularly effective in conferring salt tolerance.; GO: 0030170 pyridoxal phosphate binding, 0047307 diaminobutyrate-pyruvate transaminase activity, 0019491 ectoine biosynthetic process. Probab=21.29 E-value=57 Score=14.06 Aligned_cols=46 Identities=13% Similarity=0.204 Sum_probs=20.6 Q ss_pred CHHHHHHHHHHCCCCC----CCCCE------EEEEEECCCCCCCC-CCCEEEEEECC Q ss_conf 9899999998608789----98629------99999537876454-43013799527 Q gi|254780146|r 23 GLSDAVVMLKERATAR----FDETV------EIAMNLGIDPRHAN-QMVRGVVTMPN 68 (232) Q Consensus 23 si~eAi~~lk~~~~~k----f~esv------el~i~L~id~~k~~-~~irg~v~LPh 68 (232) .+|=||++.|+...|. |++.. =|++.=|=..++.. -+...++.||+ T Consensus 111 aVEsAlKLARKvtGR~nvvaFTNaFHGmt~GsLsvTGN~~~R~~aG~~~~~v~~mPY 167 (413) T TIGR02407 111 AVESALKLARKVTGRSNVVAFTNAFHGMTLGSLSVTGNRFKRQGAGVPLSNVSRMPY 167 (413) T ss_pred HHHHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCC T ss_conf 799999998503898406885177302348988864334443026744778256876 No 63 >KOG1500 consensus Probab=21.03 E-value=64 Score=13.76 Aligned_cols=30 Identities=30% Similarity=0.421 Sum_probs=19.5 Q ss_pred CCCEEEECHH--------HHHHHHHHHHHHHHHHCC-CC Q ss_conf 3346886446--------778888643320010017-42 Q gi|254780146|r 111 DFDRCIATPD--------MMPLVGRLGRILGPRGIM-PN 140 (232) Q Consensus 111 ~fd~~iAt~~--------~m~~i~klgriLGPkglM-Pn 140 (232) +.|.+|++|- |+-.--.-.|.|.|.|+| |+ T Consensus 243 k~DviISEPMG~mL~NERMLEsYl~Ark~l~P~GkMfPT 281 (517) T KOG1500 243 KVDVIISEPMGYMLVNERMLESYLHARKWLKPNGKMFPT 281 (517) T ss_pred CCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 034787256214111088899999998742877744675 No 64 >TIGR01452 PGP_euk phosphoglycolate/pyridoxal phosphate phosphatase family; InterPro: IPR006349 This family of sequences represent 2-phosphoglycolate phosphatase which is limited to the eukaryotic lineage. PGP is an essential enzyme in the glycolate salvage pathway in higher organisms (photorespiration in plants). Phosphoglycolate results from the oxidase activity of RubisCO in the Calvin cycle when concentrations of carbon dioxide are low relative to oxygen. In mammals, PGP is found in many tissues, notably in red blood cells where P-glycolate is and important activator of the hydrolysis of 2,3-bisphosphoglycerate, a major modifier of the oxygen affinity of hemoglobin. The PGPase enzyme described here is a member of the Haloacid dehalogenase superfamily of hydrolase enzymes. Unlike the bacterial PGP equivalog (IPR006346 from INTERPRO), which is a member of class (subfamily) I, this enzyme is a member of class (subfamily) II . These two enzymes have almost certainly arisen from convergent evolution (although these two ancestors may themselves have diverged from a more distant HAD superfamily progenitor). One of the sequences in the group comes from Chlamydomonas reinhardtii, a photosynthetic alga. The enzyme has been purified and characterised and these data are fully consistent with the assignment of function as a PGPase involved in photorespiration . Another, from Homo sapiens, is supported as a member of the family indirectly. Biochemical characterization of partially purified PGP's from various tissues including red blood cells have been performed while one gene for PGP has been localized to chromosome 16p13.3 . The Homo sapiens sequence maps to chromosome 22. There is indeed a related gene on chromosome 16 (and it is expressed, since EST's are found) which shows 46 0dentity and 59603b08ositives by BLAST2 (E=1e-66). Presumably, these two genes are isoforms and have the same catalytic function while being expressed in different tissues and may be differently regulated. The sequence from Caenorhabditis elegans, is only supported by sequence similarity. This family is closely related to a family of bacterial sequences including the Escherichia coli NagD and Bacillus subtilus AraL genes which are characterised by the ability to hydrolyze para-nitrophenylphosphate (pNPPases or NPPases). The Chlamydomonas reinhardtii PGPase does not catalyze this reaction and so presumably these two groups have different functions and substrate specificities. Many of the genes in this family have been annotated as pNPPases due to this association.; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=20.79 E-value=64 Score=13.74 Aligned_cols=38 Identities=24% Similarity=0.319 Sum_probs=29.9 Q ss_pred EECCCCCCCCCEEEEECHHHHHHHHHCCCCCCCHHHHH Q ss_conf 95277886540699975044576874120000278999 Q gi|254780146|r 65 TMPNGTGVNVRVAVFATSSKADEAREAGADIVGGEDLF 102 (232) Q Consensus 65 ~LPh~~gk~~kV~Vf~~~~~~~~Ak~aGa~~vg~~eli 102 (232) .||-+..-+.+|.|+.+.-..+|-..+|..++|+.--- T Consensus 81 rlPG~~~~~k~VyviGe~Glr~Eld~~Gir~~G~p~~~ 118 (288) T TIGR01452 81 RLPGAIDANKKVYVIGEEGLREELDEKGIRYVGDPSEK 118 (288) T ss_pred HCCCCCCCCCEEEEECCCCHHHHHHHHCCHHCCCCCCC T ss_conf 37788888862899777026889987160211788601 No 65 >TIGR01211 ELP3 histone acetyltransferase, ELP3 family; InterPro: IPR005910 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones . Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. The yeast GCN5 (yGCN5) transcriptional coactivator functions as a histone acetyltransferase (HAT) to promote transcriptional activation. The crystal structure of the yeast histone acetyltransferase Hat1-acetyl coenzyme A (AcCoA) shows that Hat1 has an elongated, curved structure, and the AcCoA molecule is bound in a cleft on the concave surface of the protein, marking the active site of the enzyme. A channel of variable width and depth that runs across the protein is probably the binding site for the histone substrate . The central protein core associated with AcCoA binding that appears to be structurally conserved among a superfamily of N-acetyltransferases, including yeast histone acetyltransferase 1 and Serratia marcescens aminoglycoside 3-N-acetyltransferase . The Saccharomyces cerevisiae member YPL086C has been characterised in vitro as an N-terminal acetyltransferase (2.3.1.48 from EC) for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. In vivo, ELP3 gene deletion confers typical ELP phenotypes such as slow growth adaptation, slow gene activation, and temperature sensitivity. This suggests a role for the proteins as novel, tightly RNAPII-associated histone acetyltransferases in transcription of DNA packaged in chromatin .. Probab=20.67 E-value=48 Score=14.55 Aligned_cols=31 Identities=13% Similarity=0.396 Sum_probs=16.8 Q ss_pred CCCCCCCCEEEEE---------ECCCCCCCCCEEEEECHHH Q ss_conf 7645443013799---------5277886540699975044 Q gi|254780146|r 53 PRHANQMVRGVVT---------MPNGTGVNVRVAVFATSSK 84 (232) Q Consensus 53 ~~k~~~~irg~v~---------LPh~~gk~~kV~Vf~~~~~ 84 (232) .+||..-|+|+.+ =|||. ..-.||+||.|-. T Consensus 57 r~KPvRTiSGVAVVAvMTsP~~CPHGk-ytGniC~yCPGG~ 96 (573) T TIGR01211 57 RKKPVRTISGVAVVAVMTSPERCPHGK-YTGNICLYCPGGP 96 (573) T ss_pred CCCCCEEECCCCCEEEECCCCCCCCCC-CCCCEEEECCCCC T ss_conf 169960115752001226216888854-2178764268788 No 66 >TIGR02437 FadB fatty oxidation complex, alpha subunit FadB; InterPro: IPR012799 Members of this family represent the alpha subunit of the multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (4.2.1.17 from EC), dodecenoyl-CoA delta-isomerase activity (5.3.3.8 from EC), 3-hydroxyacyl-CoA dehydrogenase (1.1.1.35 from EC) and 3-hydroxybutyryl-CoA epimerase (5.1.2.3 from EC). A representative is Escherichia coli FadB (P21177 from SWISSPROT). This entry excludes the FadJ family represented by P77399 from SWISSPROT.; GO: 0003857 3-hydroxyacyl-CoA dehydrogenase activity, 0004165 dodecenoyl-CoA delta-isomerase activity, 0004300 enoyl-CoA hydratase activity, 0008692 3-hydroxybutyryl-CoA epimerase activity, 0009062 fatty acid catabolic process, 0016507 fatty acid beta-oxidation multienzyme complex. Probab=20.61 E-value=38 Score=15.14 Aligned_cols=137 Identities=16% Similarity=0.290 Sum_probs=79.6 Q ss_pred CCEEEEEECCCCCCCCCEEEEECH--HHHHHHHHCCCC--CCCHHH-------HHHHHHCCCCCCC----------EEEE Q ss_conf 301379952778865406999750--445768741200--002789-------9999751753334----------6886 Q gi|254780146|r 59 MVRGVVTMPNGTGVNVRVAVFATS--SKADEAREAGAD--IVGGED-------LFEIVKGGQIDFD----------RCIA 117 (232) Q Consensus 59 ~irg~v~LPh~~gk~~kV~Vf~~~--~~~~~Ak~aGa~--~vg~~e-------li~~i~~~~~~fd----------~~iA 117 (232) -|-|+|+||.-.|-+.-+=.++.| .-+++|.+-|+- +|..+. +++.-.++++||. .+=. T Consensus 148 GFGG~VRlPR~iGaD~A~E~I~~Gk~~rAe~ALKvg~vDaVV~~~~L~~~A~~~l~~Ai~~~lDwkA~RQ~Kl~~L~L~~ 227 (716) T TIGR02437 148 GFGGTVRLPRVIGADSALEIIASGKEVRAEDALKVGLVDAVVKADKLEAAALQLLKDAINEKLDWKAKRQPKLEPLKLSK 227 (716) T ss_pred CCCCCEECCHHHCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHH T ss_conf 89874405501131036788735741230345421245310456789999999999974003462002445454233678 Q ss_pred CHHHHH-HHHH--HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCC------CCCCCCEEEEEECCCCCCHHHHHH Q ss_conf 446778-8886--4332001001742233566655799999986011102------235752678753157789899999 Q gi|254780146|r 118 TPDMMP-LVGR--LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGAVDF------RSEKAGIIHAGIGKVSFENKKIEE 188 (232) Q Consensus 118 t~~~m~-~i~k--lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~i~~------r~~k~g~i~~~IG~~~m~~e~i~e 188 (232) -..||. .+.| +...=||. .|-|-. ...-|.++-+.-++-.+++ +.-+...-++-|| +=..|..|.. T Consensus 228 ~E~~MsF~~AKG~Va~~AG~~--YPAP~~--Av~~i~~aA~~gR~~AL~~E~~~F~kLAkt~~a~aL~G-~FLNDQ~~Kg 302 (716) T TIGR02437 228 LEAMMSFTTAKGLVAQVAGKH--YPAPVT--AVKTIEKAARFGREEALEIEAKAFVKLAKTEEAKALVG-LFLNDQYVKG 302 (716) T ss_pred HHHHHHHHHHHHHHHHCCCCC--CCCHHH--HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCEEE-HHHHHHHHHH T ss_conf 888887888644543106688--884279--99999998876579999876862011110000000100-1211378877 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999999860 Q gi|254780146|r 189 NVLAFVSAVVKA 200 (232) Q Consensus 189 Ni~~~~~~i~~~ 200 (232) -.+-.-+..... T Consensus 303 ~AK~~~K~a~~~ 314 (716) T TIGR02437 303 KAKKAAKLAKKV 314 (716) T ss_pred HHHHHHHHHHHH T ss_conf 877754323000 No 67 >TIGR00004 TIGR00004 endoribonuclease L-PSP, putative; InterPro: IPR006056 This family includes proteins encoded by the yabJ gene from Bacillus subtilis that is required for adenine-mediated repression of purine biosynthetic genes in vivo and codes for an acid-soluble, 14-kDa protein. The structure of this protein has been solved and the protein is found to form a homotrimer. The trimer has a deep cleft that is a probably ligand binding site and possibly forms an enzymatic active site. However the ligand and reaction are unknown.. Probab=20.57 E-value=65 Score=13.70 Aligned_cols=42 Identities=21% Similarity=0.333 Sum_probs=25.2 Q ss_pred HHCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 86011102235752678753157789899999999999999860 Q gi|254780146|r 157 SKSGAVDFRSEKAGIIHAGIGKVSFENKKIEENVLAFVSAVVKA 200 (232) Q Consensus 157 ~k~g~i~~r~~k~g~i~~~IG~~~m~~e~i~eNi~~~~~~i~~~ 200 (232) |-||||.+-- +.|-+ +.=|+..=-.+|+.+||++++.+=-.. T Consensus 27 f~SGQIPldP-~Tge~-v~~g~~~eQa~Qvl~NlkAiL~aAG~~ 68 (129) T TIGR00004 27 FVSGQIPLDP-ETGEL-VGGGDIAEQAEQVLENLKAILEAAGLS 68 (129) T ss_pred EEECCCCCCC-CCCEE-CCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 8712237589-88757-279878888989999899999982798 No 68 >pfam01262 AlaDh_PNT_C Alanine dehydrogenase/PNT, C-terminal domain. This family now also contains the lysine 2-oxoglutarate reductases. Probab=20.38 E-value=66 Score=13.68 Aligned_cols=109 Identities=21% Similarity=0.252 Sum_probs=57.5 Q ss_pred CEEEEEECCCCCCCCCEEEEECH----HHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEE-----EECHHHH-HHHHH-- Q ss_conf 01379952778865406999750----445768741200002789999975175333468-----8644677-88886-- Q gi|254780146|r 60 VRGVVTMPNGTGVNVRVAVFATS----SKADEAREAGADIVGGEDLFEIVKGGQIDFDRC-----IATPDMM-PLVGR-- 127 (232) Q Consensus 60 irg~v~LPh~~gk~~kV~Vf~~~----~~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~-----iAt~~~m-~~i~k-- 127 (232) -+|...---+.-++.+|+|++-| ..+..|+..||.+..-+...+......-.|..+ .+++..+ ..+.. T Consensus 7 g~g~l~~~~~gv~pa~vvViG~Gv~G~~A~~~A~~lGa~V~v~D~~~~~l~~~~~~~~~~v~~~~~~~~~~l~~~i~~aD 86 (150) T pfam01262 7 GFGMLLGGAGGVPPAKVVVIGGGVVGLGAAATAKGLGAPVTILDVRPERLEQLDSLFAEFVETDIFSNCEYLAEAIAEAD 86 (150) T ss_pred CCCEEECCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHCCCCCCCHHHCCHHHHHHHHHHCC T ss_conf 83366548899767779998987899999999986799899972999999999986476200166537999999974387 Q ss_pred --HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC--CCCCCCCCCEEEEE Q ss_conf --4332001001742233566655799999986011--10223575267875 Q gi|254780146|r 128 --LGRILGPRGIMPNLRVGTVTTDVATAVRESKSGA--VDFRSEKAGIIHAG 175 (232) Q Consensus 128 --lgriLGPkglMPn~k~GTv~~di~~~i~~~k~g~--i~~r~~k~g~i~~~ 175 (232) ++-+|=|-.+-|. . |+ .++++.+|.|+ +.+-.|..|++... T Consensus 87 vvIgavl~pg~~aP~--l--It---~~mv~~MkpGsVIVDvaiDqGG~~Ets 131 (150) T pfam01262 87 LVIGTVLIPGARAPK--L--VT---REMVKTMKPGSVIVDVAIDQGGCIETT 131 (150) T ss_pred EEEEEEECCCCCCCC--E--EC---HHHHHHHCCCCEEEEEECCCCCCCCCC T ss_conf 999720317886992--2--07---999984479939999620489966257 No 69 >TIGR00730 TIGR00730 conserved hypothetical protein TIGR00730; InterPro: IPR005269 This family of conserved hypothetical proteins has no known function. . Probab=20.27 E-value=52 Score=14.31 Aligned_cols=58 Identities=22% Similarity=0.206 Sum_probs=30.4 Q ss_pred CEEEEECHHH------HHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHHH--HHHHHH--HHHCCCCCC Q ss_conf 0699975044------5768741200002789999975175333468864467788886--433200--100174223 Q gi|254780146|r 75 RVAVFATSSK------ADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVGR--LGRILG--PRGIMPNLR 142 (232) Q Consensus 75 kV~Vf~~~~~------~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~k--lgriLG--PkglMPn~k 142 (232) .|||||.... .+.|.+.|+.++ +.||- .=+==.++.+|..+++ .+.-+- -.|..||.. T Consensus 2 ~v~VY~~a~~g~~~~y~~~A~elG~~~~---------~~G~~-lV~GGG~~g~Mg~~a~a~~~~GG~GqvhGiiPnal 69 (205) T TIGR00730 2 TVCVYCGASPGAKAEYLELAAELGAALA---------GAGWG-LVYGGGRVGLMGAIAQAAMEAGGVGQVHGIIPNAL 69 (205) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHHHH---------HCCCE-EEECCCCCCHHHHHHHHHHCCCCCEEEEECCCCHH T ss_conf 0788627783257899888999999998---------58956-98879871547899888750798402410042013 No 70 >cd02395 SF1_like-KH Splicing factor 1 (SF1) K homology RNA-binding domain (KH). Splicing factor 1 (SF1) specifically recognizes the intron branch point sequence (BPS) UACUAAC in the pre-mRNA transcripts during spliceosome assembly. We show that the KH-QUA2 region of SF1 defines an enlarged KH (hnRNP K) fold which is necessary and sufficient for BPS binding. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. Probab=20.21 E-value=13 Score=18.02 Aligned_cols=14 Identities=57% Similarity=0.982 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 88886433200100 Q gi|254780146|r 123 PLVGRLGRILGPRG 136 (232) Q Consensus 123 ~~i~klgriLGPkg 136 (232) |...-+|++||||| T Consensus 12 P~~Nfvg~ilGprG 25 (120) T cd02395 12 PKYNFVGLILGPRG 25 (120) T ss_pred CCCCEEEEEECCCC T ss_conf 99885678877985 No 71 >COG0541 Ffh Signal recognition particle GTPase [Intracellular trafficking and secretion] Probab=20.09 E-value=31 Score=15.67 Aligned_cols=62 Identities=24% Similarity=0.393 Sum_probs=36.8 Q ss_pred CEEEEECH-HHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 06999750-4457687412000027899999751753334688644677888864332001001742 Q gi|254780146|r 75 RVAVFATS-SKADEAREAGADIVGGEDLFEIVKGGQIDFDRCIATPDMMPLVGRLGRILGPRGIMPN 140 (232) Q Consensus 75 kV~Vf~~~-~~~~~Ak~aGa~~vg~~eli~~i~~~~~~fd~~iAt~~~m~~i~klgriLGPkglMPn 140 (232) ||+=..|= ...+.|.++- |.--.+++.+++.+|+++++=|+.+-..|.++|.++.+| +.+|- T Consensus 292 RILGMGDv~sLvEk~~~~~-d~e~a~~~~~kl~~g~FtL~Df~~Ql~~m~kmGpl~~ll---~miPG 354 (451) T COG0541 292 RILGMGDVLSLIEKAEEVV-DEEEAEKLAEKLKKGKFTLEDFLEQLEQMKKMGPLSKLL---SMIPG 354 (451) T ss_pred HHCCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH---HHCCC T ss_conf 8537320999999999865-699999999999849978999999999987338888999---75889 Done!