Query gi|254780446|ref|YP_003064859.1| hypothetical protein CLIBASIA_01655 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 171 No_of_seqs 7 out of 9 Neff 2.7 Searched_HMMs 39220 Date Sun May 29 17:49:28 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780446.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3837 Uncharacterized conser 74.1 1.7 4.4E-05 23.9 1.5 45 105-152 8-55 (161) 2 COG4455 ImpE Protein of avirul 73.6 2.7 6.8E-05 22.7 2.4 70 90-167 44-113 (273) 3 TIGR02314 ABC_MetN D-methionin 62.8 5.2 0.00013 21.0 2.0 71 97-167 11-84 (344) 4 KOG3080 consensus 59.5 5.5 0.00014 20.9 1.7 42 128-169 150-191 (328) 5 pfam05890 Ebp2 Eukaryotic rRNA 56.4 7.7 0.0002 20.0 2.0 43 127-169 84-126 (268) 6 COG4932 Predicted outer membra 54.1 18 0.00045 17.9 3.5 30 22-54 1169-1198(1531) 7 PRK06703 flavodoxin; Provision 53.5 20 0.00052 17.5 4.0 88 75-165 45-138 (151) 8 PRK07308 flavodoxin; Validated 47.4 14 0.00035 18.5 2.1 123 47-169 8-140 (147) 9 pfam09134 Invasin_D3 Invasin, 39.6 32 0.00081 16.4 3.0 35 28-62 25-59 (99) 10 COG4598 HisP ABC-type histidin 39.4 11 0.00028 19.1 0.6 45 85-130 121-167 (256) 11 TIGR02024 FtcD glutamate formi 38.4 35 0.0009 16.1 3.9 34 78-111 43-77 (331) 12 PRK12270 kgd alpha-ketoglutara 37.6 36 0.00092 16.0 4.9 74 21-99 920-1012(1234) 13 COG3495 Uncharacterized protei 37.4 31 0.0008 16.4 2.7 57 36-95 62-143 (166) 14 COG1126 GlnQ ABC-type polar am 29.6 49 0.0013 15.3 2.6 27 35-61 38-66 (240) 15 COG1135 AbcC ABC-type metal io 28.1 52 0.0013 15.1 3.3 20 41-60 20-41 (339) 16 pfam04720 DUF506 Protein of un 27.9 53 0.0013 15.1 3.5 50 67-116 149-201 (218) 17 KOG3674 consensus 27.7 53 0.0013 15.1 2.5 86 76-163 599-695 (696) 18 TIGR03172 probable selenium-de 26.6 43 0.0011 15.6 1.9 42 64-105 112-159 (232) 19 pfam12570 DUF3750 Protein of u 25.0 60 0.0015 14.8 3.1 91 21-134 16-113 (129) 20 KOG4113 consensus 24.7 40 0.001 15.8 1.4 37 26-62 41-83 (99) 21 TIGR02315 ABC_phnC phosphonate 24.2 62 0.0016 14.7 2.3 40 122-169 36-85 (253) 22 pfam00161 RIP Ribosome inactiv 23.7 63 0.0016 14.6 3.8 108 53-169 66-197 (205) 23 pfam08484 Methyltransf_14 C-me 23.7 54 0.0014 15.0 1.9 54 110-165 74-136 (169) 24 pfam08444 Gly_acyl_tr_C Aralky 23.7 53 0.0014 15.1 1.9 75 32-138 3-77 (89) 25 TIGR01225 hutH histidine ammon 22.5 58 0.0015 14.8 1.9 26 145-170 52-80 (529) 26 pfam03393 Pneumo_matrix Pneumo 22.4 17 0.00044 17.9 -0.8 17 139-155 187-203 (252) 27 COG1541 PaaK Coenzyme F390 syn 22.2 35 0.0009 16.1 0.8 19 47-65 97-116 (438) 28 PRK09004 flavodoxin; Provision 21.9 69 0.0018 14.4 2.8 94 75-169 43-140 (146) 29 pfam11704 Folliculin Vesicle c 21.8 69 0.0018 14.4 3.3 46 62-107 42-95 (168) No 1 >COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown] Probab=74.15 E-value=1.7 Score=23.86 Aligned_cols=45 Identities=24% Similarity=0.255 Sum_probs=37.8 Q ss_pred HHHHHCCCCCCCCCCCCCCHHHHHH---HHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 8887368998721234451268999---99983887457886541676641 Q gi|254780446|r 105 EILNEKNMPYPFLGIGKNSNSVAST---LIRCMGLKELAIPNAKIAPYQGI 152 (171) Q Consensus 105 ~~IN~~nL~YPi~Gl~~NSNSva~T---l~r~M~l~~~~ip~~r~~Pg~g~ 152 (171) .+++++++.||- .-+|+++-+| ++++.+|+.-+++..++.||.-. T Consensus 8 ~~~~~k~~~y~~---~f~~~~~~~~~~~lG~~~Gl~~fGvn~~~v~PG~~S 55 (161) T COG3837 8 AVPIEKGPSYPA---PFKSPCALRTRTRLGDALGLKRFGVNLEIVEPGGES 55 (161) T ss_pred CCCCCCCCCCCC---CCCCCHHHHHHHHHHHHCCHHHCCCCEEEECCCCCC T ss_conf 011013658885---002722332345555441721301015886899755 No 2 >COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only] Probab=73.62 E-value=2.7 Score=22.73 Aligned_cols=70 Identities=14% Similarity=0.140 Sum_probs=53.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 689887679899889888736899872123445126899999983887457886541676641013777999999866 Q gi|254780446|r 90 EECVMSRWNAAVHTKEILNEKNMPYPFLGIGKNSNSVASTLIRCMGLKELAIPNAKIAPYQGIILLDDRKINEIQNNW 167 (171) Q Consensus 90 ~e~~~~rwn~~~~~~~~IN~~nL~YPi~Gl~~NSNSva~Tl~r~M~l~~~~ip~~r~~Pg~g~~LL~e~~i~~i~~~~ 167 (171) --|++--|..+.+-++..-++...|-. --|.|++++|||...+.-|- |+-.||+=-- =+++-+.-|++|. T Consensus 44 LlcvaGdw~kAl~Ql~l~a~l~p~~t~------~a~lyr~lir~ea~R~evfa-g~~~Pgflg~-p~p~wva~L~aal 113 (273) T COG4455 44 LLCVAGDWEKALAQLNLAATLSPQDTV------GASLYRHLIRCEAARNEVFA-GGAVPGFLGG-PSPEWVAALLAAL 113 (273) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCCCH------HHHHHHHHHHHHHHHHHHHC-CCCCCCCCCC-CCHHHHHHHHHHH T ss_conf 886466499999998888652901026------78999999999999999863-6777887699-9888999999987 No 3 >TIGR02314 ABC_MetN D-methionine ABC transporter, ATP-binding protein; InterPro: IPR012692 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Members of this family are the ATP-binding protein of the D-methionine ABC transporter complex. Known members belong to the proteobacteria.; GO: 0005524 ATP binding, 0048474 D-methionine transmembrane transporter activity, 0048473 D-methionine transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=62.80 E-value=5.2 Score=21.02 Aligned_cols=71 Identities=21% Similarity=0.332 Sum_probs=59.4 Q ss_pred HHHHHHHHHHHHHCCCCCC---CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 7989988988873689987---2123445126899999983887457886541676641013777999999866 Q gi|254780446|r 97 WNAAVHTKEILNEKNMPYP---FLGIGKNSNSVASTLIRCMGLKELAIPNAKIAPYQGIILLDDRKINEIQNNW 167 (171) Q Consensus 97 wn~~~~~~~~IN~~nL~YP---i~Gl~~NSNSva~Tl~r~M~l~~~~ip~~r~~Pg~g~~LL~e~~i~~i~~~~ 167 (171) |......++.+++.+|--| ++|.--.|-.--||++||.++-|.+-.-.-++-|...--|.+..+-+-|++. T Consensus 11 f~~~~~~~~al~~~~l~~P~G~i~GviG~sGaGkstlirCvnlle~P~~G~v~vdG~~lt~l~~~~l~~~rr~i 84 (344) T TIGR02314 11 FDQGDKKIQALDNVSLHVPKGQIFGVIGASGAGKSTLIRCVNLLEKPTSGAVIVDGKDLTTLSNAELVKARRQI 84 (344) T ss_pred HHHCCHHHHHHHHCEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCCCCCEEEECCHHHHHCCHHHHHHHHHHC T ss_conf 65022032102101134058607998706887603554434654489886078715344210105788877630 No 4 >KOG3080 consensus Probab=59.50 E-value=5.5 Score=20.89 Aligned_cols=42 Identities=19% Similarity=0.328 Sum_probs=35.0 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 999998388745788654167664101377799999986641 Q gi|254780446|r 128 STLIRCMGLKELAIPNAKIAPYQGIILLDDRKINEIQNNWIR 169 (171) Q Consensus 128 ~Tl~r~M~l~~~~ip~~r~~Pg~g~~LL~e~~i~~i~~~~~~ 169 (171) .++.-.--|+++.+|+-||+-|+-.|.-|+.-..++|+.+|. T Consensus 150 aVl~A~~rL~elgV~~~RP~DYfAEMaKSD~HMqKVr~~Li~ 191 (328) T KOG3080 150 AVLEAFPRLHELGVPFLRPTDYFAEMAKSDEHMQKVRQRLIK 191 (328) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999974887568317999986137899999999999 No 5 >pfam05890 Ebp2 Eukaryotic rRNA processing protein EBP2. This family consists of several Eukaryotic rRNA processing protein EBP2 sequences. Ebp2p is required for the maturation of 25S rRNA and 60S subunit assembly. Ebp2p may be one of the target proteins of Rrs1p for executing the signal to regulate ribosome biogenesis. This family also plays a role in chromosome segregation. Probab=56.42 E-value=7.7 Score=20.01 Aligned_cols=43 Identities=16% Similarity=0.290 Sum_probs=35.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9999998388745788654167664101377799999986641 Q gi|254780446|r 127 ASTLIRCMGLKELAIPNAKIAPYQGIILLDDRKINEIQNNWIR 169 (171) Q Consensus 127 a~Tl~r~M~l~~~~ip~~r~~Pg~g~~LL~e~~i~~i~~~~~~ 169 (171) +++..-+.-|...++|+.||.-|+-.|+=+++-.++|++.++. T Consensus 84 ~av~~a~~~L~~~gvp~~RP~DYfAEM~KsD~hM~Kir~kL~~ 126 (268) T pfam05890 84 AAVLEARKLLQKLGVPFKRPDDYFAEMVKSDEHMQKVKKKLLA 126 (268) T ss_pred HHHHHHHHHHHHCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHH T ss_conf 9999999999985998769605999987269999999999999 No 6 >COG4932 Predicted outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=54.08 E-value=18 Score=17.89 Aligned_cols=30 Identities=23% Similarity=0.436 Sum_probs=19.0 Q ss_pred CCEEEEEEEEEECCCCCEEEEEECEEECCCCCE Q ss_conf 200102489998799887545413024477867 Q gi|254780446|r 22 GLFFAHRLLVVFDSNKEFVAELDGLVVNKDGKF 54 (171) Q Consensus 22 ~~~fgH~~~V~~D~~~~~laeL~GLaTsk~G~~ 54 (171) +..+|-.|= |.|+-|++|.| ||.||.+|++ T Consensus 1169 a~LaGA~Fe-l~d~dG~~Vqe--gLtTD~nG~i 1198 (1531) T COG4932 1169 ATLAGAEFE-LQDEDGTLVQE--GLTTDENGKI 1198 (1531) T ss_pred CCCCCCEEE-EECCCCCEEEC--CCEECCCCCE T ss_conf 602574799-87277867521--5300578848 No 7 >PRK06703 flavodoxin; Provisional Probab=53.54 E-value=20 Score=17.54 Aligned_cols=88 Identities=7% Similarity=0.000 Sum_probs=58.5 Q ss_pred HCCCCCCEEEEEECC-HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCC---CCCCCCCCCCCCCC Q ss_conf 013334506998454-68988767989988988873689987212344512689999998388---74578865416766 Q gi|254780446|r 75 LYKEDQEQVVLCSSS-EECVMSRWNAAVHTKEILNEKNMPYPFLGIGKNSNSVASTLIRCMGL---KELAIPNAKIAPYQ 150 (171) Q Consensus 75 ~y~~sq~q~iv~~~~-~e~~~~rwn~~~~~~~~IN~~nL~YPi~Gl~~NSNSva~Tl~r~M~l---~~~~ip~~r~~Pg~ 150 (171) .........+++++. ..+.-..|..+.+.+..++-.++.|-+||+|--| +..|-+++.. .-..++..++.|+. T Consensus 45 ~l~~~d~li~gtsT~G~Ge~P~~~~~f~~~l~~~~l~g~~~avfGlGDs~---Y~~fc~a~~~~~~~l~~~Ga~~v~~~l 121 (151) T PRK06703 45 ELLAYDGIILGSYTWGDGDLPYEAEDFHEDLENIDLSGKKVAVFGSGDTA---YPLFCEAVTIFEERLVERGAELVQEGL 121 (151) T ss_pred HHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCC---CHHHHHHHHHHHHHHHHCCCEECCCCE T ss_conf 98428845999733799868466999999987188899999999843887---477839999999999987998835876 Q ss_pred CCCCCCH--HHHHHHHH Q ss_conf 4101377--79999998 Q gi|254780446|r 151 GIILLDD--RKINEIQN 165 (171) Q Consensus 151 g~~LL~e--~~i~~i~~ 165 (171) -...-|+ +.+++.+. T Consensus 122 ~id~~pd~ded~~~c~~ 138 (151) T PRK06703 122 KIELAPETDEDVEKCSN 138 (151) T ss_pred EEECCCCCHHHHHHHHH T ss_conf 99779998799999999 No 8 >PRK07308 flavodoxin; Validated Probab=47.45 E-value=14 Score=18.51 Aligned_cols=123 Identities=11% Similarity=0.010 Sum_probs=72.4 Q ss_pred EECCCCCEEEECCCCC-------CCEEEEECCC--CHHCCCCCCEEEEEECCHH-HHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 2447786732225800-------0177763045--0101333450699845468-9887679899889888736899872 Q gi|254780446|r 47 VVNKDGKFKPIGYLPS-------DRLKVFEFKY--PCLYKEDQEQVVLCSSSEE-CVMSRWNAAVHTKEILNEKNMPYPF 116 (171) Q Consensus 47 aTsk~G~~~pIGY~~s-------d~Lk~~~~~~--~~~y~~sq~q~iv~~~~~e-~~~~rwn~~~~~~~~IN~~nL~YPi 116 (171) --|++|+..-+--.-. +..++.+... +..+.+.....+++++.-+ +.-.-|..+.+.++.++-.++.|-+ T Consensus 8 ygS~tGnae~~A~~i~~~l~~~G~~v~v~~~~~~~~~~l~~~~~~ii~tsT~G~Ge~Pd~~~~f~~~l~~~~l~~~~~aV 87 (147) T PRK07308 8 YASMTGNTEEIADIVADKLQELGHDVDVDECTTVDASDFEDADIAIVATYTYGDGDLPDEIVDFYEDLADLDLSGKIYGV 87 (147) T ss_pred EECCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCEEEE T ss_conf 98897279999999999999759940761136499768611887999955789997873499999998637889999999 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 12344512689999998388745788654167664101377799999986641 Q gi|254780446|r 117 LGIGKNSNSVASTLIRCMGLKELAIPNAKIAPYQGIILLDDRKINEIQNNWIR 169 (171) Q Consensus 117 ~Gl~~NSNSva~Tl~r~M~l~~~~ip~~r~~Pg~g~~LL~e~~i~~i~~~~~~ 169 (171) ||+|--|-.-+--..+-++-.-..++-.|+.|..-.++-||.+-++--..|.+ T Consensus 88 fglGDs~Y~~FC~a~~~~d~~l~~lGA~~v~e~l~iD~~~e~~~~~~~~~~~~ 140 (147) T PRK07308 88 VGSGDTFYDYFCKSVDDFEAQFALTGATKGAEPVKVDLAAEDEDIERLEAFAE 140 (147) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCCEECCCEEEEECCCCCCHHHHHHHHHH T ss_conf 98417872889799999999999779937516189968998345999999999 No 9 >pfam09134 Invasin_D3 Invasin, domain 3. Members of this family adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria. Probab=39.58 E-value=32 Score=16.39 Aligned_cols=35 Identities=23% Similarity=0.377 Sum_probs=30.4 Q ss_pred EEEEEECCCCCEEEEEECEEECCCCCEEEECCCCC Q ss_conf 48999879988754541302447786732225800 Q gi|254780446|r 28 RLLVVFDSNKEFVAELDGLVVNKDGKFKPIGYLPS 62 (171) Q Consensus 28 ~~~V~~D~~~~~laeL~GLaTsk~G~~~pIGY~~s 62 (171) --+++||.+++++.-+-+++.+.+|-...|+-..+ T Consensus 25 LtltlkD~ngnpIsGi~~v~f~~sG~~vt~ssvte 59 (99) T pfam09134 25 LSFVPVDKNGHFISGMQGLSFTQNGVPVSISPITE 59 (99) T ss_pred EEEEEECCCCCCCCCEEEEEEECCCCCEEEECCCC T ss_conf 99998527898132525688723688128753201 No 10 >COG4598 HisP ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism] Probab=39.44 E-value=11 Score=19.06 Aligned_cols=45 Identities=18% Similarity=0.230 Sum_probs=21.7 Q ss_pred EEECCHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCCCCCCHHHHHHH Q ss_conf 98454689887679899889888736899872--12344512689999 Q gi|254780446|r 85 LCSSSEECVMSRWNAAVHTKEILNEKNMPYPF--LGIGKNSNSVASTL 130 (171) Q Consensus 85 v~~~~~e~~~~rwn~~~~~~~~IN~~nL~YPi--~Gl~~NSNSva~Tl 130 (171) |.+-+|.+...+-....+-.-.-+..+ .||. -|=-|---++|+++ T Consensus 121 VLg~~k~ea~e~Ae~~L~kVGi~ek~~-~YP~~LSGGQQQR~aIARaL 167 (256) T COG4598 121 VLGVSKAEAIERAEKYLAKVGIAEKAD-AYPAHLSGGQQQRVAIARAL 167 (256) T ss_pred HHCCCHHHHHHHHHHHHHHHCCHHHHH-CCCCCCCCHHHHHHHHHHHH T ss_conf 655778899999999999717154430-38611474178889999987 No 11 >TIGR02024 FtcD glutamate formiminotransferase; InterPro: IPR004227 This entry represents the formiminotransferase (FT) domain of formiminotransferase-cyclodeaminase (FTCD), which forms a homodimer, with each protomer being comprised of two subdomains. Tetrahydrofolate (THF)-dependent glutamate formiminotransferase is involved in the histidine utilization pathway. This enzyme interconverts L-glutamate and N-formimino-L-glutamate. The enzyme is bifunctional as it also catalyzes the cyclodeaminase reaction on N-formimino-THF, converting it to 5,10-methenyl-THF and releasing ammonia; part of the process of regenerating THF. This model covers enzymes from metazoa as well as Gram-positive bacteria and archaea. In humans, deficiency of this enzyme results in a disease phenotype . The crystal structure of the enzyme has been studied in the context of the catalytic mechanism . ; GO: 0005542 folic acid binding, 0016740 transferase activity, 0008152 metabolic process. Probab=38.40 E-value=35 Score=16.12 Aligned_cols=34 Identities=18% Similarity=0.299 Sum_probs=29.5 Q ss_pred CCCCEEEEEECCHHHHHHH-HHHHHHHHHHHHHCC Q ss_conf 3345069984546898876-798998898887368 Q gi|254780446|r 78 EDQEQVVLCSSSEECVMSR-WNAAVHTKEILNEKN 111 (171) Q Consensus 78 ~sq~q~iv~~~~~e~~~~r-wn~~~~~~~~IN~~n 111 (171) |.++++++|-|+-|++++. .++++.|.+.|-=.+ T Consensus 43 dHNRsV~T~vG~Pe~v~~A~~~~aK~A~eLIDm~~ 77 (331) T TIGR02024 43 DHNRSVITFVGEPEKVVNAALKLAKKAAELIDMRN 77 (331) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99866899865858999999998878676530014 No 12 >PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed Probab=37.58 E-value=36 Score=16.04 Aligned_cols=74 Identities=27% Similarity=0.388 Sum_probs=54.9 Q ss_pred ECCEEEEEEEEEEC-CCCCEEEEEECEEECCCCCEEEE--------------CCCCC--CCEEEEECCCCHHCCCCCCEE Q ss_conf 02001024899987-99887545413024477867322--------------25800--017776304501013334506 Q gi|254780446|r 21 IGLFFAHRLLVVFD-SNKEFVAELDGLVVNKDGKFKPI--------------GYLPS--DRLKVFEFKYPCLYKEDQEQV 83 (171) Q Consensus 21 i~~~fgH~~~V~~D-~~~~~laeL~GLaTsk~G~~~pI--------------GY~~s--d~Lk~~~~~~~~~y~~sq~q~ 83 (171) --|+|+|+-.|+.| ..++..--|+-|+.+++|++--+ ||... +.|-.||-+.--|. +-+|+ T Consensus 920 ~RGTFshRHavl~Dq~t~~~y~PL~~l~~~~q~~f~v~nS~LSE~avLGFEYGYS~~~P~~LviWEAQFGDFa--NGAQv 997 (1234) T PRK12270 920 RRGTFSQRHAVLIDRETGEEFTPLQNLADDSQGKFLVYDSLLSEYAAMGFEYGYSVGNPDALVLWEAQFGDFA--NGAQT 997 (1234) T ss_pred CCCCCCCCCEEEEECCCCCEEEEHHHCCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHH--CCCEE T ss_conf 6543231213564136784785776648365851899815213555555552443368973354065553440--58648 Q ss_pred E--EEECCHHHHHHHHHH Q ss_conf 9--984546898876798 Q gi|254780446|r 84 V--LCSSSEECVMSRWNA 99 (171) Q Consensus 84 i--v~~~~~e~~~~rwn~ 99 (171) | -|..+-| .||.. T Consensus 998 IIDQFIsSgE---~KW~~ 1012 (1234) T PRK12270 998 IIDEFISSGE---AKWGQ 1012 (1234) T ss_pred EEEEEEECHH---HHHHH T ss_conf 9865786177---76641 No 13 >COG3495 Uncharacterized protein conserved in bacteria [Function unknown] Probab=37.42 E-value=31 Score=16.41 Aligned_cols=57 Identities=30% Similarity=0.248 Sum_probs=36.2 Q ss_pred CCCEEEEEECEEECCCCCEEEECCCCCCCEEEEEC-----------------------CCCHHCCCCC-CEEE-EEECCH Q ss_conf 98875454130244778673222580001777630-----------------------4501013334-5069-984546 Q gi|254780446|r 36 NKEFVAELDGLVVNKDGKFKPIGYLPSDRLKVFEF-----------------------KYPCLYKEDQ-EQVV-LCSSSE 90 (171) Q Consensus 36 ~~~~laeL~GLaTsk~G~~~pIGY~~sd~Lk~~~~-----------------------~~~~~y~~sq-~q~i-v~~~~~ 90 (171) .++++.+|||-+|---|-+.|.-- |--+|.+| +++.-+++++ +|++ |.++-| T Consensus 62 ~p~vvk~LdG~~~klpGy~vPle~---d~k~VteFlLVPy~GACiHvPpPPpNQIV~Vk~~kg~km~elyqp~wV~GtlK 138 (166) T COG3495 62 IPTVVKELDGSATKLPGYAVPLER---DGKLVTEFLLVPYTGACIHVPPPPPNQIVLVKPAKGYKMSELYQPVWVTGTLK 138 (166) T ss_pred CCCHHHHCCCCEECCCCCEEECCC---CCCEEEEEEEECCCCCEEECCCCCCCEEEEEECCCCCCHHHHHCEEEEEEEEC T ss_conf 764112226630116860340125---75427889960356721306986988389985698861767302147863202 Q ss_pred HHHHH Q ss_conf 89887 Q gi|254780446|r 91 ECVMS 95 (171) Q Consensus 91 e~~~~ 95 (171) -+.++ T Consensus 139 ~e~~s 143 (166) T COG3495 139 PEIES 143 (166) T ss_pred HHHHH T ss_conf 10222 No 14 >COG1126 GlnQ ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism] Probab=29.64 E-value=49 Score=15.26 Aligned_cols=27 Identities=19% Similarity=0.187 Sum_probs=17.6 Q ss_pred CCCC--EEEEEECEEECCCCCEEEECCCC Q ss_conf 9988--75454130244778673222580 Q gi|254780446|r 35 SNKE--FVAELDGLVVNKDGKFKPIGYLP 61 (171) Q Consensus 35 ~~~~--~laeL~GLaTsk~G~~~pIGY~~ 61 (171) ++|| +|--+|||-+-++|.|.--|... T Consensus 38 GSGKSTlLRclN~LE~~~~G~I~i~g~~~ 66 (240) T COG1126 38 GSGKSTLLRCLNGLEEPDSGSITVDGEDV 66 (240) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCEEC T ss_conf 99888999999778688786499998722 No 15 >COG1135 AbcC ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism] Probab=28.09 E-value=52 Score=15.10 Aligned_cols=20 Identities=30% Similarity=0.534 Sum_probs=7.2 Q ss_pred EEEECEEEC-CCCCEE-EECCC Q ss_conf 454130244-778673-22258 Q gi|254780446|r 41 AELDGLVVN-KDGKFK-PIGYL 60 (171) Q Consensus 41 aeL~GLaTs-k~G~~~-pIGY~ 60 (171) .-|++.+-. ..|++- -|||+ T Consensus 20 ~al~~vsL~I~~GeI~GIIG~S 41 (339) T COG1135 20 TALDDVSLEIPKGEIFGIIGYS 41 (339) T ss_pred EEECCCEEEECCCCEEEEECCC T ss_conf 5651406887388689997488 No 16 >pfam04720 DUF506 Protein of unknown function (DUF506). Family of uncharacterized plant proteins. Probab=27.91 E-value=53 Score=15.08 Aligned_cols=50 Identities=24% Similarity=0.174 Sum_probs=38.3 Q ss_pred EEECCCCH-HCCC-CCCEEEEEECCHHHHHHH-HHHHHHHHHHHHHCCCCCCC Q ss_conf 76304501-0133-345069984546898876-79899889888736899872 Q gi|254780446|r 67 VFEFKYPC-LYKE-DQEQVVLCSSSEECVMSR-WNAAVHTKEILNEKNMPYPF 116 (171) Q Consensus 67 ~~~~~~~~-~y~~-sq~q~iv~~~~~e~~~~r-wn~~~~~~~~IN~~nL~YPi 116 (171) -|++-+|+ .|+. -+..|.||-|..|.+-.= =....++.+++.+++|+-|+ T Consensus 149 ~FEiARpt~~Y~~ll~~LP~vfVG~~~rL~~iv~im~~aak~Slk~~gmhlpP 201 (218) T pfam04720 149 EFEIARPTEEYKRILQSLPRVFVGKVERLKQLVRIMCDAAKRSLKKKGMHLPP 201 (218) T ss_pred HEEEECCCHHHHHHHHHCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 52343685899999996998477279999999999999999999977999999 No 17 >KOG3674 consensus Probab=27.69 E-value=53 Score=15.09 Aligned_cols=86 Identities=15% Similarity=0.154 Sum_probs=56.2 Q ss_pred CCCCCCEEEEEECCHHHHHHHHHH-----------HHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 133345069984546898876798-----------998898887368998721234451268999999838874578865 Q gi|254780446|r 76 YKEDQEQVVLCSSSEECVMSRWNA-----------AVHTKEILNEKNMPYPFLGIGKNSNSVASTLIRCMGLKELAIPNA 144 (171) Q Consensus 76 y~~sq~q~iv~~~~~e~~~~rwn~-----------~~~~~~~IN~~nL~YPi~Gl~~NSNSva~Tl~r~M~l~~~~ip~~ 144 (171) +-+.+++-.++.++++-+.+--.+ +.+-++.||++--.-=--.+-|.-||+--|-.|.|+ -++++|+- T Consensus 599 fnsnr~~Tl~l~t~kdt~~~~l~~i~eg~~d~~a~a~~~L~dikEL~kn~fsk~~~Qh~~s~V~~~~~~~~-~~~sF~~~ 677 (696) T KOG3674 599 FNSNRNYTLKLDTNKDTIEHILAAISEGFDDVRAGAIHSLLDIKELQKNQFSKALIQHNNSIVMTCFRSML-GEESFPMP 677 (696) T ss_pred HCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHC-CHHHCCCC T ss_conf 13553269974777544999999998765026513778888699999865526667752006999999841-03406776 Q ss_pred CCCCCCCCCCCCHHHHHHH Q ss_conf 4167664101377799999 Q gi|254780446|r 145 KIAPYQGIILLDDRKINEI 163 (171) Q Consensus 145 r~~Pg~g~~LL~e~~i~~i 163 (171) ++|-.+++..+..+.+++ T Consensus 678 -va~t~ns~~~~~qE~aav 695 (696) T KOG3674 678 -VAPTSNSDVGSIQESAAV 695 (696) T ss_pred -CCCCCCCCCEEEEHHHCC T ss_conf -675210220021111013 No 18 >TIGR03172 probable selenium-dependent hydroxylase accessory protein YqeC. This uncharacterized protein family includes YqeC from Escherichia coli. A phylogenetic profiling analysis shows correlation with SelD, the selenium donor protein, even in species where SelD contributes to neither selenocysteine nor selenouridine biosynthesis. Instead, this family, and families TIGR03309 and TIGR03310 appear to mark selenium-dependent molybdenum hydroxylase maturation systems. Probab=26.59 E-value=43 Score=15.60 Aligned_cols=42 Identities=17% Similarity=0.218 Sum_probs=30.5 Q ss_pred CEEEEECCCCHHCCCCCCEEEEEECC------HHHHHHHHHHHHHHHH Q ss_conf 17776304501013334506998454------6898876798998898 Q gi|254780446|r 64 RLKVFEFKYPCLYKEDQEQVVLCSSS------EECVMSRWNAAVHTKE 105 (171) Q Consensus 64 ~Lk~~~~~~~~~y~~sq~q~iv~~~~------~e~~~~rwn~~~~~~~ 105 (171) .||++.-.+|.....+...+.|++-+ .+++.|||+++.+.+. T Consensus 112 plKaP~~~EPvIP~~t~~VI~V~g~~~lG~pl~~e~vHR~e~~~~i~~ 159 (232) T TIGR03172 112 PLKAPSDHEPVIPKSSTTVIGVAGISVVGEKLDTDIVHRWPEFLALTG 159 (232) T ss_pred CCCCCCCCCCCCCCCCCEEEEEECHHHCCCCCCHHHCCCHHHHHHHHC T ss_conf 164889899947887888999705778298378766069999999868 No 19 >pfam12570 DUF3750 Protein of unknown function (DUF3750). This family of proteins is found in bacteria. Proteins in this family are typically between 175 and 265 amino acids in length. Probab=24.96 E-value=60 Score=14.75 Aligned_cols=91 Identities=23% Similarity=0.251 Sum_probs=47.3 Q ss_pred ECCEEEEEEEEEEC--CCCCEEEEEECEEECCCCCEEEECCCCCCCEEEEECCCCHHCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 02001024899987--9988754541302447786732225800017776304501013334506998454689887679 Q gi|254780446|r 21 IGLFFAHRLLVVFD--SNKEFVAELDGLVVNKDGKFKPIGYLPSDRLKVFEFKYPCLYKEDQEQVVLCSSSEECVMSRWN 98 (171) Q Consensus 21 i~~~fgH~~~V~~D--~~~~~laeL~GLaTsk~G~~~pIGY~~sd~Lk~~~~~~~~~y~~sq~q~iv~~~~~e~~~~rwn 98 (171) .+-++.|--+|+|. ...-..+|+-|- |...-+-.+..|+ .|....|-+ +..-. =. T Consensus 16 rG~fAvH~WivvK~~~a~~y~ryeV~gW-----g~~l~~n~~~pd~--~w~g~~p~l---------~~~~~-------G~ 72 (129) T pfam12570 16 RGIFAVHTWIVVKEAGATAYTRYEVVGW-----GVHVRKNLLAPDG--RWYGGPPRL---------LAAWR-------GE 72 (129) T ss_pred CEEEEEEEEEEEECCCCCCCCEEEEEEC-----CCEEECCCCCCCC--CCCCCCCEE---------EHHHC-------CC T ss_conf 3587887799995279875337998607-----6124414779976--604998657---------04234-------84 Q ss_pred HHHHHHHHHHHCCCCCCCCCC-----CCCCHHHHHHHHHHH Q ss_conf 899889888736899872123-----445126899999983 Q gi|254780446|r 99 AAVHTKEILNEKNMPYPFLGI-----GKNSNSVASTLIRCM 134 (171) Q Consensus 99 ~~~~~~~~IN~~nL~YPi~Gl-----~~NSNSva~Tl~r~M 134 (171) .+...++-|.+-=-.||.-+- |-|||.|..-++|.. T Consensus 73 ~a~~lI~~Ie~a~~~YP~~~~Yr~wPGPNSNTFvawv~r~v 113 (129) T pfam12570 73 EAERLIPKIEAAIASYPYPNEYRAWPGPNSNTFVAWVLRQV 113 (129) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEECCCCCHHHHHHHHHHHC T ss_conf 58999999999998299889721778997077999998658 No 20 >KOG4113 consensus Probab=24.67 E-value=40 Score=15.79 Aligned_cols=37 Identities=30% Similarity=0.357 Sum_probs=21.8 Q ss_pred EEEEEEEEC----CCCCEEEEEECEEEC--CCCCEEEECCCCC Q ss_conf 024899987----998875454130244--7786732225800 Q gi|254780446|r 26 AHRLLVVFD----SNKEFVAELDGLVVN--KDGKFKPIGYLPS 62 (171) Q Consensus 26 gH~~~V~~D----~~~~~laeL~GLaTs--k~G~~~pIGY~~s 62 (171) --+||+++| +|-.+--++.|+--= -+-|.-||||--. T Consensus 41 ~~dF~lVkDmf~FeNVgfSr~v~~~K~LvCADCEkGPiG~~d~ 83 (99) T KOG4113 41 LSDFFLVKDMFAFENVGFSREVPNLKYLVCADCEKGPIGYHDL 83 (99) T ss_pred CCCEEEECCCEEEECCEECCCCCCEEEEEEECCCCCCCEEEEC T ss_conf 4450753132014222011336980699982256587041705 No 21 >TIGR02315 ABC_phnC phosphonate ABC transporter, ATP-binding protein; InterPro: IPR012693 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Phosphonates are a class of phosphorus-containing organic compound with a stable direct C-P bond rather than a C-O-P linkage. A number of bacterial species have operons, typically about 14 genes in size, with genes for ATP-dependent transport of phosphonates, degradation, and regulation of the expression of the system. Members of this protein family are the ATP-binding cassette component of tripartite ABC transporters of phosphonates.; GO: 0005524 ATP binding, 0015416 phosphonate transmembrane-transporting ATPase activity, 0015716 phosphonate transport, 0016020 membrane. Probab=24.20 E-value=62 Score=14.67 Aligned_cols=40 Identities=33% Similarity=0.593 Sum_probs=25.9 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCC---CCCC------CCCCCHHHHHHHH-HHHHH Q ss_conf 51268999999838874578865416---7664------1013777999999-86641 Q gi|254780446|r 122 NSNSVASTLIRCMGLKELAIPNAKIA---PYQG------IILLDDRKINEIQ-NNWIR 169 (171) Q Consensus 122 NSNSva~Tl~r~M~l~~~~ip~~r~~---Pg~g------~~LL~e~~i~~i~-~~~~~ 169 (171) -|-+==|||+||.+ .=++ |-+| .+++...+|-+.+ ..=+| T Consensus 36 ~SGAGKSTLLR~iN--------rL~~Gdk~~~Geilidf~i~~~g~~i~~~~~~k~LR 85 (253) T TIGR02315 36 PSGAGKSTLLRCIN--------RLVEGDKPSSGEILIDFSILLEGTDITKLRRGKKLR 85 (253) T ss_pred CCCCCHHHHHHHHH--------HHCCCCCCCCCEEEEEEEEEECCCHHHHHHHHHHHH T ss_conf 78872679998775--------302688887650898888887273187675488999 No 22 >pfam00161 RIP Ribosome inactivating protein. Probab=23.73 E-value=63 Score=14.61 Aligned_cols=108 Identities=15% Similarity=0.152 Sum_probs=63.1 Q ss_pred CEEEECCC-CCCCEEEEECC------CCHHCCCCCCEEEEEECCHHHHHHHH-----------HHHHHHHHHHHHCCCCC Q ss_conf 67322258-00017776304------50101333450699845468988767-----------98998898887368998 Q gi|254780446|r 53 KFKPIGYL-PSDRLKVFEFK------YPCLYKEDQEQVVLCSSSEECVMSRW-----------NAAVHTKEILNEKNMPY 114 (171) Q Consensus 53 ~~~pIGY~-~sd~Lk~~~~~------~~~~y~~sq~q~iv~~~~~e~~~~rw-----------n~~~~~~~~IN~~nL~Y 114 (171) +.-=+||. ..+ +.|+|+ ...++.++.....-|.|++..++... ....+++..+...+-. T Consensus 66 NlYvvGy~~~~~--~~y~F~d~~~~~~~~lf~~~~~~~L~f~gsY~~L~~~ag~~r~~i~LG~~~L~~ai~~L~~~~~~- 142 (205) T pfam00161 66 NLYVVGYRNGGG--RWYFFKDAPSDATTNLFPGTTQTTLPFGGSYTSLEGAAGVNRENIPLGRQSLDSAITALYGYDPS- 142 (205) T ss_pred CEEEEEEECCCC--EEEECCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHCCCCCEECCCHHHHHHHHHHHHCCCCC- T ss_conf 738999985898--07983788511336678998456787889888998863887011004779999999999644888- Q ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCCCCC----CCCCCCC--CCCCCCHHHHHHHHHHHHH Q ss_conf 72123445126899999983887457886----5416766--4101377799999986641 Q gi|254780446|r 115 PFLGIGKNSNSVASTLIRCMGLKELAIPN----AKIAPYQ--GIILLDDRKINEIQNNWIR 169 (171) Q Consensus 115 Pi~Gl~~NSNSva~Tl~r~M~l~~~~ip~----~r~~Pg~--g~~LL~e~~i~~i~~~~~~ 169 (171) .+..+.|+.|+.|+-.--++.-| .++..+. ..-..|....-.+++||=+ T Consensus 143 ------~~~~~~a~~llv~IqMv~EAARFkyIe~~v~~~~~~~~~~~p~~~~i~lennW~~ 197 (205) T pfam00161 143 ------TDTKDEARALLVLIQMVAEAARFKYIENLVRTNFDSYRSFTPDPKMISLENNWGR 197 (205) T ss_pred ------CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf ------7407899999999998676887399999998533446774898889999984999 No 23 >pfam08484 Methyltransf_14 C-methyltransferase. This domain is found in bacterial C-methyltransferase proteins, often together with other methyltransferase domains such as pfam08241 or pfam08242. Probab=23.69 E-value=54 Score=15.03 Aligned_cols=54 Identities=20% Similarity=0.311 Sum_probs=40.1 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHCCCCCC---------CCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 6899872123445126899999983887457---------8865416766410137779999998 Q gi|254780446|r 110 KNMPYPFLGIGKNSNSVASTLIRCMGLKELA---------IPNAKIAPYQGIILLDDRKINEIQN 165 (171) Q Consensus 110 ~nL~YPi~Gl~~NSNSva~Tl~r~M~l~~~~---------ip~~r~~Pg~g~~LL~e~~i~~i~~ 165 (171) +.-.+.+.|.|--. =+.||+..++|.... .-.++..||.+.-.++++++.+.+. T Consensus 74 k~~~k~I~gyGAsa--K~~tlln~~~i~~~~I~~i~D~n~~K~gk~~PGt~IpI~s~~~l~~~~p 136 (169) T pfam08484 74 KAAGKSVAGYGASA--KGNTLLNYCGIGEDLLDFIVDRNPLKQGKLTPGTHIPILSPEELKARKP 136 (169) T ss_pred HHCCCEEEEECCCC--HHHHHHHHHCCCHHHHHEEEECCCCCCCCCCCCCCEEECCHHHHHHCCC T ss_conf 98699899976773--0589999966543541224416302468786898967738999955399 No 24 >pfam08444 Gly_acyl_tr_C Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region. This family features the C-terminal region of several mammalian specific aralkyl acyl-CoA:amino acid N-acyltransferase (glycine N-acyltransferase) proteins EC:2.3.1.13. Probab=23.69 E-value=53 Score=15.06 Aligned_cols=75 Identities=15% Similarity=0.274 Sum_probs=48.6 Q ss_pred EECCCCCEEEEEECEEECCCCCEEEECCCCCCCEEEEECCCCHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 98799887545413024477867322258000177763045010133345069984546898876798998898887368 Q gi|254780446|r 32 VFDSNKEFVAELDGLVVNKDGKFKPIGYLPSDRLKVFEFKYPCLYKEDQEQVVLCSSSEECVMSRWNAAVHTKEILNEKN 111 (171) Q Consensus 32 ~~D~~~~~laeL~GLaTsk~G~~~pIGY~~sd~Lk~~~~~~~~~y~~sq~q~iv~~~~~e~~~~rwn~~~~~~~~IN~~n 111 (171) +.||.|++|+= .-+|++||.---+++|++|-|++-- .++..-.+.+- . T Consensus 3 llgpeG~PVSW---~Lmdqtge~rmgyTlPeyR~kG~~~---------------------------~~~~~~~~~L~--~ 50 (89) T pfam08444 3 LLGPEGTPVSW---SLMDQTGELRMAGTLPKYRRQGLMS---------------------------HVIYHQAQYLE--K 50 (89) T ss_pred CCCCCCCEEEE---EEECCCCEEEECCCCHHHHCCCHHH---------------------------HHHHHHHHHHH--H T ss_conf 13899987479---9853742110013377787558699---------------------------99999999998--6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 998721234451268999999838874 Q gi|254780446|r 112 MPYPFLGIGKNSNSVASTLIRCMGLKE 138 (171) Q Consensus 112 L~YPi~Gl~~NSNSva~Tl~r~M~l~~ 138 (171) ..||+++..--.|-.-..+...++-.+ T Consensus 51 ~g~P~Y~hv~~~N~~~~k~~~~lg~~~ 77 (89) T pfam08444 51 LGFPVYSHVDKANEISQKMSGNLGHVP 77 (89) T ss_pred CCCCCEEEEHHCCHHHHHHHHHCCCEE T ss_conf 799946864115788999999779825 No 25 >TIGR01225 hutH histidine ammonia-lyase; InterPro: IPR005921 Histidine ammonia-lyase deaminates histidine to urocanic acid, the first step in histidine degradation. It is closely related to phenylalanine ammonia-lyase. ; GO: 0004397 histidine ammonia-lyase activity, 0006548 histidine catabolic process, 0005737 cytoplasm. Probab=22.52 E-value=58 Score=14.82 Aligned_cols=26 Identities=23% Similarity=0.406 Sum_probs=20.5 Q ss_pred CCCCCCCC---CCCCHHHHHHHHHHHHHC Q ss_conf 41676641---013777999999866411 Q gi|254780446|r 145 KIAPYQGI---ILLDDRKINEIQNNWIRC 170 (171) Q Consensus 145 r~~Pg~g~---~LL~e~~i~~i~~~~~~~ 170 (171) +|--|||+ ..++.+...+||+|+||- T Consensus 52 GvNTGFG~l~~~~I~~~~~~~LQ~NLvrS 80 (529) T TIGR01225 52 GVNTGFGKLASVRIDREDLAELQRNLVRS 80 (529) T ss_pred EEECCCCCHHHHCCCHHHHHHHHHHHHHH T ss_conf 41155001233016577899987547776 No 26 >pfam03393 Pneumo_matrix Pneumovirus matrix protein. Probab=22.41 E-value=17 Score=17.95 Aligned_cols=17 Identities=65% Similarity=0.989 Sum_probs=11.1 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 57886541676641013 Q gi|254780446|r 139 LAIPNAKIAPYQGIILL 155 (171) Q Consensus 139 ~~ip~~r~~Pg~g~~LL 155 (171) -+|-.+|++|+-|.+|+ T Consensus 187 ~AIT~ArIaPYAGLilv 203 (252) T pfam03393 187 NAITNAKIIPYAGLILV 203 (252) T ss_pred HHHHHCCCCCCCCEEEE T ss_conf 66664212244644999 No 27 >COG1541 PaaK Coenzyme F390 synthetase [Coenzyme metabolism] Probab=22.23 E-value=35 Score=16.12 Aligned_cols=19 Identities=26% Similarity=0.324 Sum_probs=16.1 Q ss_pred EEC-CCCCEEEECCCCCCCE Q ss_conf 244-7786732225800017 Q gi|254780446|r 47 VVN-KDGKFKPIGYLPSDRL 65 (171) Q Consensus 47 aTs-k~G~~~pIGY~~sd~L 65 (171) ||| +||++..+||.+.|.. T Consensus 97 aSSGTTGkPt~~~~t~~D~~ 116 (438) T COG1541 97 ASSGTTGKPTVFGYTAKDIE 116 (438) T ss_pred CCCCCCCCCEEEECCHHHHH T ss_conf 35888899604404889999 No 28 >PRK09004 flavodoxin; Provisional Probab=21.93 E-value=69 Score=14.40 Aligned_cols=94 Identities=12% Similarity=0.094 Sum_probs=54.0 Q ss_pred HCCCCCCEEEEEECCHH--HHHHHHHHHHHHHHH--HHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 01333450699845468--988767989988988--87368998721234451268999999838874578865416766 Q gi|254780446|r 75 LYKEDQEQVVLCSSSEE--CVMSRWNAAVHTKEI--LNEKNMPYPFLGIGKNSNSVASTLIRCMGLKELAIPNAKIAPYQ 150 (171) Q Consensus 75 ~y~~sq~q~iv~~~~~e--~~~~rwn~~~~~~~~--IN~~nL~YPi~Gl~~NSNSva~Tl~r~M~l~~~~ip~~r~~Pg~ 150 (171) ....+....++++++-+ .--+ ...+.+.++. ..-.++.|-+||+|--|-.-|-...+-++=.-..++-.|+.|.. T Consensus 43 ~l~~~~~~iiv~ST~G~Gd~Pdn-a~~F~~~L~~~~~~L~~l~yaVfGlGDs~Y~~Fc~~gk~ld~~L~~lGA~rl~~~~ 121 (146) T PRK09004 43 DLSASGIWLIVSSTHGAGDLPDN-LQPFFEELQEQKPDLSAVRFAAIGIGSSSYDTFCGAIEKLEALLKASGAKQIGEKL 121 (146) T ss_pred HHCCCCEEEEEECCCCCCCCCHH-HHHHHHHHHCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEECCE T ss_conf 70406828999888999978664-99999999736977689779999527756889979999999999987990742657 Q ss_pred CCCCCCHHHHHHHHHHHHH Q ss_conf 4101377799999986641 Q gi|254780446|r 151 GIILLDDRKINEIQNNWIR 169 (171) Q Consensus 151 g~~LL~e~~i~~i~~~~~~ 169 (171) -.+.-....-++.-..|++ T Consensus 122 ~~D~~~~~~~e~~~~~W~~ 140 (146) T PRK09004 122 KIDVLQHDIPEDPAEAWLK 140 (146) T ss_pred EEECCCCCCCHHHHHHHHH T ss_conf 7777899982588999999 No 29 >pfam11704 Folliculin Vesicle coat protein involved in Golgi to plasma membrane transport. In yeast cells this family functions in the regulated delivery of Gap1p (a general amino acid permease) to the cell surface, perhaps as a component of a post-Golgi secretory-vesicle coat complex. Birt-Hogg-Dube (BHD)4 syndrome is an autosomal dominant disorder characterized by hamartomas of skin follicles, lung cysts, spontaneous pneumothorax, and renal cell carcinoma. Folliculin is the protein from the BHD4 gene and is found to have no significant homology to any other human proteins. It is expressed in most tissues. These same symptoms also occur in TSC or tuberous sclerosis complex, suggesting that the same pathway is involved, and it is likely that the target is the down-stream Tor2 - an essential gene. Folliculin appears to bind Tor2, and down-regulation of Tor2 activity leads to up-regulation of nitrogen responsive genes including membrane transporters and amino acid permeases. Probab=21.77 E-value=69 Score=14.38 Aligned_cols=46 Identities=9% Similarity=0.281 Sum_probs=34.5 Q ss_pred CCCEEEEECCCCHHCCCCCC-------EEEEEECCH-HHHHHHHHHHHHHHHHH Q ss_conf 00177763045010133345-------069984546-89887679899889888 Q gi|254780446|r 62 SDRLKVFEFKYPCLYKEDQE-------QVVLCSSSE-ECVMSRWNAAVHTKEIL 107 (171) Q Consensus 62 sd~Lk~~~~~~~~~y~~sq~-------q~iv~~~~~-e~~~~rwn~~~~~~~~I 107 (171) -|..++|++++-|-.+|+++ ..|++.+++ -+.++.|.-+......| T Consensus 42 GD~~~G~~ls~~Fkl~D~~ARG~~R~Ysli~l~~~~~~~L~~~w~fi~~~f~~i 95 (168) T pfam11704 42 GDAIRGYVLSYGFKLKDPNARGNERRYSLIVLADDKMSKLLNNWPFIAEYFGKI 95 (168) T ss_pred ECCCCCEEEEEEEEECCCCCCCCCEEEEEEEEECCHHHHHHHHCHHHHHHHHHH T ss_conf 308788289999982572137644168999995676899987086999999999 Done!