Query gi|254780612|ref|YP_003065025.1| putative transmembrane protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 503 No_of_seqs 421 out of 678 Neff 10.1 Searched_HMMs 39220 Date Mon May 30 02:59:27 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780612.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PHA02107 hypothetical protein 79.9 4.6 0.00012 20.0 5.1 49 433-481 62-124 (216) 2 pfam03725 RNase_PH_C 3' exorib 63.0 11 0.00028 17.4 5.1 58 427-495 2-62 (67) 3 cd04619 CBS_pair_6 The CBS dom 61.9 10 0.00026 17.6 3.4 44 435-481 65-109 (114) 4 KOG2915 consensus 59.8 12 0.00029 17.2 3.4 76 377-473 159-238 (314) 5 pfam08016 PKD_channel Polycyst 58.5 13 0.00033 16.9 5.2 58 415-478 101-158 (423) 6 KOG0933 consensus 44.6 7.1 0.00018 18.7 0.3 32 439-480 46-81 (1174) 7 PRK09532 DNA polymerase III su 44.1 19 0.0005 15.7 2.5 17 443-460 568-584 (874) 8 TIGR02129 hisA_euk phosphoribo 43.0 11 0.00027 17.5 1.0 16 462-477 219-234 (274) 9 smart00709 Zpr1 Duplicated dom 42.2 23 0.00058 15.2 3.0 63 360-422 38-108 (160) 10 COG4738 Predicted transcriptio 42.2 9.5 0.00024 17.8 0.6 20 448-467 55-74 (124) 11 cd03600 CLECT_thrombomodulin_l 41.8 5.1 0.00013 19.7 -0.8 38 435-472 55-94 (141) 12 PRK01033 imidazole glycerol ph 41.7 6.1 0.00016 19.1 -0.4 69 412-481 18-90 (253) 13 TIGR02992 ectoine_eutC ectoine 40.7 12 0.0003 17.2 0.9 13 428-440 62-74 (326) 14 TIGR03549 conserved hypothetic 40.4 24 0.00062 15.0 2.6 98 359-500 120-224 (718) 15 TIGR00311 aIF-2beta translatio 40.2 16 0.00042 16.2 1.6 22 3-24 1-22 (134) 16 pfam07249 Cerato-platanin Cera 40.1 22 0.00055 15.4 2.2 33 468-503 65-97 (119) 17 cd03592 CLECT_selectins_like C 39.8 11 0.00027 17.5 0.6 37 426-472 36-75 (115) 18 pfam10369 ALS_ss_C Small subun 39.6 25 0.00064 14.9 4.0 46 429-477 27-72 (75) 19 PRK08637 hypothetical protein; 38.3 26 0.00067 14.8 3.4 60 436-496 322-384 (386) 20 TIGR01279 DPOR_bchN light-inde 38.3 10 0.00026 17.6 0.3 131 346-498 140-281 (458) 21 PRK05412 nucleotide-binding pr 37.6 21 0.00055 15.4 1.9 28 446-474 133-160 (161) 22 cd03595 CLECT_chondrolectin_li 37.0 6.4 0.00016 19.0 -0.9 39 435-473 62-101 (149) 23 PRK02145 consensus 36.0 12 0.0003 17.2 0.3 65 412-477 19-87 (257) 24 cd03375 TPP_OGFOR Thiamine pyr 35.8 7.2 0.00018 18.7 -0.8 56 406-476 54-110 (193) 25 TIGR02354 thiF_fam2 thiamine b 35.6 18 0.00047 15.9 1.2 26 432-459 157-182 (200) 26 COG3204 Uncharacterized protei 35.3 20 0.0005 15.6 1.4 117 368-486 142-278 (316) 27 smart00116 CBS Domain in cysta 35.2 28 0.00072 14.5 2.2 29 451-479 11-40 (49) 28 pfam03367 zf-ZPR1 ZPR1 zinc-fi 35.0 29 0.00075 14.4 3.0 62 362-423 41-110 (160) 29 PRK01659 consensus 34.7 12 0.0003 17.2 0.2 57 423-479 28-88 (252) 30 KOG3047 consensus 34.7 22 0.00056 15.3 1.5 32 430-461 88-120 (157) 31 cd04606 CBS_pair_Mg_transporte 34.4 30 0.00076 14.4 2.7 78 394-481 24-103 (109) 32 PRK00830 consensus 34.3 17 0.00043 16.1 0.9 84 407-491 17-112 (273) 33 TIGR03578 EF_0831 conserved hy 32.3 32 0.00082 14.1 2.3 21 480-500 11-31 (96) 34 COG2876 AroA 3-deoxy-D-arabino 32.3 16 0.0004 16.3 0.5 22 453-474 140-161 (286) 35 PRK12702 mannosyl-3-phosphogly 32.3 32 0.00082 14.1 4.3 54 439-502 227-298 (302) 36 PRK00748 1-(5-phosphoribosyl)- 32.0 22 0.00057 15.3 1.2 25 453-477 176-201 (241) 37 PRK04281 consensus 31.9 20 0.00051 15.6 1.0 42 436-477 44-86 (254) 38 pfam09837 DUF2064 Uncharacteri 30.6 23 0.0006 15.1 1.1 20 435-454 97-116 (121) 39 cd04638 CBS_pair_25 The CBS do 30.3 35 0.00089 13.9 3.2 43 435-480 58-100 (106) 40 pfam06089 Asparaginase_II L-as 30.3 35 0.00089 13.9 2.6 31 429-466 267-297 (325) 41 PRK02621 consensus 29.2 31 0.00078 14.3 1.5 42 436-477 44-86 (254) 42 PRK13597 imidazole glycerol ph 28.8 32 0.00082 14.2 1.6 66 412-478 19-88 (252) 43 PRK03220 consensus 28.8 35 0.0009 13.9 1.8 65 412-477 19-87 (257) 44 PRK10076 pyruvate formate lyas 28.8 37 0.00094 13.7 4.3 32 438-469 91-137 (213) 45 KOG1383 consensus 28.7 37 0.00094 13.7 2.4 36 437-472 382-417 (491) 46 pfam11217 DUF3013 Protein of u 28.7 29 0.00075 14.4 1.4 25 392-416 31-57 (160) 47 PRK12595 bifunctional 3-deoxy- 28.2 33 0.00085 14.0 1.6 33 437-474 202-234 (360) 48 COG2987 HutU Urocanate hydrata 27.5 26 0.00066 14.8 0.9 29 438-466 360-396 (561) 49 pfam12281 DUF3620 Protein of u 27.2 39 0.001 13.6 2.0 110 364-480 47-159 (216) 50 PRK02747 consensus 26.5 21 0.00053 15.5 0.2 67 412-479 18-88 (257) 51 pfam07946 DUF1682 Protein of u 26.4 28 0.00072 14.5 0.9 15 470-484 212-226 (322) 52 cd04495 BRCA2DBD_OB3 BRCA2DBD_ 26.3 14 0.00035 16.7 -0.7 40 454-498 57-96 (100) 53 PRK13415 flagella biosynthesis 26.2 41 0.001 13.4 2.1 40 407-460 105-144 (216) 54 KOG1693 consensus 26.2 14 0.00036 16.7 -0.7 103 363-465 60-164 (209) 55 smart00310 PTBI Phosphotyrosin 26.0 41 0.001 13.4 3.4 68 434-502 25-98 (98) 56 PRK13396 3-deoxy-7-phosphohept 25.8 40 0.001 13.5 1.6 37 432-474 181-217 (352) 57 pfam11322 DUF3124 Protein of u 25.8 41 0.0011 13.4 2.2 17 374-391 25-41 (126) 58 KOG0526 consensus 25.6 42 0.0011 13.3 3.4 96 372-475 141-243 (615) 59 pfam03425 CBM_11 Carbohydrate 25.5 34 0.00087 14.0 1.2 31 385-415 135-168 (170) 60 PRK05211 consensus 25.2 9.8 0.00025 17.7 -1.6 65 412-477 9-77 (248) 61 KOG3883 consensus 25.0 43 0.0011 13.3 2.4 84 392-478 41-131 (198) 62 TIGR00400 mgtE magnesium trans 24.5 44 0.0011 13.2 4.5 76 395-479 171-247 (460) 63 cd04731 HisF The cyclase subun 24.1 13 0.00033 16.9 -1.1 40 437-476 42-82 (243) 64 cd03596 CLECT_tetranectin_like 24.1 31 0.0008 14.2 0.8 25 438-472 59-86 (129) 65 pfam03180 Lipoprotein_9 NLPA l 24.0 45 0.0011 13.2 2.8 28 358-385 96-128 (236) 66 pfam00869 Flavi_glycoprot Flav 24.0 17 0.00042 16.1 -0.6 38 437-482 193-238 (293) 67 PRK13585 1-(5-phosphoribosyl)- 24.0 40 0.001 13.5 1.3 40 437-476 46-86 (240) 68 PRK07006 isocitrate dehydrogen 23.9 37 0.00093 13.8 1.1 18 454-472 279-296 (409) 69 cd07136 ALDH_YwdH-P39616 Bacil 23.7 18 0.00046 15.9 -0.5 16 456-471 423-438 (449) 70 COG1628 Endonuclease V homolog 23.4 31 0.00079 14.3 0.7 84 406-498 22-124 (185) 71 COG4148 ModC ABC-type molybdat 23.2 35 0.00088 13.9 0.9 30 444-473 156-187 (352) 72 cd04732 HisA HisA. Phosphorib 23.0 45 0.0012 13.1 1.5 40 436-475 43-83 (234) 73 cd02072 Glm_B12_BD B12 binding 22.8 47 0.0012 13.0 2.9 38 435-472 47-88 (128) 74 PRK07374 dnaE DNA polymerase I 22.7 47 0.0012 13.0 2.4 17 464-480 1123-1140(1171) 75 PRK02079 pyrroloquinoline quin 22.5 48 0.0012 13.0 1.7 22 445-466 66-87 (88) 76 KOG3107 consensus 22.4 48 0.0012 13.0 1.5 24 379-402 180-203 (468) 77 KOG3972 consensus 22.3 47 0.0012 13.0 1.4 24 3-26 81-104 (252) 78 pfam10794 DUF2606 Protein of u 22.1 48 0.0012 12.9 3.1 78 391-468 37-128 (131) 79 cd04643 CBS_pair_30 The CBS do 22.0 49 0.0012 12.9 3.3 41 435-480 69-110 (116) 80 PRK06826 dnaE DNA polymerase I 21.9 49 0.0012 12.9 2.3 40 457-497 1107-1148(1157) 81 COG3814 Uncharacterized protei 21.9 44 0.0011 13.2 1.2 12 434-445 75-86 (157) 82 TIGR03572 WbuZ glycosyl amidat 21.9 16 0.0004 16.3 -1.1 65 412-477 18-86 (232) 83 pfam00948 Flavi_NS1 Flavivirus 21.8 46 0.0012 13.1 1.3 51 360-411 183-252 (355) 84 CHL00073 chlN photochlorophyll 21.7 35 0.0009 13.9 0.7 12 394-405 415-426 (457) 85 PRK06273 ferredoxin; Provision 21.5 45 0.0011 13.1 1.2 25 406-440 55-83 (163) 86 TIGR02318 phosphono_phnM phosp 21.4 50 0.0013 12.8 3.1 51 437-503 162-215 (391) 87 pfam03452 Anp1 Anp1. The membe 21.4 50 0.0013 12.8 2.6 77 392-470 55-156 (270) 88 cd04582 CBS_pair_ABC_OpuCA_ass 20.9 51 0.0013 12.7 3.5 33 448-480 67-100 (106) 89 TIGR01858 tag_bisphos_ald clas 20.9 38 0.00098 13.6 0.7 44 457-500 192-247 (282) 90 cd01339 LDH-like_MDH L-lactate 20.8 23 0.00059 15.1 -0.4 45 380-424 235-287 (300) 91 cd03557 L-arabinose_isomerase 20.7 52 0.0013 12.7 2.2 108 331-443 270-400 (484) 92 PRK05673 dnaE DNA polymerase I 20.5 52 0.0013 12.7 2.4 44 458-501 1092-1138(1143) 93 COG2014 Uncharacterized conser 20.4 22 0.00055 15.4 -0.6 23 458-480 98-121 (250) 94 COG4742 Predicted transcriptio 20.2 18 0.00046 15.9 -1.1 22 431-452 207-228 (260) No 1 >PHA02107 hypothetical protein Probab=79.90 E-value=4.6 Score=20.03 Aligned_cols=49 Identities=22% Similarity=0.171 Sum_probs=35.9 Q ss_pred EEECCCEE------------EEECCC-CCHHHH-HHHHHHHCCCCEEEEEEECCCCEEEEEEC Q ss_conf 67469828------------998178-825688-77999730781168788379967899961 Q gi|254780612|r 433 FVISKNSY------------LISLKG-SEEDPF-RNSKILEEYRFIDIPITYRSGQKILFTID 481 (503) Q Consensus 433 ~kv~~~~f------------~v~l~~-~~~~~~-~n~~ll~~~~w~dip~~y~ngrra~~~~e 481 (503) +||.+++| .||||- ++.|-. .-|+||+.++-+----.-+|||--|.+|- T Consensus 62 ~KI~PN~~N~~A~~~~sV~G~IW~NT~FEsDC~~~RL~~L~~YG~~V~i~~~~NG~N~iV~FS 124 (216) T PHA02107 62 AKISPNHINLAAVGTHSVCGTIWVNTSFESDCVTNRLQTLEAYGVFVEVARRTNGSNHVVTFS 124 (216) T ss_pred CCCCCCEEEEECCCCEEEEEEEEEECHHHHHHHHHHHHHHHHHCEEEEEEECCCCCCCEEEEE T ss_conf 313777044411133688778997223267789989999986072899984158865169999 No 2 >pfam03725 RNase_PH_C 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterized subfamily. This subfamily is found in both eukaryotes and archaebacteria. Probab=62.96 E-value=11 Score=17.41 Aligned_cols=58 Identities=12% Similarity=0.068 Sum_probs=32.4 Q ss_pred CCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCH---HHHHHHHHH Q ss_conf 311001674698289981788256887799973078116878837996789996158768---999999999 Q gi|254780612|r 427 LIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKG---ADIFKSAIM 495 (503) Q Consensus 427 ~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g---~~~f~~a~~ 495 (503) +.+++.+=+-||.|+|-++.-+... ..+.+ .|+| ||.+-++++=|+..+ +.-|.++|. T Consensus 2 ~v~avtvg~i~~~~i~dpt~~Ee~~--------~~~~l--~v~~-~~~~~~v~~~~~g~~~l~~~~l~~~l~ 62 (67) T pfam03725 2 PVAAVTVGKIDGELVLDPTLEEESL--------ADSDL--TIVV-AGTGDIVALMKEGGGGLTEEELLEALE 62 (67) T ss_pred CEEEEEEEEECCEEEECCCHHHHHH--------HCCEE--EEEE-ECCCCEEEEECCCCCCCCHHHHHHHHH T ss_conf 8899999999999999989678633--------14736--8999-689999999918997859999999999 No 3 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=61.89 E-value=10 Score=17.63 Aligned_cols=44 Identities=5% Similarity=0.068 Sum_probs=32.8 Q ss_pred ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEEC Q ss_conf 4698289981788256887799973078116878837996-7899961 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTID 481 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~e 481 (503) -|.+.+-+--.+. ...=+++|.+++.-.+|++..||| .+|||+- T Consensus 65 Mt~~vitv~~~~~---i~~a~~~M~~~~i~~lPVVD~~~~~vGiit~~ 109 (114) T cd04619 65 MTRAVVSCRPGDL---LHDVWQVMKQRGLKNIPVVDENARPLGVLNAR 109 (114) T ss_pred CCCCCEEECCCCC---HHHHHHHHHHCCCCEEEEEECCCEEEEEEEHH T ss_conf 1589889999890---99999988876985889990799599999935 No 4 >KOG2915 consensus Probab=59.83 E-value=12 Score=17.25 Aligned_cols=76 Identities=16% Similarity=0.144 Sum_probs=46.4 Q ss_pred EEEEECCCCCCCCCE-EEEEEEEEE-CCC-C-CCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHH Q ss_conf 999951789877781-288999971-688-8-873211032310213321246631100167469828998178825688 Q gi|254780612|r 377 ASMTLKCNADISLSI-THVMEIMFS-FPK-E-SQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPF 452 (503) Q Consensus 377 ~~~~~rrn~d~slpa-sh~~e~~f~-~~~-~-~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~ 452 (503) ++++-|-=--..+.. |++++-+|. +|. . .-.--.+ +-..+|-.|+-.|-.=+-.+ T Consensus 159 vt~~hrDVc~~GF~~ks~~aDaVFLDlPaPw~AiPha~~---------------------~lk~~g~r~csFSPCIEQvq 217 (314) T KOG2915 159 VTVTHRDVCGSGFLIKSLKADAVFLDLPAPWEAIPHAAK---------------------ILKDEGGRLCSFSPCIEQVQ 217 (314) T ss_pred EEEEEEECCCCCCCCCCCCCCEEEECCCCHHHHHHHHHH---------------------HHHHCCCEEEECCHHHHHHH T ss_conf 489996415677353134235699758980552233677---------------------75406825996217899999 Q ss_pred HHHHHHHCCCCEEEEEEECCC Q ss_conf 779997307811687883799 Q gi|254780612|r 453 RNSKILEEYRFIDIPITYRSG 473 (503) Q Consensus 453 ~n~~ll~~~~w~dip~~y~ng 473 (503) |-.++|+.++||||-|+---+ T Consensus 218 rtce~l~~~gf~~i~~vEv~~ 238 (314) T KOG2915 218 RTCEALRSLGFIEIETVEVLL 238 (314) T ss_pred HHHHHHHHCCCCEEEEEEEEH T ss_conf 999999867982379987603 No 5 >pfam08016 PKD_channel Polycystin cation channel. This family contains the cation channel region of PKD1 and PKD2 proteins. Probab=58.51 E-value=13 Score=16.89 Aligned_cols=58 Identities=12% Similarity=0.183 Sum_probs=44.6 Q ss_pred CEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEE Q ss_conf 1021332124663110016746982899817882568877999730781168788379967899 Q gi|254780612|r 415 ISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILF 478 (503) Q Consensus 415 ~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~ 478 (503) -..+..+.-.|.+-.|.-.-=..|=|.+-|+....+-.+.|+-||..+||| .+-||++ T Consensus 101 w~y~~~~~l~~~~~~G~~~~Y~gGGYv~~L~~~~~~~~~~l~~L~~~~WiD------~~TRAvf 158 (423) T pfam08016 101 RIYSPPNLLMGAWHWGHLASYPSGGYVVLLPLSREESLKRLAYLQDHNWLD------RRTRAVF 158 (423) T ss_pred CEEECHHHHCCCCEEEEEEEECCCCEEEECCCCHHHHHHHHHHHHHCCCCC------CCCCEEE T ss_conf 755177881887517677898899889978999999999999998768700------0452799 No 6 >KOG0933 consensus Probab=44.57 E-value=7.1 Score=18.72 Aligned_cols=32 Identities=16% Similarity=0.354 Sum_probs=23.8 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC----CEEEEEE Q ss_conf 28998178825688779997307811687883799----6789996 Q gi|254780612|r 439 SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG----QKILFTI 480 (503) Q Consensus 439 ~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng----rra~~~~ 480 (503) .|..|++|+..-|+.|||=| +|.|| -+|..|+ T Consensus 46 CFvLGI~nl~~VRA~nlqeL----------Iyk~GQAGiTkAsVsV 81 (1174) T KOG0933 46 CFVLGITNLSQVRASNLQEL----------IYKNGQAGITKASVSV 81 (1174) T ss_pred HHHHCCCHHHHHHHHHHHHH----------HHHCCCCCCEEEEEEE T ss_conf 99982452888888789999----------9846866642579999 No 7 >PRK09532 DNA polymerase III subunit alpha; Reviewed Probab=44.15 E-value=19 Score=15.66 Aligned_cols=17 Identities=18% Similarity=0.249 Sum_probs=5.8 Q ss_pred ECCCCCHHHHHHHHHHHC Q ss_conf 817882568877999730 Q gi|254780612|r 443 SLKGSEEDPFRNSKILEE 460 (503) Q Consensus 443 ~l~~~~~~~~~n~~ll~~ 460 (503) ||.++..= .+=+.|+++ T Consensus 568 GL~~Lt~I-~~a~~lI~~ 584 (874) T PRK09532 568 GLKNLTMI-QKTADLIKE 584 (874) T ss_pred CCCHHHHH-HHHHHHHHH T ss_conf 55768899-999999998 No 8 >TIGR02129 hisA_euk phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase; InterPro: IPR011858 This enzyme acts in the biosynthesis of histidine and has been characterised in Saccharomyces cerevisiae and Arabidopsis where it complements the Escherichia coli HisA gene. In eukaryotes the gene is known as HIS6. In bacteria, this gene is found in Fibrobacter succinogenes, presumably due to lateral gene transfer from plants in the rumen gut.; GO: 0003949 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imidazole-4-carboxamide isomerase activity, 0000105 histidine biosynthetic process. Probab=43.04 E-value=11 Score=17.50 Aligned_cols=16 Identities=44% Similarity=0.821 Sum_probs=12.4 Q ss_pred CCEEEEEEECCCCEEE Q ss_conf 8116878837996789 Q gi|254780612|r 462 RFIDIPITYRSGQKIL 477 (503) Q Consensus 462 ~w~dip~~y~ngrra~ 477 (503) ..+||||||+.|=|-| T Consensus 219 ~~~~ip~tYAGGakS~ 234 (274) T TIGR02129 219 VYDDIPVTYAGGAKSI 234 (274) T ss_pred CCCCCCEEEECCCEEH T ss_conf 8752431772786556 No 9 >smart00709 Zpr1 Duplicated domain in the epidermal growth factor- and elongation factor-1alpha-binding protein Zpr1. Also present in archaeal proteins. Probab=42.24 E-value=23 Score=15.18 Aligned_cols=63 Identities=11% Similarity=0.161 Sum_probs=41.7 Q ss_pred CCCEEEEEEECCCCCEEEEEEECCCCCC------CCCEEE-EEEEEEECCCCC-CCCHHHCCCCEECCCCC Q ss_conf 6742367630787771799995178987------778128-899997168888-73211032310213321 Q gi|254780612|r 360 KGLVIKGDIPMIDNAFSASMTLKCNADI------SLSITH-VMEIMFSFPKES-QDAVVDLRRISMRKTDN 422 (503) Q Consensus 360 ~~~aira~i~iP~~~~~~~~~~rrn~d~------slpash-~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~ 422 (503) .-.-|+.--.|.++|.+.+|.++--.|. |=-|+- +=||-|.++.+. +|-|.||-||+++--++ T Consensus 38 r~~ev~~~~~~~p~G~r~tl~V~~~~DL~r~VvkS~tati~IPEL~~ei~pg~~~G~vTTVEGlL~r~~~~ 108 (160) T smart00709 38 RNNEVKSGGAIEPKGTRITLKVESPEDLNRDVVKSETATISIPELDLEIPPGPLGGFITTVEGLLSRVREV 108 (160) T ss_pred CCEEEEECCCCCCCEEEEEEEECCHHHCCCEEEECCCEEEEEEEEEEEECCCCCCCEEECHHHHHHHHHHH T ss_conf 52378887666896189999988979908607972865899854303766977686798189999999999 No 10 >COG4738 Predicted transcriptional regulator [Transcription] Probab=42.18 E-value=9.5 Score=17.82 Aligned_cols=20 Identities=15% Similarity=0.179 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHHCCCCEEEE Q ss_conf 25688779997307811687 Q gi|254780612|r 448 EEDPFRNSKILEEYRFIDIP 467 (503) Q Consensus 448 ~~~~~~n~~ll~~~~w~dip 467 (503) ..++.--+.+||+++|||+= T Consensus 55 QPEVSiAMr~Lre~gWV~~R 74 (124) T COG4738 55 QPEVSIAMRYLRENGWVDER 74 (124) T ss_pred CCHHHHHHHHHHHCCCCCHH T ss_conf 83068999999875630067 No 11 >cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved. TM exerts anti-fibrinolytic and anti-inflammatory activity. TM also regulates blood coagulation in the anticoagulant protein C pathway. In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM. TM also plays a key role in tumor biology. It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma. Loss of TM expression correlates with advanced stage and poor prognosis. Loss of function of TM func Probab=41.84 E-value=5.1 Score=19.72 Aligned_cols=38 Identities=11% Similarity=0.087 Sum_probs=24.2 Q ss_pred ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEE--EEEECC Q ss_conf 46982899817882568877999730781168--788379 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDI--PITYRS 472 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~di--p~~y~n 472 (503) .++..|||||++......-.+..|+.-.|+|= ++.|.| T Consensus 55 ~~~~~~WIGL~~~~~~~~~~~~~~~~f~W~~g~~~~~y~n 94 (141) T cd03600 55 RGSLRLWIGLQREPRQCSDPSLPLRGFSWVTGDQDTDFSN 94 (141) T ss_pred CCCCEEEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCC T ss_conf 6552177631013444556656656537788998777667 No 12 >PRK01033 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=41.66 E-value=6.1 Score=19.15 Aligned_cols=69 Identities=12% Similarity=0.164 Sum_probs=48.5 Q ss_pred CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEEEC Q ss_conf 323102133212466311001---67469828998178825688779997307-81168788379967899961 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFTID 481 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~~e 481 (503) |+.+.||... --|+|+.-+- -.=.|..++|-|+.....+..|+.++++- +=+++||.|..|=|.+-.++ T Consensus 18 Vr~~~f~~~~-~~gdP~~~ak~f~~~GadelhivDld~a~~g~~~n~~~I~~I~~~~~ipi~vGGGIrs~e~~~ 90 (253) T PRK01033 18 VKTVKFKDPK-YVGDPINAVRIFNEKEADELIVLDIDASRKGREPNYELIENLASECFMPLCYGGGIKTVEQAK 90 (253) T ss_pred EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCHHHHH T ss_conf 9785577674-888999999999987999899994745424880169999999987699889868812168889 No 13 >TIGR02992 ectoine_eutC ectoine utilization protein EutC; InterPro: IPR014334 Members of this entry are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine). Members belong to the ornithine cyclodeaminase/mu-crystallin family.. Probab=40.74 E-value=12 Score=17.17 Aligned_cols=13 Identities=15% Similarity=0.276 Sum_probs=9.8 Q ss_pred CCCEEEEECCCEE Q ss_conf 1100167469828 Q gi|254780612|r 428 IDSNIFVISKNSY 440 (503) Q Consensus 428 l~~~~~kv~~~~f 440 (503) |.+-+.||++||| T Consensus 62 ld~FAiK~SPGFF 74 (326) T TIGR02992 62 LDGFAIKVSPGFF 74 (326) T ss_pred CCCCEEEECCCCC T ss_conf 7565155257723 No 14 >TIGR03549 conserved hypothetical protein TIGR03549. This family consists of remarkably well-conserved proteins from gamma and beta Proteobacteria, heavily skewed towards organisms of marine environments. This family has an OsmC-like N-terminal domain. It shares a central domain, modeled by pfam02624 and TIGR00702, with other families of smaller proteins. The function is unknown. Fifteen of the first sixteen members of this family are from selenouridine-positive genomes, but this correlation may be fortuitous. Probab=40.41 E-value=24 Score=14.99 Aligned_cols=98 Identities=16% Similarity=0.211 Sum_probs=62.3 Q ss_pred CCCCEEEEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECCCCC--CCCHHHCCCC----EECCCCCCCCCCCCCE- Q ss_conf 76742367630787771799995178987778128899997168888--7321103231----0213321246631100- Q gi|254780612|r 359 LKGLVIKGDIPMIDNAFSASMTLKCNADISLSITHVMEIMFSFPKES--QDAVVDLRRI----SMRKTDNSPSVLIDSN- 431 (503) Q Consensus 359 ~~~~aira~i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~~--~~~v~~v~~~----~~k~~e~~~g~~l~~~- 431 (503) +++-.|. .|+.=|.+-.+.|+++...|.+ +||..++- -..|.|++++ .||-.|.+=-.|+-.| T Consensus 120 ~p~f~ie-~v~~l~~da~~~l~~~~~~~~~---------TfI~GKDa~LE~sIa~m~~~L~~LGf~IEeaSWlNpVpnVW 189 (718) T TIGR03549 120 GPEFQIE-TVENLDADAQALLMGTPDSGAS---------TYIEGKDLPLEQTIANMTAILADLGMKIEIASWRNIVPNVW 189 (718) T ss_pred CCCEEEE-EHHCCCHHHHHHHCCCCCCCCC---------EEECCCCCCHHHHHHHHHHHHHHCCCCCEEHHHCCCCCCCC T ss_conf 8844775-5302163578772578898874---------13237874389999999999987486311054337888720 Q ss_pred EEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHHC Q ss_conf 167469828998178825688779997307811687883799678999615876899999999998522 Q gi|254780612|r 432 IFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAIMQWENR 500 (503) Q Consensus 432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~~~ 500 (503) +|.|.| -|-||.++|| ||.+-+.|...|++..--+ T Consensus 190 SvHIRD--------------------------~~cP~cFTNG--------KGatk~aALASALGE~~ER 224 (718) T TIGR03549 190 SLHIRD--------------------------AASPMCFTNG--------KGATKESALCSALGEFIER 224 (718) T ss_pred EEEEEC--------------------------CCCCCCCCCC--------CCCCHHHHHHHHHHHHHHH T ss_conf 587515--------------------------8885220378--------7501888999999999998 No 15 >TIGR00311 aIF-2beta translation initiation factor aIF-2, beta subunit, putative; InterPro: IPR004458 Archaeal transcription initiation factor 2 is, like its eukaryotic homologue, a heterotrimeric protein with alpha, beta and gamma subunits . It is thought play an essential role in the recognition of the correct codon for the start of translation, similar to the role of eukaryotic transcription initiation factor. The eukaryotic factor forms a ternary complex with the methionine initiator tRNA and GTP, which binds to the 40S ribosomal subunit as part of the 43S preinitiation complex . The beta and gamma subunits are responsible for recruiting the initiator tRNA and GTP, while the alpha subunit is involved in the regulation of the translation initiation process. The beta subunit has also been shown to interact with other transcription factors. All three subunits of the archaeal and eukaryotic factors are well conserved among the diverse species of eukaryotes and archaea, but do not occur in prokaryotes. The archaeal beta subunit has an unfolded N-terminal domain, a mixed alpha/beta core domain and a C-terminal zinc finger . The N-terminal region is thought to interact with the gamma subunit, while the central and C-terminal domains are thought to provide RNA-binding sites.; GO: 0003743 translation initiation factor activity, 0006413 translational initiation. Probab=40.20 E-value=16 Score=16.18 Aligned_cols=22 Identities=32% Similarity=0.708 Sum_probs=18.6 Q ss_pred CHHHHHHHHHHCCCCCCCHHHH Q ss_conf 1799999886167765600013 Q gi|254780612|r 3 DFILVIQRAVDNLPENTPERRS 24 (503) Q Consensus 3 D~~~~l~kai~~l~~~tpe~r~ 24 (503) ||-.||+||++-|++..=+.+. T Consensus 1 dYe~LLeRA~~~lpd~vf~r~~ 22 (134) T TIGR00311 1 DYEELLERAIEQLPDKVFERKE 22 (134) T ss_pred CHHHHHHHHHHHCCCCCCCCCC T ss_conf 9578899988736841013154 No 16 >pfam07249 Cerato-platanin Cerato-platanin. This family contains a number of fungal cerato-platanin phytotoxic proteins approximately 150 residues long. Cerato-platanin contains four cysteine residues that form two disulphide bonds. Probab=40.12 E-value=22 Score=15.36 Aligned_cols=33 Identities=24% Similarity=0.418 Sum_probs=23.5 Q ss_pred EEECCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 883799678999615876899999999998522369 Q gi|254780612|r 468 ITYRSGQKILFTIDKGKKGADIFKSAIMQWENRSNN 503 (503) Q Consensus 468 ~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~~~~~~ 503 (503) |+|.+...-||+|.....| ||=+..|-...||+ T Consensus 65 ltyng~sI~vlaID~A~~G---Fni~~~Am~~LT~g 97 (119) T pfam07249 65 VTYNGNTIFVLAIDSAGGG---FNIAKTAMNELTNG 97 (119) T ss_pred EEECCEEEEEEEEECCCCC---CEECHHHHHHHHCC T ss_conf 8989928999998158876---16529999987577 No 17 >cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp Probab=39.80 E-value=11 Score=17.47 Aligned_cols=37 Identities=16% Similarity=0.298 Sum_probs=22.0 Q ss_pred CCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEE---EEEEECC Q ss_conf 6311001674698289981788256887799973078116---8788379 Q gi|254780612|r 426 VLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFID---IPITYRS 472 (503) Q Consensus 426 ~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d---ip~~y~n 472 (503) +-|..+.-+..+++|+|||+++.. +-.|+| -++.|.| T Consensus 36 ~~l~~~~~~~~~~~~WiG~~d~~~----------~~~W~~~~g~~~~y~n 75 (115) T cd03592 36 ALLNGFALKYNLGYYWIDGNDINN----------EGTWVDTDKKELEYKN 75 (115) T ss_pred HHHHHHHHCCCCCCEEEEEECCCC----------CCEEEECCCCCCCCCC T ss_conf 999999861288758987153488----------7879939899545076 No 18 >pfam10369 ALS_ss_C Small subunit of acetolactate synthase. ALS_ss_C is the C-terminal half of a family of proteins which are the small subunits of acetolactate synthase. Acetolactate synthase is a tetrameric enzyme, containing probably two large and two small subunits, which catalyses the first step in branched-chain amino acid biosynthesis. This reaction is sensitive to certain herbicides. Probab=39.60 E-value=25 Score=14.91 Aligned_cols=46 Identities=22% Similarity=0.381 Sum_probs=37.4 Q ss_pred CCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEE Q ss_conf 1001674698289981788256887799973078116878837996789 Q gi|254780612|r 429 DSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKIL 477 (503) Q Consensus 429 ~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~ 477 (503) -|-.+-+++++++|=|++.....+.-+.+|+..+-+. +-+.|+-|| T Consensus 27 ~a~ivd~~~~~~iiE~tG~~~kid~fi~~l~~~gI~E---ivRtG~vAi 72 (75) T pfam10369 27 RAKIVDVTPDSYTIELTGDPEKIDAFIDLLRPFGILE---VVRTGVVAI 72 (75) T ss_pred CCEEEEECCCEEEEEEECCHHHHHHHHHHHHHHCCEE---EEECCHHHC T ss_conf 9899970388899999399899999999876609799---995387342 No 19 >PRK08637 hypothetical protein; Provisional Probab=38.34 E-value=26 Score=14.78 Aligned_cols=60 Identities=17% Similarity=0.197 Sum_probs=38.9 Q ss_pred CCCEE-EEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEE--EEECCCCCHHHHHHHHHHH Q ss_conf 69828-9981788256887799973078116878837996789--9961587689999999999 Q gi|254780612|r 436 SKNSY-LISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKIL--FTIDKGKKGADIFKSAIMQ 496 (503) Q Consensus 436 ~~~~f-~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~--~~~ekg~~g~~~f~~a~~~ 496 (503) ..||| +|.|++...+..| .-||.+.+-.=|+.--.++|.++ |+.|+=.-+-+++.+++.. T Consensus 322 ~~G~F~~i~~~~~~~e~l~-~~Lle~~~V~~v~~g~~~iRIs~aals~~nI~~~~~~I~~al~~ 384 (386) T PRK08637 322 NSGYFMCLKLKDVDAEELR-VHLLEKYGIGVIALNETDLRIAFSCVEEEDIPHVFDSIAKAIDD 384 (386) T ss_pred CCEEEEEECCCCCCHHHHH-HHHHHCCCEEEEECCCCCEEEEEEECCHHHHHHHHHHHHHHHHH T ss_conf 7615886479989999999-99986395899955999779999858899999999999999985 No 20 >TIGR01279 DPOR_bchN light-independent protochlorophyllide reductase, N subunit; InterPro: IPR005970 Synonym: dark protochlorophyllide reductase Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite. This enzyme describes the N subunit of the dark form protochlorophyllide reductase, a nitrogenase-like enzyme . ; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction. Probab=38.26 E-value=10 Score=17.58 Aligned_cols=131 Identities=12% Similarity=0.047 Sum_probs=76.8 Q ss_pred CCCCEEECCCCCCCCCCEEEEEEECCCCCEEEEEEECCCCCCCCCEEEEE---EEEEECCCCCCCCHHHCCCCEECCCCC Q ss_conf 62212304577777674236763078777179999517898777812889---999716888873211032310213321 Q gi|254780612|r 346 KILWSLQQEKSQGLKGLVIKGDIPMIDNAFSASMTLKCNADISLSITHVM---EIMFSFPKESQDAVVDLRRISMRKTDN 422 (503) Q Consensus 346 sVvWs~~~es~~~~~~~aira~i~iP~~~~~~~~~~rrn~d~slpash~~---e~~f~~~~~~~~~v~~v~~~~~k~~e~ 422 (503) +|--.|+-.=+.. .-+.|+=.. |++-.-.|||=|..|..-=-|... .|+|+ |+|.+.=-=.|+..=. T Consensus 140 ~VL~AL~P~~P~s---a~apa~e~~-E~~~~~~~~FG~~~~~EKvese~~~~~~lv~~------GSv~D~~~~~l~~e~~ 209 (458) T TIGR01279 140 TVLAALVPFCPKS---AEAPAEEKV-ERKKQKLLSFGKKADKEKVESEAVDQRKLVLV------GSVNDIVADQLRLELK 209 (458) T ss_pred HHHHHHCCCCCCC---CCCCCCCCH-HHHCCHHHHCCCHHHHHHHHHHCCCCCCEEEE------ECCCCHHHHHHHHHHH T ss_conf 9999631257887---769898711-22122121136301122221000278656899------4355123677999998 Q ss_pred CCCCCCCCE-EEE-------ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHH Q ss_conf 246631100-167-------469828998178825688779997307811687883799678999615876899999999 Q gi|254780612|r 423 SPSVLIDSN-IFV-------ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAI 494 (503) Q Consensus 423 ~~g~~l~~~-~~k-------v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~ 494 (503) --|-+..|. |.+ |.+|++++.|+=+=.+-. .+|.|+|.==-| ==-|==|+.|.+.|=++| T Consensus 210 ~LGi~~~~flPa~~~~~lP~i~pgt~~a~~qPyL~~ta--~~L~~eR~~~~L----------~~~FPfGPDGT~~f~e~i 277 (458) T TIGR01279 210 QLGIPVVGFLPASRFTELPVIGPGTVVAPLQPYLSDTA--TTLRRERGAKVL----------SALFPFGPDGTRRFLEAI 277 (458) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHH--HHHHHHCCCEEE----------CCCCCCCCCHHHHHHHHH T ss_conf 37886434277433455768798616983374188999--999850266561----------157898851689999999 Q ss_pred HHHH Q ss_conf 9985 Q gi|254780612|r 495 MQWE 498 (503) Q Consensus 495 ~~w~ 498 (503) .+-- T Consensus 278 a~~f 281 (458) T TIGR01279 278 AAEF 281 (458) T ss_pred HHHH T ss_conf 9985 No 21 >PRK05412 nucleotide-binding protein; Reviewed Probab=37.63 E-value=21 Score=15.37 Aligned_cols=28 Identities=11% Similarity=0.178 Sum_probs=23.5 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 88256887799973078116878837996 Q gi|254780612|r 446 GSEEDPFRNSKILEEYRFIDIPITYRSGQ 474 (503) Q Consensus 446 ~~~~~~~~n~~ll~~~~w~dip~~y~ngr 474 (503) -...|.+.=++|||+.+ ||+||.|.|=| T Consensus 133 KkrDDLQ~vi~llk~~d-~~~pLQF~NfR 160 (161) T PRK05412 133 KKRDDLQAVIALLRGAD-LGQPLQFNNFR 160 (161) T ss_pred CCHHHHHHHHHHHHHCC-CCCCCEECCCC T ss_conf 98889999999998327-79971133688 No 22 >cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation. Various alternatively spliced isoforms have been of CHODL have been identified. The transmembrane form of CHODL is localized in the ER-Golgi apparatus. Layilin is widely expressed in different cell types. The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM). The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin). The ERM proteins are cytoskeleton-membrane l Probab=36.98 E-value=6.4 Score=19.01 Aligned_cols=39 Identities=23% Similarity=0.297 Sum_probs=22.0 Q ss_pred ECCCEEEEECCCCCHHHHHHHHHHHCCCCEE-EEEEECCC Q ss_conf 4698289981788256887799973078116-87883799 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRNSKILEEYRFID-IPITYRSG 473 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d-ip~~y~ng 473 (503) .+++.|||||.+....-......-..-.|.| -|+.|+|- T Consensus 62 ~~~~~~WIGL~~~~~~~~~~~~~~~~f~W~DGs~~~y~nW 101 (149) T cd03595 62 ASDGDFWIGLRRSSQYNVTSSACSSLYYWLDGSISTFRNW 101 (149) T ss_pred CCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCCCCCC T ss_conf 6666347876622666544477774059779981143576 No 23 >PRK02145 consensus Probab=36.00 E-value=12 Score=17.22 Aligned_cols=65 Identities=9% Similarity=0.107 Sum_probs=32.0 Q ss_pred CCCCEECCCCCCCCCCCCCE---EEEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE Q ss_conf 32310213321246631100---167469828998178825688779997307-8116878837996789 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSN---IFVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL 477 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~---~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~ 477 (503) |+++.++... .-|+||.-+ ...=.|..|+|-|+.....+..|+.|++.- .=++|||.|..|=|.+ T Consensus 19 Vkg~~~~~~~-~~gdP~~~a~~~~~~GadelhivDld~a~~~~~~~~~~I~~i~~~~~iPi~vGGGIrs~ 87 (257) T PRK02145 19 VKGVNFVELR-DAGDPVEIARRYDEQGADELTFLDITATSDGRDLILPIIEAVASQVFIPLTVGGGVRAV 87 (257) T ss_pred EECCCCCCEE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCH T ss_conf 8577777738-88899999999998799989999788876675408999999996568748962773046 No 24 >cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity. Probab=35.79 E-value=7.2 Score=18.67 Aligned_cols=56 Identities=14% Similarity=-0.081 Sum_probs=34.5 Q ss_pred CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCC-EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEE Q ss_conf 732110323102133212466311001674698-28998178825688779997307811687883799678 Q gi|254780612|r 406 QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKN-SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKI 476 (503) Q Consensus 406 ~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~-~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra 476 (503) |.++.--.||.+ +.+=.-|-+-+-|| ||-|||+.+-....||..++ -|+|.|+--+ T Consensus 54 G~a~~~A~G~k~-------a~p~~~Vva~~GDG~~~~iG~~~l~~A~~rn~~i~--------~iv~DN~~yg 110 (193) T cd03375 54 GRALAVATGVKL-------ANPDLTVIVVSGDGDLAAIGGNHFIHAARRNIDIT--------VIVHNNQIYG 110 (193) T ss_pred CCCHHHHHHHHH-------HCCCCEEEEEEECCHHHHCCHHHHHHHHHHCCCEE--------EEEECCCCEE T ss_conf 160888867987-------59997399997065576456188999998099749--------9998584044 No 25 >TIGR02354 thiF_fam2 thiamine biosynthesis protein ThiF; InterPro: IPR012729 Members of the HesA/MoeB/ThiF family of proteins (IPR000594 from INTERPRO) include a number of members encoded in the midst of thiamine biosynthetic operons. This mix of known and putative ThiF proteins shows a deep split in phylogenetic trees. The Escherichia coli ThiF and MoeB proteins are seemingly more closely related than the E. coli ThiF and Campylobacter (for example) ThiF. This entry represents the divergent clade of putative ThiF proteins as found in Campylobacter.. Probab=35.64 E-value=18 Score=15.85 Aligned_cols=26 Identities=19% Similarity=0.238 Sum_probs=17.5 Q ss_pred EEEECCCEEEEECCCCCHHHHHHHHHHH Q ss_conf 1674698289981788256887799973 Q gi|254780612|r 432 IFVISKNSYLISLKGSEEDPFRNSKILE 459 (503) Q Consensus 432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~ 459 (503) +=||++.||++| |...+...=..||- T Consensus 157 trKI~k~FY~cG--Dg~~~ak~G~gLma 182 (200) T TIGR02354 157 TRKISKRFYVCG--DGKSDAKSGRGLMA 182 (200) T ss_pred CCEECCCCEEEE--CCCCCCCCCCCCCC T ss_conf 121146303874--47222001631025 No 26 >COG3204 Uncharacterized protein conserved in bacteria [Function unknown] Probab=35.34 E-value=20 Score=15.62 Aligned_cols=117 Identities=15% Similarity=0.125 Sum_probs=66.9 Q ss_pred EECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECC---C---CC-CCCHHHCCCCEECCCCCC--------CCC-CCCC- Q ss_conf 30787771799995178987778128899997168---8---88-732110323102133212--------466-3110- Q gi|254780612|r 368 IPMIDNAFSASMTLKCNADISLSITHVMEIMFSFP---K---ES-QDAVVDLRRISMRKTDNS--------PSV-LIDS- 430 (503) Q Consensus 368 i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~---~---~~-~~~v~~v~~~~~k~~e~~--------~g~-~l~~- 430 (503) .-|=|+.....-.|.=+.|+.+= -+..+.|.|. . +| |=+-+-+.+-.|=..|-. ++. .|.. T Consensus 142 fvi~dER~~~l~~~~vd~~t~~~--~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~ 219 (316) T COG3204 142 FVIVDERDRALYLFTVDADTTVI--SAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVH 219 (316) T ss_pred EEEEEHHCCEEEEEEECCCCCEE--ECCCEEEECCCCCCCCCCCEEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCC T ss_conf 99886122658999986896187--2132587525568877572334417788648999813882899981488533220 Q ss_pred EEEEECCC--EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEECCCCCH Q ss_conf 01674698--289981788256887799973078116878837996-789996158768 Q gi|254780612|r 431 NIFVISKN--SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTIDKGKKG 486 (503) Q Consensus 431 ~~~kv~~~--~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~ekg~~g 486 (503) ++.-.++. |||-.+|+++-+...|..|...-.-=-|--+...|. +-+|++-||--| T Consensus 220 ~~~~~~~~~~~f~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~g 278 (316) T COG3204 220 ASLDPTADRDLFVLDVSGLEFNAITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHG 278 (316) T ss_pred CCCCCCCCCCEEEECCCCCEECCCCCCEEEEECCCCEEEEEECCCCEEEEEEECCCCCC T ss_conf 24582101262762034415348888089986477238998358872543772267677 No 27 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=35.24 E-value=28 Score=14.53 Aligned_cols=29 Identities=17% Similarity=0.198 Sum_probs=22.9 Q ss_pred HHHHHHHHHCCCCEEEEEEECCCCE-EEEE Q ss_conf 8877999730781168788379967-8999 Q gi|254780612|r 451 PFRNSKILEEYRFIDIPITYRSGQK-ILFT 479 (503) Q Consensus 451 ~~~n~~ll~~~~w~dip~~y~ngrr-a~~~ 479 (503) ...=+++|+++++-.+|+|..+||- .||| T Consensus 11 v~~a~~~m~~~~i~~lPVVd~~~~lvGiit 40 (49) T smart00116 11 LEEALELLREHGIRRLPVVDEEGRLVGIVT 40 (49) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCEEEEEE T ss_conf 999999999809985769989991999988 No 28 >pfam03367 zf-ZPR1 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other mitogens, ZPR1 accumulates in the nucleolus. ZPR1 interacts with the cytoplasmic domain of the inactive EGF receptor (EGFR) and is thought to inhibit the basal protein tyrosine kinase activity of EGFR. This interaction is disrupted when cells are treated with EGF, though by themselves, inactive EGFRs are not sufficient to sequester ZPR1 to the cytoplasm. Upon stimulation by EGF, ZPR1 directly binds the eukaryotic translation elongation factor-1alpha (eEF-1alpha) to form ZPR1/eEF-1alpha complexes. These move into the nucleus, localising particularly at the nucleolus. Indeed, the interaction between ZPR1 and eEF-1alpha has been shown to be essential for normal cellular proliferation, and ZPR1 is thought to be involved Probab=34.98 E-value=29 Score=14.42 Aligned_cols=62 Identities=8% Similarity=0.103 Sum_probs=40.0 Q ss_pred CEEEEEEECCCCCEEEEEEECCCCCC------CCCEEE-EEEEEEECCCCC-CCCHHHCCCCEECCCCCC Q ss_conf 42367630787771799995178987------778128-899997168888-732110323102133212 Q gi|254780612|r 362 LVIKGDIPMIDNAFSASMTLKCNADI------SLSITH-VMEIMFSFPKES-QDAVVDLRRISMRKTDNS 423 (503) Q Consensus 362 ~aira~i~iP~~~~~~~~~~rrn~d~------slpash-~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~ 423 (503) .-|+.--.|.++|.+.+|.+..-.|. |=-|+- +=||-|.+|... +|-|.||-||+++--++= T Consensus 41 ~ev~~~~~~~p~g~~~~l~V~~~~DL~r~VvkS~tati~IPEL~~eip~~~~~G~iTtVEG~L~~~~d~L 110 (160) T pfam03367 41 NEVQSGGEIEPKGTRYTLKVEDEEDLNRDVVKSETASISIPELGLEIPGPALGGRFTTVEGLLERVIEGL 110 (160) T ss_pred EEEEECCCCCCCEEEEEEEECCHHHHCCEEEECCCEEEEEEEEEEEECCCCCCCEEEEHHHHHHHHHHHH T ss_conf 4888976669970899999889798156179768538996555258669876868980999999999999 No 29 >PRK01659 consensus Probab=34.71 E-value=12 Score=17.16 Aligned_cols=57 Identities=11% Similarity=0.099 Sum_probs=33.1 Q ss_pred CCCCCCCCEEE---EECCCEEEEECCCCCHHHHHHHHHHHCCC-CEEEEEEECCCCEEEEE Q ss_conf 24663110016---74698289981788256887799973078-11687883799678999 Q gi|254780612|r 423 SPSVLIDSNIF---VISKNSYLISLKGSEEDPFRNSKILEEYR-FIDIPITYRSGQKILFT 479 (503) Q Consensus 423 ~~g~~l~~~~~---kv~~~~f~v~l~~~~~~~~~n~~ll~~~~-w~dip~~y~ngrra~~~ 479 (503) .-|+|+.-.-. .=.|..++|-|+.....+..|+.|+++-. =+.|||.|..|=|.+=. T Consensus 28 ~~gDP~~~ak~~~~~Gad~ihivDld~a~~g~~~n~~~I~~i~~~~~ipi~vGGGIrs~e~ 88 (252) T PRK01659 28 DVGDPVEIAAAYNEAGADELVFLDITATHEGRKTMVDVVEKVAAKVFIPLTVGGGISSVKD 88 (252) T ss_pred ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEECHHH T ss_conf 8789999999999879999999946766568864899999999756974799633200688 No 30 >KOG3047 consensus Probab=34.69 E-value=22 Score=15.27 Aligned_cols=32 Identities=16% Similarity=0.179 Sum_probs=22.9 Q ss_pred CEEEEECCCEEEE-ECCCCCHHHHHHHHHHHCC Q ss_conf 0016746982899-8178825688779997307 Q gi|254780612|r 430 SNIFVISKNSYLI-SLKGSEEDPFRNSKILEEY 461 (503) Q Consensus 430 ~~~~kv~~~~f~v-~l~~~~~~~~~n~~ll~~~ 461 (503) -+-|+..|||||- -|-+.=-..+|.++||++. T Consensus 88 ~i~VaL~~~fflElkLadAiKf~DRK~dlLkel 120 (157) T KOG3047 88 HIVVALCDDFFLELKLADAIKFCDRKMDLLKEL 120 (157) T ss_pred EEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHH T ss_conf 477775066211001998998887767999999 No 31 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=34.38 E-value=30 Score=14.36 Aligned_cols=78 Identities=17% Similarity=0.361 Sum_probs=49.4 Q ss_pred EEEEEEECCC-CCCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 8999971688-887321103231021332124663110016746982899817882568877999730781168788379 Q gi|254780612|r 394 VMEIMFSFPK-ESQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRS 472 (503) Q Consensus 394 ~~e~~f~~~~-~~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~n 472 (503) -+.-+|++-+ +---|+.....|.... ...++.-+ ++.+++.|.-+. ...-=+.+|++.++--||++..| T Consensus 24 ~~~~i~Vvd~~~~l~G~vt~~dll~~~----~~~~v~~i---M~~~~~~v~~~~---~~~~a~~~m~~~~~~~lPVVd~~ 93 (109) T cd04606 24 TIYYIYVVDEEGRLLGVVSLRDLLLAD----PDTPVSDI---MDTDVISVSADD---DQEEVARLFEKYDLLALPVVDEE 93 (109) T ss_pred EEEEEEEECCCCEEEEEEEHHHHHHCC----CCCCHHHH---HCCCEEECCCCC---CHHHHHHHHHHHCCCEEEEECCC T ss_conf 177999997999799999867722418----88716663---002136337999---89999999998527604689889 Q ss_pred CC-EEEEEEC Q ss_conf 96-7899961 Q gi|254780612|r 473 GQ-KILFTID 481 (503) Q Consensus 473 gr-ra~~~~e 481 (503) || ..+||+. T Consensus 94 ~~lvGiIt~~ 103 (109) T cd04606 94 GRLVGIITVD 103 (109) T ss_pred CEEEEEEEHH T ss_conf 9799999968 No 32 >PRK00830 consensus Probab=34.29 E-value=17 Score=16.13 Aligned_cols=84 Identities=12% Similarity=0.069 Sum_probs=57.4 Q ss_pred CCHHHCCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEEEC- Q ss_conf 32110323102133212466311001---67469828998178825688779997307-81168788379967899961- Q gi|254780612|r 407 DAVVDLRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFTID- 481 (503) Q Consensus 407 ~~v~~v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~~e- 481 (503) .++.=|+|+.||.+- --|+|+.-+- -+=.|-.|+|-|+....-+..|+.|+++- .=+.+||.|..|=|.+=.++ T Consensus 17 k~~rlvKg~~f~~~~-y~gdP~~~ak~~~~~gadelhivDld~a~~g~~~~~~~I~~i~~~~~~pi~vGGGIrs~e~~~~ 95 (273) T PRK00830 17 PGGRVVKGVEFVQLR-YAGDPVELAKRYYEDGADELVFLDITASHEGRATMIDVIERTAEEVFIPLTVGGGIRSIEDIRQ 95 (273) T ss_pred CCCEEEECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEECCEEECCCHHH T ss_conf 379898586776875-7889999999999879998999953246468842799999999866995896088437732899 Q ss_pred -------CCCCHHHHHH Q ss_conf -------5876899999 Q gi|254780612|r 482 -------KGKKGADIFK 491 (503) Q Consensus 482 -------kg~~g~~~f~ 491 (503) |=.-|..||+ T Consensus 96 ll~~GadkVvIgS~a~~ 112 (273) T PRK00830 96 ILRAGADKVSVNTAAVK 112 (273) T ss_pred HHHCCCCEEECHHHHHH T ss_conf 99769863983798985 No 33 >TIGR03578 EF_0831 conserved hypothetical protein EF_0831/AHA_3912. Members of this family of small (about 100 amino acid), relatively rare proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown. Probab=32.33 E-value=32 Score=14.13 Aligned_cols=21 Identities=24% Similarity=0.325 Sum_probs=15.0 Q ss_pred ECCCCCHHHHHHHHHHHHHHC Q ss_conf 615876899999999998522 Q gi|254780612|r 480 IDKGKKGADIFKSAIMQWENR 500 (503) Q Consensus 480 ~ekg~~g~~~f~~a~~~w~~~ 500 (503) =-||-.-++||++||+.-|.+ T Consensus 11 ~GkG~tKe~Afa~a~s~iq~~ 31 (96) T TIGR03578 11 SGKGNTKQLAFASALSKIQKK 31 (96) T ss_pred ECCCCCHHHHHHHHHHHHHHH T ss_conf 847863899999999999999 No 34 >COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism] Probab=32.30 E-value=16 Score=16.30 Aligned_cols=22 Identities=23% Similarity=0.357 Sum_probs=19.7 Q ss_pred HHHHHHHCCCCEEEEEEECCCC Q ss_conf 7799973078116878837996 Q gi|254780612|r 453 RNSKILEEYRFIDIPITYRSGQ 474 (503) Q Consensus 453 ~n~~ll~~~~w~dip~~y~ngr 474 (503) -|+.|||+.+-++.||+.++|- T Consensus 140 QNF~LLke~G~~~kPvLLKRg~ 161 (286) T COG2876 140 QNFALLKEVGRQNKPVLLKRGL 161 (286) T ss_pred HHHHHHHHHCCCCCCEEEECCC T ss_conf 5169999823559976972474 No 35 >PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=32.29 E-value=32 Score=14.13 Aligned_cols=54 Identities=15% Similarity=0.294 Sum_probs=38.3 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC------------------CEEEEEECCCCCHHHHHHHHHHHHHHC Q ss_conf 28998178825688779997307811687883799------------------678999615876899999999998522 Q gi|254780612|r 439 SYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG------------------QKILFTIDKGKKGADIFKSAIMQWENR 500 (503) Q Consensus 439 ~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng------------------rra~~~~ekg~~g~~~f~~a~~~w~~~ 500 (503) ..-|||-|.++|.. +| .+-||+||-+|= ++-.+|=.-|+.| .|+|+..|..+ T Consensus 227 v~tIaLGDspNDi~----ML---eaaD~aVvI~~p~~~~l~~~~~~~~~~~~~~~~~~a~~~GP~G---W~eaV~~~L~~ 296 (302) T PRK12702 227 IKALGIGCSPPDLA----FL---RWSEQKVVLPSPIADSLWKEALRLGGPEVQPQWQLAQLPGPEG---WNEVVLMWLEE 296 (302) T ss_pred CEEEEECCCHHHHH----HH---HHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCHH---HHHHHHHHHHH T ss_conf 54999718776599----99---8588769968998776655444567766676347668998467---99999999986 Q ss_pred CC Q ss_conf 36 Q gi|254780612|r 501 SN 502 (503) Q Consensus 501 ~~ 502 (503) ++ T Consensus 297 ~~ 298 (302) T PRK12702 297 TD 298 (302) T ss_pred HH T ss_conf 51 No 36 >PRK00748 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated Probab=32.04 E-value=22 Score=15.27 Aligned_cols=25 Identities=24% Similarity=0.320 Sum_probs=15.4 Q ss_pred HHHHHHHCC-CCEEEEEEECCCCEEE Q ss_conf 779997307-8116878837996789 Q gi|254780612|r 453 RNSKILEEY-RFIDIPITYRSGQKIL 477 (503) Q Consensus 453 ~n~~ll~~~-~w~dip~~y~ngrra~ 477 (503) =|+.|++.- .+++||+.|..|=+.+ T Consensus 176 ~d~~l~~~i~~~~~ipviasGGv~s~ 201 (241) T PRK00748 176 PNVELTRELAAATPIPVIASGGVSSL 201 (241) T ss_pred CCHHHHHHHHHHCCCCEEEECCCCCH T ss_conf 89999999998689989998899999 No 37 >PRK04281 consensus Probab=31.92 E-value=20 Score=15.56 Aligned_cols=42 Identities=19% Similarity=0.293 Sum_probs=21.1 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE Q ss_conf 69828998178825688779997307-8116878837996789 Q gi|254780612|r 436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL 477 (503) Q Consensus 436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~ 477 (503) .|-.++|=|+....-+..|+.++++- .=+.|||.|..|=|-+ T Consensus 44 adelhivDld~a~~~~~~~~~~I~~i~~~~~vpi~vGGGIrs~ 86 (254) T PRK04281 44 ADELTFLDITASSDNRDTILHIIEEVAGQVFIPLTVGGGVRTV 86 (254) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEEEC T ss_conf 9999999688987775308999999985079628997775451 No 38 >pfam09837 DUF2064 Uncharacterized protein conserved in bacteria (DUF2064). This domain, found in various prokaryotic proteins, has no known function. Probab=30.55 E-value=23 Score=15.10 Aligned_cols=20 Identities=25% Similarity=0.433 Sum_probs=13.1 Q ss_pred ECCCEEEEECCCCCHHHHHH Q ss_conf 46982899817882568877 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRN 454 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n 454 (503) -.-||+||||+....+.++| T Consensus 97 ~DGGy~LiG~~~~~~~lF~~ 116 (121) T pfam09837 97 EDGGYYLLGLRRPTPDLFEG 116 (121) T ss_pred CCCCEEEEEECCCCHHHCCC T ss_conf 88999999747778887079 No 39 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=30.34 E-value=35 Score=13.91 Aligned_cols=43 Identities=14% Similarity=0.254 Sum_probs=24.9 Q ss_pred ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEE Q ss_conf 4698289981788256887799973078116878837996789996 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTI 480 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ 480 (503) .+.+++.|.- +.....=+++|.+++--.+|++..+.=.+|||. T Consensus 58 Mt~~~~tv~~---~~~~~~a~~~m~~~~i~~lpVvd~~~lvGiIt~ 100 (106) T cd04638 58 MTRDPPTVSP---DDDVKEAAKLMVENNIRRVPVVDDGKLVGIVTV 100 (106) T ss_pred HCCCCEECCC---CCCHHHHHHHHHHCCCCEEEEEECCEEEEEEEH T ss_conf 3589736799---491999999998769759999999999999997 No 40 >pfam06089 Asparaginase_II L-asparaginase II. This family consists of several bacterial L-asparaginase II proteins. L-asparaginase (EC:3.5.1.1) catalyses the hydrolysis of L-asparagine to L-aspartate and ammonium. Rhizobium etli possesses two asparaginases: asparaginase I, which is thermostable and constitutive, and asparaginase II, which is thermolabile, induced by asparagine and repressed by the carbon source. Probab=30.26 E-value=35 Score=13.90 Aligned_cols=31 Identities=10% Similarity=0.067 Sum_probs=20.5 Q ss_pred CCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEE Q ss_conf 10016746982899817882568877999730781168 Q gi|254780612|r 429 DSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDI 466 (503) Q Consensus 429 ~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~di 466 (503) .|+++||.||-. ..--.-=+.||+..+|||- T Consensus 267 ~gialKi~DGs~-------Ra~~~~~~~~L~~lg~~~~ 297 (325) T pfam06089 267 LGIALKIEDGAT-------RAAEAVAAALLRQLGVLDP 297 (325) T ss_pred EEEEEEECCCCH-------HHHHHHHHHHHHHCCCCCH T ss_conf 599999637857-------5799999999998599986 No 41 >PRK02621 consensus Probab=29.23 E-value=31 Score=14.30 Aligned_cols=42 Identities=14% Similarity=0.202 Sum_probs=23.5 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE Q ss_conf 69828998178825688779997307-8116878837996789 Q gi|254780612|r 436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL 477 (503) Q Consensus 436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~ 477 (503) .|..++|-|......+..|++|+++- .=+.|||.|..|=|-+ T Consensus 44 ad~lhivDld~a~~~~~~~~~~I~~i~~~~~ipi~vGGGIrs~ 86 (254) T PRK02621 44 ADELVFLDITATHEGRATLIDVVYRTAEQVFIPLTVGGGISSL 86 (254) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCEEEH T ss_conf 9999998266765675428999999998679858996335357 No 42 >PRK13597 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=28.78 E-value=32 Score=14.16 Aligned_cols=66 Identities=9% Similarity=0.131 Sum_probs=36.6 Q ss_pred CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEE Q ss_conf 323102133212466311001---67469828998178825688779997307-81168788379967899 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILF 478 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~ 478 (503) |+++.++..- --|+|+.-+- -.=.|..|+|=|+.....+..|+.++++- .=+.|||.|..|=|.+= T Consensus 19 Vkg~~~~~~~-~~gdP~~~a~~~~~~Gad~lhlvDld~a~~~~~~n~~~I~~i~~~~~vpiqvGGGIrs~e 88 (252) T PRK13597 19 VKGVNFVNLR-DAGDPVEAARAYDEAGADELVFLDISATHEERAILLDVVARVAERVFIPLTVGGGVRSLE 88 (252) T ss_pred EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHH T ss_conf 8787777867-888999999999986999999995646666866379999999862698289847713089 No 43 >PRK03220 consensus Probab=28.77 E-value=35 Score=13.86 Aligned_cols=65 Identities=8% Similarity=0.141 Sum_probs=43.2 Q ss_pred CCCCEECCCCCCCCCCCCCEEE---EECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEE Q ss_conf 3231021332124663110016---7469828998178825688779997307-8116878837996789 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSNIF---VISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKIL 477 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~~~---kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~ 477 (503) |+++.++... .-|+||.-.-. .=.|-+++|-|+.....+..|+++++.- .=+.+||.|..|=|-+ T Consensus 19 Vkg~~~~~~~-~~gdP~~~a~~~~~~G~d~lhivDld~a~~g~~~n~~~I~~i~~~~~~pi~vGGGIrs~ 87 (257) T PRK03220 19 VKGVNFENLR-DAGDPVELAAVYDAEGADELTFLDVTASSSGRATMLDVVRRTAEQVFIPLTVGGGVRTV 87 (257) T ss_pred EECCCCCCCE-ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCH T ss_conf 8577777867-88899999999998699989999088875676307999999985069648984785879 No 44 >PRK10076 pyruvate formate lyase II activase; Provisional Probab=28.77 E-value=37 Score=13.73 Aligned_cols=32 Identities=19% Similarity=0.264 Sum_probs=16.6 Q ss_pred CEEEEECCCCCHHHH------------HHHHHHHCCC---CEEEEEE Q ss_conf 828998178825688------------7799973078---1168788 Q gi|254780612|r 438 NSYLISLKGSEEDPF------------RNSKILEEYR---FIDIPIT 469 (503) Q Consensus 438 ~~f~v~l~~~~~~~~------------~n~~ll~~~~---w~dip~~ 469 (503) |+||+-|...+.++- .|+..|.+++ ||++|++ T Consensus 91 Dl~L~DiK~~d~~~h~~~TG~~n~~il~Nl~~l~~~~~~v~iR~pvI 137 (213) T PRK10076 91 DEVLFDLKIMDATQARDVVKMNLPRVLENLRLLVSEGVNVIPRLPLI 137 (213) T ss_pred CEEEEEECCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEEEEEE T ss_conf 98998617798489999979993999999999996799689988677 No 45 >KOG1383 consensus Probab=28.73 E-value=37 Score=13.73 Aligned_cols=36 Identities=14% Similarity=0.202 Sum_probs=24.0 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 982899817882568877999730781168788379 Q gi|254780612|r 437 KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRS 472 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~n 472 (503) +..=+|++++...+..+=.++||.++||==-+..=+ T Consensus 382 p~vs~~~~~s~~~~i~elsd~l~~~GW~lnalq~P~ 417 (491) T KOG1383 382 PLVSFILFGSNDVNIFELSDLLRKKGWILNALQFPA 417 (491) T ss_pred CCEEEEECCCCCCCCEEHHHHHHHCCCCCCCCCCCC T ss_conf 737999716876531114578986386766567887 No 46 >pfam11217 DUF3013 Protein of unknown function (DUF3013). This bacterial family of proteins with unknown function appear to be restricted to Firmicutes. Probab=28.72 E-value=29 Score=14.42 Aligned_cols=25 Identities=20% Similarity=0.282 Sum_probs=17.9 Q ss_pred EEEEEEEEECC-CCC-CCCHHHCCCCE Q ss_conf 28899997168-888-73211032310 Q gi|254780612|r 392 THVMEIMFSFP-KES-QDAVVDLRRIS 416 (503) Q Consensus 392 sh~~e~~f~~~-~~~-~~~v~~v~~~~ 416 (503) +|+|||.|++= +|. |--|.+..|+. T Consensus 31 nH~iEv~f~l~aen~~~v~~~D~~G~~ 57 (160) T pfam11217 31 NHAIELEFTLEAQNKEGIETVDDEGEV 57 (160) T ss_pred CCEEEEEEEEEEECCCCEEEECCCCCC T ss_conf 746999999998254550678687725 No 47 >PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed Probab=28.16 E-value=33 Score=14.02 Aligned_cols=33 Identities=15% Similarity=0.281 Sum_probs=26.6 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 98289981788256887799973078116878837996 Q gi|254780612|r 437 KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ 474 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr 474 (503) =|+++||=-|.. |..||++-.=.|.||..+||- T Consensus 202 vDilqIGARnmq-----Nf~LLk~vg~~~kPVLlKrg~ 234 (360) T PRK12595 202 VDVIQIGARNMQ-----NFELLKAAGRVNKPVLLKRGL 234 (360) T ss_pred CCEEEECCHHCC-----CHHHHHHHHCCCCCEEEECCC T ss_conf 868988841035-----999999986139937960799 No 48 >COG2987 HutU Urocanate hydratase [Amino acid transport and metabolism] Probab=27.47 E-value=26 Score=14.78 Aligned_cols=29 Identities=28% Similarity=0.554 Sum_probs=22.0 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCC--------CCEEE Q ss_conf 828998178825688779997307--------81168 Q gi|254780612|r 438 NSYLISLKGSEEDPFRNSKILEEY--------RFIDI 466 (503) Q Consensus 438 ~~f~v~l~~~~~~~~~n~~ll~~~--------~w~di 466 (503) -|-||+||..+.|.-+-=..++|. +|||+ T Consensus 360 PFRW~aLSgdpeDi~~tD~~~~el~p~n~~l~~Wid~ 396 (561) T COG2987 360 PFRWVALSGDPEDIYKTDAAVKELFPDNKHLHRWIDM 396 (561) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 7157870699788888799999858982889999999 No 49 >pfam12281 DUF3620 Protein of unknown function (DUF3620). This family of proteins is found in bacteria. Proteins in this family are typically between 281 and 358 amino acids in length. There are two completely conserved residues (G and P) that may be functionally important. Probab=27.25 E-value=39 Score=13.55 Aligned_cols=110 Identities=17% Similarity=0.177 Sum_probs=56.2 Q ss_pred EEEEEECCCCCEEEEEEECCCCCCCCCE-EEEEEEEEE-CCCCCCC-CHHHCCCCEECCCCCCCCCCCCCEEEEECCCEE Q ss_conf 3676307877717999951789877781-288999971-6888873-211032310213321246631100167469828 Q gi|254780612|r 364 IKGDIPMIDNAFSASMTLKCNADISLSI-THVMEIMFS-FPKESQD-AVVDLRRISMRKTDNSPSVLIDSNIFVISKNSY 440 (503) Q Consensus 364 ira~i~iP~~~~~~~~~~rrn~d~slpa-sh~~e~~f~-~~~~~~~-~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f 440 (503) --+||.+- ....+.+.|--+.|.+|.. =+-+++.|. +|...++ ...|-+|...-==--.+|+.. +-..--+ T Consensus 47 ~T~DvDla-~~~~~slal~~~~~~~ll~vL~~vd~~F~~vp~~~~~~~~~~~~g~~VdfLtp~~~~~~-----~~~~p~~ 120 (216) T pfam12281 47 ATGDVDLA-QDFRRSLALGDSVRPGLLGLLRSVDLTFEPVPRLSGGFRARNRQGFMVEFLTPSRGGGE-----GEDGPRL 120 (216) T ss_pred HCCCHHHH-HHCCCCCCCCCCCCCCHHHHHHHCCCCCEECCCCCCCCEEECCCCEEEEEECCCCCCCC-----CCCCCCC T ss_conf 10364677-63112313555567539999986498833776557885104158759998468888875-----5578654 Q ss_pred EEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEE Q ss_conf 9981788256887799973078116878837996789996 Q gi|254780612|r 441 LISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKILFTI 480 (503) Q Consensus 441 ~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra~~~~ 480 (503) |-+|-++....-|+|+.|-..- |+..++|+||+=+.+.+ T Consensus 121 ~~~~~~~~A~~l~~l~~Ll~~P-v~av~l~~~G~~v~v~v 159 (216) T pfam12281 121 LPALDGLSAQPLRFLDWLLNDP-VRAVVLDRSGRPVRVQV 159 (216) T ss_pred CCCCCCCCCCCCCCCCCCCCCC-EEEEEECCCCCEEEEEC T ss_conf 7654783632235120010598-06899837896789957 No 50 >PRK02747 consensus Probab=26.48 E-value=21 Score=15.48 Aligned_cols=67 Identities=7% Similarity=0.106 Sum_probs=43.4 Q ss_pred CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEEEEE Q ss_conf 323102133212466311001---67469828998178825688779997307-811687883799678999 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKILFT 479 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra~~~ 479 (503) |+++.++.. .--|+|+.-+- -.=.|-.++|-|+....-+..|+.|+++- +=+.|||.|..|=|.+=. T Consensus 18 Vkg~~~~~~-~~~gdP~~~ak~~~~~Gadelh~vDl~~a~~~~~~~~~lI~~i~~~~~ipi~vGGGIrs~e~ 88 (257) T PRK02747 18 VKGVNFVDL-RDAGDPVEAARAYDAAGADELCFLDITASHENRGTMLDVVARTAEQCFMPLTVGGGVRTVDD 88 (257) T ss_pred EECCCCCCE-EECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHH T ss_conf 858777770-78889999999999869998999947677567552899999999866998898488207388 No 51 >pfam07946 DUF1682 Protein of unknown function (DUF1682). The members of this family are all hypothetical eukaryotic proteins of unknown function. One member is described as being an adipocyte-specific protein, but no evidence of this was found. Probab=26.43 E-value=28 Score=14.54 Aligned_cols=15 Identities=7% Similarity=0.304 Sum_probs=8.1 Q ss_pred ECCCCEEEEEECCCC Q ss_conf 379967899961587 Q gi|254780612|r 470 YRSGQKILFTIDKGK 484 (503) Q Consensus 470 y~ngrra~~~~ekg~ 484 (503) ++.-+|-+|+|--=. T Consensus 212 ~~~~k~l~~~~~lP~ 226 (322) T pfam07946 212 TKPKKRLILSFRLPK 226 (322) T ss_pred CCCCEEEEEEEECCC T ss_conf 667608999997389 No 52 >cd04495 BRCA2DBD_OB3 BRCA2DBD_OB3: A subfamily of OB folds corresponding to the third OB fold (OB3) of the 800-amino acid C-terminal ssDNA binding domain (DBD) of BRCA2 (breast cancer susceptibility gene 2) protein, called BRCA2DBD. BRCA2 participates in homologous recombination-mediated repair of double-strand DNA breaks. It stimulates the displacement of Replication protein A (RPA), the most abundant eukaryotic ssDNA binding protein. It also facilitates filament formation. Mutations that map throughout the BRCA2 protein are associated with breast cancer susceptibility. BRCA2 is a large nuclear protein and its most conserved region is the C-terminal BRCA2DBD. BRCA2DBD binds ssDNA in vitro, and is composed of five structural domains, three of which are OB folds (OB1, OB2, and OB3). BRCA2DBD OB2 and OB3 are arranged in tandem, and their mode of binding can be considered qualitatively similar to two OB folds of RPA1, DBD-A and DBD-B (the major DBDs of RPA). Probab=26.31 E-value=14 Score=16.73 Aligned_cols=40 Identities=10% Similarity=0.136 Sum_probs=25.5 Q ss_pred HHHHHHCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 799973078116878837996789996158768999999999985 Q gi|254780612|r 454 NSKILEEYRFIDIPITYRSGQKILFTIDKGKKGADIFKSAIMQWE 498 (503) Q Consensus 454 n~~ll~~~~w~dip~~y~ngrra~~~~ekg~~g~~~f~~a~~~w~ 498 (503) |||.=.+..| +||+.|..-. ..+| -.+-+.-|.++|..-+ T Consensus 57 NLQwR~~s~~-~iPtl~A~e~-s~FS---~nPKe~hlQe~~~~Lk 96 (100) T cd04495 57 NLQWRTESTS-GVPTLFAGEY-STFS---ANPKESHLQEEFNNLK 96 (100) T ss_pred CCEEECCCCC-CCCEEEECCC-CEEC---CCCCHHHHHHHHHHHH T ss_conf 3255046567-7860664032-0214---7940778999999988 No 53 >PRK13415 flagella biosynthesis protein FliZ; Provisional Probab=26.23 E-value=41 Score=13.43 Aligned_cols=40 Identities=10% Similarity=0.265 Sum_probs=28.0 Q ss_pred CCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHC Q ss_conf 321103231021332124663110016746982899817882568877999730 Q gi|254780612|r 407 DAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEE 460 (503) Q Consensus 407 ~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~ 460 (503) +.|.|+.|+.+-.|- .=.-+||.+..|+||.- -|+||||| T Consensus 105 ~~i~~lGGtslG~nr-------SvQlvkvG~~l~vvGVG-------e~iqLLke 144 (216) T PRK13415 105 QYVENIGGTSVGQNR-------SVQLIKVGNRVLVVGVG-------ESIQLLKE 144 (216) T ss_pred CHHHHCCCCCCCCCC-------EEEEEEECCEEEEEECC-------CHHHHHHH T ss_conf 234522674457886-------57899976889999547-------25789986 No 54 >KOG1693 consensus Probab=26.17 E-value=14 Score=16.67 Aligned_cols=103 Identities=10% Similarity=0.018 Sum_probs=61.6 Q ss_pred EEEEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEEEECCCCCCCCHHHCCCCEECCCCCCCCCCCC--CEEEEECCCEE Q ss_conf 2367630787771799995178987778128899997168888732110323102133212466311--00167469828 Q gi|254780612|r 363 VIKGDIPMIDNAFSASMTLKCNADISLSITHVMEIMFSFPKESQDAVVDLRRISMRKTDNSPSVLID--SNIFVISKNSY 440 (503) Q Consensus 363 aira~i~iP~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l~--~~~~kv~~~~f 440 (503) -|-=+|+-||.++=..-+=||-.|-++=|.|.=+-+|-+..+|+..-..+--+.++--++-.--+.. ...+++.=+-- T Consensus 60 DVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~q~~~~~~~~~~~~~~~~~~~~mena 139 (209) T KOG1693 60 DVDYDIEAPDGKVIYSEKKKRYDSFLFKAEGKGEYTFCFSNEFSTFSHKIVYMDFQVGEEPPLHPAVSNRDTALTQMENA 139 (209) T ss_pred EEEEEEECCCCCEEEECCCCCCCCEEEEEECCEEEEEEECCCCCCCCCEEEEEHHHHCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 25889988999887540102103379998511689999627645642237651024244666673312365488888889 Q ss_pred EEECCCCCHHHHHHHHHHHCCCCEE Q ss_conf 9981788256887799973078116 Q gi|254780612|r 441 LISLKGSEEDPFRNSKILEEYRFID 465 (503) Q Consensus 441 ~v~l~~~~~~~~~n~~ll~~~~w~d 465 (503) .+.....=....||.+-+|-|.|=+ T Consensus 140 ~~~I~~~L~~I~~~q~y~R~RE~rn 164 (209) T KOG1693 140 IVEIHRALNKIDDTQTYYRLREARN 164 (209) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999998899998887628 No 55 >smart00310 PTBI Phosphotyrosine-binding domain (IRS1-like). Probab=25.97 E-value=41 Score=13.40 Aligned_cols=68 Identities=18% Similarity=0.164 Sum_probs=51.4 Q ss_pred EEC-CCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCCEE-----EEEECCCCCHHHHHHHHHHHHHHCCC Q ss_conf 746-9828998178825688779997307811687883799678-----99961587689999999999852236 Q gi|254780612|r 434 VIS-KNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQKI-----LFTIDKGKKGADIFKSAIMQWENRSN 502 (503) Q Consensus 434 kv~-~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngrra-----~~~~ekg~~g~~~f~~a~~~w~~~~~ 502 (503) .++ +.+-|+-++.-..-..=.|..||..+-.+--..++.|||+ +++|+ =.-|+.+|+-..++=+++.| T Consensus 25 ~lt~~~L~L~~~~~~~~l~~wpl~~lRryG~~~~~F~fEaGRrc~~G~G~f~f~-t~~~~~i~~~v~~am~a~k~ 98 (98) T smart00310 25 RLTSETLVLWRLNPRVELVVWPLLSLRRYGRDKNFFFFEAGRRCVSGPGEFTFQ-TVVAQEIFQLVLEAMQAQSN 98 (98) T ss_pred EECCCEEEEECCCCCCEEEEEEHHHEEECCCCCCEEEEECCCCCCCCCCEEEEE-CCCHHHHHHHHHHHHHHHCC T ss_conf 987888999868999508996736623403689999999058878898679998-28499999999999986029 No 56 >PRK13396 3-deoxy-7-phosphoheptulonate synthase; Provisional Probab=25.82 E-value=40 Score=13.50 Aligned_cols=37 Identities=16% Similarity=0.321 Sum_probs=28.7 Q ss_pred EEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 1674698289981788256887799973078116878837996 Q gi|254780612|r 432 IFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ 474 (503) Q Consensus 432 ~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr 474 (503) ..++. |+++||=-| -+|..||++-.=.|.||..+||- T Consensus 181 v~~~~-DilQIGARn-----~qNf~LL~~~g~t~kPVllKrg~ 217 (352) T PRK13396 181 IAEVA-DVIQVGARN-----MQNFSLLKKVGAQDKPVLLKRGM 217 (352) T ss_pred HHHHC-CEEEECCHH-----HCCHHHHHHHHCCCCEEEECCCC T ss_conf 98658-889989254-----05999999985469807973788 No 57 >pfam11322 DUF3124 Protein of unknown function (DUF3124). This bacterial family of proteins has no known function. Probab=25.76 E-value=41 Score=13.37 Aligned_cols=17 Identities=29% Similarity=0.462 Sum_probs=11.7 Q ss_pred CEEEEEEECCCCCCCCCE Q ss_conf 717999951789877781 Q gi|254780612|r 374 AFSASMTLKCNADISLSI 391 (503) Q Consensus 374 ~~~~~~~~rrn~d~slpa 391 (503) .|+++|+| ||||.+-|- T Consensus 25 ~Lt~tLSi-RNtd~~~pi 41 (126) T pfam11322 25 NLTATLSI-RNTDPSQPI 41 (126) T ss_pred EEEEEEEE-ECCCCCCCE T ss_conf 68999999-749999988 No 58 >KOG0526 consensus Probab=25.56 E-value=42 Score=13.35 Aligned_cols=96 Identities=14% Similarity=0.166 Sum_probs=51.7 Q ss_pred CCCEEEEEEECCCCCCCCCEEEEEEEEEECCCC-C-CCCHHHC----C-CCEECCCCCCCCCCCCCEEEEECCCEEEEEC Q ss_conf 777179999517898777812889999716888-8-7321103----2-3102133212466311001674698289981 Q gi|254780612|r 372 DNAFSASMTLKCNADISLSITHVMEIMFSFPKE-S-QDAVVDL----R-RISMRKTDNSPSVLIDSNIFVISKNSYLISL 444 (503) Q Consensus 372 ~~~~~~~~~~rrn~d~slpash~~e~~f~~~~~-~-~~~v~~v----~-~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l 444 (503) ..+=.++|-|+-|.| -|+. +|||-|-.|.+ + |-.-+-| . .+.+-+-+++.|+++. ..++=++|-=- T Consensus 141 ~gKNEv~LEFh~nDd--a~~~-LmEmRF~iP~d~~~ged~~~~e~F~~~v~~kAdv~~~~gdaI~----~f~~i~~lTPR 213 (615) T KOG0526 141 TGKNEVTLEFHQNDD--APVG-LMEMRFHIPEDQEDGEDRDKVEAFYENVLAKADVSSAVGDAIV----SFEEILCLTPR 213 (615) T ss_pred CCCCEEEEEEECCCC--CCCC-EEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEE----EEEEEEEECCC T ss_conf 377517999861677--7743-3899995485212444356788999999986375412477458----76522541577 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEEECCCCE Q ss_conf 7882568877999730781168788379967 Q gi|254780612|r 445 KGSEEDPFRNSKILEEYRFIDIPITYRSGQK 475 (503) Q Consensus 445 ~~~~~~~~~n~~ll~~~~w~dip~~y~ngrr 475 (503) --.+-+..++.--|++.. +|-=|-|++=.| T Consensus 214 GRYdI~iy~t~lrL~GkT-yDyKI~y~SI~r 243 (615) T KOG0526 214 GRYDIKIYPTFLRLHGKT-YDYKIPYKSINR 243 (615) T ss_pred CCCEEEEEHHHHHHCCCC-CCEECCHHHEEE T ss_conf 620068730134221664-420120331404 No 59 >pfam03425 CBM_11 Carbohydrate binding domain (family 11). Probab=25.54 E-value=34 Score=13.98 Aligned_cols=31 Identities=16% Similarity=0.375 Sum_probs=14.7 Q ss_pred CCCCCCEEEEEEEEEECCCCCC-CC--HHHCCCC Q ss_conf 9877781288999971688887-32--1103231 Q gi|254780612|r 385 ADISLSITHVMEIMFSFPKESQ-DA--VVDLRRI 415 (503) Q Consensus 385 ~d~slpash~~e~~f~~~~~~~-~~--v~~v~~~ 415 (503) -+..|=..|+.+|.|....+-+ |. |+||+.+ T Consensus 135 ~~~~~dL~~i~~l~f~~~~~~~~g~f~iDnI~l~ 168 (170) T pfam03425 135 KDVPLDLDNITALAFQPKGGNSKGSFKIDNVRLI 168 (170) T ss_pred CCCEECHHHCEEEEEEECCCCCCCCEEEEEEEEE T ss_conf 5330373463189998648987565899779971 No 60 >PRK05211 consensus Probab=25.25 E-value=9.8 Score=17.74 Aligned_cols=65 Identities=12% Similarity=0.176 Sum_probs=36.6 Q ss_pred CCCCEECCCCCCCCCCCCCEE---EEECCCEEEEECCCCCHHHHHHHHHHHC-CCCEEEEEEECCCCEEE Q ss_conf 323102133212466311001---6746982899817882568877999730-78116878837996789 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSNI---FVISKNSYLISLKGSEEDPFRNSKILEE-YRFIDIPITYRSGQKIL 477 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~~---~kv~~~~f~v~l~~~~~~~~~n~~ll~~-~~w~dip~~y~ngrra~ 477 (503) |+++.|+ .+.--|+|+..+- -+=.|-.++|-|......+..|+.++++ ..=+.+||+|..|=|-+ T Consensus 9 Vk~~~f~-~~~~~gDP~~~ak~~~~~gadelhivDld~a~~g~~~n~~~I~~i~~~~~~Pl~vGGGIrs~ 77 (248) T PRK05211 9 VKGVQFR-NHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVAGGIKSV 77 (248) T ss_pred EECCCCC-CCEECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCH T ss_conf 7685766-88677899999999998699989999786776787214999999997679858962780138 No 61 >KOG3883 consensus Probab=24.97 E-value=43 Score=13.28 Aligned_cols=84 Identities=14% Similarity=0.163 Sum_probs=45.3 Q ss_pred EEEEEEEEECCCCCCCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEE------ Q ss_conf 28899997168888732110323102133212466311001674698289981788256887799973078116------ Q gi|254780612|r 392 THVMEIMFSFPKESQDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFID------ 465 (503) Q Consensus 392 sh~~e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~d------ 465 (503) --|||=+|...=..+.|+...-+|.=-.-=|.--.-|-.--+.-+|+|-||--++...- ..-+.|||- ||| T Consensus 41 ~pTiEDiY~~svet~rgarE~l~lyDTaGlq~~~~eLprhy~q~aDafVLVYs~~d~eS-f~rv~llKk--~Idk~KdKK 117 (198) T KOG3883 41 HPTIEDIYVASVETDRGAREQLRLYDTAGLQGGQQELPRHYFQFADAFVLVYSPMDPES-FQRVELLKK--EIDKHKDKK 117 (198) T ss_pred CCCHHHHEEEEEECCCCHHHEEEEEECCCCCCCHHHHHHHHHCCCCEEEEEECCCCHHH-HHHHHHHHH--HHHHCCCCC T ss_conf 66344224676406878101688865244557600202767105765799963799889-889999999--986255655 Q ss_pred -EEEEECCCCEEEE Q ss_conf -8788379967899 Q gi|254780612|r 466 -IPITYRSGQKILF 478 (503) Q Consensus 466 -ip~~y~ngrra~~ 478 (503) ||||--...|++- T Consensus 118 EvpiVVLaN~rdr~ 131 (198) T KOG3883 118 EVPIVVLANKRDRA 131 (198) T ss_pred CCCEEEEECHHHCC T ss_conf 12289973001013 No 62 >TIGR00400 mgtE magnesium transporter; InterPro: IPR006669 This is the MgtE family of magnesium transporters. All the archaebacterial and eukaryotic examples have two copies of the integral membrane region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg^2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport. Probab=24.54 E-value=44 Score=13.22 Aligned_cols=76 Identities=18% Similarity=0.377 Sum_probs=59.2 Q ss_pred EEEEEECCCCC-CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 99997168888-73211032310213321246631100167469828998178825688779997307811687883799 Q gi|254780612|r 395 MEIMFSFPKES-QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSG 473 (503) Q Consensus 395 ~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ng 473 (503) +-..|+.-+.- -.|+-++..+.+...|.--|+-+ ++++|+..++++-+.|.. .++...+++-+|++..+| T Consensus 171 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~d~~~~~~---~~~~~~d~~~~p~~~~~~ 241 (460) T TIGR00400 171 IYTLYVTNESKRLTGVLSLRDLILAKPEEYLGDLL------FPDGVSVDGLNDEEEDVA---RLIEKYDFLAVPVVDNEG 241 (460) T ss_pred HEEEEEECCCHHHHHHHHHHHHHHCCCHHHHHHHH------HHCCEEEEECCCCHHHHH---HHHHHCCCEEEEEECCCC T ss_conf 02457633410220001123333035056776654------210013520454046789---887640210200131677 Q ss_pred CEEEEE Q ss_conf 678999 Q gi|254780612|r 474 QKILFT 479 (503) Q Consensus 474 rra~~~ 479 (503) |..-+. T Consensus 242 ~~~g~~ 247 (460) T TIGR00400 242 RLVGIV 247 (460) T ss_pred CEEEEE T ss_conf 168754 No 63 >cd04731 HisF The cyclase subunit of imidazoleglycerol phosphate synthase (HisF). Imidazole glycerol phosphate synthase (IGPS) catalyzes the fifth step of histidine biosynthesis, the formation of the imidazole ring. IGPS converts N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to imidazole glycerol phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR). This conversion involves two tightly coupled reactions in distinct active sites of IGPS. The two catalytic domains can be fused, like in fungi and plants, or peformed by a heterodimer (HisH-glutaminase and HisF-cyclase), like in bacteria. Probab=24.10 E-value=13 Score=16.88 Aligned_cols=40 Identities=18% Similarity=0.234 Sum_probs=17.7 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCEE Q ss_conf 9828998178825688779997307-811687883799678 Q gi|254780612|r 437 KNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQKI 476 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrra 476 (503) |-.++|-|+.....+..|+.||++- .=+.+||.|..|=|. T Consensus 42 delhivDld~a~~g~~~n~~~i~~i~~~~~~pi~vGGGIrs 82 (243) T cd04731 42 DELVFLDITASSEGRETMLDVVERVAEEVFIPLTVGGGIRS 82 (243) T ss_pred CEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEEE T ss_conf 99999706732037700799999999867986899850664 No 64 >cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes. The TN CTLD binds two calcium ions. Its calcium free form binds to various kringle-like protein ligands. Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4). TN binds the kringle 1-4 form of angiostatin (AST K1-4). AST K1-4 is a fragment of Plg, commonly found in cancer tissues. TN inhibits the bin Probab=24.06 E-value=31 Score=14.22 Aligned_cols=25 Identities=16% Similarity=0.375 Sum_probs=15.4 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCCEE---EEEEECC Q ss_conf 8289981788256887799973078116---8788379 Q gi|254780612|r 438 NSYLISLKGSEEDPFRNSKILEEYRFID---IPITYRS 472 (503) Q Consensus 438 ~~f~v~l~~~~~~~~~n~~ll~~~~w~d---ip~~y~n 472 (503) .-|||||+|...+ -.|++ .||.|.| T Consensus 59 ~~~WIGl~d~~~~----------g~w~w~~g~~~~~~~ 86 (129) T cd03596 59 WEVWLGINDMVAE----------GKWVDVNGSPISYFN 86 (129) T ss_pred CCEEEECCCCCCC----------CCEECCCCCCCCCCC T ss_conf 6478761112679----------966819998856257 No 65 >pfam03180 Lipoprotein_9 NLPA lipoprotein. This family of bacterial lipoproteins contains several antigenic members, that may be involved in bacterial virulence. Their precise function is unknown. However they are probably distantly related to pfam00497 which are solute binding proteins. Probab=24.03 E-value=45 Score=13.16 Aligned_cols=28 Identities=18% Similarity=0.078 Sum_probs=16.4 Q ss_pred CCCCCEEEEEEECCCCCEEEE-----EEECCCC Q ss_conf 776742367630787771799-----9951789 Q gi|254780612|r 358 GLKGLVIKGDIPMIDNAFSAS-----MTLKCNA 385 (503) Q Consensus 358 ~~~~~aira~i~iP~~~~~~~-----~~~rrn~ 385 (503) +|...+|--|.+.=.|.|.+. ++|+.+. T Consensus 96 ~Ga~IaIpnD~sN~~RAL~lL~~aGLI~Lk~~~ 128 (236) T pfam03180 96 DGATIAVPNDPSNEGRALLLLEKAGLIKLKDGA 128 (236) T ss_pred CCCEEEECCCCCHHHHHHHHHHHCCCEEECCCC T ss_conf 999898158844299999999988988976899 No 66 >pfam00869 Flavi_glycoprot Flavivirus glycoprotein, central and dimerization domains. Probab=23.98 E-value=17 Score=16.14 Aligned_cols=38 Identities=21% Similarity=0.539 Sum_probs=22.4 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCCE-EEEEEECCC-------CEEEEEECC Q ss_conf 9828998178825688779997307811-687883799-------678999615 Q gi|254780612|r 437 KNSYLISLKGSEEDPFRNSKILEEYRFI-DIPITYRSG-------QKILFTIDK 482 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~~n~~ll~~~~w~-dip~~y~ng-------rra~~~~ek 482 (503) ++++|+.+. |-..|-+|+|| ||||=++.| |-.|+.|+- T Consensus 193 ~~~yl~~~~--------~~~wlVhrdWf~DL~LPW~~~~~~~W~~~e~LVeF~~ 238 (293) T pfam00869 193 SNYYLLEMD--------GKSWLVHRDWFHDLPLPWTHGGATSWRNKERLVEFEE 238 (293) T ss_pred HHEEEEEEC--------CCEEEEEHHHHHCCCCCCCCCCCCCCCCCCEEEEECC T ss_conf 678999976--------9589987557533788851799973514523788669 No 67 >PRK13585 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=23.95 E-value=40 Score=13.46 Aligned_cols=40 Identities=28% Similarity=0.414 Sum_probs=17.8 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHC-CCCEEEEEEECCCCEE Q ss_conf 982899817882568877999730-7811687883799678 Q gi|254780612|r 437 KNSYLISLKGSEEDPFRNSKILEE-YRFIDIPITYRSGQKI 476 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~~n~~ll~~-~~w~dip~~y~ngrra 476 (503) |-+|+|=|+....-+..|+.++++ ..=+.+||.|..|=|- T Consensus 46 ~~lhivDLd~a~~g~~~n~~~I~~i~~~~~~pi~vGGGIrs 86 (240) T PRK13585 46 KTLHLVDLDGAFEGSRKNADIIEKIVEATDVSIQLGGGIRS 86 (240) T ss_pred CEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 97999989772118944499999999737977899788587 No 68 >PRK07006 isocitrate dehydrogenase; Reviewed Probab=23.90 E-value=37 Score=13.75 Aligned_cols=18 Identities=22% Similarity=0.108 Sum_probs=12.9 Q ss_pred HHHHHHCCCCEEEEEEECC Q ss_conf 7999730781168788379 Q gi|254780612|r 454 NSKILEEYRFIDIPITYRS 472 (503) Q Consensus 454 n~~ll~~~~w~dip~~y~n 472 (503) =++|++.-.+||+ ||-.| T Consensus 279 ~~~lv~~P~~fDV-ivt~N 296 (409) T PRK07006 279 LQQILLRPAEYDV-IATMN 296 (409) T ss_pred HHHHHHCCCCCCE-EEECC T ss_conf 9999639645547-98755 No 69 >cd07136 ALDH_YwdH-P39616 Bacillus subtilis aldehyde dehydrogenase ywdH-like. Uncharacterized Bacillus subtilis ywdH aldehyde dehydrogenase (locus P39616) most closely related to the ALDHs and fatty ALDHs of families 3 and 14, and similar sequences, are included in this CD. Probab=23.74 E-value=18 Score=15.88 Aligned_cols=16 Identities=31% Similarity=0.783 Sum_probs=12.5 Q ss_pred HHHHCCCCEEEEEEEC Q ss_conf 9973078116878837 Q gi|254780612|r 456 KILEEYRFIDIPITYR 471 (503) Q Consensus 456 ~ll~~~~w~dip~~y~ 471 (503) ..|+.+.|||||+.|- T Consensus 423 tV~~~~~~~~~~~~~p 438 (449) T cd07136 423 SILKKSTWFDLPLRYP 438 (449) T ss_pred EEEECCCCCCCCCCCC T ss_conf 7998887787570089 No 70 >COG1628 Endonuclease V homolog [Replication, recombination, and repair] Probab=23.37 E-value=31 Score=14.27 Aligned_cols=84 Identities=17% Similarity=0.143 Sum_probs=47.9 Q ss_pred CCCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHHHHHCC---------------CCEEEEEEE Q ss_conf 73211032310213321246631100167469828998178825688779997307---------------811687883 Q gi|254780612|r 406 QDAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLISLKGSEEDPFRNSKILEEY---------------RFIDIPITY 470 (503) Q Consensus 406 ~~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~---------------~w~dip~~y 470 (503) ++.-+.+-|..|+.+- =+.|+.-- -+=|.=.|.-.+..+=+.-++.+ ||+||.-+| T Consensus 22 ~~~~~~~vG~v~r~~~-----~v~gv~~~----~i~vDG~D~T~~i~~~v~~~~~~~~rvVlLdGIt~aGFNivDi~~l~ 92 (185) T COG1628 22 GDDKVTLVGVVMRGDR-----VVDGVAFS----LITVDGLDVTDAISDMVNRSKRRDLRVVLLDGITFAGFNIVDIEALY 92 (185) T ss_pred CCCEEEEEEEEEECCE-----EEEEEEEE----EEEECCCHHHHHHHHHHHHHHCCCCCEEEECCEEECCCEEECHHHHH T ss_conf 7860899999998886-----88766778----99866823889999999985255611899877203052484289988 Q ss_pred -CCCCEEEEEECCCCCHHHHHH---HHHHHHH Q ss_conf -799678999615876899999---9999985 Q gi|254780612|r 471 -RSGQKILFTIDKGKKGADIFK---SAIMQWE 498 (503) Q Consensus 471 -~ngrra~~~~ekg~~g~~~f~---~a~~~w~ 498 (503) +.|+=-|--+.|++.+|+++. +-|.-|. T Consensus 93 ~~tg~PVi~V~~k~P~~e~i~~Al~k~f~d~e 124 (185) T COG1628 93 KETGLPVIVVYRKKPDIERIESALRKHFDDAE 124 (185) T ss_pred HHHCCCEEEEEECCCCHHHHHHHHHHHCCCHH T ss_conf 76199689999637987899999997588799 No 71 >COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=23.20 E-value=35 Score=13.92 Aligned_cols=30 Identities=27% Similarity=0.165 Sum_probs=14.8 Q ss_pred CCCCCHHHHHHHHHH--HCCCCEEEEEEECCC Q ss_conf 178825688779997--307811687883799 Q gi|254780612|r 444 LKGSEEDPFRNSKIL--EEYRFIDIPITYRSG 473 (503) Q Consensus 444 l~~~~~~~~~n~~ll--~~~~w~dip~~y~ng 473 (503) |++++..++|-+-=. |=++=|+|||+|-+- T Consensus 156 LaSLD~~RK~EilpylERL~~e~~IPIlYVSH 187 (352) T COG4148 156 LASLDLPRKREILPYLERLRDEINIPILYVSH 187 (352) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 03236510467789999987621887899936 No 72 >cd04732 HisA HisA. Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase catalyzes the fourth step in histidine biosynthesis, an isomerisation of the aminoaldose moiety of ProFAR to the aminoketose of PRFAR (N-(5'-phospho-D-1'-ribulosylformimino)-5-amino-1-(5''-phospho-ribosyl)-4-imidazolecarboxamide). In bacteria and archaea, ProFAR isomerase is encoded by the HisA gene. Probab=23.04 E-value=45 Score=13.10 Aligned_cols=40 Identities=23% Similarity=0.395 Sum_probs=17.6 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEEEECCCCE Q ss_conf 69828998178825688779997307-81168788379967 Q gi|254780612|r 436 SKNSYLISLKGSEEDPFRNSKILEEY-RFIDIPITYRSGQK 475 (503) Q Consensus 436 ~~~~f~v~l~~~~~~~~~n~~ll~~~-~w~dip~~y~ngrr 475 (503) .|.++++=|+.....+..|+.|+++- .=+++||.|..|-| T Consensus 43 ~d~l~i~DLdaa~~~~~~n~~~I~~I~~~~~~pi~vGGGIr 83 (234) T cd04732 43 AKWLHVVDLDGAKGGEPVNLELIEEIVKAVGIPVQVGGGIR 83 (234) T ss_pred CCEEEEEECCCHHHCCCCHHHHHHHHHHHCCCCEEECCCCC T ss_conf 99899996753030891159999999976795689737717 No 73 >cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains. Probab=22.79 E-value=47 Score=13.00 Aligned_cols=38 Identities=16% Similarity=0.024 Sum_probs=27.2 Q ss_pred ECCCEEEEECCCCCH----HHHHHHHHHHCCCCEEEEEEECC Q ss_conf 469828998178825----68877999730781168788379 Q gi|254780612|r 435 ISKNSYLISLKGSEE----DPFRNSKILEEYRFIDIPITYRS 472 (503) Q Consensus 435 v~~~~f~v~l~~~~~----~~~~n~~ll~~~~w~dip~~y~n 472 (503) +..+--.|++|++.. +..+.+++|++++--||+|+... T Consensus 47 ~e~~ad~i~vSsl~g~~~~~~~~l~~~L~e~G~~di~vvvGG 88 (128) T cd02072 47 IETDADAILVSSLYGHGEIDCKGLREKCDEAGLKDILLYVGG 88 (128) T ss_pred HHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 873999999823202562489999999996799999899789 No 74 >PRK07374 dnaE DNA polymerase III subunit alpha; Validated Probab=22.71 E-value=47 Score=12.99 Aligned_cols=17 Identities=18% Similarity=0.296 Sum_probs=8.8 Q ss_pred EEEEEEECCCC-EEEEEE Q ss_conf 16878837996-789996 Q gi|254780612|r 464 IDIPITYRSGQ-KILFTI 480 (503) Q Consensus 464 ~dip~~y~ngr-ra~~~~ 480 (503) +-|-|.|.|+. ++.|.| T Consensus 1123 ~pv~~~~~~~~~~~~~~l 1140 (1171) T PRK07374 1123 IPVVAAVRAGNSVRYVRL 1140 (1171) T ss_pred CCEEEEEECCCCEEEEEE T ss_conf 776999976983699981 No 75 >PRK02079 pyrroloquinoline quinone biosynthesis protein PqqD; Provisional Probab=22.54 E-value=48 Score=12.96 Aligned_cols=22 Identities=18% Similarity=0.386 Sum_probs=19.2 Q ss_pred CCCCHHHHHHHHHHHCCCCEEE Q ss_conf 7882568877999730781168 Q gi|254780612|r 445 KGSEEDPFRNSKILEEYRFIDI 466 (503) Q Consensus 445 ~~~~~~~~~n~~ll~~~~w~di 466 (503) .++..|+-.=|+.|..+.||+| T Consensus 66 ~~i~~DV~~fL~~a~~k~wi~l 87 (88) T PRK02079 66 PGLDEDVLEFLEVARAKHWIEL 87 (88) T ss_pred HHHHHHHHHHHHHHHHCCCEEC T ss_conf 1258999999999998819762 No 76 >KOG3107 consensus Probab=22.45 E-value=48 Score=12.96 Aligned_cols=24 Identities=17% Similarity=0.009 Sum_probs=17.7 Q ss_pred EEECCCCCCCCCEEEEEEEEEECC Q ss_conf 995178987778128899997168 Q gi|254780612|r 379 MTLKCNADISLSITHVMEIMFSFP 402 (503) Q Consensus 379 ~~~rrn~d~slpash~~e~~f~~~ 402 (503) -.=|||+|+|-|-...+|-+|+|. T Consensus 180 ~rgkk~~~psp~~ds~~eRVFiWD 203 (468) T KOG3107 180 GRGKKKTGPSPPGDSTLERVFIWD 203 (468) T ss_pred CCCCCCCCCCCCCCCCCEEEEEEE T ss_conf 556666899899877600588860 No 77 >KOG3972 consensus Probab=22.28 E-value=47 Score=13.01 Aligned_cols=24 Identities=21% Similarity=0.346 Sum_probs=19.6 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 179999988616776560001322 Q gi|254780612|r 3 DFILVIQRAVDNLPENTPERRSHI 26 (503) Q Consensus 3 D~~~~l~kai~~l~~~tpe~r~~v 26 (503) -||-|||||-.||.-.+.+-|-.. T Consensus 81 ayyklLkka~~GL~si~~d~~~~~ 104 (252) T KOG3972 81 AYYKLLKKAQEGLNSITEDGRLHN 104 (252) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCH T ss_conf 999999999877873585677761 No 78 >pfam10794 DUF2606 Protein of unknown function, DUF2606. Family of bacterial proteins with unknown function. Probab=22.12 E-value=48 Score=12.91 Aligned_cols=78 Identities=15% Similarity=0.221 Sum_probs=53.6 Q ss_pred EEEEE-EEEEECCCCCCCCHHHCCCCEECCCCCCCCCCC-CCEEE-----------E-ECCCEEEEECCCCCHHHHHHHH Q ss_conf 12889-999716888873211032310213321246631-10016-----------7-4698289981788256887799 Q gi|254780612|r 391 ITHVM-EIMFSFPKESQDAVVDLRRISMRKTDNSPSVLI-DSNIF-----------V-ISKNSYLISLKGSEEDPFRNSK 456 (503) Q Consensus 391 ash~~-e~~f~~~~~~~~~v~~v~~~~~k~~e~~~g~~l-~~~~~-----------k-v~~~~f~v~l~~~~~~~~~n~~ 456 (503) +|.++ -++|-+-+--|.-|.+|.-++||..+.+..--- +|..- | +..|-|.|.|-+.+.-..||+. T Consensus 37 ~~kv~~pVT~hVen~e~qpik~~ei~lmKa~d~~p~Pskeig~~IGKTd~eGkiiWk~~rKG~Yiv~lp~~et~~~r~is 116 (131) T pfam10794 37 AKKVVNPVTLHVEDAEGQPIKGVEVTLMKAADSDPQPSKEIGEIIGKTDHEGKIIWKSGRKGKYIVVLPKNETPETRNIS 116 (131) T ss_pred HCEECCCEEEEEECCCCCCCCCEEEEEEECCCCCCCCCHHHHEEECCCCCCCEEEEECCCCCEEEEEECCCCCEEEEEEE T ss_conf 41003747999825778932675999996565699974212114455476770887337772089996389850577542 Q ss_pred HHHCCCCEEEEE Q ss_conf 973078116878 Q gi|254780612|r 457 ILEEYRFIDIPI 468 (503) Q Consensus 457 ll~~~~w~dip~ 468 (503) |...|.=-.|+. T Consensus 117 l~~dr~~~~vi~ 128 (131) T pfam10794 117 LIEDRKDHKVIS 128 (131) T ss_pred EECCCCCCEEEE T ss_conf 102677635898 No 79 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=22.00 E-value=49 Score=12.89 Aligned_cols=41 Identities=15% Similarity=0.138 Sum_probs=28.4 Q ss_pred ECCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEE Q ss_conf 4698289981788256887799973078116878837996-789996 Q gi|254780612|r 435 ISKNSYLISLKGSEEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTI 480 (503) Q Consensus 435 v~~~~f~v~l~~~~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~ 480 (503) .+.++.-|..+..= ..=+.||.+.+. ||++..||+ ..|||- T Consensus 69 M~~~~~tv~~~~~i---~~~~~~m~~~~~--lpVVD~~~~lvGIITr 110 (116) T cd04643 69 MNTDVPVIIDDADI---EEILHLLIDQPF--LPVVDDDGIFIGIITR 110 (116) T ss_pred HCCCCEEEECCCCH---HHHHHHHHHCCE--EEEEECCCEEEEEEEH T ss_conf 15698899079999---999999987898--9999459999999992 No 80 >PRK06826 dnaE DNA polymerase III DnaE; Reviewed Probab=21.91 E-value=49 Score=12.88 Aligned_cols=40 Identities=8% Similarity=0.042 Sum_probs=20.7 Q ss_pred HHHCCCCEEEEEEECCCCEEEEEECC--CCCHHHHHHHHHHHH Q ss_conf 97307811687883799678999615--876899999999998 Q gi|254780612|r 457 ILEEYRFIDIPITYRSGQKILFTIDK--GKKGADIFKSAIMQW 497 (503) Q Consensus 457 ll~~~~w~dip~~y~ngrra~~~~ek--g~~g~~~f~~a~~~w 497 (503) |.+.++=+.|-|.|.++++ .|-|.+ ...-...|-+.+..- T Consensus 1107 l~~~~G~~~V~l~~~~~~~-~~~L~~~~~V~~~~~l~~~L~~l 1148 (1157) T PRK06826 1107 LKKHPGNTPVYLYTEKERK-KFKLDRELWVSLSEELIDNLRNL 1148 (1157) T ss_pred HHHCCCCCEEEEEECCCCE-EEECCCCCCCCCCHHHHHHHHHH T ss_conf 9739799859999868977-99889987467799999999986 No 81 >COG3814 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.89 E-value=44 Score=13.19 Aligned_cols=12 Identities=8% Similarity=0.445 Sum_probs=6.5 Q ss_pred EECCCEEEEECC Q ss_conf 746982899817 Q gi|254780612|r 434 VISKNSYLISLK 445 (503) Q Consensus 434 kv~~~~f~v~l~ 445 (503) ||+|.+|-|+|| T Consensus 75 ~v~dtgFsv~ls 86 (157) T COG3814 75 KVTDTGFSVTLS 86 (157) T ss_pred EECCCCEEEEEE T ss_conf 102462489988 No 82 >TIGR03572 WbuZ glycosyl amidation-associated protein WbuZ. This clade of sequences is highly similar to the HisF protein, but generally represents the second HisF homolog in the genome where the other is an authentic HisF observed in the context of a complete histidine biosynthesis operon. The similarity between these WbuZ sequences and true HisFs is such that often the closest match by BLAST of a WbuZ is a HisF. Only by making a multiple sequence alignment is the homology relationship among the WbuZ sequences made apparent. WbuZ genes are invariably observed in the presence of a homolog of the HisH protein (designated WbuY) and a proposed N-acetyl sugar amidotransferase designated in WbuX in E. coli, IfnA in P. aeriginosa and PseA in C. jejuni. Similarly, this trio of genes is invariably found in the context of saccharide biosynthesis loci. It has been shown that the WbuYZ homologs are not essential components of the activity expressed by WbuX, leading to the proposal that these to pr Probab=21.85 E-value=16 Score=16.30 Aligned_cols=65 Identities=8% Similarity=0.103 Sum_probs=43.4 Q ss_pred CCCCEECCCCCCCCCCCCCE---EEEECCCEEEEECCCCCHHHHHHHHHHHCCC-CEEEEEEECCCCEEE Q ss_conf 32310213321246631100---1674698289981788256887799973078-116878837996789 Q gi|254780612|r 412 LRRISMRKTDNSPSVLIDSN---IFVISKNSYLISLKGSEEDPFRNSKILEEYR-FIDIPITYRSGQKIL 477 (503) Q Consensus 412 v~~~~~k~~e~~~g~~l~~~---~~kv~~~~f~v~l~~~~~~~~~n~~ll~~~~-w~dip~~y~ngrra~ 477 (503) |++..||. +.--|+|+.-+ .-+=.|-.++|=|+.....+..|+.|+++-. =+.|||.|..|=|-+ T Consensus 18 Vk~~~~~~-~~~~gdP~~~ak~~~~~g~d~lhivDld~a~~~~~~n~~~I~~i~~~~~ipi~vGGGIrs~ 86 (232) T TIGR03572 18 VKTVQFKD-PRYIGDPVNAARIYNAKGADELIVLDIDASKRGREPLFELISNLAEECFMPLTVGGGIRSL 86 (232) T ss_pred EECCCCCC-CEECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEEECEEEH T ss_conf 97847877-6578899999999998699999999687643488217999999999729858997133038 No 83 >pfam00948 Flavi_NS1 Flavivirus non-structural Protein NS1. The NS1 protein is well conserved amongst the flaviviruses. It contains 12 cysteines, and undergoes glycosylation in a similar manner to other NS proteins. Mutational analysis has strongly implied a role for NS1 in the early stages of RNA replication. Probab=21.80 E-value=46 Score=13.09 Aligned_cols=51 Identities=14% Similarity=0.116 Sum_probs=28.1 Q ss_pred CCCEEEEEEECCCCCEEEEEEECCCCCC-------------CCCEEEEEE------EEEECCCCCCCCHHH Q ss_conf 6742367630787771799995178987-------------778128899------997168888732110 Q gi|254780612|r 360 KGLVIKGDIPMIDNAFSASMTLKCNADI-------------SLSITHVME------IMFSFPKESQDAVVD 411 (503) Q Consensus 360 ~~~aira~i~iP~~~~~~~~~~rrn~d~-------------slpash~~e------~~f~~~~~~~~~v~~ 411 (503) -..||++++.| -.+..|=|-=+.|.-- .-|.|||+. -..++|.-.||=++. T Consensus 183 ~GaAVK~~~aV-HtD~~~WmeS~~NgTw~i~~l~~~~~~~C~WP~shTl~~~~v~eS~lfiP~~lgGP~S~ 252 (355) T pfam00948 183 MGAAVKNGRAV-HTDLSYWIESRLNGTWKIERAELGEVKSCTWPETHTLWGDGVLESELFIPVTLAGPRSQ 252 (355) T ss_pred EEHEEECCEEE-ECCCCEEEEECCCCEEEEEEEEECCEEEEECCCCCCCCCCCCCCCCEECCCCCCCCCHH T ss_conf 00211188659-65886278974688199999986130564288761259998440230054302587200 No 84 >CHL00073 chlN photochlorophyllide reductase subunit N Probab=21.67 E-value=35 Score=13.86 Aligned_cols=12 Identities=17% Similarity=0.127 Sum_probs=5.3 Q ss_pred EEEEEEECCCCC Q ss_conf 899997168888 Q gi|254780612|r 394 VMEIMFSFPKES 405 (503) Q Consensus 394 ~~e~~f~~~~~~ 405 (503) -|||+|.==.+| T Consensus 415 SiEl~F~pIHGf 426 (457) T CHL00073 415 SVEFTFAQIHGF 426 (457) T ss_pred EEEEEECCCCCH T ss_conf 787772043460 No 85 >PRK06273 ferredoxin; Provisional Probab=21.53 E-value=45 Score=13.14 Aligned_cols=25 Identities=12% Similarity=0.149 Sum_probs=15.4 Q ss_pred C-CCHHHC---CCCEECCCCCCCCCCCCCEEEEECCCEE Q ss_conf 7-321103---2310213321246631100167469828 Q gi|254780612|r 406 Q-DAVVDL---RRISMRKTDNSPSVLIDSNIFVISKNSY 440 (503) Q Consensus 406 ~-~~v~~v---~~~~~k~~e~~~g~~l~~~~~kv~~~~f 440 (503) | ||-+|| ..|.|+.-| |||||||+- T Consensus 55 GC~GCaNvCPT~AIeM~~~e----------PVkite~~v 83 (163) T PRK06273 55 GCGGCANACPTKAIEMIPVE----------PVKITETYV 83 (163) T ss_pred CCCCCCCCCCCCCEEEEECC----------CEEEECCHH T ss_conf 65550023887755777656----------567411033 No 86 >TIGR02318 phosphono_phnM phosphonate metabolism protein PhnM; InterPro: IPR012696 This family consists of proteins from in the PhnM family. PhnM is a protein associated with phosphonate utilization in a number of bacterial species. Pseudomonas stutzeri WM88, has a related protein that is part of a system for the oxidation of phosphites (another form of reduced phosphorous compound), which is not contained in this entry.. Probab=21.42 E-value=50 Score=12.81 Aligned_cols=51 Identities=22% Similarity=0.244 Sum_probs=33.7 Q ss_pred CCEEEEECCCCCHHHH--HHHHHHHCCCCEEEEEEECCCCEE-EEEECCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9828998178825688--779997307811687883799678-999615876899999999998522369 Q gi|254780612|r 437 KNSYLISLKGSEEDPF--RNSKILEEYRFIDIPITYRSGQKI-LFTIDKGKKGADIFKSAIMQWENRSNN 503 (503) Q Consensus 437 ~~~f~v~l~~~~~~~~--~n~~ll~~~~w~dip~~y~ngrra-~~~~ekg~~g~~~f~~a~~~w~~~~~~ 503 (503) +..-||.|.|+-.=+- +|+++.|+ |-.|||+ +|| +..|++-|..|++..+| T Consensus 162 ~~v~L~SlMDHtPGQrQ~~~~~~yr~---------~~~~k~grvls-------d~~~~~~~~~r~~~~~~ 215 (391) T TIGR02318 162 PRVDLISLMDHTPGQRQFRDLEKYRE---------YYQKKRGRVLS-------DDEFDEIVEERIARSAE 215 (391) T ss_pred CCCEEEEEECCCCHHHCCHHHHHHHH---------HHHHHHCCCCC-------HHHHHHHHHHHHHHHCC T ss_conf 97018886068870110112578999---------87245267458-------99999999999997358 No 87 >pfam03452 Anp1 Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins co-localize within the cis Golgi, and that they are physically associated in two distinct complexes. Probab=21.40 E-value=50 Score=12.81 Aligned_cols=77 Identities=17% Similarity=0.200 Sum_probs=38.4 Q ss_pred EEEEEEEEECCCCC-CCCHHHCCCCEECCCCCC--CCCCCCCEEEEECCCEEEEECCCCCH---------------HHHH Q ss_conf 28899997168888-732110323102133212--46631100167469828998178825---------------6887 Q gi|254780612|r 392 THVMEIMFSFPKES-QDAVVDLRRISMRKTDNS--PSVLIDSNIFVISKNSYLISLKGSEE---------------DPFR 453 (503) Q Consensus 392 sh~~e~~f~~~~~~-~~~v~~v~~~~~k~~e~~--~g~~l~~~~~kv~~~~f~v~l~~~~~---------------~~~~ 453 (503) -|+|+|-|++++.. |-..-.+-...++..... --....-+.. +..+| .-.+|-... .+.| T Consensus 55 h~lI~L~Flv~~~~~~d~t~~~l~~~~~~~Q~~~~~~~~F~~iti-~~kdf-~~~~~q~~~~RH~~~~Q~~RR~~maraR 132 (270) T pfam03452 55 HELIDLGFLVPDTKEGDKTLEVLAAALKKTQSKGDGEKRFRSITI-LRKDF-GDVEGQSESDRHAFAVQGPRRKLMARAR 132 (270) T ss_pred HHHEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE-ECCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 345357887168876316999999999997417876777325799-72775-3102545454301334478999999999 Q ss_pred HHHHHHCCC-------CEEEEEEE Q ss_conf 799973078-------11687883 Q gi|254780612|r 454 NSKILEEYR-------FIDIPITY 470 (503) Q Consensus 454 n~~ll~~~~-------w~dip~~y 470 (503) |--|..... |+|..|+- T Consensus 133 N~Ll~~aL~p~~swVlW~DaDiv~ 156 (270) T pfam03452 133 NWLLSTALKPYHSWVLWLDADIVE 156 (270) T ss_pred HHHHHHHCCCCCCEEEEEECCHHH T ss_conf 999998438876469998264322 No 88 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=20.94 E-value=51 Score=12.75 Aligned_cols=33 Identities=12% Similarity=-0.060 Sum_probs=25.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEEEECCCC-EEEEEE Q ss_conf 256887799973078116878837996-789996 Q gi|254780612|r 448 EEDPFRNSKILEEYRFIDIPITYRSGQ-KILFTI 480 (503) Q Consensus 448 ~~~~~~n~~ll~~~~w~dip~~y~ngr-ra~~~~ 480 (503) +.+...=+.||.+++---+|++..+|| ..|||. T Consensus 67 d~~l~~a~~~M~~~~i~~lPVVD~~grlvGivT~ 100 (106) T cd04582 67 DDDLRIVLSRMFAHDMSWLPCVDEDGRYVGEVTQ 100 (106) T ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEH T ss_conf 9999999999997298762589899909999986 No 89 >TIGR01858 tag_bisphos_ald class II aldolase, tagatose bisphosphate family; InterPro: IPR011288 This entry describes tagatose-1,6-bisphosphate aldolases, and perhaps other closely related class II aldolases. This tetrameric, Zn2+-dependent enzyme is related to the class II fructose bisphosphate aldolase; fructose 1,6-bisphosphate and tagatose 1,6 bisphosphate differ only in chirality at C4 , .; GO: 0008270 zinc ion binding, 0016832 aldehyde-lyase activity, 0019512 lactose catabolic process via tagatose-6-phosphate. Probab=20.88 E-value=38 Score=13.61 Aligned_cols=44 Identities=14% Similarity=0.183 Sum_probs=29.1 Q ss_pred HHHCCCCEEEEEEECCC--------CEE-EEEECCCCCH---HHHHHHHHHHHHHC Q ss_conf 97307811687883799--------678-9996158768---99999999998522 Q gi|254780612|r 457 ILEEYRFIDIPITYRSG--------QKI-LFTIDKGKKG---ADIFKSAIMQWENR 500 (503) Q Consensus 457 ll~~~~w~dip~~y~ng--------rra-~~~~ekg~~g---~~~f~~a~~~w~~~ 500 (503) |-+=|.|.|||||.=.+ ||+ .|=|=|=--| .=+|.+|+++|... T Consensus 192 L~~IR~~v~vPLVLHGASdvpde~v~~~IeLGi~KvNVATeLKiAFS~AlK~Yf~e 247 (282) T TIGR01858 192 LAKIREKVDVPLVLHGASDVPDEDVRRTIELGICKVNVATELKIAFSGALKAYFLE 247 (282) T ss_pred HHHHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCEEECCHHCCHHHHHHHHHHHHH T ss_conf 98752203777242177788767899998707413411111003689999999974 No 90 >cd01339 LDH-like_MDH L-lactate dehydrogenase-like malate dehydrogenase proteins. Members of this subfamily have an LDH-like structure and an MDH enzymatic activity. Some members, like MJ0490 from Methanococcus jannaschii, exhibit both MDH and LDH activities. Tetrameric MDHs, including those from phototrophic bacteria, are more similar to LDHs than to other MDHs. LDH catalyzes the last step of glycolysis in which pyruvate is converted to L-lactate. MDH is one of the key enzymes in the citric acid cycle, facilitating both the conversion of malate to oxaloacetate and replenishing levels of oxalacetate by reductive carboxylation of pyruvate. The LDH-like MDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenas Probab=20.77 E-value=23 Score=15.14 Aligned_cols=45 Identities=11% Similarity=0.240 Sum_probs=29.8 Q ss_pred EECCCCCCCCCEEEEEE-------EEEECCCCC-CCCHHHCCCCEECCCCCCC Q ss_conf 95178987778128899-------997168888-7321103231021332124 Q gi|254780612|r 380 TLKCNADISLSITHVME-------IMFSFPKES-QDAVVDLRRISMRKTDNSP 424 (503) Q Consensus 380 ~~rrn~d~slpash~~e-------~~f~~~~~~-~~~v~~v~~~~~k~~e~~~ 424 (503) .|.+|...-||+|+.++ +.|.+|--. ..||..|-.+.|-..||.+ T Consensus 235 aIl~d~~~v~pvs~~~~g~yG~~~v~~s~P~~ig~~Gv~~i~~l~L~~~E~~~ 287 (300) T cd01339 235 AILKDKKRVLPCSAYLEGEYGIKDIFVGVPVVLGKNGVEKIIELDLTDEEKEA 287 (300) T ss_pred HHHCCCCCEEEEEEEECCCCCCCCEEEEEEEEECCCCEEEECCCCCCHHHHHH T ss_conf 99647897899999844777860479998999858831898589999999999 No 91 >cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute. Probab=20.73 E-value=52 Score=12.72 Aligned_cols=108 Identities=11% Similarity=0.118 Sum_probs=65.3 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCC--CCCC--CE--EEEEEECCCCCEE--------EEEEECCCCCCCCCEEEEEE Q ss_conf 345545455544466622123045777--7767--42--3676307877717--------99995178987778128899 Q gi|254780612|r 331 VFINKGRGQSSILSGKILWSLQQEKSQ--GLKG--LV--IKGDIPMIDNAFS--------ASMTLKCNADISLSITHVME 396 (503) Q Consensus 331 ~~~~e~~~~~~~~~GsVvWs~~~es~~--~~~~--~a--ira~i~iP~~~~~--------~~~~~rrn~d~slpash~~e 396 (503) +.+-|+...-+..+|-.+-+|-.+.-+ +.-+ +| +|+-=-+ ..+|. -+|-|.++.+.-|= ||..| T Consensus 270 Tt~FEdL~Gl~QLPGLAvQrLMa~GYGFggEGDwKTAal~r~mKvm-~~gl~gGtSFMEDYTy~f~~g~~~iLG-AHMLE 347 (484) T cd03557 270 TTTFEDLHGLKQLPGLAVQRLMAEGYGFGAEGDWKTAALVRAMKVM-SAGLPGGTSFMEDYTYHFEPGNELVLG-AHMLE 347 (484) T ss_pred EEHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCCCCCCHHHHHCCCCCCEEEHH-HHCCC T ss_conf 0012045670016216787786538663444118999999999998-378999822144544028999844303-10055 Q ss_pred EEEECCCC-------C-C-CCHHHCCCCEECCCCCCCCCCCCCEEEEECCCEEEEE Q ss_conf 99716888-------8-7-3211032310213321246631100167469828998 Q gi|254780612|r 397 IMFSFPKE-------S-Q-DAVVDLRRISMRKTDNSPSVLIDSNIFVISKNSYLIS 443 (503) Q Consensus 397 ~~f~~~~~-------~-~-~~v~~v~~~~~k~~e~~~g~~l~~~~~kv~~~~f~v~ 443 (503) +-++.-.+ + | |+-.+-.||.|.... |..+..--+-..|.|+||. T Consensus 348 vcpsIA~~KP~levhPLgIGgk~DP~RLVF~~~~---Gpav~vslvD~G~rFRLiv 400 (484) T cd03557 348 VCPSIAAAKPRLEVHPLGIGGKEDPARLVFTGKA---GPAVNASLVDMGNRFRLVV 400 (484) T ss_pred CCHHHHCCCCEEEECCCCCCCCCCCCEEEEECCC---CCEEEEEEEECCCCEEEEE T ss_conf 6815536895467615787886886269997698---8879999986588479999 No 92 >PRK05673 dnaE DNA polymerase III subunit alpha; Validated Probab=20.50 E-value=52 Score=12.68 Aligned_cols=44 Identities=7% Similarity=0.080 Sum_probs=22.5 Q ss_pred HHCCCCEEEEEEECCCC-EEEEEECCC--CCHHHHHHHHHHHHHHCC Q ss_conf 73078116878837996-789996158--768999999999985223 Q gi|254780612|r 458 LEEYRFIDIPITYRSGQ-KILFTIDKG--KKGADIFKSAIMQWENRS 501 (503) Q Consensus 458 l~~~~w~dip~~y~ngr-ra~~~~ekg--~~g~~~f~~a~~~w~~~~ 501 (503) .+.++=..|-|.|.+.. ...|.|-+. ..-+..|-+.+..-.+.. T Consensus 1092 ~~~~G~~~v~l~~~~~~~~~~~~l~~~~~v~~~~~l~~~l~~l~G~~ 1138 (1143) T PRK05673 1092 EPGPGTVPVHLYYQDGDAEAELRLGDRWRVSPSDELLGDLKALLGVE 1138 (1143) T ss_pred HHCCCCCEEEEEEECCCEEEEEECCCCCEECCCHHHHHHHHHHHCCC T ss_conf 70989986999997698279998699852777999999999850976 No 93 >COG2014 Uncharacterized conserved protein [Function unknown] Probab=20.38 E-value=22 Score=15.35 Aligned_cols=23 Identities=22% Similarity=0.209 Sum_probs=15.0 Q ss_pred HHCCCCEEEEEEECCCC-EEEEEE Q ss_conf 73078116878837996-789996 Q gi|254780612|r 458 LEEYRFIDIPITYRSGQ-KILFTI 480 (503) Q Consensus 458 l~~~~w~dip~~y~ngr-ra~~~~ 480 (503) |++++|+||-..-.+++ .+.|.+ T Consensus 98 l~~~~~~Dil~li~~~d~IkmI~~ 121 (250) T COG2014 98 LEEANWFDILDLIQRDDKIKMIAE 121 (250) T ss_pred HHHCCHHHHHHHHCCCCCEEEEEE T ss_conf 775346779998747873457775 No 94 >COG4742 Predicted transcriptional regulator [Transcription] Probab=20.17 E-value=18 Score=15.90 Aligned_cols=22 Identities=9% Similarity=0.064 Sum_probs=17.0 Q ss_pred EEEEECCCEEEEECCCCCHHHH Q ss_conf 0167469828998178825688 Q gi|254780612|r 431 NIFVISKNSYLISLKGSEEDPF 452 (503) Q Consensus 431 ~~~kv~~~~f~v~l~~~~~~~~ 452 (503) +.+-|||.||+++|-+.+..-+ T Consensus 207 ~~~~VTD~f~~L~Lf~k~G~~d 228 (260) T COG4742 207 ASLIVTDRFMALSLFDKDGRYD 228 (260) T ss_pred EEEEEECHHHHHCCCCCCCCCC T ss_conf 2688861146400415699807 Done!