Query gi|254780660|ref|YP_003065073.1| hypothetical protein CLIBASIA_02735 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 244 No_of_seqs 153 out of 2227 Neff 6.6 Searched_HMMs 23785 Date Tue May 31 17:22:37 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780660.hhm -d /home/congqian_1/database/pdb/pdb70.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 1lfp_A Hypothetical protein AQ 100.0 0 0 661.3 21.0 244 1-244 1-245 (249) 2 1kon_A Protein YEBC, YEBC; alp 100.0 0 0 655.2 15.2 241 1-244 4-246 (249) 3 1mw7_A Hypothetical protein HP 100.0 0 0 573.0 19.7 231 6-238 4-240 (240) 4 2jsx_A Protein NAPD; TAT, proo 56.5 11 0.00045 17.7 4.7 67 169-240 9-82 (95) 5 3fyb_A Protein of unknown func 42.4 12 0.0005 17.4 2.3 20 214-233 15-34 (104) 6 3lk7_A UDP-N-acetylmuramoylala 41.3 18 0.00076 16.1 4.8 62 48-114 282-343 (451) 7 3lh2_S 4E10_1VI7A_S0_002_N (T8 40.6 9.7 0.00041 18.0 1.6 42 169-210 9-50 (76) 8 2o35_A Hypothetical protein DU 38.8 13 0.00056 17.0 2.1 20 214-233 16-35 (105) 9 2epi_A UPF0045 protein MJ1052; 37.1 8 0.00034 18.6 0.7 54 181-241 28-82 (100) 10 1d1r_A Hypothetical 11.4 KD pr 34.2 23 0.00096 15.3 5.1 70 80-160 31-101 (116) 11 3hno_A Pyrophosphate-dependent 33.5 11 0.00047 17.6 1.0 177 36-223 9-237 (419) 12 2x5o_A UDP-N-acetylmuramoylala 33.0 24 0.001 15.2 3.9 61 49-114 272-332 (439) 13 2f00_A UDP-N-acetylmuramate--L 32.0 25 0.001 15.1 7.8 70 47-118 295-369 (491) 14 2ibo_A Hypothetical protein SP 31.1 13 0.00055 17.1 1.0 45 195-241 31-78 (104) 15 1vk8_A Hypothetical protein TM 30.5 15 0.00062 16.7 1.2 39 201-241 52-91 (106) 16 2w0c_P Protein P3, protein III 30.5 10 0.00043 17.8 0.4 19 2-20 25-43 (104) 17 3lap_A Arginine repressor; arg 30.2 26 0.0011 14.9 4.7 90 101-191 29-136 (170) 18 1lxn_A Hypothetical protein MT 28.1 4.1 0.00017 20.7 -2.0 44 195-241 31-80 (99) 19 2cvi_A 75AA long hypothetical 25.6 32 0.0013 14.3 2.2 69 169-241 5-77 (83) 20 3k85_A D-glycero-D-manno-hepto 24.7 33 0.0014 14.2 2.1 63 148-210 277-343 (357) 21 2ko1_A CTR148A, GTP pyrophosph 24.1 34 0.0014 14.1 5.8 58 174-234 14-76 (88) 22 2wbr_A GW182, gawky, LD47780P; 23.3 33 0.0014 14.2 1.9 31 89-121 3-33 (89) 23 1zl0_A Hypothetical protein PA 22.0 29 0.0012 14.6 1.5 14 16-29 63-76 (311) 24 1b4a_A Arginine repressor; hel 21.9 37 0.0016 13.9 6.4 91 101-192 14-116 (149) 25 1lxj_A YBL001C, hypothetical 1 21.9 6.5 0.00027 19.3 -1.9 52 181-241 28-83 (104) 26 2cve_A Hypothetical protein TT 21.7 37 0.0016 13.8 4.1 52 167-221 127-180 (191) 27 3m16_A Transaldolase; dimer, m 21.1 30 0.0012 14.5 1.3 76 40-118 33-126 (329) 28 2vse_A MTX holotoxin, mosquito 20.6 32 0.0014 14.3 1.4 10 85-94 141-150 (841) 29 1t95_A Hypothetical protein AF 20.4 38 0.0016 13.8 1.7 43 19-73 96-138 (240) 30 1fp1_D Isoliquiritigenin 2'-O- 20.0 28 0.0012 14.7 1.0 10 31-40 40-49 (372) No 1 >1lfp_A Hypothetical protein AQ_1575; NEW fold, thermostability, structural genomics, BSGC structure funded by NIH, protein structure initiative, PSI; 1.72A {Aquifex aeolicus} SCOP: e.39.1.1 Probab=100.00 E-value=0 Score=661.27 Aligned_cols=244 Identities=40% Similarity=0.637 Sum_probs=238.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCC Q ss_conf 9774112655646775578889799999999999998189994148999999999996688878999999850377-666 Q gi|254780660|r 1 MAGHSQFKNIMHRKERKDALKSKIFSKLSREITVSAKLSGQNPLENPRLRLAIQNAKNQSMPKENIERAIKKAGSD-DLG 79 (244) Q Consensus 1 maGHsKW~nIkh~K~~~D~~k~k~f~k~~keI~~A~k~gG~dp~~N~~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~-~~~ 79 (244) ||||||||||||+||++|++||++|+||+|+|++|+|+|||||+.|||||+||++||++||||++|||||+||.|. +.. T Consensus 1 maGHsKW~~Ikh~K~~~D~~k~k~f~k~~r~I~~A~k~GG~dp~~N~~L~~ai~~Ak~~nmPk~~Ie~AIkk~~g~~~~~ 80 (249) T 1lfp_A 1 MAGHSHWAQIKHKKAKVDAQRGKLFSKLIREIIVATRLGGPNPEFNPRLRTAIEQAKKANMPWENIERAIKKGAGELEGE 80 (249) T ss_dssp ----CCSCCSSSSSSCTTTSHHHHHHHHHHHHHHHHHHHCSCGGGCHHHHHHHHHHHHTTCCHHHHHHHHHHHHSCSSSC T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCC T ss_conf 99652666765688898999999999999999999972899976199999999999984999889999998504566666 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCCCCCCHHHHHHHHCCC Q ss_conf 64023678650599399999944630235899898764256831478860456864472797067753013667751268 Q gi|254780660|r 80 NYTNIRYEGYGPEGVAIIIEALTDNRNRTASSIRSIFTKANGSLGSTGSTTRFFEQIGEIIYHSNIGDSNLAMEVAIESD 159 (244) Q Consensus 80 ~~~~~~yEg~gP~gvaiiVe~lTDN~nRt~~~vr~~f~K~gG~lg~~Gsv~~~F~~~G~i~~~~~~~d~d~~~e~aie~g 159 (244) +|++++|||||||||+|||||||||+|||+++|||+|+|+||+||++|||+|||+|+|+|.++.+..++|.+++.+||+| T Consensus 81 ~~~~~~yEg~gP~GvaiiVe~lTDN~nRt~~~vr~~f~K~gG~lg~~GsV~~~F~~~G~i~~~~~~~~ed~l~e~aie~g 160 (249) T 1lfp_A 81 QFEEVIYEGYAPGGVAVMVLATTDNRNRTTSEVRHVFTKHGGNLGASGCVSYLFERKGYIEVPAKEVSEEELLEKAIEVG 160 (249) T ss_dssp CCEEEEEEEEETTTEEEEEEEEESCHHHHHHHHHHHHHHTTCEECCTTSSGGGEEEEEEEEEEGGGSCHHHHHHHHHHHT T ss_pred CEEEEEEEEECCCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHEECCEEEECCCCCCHHHHHHHHHHCC T ss_conf 60799999987888589999938877789999999998718814788505676101132762588776778899987369 Q ss_pred CCCCCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 75223468825999643201345665420256742221786037610248989999999999875238881311025626 Q gi|254780660|r 160 AFEVLFEDQEYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLEI 239 (244) Q Consensus 160 a~Dv~~~d~~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~i 239 (244) |+|++.+++.++|+|+|++|.+|+++|++.++.+++++++|+|+++|+|+++|+++++.+|+++|||+||||+||||++| T Consensus 161 aedve~~e~~~~i~~~~~~~~~v~~~Le~~g~~i~~sei~~~P~~~v~l~dee~~~~~~kLid~Lee~dDVq~Vy~N~ei 240 (249) T 1lfp_A 161 AEDVQPGEEVHIIYTVPEELYEVKENLEKLGVPIEKAQITWKPISTVQINDEETAQKVIKLLNALEELDDVQQVIANFEI 240 (249) T ss_dssp CSEEECCSSEEEEEECGGGHHHHHHHHHTTTCCCSEEEEEEEESSCEECCCHHHHHHHHHHHHHHHTSTTEEEEEECEEC T ss_pred CCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCEECCCHHHHHHHHHHHHHHHCCCCCCCEEECCCC T ss_conf 62001577518996079999999999987487612020268507834248799999999999987452170125367866 Q ss_pred CCCCC Q ss_conf 62439 Q gi|254780660|r 240 ADDII 244 (244) Q Consensus 240 ~eeil 244 (244) +||+| T Consensus 241 ~ee~~ 245 (249) T 1lfp_A 241 PEEIL 245 (249) T ss_dssp CSCCS T ss_pred CHHHH T ss_conf 99999 No 2 >1kon_A Protein YEBC, YEBC; alpha/beta, two-domains, montreal-kingston bacterial structural genomics initiative, BSGI, structural genomics; 2.20A {Escherichia coli} SCOP: e.39.1.1 Probab=100.00 E-value=0 Score=655.21 Aligned_cols=241 Identities=39% Similarity=0.637 Sum_probs=231.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCC Q ss_conf 9774112655646775578889799999999999998189994148999999999996688878999999850377-666 Q gi|254780660|r 1 MAGHSQFKNIMHRKERKDALKSKIFSKLSREITVSAKLSGQNPLENPRLRLAIQNAKNQSMPKENIERAIKKAGSD-DLG 79 (244) Q Consensus 1 maGHsKW~nIkh~K~~~D~~k~k~f~k~~keI~~A~k~gG~dp~~N~~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~-~~~ 79 (244) ||||||||||||+||++|++||++|+||+|+|++|||+|||||++|+|||+||++||++||||++||||||||.|. ... T Consensus 4 maGHsKW~nIkh~K~a~D~~k~k~f~k~~k~I~~A~k~GG~Dp~~N~~L~~ai~~Ak~~nmPkd~IerAIkk~~g~~~~~ 83 (249) T 1kon_A 4 MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLRAAVDKALSNNMTRDTLNRAIARGVGGDDDA 83 (249) T ss_dssp --CCCSGGGTSSSTTTTTSCHHHHHHHHHHHHHHHHHC-----CCSTTTHHHHHHHHHTTCCHHHHHHHHSCC------C T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCC T ss_conf 87266767765788999999999999999999999971799976299999999999980998799999999625778887 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCCCCCCHHHHHHHHCCC Q ss_conf 64023678650599399999944630235899898764256831478860456864472797067753013667751268 Q gi|254780660|r 80 NYTNIRYEGYGPEGVAIIIEALTDNRNRTASSIRSIFTKANGSLGSTGSTTRFFEQIGEIIYHSNIGDSNLAMEVAIESD 159 (244) Q Consensus 80 ~~~~~~yEg~gP~gvaiiVe~lTDN~nRt~~~vr~~f~K~gG~lg~~Gsv~~~F~~~G~i~~~~~~~d~d~~~e~aie~g 159 (244) +|++++|||||||||+|||||||||+|||+++||++|+|+||+||++|||+|||+|+|+|.++.. ++|.+++++||+| T Consensus 84 ~~~ei~yEg~gPgGvaiiVe~lTDN~nRt~~~vR~~f~K~gG~Lg~~Gsv~~~F~rkG~i~~~~~--~~d~~~e~aie~G 161 (249) T 1kon_A 84 NMETIIYEGYGPGGTAIMIECLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKG--DEDTIMEAALEAG 161 (249) T ss_dssp CCEEEEEEEEETTTEEEEEEEEESCHHHHHHHHHHHHHTTTCEECCTTSSGGGEEEEEEEEESSS--CHHHHHHHHHHHT T ss_pred CEEEEEEEEECCCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEEEECCC--CHHHHHHHHHHCC T ss_conf 66999999986898489999948987778999998998649963797631523135345631789--9889999999758 Q ss_pred CCCCCC-CCCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 752234-6882599964320134566542025674222178603761024898999999999987523888131102562 Q gi|254780660|r 160 AFEVLF-EDQEYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLE 238 (244) Q Consensus 160 a~Dv~~-~d~~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~ 238 (244) |+||+. +++.++|+|+|++|.+|+++|++.++.+++++|+|+|+++|+|++ |+++++.+|++.|||+||||+||||++ T Consensus 162 AeDv~~~d~~~~~i~~~~~~l~~v~~~Le~~g~~~~~sei~~~P~~~v~l~~-e~~~kv~kLie~Led~DDVq~Vy~N~e 240 (249) T 1kon_A 162 AEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSMIPSTKADMDA-ETAPKLMRLIDMLEDCDDVQEVYHNGE 240 (249) T ss_dssp CSEEEECTTSCEEEEEEGGGHHHHHHHHHHTTCCCSEEEEEEEESSCCCCCT-TTSHHHHHHHHHHHHSSSEEEEEECCC T ss_pred CCCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCH-HHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 4002245778439982787799999999866887056689997588533299-999999999998744447242878885 Q ss_pred CCCCCC Q ss_conf 662439 Q gi|254780660|r 239 IADDII 244 (244) Q Consensus 239 i~eeil 244 (244) |+||+| T Consensus 241 i~ee~l 246 (249) T 1kon_A 241 ISDEVA 246 (249) T ss_dssp CCHHHH T ss_pred CCHHHH T ss_conf 299998 No 3 >1mw7_A Hypothetical protein HP0162; structural genomics, PSI, protein structure initiative, northeast structural genomics consortium, NESG; 2.00A {Helicobacter pylori} SCOP: e.39.1.1 Probab=100.00 E-value=0 Score=572.96 Aligned_cols=231 Identities=33% Similarity=0.434 Sum_probs=220.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 12655646775578889799999999999998189994148999999999996688878999999850377666640236 Q gi|254780660|r 6 QFKNIMHRKERKDALKSKIFSKLSREITVSAKLSGQNPLENPRLRLAIQNAKNQSMPKENIERAIKKAGSDDLGNYTNIR 85 (244) Q Consensus 6 KW~nIkh~K~~~D~~k~k~f~k~~keI~~A~k~gG~dp~~N~~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~~~~~~~~~~ 85 (244) +|+++||+|+++|++|||+|+||+|+|++|||+|||||+.|||||+||++||++||||++|||||+|+++.+ ++|++++ T Consensus 4 ~~~~~k~~k~~~d~~rsk~f~kl~reI~vA~k~GG~DP~~N~~L~~ai~~Ak~~nmPkd~IerAIkk~~~~~-~~~~e~~ 82 (240) T 1mw7_A 4 AFEYRRAAKEKRWDKMSKVFPKLAKAITLAAKDGGSEPDTNAKLRTAILNAKAQNMPKDNIDAAIKRASSKE-GNLSEIT 82 (240) T ss_dssp -----------------CCHHHHHHHHHHHHHTTCSCGGGCHHHHHHHHHHHHHTCCHHHHHHHHHHTTSTT-CCCEEEE T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCC-CCCEEEE T ss_conf 588898776440788889999999999999971899977299999999999983999899999998505577-8808999 Q ss_pred EEEECCCCEEEEEEEECCCCHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHCCEEEECCCC-----CCCHHHHHHHHCCC Q ss_conf 7865059939999994463023589989876425-68314788604568644727970677-----53013667751268 Q gi|254780660|r 86 YEGYGPEGVAIIIEALTDNRNRTASSIRSIFTKA-NGSLGSTGSTTRFFEQIGEIIYHSNI-----GDSNLAMEVAIESD 159 (244) Q Consensus 86 yEg~gP~gvaiiVe~lTDN~nRt~~~vr~~f~K~-gG~lg~~Gsv~~~F~~~G~i~~~~~~-----~d~d~~~e~aie~g 159 (244) |||||||||+|||||||||+|||+++||++|+|+ ||+||++|||+|||+|+|+|.++++. .++|.+++.+||+| T Consensus 83 YEg~gP~GvaiiVe~lTDN~nRt~~~vR~~f~K~~Gg~l~~~GSV~~~F~rkG~i~~~~~~~~~~~~~~d~~~~~aie~G 162 (240) T 1mw7_A 83 YEGKANFGVLIIMECMTDNPTRTIANLKSYFNKTQGASIVPNGSLEFMFNRKSVFECLKNEVENLKLSLEDLEFALIDYG 162 (240) T ss_dssp EEEEETTTEEEEEEEEESCHHHHHHHHHHHHTTSTTCEEECTTTTTTSEEEEEEEEEEHHHHHHTTCCHHHHHHHHGGGT T ss_pred EEEECCCCEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEECCCCCCHHHCCHHHHHHHHHHCC T ss_conf 99987997499999957975567999999998628955466785479999758999767741021047778877765047 Q ss_pred CCCCCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 7522346882599964320134566542025674222178603761024898999999999987523888131102562 Q gi|254780660|r 160 AFEVLFEDQEYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLE 238 (244) Q Consensus 160 a~Dv~~~d~~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~ 238 (244) |+||+.+++.++|+|+|++|..|+++|++.++.+.+++++|+|+++|+|++ |+++++.+|+|.|||+||||+||||++ T Consensus 163 aeDve~~e~~~~i~~~~~~~~~v~~~Le~~~~~~~~ael~~ip~~~Vel~~-e~~~~~~kLid~Lee~DDVq~VytN~e 240 (240) T 1mw7_A 163 LEELEEVEDKIIIRGDYNSFKLLNEGFESLKLPILKASLQRIATTPIELND-EQMELTEKLLDRIEDDDDVVALYTNIE 240 (240) T ss_dssp EEEEEEETTEEEEEEEGGGHHHHHHHHHHTTCCCSEEEEEEEESSCBCCCH-HHHHHHHHHHHHHHTSTTEEEEEESBC T ss_pred CCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCEECCH-HHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 410123687306733147689999999864776002478994288353499-999999999998755257360511789 No 4 >2jsx_A Protein NAPD; TAT, proofreading, cytoplasm, chaperone; NMR {Escherichia coli K12} PDB: 2pq4_A Probab=56.49 E-value=11 Score=17.71 Aligned_cols=67 Identities=18% Similarity=0.244 Sum_probs=46.1 Q ss_pred CEEEEECCCCHHHHHHHHHHCCCCCCCCEEEEC--CC--CCEECCCHHHHHHHHHHHHHHHCCCCCCC---CCCCCCCC Q ss_conf 259996432013456654202567422217860--37--61024898999999999987523888131---10256266 Q gi|254780660|r 169 EYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWK--PL--NYIRLSNADKVKSIIKMIENLEDDDDVQS---VYSNLEIA 240 (244) Q Consensus 169 ~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~--P~--~~V~l~d~e~~~~~~klie~Led~DDVq~---VytN~~i~ 240 (244) .+.|.|.|+.+..|+..|.+.. .+|+-.. +. --|.+..+. ...+.+.++.|+.++.|.+ ||+.++-. T Consensus 9 SlVV~~~Pe~~~~V~~~l~~~~----g~EIh~~d~~~GKiVVtiE~~~-~~~~~~~i~~I~~l~GVlsa~lVY~~~e~e 82 (95) T 2jsx_A 9 SLVVQAKSERISDISTQLNAFP----GCEVAVSDAPSGQLIVVVEAED-SETLIQTIESVRNVEGVLAVSLVYHQQEEQ 82 (95) T ss_dssp EEEEEECTTSHHHHHHHHTTST----TEEEEEEETTTTEEEEEEEESS-HHHHHHHHHHHTTSTTEEEEEESSCCCCCC T ss_pred EEEEEECHHHHHHHHHHHHCCC----CCEEEEECCCCCCEEEEEEECC-HHHHHHHHHHHHCCCCCEEEEEEEEEECCC T ss_conf 8999978788999999997599----9789620388970999997398-679999999987699832786778871566 No 5 >3fyb_A Protein of unknown function (DUF1244); hydrocarbon-degrading, structural genomics, PSI-2, protein structure initiative; HET: PEG; 1.80A {Alcanivorax borkumensis SK2} Probab=42.42 E-value=12 Score=17.36 Aligned_cols=20 Identities=20% Similarity=0.539 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999875238881311 Q gi|254780660|r 214 VKSIIKMIENLEDDDDVQSV 233 (244) Q Consensus 214 ~~~~~klie~Led~DDVq~V 233 (244) +..+-+|+.-|.+.-||||| T Consensus 15 AaaFrrL~~HL~~r~DVQNI 34 (104) T 3fyb_A 15 AAAFRHLLRHLDEHKDVQNI 34 (104) T ss_dssp HHHHHHHHHHHHTCTTSCHH T ss_pred HHHHHHHHHHHHHCCCHHHH T ss_conf 99999999998507722688 No 6 >3lk7_A UDP-N-acetylmuramoylalanine--D-glutamate ligase; agalacitae, PSI, MCSG, structural genomics, midwest center for structural genomics; HET: MSE; 1.50A {Streptococcus agalactiae} Probab=41.28 E-value=18 Score=16.09 Aligned_cols=62 Identities=13% Similarity=0.146 Sum_probs=43.3 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCHHHHHHHHH Q ss_conf 9999999999668887899999985037766664023678650599399999944630235899898 Q gi|254780660|r 48 RLRLAIQNAKNQSMPKENIERAIKKAGSDDLGNYTNIRYEGYGPEGVAIIIEALTDNRNRTASSIRS 114 (244) Q Consensus 48 ~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~~~~~~~~~~yEg~gP~gvaiiVe~lTDN~nRt~~~vr~ 114 (244) .+.+|+..|...++|-+.|..+|++-.+-.. -++-+ ..++|+.+|.+....|..-+...++. T Consensus 282 NalaA~a~a~~lGi~~~~i~~~L~~f~~~~g-R~e~i----~~~~g~~iidD~~~~~~~s~~~al~~ 343 (451) T 3lk7_A 282 NALATIAVAKLAGISNQVIRETLSNFGGVKH-RLQSL----GKVHGISFYNDSKSTNILATQKALSG 343 (451) T ss_dssp HHHHHHHHHHHHTCCHHHHHHHHHHCCCCTT-SSEEE----EEETTEEEEECTTCCSHHHHHHHHHT T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHCCCCHH-HHEEE----ECCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 0899999999869999999999985666501-31577----41388699956774169999999974 No 7 >3lh2_S 4E10_1VI7A_S0_002_N (T88); epitope-scaffold, immune system; 2.65A {Artificial gene} Probab=40.61 E-value=9.7 Score=18.00 Aligned_cols=42 Identities=17% Similarity=0.070 Sum_probs=30.8 Q ss_pred CEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCC Q ss_conf 259996432013456654202567422217860376102489 Q gi|254780660|r 169 EYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSN 210 (244) Q Consensus 169 ~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d 210 (244) .+.+.|++..+..++..|++.++.+.+.+..-.-.-.+.++. T Consensus 9 e~tl~~dY~~~g~l~~~L~~~~~~i~~~~Y~~~V~l~v~vp~ 50 (76) T 3lh2_S 9 EYTLQANWFDITGILWLLGQVDGKIINSDVQAFVLLRVALPA 50 (76) T ss_dssp EEEEEECHHHHHHHHHHHHHTTCEEEEEEEEEEEEEEEEECC T ss_pred EEEEEECCCCHHHHHHHHHHCCCEEEEEEEEEEEEEEEECCH T ss_conf 999994544679999999987979983354207999998488 No 8 >2o35_A Hypothetical protein DUF1244; helix bundle, structural genomics, PSI-2, protein structure initiative; HET: MSE; 2.12A {Sinorhizobium meliloti} SCOP: a.293.1.1 Probab=38.83 E-value=13 Score=17.03 Aligned_cols=20 Identities=25% Similarity=0.542 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999875238881311 Q gi|254780660|r 214 VKSIIKMIENLEDDDDVQSV 233 (244) Q Consensus 214 ~~~~~klie~Led~DDVq~V 233 (244) +..+-+|+.-|.+.-||||| T Consensus 16 AaaFRrL~~HL~~r~DVQNI 35 (105) T 2o35_A 16 AAVFRRLLEHLRERSDVQNI 35 (105) T ss_dssp HHHHHHHHHHHHHTTTSCHH T ss_pred HHHHHHHHHHHHHCCCHHHH T ss_conf 99999999998627730578 No 9 >2epi_A UPF0045 protein MJ1052; NPPSFA, national project on protein structural and functional analyses; 1.70A {Methanocaldococcus jannaschii DSM2661} PDB: 2eky_A Probab=37.12 E-value=8 Score=18.61 Aligned_cols=54 Identities=13% Similarity=0.103 Sum_probs=30.5 Q ss_pred HHHHHHHHCCCCCCCCEEEECCC-CCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 45665420256742221786037-61024898999999999987523888131102562662 Q gi|254780660|r 181 LTSKKLEEKIGEAQSIKVIWKPL-NYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLEIAD 241 (244) Q Consensus 181 ~v~~~Le~~~~~~~~sei~~~P~-~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~i~e 241 (244) .+.+.|++.|+.+ ...|. |.++-+=++.++.+.+..+.+.+ ++.+||+++.|++ T Consensus 28 ~~i~~i~~sGl~y-----~~~pmgT~IEG~~dev~~~i~~~~e~~~~--g~~RV~t~ikId~ 82 (100) T 2epi_A 28 KAIEVFKKYDLKV-----ETNAMGTVLEGDLDEILKAFKEAHSTVLN--DVDRVVSSLKIDE 82 (100) T ss_dssp HHHHHHTTSSCEE-----EEETTEEEEEEEHHHHHHHHHHHHHHHHT--TSSEEEEEEEEEE T ss_pred HHHHHHHHCCCCE-----ECCCCCCEEECCHHHHHHHHHHHHHHHHC--CCCEEEEEEEEEE T ss_conf 9999999759972-----70798157866899999999999999977--9987999999980 No 10 >1d1r_A Hypothetical 11.4 KD protein YCIH in PYRF-OSMB intergenic region; alpha-beta plait, open-faced beta sandwich, ferredoxin-like fold; NMR {Escherichia coli} SCOP: d.64.1.1 Probab=34.15 E-value=23 Score=15.33 Aligned_cols=70 Identities=13% Similarity=0.178 Sum_probs=48.6 Q ss_pred CCCEEEEEEECCCC-EEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCCCCCCHHHHHHHHCC Q ss_conf 64023678650599-39999994463023589989876425683147886045686447279706775301366775126 Q gi|254780660|r 80 NYTNIRYEGYGPEG-VAIIIEALTDNRNRTASSIRSIFTKANGSLGSTGSTTRFFEQIGEIIYHSNIGDSNLAMEVAIES 158 (244) Q Consensus 80 ~~~~~~yEg~gP~g-vaiiVe~lTDN~nRt~~~vr~~f~K~gG~lg~~Gsv~~~F~~~G~i~~~~~~~d~d~~~e~aie~ 158 (244) +.-.+++|-.|-+| ..-+|+-|.- ...+++.+.+...-.+|..|||. -+.|.+..+. -+.+.++.++- T Consensus 31 ~~v~i~~e~kgR~GK~VT~V~Gl~~----~~~~lk~l~k~LKk~~g~GGsvk-----~~~I~lQGD~--r~~i~~~L~~~ 99 (116) T 1d1r_A 31 GVVRIQRQTSGRKGKGVCLITGVDL----DDAELTKLAAELKKKCGCGGAVK-----DGVIEIQGDK--RDLLKSLLEAK 99 (116) T ss_dssp CEEEEEECCCSSSSCCCEEEECCCS----CHHHHHHHHHHHTTSSSSCCBCC-----SSCEEECSCC--HHHHHHHHHHH T ss_pred CEEEEEEEECCCCCCEEEEEECCCC----CHHHHHHHHHHHHHHHCCCEEEC-----CCEEEECCCH--HHHHHHHHHHC T ss_conf 1699999836889976999979738----98999999999988725874974-----9999973847--99999999986 Q ss_pred CC Q ss_conf 87 Q gi|254780660|r 159 DA 160 (244) Q Consensus 159 ga 160 (244) |- T Consensus 100 G~ 101 (116) T 1d1r_A 100 GM 101 (116) T ss_dssp TC T ss_pred CC T ss_conf 99 No 11 >3hno_A Pyrophosphate-dependent phosphofructokinase; structural genomics, PSI-2, protein structure initiative; 2.00A {Nitrosospira multiformis atcc 25196} PDB: 3k2q_A Probab=33.50 E-value=11 Score=17.59 Aligned_cols=177 Identities=12% Similarity=0.065 Sum_probs=78.2 Q ss_pred HHHCCCCHHHCHHHHHHHHHHHHCCCC-------HHHHHHHHHHHCCCCCCCCCEEEE------EEE-CCCCEEEEEEEE Q ss_conf 981899941489999999999966888-------789999998503776666402367------865-059939999994 Q gi|254780660|r 36 AKLSGQNPLENPRLRLAIQNAKNQSMP-------KENIERAIKKAGSDDLGNYTNIRY------EGY-GPEGVAIIIEAL 101 (244) Q Consensus 36 ~k~gG~dp~~N~~L~~ai~~Ak~~~mP-------k~~Ie~aIkk~~~~~~~~~~~~~y------Eg~-gP~gvaiiVe~l 101 (244) ++.||+-|-.|+.+|.++..|.+.+.. ++-++..+.. ++.+++- ..+ .-+| .++=-|. T Consensus 9 ~qSGG~apgiNa~i~gvv~~a~~~~~~v~~Vyg~~~G~~Gll~~-------~~i~l~~~~~~~i~~l~~~gG-t~LGssR 80 (419) T 3hno_A 9 AQSGGVTAVINASAAGVIEAARKQSGKIGRIYAGRNGIIGALTE-------DLIDTGQESDAAISALRYTPS-GAFGSCR 80 (419) T ss_dssp EECSSCCSSHHHHHHHHHHHHHHHCSSCCCEEEETTTHHHHHTT-------CEEEGGGSCHHHHHHGGGCCS-CTTCCCC T ss_pred ECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHCC-------CEEECCCCCHHHHHHHCCCCC-CCCCCCC T ss_conf 88878258897999999999998499487997785582456629-------838896135666876445897-6447789 Q ss_pred CC-----CCHHHHHHHHHHHHHCCCC----CCCCCCC------HHHHHH----CCEEEECCCCCCCHHH------HHHHH Q ss_conf 46-----3023589989876425683----1478860------456864----4727970677530136------67751 Q gi|254780660|r 102 TD-----NRNRTASSIRSIFTKANGS----LGSTGST------TRFFEQ----IGEIIYHSNIGDSNLA------MEVAI 156 (244) Q Consensus 102 TD-----N~nRt~~~vr~~f~K~gG~----lg~~Gsv------~~~F~~----~G~i~~~~~~~d~d~~------~e~ai 156 (244) +. ...+....+-..|+|++=. +|..||. +..+.+ .-+|.+++...++-.. ++-|. T Consensus 81 ~k~~~~e~~~~~~~~i~~~l~~~~Id~Li~IGGdgS~~~a~~L~~~~~~~~~~i~vIgIPKTIDNDl~~tD~t~GF~TA~ 160 (419) T 3hno_A 81 YKLKSLEQNRREYERLIEVFKAHDIGYFFYNGGGDSADTCLKVSQLSGTLGYPIQAIHVPKTVDNDLPITDCCPGFGSVA 160 (419) T ss_dssp CC------CHHHHHHHHHHHHHTTEEEEEEEESHHHHHHHHHHHHHHHHTTCCCEEEEEECCTTCCCSSSSSCTTHHHHH T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEECCCCHHHHH T ss_conf 89877531257999999999970998799975807899999999999970899418961243257877633276478899 Q ss_pred CCC-------CCCCCCCC---C---CEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCHHHHHHHHHHHHH Q ss_conf 268-------75223468---8---2599964320134566542025674222178603761024898999999999987 Q gi|254780660|r 157 ESD-------AFEVLFED---Q---EYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSNADKVKSIIKMIEN 223 (244) Q Consensus 157 e~g-------a~Dv~~~d---~---~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d~e~~~~~~klie~ 223 (244) +.- ..|..... . -+++.....-+-++..+|... .+..-..+++.|..+.++. .-.+.+.+.+.. T Consensus 161 ~~~a~~~~e~~~d~~s~a~s~~rv~IVEvMGR~aGwLAl~a~lA~~-ad~~~~~~iliPE~~~~~~--~~~~~v~~~~~~ 237 (419) T 3hno_A 161 KYIAVSTLEASFDVASMSATSTKVFVLEVMGRHAGWIAAAGGLASS-PEREIPVVILFPEISFDKQ--KFLAKVDSCVKK 237 (419) T ss_dssp HHHHHHHHHHHHHHHHHTTTSCCEEEEEECCSSCCHHHHGGGGGCC-SSSCCCEEEECTTSCCCHH--HHHHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHC-CCCCCCEEEECCCCCCCHH--HHHHHHHHHHCC T ss_conf 9999999999986666643687389999518765477887676521-6666870786797764677--788899887604 No 12 >2x5o_A UDP-N-acetylmuramoylalanine--D-glutamate ligase; ATP-binding, cell cycle, cell division, cell shape, cell WAL biogenesis/degradation; HET: KCX VSV; 1.46A {Escherichia coli} PDB: 2wjp_A* 2jfg_A* 2jff_A* 2jfh_A* 2uuo_A* 2uup_A* 2vtd_A* 2vte_A* 4uag_A* 1e0d_A* 1uag_A* 1eeh_A* 3uag_A* 2uag_A* Probab=33.00 E-value=24 Score=15.21 Aligned_cols=61 Identities=15% Similarity=0.192 Sum_probs=42.5 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCHHHHHHHHH Q ss_conf 999999999668887899999985037766664023678650599399999944630235899898 Q gi|254780660|r 49 LRLAIQNAKNQSMPKENIERAIKKAGSDDLGNYTNIRYEGYGPEGVAIIIEALTDNRNRTASSIRS 114 (244) Q Consensus 49 L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~~~~~~~~~~yEg~gP~gvaiiVe~lTDN~nRt~~~vr~ 114 (244) +.+|+..|+..++|-+.|..+|++-.+-.. .++ +.+ ..+|+.+|.++.--|...+.+-++. T Consensus 272 alaAia~~~~lGi~~~~i~~~L~~f~gv~~-R~e-~~~---~~~g~~~idD~~a~np~a~~~al~~ 332 (439) T 2x5o_A 272 ALAALALADAAGLPRASSLKALTTFTGLPH-RFE-VVL---EHNGVRWINDSKATNVGSTEAALNG 332 (439) T ss_dssp HHHHHHHHHHTTCCHHHHHHHHHHCCCCTT-SSE-EEE---EETTEEEEECTTCCSHHHHHHHHTT T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHCCCCCC-EEE-EEE---EECCEEEEEECCCCCHHHHHHHHHH T ss_conf 877999999839969999999975779744-268-999---5099899995379998999999996 No 13 >2f00_A UDP-N-acetylmuramate--L-alanine ligase; amide bond ligase, ATPase, bacterial cell WALL; 2.50A {Escherichia coli} Probab=32.01 E-value=25 Score=15.10 Aligned_cols=70 Identities=16% Similarity=0.240 Sum_probs=50.7 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCEEEEE-----EECCCCEEEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 99999999999668887899999985037766664023678-----6505993999999446302358998987642 Q gi|254780660|r 47 PRLRLAIQNAKNQSMPKENIERAIKKAGSDDLGNYTNIRYE-----GYGPEGVAIIIEALTDNRNRTASSIRSIFTK 118 (244) Q Consensus 47 ~~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~~~~~~~~~~yE-----g~gP~gvaiiVe~lTDN~nRt~~~vr~~f~K 118 (244) ....+|+..|...+++.+.|.++++.-.+-.. -+ ++..| ..+++|..+|++..--|...+.+-++++-.. T Consensus 295 ~NalAAia~a~~lGi~~e~I~~aL~~F~g~~~-R~-E~vg~~~~~~~~~~~~~~~i~DdyahnP~si~a~l~al~~~ 369 (491) T 2f00_A 295 LNAAAAVAVATEEGIDDEAILRALESFQGTGR-RF-DFLGEFPLEPVNGKSGTAMLVDDYGHHPTEVDATIKAARAG 369 (491) T ss_dssp HHHHHHHHHHHHHTCCHHHHHHHHHTCCCBTT-SS-EEEEEEESSSSSSCSSEEEEEEECCCSHHHHHHHHHHHHTT T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC-CE-EEEEEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 34999999999859957887766540478874-14-89987421234237897899984678958999999999965 No 14 >2ibo_A Hypothetical protein SP2199; alpha-beta protein, structural genomics, PSI-2, protein structure initiative; 2.80A {Streptococcus pneumoniae TIGR4} SCOP: d.58.48.1 Probab=31.15 E-value=13 Score=17.05 Aligned_cols=45 Identities=13% Similarity=0.259 Sum_probs=26.4 Q ss_pred CCEEEEC--CC-CCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 2217860--37-61024898999999999987523888131102562662 Q gi|254780660|r 195 SIKVIWK--PL-NYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLEIAD 241 (244) Q Consensus 195 ~sei~~~--P~-~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~i~e 241 (244) .+++.|. |. |.++-+-++.++-+.+..+.+.+ ++.+||+++.|+. T Consensus 31 ~sGl~~~~~pm~T~IEGe~d~vm~~vk~~~e~~~~--g~~RV~t~iKId~ 78 (104) T 2ibo_A 31 TQEVTMVVTPFETVLEGEFDELMRILKEALEVAGQ--EADNVFANVKINV 78 (104) T ss_dssp HSSSEEEECSSCEEEEEEHHHHHHHHHHHHHHHHT--SCSCEEEEEEEEE T ss_pred HCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHC--CCCEEEEEEEEEC T ss_conf 74996797688650327899999999999999977--9987999999976 No 15 >1vk8_A Hypothetical protein TM0486; protein with possible role in cell WALL biogenesis, structural genomics, JCSG, PSI, protein structure initiative; HET: UNL; 1.80A {Thermotoga maritima} SCOP: d.58.48.1 Probab=30.53 E-value=15 Score=16.70 Aligned_cols=39 Identities=10% Similarity=0.127 Sum_probs=21.2 Q ss_pred CCC-CCEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 037-61024898999999999987523888131102562662 Q gi|254780660|r 201 KPL-NYIRLSNADKVKSIIKMIENLEDDDDVQSVYSNLEIAD 241 (244) Q Consensus 201 ~P~-~~V~l~d~e~~~~~~klie~Led~DDVq~VytN~~i~e 241 (244) -|. |.++-+-++-++.+.+..+.+ ..++.+||+++.|++ T Consensus 52 ~pmgT~IEGe~dev~~~vk~~~e~~--~~g~~RV~t~iKId~ 91 (106) T 1vk8_A 52 GPSNTTVEGEFEEIMDRVKELARYL--EQFAKRFVLQLDIDY 91 (106) T ss_dssp CSSCEEEEECHHHHHHHHHHHHHHH--TTTCSEEEEEEEEEE T ss_pred CCCCCEEECCHHHHHHHHHHHHHHH--HCCCCEEEEEEEEEE T ss_conf 6980289778999999999999999--869987999999981 No 16 >2w0c_P Protein P3, protein III; virus, member of PRD1-adeno viral lineage, membrane-containing bacteriophage, virus virion, membrane; 7.00A {Pseudoalteromonas phage PM2} Probab=30.48 E-value=10 Score=17.85 Aligned_cols=19 Identities=21% Similarity=0.102 Sum_probs=15.5 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 7741126556467755788 Q gi|254780660|r 2 AGHSQFKNIMHRKERKDAL 20 (244) Q Consensus 2 aGHsKW~nIkh~K~~~D~~ 20 (244) +|-|+|+.|...|+++|.. T Consensus 25 ~~l~~WA~iEQv~A~K~S~ 43 (104) T 2w0c_P 25 ALISGWARVEQIKAAKAST 43 (104) T ss_dssp HHHHHHHHHHHHHHHHHHS T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 8877678899887765169 No 17 >3lap_A Arginine repressor; arginine repressor, DNA binding, DNA-canavanine ternary complex; HET: GGB; 2.15A {Mycobacterium tuberculosis} PDB: 3fhz_A* 3ere_D* 3laj_A* Probab=30.18 E-value=26 Score=14.89 Aligned_cols=90 Identities=8% Similarity=0.067 Sum_probs=44.8 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCCCC------------------CCHHHHHHHHCCCCCC Q ss_conf 4463023589989876425683147886045686447279706775------------------3013667751268752 Q gi|254780660|r 101 LTDNRNRTASSIRSIFTKANGSLGSTGSTTRFFEQIGEIIYHSNIG------------------DSNLAMEVAIESDAFE 162 (244) Q Consensus 101 lTDN~nRt~~~vr~~f~K~gG~lg~~Gsv~~~F~~~G~i~~~~~~~------------------d~d~~~e~aie~ga~D 162 (244) ++.|.-+|..++...+.+.|=. .+...+|-.....|++-.....+ ....-+.-.+.-...+ T Consensus 29 I~~~~I~tQeeL~~~L~~~Gi~-vTQATiSRDlkeL~ivKv~~~~G~~~Y~~~~~~~~~~~~~~~~~~~l~~~~~~~v~~ 107 (170) T 3lap_A 29 LSSAQVRSQNELAALLAAEGIE-VTQATLSRDLEELGAVKLRGADGGTGIYVVPEDGSPVRGVSGGTDRMARLLGELLVS 107 (170) T ss_dssp HHHSCCCSHHHHHHHHHHTTCC-CCHHHHHHHHHHHTCEEECCTTCTTCEEECCC------CCTTHHHHHHHHHHHHCCE T ss_pred HHHCCCCCHHHHHHHHHHCCCC-EEHHHHHHHHHHHCCEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHEEE T ss_conf 9858978999999999975985-418998988998478786538998899975776774323343689999999977557 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHHHCCC Q ss_conf 23468825999643201345665420256 Q gi|254780660|r 163 VLFEDQEYIFYCDFNNVGLTSKKLEEKIG 191 (244) Q Consensus 163 v~~~d~~~~i~~~~~~~~~v~~~Le~~~~ 191 (244) +...+..+.|-|.|-.=..+...|....+ T Consensus 108 i~~~~nlvVIkT~pG~A~~va~~ID~~~~ 136 (170) T 3lap_A 108 TDDSGNLAVLRTPPGAAHYLASAIDRAAL 136 (170) T ss_dssp EEEETTEEEEECSTTCHHHHHHHHHHHTC T ss_pred EEEECCEEEEEECCCCHHHHHHHHHHCCC T ss_conf 75107789999379969999999974799 No 18 >1lxn_A Hypothetical protein MTH1187; hypothetical structure, structural genomics, PSI, protein structure initiative; 2.30A {Methanothermobacterthermautotrophicus} SCOP: d.58.48.1 Probab=28.09 E-value=4.1 Score=20.65 Aligned_cols=44 Identities=14% Similarity=0.428 Sum_probs=23.6 Q ss_pred CCEEEEC--CC-CCEECCCHHHHHHHHHHHHHHHCC---CCCCCCCCCCCCCC Q ss_conf 2217860--37-610248989999999999875238---88131102562662 Q gi|254780660|r 195 SIKVIWK--PL-NYIRLSNADKVKSIIKMIENLEDD---DDVQSVYSNLEIAD 241 (244) Q Consensus 195 ~sei~~~--P~-~~V~l~d~e~~~~~~klie~Led~---DDVq~VytN~~i~e 241 (244) .+++.|. |. |.++-++ .+.+..++..+.+. -++.+||+++.|++ T Consensus 31 ~sgl~y~~~pmgT~IEge~---~dev~~~ik~~~e~~~~~G~~RV~t~ikID~ 80 (99) T 1lxn_A 31 KLNVRYEISGMGTLLEAED---LDELMEAVKAAHEAVLQAGSDRVYTTLKIDD 80 (99) T ss_dssp TSSCEEEEETTEEEEEESS---HHHHHHHHHHHHHHHHHTTCSEEEEEEEEEE T ss_pred HCCCCEEECCCCCEEECCC---HHHHHHHHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 7599837468841797797---9999999999999999769986999999770 No 19 >2cvi_A 75AA long hypothetical regulatory protein ASNC; structural genomics, unknown function; 1.50A {Pyrococcus horikoshii OT3} PDB: 2z4p_A 2e1a_A Probab=25.63 E-value=32 Score=14.34 Aligned_cols=69 Identities=13% Similarity=0.156 Sum_probs=42.9 Q ss_pred CEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCC---CEECCCHHHHHHHHHHH-HHHHCCCCCCCCCCCCCCCC Q ss_conf 259996432013456654202567422217860376---10248989999999999-87523888131102562662 Q gi|254780660|r 169 EYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLN---YIRLSNADKVKSIIKMI-ENLEDDDDVQSVYSNLEIAD 241 (244) Q Consensus 169 ~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~---~V~l~d~e~~~~~~kli-e~Led~DDVq~VytN~~i~e 241 (244) .+.+-|+|.....+...|.+.- ++.+.-.+.=+-. .+...|. +.+.+|+ +.|..+|.|.++-|-+-+.| T Consensus 5 ~i~I~~~~~~~~~v~~~l~~~p-eV~~v~~vtG~~Dli~~v~~~d~---~~l~~~v~~~l~~~~gV~~t~T~ivl~e 77 (83) T 2cvi_A 5 FILMVTAAGKEREVMEKLLAMP-EVKEAYVVYGEYDLIVKVETDTL---KDLDQFITEKIRKMPEIQMTSTMIAILE 77 (83) T ss_dssp EEEEEECTTCHHHHHHHHHTST-TEEEEEECBSSCSEEEEEEESSH---HHHHHHHHTTGGGCTTEEEEEEEEEEEE T ss_pred EEEEEECCCCHHHHHHHHHCCC-CEEEEEEEECCCCEEEEEEECCH---HHHHHHHHHHHHCCCCEEEEEEEEEEEE T ss_conf 9999977899999999997799-97999992389889999998999---9999999998635999837999999983 No 20 >3k85_A D-glycero-D-manno-heptose 1-phosphate kinase; bacteriodes thetaiotaomicron, protein structure initiative II(PSI II), nysgxrc; 2.28A {Bacteroides thetaiotaomicron} Probab=24.69 E-value=33 Score=14.22 Aligned_cols=63 Identities=13% Similarity=0.045 Sum_probs=40.8 Q ss_pred CHHHHHHHHCCCCCCCCCC----CCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCEECCC Q ss_conf 0136677512687522346----88259996432013456654202567422217860376102489 Q gi|254780660|r 148 SNLAMEVAIESDAFEVLFE----DQEYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPLNYIRLSN 210 (244) Q Consensus 148 ~d~~~e~aie~ga~Dv~~~----d~~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~~~V~l~d 210 (244) .|.+.+.+++.||.--..- .+.+.++|+++.-..+.++|++.++.+....+...+..-..+.+ T Consensus 277 ld~l~~~a~~~Ga~gaKlsGAGgGG~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~G~~~~~i~~ 343 (357) T 3k85_A 277 IQEAFDVATGAGAMAGKVSGAGGGGFIMFVVEPTRKEEVVRALNNLNGFVMPFQFIDDGAHGWKIYS 343 (357) T ss_dssp --CCSCTTTTSCCSEEEECCCC---CEEEECCHHHHHHHHHHHHTSSSCBCCCCBCCCCCEEEECCC T ss_pred HHHHHHHHHHCCCEEEEEEEECHHHEEEEEECHHHHHHHHHHHHHCCCEEEEEEECCCCCEEEEEEE T ss_conf 9999999976899699997306102899998764599999999877984999999179951899996 No 21 >2ko1_A CTR148A, GTP pyrophosphokinase; homodimer, alpha+beta, transferase, structural genomics, PSI-2, protein structure initiative; NMR {Chlorobaculum tepidum} PDB: 3ibw_A Probab=24.05 E-value=34 Score=14.14 Aligned_cols=58 Identities=14% Similarity=0.060 Sum_probs=36.3 Q ss_pred ECCCCHHHHHHHHHHCCCCCCCCEEEECCC-----CCEECCCHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 643201345665420256742221786037-----6102489899999999998752388813110 Q gi|254780660|r 174 CDFNNVGLTSKKLEEKIGEAQSIKVIWKPL-----NYIRLSNADKVKSIIKMIENLEDDDDVQSVY 234 (244) Q Consensus 174 ~~~~~~~~v~~~Le~~~~~~~~sei~~~P~-----~~V~l~d~e~~~~~~klie~Led~DDVq~Vy 234 (244) =.|--+..+...|.+.+..+.+....-... -.+++.+.+ .+.++++.|+..++|..|+ T Consensus 14 Dr~GlL~dIt~~is~~~inI~~i~~~~~~~~~~~~i~v~V~d~~---~L~~li~~L~~i~~V~~V~ 76 (88) T 2ko1_A 14 DKNGMTNQITGVISKFDTNIRTIVLNAKDGIFTCNLMIFVKNTD---KLTTLMDKLRKVQGVFTVE 76 (88) T ss_dssp CCTTHHHHHHHHHTTSSSCEEEEEEEECSSEEEEEEEEEESSHH---HHHHHHHHHTTCTTEEEEE T ss_pred CCCCHHHHHHHHHHHCCCEEEEEEEEECCCEEEEEEEEEECCHH---HHHHHHHHHHCCCCCCEEE T ss_conf 78887999999998779739999997049979999999998778---8999999997799987999 No 22 >2wbr_A GW182, gawky, LD47780P; DNA-binding protein, RRM, RBD, TNRC6A, mirnas, P-bodies, argonaute, mRNA decay; NMR {Drosophila melanogaster} Probab=23.25 E-value=33 Score=14.16 Aligned_cols=31 Identities=10% Similarity=0.178 Sum_probs=25.7 Q ss_pred ECCCCEEEEEEEECCCCHHHHHHHHHHHHHCCC Q ss_conf 505993999999446302358998987642568 Q gi|254780660|r 89 YGPEGVAIIIEALTDNRNRTASSIRSIFTKANG 121 (244) Q Consensus 89 ~gP~gvaiiVe~lTDN~nRt~~~vr~~f~K~gG 121 (244) .+|++.-|+|.=|+.+. |..++|.+|+++|- T Consensus 3 ~~~~s~~l~v~NL~~~v--t~~~L~~lf~~~G~ 33 (89) T 2wbr_A 3 MAWGSSWLLLKNLTAQI--DGPTLRTLCMQHGP 33 (89) T ss_dssp CCCCCCEEEEECCCTTC--CCHHHHHHHHHHSC T ss_pred CCCCCCEEEEECCCCCC--CHHHHHHHHHHHCC T ss_conf 88887789997999989--99999999998599 No 23 >1zl0_A Hypothetical protein PA5198; structural genomics, PSI, protein structure initiative, midwest center for structural genomics, MCSG; HET: TLA PEG; 1.10A {Pseudomonas aeruginosa} SCOP: c.8.10.1 c.23.16.7 PDB: 1zrs_A 2aum_A 2aun_A Probab=22.04 E-value=29 Score=14.55 Aligned_cols=14 Identities=0% Similarity=-0.206 Sum_probs=6.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 55788897999999 Q gi|254780660|r 16 RKDALKSKIFSKLS 29 (244) Q Consensus 16 ~~D~~k~k~f~k~~ 29 (244) ..|.+|+.-+..+- T Consensus 63 gt~~~Ra~dl~~a~ 76 (311) T 1zl0_A 63 GTVEQRLEDLHNAF 76 (311) T ss_dssp SCHHHHHHHHHHHH T ss_pred CCHHHHHHHHHHHH T ss_conf 99999999999885 No 24 >1b4a_A Arginine repressor; helix turn helix; 2.50A {Geobacillus stearothermophilus} SCOP: a.4.5.3 d.74.2.1 PDB: 1f9n_A Probab=21.89 E-value=37 Score=13.85 Aligned_cols=91 Identities=9% Similarity=-0.010 Sum_probs=48.6 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCCCC------------CCHHHHHHHHCCCCCCCCCCCC Q ss_conf 4463023589989876425683147886045686447279706775------------3013667751268752234688 Q gi|254780660|r 101 LTDNRNRTASSIRSIFTKANGSLGSTGSTTRFFEQIGEIIYHSNIG------------DSNLAMEVAIESDAFEVLFEDQ 168 (244) Q Consensus 101 lTDN~nRt~~~vr~~f~K~gG~lg~~Gsv~~~F~~~G~i~~~~~~~------------d~d~~~e~aie~ga~Dv~~~d~ 168 (244) +..|.-+|..++...+.+.|=+ .+...+|-.....|.+-.+...+ +...-+...+.-....+...+. T Consensus 14 I~~~~I~tQeeL~~~L~~~Gi~-vTQATiSRDlkeL~~vKv~~~~G~~~y~~~~~~~~~~~~~l~~~~~~~v~~i~~~~~ 92 (149) T 1b4a_A 14 IMSNDIETQDELVDRLREAGFN-VTQATVSRDIKEMQLVKVPMANGRYKYSLPSDQRFNPLQKLKRALVDVFIKLDGTGN 92 (149) T ss_dssp HHHSCCCSHHHHHHHHHHTTCC-CCHHHHHHHHHHTTCEEEECSSSCEEEECTTCSSSCHHHHHHHHHHHHEEEEEEETT T ss_pred HHHCCCCCHHHHHHHHHHCCCC-CCHHHHHHHHHHHCCEEEECCCCCEEEEECCCCCCCHHHHHHHHHHHHEEEEECCCC T ss_conf 9858978899999999986985-408998888998398887769996899605555546689999999977268840387 Q ss_pred CEEEEECCCCHHHHHHHHHHCCCC Q ss_conf 259996432013456654202567 Q gi|254780660|r 169 EYIFYCDFNNVGLTSKKLEEKIGE 192 (244) Q Consensus 169 ~~~i~~~~~~~~~v~~~Le~~~~~ 192 (244) .+.+-|.|-.-..+...+....++ T Consensus 93 lvvikT~pG~A~~va~~ID~~~~~ 116 (149) T 1b4a_A 93 LLVLRTLPGNAHAIGVLLDNLDWD 116 (149) T ss_dssp EEEEEESTTCHHHHHHHHHHHTCT T ss_pred EEEEEECCCCHHHHHHHHHHCCCC T ss_conf 899994799589999999847998 No 25 >1lxj_A YBL001C, hypothetical 11.5KDA protein in HTB2-NTH2 intergenic region; hypothetical protein, structural genomics, PSI; 1.80A {Saccharomyces cerevisiae} SCOP: d.58.48.1 Probab=21.86 E-value=6.5 Score=19.25 Aligned_cols=52 Identities=13% Similarity=0.208 Sum_probs=27.2 Q ss_pred HHHHHHHHCCCCCCCCEEEECCC-CCEECCCHHHHHHHHHHHHHHHCC---CCCCCCCCCCCCCC Q ss_conf 45665420256742221786037-610248989999999999875238---88131102562662 Q gi|254780660|r 181 LTSKKLEEKIGEAQSIKVIWKPL-NYIRLSNADKVKSIIKMIENLEDD---DDVQSVYSNLEIAD 241 (244) Q Consensus 181 ~v~~~Le~~~~~~~~sei~~~P~-~~V~l~d~e~~~~~~klie~Led~---DDVq~VytN~~i~e 241 (244) .+...|++.|+.+ ...|. |.++-+ ...+..++..+.+. -++.+||+++.|++ T Consensus 28 ~~i~~i~~sGl~y-----~~~pmgT~IEGe----~dev~~~v~~~~e~~~~~G~~RV~t~ikID~ 83 (104) T 1lxj_A 28 LIEKKIRESPLKS-----TLHSAGTTIEGP----WDDVMGLIGEIHEYGHEKGYVRVHTDIRVGT 83 (104) T ss_dssp HHHHHHHTSSSEE-----EEETTEEEEEEE----HHHHHHHHHHHHHHHHHTTCCEEEEEEEEEC T ss_pred HHHHHHHHCCCCE-----EECCCCCEEECC----HHHHHHHHHHHHHHHHHCCCCEEEEEEEEEC T ss_conf 9999999759976-----766983579778----9999999999999999769985999999861 No 26 >2cve_A Hypothetical protein TTHA1053; COG1739, UPF0029, structural genomics, riken structural genomics/proteomics initiative, RSGI; HET: TLA; 1.60A {Thermus thermophilus HB8} SCOP: d.14.1.11 d.58.11.2 Probab=21.69 E-value=37 Score=13.83 Aligned_cols=52 Identities=15% Similarity=0.144 Sum_probs=33.9 Q ss_pred CCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCC--CCEECCCHHHHHHHHHHH Q ss_conf 8825999643201345665420256742221786037--610248989999999999 Q gi|254780660|r 167 DQEYIFYCDFNNVGLTSKKLEEKIGEAQSIKVIWKPL--NYIRLSNADKVKSIIKMI 221 (244) Q Consensus 167 d~~~~i~~~~~~~~~v~~~Le~~~~~~~~sei~~~P~--~~V~l~d~e~~~~~~kli 221 (244) -..+.|.|++..+..+...|++.++.+.+. .|-.. -.+.++ .++.+.+.+.+ T Consensus 127 ~~~~~i~~~y~~~~~v~~~l~~~~~~i~~~--~~~~~V~~~~~v~-~~~~~~~~~~l 180 (191) T 2cve_A 127 RVGLAFLVPFAEVGRVYALLEARALKAEET--YTPEGVRFALLLP-KPEREGFLRAL 180 (191) T ss_dssp EEEEEEEECGGGHHHHHHHHHHTTCCEEEE--EETTEEEEEEEEE-HHHHHHHHHHH T ss_pred EEEEEEEECHHHHHHHHHHHHHCCCEEEEE--EECCCEEEEEEEC-HHHHHHHHHHH T ss_conf 888999986899899999998779789989--8069889999987-99999999999 No 27 >3m16_A Transaldolase; dimer, molecular replac swiss-model, structural genomics, PSI-2, protein structure initiative; 2.79A {Oleispira antarctica} Probab=21.08 E-value=30 Score=14.54 Aligned_cols=76 Identities=21% Similarity=0.173 Sum_probs=40.4 Q ss_pred CC-CHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCCCCCEEEEE---------------EECCCCEEEEEEE-E Q ss_conf 99-94148999999999996688878999999850377-66664023678---------------6505993999999-4 Q gi|254780660|r 40 GQ-NPLENPRLRLAIQNAKNQSMPKENIERAIKKAGSD-DLGNYTNIRYE---------------GYGPEGVAIIIEA-L 101 (244) Q Consensus 40 G~-dp~~N~~L~~ai~~Ak~~~mPk~~Ie~aIkk~~~~-~~~~~~~~~yE---------------g~gP~gvaiiVe~-l 101 (244) +| |-.+||.| +.+|-+..--++.|+.++..+... ....+.+.+++ ...||-|.+=|.. | T Consensus 33 ~p~daTTNPSi---i~ka~~~~~y~~li~~~i~~~~~~~~~~~~~~~~~~~i~d~~~v~~~~ei~~~~~G~VS~EVdp~l 109 (329) T 3m16_A 33 KPVDATTNPSL---ILKAAKLANYQHLIEEAIDWALQIKGNDKNSQTTLENVGDKLAVNIGCEVLTSIPGVISTEVDARL 109 (329) T ss_dssp CCSEEECCHHH---HHHHHTCGGGHHHHHHHHHHHHHHCCC-CCTTHHHHHHHHHHHHHHHHHHHTTCSSCEEEECCGGG T ss_pred CCCCCCCCHHH---HHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHH T ss_conf 99984653899---998526915699999999998750245420677899999999999999996378987688865012 Q ss_pred CCCCHHHHHHHHHHHHH Q ss_conf 46302358998987642 Q gi|254780660|r 102 TDNRNRTASSIRSIFTK 118 (244) Q Consensus 102 TDN~nRt~~~vr~~f~K 118 (244) ..+...|+.+-|.+++- T Consensus 110 s~dt~~~i~~A~~l~~l 126 (329) T 3m16_A 110 SFDTQATVAKARKLIRL 126 (329) T ss_dssp TTCHHHHHHHHHHHHHH T ss_pred CCCHHHHHHHHHHHHHH T ss_conf 16888999999999999 No 28 >2vse_A MTX holotoxin, mosquitocidal toxin; ADP-ribosyltransferase, lectin, ricin-B-like domain; 2.5A {Lysinibacillus sphaericus} PDB: 2vsa_A Probab=20.59 E-value=32 Score=14.28 Aligned_cols=10 Identities=50% Similarity=1.122 Sum_probs=4.6 Q ss_pred EEEEECCCCE Q ss_conf 6786505993 Q gi|254780660|r 85 RYEGYGPEGV 94 (244) Q Consensus 85 ~yEg~gP~gv 94 (244) .||+|+|||+ T Consensus 141 ~~~~~~~~~~ 150 (841) T 2vse_A 141 RYEIFAPGGI 150 (841) T ss_dssp EEEECCSCCE T ss_pred EEEEECCCCC T ss_conf 9997458871 No 29 >1t95_A Hypothetical protein AF0491; shwachman-bodian-diamond syndrome protein orthologue, unknown function; 1.90A {Archaeoglobus fulgidus} SCOP: a.5.8.1 d.235.1.2 d.58.11.3 PDB: 1p9q_C Probab=20.35 E-value=38 Score=13.75 Aligned_cols=43 Identities=21% Similarity=0.326 Sum_probs=25.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 8889799999999999998189994148999999999996688878999999850 Q gi|254780660|r 19 ALKSKIFSKLSREITVSAKLSGQNPLENPRLRLAIQNAKNQSMPKENIERAIKKA 73 (244) Q Consensus 19 ~~k~k~f~k~~keI~~A~k~gG~dp~~N~~L~~ai~~Ak~~~mPk~~Ie~aIkk~ 73 (244) .+|-...-..-|+|.--+...--||.+| ...|...||+|++.+ T Consensus 96 ~eR~~~~e~k~k~Ii~~I~~~~VnP~T~------------~P~p~~rIE~Al~e~ 138 (240) T 1t95_A 96 EQRREMLEAKRKQIINFISRNTIDPRTN------------APHPPSRIERALEEA 138 (240) T ss_dssp HHHHHHHHHHHHHHHHHHHHHEEBTTTT------------BCCCHHHHHHHHHHT T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCC------------CCCCHHHHHHHHHHC T ss_conf 9999999999999999999972298889------------989889999999957 No 30 >1fp1_D Isoliquiritigenin 2'-O-methyltransferase; protein-substrate, protein-product complex; HET: SAH HCC; 1.82A {Medicago sativa} SCOP: a.4.5.29 c.66.1.12 PDB: 1fpq_A* Probab=20.02 E-value=28 Score=14.71 Aligned_cols=10 Identities=10% Similarity=0.096 Sum_probs=4.5 Q ss_pred HHHHHHHHCC Q ss_conf 9999998189 Q gi|254780660|r 31 EITVSAKLSG 40 (244) Q Consensus 31 eI~~A~k~gG 40 (244) -+..|+++|= T Consensus 40 ~L~aAveLgl 49 (372) T 1fp1_D 40 VLNAAIDLNL 49 (372) T ss_dssp HHHHHHHTTH T ss_pred HHHHHHHCCH T ss_conf 9999998794 Done!