Query gi|254780291|ref|YP_003064704.1| hypothetical protein CLIBASIA_00880 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 475 No_of_seqs 13 out of 15 Neff 2.2 Searched_HMMs 23785 Date Tue May 24 15:04:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780291.hhm -d /home/congqian_1/database/pdb/pdb70.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 2nxf_A Putative dimetal phosph 99.4 9E-12 3.8E-16 96.1 13.2 254 161-455 7-314 (322) 2 3ib7_A ICC protein; metallopho 99.2 1.1E-09 4.8E-14 82.2 13.4 197 177-456 53-280 (330) 3 1ute_A Protein (II purple acid 99.2 3E-10 1.3E-14 86.0 10.4 236 162-457 9-293 (313) 4 2qfp_A Purple acid phosphatase 99.1 2.4E-09 1E-13 80.0 13.0 159 185-409 143-317 (424) 5 1xzw_A Purple acid phosphatase 99.1 6.3E-09 2.6E-13 77.3 14.8 180 162-410 129-325 (426) 6 3d03_A Phosphohydrolase; glyce 99.0 3.7E-09 1.6E-13 78.8 11.2 177 177-432 28-215 (274) 7 2yvt_A Hypothetical protein AQ 98.5 1.7E-06 7E-11 61.3 11.6 217 160-452 6-259 (260) 8 1uf3_A Hypothetical protein TT 98.5 6.6E-06 2.8E-10 57.4 13.2 216 160-450 6-225 (228) 9 1ii7_A MRE11 nuclease; RAD50, 97.3 0.00016 6.7E-09 48.2 3.5 68 168-235 19-92 (333) 10 2q8u_A Exonuclease, putative; 97.1 0.002 8.4E-08 41.0 7.8 196 177-445 49-269 (336) 11 2a22_A Vacuolar protein sortin 85.3 1.8 7.6E-05 21.5 7.4 63 391-458 133-205 (215) 12 1z2w_A Vacuolar protein sortin 84.6 1.9 8.1E-05 21.3 6.7 61 392-457 110-180 (192) 13 1su1_A Hypothetical protein YF 81.9 2.5 0.0001 20.6 5.5 66 162-230 28-100 (208) 14 2kkn_A Uncharacterized protein 74.4 3.2 0.00014 19.9 4.1 50 397-451 126-175 (178) 15 1nnw_A Hypothetical protein; s 64.9 6.4 0.00027 17.9 5.0 47 398-449 166-219 (252) 16 3ck2_A Conserved uncharacteriz 58.4 8.3 0.00035 17.1 7.0 57 394-455 100-163 (176) 17 1v77_A PH1877P, hypothetical p 55.7 7.2 0.0003 17.6 2.9 80 157-240 93-179 (212) 18 1s3l_A Hypothetical protein MJ 47.0 13 0.00053 15.9 4.8 60 162-230 28-87 (190) 19 1wb9_A DNA mismatch repair pro 37.4 18 0.00074 15.0 3.5 53 358-413 677-734 (800) 20 1wgo_A VPS10 domain-containing 37.1 8.9 0.00037 16.9 1.0 13 298-310 61-73 (123) 21 3f9u_A Putative exported cytoc 35.6 10 0.00042 16.6 1.0 22 357-378 35-56 (172) 22 2o8b_B DNA mismatch repair pro 32.7 21 0.00087 14.5 4.3 70 357-432 858-932 (1022) 23 3fk8_A Disulphide isomerase; A 32.5 19 0.00078 14.8 2.0 30 352-381 12-43 (133) 24 2kzw_A Uncharacterized protein 29.7 16 0.00067 15.3 1.3 14 296-309 86-99 (145) 25 3jqu_A Collagenase; PKD, beta 26.7 25 0.0011 13.9 1.8 12 299-310 35-46 (87) 26 1qwz_A NPQTN specific sortase 26.5 18 0.00074 15.0 1.0 18 216-233 72-89 (235) 27 1egz_A Endoglucanase Z, EGZ, C 23.9 29 0.0012 13.5 4.9 51 356-406 200-255 (291) 28 2o8b_A DNA mismatch repair pro 23.8 30 0.0012 13.5 4.8 54 357-413 731-789 (934) 29 2w70_A Biotin carboxylase; lig 22.3 32 0.0013 13.3 1.9 65 155-223 93-158 (449) 30 3mx1_A ECO29KIR; type II restr 22.2 26 0.0011 13.9 1.2 53 179-234 128-190 (235) 31 1sen_A Thioredoxin-like protei 22.0 27 0.0011 13.8 1.2 21 357-378 35-55 (164) 32 1w9c_A CRM1 protein, exportin 21.6 16 0.00069 15.2 0.1 17 349-365 263-279 (321) 33 1b4r_A Protein (PKD1_human); P 21.6 26 0.0011 13.9 1.1 12 297-308 25-36 (80) 34 1t71_A Phosphatase, conserved; 21.4 33 0.0014 13.2 2.5 70 163-233 8-77 (281) 35 1jni_A NAPB;, diheme cytochrom 21.4 10 0.00044 16.5 -1.0 26 138-163 40-66 (123) 36 1ogy_B Diheme cytochrome C NAP 20.4 13 0.00055 15.8 -0.6 26 138-163 41-67 (130) No 1 >2nxf_A Putative dimetal phosphatase; dinuclear metal center phosphatase, metalloprotein, metallophosphoesterase, protein structure initiative; 1.70A {Danio rerio} SCOP: d.159.1.12 Probab=99.39 E-value=9e-12 Score=96.06 Aligned_cols=254 Identities=12% Similarity=0.111 Sum_probs=132.1 Q ss_pred CEEEEECCCEE-CC----------------CHHHHHHHHHHHHCCCEEEEEEECCCCCC------CCHHHHHHHHHHH-C Q ss_conf 51555133100-46----------------31468899988611656789760510012------1267788888672-6 Q gi|254780291|r 161 GIAVIADPWYK-AD----------------TPMFVEAINSLKSSKNIILGILTGDMTQS------STTKELKRFYNIY-S 216 (475) Q Consensus 161 ~~~~~~~pw~k-~~----------------~~~~vesinsl~~~~~~~~gIINGDLTeF------G~q~qL~eFr~Vw-n 216 (475) -++||+||-|. .+ ...+.++|..+. ..++.|-|++|||++- +...+++++...+ + T Consensus 7 ~f~~isD~h~~~~~~~~~~~~~~~~~~~~s~~~l~~ai~~~~-~~~~dfVv~~GDl~d~~~~~~~~~~~~~~~~~~~~~~ 85 (322) T 2nxf_A 7 TFGLIADVQYADIEDGENYLRTRRRYYRGSADLLRDAVLQWR-RERVQCVVQLGDIIDGHNRRRDASDRALDTVMAELDA 85 (322) T ss_dssp EEEEECCCCBCSSCCEECTTSSSEECTTHHHHHHHHHHHHHH-HTTCSEEEECSCCBCTHHHHTTCHHHHHHHHHHHHHT T ss_pred EEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH-CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999983178777776523344303456666999999999973-3799999999987889885236899999999999986 Q ss_pred CCCCEEECCCCCCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEE Q ss_conf 55751302455444788-87667454550000036888999999998776555666542100100134445655210377 Q gi|254780291|r 217 LKFPFFRGLGSQEYIGN-RPCRDPYTLTPSIYGCAFIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISI 295 (475) Q Consensus 217 l~iPv~lGLGNHDYqNN-~dC~~P~~~d~s~n~CA~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei 295 (475) +++|++...||||+... .++........ +..+..+.. ...... T Consensus 86 ~~~p~~~~~GNHD~~~~~~~~~~~~~~~~------------~~~~~~~~~----------~~~~~~-------------- 129 (322) T 2nxf_A 86 CSVDVHHVWGNHEFYNFSRPSLLSSRLNS------------AQRTGTDTG----------SDLIGD-------------- 129 (322) T ss_dssp TCSEEEECCCHHHHHHCCHHHHHTSTTCC------------CC------C----------EECGGG-------------- T ss_pred CCCCEEEECCCCCCCCCCCHHHCCCCCCC------------HHHHCCCCC----------CCCCCC-------------- T ss_conf 59978982687752223300000012341------------222112233----------222577-------------- Q ss_pred EEEEEEEEECCCEEEEEECCCCEEEEECCCCCC-----HHHHHHHHHCCCCCCEE--------EEEE----CCCCHHHHH Q ss_conf 754433430273355540364103675167530-----01222210002565113--------4320----565577887 Q gi|254780291|r 296 SGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWS-----NIFTVAVPEHISKQDLP--------SHVS----NGSEISQWI 358 (475) Q Consensus 296 ~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~-----~af~Iqv~n~Id~q~l~--------s~Vn----~~~~iskWL 358 (475) .+...+.-..++.+++-++-..+.......... -.+.......+.....+ .... ....-..|| T Consensus 130 ~~~~~~~~~~~~~~~i~ld~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Q~~wl 209 (322) T 2nxf_A 130 DIYAYEFSPAPNFRFVLLDAYDLSVIGREEESEKHTHSWRILTQHNHNLQDLNLPPVSVGLEQRFVKFNGGFSEQQLQWL 209 (322) T ss_dssp TCCCEEEEEETTEEEEECCTTSBCSSSSCTTSHHHHHHHHHHHHHCCCTTCTTSCSCSSSGGGGCSTTCCBCCHHHHHHH T ss_pred CCCEEEEECCCCEEEEEECCCCCCCCCCCCCCCCHHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 87404764179769999657543444566565210011000012466621003553223555554444556889999999 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCC------HHHHHHHHHHHHHHHC-CCCEEEEEEEECCCCCCCCCCCCCEEEEEEECC Q ss_conf 888884034795799950047720------0389999999999858-961799986423676524655531013897045 Q gi|254780291|r 359 RDDVFQAQREGKYIILFADDIDRF------SSIDQKRMFEKFLTQS-KISTIFTTRFTSSPESYIKDSTGRPVRVYNINK 431 (475) Q Consensus 359 ~~DL~~A~~~gK~IILN~HD~~~~------sSi~~KrmFks~itk~-nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGS 431 (475) ++.|..+++.++++|+.+|.-... .....-..+..++.++ +|.++|.||.|... +...+.| ++++..++ T Consensus 210 ~~~L~~~~~~~~~~iv~~H~p~~~~~~~~~~~~~~~~~~~~~l~~~~~V~~v~~GH~H~~~--~~~~~~g--i~~v~~~~ 285 (322) T 2nxf_A 210 DAVLTLSDHKQERVLIFSHLPVHPCAADPICLAWNHEAVLSVLRSHQSVLCFIAGHDHDGG--RCTDSSG--AQHITLEG 285 (322) T ss_dssp HHHHHHHHHHTCEEEEEESSCCCTTSSCGGGSCTTHHHHHHHHHTCTTEEEEEECSCTTCE--EEECTTS--CEEEECCC T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCC--EEECCCC--CEEEECCE T ss_conf 9999862642874899992189777788763113499999999748984699947747886--3344489--76997780 Q ss_pred CCC-----CCEEEEEECCCEEEEEEEECC Q ss_conf 646-----718999964885999998516 Q gi|254780291|r 432 NSK-----NEFILLEMTPHYINVTAYERR 455 (475) Q Consensus 432 vpk-----NrF~~lem~~~~atITgY~~R 455 (475) +.. +.|..+|+.+-.++|+||-+- T Consensus 286 ~~~~~~~~~~~~~~~~~~d~~~i~~~~~~ 314 (322) T 2nxf_A 286 VIETPPHSHAFATAYLYEDRMVMKGRGRV 314 (322) T ss_dssp GGGCCTTSCEEEEEEECSSEEEEEEEETS T ss_pred ECCCCCCCCCEEEEEEECCEEEEEEEEEE T ss_conf 20268999988999998999999989876 No 2 >3ib7_A ICC protein; metallophosphoesterase, alpha-beta fold, swapped-dimer, hydrolase; HET: BTB; 1.60A {Mycobacterium tuberculosis} PDB: 3ib8_A* 2hy1_A 2hyp_A 2hyo_A Probab=99.15 E-value=1.1e-09 Score=82.20 Aligned_cols=197 Identities=11% Similarity=0.135 Sum_probs=123.3 Q ss_pred HHHHHHHHHH-CCCEEEEEEECCCCCCCCHHHHHHHHHHH-----CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 6889998861-16567897605100121267788888672-----65575130245544478887667454550000036 Q gi|254780291|r 177 FVEAINSLKS-SKNIILGILTGDMTQSSTTKELKRFYNIY-----SLKFPFFRGLGSQEYIGNRPCRDPYTLTPSIYGCA 250 (475) Q Consensus 177 ~vesinsl~~-~~~~~~gIINGDLTeFG~q~qL~eFr~Vw-----nl~iPv~lGLGNHDYqNN~dC~~P~~~d~s~n~CA 250 (475) |.+.+..+.+ ..+.-+-|++||||+.|...++++|++.. .+++|++.=.||||+..+ T Consensus 53 l~~~l~~i~~~~~~pD~vvitGDl~~~g~~~~y~~~~~~l~~~~~~~~~pv~~v~GNHD~~~~----------------- 115 (330) T 3ib7_A 53 LGELLEQLNQSGLRPDAIVFTGDLADKGEPAAYRKLRGLVEPFAAQLGAELVWVMGNHDDRAE----------------- 115 (330) T ss_dssp HHHHHHHHHHHTCCCSEEEECSCCBTTCCHHHHHHHHHHHHHHHHHHTCEEEECCCTTSCHHH----------------- T ss_pred HHHHHHHHHHCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHH----------------- T ss_conf 999999998229899999989877899999999999999999875249977995787764455----------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEECCCCEEEEECCCCCCHH Q ss_conf 88899999999877655566654210010013444565521037775443343027335554036410367516753001 Q gi|254780291|r 251 FIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWSNI 330 (475) Q Consensus 251 ~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~~a 330 (475) |-++..++ .. -.+...|..++++.+++.++-..+.. ..+.. T Consensus 116 ------~~~~~~~~----------------~~------------~~~~~~~~~~~~~~~~~~ldt~~~~~---~~G~~-- 156 (330) T 3ib7_A 116 ------LRKFLLDE----------------AP------------SMAPLDRVCMIDGLRIIVLDTSVPGH---HHGEI-- 156 (330) T ss_dssp ------HHHHHHCC----------------CC------------CCSCCCEEEEETTEEEEECCCCCTTC---CSBCC-- T ss_pred ------HHHHHCCC----------------CC------------CCCCCEEEEEECCCEEEECCCCCCCC---CCCCC-- T ss_conf ------54431013----------------66------------66763047870783266436666787---68855-- Q ss_pred HHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCC------HHH--HHHHHHHHHHHHCCCCE Q ss_conf 2222100025651134320565577887888884034795799950047720------038--99999999998589617 Q gi|254780291|r 331 FTVAVPEHISKQDLPSHVSNGSEISQWIRDDVFQAQREGKYIILFADDIDRF------SSI--DQKRMFEKFLTQSKIST 402 (475) Q Consensus 331 f~Iqv~n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK~IILN~HD~~~~------sSi--~~KrmFks~itk~nV~A 402 (475) ..+--+||++.|.++ .++++|+.+|-.+.. .++ .....|..++++++|.+ T Consensus 157 --------------------~~~ql~wL~~~L~~~--~~~~~iv~~HHpp~~~~~~~~~~~~~~~~~~l~~ll~~~~v~~ 214 (330) T 3ib7_A 157 --------------------RASQLGWLAEELATP--APDGTILALHHPPIPSVLDMAVTVELRDQAALGRVLRGTDVRA 214 (330) T ss_dssp --------------------CHHHHHHHHHHTTSC--CTTCEEEECSSCSSCCSSGGGGGGSBSCHHHHHHHHTTSSEEE T ss_pred --------------------CHHHHHHHHHHHHHC--CCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCE T ss_conf --------------------999999999988647--7887699981698567775334433444799999997469819 Q ss_pred EEEEEEECCCCCCCCCCCCCEEEEEEECCCC-----------------CCCEEEEEECCCEEEEEEEECCC Q ss_conf 9998642367652465553101389704564-----------------67189999648859999985167 Q gi|254780291|r 403 IFTTRFTSSPESYIKDSTGRPVRVYNINKNS-----------------KNEFILLEMTPHYINVTAYERRG 456 (475) Q Consensus 403 IF~aH~Hqshe~~lec~~GykVPvy~iGSvp-----------------kNrF~~lem~~~~atITgY~~Rd 456 (475) ||.||.|....... ..+|++..+|.. ..-|..+++....+..+.+-..+ T Consensus 215 vl~GH~H~~~~~~~-----~Gi~~~~~pst~~~~~~~~~~~~~~~~~~~~g~~~i~~~~d~~v~~~vp~~~ 280 (330) T 3ib7_A 215 ILAGHLHYSTNATF-----VGIPVSVASATCYTQDLTVAAGGTRGRDGAQGCNLVHVYPDTVVHSVIPLGG 280 (330) T ss_dssp EEECSSSSCEEEEE-----TTEEEEECCCSSCEECTTSCTTCCCEESCSCEEEEEEECSSCEEEEEEECSC T ss_pred EEECCCCCCCEEEE-----CCEEEEEECCCHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECCC T ss_conf 99777880493799-----9999999696253266777888755556899759999959977999997289 No 3 >1ute_A Protein (II purple acid phosphatase); tartrate resistant acid phosphatase, metalloenzyme, uteroferrin, hydrolase; HET: NAG; 1.55A {Sus scrofa} SCOP: d.159.1.1 PDB: 1war_A* 2bq8_X 1qfc_A* 1qhw_A* Probab=99.15 E-value=3e-10 Score=85.99 Aligned_cols=236 Identities=17% Similarity=0.167 Sum_probs=123.1 Q ss_pred EEEEEC----CCEECCCHH---HHHHHHHHHHCCCEEEEEEECCCC-CCCCHHHH-HHHHHHH--------CCCCCEEEC Q ss_conf 155513----310046314---688999886116567897605100-12126778-8888672--------655751302 Q gi|254780291|r 162 IAVIAD----PWYKADTPM---FVEAINSLKSSKNIILGILTGDMT-QSSTTKEL-KRFYNIY--------SLKFPFFRG 224 (475) Q Consensus 162 ~~~~~~----pw~k~~~~~---~vesinsl~~~~~~~~gIINGDLT-eFG~q~qL-~eFr~Vw--------nl~iPv~lG 224 (475) ++|++| +.....++. ..++++......+..|-|..||+. +.|...+. .+|.+.| ...+|+|.- T Consensus 9 f~v~gD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~pdfvl~~GD~~y~~g~~~~~~~~~~~~~~~~~~~~~~~~~P~~~~ 88 (313) T 1ute_A 9 FVAVGDWGGVPNAPFHTAREMANAKAIATTVKTLGADFILSLGDNFYFTGVHDAKDKRFQETFEDVFSDPSLRNVPWHVL 88 (313) T ss_dssp EEEECSCCCCSSTTSSCHHHHHHHHHHHHHHHHHCCSEEEECSCCSTTTCCSSTTCTHHHHHTTTTSCSGGGTTCCEEEC T ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99994699999754451789999999999986369989998998877788875217999999998753356518987984 Q ss_pred CCCCCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEE Q ss_conf 455444788-8766745455000003688899999999877655566654210010013444565521037775443343 Q gi|254780291|r 225 LGSQEYIGN-RPCRDPYTLTPSIYGCAFIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSW 303 (475) Q Consensus 225 LGNHDYqNN-~dC~~P~~~d~s~n~CA~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSW 303 (475) +|||||.+| ..+. .+. . ....++.... .++..|.- T Consensus 89 ~GNHd~~~~~~~~~-------------------------~~~-~--~~~~~~~~~~----------------~y~~~~~~ 124 (313) T 1ute_A 89 AGNHDHLGNVSAQI-------------------------AYS-K--ISKRWNFPSP----------------YYRLRFKI 124 (313) T ss_dssp CCHHHHHSCHHHHH-------------------------HGG-G--TSTTEECCSS----------------SEEEEEEC T ss_pred CCCCCCCCCCCCCC-------------------------CHH-H--CCCCCCCCCC----------------CCCEECCC T ss_conf 47656667632220-------------------------112-1--1545568887----------------64401046 Q ss_pred EC--CCEEEEEECCCCEEEEECCCCCCHHHHHHHHHCCCC-CCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEECCCCC Q ss_conf 02--733555403641036751675300122221000256-511343205655778878888840347957999500477 Q gi|254780291|r 304 NI--DNVHFIQANYSMFHSVYFNDEWSNIFTVAVPEHISK-QDLPSHVSNGSEISQWIRDDVFQAQREGKYIILFADDID 380 (475) Q Consensus 304 d~--gdvHfVQ~NYpsy~~V~~n~~~~~af~Iqv~n~Id~-q~l~s~Vn~~~~iskWL~~DL~~A~~~gK~IILN~HD~~ 380 (475) .. +++.++.+++..|.... ...... ...+.......+-.+||+++|... ..+.+++.+|.-. T Consensus 125 ~~~~~~~~~~~~d~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~Q~~WL~~~L~~~--~~~~~iv~~hhp~ 189 (313) T 1ute_A 125 PRSNVSVAIFMLDTVTLCGNS-------------DDFVSQQPERPRNLALARTQLAWIKKQLAAA--KEDYVLVAGHYPV 189 (313) T ss_dssp TTSSCEEEEEECCHHHHHCCG-------------GGSTTCSCCSCSCHHHHHHHHHHHHHHHHHC--CCSEEEEECSSCS T ss_pred CCCCCCEEEEECCCEEEEECC-------------CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC--CCCCEEEEEECCC T ss_conf 788970699981555687303-------------4444456666434223999999999999726--3567389982376 Q ss_pred -----CCHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCC-----------------------CCCCEEEEEEECCC Q ss_conf -----20038999999999985896179998642367652465-----------------------55310138970456 Q gi|254780291|r 381 -----RFSSIDQKRMFEKFLTQSKISTIFTTRFTSSPESYIKD-----------------------STGRPVRVYNINKN 432 (475) Q Consensus 381 -----~~sSi~~KrmFks~itk~nV~AIF~aH~Hqshe~~lec-----------------------~~GykVPvy~iGSv 432 (475) ...+-.....|..++.+++|.++|.||.|. ++++... .-.+.+++...... T Consensus 190 ~~~~~~~~~~~~~~~~~~ll~~~~V~~~~~GH~H~-~~~~~~~~~~~~~~~g~gg~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (313) T 1ute_A 190 WSIAEHGPTHCLVKQLLPLLTTHKVTAYLCGHDHN-LQYLQDENGLGFVLSGAGNFMDPSKKHLRKVPNGYLRFHFGAEN 268 (313) T ss_dssp SCCSSSCCCHHHHHHTHHHHHHTTCSEEEECSSSS-EEEEECTTCCEEEEECBSSCCCCCCTTGGGSCTTCEEEEECCTT T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCC-EEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEECCCC T ss_conf 44466787477889999999976944999588642-16774489978999698856678876566687533432350467 Q ss_pred CCCCEEEEEECCCEEEEEEEECCCC Q ss_conf 4671899996488599999851676 Q gi|254780291|r 433 SKNEFILLEMTPHYINVTAYERRGK 457 (475) Q Consensus 433 pkNrF~~lem~~~~atITgY~~Rd~ 457 (475) ...-|..+++++.+++++.|..-+. T Consensus 269 ~~~gf~~i~v~~~~l~~~~~~~~G~ 293 (313) T 1ute_A 269 SLGGFAYVEITPKEMSVTYIEASGK 293 (313) T ss_dssp SCCEEEEEEECSSCEEEEEEETTSC T ss_pred CCCEEEEEEEECCEEEEEEECCCCC T ss_conf 8663999999899999999949998 No 4 >2qfp_A Purple acid phosphatase; binuclear, Fe-Zn, hydrolase; HET: NAG NDG; 2.20A {Phaseolus vulgaris} SCOP: b.1.12.1 d.159.1.1 PDB: 2qfr_A* 1kbp_A* 3kbp_A* 4kbp_A* Probab=99.10 E-value=2.4e-09 Score=80.02 Aligned_cols=159 Identities=12% Similarity=0.183 Sum_probs=96.3 Q ss_pred HHCCCEEEEEEECCCCC-----CCCHHHHHHHHHHH---CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 61165678976051001-----21267788888672---65575130245544478887667454550000036888999 Q gi|254780291|r 185 KSSKNIILGILTGDMTQ-----SSTTKELKRFYNIY---SLKFPFFRGLGSQEYIGNRPCRDPYTLTPSIYGCAFIAIND 256 (475) Q Consensus 185 ~~~~~~~~gIINGDLTe-----FG~q~qL~eFr~Vw---nl~iPv~lGLGNHDYqNN~dC~~P~~~d~s~n~CA~sav~~ 256 (475) ....+..|-+..||+.. ...+.+.++|.+.. .-.+|++..+|||||..+..-...+.+ - T Consensus 143 ~~~~~pdfvl~~GD~~y~d~~~~~~~~~wd~~~~~~~~~~~~iP~~~~~GNHE~~~~~~~~~~~~~------~------- 209 (424) T 2qfp_A 143 LSPKKGQTVLFVGDLSYADRYPNHDNVRWDTWGRFTERSVAYQPWIWTAGNHEIEFAPEINETEPF------K------- 209 (424) T ss_dssp TCSSCCCEEEECSCCSCGGGSGGGCTHHHHHHHHHHHHHHTTSCEEECCCHHHHCCBGGGTBCSTT------H------- T ss_pred HCCCCCCEEEEECCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCC------E------- T ss_conf 655687659980437650555663036888999888788722965870676532257655667464------0------- Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEECCCCEEEEECCCCCCHHHHHHHH Q ss_conf 99999877655566654210010013444565521037775443343027335554036410367516753001222210 Q gi|254780291|r 257 ISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWSNIFTVAVP 336 (475) Q Consensus 257 M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~~af~Iqv~ 336 (475) .|... |+.-. +.. .-.+..-||.++|++||+.++--.. +.. T Consensus 210 ------~~~~~------f~~p~-----~~~-------~~~~~~yYsfd~G~v~fi~lds~~~----~~~----------- 250 (424) T 2qfp_A 210 ------PFSYR------YHVPY-----EAS-------QSTSPFWYSIKRASAHIIVLSSYSA----YGR----------- 250 (424) T ss_dssp ------HHHHH------CCCCG-----GGG-------TCSSTTSEEEEETTEEEEECCTTSC----CST----------- T ss_pred ------EEECC------CCCCC-----CCC-------CCCCCCEEEEEECCEEEEECCCCCC----CCC----------- T ss_conf ------44211------35776-----666-------7888756999878888996147446----775----------- Q ss_pred HCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC-EEEEECCCC---CCC----HHHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 0025651134320565577887888884034795-799950047---720----03899999999998589617999864 Q gi|254780291|r 337 EHISKQDLPSHVSNGSEISQWIRDDVFQAQREGK-YIILFADDI---DRF----SSIDQKRMFEKFLTQSKISTIFTTRF 408 (475) Q Consensus 337 n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK-~IILN~HD~---~~~----sSi~~KrmFks~itk~nV~AIF~aH~ 408 (475) +.+--+||++||.++++... -+|+.+|.- ..- .....+..|+.++.+++|..||.||. T Consensus 251 --------------~~~Q~~WL~~dL~~~~r~~~~w~iv~~H~P~ys~~~~~~~~~~~~r~~~~~lf~ky~Vdlvl~GH~ 316 (424) T 2qfp_A 251 --------------GTPQYTWLKKELRKVKRSETPWLIVLMHSPLYNSYNHHFMEGEAMRTKFEAWFVKYKVDVVFAGHV 316 (424) T ss_dssp --------------TSHHHHHHHHHHHHCCTTTCCEEEEECSSCSSCCBSTTTTTTHHHHHHHHHHHHHTTCSEEEECSS T ss_pred --------------CCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCC T ss_conf --------------459999999998753125798699996888654587887666789999999999719759996985 Q ss_pred E Q ss_conf 2 Q gi|254780291|r 409 T 409 (475) Q Consensus 409 H 409 (475) | T Consensus 317 H 317 (424) T 2qfp_A 317 H 317 (424) T ss_dssp S T ss_pred C T ss_conf 6 No 5 >1xzw_A Purple acid phosphatase; hydrolase; HET: NAG FUC MAN; 2.50A {Ipomoea batatas} SCOP: b.1.12.1 d.159.1.1 Probab=99.09 E-value=6.3e-09 Score=77.32 Aligned_cols=180 Identities=13% Similarity=0.175 Sum_probs=102.6 Q ss_pred EEEEECCCEECCCHHHHHHHHHH-HHCCCEEEEEEECCCCCC-C----CHHHHHHHHHHH---CCCCCEEECCCCCCCCC Q ss_conf 15551331004631468899988-611656789760510012-1----267788888672---65575130245544478 Q gi|254780291|r 162 IAVIADPWYKADTPMFVEAINSL-KSSKNIILGILTGDMTQS-S----TTKELKRFYNIY---SLKFPFFRGLGSQEYIG 232 (475) Q Consensus 162 ~~~~~~pw~k~~~~~~vesinsl-~~~~~~~~gIINGDLTeF-G----~q~qL~eFr~Vw---nl~iPv~lGLGNHDYqN 232 (475) +++++|.=...+. ...+..+ ....+..|-|..||+.-- | .+.+.++|.+.. .-.+|++..+|||||.+ T Consensus 129 f~~~GD~G~~~~~---~~~~~~~~~~~~~~dfvl~~GD~~Y~~~~~~~~~~~wd~~~~~~~~~~~~~P~~~~~GNHE~~~ 205 (426) T 1xzw_A 129 FGLIGDIGQTHDS---NTTLTHYEQNSAKGQAVLFMGDLSYSNRWPNHDNNRWDTWGRFSERSVAYQPWIWTAGNHEIDY 205 (426) T ss_dssp EEEECSCTTBHHH---HHHHHHHHHCTTCCSEEEECSCCCCGGGSGGGCTHHHHHHHHHHHHHHTTSCEECCCCGGGCCC T ss_pred EEEECCCCCCCCH---HHHHHHHHHCCCCCCEEEECCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCC T ss_conf 5364787888620---4899999870778758996577430464544215788899877677860496698347545566 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEE Q ss_conf 88766745455000003688899999999877655566654210010013444565521037775443343027335554 Q gi|254780291|r 233 NRPCRDPYTLTPSIYGCAFIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQ 312 (475) Q Consensus 233 N~dC~~P~~~d~s~n~CA~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ 312 (475) +.+-.. ...+. .|... |..... ... -...+-||.++|++||+. T Consensus 206 ~~~~~~---------~~~~~----------~~~~~------f~~~~~----~~~--------~~~~~~Ysf~~g~v~fi~ 248 (426) T 1xzw_A 206 APDIGE---------YQPFV----------PFTNR------YPTPHE----ASG--------SGDPLWYAIKRASAHIIV 248 (426) T ss_dssp BGGGTB---------CSTTH----------HHHHH------SCCCCG----GGT--------CSSTTSEEEEETTEEEEE T ss_pred CCCCCC---------CCCCC----------CHHHC------CCCCCC----CCC--------CCCCCEEEEECCEEEEEE T ss_conf 666555---------44365----------27761------888644----567--------888855999817289999 Q ss_pred ECCCCEEEEECCCCCCHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC-EEEEECCCCC--C-----CHH Q ss_conf 0364103675167530012222100025651134320565577887888884034795-7999500477--2-----003 Q gi|254780291|r 313 ANYSMFHSVYFNDEWSNIFTVAVPEHISKQDLPSHVSNGSEISQWIRDDVFQAQREGK-YIILFADDID--R-----FSS 384 (475) Q Consensus 313 ~NYpsy~~V~~n~~~~~af~Iqv~n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK-~IILN~HD~~--~-----~sS 384 (475) +|--.... ...+--+||++||..+++... -+|+.+|.-. . ... T Consensus 249 lds~~~~~-----------------------------~~~~Q~~WL~~~L~~~~~~~~~w~iv~~H~P~y~s~~~~~~~~ 299 (426) T 1xzw_A 249 LSSYSGFV-----------------------------KYSPQYKWFTSELEKVNRSETPWLIVLVHAPLYNSYEAHYMEG 299 (426) T ss_dssp CCTTSCCS-----------------------------TTSHHHHHHHHHHHHCCTTTCCEEEEECSSCSSCCBSTTTTTT T ss_pred ECCCCCCC-----------------------------CCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCC T ss_conf 54666876-----------------------------5049999999999975433797599981888643576666677 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf 89999999999858961799986423 Q gi|254780291|r 385 IDQKRMFEKFLTQSKISTIFTTRFTS 410 (475) Q Consensus 385 i~~KrmFks~itk~nV~AIF~aH~Hq 410 (475) ...+..|+.++.+++|..||.||.|. T Consensus 300 ~~~r~~l~~lf~~y~Vdlvl~GH~H~ 325 (426) T 1xzw_A 300 EAMRAIFEPYFVYYKVDIVFSGHVHS 325 (426) T ss_dssp HHHHHHHHHHHHHTTCSEEEECSSSS T ss_pred HHHHHHHHHHHHHHCCEEEEECCCCC T ss_conf 88999999999981971999798556 No 6 >3d03_A Phosphohydrolase; glycerophosphodiesterase, metallohydrolase, phosphatase, metal ION; 1.90A {Enterobacter aerogenes} SCOP: d.159.1.11 PDB: 2zoa_A 2zo9_B 2dxn_A 2dxl_A Probab=99.02 E-value=3.7e-09 Score=78.81 Aligned_cols=177 Identities=13% Similarity=0.239 Sum_probs=103.8 Q ss_pred HHHHHHHHHH-CCCEEEEEEECCCCCCCCHHHHHHHHHHH-CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 6889998861-16567897605100121267788888672-655751302455444788876674545500000368889 Q gi|254780291|r 177 FVEAINSLKS-SKNIILGILTGDMTQSSTTKELKRFYNIY-SLKFPFFRGLGSQEYIGNRPCRDPYTLTPSIYGCAFIAI 254 (475) Q Consensus 177 ~vesinsl~~-~~~~~~gIINGDLTeFG~q~qL~eFr~Vw-nl~iPv~lGLGNHDYqNN~dC~~P~~~d~s~n~CA~sav 254 (475) +.+.++.+.. ..+.-+-|+.|||++.|...+++.|++.- .+++|++.=.||||+.++ T Consensus 28 l~~~~~~i~~~~~~~D~vv~tGDl~~~~~~~~y~~~~~~l~~l~~p~~~v~GNHD~~~~--------------------- 86 (274) T 3d03_A 28 NADVVSQLNALRERPDAVVVSGDIVNCGRPEEYQVARQILGSLNYPLYLIPGNHDDKAL--------------------- 86 (274) T ss_dssp HHHHHHHHHTCSSCCSEEEEESCCBSSCCHHHHHHHHHHHTTCSSCEEEECCTTSCHHH--------------------- T ss_pred HHHHHHHHHHCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCHHHH--------------------- T ss_conf 99999999832899999998875788998899999999998728878996788634678--------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEECCCCEEEEECCCCCCHHHHHH Q ss_conf 99999998776555666542100100134445655210377754433430273355540364103675167530012222 Q gi|254780291|r 255 NDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWSNIFTVA 334 (475) Q Consensus 255 ~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~~af~Iq 334 (475) +......+.++ +..+ .+...|+-+.+.++|+-++...+.. ..++ T Consensus 87 --~~~~~~~~~~~------~~~~------------------~~~~~~~~~~~~~~~i~ldt~~~~~---~~~~------- 130 (274) T 3d03_A 87 --FLEYLQPLCPQ------LGSD------------------ANNMRCAVDDFATRLLFIDSSRAGT---SKGW------- 130 (274) T ss_dssp --HHHHHGGGSGG------GCSC------------------GGGCCEEECSSSSEEEECCCCCTTC---SSBC------- T ss_pred --HHHHHHHHCCC------CCCC------------------CCCEEEEEECCCEEEEECCCCCCCC---CCCC------- T ss_conf --89876543012------3567------------------7862699954875898425776788---7640------- Q ss_pred HHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEECCCCCC------CH--HHHHHHHHHHHHHHC-CCCEEEE Q ss_conf 10002565113432056557788788888403479579995004772------00--389999999999858-9617999 Q gi|254780291|r 335 VPEHISKQDLPSHVSNGSEISQWIRDDVFQAQREGKYIILFADDIDR------FS--SIDQKRMFEKFLTQS-KISTIFT 405 (475) Q Consensus 335 v~n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK~IILN~HD~~~------~s--Si~~KrmFks~itk~-nV~AIF~ 405 (475) + .-+-.+||++.|..+.. +++++.+|-.+- +. ..+....|.+.+.++ +|.+||. T Consensus 131 ----l-----------~~~~l~wL~~~L~~~~~--~~~iv~~Hhpp~~~~~~~~~~~~~~~~~~l~~~l~~~~~V~~vl~ 193 (274) T 3d03_A 131 ----L-----------TDETISWLEAQLFEGGD--KPATIFMHHPPLPLGNAQMDPIACENGHRLLALVERFPSLTRIFC 193 (274) T ss_dssp ----C-----------CHHHHHHHHHHHHHHTT--SCEEEEESSCSSCCSCTTTGGGSBTTTHHHHHHHHHCTTEEEEEE T ss_pred ----C-----------CHHHHHHHHHHHHHCCC--CCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEE T ss_conf ----1-----------59999999999875106--634999943874678865454246689999999983799329998 Q ss_pred EEEECCCCCCCCCCCCCEEEEEEECCC Q ss_conf 864236765246555310138970456 Q gi|254780291|r 406 TRFTSSPESYIKDSTGRPVRVYNINKN 432 (475) Q Consensus 406 aH~Hqshe~~lec~~GykVPvy~iGSv 432 (475) ||.|....... ..+|++...|. T Consensus 194 GH~H~~~~~~~-----~gi~~~~~pst 215 (274) T 3d03_A 194 GHNHSLTMTQY-----RQALISTLPGT 215 (274) T ss_dssp CSSSSCEEEEE-----TTEEEEECCCS T ss_pred CCCCHHHHEEE-----CCEEEEECCCC T ss_conf 85480350699-----99999984841 No 7 >2yvt_A Hypothetical protein AQ_1956; structural genomics, unknown function, NPPSFA, national project on protein structural and functional analyses; 1.60A {Aquifex aeolicus VF5} SCOP: d.159.1.6 Probab=98.52 E-value=1.7e-06 Score=61.31 Aligned_cols=217 Identities=10% Similarity=0.072 Sum_probs=112.8 Q ss_pred CCEEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHH---------------------------HHHHH Q ss_conf 551555133100463146889998861165678976051001212677---------------------------88888 Q gi|254780291|r 160 KGIAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTTKE---------------------------LKRFY 212 (475) Q Consensus 160 ~~~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q~q---------------------------L~eFr 212 (475) +-|++|.|=--.-+ +.+.+..+...+++-+=++.|||++.|...+ ++++. T Consensus 6 ~~i~~isD~h~~~~---~l~~l~~~~~~~~~D~vl~~GDl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (260) T 2yvt_A 6 RKVLAIKNFKERFD---LLPKLKGVIAEKQPDILVVVGNILKNEALEKEYERAHLARREPNRKVIHENEHYIIETLDKFF 82 (260) T ss_dssp CEEEEEECCTTCGG---GHHHHHHHHHHHCCSEEEEESCCCCCHHHHHHHHHHHHTTCCCCTHHHHHHHHHHHHHHHHHH T ss_pred EEEEEEECCCCCHH---HHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHH T ss_conf 58999953799989---999999988774999999916878999875899999987645455077502663188999999 Q ss_pred HHH-CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCC Q ss_conf 672-6557513024554447888766745455000003688899999999877655566654210010013444565521 Q gi|254780291|r 213 NIY-SLKFPFFRGLGSQEYIGNRPCRDPYTLTPSIYGCAFIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETY 291 (475) Q Consensus 213 ~Vw-nl~iPv~lGLGNHDYqNN~dC~~P~~~d~s~n~CA~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~ 291 (475) +.. +...|++.=.||||.-.... +....... ............ T Consensus 83 ~~~~~~~~~~~~i~GNHD~~~~~~---------------------~~~~~~~~---------~~~~~~~~~~~~------ 126 (260) T 2yvt_A 83 REIGELGVKTFVVPGKNDAPLKIF---------------------LRAAYEAE---------TAYPNIRVLHEG------ 126 (260) T ss_dssp HHHHTTCSEEEEECCTTSCCHHHH---------------------HHHHHHTT---------TTCTTEEECSSE------ T ss_pred HHHHHCCCCEEEECCCCCHHHHHH---------------------HHHHCCCC---------CCCCCCCCCCCE------ T ss_conf 999855996388628861766545---------------------66641642---------123442123424------ Q ss_pred EEEEEEEEEEEEECCCEEEEEECCCCEEEEECCCCCCHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCE Q ss_conf 03777544334302733555403641036751675300122221000256511343205655778878888840347957 Q gi|254780291|r 292 SISISGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWSNIFTVAVPEHISKQDLPSHVSNGSEISQWIRDDVFQAQREGKY 371 (475) Q Consensus 292 ~vei~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~~af~Iqv~n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK~ 371 (475) .+...+ +.+-.+...+.- ... ...++ ........-.|++..+.+.. ... T Consensus 127 ~~~~~~---~~~~~~~~~~~~--~~~---------~~~~~---------------~~~~~~~~~~~~~~~~~~~~--~~~ 175 (260) T 2yvt_A 127 FAGWRG---EFEVIGFGGLLT--EHE---------FEEDF---------------VLKYPRWYVEYILKFVNELK--PRR 175 (260) T ss_dssp EEEETT---TEEEEEECSEEE--SSC---------CBSSS---------------SCEEEHHHHHHHGGGGGGSC--CCE T ss_pred EEEECC---CEEEEEECCCCC--CCC---------CCHHH---------------HHHHHHHHHHHHHHHHHHHC--CCC T ss_conf 899539---779998457768--744---------53044---------------44345678999987653202--355 Q ss_pred EEEECCCCCCC---------HHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCEEEEEE Q ss_conf 99950047720---------038999999999985896179998642367652465553101389704564671899996 Q gi|254780291|r 372 IILFADDIDRF---------SSIDQKRMFEKFLTQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNSKNEFILLEM 442 (475) Q Consensus 372 IILN~HD~~~~---------sSi~~KrmFks~itk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvpkNrF~~lem 442 (475) +++.+|-.+.. ........+...++..++..++.||.|+.... .++.++++.||+..++|..+|+ T Consensus 176 ~~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~vi~GH~H~~~~~------~g~~~iv~pGs~~~~~y~~i~~ 249 (260) T 2yvt_A 176 LVTIFYTPPIGEFVDRTPEDPKHHGSAVVNTIIKSLNPEVAIVGHVGKGHEL------VGNTIVVNPGEFEEGRYAFLDL 249 (260) T ss_dssp EEEEESSCCSCSSTTCBTTBSCCCSCHHHHHHHHHHCCSEEEECSSCCEEEE------ETTEEEEECCBGGGTEEEEEET T ss_pred EEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCEEE------ECCEEEEECCCCCCCCEEEEEE T ss_conf 0578853873224554532220136799999987619979999544688487------5987999899554491799998 Q ss_pred CCCEEEEEEE Q ss_conf 4885999998 Q gi|254780291|r 443 TPHYINVTAY 452 (475) Q Consensus 443 ~~~~atITgY 452 (475) +++.+.+.-| T Consensus 250 ~~~~~~~~~~ 259 (260) T 2yvt_A 250 TQHKIKLEQF 259 (260) T ss_dssp TTTEEEEEEC T ss_pred CCCEEEEEEC T ss_conf 3996999977 No 8 >1uf3_A Hypothetical protein TT1561; metallo-dependent phosphatases, structural genomics, riken structural genomics/proteomics initiative, RSGI; 2.10A {Thermus thermophilus} SCOP: d.159.1.6 Probab=98.45 E-value=6.6e-06 Score=57.35 Aligned_cols=216 Identities=9% Similarity=0.105 Sum_probs=108.8 Q ss_pred CCEEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCH-HHHHHHHHHH-CCCCCEEECCCCCCCCCCCCCC Q ss_conf 5515551331004631468899988611656789760510012126-7788888672-6557513024554447888766 Q gi|254780291|r 160 KGIAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTT-KELKRFYNIY-SLKFPFFRGLGSQEYIGNRPCR 237 (475) Q Consensus 160 ~~~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q-~qL~eFr~Vw-nl~iPv~lGLGNHDYqNN~dC~ 237 (475) |-|+++.|+--..+ .+ |.+-.+......-+=|+.|||++.|.. .+..++-..- ..+.|++.=.||||..- T Consensus 6 ~~i~~~~d~hg~~~--al-e~~l~~~~~~~~D~vv~~GDl~~~~~~~~~~~~~~~~l~~~~~pv~~v~GNHD~~~----- 77 (228) T 1uf3_A 6 RYILATSNPMGDLE--AL-EKFVKLAPDTGADAIALIGNLMPKAAKSRDYAAFFRILSEAHLPTAYVPGPQDAPI----- 77 (228) T ss_dssp CEEEEEECCTTCHH--HH-HHHHTHHHHHTCSEEEEESCSSCTTCCHHHHHHHHHHHGGGCSCEEEECCTTSCSH----- T ss_pred EEEEEEECCCCCHH--HH-HHHHHHHHHHCCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCHH----- T ss_conf 29999956889999--99-99999877729999998788699987829999999974045775899968998167----- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEECCCC Q ss_conf 74545500000368889999999987765556665421001001344456552103777544334302733555403641 Q gi|254780291|r 238 DPYTLTPSIYGCAFIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQANYSM 317 (475) Q Consensus 238 ~P~~~d~s~n~CA~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ~NYps 317 (475) .+ . +..-..... ...... .-...+.+..+.+.++....+. T Consensus 78 --------~~--~------~~~~~~~~~--------~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~ 117 (228) T 1uf3_A 78 --------WE--Y------LREAANVEL--------VHPEMR----------------NVHETFTFWRGPYLVAGVGGEI 117 (228) T ss_dssp --------HH--H------HHHHHHHHH--------HCTTEE----------------ECBTSEEEETTTEEEEEECSEE T ss_pred --------HH--H------HHHHCCCCC--------CCCCCC----------------CCCCEEEEECCCEEEEECCCCC T ss_conf --------52--0------355314211--------234220----------------0132268841877787417744 Q ss_pred EEEEECCCCCCHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHH--HHHHHHHHH Q ss_conf 036751675300122221000256511343205655778878888840347957999500477200389--999999999 Q gi|254780291|r 318 FHSVYFNDEWSNIFTVAVPEHISKQDLPSHVSNGSEISQWIRDDVFQAQREGKYIILFADDIDRFSSID--QKRMFEKFL 395 (475) Q Consensus 318 y~~V~~n~~~~~af~Iqv~n~Id~q~l~s~Vn~~~~iskWL~~DL~~A~~~gK~IILN~HD~~~~sSi~--~KrmFks~i 395 (475) +.. ...+. ...+-....-..|+.+.+..... ...++-.|..+...... .-..+.+.+ T Consensus 118 ~~~---~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~--~~~i~~~h~~~~~~~~~~~~~~~~~~~~ 176 (228) T 1uf3_A 118 ADE---GEPEE----------------HEALRYPAWVAEYRLKALWELKD--YPKIFLFHTMPYHKGLNEQGSHEVAHLI 176 (228) T ss_dssp ESS---SCCBS----------------SSSCEEEHHHHHHHHGGGGGSCS--CCEEEEESSCBCBTTTBTTSBHHHHHHH T ss_pred CCC---CCCCH----------------HHHHHHHHHHHHHHHHHHHHCCC--CCEEEEEECCCCCCCCCCCCCHHHHHHH T ss_conf 677---77602----------------32334458899999998664035--5148987138876532334625565555 Q ss_pred HHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCEEEEEECCCEEEEE Q ss_conf 8589617999864236765246555310138970456467189999648859999 Q gi|254780291|r 396 TQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNSKNEFILLEMTPHYINVT 450 (475) Q Consensus 396 tk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvpkNrF~~lem~~~~atIT 450 (475) +..++..++.||.|+.++.. ++..+++.|++..|+|..+|++...+... T Consensus 177 ~~~~~~l~~~GH~H~~~~~~------~~~~~in~Gs~~~g~y~i~d~~~~~v~~~ 225 (228) T 1uf3_A 177 KTHNPLLVLVAGKGQKHEML------GASWVVVPGDLSEGEYSLLDLRARKLETG 225 (228) T ss_dssp HHHCCSEEEECCSSCEEEEE------TTEEEEECCBGGGTEEEEEETTTTEEEEE T ss_pred HHCCCCEEEECCCCCCCEEE------CCEEEEECCCCCCCCEEEEEECCCEEEEE T ss_conf 50288679957846661388------98899988876458279999519989998 No 9 >1ii7_A MRE11 nuclease; RAD50, DNA double-strand break repair, DAMP, manganese, replication; HET: DA; 2.20A {Pyrococcus furiosus} SCOP: d.159.1.4 PDB: 3dsc_A* 3dsd_A* 1s8e_A Probab=97.26 E-value=0.00016 Score=48.24 Aligned_cols=68 Identities=18% Similarity=0.269 Sum_probs=45.5 Q ss_pred CCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCC--HHHHHHHHHHH----CCCCCEEECCCCCCCCCCCC Q ss_conf 3100463146889998861165678976051001212--67788888672----65575130245544478887 Q gi|254780291|r 168 PWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSST--TKELKRFYNIY----SLKFPFFRGLGSQEYIGNRP 235 (475) Q Consensus 168 pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~--q~qL~eFr~Vw----nl~iPv~lGLGNHDYqNN~d 235 (475) ++...+...+.+.+-..-..+++-+=++.|||.+.+. ...+..+.... ..++|++.=.||||+..+.. T Consensus 19 ~~~~~~~~~~~~~~v~~a~~~~vD~vli~GDlfd~~~~~~~~~~~~~~~~~~l~~~~ipv~~i~GNHD~~~~~~ 92 (333) T 1ii7_A 19 PQREEEFAEAFKNALEIAVQENVDFILIAGDLFHSSRPSPGTLKKAIALLQIPKEHSIPVFAIEGNHDRTQRGP 92 (333) T ss_dssp HHHHHHHHHHHHHHHHHHHHTTCSEEEEESCSBSSSSCCHHHHHHHHHHHHHHHTTTCCEEEECCTTTCCSSSC T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHCCH T ss_conf 45799999999999999987399999989887889989999999999999999868995999646666210003 No 10 >2q8u_A Exonuclease, putative; structural genomics, joint center for structural genomics, J protein structure initiative, PSI-2, hydrolase; HET: MSE; 2.20A {Thermotoga maritima MSB8} Probab=97.11 E-value=0.002 Score=41.01 Aligned_cols=196 Identities=11% Similarity=0.119 Sum_probs=97.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCC---HHHHHHHHHHH---CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 6889998861165678976051001212---67788888672---65575130245544478887667454550000036 Q gi|254780291|r 177 FVEAINSLKSSKNIILGILTGDMTQSST---TKELKRFYNIY---SLKFPFFRGLGSQEYIGNRPCRDPYTLTPSIYGCA 250 (475) Q Consensus 177 ~vesinsl~~~~~~~~gIINGDLTeFG~---q~qL~eFr~Vw---nl~iPv~lGLGNHDYqNN~dC~~P~~~d~s~n~CA 250 (475) +.+.+-..-...++-+=++.|||++.+. ...+..+.... +-++|+|.=.||||+.+... T Consensus 49 ~l~~i~~~a~~~~~D~vliaGDlf~~~~~~~~~~~~~~~~~~~~l~~~~~v~~i~GNHD~~~~~~--------------- 113 (336) T 2q8u_A 49 ALDKVVEEAEKREVDLILLTGDLLHSRNNPSVVALHDLLDYLKRMMRTAPVVVLPGNHDWKGLKL--------------- 113 (336) T ss_dssp HHHHHHHHHHHHTCSEEEEESCSBSCSSCCCHHHHHHHHHHHHHHHHHSCEEECCC------CHH--------------- T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHC--------------- T ss_conf 99999999997499999989877779998889999999999998742897899457646410001--------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEECCCCEEEEECCCCCCHH Q ss_conf 88899999999877655566654210010013444565521037775443343027335554036410367516753001 Q gi|254780291|r 251 FIAINDISQQINDHYPQIKSIKEFNGDSQRYRNRSWHGETYSISISGSQSYSWNIDNVHFIQANYSMFHSVYFNDEWSNI 330 (475) Q Consensus 251 ~sav~~M~s~I~dy~~q~s~i~efN~Ds~~y~~~~~~~~~~~vei~GSLSYSWd~gdvHfVQ~NYpsy~~V~~n~~~~~a 330 (475) ....... +..........- .+.+ +....+.++++-..|+....+.. T Consensus 114 ------~~~~~~~----------~~~~~~~~~~~~------~~~~-----~~~~~~~v~~~~~~~~~~~~~~~------- 159 (336) T 2q8u_A 114 ------FGNFVTS----------ISSDITFVMSFE------PVDV-----EAKRGQKVRILPFPYPDESEALR------- 159 (336) T ss_dssp ------HHHHHHH----------HCSSEEECCSSS------CEEE-----ECTTSCEEEEEEECCC-------------- T ss_pred ------CHHHHHH----------CCCCCEECCCCC------CEEE-----ECCCCCEEEEECCCCCCCCCCCC------- T ss_conf ------1002221----------045500114556------5688-----70279669994366643000123------- Q ss_pred HHHHHHHCCCCCCEEEEEECCCCHH----HHHHHHHHHCCCCCCEEEEECCCC-CCCHHHHHHH-----HHHHHHHHCCC Q ss_conf 2222100025651134320565577----887888884034795799950047-7200389999-----99999985896 Q gi|254780291|r 331 FTVAVPEHISKQDLPSHVSNGSEIS----QWIRDDVFQAQREGKYIILFADDI-DRFSSIDQKR-----MFEKFLTQSKI 400 (475) Q Consensus 331 f~Iqv~n~Id~q~l~s~Vn~~~~is----kWL~~DL~~A~~~gK~IILN~HD~-~~~sSi~~Kr-----mFks~itk~nV 400 (475) ....+.. +++++-+..+...+.+.|+-+|.. ..+....... .+...+...++ T Consensus 160 ------------------~~~~~~~~~~~~~l~~~~~~~~~~~~~~I~~~H~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (336) T 2q8u_A 160 ------------------KNEGDFRFFLESRLNKLYEEALKKEDFAIFMGHFTVEGLAGYAGIEQGREIIINRALIPSVV 221 (336) T ss_dssp ------------------CCSSHHHHHHHHHHHHHHHHHHTCSSEEEEEEESEETTCC--------CCCEECGGGSCTTS T ss_pred ------------------CCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHCCC T ss_conf ------------------33067889999999999987326466539987542157666675223765455166631386 Q ss_pred CEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCC---------CCEEEEEECCC Q ss_conf 1799986423676524655531013897045646---------71899996488 Q gi|254780291|r 401 STIFTTRFTSSPESYIKDSTGRPVRVYNINKNSK---------NEFILLEMTPH 445 (475) Q Consensus 401 ~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvpk---------NrF~~lem~~~ 445 (475) ..|+-||.|.....- .+-.++|.||... .-|.++|+++. T Consensus 222 d~v~~GH~H~~~~~~------~~~~i~y~Gs~~~~~~~E~~~~kg~~lvdi~~~ 269 (336) T 2q8u_A 222 DYAALGHIHSFREIQ------KQPLTIYPGSLIRIDFGEEADEKGAVFVELKRG 269 (336) T ss_dssp SEEEEESCSSCEEEE------ETTEEEECCCSSCCSGGGTTCCCEEEEEEEETT T ss_pred CEEEECCCCCCEEEC------CCCCEEECCCCCCCCCCCCCCCCEEEEEEECCC T ss_conf 289927775755877------999889738962677251479987999998289 No 11 >2a22_A Vacuolar protein sorting 29; alpha-beta-BETA-alpha sandwich, structural genomics, structural genomics consortium, SGC, protein transport; 2.20A {Cryptosporidium parvum} SCOP: d.159.1.7 Probab=85.29 E-value=1.8 Score=21.52 Aligned_cols=63 Identities=11% Similarity=0.176 Sum_probs=46.7 Q ss_pred HHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCC----------CEEEEEECCCEEEEEEEECCCCC Q ss_conf 999998589617999864236765246555310138970456467----------18999964885999998516763 Q gi|254780291|r 391 FEKFLTQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNSKN----------EFILLEMTPHYINVTAYERRGKV 458 (475) Q Consensus 391 Fks~itk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvpkN----------rF~~lem~~~~atITgY~~Rd~t 458 (475) +..+....++..++.||-|..- .+. ..++-++|-||+... -|.++|+++..+++.-|+-++.- T Consensus 133 l~~~~~~~~~dvvv~GHTH~p~---~~~--~~g~l~iNPGS~~~~r~~~~~~~~pS~aiLdi~~~~v~v~~y~l~~~~ 205 (215) T 2a22_A 133 LEQWQRRLDCDILVTGHTHKLR---VFE--KNGKLFLNPGTATGAFSALTPDAPPSFMLMALQGNKVVLYVYDLRDGK 205 (215) T ss_dssp HHHHHHHHTCSEEEECSSCCCE---EEE--ETTEEEEECCCSSCCCCTTSTTCCCEEEEEEEETTEEEEEEEEEETTE T ss_pred HHHHHHCCCCCEEEECCCCCCC---EEE--ECCEEEEECCCCCCCCCCCCCCCCCEEEEEEEECCEEEEEEEEECCCE T ss_conf 9998750289989979989764---799--999999948987777778788888889999998999999999934992 No 12 >1z2w_A Vacuolar protein sorting 29; VPS29, retromer, phosphatase, manganese, protein transport; 2.00A {Mus musculus} SCOP: d.159.1.7 PDB: 1z2x_A 3lh6_A 3lh7_A 1w24_A 2r17_A Probab=84.61 E-value=1.9 Score=21.32 Aligned_cols=61 Identities=11% Similarity=0.159 Sum_probs=43.7 Q ss_pred HHHHHHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCC--CC--------CEEEEEECCCEEEEEEEECCCC Q ss_conf 999985896179998642367652465553101389704564--67--------1899996488599999851676 Q gi|254780291|r 392 EKFLTQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNS--KN--------EFILLEMTPHYINVTAYERRGK 457 (475) Q Consensus 392 ks~itk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvp--kN--------rF~~lem~~~~atITgY~~Rd~ 457 (475) ....+....-.||.||-|.... +.. .+.-|+|-||+. ++ -|-.+|+++..+++.-|+..+. T Consensus 110 ~~~~~~~~~divi~GHTH~p~~---~~~--~~~~iiNPGS~~~pr~~~~~~~~~syaIld~~~~~v~~~~~~l~~~ 180 (192) T 1z2w_A 110 ALLQRQFDVDILISGHTHKFEA---FEH--ENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVVTYVYQLIGD 180 (192) T ss_dssp HHHHHHHSSSEEECCSSCCCEE---EEE--TTEEEEECCCTTCCCCSSCSCCCCEEEEEEEETTEEEEEEEEEETT T ss_pred HHHHHHCCCCEEEECCCCCCEE---EEE--CCEEEEECCCCCCCCCCCCCCCCCEEEEEEEECCEEEEEEEEECCC T ss_conf 9998735899899788785038---999--9999997998888888988888778999998399999999993599 No 13 >1su1_A Hypothetical protein YFCE; structural genomics, phosphoesterase, PSI, protein structure initiative, midwest center for structural genomics, MCSG; 2.25A {Escherichia coli} SCOP: d.159.1.7 Probab=81.94 E-value=2.5 Score=20.63 Aligned_cols=66 Identities=12% Similarity=0.171 Sum_probs=42.9 Q ss_pred EEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHH------HHH-HHHHCCCCCEEECCCCCCC Q ss_conf 15551331004631468899988611656789760510012126778------888-8672655751302455444 Q gi|254780291|r 162 IAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTTKEL------KRF-YNIYSLKFPFFRGLGSQEY 230 (475) Q Consensus 162 ~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q~qL------~eF-r~Vwnl~iPv~lGLGNHDY 230 (475) |++|+|-- .+.+. .+++-....+..+-.=|+.||++..|....+ .+. ....+++.|++.-.||||. T Consensus 28 i~viSDiH--gn~~a-le~vl~~~~~~~~D~ii~lGDlv~~gp~~~~~~~~~~~~~l~~l~~~~~~~~~V~GNhD~ 100 (208) T 1su1_A 28 LMFASDIH--GSLPA-TERVLELFAQSGAQWLVILGDVLNHGPRNALPEGYAPAKVVERLNEVAHKVIAVRGNCDS 100 (208) T ss_dssp EEEECCCT--TBHHH-HHHHHHHHHHHTCSEEEECSCCSCCCTTSCCCTTBCHHHHHHHHHTTGGGEEECCCTTCC T ss_pred EEEEEECC--CCHHH-HHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHCCCHHHHHHHHHCCCCEEEECCCCCH T ss_conf 99993168--99899-999999987569988999166013376520111128099999998549967996477837 No 14 >2kkn_A Uncharacterized protein; protein phosphatase 2A homologue, structural genomics, PSI- 2, protein structure initiative; NMR {Thermotoga maritima} Probab=74.40 E-value=3.2 Score=19.85 Aligned_cols=50 Identities=14% Similarity=0.168 Sum_probs=37.9 Q ss_pred HCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCEEEEEECCCEEEEEE Q ss_conf 5896179998642367652465553101389704564671899996488599999 Q gi|254780291|r 397 QSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNSKNEFILLEMTPHYINVTA 451 (475) Q Consensus 397 k~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvpkNrF~~lem~~~~atITg 451 (475) ..+...++.||-|.... + .-+++-++|-||+..+.|..++++...+++.= T Consensus 126 ~~~~divi~GHTH~~~~---~--~~~~~~iiNPGS~~~~syail~ie~~~v~~ei 175 (178) T 2kkn_A 126 NEKPQVILFGHTHEPED---T--VKAGVRFLNPGSLAEGSYAVLELDGGEVRFEL 175 (178) T ss_dssp SSCCSEEECCSCSSCCE---E--EETTEEEECCCCTTTTEEEEEEEETTEEEEEE T ss_pred HCCCCEEEECCCCCCEE---E--EECCEEEEECCCCCCCEEEEEEEECCEEEEEE T ss_conf 13898899688565218---9--98999999798999986999999899999999 No 15 >1nnw_A Hypothetical protein; structural genomics, PSI, protein structure initiative, southeast collaboratory for structural genomics, secsg; 1.90A {Pyrococcus furiosus} SCOP: d.159.1.5 PDB: 2gju_A Probab=64.88 E-value=6.4 Score=17.87 Aligned_cols=47 Identities=13% Similarity=0.088 Sum_probs=33.1 Q ss_pred CCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCC-------CCCEEEEEECCCEEEE Q ss_conf 896179998642367652465553101389704564-------6718999964885999 Q gi|254780291|r 398 SKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNS-------KNEFILLEMTPHYINV 449 (475) Q Consensus 398 ~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvp-------kNrF~~lem~~~~atI 449 (475) .+...++.||.|....... ....+++.||+- +..|.++|.+...+++ T Consensus 166 ~~~~~vv~GHtH~~~~~~~-----~~~~~iN~Gsvg~~~~~~~~~~y~i~d~~~~~v~~ 219 (252) T 1nnw_A 166 KDYEMLIVASPMYPVDAMT-----RYGRVVCPGSVGFPPGKEHKATFALVDVDTLKPKF 219 (252) T ss_dssp TTSSEEEESTTCSEEEEEE-----TTEEEEEECCSSSCSSSSCCEEEEEEETTTCCEEE T ss_pred CCCCEEEECCCCEEEEEEE-----CCCEEECCCEEEECCCCCCCCEEEEEECCCCEEEE T ss_conf 4662489724744678860-----78368718616007899987779999937987999 No 16 >3ck2_A Conserved uncharacterized protein (predicted phosphoesterase COG0622); structural genomics, predicted phosphodiesterase, PSI-2; HET: SRT; 2.30A {Streptococcus pneumoniae TIGR4} SCOP: d.159.1.7 Probab=58.36 E-value=8.3 Score=17.12 Aligned_cols=57 Identities=11% Similarity=0.138 Sum_probs=41.2 Q ss_pred HHHHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECCCC-------CCCEEEEEECCCEEEEEEEECC Q ss_conf 9985896179998642367652465553101389704564-------6718999964885999998516 Q gi|254780291|r 394 FLTQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINKNS-------KNEFILLEMTPHYINVTAYERR 455 (475) Q Consensus 394 ~itk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGSvp-------kNrF~~lem~~~~atITgY~~R 455 (475) .....+...+|.||-|...... . +.+-++|-||+- ...|-++|.+....++.-|..- T Consensus 100 ~~~~~~~divi~GHTH~p~~~~---~--~~~~iiNPGSvg~pr~~~~~~syail~~~~~~~~v~~~~~d 163 (176) T 3ck2_A 100 WAQEEEAAICLYGHLHVPSAWL---E--GKILFLNPGSISQPRGTIRECLYARVEIDDSYFKVDFLTRD 163 (176) T ss_dssp HHHHTTCSEEECCSSCCEEEEE---E--TTEEEEEECCSSSCCTTCCSCCEEEEEECSSEEEEEEECTT T ss_pred HHHHCCCCEEEECCCCCCEEEE---E--CCEEEEECCCCCCCCCCCCCCEEEEEEEECCEEEEEEEEEC T ss_conf 9985599899968977425999---8--99999935886677899985689999981999999999858 No 17 >1v77_A PH1877P, hypothetical protein PH1877; RNAse P protein, TIM-barrel, RNA binding protein; 1.80A {Pyrococcus horikoshii} SCOP: c.6.3.2 PDB: 2czv_A* Probab=55.66 E-value=7.2 Score=17.55 Aligned_cols=80 Identities=21% Similarity=0.324 Sum_probs=58.2 Q ss_pred CCCCCEEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCC--CCC-CHHHHHHHHHHH----CCCCCEEECCCCCC Q ss_conf 20155155513310046314688999886116567897605100--121-267788888672----65575130245544 Q gi|254780291|r 157 CHHKGIAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMT--QSS-TTKELKRFYNIY----SLKFPFFRGLGSQE 229 (475) Q Consensus 157 ~~~~~~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLT--eFG-~q~qL~eFr~Vw----nl~iPv~lGLGNHD 229 (475) |.+ ++-+++-||..--..-|-.....+-..+++++.|-++.+- ..| +..-+..+++++ ..++|+..+-|.|. T Consensus 93 ~e~-~VDiL~~P~~~r~d~gi~hvlak~A~e~gValEIn~s~ll~~~~~~R~~~l~n~~~ll~L~kky~~piVvsSdAhs 171 (212) T 1v77_A 93 IEK-GVDAIISPWVNRKDPGIDHVLAKLMVKKNVALGFSLRPLLYSNPYERANLLRFMMKAWKLVEKYKVRRFLTSSAQE 171 (212) T ss_dssp HHT-TCSEEECTTTTSSSCSCCHHHHHHHHHHTCEEEEESHHHHHSCHHHHHHHHHHHHHHHHHHHHHTCCEEEECCCSS T ss_pred HHC-CCCEEECCCCCCCCCCCCHHHHHHHHHCCEEEEEECHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCC T ss_conf 717-9548956865467778578999999978918999762764479864789999999999999981998899769998 Q ss_pred CCCCCCCCCCC Q ss_conf 47888766745 Q gi|254780291|r 230 YIGNRPCRDPY 240 (475) Q Consensus 230 YqNN~dC~~P~ 240 (475) - .+++.|. T Consensus 172 ~---~dlrsp~ 179 (212) T 1v77_A 172 K---WDVRYPR 179 (212) T ss_dssp G---GGCCCHH T ss_pred H---HHCCCHH T ss_conf 2---0127999 No 18 >1s3l_A Hypothetical protein MJ0936; phosphodiesterase, nuclease, structural genomics, BSGC structure funded by NIH; 2.40A {Methanocaldococcus jannaschii} SCOP: d.159.1.7 PDB: 1s3m_A 1s3n_A 2ahd_A Probab=46.99 E-value=13 Score=15.95 Aligned_cols=60 Identities=18% Similarity=0.273 Sum_probs=37.4 Q ss_pred EEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHCCCCCEEECCCCCCC Q ss_conf 155513310046314688999886116567897605100121267788888672655751302455444 Q gi|254780291|r 162 IAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTTKELKRFYNIYSLKFPFFRGLGSQEY 230 (475) Q Consensus 162 ~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q~qL~eFr~Vwnl~iPv~lGLGNHDY 230 (475) |++|+|-- .|-+.+ +++-....++++-.=|.-||++..+- ++++. ++..|++--.||||. T Consensus 28 I~iiSDiH--gn~~al-e~vl~~~~~~~~D~vi~lGDiv~~~~---~~~l~---~~~~~~~~V~GNhD~ 87 (190) T 1s3l_A 28 IGIMSDTH--DHLPNI-RKAIEIFNDENVETVIHCGDFVSLFV---IKEFE---NLNANIIATYGNNDG 87 (190) T ss_dssp EEEECCCT--TCHHHH-HHHHHHHHHSCCSEEEECSCCCSTHH---HHHGG---GCSSEEEEECCTTCC T ss_pred EEEEECCC--CCHHHH-HHHHHHHHHCCCCEEEECCCCCCHHH---HHHHH---HCCCCEEEEECCCCC T ss_conf 99998089--996999-99999997559999998788389899---99987---347608997276543 No 19 >1wb9_A DNA mismatch repair protein MUTS; DNA-binding, ATP-binding, complete proteome, DNA binding, DNA repair; HET: DNA ADP; 2.10A {Escherichia coli} SCOP: a.113.1.1 c.37.1.12 c.55.6.1 d.75.2.1 PDB: 1wbb_A* 1e3m_A* 1oh5_A* 1oh6_A* 1oh7_A* 1oh8_A* 1w7a_A* 2wtu_A* 1wbd_A* 1ng9_A* 3k0s_A* Probab=37.38 E-value=18 Score=14.99 Aligned_cols=53 Identities=17% Similarity=0.167 Sum_probs=31.2 Q ss_pred HHHHHHHCCCCCCEEEEECCC-CCCCHHHH----HHHHHHHHHHHCCCCEEEEEEEECCCC Q ss_conf 788888403479579995004-77200389----999999999858961799986423676 Q gi|254780291|r 358 IRDDVFQAQREGKYIILFADD-IDRFSSID----QKRMFEKFLTQSKISTIFTTRFTSSPE 413 (475) Q Consensus 358 L~~DL~~A~~~gK~IILN~HD-~~~~sSi~----~KrmFks~itk~nV~AIF~aH~Hqshe 413 (475) +..=|..|..++++|+ +- +-+.++.| +..+.+.++++.+..++|.+|||+=.+ T Consensus 677 ~~~il~~at~~SLvii---DElGrGTst~DG~aia~avle~l~~~~~~~~lfaTH~~eL~~ 734 (800) T 1wb9_A 677 TANILHNATEYSLVLM---DEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQ 734 (800) T ss_dssp HHHHHHHCCTTEEEEE---ESCCCCSSSSHHHHHHHHHHHHHHHTTCCEEEEECSCGGGGG T ss_pred HHHHHHHCCCCEEEEE---ECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHH T ss_conf 9999985899829998---059999985999999999999998636988999895089988 No 20 >1wgo_A VPS10 domain-containing receptor sorcs2; polycystic kidney disease, PKD, structural genomics, KIAA1329 protein; NMR {Homo sapiens} SCOP: b.1.3.1 Probab=37.12 E-value=8.9 Score=16.93 Aligned_cols=13 Identities=8% Similarity=0.197 Sum_probs=9.9 Q ss_pred EEEEEEECCCEEE Q ss_conf 4433430273355 Q gi|254780291|r 298 SQSYSWNIDNVHF 310 (475) Q Consensus 298 SLSYSWd~gdvHf 310 (475) +++|+|||||-.. T Consensus 61 ~~sy~WDFGDG~~ 73 (123) T 1wgo_A 61 TTKYQVDLGDGFK 73 (123) T ss_dssp SCEEEEECSSSCE T ss_pred CEEEEEEECCCCC T ss_conf 1689999089971 No 21 >3f9u_A Putative exported cytochrome C biogenesis- related protein; structural genomics, PSI-2, protein structure initiative; 2.20A {Bacteroides fragilis nctc 9343} Probab=35.58 E-value=10 Score=16.60 Aligned_cols=22 Identities=14% Similarity=-0.071 Sum_probs=20.2 Q ss_pred HHHHHHHHCCCCCCEEEEECCC Q ss_conf 8788888403479579995004 Q gi|254780291|r 357 WIRDDVFQAQREGKYIILFADD 378 (475) Q Consensus 357 WL~~DL~~A~~~gK~IILN~HD 378 (475) .++..|.+|+.+||+|+|+|.- T Consensus 35 d~~~al~~Ak~~gKpvll~F~A 56 (172) T 3f9u_A 35 DYDLGMEYARQHNKPVMLDFTG 56 (172) T ss_dssp CHHHHHHHHHHTTCCEEEEEEC T ss_pred CHHHHHHHHHHCCCEEEEEEEC T ss_conf 8999999999839939999976 No 22 >2o8b_B DNA mismatch repair protein MSH6; DNA damage response, somatic hypermutation, protein-DNA complex, DNA mispair, cancer, ABC transporter ATPase; HET: DNA ADP; 2.75A {Homo sapiens} PDB: 2o8c_B* 2o8d_B* 2o8e_B* 2o8f_B* Probab=32.74 E-value=21 Score=14.51 Aligned_cols=70 Identities=19% Similarity=0.244 Sum_probs=41.9 Q ss_pred HHHHHHHHCCCCCCEEEEECCC-CCCCHHHH----HHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEECC Q ss_conf 8788888403479579995004-77200389----999999999858961799986423676524655531013897045 Q gi|254780291|r 357 WIRDDVFQAQREGKYIILFADD-IDRFSSID----QKRMFEKFLTQSKISTIFTTRFTSSPESYIKDSTGRPVRVYNINK 431 (475) Q Consensus 357 WL~~DL~~A~~~gK~IILN~HD-~~~~sSi~----~KrmFks~itk~nV~AIF~aH~Hqshe~~lec~~GykVPvy~iGS 431 (475) =+..=|..|..+++++| +- +-+.++.| +-.+.+.+.++.+..++|.+|||+=.+-. .....|..+.... T Consensus 858 e~~~IL~~at~~SLVll---DElGrGTst~dG~aIA~avle~L~~~~~~~~lfaTHy~~L~~~~---~~~~~v~~~~m~~ 931 (1022) T 2o8b_B 858 ETASILMHATAHSLVLV---DELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDY---SQNVAVRLGHMAC 931 (1022) T ss_dssp HHHHHHHHCCTTCEEEE---ECTTTTSCHHHHHHHHHHHHHHHHHTSCCEEEEECCCHHHHHHT---SSCSSEEEEEEEE T ss_pred HHHHHHHHCCCCCEEEE---ECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH---HCCCCEEEEEEEE T ss_conf 99999985899848998---23889988578999999999999863698599968838999866---4084406635788 Q ss_pred C Q ss_conf 6 Q gi|254780291|r 432 N 432 (475) Q Consensus 432 v 432 (475) . T Consensus 932 ~ 932 (1022) T 2o8b_B 932 M 932 (1022) T ss_dssp C T ss_pred E T ss_conf 8 No 23 >3fk8_A Disulphide isomerase; APC61824.1, structural genomics, PSI-2, protein structure initiative, midwest center for structural genomics; 1.30A {Xylella fastidiosa TEMECULA1} Probab=32.54 E-value=19 Score=14.81 Aligned_cols=30 Identities=10% Similarity=0.113 Sum_probs=22.8 Q ss_pred CCHHHHHHHHHHHCCCCCCEEEEECCC--CCC Q ss_conf 557788788888403479579995004--772 Q gi|254780291|r 352 SEISQWIRDDVFQAQREGKYIILFADD--IDR 381 (475) Q Consensus 352 ~~iskWL~~DL~~A~~~gK~IILN~HD--~~~ 381 (475) .+...-+++-|..|+.+||+++|+|.- |+. T Consensus 12 ~~~~~d~~~~l~~a~~~~K~vlv~F~a~WC~~ 43 (133) T 3fk8_A 12 ADAWTQVKKALAAGKRTHKPTLLVFGANWCTD 43 (133) T ss_dssp CCHHHHHHHHHHHHHHHTCCEEEEEECTTCHH T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEEECCCCHH T ss_conf 77288899999999985992999994691930 No 24 >2kzw_A Uncharacterized protein; structural genomics, northeast structural genomics consortiu PSI-2, protein structure initiative; NMR {Methanosarcina mazei} Probab=29.70 E-value=16 Score=15.25 Aligned_cols=14 Identities=14% Similarity=0.316 Sum_probs=11.0 Q ss_pred EEEEEEEEECCCEE Q ss_conf 75443343027335 Q gi|254780291|r 296 SGSQSYSWNIDNVH 309 (475) Q Consensus 296 ~GSLSYSWd~gdvH 309 (475) .+..+|.|||||-. T Consensus 86 ~~~~~~~WdFgdG~ 99 (145) T 2kzw_A 86 ENATSRLWMFGDGN 99 (145) T ss_dssp BSCSEEEECCSSSS T ss_pred CCCCEEEEEECCCC T ss_conf 99758899909998 No 25 >3jqu_A Collagenase; PKD, beta barrel, cell adhesion; 1.40A {Clostridium histolyticum} PDB: 3js7_A Probab=26.71 E-value=25 Score=13.94 Aligned_cols=12 Identities=25% Similarity=0.623 Sum_probs=9.2 Q ss_pred EEEEEECCCEEE Q ss_conf 433430273355 Q gi|254780291|r 299 QSYSWNIDNVHF 310 (475) Q Consensus 299 LSYSWd~gdvHf 310 (475) ++|.|++||--. T Consensus 35 ~~y~W~fgdG~~ 46 (87) T 3jqu_A 35 VSYDWDFGDGAT 46 (87) T ss_dssp EEEEEECSSSCE T ss_pred EEEEEEECCCCE T ss_conf 999999499988 No 26 >1qwz_A NPQTN specific sortase B; beta barrel, transpeptidase, hydrolase; 1.75A {Staphylococcus aureus} SCOP: b.100.1.1 PDB: 1qxa_A 1ng5_A 1qx6_A* Probab=26.46 E-value=18 Score=14.99 Aligned_cols=18 Identities=28% Similarity=0.731 Sum_probs=11.9 Q ss_pred CCCCCEEECCCCCCCCCC Q ss_conf 655751302455444788 Q gi|254780291|r 216 SLKFPFFRGLGSQEYIGN 233 (475) Q Consensus 216 nl~iPv~lGLGNHDYqNN 233 (475) ++..|++.|-.||||-|- T Consensus 72 slnypvlqgktnhdylnl 89 (235) T 1qwz_A 72 SLNYPVLQGKTNHDYLNL 89 (235) T ss_dssp SCEEEEECCSSSSTTTSB T ss_pred CCCCHHHCCCCCCCCCCC T ss_conf 347553247676540436 No 27 >1egz_A Endoglucanase Z, EGZ, CEL5; glycosyl hydrolase, CLAN GH-A, family 5-2, cellulase; 2.30A {Erwinia chrysanthemi} SCOP: c.1.8.3 Probab=23.89 E-value=29 Score=13.50 Aligned_cols=51 Identities=16% Similarity=0.141 Sum_probs=29.8 Q ss_pred HHHHHHHHHCCCCCCEEEEECCC---CCC--CHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 88788888403479579995004---772--0038999999999985896179998 Q gi|254780291|r 356 QWIRDDVFQAQREGKYIILFADD---IDR--FSSIDQKRMFEKFLTQSKISTIFTT 406 (475) Q Consensus 356 kWL~~DL~~A~~~gK~IILN~HD---~~~--~sSi~~KrmFks~itk~nV~AIF~a 406 (475) +++++.+..+...|+|+++.=-- ++. .........|-++++++++.-+|-+ T Consensus 200 ~~~~~~~~~~~~~~~Pv~~gEfG~~~~~~~~~~~~~~~~~~~~~~~~~~igw~~W~ 255 (291) T 1egz_A 200 ESLRNKARQALNNGIALFVTEWGTVNADGNGGVNQTETDAWVTFMRDNNISNANWA 255 (291) T ss_dssp HHHHHHHHHHHHTTCCEEEEEEESSCTTSCSCCCHHHHHHHHHHHHHTTCCEEEEE T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEE T ss_conf 78999999998759985863027867788876489999999999998599789997 No 28 >2o8b_A DNA mismatch repair protein MSH2; DNA damage response, somatic hypermutation, protein-DNA complex, DNA mispair, cancer, ABC transporter ATPase; HET: DNA ADP; 2.75A {Homo sapiens} PDB: 2o8c_A* 2o8d_A* 2o8f_A* 2o8e_A* Probab=23.76 E-value=30 Score=13.48 Aligned_cols=54 Identities=20% Similarity=0.140 Sum_probs=33.6 Q ss_pred HHHHHHHHCCCCCCEEEEECCC-CCCCHHHH----HHHHHHHHHHHCCCCEEEEEEEECCCC Q ss_conf 8788888403479579995004-77200389----999999999858961799986423676 Q gi|254780291|r 357 WIRDDVFQAQREGKYIILFADD-IDRFSSID----QKRMFEKFLTQSKISTIFTTRFTSSPE 413 (475) Q Consensus 357 WL~~DL~~A~~~gK~IILN~HD-~~~~sSi~----~KrmFks~itk~nV~AIF~aH~Hqshe 413 (475) =+..=|..|..+++++| +- +-+.++.| +..+.+.+.++.+..++|+||||+=.+ T Consensus 731 e~~~il~~at~~SLvll---DElgrGT~~~dG~aia~aile~l~~~~~~~~lfaTH~~eL~~ 789 (934) T 2o8b_A 731 ETASILRSATKDSLIII---DELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTA 789 (934) T ss_dssp HHHHHHTTCCTTSEEEE---ESCCCSSCHHHHHHHHHHHHHHHHHTTCCEEEEECCCTTSTT T ss_pred HHHHHHHHCCCCEEEEE---ECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHH T ss_conf 99999983899839998---218899985678999999999998647987999885288988 No 29 >2w70_A Biotin carboxylase; ligase, inhibitor, ATP-binding, fatty acid biosynthesis, nucleotide-binding, lipid synthesis, ATP-grAsp domain; HET: L22; 1.77A {Escherichia coli} PDB: 1bnc_A 2j9g_A* 2v58_A* 2v59_A* 2v5a_A* 2vr1_A* 2w6m_A* 1dv1_A* 2w6o_A* 2w6n_A* 2w6q_A* 2w6z_A* 2w6p_A* 2w71_A* 3jzf_A* 3jzi_A* 1dv2_A* 2gps_A 2gpw_A 3g8c_A* ... Probab=22.33 E-value=32 Score=13.30 Aligned_cols=65 Identities=9% Similarity=0.177 Sum_probs=26.5 Q ss_pred CCCCCCCEEEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHH-CCCCCEEE Q ss_conf 0220155155513310046314688999886116567897605100121267788888672-65575130 Q gi|254780291|r 155 LNCHHKGIAVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTTKELKRFYNIY-SLKFPFFR 223 (475) Q Consensus 155 ~~~~~~~~~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q~qL~eFr~Vw-nl~iPv~l 223 (475) ..|...|+..|.-++--++. +-+-+.+-.-.+.+-.-++-| +++.-.+..++.+..- .+++|+.+ T Consensus 93 ~~~~~~Gi~fIGPs~~~i~~--~gDK~~ar~la~~~gvp~ip~--~~~~~~~~~~ea~~~a~~iGyPViI 158 (449) T 2w70_A 93 EQVERSGFIFIGPKAETIRL--MGDKVSAIAAMKKAGVPCVPG--SDGPLGDDMDKNRAIAKRIGYPVII 158 (449) T ss_dssp HHHHHTTCEESSSCHHHHHH--HHSHHHHHHHHHHHTCCBCSB--CSSCCCSCHHHHHHHHHHHCSSEEE T ss_pred HHHHHCCCEEECCCHHHHHH--HCCHHHHHHHHHHCCCCCCCC--CCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 88998899288889999987--409899999999859996898--7666688599999999866996688 No 30 >3mx1_A ECO29KIR; type II restriction endonuclease, GIY-YIG endonuclease, HYDR; 2.30A {Escherichia coli} PDB: 3mx4_A* 3nic_A* Probab=22.17 E-value=26 Score=13.89 Aligned_cols=53 Identities=15% Similarity=0.241 Sum_probs=32.9 Q ss_pred HHHHHHHHCCC-------EEEEEEECC---CCCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCCC Q ss_conf 89998861165-------678976051---001212677888886726557513024554447888 Q gi|254780291|r 179 EAINSLKSSKN-------IILGILTGD---MTQSSTTKELKRFYNIYSLKFPFFRGLGSQEYIGNR 234 (475) Q Consensus 179 esinsl~~~~~-------~~~gIINGD---LTeFG~q~qL~eFr~Vwnl~iPv~lGLGNHDYqNN~ 234 (475) |--.|+..+.+ .+|-|+.+. +---+.+.--..|+-+||- ..-|.||||=...+ T Consensus 128 eH~rSI~~~~nLd~~DF~cR~lV~~~~~s~wIpl~Es~LIr~~~P~WN~---~iDGFGnHDPG~gR 190 (235) T 3mx1_A 128 EHGRNIAKTSNLDLCDFSCRFVIFEATGSDMISTVQAALIKIYKPLWNT---VVDGFGNHTPGAGR 190 (235) T ss_dssp HHHHHHHTCSSCCGGGEEEEEEECCSGGGGGHHHHHHHHHHHHCCHHHH---TSCCTTCCCCCSSC T ss_pred HHHHHHHCCCCCCHHHEEEEEEEEECCCCCHHHHHHHHHHHHCCCCHHC---CCCCCCCCCCCCCC T ss_conf 9998774025998667489999982575413568999999860730330---45555578998665 No 31 >1sen_A Thioredoxin-like protein P19; endoplasmic reticulum, RP19, structural genomics, PSI, protein structure initiative; 1.20A {Homo sapiens} SCOP: c.47.1.1 PDB: 2k8v_A Probab=21.97 E-value=27 Score=13.77 Aligned_cols=21 Identities=14% Similarity=0.202 Sum_probs=19.1 Q ss_pred HHHHHHHHCCCCCCEEEEECCC Q ss_conf 8788888403479579995004 Q gi|254780291|r 357 WIRDDVFQAQREGKYIILFADD 378 (475) Q Consensus 357 WL~~DL~~A~~~gK~IILN~HD 378 (475) | +.-|..|..+|||++++|.- T Consensus 35 ~-~eal~~Ak~~~Kpvlv~F~a 55 (164) T 1sen_A 35 L-EDGKKEAAASGLPLMVIIHK 55 (164) T ss_dssp H-HHHHHHHHHHTCCEEEEEEC T ss_pred H-HHHHHHHHHCCCCEEEEECC T ss_conf 9-99999999819979999888 No 32 >1w9c_A CRM1 protein, exportin 1; nuclear protein, nuclear export complex; 2.3A {Homo sapiens} SCOP: a.118.1.19 Probab=21.64 E-value=16 Score=15.19 Aligned_cols=17 Identities=12% Similarity=0.170 Sum_probs=10.2 Q ss_pred ECCCCHHHHHHHHHHHC Q ss_conf 05655778878888840 Q gi|254780291|r 349 SNGSEISQWIRDDVFQA 365 (475) Q Consensus 349 n~~~~iskWL~~DL~~A 365 (475) ++...+.+|+.+-|.+| T Consensus 263 ~n~~~l~e~l~~lL~~a 279 (321) T 1w9c_A 263 NNQIFLQEYVANLLKSA 279 (321) T ss_dssp CHHHHHHHHHHHHHHHH T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 55999999999999876 No 33 >1b4r_A Protein (PKD1_human); PKD domain 1 from human polycystein-1, polycystin (precursor), membrane protein; NMR {Homo sapiens} SCOP: b.1.3.1 Probab=21.59 E-value=26 Score=13.90 Aligned_cols=12 Identities=8% Similarity=0.357 Sum_probs=8.9 Q ss_pred EEEEEEEECCCE Q ss_conf 544334302733 Q gi|254780291|r 297 GSQSYSWNIDNV 308 (475) Q Consensus 297 GSLSYSWd~gdv 308 (475) +..+|+|||||- T Consensus 25 ~~~s~~WdFgDG 36 (80) T 1b4r_A 25 PVTATRWDFGDG 36 (80) T ss_dssp SCSEEEEECCSS T ss_pred CCEEEEEEECCC T ss_conf 950899992899 No 34 >1t71_A Phosphatase, conserved; crystal, X-RAY crystallography, structural genomics, berkeley structural genomics center, BSGC, PSI; 2.10A {Mycoplasma pneumoniae M129} SCOP: d.159.1.9 Probab=21.44 E-value=33 Score=13.19 Aligned_cols=70 Identities=17% Similarity=0.257 Sum_probs=53.9 Q ss_pred EEEECCCEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 55513310046314688999886116567897605100121267788888672655751302455444788 Q gi|254780291|r 163 AVIADPWYKADTPMFVEAINSLKSSKNIILGILTGDMTQSSTTKELKRFYNIYSLKFPFFRGLGSQEYIGN 233 (475) Q Consensus 163 ~~~~~pw~k~~~~~~vesinsl~~~~~~~~gIINGDLTeFG~q~qL~eFr~Vwnl~iPv~lGLGNHDYqNN 233 (475) ..|.|--=+.-..+..+.+-.|++.-++-|-|+||.=+.-|.-=..+-+++..+.++-++- +|||-+.+- T Consensus 8 LfiGDIvG~~Gr~~v~~~Lp~l~~~~~iDfvIaNgENaa~G~Git~~~~~~l~~~GvDviT-~GNH~wd~k 77 (281) T 1t71_A 8 IFLGDVYGKAGRNIIKNNLAQLKSKYQADLVIVNAENTTHGKGLSLKHYEFLKEAGVNYIT-MGNHTWFQK 77 (281) T ss_dssp EEECEEBHHHHHHHHHTTHHHHHHHHTCSEEEEECTBTTTTSSCCHHHHHHHHHHTCCEEE-CCTTTTCCG T ss_pred EEEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCEEE-CCCCCCCCH T ss_conf 9998068888999999980999998289999989853678969899999999971998992-472102563 No 35 >1jni_A NAPB;, diheme cytochrome C NAPB; dihaem cytochrome C, proteolytic fragment, nitrate reductase subunit, oxidoreductase; HET: HEM; 1.25A {Haemophilus influenzae} SCOP: a.138.1.3 Probab=21.39 E-value=10 Score=16.48 Aligned_cols=26 Identities=35% Similarity=0.715 Sum_probs=22.0 Q ss_pred HCCCC-CCCEEEEEEHHHCCCCCCCEE Q ss_conf 23333-686089980542022015515 Q gi|254780291|r 138 LLPHK-QNMDIVVDVNKILNCHHKGIA 163 (475) Q Consensus 138 ~~~~~-~~~~~~~~~~~~~~~~~~~~~ 163 (475) |.||+ .+|.|-.+.|+-|.||-...| T Consensus 40 lIPH~i~gy~It~~~N~Cl~CH~~~~a 66 (123) T 1jni_A 40 MVPHSVANYQVTKNVNQCLNCHSPENS 66 (123) T ss_dssp CCSSCCTTCCEETTEETTHHHHSTTTH T ss_pred CCCCCCCCCEEECCCCCCCCCCCHHHH T ss_conf 687335676552688837458687457 No 36 >1ogy_B Diheme cytochrome C NAPB molecule: nitrate reductase; oxidoreductase; HET: MGD HEC; 3.2A {Rhodobacter sphaeroides} SCOP: a.138.1.3 Probab=20.45 E-value=13 Score=15.81 Aligned_cols=26 Identities=27% Similarity=0.758 Sum_probs=20.7 Q ss_pred HCCCC-CCCEEEEEEHHHCCCCCCCEE Q ss_conf 23333-686089980542022015515 Q gi|254780291|r 138 LLPHK-QNMDIVVDVNKILNCHHKGIA 163 (475) Q Consensus 138 ~~~~~-~~~~~~~~~~~~~~~~~~~~~ 163 (475) +.||+ ..|.|-.+.|+-|.||-...| T Consensus 41 lIPH~iegy~It~~~N~Cl~CH~~~~a 67 (130) T 1ogy_B 41 VIPHSIEGYQLSVNANRCLECHRRQYS 67 (130) T ss_dssp CBCSCCTTCCBSSSCBGGGGTSCCCCT T ss_pred CCCCCCCCCEEECCCCCCCCCCCHHHH T ss_conf 687247785550789826658796346 Done!