Query gi|254780837|ref|YP_003065250.1| putative restriction endonuclease S subunit [Candidatus Liberibacter asiaticus str. psy62] Match_columns 426 No_of_seqs 161 out of 3207 Neff 9.1 Searched_HMMs 39220 Date Mon May 30 02:15:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780837.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09737 EcoKI restriction-mod 100.0 0 0 480.6 33.4 398 18-424 3-422 (459) 2 COG0732 HsdS Restriction endon 100.0 3.1E-37 7.9E-42 250.3 26.7 378 25-418 1-391 (391) 3 PRK09737 EcoKI restriction-mod 100.0 2.4E-34 6.2E-39 232.3 14.2 200 16-216 211-422 (459) 4 pfam01420 Methylase_S Type I r 100.0 8.7E-28 2.2E-32 191.5 12.8 160 22-197 1-167 (167) 5 pfam01420 Methylase_S Type I r 99.9 3.7E-21 9.3E-26 150.2 12.1 160 229-405 1-167 (167) 6 COG0732 HsdS Restriction endon 99.7 3.2E-16 8.2E-21 119.5 11.7 188 20-208 193-389 (391) 7 PRK10642 proline/glycine betai 79.2 4.1 0.0001 19.2 4.8 48 366-413 438-485 (490) 8 pfam08549 SWI-SNF_Ssr4 Fungal 70.1 6.8 0.00017 17.8 3.8 37 376-414 367-403 (669) 9 TIGR02777 LigD_PE_dom DNA liga 53.0 2.8 7.1E-05 20.2 -0.7 14 202-215 53-68 (163) 10 pfam04683 Proteasom_Rpn13 Prot 47.9 17 0.00044 15.3 3.9 75 41-132 16-90 (263) 11 pfam12548 DUF3740 Sulfatase pr 45.2 19 0.00048 15.1 3.9 22 389-410 110-131 (144) 12 KOG0018 consensus 42.3 21 0.00053 14.8 6.1 13 358-370 695-707 (1141) 13 pfam07674 consensus 37.9 22 0.00055 14.7 1.9 14 357-370 38-51 (70) 14 pfam08287 DASH_Spc19 Spc19. Sp 37.2 25 0.00064 14.3 6.2 68 136-204 19-89 (150) 15 KOG0353 consensus 35.8 17 0.00043 15.4 1.1 30 84-115 111-140 (695) 16 pfam08946 Osmo_CC Osmosensory 33.4 29 0.00073 13.9 5.3 32 382-413 10-41 (46) 17 PRK04325 hypothetical protein; 32.2 30 0.00077 13.8 6.2 35 173-207 20-54 (74) 18 pfam08810 KapB Kinase associat 29.8 33 0.00084 13.5 4.9 53 354-409 56-110 (113) 19 pfam11802 CENP-K Centromere-as 28.1 35 0.0009 13.4 6.6 10 330-339 244-253 (273) 20 TIGR02424 TF_pcaQ pca operon t 25.0 40 0.001 13.0 2.6 32 353-386 181-212 (300) 21 pfam10398 DUF2443 Protein of u 24.5 41 0.001 13.0 4.3 25 387-411 53-77 (79) 22 pfam12005 DUF3499 Protein of u 23.2 42 0.0011 12.9 1.3 11 228-238 54-64 (125) No 1 >PRK09737 EcoKI restriction-modification system protein HsdS; Provisional Probab=100.00 E-value=0 Score=480.61 Aligned_cols=398 Identities=17% Similarity=0.248 Sum_probs=312.2 Q ss_pred CCCCCCCCEEEECCCEEEEEECCCCCCC--------CCEEEEEEEEEEECCCEEEECCCCCCCCCCCCEEEECCCCEEEE Q ss_conf 3706898869884123998506678887--------75368870000206412331122223466553378428978999 Q gi|254780837|r 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 (426) Q Consensus 18 ~g~iP~~We~~kL~~i~~i~~g~~~~~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~gDil~s 89 (426) -||||+||+|++||+||++.+|++++++ .+++|+++.++.++...+...............+.+++||||++ T Consensus 3 ~GelP~gW~~~rLgdI~~~~~G~t~~k~~~~~~~~~g~ip~i~~~di~~g~i~~~~~~~~~~~~~~~~~~~~~~gdiLi~ 82 (459) T PRK09737 3 AGKLPEGWVTTHISEICSVIRGVKPPKGYATKSLTMGDLPFIRTTDITNGAVDLTTLPYCMDAPEDVSKYQLQDRDIVIS 82 (459) T ss_pred CCCCCCCCEEEECHHEEEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEECCCCEEECCCHHHHCCCCCCCCEEEEE T ss_conf 99799997899904617982787079755454457897579973315688465356557045253420044789969998 Q ss_pred EECCCCCEEEEECC----CEEEECCEEEECCCC-CCCHHHHHHHCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHH Q ss_conf 42287523798346----489907438714666-7870222221117788988831568716665266634455544332 Q gi|254780837|r 90 KLGPYLRKAIIADF----DGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 (426) Q Consensus 90 ~~g~~~g~~~i~~~----~~~~~~~~~vl~~~~-~~~~fl~y~l~s~~~~~~~~~~~~Gs~~~~i~~~~l~~~~iplPpl 164 (426) ..|+.| .+++... +.+++..+.++++.. +++.||+|+|.|..++.++...++|+++++|+.+++.++.|||||| T Consensus 83 ~~gtiG-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~fl~~~l~s~~~~~~i~~~~~G~~~~~i~~~~l~~~~iPlPPL 161 (459) T PRK09737 83 SAGSIG-FSFLAQNPLPSNVVFASFAIRLKPVNYFSEYFLKHFLESSDYWNQLSLMSAGNAVQNINAQSLSTLTFPIPPI 161 (459) T ss_pred CCCCCE-EEEEECCCCCCCCEEEEEEEEECCCCCCCHHHHHHHHHCHHHHHHHHHHCCCCCCCCCCHHHHCCCEECCCCH T ss_conf 278860-4899776466650476578970565332399999999677999999853676113415888870546068999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHCCCC Q ss_conf 34577999999999988877654333222236889999999986078810002357500068686656433024310134 Q gi|254780837|r 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 (426) Q Consensus 165 ~eQ~kIv~~Ld~~~~~Id~~I~~~~~~i~~l~e~kqali~~~~tk~L~p~~~~k~s~~e~lG~IP~~W~~~~l~~~~~~~ 244 (426) +||+|||++||.++++||++++..++..+++++++|++++++|+++|+++.++ +++|++|+++++++++... T Consensus 162 ~EQkrIv~~Ld~l~~~id~~~~~~e~~~~llk~~rq~iL~~av~g~l~~~~~~--------~~~p~~W~~~~l~~i~~~~ 233 (459) T PRK09737 162 AEQKIIAEKLDTLLAQVDSTKARLEQIPQILKRFRQAVLGAAVSGKLTEEWRW--------PDLPSTWSVHKYSELVDSR 233 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC--------CCCCCCCEEEEHHHHHCCC T ss_conf 99999999999999999999999999999999999999999864877534567--------8998885798868941243 Q ss_pred CCCCC--CCCCCEEEEECCCCCCCCCCCC-CCCEEC-CCCCCCEEECCCCEEEEEECCC--CCEEEEEEECCCCCEEEEC Q ss_conf 56764--4443215774045532465322-121000-4433202422884899742166--4037999502356607741 Q gi|254780837|r 245 NRKNT--KLIESNILSLSYGNIIQKLETR-NMGLKP-ESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITS 318 (426) Q Consensus 245 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~gdil~~~~~~~--~~~~~~~~~~~~~~~~~~~ 318 (426) ..+.. ....+.++++..+++....... ...+.. ........++.||+|+++.++. .+++++.........+.+. T Consensus 234 ~~g~~~k~~~~~~~~~~~~~ni~~~~i~~~~~~~~~~~~~~~~~~l~~GDiL~~~~g~~~~vG~~ai~~~~~~~~~~~~~ 313 (459) T PRK09737 234 LGGMSSKANNFGSATKYLLGNIDQNNVRWFSFDLENSDIERRELKLKLGDVLFCRGGSPEFVGRCAIWKEPQDIPVIYQK 313 (459) T ss_pred CCCCCCCCCCCCCCCEEEECEEECCEEEECCEEEEECHHHHHHCCCCCCCEEEECCCCHHHCEEEEEECCCCCCCEEECC T ss_conf 68868886447987879756242680554000475033234231477898899887874447379997566665536346 Q ss_pred HHEECCCC-CCCHHHHHHHHHCHHHHHHH-HHHC-CCEEEEECHHHHHCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11430577-68868899997098999999-8625-840410028898247214898899999999999999999999999 Q gi|254780837|r 319 AYMAVKPH-GIDSTYLAWLMRSYDLCKVF-YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 (426) Q Consensus 319 ~~~~~~~~-~~~~~yl~~~l~s~~~~~~~-~~~~-~g~~~~i~~~~l~~~~i~lP~leeQ~~I~~~l~~~~~~id~li~~ 395 (426) +++.++++ .+++.||+|+|++..++.+. ...+ +++|++||.+.+++++|||||++||++||++||++++.+|++++. T Consensus 314 ~~~~~r~~~~i~~~fl~~~l~s~~~~~~~~~~~~~G~~~~~in~~~i~~~~iplPPL~EQ~~IV~~ld~l~~~~d~l~~~ 393 (459) T PRK09737 314 ALIRARVKDDIIPEYLVYNLKSPDSRNISLSQLFTGTTQKHITGKALANYPIRVPPLEEQAEIVRRVEQLFAYADTIEKQ 393 (459) T ss_pred EEEEEECCHHHCHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 38997232110699999997199999999998604677777898987038744899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCEEECCCC Q ss_conf 99999999999999778862233614876 Q gi|254780837|r 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 (426) Q Consensus 396 ~~~~i~~L~~lk~sLl~~a~tGki~V~~~ 424 (426) .+++++.|++||+|||++|||||+-=+|| T Consensus 394 ~~~~~~~l~~l~qSiL~kAF~GeLv~qdp 422 (459) T PRK09737 394 VNNALARVNNLTQSILAKAFRGELTAQWR 422 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999999999999999996874568897 No 2 >COG0732 HsdS Restriction endonuclease S subunits [Defense mechanisms] Probab=100.00 E-value=3.1e-37 Score=250.31 Aligned_cols=378 Identities=19% Similarity=0.240 Sum_probs=245.0 Q ss_pred CEEEECCCEEEEEECCCCCCCCCEEEEEEEE-EEEC--CCEEEECCCCCCCCCCCCEEEECCCCEEEEEECCCCCEEEEE Q ss_conf 8698841239985066788877536887000-0206--412331122223466553378428978999422875237983 Q gi|254780837|r 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLED-VESG--TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 (426) Q Consensus 25 We~~kL~~i~~i~~g~~~~~~~~~~~~~~~~-i~~~--~~~~~~~~~~~~~~~~~~~~~~~~gDil~s~~g~~~g~~~i~ 101 (426) |++++|++++++..|.++.......+..... +... ........................+....+.... .|.+.+. T Consensus 1 w~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 79 (391) T COG0732 1 WKELKLGDIGEIIRGKTPSTESPLNYGSDIPILTVKDVKGRYSVKGNNGIIGKKLSYLIAGLGPLGELVRAG-LGDVAYV 79 (391) T ss_pred CCEEECCCHHHEEECCCCCCCCCCCCCCCCCEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCCEEEECCC-CCEEEEE T ss_conf 957874743514616677745424555676401021235531121013322223110014545421441133-2202441 Q ss_pred CCCEEEECCEEEECCCCCCCHHHHHHHCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 46489907438714666787022222111778898883156871666526663445554433234577999999999988 Q gi|254780837|r 102 DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 (426) Q Consensus 102 ~~~~~~~~~~~vl~~~~~~~~fl~y~l~s~~~~~~~~~~~~Gs~~~~i~~~~l~~~~iplPpl~eQ~kIv~~Ld~~~~~I 181 (426) +...++++.+..+++++....++++++............+.|+++++++...+.++.||+||++||++||.+|+. + T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~i~~Ppl~eQ~~I~~iL~~----~ 155 (391) T COG0732 80 NGKLSINQGCLSLRPKKKEPKSLFLYLLLKLELKLLKSGAGGSTIPNLSKSSLKEIPIPLPPLEEQQAIAKILST----L 155 (391) T ss_pred CCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHH----H T ss_conf 355235302688615764378999999976544455520356652223498895178789998999999998766----6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHCCCCCCCC--CCCCCCEEEEE Q ss_conf 8776543332222368899999999860788100023575000686866564330243101345676--44443215774 Q gi|254780837|r 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLIESNILSL 259 (426) Q Consensus 182 d~~I~~~~~~i~~l~e~kqali~~~~tk~L~p~~~~k~s~~e~lG~IP~~W~~~~l~~~~~~~~~~~--~~~~~~~~~~~ 259 (426) |++|+..+++++.+++.++++++++++++......+ .++|++|+..++++++.....+. .+......... T Consensus 156 d~~i~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (391) T COG0732 156 DDLIELLEKLIELLEAIKQKLFKELFTKGKYLLNNI--------KSIPEDWKVIKLKDIVEIISGGTKIKPLEKGVNVSY 227 (391) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHC--------CCCCCCCEECCCHHHEEECCCCCCCCCCCCCCCEEE T ss_conf 677899999888999999999999876332200110--------346656401130033021366754555445555021 Q ss_pred CCCCCCCCCCCCC-----CCEECCCCCCCEEECCCCEEEEEECCCCCEEEEEEECCCCCEEEECHHEECCCCCC-C-HHH Q ss_conf 0455324653221-----21000443320242288489974216640379995023566077411143057768-8-688 Q gi|254780837|r 260 SYGNIIQKLETRN-----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-D-STY 332 (426) Q Consensus 260 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~gdil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~y 332 (426) ....+........ ............. .++.++.........+..........++.++++..+++... . ..| T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f 305 (391) T COG0732 228 GKNPVLNGGEPNNGDIVKLSRIDNFIYGLKL--SGGKYITLSSSGATIGIAIINNGTLKFILSQGIGILRIENKVEGEKF 305 (391) T ss_pred CCCCCCCCCCCCCCCCEEECCCCHHHHHHHC--CCCCEEEECCCCCEEEEEEECCCCCEEEECCCEEEEEECCCCCHHHH T ss_conf 1344335653467652231145415554430--45752563144424568861466520353275589851553203788 Q ss_pred HHHHHHCHHHHHHHHHHCC-CEEEEECHHHHHCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999709899999986258-404100288982472148988999999999999999999999999999999999999977 Q gi|254780837|r 333 LAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 (426) Q Consensus 333 l~~~l~s~~~~~~~~~~~~-g~~~~i~~~~l~~~~i~lP~leeQ~~I~~~l~~~~~~id~li~~~~~~i~~L~~lk~sLl 411 (426) ++|+|.+..+........+ +++++|+.+.+.+++||+||++||++|+++++..++.++.+... .++++.++..++.+| T Consensus 306 l~~~l~s~~~~~~~~~~~~gs~~~~l~~~~l~~~~i~iPpl~eQ~~i~~~l~~l~~~i~~~~~~-~~~~~~~~~~~~~~l 384 (391) T COG0732 306 LYYYLSSEFFKKIIKKVSKGSTIPRLNKKLLEEIPIPIPPLEEQQKIAEILSALDKLIDILNRK-PAELEALKKQLDYLL 384 (391) T ss_pred HHHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHH T ss_conf 9999726767899998603786553365326431427999999999999999999999985257-778999999999987 Q ss_pred HHHHCCE Q ss_conf 8862233 Q gi|254780837|r 412 AAAVTGQ 418 (426) Q Consensus 412 ~~a~tGk 418 (426) +++++|+ T Consensus 385 ~~l~~~~ 391 (391) T COG0732 385 NKLLTGK 391 (391) T ss_pred HHHHCCC T ss_conf 7755369 No 3 >PRK09737 EcoKI restriction-modification system protein HsdS; Provisional Probab=100.00 E-value=2.4e-34 Score=232.30 Aligned_cols=200 Identities=19% Similarity=0.292 Sum_probs=160.7 Q ss_pred CC-CCCCCCCCEEEECCCEEEEEECCCCCCCC---CEEEEEEEEEEECCCEEEECCCCCCCCCCCCEEEECCCCEEEEEE Q ss_conf 62-37068988698841239985066788877---536887000020641233112222346655337842897899942 Q gi|254780837|r 16 QW-IGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 (426) Q Consensus 16 ~w-~g~iP~~We~~kL~~i~~i~~g~~~~~~~---~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~gDil~s~~ 91 (426) +| .+++|++|+|++|+++++...|+..++.+ ..+++.+.++..+...+....... .......+.++.|||||++. T Consensus 211 ~~~~~~~p~~W~~~~l~~i~~~~~~g~~~k~~~~~~~~~~~~~ni~~~~i~~~~~~~~~-~~~~~~~~~l~~GDiL~~~~ 289 (459) T PRK09737 211 EWRWPDLPSTWSVHKYSELVDSRLGGMSSKANNFGSATKYLLGNIDQNNVRWFSFDLEN-SDIERRELKLKLGDVLFCRG 289 (459) T ss_pred CCCCCCCCCCCEEEEHHHHHCCCCCCCCCCCCCCCCCCEEEECEEECCEEEECCEEEEE-CHHHHHHCCCCCCCEEEECC T ss_conf 45678998885798868941243688688864479878797562426805540004750-33234231477898899887 Q ss_pred CC--CCCEEEEEC----CCEEEECCEEEECCC-CCCCHHHHHHHCHHHHHHH-HHHCCCCCCEEEECHHHHHHHHHHHHH Q ss_conf 28--752379834----648990743871466-6787022222111778898-883156871666526663445554433 Q gi|254780837|r 92 GP--YLRKAIIAD----FDGICSTQFLVLQPK-DVLPELLQGWLLSIDVTQR-IEAICEGATMSHADWKGIGNIPMPIPP 163 (426) Q Consensus 92 g~--~~g~~~i~~----~~~~~~~~~~vl~~~-~~~~~fl~y~l~s~~~~~~-~~~~~~Gs~~~~i~~~~l~~~~iplPp 163 (426) |+ ..|++++.. ...+++++++.+|++ +++++||+|+|+|..++++ +...++|++|++|+.++++++.||||| T Consensus 290 g~~~~vG~~ai~~~~~~~~~~~~~~~~~~r~~~~i~~~fl~~~l~s~~~~~~~~~~~~~G~~~~~in~~~i~~~~iplPP 369 (459) T PRK09737 290 GSPEFVGRCAIWKEPQDIPVIYQKALIRARVKDDIIPEYLVYNLKSPDSRNISLSQLFTGTTQKHITGKALANYPIRVPP 369 (459) T ss_pred CCHHHCEEEEEECCCCCCCEEECCEEEEEECCHHHCHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHCCEECCCC T ss_conf 87444737999756666553634638997232110699999997199999999998604677777898987038744899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 23457799999999998887765433322223688999999998607881000 Q gi|254780837|r 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 (426) Q Consensus 164 l~eQ~kIv~~Ld~~~~~Id~~I~~~~~~i~~l~e~kqali~~~~tk~L~p~~~ 216 (426) |+||++||++||.+++++|.+.+..++.++.|++++||+|++||+|+|.|..| T Consensus 370 L~EQ~~IV~~ld~l~~~~d~l~~~~~~~~~~l~~l~qSiL~kAF~GeLv~qdp 422 (459) T PRK09737 370 LEEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 422 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999999999999999999999999999999999999999996874568897 No 4 >pfam01420 Methylase_S Type I restriction modification DNA specificity domain. This domain is also known as the target recognition domain (TRD). Restriction-modification (R-M) systems protect a bacterial cell against invasion of foreign DNA by endonucleolytic cleavage of DNA that lacks a site specific modification. The host genome is protected from cleavage by methylation of specific nucleotides in the target sites. In type I systems, both restriction and modification activities are present in one heteromeric enzyme complex composed of one DNA specificity subunit (this family), two modification (M) subunits and two restriction (R) subunits. Probab=99.95 E-value=8.7e-28 Score=191.48 Aligned_cols=160 Identities=21% Similarity=0.334 Sum_probs=131.4 Q ss_pred CCCCEEEECCCEEEEEECCCCCCC-----CCEEEEEEEEEEECCCEEEECCCCCCCCCCCCEEEECCCCEEEEEECCCCC Q ss_conf 898869884123998506678887-----753688700002064123311222234665533784289789994228752 Q gi|254780837|r 22 PKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR 96 (426) Q Consensus 22 P~~We~~kL~~i~~i~~g~~~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~gDil~s~~g~~~g 96 (426) |++|++++||+||++.+|+++++. .+++++++.++..+...+. +..+.+.+..|||++++.|++ | T Consensus 1 P~~We~~~Lg~i~~~~~G~~~~~~~~~~~g~ip~i~~~~~~~~~~~~~---------~~~~~~~~~~~dil~~~~g~~-g 70 (167) T pfam01420 1 PVDWEEFKLGEIFEIKKGKTLSKKELRENGKYPYITAGDLNNGVIGGV---------GYIKKKIFPGNSILISSNGSI-G 70 (167) T ss_pred CCCCEEEEEEEEEEEECCCCCCHHHHCCCCCEEEEEEEEECCCCCCEE---------ECCEEEEECCCEEEEECCCCC-E T ss_conf 979899990529999279989877812379816997402204642100---------011257505981686215774-0 Q ss_pred EEEEECCCEEEECCEEEECCC--CCCCHHHHHHHCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 379834648990743871466--678702222211177889888315687166652666344555443323457799999 Q gi|254780837|r 97 KAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 (426) Q Consensus 97 ~~~i~~~~~~~~~~~~vl~~~--~~~~~fl~y~l~s~~~~~~~~~~~~Gs~~~~i~~~~l~~~~iplPpl~eQ~kIv~~L 174 (426) .+++.+.++++++++++++++ ...+.|++|++.+ +..++...++|+++++|+.++++++.||+||++||++||++| T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~g~~~~~i~~~~i~~~~i~~Ppl~EQ~~I~~~l 148 (167) T pfam01420 71 YVFYRDKPFFANQDVKVLIPKNNELLNKFLYLFLKT--ILKKLKKLKFGSGQPKLNKKRLKELKIPLPPLEEQQKIADIL 148 (167) T ss_pred EEEECCCCEEEECCEEEEEECCCCCCHHHHHHHHHH--HHHHHHHHCCCCCCCCCCHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 478608875775235899731113328899999974--057787534665301466788605734489999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999888776543332222368 Q gi|254780837|r 175 IAETVRIDTLITERIRFIELLKE 197 (426) Q Consensus 175 d~~~~~Id~~I~~~~~~i~~l~e 197 (426) + ++|++|+.++++++.|+| T Consensus 149 ~----~ld~~I~~~~~~i~~Lke 167 (167) T pfam01420 149 E----QLDATIELEKKKLAELEE 167 (167) T ss_pred H----HHHHHHHHHHHHHHHHCC T ss_conf 9----999999999999998478 No 5 >pfam01420 Methylase_S Type I restriction modification DNA specificity domain. This domain is also known as the target recognition domain (TRD). Restriction-modification (R-M) systems protect a bacterial cell against invasion of foreign DNA by endonucleolytic cleavage of DNA that lacks a site specific modification. The host genome is protected from cleavage by methylation of specific nucleotides in the target sites. In type I systems, both restriction and modification activities are present in one heteromeric enzyme complex composed of one DNA specificity subunit (this family), two modification (M) subunits and two restriction (R) subunits. Probab=99.86 E-value=3.7e-21 Score=150.25 Aligned_cols=160 Identities=14% Similarity=0.164 Sum_probs=116.9 Q ss_pred CCCCCCCCHHHHCCCCCCCCC----CCCCCEEEEECCCCCCCCCCCCCCCEECCCCCCCEEECCCCEEEEEECCCCCEEE Q ss_conf 665643302431013456764----4443215774045532465322121000443320242288489974216640379 Q gi|254780837|r 229 PDHWEVKPFFALVTELNRKNT----KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 (426) Q Consensus 229 P~~W~~~~l~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gdil~~~~~~~~~~~~ 304 (426) |++|++++|++++...++... ...++++|+++.+++....... ........++.||++++..+..+..+ T Consensus 1 P~~We~~~Lg~i~~~~~G~~~~~~~~~~~g~ip~i~~~~~~~~~~~~------~~~~~~~~~~~~dil~~~~g~~g~~~- 73 (167) T pfam01420 1 PVDWEEFKLGEIFEIKKGKTLSKKELRENGKYPYITAGDLNNGVIGG------VGYIKKKIFPGNSILISSNGSIGYVF- 73 (167) T ss_pred CCCCEEEEEEEEEEEECCCCCCHHHHCCCCCEEEEEEEEECCCCCCE------EECCEEEEECCCEEEEECCCCCEEEE- T ss_conf 97989999052999927998987781237981699740220464210------00112575059816862157740478- Q ss_pred EEEECCCCCEEEECHHEECCCC--CCCHHHHHHHHHCHHHHHHHHH-HCCCEEEEECHHHHHCCEECCCCHHHHHHHHHH Q ss_conf 9950235660774111430577--6886889999709899999986-258404100288982472148988999999999 Q gi|254780837|r 305 LRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 (426) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~yl~~~l~s~~~~~~~~~-~~~g~~~~i~~~~l~~~~i~lP~leeQ~~I~~~ 381 (426) .....+++++++++++++ .+++.|+||++.+. ..++.. ..++++++|+.++++++.||+||++||++||++ T Consensus 74 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~g~~~~~i~~~~i~~~~i~~Ppl~EQ~~I~~~ 147 (167) T pfam01420 74 ----YRDKPFFANQDVKVLIPKNNELLNKFLYLFLKTI--LKKLKKLKFGSGQPKLNKKRLKELKIPLPPLEEQQKIADI 147 (167) T ss_pred ----ECCCCEEEECCEEEEEECCCCCCHHHHHHHHHHH--HHHHHHHCCCCCCCCCCHHHHHCCCCCCCCHHHHHHHHHH T ss_conf ----6088757752358997311133288999999740--5778753466530146678860573448999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999 Q gi|254780837|r 382 INVETARIDVLVEKIEQSIVLLKE 405 (426) Q Consensus 382 l~~~~~~id~li~~~~~~i~~L~~ 405 (426) |+ .+|.+|+.++++++.|+| T Consensus 148 l~----~ld~~I~~~~~~i~~Lke 167 (167) T pfam01420 148 LE----QLDATIELEKKKLAELEE 167 (167) T ss_pred HH----HHHHHHHHHHHHHHHHCC T ss_conf 99----999999999999998478 No 6 >COG0732 HsdS Restriction endonuclease S subunits [Defense mechanisms] Probab=99.70 E-value=3.2e-16 Score=119.47 Aligned_cols=188 Identities=26% Similarity=0.284 Sum_probs=118.7 Q ss_pred CCCCCCEEEECCCEEEEEECCC--CCCCCC--EEEEEEEEEEECCCEEEECCCCCCCCCCC-CEEEECCCCEEEEEECCC Q ss_conf 0689886988412399850667--888775--36887000020641233112222346655-337842897899942287 Q gi|254780837|r 20 AIPKHWKVVPIKRFTKLNTGRT--SESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPY 94 (426) Q Consensus 20 ~iP~~We~~kL~~i~~i~~g~~--~~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~~~~-~~~~~~~gDil~s~~g~~ 94 (426) ++|++|+..++++++++.++.. +..... ..+........+................. .......+.+..+..+.. T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 272 (391) T COG0732 193 SIPEDWKVIKLKDIVEIISGGTKIKPLEKGVNVSYGKNPVLNGGEPNNGDIVKLSRIDNFIYGLKLSGGKYITLSSSGAT 272 (391) T ss_pred CCCCCCEECCCHHHEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCEEECCCCHHHHHHHCCCCCEEEECCCCCE T ss_conf 46656401130033021366754555445555021134433565346765223114541555443045752563144424 Q ss_pred CCEEEEECC--CEEEECCEEEECCCCC-C-CHHHHHHHCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHH Q ss_conf 523798346--4899074387146667-8-70222221117788988831568716665266634455544332345779 Q gi|254780837|r 95 LRKAIIADF--DGICSTQFLVLQPKDV-L-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 (426) Q Consensus 95 ~g~~~i~~~--~~~~~~~~~vl~~~~~-~-~~fl~y~l~s~~~~~~~~~~~~Gs~~~~i~~~~l~~~~iplPpl~eQ~kI 170 (426) .|.+.+... .+++++++..+++.+. . +.|++|+|.+..++........|+++++++.+.+.++.||+||++||++| T Consensus 273 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fl~~~l~s~~~~~~~~~~~~gs~~~~l~~~~l~~~~i~iPpl~eQ~~i 352 (391) T COG0732 273 IGIAIINNGTLKFILSQGIGILRIENKVEGEKFLYYYLSSEFFKKIIKKVSKGSTIPRLNKKLLEEIPIPIPPLEEQQKI 352 (391) T ss_pred EEEEEECCCCCEEEECCCEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHEECCCCCHHHHHHH T ss_conf 56886146652035327558985155320378899997267678999986037865533653264314279999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999988877654333222236889999999986 Q gi|254780837|r 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 (426) Q Consensus 171 v~~Ld~~~~~Id~~I~~~~~~i~~l~e~kqali~~~~t 208 (426) |++|+....+++.+... .+.++.++..+.++++..++ T Consensus 353 ~~~l~~l~~~i~~~~~~-~~~~~~~~~~~~~~l~~l~~ 389 (391) T COG0732 353 AEILSALDKLIDILNRK-PAELEALKKQLDYLLNKLLT 389 (391) T ss_pred HHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999985257-77899999999998777553 No 7 >PRK10642 proline/glycine betaine transporter; Provisional Probab=79.23 E-value=4.1 Score=19.19 Aligned_cols=48 Identities=17% Similarity=0.304 Sum_probs=38.2 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 214898899999999999999999999999999999999999997788 Q gi|254780837|r 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 (426) Q Consensus 366 ~i~lP~leeQ~~I~~~l~~~~~~id~li~~~~~~i~~L~~lk~sLl~~ 413 (426) +=.-|..+.+.+..+.+++..+.+++.++...++++.|+..|+.|..+ T Consensus 438 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (490) T PRK10642 438 KGATPAASDIQEAKEILVEHYDNIEQKIEDIDQEIAELQAKRTRLVQQ 485 (490) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999998899899999987577789888855898999999876888860 No 8 >pfam08549 SWI-SNF_Ssr4 Fungal domain of unknown function (DUF1750). This is a fungal domain of unknown function. Probab=70.06 E-value=6.8 Score=17.82 Aligned_cols=37 Identities=19% Similarity=0.213 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999977886 Q gi|254780837|r 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 (426) Q Consensus 376 ~~I~~~l~~~~~~id~li~~~~~~i~~L~~lk~sLl~~a 414 (426) .+..+.+....+.|++++..+.+.+++++ |.|||.+| T Consensus 367 ~RV~~kva~~~aEIEkmKa~Hakrm~Kfk--r~s~l~~A 403 (669) T pfam08549 367 DRVAKKVADMTAEIEKLKKRHARRMEKFN--RTSALKDA 403 (669) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--CCHHHHHH T ss_conf 99999888889999999999999999863--00278889 No 9 >TIGR02777 LigD_PE_dom DNA ligase D, 3'-phosphoesterase domain; InterPro: IPR014144 Most sequences in this entry contain a 3'-phosphoesterase domain of a multidomain, multifunctional DNA ligase, LigD, which is involved along with bacterial Ku protein in non-homologous end joining - the less common of two general mechanisms of repairing double-stranded breaks in DNA sequences. LigD is variable in architecture, as it lacks this domain in Bacillus subtilis, is permuted in Mycobacterium tuberculosis, and occasionally is encoded by tandem ORFs rather than a multifunctional protein. In a few species (Dehalococcoides ethenogenes and the archaeal genus Methanosarcina), sequences corresponding to the ligase and polymerase domains of LigD are not found, and the role of this protein is unclear.. Probab=52.99 E-value=2.8 Score=20.22 Aligned_cols=14 Identities=50% Similarity=0.774 Sum_probs=9.2 Q ss_pred HHHHHHHHC--CCCCC Q ss_conf 999998607--88100 Q gi|254780837|r 202 LVSYIVTKG--LNPDV 215 (426) Q Consensus 202 li~~~~tk~--L~p~~ 215 (426) |-++||||| |||.. T Consensus 53 LKSWAVPKGPSl~P~~ 68 (163) T TIGR02777 53 LKSWAVPKGPSLDPKD 68 (163) T ss_pred EEEEEECCCCCCCCCC T ss_conf 5612475854576032 No 10 >pfam04683 Proteasom_Rpn13 Proteasome complex subunit Rpn13 ubiquitin receptor. This family was thought originally to be involved in cell-adhesion, but the members are now known to be proteasome subunit Rpn13, a novel ubiquitin receptor. The 26S proteasome is a huge macromolecular protein-degradation machine consisting of a proteolytically active 20S core, in the form of four disc-like proteins, and one or two 19S regulatory particles. The regulatory particle(s) sit on the top and or bottom of the core, de-ubiquitinate the substrate peptides, unfold them and guide them into the narrow channel through the centre of the core. Rpn13 and its homologues dock onto the regulatory particle through the N-terminal region which binds Rpn2. The C-terminal part of the domain binds de-ubiquitinating enzyme Uch37/UCHL5 and enhances its isopeptidase activity. Rpn13 binds ubiquitin via a conserved amino-terminal region called the pleckstrin-like receptor for ubiquitin, termed Pru, domain. The domain for Probab=47.88 E-value=17 Score=15.32 Aligned_cols=75 Identities=16% Similarity=0.132 Sum_probs=36.3 Q ss_pred CCCCCCCEEEEEEEEEEECCCEEEECCCCCCCCCCCCEEEECCCCEEEEEECCCCCEEEEECCCEEEECCEEEECCCCCC Q ss_conf 78887753688700002064123311222234665533784289789994228752379834648990743871466678 Q gi|254780837|r 41 TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 (426) Q Consensus 41 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~gDil~s~~g~~~g~~~i~~~~~~~~~~~~vl~~~~~~ 120 (426) +|...++..++...+=..-..-+..... .......++.+||.-|.++ +-|-+..+++|+.+... T Consensus 16 ~pd~rkG~l~~~~~~d~l~hf~W~~r~~----~~~e~d~i~~p~d~~f~~v------------~~c~tGRVy~Lkf~~s~ 79 (263) T pfam04683 16 TPDPRKGLVYIYQSDDGLIHFQWKDRTS----GTVEDDLIVFPDDAEFKKV------------KQCKTGRVYVLKFKSSD 79 (263) T ss_pred EECCCCEEEEEEECCCCEEEEEEECCCC----CCCCCEEEECCCCEEEEEC------------CCCCCCCEEEEEECCCC T ss_conf 6669983899998788808989853789----9865217874896489988------------87888818999956998 Q ss_pred CHHHHHHHCHHH Q ss_conf 702222211177 Q gi|254780837|r 121 PELLQGWLLSID 132 (426) Q Consensus 121 ~~fl~y~l~s~~ 132 (426) .+ ++|||+.+. T Consensus 80 ~~-~ffWmQe~~ 90 (263) T pfam04683 80 RR-LFFWMQEPK 90 (263) T ss_pred CC-EEEEECCCC T ss_conf 62-489824898 No 11 >pfam12548 DUF3740 Sulfatase protein. This domain family is found in eukaryotes, and is typically between 144 and 173 amino acids in length. The family is found in association with pfam00884. Probab=45.21 E-value=19 Score=15.07 Aligned_cols=22 Identities=32% Similarity=0.490 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999997 Q gi|254780837|r 389 IDVLVEKIEQSIVLLKERRSSF 410 (426) Q Consensus 389 id~li~~~~~~i~~L~~lk~sL 410 (426) ||.+|+.++.+|+.|++.|.-| T Consensus 110 ID~eI~~Lq~KIk~LkevrghL 131 (144) T pfam12548 110 IDMEIETLKDKIKNLKEVKGHL 131 (144) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999899999887 No 12 >KOG0018 consensus Probab=42.30 E-value=21 Score=14.79 Aligned_cols=13 Identities=15% Similarity=0.100 Sum_probs=5.5 Q ss_pred CHHHHHCCEECCC Q ss_conf 2889824721489 Q gi|254780837|r 358 KFEDVKRLPVLVP 370 (426) Q Consensus 358 ~~~~l~~~~i~lP 370 (426) ...++..++-.++ T Consensus 695 ~~~~~~~~k~~l~ 707 (1141) T KOG0018 695 SKLDLEQLKRSLE 707 (1141) T ss_pred HHHHHHHHHHHHH T ss_conf 8988999998788 No 13 >pfam07674 consensus Probab=37.94 E-value=22 Score=14.67 Aligned_cols=14 Identities=21% Similarity=0.617 Sum_probs=7.0 Q ss_pred ECHHHHHCCEECCC Q ss_conf 02889824721489 Q gi|254780837|r 357 LKFEDVKRLPVLVP 370 (426) Q Consensus 357 i~~~~l~~~~i~lP 370 (426) .+.+.+++..|.|| T Consensus 38 ~~~~rlk~~kI~LP 51 (70) T pfam07674 38 FTLERLKEQKIMLP 51 (70) T ss_pred HHHHHHHCCEEEEE T ss_conf 00677604889713 No 14 >pfam08287 DASH_Spc19 Spc19. Spc19 is a component of the DASH complex. The DASH complex associates with the spindle pole body and is important for spindle and kinetochore integrity during cell division. Probab=37.22 E-value=25 Score=14.30 Aligned_cols=68 Identities=12% Similarity=0.176 Sum_probs=30.1 Q ss_pred HHHHCCCCC-CEEEECHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 888315687-166652666344555--443323457799999999998887765433322223688999999 Q gi|254780837|r 136 RIEAICEGA-TMSHADWKGIGNIPM--PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 (426) Q Consensus 136 ~~~~~~~Gs-~~~~i~~~~l~~~~i--plPpl~eQ~kIv~~Ld~~~~~Id~~I~~~~~~i~~l~e~kqali~ 204 (426) .+.....|+ ..|+|+ +.|.--.+ -+|...-|..=.++.+...-.|+.++.+.++.++.|......+.+ T Consensus 19 Si~~Ld~~~~d~pRL~-kvL~t~R~FeLipE~dl~~Ak~~l~~eI~P~i~~Ll~k~e~~l~kl~r~~~tL~~ 89 (150) T pfam08287 19 SLSTLDSGTEDFPRLK-KVLQTKRVFELIPEYDLEAAQLSLLEEIEPQVQTLLSKLEKSLSKLQREEDTLKA 89 (150) T ss_pred HHHHHHHCCCCCHHHH-HHHCCCCEEEECCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999980264658899-9971352012078888999999889861779999999999999999999999999 No 15 >KOG0353 consensus Probab=35.83 E-value=17 Score=15.38 Aligned_cols=30 Identities=17% Similarity=0.307 Sum_probs=15.3 Q ss_pred CCEEEEEECCCCCEEEEECCCEEEECCEEEEC Q ss_conf 97899942287523798346489907438714 Q gi|254780837|r 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 (426) Q Consensus 84 gDil~s~~g~~~g~~~i~~~~~~~~~~~~vl~ 115 (426) .-+|+-.+| ||+......++.++.++..+. T Consensus 111 d~~lil~tg--ggkslcyqlpal~adg~alvi 140 (695) T KOG0353 111 DAFLILPTG--GGKSLCYQLPALCADGFALVI 140 (695) T ss_pred CEEEEEECC--CCCCHHHHHHHHHCCCCEEEE T ss_conf 469998379--961245223587628745761 No 16 >pfam08946 Osmo_CC Osmosensory transporter coiled coil. The osmosensory transporter coiled coil is a C-terminal domain found in various bacterial osmoprotective transporters, such as ProP, Proline/betaine transporter, Proline permease 2 and the citrate proton symporters. It adopts an antiparallel coiled-coil structure, and is essential for osmosensory and osmoprotectant transporter function. Probab=33.38 E-value=29 Score=13.91 Aligned_cols=32 Identities=19% Similarity=0.424 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999997788 Q gi|254780837|r 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 (426) Q Consensus 382 l~~~~~~id~li~~~~~~i~~L~~lk~sLl~~ 413 (426) |......|++.|+.+..+|..|++-|+.|..+ T Consensus 10 L~e~~dnIEqkiedid~eIaeLe~KR~~Lv~q 41 (46) T pfam08946 10 LQEHHDNIEQKIEDIDEQIAELQKKRQRLVDQ 41 (46) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999879997986999999999999999870 No 17 >PRK04325 hypothetical protein; Provisional Probab=32.18 E-value=30 Score=13.79 Aligned_cols=35 Identities=9% Similarity=0.082 Sum_probs=24.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999998887765433322223688999999998 Q gi|254780837|r 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 (426) Q Consensus 173 ~Ld~~~~~Id~~I~~~~~~i~~l~e~kqali~~~~ 207 (426) |-+..+..++..+..+++.|+.|...-..+.+++- T Consensus 20 fqEDlid~LN~tV~rQQqqId~L~~q~~~L~q~~~ 54 (74) T PRK04325 20 FQEDLIDGLNATVARQQQTLDLLQAQLRLLYQQMR 54 (74) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999999999999999999999999987 No 18 >pfam08810 KapB Kinase associated protein B. This bacterial protein forms an anti-parallel beta sheet with an extending alpha helical region. Probab=29.79 E-value=33 Score=13.54 Aligned_cols=53 Identities=21% Similarity=0.186 Sum_probs=34.5 Q ss_pred EEEECHHHHHCCEECCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 410028898247214898899--99999999999999999999999999999999999 Q gi|254780837|r 354 RQSLKFEDVKRLPVLVPPIKE--QFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 (426) Q Consensus 354 ~~~i~~~~l~~~~i~lP~lee--Q~~I~~~l~~~~~~id~li~~~~~~i~~L~~lk~s 409 (426) +.++....++.+..-+|+.++ |+.+....+.+. +.--+-.++.++.|++|++- T Consensus 56 k~~ip~~~vk~y~geip~Y~~SLq~A~~~~~~~L~---~~~s~~a~~sL~~L~~Lk~e 110 (113) T pfam08810 56 KRNVPPSQVKPYEGEIPDYEESLQQAITKLEEKLK---EDNTEFAEKSLENLQQLKKD 110 (113) T ss_pred HHCCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH---HCCCHHHHHHHHHHHHHHHH T ss_conf 85398876510678889979999999999999987---20719999999999999987 No 19 >pfam11802 CENP-K Centromere-associated protein K. CENP-K is one of seven new CENP-A-nucleosome distal (CAD) centromere components (the others being CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and CENP-S) that are identified as assembling on the CENP-A nucleosome associated complex, NAC. The CENP-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. CENP-K is centromere-associated through its interaction with one or more components of the CENP-A NAC. Probab=28.14 E-value=35 Score=13.36 Aligned_cols=10 Identities=30% Similarity=0.634 Sum_probs=5.3 Q ss_pred HHHHHHHHHC Q ss_conf 6889999709 Q gi|254780837|r 330 STYLAWLMRS 339 (426) Q Consensus 330 ~~yl~~~l~s 339 (426) |-|+-.+|+. T Consensus 244 PPYiElLLR~ 253 (273) T pfam11802 244 PPYVELLLRY 253 (273) T ss_pred CHHHHHHHHC T ss_conf 4599999981 No 20 >TIGR02424 TF_pcaQ pca operon transcription factor PcaQ; InterPro: IPR012787 Members of this family are LysR-family transcription factors associated with operons for catabolism of protocatechuate . Members occur only in proteobacteria.; GO: 0003677 DNA binding, 0016563 transcription activator activity, 0019619 protocatechuate catabolic process, 0045941 positive regulation of transcription. Probab=25.00 E-value=40 Score=13.01 Aligned_cols=32 Identities=28% Similarity=0.382 Sum_probs=24.5 Q ss_pred EEEEECHHHHHCCEECCCCHHHHHHHHHHHHHHH Q ss_conf 0410028898247214898899999999999999 Q gi|254780837|r 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 (426) Q Consensus 353 ~~~~i~~~~l~~~~i~lP~leeQ~~I~~~l~~~~ 386 (426) +.|+++...|..++|.+|| ++.-|=.++|.++ T Consensus 181 ~~p~~~~~~L~~yPvl~P~--~g~~IR~~~erll 212 (300) T TIGR02424 181 AAPSLSVASLADYPVLLPP--EGSIIRPLVERLL 212 (300) T ss_pred CCCCCCHHHHHCCCCCCCC--CHHHHHHHHHHHH T ss_conf 7887777887338810486--2226799999999 No 21 >pfam10398 DUF2443 Protein of unknown function (DUF2443). This is a small family of highly conserved proteins from bacteria, in particular Helicobacter species, The structure is a bundle of alpha helices. The function is not known. Probab=24.55 E-value=41 Score=12.95 Aligned_cols=25 Identities=28% Similarity=0.426 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999977 Q gi|254780837|r 387 ARIDVLVEKIEQSIVLLKERRSSFI 411 (426) Q Consensus 387 ~~id~li~~~~~~i~~L~~lk~sLl 411 (426) .++|.-+++..+.|+.|+.+|+.+| T Consensus 53 ~qid~ev~kLK~~IdaLNKiKke~l 77 (79) T pfam10398 53 TQIDEEVEKLKELINALNKIKKELL 77 (79) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9889999999999999999988883 No 22 >pfam12005 DUF3499 Protein of unknown function (DUF3499). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 125 to 163 amino acids in length. Probab=23.19 E-value=42 Score=12.87 Aligned_cols=11 Identities=36% Similarity=0.625 Sum_probs=6.0 Q ss_pred CCCCCCCCCHH Q ss_conf 86656433024 Q gi|254780837|r 228 VPDHWEVKPFF 238 (426) Q Consensus 228 IP~~W~~~~l~ 238 (426) .|.||++.++. T Consensus 54 aP~GW~vvr~~ 64 (125) T pfam12005 54 APLGWEVVRLA 64 (125) T ss_pred CCCCCEEEEEC T ss_conf 88875778604 Done!