Query psy2275
Match_columns 625
No_of_seqs 334 out of 2640
Neff 8.4
Searched_HMMs 46136
Date Fri Aug 16 20:33:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2275.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2275hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG3590 PepO Predicted metallo 100.0 4.5E-50 9.7E-55 415.1 15.2 363 17-562 240-637 (654)
2 KOG3624|consensus 100.0 4.8E-44 1E-48 409.2 17.8 351 31-575 296-687 (687)
3 COG3590 PepO Predicted metallo 100.0 1.9E-37 4E-42 321.6 16.9 262 356-625 170-571 (654)
4 TIGR01762 chlorin-enz chlorina 99.9 1.6E-26 3.4E-31 235.6 13.7 156 189-349 60-246 (288)
5 TIGR02408 ectoine_ThpD ectoine 99.9 2.2E-26 4.7E-31 234.7 12.7 208 114-350 27-248 (277)
6 KOG3624|consensus 99.9 1.3E-25 2.9E-30 257.4 13.9 262 358-624 214-599 (687)
7 KOG3290|consensus 99.9 3.3E-25 7.1E-30 210.7 7.0 152 193-351 105-259 (286)
8 PF05649 Peptidase_M13_N: Pept 99.9 4.2E-23 9.1E-28 221.9 13.2 159 17-175 229-390 (390)
9 PF01431 Peptidase_M13: Peptid 99.9 5.1E-25 1.1E-29 215.6 -4.6 162 408-573 4-205 (206)
10 PF05721 PhyH: Phytanoyl-CoA d 99.9 1.7E-23 3.6E-28 204.6 5.6 199 115-344 4-211 (211)
11 COG5285 Protein involved in bi 99.8 3E-19 6.4E-24 174.5 11.8 129 199-347 98-227 (299)
12 PF01431 Peptidase_M13: Peptid 99.4 3.4E-14 7.4E-19 139.0 1.8 98 526-624 13-122 (206)
13 PF05649 Peptidase_M13_N: Pept 98.9 5.3E-09 1.1E-13 112.4 10.1 134 357-490 162-390 (390)
14 PRK05467 Fe(II)-dependent oxyg 95.6 0.13 2.8E-06 50.7 11.0 75 238-345 93-172 (226)
15 smart00702 P4Hc Prolyl 4-hydro 95.0 0.12 2.7E-06 48.9 8.9 75 238-345 96-173 (178)
16 PF13640 2OG-FeII_Oxy_3: 2OG-F 93.4 0.041 8.9E-07 46.7 1.7 80 238-345 12-95 (100)
17 PF13759 2OG-FeII_Oxy_5: Putat 93.4 0.033 7.1E-07 47.6 1.1 37 309-345 62-98 (101)
18 PF07350 DUF1479: Protein of u 91.6 0.13 2.7E-06 54.8 2.9 83 254-336 235-343 (416)
19 TIGR02466 conserved hypothetic 88.1 0.35 7.7E-06 46.8 2.6 35 311-345 160-194 (201)
20 PLN00052 prolyl 4-hydroxylase; 86.0 3.8 8.3E-05 42.4 8.9 84 239-339 146-241 (310)
21 PF12851 Tet_JBP: Oxygenase do 64.8 14 0.00029 34.9 5.4 26 313-338 128-154 (171)
22 PF08856 DUF1826: Protein of u 64.2 27 0.00058 33.7 7.4 32 312-343 150-191 (196)
23 PHA02813 hypothetical protein; 57.4 17 0.00038 37.8 5.0 65 238-340 107-172 (354)
24 TIGR00568 alkb DNA alkylation 43.0 80 0.0017 29.7 6.7 16 312-327 145-160 (169)
25 COG3128 PiuC Uncharacterized i 42.8 61 0.0013 30.7 5.6 28 312-339 143-170 (229)
26 PHA02869 C4L/C10L-like gene fa 27.5 2E+02 0.0044 30.7 7.2 66 238-340 116-181 (418)
27 PF10835 DUF2573: Protein of u 27.0 1.6E+02 0.0035 23.6 4.9 62 76-142 4-72 (82)
28 PRK09553 tauD taurine dioxygen 24.9 48 0.001 33.8 2.2 37 309-345 230-267 (277)
29 PF10014 2OG-Fe_Oxy_2: 2OG-Fe 24.8 1.4E+02 0.0031 28.6 5.3 23 315-337 155-178 (195)
30 COG2175 TauD Probable taurine 23.7 49 0.0011 33.9 1.9 38 310-347 236-274 (286)
31 KOG1591|consensus 22.9 2.5E+02 0.0054 28.8 6.9 31 239-269 190-227 (289)
32 PF13661 2OG-FeII_Oxy_4: 2OG-F 22.5 1.1E+02 0.0025 23.7 3.5 37 238-275 24-63 (70)
33 COG4340 Uncharacterized protei 20.6 6.4E+02 0.014 24.1 8.2 109 192-339 86-202 (226)
No 1
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=4.5e-50 Score=415.15 Aligned_cols=363 Identities=27% Similarity=0.445 Sum_probs=286.8
Q ss_pred cCCCCccCcccccCCCCCCeEEEcCchhHHHHHHHHhcCChHHHHHHHHHHHHHHhhhhccHHHHHHHHH-hhhhccCcc
Q psy2275 17 GRGLDKPDLSHLSPVKPDEVVNLEAPDFFRSFEPLLKATPKRVLANYLLWQVMSFSSTYLTEEMNNRYFK-FSSTLRGVT 95 (625)
Q Consensus 17 ~~~~~~~~~~~l~~i~~~~~V~v~~~~Yl~~l~~ll~~t~~~~l~nYl~wr~i~~~~~~L~~~~~~~~~~-f~~~l~G~~ 95 (625)
.++-+|.-+-.....-||..|++.+|.|++.+..++.+++...++.|+.|++++..+|||+++++...++ |.++|+|.+
T Consensus 240 ~p~~~w~~~~~~~G~~PD~~iv~~~p~y~~~~~~~~~e~~~~~wKawl~~~~~~~~ap~Lt~dl~~~~f~fy~r~LsG~~ 319 (654)
T COG3590 240 LPGDDWSLLFSALGQLPDKVIVVENPFYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYGRTLSGQP 319 (654)
T ss_pred CCCccHHHHHHHhcCCCCeeEeecCchHHHHHHHHHhcCcHHHHHHHHHHHHHhhcchhhhhHHHHhhhHHHhhhccCCh
Confidence 3445554444444446778999999999999999999999999999999999999999999999999995 679999999
Q ss_pred CCCcchhhHHHHhhh-hhHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHhhcccCCCCChhhHHHHHHHHhhhcccccC
Q psy2275 96 ARKPRWQECVDETKD-LDIAVGALYIRKYFNQDAKANVETMVRLILDETYKYLSTVDWMDPNTRLAAQDKVKAIIPYVAY 174 (625)
Q Consensus 96 ~~~~r~~~C~~~~~~-~~~~lg~lyv~~~f~~~~~~~v~~m~~~Ik~~~~~~l~~~~Wmd~~tk~~a~~Kl~~~~~~ig~ 174 (625)
+..+||+..+.+++. |+.++|.+||+.+|++++++.++++|..++++++.+|++++||.++||++|++||+.+..+|||
T Consensus 320 E~r~rwKr~~~l~~~~~geaiG~~Yv~~~Fpp~aKa~m~~lVa~l~~ayk~Ri~~ldWmtpeTrekAl~KL~~ft~kIGY 399 (654)
T COG3590 320 EARDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREKALEKLNKFTAKIGY 399 (654)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHHHHHHhhhccccCHHHHHHHHHHHhccccccCC
Confidence 999999999999987 9999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CccccchhhhhhcccccccHHHHHHhhcHhHHHHHHHhcCCcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCCCCc
Q psy2275 175 PQELLGEYLYNKVQELVYDEVFEKYTMHKKLLDYVESFIGPHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRPAHR 254 (625)
Q Consensus 175 P~~~~~~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~llg~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~ 254 (625)
|+.-.+-..|.. ..+. ++. ++.+ ..-+
T Consensus 400 P~~wr~y~kL~~----~~~s----l~~-----N~~r----~~~~------------------------------------ 426 (654)
T COG3590 400 PDPWRYYSKLEI----KRDS----LYG-----NVLR----ASAF------------------------------------ 426 (654)
T ss_pred Cchhhhhhhhcc----Cchh----HHH-----HHHH----HHHH------------------------------------
Confidence 963222111110 0000 111 0100 0000
Q ss_pred ceEEEEeeccCcccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccccCCcEEEecCCccccCC
Q psy2275 255 IVAAWTAMERIDKDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYMERGDTVFFHPLLIHGSG 334 (625)
Q Consensus 255 ~~t~wi~L~d~~~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~~aGd~l~f~~~~~H~s~ 334 (625)
.|. |....|.+
T Consensus 427 -----------------------~~~-------~~l~K~~k--------------------------------------- 437 (654)
T COG3590 427 -----------------------NHA-------HELSKIGK--------------------------------------- 437 (654)
T ss_pred -----------------------HHh-------hhHHHhCC---------------------------------------
Confidence 000 00011111
Q ss_pred CCCCccccccccccccchhhh-chHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHhhhcCChhhhcchhhhcc
Q psy2275 335 TNVTKAVLGTSQRSAGRTQRL-ANPVVKAYLEYMVDIAVMMGASRETAQQELTASLQFESRLRNITKSNEEYRKMSKLSN 413 (625)
Q Consensus 335 ~N~s~~~R~~~~~~~~~~~~~-~~~~~~Ay~~~~~~~~~~~g~~~~~a~~~~~~vl~fE~~LA~~~~~~~~~~~~~~~yn 413 (625)
|+.+.+|. .+.+|||||++. -|
T Consensus 438 -------------PVDr~eW~M~pq~VNAYYnp~--------------------------------------------~N 460 (654)
T COG3590 438 -------------PVDRDEWEMPPQTVNAYYNPQ--------------------------------------------KN 460 (654)
T ss_pred -------------CCchhhcCCCHHHhhhhcCCC--------------------------------------------Cc
Confidence 22222333 355899999763 46
Q ss_pred cccHHHHHhhCCCCCCCccCCcccccccchhhHHHHHHHHHhhccchhhHhhcCCCCCCCCHHHHHHHHHHhc-------
Q psy2275 414 LVTISHLEHKYPILGTSFSNDRPSYMNFGSMGWTIGHEITHGFDSTGSRFDKNGTEINWWDPSTREKYKILWL------- 486 (625)
Q Consensus 414 ~~~i~~l~~~~P~l~~~f~~~~p~~l~~g~lG~~ig~e~~~~~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~~------- 486 (625)
.|.+|..++|.|| |+++.+...|||++|.+|||||.|+||+.|+.||..|++++||++++++.|+++-.
T Consensus 461 ~IVFPAAILQ~PF----fd~ea~~a~NYGgIGaVIgHEI~HgFDdqGakfD~~GnL~dWWT~eD~~aF~~~T~~lvaqf~ 536 (654)
T COG3590 461 EIVFPAAILQAPF----FDPEADSAANYGGIGAVIGHEIGHGFDDQGAKFDGDGNLNDWWTDEDAAAFKERTKKLVAQFD 536 (654)
T ss_pred eEeeeHHhcCCCC----CCCCcchhhcccCccceehhhhcccccCCccccCCCCcHHhhcCHHHHHHHHHHHHHHHHHhc
Confidence 7788888899997 99999999999999999999999999999999999999999999999999987620
Q ss_pred ---------CC----------------CCCCCCCCCCCCCCCCcCCChhhHHHHHHhhhhhccCCCccccccccCCCCCC
Q psy2275 487 ---------LH----------------LPWAANRPEEPRLPGLQRFSPRQMFWVSAATSYCGLQRPEDLKDNLLNDEHTP 541 (625)
Q Consensus 487 ---------i~----------------~~~~~~~~~~~~l~gl~~~~~~qlFf~~~a~~~c~~~~~~~~~~~~~~~~~~p 541 (625)
.| .++..++.....-|-++.++..|-||+++|+.|....+|+...+.+..|||+|
T Consensus 537 ~~e~~~~~~~g~~lTvgENIaDlGGl~~Al~Ayk~~~~~~pv~dg~tg~qrfF~~wAqiWR~K~r~e~~~~~l~~DpHsP 616 (654)
T COG3590 537 GYEPEGGKDNGNALTVGENIADLGGLAIALDAYKLSLDPAPVIDGFTGLQRFFLNWAQIWRMKARPEEQQMRLSVDPHSP 616 (654)
T ss_pred CccccccccccceeeeccccccchhHHHHHHHHHHhcCCcccccccchHHHHHHHHHHHHhhccCHHHHHHHHccCCCCC
Confidence 00 00111111111223345688999999999999999999998888889999999
Q ss_pred CceeEeeecccccccccccCC
Q psy2275 542 PEWRAVGSLANSVEFGRDFGT 562 (625)
Q Consensus 542 ~~~r~~g~l~N~~eF~~aFdc 562 (625)
++|||||++.|+.+|.++|||
T Consensus 617 ~~fR~Ng~~~N~~~FyeaFdV 637 (654)
T COG3590 617 AEFRVNGPVRNMDEFYEAFDV 637 (654)
T ss_pred ceeeccCCCCChHHHHHHcCC
Confidence 999999999999999999997
No 2
>KOG3624|consensus
Probab=100.00 E-value=4.8e-44 Score=409.22 Aligned_cols=351 Identities=31% Similarity=0.631 Sum_probs=266.9
Q ss_pred CCCCCeEEEcCchhHHHHHHHHhcCChHHHHHHHHHHHHHHhhhhccHHHHHHHHHhhhhccCccCCCcchhhHHHHhhh
Q psy2275 31 VKPDEVVNLEAPDFFRSFEPLLKATPKRVLANYLLWQVMSFSSTYLTEEMNNRYFKFSSTLRGVTARKPRWQECVDETKD 110 (625)
Q Consensus 31 i~~~~~V~v~~~~Yl~~l~~ll~~t~~~~l~nYl~wr~i~~~~~~L~~~~~~~~~~f~~~l~G~~~~~~r~~~C~~~~~~ 110 (625)
+.....|++.+++|+.+|.++|.+|++++|+|||+|+++..+.+.++...+ ||..|+..+..
T Consensus 296 ~~~~~~~~~~~~~y~~~l~~ii~~t~~~~laNYl~~~~~~~~~~~~~~~~~------------------~~~~Cv~~~~~ 357 (687)
T KOG3624|consen 296 ILPHNEVVVFNPDYLVNLSAIIQSTPKRTLANYLIWRLLDDLLDLLPKPRK------------------RWADCVELVRT 357 (687)
T ss_pred cccCCceEEecHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcccchhhH------------------HHHHHHHHHHH
Confidence 334447999999999999999999999999999999999988875444321 89999999876
Q ss_pred -hhHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHhhcccCCCCChhhHHHHHHHHhhhcccccCCccccchhhhhhc-c
Q psy2275 111 -LDIAVGALYIRKYFNQDAKANVETMVRLILDETYKYLSTVDWMDPNTRLAAQDKVKAIIPYVAYPQELLGEYLYNKV-Q 188 (625)
Q Consensus 111 -~~~~lg~lyv~~~f~~~~~~~v~~m~~~Ik~~~~~~l~~~~Wmd~~tk~~a~~Kl~~~~~~ig~P~~~~~~~~l~~~-~ 188 (625)
||.++|++|++++|+++.+..+.+|+++||.+|.+.|.+++|||++||+.|++|+++|...||||+...++..+++. .
T Consensus 358 ~~p~a~~~l~~r~~~~~~~~~~~~~m~~~lk~~f~~~l~~~~Wl~~~tr~~a~~K~~~m~~~ig~p~~~~~~~~ld~~y~ 437 (687)
T KOG3624|consen 358 LLPLALGRLYVRNFFDKENKKEVSEMIEDLKKAFEEMLQELDWLDEETRKSAIKKLNAMKKNIGYPDEFLPPGTLDKEYE 437 (687)
T ss_pred hhhhhhHHHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHhCcccCHHHHHHHHHHHHHHHHHcCCHHHhcCchhHHHHHh
Confidence 99999999999999999999999999999999999999999999999999999999999999999988776656553 3
Q ss_pred cccccHHHHHHhhcHhHHHHHHHhcCCcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCCCCcceEEEEeeccCccc
Q psy2275 189 ELVYDEVFEKYTMHKKLLDYVESFIGPHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRPAHRIVAAWTAMERIDKD 268 (625)
Q Consensus 189 ~~~~~~~~~~~~~~~~l~~~~~~llg~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~ 268 (625)
.+...+ .++..+-..+..+....... .. +..
T Consensus 438 ~l~l~~--~~~~~~l~~~~~~~~~~~~~-----~~-~~~----------------------------------------- 468 (687)
T KOG3624|consen 438 NLKLSL--DDYDSNLEILLKLQRRRSEQ-----LE-LRA----------------------------------------- 468 (687)
T ss_pred cccCCh--HHHHHHHHHHHHHHHHHHHH-----HH-Hhc-----------------------------------------
Confidence 322221 11111111110000000000 00 000
Q ss_pred CCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccccCCcEEEecCCccccCCCCCCcccccccccc
Q psy2275 269 NGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTSQRS 348 (625)
Q Consensus 269 nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~~~~ 348 (625)
+.+ ...|..
T Consensus 469 -------~~~-----------~~~~~~----------------------------------------------------- 477 (687)
T KOG3624|consen 469 -------PVD-----------PLDWVG----------------------------------------------------- 477 (687)
T ss_pred -------CCC-----------cccccc-----------------------------------------------------
Confidence 000 000100
Q ss_pred ccchhhhchHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHhhhcCChhhhcchhhhcccccHHHHHhhCCCCC
Q psy2275 349 AGRTQRLANPVVKAYLEYMVDIAVMMGASRETAQQELTASLQFESRLRNITKSNEEYRKMSKLSNLVTISHLEHKYPILG 428 (625)
Q Consensus 349 ~~~~~~~~~~~~~Ay~~~~~~~~~~~g~~~~~a~~~~~~vl~fE~~LA~~~~~~~~~~~~~~~yn~~~i~~l~~~~P~l~ 428 (625)
.+..++|||.+ .-|.++++....+.|+
T Consensus 478 -------~~~~~na~Y~~--------------------------------------------~~N~i~~pa~ilq~P~-- 504 (687)
T KOG3624|consen 478 -------SPAQVNAFYSP--------------------------------------------EKNEIVFPAGLLQPPF-- 504 (687)
T ss_pred -------ccceeeccccC--------------------------------------------CCceEEEehhcccCCC--
Confidence 01123344322 1255566666666776
Q ss_pred CCccCCcccccccchhhHHHHHHHHHhhccchhhHhhcCCCCCCCCHHHHHHHHHHh-----------------------
Q psy2275 429 TSFSNDRPSYMNFGSMGWTIGHEITHGFDSTGSRFDKNGTEINWWDPSTREKYKILW----------------------- 485 (625)
Q Consensus 429 ~~f~~~~p~~l~~g~lG~~ig~e~~~~~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~----------------------- 485 (625)
|+.+.|.|++||++|.+||||++|+||+.|..||+.|+..+||+++|..+|.++.
T Consensus 505 --f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~D~~Gn~~~ww~~~s~~~f~~~~qC~~~qy~~y~~~~~~ng~~t~~ 582 (687)
T KOG3624|consen 505 --FDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQYDESGNLRDWWDTESESEFQERAQCLVKQYSNYLDPRRLNGSKTLG 582 (687)
T ss_pred --CCcccchhhhhHHHHHHHHHHHhhcccccccccCCCcccccCCCHhHHHHHHHHHHHHHHHHhccCCCcccCCccccc
Confidence 8889999999999999999999999999999999999999999999999888752
Q ss_pred -------cCCCCCCCCC------CCCCC-CCCCcCCChhhHHHHHHhhhhhccCCCccccccccCCCCCCCceeEeeecc
Q psy2275 486 -------LLHLPWAANR------PEEPR-LPGLQRFSPRQMFWVSAATSYCGLQRPEDLKDNLLNDEHTPPEWRAVGSLA 551 (625)
Q Consensus 486 -------~i~~~~~~~~------~~~~~-l~gl~~~~~~qlFf~~~a~~~c~~~~~~~~~~~~~~~~~~p~~~r~~g~l~ 551 (625)
++..+|.+|. ..... +|+++ ++++|+||++||+.||+...|....+-+..++|+|.++||||+++
T Consensus 583 EnIAD~~Gl~~A~~AY~~~~~~~~~~~~~lp~l~-~t~~QLFFl~~Aq~~C~~~~~~~~~~~~~~~~H~~~~~RVng~ls 661 (687)
T KOG3624|consen 583 ENIADNGGLKLAYRAYKKWKLDLSGEPRELPGLD-LTPEQLFFLSYAQFFCSSEDPKKLPESLLTDPHSPESFRVNGVLS 661 (687)
T ss_pred hhhcchhhHHHHHHHHHHHHHhccCCCCCCCCCC-CChhHHHHHHHHHHHhccCChhhhhHHhhcCCCCCcceeEeeecc
Confidence 0111233332 11222 99995 999999999999999999888776666778999999999999999
Q ss_pred cccccccccCC--CCCCCCCCCCCCC
Q psy2275 552 NSVEFGRDFGT--GSRFDKNGTEFNW 575 (625)
Q Consensus 552 N~~eF~~aFdc--Gs~~d~~G~~~~W 575 (625)
|++||+++|+| ||+|+|..+|..|
T Consensus 662 N~~eFa~aF~C~~gS~MnP~~kC~v~ 687 (687)
T KOG3624|consen 662 NSPEFAEAFNCPIGSPMNPEKKCKVW 687 (687)
T ss_pred CcHHHHHhcCCCCCCCCCccccCcCC
Confidence 99999999999 9999999999998
No 3
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=1.9e-37 Score=321.59 Aligned_cols=262 Identities=24% Similarity=0.369 Sum_probs=216.3
Q ss_pred chHHHHHHHHHHHHHHHHcCCChH--HHHHHHHHHHHHHHHHhhhcCChhhhcchhhhcccccHHHHHhhCCCCCC--Cc
Q psy2275 356 ANPVVKAYLEYMVDIAVMMGASRE--TAQQELTASLQFESRLRNITKSNEEYRKMSKLSNLVTISHLEHKYPILGT--SF 431 (625)
Q Consensus 356 ~~~~~~Ay~~~~~~~~~~~g~~~~--~a~~~~~~vl~fE~~LA~~~~~~~~~~~~~~~yn~~~i~~l~~~~P~l~~--~f 431 (625)
+...+.||..++.+++.++|.+++ .+.+.+..|+.||++||+.++...++||....||+.++.+++...|..+| ++
T Consensus 170 ~~e~l~ay~~hv~rml~~~~~~~~~~d~a~~A~~v~a~et~lA~~~~~~~~~rd~~~~Yn~~tf~~l~~~~p~~~w~~~~ 249 (654)
T COG3590 170 HAELLAAYKEHVARMLGLFGFSEEEEDAAKHALRVVALETKLANASWEVVKYRDLYHTYNPATFAELQPELPGDDWSLLF 249 (654)
T ss_pred HHHHHHHHHHHHHHHHHHhCCchhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHhhhcCcChhhHHHhcccCCCccHHHHH
Confidence 456788999999999999998876 68999999999999999999999999999999999999999999888664 22
Q ss_pred c-----C------Ccccccc------------------------------------------------------------
Q psy2275 432 S-----N------DRPSYMN------------------------------------------------------------ 440 (625)
Q Consensus 432 ~-----~------~~p~~l~------------------------------------------------------------ 440 (625)
. | ..|.|+.
T Consensus 250 ~~~G~~PD~~iv~~~p~y~~~~~~~~~e~~~~~wKawl~~~~~~~~ap~Lt~dl~~~~f~fy~r~LsG~~E~r~rwKr~~ 329 (654)
T COG3590 250 SALGQLPDKVIVVENPFYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYGRTLSGQPEARDRWKRAV 329 (654)
T ss_pred HHhcCCCCeeEeecCchHHHHHHHHHhcCcHHHHHHHHHHHHHhhcchhhhhHHHHhhhHHHhhhccCChhhhHHHHHHH
Confidence 1 1 2344432
Q ss_pred ---cchhhHHHHHHHHHh-------------hccchhhHhhcCCCCCCCCHHHHHHHHHHh-----cCCCC--CCCCCC-
Q psy2275 441 ---FGSMGWTIGHEITHG-------------FDSTGSRFDKNGTEINWWDPSTREKYKILW-----LLHLP--WAANRP- 496 (625)
Q Consensus 441 ---~g~lG~~ig~e~~~~-------------~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~-----~i~~~--~~~~~~- 496 (625)
.+.+|.+||..++.. +++++.+|+.++.+++||+++||++|++|+ +|||| |..++.
T Consensus 330 ~l~~~~~geaiG~~Yv~~~Fpp~aKa~m~~lVa~l~~ayk~Ri~~ldWmtpeTrekAl~KL~~ft~kIGYP~~wr~y~kL 409 (654)
T COG3590 330 NLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREKALEKLNKFTAKIGYPDPWRYYSKL 409 (654)
T ss_pred HHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHHHHHHhhhccccCHHHHHHHHHHHhccccccCCCchhhhhhhh
Confidence 279999999877765 668899999999999999999999999996 79999 333332
Q ss_pred --CCCCCCCCcCCChhhHH-HHHHhhhhhccCCCccccccccC---------------------------CCCC--CCce
Q psy2275 497 --EEPRLPGLQRFSPRQMF-WVSAATSYCGLQRPEDLKDNLLN---------------------------DEHT--PPEW 544 (625)
Q Consensus 497 --~~~~l~gl~~~~~~qlF-f~~~a~~~c~~~~~~~~~~~~~~---------------------------~~~~--p~~~ 544 (625)
+..+|.+ +-|-+ -+.+++.|.+..+|+++.+|.|+ |++. ..||
T Consensus 410 ~~~~~sl~~-----N~~r~~~~~~~~~l~K~~kPVDr~eW~M~pq~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NY 484 (654)
T COG3590 410 EIKRDSLYG-----NVLRASAFNHAHELSKIGKPVDRDEWEMPPQTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANY 484 (654)
T ss_pred ccCchhHHH-----HHHHHHHHHHhhhHHHhCCCCchhhcCCCHHHhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcc
Confidence 1122211 11222 24567889999999999999863 4443 3599
Q ss_pred eEeeecccccccccccC-CCCCCCCCCCCCCCCChhhHHHHHHHHhhHHHHhcCcccCCCCcccCcc-cc-------hHH
Q psy2275 545 RAVGSLANSVEFGRDFG-TGSRFDKNGTEFNWWDPSTREKYKVKSKCMVDQYGKYVVPEVNGSVNGV-NT-------LVG 615 (625)
Q Consensus 545 r~~g~l~N~~eF~~aFd-cGs~~d~~G~~~~Ww~~~~~~~f~~~~~c~~~qy~~~~~~~~~~~~ng~-~t-------~gg 615 (625)
|+||+|+ +||++|+|| +|++||++||+++|||++|+.+|++|++.+|+||+.|+.. +...||+ +| |||
T Consensus 485 GgIGaVI-gHEI~HgFDdqGakfD~~GnL~dWWT~eD~~aF~~~T~~lvaqf~~~e~~--~~~~~g~~lTvgENIaDlGG 561 (654)
T COG3590 485 GGIGAVI-GHEIGHGFDDQGAKFDGDGNLNDWWTDEDAAAFKERTKKLVAQFDGYEPE--GGKDNGNALTVGENIADLGG 561 (654)
T ss_pred cCcccee-hhhhcccccCCccccCCCCcHHhhcCHHHHHHHHHHHHHHHHHhcCcccc--ccccccceeeeccccccchh
Confidence 9999999 999999999 8999999999999999999999999999999999999976 4555666 88 799
Q ss_pred HHHHHHHHhC
Q psy2275 616 LAIAIAIASS 625 (625)
Q Consensus 616 ~~~a~~a~~~ 625 (625)
|.+||.|||.
T Consensus 562 l~~Al~Ayk~ 571 (654)
T COG3590 562 LAIALDAYKL 571 (654)
T ss_pred HHHHHHHHHH
Confidence 9999999984
No 4
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=99.94 E-value=1.6e-26 Score=235.62 Aligned_cols=156 Identities=20% Similarity=0.299 Sum_probs=117.6
Q ss_pred cccccHHHHHHhhcHhHHHHHHHhcCCcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCC-----------CCcceE
Q psy2275 189 ELVYDEVFEKYTMHKKLLDYVESFIGPHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRP-----------AHRIVA 257 (625)
Q Consensus 189 ~~~~~~~~~~~~~~~~l~~~~~~llg~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~-----------~~~~~t 257 (625)
.+..++.|.+++.+|++++++++|+|++++++++.++.|+|+.++ ++||||..||++.+ .+.++|
T Consensus 60 ~~~~~~~~~~l~~~~~l~~~~~~llG~~v~l~~~~~~~K~pg~~~----~~wHQD~~y~~~~~~~~~~~p~~~~~~~~vt 135 (288)
T TIGR01762 60 RHLDDDFLASHICRPEICHRVESILGPNVLCWRTEFFPKYPGDEG----TDWHQADTFANASGKPQLVWPENEEFGGTIT 135 (288)
T ss_pred ecccCHHHHHHhcCHHHHHHHHHHhCCcEEeeeceeeeeCCCCCC----CCCCccCcccccCCcccccccccCCCCCeEE
Confidence 345678899999999999999999999999999999999998653 68999999998753 246899
Q ss_pred EEEeeccCcccCCceeeecCCCCCCCCCcccCC-----CCcc----cCc-cc----ccccc-cC----CCCCCccccccc
Q psy2275 258 AWTAMERIDKDNGCLFVLPRTHRDPGSLLQHEY-----PEWE----GGV-NA----MYHGI-RG----FDSHDKLNLYME 318 (625)
Q Consensus 258 ~wi~L~d~~~~nG~l~vvpGSH~~~~~l~~~~~-----~~w~----~~~-~~----~~~~~-~~----~~~~~~v~~~~~ 318 (625)
+||||+|++.+||||+||||||++ .....+.. ..+. +.. .. .+..+ .+ .++...|+++++
T Consensus 136 ~wiaLdd~t~eNG~L~viPGSH~~-~~~~~~~~~~~~p~~~~~~~~g~~~~~~~~~~~~~l~~d~~~~~~~~~~v~~~lk 214 (288)
T TIGR01762 136 VWTAFTDATIENGCMQFIPGTHNS-MNYDETRRMTFEPDANNSVVKGGVRRGFFGYDYRQLQIDENWKPDEASAVPMQMK 214 (288)
T ss_pred EEEEcccCCcccCCEEEECCCCCC-CCCCcccccccCccccccccccccccccccccchhhcccccCCccccceeeeeeC
Confidence 999999999999999999999995 21111110 0000 000 00 00000 01 123467999999
Q ss_pred CCcEEEecCCccccCCCCCCc-cccccccccc
Q psy2275 319 RGDTVFFHPLLIHGSGTNVTK-AVLGTSQRSA 349 (625)
Q Consensus 319 aGd~l~f~~~~~H~s~~N~s~-~~R~~~~~~~ 349 (625)
|||++|||++|+|||++|+|+ +.|+++-.++
T Consensus 215 aGd~~~f~~~t~HgS~~N~S~~~~R~~~~~ry 246 (288)
T TIGR01762 215 AGQFIIFWSTLMHASYPNSGESQMRMGFASRY 246 (288)
T ss_pred CceEEEECCCceecCCCCCCCCceEEEEEEEE
Confidence 999999999999999999996 4688776554
No 5
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=99.94 E-value=2.2e-26 Score=234.74 Aligned_cols=208 Identities=18% Similarity=0.222 Sum_probs=144.5
Q ss_pred HHHHHHHhhhcChHHHHHHHHHHHHHHHHHHhhcccCCCCChhhHHHHHHHHhhhcccccCCccccchhhhhh-cccccc
Q psy2275 114 AVGALYIRKYFNQDAKANVETMVRLILDETYKYLSTVDWMDPNTRLAAQDKVKAIIPYVAYPQELLGEYLYNK-VQELVY 192 (625)
Q Consensus 114 ~lg~lyv~~~f~~~~~~~v~~m~~~Ik~~~~~~l~~~~Wmd~~tk~~a~~Kl~~~~~~ig~P~~~~~~~~l~~-~~~~~~ 192 (625)
-.||+.++++++++.++.+++-++.+... .++..... + +. .-. ...+.. +..+..
T Consensus 27 ~dGyvvl~~vls~eev~~lr~~i~~~~~~--------~~~~~~~~--~------~~---~~~-----~~~~r~~~~~~~~ 82 (277)
T TIGR02408 27 RDGFLLLENLFSDDEVAALLAEVERMTRD--------PAIVRDEE--A------IT---EPG-----SNAVRSIFEVHVL 82 (277)
T ss_pred HCCEEECcccCCHHHHHHHHHHHHHHHhc--------ccccCCCc--c------ee---cCC-----CCceEEEeccccc
Confidence 37999999999999888777665555431 01110000 0 00 000 001111 122356
Q ss_pred cHHHHHHhhcHhHHHHHHHhcCCcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCC---CCcceEEEEeeccCcccC
Q psy2275 193 DEVFEKYTMHKKLLDYVESFIGPHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRP---AHRIVAAWTAMERIDKDN 269 (625)
Q Consensus 193 ~~~~~~~~~~~~l~~~~~~llg~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~---~~~~~t~wi~L~d~~~~n 269 (625)
++.+.+++.+|+|++++++|+|.+++++++.++.||+..|. .++||||..||.... ....+|+||||+|++.+|
T Consensus 83 ~~~~~~l~~~p~l~~~~~~LlG~~~~l~~~~l~~kp~~~g~---~~~WHQD~~~w~~~~~~p~~~~vt~wiaLdD~t~eN 159 (277)
T TIGR02408 83 SPILARLVRDPRVANAARQILGSDVYVHQSRINMKPGFKGT---GFYWHSDFETWHAEDGMPSMRAVSCSIALTDNNETN 159 (277)
T ss_pred CHHHHHHHcChHHHHHHHHHcCCCeEEEeeeeeecCCCCCC---CccCCcCCccccccCCCCCcCeEEEEEEcccCCCCC
Confidence 88999999999999999999999999999988899876654 358999999998653 346899999999999999
Q ss_pred CceeeecCCCCCCCCCcccCC---CCcccCcccccccccC-------CCCCCcccccccCCcEEEecCCccccCCCCCCc
Q psy2275 270 GCLFVLPRTHRDPGSLLQHEY---PEWEGGVNAMYHGIRG-------FDSHDKLNLYMERGDTVFFHPLLIHGSGTNVTK 339 (625)
Q Consensus 270 G~l~vvpGSH~~~~~l~~~~~---~~w~~~~~~~~~~~~~-------~~~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~ 339 (625)
|||+||||||++ .+..... ..+.......+..+.+ .+..+.++++++|||++|||++|+|||++|+|+
T Consensus 160 G~l~vIPGSH~~--~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~v~~~~~aGDvl~f~~~~~H~S~~N~s~ 237 (277)
T TIGR02408 160 GPLMLVPGSHRT--FISCVGETPRDNYKQSLKKQEYGVPDPVSLTKLADQGGISTFTGKAGSAVWFDCNTMHGSGSNITP 237 (277)
T ss_pred CCEEEecCCCCC--cccCCccccchhhhhhhhhhhcCCCCHHHHHHHHHhCCceeeccCCceEEEEccccccCCCCCCCC
Confidence 999999999995 2211110 0111100000111111 123457889999999999999999999999999
Q ss_pred ccccccccccc
Q psy2275 340 AVLGTSQRSAG 350 (625)
Q Consensus 340 ~~R~~~~~~~~ 350 (625)
++|+++...+.
T Consensus 238 ~~R~~l~l~y~ 248 (277)
T TIGR02408 238 WPRSNVFMVFN 248 (277)
T ss_pred CcceeEEEEEe
Confidence 99999988763
No 6
>KOG3624|consensus
Probab=99.93 E-value=1.3e-25 Score=257.37 Aligned_cols=262 Identities=23% Similarity=0.396 Sum_probs=200.4
Q ss_pred HHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHhhhcCChhhhcchhhhcccccHHHHHhhCCCCCC--Ccc---
Q psy2275 358 PVVKAYLEYMVDIAVMMGASRETAQQELTASLQFESRLRNITKSNEEYRKMSKLSNLVTISHLEHKYPILGT--SFS--- 432 (625)
Q Consensus 358 ~~~~Ay~~~~~~~~~~~g~~~~~a~~~~~~vl~fE~~LA~~~~~~~~~~~~~~~yn~~~i~~l~~~~P~l~~--~f~--- 432 (625)
..+.+|..+++.++..+|.+++.+++.+..+++||++|++++.+.+.+++...+|+.+++.+++..+|.+++ ++.
T Consensus 214 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 293 (687)
T KOG3624|consen 214 EKLRAYLLLANELLQLLGLDSDEAEEYARLVIELERQLANITKPDENRRELQALYNSVNLAELQKKFPSIDWKQLLRSVL 293 (687)
T ss_pred HHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHhccchhhhhhhHHHHcccccHHHHHHhcccccHHHHhhhhc
Confidence 567889999999999999999999999999999999999999999999999999999999999999997642 111
Q ss_pred -----C-----Ccccccc--------------------------------------------cchhhHHHHHHHHH----
Q psy2275 433 -----N-----DRPSYMN--------------------------------------------FGSMGWTIGHEITH---- 454 (625)
Q Consensus 433 -----~-----~~p~~l~--------------------------------------------~g~lG~~ig~e~~~---- 454 (625)
. ..++|+. ...|+..+++.+++
T Consensus 294 ~~~~~~~~~~~~~~~y~~~l~~ii~~t~~~~laNYl~~~~~~~~~~~~~~~~~~~~~Cv~~~~~~~p~a~~~l~~r~~~~ 373 (687)
T KOG3624|consen 294 GEILPHNEVVVFNPDYLVNLSAIIQSTPKRTLANYLIWRLLDDLLDLLPKPRKRWADCVELVRTLLPLALGRLYVRNFFD 373 (687)
T ss_pred cccccCCceEEecHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcccchhhHHHHHHHHHHHHhhhhhhHHHHHHHhcC
Confidence 0 1222322 13444444544333
Q ss_pred ---------hhccchhhHhhcCCCCCCCCHHHHHHHHHHh-----cCCCCCCCCCC-C-CCCCCCCcCCChhhHHHH---
Q psy2275 455 ---------GFDSTGSRFDKNGTEINWWDPSTREKYKILW-----LLHLPWAANRP-E-EPRLPGLQRFSPRQMFWV--- 515 (625)
Q Consensus 455 ---------~~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~-----~i~~~~~~~~~-~-~~~l~gl~~~~~~qlFf~--- 515 (625)
.+++++.+|.+.+...+||+++||+.|++|+ .+|+|...... . +..|.++ .++.+.++-.
T Consensus 374 ~~~~~~~~~m~~~lk~~f~~~l~~~~Wl~~~tr~~a~~K~~~m~~~ig~p~~~~~~~~ld~~y~~l-~l~~~~~~~~l~~ 452 (687)
T KOG3624|consen 374 KENKKEVSEMIEDLKKAFEEMLQELDWLDEETRKSAIKKLNAMKKNIGYPDEFLPPGTLDKEYENL-KLSLDDYDSNLEI 452 (687)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhCcccCHHHHHHHHHHHHHHHHHcCCHHHhcCchhHHHHHhcc-cCChHHHHHHHHH
Confidence 3678899999999999999999999999986 68888433222 1 1234444 3444333310
Q ss_pred --HH--hhhhh-ccC---------------------------CCccccccccCCCCCC--CceeEeeecccccccccccC
Q psy2275 516 --SA--ATSYC-GLQ---------------------------RPEDLKDNLLNDEHTP--PEWRAVGSLANSVEFGRDFG 561 (625)
Q Consensus 516 --~~--a~~~c-~~~---------------------------~~~~~~~~~~~~~~~p--~~~r~~g~l~N~~eF~~aFd 561 (625)
.+ .+... ... -|+.+.+..+++.+.| -+|+++|.++ +||++|+||
T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vi-gHEl~H~FD 531 (687)
T KOG3624|consen 453 LLKLQRRRSEQLELRAPVDPLDWVGSPAQVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVI-GHELTHGFD 531 (687)
T ss_pred HHHHHHHHHHHHHHhcCCCccccccccceeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHH-HHHHhhccc
Confidence 00 11111 011 1333344444567777 4899999999 999999999
Q ss_pred -CCCCCCCCCCCCCCCChhhHHHHHHHHhhHHHHhcCcccCCCCcccCcccc-------hHHHHHHHHHHh
Q psy2275 562 -TGSRFDKNGTEFNWWDPSTREKYKVKSKCMVDQYGKYVVPEVNGSVNGVNT-------LVGLAIAIAIAS 624 (625)
Q Consensus 562 -cGs~~d~~G~~~~Ww~~~~~~~f~~~~~c~~~qy~~~~~~~~~~~~ng~~t-------~gg~~~a~~a~~ 624 (625)
.|++||+.||.++||+++|.++|.+|+||+++||++|.++ .++||+.| .||+++||+||+
T Consensus 532 ~~G~~~D~~Gn~~~ww~~~s~~~f~~~~qC~~~qy~~y~~~---~~~ng~~t~~EnIAD~~Gl~~A~~AY~ 599 (687)
T KOG3624|consen 532 DQGRQYDESGNLRDWWDTESESEFQERAQCLVKQYSNYLDP---RRLNGSKTLGENIADNGGLKLAYRAYK 599 (687)
T ss_pred ccccccCCCcccccCCCHhHHHHHHHHHHHHHHHHhccCCC---cccCCccccchhhcchhhHHHHHHHHH
Confidence 8999999999999999999999999999999999999987 78999999 599999999998
No 7
>KOG3290|consensus
Probab=99.91 E-value=3.3e-25 Score=210.68 Aligned_cols=152 Identities=45% Similarity=0.810 Sum_probs=131.2
Q ss_pred cHHHH--HHhhcHhHHHHHHHhcCCcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCCCCcceEEEEeeccCcccCC
Q psy2275 193 DEVFE--KYTMHKKLLDYVESFIGPHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRPAHRIVAAWTAMERIDKDNG 270 (625)
Q Consensus 193 ~~~~~--~~~~~~~l~~~~~~llg~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~nG 270 (625)
+|.+. .+...+++++....+.|| .+.+++.+.|+|+.|+.++ .|||..|++++|.+..+.+|+|+++++..||
T Consensus 105 ~p~~~~fty~~k~~~l~~~l~~~~P--vv~qsmyi~K~P~iGg~vt---~HqD~~~~p~~P~d~~~~~W~Ame~~t~~NG 179 (286)
T KOG3290|consen 105 DPDFKLFTYSSKVQVLAVVLGLQGP--VVAQSMYIFKPPDIGGEVT---PHQDLTFFPWDPCDSCVGAWTAMEDITLVNG 179 (286)
T ss_pred CcchhheeeeccccceeeeeccCCc--eeeeEEEEecCCCCCCccC---CcccceecccCchhhhhhhhhhhhhhhhcCC
Confidence 44444 456666677777777774 5555666569999998865 4999999999999999999999999999999
Q ss_pred ceeeecCCCCCCCCCcccCCCCcccCcccccccccCCC-CCCcccccccCCcEEEecCCccccCCCCCCccccccccccc
Q psy2275 271 CLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFD-SHDKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTSQRSA 349 (625)
Q Consensus 271 ~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~-~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~~~~~ 349 (625)
||+|+||||+ +.+.+|..++|.++++..|..+.+.+ .....+++|++||.++||+.++|||++|+|...|.++-+++
T Consensus 180 CL~~iPGshk--~~l~~~~yp~~sgg~~~~~~~~~~~~~q~~~~~v~mekg~tv~~h~~lvh~S~~NrSe~~R~a~s~h~ 257 (286)
T KOG3290|consen 180 CLQVIPGSHK--GPLLFHRYPDWSGGVNMAYIGIQDYYPQSPRGHVEMEKGDTVFFHPLLVHGSGANRSEGFRKAISCHV 257 (286)
T ss_pred ceEecccccc--CcceeeccCCcccccccceeecccCCccCCccceeecCCcEEEEeeeeeecCCCCcccccccccccce
Confidence 9999999999 89999999999999999888888765 67888999999999999999999999999999999997776
Q ss_pred cc
Q psy2275 350 GR 351 (625)
Q Consensus 350 ~~ 351 (625)
..
T Consensus 258 ~~ 259 (286)
T KOG3290|consen 258 AE 259 (286)
T ss_pred ee
Confidence 44
No 8
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=99.89 E-value=4.2e-23 Score=221.86 Aligned_cols=159 Identities=36% Similarity=0.680 Sum_probs=134.0
Q ss_pred cCCCCccC-cccccC-CCCCCeEEEcCchhHHHHHHHHhcCChHHHHHHHHHHHHHHhhhhccHHHHHHHHHhhhhccCc
Q psy2275 17 GRGLDKPD-LSHLSP-VKPDEVVNLEAPDFFRSFEPLLKATPKRVLANYLLWQVMSFSSTYLTEEMNNRYFKFSSTLRGV 94 (625)
Q Consensus 17 ~~~~~~~~-~~~l~~-i~~~~~V~v~~~~Yl~~l~~ll~~t~~~~l~nYl~wr~i~~~~~~L~~~~~~~~~~f~~~l~G~ 94 (625)
.+.|+|.. |+.+.+ +++++.|+|.+|+||++|.++|+++++++|+||++||++.+++++++.+++.....|...+.|.
T Consensus 229 ~p~i~w~~~l~~~~~~~~~~~~V~v~~~~y~~~l~~ll~~~~~~~l~nYl~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~ 308 (390)
T PF05649_consen 229 YPQIDWDRYLKALFGEVTESDEVIVRSPDYFKKLSELLKQTSKETLANYLGWRLIRELSPYLSSELRDLYFEFYRTLSGV 308 (390)
T ss_dssp STTS-HHHHHHHHCCCCCTTSEEEES-HHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHGGGS-HHHHHHHHHHHTHHH--
T ss_pred HhhhhHHHHHHHHccccCCCceEEEcCcHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCC
Confidence 45566633 344444 6888999999999999999999999999999999999999999999999999999999888999
Q ss_pred cCCCcchhhHHHHhhh-hhHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHhhcccCCCCChhhHHHHHHHHhhhccccc
Q psy2275 95 TARKPRWQECVDETKD-LDIAVGALYIRKYFNQDAKANVETMVRLILDETYKYLSTVDWMDPNTRLAAQDKVKAIIPYVA 173 (625)
Q Consensus 95 ~~~~~r~~~C~~~~~~-~~~~lg~lyv~~~f~~~~~~~v~~m~~~Ik~~~~~~l~~~~Wmd~~tk~~a~~Kl~~~~~~ig 173 (625)
....+||+.|++.++. ||+++|++|++.+++++.++.+.+|+++||++|.++|.+++|||++||+.|++||++|...||
T Consensus 309 ~~~~~~~~~C~~~~~~~l~~~~~~~y~~~~~~~~~~~~v~~~~~~ik~~f~~~i~~~~Wld~~tk~~a~~Kl~~m~~~iG 388 (390)
T PF05649_consen 309 EESRPRWQRCVRHVEKLLPFALGRLYVRRYFDKENKKEVEDMVENIKEAFRERIEESDWLDEETKKEAIEKLNNMKLNIG 388 (390)
T ss_dssp SS---HHHHHHHHHHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCT-TTS-HHHHHHHHHHHHH-EEEEE
T ss_pred ccchhhHHHHHHHHHHHHHHHHHhhHhhhcCchhHHHHHHHHHHHHHHHHHHHHhhCCCCCHHHHHHHHHHHHhhhhhcc
Confidence 8888999999999986 999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CC
Q psy2275 174 YP 175 (625)
Q Consensus 174 ~P 175 (625)
||
T Consensus 389 yP 390 (390)
T PF05649_consen 389 YP 390 (390)
T ss_dssp S-
T ss_pred CC
Confidence 98
No 9
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=99.88 E-value=5.1e-25 Score=215.61 Aligned_cols=162 Identities=32% Similarity=0.625 Sum_probs=118.7
Q ss_pred hhhhcccccHHHHHhhCCCCCCCccCCcccccccchhhHHHHHHHHHhhccchhhHhhcCCCCCCCCHHHHHHHHHHh--
Q psy2275 408 MSKLSNLVTISHLEHKYPILGTSFSNDRPSYMNFGSMGWTIGHEITHGFDSTGSRFDKNGTEINWWDPSTREKYKILW-- 485 (625)
Q Consensus 408 ~~~~yn~~~i~~l~~~~P~l~~~f~~~~p~~l~~g~lG~~ig~e~~~~~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~-- 485 (625)
|....|.+.++....+.|+ |+++.|.+++||++|.+|||||+|+||..|..||+.|+..+||++.|++.+.++.
T Consensus 4 Y~~~~N~i~ip~~~l~~P~----f~~~~p~~~~yg~lG~ilahel~hafd~~g~~~D~~g~~~~wWs~~~~~~~~~~~~C 79 (206)
T PF01431_consen 4 YSPRFNSIVIPAGILQPPF----FDPNYPPALNYGGLGFILAHELMHAFDPEGINYDEDGNLRNWWSEESRSNFKERAQC 79 (206)
T ss_dssp EETTTTEEEEEGGGSSTTT------TTS-HHHHHHTHHHHHHHHHHHCTSTTGGGB-TTS-BS--S-HHHHHHHHHHHHH
T ss_pred CCcccCEEEecHHHhCCcc----CCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhhcCcccccccccchhhHHHHHHHHHH
Confidence 4445788999998888887 8999999999999999999999999999999999999999999999999998762
Q ss_pred ------cCCC---------------------------CCCCCC---CCCCCCCCCcCCChhhHHHHHHhhhhhccCCCcc
Q psy2275 486 ------LLHL---------------------------PWAANR---PEEPRLPGLQRFSPRQMFWVSAATSYCGLQRPED 529 (625)
Q Consensus 486 ------~i~~---------------------------~~~~~~---~~~~~l~gl~~~~~~qlFf~~~a~~~c~~~~~~~ 529 (625)
.... +|..+. ..+..+|++++++++|+||++||+.||....+..
T Consensus 80 l~~qy~~~~~~~~~~~~~~~~~~~~~E~iaD~~gl~~a~~ay~~~~~~~~~l~~~~~~t~~QlFF~~~a~~~C~~~~~~~ 159 (206)
T PF01431_consen 80 LRDQYSNYSVPEGNKSVSLNGSDTLNENIADNAGLRLAYRAYRKKSSNEERLPGLEGLTPDQLFFISFAQSFCEKMSPES 159 (206)
T ss_dssp HHHHHHT-BCGGGTTCSB--TTTTHHHHHHHHHHHHHHHHHHHHHHTTT-B-TTCTTB-HHHHHHHHHHHHT-EEE-HHH
T ss_pred HHHHHhcccccccccccccchHHHHHHHHHHHHhHHHHHHHHhhcccccccccccccchhHHHHHHHHHHHHhcCcchhh
Confidence 0111 111111 2345688888899999999999999998776655
Q ss_pred ccccccCCCCCCCceeEeeecccccccccccCC--CCCCCCCCCCC
Q psy2275 530 LKDNLLNDEHTPPEWRAVGSLANSVEFGRDFGT--GSRFDKNGTEF 573 (625)
Q Consensus 530 ~~~~~~~~~~~p~~~r~~g~l~N~~eF~~aFdc--Gs~~d~~G~~~ 573 (625)
.......++|+|..+|||++|+|+++|+++|+| |+.|+|..+|+
T Consensus 160 ~~~~~~~~~h~p~~~RvN~~L~n~~~Fa~aF~C~~gs~Mnp~~kC~ 205 (206)
T PF01431_consen 160 VYRQLQTDPHSPARFRVNGPLSNMPEFAEAFNCPSGSPMNPKKKCR 205 (206)
T ss_dssp HHHHHHH-SS--HHHHHHHHHHT-HHHHHHHT--TTSTTS-SSGG-
T ss_pred hhhhhhcCCCCchHHHHHHHHhcCHHHHHHcCCCCCCCCCCCCCCc
Confidence 544556789999999999999999999999999 99999999886
No 10
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=99.88 E-value=1.7e-23 Score=204.56 Aligned_cols=199 Identities=25% Similarity=0.420 Sum_probs=130.0
Q ss_pred HHHHHHhhhcChHHHHHHHHHHHHHHHHHHh-hcccCCCCChhhHHHHHHHHhhhcccccCCccccchhhhhhccccccc
Q psy2275 115 VGALYIRKYFNQDAKANVETMVRLILDETYK-YLSTVDWMDPNTRLAAQDKVKAIIPYVAYPQELLGEYLYNKVQELVYD 193 (625)
Q Consensus 115 lg~lyv~~~f~~~~~~~v~~m~~~Ik~~~~~-~l~~~~Wmd~~tk~~a~~Kl~~~~~~ig~P~~~~~~~~l~~~~~~~~~ 193 (625)
-||++++++++++.++.+++.+.++...... ..... ........ .. ....+..+
T Consensus 4 ~Gyvvi~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~-~~-----~~~~~~~~ 58 (211)
T PF05721_consen 4 DGYVVIRNVLSPEEVERLREELDRLDDRALEPDQDVS-------------------DFFDESFF-GD-----YTEQLAKS 58 (211)
T ss_dssp HSEEEETTSS-HHHHHHHHHHHHHHHHHHTTTTTSCE-------------------EEESTSCC-CT-----CCCCGCCC
T ss_pred CcEEEECCcCCHHHHHHHHHHHHHHHhhhhccccccc-------------------cccccccc-cc-----cccccccc
Confidence 5889999999999888887777776221111 10000 00000000 00 00011246
Q ss_pred HHHHHHhhcH-hHHHHHHHhcCCccee---eccce--eecCCCCCCCcccCCCCCCCCcCCCCCCCcceEEEEeeccCcc
Q psy2275 194 EVFEKYTMHK-KLLDYVESFIGPHIRA---IHSML--INKPPDAGTQTSRHPLHQDLHYFPHRPAHRIVAAWTAMERIDK 267 (625)
Q Consensus 194 ~~~~~~~~~~-~l~~~~~~llg~~~~~---~~~~~--~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~ 267 (625)
+.+.+.+.++ .+.+++++++|.+... +...+ +.++|+.|... ++||||..||...+....+++||+|+|++.
T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~g~~~--~~wH~D~~~~~~~~~~~~~~~wi~L~d~~~ 136 (211)
T PF05721_consen 59 PNFYDLFLHPPRILDLVRALLGSDVFVQNWLQSMYQDIVKPPGPGAAV--QPWHQDAPYWHTDPPENQLTVWIALDDITP 136 (211)
T ss_dssp HHHHHHHHTHHHHHHHHHHHHTSSEEEE--EEEEEEEEEE-TTTTC-E--EEEBEHHHCSTEESSSCEEEEEEESS-BBT
T ss_pred hhhHHHHhhHHHHHHHHHHhhCCcchhhhhhHHHHHhhhhccccCCCC--CCCCCCCcccccCCccceEEEEEeeccCCc
Confidence 6777888887 9999999999988765 56666 88999877654 589999999987667789999999999999
Q ss_pred cCCceeeecCCCCCCCCCccc--CCCCcccCcccccccccCCCCCCcccccccCCcEEEecCCccccCCCCCCcccccc
Q psy2275 268 DNGCLFVLPRTHRDPGSLLQH--EYPEWEGGVNAMYHGIRGFDSHDKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGT 344 (625)
Q Consensus 268 ~nG~l~vvpGSH~~~~~l~~~--~~~~w~~~~~~~~~~~~~~~~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~ 344 (625)
+||||+|+||||++ ...++ .......... ............++++++|||+||||++++|||++|.|+.+|++
T Consensus 137 ~~G~~~v~pGSH~~--~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~Gdvl~~~~~~~H~s~~N~s~~~R~a 211 (211)
T PF05721_consen 137 ENGPLEVVPGSHKW--GVEPHEERFPEEDFPEE--DDEESDEDEDEWVPVPMKAGDVLFFHSRLIHGSGPNTSDDPRRA 211 (211)
T ss_dssp TCTCEEEETTGCCS--CCEEECCCCCCCCCCCC--HHHHHHHHCSGCEEE-BSTTEEEEEETTSEEEEE-B-SSSTEEE
T ss_pred ccCceEeecCCcCC--Ccccccccccccccccc--cccccccccCceEEeecCCCeEEEEcCCccccCCCCCCcCcCCC
Confidence 99999999999995 22211 1111000000 00000112567899999999999999999999999999999975
No 11
>COG5285 Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=99.80 E-value=3e-19 Score=174.55 Aligned_cols=129 Identities=21% Similarity=0.277 Sum_probs=92.0
Q ss_pred HhhcHhHHHHHHHhcC-CcceeeccceeecCCCCCCCcccCCCCCCCCcCCCCCCCcceEEEEeeccCcccCCceeeecC
Q psy2275 199 YTMHKKLLDYVESFIG-PHIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRPAHRIVAAWTAMERIDKDNGCLFVLPR 277 (625)
Q Consensus 199 ~~~~~~l~~~~~~llg-~~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~nG~l~vvpG 277 (625)
+...++|..+++.+++ .+..++....+.||. ...+.||||..+|...+ ..+|++||||+|+++||||++||||
T Consensus 98 ~~a~~~i~~~~~~~~~~~~~v~~~~~~~~~p~-----~~~t~~HqD~~~~~~~~-~~lV~~wiAl~d~~~dnGat~vvPg 171 (299)
T COG5285 98 LPAAPKILAVTRTLLLRHGHVLWKMPGFQKPG-----AVATRWHQDYPLVSPGY-PALVNAWIALCDFTEDNGATLVVPG 171 (299)
T ss_pred hhhhHHHHHHHHHHhccCCeEEEecCCCCCCc-----ccccccccccccccCCc-cceEEEEEeccccccccCceEEEec
Confidence 4445566666666655 333333333344331 22368999987775544 4589999999999999999999999
Q ss_pred CCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccccCCcEEEecCCccccCCCCCCccccccccc
Q psy2275 278 THRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTSQR 347 (625)
Q Consensus 278 SH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~~~ 347 (625)
||+| ..+ ++ |.+ .+.+.+...++++|++||+||||+.|+|+++.|+|+.+|.|+..
T Consensus 172 SH~~-~~~-~~----r~d--------~~~y~~~~~~pv~lekGDallF~~~L~HaA~aNrT~~~R~A~~~ 227 (299)
T COG5285 172 SHKW-DVI-PE----RPD--------HETYLERNAVPVELEKGDALLFNGSLWHAAGANRTSADRVALTL 227 (299)
T ss_pred cccc-ccC-CC----CCC--------ccchhhhcceeeeecCCCEEEEcchhhhhhhcCCCCcccceEEE
Confidence 9996 211 11 111 11233455899999999999999999999999999999998754
No 12
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=99.43 E-value=3.4e-14 Score=139.04 Aligned_cols=98 Identities=32% Similarity=0.638 Sum_probs=76.0
Q ss_pred CCccccccccCCCCCCC--ceeEeeecccccccccccC-CCCCCCCCCCCCCCCChhhHHHHHHHHhhHHHHhcCcccCC
Q psy2275 526 RPEDLKDNLLNDEHTPP--EWRAVGSLANSVEFGRDFG-TGSRFDKNGTEFNWWDPSTREKYKVKSKCMVDQYGKYVVPE 602 (625)
Q Consensus 526 ~~~~~~~~~~~~~~~p~--~~r~~g~l~N~~eF~~aFd-cGs~~d~~G~~~~Ww~~~~~~~f~~~~~c~~~qy~~~~~~~ 602 (625)
-|....+..++++..|. +|+++|.++ +||+.|+|| .|+.+|.+|+.++||++.+.+.|.++++|+++||+++....
T Consensus 13 ip~~~l~~P~f~~~~p~~~~yg~lG~il-ahel~hafd~~g~~~D~~g~~~~wWs~~~~~~~~~~~~Cl~~qy~~~~~~~ 91 (206)
T PF01431_consen 13 IPAGILQPPFFDPNYPPALNYGGLGFIL-AHELMHAFDPEGINYDEDGNLRNWWSEESRSNFKERAQCLRDQYSNYSVPE 91 (206)
T ss_dssp EEGGGSSTTT--TTS-HHHHHHTHHHHH-HHHHHHCTSTTGGGB-TTS-BS--S-HHHHHHHHHHHHHHHHHHHT-BCGG
T ss_pred ecHHHhCCccCCCCCCHHHHHHHHHHHH-HHHHHHHHHHhHhhcCcccccccccchhhHHHHHHHHHHHHHHHhcccccc
Confidence 35556666667777665 899999999 999999999 69999999999999999999999999999999999999875
Q ss_pred CCc--ccCcccc-------hHHHHHHHHHHh
Q psy2275 603 VNG--SVNGVNT-------LVGLAIAIAIAS 624 (625)
Q Consensus 603 ~~~--~~ng~~t-------~gg~~~a~~a~~ 624 (625)
.+. .+||..| ++||++||+||+
T Consensus 92 ~~~~~~~~~~~~~~E~iaD~~gl~~a~~ay~ 122 (206)
T PF01431_consen 92 GNKSVSLNGSDTLNENIADNAGLRLAYRAYR 122 (206)
T ss_dssp GTTCSB--TTTTHHHHHHHHHHHHHHHHHHH
T ss_pred cccccccchHHHHHHHHHHHHhHHHHHHHHh
Confidence 332 6889988 689999999997
No 13
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=98.89 E-value=5.3e-09 Score=112.37 Aligned_cols=134 Identities=22% Similarity=0.419 Sum_probs=103.0
Q ss_pred hHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHhhhcCChhhhcchhhhcccccHHHHHhhCCCCCC------C
Q psy2275 357 NPVVKAYLEYMVDIAVMMGASRETAQQELTASLQFESRLRNITKSNEEYRKMSKLSNLVTISHLEHKYPILGT------S 430 (625)
Q Consensus 357 ~~~~~Ay~~~~~~~~~~~g~~~~~a~~~~~~vl~fE~~LA~~~~~~~~~~~~~~~yn~~~i~~l~~~~P~l~~------~ 430 (625)
...+.+|..++.+++..+|.+.+++.+.+++|++||..|+++..+.++++++...|+++++.+++..+|.++| +
T Consensus 162 ~~~~~~y~~~v~~~l~~~g~~~~~~~~~~~~i~~~e~~L~~~~~~~~~~~~~~~~~~~~tl~eL~~~~p~i~w~~~l~~~ 241 (390)
T PF05649_consen 162 KEYLQAYREYVREVLKLLGVDDDEASKLADDIVKFEKQLAKLSSEDEERRDPQKLYNPMTLSELQSQYPQIDWDRYLKAL 241 (390)
T ss_dssp HHHHHHHHHHHHHHHHHCSS-HHHHHHHHHHHHHHHHHHHHHS--GGGCT-HHHH-EEEEHHHHHHHSTTS-HHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhhccCChHHHHHHHHHHHHHHHHHHHHhhhhHhhccHHHhcCcccHHHHHHHHhhhhHHHHHHHH
Confidence 4578899999999999999999999999999999999999999999999999999999999999999998865 2
Q ss_pred ccC---------Ccccccc-------------------------------------------------------------
Q psy2275 431 FSN---------DRPSYMN------------------------------------------------------------- 440 (625)
Q Consensus 431 f~~---------~~p~~l~------------------------------------------------------------- 440 (625)
+.. ..|+|++
T Consensus 242 ~~~~~~~~~V~v~~~~y~~~l~~ll~~~~~~~l~nYl~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~C~~~ 321 (390)
T PF05649_consen 242 FGEVTESDEVIVRSPDYFKKLSELLKQTSKETLANYLGWRLIRELSPYLSSELRDLYFEFYRTLSGVEESRPRWQRCVRH 321 (390)
T ss_dssp CCCCCTTSEEEES-HHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHGGGS-HHHHHHHHHHHTHHH--SS---HHHHHHHH
T ss_pred ccccCCCceEEEcCcHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCCccchhhHHHHHHH
Confidence 221 2445544
Q ss_pred -cchhhHHHHHHHHHh-------------hccchhhHhhcCCCCCCCCHHHHHHHHHHh-----cCCCC
Q psy2275 441 -FGSMGWTIGHEITHG-------------FDSTGSRFDKNGTEINWWDPSTREKYKILW-----LLHLP 490 (625)
Q Consensus 441 -~g~lG~~ig~e~~~~-------------~d~~~~~f~~~l~~~~W~~~~T~~~a~~k~-----~i~~~ 490 (625)
...|+.++|+.+++. +++++.+|.+.+...+||+++||+.|.+|+ .||||
T Consensus 322 ~~~~l~~~~~~~y~~~~~~~~~~~~v~~~~~~ik~~f~~~i~~~~Wld~~tk~~a~~Kl~~m~~~iGyP 390 (390)
T PF05649_consen 322 VEKLLPFALGRLYVRRYFDKENKKEVEDMVENIKEAFRERIEESDWLDEETKKEAIEKLNNMKLNIGYP 390 (390)
T ss_dssp HHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCT-TTS-HHHHHHHHHHHHH-EEEEES-
T ss_pred HHHHHHHHHHhhHhhhcCchhHHHHHHHHHHHHHHHHHHHHhhCCCCCHHHHHHHHHHHHhhhhhccCC
Confidence 024555555544433 678899999999999999999999999996 68887
No 14
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=95.55 E-value=0.13 Score=50.66 Aligned_cols=75 Identities=16% Similarity=0.090 Sum_probs=50.6
Q ss_pred CCCCCCCCcCCCCC---C-CcceEEEEeeccCc-ccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCc
Q psy2275 238 HPLHQDLHYFPHRP---A-HRIVAAWTAMERID-KDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDK 312 (625)
Q Consensus 238 ~~~HqD~~y~~~~~---~-~~~~t~wi~L~d~~-~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~ 312 (625)
..+|.|..+..... . ...+|+-++|.|.. -++|-|.+-... +.
T Consensus 93 y~~H~D~~~~~~~~~~~~~rs~lS~~lyLnd~~~yeGGEl~~~~~~--------------------------------g~ 140 (226)
T PRK05467 93 YGFHVDNAVRSLPGTGGRVRTDLSATLFLSDPDDYDGGELVIEDTY--------------------------------GE 140 (226)
T ss_pred cCccccCCcccCCCCCcceeEEEEEEEEeCCCCCCcCCceEEecCC--------------------------------Cc
Confidence 46899998765321 1 13689999999865 366665543211 11
Q ss_pred ccccccCCcEEEecCCccccCCCCCCccccccc
Q psy2275 313 LNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 313 v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~ 345 (625)
..+..++|++|+|.+.++|...|-++.. |.++
T Consensus 141 ~~Vkp~aG~~vlfps~~lH~v~pVt~G~-R~~~ 172 (226)
T PRK05467 141 HRVKLPAGDLVLYPSTSLHRVTPVTRGV-RVAS 172 (226)
T ss_pred EEEecCCCeEEEECCCCceeeeeccCcc-EEEE
Confidence 2345679999999999999999976643 5444
No 15
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=95.02 E-value=0.12 Score=48.91 Aligned_cols=75 Identities=16% Similarity=0.094 Sum_probs=49.6
Q ss_pred CCCCCCCCcCCCCCCCcceEEEEeeccCcccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccc
Q psy2275 238 HPLHQDLHYFPHRPAHRIVAAWTAMERIDKDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYM 317 (625)
Q Consensus 238 ~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~ 317 (625)
..+|.|....... ..+.+|+.++|.|. .++|.+.+...... ....+.-
T Consensus 96 ~~~H~D~~~~~~~-~~r~~T~~~yLn~~-~~GG~~~f~~~~~~------------------------------~~~~v~P 143 (178)
T smart00702 96 YGPHVDNFEDDEN-GDRIATFLLYLNDV-EEGGELVFPGLGLM------------------------------VCATVKP 143 (178)
T ss_pred ccCcCCCCCCCCC-CCeEEEEEEEeccC-CcCceEEecCCCCc------------------------------cceEEeC
Confidence 3679998654322 35689999999984 45666655442110 1123344
Q ss_pred cCCcEEEecC---CccccCCCCCCccccccc
Q psy2275 318 ERGDTVFFHP---LLIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 318 ~aGd~l~f~~---~~~H~s~~N~s~~~R~~~ 345 (625)
++|++|+|.+ .++|+..|-.+. .|+++
T Consensus 144 ~~G~~v~f~~~~~~~~H~v~pv~~G-~r~~~ 173 (178)
T smart00702 144 KKGDLLFFPSGRGRSLHGVCPVTRG-SRWAI 173 (178)
T ss_pred CCCcEEEEeCCCCCccccCCcceeC-CEEEE
Confidence 6899999998 599999998773 45554
No 16
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=93.43 E-value=0.041 Score=46.73 Aligned_cols=80 Identities=19% Similarity=0.228 Sum_probs=48.3
Q ss_pred CCCCCCCCcCCCCCCCcceEEEEeeccCc--ccCCceeeecCC-CCCCCCCcccCCCCcccCcccccccccCCCCCCccc
Q psy2275 238 HPLHQDLHYFPHRPAHRIVAAWTAMERID--KDNGCLFVLPRT-HRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLN 314 (625)
Q Consensus 238 ~~~HqD~~y~~~~~~~~~~t~wi~L~d~~--~~nG~l~vvpGS-H~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~ 314 (625)
..||.|..+ .....+|+.++|.+-+ .+.|.|.+.+.. +. . ... .+. ...
T Consensus 12 ~~~H~D~~~----~~~~~~t~llyL~~~~~~~~GG~l~~~~~~~~~--------~-------~~~---~~~------~~~ 63 (100)
T PF13640_consen 12 FGPHTDNSY----DPHRRVTLLLYLNDPEWEFEGGELEFYPSKDSD--------D-------VSR---EVE------DFD 63 (100)
T ss_dssp EEEEESSSC----CCSEEEEEEEESS-CS-HCEE--EEETTTS-TS--------S-------TCE---EEG------GGS
T ss_pred EeeeECCCC----CCcceEEEEEEECCCCcccCCCEEEEeccccCC--------C-------cce---EEE------ecc
Confidence 468999855 3457899999999665 677777777643 11 0 000 000 011
Q ss_pred ccccCCcEEEecC-CccccCCCCCCccccccc
Q psy2275 315 LYMERGDTVFFHP-LLIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 315 ~~~~aGd~l~f~~-~~~H~s~~N~s~~~R~~~ 345 (625)
+..++|++|+|.+ .++|+..+......|.++
T Consensus 64 ~~p~~g~~v~F~~~~~~H~v~~v~~~~~R~~l 95 (100)
T PF13640_consen 64 IVPKPGRLVIFPSDNSLHGVTPVGEGGRRYSL 95 (100)
T ss_dssp EE-BTTEEEEEESCTCEEEEEEE-EESEEEEE
T ss_pred ccCCCCEEEEEeCCCCeecCcccCCCCCEEEE
Confidence 1177899999999 999999888444555544
No 17
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=93.41 E-value=0.033 Score=47.63 Aligned_cols=37 Identities=24% Similarity=0.216 Sum_probs=24.7
Q ss_pred CCCcccccccCCcEEEecCCccccCCCCCCccccccc
Q psy2275 309 SHDKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 309 ~~~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~ 345 (625)
......+..++||+|||.+.+.|+..||.++.+|-++
T Consensus 62 ~~~~~~~~p~~G~lvlFPs~l~H~v~p~~~~~~Risi 98 (101)
T PF13759_consen 62 NSPYYIVEPEEGDLVLFPSWLWHGVPPNNSDEERISI 98 (101)
T ss_dssp C-SEEEE---TTEEEEEETTSEEEE----SSS-EEEE
T ss_pred cCceEEeCCCCCEEEEeCCCCEEeccCcCCCCCEEEE
Confidence 3456678889999999999999999999999999876
No 18
>PF07350 DUF1479: Protein of unknown function (DUF1479); InterPro: IPR010856 This family consists of several hypothetical Enterobacterial proteins, of around 420 residues in length. Members of this family are often known as YbiU. The function of this family is unknown.; PDB: 2CSG_A 2DBI_A 2DBN_A.
Probab=91.64 E-value=0.13 Score=54.83 Aligned_cols=83 Identities=23% Similarity=0.408 Sum_probs=35.9
Q ss_pred cceEEEEeeccCcccCCceeeecCCCCCCC-----CCc-ccCC-CC----cccCccccccc--------cc-CC----C-
Q psy2275 254 RIVAAWTAMERIDKDNGCLFVLPRTHRDPG-----SLL-QHEY-PE----WEGGVNAMYHG--------IR-GF----D- 308 (625)
Q Consensus 254 ~~~t~wi~L~d~~~~nG~l~vvpGSH~~~~-----~l~-~~~~-~~----w~~~~~~~~~~--------~~-~~----~- 308 (625)
++.+-|++|.++....|.|+|+|.-+.-.. +++ +... .. +....+..+.+ +. .. .
T Consensus 235 R~FQGWlaLS~~~p~eGtL~v~P~~~~atAY~lLRPfF~~~~~~~~~~~~~~~~~~~~~~Ga~pg~~q~~~~~~HPhL~L 314 (416)
T PF07350_consen 235 RTFQGWLALSDTGPGEGTLRVLPLLKLATAYFLLRPFFDPVPSPSESPDYEDDPQTSIFPGAVPGRAQELSPKWHPHLRL 314 (416)
T ss_dssp -SEEEEEESS-B-TTSS-EEE-S-CC-HHHHHHHGGGSTTS--------------TT--TT--TTS-EE--TTT-HH--H
T ss_pred eccchhhccccCCCCCCeEEEecchHHHHHHHHhhhhcCCccccccccccccccCCCCCCCcCCCCCccCCcccCCchhh
Confidence 578899999999999999999998773100 111 1110 00 00000111110 00 00 1
Q ss_pred CCCccccc-ccCCcEEEecCCccccCCCC
Q psy2275 309 SHDKLNLY-MERGDTVFFHPLLIHGSGTN 336 (625)
Q Consensus 309 ~~~~v~~~-~~aGd~l~f~~~~~H~s~~N 336 (625)
....|+++ ++|||.|+|||.++|+-.+.
T Consensus 315 ~~~mv~iP~v~PGD~V~WHcD~iH~Vd~~ 343 (416)
T PF07350_consen 315 EKTMVSIPDVEPGDYVFWHCDLIHAVDPE 343 (416)
T ss_dssp HTT-EE---B-TT-EEEEETT--EEE--B
T ss_pred hcccccCCCCCCCCeEEEeCCcccccccc
Confidence 34455555 99999999999999987664
No 19
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=88.10 E-value=0.35 Score=46.76 Aligned_cols=35 Identities=14% Similarity=0.128 Sum_probs=30.6
Q ss_pred CcccccccCCcEEEecCCccccCCCCCCccccccc
Q psy2275 311 DKLNLYMERGDTVFFHPLLIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 311 ~~v~~~~~aGd~l~f~~~~~H~s~~N~s~~~R~~~ 345 (625)
..+.+.-++|++|+|.+.++|+..||.++..|-++
T Consensus 160 ~~~~v~P~~G~lvlFPS~L~H~v~p~~~~~~RISi 194 (201)
T TIGR02466 160 RFVYVPPQEGRVLLFESWLRHEVPPNESEEERISV 194 (201)
T ss_pred ccEEECCCCCeEEEECCCCceecCCCCCCCCEEEE
Confidence 34456669999999999999999999999999876
No 20
>PLN00052 prolyl 4-hydroxylase; Provisional
Probab=86.05 E-value=3.8 Score=42.42 Aligned_cols=84 Identities=20% Similarity=0.272 Sum_probs=48.8
Q ss_pred CCCCCCCcCC---CCCCCcceEEEEeeccCcccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccc
Q psy2275 239 PLHQDLHYFP---HRPAHRIVAAWTAMERIDKDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNL 315 (625)
Q Consensus 239 ~~HqD~~y~~---~~~~~~~~t~wi~L~d~~~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~ 315 (625)
.+|.|...-. .....+..|+.++|.|+. ++|.+.| |....+ ...+.. ..|.. . ...-+.+
T Consensus 146 ~~H~D~~~~~~~~~~gg~R~aTvL~YLndv~-~GGeT~F-P~~~~~--~~~~~~-~~~s~-----------c-~~~gl~V 208 (310)
T PLN00052 146 EPHFDYFHDKINQALGGHRYATVLMYLSTVD-KGGETVF-PNAEGW--ENQPKD-DTFSE-----------C-AHKGLAV 208 (310)
T ss_pred CCCCCccccccccccCCceeEEEEEEeccCC-CCCceec-CCcccc--cccccc-cchhh-----------h-hcCCeEe
Confidence 5699964311 123467999999999986 4555544 433221 000000 01110 0 1123566
Q ss_pred cccCCcEEEecC---------CccccCCCCCCc
Q psy2275 316 YMERGDTVFFHP---------LLIHGSGTNVTK 339 (625)
Q Consensus 316 ~~~aGd~l~f~~---------~~~H~s~~N~s~ 339 (625)
.-++|++|||.+ .++|++.|-...
T Consensus 209 kPkkG~ALlF~nl~~dG~~D~~SlHagcPVi~G 241 (310)
T PLN00052 209 KPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEG 241 (310)
T ss_pred ccCcceEEEEeccCCCCCCCcccccCCCeeecC
Confidence 678999999998 579998887664
No 21
>PF12851 Tet_JBP: Oxygenase domain of the 2OGFeDO superfamily ; InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=64.79 E-value=14 Score=34.89 Aligned_cols=26 Identities=15% Similarity=0.299 Sum_probs=21.2
Q ss_pred ccccccCCcEEEecCC-ccccCCCCCC
Q psy2275 313 LNLYMERGDTVFFHPL-LIHGSGTNVT 338 (625)
Q Consensus 313 v~~~~~aGd~l~f~~~-~~H~s~~N~s 338 (625)
+.+.+.+|||+|+.+. ..||-.|-.+
T Consensus 128 ~~~~~~~GtVl~~~~~~~~Hgvtpv~~ 154 (171)
T PF12851_consen 128 VAFAYQPGTVLIFCAKRELHGVTPVES 154 (171)
T ss_pred EEEecCCCcEEEEcccceeeecCcccC
Confidence 4567889999999999 8998776543
No 22
>PF08856 DUF1826: Protein of unknown function (DUF1826); InterPro: IPR014955 These proteins are functionally uncharacterised.
Probab=64.24 E-value=27 Score=33.68 Aligned_cols=32 Identities=16% Similarity=0.063 Sum_probs=23.8
Q ss_pred cccccccCCcEEEecCC---------ccccCCCCC-Cccccc
Q psy2275 312 KLNLYMERGDTVFFHPL---------LIHGSGTNV-TKAVLG 343 (625)
Q Consensus 312 ~v~~~~~aGd~l~f~~~---------~~H~s~~N~-s~~~R~ 343 (625)
.....+++||+.++.+. ++|+|++-. ++..|.
T Consensus 150 ~~i~~~~~G~vallKG~~w~g~~~~glvHRSP~~~~~~~~RL 191 (196)
T PF08856_consen 150 ARIQQLPTGDVALLKGERWPGNEGAGLVHRSPPISGSGERRL 191 (196)
T ss_pred ccceecCCCCEEEEccCCCCCCCCCceeeCCCCCCCCCCceE
Confidence 34456889999999887 788888877 555554
No 23
>PHA02813 hypothetical protein; Provisional
Probab=57.41 E-value=17 Score=37.78 Aligned_cols=65 Identities=14% Similarity=0.134 Sum_probs=43.7
Q ss_pred CCCCCCCCcCCCCCCCcceEEEEeeccCcccCCceeee-cCCCCCCCCCcccCCCCcccCcccccccccCCCCCCccccc
Q psy2275 238 HPLHQDLHYFPHRPAHRIVAAWTAMERIDKDNGCLFVL-PRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLY 316 (625)
Q Consensus 238 ~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~nG~l~vv-pGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~ 316 (625)
..||+|..|...... -.+|+.++|.+. .++|-+.+. ++.+ .
T Consensus 107 F~~H~Dg~~~r~k~~-s~~tLLLYLN~~-~~GGeT~f~~~~~t------------------------------------s 148 (354)
T PHA02813 107 FNNHRDFIHFKSKNC-YCYHLVLYLNNT-SKGGNTNIHIKDNT------------------------------------I 148 (354)
T ss_pred cCcccCCceeecCCc-eEEEEEEEEecc-CCCCceEEEcCCCc------------------------------------e
Confidence 368999877654222 578899999997 333333222 1111 1
Q ss_pred ccCCcEEEecCCccccCCCCCCcc
Q psy2275 317 MERGDTVFFHPLLIHGSGTNVTKA 340 (625)
Q Consensus 317 ~~aGd~l~f~~~~~H~s~~N~s~~ 340 (625)
+..|.+++|+-.++|.|.+-.++.
T Consensus 149 I~~g~dlLFdh~l~Heg~~V~sG~ 172 (354)
T PHA02813 149 FSTKNDVLFDKTLNHSSDIITDGE 172 (354)
T ss_pred EeecceEEEecccccCCcEeccCe
Confidence 348999999999999998877754
No 24
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=43.05 E-value=80 Score=29.66 Aligned_cols=16 Identities=25% Similarity=0.358 Sum_probs=13.1
Q ss_pred cccccccCCcEEEecC
Q psy2275 312 KLNLYMERGDTVFFHP 327 (625)
Q Consensus 312 ~v~~~~~aGd~l~f~~ 327 (625)
.+.+.++.||++|+.+
T Consensus 145 ~~~l~L~sGsllvM~G 160 (169)
T TIGR00568 145 PKRLRLHSGDVVIMGG 160 (169)
T ss_pred eEEEEeCCCCEEEECC
Confidence 4567899999999865
No 25
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=42.83 E-value=61 Score=30.68 Aligned_cols=28 Identities=14% Similarity=0.234 Sum_probs=23.2
Q ss_pred cccccccCCcEEEecCCccccCCCCCCc
Q psy2275 312 KLNLYMERGDTVFFHPLLIHGSGTNVTK 339 (625)
Q Consensus 312 ~v~~~~~aGd~l~f~~~~~H~s~~N~s~ 339 (625)
...+.++|||+|++.+.++|.-.|-+-+
T Consensus 143 ~h~VklPAGdLVlypStSlH~VtPVTRg 170 (229)
T COG3128 143 NHRVKLPAGDLVLYPSTSLHEVTPVTRG 170 (229)
T ss_pred ceEEeccCCCEEEcccccceeccccccC
Confidence 4567789999999999999998776544
No 26
>PHA02869 C4L/C10L-like gene family protein; Provisional
Probab=27.45 E-value=2e+02 Score=30.72 Aligned_cols=66 Identities=14% Similarity=0.093 Sum_probs=42.4
Q ss_pred CCCCCCCCcCCCCCCCcceEEEEeeccCcccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccc
Q psy2275 238 HPLHQDLHYFPHRPAHRIVAAWTAMERIDKDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYM 317 (625)
Q Consensus 238 ~~~HqD~~y~~~~~~~~~~t~wi~L~d~~~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~ 317 (625)
..||+|..|... ...-.+|+.++|.+. .++|-+.+.-+.+ ..+..
T Consensus 116 F~~H~Dg~~~rs-~e~s~~tLLLYLNd~-~~GGET~f~~~~~---------------------------------~sI~p 160 (418)
T PHA02869 116 FARHRDFSTVFS-KNIICVHLLLYLEQP-ETGGETVIYIDNN---------------------------------TSVKL 160 (418)
T ss_pred ccccccCceecC-CCEEEEEEEEEEecc-CCCCceEEEeCCC---------------------------------ceEec
Confidence 468999876544 333468899999998 3444443322111 12334
Q ss_pred cCCcEEEecCCccccCCCCCCcc
Q psy2275 318 ERGDTVFFHPLLIHGSGTNVTKA 340 (625)
Q Consensus 318 ~aGd~l~f~~~~~H~s~~N~s~~ 340 (625)
+.| ++|+-.+.|.|.+-.++.
T Consensus 161 ksg--LLFdh~l~Heg~~V~sG~ 181 (418)
T PHA02869 161 KTD--HLFDKTIEHESITVESGR 181 (418)
T ss_pred CCC--eEeccccccCCcEeecCe
Confidence 566 889999999988877653
No 27
>PF10835 DUF2573: Protein of unknown function (DUF2573); InterPro: IPR020393 This entry contains proteins with no known function.
Probab=27.01 E-value=1.6e+02 Score=23.60 Aligned_cols=62 Identities=18% Similarity=0.236 Sum_probs=38.5
Q ss_pred ccHHHHHHHHHhhhhccCccCC-----CcchhhHHH--HhhhhhHHHHHHHHhhhcChHHHHHHHHHHHHHHHH
Q psy2275 76 LTEEMNNRYFKFSSTLRGVTAR-----KPRWQECVD--ETKDLDIAVGALYIRKYFNQDAKANVETMVRLILDE 142 (625)
Q Consensus 76 L~~~~~~~~~~f~~~l~G~~~~-----~~r~~~C~~--~~~~~~~~lg~lyv~~~f~~~~~~~v~~m~~~Ik~~ 142 (625)
+.+.|..+..+|...+.|.... ...| |+- ...+||....+ -+---+++++.+.+++.+||+-
T Consensus 4 l~eq~dgLveKytELL~Ge~~~e~~EkVk~W--~lYshiaKsMPpL~kH---WN~~~PeaK~~ik~li~~Ik~l 72 (82)
T PF10835_consen 4 LQEQFDGLVEKYTELLLGETSPEMKEKVKQW--ALYSHIAKSMPPLAKH---WNGTYPEAKEEIKELIEEIKQL 72 (82)
T ss_pred HHHHHHHHHHHHHHHHhcCCCHHHHHHHHHH--HHHHHHHHhCcHHHHh---hcccCchHHHHHHHHHHHHHHH
Confidence 3445666677787777775422 2345 433 23458876533 1223467899999999999874
No 28
>PRK09553 tauD taurine dioxygenase; Reviewed
Probab=24.91 E-value=48 Score=33.78 Aligned_cols=37 Identities=5% Similarity=-0.045 Sum_probs=27.6
Q ss_pred CCCcccccccCCcEEEecCC-ccccCCCCCCccccccc
Q psy2275 309 SHDKLNLYMERGDTVFFHPL-LIHGSGTNVTKAVLGTS 345 (625)
Q Consensus 309 ~~~~v~~~~~aGd~l~f~~~-~~H~s~~N~s~~~R~~~ 345 (625)
..-.+....++||+++|+++ ++|++.+...+..|...
T Consensus 230 p~~~~~~~w~~GD~viwDNr~~~H~a~~~~~~~~R~~~ 267 (277)
T PRK09553 230 PEFQVRWRWQPNDVAIWDNRVTQHYANADYLPQRRIMH 267 (277)
T ss_pred cCeEEEEecCCCCEEEEcCcceeEecccCCCCCceEEE
Confidence 34556778999999999986 78998876666566433
No 29
>PF10014 2OG-Fe_Oxy_2: 2OG-Fe dioxygenase; InterPro: IPR018724 Members of this family of hypothetical bacterial proteins have no known function. Some are described as putative biofilm formation or putative agglutination proteins. ; PDB: 3PL0_B.
Probab=24.81 E-value=1.4e+02 Score=28.60 Aligned_cols=23 Identities=22% Similarity=0.211 Sum_probs=15.3
Q ss_pred ccccCCcEEEec-CCccccCCCCC
Q psy2275 315 LYMERGDTVFFH-PLLIHGSGTNV 337 (625)
Q Consensus 315 ~~~~aGd~l~f~-~~~~H~s~~N~ 337 (625)
...++||.++++ ..++|+..|=+
T Consensus 155 ~l~~p~d~l~~~D~~~~H~vtpI~ 178 (195)
T PF10014_consen 155 TLLEPGDTLLVDDRRVWHYVTPIR 178 (195)
T ss_dssp ---STTEEEEEETTTEEEEE--EE
T ss_pred EecCCCCEEEEeCCcceECCCcee
Confidence 346789999888 99999987743
No 30
>COG2175 TauD Probable taurine catabolism dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=23.67 E-value=49 Score=33.92 Aligned_cols=38 Identities=11% Similarity=0.098 Sum_probs=30.0
Q ss_pred CCcccccccCCcEEEecCC-ccccCCCCCCccccccccc
Q psy2275 310 HDKLNLYMERGDTVFFHPL-LIHGSGTNVTKAVLGTSQR 347 (625)
Q Consensus 310 ~~~v~~~~~aGd~l~f~~~-~~H~s~~N~s~~~R~~~~~ 347 (625)
...+....++||++|||++ |+|++..-.++..|+.-++
T Consensus 236 e~~~r~~wq~gDivIwDNr~~lH~a~~~~~~~~R~l~R~ 274 (286)
T COG2175 236 ENQYRHRWQPGDIVIWDNRATLHAATAFYADQRRILHRC 274 (286)
T ss_pred hhEEEEecCCCCEEEEeCcceeeccccCCccceEEEEEe
Confidence 4677888999999999986 6899888888866665433
No 31
>KOG1591|consensus
Probab=22.94 E-value=2.5e+02 Score=28.85 Aligned_cols=31 Identities=13% Similarity=0.178 Sum_probs=20.8
Q ss_pred CCCCCCCc-------CCCCCCCcceEEEEeeccCcccC
Q psy2275 239 PLHQDLHY-------FPHRPAHRIVAAWTAMERIDKDN 269 (625)
Q Consensus 239 ~~HqD~~y-------~~~~~~~~~~t~wi~L~d~~~~n 269 (625)
.+|-|... +......++-|+.++|.|+...+
T Consensus 190 ~~H~D~~~~~~~~~~~~~~~g~RiaT~l~yls~v~~GG 227 (289)
T KOG1591|consen 190 EPHYDYFLPEEDETFNGLNGGNRIATVLMYLSDVEQGG 227 (289)
T ss_pred cccccccccccchhhhhcccCCcceeEEEEecccCCCC
Confidence 45877653 22234467889999999987754
No 32
>PF13661 2OG-FeII_Oxy_4: 2OG-Fe(II) oxygenase superfamily
Probab=22.45 E-value=1.1e+02 Score=23.72 Aligned_cols=37 Identities=19% Similarity=0.136 Sum_probs=23.6
Q ss_pred CCCCCCCCcCCCCCCCcceEEEEeec---cCcccCCceeee
Q psy2275 238 HPLHQDLHYFPHRPAHRIVAAWTAME---RIDKDNGCLFVL 275 (625)
Q Consensus 238 ~~~HqD~~y~~~~~~~~~~t~wi~L~---d~~~~nG~l~vv 275 (625)
..||.|...-.. +..+.+++.|+|. +.+-+.|.|.+.
T Consensus 24 ~~~H~D~~~~~~-~~~r~~t~llYLn~~w~~d~~Gg~~~f~ 63 (70)
T PF13661_consen 24 FGWHVDADPSSS-GKRRFLTLLLYLNEDWDEDFGGGELFFD 63 (70)
T ss_pred eeeeEcCCcccc-ccceeEEEEEEecccccCccCCcEEEEe
Confidence 468999865433 3446789999999 444444444443
No 33
>COG4340 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.65 E-value=6.4e+02 Score=24.07 Aligned_cols=109 Identities=15% Similarity=0.175 Sum_probs=56.0
Q ss_pred ccHHHHHHhhcHhHHHHHHHhcCC-------cceeeccceeecCCCCCCCcccCCCCCCCCcCCCCCCCcceEEEEeecc
Q psy2275 192 YDEVFEKYTMHKKLLDYVESFIGP-------HIRAIHSMLINKPPDAGTQTSRHPLHQDLHYFPHRPAHRIVAAWTAMER 264 (625)
Q Consensus 192 ~~~~~~~~~~~~~l~~~~~~llg~-------~~~~~~~~~~~k~p~~g~~~~~~~~HqD~~y~~~~~~~~~~t~wi~L~d 264 (625)
+.|....++.+|-..+++..+-.. .+-+++..+-..+-..|- ..+-+-|||..=| -+.+-+|.
T Consensus 86 Fepl~da~~~~~~~r~l~aaf~~~~~L~p~~EIe~HQ~Ri~a~~de~gl-paPEG~HqDG~D~---------I~I~~vDR 155 (226)
T COG4340 86 FEPLTDAIADHPVTRGLIAAFELFDPLSPTSEIEMHQFRIEARTDEQGL-PAPEGAHQDGVDW---------IIIMLVDR 155 (226)
T ss_pred eeehhhhhhcCchHHHHHHHHHhcCCCCCcceeeeEEEEEEeecCCcCC-CCCccccccCccE---------EEEEEeee
Confidence 344445556666666665543332 223333333333333332 2345789997421 11223355
Q ss_pred CcccCCceeeecCCCCCCCCCcccCCCCcccCcccccccccCCCCCCcccccccCCcEEEe-cCCccccCCCCCCc
Q psy2275 265 IDKDNGCLFVLPRTHRDPGSLLQHEYPEWEGGVNAMYHGIRGFDSHDKLNLYMERGDTVFF-HPLLIHGSGTNVTK 339 (625)
Q Consensus 265 ~~~~nG~l~vvpGSH~~~~~l~~~~~~~w~~~~~~~~~~~~~~~~~~~v~~~~~aGd~l~f-~~~~~H~s~~N~s~ 339 (625)
-+.+.|-+.+.--+|.. ..-...+.+|++++. +..++|+..|=++.
T Consensus 156 ~NI~gGet~lY~~~~~~-----------------------------p~f~kvl~pGe~~~l~Dh~~~H~~tpi~p~ 202 (226)
T COG4340 156 QNIDGGETDLYAPDGAS-----------------------------PGFFKVLAPGEAVFLDDHRVLHGVTPIVPS 202 (226)
T ss_pred ccccCceEEEEccCCCC-----------------------------cceEEeccCCcEEEeccchhcccccceecc
Confidence 55556655544444441 111233668888776 57899998886543
Done!