Query psy980
Match_columns 587
No_of_seqs 347 out of 660
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 20:01:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/980hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2443|consensus 100.0 1.1E-67 2.3E-72 541.6 16.7 247 2-257 94-342 (362)
2 KOG2443|consensus 100.0 9.2E-59 2E-63 475.9 9.7 232 314-568 71-342 (362)
3 PF04258 Peptidase_A22B: Signa 100.0 4.7E-57 1E-61 465.4 -0.1 239 4-254 43-298 (298)
4 KOG2442|consensus 100.0 2.9E-46 6.3E-51 396.4 13.2 237 4-261 269-523 (541)
5 PF04258 Peptidase_A22B: Signa 100.0 8.9E-46 1.9E-50 381.8 -0.0 230 314-565 16-298 (298)
6 smart00730 PSN Presenilin, sig 100.0 5.6E-42 1.2E-46 345.9 17.8 207 3-246 32-249 (249)
7 KOG2442|consensus 100.0 1.1E-38 2.4E-43 338.8 6.9 213 325-570 257-521 (541)
8 smart00730 PSN Presenilin, sig 100.0 4.9E-31 1.1E-35 266.6 10.6 181 325-557 21-249 (249)
9 PF06550 DUF1119: Protein of u 99.5 8.5E-14 1.8E-18 142.2 14.4 173 70-246 91-281 (283)
10 COG3389 Uncharacterized protei 99.3 3.5E-12 7.6E-17 126.8 8.1 161 77-244 98-272 (277)
11 PF06550 DUF1119: Protein of u 97.0 0.0026 5.7E-08 66.2 9.1 138 420-558 92-282 (283)
12 PF01080 Presenilin: Presenili 92.5 0.21 4.5E-06 55.0 5.8 56 167-229 325-381 (403)
13 COG3389 Uncharacterized protei 89.4 0.85 1.8E-05 46.8 6.3 80 423-503 95-179 (277)
14 KOG2736|consensus 89.0 0.3 6.5E-06 53.0 2.9 66 167-240 319-384 (406)
15 KOG2736|consensus 29.3 49 0.0011 36.6 3.2 36 512-549 347-382 (406)
16 PF01080 Presenilin: Presenili 24.9 68 0.0015 35.8 3.4 29 512-540 353-381 (403)
17 PF07857 DUF1632: CEO family ( 20.8 3.1E+02 0.0067 28.8 7.1 31 328-358 211-241 (254)
18 TIGR02587 putative integral me 20.1 1.1E+02 0.0024 32.5 3.7 48 200-251 69-126 (271)
No 1
>KOG2443|consensus
Probab=100.00 E-value=1.1e-67 Score=541.65 Aligned_cols=247 Identities=56% Similarity=0.976 Sum_probs=232.0
Q ss_pred HhHHHHHHHHHHHHHHHHHhhhhhccccccCccCCcceEEeeccCCCccccccccccceeEEEEEeccchhHHHHHHHHH
Q psy980 2 INLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVF 81 (587)
Q Consensus 2 in~ll~~Yf~~lGv~al~~~l~p~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~dl~~~~~s~~~ 81 (587)
+| +|+.||++.|+.|+++++.|+++...|...|+.+||+.++++.+ ++|| +..|.+||+.|++++..|+.+
T Consensus 94 ~~-ll~~~ff~~g~~al~~~~~p~~~~~~~~l~p~~~~~~~~~~~~~-~~~~-------~~~~~~Ft~~~iv~~vls~~i 164 (362)
T KOG2443|consen 94 IN-LLTMYFFFLGVIALLSLLDPFINAFKFLLLPMCQYHLLFPRGPG-EKKE-------FICNGKFTRAQIVALVLSSMI 164 (362)
T ss_pred HH-HHHHHHHHHHHHHHHHhhHHHHhhhhhhcCccchhheeeeccCC-cccc-------eeecccccHHHHHHHHHHHHH
Confidence 34 89999999999999999999999777777788889888888776 4443 789999999999999999999
Q ss_pred HhhhhccccchHHhHHHHHHHHHHHHhcccChhHHHHHHHHHHhhhheeeEeccchhhhhcccCCCCeEEEecccccccC
Q psy980 82 GSWYLVKKHWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFGTNVMVTVAKSFEAPIKLVFPQDLLEHG 161 (587)
Q Consensus 82 ~~~y~~~~~WilnNilgi~~~i~~i~~l~l~s~k~g~iLL~gLf~YDIFwVFgtsVMvtVA~~~d~PikL~~P~~~~~~~ 161 (587)
++||+.++||++||++|+++|+++|+.+|+||+|+|++||.|||+|||||||||||||||||++|+|+|+++|++....+
T Consensus 165 ~v~~ll~~HWl~nN~lgms~~I~~I~~lrL~s~ktgalLL~gLffYDIfwVFgTnVMVtVAt~~D~PikL~fP~~l~~~~ 244 (362)
T KOG2443|consen 165 VVWYLLTKHWLANNLLGMSFCIAGIEFLRLPSLKTGALLLGGLFFYDIFWVFGTNVMVTVATSLDAPIKLVFPQKLLFPG 244 (362)
T ss_pred HHHHHhhhHHHHHhHHHHHHHHHHHHHhcccchhhhHHHHHHHHHHheEEEecCceEEEeecccCCceEEecchhhccCC
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999987665
Q ss_pred ccCCccccccCCcchhhHHHHHHHHHhhhccCCCC--ccchhHhHHHHHHHHHHHHHHHHHhCCCcceehhhhhhhhHHH
Q psy980 162 VSANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRS--NTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYLVPACLGLP 239 (587)
Q Consensus 162 ~~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~--~~YF~~s~igY~lGL~~T~~vm~~~~~aQPALLYLVP~~l~~~ 239 (587)
..+.+||||||||||+||+|+|+++|||.++++++ ++||+++++||.+||..|+++|++||+|||||||+||+|++.+
T Consensus 245 ~~as~fsMLGLGDIviPGiflAl~lRfD~~k~~~s~~~~YF~~t~i~Y~~gL~~ti~~~~~FkaAQPALLYlVP~~l~~~ 324 (362)
T KOG2443|consen 245 LTASNFSMLGLGDIVIPGIFLALVLRFDIRKKRNSKVRTYFHNTFIAYFLGLLTTIVVLHIFKAAQPALLYLVPACLGPL 324 (362)
T ss_pred CccccceeccccchhhHHHHHHHHHHhhHHHHhcccCceEEEEeHHHHHhhhHHHhhhhhhhhccchhhhhhhHHHHhHH
Confidence 55678999999999999999999999999987654 7899999999999999999999999999999999999999999
Q ss_pred HHHHHHHHhhhhcCCCCC
Q psy980 240 LLIIARISLMDNRYPTAG 257 (587)
Q Consensus 240 ~l~A~~r~e~~~~~~~~~ 257 (587)
+++|.+|||++.+|.-++
T Consensus 325 ll~A~~~gdlk~l~s~~~ 342 (362)
T KOG2443|consen 325 LLMAYWRGDLKVLWSFDE 342 (362)
T ss_pred HHHHHHccchHhhhCccc
Confidence 999999999999999885
No 2
>KOG2443|consensus
Probab=100.00 E-value=9.2e-59 Score=475.91 Aligned_cols=232 Identities=49% Similarity=0.832 Sum_probs=211.4
Q ss_pred HHHHhhhccccceeeeeccCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccCCccCCccccccccCCCccccccCCCCC
Q psy980 314 IFVMHVFKHAQPALLYLFFSKDMINLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSFD 393 (587)
Q Consensus 314 ~~~~~~~~~~~l~l~fKf~~~~~IN~lLt~YF~ilGv~AL~~~l~p~i~~l~p~~~p~~~~~~~~~~~~~~~~k~~~~f~ 393 (587)
.|.+..|++.+++++||.++++++| +++.||++.|+.|+.+++.|+++...+...|+.+||.-+.++.++ |+
T Consensus 71 fPi~~s~tLl~lyl~fk~ls~~~~~-ll~~~ff~~g~~al~~~~~p~~~~~~~~l~p~~~~~~~~~~~~~~--~~----- 142 (362)
T KOG2443|consen 71 FPIIGSCTLLLLYLLFKPLSKELIN-LLTMYFFFLGVIALLSLLDPFINAFKFLLLPMCQYHLLFPRGPGE--KK----- 142 (362)
T ss_pred CCcccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhHHHHhhhhhhcCccchhheeeeccCCc--cc-----
Confidence 7889999999999999999999999 999999999999999999999997777889999998655554322 10
Q ss_pred CCCCccccccccccceeeEEEEecccchhhhhHHHhhhhhhhccccchHHHHHHHHHHHHHHhhcccCchHHHHHHHHHH
Q psy980 394 RGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGL 473 (587)
Q Consensus 394 ~~~~~~~~~~~~e~~~~~~~~~f~~~~~~~~~~~~~~~~~y~~~~~Wil~Nilgi~f~i~~i~~l~l~s~k~g~iLL~gL 473 (587)
..+++.+|+..|++|++.|+.+++||+.+|||++||++|++||+++|+.+||||+|+|++||.||
T Consensus 143 ---------------~~~~~~~Ft~~~iv~~vls~~i~v~~ll~~HWl~nN~lgms~~I~~I~~lrL~s~ktgalLL~gL 207 (362)
T KOG2443|consen 143 ---------------EFICNGKFTRAQIVALVLSSMIVVWYLLTKHWLANNLLGMSFCIAGIEFLRLPSLKTGALLLGGL 207 (362)
T ss_pred ---------------ceeecccccHHHHHHHHHHHHHHHHHHhhhHHHHHhHHHHHHHHHHHHHhcccchhhhHHHHHHH
Confidence 13578999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhhheeeEeeeceeehhhccCCCceeecccc-------------cCccc---------------------------hhHH
Q psy980 474 FIYDIFWVFGTNVMVTVAKSFEAPIKYVHES-------------FKGLT---------------------------QWFS 513 (587)
Q Consensus 474 F~YDIFwVFgt~VMvtVA~~~d~Pikl~~p~-------------~~gl~---------------------------~~~~ 513 (587)
|+|||||||||||||||||++|+|+|+++|+ +.|++ ||..
T Consensus 208 ffYDIfwVFgTnVMVtVAt~~D~PikL~fP~~l~~~~~~as~fsMLGLGDIviPGiflAl~lRfD~~k~~~s~~~~YF~~ 287 (362)
T KOG2443|consen 208 FFYDIFWVFGTNVMVTVATSLDAPIKLVFPQKLLFPGLTASNFSMLGLGDIVIPGIFLALVLRFDIRKKRNSKVRTYFHN 287 (362)
T ss_pred HHHheEEEecCceEEEeecccCCceEEecchhhccCCCccccceeccccchhhHHHHHHHHHHhhHHHHhcccCceEEEE
Confidence 9999999999999999999999999999984 12332 2335
Q ss_pred HHHHHHHHHHHHHHHHHhhccCCceeEeeccccchHHHHHHHhhchHHHHhcccc
Q psy980 514 NFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVKGDLSALINVVV 568 (587)
Q Consensus 514 ~~~~Y~~GL~~T~~vm~~fk~aQPALLYLVP~~l~~~~~~A~~rgEl~~~w~~~~ 568 (587)
+++||.+||.+|+++||+||+|||||||+||+|+++++++|+.|||+|++|++.+
T Consensus 288 t~i~Y~~gL~~ti~~~~~FkaAQPALLYlVP~~l~~~ll~A~~~gdlk~l~s~~~ 342 (362)
T KOG2443|consen 288 TFIAYFLGLLTTIVVLHIFKAAQPALLYLVPACLGPLLLMAYWRGDLKVLWSFDE 342 (362)
T ss_pred eHHHHHhhhHHHhhhhhhhhccchhhhhhhHHHHhHHHHHHHHccchHhhhCccc
Confidence 6899999999999999999999999999999999999999999999999999976
No 3
>PF04258 Peptidase_A22B: Signal peptide peptidase; InterPro: IPR007369 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain aspartic endopeptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22B. The peptidases were originally classified by hierarchical homology to the most conserved member - IMPAS 1. They are also known as signal peptide peptidase (SPP) []. They belong to the I-CliP family of peptidases. SPP cleaves cleaves remnant signal peptides left behind in the membrane by the action of signal peptidase and also plays key roles in immune surveillance and the maturation of certain viral proteins []. SPPs do not require cofactors as demonstrated by expression in bacteria and purification of a proteolytically active form. The C-terminal region defines the functional domain, which is in itself sufficient for proteolytic activity []. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 1JUF_C 1INQ_C.
Probab=100.00 E-value=4.7e-57 Score=465.44 Aligned_cols=239 Identities=44% Similarity=0.793 Sum_probs=0.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhccccccCc----cCCcceEEeeccCCCccccccccccceeEEEEEeccchhHHHHHHH
Q psy980 4 LLLTGYFFLLGVLALCYLLSPVISPLVPAA----IPNIPFHLKFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCS 79 (587)
Q Consensus 4 ~ll~~Yf~~lGv~al~~~l~p~~~~~~p~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~dl~~~~~s~ 79 (587)
.+++.||++.|+.|+++.+.|.+.+..+.. .++.+.++..+++.. +.++.+++..|+++..+|+
T Consensus 43 ~vl~~~f~~~gv~a~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~l~~~~~s~ 110 (298)
T PF04258_consen 43 YVLTVYFCLAGVIALSFLILPFLTYIFPFFPCRSFPWKKWKLSFPRRDI------------KPFSGSFTLSDLLSFLISL 110 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccceEEEEEEcccc------------cceeeeeeHHHHHHHHHHH
Confidence 499999999999999999999998866532 222233221111110 2467889999999999999
Q ss_pred HHHhh-hhccc-cchHHhHHHHHHHHHHHHhcccChhHHHHHHHHHHhhhheeeEe------ccchhhhhccc-CCC---
Q psy980 80 VFGSW-YLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVF------GTNVMVTVAKS-FEA--- 147 (587)
Q Consensus 80 ~~~~~-y~~~~-~WilnNilgi~~~i~~i~~l~l~s~k~g~iLL~gLf~YDIFwVF------gtsVMvtVA~~-~d~--- 147 (587)
.++++ |+.++ ||++||++|+|+|+++++.+|+||+|+++++|+|+|+||||||| |+|||++|||+ .|.
T Consensus 111 ~~~~~w~~~~~~~W~l~nilgi~~~i~~i~~i~l~s~k~~~ilL~~lf~YDif~VF~s~~~~g~svM~~VA~~~~~~~~~ 190 (298)
T PF04258_consen 111 AIAVVWYVYRNEHWILQNILGICFCINIISLIRLPSFKTATILLIGLFLYDIFWVFISPYFFGTSVMVTVATGGFDAPEK 190 (298)
T ss_dssp ----S---------------------------------------------------------------------------
T ss_pred HHHHHHHHhccchHHHHhHHHHHHHHHHHHheeccchHHHHHHHHHHHHHHHHHHHhccccCCCchhhhhhccccccccC
Confidence 88775 66677 99999999999999999999999999999999999999999999 99999999999 888
Q ss_pred -CeEEEecccccccCccCCccccccCCcchhhHHHHHHHHHhhhccCCCCccchhHhHHHHHHHHHHHHHHHHHhCCCcc
Q psy980 148 -PIKLVFPQDLLEHGVSANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQP 226 (587)
Q Consensus 148 -PikL~~P~~~~~~~~~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~~s~igY~lGL~~T~~vm~~~~~aQP 226 (587)
|+|+++|+..+.++..++++||||+||||+||+|+++|+|||.++++++++||.++++||++||++|+++++++|+|||
T Consensus 191 ~P~~l~~P~~~~~~~~~~~~~s~LGlGDIviPGl~i~~~~rfD~~~~~~~~~Yf~~~~~~Y~~Gl~~t~~~~~~~~~~QP 270 (298)
T PF04258_consen 191 LPIKLQFPRFFDSNSSCPKPFSMLGLGDIVIPGLFIAFCLRFDKSRNKSRKPYFIASLIGYALGLLLTFVALHLFKHGQP 270 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CCeEEEEeccccccccCCCCeeEeccchHHHHHHHHHHHHHhhHhhccccchHHHHHHHHHHHHHHHHHHHHHHhCCCCC
Confidence 9999999986533346789999999999999999999999999986567789999999999999999999999999999
Q ss_pred eehhhhhhhhHHHHHHHHHHHhhhhcCC
Q psy980 227 ALLYLVPACLGLPLLIIARISLMDNRYP 254 (587)
Q Consensus 227 ALLYLVP~~l~~~~l~A~~r~e~~~~~~ 254 (587)
||||+||+|+++++++|++|||++..|+
T Consensus 271 ALlylvP~~l~~~~~~a~~r~el~~~w~ 298 (298)
T PF04258_consen 271 ALLYLVPCTLGSVLLVAWIRGELKDFWN 298 (298)
T ss_dssp ----------------------------
T ss_pred eehHHHHHHHHHHHHHHHHhhHHHHhhC
Confidence 9999999999999999999999999985
No 4
>KOG2442|consensus
Probab=100.00 E-value=2.9e-46 Score=396.38 Aligned_cols=237 Identities=31% Similarity=0.539 Sum_probs=194.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhccccccCccCCcceEEeeccCCCccccccccccceeEEEEEeccchhHHHHHHHHH-H
Q psy980 4 LLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVF-G 82 (587)
Q Consensus 4 ~ll~~Yf~~lGv~al~~~l~p~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~dl~~~~~s~~~-~ 82 (587)
++++..|++-|.-++-..+.|.+.+....... ++.. .++.+ + .....++-.++++.+ +
T Consensus 269 ~v~iiif~i~g~~gLy~Cl~~lv~r~~~~~~~--~~~~-~~~l~----------------~--~~~~~l~l~~~Cia~aV 327 (541)
T KOG2442|consen 269 YVLIIIFCIGGAQGLYNCLAALVHRLPYGAAR--FPTL-APRLG----------------N--MSYRLLFLSILCIAVAV 327 (541)
T ss_pred HHHhhheeecccchHHHHHHHHHhhhhhhccc--cccc-ccccC----------------C--hhHHHHHHHHhhhheeE
Confidence 45667777888888888888888765533211 0000 00100 0 011234444555555 4
Q ss_pred hhhhccc---cchHHhHHHHHHHHHHHHhcccChhHHHHHHHHHHhhhheeeEe--------ccchhhhhcccC-----C
Q psy980 83 SWYLVKK---HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVF--------GTNVMVTVAKSF-----E 146 (587)
Q Consensus 83 ~~y~~~~---~WilnNilgi~~~i~~i~~l~l~s~k~g~iLL~gLf~YDIFwVF--------gtsVMvtVA~~~-----d 146 (587)
+|.++|+ .|++||++|||+|++.++.+|+||+|++++||..+|+||||||| |+|||+.||++- .
T Consensus 328 ~W~v~R~e~~AwilqDvLGIalci~vLk~vRLPnlK~~tiLL~c~f~YDiF~VFitp~~t~~geSVMieVA~G~~s~~Ek 407 (541)
T KOG2442|consen 328 VWAVFRNEDWAWILQDVLGIALCITVLKTVRLPNLKVCTILLLCLFLYDIFFVFITPFITKNGESVMIEVARGPSSTEEK 407 (541)
T ss_pred EEEEeecCchHHHHHhhHhHHHHHHHHHHhcCCchhHHHHHHHHHHHHhhheeeeehhhccCCceEEEEEecCCCCCCCC
Confidence 5777777 59999999999999999999999999999999999999999999 568999999973 5
Q ss_pred CCeEEEecccccc-cCccCCccccccCCcchhhHHHHHHHHHhhhccCCCCccchhHhHHHHHHHHHHHHHHHHHhCCCc
Q psy980 147 APIKLVFPQDLLE-HGVSANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQ 225 (587)
Q Consensus 147 ~PikL~~P~~~~~-~~~~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~~s~igY~lGL~~T~~vm~~~~~aQ 225 (587)
.|+.+++||...+ ...+.++|||||+|||++||++||+|+|||...+|.++.||.++++||.+||++|++++..+|.||
T Consensus 408 iPMlLkVPrl~~s~~~~~~~~~silGFGDIl~PGlLVa~c~RfD~~~~~~~~iYfv~~tvaYgiGLlvTfvaL~LM~~GQ 487 (541)
T KOG2442|consen 408 IPMLLKVPRLFFSVLSDPWGGYSILGFGDILVPGLLVAFCLRFDVQVNSVSNIYFVWSTVAYGIGLLVTFVALVLMKGGQ 487 (541)
T ss_pred cceEEEcchhccccccccCCCeeEeeecccccchHHHHHHHHhhhhccccceeEEehhHHHHHHHHHHHHHHHHHhcCCC
Confidence 8999999997622 123678999999999999999999999999998766889999999999999999999999999999
Q ss_pred ceehhhhhhhhHHHHHHHHHHHhhhhcCCCCCCccc
Q psy980 226 PALLYLVPACLGLPLLIIARISLMDNRYPTAGQRSH 261 (587)
Q Consensus 226 PALLYLVP~~l~~~~l~A~~r~e~~~~~~~~~~~s~ 261 (587)
||||||||++|++.+.+|++|||+++.|.+.+..++
T Consensus 488 PALLYLVP~TL~t~~~lal~R~El~~fWtg~~~~~~ 523 (541)
T KOG2442|consen 488 PALLYLVPCTLGTAVVLALCRGELKKFWTGGSYQKM 523 (541)
T ss_pred ceEEEEechHHHHHHHHHHHHHHHHHHhccCCcccC
Confidence 999999999999999999999999999999976554
No 5
>PF04258 Peptidase_A22B: Signal peptide peptidase; InterPro: IPR007369 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain aspartic endopeptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22B. The peptidases were originally classified by hierarchical homology to the most conserved member - IMPAS 1. They are also known as signal peptide peptidase (SPP) []. They belong to the I-CliP family of peptidases. SPP cleaves cleaves remnant signal peptides left behind in the membrane by the action of signal peptidase and also plays key roles in immune surveillance and the maturation of certain viral proteins []. SPPs do not require cofactors as demonstrated by expression in bacteria and purification of a proteolytically active form. The C-terminal region defines the functional domain, which is in itself sufficient for proteolytic activity []. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 1JUF_C 1INQ_C.
Probab=100.00 E-value=8.9e-46 Score=381.79 Aligned_cols=230 Identities=40% Similarity=0.737 Sum_probs=0.5
Q ss_pred HHHHhhhccccceeeeeccCHHHHHH--HHHHHHHHHHHHHHHHhhhhhcccccCCccCCccccccccCCCccccccCCC
Q psy980 314 IFVMHVFKHAQPALLYLFFSKDMINL--LLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGS 391 (587)
Q Consensus 314 ~~~~~~~~~~~l~l~fKf~~~~~IN~--lLt~YF~ilGv~AL~~~l~p~i~~l~p~~~p~~~~~~~~~~~~~~~~k~~~~ 391 (587)
.+++-++.+.+++.+||+++++++|. ++++||+++|+.|+.+.+.|.+.+..+.. +. ++.+..+ .+. .
T Consensus 16 fpv~~S~~L~gLY~~~k~l~~~~i~~~~vl~~~f~~~gv~a~~~~i~~~l~~~~~~~-~~--~~~~~~~-----~~~--~ 85 (298)
T PF04258_consen 16 FPVVASCVLLGLYFFFKYLDKDLINIIYVLTVYFCLAGVIALSFLILPFLTYIFPFF-PC--RSFPWKK-----WKL--S 85 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--cccccce-----EEE--E
Confidence 67788888999999999999999997 99999999999999999998877654321 00 0000000 000 0
Q ss_pred CCCCCCccccccccccceeeEEEEecccchhhhhHHHhhhhh-hhccc-cchHHHHHHHHHHHHHHhhcccCchHHHHHH
Q psy980 392 FDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVFGSW-YLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVIL 469 (587)
Q Consensus 392 f~~~~~~~~~~~~~e~~~~~~~~~f~~~~~~~~~~~~~~~~~-y~~~~-~Wil~Nilgi~f~i~~i~~l~l~s~k~g~iL 469 (587)
.++ ++. +.++.+++..|+++..+|..++++ |..++ ||++||++|+|+|+++++.+|+||+|+|+++
T Consensus 86 ~~~-------~~~-----~~~~~~~~~~~l~~~~~s~~~~~~w~~~~~~~W~l~nilgi~~~i~~i~~i~l~s~k~~~il 153 (298)
T PF04258_consen 86 FPR-------RDI-----KPFSGSFTLSDLLSFLISLAIAVVWYVYRNEHWILQNILGICFCINIISLIRLPSFKTATIL 153 (298)
T ss_dssp -----------------------------------------S--------------------------------------
T ss_pred EEc-------ccc-----cceeeeeeHHHHHHHHHHHHHHHHHHHhccchHHHHhHHHHHHHHHHHHheeccchHHHHHH
Confidence 000 000 125678888999999999998765 66677 9999999999999999999999999999999
Q ss_pred HHHHHhhheeeEe------eeceeehhhcc-CCC----ceeecccccC-------------ccc----------------
Q psy980 470 LCGLFIYDIFWVF------GTNVMVTVAKS-FEA----PIKYVHESFK-------------GLT---------------- 509 (587)
Q Consensus 470 L~gLF~YDIFwVF------gt~VMvtVA~~-~d~----Pikl~~p~~~-------------gl~---------------- 509 (587)
|+|+|+||||||| |||||++|||+ .|+ |+|+++|+.. |++
T Consensus 154 L~~lf~YDif~VF~s~~~~g~svM~~VA~~~~~~~~~~P~~l~~P~~~~~~~~~~~~~s~LGlGDIviPGl~i~~~~rfD 233 (298)
T PF04258_consen 154 LIGLFLYDIFWVFISPYFFGTSVMVTVATGGFDAPEKLPIKLQFPRFFDSNSSCPKPFSMLGLGDIVIPGLFIAFCLRFD 233 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHhccccCCCchhhhhhccccccccCCCeEEEEeccccccccCCCCeeEeccchHHHHHHHHHHHHHhh
Confidence 9999999999999 99999999999 898 9999998651 221
Q ss_pred ---------hhHHHHHHHHHHHHHHHHHHHhhccCCceeEeeccccchHHHHHHHhhchHHHHhc
Q psy980 510 ---------QWFSNFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVKGDLSALIN 565 (587)
Q Consensus 510 ---------~~~~~~~~Y~~GL~~T~~vm~~fk~aQPALLYLVP~~l~~~~~~A~~rgEl~~~w~ 565 (587)
||.++++||++||++|+++++.+|+||||||||||+|+++++++|+.|||++++||
T Consensus 234 ~~~~~~~~~Yf~~~~~~Y~~Gl~~t~~~~~~~~~~QPALlylvP~~l~~~~~~a~~r~el~~~w~ 298 (298)
T PF04258_consen 234 KSRNKSRKPYFIASLIGYALGLLLTFVALHLFKHGQPALLYLVPCTLGSVLLVAWIRGELKDFWN 298 (298)
T ss_dssp -----------------------------------------------------------------
T ss_pred HhhccccchHHHHHHHHHHHHHHHHHHHHHHhCCCCCeehHHHHHHHHHHHHHHHHhhHHHHhhC
Confidence 34467899999999999999999999999999999999999999999999999997
No 6
>smart00730 PSN Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.
Probab=100.00 E-value=5.6e-42 Score=345.94 Aligned_cols=207 Identities=40% Similarity=0.643 Sum_probs=175.2
Q ss_pred hHHHHHHHHHHHHHHHHHhhhhhccccccCccCCcceEEeeccCCCccccccccccceeEEEEEeccchhHHHHHHHHHH
Q psy980 3 NLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVFG 82 (587)
Q Consensus 3 n~ll~~Yf~~lGv~al~~~l~p~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~dl~~~~~s~~~~ 82 (587)
|+.++.||++.|+.++...+.|..... . + ....+.......+.+
T Consensus 32 ~i~~~~~f~~~~~~~~~~~~~~~~~~~------------------~-~-----------------~~~~~~~~~~~~v~~ 75 (249)
T smart00730 32 VIVLVIYFSSLGVLFLYSLLYPLEVFR------------------V-D-----------------YPTLLILLLNFAVVG 75 (249)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH------------------H-h-----------------HHHHHHHHHHHHHHH
Confidence 788999999999999999988766533 0 0 012233333444667
Q ss_pred hhhhccc-cchHHhHHHHHHHHHHHHhcccChhHHHHHHHHHHhhhheeeEecc----chhhhhcccCCCC-----eEEE
Q psy980 83 SWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFGT----NVMVTVAKSFEAP-----IKLV 152 (587)
Q Consensus 83 ~~y~~~~-~WilnNilgi~~~i~~i~~l~l~s~k~g~iLL~gLf~YDIFwVFgt----sVMvtVA~~~d~P-----ikL~ 152 (587)
+|+.++| ||+.||++|+++|+.+++.+++||+|++++||+++++||+||||+| +|||+|||+.|.| .++.
T Consensus 76 ~~~~~~~~~w~~~~~lgi~~~~~~~~~~~l~~~~~~~iLL~~l~iYDif~Vf~t~~~~~vMv~vA~~~~~~~~~~P~ll~ 155 (249)
T smart00730 76 FWCIHRKGAWIQQDLIGISLCMAILFILRLPSEWTAWILLGALFIYDIFAVFGTPGPLRVMVEVATGRDEPIKVFPALLY 155 (249)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHheeecCCCCchHHhhHhccCCCCcccCChhhc
Confidence 7888887 9999999999999999999999999999999999999999999999 9999999998754 5555
Q ss_pred eccccccc-CccCCccccccCCcchhhHHHHHHHHHhhhccCCCCccchhHhHHHHHHHHHHHHHHHHHhCCCcceehhh
Q psy980 153 FPQDLLEH-GVSANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYL 231 (587)
Q Consensus 153 ~P~~~~~~-~~~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~~s~igY~lGL~~T~~vm~~~~~aQPALLYL 231 (587)
.|+...+. +.+++++++||+||||+||+++++|+|||.+. ++++.||.+|++||.+||+.|+++++.+|+|||||+|+
T Consensus 156 ~p~~~~~~~~~~~~~~~~LGLGDiv~Pgilv~~a~~fd~~~-~~~~~yf~~~~~ay~~GL~~t~~~l~~~~~aqPALlyl 234 (249)
T smart00730 156 VPRLVVSFEDDEEGRFSMLGLGDIVFPGILVASAARFDVSV-RSDSNYFLACFVAYGIGLILTLVLLALFKKAQPALPYL 234 (249)
T ss_pred ccccccccccCCCCccceecCCCeeeHHHHHHHHHHhhhcc-cCCcccHHHHHHHHHHHHHHHHHHHHHhCCCCccHHHH
Confidence 66532111 22457899999999999999999999999875 34678999999999999999999999999999999999
Q ss_pred hhhhhHHHHHHHHHH
Q psy980 232 VPACLGLPLLIIARI 246 (587)
Q Consensus 232 VP~~l~~~~l~A~~r 246 (587)
||+++++.++.|+.|
T Consensus 235 vp~~l~~~~~~~~~r 249 (249)
T smart00730 235 VPFTLVFYLLTALLR 249 (249)
T ss_pred HHHHHHHHHHHHHhC
Confidence 999999999999875
No 7
>KOG2442|consensus
Probab=100.00 E-value=1.1e-38 Score=338.80 Aligned_cols=213 Identities=30% Similarity=0.496 Sum_probs=161.2
Q ss_pred ceeeeeccCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccCCccCCccccccccCCCccccccCCCCCCCCCccccccc
Q psy980 325 PALLYLFFSKDMINLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSFDRGATNEEKKDG 404 (587)
Q Consensus 325 l~l~fKf~~~~~IN~lLt~YF~ilGv~AL~~~l~p~i~~l~p~~~p~~~~~~~~~~~~~~~~k~~~~f~~~~~~~~~~~~ 404 (587)
|+++|+|.| +.-.+++..|++.|+-+|.+|+++.+++.....- +.. .+ .++...
T Consensus 257 LvLlyfF~~--~~V~v~iiif~i~g~~gLy~Cl~~lv~r~~~~~~----------~~~----~~---~~~l~~------- 310 (541)
T KOG2442|consen 257 LVLLYFFYS--YLVYVLIIIFCIGGAQGLYNCLAALVHRLPYGAA----------RFP----TL---APRLGN------- 310 (541)
T ss_pred HHHHHHHHH--HHHHHHhhheeecccchHHHHHHHHHhhhhhhcc----------ccc----cc---ccccCC-------
Confidence 677777744 5778999999999999999999999887532110 000 00 000000
Q ss_pred cccceeeEEEEecccchhhhhHHHhhhhhhhccccchHHHHHHHHHHHHHHhhcccCchHHHHHHHHHHHhhheeeEeee
Q psy980 405 SEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFGT 484 (587)
Q Consensus 405 ~e~~~~~~~~~f~~~~~~~~~~~~~~~~~y~~~~~Wil~Nilgi~f~i~~i~~l~l~s~k~g~iLL~gLF~YDIFwVFgt 484 (587)
.+++.-...++|+.+|+.+.++......|++||++|||+|++.++.+|+||+|+|++||+++|+|||||||+|
T Consensus 311 -------~~~~~l~l~~~Cia~aV~W~v~R~e~~AwilqDvLGIalci~vLk~vRLPnlK~~tiLL~c~f~YDiF~VFit 383 (541)
T KOG2442|consen 311 -------MSYRLLFLSILCIAVAVVWAVFRNEDWAWILQDVLGIALCITVLKTVRLPNLKVCTILLLCLFLYDIFFVFIT 383 (541)
T ss_pred -------hhHHHHHHHHhhhheeEEEEEeecCchHHHHHhhHhHHHHHHHHHHhcCCchhHHHHHHHHHHHHhhheeeee
Confidence 0111111233344343333333334457999999999999999999999999999999999999999999964
Q ss_pred --------ceeehhhccCC-----CceeecccccC----------------------cc-----------------chhH
Q psy980 485 --------NVMVTVAKSFE-----APIKYVHESFK----------------------GL-----------------TQWF 512 (587)
Q Consensus 485 --------~VMvtVA~~~d-----~Pikl~~p~~~----------------------gl-----------------~~~~ 512 (587)
|||++||+|.+ .|+-+-.|++. |+ .||+
T Consensus 384 p~~t~~geSVMieVA~G~~s~~EkiPMlLkVPrl~~s~~~~~~~~~silGFGDIl~PGlLVa~c~RfD~~~~~~~~iYfv 463 (541)
T KOG2442|consen 384 PFITKNGESVMIEVARGPSSTEEKIPMLLKVPRLFFSVLSDPWGGYSILGFGDILVPGLLVAFCLRFDVQVNSVSNIYFV 463 (541)
T ss_pred hhhccCCceEEEEEecCCCCCCCCcceEEEcchhccccccccCCCeeEeeecccccchHHHHHHHHhhhhccccceeEEe
Confidence 89999999974 58766555421 21 2456
Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCceeEeeccccchHHHHHHHhhchHHHHhcccccc
Q psy980 513 SNFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVKGDLSALINVVVSE 570 (587)
Q Consensus 513 ~~~~~Y~~GL~~T~~vm~~fk~aQPALLYLVP~~l~~~~~~A~~rgEl~~~w~~~~~~ 570 (587)
.+++||.+||++|++++.++|.|||||||||||+|++...+|++|||++++||+...|
T Consensus 464 ~~tvaYgiGLlvTfvaL~LM~~GQPALLYLVP~TL~t~~~lal~R~El~~fWtg~~~~ 521 (541)
T KOG2442|consen 464 WSTVAYGIGLLVTFVALVLMKGGQPALLYLVPCTLGTAVVLALCRGELKKFWTGGSYQ 521 (541)
T ss_pred hhHHHHHHHHHHHHHHHHHhcCCCceEEEEechHHHHHHHHHHHHHHHHHHhccCCcc
Confidence 7899999999999999999999999999999999999999999999999999997654
No 8
>smart00730 PSN Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.
Probab=99.97 E-value=4.9e-31 Score=266.57 Aligned_cols=181 Identities=39% Similarity=0.624 Sum_probs=146.5
Q ss_pred ceeeeeccCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccCCccCCccccccccCCCccccccCCCCCCCCCccccccc
Q psy980 325 PALLYLFFSKDMINLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSFDRGATNEEKKDG 404 (587)
Q Consensus 325 l~l~fKf~~~~~IN~lLt~YF~ilGv~AL~~~l~p~i~~l~p~~~p~~~~~~~~~~~~~~~~k~~~~f~~~~~~~~~~~~ 404 (587)
++++||+.. +.+..+..||++.|+.++...+.+...... .
T Consensus 21 Ll~Ly~~~~--~~~i~~~~~f~~~~~~~~~~~~~~~~~~~~--------------~------------------------ 60 (249)
T smart00730 21 LVLLYKFFK--YLVIVLVIYFSSLGVLFLYSLLYPLEVFRV--------------D------------------------ 60 (249)
T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------h------------------------
Confidence 567777754 348999999999999999999887533210 0
Q ss_pred cccceeeEEEEecccchhhhhHHHhhhhhhhccc-cchHHHHHHHHHHHHHHhhcccCchHHHHHHHHHHHhhheeeEee
Q psy980 405 SEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFG 483 (587)
Q Consensus 405 ~e~~~~~~~~~f~~~~~~~~~~~~~~~~~y~~~~-~Wil~Nilgi~f~i~~i~~l~l~s~k~g~iLL~gLF~YDIFwVFg 483 (587)
....+......++.++|+..+| ||+.||++|+++|+++++.+|+||+|++++||+++|+|||||||+
T Consensus 61 ------------~~~~~~~~~~~~v~~~~~~~~~~~w~~~~~lgi~~~~~~~~~~~l~~~~~~~iLL~~l~iYDif~Vf~ 128 (249)
T smart00730 61 ------------YPTLLILLLNFAVVGFWCIHRKGAWIQQDLIGISLCMAILFILRLPSEWTAWILLGALFIYDIFAVFG 128 (249)
T ss_pred ------------HHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHheeec
Confidence 0011223333445567888887 999999999999999999999999999999999999999999999
Q ss_pred e----ceeehhhccCCCce-----eecccc--------------cCccc------------------------hhHHHHH
Q psy980 484 T----NVMVTVAKSFEAPI-----KYVHES--------------FKGLT------------------------QWFSNFF 516 (587)
Q Consensus 484 t----~VMvtVA~~~d~Pi-----kl~~p~--------------~~gl~------------------------~~~~~~~ 516 (587)
| +|||||||+.|.|+ ++..|+ +.|++ ++..+++
T Consensus 129 t~~~~~vMv~vA~~~~~~~~~~P~ll~~p~~~~~~~~~~~~~~~~LGLGDiv~Pgilv~~a~~fd~~~~~~~~yf~~~~~ 208 (249)
T smart00730 129 TPGPLRVMVEVATGRDEPIKVFPALLYVPRLVVSFEDDEEGRFSMLGLGDIVFPGILVASAARFDVSVRSDSNYFLACFV 208 (249)
T ss_pred CCCCchHHhhHhccCCCCcccCChhhcccccccccccCCCCccceecCCCeeeHHHHHHHHHHhhhcccCCcccHHHHHH
Confidence 9 99999999998763 333342 12332 2456789
Q ss_pred HHHHHHHHHHHHHHhhccCCceeEeeccccchHHHHHHHhh
Q psy980 517 AWHLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVK 557 (587)
Q Consensus 517 ~Y~~GL~~T~~vm~~fk~aQPALLYLVP~~l~~~~~~A~~r 557 (587)
||.+||+.|+++++.+|+|||||+|+||+++++.+++|+.|
T Consensus 209 ay~~GL~~t~~~l~~~~~aqPALlylvp~~l~~~~~~~~~r 249 (249)
T smart00730 209 AYGIGLILTLVLLALFKKAQPALPYLVPFTLVFYLLTALLR 249 (249)
T ss_pred HHHHHHHHHHHHHHHhCCCCccHHHHHHHHHHHHHHHHHhC
Confidence 99999999999999999999999999999999999999976
No 9
>PF06550 DUF1119: Protein of unknown function (DUF1119); InterPro: IPR010545 This family consists of several hypothetical archaeal proteins of unknown function.
Probab=99.54 E-value=8.5e-14 Score=142.21 Aligned_cols=173 Identities=23% Similarity=0.308 Sum_probs=130.9
Q ss_pred chhHHHHHHHHHHhhhhccccchHHhHHHHHHHH--HHHHhcccChhHHHHHHHHHHhhhheeeEeccchhhhhccc---
Q psy980 70 HDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAV--NGIELLHLNNVMIGVILLCGLFIYDIFWVFGTNVMVTVAKS--- 144 (587)
Q Consensus 70 ~dl~~~~~s~~~~~~y~~~~~WilnNilgi~~~i--~~i~~l~l~s~k~g~iLL~gLf~YDIFwVFgtsVMvtVA~~--- 144 (587)
.+.++...++.+..+.+..+.|+..|+.|+..+. .++-.+++ +.-.+.+||..+-+||..-||+|++|++.|++
T Consensus 91 ~~~~a~~~ai~~~~~L~~ypEWYviD~~Gil~~aG~aaiFGISl-~~lpaiiLL~iLAVYDaISVYkTkHMltLAegv~d 169 (283)
T PF06550_consen 91 VNIIALILAIALTALLYKYPEWYVIDIAGILMGAGAAAIFGISL-GILPAIILLAILAVYDAISVYKTKHMLTLAEGVMD 169 (283)
T ss_pred HHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhHHHHHHhhhc-cHHHHHHHHHHHHHhhhhheecchHHHHHHHHHhc
Confidence 4567888888888888899999999999999974 46677888 66677999999999999999999999999997
Q ss_pred CCCCeEEEecccccc----cCc------cCCccccccCCcchhhHHHHHHHHHhhhccCCCCccchhHhHHHHHHHHHHH
Q psy980 145 FEAPIKLVFPQDLLE----HGV------SANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMAT 214 (587)
Q Consensus 145 ~d~PikL~~P~~~~~----~~~------~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~~s~igY~lGL~~T 214 (587)
.+.|+.+++|+.... .+. ++|+-.++|+||.++|.++++-+..|...-.-..-.++ .++=.+|-.+.
T Consensus 170 ~klPilfViP~~~~ySf~~~~~~~~~~~~~r~a~fiGlGD~vmPtILVvSa~~f~~~~~~~~~~lp---alga~~Gtl~g 246 (283)
T PF06550_consen 170 LKLPILFVIPKKRGYSFLKDGFDNREEKEERDAFFIGLGDAVMPTILVVSAAFFLSAPILGGLNLP---ALGAMLGTLAG 246 (283)
T ss_pred cCCceEEEEecccCccccccccccccccccccceEeccchhhhHHHHHHHHHHhccccchhhhhHH---HHHHHHHHHHH
Confidence 589999999997531 111 12345699999999999999999999765211000111 13333444444
Q ss_pred HHHHHH---hCCCcceehhhhhhhhHHHHHHHHHH
Q psy980 215 IFVMHV---FKHAQPALLYLVPACLGLPLLIIARI 246 (587)
Q Consensus 215 ~~vm~~---~~~aQPALLYLVP~~l~~~~l~A~~r 246 (587)
+.++.+ .+++||+|.||.-..+++-++-++..
T Consensus 247 l~vL~~~v~kgrp~aGLP~LN~GaI~Gflig~l~s 281 (283)
T PF06550_consen 247 LAVLLRFVMKGRPQAGLPFLNGGAIAGFLIGALAS 281 (283)
T ss_pred HHHHHHHHHcCCCCCCCCccchhHHHHHHHHHHHc
Confidence 444444 58999999999888888877776643
No 10
>COG3389 Uncharacterized protein conserved in archaea [Function unknown]
Probab=99.32 E-value=3.5e-12 Score=126.82 Aligned_cols=161 Identities=24% Similarity=0.361 Sum_probs=120.6
Q ss_pred HHHHHHhhhhccccchHHhHHHHHHH--HHHHHhcccChhHHHHHHHHHHhhhheeeEeccchhhhhccc---CCCCeEE
Q psy980 77 VCSVFGSWYLVKKHWIANNLFGLAFA--VNGIELLHLNNVMIGVILLCGLFIYDIFWVFGTNVMVTVAKS---FEAPIKL 151 (587)
Q Consensus 77 ~s~~~~~~y~~~~~WilnNilgi~~~--i~~i~~l~l~s~k~g~iLL~gLf~YDIFwVFgtsVMvtVA~~---~d~PikL 151 (587)
+++.+....+.++.|+..|..|.+++ +.++-.+++ ....+..+|..+-+||..-|++|.+|++.|++ .|+|+.+
T Consensus 98 ~aI~~~~lL~~~peWyVid~ag~~la~Giaai~GIsf-gv~pavvlL~~lavYDaIsVYkT~HMIslA~~v~d~~lPmlf 176 (277)
T COG3389 98 LAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISF-GVLPAVVLLIALAVYDAISVYKTRHMISLAEGVMDLDLPMLF 176 (277)
T ss_pred HHHHHHHhhhhccceEEeehHHHHHHhhHHHhheeec-chHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcCCceEE
Confidence 33333333446789999999999997 455566777 45567999999999999999999999999997 6999999
Q ss_pred Eecccccc----cC---ccCCccccccCCcchhhHHHHHHHHHhhhccCCCCccchh--HhHHHHHHHHHHHHHHHHHhC
Q psy980 152 VFPQDLLE----HG---VSANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFN--TAFLAYFLGLMATIFVMHVFK 222 (587)
Q Consensus 152 ~~P~~~~~----~~---~~~~~~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~--~s~igY~lGL~~T~~vm~~~~ 222 (587)
++|+...- .+ -.+++--|+|+||+++|.|+++-+.-|-.+ +..+|. .++.|-.+|+.+- -+..-+
T Consensus 177 viP~~l~ysf~~~~fe~r~dgna~miG~GDavmPsIlVvSaa~f~~s----~~l~f~~Lpal~GglvGl~vL--~~v~r~ 250 (277)
T COG3389 177 VIPENLAYSFVEDAFENRGDGNAYMIGLGDAVMPSILVVSAAFFLIS----PILAFIVLPALAGGLVGLAVL--YFVNRG 250 (277)
T ss_pred EeecccccceeehhhhcCCCCceEEEeechhhcccceeeehHHhccC----CchhhhhHHHHhccHHHHHHH--HHHhcC
Confidence 99975421 11 134566799999999999999999888554 234444 3566667776654 344458
Q ss_pred CCcceehhhhhhhhHHHHHHHH
Q psy980 223 HAQPALLYLVPACLGLPLLIIA 244 (587)
Q Consensus 223 ~aQPALLYLVP~~l~~~~l~A~ 244 (587)
++||.|.|+.-..+.+-++-+.
T Consensus 251 Rp~pGLP~lN~GaIaGflig~V 272 (277)
T COG3389 251 RPHPGLPFLNTGAIAGFLIGFV 272 (277)
T ss_pred CCCCCCceeccchHHHHHHHHH
Confidence 9999999999887766655433
No 11
>PF06550 DUF1119: Protein of unknown function (DUF1119); InterPro: IPR010545 This family consists of several hypothetical archaeal proteins of unknown function.
Probab=97.04 E-value=0.0026 Score=66.19 Aligned_cols=138 Identities=24% Similarity=0.394 Sum_probs=99.8
Q ss_pred chhhhhHHHhhhhhhhccccchHHHHHHHHHHHHHHhh--cccCchHHHHHHHHHHHhhheeeEeeeceeehhhcc---C
Q psy980 420 DIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIEL--LHLNNVMIGVILLCGLFIYDIFWVFGTNVMVTVAKS---F 494 (587)
Q Consensus 420 ~~~~~~~~~~~~~~y~~~~~Wil~Nilgi~f~i~~i~~--l~l~s~k~g~iLL~gLF~YDIFwVFgt~VMvtVA~~---~ 494 (587)
+..+...+.++.++......|+..|+.|+.++...-.. +++ +.-.+.+||..|=+||-.=|++|.=|+|.|++ .
T Consensus 92 ~~~a~~~ai~~~~~L~~ypEWYviD~~Gil~~aG~aaiFGISl-~~lpaiiLL~iLAVYDaISVYkTkHMltLAegv~d~ 170 (283)
T PF06550_consen 92 NIIALILAIALTALLYKYPEWYVIDIAGILMGAGAAAIFGISL-GILPAIILLAILAVYDAISVYKTKHMLTLAEGVMDL 170 (283)
T ss_pred HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhHHHHHHhhhc-cHHHHHHHHHHHHHhhhhheecchHHHHHHHHHhcc
Confidence 44567777777777778889999999999998664333 444 45667899999999999999999999999998 4
Q ss_pred CCceeeccccc-----------------------Cccch------------hHHH----------HHHHHHHHHHHHHHH
Q psy980 495 EAPIKYVHESF-----------------------KGLTQ------------WFSN----------FFAWHLGLMATIFVM 529 (587)
Q Consensus 495 d~Pikl~~p~~-----------------------~gl~~------------~~~~----------~~~Y~~GL~~T~~vm 529 (587)
+.|+-++-|+- -|+++ ++.. .++=.+|-++.+.++
T Consensus 171 klPilfViP~~~~ySf~~~~~~~~~~~~~r~a~fiGlGD~vmPtILVvSa~~f~~~~~~~~~~lpalga~~Gtl~gl~vL 250 (283)
T PF06550_consen 171 KLPILFVIPKKRGYSFLKDGFDNREEKEERDAFFIGLGDAVMPTILVVSAAFFLSAPILGGLNLPALGAMLGTLAGLAVL 250 (283)
T ss_pred CCceEEEEecccCccccccccccccccccccceEeccchhhhHHHHHHHHHHhccccchhhhhHHHHHHHHHHHHHHHHH
Confidence 68998877731 12221 1100 123344555555555
Q ss_pred Hh---hccCCceeEeeccccchHHHHHHHhhc
Q psy980 530 HV---FKHAQPALLYLVPACLGLPLLVALVKG 558 (587)
Q Consensus 530 ~~---fk~aQPALLYLVP~~l~~~~~~A~~rg 558 (587)
.. .+++||.|=||--+.+++.++-++..|
T Consensus 251 ~~~v~kgrp~aGLP~LN~GaI~Gflig~l~sg 282 (283)
T PF06550_consen 251 LRFVMKGRPQAGLPFLNGGAIAGFLIGALASG 282 (283)
T ss_pred HHHHHcCCCCCCCCccchhHHHHHHHHHHHcC
Confidence 54 599999999998888888777776554
No 12
>PF01080 Presenilin: Presenilin Alzheimer disease; InterPro: IPR001108 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22A, the type example being presenilin 1 from Homo sapiens (Human). Presenilins are polytopic transmembrane (TM) proteins, mutations in which are associated with the occurrence of early-onset familial Alzheimer's disease, a rare form of the disease that results from a single-gene mutation [, ]. The physiological functions of presenilins are unknown, but they may be related to developmental signalling, apoptotic signal transduction, or processing of selected proteins, such as the beta-amyloid precursor protein(beta-APP). There are a number of subtypes which belong to this presenilin family. That presenilin homologues have been identified in species that do not have an Alzhemier's disease correlate suggests that they may have functions unrelated to the disease, homologues having been identified in mouse, Drosophila melanogaster, Caenorhabditis elegans [] and other members of the eukarya including plants. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 2KR6_A.
Probab=92.51 E-value=0.21 Score=54.95 Aligned_cols=56 Identities=29% Similarity=0.538 Sum_probs=45.2
Q ss_pred cccccCCcchhhHHHHHHHHHh-hhccCCCCccchhHhHHHHHHHHHHHHHHHHHhCCCcceeh
Q psy980 167 FAMLGLGDIVVPGIFIALLLRF-DLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALL 229 (587)
Q Consensus 167 ~smLGLGDIViPGifIa~~lRF-D~~~~~~~~~YF~~s~igY~lGL~~T~~vm~~~~~aQPALL 229 (587)
-.-|||||-|+=+++++-+.++ |.. -..+|++|-.+||++|...+..+|++-|||.
T Consensus 325 ~~klGlGDFiFYs~Lvg~aa~~~~~~-------~~~~~~~ail~Gl~~Tl~~l~~~~~alPALP 381 (403)
T PF01080_consen 325 GIKLGLGDFIFYSVLVGRAAMYGDWN-------TVVACFVAILIGLCLTLLLLAIFRKALPALP 381 (403)
T ss_dssp --SS-TTTHHHHHHHHHHHHHH-TTT-------THHHHHHHHHHHHHHHHHHHHHHT-S-SSSS
T ss_pred ceeecchhHHHHHHHHhHHHhcCCHH-------HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCc
Confidence 3479999999999999988876 442 3568999999999999999999999999997
No 13
>COG3389 Uncharacterized protein conserved in archaea [Function unknown]
Probab=89.44 E-value=0.85 Score=46.85 Aligned_cols=80 Identities=23% Similarity=0.376 Sum_probs=61.0
Q ss_pred hhhHHHhhhhhhhccccchHHHHHHHHHHHHHHhh--cccCchHHHHHHHHHHHhhheeeEeeeceeehhhcc---CCCc
Q psy980 423 CFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIEL--LHLNNVMIGVILLCGLFIYDIFWVFGTNVMVTVAKS---FEAP 497 (587)
Q Consensus 423 ~~~~~~~~~~~y~~~~~Wil~Nilgi~f~i~~i~~--l~l~s~k~g~iLL~gLF~YDIFwVFgt~VMvtVA~~---~d~P 497 (587)
+...+..+....+.+..|..-|..|.+++...-.. +++. .-.+.++|..+=+||-.=|.+|.=|++.|++ .|.|
T Consensus 95 si~~aI~~~~lL~~~peWyVid~ag~~la~Giaai~GIsfg-v~pavvlL~~lavYDaIsVYkT~HMIslA~~v~d~~lP 173 (277)
T COG3389 95 SIGLAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISFG-VLPAVVLLIALAVYDAISVYKTRHMISLAEGVMDLDLP 173 (277)
T ss_pred HHHHHHHHHHhhhhccceEEeehHHHHHHhhHHHhheeecc-hHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcCCc
Confidence 44455554444445678999999999887654444 4443 4456789999999999999999999999998 5899
Q ss_pred eeeccc
Q psy980 498 IKYVHE 503 (587)
Q Consensus 498 ikl~~p 503 (587)
+-++-|
T Consensus 174 mlfviP 179 (277)
T COG3389 174 MLFVIP 179 (277)
T ss_pred eEEEee
Confidence 998877
No 14
>KOG2736|consensus
Probab=88.96 E-value=0.3 Score=52.98 Aligned_cols=66 Identities=29% Similarity=0.486 Sum_probs=50.2
Q ss_pred cccccCCcchhhHHHHHHHHHhhhccCCCCccchhHhHHHHHHHHHHHHHHHHHhCCCcceehhhhhhhhHHHH
Q psy980 167 FAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYLVPACLGLPL 240 (587)
Q Consensus 167 ~smLGLGDIViPGifIa~~lRFD~~~~~~~~~YF~~s~igY~lGL~~T~~vm~~~~~aQPALLYLVP~~l~~~~ 240 (587)
---||+||-|+=.+++.-+.-+|.. .==++|++|-.+||++|...+.++|++-|||. .|.+.|.++
T Consensus 319 ~ikLGlGDFIFYSvLvGkAa~~~d~------~TviAC~vaIL~GL~~TL~llsv~~kALPALP--isI~~G~iF 384 (406)
T KOG2736|consen 319 GIKLGLGDFIFYSVLVGKAAAYGDL------NTVIACFVAILIGLCLTLLLLSVFKKALPALP--ISITFGLIF 384 (406)
T ss_pred ceeeccCceEEEEeeccchhhcCCh------HHHHHHHHHHHHHHHHHHHHHHHHhhcCcCCc--hHHHHHHHH
Confidence 4479999999877777766655521 11358999999999999999999999999998 344444443
No 15
>KOG2736|consensus
Probab=29.29 E-value=49 Score=36.55 Aligned_cols=36 Identities=39% Similarity=0.654 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCceeEeeccccchH
Q psy980 512 FSNFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLGL 549 (587)
Q Consensus 512 ~~~~~~Y~~GL~~T~~vm~~fk~aQPALLYLVP~~l~~ 549 (587)
.++++|=.+||.+|...+.++|+|-|||= +|-+.|.
T Consensus 347 iAC~vaIL~GL~~TL~llsv~~kALPALP--isI~~G~ 382 (406)
T KOG2736|consen 347 IACFVAILIGLCLTLLLLSVFKKALPALP--ISITFGL 382 (406)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCcCCc--hHHHHHH
Confidence 46788889999999999999999999984 3444433
No 16
>PF01080 Presenilin: Presenilin Alzheimer disease; InterPro: IPR001108 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22A, the type example being presenilin 1 from Homo sapiens (Human). Presenilins are polytopic transmembrane (TM) proteins, mutations in which are associated with the occurrence of early-onset familial Alzheimer's disease, a rare form of the disease that results from a single-gene mutation [, ]. The physiological functions of presenilins are unknown, but they may be related to developmental signalling, apoptotic signal transduction, or processing of selected proteins, such as the beta-amyloid precursor protein(beta-APP). There are a number of subtypes which belong to this presenilin family. That presenilin homologues have been identified in species that do not have an Alzhemier's disease correlate suggests that they may have functions unrelated to the disease, homologues having been identified in mouse, Drosophila melanogaster, Caenorhabditis elegans [] and other members of the eukarya including plants. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 2KR6_A.
Probab=24.94 E-value=68 Score=35.80 Aligned_cols=29 Identities=34% Similarity=0.615 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCceeE
Q psy980 512 FSNFFAWHLGLMATIFVMHVFKHAQPALL 540 (587)
Q Consensus 512 ~~~~~~Y~~GL~~T~~vm~~fk~aQPALL 540 (587)
.++++|-.+||..|...+.++|++=|||=
T Consensus 353 ~~~~~ail~Gl~~Tl~~l~~~~~alPALP 381 (403)
T PF01080_consen 353 VACFVAILIGLCLTLLLLAIFRKALPALP 381 (403)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHT-S-SSSS
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCCc
Confidence 35678888999999999999999999983
No 17
>PF07857 DUF1632: CEO family (DUF1632); InterPro: IPR012435 These sequences are found in hypothetical eukaryotic proteins of unknown function. The region concerned is approximately 280 residues long.
Probab=20.75 E-value=3.1e+02 Score=28.77 Aligned_cols=31 Identities=6% Similarity=0.187 Sum_probs=19.3
Q ss_pred eeeccCHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy980 328 LYLFFSKDMINLLLTGYFFLLGVLALCYLLS 358 (587)
Q Consensus 328 ~fKf~~~~~IN~lLt~YF~ilGv~AL~~~l~ 358 (587)
.|.--|++-++++++-|..+.-.-.+...+-
T Consensus 211 ~y~~as~~~ldYvFs~f~GIfltSt~~F~~Y 241 (254)
T PF07857_consen 211 IYPGASQNGLDYVFSHFSGIFLTSTVYFVIY 241 (254)
T ss_pred ccCCCCCcchheeHHHHhhHHHHHHHHHHHH
Confidence 3444566778888887776665555554443
No 18
>TIGR02587 putative integral membrane protein TIGR02587. Members of this family are found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Sinorhizobium meliloti, and Gloeobacter violaceus in a conserved two-gene neighborhood. This family, as defined, includes some members of COG4711 but is narrower and strictly bacterial. Members appear to span the membrane seven times.
Probab=20.09 E-value=1.1e+02 Score=32.45 Aligned_cols=48 Identities=15% Similarity=0.261 Sum_probs=34.2
Q ss_pred hhHhHHHHHHHHHHHHHHHHHhCC---Ccce------ehhh-hhhhhHHHHHHHHHHHhhhh
Q psy980 200 FNTAFLAYFLGLMATIFVMHVFKH---AQPA------LLYL-VPACLGLPLLIIARISLMDN 251 (587)
Q Consensus 200 F~~s~igY~lGL~~T~~vm~~~~~---aQPA------LLYL-VP~~l~~~~l~A~~r~e~~~ 251 (587)
..=++.+|.+|++++.+++..+++ ++|- ...+ +|+.+|+. +.|..++.
T Consensus 69 i~eti~ay~Iglv~S~~~L~lfgri~~~~pl~e~Lg~vivl~vP~sIGaA----laR~~L~~ 126 (271)
T TIGR02587 69 VIDTVEAMAIGFVCSAAMLWLFGIITPETSLKEIVGKVAFQGVPFSLGAA----LARQQLGD 126 (271)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCCCCCHHHHHHHHHHHhCcHHHHHH----HHHHHhCC
Confidence 456899999999999999999987 5552 2223 88877654 44555443
Done!