Query psy13063
Match_columns 316
No_of_seqs 140 out of 458
Neff 5.2
Searched_HMMs 46136
Date Fri Aug 16 18:30:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13063.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13063hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2443|consensus 100.0 2.3E-59 5E-64 446.4 12.7 213 26-312 132-349 (362)
2 KOG2442|consensus 100.0 2.3E-58 4.9E-63 454.4 10.5 219 25-308 281-522 (541)
3 PF04258 Peptidase_A22B: Signa 100.0 1.6E-58 3.4E-63 439.9 -0.5 198 40-302 94-298 (298)
4 smart00730 PSN Presenilin, sig 100.0 1.1E-42 2.5E-47 324.2 13.8 180 47-294 62-249 (249)
5 PF06550 DUF1119: Protein of u 99.6 1.2E-15 2.7E-20 144.0 9.2 185 49-295 93-282 (283)
6 COG3389 Uncharacterized protei 99.4 1.5E-13 3.4E-18 126.6 6.9 162 64-292 108-272 (277)
7 KOG2443|consensus 96.6 0.0003 6.6E-09 68.9 -1.4 22 177-198 257-278 (362)
8 PF04258 Peptidase_A22B: Signa 94.8 0.0081 1.8E-07 57.9 0.0 19 177-195 218-236 (298)
9 KOG2736|consensus 93.7 0.11 2.4E-06 51.9 5.4 70 197-290 317-386 (406)
10 KOG2442|consensus 91.9 0.032 6.9E-07 57.3 -1.2 21 177-197 436-456 (541)
11 smart00730 PSN Presenilin, sig 90.1 0.048 1E-06 51.5 -1.8 19 177-195 178-196 (249)
12 PF01080 Presenilin: Presenili 89.4 0.18 3.9E-06 50.9 1.6 65 199-288 325-390 (403)
13 COG3389 Uncharacterized protei 83.1 3.4 7.5E-05 39.2 6.3 35 168-210 198-232 (277)
14 cd06853 GT_WecA_like This subf 62.8 22 0.00049 33.1 6.4 67 248-314 174-240 (249)
15 PRK13743 conjugal transfer pro 59.6 25 0.00054 30.6 5.5 68 246-313 39-125 (141)
16 cd06854 GT_WbpL_WbcO_like The 56.7 27 0.00059 32.8 5.9 64 248-314 176-239 (253)
17 PRK15119 undecaprenyl-phosphat 54.6 36 0.00079 34.0 6.7 63 252-314 218-280 (365)
18 TIGR02380 ECA_wecA undecapreny 37.6 1E+02 0.0022 30.6 6.7 63 252-314 213-276 (346)
19 cd06855 GT_GPT_euk UDP-GlcNAc: 25.8 96 0.0021 29.9 4.3 63 252-315 219-281 (283)
20 PRK14412 membrane protein; Pro 23.5 3.7E+02 0.0081 24.7 7.5 34 280-313 163-196 (198)
21 TIGR02587 putative integral me 21.4 1.6E+02 0.0034 28.7 4.7 49 248-296 69-127 (271)
No 1
>KOG2443|consensus
Probab=100.00 E-value=2.3e-59 Score=446.38 Aligned_cols=213 Identities=52% Similarity=0.829 Sum_probs=191.4
Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHhhhhhHHHhHHHHHHHHHHHHhcccCchHHHHHHHHHHHHHh
Q psy13063 26 SVYNTTLFRRISFGVCGRFTSAELFSFSLSLFIVCIWVLTGHWLLMDAMGMGLCVAFIAFVRLPSLKVSTLLLTGLLIYD 105 (316)
Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~w~~~~~W~l~Nilgi~~~i~~i~~i~l~s~k~~tiLL~gLf~YD 105 (316)
-+|.+...+|..+++|+++|..|++++..|..++++|++++||++||++|+++|+++|+.+|++|+|+|++||.|||+||
T Consensus 132 ~~~~~~~~~~~~~~~~~~Ft~~~iv~~vls~~i~v~~ll~~HWl~nN~lgms~~I~~I~~lrL~s~ktgalLL~gLffYD 211 (362)
T KOG2443|consen 132 LLFPRGPGEKKEFICNGKFTRAQIVALVLSSMIVVWYLLTKHWLANNLLGMSFCIAGIEFLRLPSLKTGALLLGGLFFYD 211 (362)
T ss_pred eeeeccCCcccceeecccccHHHHHHHHHHHHHHHHHHhhhHHHHHhHHHHHHHHHHHHHhcccchhhhHHHHHHHHHHh
Confidence 34555666667789999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhheeecccccccceeEEeecCCCCCCccchhhcccCCCcccCCCCCCCCceeeccCCCC-----CCcccccccCCCccc
Q psy13063 106 VFWVFFSSYIFSTNVMVKVATRPAENPVGLVAKRFHMGGVARDAPKLSLPGKLVFPSLHQ-----TGHFSMLRGLGDIVM 180 (316)
Q Consensus 106 IF~VF~sp~~f~tsVMvtVAt~~~~~~~~~~~~~~~~~~~~~~~~~l~~Pikl~~P~~~~-----~~~~SmL~glGDiv~ 180 (316)
|||||+ |||||||||+ +|.|+|++||+... ..+||||
T Consensus 212 IfwVFg------TnVMVtVAt~------------------------~D~PikL~fP~~l~~~~~~as~fsML-------- 253 (362)
T KOG2443|consen 212 IFWVFG------TNVMVTVATS------------------------LDAPIKLVFPQKLLFPGLTASNFSML-------- 253 (362)
T ss_pred eEEEec------CceEEEeecc------------------------cCCceEEecchhhccCCCccccceec--------
Confidence 999995 8999999996 57888888887532 2358998
Q ss_pred ccceeeeeehhhhhhhhhhhhcCCCCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCCcccchhHHHHHHHHHHHHH
Q psy13063 181 PGLLLCFVLRYDAYKKSQLLHLGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRHLSRISYFHCSLIGYFLGLL 260 (316)
Q Consensus 181 Pg~l~~~~lr~d~~~~~~~~~lGlGDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~~~r~~YF~~sligY~~GL~ 260 (316)
||||||+||+|+|+++|||.+|+++. .+.+||.++++||.+||.
T Consensus 254 ----------------------GLGDIviPGiflAl~lRfD~~k~~~s--------------~~~~YF~~t~i~Y~~gL~ 297 (362)
T KOG2443|consen 254 ----------------------GLGDIVIPGIFLALVLRFDIRKKRNS--------------KVRTYFHNTFIAYFLGLL 297 (362)
T ss_pred ----------------------cccchhhHHHHHHHHHHhhHHHHhcc--------------cCceEEEEeHHHHHhhhH
Confidence 99999999999999999999887752 123899999999999999
Q ss_pred HHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCC
Q psy13063 261 TATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSK 312 (316)
Q Consensus 261 ~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~ 312 (316)
.|+++|+++|++|||||||||+|+++.+++|++|||++.+|++++.-.++++
T Consensus 298 ~ti~~~~~FkaAQPALLYlVP~~l~~~ll~A~~~gdlk~l~s~~~~~~~~~~ 349 (362)
T KOG2443|consen 298 TTIVVLHIFKAAQPALLYLVPACLGPLLLMAYWRGDLKVLWSFDESTKEESA 349 (362)
T ss_pred HHhhhhhhhhccchhhhhhhHHHHhHHHHHHHHccchHhhhCccccCCCCcH
Confidence 9999999999999999999999999999999999999999999996665554
No 2
>KOG2442|consensus
Probab=100.00 E-value=2.3e-58 Score=454.45 Aligned_cols=219 Identities=37% Similarity=0.698 Sum_probs=194.9
Q ss_pred cccccc--ccccccccccccccc---------chhHHHHHHHHHHHHHHHhhh----hhHHHhHHHHHHHHHHHHhcccC
Q psy13063 25 VSVYNT--TLFRRISFGVCGRFT---------SAELFSFSLSLFIVCIWVLTG----HWLLMDAMGMGLCVAFIAFVRLP 89 (316)
Q Consensus 25 ~~~~~~--~~~~~~~~~~~~~~~---------~~~~~~~~~s~~~~~~w~~~~----~W~l~Nilgi~~~i~~i~~i~l~ 89 (316)
.++||| ++++|.++|.|...+ ...++.-.+|++++++|.+.| +|++||++|||+|+++++.+|+|
T Consensus 281 ~gLy~Cl~~lv~r~~~~~~~~~~~~~~l~~~~~~~l~l~~~Cia~aV~W~v~R~e~~AwilqDvLGIalci~vLk~vRLP 360 (541)
T KOG2442|consen 281 QGLYNCLAALVHRLPYGAARFPTLAPRLGNMSYRLLFLSILCIAVAVVWAVFRNEDWAWILQDVLGIALCITVLKTVRLP 360 (541)
T ss_pred chHHHHHHHHHhhhhhhcccccccccccCChhHHHHHHHHhhhheeEEEEEeecCchHHHHHhhHhHHHHHHHHHHhcCC
Confidence 579997 999999999885322 344455567888999998865 79999999999999999999999
Q ss_pred chHHHHHHHHHHHHHhhhheeecccccc--cceeEEeecCCCCCCccchhhcccCCCcccCCCCCCCCceeeccCCC---
Q psy13063 90 SLKVSTLLLTGLLIYDVFWVFFSSYIFS--TNVMVKVATRPAENPVGLVAKRFHMGGVARDAPKLSLPGKLVFPSLH--- 164 (316)
Q Consensus 90 s~k~~tiLL~gLf~YDIF~VF~sp~~f~--tsVMvtVAt~~~~~~~~~~~~~~~~~~~~~~~~~l~~Pikl~~P~~~--- 164 (316)
|+|++++||..+|+|||||||+||++|+ +|||++||+|+++++ +++||.+.+||+.
T Consensus 361 nlK~~tiLL~c~f~YDiF~VFitp~~t~~geSVMieVA~G~~s~~-------------------EkiPMlLkVPrl~~s~ 421 (541)
T KOG2442|consen 361 NLKVCTILLLCLFLYDIFFVFITPFITKNGESVMIEVARGPSSTE-------------------EKIPMLLKVPRLFFSV 421 (541)
T ss_pred chhHHHHHHHHHHHHhhheeeeehhhccCCceEEEEEecCCCCCC-------------------CCcceEEEcchhcccc
Confidence 9999999999999999999999999998 899999999976552 5679999999976
Q ss_pred --C-CCcccccccCCCcccccceeeeeehhhhhhhhhhhhcCCCCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCC
Q psy13063 165 --Q-TGHFSMLRGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRH 241 (316)
Q Consensus 165 --~-~~~~SmL~glGDiv~Pg~l~~~~lr~d~~~~~~~~~lGlGDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~ 241 (316)
+ .++|||| |+|||++||++++||+|||.+.++.
T Consensus 422 ~~~~~~~~sil------------------------------GFGDIl~PGlLVa~c~RfD~~~~~~-------------- 457 (541)
T KOG2442|consen 422 LSDPWGGYSIL------------------------------GFGDILVPGLLVAFCLRFDVQVNSV-------------- 457 (541)
T ss_pred ccccCCCeeEe------------------------------eecccccchHHHHHHHHhhhhcccc--------------
Confidence 3 3567877 9999999999999999999886652
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCC
Q psy13063 242 LSRISYFHCSLIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIV 308 (316)
Q Consensus 242 ~~r~~YF~~sligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~ 308 (316)
|..||.++++||++||++|++++.+||.|||||||||||||++...+|++|||++++|+|.-...
T Consensus 458 --~~iYfv~~tvaYgiGLlvTfvaL~LM~~GQPALLYLVP~TL~t~~~lal~R~El~~fWtg~~~~~ 522 (541)
T KOG2442|consen 458 --SNIYFVWSTVAYGIGLLVTFVALVLMKGGQPALLYLVPCTLGTAVVLALCRGELKKFWTGGSYQK 522 (541)
T ss_pred --ceeEEehhHHHHHHHHHHHHHHHHHhcCCCceEEEEechHHHHHHHHHHHHHHHHHHhccCCccc
Confidence 34899999999999999999999999999999999999999999999999999999999998654
No 3
>PF04258 Peptidase_A22B: Signal peptide peptidase; InterPro: IPR007369 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain aspartic endopeptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22B. The peptidases were originally classified by hierarchical homology to the most conserved member - IMPAS 1. They are also known as signal peptide peptidase (SPP) []. They belong to the I-CliP family of peptidases. SPP cleaves cleaves remnant signal peptides left behind in the membrane by the action of signal peptidase and also plays key roles in immune surveillance and the maturation of certain viral proteins []. SPPs do not require cofactors as demonstrated by expression in bacteria and purification of a proteolytically active form. The C-terminal region defines the functional domain, which is in itself sufficient for proteolytic activity []. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 1JUF_C 1INQ_C.
Probab=100.00 E-value=1.6e-58 Score=439.93 Aligned_cols=198 Identities=49% Similarity=0.936 Sum_probs=1.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHHhhh--hhHHHhHHHHHHHHHHHHhcccCchHHHHHHHHHHHHHhhhheeecccccc
Q psy13063 40 VCGRFTSAELFSFSLSLFIVCIWVLTG--HWLLMDAMGMGLCVAFIAFVRLPSLKVSTLLLTGLLIYDVFWVFFSSYIFS 117 (316)
Q Consensus 40 ~~~~~~~~~~~~~~~s~~~~~~w~~~~--~W~l~Nilgi~~~i~~i~~i~l~s~k~~tiLL~gLf~YDIF~VF~sp~~f~ 117 (316)
...+++..++++..+|+++++.|++.+ ||++||++|+|+|+.+++.+|+||+|+++++|+++|+|||||||+||++||
T Consensus 94 ~~~~~~~~~l~~~~~s~~~~~~w~~~~~~~W~l~nilgi~~~i~~i~~i~l~s~k~~~ilL~~lf~YDif~VF~s~~~~g 173 (298)
T PF04258_consen 94 FSGSFTLSDLLSFLISLAIAVVWYVYRNEHWILQNILGICFCINIISLIRLPSFKTATILLIGLFLYDIFWVFISPYFFG 173 (298)
T ss_dssp ---------------------S----------------------------------------------------------
T ss_pred eeeeeeHHHHHHHHHHHHHHHHHHHhccchHHHHhHHHHHHHHHHHHheeccchHHHHHHHHHHHHHHHHHHHhccccCC
Confidence 345788899999999999999988754 799999999999999999999999999999999999999999999999999
Q ss_pred cceeEEeecCCCCCCccchhhcccCCCcccCCCCCCCCceeeccCCC-----CCCcccccccCCCcccccceeeeeehhh
Q psy13063 118 TNVMVKVATRPAENPVGLVAKRFHMGGVARDAPKLSLPGKLVFPSLH-----QTGHFSMLRGLGDIVMPGLLLCFVLRYD 192 (316)
Q Consensus 118 tsVMvtVAt~~~~~~~~~~~~~~~~~~~~~~~~~l~~Pikl~~P~~~-----~~~~~SmL~glGDiv~Pg~l~~~~lr~d 192 (316)
+|||++|||++.+++ .++|+|+++|+.. ..+++|||
T Consensus 174 ~svM~~VA~~~~~~~-------------------~~~P~~l~~P~~~~~~~~~~~~~s~L-------------------- 214 (298)
T PF04258_consen 174 TSVMVTVATGGFDAP-------------------EKLPIKLQFPRFFDSNSSCPKPFSML-------------------- 214 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred Cchhhhhhccccccc-------------------cCCCeEEEEeccccccccCCCCeeEe--------------------
Confidence 999999999855442 3579999999974 35678888
Q ss_pred hhhhhhhhhcCCCCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHhCcC
Q psy13063 193 AYKKSQLLHLGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRHLSRISYFHCSLIGYFLGLLTATVSSEIFKAA 272 (316)
Q Consensus 193 ~~~~~~~~~lGlGDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~~~r~~YF~~sligY~~GL~~t~~a~~~~~~g 272 (316)
|+|||++||+++++|+|||..+++. +++||.++++||++||++|+++++++|+|
T Consensus 215 ----------GlGDIviPGl~i~~~~rfD~~~~~~----------------~~~Yf~~~~~~Y~~Gl~~t~~~~~~~~~~ 268 (298)
T PF04258_consen 215 ----------GLGDIVIPGLFIAFCLRFDKSRNKS----------------RKPYFIASLIGYALGLLLTFVALHLFKHG 268 (298)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ----------ccchHHHHHHHHHHHHHhhHhhccc----------------cchHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 9999999999999999999887321 23799999999999999999999999999
Q ss_pred ccchhhhhhhhhHHHHHHHHHHhHHHHhcc
Q psy13063 273 QPALLYLVPFTLLPLLTMAYLKGDLRRMWS 302 (316)
Q Consensus 273 QPALLYLVP~~L~~~~~~A~~RgEl~~~W~ 302 (316)
|||||||||+|+++++++|++|||++++||
T Consensus 269 QPALlylvP~~l~~~~~~a~~r~el~~~w~ 298 (298)
T PF04258_consen 269 QPALLYLVPCTLGSVLLVAWIRGELKDFWN 298 (298)
T ss_dssp ------------------------------
T ss_pred CCeehHHHHHHHHHHHHHHHHhhHHHHhhC
Confidence 999999999999999999999999999997
No 4
>smart00730 PSN Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.
Probab=100.00 E-value=1.1e-42 Score=324.24 Aligned_cols=180 Identities=41% Similarity=0.694 Sum_probs=156.1
Q ss_pred hhHHHHHHHHHHHHHHHhhh--hhHHHhHHHHHHHHHHHHhcccCchHHHHHHHHHHHHHhhhheeecccccccceeEEe
Q psy13063 47 AELFSFSLSLFIVCIWVLTG--HWLLMDAMGMGLCVAFIAFVRLPSLKVSTLLLTGLLIYDVFWVFFSSYIFSTNVMVKV 124 (316)
Q Consensus 47 ~~~~~~~~s~~~~~~w~~~~--~W~l~Nilgi~~~i~~i~~i~l~s~k~~tiLL~gLf~YDIF~VF~sp~~f~tsVMvtV 124 (316)
........+++++..|.+++ +|++||++|+++|++.++.+++||+|+++++|+++++||+||||.||. +.+|||+|
T Consensus 62 ~~~~~~~~~~~v~~~~~~~~~~~w~~~~~lgi~~~~~~~~~~~l~~~~~~~iLL~~l~iYDif~Vf~t~~--~~~vMv~v 139 (249)
T smart00730 62 PTLLILLLNFAVVGFWCIHRKGAWIQQDLIGISLCMAILFILRLPSEWTAWILLGALFIYDIFAVFGTPG--PLRVMVEV 139 (249)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHheeecCCC--CchHHhhH
Confidence 44566667788888888664 999999999999999999999999999999999999999999999987 88999999
Q ss_pred ecCCCCCCccchhhcccCCCcccCCCCCCCCceeeccCC------CCCCcccccccCCCcccccceeeeeehhhhhhhhh
Q psy13063 125 ATRPAENPVGLVAKRFHMGGVARDAPKLSLPGKLVFPSL------HQTGHFSMLRGLGDIVMPGLLLCFVLRYDAYKKSQ 198 (316)
Q Consensus 125 At~~~~~~~~~~~~~~~~~~~~~~~~~l~~Pikl~~P~~------~~~~~~SmL~glGDiv~Pg~l~~~~lr~d~~~~~~ 198 (316)
|++..++ .+..|+++..|+. .+.+++|+|
T Consensus 140 A~~~~~~-------------------~~~~P~ll~~p~~~~~~~~~~~~~~~~L-------------------------- 174 (249)
T smart00730 140 ATGRDEP-------------------IKVFPALLYVPRLVVSFEDDEEGRFSML-------------------------- 174 (249)
T ss_pred hccCCCC-------------------cccCChhhcccccccccccCCCCcccee--------------------------
Confidence 9974321 1457888888873 223456666
Q ss_pred hhhcCCCCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHhCcCccchhh
Q psy13063 199 LLHLGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRHLSRISYFHCSLIGYFLGLLTATVSSEIFKAAQPALLY 278 (316)
Q Consensus 199 ~~~lGlGDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~~~r~~YF~~sligY~~GL~~t~~a~~~~~~gQPALLY 278 (316)
|+||||+||+++++|+|||..++. +..||.++++||++||+.|+++++++|+|||||+|
T Consensus 175 ----GLGDiv~Pgilv~~a~~fd~~~~~-----------------~~~yf~~~~~ay~~GL~~t~~~l~~~~~aqPALly 233 (249)
T smart00730 175 ----GLGDIVFPGILVASAARFDVSVRS-----------------DSNYFLACFVAYGIGLILTLVLLALFKKAQPALPY 233 (249)
T ss_pred ----cCCCeeeHHHHHHHHHHhhhcccC-----------------CcccHHHHHHHHHHHHHHHHHHHHHhCCCCccHHH
Confidence 999999999999999999976432 23799999999999999999999999999999999
Q ss_pred hhhhhhHHHHHHHHHH
Q psy13063 279 LVPFTLLPLLTMAYLK 294 (316)
Q Consensus 279 LVP~~L~~~~~~A~~R 294 (316)
+||+++++.+++|+.|
T Consensus 234 lvp~~l~~~~~~~~~r 249 (249)
T smart00730 234 LVPFTLVFYLLTALLR 249 (249)
T ss_pred HHHHHHHHHHHHHHhC
Confidence 9999999999999976
No 5
>PF06550 DUF1119: Protein of unknown function (DUF1119); InterPro: IPR010545 This family consists of several hypothetical archaeal proteins of unknown function.
Probab=99.62 E-value=1.2e-15 Score=144.01 Aligned_cols=185 Identities=22% Similarity=0.327 Sum_probs=134.3
Q ss_pred HHHHHHHHHHHHHHHhhhhhHHHhHHHHHHHHHHHHhcccC-chHHHHHHHHHHHHHhhhheeecccccccceeEEeecC
Q psy13063 49 LFSFSLSLFIVCIWVLTGHWLLMDAMGMGLCVAFIAFVRLP-SLKVSTLLLTGLLIYDVFWVFFSSYIFSTNVMVKVATR 127 (316)
Q Consensus 49 ~~~~~~s~~~~~~w~~~~~W~l~Nilgi~~~i~~i~~i~l~-s~k~~tiLL~gLf~YDIF~VF~sp~~f~tsVMvtVAt~ 127 (316)
..++..+++++...+.+++|+..|+.|+.++.+.-..+.+. +...+.+||+.|-+||..-|| +|++|++.|++
T Consensus 93 ~~a~~~ai~~~~~L~~ypEWYviD~~Gil~~aG~aaiFGISl~~lpaiiLL~iLAVYDaISVY------kTkHMltLAeg 166 (283)
T PF06550_consen 93 IIALILAIALTALLYKYPEWYVIDIAGILMGAGAAAIFGISLGILPAIILLAILAVYDAISVY------KTKHMLTLAEG 166 (283)
T ss_pred HHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhHHHHHHhhhccHHHHHHHHHHHHHhhhhhee------cchHHHHHHHH
Confidence 44566777777777788999999999999999877766665 667799999999999999999 67899999997
Q ss_pred CCCCCccchhhcccCCCcccCCCCCCCCceeeccCCCCCCcccccccCCCcccccceeeeeehhhhh---hhhhhhhcCC
Q psy13063 128 PAENPVGLVAKRFHMGGVARDAPKLSLPGKLVFPSLHQTGHFSMLRGLGDIVMPGLLLCFVLRYDAY---KKSQLLHLGL 204 (316)
Q Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~l~~Pikl~~P~~~~~~~~SmL~glGDiv~Pg~l~~~~lr~d~~---~~~~~~~lGl 204 (316)
.. .+++|+.++.|+. ++||.+ +.| .+.. ++|+...+|+
T Consensus 167 v~---------------------d~klPilfViP~~---~~ySf~-~~~--------------~~~~~~~~~r~a~fiGl 207 (283)
T PF06550_consen 167 VM---------------------DLKLPILFVIPKK---RGYSFL-KDG--------------FDNREEKEERDAFFIGL 207 (283)
T ss_pred Hh---------------------ccCCceEEEEecc---cCcccc-ccc--------------cccccccccccceEecc
Confidence 43 2578999999983 568887 222 1222 2367789999
Q ss_pred CCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCCcccchhH-HHHHHHHHHHHHHHHHHHHHhCcCccchhhhhhhh
Q psy13063 205 GDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRHLSRISYF-HCSLIGYFLGLLTATVSSEIFKAAQPALLYLVPFT 283 (316)
Q Consensus 205 GDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~~~r~~YF-~~sligY~~GL~~t~~a~~~~~~gQPALLYLVP~~ 283 (316)
||.++|.++++....|.....- .-..++ ..+++|=.+|+.+=+.. ...+++||+|-||.-+.
T Consensus 208 GD~vmPtILVvSa~~f~~~~~~----------------~~~~lpalga~~Gtl~gl~vL~~~-v~kgrp~aGLP~LN~Ga 270 (283)
T PF06550_consen 208 GDAVMPTILVVSAAFFLSAPIL----------------GGLNLPALGAMLGTLAGLAVLLRF-VMKGRPQAGLPFLNGGA 270 (283)
T ss_pred chhhhHHHHHHHHHHhccccch----------------hhhhHHHHHHHHHHHHHHHHHHHH-HHcCCCCCCCCccchhH
Confidence 9999999999988888643110 000121 33355555554332221 23589999999999999
Q ss_pred hHHHHHHHHHHh
Q psy13063 284 LLPLLTMAYLKG 295 (316)
Q Consensus 284 L~~~~~~A~~Rg 295 (316)
++..++-++..|
T Consensus 271 I~Gflig~l~sg 282 (283)
T PF06550_consen 271 IAGFLIGALASG 282 (283)
T ss_pred HHHHHHHHHHcC
Confidence 888877766543
No 6
>COG3389 Uncharacterized protein conserved in archaea [Function unknown]
Probab=99.44 E-value=1.5e-13 Score=126.62 Aligned_cols=162 Identities=24% Similarity=0.337 Sum_probs=127.3
Q ss_pred hhhhhHHHhHHHHHHHHHHHHhcccC-chHHHHHHHHHHHHHhhhheeecccccccceeEEeecCCCCCCccchhhcccC
Q psy13063 64 LTGHWLLMDAMGMGLCVAFIAFVRLP-SLKVSTLLLTGLLIYDVFWVFFSSYIFSTNVMVKVATRPAENPVGLVAKRFHM 142 (316)
Q Consensus 64 ~~~~W~l~Nilgi~~~i~~i~~i~l~-s~k~~tiLL~gLf~YDIF~VF~sp~~f~tsVMvtVAt~~~~~~~~~~~~~~~~ 142 (316)
.++.|+..|..|.+++.+.-..+.++ ....+..+|..+-+||..-|+ ||++|++.|++..
T Consensus 108 ~~peWyVid~ag~~la~Giaai~GIsfgv~pavvlL~~lavYDaIsVY------kT~HMIslA~~v~------------- 168 (277)
T COG3389 108 KYPEWYVIDLAGFFLAVGIAAIFGISFGVLPAVVLLIALAVYDAISVY------KTRHMISLAEGVM------------- 168 (277)
T ss_pred hccceEEeehHHHHHHhhHHHhheeecchHHHHHHHHHHHHHHHHHHH------hHHHHHHHHHHHH-------------
Confidence 35699999999999999987777666 666789999999999999998 6889999999732
Q ss_pred CCcccCCCCCCCCceeeccCCCCCCcccccccCCCcccccceeeeeehhhhhhhhhhhhcCCCCccccceeeeeeehhhh
Q psy13063 143 GGVARDAPKLSLPGKLVFPSLHQTGHFSMLRGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGLGDIVMPGLLLCFVLRYDA 222 (316)
Q Consensus 143 ~~~~~~~~~l~~Pikl~~P~~~~~~~~SmL~glGDiv~Pg~l~~~~lr~d~~~~~~~~~lGlGDIviPGl~ia~~~rfD~ 222 (316)
.+|+||.+++|. +..||+. + ..||+.++++..+.|+||+++|.+++..+.-|-.
T Consensus 169 --------d~~lPmlfviP~---~l~ysf~-~--------------~~fe~r~dgna~miG~GDavmPsIlVvSaa~f~~ 222 (277)
T COG3389 169 --------DLDLPMLFVIPE---NLAYSFV-E--------------DAFENRGDGNAYMIGLGDAVMPSILVVSAAFFLI 222 (277)
T ss_pred --------hcCCceEEEeec---cccccee-e--------------hhhhcCCCCceEEEeechhhcccceeeehHHhcc
Confidence 368999999997 3568887 3 3577778999999999999999999998877743
Q ss_pred hhhhhcccCCCCCCCCCCCcccchhH--HHHHHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHH
Q psy13063 223 YKKSQLLHLGETGIPAPRHLSRISYF--HCSLIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAY 292 (316)
Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~r~~YF--~~sligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~ 292 (316)
.. ..+| ..++.|=.+|+++- -+..-+++||-|-|+.-+++.+.++-+.
T Consensus 223 s~--------------------~l~f~~Lpal~GglvGl~vL--~~v~r~Rp~pGLP~lN~GaIaGflig~V 272 (277)
T COG3389 223 SP--------------------ILAFIVLPALAGGLVGLAVL--YFVNRGRPHPGLPFLNTGAIAGFLIGFV 272 (277)
T ss_pred CC--------------------chhhhhHHHHhccHHHHHHH--HHHhcCCCCCCCceeccchHHHHHHHHH
Confidence 21 1233 23577777777543 3345689999999999888877665443
No 7
>KOG2443|consensus
Probab=96.62 E-value=0.0003 Score=68.88 Aligned_cols=22 Identities=55% Similarity=1.004 Sum_probs=21.2
Q ss_pred Ccccccceeeeeehhhhhhhhh
Q psy13063 177 DIVMPGLLLCFVLRYDAYKKSQ 198 (316)
Q Consensus 177 Div~Pg~l~~~~lr~d~~~~~~ 198 (316)
||||||+++|+++|||.+++++
T Consensus 257 DIviPGiflAl~lRfD~~k~~~ 278 (362)
T KOG2443|consen 257 DIVIPGIFLALVLRFDIRKKRN 278 (362)
T ss_pred chhhHHHHHHHHHHhhHHHHhc
Confidence 8999999999999999999887
No 8
>PF04258 Peptidase_A22B: Signal peptide peptidase; InterPro: IPR007369 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain aspartic endopeptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22B. The peptidases were originally classified by hierarchical homology to the most conserved member - IMPAS 1. They are also known as signal peptide peptidase (SPP) []. They belong to the I-CliP family of peptidases. SPP cleaves cleaves remnant signal peptides left behind in the membrane by the action of signal peptidase and also plays key roles in immune surveillance and the maturation of certain viral proteins []. SPPs do not require cofactors as demonstrated by expression in bacteria and purification of a proteolytically active form. The C-terminal region defines the functional domain, which is in itself sufficient for proteolytic activity []. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 1JUF_C 1INQ_C.
Probab=94.78 E-value=0.0081 Score=57.87 Aligned_cols=19 Identities=53% Similarity=1.121 Sum_probs=0.0
Q ss_pred Ccccccceeeeeehhhhhh
Q psy13063 177 DIVMPGLLLCFVLRYDAYK 195 (316)
Q Consensus 177 Div~Pg~l~~~~lr~d~~~ 195 (316)
|||+||+++++|+|||.++
T Consensus 218 DIviPGl~i~~~~rfD~~~ 236 (298)
T PF04258_consen 218 DIVIPGLFIAFCLRFDKSR 236 (298)
T ss_dssp -------------------
T ss_pred hHHHHHHHHHHHHHhhHhh
Confidence 8999999999999999998
No 9
>KOG2736|consensus
Probab=93.72 E-value=0.11 Score=51.91 Aligned_cols=70 Identities=23% Similarity=0.392 Sum_probs=54.8
Q ss_pred hhhhhcCCCCccccceeeeeeehhhhhhhhhcccCCCCCCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHhCcCccch
Q psy13063 197 SQLLHLGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGETGIPAPRHLSRISYFHCSLIGYFLGLLTATVSSEIFKAAQPAL 276 (316)
Q Consensus 197 ~~~~~lGlGDIviPGl~ia~~~rfD~~~~~~~~~~~~~~~~~~~~~~r~~YF~~sligY~~GL~~t~~a~~~~~~gQPAL 276 (316)
.....||+||-|+=.++++-+.-+|.+ .=-++|.+|-.+||.+|...+.++|++-|||
T Consensus 317 ~r~ikLGlGDFIFYSvLvGkAa~~~d~----------------------~TviAC~vaIL~GL~~TL~llsv~~kALPAL 374 (406)
T KOG2736|consen 317 ERGIKLGLGDFIFYSVLVGKAAAYGDL----------------------NTVIACFVAILIGLCLTLLLLSVFKKALPAL 374 (406)
T ss_pred ccceeeccCceEEEEeeccchhhcCCh----------------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCcCC
Confidence 334567999999888888776666622 1136889999999999999999999999999
Q ss_pred hhhhhhhhHHHHHH
Q psy13063 277 LYLVPFTLLPLLTM 290 (316)
Q Consensus 277 LYLVP~~L~~~~~~ 290 (316)
- +|.++|.+.-.
T Consensus 375 P--isI~~G~iFYF 386 (406)
T KOG2736|consen 375 P--ISITFGLIFYF 386 (406)
T ss_pred c--hHHHHHHHHHH
Confidence 8 57777766543
No 10
>KOG2442|consensus
Probab=91.93 E-value=0.032 Score=57.28 Aligned_cols=21 Identities=52% Similarity=0.980 Sum_probs=19.0
Q ss_pred Ccccccceeeeeehhhhhhhh
Q psy13063 177 DIVMPGLLLCFVLRYDAYKKS 197 (316)
Q Consensus 177 Div~Pg~l~~~~lr~d~~~~~ 197 (316)
||++||+|+++|+|||-..++
T Consensus 436 DIl~PGlLVa~c~RfD~~~~~ 456 (541)
T KOG2442|consen 436 DILVPGLLVAFCLRFDVQVNS 456 (541)
T ss_pred ccccchHHHHHHHHhhhhccc
Confidence 899999999999999987654
No 11
>smart00730 PSN Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.
Probab=90.09 E-value=0.048 Score=51.50 Aligned_cols=19 Identities=42% Similarity=0.774 Sum_probs=17.7
Q ss_pred Ccccccceeeeeehhhhhh
Q psy13063 177 DIVMPGLLLCFVLRYDAYK 195 (316)
Q Consensus 177 Div~Pg~l~~~~lr~d~~~ 195 (316)
|||+||+++++++|||.++
T Consensus 178 Div~Pgilv~~a~~fd~~~ 196 (249)
T smart00730 178 DIVFPGILVASAARFDVSV 196 (249)
T ss_pred CeeeHHHHHHHHHHhhhcc
Confidence 8999999999999999764
No 12
>PF01080 Presenilin: Presenilin Alzheimer disease; InterPro: IPR001108 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A22 (presenilin family, clan AD): subfamily A22A, the type example being presenilin 1 from Homo sapiens (Human). Presenilins are polytopic transmembrane (TM) proteins, mutations in which are associated with the occurrence of early-onset familial Alzheimer's disease, a rare form of the disease that results from a single-gene mutation [, ]. The physiological functions of presenilins are unknown, but they may be related to developmental signalling, apoptotic signal transduction, or processing of selected proteins, such as the beta-amyloid precursor protein(beta-APP). There are a number of subtypes which belong to this presenilin family. That presenilin homologues have been identified in species that do not have an Alzhemier's disease correlate suggests that they may have functions unrelated to the disease, homologues having been identified in mouse, Drosophila melanogaster, Caenorhabditis elegans [] and other members of the eukarya including plants. ; GO: 0004190 aspartic-type endopeptidase activity, 0016021 integral to membrane; PDB: 2KR6_A.
Probab=89.45 E-value=0.18 Score=50.93 Aligned_cols=65 Identities=26% Similarity=0.428 Sum_probs=43.5
Q ss_pred hhhcCCCCccccceeeeeeehh-hhhhhhhcccCCCCCCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHhCcCccchh
Q psy13063 199 LLHLGLGDIVMPGLLLCFVLRY-DAYKKSQLLHLGETGIPAPRHLSRISYFHCSLIGYFLGLLTATVSSEIFKAAQPALL 277 (316)
Q Consensus 199 ~~~lGlGDIviPGl~ia~~~rf-D~~~~~~~~~~~~~~~~~~~~~~r~~YF~~sligY~~GL~~t~~a~~~~~~gQPALL 277 (316)
..-||+||-|+=+++++-+..+ | ..-..++.+|-.+||.+|++.+.++|++-|||-
T Consensus 325 ~~klGlGDFiFYs~Lvg~aa~~~~-----------------------~~~~~~~~~ail~Gl~~Tl~~l~~~~~alPALP 381 (403)
T PF01080_consen 325 GIKLGLGDFIFYSVLVGRAAMYGD-----------------------WNTVVACFVAILIGLCLTLLLLAIFRKALPALP 381 (403)
T ss_dssp --SS-TTTHHHHHHHHHHHHHH-T-----------------------TTTHHHHHHHHHHHHHHHHHHHHHHT-S-SSSS
T ss_pred ceeecchhHHHHHHHHhHHHhcCC-----------------------HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCc
Confidence 4458999999888777755443 1 123568899999999999999999999999997
Q ss_pred hhhhhhhHHHH
Q psy13063 278 YLVPFTLLPLL 288 (316)
Q Consensus 278 YLVP~~L~~~~ 288 (316)
++..++.+.
T Consensus 382 --isi~~g~~~ 390 (403)
T PF01080_consen 382 --ISIALGLIF 390 (403)
T ss_dssp --S----HHHH
T ss_pred --HHHHHHHHH
Confidence 455555443
No 13
>COG3389 Uncharacterized protein conserved in archaea [Function unknown]
Probab=83.05 E-value=3.4 Score=39.24 Aligned_cols=35 Identities=37% Similarity=0.510 Sum_probs=25.6
Q ss_pred cccccccCCCcccccceeeeeehhhhhhhhhhhhcCCCCcccc
Q psy13063 168 HFSMLRGLGDIVMPGLLLCFVLRYDAYKKSQLLHLGLGDIVMP 210 (316)
Q Consensus 168 ~~SmL~glGDiv~Pg~l~~~~lr~d~~~~~~~~~lGlGDIviP 210 (316)
+-=|. |+||+|||.+++.-+.=|..+. +++=+++|
T Consensus 198 na~mi-G~GDavmPsIlVvSaa~f~~s~-------~l~f~~Lp 232 (277)
T COG3389 198 NAYMI-GLGDAVMPSILVVSAAFFLISP-------ILAFIVLP 232 (277)
T ss_pred ceEEE-eechhhcccceeeehHHhccCC-------chhhhhHH
Confidence 33455 9999999999998766665544 56667777
No 14
>cd06853 GT_WecA_like This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins. WecA is an UDP-N-acetylglucosamine (GlcNAc):undecaprenyl-phosphate (Und-P) GlcNAc-1-phosphate transferase that catalyzes the formation of a phosphodiester bond between a membrane-associated undecaprenyl-phosphate molecule and N-acetylglucosamine 1-phosphate, which is usually donated by a soluble UDP-N-acetylglucosamine precursor. WecA participates in the biosynthesis of O antigen LPS in many enteric bacteria and is also involved in the biosynthesis of enterobacterial common antigen. A conserved short sequence motif and a conserved arginine at a cytosolic loop of this integral membrane protein were shown to be critical in recognition of substrate UDP-N-acetylglucosamine.
Probab=62.76 E-value=22 Score=33.14 Aligned_cols=67 Identities=22% Similarity=0.230 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCCC
Q psy13063 248 FHCSLIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKHM 314 (316)
Q Consensus 248 F~~sligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~~ 314 (316)
|.-=.-+|.+|.+++..+....+.+|.+.-...|..+....+.-..+-=+++.++|+-..++++.|+
T Consensus 174 FmGD~GS~~lG~~la~~~l~~~~~~~~~~~~~~~~~~~~~p~~D~~~~i~rR~~~g~~~f~~dr~Hl 240 (249)
T cd06853 174 FMGDAGSLFLGFLLAVLSILGTQKSSTAISPVVPLLILAVPLFDTLFVIIRRLLRGRSPFQADRDHL 240 (249)
T ss_pred EeeCchHHHHHHHHHHHHHHHhccCCcchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCchhH
Confidence 4333556777777776666665554433333344445545444444555666789999999899884
No 15
>PRK13743 conjugal transfer protein TrbF; Provisional
Probab=59.55 E-value=25 Score=30.65 Aligned_cols=68 Identities=24% Similarity=0.433 Sum_probs=55.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhC------cCccch----------hhhhhhhhHHH---HHHHHHHhHHHHhccCccc
Q psy13063 246 SYFHCSLIGYFLGLLTATVSSEIFK------AAQPAL----------LYLVPFTLLPL---LTMAYLKGDLRRMWSEPFI 306 (316)
Q Consensus 246 ~YF~~sligY~~GL~~t~~a~~~~~------~gQPAL----------LYLVP~~L~~~---~~~A~~RgEl~~~W~g~~~ 306 (316)
.||-..++-...|.++..++-.+++ ++-|-| .|..|.+|.++ +++|..+.-+.++-+|.++
T Consensus 39 ~Y~~LfiVFl~AG~vLw~vM~~iFd~CIDsWkAdpeLnn~rymWNilMYaIPy~L~Ala~GFlv~~~~~p~~~~i~~~~~ 118 (141)
T PRK13743 39 IYFDLFIVFLTAGIVLWVIMHSIFDACIDSWKADPELNNFRYMWNILMYVIPYTLWALAAGFLVAGVRNPLCELINGGIR 118 (141)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHhhhcChhhhhHHHHHHHHHHHHHHHHHHHHhchhhhhhhhHHHHHHhccee
Confidence 7999999999999999888777765 677877 89999998877 4777788888888888876
Q ss_pred CCCCCCC
Q psy13063 307 IVPPSKH 313 (316)
Q Consensus 307 ~~~~~~~ 313 (316)
+-..+.+
T Consensus 119 ~f~l~r~ 125 (141)
T PRK13743 119 IFRLKRR 125 (141)
T ss_pred eeehhcc
Confidence 6554443
No 16
>cd06854 GT_WbpL_WbcO_like The members of this subfamily catalyze the formation of a phosphodiester bond between a membrane-associated undecaprenyl-phosphate (Und-P) molecule and N-acetylhexosamine 1-phosphate, which is usually donated by a soluble UDP-N-acetylhexosamine precursor. The WbcO/WbpL substrate specificity has not yet been determined, but the structure of their biosynthetic end products implies that UDP-N-acetyl-D-fucosamine (UDP-FucNAc) and/or UDPN-acetyl-D-quinosamine (UDP-QuiNAc) are used. The subgroup of bacterial UDP-HexNAc:polyprenol-P HexNAc-1-P transferases includes the WbcO protein from Yersinia enterocolitica and the WbpL protein from Pseudomonas aeruginosa. These transferases initiate LPS O-antigen biosynthesis. Similar to other GlcNAc/MurNAc-1-P transferase family members, WbpL is a highly hydrophobic protein possessing 11 predicted transmembrane segments.
Probab=56.72 E-value=27 Score=32.77 Aligned_cols=64 Identities=23% Similarity=0.219 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCCC
Q psy13063 248 FHCSLIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKHM 314 (316)
Q Consensus 248 F~~sligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~~ 314 (316)
|.-=.-+|.+|..++..+....+..|.+ +.|..+....+.-..+-=++++.+|+-...+++.|+
T Consensus 176 FmGD~GS~~lG~~la~~~i~~~~~~~~~---~~~l~~~~~p~~D~~~~i~~R~~~g~~~~~~dr~Hl 239 (253)
T cd06854 176 FMGDVGSTFLGFLLAALLLLLALSGQSP---WAWLLLLSPFLVDATVTLLRRLLRGENIFQAHRKHL 239 (253)
T ss_pred ccccccHHHHHHHHHHHHHHHHhccccH---HHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhH
Confidence 3333556777777777666665555543 334444444444334445666789999999999984
No 17
>PRK15119 undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase; Provisional
Probab=54.61 E-value=36 Score=33.99 Aligned_cols=63 Identities=11% Similarity=0.113 Sum_probs=40.8
Q ss_pred HHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCCC
Q psy13063 252 LIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKHM 314 (316)
Q Consensus 252 ligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~~ 314 (316)
.-+|.+|.+++..+....+..|++.--+.|..++..-+.-..+-=++++.+|+-...++++|+
T Consensus 218 ~GS~~LGf~la~~~i~~~~~~~~~~~~~~~l~~l~~Pl~D~~~~~~rR~~~g~~~f~adr~Hl 280 (365)
T PRK15119 218 AGSTLIGFTVIWILLETTQGKTHPISPVTALWIIAIPLMDMVAIMYRRLRKGMSPFSPDRQHI 280 (365)
T ss_pred hhHHHHHHHHHHHHHHHhcccccchhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCcH
Confidence 455677777776666666655544333445444444444444555777899999999999994
No 18
>TIGR02380 ECA_wecA undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase. Members of this family are the WecA enzyme of enterobacterial common antigen biosynthesis, undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase. This family represents one narrow clade, and closely related sequences outside this clade may represent enzymes that catalyze the same specific reaction, but in the context of different pathways. A His-rich motif in a cytosolic loop of this integral membrane protein, shown critical to enzymatic activity for WecA (see ref ) is variously present or absent in the clade that includes Bacillus subtilis TagO teichoic acid biosynthesis enzyme, which may catalyze the same reaction as WecA.
Probab=37.61 E-value=1e+02 Score=30.63 Aligned_cols=63 Identities=17% Similarity=0.173 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHHHHHhCcCc-cchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCCC
Q psy13063 252 LIGYFLGLLTATVSSEIFKAAQ-PALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKHM 314 (316)
Q Consensus 252 ligY~~GL~~t~~a~~~~~~gQ-PALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~~ 314 (316)
.-+|.+|.+++..+....+..| |...-+.|..++.+-+.-..+-=++++.+|+-...++++|+
T Consensus 213 ~GS~~LGf~la~~~i~~~~~~~~~~~~~~~~l~~l~~Pl~D~~~~i~rR~~~g~~~f~adr~Hl 276 (346)
T TIGR02380 213 AGSTFIGFTVIWLLLLATQGESSPSMRPVTALWLIALPLMDMAAIMLRRIRKGDSPFKPDRQHL 276 (346)
T ss_pred hhhHHHHHHHHHHHHHHHccccccchhHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCccH
Confidence 4456677676666665555443 32222233333333233333333456779999999999984
No 19
>cd06855 GT_GPT_euk UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT) catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-glycosylation. GPT activity has been identified in all eukaryotic cells examined to date. A series of six conserved motifs designated A through F, ranging in length from 5 to 13 amino acid residues, has been identified in this family. They have been determined to be important for stable expression, substrate binding, or catalytic activities.
Probab=25.77 E-value=96 Score=29.86 Aligned_cols=63 Identities=19% Similarity=0.232 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHHHHHHhCcCccchhhhhhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCCCC
Q psy13063 252 LIGYFLGLLTATVSSEIFKAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKHMD 315 (316)
Q Consensus 252 ligY~~GL~~t~~a~~~~~~gQPALLYLVP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~~~ 315 (316)
.-++.+|.+++..+... +...+.+.++.|-+.-.++..=...+-.++.=+++-...++++|+|
T Consensus 219 ~GSl~LG~~la~~~i~~-~~~~~~~~~~~p~~~~~~l~~P~~~~~~~r~~~~~~~~~~d~~hl~ 281 (283)
T cd06855 219 TFTYFAGMVFAVVGILG-HFSKTLLLFFIPQIINFLYSLPQLFGIVPCPRHRLPKFNPKTGLLE 281 (283)
T ss_pred hHHHHHHHHHHHHHHHH-hhHHHHHHHHHHHHHHHHHHHhHhheeeeccccCCCCCCCCcCCCC
Confidence 44566676666555443 3333555667777744433322222233334466777888899986
No 20
>PRK14412 membrane protein; Provisional
Probab=23.50 E-value=3.7e+02 Score=24.70 Aligned_cols=34 Identities=12% Similarity=0.125 Sum_probs=23.6
Q ss_pred hhhhhHHHHHHHHHHhHHHHhccCcccCCCCCCC
Q psy13063 280 VPFTLLPLLTMAYLKGDLRRMWSEPFIIVPPSKH 313 (316)
Q Consensus 280 VP~~L~~~~~~A~~RgEl~~~W~g~~~~~~~~~~ 313 (316)
++..+++.+..---|.+++++++|+-....+|+.
T Consensus 163 ~~~~~~~~lii~rHr~NI~Rl~~g~E~k~~~k~~ 196 (198)
T PRK14412 163 VFSLILASLAIFQHRSNIKRLLAGTESKLGQKTE 196 (198)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCCCcccc
Confidence 4455566666666689999999998766554443
No 21
>TIGR02587 putative integral membrane protein TIGR02587. Members of this family are found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Sinorhizobium meliloti, and Gloeobacter violaceus in a conserved two-gene neighborhood. This family, as defined, includes some members of COG4711 but is narrower and strictly bacterial. Members appear to span the membrane seven times.
Probab=21.43 E-value=1.6e+02 Score=28.65 Aligned_cols=49 Identities=16% Similarity=0.165 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCc---Cccc-------hhhhhhhhhHHHHHHHHHHhH
Q psy13063 248 FHCSLIGYFLGLLTATVSSEIFKA---AQPA-------LLYLVPFTLLPLLTMAYLKGD 296 (316)
Q Consensus 248 F~~sligY~~GL~~t~~a~~~~~~---gQPA-------LLYLVP~~L~~~~~~A~~RgE 296 (316)
..=++.+|++|++++.+++.++++ ++|. ++=-+|+.+|+.......+++
T Consensus 69 i~eti~ay~Iglv~S~~~L~lfgri~~~~pl~e~Lg~vivl~vP~sIGaAlaR~~L~~~ 127 (271)
T TIGR02587 69 VIDTVEAMAIGFVCSAAMLWLFGIITPETSLKEIVGKVAFQGVPFSLGAALARQQLGDT 127 (271)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCCCCCHHHHHHHHHHHhCcHHHHHHHHHHHhCCC
Confidence 345699999999999999999987 5563 222389999887777766554
Done!