Query psy3106
Match_columns 433
No_of_seqs 342 out of 600
Neff 7.1
Searched_HMMs 46136
Date Fri Aug 16 16:38:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3106.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3106hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00241 ZP Zona pellucida ( 99.8 1.2E-19 2.6E-24 175.5 16.8 195 154-359 16-242 (253)
2 cd01099 PAN_AP_HGF Subfamily o 99.6 9.4E-16 2E-20 122.8 8.7 79 31-111 2-80 (80)
3 PF00100 Zona_pellucida: Zona 99.4 1.7E-12 3.8E-17 124.6 12.8 151 205-358 87-254 (265)
4 PF00024 PAN_1: PAN domain Thi 99.4 8.7E-13 1.9E-17 104.1 6.5 77 33-112 2-79 (79)
5 smart00473 PAN_AP divergent su 98.7 4.6E-08 1E-12 76.5 7.3 76 32-111 3-78 (78)
6 cd01099 PAN_AP_HGF Subfamily o 98.2 3E-06 6.4E-11 67.8 5.9 58 122-180 6-64 (80)
7 PF00024 PAN_1: PAN domain Thi 97.6 5.9E-05 1.3E-09 59.0 3.8 59 121-180 3-61 (79)
8 smart00473 PAN_AP divergent su 97.1 0.0015 3.2E-08 50.6 6.0 54 122-176 6-59 (78)
9 cd01100 APPLE_Factor_XI_like S 95.6 0.017 3.8E-07 44.9 4.0 55 33-89 4-58 (73)
10 cd00129 PAN_APPLE PAN/APPLE-li 95.6 0.036 7.9E-07 44.3 5.9 73 34-110 3-79 (80)
11 PF08276 PAN_2: PAN-like domai 94.7 0.093 2E-06 40.1 5.8 58 29-88 1-61 (66)
12 smart00223 APPLE APPLE domain. 92.9 0.16 3.4E-06 40.5 4.1 49 38-88 6-57 (79)
13 PF14295 PAN_4: PAN domain; PD 92.7 0.079 1.7E-06 37.6 2.1 33 51-85 13-50 (51)
14 cd01098 PAN_AP_plant Plant PAN 91.3 0.35 7.5E-06 37.9 4.5 54 33-89 12-65 (84)
15 PF08276 PAN_2: PAN-like domai 86.9 1.6 3.5E-05 33.2 5.1 52 122-174 7-60 (66)
16 cd00129 PAN_APPLE PAN/APPLE-li 82.4 2.7 5.9E-05 33.5 4.7 36 141-177 25-62 (80)
17 cd01100 APPLE_Factor_XI_like S 71.5 5.7 0.00012 30.6 3.7 36 139-176 23-58 (73)
18 PF08277 PAN_3: PAN-like domai 63.1 17 0.00036 27.5 4.8 39 49-91 15-53 (71)
19 PF05506 DUF756: Domain of unk 62.4 64 0.0014 25.7 8.3 43 253-295 32-77 (89)
20 PF14295 PAN_4: PAN domain; PD 61.9 5.4 0.00012 27.9 1.7 33 139-172 14-50 (51)
21 smart00223 APPLE APPLE domain. 60.2 13 0.00027 29.6 3.7 44 130-175 11-57 (79)
22 cd01098 PAN_AP_plant Plant PAN 51.2 26 0.00056 27.0 4.2 32 141-174 32-63 (84)
23 PTZ00265 multidrug resistance 49.3 8 0.00017 46.9 1.3 15 306-320 598-612 (1466)
24 KOG3441|consensus 37.4 82 0.0018 27.1 5.2 67 250-325 54-124 (149)
25 KOG4574|consensus 35.8 26 0.00057 39.4 2.5 32 35-66 515-547 (1007)
26 COG3485 PcaH Protocatechuate 3 33.5 71 0.0015 30.6 4.8 71 254-331 74-145 (226)
27 KOG4574|consensus 29.4 32 0.0007 38.7 1.9 6 380-385 948-953 (1007)
28 smart00765 MANEC The MANEC dom 28.6 1.1E+02 0.0024 25.2 4.5 52 35-88 10-74 (93)
29 PF02822 Antistasin: Antistasi 28.3 33 0.00072 21.3 1.1 9 310-318 15-23 (26)
30 PF07502 MANEC: MANEC domain; 28.0 1.2E+02 0.0026 24.8 4.6 44 33-78 7-58 (92)
31 PRK05483 rplN 50S ribosomal pr 26.9 1.2E+02 0.0026 26.2 4.7 61 269-329 31-105 (122)
32 PF07502 MANEC: MANEC domain; 24.3 82 0.0018 25.7 3.0 53 122-175 9-73 (92)
33 PF10836 DUF2574: Protein of u 23.9 1E+02 0.0022 25.0 3.3 57 242-303 26-82 (93)
34 PF05688 DUF824: Salmonella re 23.5 98 0.0021 22.1 2.9 18 269-286 7-24 (47)
35 TIGR01067 rplN_bact ribosomal 23.3 1.5E+02 0.0033 25.6 4.6 61 269-329 31-105 (122)
36 smart00605 CW CW domain. 22.1 2.5E+02 0.0055 22.4 5.6 37 51-90 19-55 (94)
37 TIGR03673 rpl14p_arch 50S ribo 20.1 1.6E+02 0.0035 25.8 4.1 61 269-329 44-115 (131)
No 1
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=99.83 E-value=1.2e-19 Score=175.50 Aligned_cols=195 Identities=21% Similarity=0.333 Sum_probs=141.2
Q ss_pred CceEEEEecCCCC-ccceEEec--CCCceeeccccccccccccccc------------eeEEEe-eCCceeeeccc-eeE
Q psy3106 154 PYRCHSYDYGDTG-DLVCRLSH--HSRATLVGIQEYRNTVGIEVGN------------RNTVIQ-HHDTIITSSDL-GLA 216 (433)
Q Consensus 154 ~F~Crs~~~~~~~-~~~C~Ls~--~s~~t~~~~~~P~~~v~~~Cg~------------~~vvVq-~hp~~~T~~Dr-~y~ 216 (433)
.|.++.+..+.+. +..|.... .++..+ .++.|+ ..||+ ++|+++ +||.+||..|+ .|+
T Consensus 16 ~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~-~f~~~l----~~CGt~~~~~~~~~~ysn~v~~~~~~~~~itr~~~~~~~ 90 (253)
T smart00241 16 LFPGGIYVKGLYLGDPSCRPVFTDSTSAFV-SFEVPL----NGCGTRRQVNPDGIVYSNTLVVSPFHPGFITRDDRAAYH 90 (253)
T ss_pred CCCCeEEEeeEEeCCCCCCCccccCCCcEE-EEEecc----ccCCCeEEECCCeEEEEEEEEEccCCCCceEecCceEEE
Confidence 6778887776432 56899852 222222 123333 27987 789999 79999999999 999
Q ss_pred EEEeecCCCceeeeceeeccccccccccccceecCCCcEEEEEEcCC--CCccccee---EeCCeEEEEEEEeCC-CCCc
Q psy3106 217 VTCQYDLTNKTVSNEVDLGVQGDVKSALTEEVVVDSPNVAMKITDRS--GADVKPSA---EVGDPLALRFEILDP-NSPY 290 (433)
Q Consensus 217 v~C~y~~~~~~v~~~~~v~~~~~~~~~~~~~~~~~~P~c~~~I~~~~--g~~v~~~a---~vGd~l~~~w~~~~~-~~~~ 290 (433)
++|.|....+ +...+.+.... +..+. ......|.|+|+|+.++ |+|+ ..+ +|||+|+|+|++... +..|
T Consensus 91 ~~C~y~~~~~-~~~~~~~~~~~--~~~~~-~~~~g~~~~~m~l~~~~~~~~~~-~~~~~~~lg~~l~~~~~~~~~~~~~~ 165 (253)
T smart00241 91 FQCFYPENEK-VSLNLDVSTIP--PTELS-SVSEGPPTCSYRLYKDDSFGSPY-QSADYPVLGDPVYHEWSCDGADDPPL 165 (253)
T ss_pred EEEEEeCCCc-eEEEEEecCCC--CCCcc-cccCCCcEEEEEeccCCCCCCcc-cCCCCcccCCeEEEEEEEccCCCCCe
Confidence 9999987555 33333333211 11121 12256789999999753 4555 333 799999999999654 7999
Q ss_pred cEEEEEEEEEcCCC---CceEEEEcCCCCcccccccCcceeccc-cC----cccccccCCcceeEEE-eccCCcCCcc
Q psy3106 291 EIFVRELVAMDGVD---SSEIVLIDSNGCPTDHFIMGPLYKAAD-TG----KFLNCPHEGKCTLTMS-KVSSCPRSSI 359 (433)
Q Consensus 291 ~~~V~~C~a~dg~~---~~~~~lID~~GC~~d~~l~~~~~y~~~-~~----~sl~~~F~~~s~l~~~-~v~~C~~s~~ 359 (433)
+|+|++|+|.++.+ +.++.||| +|||+|+.+++.+.|..+ .. ....++|.+.+.+|++ .+..|.....
T Consensus 166 ~l~v~~C~at~~~~~~~~~~~~lI~-~GC~~d~~~~~~~~~~~~~~~~~~f~~~aF~F~~~~~v~~hC~v~vC~~~~~ 242 (253)
T smart00241 166 GLLVDNCYATPGSDPSSGPKYFIID-NGCPVDGYLDSTIPYNSSPNHYARFSVKVFKFADRSLVYFHCQIRLCDKSDG 242 (253)
T ss_pred EEEEeeEEEcCCCCCCCCCcEEEEE-CccCCCCccccceecCCCCcceEEEEEEEEEecCCCcEEEEEEEEEECCCCC
Confidence 99999999998852 68999999 999999999999999876 21 1346799999999997 8888876544
No 2
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.63 E-value=9.4e-16 Score=122.82 Aligned_cols=79 Identities=34% Similarity=0.538 Sum_probs=72.2
Q ss_pred CCeEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCcccCCCceEEec
Q psy3106 31 RAWCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEETKDIEYLEN 110 (433)
Q Consensus 31 ~~~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~~~~~dY~En 110 (433)
..|.|++++++.|+|+++.++.+.|+++|+.+|+.+..|.||||+|++.+++|+|+++++.+. +..+..+.++|||||
T Consensus 2 ~~~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~~~~C~L~~~~~~~~--~~~~~~~~~~dyyE~ 79 (80)
T cd01099 2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSKECILSDEDRMSS--GVKLLYDSNVDYYEN 79 (80)
T ss_pred CcccEEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcCCCEEEEeCCCcccc--ccccccccCceeeec
Confidence 358999999999999999999999999999999998889999999999999999999999876 466666668999998
Q ss_pred C
Q psy3106 111 N 111 (433)
Q Consensus 111 ~ 111 (433)
.
T Consensus 80 ~ 80 (80)
T cd01099 80 K 80 (80)
T ss_pred C
Confidence 4
No 3
>PF00100 Zona_pellucida: Zona pellucida-like domain; InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.41 E-value=1.7e-12 Score=124.61 Aligned_cols=151 Identities=21% Similarity=0.335 Sum_probs=92.5
Q ss_pred CceeeeccceeEEEEeecCCCceeeeceeeccccccccccccceecCCCcEEEEEEcCCC--C---cccceeEeCCeEEE
Q psy3106 205 DTIITSSDLGLAVTCQYDLTNKTVSNEVDLGVQGDVKSALTEEVVVDSPNVAMKITDRSG--A---DVKPSAEVGDPLAL 279 (433)
Q Consensus 205 p~~~T~~Dr~y~v~C~y~~~~~~v~~~~~v~~~~~~~~~~~~~~~~~~P~c~~~I~~~~g--~---~v~~~a~vGd~l~~ 279 (433)
+.+++..|+.|+++|.|......+..... ... ..+............++|++..... . .....+++||+|++
T Consensus 87 ~~i~~~~~~~~~~~C~y~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~m~l~~~~~~~~~~~~~~~~~~vg~~v~~ 163 (265)
T PF00100_consen 87 GNITRDSDRRFNFTCTYSSSQYSVSSGSP-NIS--SVTIHISQESSGSFSFSMRLYKDDSFSSPYSQPPATVEVGDRVYV 163 (265)
T ss_dssp TTECSSS-EEEEEEEEE-EC---------------B-TTT---STT--EEEEEEEESTTTSSCB---SSEEE-TT-EEEE
T ss_pred CceEEecccccccEEEECCccceeccccc-cce--eccccceeccCCccceEEEeccCccccccccCCceeEEeeeeEEE
Confidence 35667789999999999433333322211 110 0010111112334457777776431 1 22256899999999
Q ss_pred EEEEeCCC-CCccEEEEEEEEEcC---CCCceEEEEcCCCCcccccccCcceec---cccCc----ccccccCCcceeEE
Q psy3106 280 RFEILDPN-SPYEIFVRELVAMDG---VDSSEIVLIDSNGCPTDHFIMGPLYKA---ADTGK----FLNCPHEGKCTLTM 348 (433)
Q Consensus 280 ~w~~~~~~-~~~~~~V~~C~a~dg---~~~~~~~lID~~GC~~d~~l~~~~~y~---~~~~~----sl~~~F~~~s~l~~ 348 (433)
+|++.... ..+.|+|++|+|.++ .++.++.|||++|||+|...++.+.+. .+... ...++|.....+|+
T Consensus 164 ~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~GC~~~~~~~~~~~~~~~~~~~~~~~f~f~aF~f~~~~~v~i 243 (265)
T PF00100_consen 164 EVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENGCPVDSDDFVTFEYSGPNGDSSRARFSFRAFRFPDSSSVYI 243 (265)
T ss_dssp EEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTTEEGGGCTTTEEEEETTTCCCCEEEEEEE-EEETSSSEEEE
T ss_pred EEEeccccceeeEEEeCEEEEeCCCCcccccccceeccccccccccceeEEEeccccccccEEEEEeeEEEecCCCeEEE
Confidence 99974332 237799999999988 568899999999999999999999987 33211 24568888999999
Q ss_pred E-eccCCcCCc
Q psy3106 349 S-KVSSCPRSS 358 (433)
Q Consensus 349 ~-~v~~C~~s~ 358 (433)
+ .+..|....
T Consensus 244 ~C~v~~C~~~~ 254 (265)
T PF00100_consen 244 HCSVRVCDKSD 254 (265)
T ss_dssp EEEEEEEETTS
T ss_pred EEEEEEECCCC
Confidence 7 888887754
No 4
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=99.38 E-value=8.7e-13 Score=104.05 Aligned_cols=77 Identities=31% Similarity=0.713 Sum_probs=68.3
Q ss_pred eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCccc-CCCceEEecC
Q psy3106 33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEE-TKDIEYLENN 111 (433)
Q Consensus 33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~-~~~~dY~En~ 111 (433)
|.|++++|+.|.|.....+.+.++++|+.+|..+.. .|+||+|++..+.|.|++.++.+. +..+.. ...++|||+.
T Consensus 2 ~~f~~~~~~~l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~~~~C~L~~~~~~~~--~~~~~~~~~~~~~ye~~ 78 (79)
T PF00024_consen 2 WAFERIPGYRLSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPSSKTCYLSSSDRSSL--PPRLTPSSPNVDYYEKS 78 (79)
T ss_dssp -TEEEEEEEEEESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETTTTEEEEECSSSSSE--STEEEEEEEEEEEEEEG
T ss_pred CCeEEECCEEEeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECCCCEEEEcCCCCCcc--cceeEeccCcEEEEEec
Confidence 789999999999998888888999999999999876 799999999999999999999985 245555 7899999998
Q ss_pred C
Q psy3106 112 C 112 (433)
Q Consensus 112 C 112 (433)
|
T Consensus 79 C 79 (79)
T PF00024_consen 79 C 79 (79)
T ss_dssp C
T ss_pred C
Confidence 8
No 5
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=98.71 E-value=4.6e-08 Score=76.45 Aligned_cols=76 Identities=26% Similarity=0.439 Sum_probs=61.8
Q ss_pred CeEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCcccCCCceEEecC
Q psy3106 32 AWCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEETKDIEYLENN 111 (433)
Q Consensus 32 ~~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~~~~~dY~En~ 111 (433)
...|.++++..|.+++...+.+.++++|+++|+.+ .+.|+||+|.+..+.|+|+.+..... +. ......++|||+.
T Consensus 3 ~~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~-~~~C~s~~y~~~~~~C~l~~~~~~~~--~~-~~~~~~~~~y~~~ 78 (78)
T smart00473 3 DDCFVRLPNTKLPGFSRIVISVASLEECASKCLNS-NCSCRSFTYNNGTKGCLLWSESSLGD--AR-LFPSGGVDLYEKI 78 (78)
T ss_pred CceeEEecCccCCCCcceeEcCCCHHHHHHHhCCC-CCceEEEEEcCCCCEEEEeeCCcccc--ce-ecccCCceeEEeC
Confidence 35799999999987777667889999999999985 58999999998899999999555444 22 4456788999974
No 6
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=98.20 E-value=3e-06 Score=67.76 Aligned_cols=58 Identities=31% Similarity=0.543 Sum_probs=48.7
Q ss_pred eeeecCceecccceeeecccCHHHHHHhhhCC-CceEEEEecCCCCccceEEecCCCcee
Q psy3106 122 FKKLAGRILKTVDSVYQDVASIEECRELCLNS-PYRCHSYDYGDTGDLVCRLSHHSRATL 180 (433)
Q Consensus 122 ~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s-~F~Crs~~~~~~~~~~C~Ls~~s~~t~ 180 (433)
|.+.++++|.+.......+.|+++|+++|+++ +|.||||+|.. ...+|+|+..++.+.
T Consensus 6 f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~-~~~~C~L~~~~~~~~ 64 (80)
T cd01099 6 FVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNY-KSKECILSDEDRMSS 64 (80)
T ss_pred EEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEc-CCCEEEEeCCCcccc
Confidence 55778888877665555678999999999997 99999999997 589999999988544
No 7
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=97.61 E-value=5.9e-05 Score=58.97 Aligned_cols=59 Identities=37% Similarity=0.731 Sum_probs=48.5
Q ss_pred EeeeecCceecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCCCcee
Q psy3106 121 EFKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHSRATL 180 (433)
Q Consensus 121 ~~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s~~t~ 180 (433)
.|+++.|..|.+......+|.++++|+++|+..+..|++|.|.. ....|.|...++.+.
T Consensus 3 ~f~~~~~~~l~~~~~~~~~v~s~~~C~~~C~~~~~~C~s~~y~~-~~~~C~L~~~~~~~~ 61 (79)
T PF00024_consen 3 AFERIPGYRLSGHSIKEINVPSLEECAQLCLNEPRRCKSFNYDP-SSKTCYLSSSDRSSL 61 (79)
T ss_dssp TEEEEEEEEEESCEEEEEEESSHHHHHHHHHHSTT-ESEEEEET-TTTEEEEECSSSSSE
T ss_pred CeEEECCEEEeCCcceEEcCCCHHHHHhhcCcCcccCCeEEEEC-CCCEEEEcCCCCCcc
Confidence 58888998888864444488999999999999866999999997 588999999888764
No 8
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=97.07 E-value=0.0015 Score=50.61 Aligned_cols=54 Identities=37% Similarity=0.642 Sum_probs=42.3
Q ss_pred eeeecCceecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCC
Q psy3106 122 FKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHS 176 (433)
Q Consensus 122 ~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s 176 (433)
|.++++..|...+.....+.++++|+++|+++.+.|++|+|.. ....|+|...+
T Consensus 6 f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~~C~s~~y~~-~~~~C~l~~~~ 59 (78)
T smart00473 6 FVRLPNTKLPGFSRIVISVASLEECASKCLNSNCSCRSFTYNN-GTKGCLLWSES 59 (78)
T ss_pred eEEecCccCCCCcceeEcCCCHHHHHHHhCCCCCceEEEEEcC-CCCEEEEeeCC
Confidence 6667777776444333456799999999999779999999987 68899999833
No 9
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=95.55 E-value=0.017 Score=44.93 Aligned_cols=55 Identities=20% Similarity=0.372 Sum_probs=47.0
Q ss_pred eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCC
Q psy3106 33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMD 89 (433)
Q Consensus 33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~ 89 (433)
.+|+..++..+.|....+..+.+.++|+++|+.+ -.|+.|.|....+.|.|-...
T Consensus 4 ~C~~~~~~~~~~g~d~~~~~~~s~~~Cq~~C~~~--~~C~afT~~~~~~~C~lk~~~ 58 (73)
T cd01100 4 SCFRQGSNVDFRGGDLSTVFASSAEQCQAACTAD--PGCLAFTYNTKSKKCFLKSSE 58 (73)
T ss_pred ccccccCCCccccCCcceeecCCHHHHHHHcCCC--CCceEEEEECCCCeEEcccCC
Confidence 4566667888888877777788999999999986 689999999989999998875
No 10
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=95.55 E-value=0.036 Score=44.27 Aligned_cols=73 Identities=19% Similarity=0.327 Sum_probs=48.0
Q ss_pred EEEEecCeEEeccee--eEeecCCHHHHHHHccc-CCCceeeeEEeeCCCCceEeecCCc-ccccCCCCcccCCCceEEe
Q psy3106 34 CFDRVHGYSLLGFVK--KVGTATSRTDCFEMCLG-ERDFICRSANYNNATGECHMSDMDR-ITVAGHGAFEETKDIEYLE 109 (433)
Q Consensus 34 ~Fer~~g~~L~g~~~--~~~~~~s~~eC~~~C~~-e~~f~CrS~~y~~~~~~C~Ls~~~~-~t~~~p~~f~~~~~~dY~E 109 (433)
.|-+.+|..|..... ....+.+++||+++|++ +.++.|+++.|....+.|.+-..+. ... ..+. +.+.+.|+
T Consensus 3 ~~~~~~g~fl~~~~~klpd~~~~s~~eC~~~Cl~~~~nCsC~Aya~~~~~~gC~~W~~~l~~d~---~~~~-~~g~~Ly~ 78 (80)
T cd00129 3 EFCKSAGTTLIKIALKIKTTKANTADECANRCEKNGLPFSCKAFVFAKARKQCLWFPFNSMSGV---RKEF-SHGFDLYE 78 (80)
T ss_pred eeeecCCeEEEeecccCCcccccCHHHHHHHHhcCCCCCCceeeeccCCCCCeEEecCcchhhH---Hhcc-CCCceeEe
Confidence 456667666642211 12244799999999998 3369999999976678999998877 332 2221 33566665
Q ss_pred c
Q psy3106 110 N 110 (433)
Q Consensus 110 n 110 (433)
+
T Consensus 79 r 79 (80)
T cd00129 79 N 79 (80)
T ss_pred E
Confidence 4
No 11
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=94.71 E-value=0.093 Score=40.11 Aligned_cols=58 Identities=22% Similarity=0.429 Sum_probs=45.9
Q ss_pred CCCCeEEEEecCeEEecceeeEe-ecCCHHHHHHHcccCCCceeeeEEeeC--CCCceEeecC
Q psy3106 29 CERAWCFDRVHGYSLLGFVKKVG-TATSRTDCFEMCLGERDFICRSANYNN--ATGECHMSDM 88 (433)
Q Consensus 29 C~~~~~Fer~~g~~L~g~~~~~~-~~~s~~eC~~~C~~e~~f~CrS~~y~~--~~~~C~Ls~~ 88 (433)
|+..-.|.++++..|.+.....+ .+.++++|..+|++. =.|.++.|.. ..+.|.|-..
T Consensus 1 C~~~d~F~~l~~~~~p~~~~~~~~~~~s~~~C~~~Cl~n--CsC~Ayay~~~~~~~~C~lW~~ 61 (66)
T PF08276_consen 1 CGSGDGFLKLPNMKLPDFDNAIVDSSVSLEECEKACLSN--CSCTAYAYSNLSGGGGCLLWYG 61 (66)
T ss_pred CcCCCEEEEECCeeCCCCcceeeecCCCHHHHHhhcCCC--CCEeeEEeeccCCCCEEEEEcC
Confidence 44456899999999977655554 679999999999976 5799999975 5689987653
No 12
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=92.88 E-value=0.16 Score=40.52 Aligned_cols=49 Identities=14% Similarity=0.218 Sum_probs=43.5
Q ss_pred ecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCC---ceEeecC
Q psy3106 38 VHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATG---ECHMSDM 88 (433)
Q Consensus 38 ~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~---~C~Ls~~ 88 (433)
..+..+.|...+.+.+.+.++|++.|... -.|+.|.|..... .|.|-..
T Consensus 6 ~~~~df~G~Dl~~~~~~~~~~Cq~~Ct~~--~~C~~FTf~~~~~~~~~C~LK~s 57 (79)
T smart00223 6 YKNVDFRGSDINTVYVPSAQVCQKRCTSH--PRCLFFTFSTNEPPEEKCLLKDS 57 (79)
T ss_pred ccCccccCceeeeeecCCHHHHHHhhcCC--CCccEEEeeCCCCCCCEeEeCcC
Confidence 45777889999999999999999999976 5899999998888 9999876
No 13
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=92.72 E-value=0.079 Score=37.60 Aligned_cols=33 Identities=27% Similarity=0.631 Sum_probs=17.3
Q ss_pred eecCCHHHHHHHcccCCCceeeeEEeeC-----CCCceEe
Q psy3106 51 GTATSRTDCFEMCLGERDFICRSANYNN-----ATGECHM 85 (433)
Q Consensus 51 ~~~~s~~eC~~~C~~e~~f~CrS~~y~~-----~~~~C~L 85 (433)
+.+.+.++|+++|... ..|..+.|.. ..+.|.|
T Consensus 13 ~~~~s~~~C~~~C~~~--~~C~~~~~~~~~~~~~~~~C~L 50 (51)
T PF14295_consen 13 VTASSPEECQAACAAD--PGCQAFTFNPPGCPSSSGRCYL 50 (51)
T ss_dssp -----HHHHHHHHHTS--TT--EEEEETTEE---------
T ss_pred ccCCCHHHHHHHccCC--CCCCEEEEECCCcccccccccC
Confidence 3788999999999976 5799999998 5777776
No 14
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=91.33 E-value=0.35 Score=37.92 Aligned_cols=54 Identities=28% Similarity=0.428 Sum_probs=42.7
Q ss_pred eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCC
Q psy3106 33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMD 89 (433)
Q Consensus 33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~ 89 (433)
..|..+++..+.+..... ...++++|.++|++. =.|.+|.|....+.|.+-...
T Consensus 12 ~~f~~~~~~~~~~~~~~~-~~~s~~~C~~~Cl~n--CsC~a~~~~~~~~~C~~~~~~ 65 (84)
T cd01098 12 DGFLKLPDVKLPDNASAI-TAISLEECREACLSN--CSCTAYAYNNGSGGCLLWNGL 65 (84)
T ss_pred CEEEEeCCeeCCCchhhh-ccCCHHHHHHHHhcC--CCcceeeecCCCCeEEEEece
Confidence 689888888776554443 778999999999975 689999998667999885444
No 15
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=86.93 E-value=1.6 Score=33.23 Aligned_cols=52 Identities=33% Similarity=0.606 Sum_probs=35.7
Q ss_pred eeeecCceecccceee-ecccCHHHHHHhhhCCCceEEEEecCCC-CccceEEec
Q psy3106 122 FKKLAGRILKTVDSVY-QDVASIEECRELCLNSPYRCHSYDYGDT-GDLVCRLSH 174 (433)
Q Consensus 122 ~~~~~~~~l~~~d~~~-~~v~s~~eC~~~Cl~s~F~Crs~~~~~~-~~~~C~Ls~ 174 (433)
|.++++..++..+..+ ....++++|+.+|++. =.|.+|+|... +...|.+-.
T Consensus 7 F~~l~~~~~p~~~~~~~~~~~s~~~C~~~Cl~n-CsC~Ayay~~~~~~~~C~lW~ 60 (66)
T PF08276_consen 7 FLKLPNMKLPDFDNAIVDSSVSLEECEKACLSN-CSCTAYAYSNLSGGGGCLLWY 60 (66)
T ss_pred EEEECCeeCCCCcceeeecCCCHHHHHhhcCCC-CCEeeEEeeccCCCCEEEEEc
Confidence 5566666665554333 2446999999999996 13999998741 457898854
No 16
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=82.37 E-value=2.7 Score=33.46 Aligned_cols=36 Identities=17% Similarity=0.580 Sum_probs=29.2
Q ss_pred cCHHHHHHhhhCC--CceEEEEecCCCCccceEEecCCC
Q psy3106 141 ASIEECRELCLNS--PYRCHSYDYGDTGDLVCRLSHHSR 177 (433)
Q Consensus 141 ~s~~eC~~~Cl~s--~F~Crs~~~~~~~~~~C~Ls~~s~ 177 (433)
.+++||+++|+++ .+.|++|.|.. ....|.+-..+.
T Consensus 25 ~s~~eC~~~Cl~~~~nCsC~Aya~~~-~~~gC~~W~~~l 62 (80)
T cd00129 25 NTADECANRCEKNGLPFSCKAFVFAK-ARKQCLWFPFNS 62 (80)
T ss_pred cCHHHHHHHHhcCCCCCCceeeeccC-CCCCeEEecCcc
Confidence 4899999999994 68899999875 245899987655
No 17
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=71.55 E-value=5.7 Score=30.60 Aligned_cols=36 Identities=25% Similarity=0.593 Sum_probs=29.5
Q ss_pred cccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCC
Q psy3106 139 DVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHS 176 (433)
Q Consensus 139 ~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s 176 (433)
.+.+.++|++.|+..+ .|+.|.|.. ....|.|-...
T Consensus 23 ~~~s~~~Cq~~C~~~~-~C~afT~~~-~~~~C~lk~~~ 58 (73)
T cd01100 23 FASSAEQCQAACTADP-GCLAFTYNT-KSKKCFLKSSE 58 (73)
T ss_pred ecCCHHHHHHHcCCCC-CceEEEEEC-CCCeEEcccCC
Confidence 3669999999999974 599999985 57899997654
No 18
>PF08277 PAN_3: PAN-like domain; InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=63.14 E-value=17 Score=27.51 Aligned_cols=39 Identities=28% Similarity=0.589 Sum_probs=32.9
Q ss_pred eEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcc
Q psy3106 49 KVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRI 91 (433)
Q Consensus 49 ~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~ 91 (433)
......+.++|+..|... -.|--|-|. .+.|.|......
T Consensus 15 ~~~~~~sw~~Cv~~C~~~--~~C~la~~~--~~~C~~y~~~~i 53 (71)
T PF08277_consen 15 SSTTNTSWDDCVQKCYND--ENCVLAYFD--SGKCYLYNYGSI 53 (71)
T ss_pred ccccCCCHHHHhHHhCCC--CEEEEEEeC--CCCEEEEEcCCE
Confidence 445678999999999987 589999887 889999988764
No 19
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=62.45 E-value=64 Score=25.74 Aligned_cols=43 Identities=16% Similarity=0.343 Sum_probs=33.0
Q ss_pred CcEEEEEEcC---CCCcccceeEeCCeEEEEEEEeCCCCCccEEEE
Q psy3106 253 PNVAMKITDR---SGADVKPSAEVGDPLALRFEILDPNSPYEIFVR 295 (433)
Q Consensus 253 P~c~~~I~~~---~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~ 295 (433)
-.+.++|.+. .+.+..-.++-|+.+.+.|......+-|++.|+
T Consensus 32 ~~~~~~v~~~~y~~~~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~ 77 (89)
T PF05506_consen 32 AAVTFTVYDNAYGGGGPWTYTVAAGQTVSLTWPLAASGGWYDLTVT 77 (89)
T ss_pred CcEEEEEEeCCcCCCCCEEEEECCCCEEEEEEeecCCCCcEEEEEE
Confidence 3567777762 235666788889999999988556899999886
No 20
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=61.89 E-value=5.4 Score=27.86 Aligned_cols=33 Identities=24% Similarity=0.588 Sum_probs=16.1
Q ss_pred cccCHHHHHHhhhCCCceEEEEecCCC----CccceEE
Q psy3106 139 DVASIEECRELCLNSPYRCHSYDYGDT----GDLVCRL 172 (433)
Q Consensus 139 ~v~s~~eC~~~Cl~s~F~Crs~~~~~~----~~~~C~L 172 (433)
.+.+.++|+++|... -.|..|+|... ....|+|
T Consensus 14 ~~~s~~~C~~~C~~~-~~C~~~~~~~~~~~~~~~~C~L 50 (51)
T PF14295_consen 14 TASSPEECQAACAAD-PGCQAFTFNPPGCPSSSGRCYL 50 (51)
T ss_dssp ----HHHHHHHHHTS-TT--EEEEETTEE---------
T ss_pred cCCCHHHHHHHccCC-CCCCEEEEECCCcccccccccC
Confidence 456999999999975 35999988751 4567765
No 21
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=60.23 E-value=13 Score=29.55 Aligned_cols=44 Identities=20% Similarity=0.430 Sum_probs=31.7
Q ss_pred ecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCcc---ceEEecC
Q psy3106 130 LKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDL---VCRLSHH 175 (433)
Q Consensus 130 l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~---~C~Ls~~ 175 (433)
+.+.|.....+.+.++|+++|... -.|+.|.|.. ... .|+|-..
T Consensus 11 f~G~Dl~~~~~~~~~~Cq~~Ct~~-~~C~~FTf~~-~~~~~~~C~LK~s 57 (79)
T smart00223 11 FRGSDINTVYVPSAQVCQKRCTSH-PRCLFFTFST-NEPPEEKCLLKDS 57 (79)
T ss_pred ccCceeeeeecCCHHHHHHhhcCC-CCccEEEeeC-CCCCCCEeEeCcC
Confidence 334444444567999999999975 3599999985 344 8999653
No 22
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=51.18 E-value=26 Score=27.00 Aligned_cols=32 Identities=44% Similarity=0.910 Sum_probs=24.9
Q ss_pred cCHHHHHHhhhCCCceEEEEecCCCCccceEEec
Q psy3106 141 ASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSH 174 (433)
Q Consensus 141 ~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~ 174 (433)
.++++|+++|++. =.|..|.|.. ....|.+-.
T Consensus 32 ~s~~~C~~~Cl~n-CsC~a~~~~~-~~~~C~~~~ 63 (84)
T cd01098 32 ISLEECREACLSN-CSCTAYAYNN-GSGGCLLWN 63 (84)
T ss_pred CCHHHHHHHHhcC-CCcceeeecC-CCCeEEEEe
Confidence 4889999999986 3488888875 367898753
No 23
>PTZ00265 multidrug resistance protein (mdr1); Provisional
Probab=49.26 E-value=8 Score=46.93 Aligned_cols=15 Identities=20% Similarity=0.481 Sum_probs=9.2
Q ss_pred ceEEEEcCCCCcccc
Q psy3106 306 SEIVLIDSNGCPTDH 320 (433)
Q Consensus 306 ~~~~lID~~GC~~d~ 320 (433)
-+++|+||-==++|+
T Consensus 598 P~ILlLDEpTSaLD~ 612 (1466)
T PTZ00265 598 PKILILDEATSSLDN 612 (1466)
T ss_pred CCEEEEeCcccccCH
Confidence 466777766555553
No 24
>KOG3441|consensus
Probab=37.43 E-value=82 Score=27.14 Aligned_cols=67 Identities=25% Similarity=0.325 Sum_probs=42.5
Q ss_pred cCCCcEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEEEc----CCCCceEEEEcCCCCcccccccCc
Q psy3106 250 VDSPNVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVAMD----GVDSSEIVLIDSNGCPTDHFIMGP 325 (433)
Q Consensus 250 ~~~P~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a~d----g~~~~~~~lID~~GC~~d~~l~~~ 325 (433)
...|.|. .+.... ..+++||.+.+-+.= .-.-.+.|..|.-.. ..+.+.+.|||++|=|+...|--+
T Consensus 54 gr~PrCI-HVYkkr-----gvg~~GDkiLvAIkG---QmkKa~vVGh~~~~k~~~P~fDsNniVLiddnGnPlGtRI~~P 124 (149)
T KOG3441|consen 54 GRLPRCI-HVYKKR-----GVGELGDKILVAIKG---QMKKAYVVGHVHYRKHGVPVFDSNNIVLIDDNGNPLGTRITAP 124 (149)
T ss_pred CCCCceE-EEEecc-----cccccccEEEEEEec---ceeeeEEEEeeccCCCCCcccCCCcEEEECCCCCcccceEecc
Confidence 4566664 343333 367889999877753 333355777765532 123468899999999998766443
No 25
>KOG4574|consensus
Probab=35.75 E-value=26 Score=39.39 Aligned_cols=32 Identities=19% Similarity=0.017 Sum_probs=18.8
Q ss_pred EEEecCeEE-ecceeeEeecCCHHHHHHHcccC
Q psy3106 35 FDRVHGYSL-LGFVKKVGTATSRTDCFEMCLGE 66 (433)
Q Consensus 35 Fer~~g~~L-~g~~~~~~~~~s~~eC~~~C~~e 66 (433)
||..+=+++ +.+.--.+....+++=+..|+.+
T Consensus 515 fe~~klRe~rksedsn~~S~pEied~ai~mLDe 547 (1007)
T KOG4574|consen 515 FEAPKLRELRKSEDSNRLSAPEIEDLAILMLDE 547 (1007)
T ss_pred CccchhhHHhhhcccCcccchhHHHHHHHHhcc
Confidence 555554444 34445555666677777777765
No 26
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=33.50 E-value=71 Score=30.64 Aligned_cols=71 Identities=17% Similarity=0.174 Sum_probs=46.4
Q ss_pred cEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEE-EcCCCCceEEEEcCCCCcccccccCcceeccc
Q psy3106 254 NVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVA-MDGVDSSEIVLIDSNGCPTDHFIMGPLYKAAD 331 (433)
Q Consensus 254 ~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a-~dg~~~~~~~lID~~GC~~d~~l~~~~~y~~~ 331 (433)
.++.+|+|.+|.|| ..|. .-.|.| +..+.|.-.--+=.+ .+.-.|...++.|++|=-.-..|.|....-.+
T Consensus 74 ~l~G~VlD~~G~Pv-~~A~-----VEiWQA-da~GrY~~~~d~~~~~~~~f~g~Gr~~Td~~G~y~F~Ti~Pg~yp~~~ 145 (226)
T COG3485 74 LLEGRVLDGNGRPV-PDAL-----VEIWQA-DADGRYSHPKDSRLAPLPNFNGRGRTITDEDGEYRFRTIKPGPYPWRN 145 (226)
T ss_pred EEEEEEECCCCCCC-CCCE-----EEEEEc-CCCCcccCccccccCcCccccceEEEEeCCCceEEEEEeecccccCCC
Confidence 67889999999998 6665 578999 456666420000000 01233677899999998877788886555444
No 27
>KOG4574|consensus
Probab=29.37 E-value=32 Score=38.73 Aligned_cols=6 Identities=33% Similarity=0.512 Sum_probs=2.4
Q ss_pred CCceee
Q psy3106 380 NGESVN 385 (433)
Q Consensus 380 ng~~~n 385 (433)
+|+.+-
T Consensus 948 n~t~ll 953 (1007)
T KOG4574|consen 948 NGTKLL 953 (1007)
T ss_pred Ccccee
Confidence 344433
No 28
>smart00765 MANEC The MANEC domain was formerly called MANSC. This domain, comprising 8 conserved cysteines, is found in the N terminus of higher multicellular animal membrane and extracellular proteins. It is postulated that this domain may play a role in the formation of protein complexes involving various protease activators and inhibitors. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulfide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase.
Probab=28.60 E-value=1.1e+02 Score=25.15 Aligned_cols=52 Identities=17% Similarity=0.326 Sum_probs=35.3
Q ss_pred EEEecCeEEe-cc----eeeE----eecCCHHHHHHHcccCCCceeeeEEeeC----CCCceEeecC
Q psy3106 35 FDRVHGYSLL-GF----VKKV----GTATSRTDCFEMCLGERDFICRSANYNN----ATGECHMSDM 88 (433)
Q Consensus 35 Fer~~g~~L~-g~----~~~~----~~~~s~~eC~~~C~~e~~f~CrS~~y~~----~~~~C~Ls~~ 88 (433)
|++.++++|. +. ..+. ..+.+.++|..+|=... .|-=|.|.. ..+.|.|..=
T Consensus 10 f~~~~n~ii~t~~S~~~Ga~fL~~p~~~~s~edC~~aCC~~~--~CnlAv~e~~~~~~~~~CyLf~C 74 (93)
T smart00765 10 FRVLENAIIRTEESLSAGARFLKSPIAVNTWEDCVRACCSTP--NCNLAVFELRREDAEGNCYLFNC 74 (93)
T ss_pred cccCCCeeeccchhhhccccccCCccccCCHHHHHHHHcCCC--CCcEEEEeccCCCCCCceEEEEc
Confidence 7778888884 11 1122 46789999999999775 488888852 2466766553
No 29
>PF02822 Antistasin: Antistasin family; InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=28.28 E-value=33 Score=21.26 Aligned_cols=9 Identities=67% Similarity=1.265 Sum_probs=6.3
Q ss_pred EEcCCCCcc
Q psy3106 310 LIDSNGCPT 318 (433)
Q Consensus 310 lID~~GC~~ 318 (433)
..|++||++
T Consensus 15 ~~D~~GC~~ 23 (26)
T PF02822_consen 15 KTDENGCPT 23 (26)
T ss_dssp EE-TTSSEE
T ss_pred cCCCCCCCc
Confidence 568999986
No 30
>PF07502 MANEC: MANEC domain; InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=28.04 E-value=1.2e+02 Score=24.77 Aligned_cols=44 Identities=18% Similarity=0.353 Sum_probs=31.7
Q ss_pred eEEEEecCeEEe-------cce-eeEeecCCHHHHHHHcccCCCceeeeEEeeC
Q psy3106 33 WCFDRVHGYSLL-------GFV-KKVGTATSRTDCFEMCLGERDFICRSANYNN 78 (433)
Q Consensus 33 ~~Fer~~g~~L~-------g~~-~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~ 78 (433)
+.|+..++++|. |.. .....+.+.++|+.+|=..++ |-=|.|..
T Consensus 7 ~~f~~~~d~iIdt~~S~~~GA~fl~~~~v~s~~dC~~aCC~~~~--Cnlav~e~ 58 (92)
T PF07502_consen 7 DDFRSFPDFIIDTEESVENGATFLSSPEVSSAEDCVRACCSTPN--CNLAVFEE 58 (92)
T ss_pred cccccCCCeEEehHHHHHCCCeecCccccCCHHHHHHHhCCCCC--CcEEEEec
Confidence 458888898885 111 122378999999999997754 88888864
No 31
>PRK05483 rplN 50S ribosomal protein L14; Validated
Probab=26.93 E-value=1.2e+02 Score=26.16 Aligned_cols=61 Identities=25% Similarity=0.403 Sum_probs=40.4
Q ss_pred ceeEeCCeEEEEEEEeC-------CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106 269 PSAEVGDPLALRFEILD-------PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA 329 (433)
Q Consensus 269 ~~a~vGd~l~~~w~~~~-------~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~ 329 (433)
..|.|||.+...+.=.. ++-...+-|+.-.- .|| .+++.+.|||++|=|+-..|++++.+.
T Consensus 31 ~~a~iGD~I~vsVkk~~~~~~~kkg~v~~AvIVrtkk~~~r~dG~~i~F~dNavVLin~~~~p~GTrI~Gpv~~e 105 (122)
T PRK05483 31 RYASIGDVIVVSVKEAIPRGKVKKGDVVKAVVVRTKKGVRRPDGSYIRFDDNAAVLLNNDGEPRGTRIFGPVARE 105 (122)
T ss_pred CccccCCEEEEEEEEcCCCCcccCCCEeeEEEEEeccceecCCCCEEEcCCCEEEEECCCCCEeEeEEeccchHH
Confidence 57888888877664211 12223444555443 233 335788999999999999999998764
No 32
>PF07502 MANEC: MANEC domain; InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=24.33 E-value=82 Score=25.75 Aligned_cols=53 Identities=26% Similarity=0.503 Sum_probs=33.5
Q ss_pred eeeecCceecccceee--------ecccCHHHHHHhhhCCCceEEEEecCCCCc---cc-eEEecC
Q psy3106 122 FKKLAGRILKTVDSVY--------QDVASIEECRELCLNSPYRCHSYDYGDTGD---LV-CRLSHH 175 (433)
Q Consensus 122 ~~~~~~~~l~~~d~~~--------~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~---~~-C~Ls~~ 175 (433)
|..+++.+|..-+++. ..|.+.++|..+|=..+= |-.+++..... .. |.|..=
T Consensus 9 f~~~~d~iIdt~~S~~~GA~fl~~~~v~s~~dC~~aCC~~~~-Cnlav~e~~~~~~~~~~CyLf~C 73 (92)
T PF07502_consen 9 FRSFPDFIIDTEESVENGATFLSSPEVSSAEDCVRACCSTPN-CNLAVFEERPTGSEDNSCYLFNC 73 (92)
T ss_pred cccCCCeEEehHHHHHCCCeecCccccCCHHHHHHHhCCCCC-CcEEEEeccCCCCCCceEEEEEc
Confidence 5556666666533211 267899999999987632 88888764322 23 888664
No 33
>PF10836 DUF2574: Protein of unknown function (DUF2574) ; InterPro: IPR020386 This entry contains proteins with no known function.
Probab=23.90 E-value=1e+02 Score=24.97 Aligned_cols=57 Identities=18% Similarity=0.179 Sum_probs=34.0
Q ss_pred cccccceecCCCcEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEEEcCC
Q psy3106 242 SALTEEVVVDSPNVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVAMDGV 303 (433)
Q Consensus 242 ~~~~~~~~~~~P~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a~dg~ 303 (433)
.+|.+.+....|+|+++|++..- .-+=|+.++|.=.-...+.+-.=.+.+-++-.+.
T Consensus 26 ATLtIsGrv~~PTCS~dvVn~~~-----QQrCGq~~~~~~~~~~~~tp~~GV~Tevv~vp~d 82 (93)
T PF10836_consen 26 ATLTISGRVSPPTCSTDVVNNQL-----QQRCGQNTYLVNTQNRASTPARGVTTEVVTVPGD 82 (93)
T ss_pred eEEEEcceEcCCcccHHHhccch-----hhhhccceEeeeccccccccccceEEEEEEcCCC
Confidence 35666677889999999997542 2234666554432211133444457777777664
No 34
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=23.49 E-value=98 Score=22.15 Aligned_cols=18 Identities=28% Similarity=0.551 Sum_probs=15.5
Q ss_pred ceeEeCCeEEEEEEEeCC
Q psy3106 269 PSAEVGDPLALRFEILDP 286 (433)
Q Consensus 269 ~~a~vGd~l~~~w~~~~~ 286 (433)
..||+||.|.|.++..+.
T Consensus 7 akaK~Ge~I~ltVt~kda 24 (47)
T PF05688_consen 7 AKAKVGETIPLTVTVKDA 24 (47)
T ss_pred hheecCCeEEEEEEEECC
Confidence 589999999999998653
No 35
>TIGR01067 rplN_bact ribosomal protein L14, bacterial/organelle. This model distinguishes bacterial and most organellar examples of ribosomal protein L14 from all archaeal and eukaryotic forms.
Probab=23.35 E-value=1.5e+02 Score=25.57 Aligned_cols=61 Identities=28% Similarity=0.365 Sum_probs=39.5
Q ss_pred ceeEeCCeEEEEEEEeC-------CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106 269 PSAEVGDPLALRFEILD-------PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA 329 (433)
Q Consensus 269 ~~a~vGd~l~~~w~~~~-------~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~ 329 (433)
..|.|||.+...+.=.. ++-.+.+-|+.-.- .|| .+++.+.|||++|=|+-..|++++.+.
T Consensus 31 ~~a~iGD~I~vsVk~~~~~~~~kkg~v~~AvIVrtkk~~~r~dG~~i~F~~Na~VLin~~~~p~GTrI~Gpv~~e 105 (122)
T TIGR01067 31 RYATVGDVIVVVVKDAIPNGKVKKGDVVKAVIVRTKKGVRRKDGSYIRFDDNACVLINKNKEPRGTRIFGPVARE 105 (122)
T ss_pred CccccCCEEEEEEEEcCCCCccccccEEEEEEEEeecceEeCCCCEEECCCceEEEECCCCCEeeeEEEccchHH
Confidence 46888888877664211 11223444554433 233 345788999999999999999988764
No 36
>smart00605 CW CW domain.
Probab=22.08 E-value=2.5e+02 Score=22.43 Aligned_cols=37 Identities=19% Similarity=0.395 Sum_probs=28.7
Q ss_pred eecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCc
Q psy3106 51 GTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDR 90 (433)
Q Consensus 51 ~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~ 90 (433)
....+.++|...|... ..|--|-+. ....|.|.....
T Consensus 19 ~~~~sw~~Ci~~C~~~--~~Cvlay~~-~~~~C~~f~~~~ 55 (94)
T smart00605 19 AATLSWDECIQKCYED--SNCVLAYGN-SSETCYLFSYGT 55 (94)
T ss_pred ccCCCHHHHHHHHhCC--CceEEEecC-CCCceEEEEcCC
Confidence 3577899999999976 589866443 368999988864
No 37
>TIGR03673 rpl14p_arch 50S ribosomal protein L14P. Part of the 50S ribosomal subunit. Forms a cluster with proteins L3 and L24e, part of which may contact the 16S rRNA in 2 intersubunit bridges.
Probab=20.13 E-value=1.6e+02 Score=25.82 Aligned_cols=61 Identities=25% Similarity=0.420 Sum_probs=40.6
Q ss_pred ceeEeCCeEEEEEEEeC----CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106 269 PSAEVGDPLALRFEILD----PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA 329 (433)
Q Consensus 269 ~~a~vGd~l~~~w~~~~----~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~ 329 (433)
.+|.|||.+...+.=.. ++-.+.+-|+.-.- .|| .+++.+.|||++|=|+-..|++++.+.
T Consensus 44 ~~a~iGD~IvvsVK~~~p~~kg~v~kAVIVRtkk~~~R~dGs~i~FddNa~VLin~~~~P~GTRI~GpV~rE 115 (131)
T TIGR03673 44 PCAGVGDMVVVSVKKGTPEMRKQVFKAVVVRQRKEYRRPDGTRVKFEDNAVVIVTPDGEPKGTEIKGPVARE 115 (131)
T ss_pred CccccCCEEEEEEEECCccccCCEeEEEEEEeCcceecCCCcEEEeCCcEEEEECCCCCEeeeEEEccchHH
Confidence 57889998877765211 12234444544433 233 235778999999999999999998765
Done!