Query psy1623
Match_columns 522
No_of_seqs 224 out of 412
Neff 5.8
Searched_HMMs 46136
Date Sat Aug 17 00:11:30 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1623.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1623hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00241 ZP Zona pellucida ( 100.0 8.8E-30 1.9E-34 252.9 24.4 220 129-390 1-239 (253)
2 PF00100 Zona_pellucida: Zona 99.9 5.8E-24 1.3E-28 208.4 22.4 224 129-392 1-254 (265)
3 cd01099 PAN_AP_HGF Subfamily o 99.3 8.1E-12 1.8E-16 104.3 6.5 77 23-108 4-80 (80)
4 PF00024 PAN_1: PAN domain Thi 98.5 1.7E-07 3.6E-12 76.1 5.7 74 23-108 4-78 (79)
5 smart00473 PAN_AP divergent su 98.0 1.6E-05 3.5E-10 64.0 6.3 75 21-108 4-78 (78)
6 cd01100 APPLE_Factor_XI_like S 91.4 0.28 6E-06 39.9 4.1 56 20-84 3-58 (73)
7 cd00129 PAN_APPLE PAN/APPLE-li 91.2 0.35 7.7E-06 40.6 4.6 56 42-107 23-79 (80)
8 PF14295 PAN_4: PAN domain; PD 88.0 0.31 6.7E-06 36.1 1.7 38 41-80 13-50 (51)
9 smart00223 APPLE APPLE domain. 86.0 1.1 2.3E-05 37.6 4.0 47 33-83 11-57 (79)
10 PF08276 PAN_2: PAN-like domai 74.7 5.7 0.00012 31.8 4.5 59 20-85 4-63 (66)
11 cd01098 PAN_AP_plant Plant PAN 49.4 18 0.00038 29.2 3.0 37 42-85 30-66 (84)
12 PF00558 Vpu: Vpu protein; In 36.2 24 0.00052 30.0 1.8 20 502-521 8-27 (81)
13 PF05454 DAG1: Dystroglycan (D 31.0 16 0.00035 38.0 0.0 23 499-522 149-171 (290)
14 PF10177 DUF2371: Uncharacteri 28.3 48 0.001 31.1 2.6 30 490-522 35-64 (141)
15 PF14814 UB2H: Bifunctional tr 27.1 98 0.0021 26.0 4.1 34 357-390 33-66 (85)
16 PF15102 TMEM154: TMEM154 prot 25.8 52 0.0011 31.0 2.4 23 499-522 61-83 (146)
17 PF09680 Tiny_TM_bacill: Prote 24.2 44 0.00096 22.0 1.1 15 502-516 7-21 (24)
18 PF01299 Lamp: Lysosome-associ 22.9 78 0.0017 32.8 3.3 39 484-522 258-296 (306)
19 PF07502 MANEC: MANEC domain; 22.4 1.2E+02 0.0025 26.3 3.7 41 41-83 33-73 (92)
20 COG4064 MtrG Tetrahydromethano 22.3 1.1E+02 0.0024 25.4 3.3 23 493-515 50-72 (75)
21 PF07787 DUF1625: Protein of u 21.8 88 0.0019 31.5 3.3 25 498-522 223-247 (248)
No 1
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=99.97 E-value=8.8e-30 Score=252.87 Aligned_cols=220 Identities=22% Similarity=0.370 Sum_probs=169.6
Q ss_pred EeCCCcEEEEEeCCCCceEEEEecCCcc---CceEeccCCceeEEEEecccCCCCCCceee--C--ceEEEEEEEE-ecC
Q psy1623 129 VCSEDGMEFTLRTPEGFLGRIYTYGYYD---RCFYRGNGGTANVLRISGAQGYPDCGTQRY--G--DTMTNIVVVQ-FSD 200 (522)
Q Consensus 129 ~C~~~~m~V~v~t~~pF~G~IYvkg~~~---~C~~~~~~~~~~~l~i~l~~~~~~CG~~~~--~--~~~s~~VVVQ-~hp 200 (522)
+|.+++|.|.|++..+|.|+||+++.+. +|+..+...+...+.|.++ .++||+++. + .+|+|+|++| +|+
T Consensus 1 ~C~~~~m~v~v~~~~~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~~f~~~--l~~CGt~~~~~~~~~~ysn~v~~~~~~~ 78 (253)
T smart00241 1 QCGEDQMVVSVSTDLLFPGGIYVKGLYLGDPSCRPVFTDSTSAFVSFEVP--LNGCGTRRQVNPDGIVYSNTLVVSPFHP 78 (253)
T ss_pred CCCCCEEEEEEEecCCCCCeEEEeeEEeCCCCCCCccccCCCcEEEEEec--cccCCCeEEECCCeEEEEEEEEEccCCC
Confidence 5999999999999999999999999774 8999643333334445444 459999853 2 2799999999 899
Q ss_pred eEeeccce-EEEEEeeeCCCCCeEEEeeccccCCCCCcceeecCccCCCCCceEEEEee---CCCceeee---ecCCeEE
Q psy1623 201 YVQTSRDK-RYNLTCLFRGPGEAVVTSSYMTANSGSPIPIEYLPAENSLSSRVRLQILF---QGRPTNTI---AVGDPLT 273 (522)
Q Consensus 201 ~~vT~~Dr-~y~V~C~y~~~~e~~vt~~~~~~~~g~p~pi~~l~~e~~~~~~v~m~I~~---~G~~v~~a---~VGd~L~ 273 (522)
.++|..|+ .|+++|.|....+ +. ..+... ...+. .+.......+.++|++.. .|+++..+ ++||+|+
T Consensus 79 ~~itr~~~~~~~~~C~y~~~~~-~~-~~~~~~---~~~~~-~~~~~~~g~~~~~m~l~~~~~~~~~~~~~~~~~lg~~l~ 152 (253)
T smart00241 79 GFITRDDRAAYHFQCFYPENEK-VS-LNLDVS---TIPPT-ELSSVSEGPPTCSYRLYKDDSFGSPYQSADYPVLGDPVY 152 (253)
T ss_pred CceEecCceEEEEEEEEeCCCc-eE-EEEEec---CCCCC-CcccccCCCcEEEEEeccCCCCCCcccCCCCcccCCeEE
Confidence 99999999 9999999997433 32 222111 11111 111112234578999985 46677544 6999999
Q ss_pred EEEEEeCCCCceecEEEEEEEEECCCC---CceEEEEcCCCCceecccccCCCCCCCCCCCccccccceeccCCCCCCcc
Q psy1623 274 FKLDSQDGYNYVTDIFATNVIARDPYS---GRSVQLIDRYGYWIIESTFNHLPVNQCSPQRSFSALTVTIFSFRCPVDNF 350 (522)
Q Consensus 274 l~we~~~~~~~~y~i~V~~C~A~Dg~~---~~~~~LIDe~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~ngC~~D~~ 350 (522)
|+|++.......++|+|++|+|.++.+ +.++.||| +| ||+|+.
T Consensus 153 ~~~~~~~~~~~~~~l~v~~C~at~~~~~~~~~~~~lI~-~G---------------------------------C~~d~~ 198 (253)
T smart00241 153 HEWSCDGADDPPLGLLVDNCYATPGSDPSSGPKYFIID-NG---------------------------------CPVDGY 198 (253)
T ss_pred EEEEEccCCCCCeEEEEeeEEEcCCCCCCCCCcEEEEE-Cc---------------------------------cCCCCc
Confidence 999998642358899999999999853 68888998 88 999999
Q ss_pred ccCCeeeccC-CCeEEEEEEeEEcCCCCeEEEEEEEEEccC
Q psy1623 351 VFPALDRSRE-GDGLEARFNAFKIPESNFLVFEATVRTCRD 390 (522)
Q Consensus 351 i~~~l~~~~~-~~~l~a~f~AFKFpds~~V~f~C~V~~C~~ 390 (522)
+++.+.+..+ +..++..|+||||+++..|+|+|+|++|.+
T Consensus 199 ~~~~~~~~~~~~~~~~f~~~aF~F~~~~~v~~hC~v~vC~~ 239 (253)
T smart00241 199 LDSTIPYNSSPNHYARFSVKVFKFADRSLVYFHCQIRLCDK 239 (253)
T ss_pred cccceecCCCCcceEEEEEEEEEecCCCcEEEEEEEEEECC
Confidence 9999998765 678889999999999999999999999985
No 2
>PF00100 Zona_pellucida: Zona pellucida-like domain; InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.92 E-value=5.8e-24 Score=208.42 Aligned_cols=224 Identities=25% Similarity=0.413 Sum_probs=146.7
Q ss_pred EeCCCcEEEEEeCCCCceEEEEec-CCcc--CceE-eccCCceeEEEEecccCCCCCCceeeC---c--eEEEEEEEEec
Q psy1623 129 VCSEDGMEFTLRTPEGFLGRIYTY-GYYD--RCFY-RGNGGTANVLRISGAQGYPDCGTQRYG---D--TMTNIVVVQFS 199 (522)
Q Consensus 129 ~C~~~~m~V~v~t~~pF~G~IYvk-g~~~--~C~~-~~~~~~~~~l~i~l~~~~~~CG~~~~~---~--~~s~~VVVQ~h 199 (522)
+|+++.|.|.|++..+|.|.||+. ++.. +|+. ...+.....++| ++....||+.+.. + .|+++|+++.+
T Consensus 1 ~C~~~~~~v~i~~~~~~~~~i~~~~~~~~~~~C~~~~~~~~~~~~~~~--~~~~~~Cg~~~~~~~~~~~~y~n~i~~~~~ 78 (265)
T PF00100_consen 1 TCGSDSMTVTIKKDYLFTGNIYVSDLHLNDPSCRARSDTNSTHITFSI--PFNSPSCGTTRSIENGDSIVYSNTIVSSQD 78 (265)
T ss_dssp EE-SSEEEEEEESSTTSSS-B-GGGEEETTTTB--EEECTTTEEEEEE--ECCC--CTSEEEE--SSEEEEEEEEEEE--
T ss_pred CcCCCEEEEEEEeccccCCCEEeeeeEECCCCCCCcccCCCcEEEEEE--ecCCCCCcccccccccCcceeeeeeeeecc
Confidence 699999999999999999999984 4443 7999 444444434444 3332249998332 2 58999998876
Q ss_pred C--------eEeeccceEEEEEeeeCCCCCeEEEeeccccCCCCCcceeecCccCCCCCceEEEEeeC-------CCcee
Q psy1623 200 D--------YVQTSRDKRYNLTCLFRGPGEAVVTSSYMTANSGSPIPIEYLPAENSLSSRVRLQILFQ-------GRPTN 264 (522)
Q Consensus 200 p--------~~vT~~Dr~y~V~C~y~~~~e~~vt~~~~~~~~g~p~pi~~l~~e~~~~~~v~m~I~~~-------G~~v~ 264 (522)
. .+++..|..|+++|.|... ...++....... ..++.... .......++|++..+ ..+..
T Consensus 79 ~~~~~~~~~~i~~~~~~~~~~~C~y~~~-~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~m~l~~~~~~~~~~~~~~~ 153 (265)
T PF00100_consen 79 VSFNDSPGGNITRDSDRRFNFTCTYSSS-QYSVSSGSPNIS---SVTIHISQ-ESSGSFSFSMRLYKDDSFSSPYSQPPA 153 (265)
T ss_dssp SSSS---STTECSSS-EEEEEEEEE-EC----------------B-TTT----STT--EEEEEEEESTTTSSCB---SSE
T ss_pred ccccccccCceEEecccccccEEEECCc-cceeccccccce---ecccccee-ccCCccceEEEeccCccccccccCCce
Confidence 4 5777899999999999321 212221111000 00110000 111122477777742 24557
Q ss_pred eeecCCeEEEEEEEeCCCCceecEEEEEEEEECC---CCCceEEEEcCCCCceecccccCCCCCCCCCCCccccccceec
Q psy1623 265 TIAVGDPLTFKLDSQDGYNYVTDIFATNVIARDP---YSGRSVQLIDRYGYWIIESTFNHLPVNQCSPQRSFSALTVTIF 341 (522)
Q Consensus 265 ~a~VGd~L~l~we~~~~~~~~y~i~V~~C~A~Dg---~~~~~~~LIDe~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii 341 (522)
.+++||+|+++|++.....+.+.++|++|+|.++ .++..+.|||++|
T Consensus 154 ~~~vg~~v~~~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~G------------------------------ 203 (265)
T PF00100_consen 154 TVEVGDRVYVEVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENG------------------------------ 203 (265)
T ss_dssp EE-TT-EEEEEEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTT------------------------------
T ss_pred eEEeeeeEEEEEEeccccceeeEEEeCEEEEeCCCCcccccccceecccc------------------------------
Confidence 8899999999999876532226699999999998 6678899999999
Q ss_pred cCCCCCCccccCCeeec---cCCCeEEEEEEeEEcCCCCeEEEEEEEEEccCCC
Q psy1623 342 SFRCPVDNFVFPALDRS---REGDGLEARFNAFKIPESNFLVFEATVRTCRDGC 392 (522)
Q Consensus 342 ~ngC~~D~~i~~~l~~~---~~~~~l~a~f~AFKFpds~~V~f~C~V~~C~~~C 392 (522)
||+|...++.+.+. .++..+...|+||||+++..|+|+|+|++|...|
T Consensus 204 ---C~~~~~~~~~~~~~~~~~~~~~~~f~f~aF~f~~~~~v~i~C~v~~C~~~~ 254 (265)
T PF00100_consen 204 ---CPVDSDDFVTFEYSGPNGDSSRARFSFRAFRFPDSSSVYIHCSVRVCDKSD 254 (265)
T ss_dssp ---EEGGGCTTTEEEEETTTCCCCEEEEEEE-EEETSSSEEEEEEEEEEEETTS
T ss_pred ---ccccccceeEEEeccccccccEEEEEeeEEEecCCCeEEEEEEEEEECCCC
Confidence 99999999999987 5677899999999999999999999999999875
No 3
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.27 E-value=8.1e-12 Score=104.29 Aligned_cols=77 Identities=32% Similarity=0.508 Sum_probs=63.5
Q ss_pred cccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecCCc
Q psy1623 23 SNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTGDF 102 (522)
Q Consensus 23 f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~~~ 102 (522)
|..+-..++++.+++++.+.+.|+++|+++|+.+.+|.||||+|+.. ...|+|++.++.+. +..+....++
T Consensus 4 ~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~-----~~~C~L~~~~~~~~----~~~~~~~~~~ 74 (80)
T cd01099 4 FKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYK-----SKECILSDEDRMSS----GVKLLYDSNV 74 (80)
T ss_pred ccEEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcC-----CCEEEEeCCCcccc----ccccccccCc
Confidence 33444456778999999999999999999999999999999999864 79999999999754 2334445589
Q ss_pred ceeecc
Q psy1623 103 DFYEKS 108 (522)
Q Consensus 103 dYyE~~ 108 (522)
|||||.
T Consensus 75 dyyE~~ 80 (80)
T cd01099 75 DYYENK 80 (80)
T ss_pred eeeecC
Confidence 999984
No 4
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=98.51 E-value=1.7e-07 Score=76.08 Aligned_cols=74 Identities=31% Similarity=0.551 Sum_probs=58.1
Q ss_pred cccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceee-cCC
Q psy1623 23 SNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEP-TGD 101 (522)
Q Consensus 23 f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~-~~~ 101 (522)
|+++. +. .+.+.....+.+.++++|+++|..+.. .|+||+|+.. .+.|+|++.+..+. +..+.. .++
T Consensus 4 f~~~~-~~-~l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~-----~~~C~L~~~~~~~~----~~~~~~~~~~ 71 (79)
T PF00024_consen 4 FERIP-GY-RLSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPS-----SKTCYLSSSDRSSL----PPRLTPSSPN 71 (79)
T ss_dssp EEEEE-EE-EEESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETT-----TTEEEEECSSSSSE----STEEEEEEEE
T ss_pred eEEEC-CE-EEeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECC-----CCEEEEcCCCCCcc----cceeEeccCc
Confidence 66666 23 345566666799999999999999987 8999999864 79999999999764 234555 789
Q ss_pred cceeecc
Q psy1623 102 FDFYEKS 108 (522)
Q Consensus 102 ~dYyE~~ 108 (522)
++|||+.
T Consensus 72 ~~~ye~~ 78 (79)
T PF00024_consen 72 VDYYEKS 78 (79)
T ss_dssp EEEEEEG
T ss_pred EEEEEec
Confidence 9999984
No 5
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=97.99 E-value=1.6e-05 Score=63.95 Aligned_cols=75 Identities=27% Similarity=0.497 Sum_probs=53.1
Q ss_pred cccccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecC
Q psy1623 21 PISNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTG 100 (522)
Q Consensus 21 ~~f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~ 100 (522)
..|..+.+.. + .++......+.++++|+++|+.+ ++.|+||+|+. +.+.|.|++.... .++. ...+.
T Consensus 4 ~~f~~~~~~~-l-~~~~~~~~~~~s~~~C~~~C~~~-~~~C~s~~y~~-----~~~~C~l~~~~~~----~~~~-~~~~~ 70 (78)
T smart00473 4 DCFVRLPNTK-L-PGFSRIVISVASLEECASKCLNS-NCSCRSFTYNN-----GTKGCLLWSESSL----GDAR-LFPSG 70 (78)
T ss_pred ceeEEecCcc-C-CCCcceeEcCCCHHHHHHHhCCC-CCceEEEEEcC-----CCCEEEEeeCCcc----ccce-ecccC
Confidence 4566666332 3 34555456789999999999997 68999999974 2689999993322 2222 45577
Q ss_pred Ccceeecc
Q psy1623 101 DFDFYEKS 108 (522)
Q Consensus 101 ~~dYyE~~ 108 (522)
+.+|||+.
T Consensus 71 ~~~~y~~~ 78 (78)
T smart00473 71 GVDLYEKI 78 (78)
T ss_pred CceeEEeC
Confidence 89999974
No 6
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=91.41 E-value=0.28 Score=39.88 Aligned_cols=56 Identities=18% Similarity=0.258 Sum_probs=40.8
Q ss_pred ccccccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCC
Q psy1623 20 LPISNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRD 84 (522)
Q Consensus 20 ~~~f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d 84 (522)
..+|.++ ... -..++......+.+.++|+++|+.+. .|..|+|+.. .+.|.|....
T Consensus 3 ~~C~~~~-~~~-~~~g~d~~~~~~~s~~~Cq~~C~~~~--~C~afT~~~~-----~~~C~lk~~~ 58 (73)
T cd01100 3 SSCFRQG-SNV-DFRGGDLSTVFASSAEQCQAACTADP--GCLAFTYNTK-----SKKCFLKSSE 58 (73)
T ss_pred ccccccc-CCC-ccccCCcceeecCCHHHHHHHcCCCC--CceEEEEECC-----CCeEEcccCC
Confidence 4566665 111 13345555667899999999999975 7999999843 7999998875
No 7
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=91.17 E-value=0.35 Score=40.61 Aligned_cols=56 Identities=21% Similarity=0.476 Sum_probs=39.2
Q ss_pred eccCHHHHHHHhhc-CCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecCCcceeec
Q psy1623 42 TAASLAACERECAE-SRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTGDFDFYEK 107 (522)
Q Consensus 42 ~v~S~~~C~~~C~~-e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~~~dYyE~ 107 (522)
...+.++|+++|+. +-++.|+.|.|... ...|.+-..+.. +++. .+ ...+.|.|+|
T Consensus 23 ~~~s~~eC~~~Cl~~~~nCsC~Aya~~~~-----~~gC~~W~~~l~-~d~~---~~-~~~g~~Ly~r 79 (80)
T cd00129 23 KANTADECANRCEKNGLPFSCKAFVFAKA-----RKQCLWFPFNSM-SGVR---KE-FSHGFDLYEN 79 (80)
T ss_pred cccCHHHHHHHHhcCCCCCCceeeeccCC-----CCCeEEecCcch-hhHH---hc-cCCCceeEeE
Confidence 34799999999998 44799999999643 357999988873 2222 12 2346677765
No 8
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=87.98 E-value=0.31 Score=36.07 Aligned_cols=38 Identities=26% Similarity=0.545 Sum_probs=17.7
Q ss_pred eeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEe
Q psy1623 41 TTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCEL 80 (522)
Q Consensus 41 ~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~L 80 (522)
+.+.+.++|+++|..+. .|..|+|.........+.|.|
T Consensus 13 ~~~~s~~~C~~~C~~~~--~C~~~~~~~~~~~~~~~~C~L 50 (51)
T PF14295_consen 13 VTASSPEECQAACAADP--GCQAFTFNPPGCPSSSGRCYL 50 (51)
T ss_dssp -----HHHHHHHHHTST--T--EEEEETTEE---------
T ss_pred ccCCCHHHHHHHccCCC--CCCEEEEECCCcccccccccC
Confidence 47889999999999865 699999986321123677876
No 9
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=86.00 E-value=1.1 Score=37.65 Aligned_cols=47 Identities=21% Similarity=0.386 Sum_probs=36.9
Q ss_pred cccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccC
Q psy1623 33 RRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDR 83 (522)
Q Consensus 33 ~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~ 83 (522)
..|.+-+.+.+.+.++|+++|..+. .|+.|+|+.... ....|.|-+.
T Consensus 11 f~G~Dl~~~~~~~~~~Cq~~Ct~~~--~C~~FTf~~~~~--~~~~C~LK~s 57 (79)
T smart00223 11 FRGSDINTVYVPSAQVCQKRCTSHP--RCLFFTFSTNEP--PEEKCLLKDS 57 (79)
T ss_pred ccCceeeeeecCCHHHHHHhhcCCC--CccEEEeeCCCC--CCCEeEeCcC
Confidence 4466777889999999999999875 799999986421 1238999876
No 10
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=74.69 E-value=5.7 Score=31.77 Aligned_cols=59 Identities=20% Similarity=0.374 Sum_probs=39.7
Q ss_pred ccccccccccccccccceeeee-eccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCc
Q psy1623 20 LPISNNIGVRMRMRRQFIKKST-TAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDA 85 (522)
Q Consensus 20 ~~~f~~vg~~~r~~~~~~~r~~-~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~ 85 (522)
.+.|-++. +.++ +....... ...++++|+.+|+..= .|..|.|.... +.+.|.|-..+.
T Consensus 4 ~d~F~~l~-~~~~-p~~~~~~~~~~~s~~~C~~~Cl~nC--sC~Ayay~~~~---~~~~C~lW~~~L 63 (66)
T PF08276_consen 4 GDGFLKLP-NMKL-PDFDNAIVDSSVSLEECEKACLSNC--SCTAYAYSNLS---GGGGCLLWYGDL 63 (66)
T ss_pred CCEEEEEC-CeeC-CCCcceeeecCCCHHHHHhhcCCCC--CEeeEEeeccC---CCCEEEEEcCEe
Confidence 44566665 3333 33333332 6799999999999964 79999997432 358999877654
No 11
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=49.36 E-value=18 Score=29.20 Aligned_cols=37 Identities=27% Similarity=0.522 Sum_probs=29.1
Q ss_pred eccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCc
Q psy1623 42 TAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDA 85 (522)
Q Consensus 42 ~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~ 85 (522)
...++++|+++|+.. =.|.+|.|... .+.|.+-..+.
T Consensus 30 ~~~s~~~C~~~Cl~n--CsC~a~~~~~~-----~~~C~~~~~~~ 66 (84)
T cd01098 30 TAISLEECREACLSN--CSCTAYAYNNG-----SGGCLLWNGLL 66 (84)
T ss_pred ccCCHHHHHHHHhcC--CCcceeeecCC-----CCeEEEEecee
Confidence 678999999999985 58999999742 58898854443
No 12
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=36.22 E-value=24 Score=30.01 Aligned_cols=20 Identities=15% Similarity=0.321 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHhhhceee
Q psy1623 502 TLFILLTFLIVVTAFAGFYF 521 (522)
Q Consensus 502 ~l~i~~~~~~~~~~~~~~~~ 521 (522)
+++.+.+++|.+++||.+.|
T Consensus 8 ~iialiv~~iiaIvvW~iv~ 27 (81)
T PF00558_consen 8 AIIALIVALIIAIVVWTIVY 27 (81)
T ss_dssp HHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444555665544
No 13
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=30.98 E-value=16 Score=38.01 Aligned_cols=23 Identities=13% Similarity=0.369 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHhhhceeeC
Q psy1623 499 LLTTLFILLTFLIVVTAFAGFYFK 522 (522)
Q Consensus 499 ~~~~l~i~~~~~~~~~~~~~~~~~ 522 (522)
+|.+++|+++|| ++.++|++|||
T Consensus 149 ~IpaVVI~~iLL-IA~iIa~icyr 171 (290)
T PF05454_consen 149 FIPAVVIAAILL-IAGIIACICYR 171 (290)
T ss_dssp ------------------------
T ss_pred HHHHHHHHHHHH-HHHHHHHHhhh
Confidence 445554444444 56666666765
No 14
>PF10177 DUF2371: Uncharacterised conserved protein (DUF2371); InterPro: IPR018787 This family of proteins with no known function is conserved from nematodes to humans. It includes members of the TMEM200 family of transmembrane proteins.
Probab=28.34 E-value=48 Score=31.07 Aligned_cols=30 Identities=27% Similarity=0.315 Sum_probs=23.1
Q ss_pred cccchhhhHHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623 490 CMSNREYYSLLTTLFILLTFLIVVTAFAGFYFK 522 (522)
Q Consensus 490 C~s~~~~~~~~~~l~i~~~~~~~~~~~~~~~~~ 522 (522)
|.|.+ ++++.+.+++++.=.+.|+.++|.+
T Consensus 35 l~s~S---g~~l~lG~lvllvGiaMAv~GYwp~ 64 (141)
T PF10177_consen 35 LCSPS---GLFLLLGILVLLVGIAMAVLGYWPK 64 (141)
T ss_pred EecHH---HHHHHHHHHHHHHhhHhheeecccc
Confidence 55664 8888888888888888888888753
No 15
>PF14814 UB2H: Bifunctional transglycosylase second domain; PDB: 3FWL_A 3VMA_A.
Probab=27.08 E-value=98 Score=26.02 Aligned_cols=34 Identities=18% Similarity=0.247 Sum_probs=22.9
Q ss_pred eccCCCeEEEEEEeEEcCCCCeEEEEEEEEEccC
Q psy1623 357 RSREGDGLEARFNAFKIPESNFLVFEATVRTCRD 390 (522)
Q Consensus 357 ~~~~~~~l~a~f~AFKFpds~~V~f~C~V~~C~~ 390 (522)
|..+++.+...-.+|.|||...=-....|++=.+
T Consensus 33 y~~~g~~i~i~~R~F~F~Dg~e~~~~~~l~f~~~ 66 (85)
T PF14814_consen 33 YSRSGNRIEIYTRGFDFPDGQEPARRVRLTFSGG 66 (85)
T ss_dssp EEEETTEEEEEE--EEETTCEE--EEEEEEEETT
T ss_pred EEEECCEEEEEECCCCCCCCCccCEEEEEEECCC
Confidence 3446778999999999999988777777776543
No 16
>PF15102 TMEM154: TMEM154 protein family
Probab=25.82 E-value=52 Score=30.96 Aligned_cols=23 Identities=30% Similarity=0.655 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHhhhceeeC
Q psy1623 499 LLTTLFILLTFLIVVTAFAGFYFK 522 (522)
Q Consensus 499 ~~~~l~i~~~~~~~~~~~~~~~~~ 522 (522)
.+|-+++|+||||+++++ +.+||
T Consensus 61 IlIP~VLLvlLLl~vV~l-v~~~k 83 (146)
T PF15102_consen 61 ILIPLVLLVLLLLSVVCL-VIYYK 83 (146)
T ss_pred EeHHHHHHHHHHHHHHHh-eeEEe
Confidence 344455555555555444 44443
No 17
>PF09680 Tiny_TM_bacill: Protein of unknown function (Tiny_TM_bacill); InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=24.24 E-value=44 Score=22.02 Aligned_cols=15 Identities=20% Similarity=0.545 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHhh
Q psy1623 502 TLFILLTFLIVVTAF 516 (522)
Q Consensus 502 ~l~i~~~~~~~~~~~ 516 (522)
+|++++.|||+++-+
T Consensus 7 alivVLFILLiIvG~ 21 (24)
T PF09680_consen 7 ALIVVLFILLIIVGA 21 (24)
T ss_pred hhHHHHHHHHHHhcc
Confidence 455555555555543
No 18
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=22.90 E-value=78 Score=32.83 Aligned_cols=39 Identities=13% Similarity=0.005 Sum_probs=20.2
Q ss_pred CCCCcccccchhhhHHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623 484 AIPDSICMSNREYYSLLTTLFILLTFLIVVTAFAGFYFK 522 (522)
Q Consensus 484 ~~~~~~C~s~~~~~~~~~~l~i~~~~~~~~~~~~~~~~~ 522 (522)
.+....|-++..-..+-|++++++.+|++++++|-++.|
T Consensus 258 Fg~a~~C~~D~~~~~vPIaVG~~La~lvlivLiaYli~R 296 (306)
T PF01299_consen 258 FGTAEECSSDDTSDLVPIAVGAALAGLVLIVLIAYLIGR 296 (306)
T ss_pred CCChhcCCcCCccchHHHHHHHHHHHHHHHHHHhheeEe
Confidence 344567877543233334444445555555555655554
No 19
>PF07502 MANEC: MANEC domain; InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=22.41 E-value=1.2e+02 Score=26.30 Aligned_cols=41 Identities=22% Similarity=0.536 Sum_probs=31.7
Q ss_pred eeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccC
Q psy1623 41 TTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDR 83 (522)
Q Consensus 41 ~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~ 83 (522)
..+.+.++|.++|=.... |-=+-|...+.+.+...|.|.+=
T Consensus 33 ~~v~s~~dC~~aCC~~~~--Cnlav~e~~~~~~~~~~CyLf~C 73 (92)
T PF07502_consen 33 PEVSSAEDCVRACCSTPN--CNLAVFEERPTGSEDNSCYLFNC 73 (92)
T ss_pred cccCCHHHHHHHhCCCCC--CcEEEEeccCCCCCCceEEEEEc
Confidence 479999999999998875 99999987654432233999876
No 20
>COG4064 MtrG Tetrahydromethanopterin S-methyltransferase, subunit G [Coenzyme metabolism]
Probab=22.34 E-value=1.1e+02 Score=25.35 Aligned_cols=23 Identities=17% Similarity=0.377 Sum_probs=16.7
Q ss_pred chhhhHHHHHHHHHHHHHHHHHh
Q psy1623 493 NREYYSLLTTLFILLTFLIVVTA 515 (522)
Q Consensus 493 ~~~~~~~~~~l~i~~~~~~~~~~ 515 (522)
....||++++|+++++..+..++
T Consensus 50 IGILYGlVIGlil~~i~~~l~~~ 72 (75)
T COG4064 50 IGILYGLVIGLILCMIYILLGVA 72 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45679999999877766655544
No 21
>PF07787 DUF1625: Protein of unknown function (DUF1625); InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long.
Probab=21.76 E-value=88 Score=31.46 Aligned_cols=25 Identities=16% Similarity=0.461 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623 498 SLLTTLFILLTFLIVVTAFAGFYFK 522 (522)
Q Consensus 498 ~~~~~l~i~~~~~~~~~~~~~~~~~ 522 (522)
..++++++.+.+.++++++|-++||
T Consensus 223 ~~~~~~~~s~~lsl~~Ia~aW~~yR 247 (248)
T PF07787_consen 223 LFLVAFIISFSLSLLTIALAWLFYR 247 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHhheeeC
Confidence 3456677888888889999999998
Done!