Query psy10620
Match_columns 589
No_of_seqs 334 out of 709
Neff 7.5
Searched_HMMs 46136
Date Fri Aug 16 22:59:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10620.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10620hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00241 ZP Zona pellucida ( 100.0 5.1E-32 1.1E-36 274.1 25.6 224 254-480 1-251 (253)
2 PF00100 Zona_pellucida: Zona 99.9 2.5E-26 5.4E-31 232.3 21.1 224 254-480 1-263 (265)
3 cd01099 PAN_AP_HGF Subfamily o 99.5 1.4E-14 2.9E-19 120.4 8.2 79 14-94 2-80 (80)
4 PF00024 PAN_1: PAN domain Thi 99.3 7.4E-12 1.6E-16 103.0 6.1 78 16-95 2-79 (79)
5 cd01099 PAN_AP_HGF Subfamily o 98.9 2.5E-09 5.5E-14 88.7 6.3 66 126-207 14-79 (80)
6 smart00473 PAN_AP divergent su 98.5 2.4E-07 5.2E-12 75.5 6.8 75 16-94 4-78 (78)
7 PF00024 PAN_1: PAN domain Thi 98.4 4.1E-07 8.9E-12 74.5 4.8 68 126-209 12-79 (79)
8 smart00473 PAN_AP divergent su 97.1 0.0011 2.4E-08 53.7 5.4 36 134-174 22-57 (78)
9 cd01100 APPLE_Factor_XI_like S 95.4 0.02 4.3E-07 46.4 4.0 54 17-72 5-58 (73)
10 PF08276 PAN_2: PAN-like domai 92.8 0.29 6.2E-06 38.8 5.6 57 12-70 1-60 (66)
11 PF14295 PAN_4: PAN domain; PD 92.6 0.09 1.9E-06 38.9 2.3 33 34-68 13-50 (51)
12 smart00223 APPLE APPLE domain. 92.1 0.2 4.3E-06 41.4 3.9 49 21-71 6-57 (79)
13 cd01098 PAN_AP_plant Plant PAN 91.5 0.34 7.4E-06 39.6 4.8 59 10-71 3-64 (84)
14 cd00129 PAN_APPLE PAN/APPLE-li 85.4 1.8 4E-05 35.8 5.1 38 36-73 24-62 (80)
15 cd00129 PAN_APPLE PAN/APPLE-li 81.6 2.1 4.6E-05 35.4 3.9 38 136-177 24-62 (80)
16 PF14295 PAN_4: PAN domain; PD 78.7 1.7 3.8E-05 31.8 2.4 36 135-172 14-50 (51)
17 cd01100 APPLE_Factor_XI_like S 78.3 3.1 6.6E-05 33.5 3.9 36 135-176 23-58 (73)
18 PF01102 Glycophorin_A: Glycop 78.1 2.9 6.3E-05 37.4 4.0 26 555-580 65-90 (122)
19 smart00223 APPLE APPLE domain. 75.1 4 8.7E-05 33.7 3.8 39 134-175 19-57 (79)
20 PF07213 DAP10: DAP10 membrane 71.8 6.5 0.00014 32.2 4.1 33 555-587 35-67 (79)
21 PF05393 Hum_adeno_E3A: Human 71.7 5.4 0.00012 33.2 3.6 26 558-583 37-62 (94)
22 PF08277 PAN_3: PAN-like domai 67.7 10 0.00022 30.0 4.5 39 32-74 15-53 (71)
23 PF02439 Adeno_E3_CR2: Adenovi 64.8 13 0.00028 26.1 3.8 20 555-574 5-24 (38)
24 PF14575 EphA2_TM: Ephrin type 63.0 7.1 0.00015 31.9 2.8 19 556-574 3-21 (75)
25 PF06365 CD34_antigen: CD34/Po 61.4 5.1 0.00011 39.0 1.9 20 554-573 101-120 (202)
26 PF08276 PAN_2: PAN-like domai 58.7 15 0.00033 28.9 3.9 38 135-176 25-62 (66)
27 PF05454 DAG1: Dystroglycan (D 56.1 3.7 8.1E-05 42.3 0.0 25 558-583 152-176 (290)
28 PF03988 DUF347: Repeat of Unk 53.4 17 0.00036 27.8 3.2 29 556-584 27-55 (55)
29 PF01299 Lamp: Lysosome-associ 51.9 13 0.00027 38.9 3.1 36 546-581 261-298 (306)
30 PF14991 MLANA: Protein melan- 50.4 2.9 6.2E-05 36.5 -1.6 26 554-581 25-50 (118)
31 KOG1094|consensus 49.5 16 0.00035 41.1 3.5 40 547-586 384-423 (807)
32 TIGR01478 STEVOR variant surfa 47.7 22 0.00048 36.3 3.9 26 559-584 262-289 (295)
33 PF04689 S1FA: DNA binding pro 44.8 18 0.00038 28.4 2.1 37 550-586 10-46 (69)
34 PF15102 TMEM154: TMEM154 prot 44.6 5.7 0.00012 36.5 -0.7 6 579-584 83-88 (146)
35 cd01098 PAN_AP_plant Plant PAN 44.3 26 0.00057 28.2 3.3 35 135-175 30-64 (84)
36 PTZ00370 STEVOR; Provisional 43.0 25 0.00053 36.1 3.4 24 558-581 257-282 (296)
37 PF02439 Adeno_E3_CR2: Adenovi 42.1 56 0.0012 23.0 4.0 30 554-583 7-36 (38)
38 PRK00523 hypothetical protein; 39.1 36 0.00078 27.5 3.1 28 556-583 4-31 (72)
39 PF11346 DUF3149: Protein of u 37.8 62 0.0014 23.4 3.9 31 555-585 10-40 (42)
40 COG3763 Uncharacterized protei 35.3 43 0.00092 26.9 2.9 26 558-583 5-30 (71)
41 PF15176 LRR19-TM: Leucine-ric 35.1 63 0.0014 27.8 4.1 33 554-586 19-51 (102)
42 PF01102 Glycophorin_A: Glycop 34.2 54 0.0012 29.5 3.8 29 554-582 67-95 (122)
43 PF10577 UPF0560: Uncharacteri 33.5 48 0.001 38.8 4.2 33 555-587 273-307 (807)
44 PHA03283 envelope glycoprotein 33.1 36 0.00078 37.7 3.0 33 556-588 403-436 (542)
45 PF07502 MANEC: MANEC domain; 33.1 57 0.0012 27.7 3.6 43 17-61 8-58 (92)
46 PF09680 Tiny_TM_bacill: Prote 32.7 25 0.00054 22.0 1.0 15 559-573 7-21 (24)
47 PF05545 FixQ: Cbb3-type cytoc 30.9 55 0.0012 24.1 2.8 19 562-580 14-32 (49)
48 KOG3441|consensus 30.9 2E+02 0.0043 25.7 6.6 65 367-437 55-123 (149)
49 PF05808 Podoplanin: Podoplani 30.1 17 0.00037 34.0 0.0 28 555-582 130-157 (162)
50 PF02480 Herpes_gE: Alphaherpe 30.1 17 0.00037 39.9 0.0 15 251-265 40-54 (439)
51 PRK01844 hypothetical protein; 30.0 55 0.0012 26.5 2.8 21 563-583 10-30 (72)
52 PF06596 PsbX: Photosystem II 29.8 1.2E+02 0.0025 21.6 4.0 24 554-577 7-30 (39)
53 PF02038 ATP1G1_PLM_MAT8: ATP1 29.7 81 0.0018 23.6 3.4 20 555-574 16-35 (50)
54 PF13908 Shisa: Wnt and FGF in 29.4 24 0.00051 33.7 0.8 14 467-480 26-39 (179)
55 smart00765 MANEC The MANEC dom 29.0 70 0.0015 27.3 3.5 55 11-69 5-72 (93)
56 PTZ00382 Variant-specific surf 28.2 16 0.00036 31.2 -0.4 17 563-579 75-91 (96)
57 PF12191 stn_TNFRSF12A: Tumour 27.9 20 0.00043 32.1 0.0 14 558-571 83-96 (129)
58 COG3190 FliO Flagellar biogene 27.1 71 0.0015 29.2 3.4 24 560-583 27-50 (137)
59 PF15102 TMEM154: TMEM154 prot 25.8 37 0.00081 31.3 1.4 10 575-584 82-91 (146)
60 PF07204 Orthoreo_P10: Orthore 25.3 44 0.00096 28.3 1.6 21 561-581 48-68 (98)
61 PRK11486 flagellar biosynthesi 25.0 75 0.0016 28.6 3.1 20 561-581 23-42 (124)
62 PTZ00208 65 kDa invariant surf 25.0 61 0.0013 34.8 2.9 31 555-585 387-417 (436)
63 PF06809 NPDC1: Neural prolife 24.7 35 0.00075 35.4 1.0 25 559-583 204-228 (341)
64 TIGR00847 ccoS cytochrome oxid 24.1 1.2E+02 0.0027 22.8 3.6 25 555-579 3-27 (51)
65 PF00558 Vpu: Vpu protein; In 23.5 95 0.0021 25.8 3.2 12 555-566 6-17 (81)
66 PF03597 CcoS: Cytochrome oxid 23.2 1.1E+02 0.0024 22.4 3.1 24 556-579 3-26 (45)
67 PF12768 Rax2: Cortical protei 23.0 55 0.0012 33.8 2.2 18 563-580 240-257 (281)
68 PF01034 Syndecan: Syndecan do 22.0 29 0.00063 27.3 -0.1 16 555-570 14-29 (64)
69 PF01034 Syndecan: Syndecan do 21.3 32 0.0007 27.1 0.1 21 554-574 9-29 (64)
70 PF05624 LSR: Lipolysis stimul 21.0 67 0.0014 23.5 1.6 16 557-572 6-21 (49)
71 PF05506 DUF756: Domain of unk 20.7 4.8E+02 0.01 21.4 8.1 41 368-408 32-77 (89)
72 PF11884 DUF3404: Domain of un 20.4 84 0.0018 32.0 2.8 13 384-396 80-92 (262)
73 PF05337 CSF-1: Macrophage col 20.2 34 0.00075 34.8 0.0 28 554-581 226-253 (285)
74 PF10873 DUF2668: Protein of u 20.2 50 0.0011 30.3 1.0 31 555-585 62-95 (155)
75 PF05399 EVI2A: Ectropic viral 20.0 1E+02 0.0022 30.3 3.1 36 554-589 130-168 (227)
No 1
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=100.00 E-value=5.1e-32 Score=274.12 Aligned_cols=224 Identities=22% Similarity=0.488 Sum_probs=173.1
Q ss_pred EecCCeEEEEEEeCCCCcceEeecCC---CCCCceeecCCceEEEEEEecCCCCCccee---cc-EEEEEEEEe-ecCee
Q psy10620 254 HCKDTRIAVQVRTNKPFNGRIYALGR---SETCNIDVLNSDLFRLDLTMSGQDCNTQSV---TG-VFSNTVVLQ-HHSVV 325 (589)
Q Consensus 254 ~C~~~~m~V~v~t~~~F~G~iy~kg~---~~~C~~~~~~~~~~~l~l~~~~~~Cg~~~~---~g-~ysn~Vvvq-~h~~i 325 (589)
+|.++.|.|.|++..+|.|+||+++. .+.|++.....+...+.|.+++.+||++.. ++ +|+|+|+++ +|+.+
T Consensus 1 ~C~~~~m~v~v~~~~~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~~f~~~l~~CGt~~~~~~~~~~ysn~v~~~~~~~~~ 80 (253)
T smart00241 1 QCGEDQMVVSVSTDLLFPGGIYVKGLYLGDPSCRPVFTDSTSAFVSFEVPLNGCGTRRQVNPDGIVYSNTLVVSPFHPGF 80 (253)
T ss_pred CCCCCEEEEEEEecCCCCCeEEEeeEEeCCCCCCCccccCCCcEEEEEeccccCCCeEEECCCeEEEEEEEEEccCCCCc
Confidence 59999999999999999999999985 369999632222335677778889999753 33 799999999 79999
Q ss_pred eeecCe-EEEEEeEecCCCceeeecceecCCCceeEEe-cCCCCCCcEEEEecC--CCCeeeee---ecCCEEEEEEEEc
Q psy10620 326 MTKADK-IYKVKCTYDMSSKNITFGMMPIRDPEMISIT-SAPEAPPPRIRILDT--KSREVETV---RIGDKLTFRIEIP 398 (589)
Q Consensus 326 ~T~~D~-~~~v~C~y~~~~~~Vs~~~~p~~~~~~~~i~-~~~~~p~~~m~i~~~--~g~~v~~v---~vGd~l~l~~~~~ 398 (589)
+|..|+ .|+++|.|....+ ++..+ .+.......+. ...+.+.++|+|+.. .|+++..+ ++||+|+|+|+++
T Consensus 81 itr~~~~~~~~~C~y~~~~~-~~~~~-~~~~~~~~~~~~~~~g~~~~~m~l~~~~~~~~~~~~~~~~~lg~~l~~~~~~~ 158 (253)
T smart00241 81 ITRDDRAAYHFQCFYPENEK-VSLNL-DVSTIPPTELSSVSEGPPTCSYRLYKDDSFGSPYQSADYPVLGDPVYHEWSCD 158 (253)
T ss_pred eEecCceEEEEEEEEeCCCc-eEEEE-EecCCCCCCcccccCCCcEEEEEeccCCCCCCcccCCCCcccCCeEEEEEEEc
Confidence 999998 9999999997544 43221 11110001111 123457899999853 56776433 7999999999998
Q ss_pred C--CCcccEEEeEEEEEcCCC---CceEEEEeCCCCCCCCCCCCccccc---CceeEEEEEEEEEcccceEEEEEEEEEe
Q psy10620 399 E--ETPYGIFARSCVAMAKDS---KSTFQIIDDEGCPVDPNIFPSFTPD---GNALQSVYEAFRFTESYGVIFQCNVKYC 470 (589)
Q Consensus 399 ~--~~~~~i~V~~C~A~d~~~---~~~~~LID~~GCp~D~~i~~~~~~~---~~~l~~~f~AFKFp~s~~V~f~C~V~vC 470 (589)
+ ++.|+|+|++|||+++.+ ..++.||| +|||+|+.+++...+. +..+++.|+||||++++.|||||+|++|
T Consensus 159 ~~~~~~~~l~v~~C~at~~~~~~~~~~~~lI~-~GC~~d~~~~~~~~~~~~~~~~~~f~~~aF~F~~~~~v~~hC~v~vC 237 (253)
T smart00241 159 GADDPPLGLLVDNCYATPGSDPSSGPKYFIID-NGCPVDGYLDSTIPYNSSPNHYARFSVKVFKFADRSLVYFHCQIRLC 237 (253)
T ss_pred cCCCCCeEEEEeeEEEcCCCCCCCCCcEEEEE-CccCCCCccccceecCCCCcceEEEEEEEEEecCCCcEEEEEEEEEE
Confidence 5 679999999999998763 57999998 9999999887665443 3567899999999999999999999999
Q ss_pred cC----CCCCCCCC
Q psy10620 471 LG----PCEPAVCE 480 (589)
Q Consensus 471 ~~----~C~p~~C~ 480 (589)
.+ .|..+.|.
T Consensus 238 ~~~~~~~C~~~~C~ 251 (253)
T smart00241 238 DKSDGSSCDGPACS 251 (253)
T ss_pred CCCCCCCCCCCcCC
Confidence 85 57666775
No 2
>PF00100 Zona_pellucida: Zona pellucida-like domain; InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.95 E-value=2.5e-26 Score=232.25 Aligned_cols=224 Identities=26% Similarity=0.566 Sum_probs=145.9
Q ss_pred EecCCeEEEEEEeCCCCcceEeecC-C--CCCCce-eecCCceEEEEEEecCCC--CCcce-ec--c--EEEEEEEEeec
Q psy10620 254 HCKDTRIAVQVRTNKPFNGRIYALG-R--SETCNI-DVLNSDLFRLDLTMSGQD--CNTQS-VT--G--VFSNTVVLQHH 322 (589)
Q Consensus 254 ~C~~~~m~V~v~t~~~F~G~iy~kg-~--~~~C~~-~~~~~~~~~l~l~~~~~~--Cg~~~-~~--g--~ysn~Vvvq~h 322 (589)
+|+++.|.|.|++..+|.|.||+.. + .+.|++ ...+...+. |.+++.+ ||+.. .. + +|+|+|+++.+
T Consensus 1 ~C~~~~~~v~i~~~~~~~~~i~~~~~~~~~~~C~~~~~~~~~~~~--~~~~~~~~~Cg~~~~~~~~~~~~y~n~i~~~~~ 78 (265)
T PF00100_consen 1 TCGSDSMTVTIKKDYLFTGNIYVSDLHLNDPSCRARSDTNSTHIT--FSIPFNSPSCGTTRSIENGDSIVYSNTIVSSQD 78 (265)
T ss_dssp EE-SSEEEEEEESSTTSSS-B-GGGEEETTTTB--EEECTTTEEE--EEEECCC--CTSEEEE--SSEEEEEEEEEEE--
T ss_pred CcCCCEEEEEEEeccccCCCEEeeeeEECCCCCCCcccCCCcEEE--EEEecCCCCCcccccccccCcceeeeeeeeecc
Confidence 6999999999999999999988743 3 368999 444445444 4445566 99973 22 2 69999999865
Q ss_pred C--------eeeeecCeEEEEEeEecCCCceeeecceecCCCceeEEecCCCCCCcEEEEecCC---C---CeeeeeecC
Q psy10620 323 S--------VVMTKADKIYKVKCTYDMSSKNITFGMMPIRDPEMISITSAPEAPPPRIRILDTK---S---REVETVRIG 388 (589)
Q Consensus 323 ~--------~i~T~~D~~~~v~C~y~~~~~~Vs~~~~p~~~~~~~~i~~~~~~p~~~m~i~~~~---g---~~v~~v~vG 388 (589)
. .+++..|..|.++|.|.....+++..................+...++|++++.. + .+...+.+|
T Consensus 79 ~~~~~~~~~~i~~~~~~~~~~~C~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~l~~~~~~~~~~~~~~~~~~vg 158 (265)
T PF00100_consen 79 VSFNDSPGGNITRDSDRRFNFTCTYSSSQYSVSSGSPNISSVTIHISQESSGSFSFSMRLYKDDSFSSPYSQPPATVEVG 158 (265)
T ss_dssp SSSS---STTECSSS-EEEEEEEEE-EC------------B-TTT---STT--EEEEEEEESTTTSSCB---SSEEE-TT
T ss_pred ccccccccCceEEecccccccEEEECCccceeccccccceeccccceeccCCccceEEEeccCccccccccCCceeEEee
Confidence 3 5677889999999999333334432211111111111111223345789998642 1 335678999
Q ss_pred CEEEEEEEEcCCC--cccEEEeEEEEEcC---CCCceEEEEeCCCCCCCCCCCCccc-----ccCceeEEEEEEEEEccc
Q psy10620 389 DKLTFRIEIPEET--PYGIFARSCVAMAK---DSKSTFQIIDDEGCPVDPNIFPSFT-----PDGNALQSVYEAFRFTES 458 (589)
Q Consensus 389 d~l~l~~~~~~~~--~~~i~V~~C~A~d~---~~~~~~~LID~~GCp~D~~i~~~~~-----~~~~~l~~~f~AFKFp~s 458 (589)
|+|+++|++..+. ++.++|++|||++. .+...+.|||++|||+|...++.+. .+++.+++.|+||||+++
T Consensus 159 ~~v~~~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~GC~~~~~~~~~~~~~~~~~~~~~~~f~f~aF~f~~~ 238 (265)
T PF00100_consen 159 DRVYVEVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENGCPVDSDDFVTFEYSGPNGDSSRARFSFRAFRFPDS 238 (265)
T ss_dssp -EEEEEEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTTEEGGGCTTTEEEEETTTCCCCEEEEEEE-EEETSS
T ss_pred eeEEEEEEeccccceeeEEEeCEEEEeCCCCcccccccceeccccccccccceeEEEeccccccccEEEEEeeEEEecCC
Confidence 9999999986432 27799999999987 4567999999999999988877664 344568899999999999
Q ss_pred ceEEEEEEEEEecCC----CCCCCCC
Q psy10620 459 YGVIFQCNVKYCLGP----CEPAVCE 480 (589)
Q Consensus 459 ~~V~f~C~V~vC~~~----C~p~~C~ 480 (589)
..|||||+|++|... |++ .|+
T Consensus 239 ~~v~i~C~v~~C~~~~~~~C~~-~C~ 263 (265)
T PF00100_consen 239 SSVYIHCSVRVCDKSDGSSCPP-NCS 263 (265)
T ss_dssp SEEEEEEEEEEEETTSSCTTBB--EE
T ss_pred CeEEEEEEEEEECCCCCCcCCC-CCC
Confidence 999999999999976 554 554
No 3
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.55 E-value=1.4e-14 Score=120.40 Aligned_cols=79 Identities=39% Similarity=0.678 Sum_probs=71.0
Q ss_pred CCeeEEEecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCCCccccCcccccccCcCcceeec
Q psy10620 14 RPWAFERVPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDSDRRTTGQYVQFVDAQGVDYFEN 93 (589)
Q Consensus 14 ~~~~Fer~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~~~~~~p~~~~~~~~~~~~Y~en 93 (589)
..|.|+++++++|+|+++.++.+.|+++|+.+|+.+.+|.|||++|++.+++|+||++++.+.+.. ...+.++|||||
T Consensus 2 ~~~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~~~~C~L~~~~~~~~~~~--~~~~~~~dyyE~ 79 (80)
T cd01099 2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSKECILSDEDRMSSGVK--LLYDSNVDYYEN 79 (80)
T ss_pred CcccEEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcCCCEEEEeCCCccccccc--cccccCceeeec
Confidence 468999999999999999999999999999999999999999999999999999999999876654 233458999998
Q ss_pred c
Q psy10620 94 L 94 (589)
Q Consensus 94 ~ 94 (589)
.
T Consensus 80 ~ 80 (80)
T cd01099 80 K 80 (80)
T ss_pred C
Confidence 4
No 4
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=99.26 E-value=7.4e-12 Score=102.99 Aligned_cols=78 Identities=37% Similarity=0.692 Sum_probs=67.6
Q ss_pred eeEEEecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCCCccccCcccccccCcCcceeeccc
Q psy10620 16 WAFERVPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDSDRRTTGQYVQFVDAQGVDYFENLC 95 (589)
Q Consensus 16 ~~Fer~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~~~~~~p~~~~~~~~~~~~Y~en~C 95 (589)
|.|++++|+.|.|+....+.+.++++|+.+|+.+.. .|+|++|++.++.|.|++.++.+.+..+... ++.++||||.|
T Consensus 2 ~~f~~~~~~~l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~~~~C~L~~~~~~~~~~~~~~~-~~~~~~ye~~C 79 (79)
T PF00024_consen 2 WAFERIPGYRLSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPSSKTCYLSSSDRSSLPPRLTPS-SPNVDYYEKSC 79 (79)
T ss_dssp -TEEEEEEEEEESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETTTTEEEEECSSSSSESTEEEEE-EEEEEEEEEGC
T ss_pred CCeEEECCEEEeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECCCCEEEEcCCCCCcccceeEec-cCcEEEEEecC
Confidence 889999999999998888888899999999999995 6999999999999999999999954433322 78999999988
No 5
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=98.90 E-value=2.5e-09 Score=88.75 Aligned_cols=66 Identities=33% Similarity=0.460 Sum_probs=52.3
Q ss_pred cceeeeeeeeeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEeccCCCcCCCCCCccccCCCcccccCccccccc
Q psy10620 126 LHYYVDKELQVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHLDHKTLPDGPSTYLNAERPLIDDGQRIGSYY 205 (589)
Q Consensus 126 l~~~~d~~~~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~d~~tl~~~~~~~~~~~~~l~~~~~~~~~y~ 205 (589)
|.+++...+.+.|+++|+++|+.+.+|.||||+|+..+ ++|+|+++++.+.+. . ++. +.++|||
T Consensus 14 l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~~----~~C~L~~~~~~~~~~---~-------~~~--~~~~dyy 77 (80)
T cd01099 14 LVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKS----KECILSDEDRMSSGV---K-------LLY--DSNVDYY 77 (80)
T ss_pred ecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcCC----CEEEEeCCCcccccc---c-------ccc--ccCceee
Confidence 34567777778999999999998889999999999875 799999999976542 1 111 1258999
Q ss_pred cc
Q psy10620 206 EN 207 (589)
Q Consensus 206 E~ 207 (589)
||
T Consensus 78 E~ 79 (80)
T cd01099 78 EN 79 (80)
T ss_pred ec
Confidence 97
No 6
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=98.52 E-value=2.4e-07 Score=75.54 Aligned_cols=75 Identities=31% Similarity=0.579 Sum_probs=61.3
Q ss_pred eeEEEecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCCCccccCcccccccCcCcceeecc
Q psy10620 16 WAFERVPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDSDRRTTGQYVQFVDAQGVDYFENL 94 (589)
Q Consensus 16 ~~Fer~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~~~~~~p~~~~~~~~~~~~Y~en~ 94 (589)
-.|.++++..|.+++.....+.++++|..+|+.+ ++.|+|++|++.++.|+|+..+....+. ...+.+.+|||+.
T Consensus 4 ~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~-~~~C~s~~y~~~~~~C~l~~~~~~~~~~---~~~~~~~~~y~~~ 78 (78)
T smart00473 4 DCFVRLPNTKLPGFSRIVISVASLEECASKCLNS-NCSCRSFTYNNGTKGCLLWSESSLGDAR---LFPSGGVDLYEKI 78 (78)
T ss_pred ceeEEecCccCCCCcceeEcCCCHHHHHHHhCCC-CCceEEEEEcCCCCEEEEeeCCccccce---ecccCCceeEEeC
Confidence 4699999999998777667889999999999996 5699999998878999999955544443 2456688999974
No 7
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=98.38 E-value=4.1e-07 Score=74.50 Aligned_cols=68 Identities=31% Similarity=0.485 Sum_probs=50.5
Q ss_pred cceeeeeeeeeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEeccCCCcCCCCCCccccCCCcccccCccccccc
Q psy10620 126 LHYYVDKELQVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHLDHKTLPDGPSTYLNAERPLIDDGQRIGSYY 205 (589)
Q Consensus 126 l~~~~d~~~~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~d~~tl~~~~~~~~~~~~~l~~~~~~~~~y~ 205 (589)
|.+.....+.+.++++|+++|..+.. .|+||+|+... +.|.|++.++.+.+. . |... .+..+||
T Consensus 12 l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~~----~~C~L~~~~~~~~~~---~-------~~~~-~~~~~~y 75 (79)
T PF00024_consen 12 LSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPSS----KTCYLSSSDRSSLPP---R-------LTPS-SPNVDYY 75 (79)
T ss_dssp EESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETTT----TEEEEECSSSSSEST---E-------EEEE-EEEEEEE
T ss_pred EeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECCC----CEEEEcCCCCCcccc---e-------eEec-cCcEEEE
Confidence 33344455688899999999987776 89999999875 799999999888531 1 2221 3568999
Q ss_pred cccc
Q psy10620 206 ENYC 209 (589)
Q Consensus 206 E~~C 209 (589)
||.|
T Consensus 76 e~~C 79 (79)
T PF00024_consen 76 EKSC 79 (79)
T ss_dssp EEGC
T ss_pred EecC
Confidence 9987
No 8
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=97.06 E-value=0.0011 Score=53.66 Aligned_cols=36 Identities=28% Similarity=0.600 Sum_probs=30.5
Q ss_pred eeeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEec
Q psy10620 134 LQVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFH 174 (589)
Q Consensus 134 ~~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~ 174 (589)
..+.++++|+++|+.+ ++.|+||+|+..+ +.|+|++
T Consensus 22 ~~~~s~~~C~~~C~~~-~~~C~s~~y~~~~----~~C~l~~ 57 (78)
T smart00473 22 ISVASLEECASKCLNS-NCSCRSFTYNNGT----KGCLLWS 57 (78)
T ss_pred EcCCCHHHHHHHhCCC-CCceEEEEEcCCC----CEEEEee
Confidence 4578999999999754 7999999999732 6999999
No 9
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=95.40 E-value=0.02 Score=46.38 Aligned_cols=54 Identities=31% Similarity=0.592 Sum_probs=47.4
Q ss_pred eEEEecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCCC
Q psy10620 17 AFERVPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDSD 72 (589)
Q Consensus 17 ~Fer~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~~ 72 (589)
+|...++..+.|.+...+.+.+.++|..+|+.+. .|..+.|+...+.|.|-...
T Consensus 5 C~~~~~~~~~~g~d~~~~~~~s~~~Cq~~C~~~~--~C~afT~~~~~~~C~lk~~~ 58 (73)
T cd01100 5 CFRQGSNVDFRGGDLSTVFASSAEQCQAACTADP--GCLAFTYNTKSKKCFLKSSE 58 (73)
T ss_pred cccccCCCccccCCcceeecCCHHHHHHHcCCCC--CceEEEEECCCCeEEcccCC
Confidence 5666788889998887777899999999999999 99999999889999997654
No 10
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=92.79 E-value=0.29 Score=38.82 Aligned_cols=57 Identities=28% Similarity=0.497 Sum_probs=46.1
Q ss_pred CCCCeeEEEecCeEEecccceEE-ecCCHHHHHHHhhcCCCceeeeEEeEe--ccCcEEecC
Q psy10620 12 CLRPWAFERVPNKMIRGLDNALI-YTSTKEACLAACLNEHRFTCRSVEYNY--VTLQCHLSD 70 (589)
Q Consensus 12 C~~~~~Fer~~~~~L~g~~~~~~-~~~s~~~C~~~Cl~~~~f~CrS~~y~~--~~~~C~Ls~ 70 (589)
|+..-.|-++++..|-+.+...+ .+.++++|..+|+++= .|.++.|.. ..+.|.|-.
T Consensus 1 C~~~d~F~~l~~~~~p~~~~~~~~~~~s~~~C~~~Cl~nC--sC~Ayay~~~~~~~~C~lW~ 60 (66)
T PF08276_consen 1 CGSGDGFLKLPNMKLPDFDNAIVDSSVSLEECEKACLSNC--SCTAYAYSNLSGGGGCLLWY 60 (66)
T ss_pred CcCCCEEEEECCeeCCCCcceeeecCCCHHHHHhhcCCCC--CEeeEEeeccCCCCEEEEEc
Confidence 43334799999999987766665 6799999999999998 899999975 568998754
No 11
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=92.62 E-value=0.09 Score=38.85 Aligned_cols=33 Identities=30% Similarity=0.652 Sum_probs=18.0
Q ss_pred EecCCHHHHHHHhhcCCCceeeeEEeEe-----ccCcEEe
Q psy10620 34 IYTSTKEACLAACLNEHRFTCRSVEYNY-----VTLQCHL 68 (589)
Q Consensus 34 ~~~~s~~~C~~~Cl~~~~f~CrS~~y~~-----~~~~C~L 68 (589)
+.+.+.++|+++|..++ .|..+.|.. ..+.|.|
T Consensus 13 ~~~~s~~~C~~~C~~~~--~C~~~~~~~~~~~~~~~~C~L 50 (51)
T PF14295_consen 13 VTASSPEECQAACAADP--GCQAFTFNPPGCPSSSGRCYL 50 (51)
T ss_dssp -----HHHHHHHHHTST--T--EEEEETTEE---------
T ss_pred ccCCCHHHHHHHccCCC--CCCEEEEECCCcccccccccC
Confidence 47889999999999998 899999987 5677776
No 12
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=92.12 E-value=0.2 Score=41.39 Aligned_cols=49 Identities=24% Similarity=0.472 Sum_probs=44.3
Q ss_pred ecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccC---cEEecCC
Q psy10620 21 VPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTL---QCHLSDS 71 (589)
Q Consensus 21 ~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~---~C~Ls~~ 71 (589)
.++..+.|.+...+.+.+.++|...|...+ .|+.+.|+.... .|.|-..
T Consensus 6 ~~~~df~G~Dl~~~~~~~~~~Cq~~Ct~~~--~C~~FTf~~~~~~~~~C~LK~s 57 (79)
T smart00223 6 YKNVDFRGSDINTVYVPSAQVCQKRCTSHP--RCLFFTFSTNEPPEEKCLLKDS 57 (79)
T ss_pred ccCccccCceeeeeecCCHHHHHHhhcCCC--CccEEEeeCCCCCCCEeEeCcC
Confidence 567888999999999999999999999998 999999998877 9999755
No 13
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=91.48 E-value=0.34 Score=39.64 Aligned_cols=59 Identities=24% Similarity=0.408 Sum_probs=46.9
Q ss_pred CCCCCC---eeEEEecCeEEecccceEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCC
Q psy10620 10 NVCLRP---WAFERVPNKMIRGLDNALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDS 71 (589)
Q Consensus 10 ~~C~~~---~~Fer~~~~~L~g~~~~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~ 71 (589)
..|++. ..|..+++..+-+..... ...++++|.++|+++= .|..+.|....+.|.+-..
T Consensus 3 ~~C~~~~~~~~f~~~~~~~~~~~~~~~-~~~s~~~C~~~Cl~nC--sC~a~~~~~~~~~C~~~~~ 64 (84)
T cd01098 3 LNCGGDGSTDGFLKLPDVKLPDNASAI-TAISLEECREACLSNC--SCTAYAYNNGSGGCLLWNG 64 (84)
T ss_pred cccCCCCCCCEEEEeCCeeCCCchhhh-ccCCHHHHHHHHhcCC--CcceeeecCCCCeEEEEec
Confidence 457543 689999988887765444 7789999999999987 9999999766899987533
No 14
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=85.41 E-value=1.8 Score=35.76 Aligned_cols=38 Identities=18% Similarity=0.452 Sum_probs=31.5
Q ss_pred cCCHHHHHHHhhc-CCCceeeeEEeEeccCcEEecCCCc
Q psy10620 36 TSTKEACLAACLN-EHRFTCRSVEYNYVTLQCHLSDSDR 73 (589)
Q Consensus 36 ~~s~~~C~~~Cl~-~~~f~CrS~~y~~~~~~C~Ls~~~~ 73 (589)
..++++|.++|++ +.++.|+++.|....+.|.+-..+.
T Consensus 24 ~~s~~eC~~~Cl~~~~nCsC~Aya~~~~~~gC~~W~~~l 62 (80)
T cd00129 24 ANTADECANRCEKNGLPFSCKAFVFAKARKQCLWFPFNS 62 (80)
T ss_pred ccCHHHHHHHHhcCCCCCCceeeeccCCCCCeEEecCcc
Confidence 4789999999999 4467999999976667899987765
No 15
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=81.57 E-value=2.1 Score=35.40 Aligned_cols=38 Identities=21% Similarity=0.584 Sum_probs=31.0
Q ss_pred eCCHHHHHHHhhc-cCCceeeeEEecCCCCCCccceEEeccCC
Q psy10620 136 VTNEAACRLACEI-ENEFLCRSFLYKGPPIGAQYNCQLFHLDH 177 (589)
Q Consensus 136 ~~s~~~C~~~C~~-~~~f~CrS~~y~~~~~~~~~~C~Ls~~d~ 177 (589)
..+.++|+++|+. +.++.|++|.|.... ..|.+-..+.
T Consensus 24 ~~s~~eC~~~Cl~~~~nCsC~Aya~~~~~----~gC~~W~~~l 62 (80)
T cd00129 24 ANTADECANRCEKNGLPFSCKAFVFAKAR----KQCLWFPFNS 62 (80)
T ss_pred ccCHHHHHHHHhcCCCCCCceeeeccCCC----CCeEEecCcc
Confidence 4689999999976 338999999998653 4699988776
No 16
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=78.66 E-value=1.7 Score=31.77 Aligned_cols=36 Identities=22% Similarity=0.567 Sum_probs=16.6
Q ss_pred eeCCHHHHHHHhhccCCceeeeEEecCCC-CCCccceEE
Q psy10620 135 QVTNEAACRLACEIENEFLCRSFLYKGPP-IGAQYNCQL 172 (589)
Q Consensus 135 ~~~s~~~C~~~C~~~~~f~CrS~~y~~~~-~~~~~~C~L 172 (589)
.+.+.++|+++|. .+-.|..|.|.... .+..+.|.|
T Consensus 14 ~~~s~~~C~~~C~--~~~~C~~~~~~~~~~~~~~~~C~L 50 (51)
T PF14295_consen 14 TASSPEECQAACA--ADPGCQAFTFNPPGCPSSSGRCYL 50 (51)
T ss_dssp ----HHHHHHHHH--TSTT--EEEEETTEE---------
T ss_pred cCCCHHHHHHHcc--CCCCCCEEEEECCCcccccccccC
Confidence 5789999999995 34479999999831 112478877
No 17
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=78.31 E-value=3.1 Score=33.46 Aligned_cols=36 Identities=22% Similarity=0.474 Sum_probs=28.6
Q ss_pred eeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEeccC
Q psy10620 135 QVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHLD 176 (589)
Q Consensus 135 ~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~d 176 (589)
.+.+.++|++.|+. +-.|+.|+|+... +.|.|-...
T Consensus 23 ~~~s~~~Cq~~C~~--~~~C~afT~~~~~----~~C~lk~~~ 58 (73)
T cd01100 23 FASSAEQCQAACTA--DPGCLAFTYNTKS----KKCFLKSSE 58 (73)
T ss_pred ecCCHHHHHHHcCC--CCCceEEEEECCC----CeEEcccCC
Confidence 46799999999954 4589999999653 799996543
No 18
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=78.15 E-value=2.9 Score=37.45 Aligned_cols=26 Identities=12% Similarity=0.139 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIKK 580 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~~ 580 (589)
+++++|+|+++-++.+++++.|+.||
T Consensus 65 ~i~~Ii~gv~aGvIg~Illi~y~irR 90 (122)
T PF01102_consen 65 AIIGIIFGVMAGVIGIILLISYCIRR 90 (122)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ceeehhHHHHHHHHHHHHHHHHHHHH
Confidence 34455555554444444433333333
No 19
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=75.13 E-value=4 Score=33.65 Aligned_cols=39 Identities=18% Similarity=0.350 Sum_probs=29.8
Q ss_pred eeeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEecc
Q psy10620 134 LQVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHL 175 (589)
Q Consensus 134 ~~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~ 175 (589)
+.+.+.++|++.|. .+-.|+.|.|+....+ ...|.|-+.
T Consensus 19 ~~~~~~~~Cq~~Ct--~~~~C~~FTf~~~~~~-~~~C~LK~s 57 (79)
T smart00223 19 VYVPSAQVCQKRCT--SHPRCLFFTFSTNEPP-EEKCLLKDS 57 (79)
T ss_pred eecCCHHHHHHhhc--CCCCccEEEeeCCCCC-CCEeEeCcC
Confidence 56789999999995 3458999999976521 238999654
No 20
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=71.84 E-value=6.5 Score=32.21 Aligned_cols=33 Identities=15% Similarity=0.225 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIKKWMTPRKV 587 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~~~ 587 (589)
.++=+|++=+++.++|+++..|..|.++||.++
T Consensus 35 ~LaGiV~~D~vlTLLIv~~vy~car~r~r~~~~ 67 (79)
T PF07213_consen 35 LLAGIVAADAVLTLLIVLVVYYCARPRRRPTQE 67 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCccc
Confidence 555566666666677777777788888888764
No 21
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=71.65 E-value=5.4 Score=33.19 Aligned_cols=26 Identities=23% Similarity=0.418 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 558 LGVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 558 l~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
+++++.+|++++|.-+.|++.||+-|
T Consensus 37 ~lvI~~iFil~VilwfvCC~kRkrsR 62 (94)
T PF05393_consen 37 FLVICGIFILLVILWFVCCKKRKRSR 62 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 55677777778888777777776643
No 22
>PF08277 PAN_3: PAN-like domain; InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=67.75 E-value=10 Score=30.01 Aligned_cols=39 Identities=15% Similarity=0.432 Sum_probs=33.4
Q ss_pred eEEecCCHHHHHHHhhcCCCceeeeEEeEeccCcEEecCCCcc
Q psy10620 32 ALIYTSTKEACLAACLNEHRFTCRSVEYNYVTLQCHLSDSDRR 74 (589)
Q Consensus 32 ~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~~~~~C~Ls~~~~~ 74 (589)
......+.++|+..|..+. .|--|-++ .+.|.|...+..
T Consensus 15 ~~~~~~sw~~Cv~~C~~~~--~C~la~~~--~~~C~~y~~~~i 53 (71)
T PF08277_consen 15 SSTTNTSWDDCVQKCYNDE--NCVLAYFD--SGKCYLYNYGSI 53 (71)
T ss_pred ccccCCCHHHHhHHhCCCC--EEEEEEeC--CCCEEEEEcCCE
Confidence 4457889999999999999 99999997 899999876553
No 23
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=64.75 E-value=13 Score=26.06 Aligned_cols=20 Identities=20% Similarity=0.321 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy10620 555 ILALGVTCCLLILIYVSTIF 574 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~ 574 (589)
.+|++++.++-++++++.++
T Consensus 5 ~IaIIv~V~vg~~iiii~~~ 24 (38)
T PF02439_consen 5 TIAIIVAVVVGMAIIIICMF 24 (38)
T ss_pred hhhHHHHHHHHHHHHHHHHH
Confidence 33433333333333333333
No 24
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=63.05 E-value=7.1 Score=31.85 Aligned_cols=19 Identities=26% Similarity=0.399 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy10620 556 LALGVTCCLLILIYVSTIF 574 (589)
Q Consensus 556 ~~l~i~~~ll~~~~i~~~~ 574 (589)
++.+++|++++++++++++
T Consensus 3 i~~~~~g~~~ll~~v~~~~ 21 (75)
T PF14575_consen 3 IASIIVGVLLLLVLVIIVI 21 (75)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred EehHHHHHHHHHHhheeEE
Confidence 3445555555544444433
No 25
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=61.39 E-value=5.1 Score=39.03 Aligned_cols=20 Identities=20% Similarity=0.316 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy10620 554 SILALGVTCCLLILIYVSTI 573 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~ 573 (589)
.+|+|+++|.++++++++..
T Consensus 101 ~lI~lv~~g~~lLla~~~~~ 120 (202)
T PF06365_consen 101 TLIALVTSGSFLLLAILLGA 120 (202)
T ss_pred EEEehHHhhHHHHHHHHHHH
Confidence 47788888855554444444
No 26
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=58.71 E-value=15 Score=28.88 Aligned_cols=38 Identities=21% Similarity=0.427 Sum_probs=28.3
Q ss_pred eeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEeccC
Q psy10620 135 QVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHLD 176 (589)
Q Consensus 135 ~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~d 176 (589)
...++++|+.+|+. +=.|..|.|.... +...|.|-+.+
T Consensus 25 ~~~s~~~C~~~Cl~--nCsC~Ayay~~~~--~~~~C~lW~~~ 62 (66)
T PF08276_consen 25 SSVSLEECEKACLS--NCSCTAYAYSNLS--GGGGCLLWYGD 62 (66)
T ss_pred cCCCHHHHHhhcCC--CCCEeeEEeeccC--CCCEEEEEcCE
Confidence 45899999999974 3479999998632 12689986654
No 27
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=56.14 E-value=3.7 Score=42.33 Aligned_cols=25 Identities=20% Similarity=0.375 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 558 LGVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 558 l~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
.+|+.++|++|+|+|++||. |||..
T Consensus 152 aVVI~~iLLIA~iIa~icyr-rkR~G 176 (290)
T PF05454_consen 152 AVVIAAILLIAGIIACICYR-RKRKG 176 (290)
T ss_dssp --------------------------
T ss_pred HHHHHHHHHHHHHHHHHhhh-hhhcc
Confidence 33444444445555555544 44433
No 28
>PF03988 DUF347: Repeat of Unknown Function (DUF347) ; InterPro: IPR007136 This repeat is found as four tandem repeats in a family of bacterial membrane proteins. Each repeat contains two transmembrane regions and a conserved tryptophan.
Probab=53.40 E-value=17 Score=27.76 Aligned_cols=29 Identities=14% Similarity=0.287 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhcCc
Q psy10620 556 LALGVTCCLLILIYVSTIFCYYIKKWMTP 584 (589)
Q Consensus 556 ~~l~i~~~ll~~~~i~~~~~~~~~~~~~~ 584 (589)
++..+..++++.++++++..|++.++.+|
T Consensus 27 lg~~~~~~~~~~~l~~~~~~~~~~~~~~p 55 (55)
T PF03988_consen 27 LGYLISTLIFAALLAVVLALWYRSKRYRP 55 (55)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHhccCC
Confidence 44555666666666667777888887776
No 29
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=51.95 E-value=13 Score=38.89 Aligned_cols=36 Identities=17% Similarity=0.326 Sum_probs=16.5
Q ss_pred ecccCChh--HHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy10620 546 VEPCPTKT--SILALGVTCCLLILIYVSTIFCYYIKKW 581 (589)
Q Consensus 546 ~~~C~s~~--~~~~l~i~~~ll~~~~i~~~~~~~~~~~ 581 (589)
.+.|..-. .++-|+|-++|..|++|+.|..++.|||
T Consensus 261 a~~C~~D~~~~~vPIaVG~~La~lvlivLiaYli~Rrr 298 (306)
T PF01299_consen 261 AEECSSDDTSDLVPIAVGAALAGLVLIVLIAYLIGRRR 298 (306)
T ss_pred hhcCCcCCccchHHHHHHHHHHHHHHHHHHhheeEecc
Confidence 35675422 3444444444444444444444444443
No 30
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=50.37 E-value=2.9 Score=36.55 Aligned_cols=26 Identities=23% Similarity=0.439 Sum_probs=1.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKW 581 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~ 581 (589)
.+.+++|+.+|| ++++++.||+-|||
T Consensus 25 EAaGIGiL~VIL--giLLliGCWYckRR 50 (118)
T PF14991_consen 25 EAAGIGILIVIL--GILLLIGCWYCKRR 50 (118)
T ss_dssp ---SSS----------------------
T ss_pred HhccceeHHHHH--HHHHHHhheeeeec
Confidence 355655554444 34445678888876
No 31
>KOG1094|consensus
Probab=49.50 E-value=16 Score=41.13 Aligned_cols=40 Identities=13% Similarity=0.237 Sum_probs=20.1
Q ss_pred cccCChhHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccc
Q psy10620 547 EPCPTKTSILALGVTCCLLILIYVSTIFCYYIKKWMTPRK 586 (589)
Q Consensus 547 ~~C~s~~~~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~~ 586 (589)
++|-+++.++.++....+|++++|++++.|..||.+.|++
T Consensus 384 ~~~~~~t~~~~~~f~~if~iva~ii~~~L~R~rr~~~ka~ 423 (807)
T KOG1094|consen 384 KVDGSPTAILIIIFVAIFLIVALIIALMLWRWRRLLSKAS 423 (807)
T ss_pred EEcCCCceehHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 5787777444433334444445555555555444444433
No 32
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=47.67 E-value=22 Score=36.33 Aligned_cols=26 Identities=15% Similarity=0.475 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHH--HHHHhhhhhcCc
Q psy10620 559 GVTCCLLILIYVST--IFCYYIKKWMTP 584 (589)
Q Consensus 559 ~i~~~ll~~~~i~~--~~~~~~~~~~~~ 584 (589)
+|+.++|++++|+. .-.|+.|||-..
T Consensus 262 giaalvllil~vvliiLYiWlyrrRK~s 289 (295)
T TIGR01478 262 GIAALVLIILTVVLIILYIWLYRRRKKS 289 (295)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 34444333333332 235555555433
No 33
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=44.84 E-value=18 Score=28.37 Aligned_cols=37 Identities=24% Similarity=0.351 Sum_probs=21.1
Q ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccc
Q psy10620 550 PTKTSILALGVTCCLLILIYVSTIFCYYIKKWMTPRK 586 (589)
Q Consensus 550 ~s~~~~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~~ 586 (589)
+++..++.|.|++++++.++--.++..+..+-+-|++
T Consensus 10 lnPGlIVLlvV~g~ll~flvGnyvlY~Yaqk~lpp~k 46 (69)
T PF04689_consen 10 LNPGLIVLLVVAGLLLVFLVGNYVLYVYAQKTLPPKK 46 (69)
T ss_pred CCCCeEEeehHHHHHHHHHHHHHHHHHHHhhcCCCCC
Confidence 3555555566666666655444555555555555554
No 34
>PF15102 TMEM154: TMEM154 protein family
Probab=44.57 E-value=5.7 Score=36.55 Aligned_cols=6 Identities=33% Similarity=0.147 Sum_probs=2.9
Q ss_pred hhhcCc
Q psy10620 579 KKWMTP 584 (589)
Q Consensus 579 ~~~~~~ 584 (589)
|||++|
T Consensus 83 kRkr~K 88 (146)
T PF15102_consen 83 KRKRTK 88 (146)
T ss_pred eecccC
Confidence 454443
No 35
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=44.32 E-value=26 Score=28.20 Aligned_cols=35 Identities=23% Similarity=0.506 Sum_probs=26.8
Q ss_pred eeCCHHHHHHHhhccCCceeeeEEecCCCCCCccceEEecc
Q psy10620 135 QVTNEAACRLACEIENEFLCRSFLYKGPPIGAQYNCQLFHL 175 (589)
Q Consensus 135 ~~~s~~~C~~~C~~~~~f~CrS~~y~~~~~~~~~~C~Ls~~ 175 (589)
...++++|+++|+. +=.|..|.|.... +.|.+-..
T Consensus 30 ~~~s~~~C~~~Cl~--nCsC~a~~~~~~~----~~C~~~~~ 64 (84)
T cd01098 30 TAISLEECREACLS--NCSCTAYAYNNGS----GGCLLWNG 64 (84)
T ss_pred ccCCHHHHHHHHhc--CCCcceeeecCCC----CeEEEEec
Confidence 45789999999974 5689999998642 68988443
No 36
>PTZ00370 STEVOR; Provisional
Probab=42.98 E-value=25 Score=36.08 Aligned_cols=24 Identities=17% Similarity=0.523 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHH--HHHhhhhh
Q psy10620 558 LGVTCCLLILIYVSTI--FCYYIKKW 581 (589)
Q Consensus 558 l~i~~~ll~~~~i~~~--~~~~~~~~ 581 (589)
.+|+.++|++++|+.| ..|+.|||
T Consensus 257 ygiaalvllil~vvliilYiwlyrrR 282 (296)
T PTZ00370 257 YGIAALVLLILAVVLIILYIWLYRRR 282 (296)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3444444444433332 25555554
No 37
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=42.12 E-value=56 Score=23.02 Aligned_cols=30 Identities=23% Similarity=0.301 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
..++-.++|++++++.+..-.|.++|.+++
T Consensus 7 aIIv~V~vg~~iiii~~~~YaCcykk~~~~ 36 (38)
T PF02439_consen 7 AIIVAVVVGMAIIIICMFYYACCYKKHRRQ 36 (38)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHcccccc
Confidence 466667777777777777655555555444
No 38
>PRK00523 hypothetical protein; Provisional
Probab=39.14 E-value=36 Score=27.47 Aligned_cols=28 Identities=21% Similarity=0.231 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 556 LALGVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 556 ~~l~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
+++.|+..++++++-+++.+|+.||++.
T Consensus 4 ~~l~I~l~i~~li~G~~~Gffiark~~~ 31 (72)
T PRK00523 4 IGLALGLGIPLLIVGGIIGYFVSKKMFK 31 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444445566777777654
No 39
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=37.84 E-value=62 Score=23.35 Aligned_cols=31 Identities=16% Similarity=0.179 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcCcc
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIKKWMTPR 585 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~ 585 (589)
-++|..+.+|+..+.+.+.+.++..|++..+
T Consensus 10 ~vGL~Sl~vI~~~igm~~~~~~~F~~k~~~~ 40 (42)
T PF11346_consen 10 DVGLMSLIVIVFTIGMGVFFIRYFIRKMKED 40 (42)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
Confidence 4555556666665666665655555555443
No 40
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.32 E-value=43 Score=26.87 Aligned_cols=26 Identities=19% Similarity=0.105 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 558 LGVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 558 l~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
++|++.++.+++-++...|+.||.+.
T Consensus 5 lail~ivl~ll~G~~~G~fiark~~~ 30 (71)
T COG3763 5 LAILLIVLALLAGLIGGFFIARKQMK 30 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444455555555566777777643
No 41
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=35.13 E-value=63 Score=27.85 Aligned_cols=33 Identities=15% Similarity=0.208 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccc
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKWMTPRK 586 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~~ 586 (589)
.+.+++++.+.+.+++++|+=|-+.+|.+..++
T Consensus 19 ~LVGVv~~al~~SlLIalaaKC~~~~k~~~SY~ 51 (102)
T PF15176_consen 19 FLVGVVVTALVTSLLIALAAKCPVWYKYLASYR 51 (102)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhHHHHHHHhccc
Confidence 466666666666666666766766666554443
No 42
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=34.19 E-value=54 Score=29.45 Aligned_cols=29 Identities=14% Similarity=0.135 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhc
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKWM 582 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~~ 582 (589)
..|.|+|++-+++++++++.++..+||+-
T Consensus 67 ~~Ii~gv~aGvIg~Illi~y~irR~~Kk~ 95 (122)
T PF01102_consen 67 IGIIFGVMAGVIGIILLISYCIRRLRKKS 95 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHS---
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 34444444444444556666778887763
No 43
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=33.55 E-value=48 Score=38.76 Aligned_cols=33 Identities=27% Similarity=0.700 Sum_probs=18.2
Q ss_pred HHHHHHHH--HHHHHHHHHHHHHHhhhhhcCcccc
Q psy10620 555 ILALGVTC--CLLILIYVSTIFCYYIKKWMTPRKV 587 (589)
Q Consensus 555 ~~~l~i~~--~ll~~~~i~~~~~~~~~~~~~~~~~ 587 (589)
++.++|+| +|++++++.+.+||.|||-..|+++
T Consensus 273 ~fLl~ILG~~~livl~lL~vLl~yCrrkc~~~r~~ 307 (807)
T PF10577_consen 273 VFLLAILGGTALIVLILLCVLLCYCRRKCLKPRQR 307 (807)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccCCcccc
Confidence 34444555 3333333335668888877777653
No 44
>PHA03283 envelope glycoprotein E; Provisional
Probab=33.14 E-value=36 Score=37.71 Aligned_cols=33 Identities=18% Similarity=0.294 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh-hhhhcCccccC
Q psy10620 556 LALGVTCCLLILIYVSTIFCYY-IKKWMTPRKVM 588 (589)
Q Consensus 556 ~~l~i~~~ll~~~~i~~~~~~~-~~~~~~~~~~~ 588 (589)
++.+++++.++++++++++|.. ++++++|++.+
T Consensus 403 ~~~~~~~~~~~~~~l~vw~c~~~r~~~~~~y~il 436 (542)
T PHA03283 403 LLAIICTCAALLVALVVWGCILYRRSNRKPYEVL 436 (542)
T ss_pred HHHHHHHHHHHHHHHhhhheeeehhhcCCccccc
Confidence 3333333444455566677776 77777887653
No 45
>PF07502 MANEC: MANEC domain; InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=33.12 E-value=57 Score=27.74 Aligned_cols=43 Identities=21% Similarity=0.437 Sum_probs=33.3
Q ss_pred eEEEecCeEEecccc--------eEEecCCHHHHHHHhhcCCCceeeeEEeEe
Q psy10620 17 AFERVPNKMIRGLDN--------ALIYTSTKEACLAACLNEHRFTCRSVEYNY 61 (589)
Q Consensus 17 ~Fer~~~~~L~g~~~--------~~~~~~s~~~C~~~Cl~~~~f~CrS~~y~~ 61 (589)
.|...++++|.-.+. ....+.+.++|+.+|=..+ .|-=|.|..
T Consensus 8 ~f~~~~d~iIdt~~S~~~GA~fl~~~~v~s~~dC~~aCC~~~--~Cnlav~e~ 58 (92)
T PF07502_consen 8 DFRSFPDFIIDTEESVENGATFLSSPEVSSAEDCVRACCSTP--NCNLAVFEE 58 (92)
T ss_pred ccccCCCeEEehHHHHHCCCeecCccccCCHHHHHHHhCCCC--CCcEEEEec
Confidence 488899999873211 2346899999999999999 588888864
No 46
>PF09680 Tiny_TM_bacill: Protein of unknown function (Tiny_TM_bacill); InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=32.74 E-value=25 Score=22.04 Aligned_cols=15 Identities=20% Similarity=0.401 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHH
Q psy10620 559 GVTCCLLILIYVSTI 573 (589)
Q Consensus 559 ~i~~~ll~~~~i~~~ 573 (589)
+.+.+||++++|+..
T Consensus 7 alivVLFILLiIvG~ 21 (24)
T PF09680_consen 7 ALIVVLFILLIIVGA 21 (24)
T ss_pred hhHHHHHHHHHHhcc
Confidence 344455555555543
No 47
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=30.93 E-value=55 Score=24.13 Aligned_cols=19 Identities=11% Similarity=0.317 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHHHhhhh
Q psy10620 562 CCLLILIYVSTIFCYYIKK 580 (589)
Q Consensus 562 ~~ll~~~~i~~~~~~~~~~ 580 (589)
+++++.++.++++.|..++
T Consensus 14 ~~v~~~~~F~gi~~w~~~~ 32 (49)
T PF05545_consen 14 GTVLFFVFFIGIVIWAYRP 32 (49)
T ss_pred HHHHHHHHHHHHHHHHHcc
Confidence 3344444444444444443
No 48
>KOG3441|consensus
Probab=30.92 E-value=2e+02 Score=25.67 Aligned_cols=65 Identities=18% Similarity=0.273 Sum_probs=43.4
Q ss_pred CCCcEEEEecCCCCeeeeeecCCEEEEEEEEcCCCcccEEEeEEEEEc----CCCCceEEEEeCCCCCCCCCCCC
Q psy10620 367 APPPRIRILDTKSREVETVRIGDKLTFRIEIPEETPYGIFARSCVAMA----KDSKSTFQIIDDEGCPVDPNIFP 437 (589)
Q Consensus 367 ~p~~~m~i~~~~g~~v~~v~vGd~l~l~~~~~~~~~~~i~V~~C~A~d----~~~~~~~~LID~~GCp~D~~i~~ 437 (589)
.++.-+.++..+| ..++||++-+-+. +.-.-.+.|..|.-.. .-+.+.+.|||++|=|+...|--
T Consensus 55 r~PrCIHVYkkrg----vg~~GDkiLvAIk--GQmkKa~vVGh~~~~k~~~P~fDsNniVLiddnGnPlGtRI~~ 123 (149)
T KOG3441|consen 55 RLPRCIHVYKKRG----VGELGDKILVAIK--GQMKKAYVVGHVHYRKHGVPVFDSNNIVLIDDNGNPLGTRITA 123 (149)
T ss_pred CCCceEEEEeccc----ccccccEEEEEEe--cceeeeEEEEeeccCCCCCcccCCCcEEEECCCCCcccceEec
Confidence 3444577776665 5789999988665 3334456666665432 12346899999999999877643
No 49
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=30.07 E-value=17 Score=33.99 Aligned_cols=28 Identities=18% Similarity=0.204 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhc
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIKKWM 582 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~~~~ 582 (589)
.|.=+|+|+|+.+.+|.+|.+.++||+.
T Consensus 130 tLVGIIVGVLlaIG~igGIIivvvRKmS 157 (162)
T PF05808_consen 130 TLVGIIVGVLLAIGFIGGIIIVVVRKMS 157 (162)
T ss_dssp ----------------------------
T ss_pred eeeeehhhHHHHHHHHhheeeEEeehhc
Confidence 4555688999999999999988888863
No 50
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=30.06 E-value=17 Score=39.92 Aligned_cols=15 Identities=20% Similarity=0.375 Sum_probs=0.0
Q ss_pred eEEEecCCeEEEEEE
Q psy10620 251 VTVHCKDTRIAVQVR 265 (589)
Q Consensus 251 v~v~C~~~~m~V~v~ 265 (589)
+...|....|++-+.
T Consensus 40 ~~~~C~~~~~~~~l~ 54 (439)
T PF02480_consen 40 VDKACVRKAMEVPLA 54 (439)
T ss_dssp ---------------
T ss_pred eccccccccccccee
Confidence 446777666666555
No 51
>PRK01844 hypothetical protein; Provisional
Probab=30.02 E-value=55 Score=26.47 Aligned_cols=21 Identities=14% Similarity=0.257 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHhhhhhcC
Q psy10620 563 CLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 563 ~ll~~~~i~~~~~~~~~~~~~ 583 (589)
.++.+++-+++.+|+.||++.
T Consensus 10 ~I~~li~G~~~Gff~ark~~~ 30 (72)
T PRK01844 10 GVVALVAGVALGFFIARKYMM 30 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444444566777777654
No 52
>PF06596 PsbX: Photosystem II reaction centre X protein (PsbX); InterPro: IPR009518 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. The low molecular weight transmembrane protein PsbX found in PSII is associated with the oxygen-evolving complex. Its expression is light-regulated. PsbX appears to be involved in the regulation of the amount of PSII [], and may be involved in the binding or turnover of quinone molecules at the Qb (PsbA) site [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0016020 membrane; PDB: 3ARC_x 3A0H_X 3A0B_X 3PRR_X 1S5L_x 4FBY_j 3PRQ_X 3KZI_X 3BZ2_X 3BZ1_X.
Probab=29.80 E-value=1.2e+02 Score=21.62 Aligned_cols=24 Identities=13% Similarity=0.256 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYY 577 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~ 577 (589)
+++.-+++|.+++++.|.+.+.++
T Consensus 7 nfl~Sl~aG~~iVv~~i~~ali~V 30 (39)
T PF06596_consen 7 NFLLSLVAGAVIVVIPIAGALIFV 30 (39)
T ss_dssp HHHHHHHHHH-HHHHHHHHHHHHH
T ss_pred HHHHHHHhhhhhhhhhhhhheEEE
Confidence 566666777755555555544443
No 53
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=29.71 E-value=81 Score=23.64 Aligned_cols=20 Identities=25% Similarity=0.110 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy10620 555 ILALGVTCCLLILIYVSTIF 574 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~ 574 (589)
..||+++++|+++-++++..
T Consensus 16 igGLi~A~vlfi~Gi~iils 35 (50)
T PF02038_consen 16 IGGLIFAGVLFILGILIILS 35 (50)
T ss_dssp HHHHHHHHHHHHHHHHHHCT
T ss_pred ccchHHHHHHHHHHHHHHHc
Confidence 67788888888766655544
No 54
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=29.41 E-value=24 Score=33.68 Aligned_cols=14 Identities=29% Similarity=0.909 Sum_probs=10.4
Q ss_pred EEEecCCCCCCCCC
Q psy10620 467 VKYCLGPCEPAVCE 480 (589)
Q Consensus 467 V~vC~~~C~p~~C~ 480 (589)
..+|=|.|.--.|=
T Consensus 26 ~~~CCG~C~~ryCC 39 (179)
T PF13908_consen 26 FTFCCGTCSLRYCC 39 (179)
T ss_pred cceecCCccCcchh
Confidence 67888888866664
No 55
>smart00765 MANEC The MANEC domain was formerly called MANSC. This domain, comprising 8 conserved cysteines, is found in the N terminus of higher multicellular animal membrane and extracellular proteins. It is postulated that this domain may play a role in the formation of protein complexes involving various protease activators and inhibitors. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulfide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase.
Probab=28.96 E-value=70 Score=27.29 Aligned_cols=55 Identities=27% Similarity=0.510 Sum_probs=37.6
Q ss_pred CCCCCeeEEEecCeEEecccc-----eE----EecCCHHHHHHHhhcCCCceeeeEEeEe----ccCcEEec
Q psy10620 11 VCLRPWAFERVPNKMIRGLDN-----AL----IYTSTKEACLAACLNEHRFTCRSVEYNY----VTLQCHLS 69 (589)
Q Consensus 11 ~C~~~~~Fer~~~~~L~g~~~-----~~----~~~~s~~~C~~~Cl~~~~f~CrS~~y~~----~~~~C~Ls 69 (589)
.|... |+..+|.+|..... .. ..+.+.++|+.+|=..+ .|-=|.|.. ..++|.|.
T Consensus 5 ~C~~~--f~~~~n~ii~t~~S~~~Ga~fL~~p~~~~s~edC~~aCC~~~--~CnlAv~e~~~~~~~~~CyLf 72 (93)
T smart00765 5 DCLGR--FRVLENAIIRTEESLSAGARFLKSPIAVNTWEDCVRACCSTP--NCNLAVFELRREDAEGNCYLF 72 (93)
T ss_pred ccccc--cccCCCeeeccchhhhccccccCCccccCCHHHHHHHHcCCC--CCcEEEEeccCCCCCCceEEE
Confidence 56432 77788888874322 12 35789999999999999 698888852 23555554
No 56
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=28.19 E-value=16 Score=31.24 Aligned_cols=17 Identities=6% Similarity=0.368 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10620 563 CLLILIYVSTIFCYYIK 579 (589)
Q Consensus 563 ~ll~~~~i~~~~~~~~~ 579 (589)
+++++.+|+++++|++.
T Consensus 75 ~~~~v~~lv~~l~w~f~ 91 (96)
T PTZ00382 75 VVAVVGGLVGFLCWWFV 91 (96)
T ss_pred hhhHHHHHHHHHhheeE
Confidence 33444444444444443
No 57
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=27.89 E-value=20 Score=32.12 Aligned_cols=14 Identities=21% Similarity=0.263 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHH
Q psy10620 558 LGVTCCLLILIYVS 571 (589)
Q Consensus 558 l~i~~~ll~~~~i~ 571 (589)
+.+++++|++++++
T Consensus 83 ~sal~v~lVl~lls 96 (129)
T PF12191_consen 83 GSALSVVLVLALLS 96 (129)
T ss_dssp --------------
T ss_pred hhHHHHHHHHHHHH
Confidence 34444444433333
No 58
>COG3190 FliO Flagellar biogenesis protein [Cell motility and secretion]
Probab=27.12 E-value=71 Score=29.22 Aligned_cols=24 Identities=8% Similarity=0.299 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 560 VTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 560 i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
+.+.|+++++++.+++|+.||..+
T Consensus 27 ~~gsL~~iL~lil~~~wl~kr~~~ 50 (137)
T COG3190 27 MFGSLILILALILFLAWLVKRLGR 50 (137)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 444444445555557788887765
No 59
>PF15102 TMEM154: TMEM154 protein family
Probab=25.84 E-value=37 Score=31.31 Aligned_cols=10 Identities=10% Similarity=-0.021 Sum_probs=4.1
Q ss_pred HHhhhhhcCc
Q psy10620 575 CYYIKKWMTP 584 (589)
Q Consensus 575 ~~~~~~~~~~ 584 (589)
+-.+|.+..|
T Consensus 82 ~kRkr~K~~~ 91 (146)
T PF15102_consen 82 YKRKRTKQEP 91 (146)
T ss_pred EeecccCCCC
Confidence 3344444333
No 60
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=25.32 E-value=44 Score=28.34 Aligned_cols=21 Identities=14% Similarity=0.387 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHhhhhh
Q psy10620 561 TCCLLILIYVSTIFCYYIKKW 581 (589)
Q Consensus 561 ~~~ll~~~~i~~~~~~~~~~~ 581 (589)
.|.++++++|+++++..+.||
T Consensus 48 GGG~iLilIii~Lv~CC~~K~ 68 (98)
T PF07204_consen 48 GGGLILILIIIALVCCCRAKH 68 (98)
T ss_pred cchhhhHHHHHHHHHHhhhhh
Confidence 344444444444444444443
No 61
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=24.99 E-value=75 Score=28.60 Aligned_cols=20 Identities=10% Similarity=0.243 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhh
Q psy10620 561 TCCLLILIYVSTIFCYYIKKW 581 (589)
Q Consensus 561 ~~~ll~~~~i~~~~~~~~~~~ 581 (589)
.++++++++|.+ +.|++||.
T Consensus 23 ~~L~lVl~lI~~-~aWLlkR~ 42 (124)
T PRK11486 23 GALIGIIALILA-AAWLVKRL 42 (124)
T ss_pred HHHHHHHHHHHH-HHHHHHHc
Confidence 333444444444 45666664
No 62
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=24.97 E-value=61 Score=34.79 Aligned_cols=31 Identities=19% Similarity=0.337 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcCcc
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIKKWMTPR 585 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~~~~~~~ 585 (589)
+|.|+|+.-..+|++|+++|+.+.||||...
T Consensus 387 ~i~~avl~p~~il~~~~~~~~~~v~rrr~~~ 417 (436)
T PTZ00208 387 MIILAVLVPAIILAIIAVAFFIMVKRRRNSS 417 (436)
T ss_pred HHHHHHHHHHHHHHHHHHHhheeeeeccCCc
Confidence 4444455544444455554444444444433
No 63
>PF06809 NPDC1: Neural proliferation differentiation control-1 protein (NPDC1); InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=24.69 E-value=35 Score=35.44 Aligned_cols=25 Identities=16% Similarity=0.298 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhcC
Q psy10620 559 GVTCCLLILIYVSTIFCYYIKKWMT 583 (589)
Q Consensus 559 ~i~~~ll~~~~i~~~~~~~~~~~~~ 583 (589)
++.++..+.++|+|.+||++.++..
T Consensus 204 v~~cvaG~aAliva~~cW~Rlqr~~ 228 (341)
T PF06809_consen 204 VVCCVAGAAALIVAGYCWYRLQREI 228 (341)
T ss_pred HHHHHHHHHHHHHhhheEEEecccc
Confidence 3334444556667788999876643
No 64
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=24.07 E-value=1.2e+02 Score=22.82 Aligned_cols=25 Identities=24% Similarity=0.235 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh
Q psy10620 555 ILALGVTCCLLILIYVSTIFCYYIK 579 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~~~~~~ 579 (589)
++.++|-..+++.+++++++.|-.|
T Consensus 3 il~~LIpiSl~l~~~~l~~f~Wavk 27 (51)
T TIGR00847 3 ILTILIPISLLLGGVGLVAFLWSLK 27 (51)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 3444555555555555555655544
No 65
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=23.52 E-value=95 Score=25.76 Aligned_cols=12 Identities=42% Similarity=0.368 Sum_probs=4.0
Q ss_pred HHHHHHHHHHHH
Q psy10620 555 ILALGVTCCLLI 566 (589)
Q Consensus 555 ~~~l~i~~~ll~ 566 (589)
++++++++++++
T Consensus 6 i~~iialiv~~i 17 (81)
T PF00558_consen 6 ILAIIALIVALI 17 (81)
T ss_dssp --HHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 344434443333
No 66
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=23.17 E-value=1.1e+02 Score=22.43 Aligned_cols=24 Identities=17% Similarity=0.311 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhh
Q psy10620 556 LALGVTCCLLILIYVSTIFCYYIK 579 (589)
Q Consensus 556 ~~l~i~~~ll~~~~i~~~~~~~~~ 579 (589)
+.+.|-.++++.+++++++.|-.|
T Consensus 3 l~~lip~sl~l~~~~l~~f~Wavk 26 (45)
T PF03597_consen 3 LYILIPVSLILGLIALAAFLWAVK 26 (45)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHc
Confidence 344444455554555555555443
No 67
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=22.96 E-value=55 Score=33.78 Aligned_cols=18 Identities=17% Similarity=0.606 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHhhhh
Q psy10620 563 CLLILIYVSTIFCYYIKK 580 (589)
Q Consensus 563 ~ll~~~~i~~~~~~~~~~ 580 (589)
++|++.++.+++.+++||
T Consensus 240 ~v~ll~l~Gii~~~~~r~ 257 (281)
T PF12768_consen 240 TVFLLVLIGIILAYIRRR 257 (281)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 334434444455555555
No 68
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=21.95 E-value=29 Score=27.33 Aligned_cols=16 Identities=25% Similarity=0.507 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHH
Q psy10620 555 ILALGVTCCLLILIYV 570 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i 570 (589)
+++-+|+|+++++++|
T Consensus 14 vIaG~Vvgll~ailLI 29 (64)
T PF01034_consen 14 VIAGGVVGLLFAILLI 29 (64)
T ss_dssp ----------------
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4444555555554444
No 69
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=21.30 E-value=32 Score=27.10 Aligned_cols=21 Identities=14% Similarity=0.267 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy10620 554 SILALGVTCCLLILIYVSTIF 574 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~ 574 (589)
.+++-+|+|.++.++++++++
T Consensus 9 ~vlaavIaG~Vvgll~ailLI 29 (64)
T PF01034_consen 9 EVLAAVIAGGVVGLLFAILLI 29 (64)
T ss_dssp ---------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHH
Confidence 455666666665544444433
No 70
>PF05624 LSR: Lipolysis stimulated receptor (LSR); InterPro: IPR008664 This domain consists of mammalian LISCH7 protein homologues. LISCH7 is a liver-specific BHLH-ZIP transcription factor.
Probab=21.05 E-value=67 Score=23.52 Aligned_cols=16 Identities=25% Similarity=0.426 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy10620 557 ALGVTCCLLILIYVST 572 (589)
Q Consensus 557 ~l~i~~~ll~~~~i~~ 572 (589)
.++|+|++|++++|.+
T Consensus 6 ~~iilg~~ll~~Ligi 21 (49)
T PF05624_consen 6 VLIILGALLLLLLIGI 21 (49)
T ss_pred eHHHHHHHHHHHHHHH
Confidence 4667777777665554
No 71
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=20.72 E-value=4.8e+02 Score=21.44 Aligned_cols=41 Identities=12% Similarity=0.185 Sum_probs=29.5
Q ss_pred CCcEEEEecC---CCCe-eeeeecCCEEEEEEEEc-CCCcccEEEe
Q psy10620 368 PPPRIRILDT---KSRE-VETVRIGDKLTFRIEIP-EETPYGIFAR 408 (589)
Q Consensus 368 p~~~m~i~~~---~g~~-v~~v~vGd~l~l~~~~~-~~~~~~i~V~ 408 (589)
-.+.+.+.+. .+.+ -..|.-|+.+.+.|.+. +++-|++.|+
T Consensus 32 ~~~~~~v~~~~y~~~~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~ 77 (89)
T PF05506_consen 32 AAVTFTVYDNAYGGGGPWTYTVAAGQTVSLTWPLAASGGWYDLTVT 77 (89)
T ss_pred CcEEEEEEeCCcCCCCCEEEEECCCCEEEEEEeecCCCCcEEEEEE
Confidence 3567777762 2333 35899999999999984 5678888776
No 72
>PF11884 DUF3404: Domain of unknown function (DUF3404); InterPro: IPR021821 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM.
Probab=20.36 E-value=84 Score=31.97 Aligned_cols=13 Identities=23% Similarity=0.296 Sum_probs=6.9
Q ss_pred eeecCCEEEEEEE
Q psy10620 384 TVRIGDKLTFRIE 396 (589)
Q Consensus 384 ~v~vGd~l~l~~~ 396 (589)
+-.|+|+|.+...
T Consensus 80 sP~ite~l~FerA 92 (262)
T PF11884_consen 80 SPLITEPLVFERA 92 (262)
T ss_pred CcccccchHHHHH
Confidence 4455666655443
No 73
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=20.17 E-value=34 Score=34.82 Aligned_cols=28 Identities=21% Similarity=0.368 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKW 581 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~ 581 (589)
+++-|+|.++||+++++...++|.+|||
T Consensus 226 ~vf~lLVPSiILVLLaVGGLLfYr~rrR 253 (285)
T PF05337_consen 226 FVFYLLVPSIILVLLAVGGLLFYRRRRR 253 (285)
T ss_dssp ----------------------------
T ss_pred ccccccccchhhhhhhccceeeeccccc
Confidence 5677889999999888888777666665
No 74
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=20.15 E-value=50 Score=30.30 Aligned_cols=31 Identities=26% Similarity=0.217 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHhhhhhcCcc
Q psy10620 555 ILALGVTCCLLILIYVSTIF---CYYIKKWMTPR 585 (589)
Q Consensus 555 ~~~l~i~~~ll~~~~i~~~~---~~~~~~~~~~~ 585 (589)
+|+-+|.|++|++.+|++++ |..+++-++.+
T Consensus 62 AIaGIVfgiVfimgvva~i~icvCmc~kn~rgsR 95 (155)
T PF10873_consen 62 AIAGIVFGIVFIMGVVAGIAICVCMCMKNSRGSR 95 (155)
T ss_pred eeeeeehhhHHHHHHHHHHHHHHhhhhhcCCCcc
Confidence 55555777777666666543 66666655543
No 75
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=20.04 E-value=1e+02 Score=30.28 Aligned_cols=36 Identities=11% Similarity=0.206 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhh---cCccccCC
Q psy10620 554 SILALGVTCCLLILIYVSTIFCYYIKKW---MTPRKVMG 589 (589)
Q Consensus 554 ~~~~l~i~~~ll~~~~i~~~~~~~~~~~---~~~~~~~~ 589 (589)
+++-|+|+++||++-.++.+.--+++.+ ++.+|+|+
T Consensus 130 mLIClIIIAVLfLICT~LfLSTVVLANKVS~LKrskQ~g 168 (227)
T PF05399_consen 130 MLICLIIIAVLFLICTLLFLSTVVLANKVSSLKRSKQVG 168 (227)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Done!