Psyllid ID: psy10314


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230---
TCSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
cccccEEEEEEccccEEEEEEccccEEcccccccEEccccEEEEEEEEcccccccEEEEEEEEccccccEEEEEEEEEEEcccEEEEEEEccccEEEccccccEEEEEEEcccEEEEEEccccEEEEEccccEEEEEEccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
ccccccEEEcccccccEEEEEccccEEEEcccEEEEccccEEEEEEcccccccccEEEEEEEccccccccEEEEEEEEEEccEEEEEEEEEEccEEEEcccccccEEEEEccEEEEEEEcccEEEEEEccccEEEEEEcHHHcccccccccccccccccccccccccEEcccHHHHHHHccccccccccccccccccccHcccHHHHHHHHHHcccHHHHHHccccccHHHcc
tcsqghwkcskeecpgictawgdshfqtfdaknfdfqgACEYVLVKGANEEEEEAFVVVIEVvpcgstgvscsksvkistgskknaesitltrnkplpnsgeyhklsvreAGLFVFVELHDIgvtvqwdkgtrvsvtlspkwknkvkglcgnyndnelddfqspsggisevspvvfgdswrvhtycppsiqiqdtcskhpnrklWAVKQCNLlkselfqpchsevpvdsffer
tcsqghwkcskeeCPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVpcgstgvscsksvkistgskknaesitltrnkplpnsgEYHKLSVREAGLFVFVELHDIGvtvqwdkgtrvsvtlspkwknkvKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSelfqpchsevpvdsffer
TCSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANeeeeeafvvvievvPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
******WKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCS*****************************YHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDF*****GISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSE*********
TCSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTY***************NRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
**********KEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGV*****************SITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
*C*QGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
TCSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query233 2.2.26 [Sep-21-2011]
P98092 3133 Hemocytin OS=Bombyx mori N/A N/A 0.991 0.073 0.437 1e-58
P98088 5030 Mucin-5AC (Fragments) OS= no N/A 0.952 0.044 0.359 9e-42
Q2PC93 5255 SCO-spondin OS=Gallus gal no N/A 0.961 0.042 0.384 9e-41
Q62635 1513 Mucin-2 (Fragment) OS=Rat no N/A 0.982 0.151 0.364 1e-40
Q80Z19 2680 Mucin-2 (Fragments) OS=Mu no N/A 0.974 0.084 0.370 4e-40
Q9HC84 5762 Mucin-5B OS=Homo sapiens no N/A 0.974 0.039 0.357 2e-39
Q02817 5179 Mucin-2 OS=Homo sapiens G no N/A 0.965 0.043 0.365 4e-39
Q7Z5P9 6254 Mucin-19 OS=Homo sapiens no N/A 0.974 0.036 0.313 1e-34
Q6PZE0 7524 Mucin-19 OS=Mus musculus no N/A 0.987 0.030 0.320 2e-34
A2VEC9 5147 SCO-spondin OS=Homo sapie no N/A 0.948 0.042 0.333 4e-34
>sp|P98092|HMCT_BOMMO Hemocytin OS=Bombyx mori PE=2 SV=1 Back     alignment and function desciption
 Score =  226 bits (575), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 102/233 (43%), Positives = 149/233 (63%), Gaps = 2/233 (0%)

Query: 2   CSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIE 61
           C  G+W C++ +C G+C AWGDSH  TFD   +DF+G C Y+L KGA +  +  F V I+
Sbjct: 233 CKNGNWSCTQRKCAGVCGAWGDSHVNTFDGTQYDFEGVCTYLLAKGAMDGTD-GFDVEIQ 291

Query: 62  VVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHD 121
            VPCG+TG +CSKSV +  G   N E ++LT+N P+P+  +  ++ +R+AG +VF+++  
Sbjct: 292 NVPCGTTGATCSKSVTLKVGGAGNEEIVSLTKNAPIPDISKLKRIKMRKAGAYVFLDVPS 351

Query: 122 IGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPS-GGISEVSPVVFGDSW 180
           +G+++QWD+G RV V +   W+ +VKGLCGNYN +  DDFQ+PS GG+SE S ++F DSW
Sbjct: 352 LGMSLQWDRGLRVYVKIDTMWQGRVKGLCGNYNGDMRDDFQTPSGGGMSESSALIFADSW 411

Query: 181 RVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER 233
           ++   CP    + D C + P RK WA   C  LK   F  C  EV   ++  R
Sbjct: 412 KLKPTCPKPQPVIDHCKQRPERKEWAQSVCGALKRYPFSLCAGEVGAGAYVAR 464




Adhesive protein and relates to hemostasis or encapsulation of foreign substances for self-defense.
Bombyx mori (taxid: 7091)
>sp|P98088|MUC5A_HUMAN Mucin-5AC (Fragments) OS=Homo sapiens GN=MUC5AC PE=1 SV=3 Back     alignment and function description
>sp|Q2PC93|SSPO_CHICK SCO-spondin OS=Gallus gallus GN=SSPO PE=2 SV=1 Back     alignment and function description
>sp|Q62635|MUC2_RAT Mucin-2 (Fragment) OS=Rattus norvegicus GN=Muc2 PE=1 SV=1 Back     alignment and function description
>sp|Q80Z19|MUC2_MOUSE Mucin-2 (Fragments) OS=Mus musculus GN=Muc2 PE=1 SV=2 Back     alignment and function description
>sp|Q9HC84|MUC5B_HUMAN Mucin-5B OS=Homo sapiens GN=MUC5B PE=1 SV=3 Back     alignment and function description
>sp|Q02817|MUC2_HUMAN Mucin-2 OS=Homo sapiens GN=MUC2 PE=1 SV=2 Back     alignment and function description
>sp|Q7Z5P9|MUC19_HUMAN Mucin-19 OS=Homo sapiens GN=MUC19 PE=1 SV=2 Back     alignment and function description
>sp|Q6PZE0|MUC19_MOUSE Mucin-19 OS=Mus musculus GN=Muc19 PE=2 SV=2 Back     alignment and function description
>sp|A2VEC9|SSPO_HUMAN SCO-spondin OS=Homo sapiens GN=SSPO PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query233
270013885 3944 hypothetical protein TcasGA2_TC012551 [T 0.987 0.058 0.551 5e-76
91090548 3927 PREDICTED: similar to Hemolectin CG7002- 0.987 0.058 0.551 6e-76
340727078 3940 PREDICTED: hemocytin-like [Bombus terres 0.982 0.058 0.554 2e-74
383851337 3801 PREDICTED: hemocytin-like [Megachile rot 0.982 0.060 0.549 7e-74
350402135 3825 PREDICTED: hemocytin-like [Bombus impati 0.974 0.059 0.558 8e-74
328784801 3764 PREDICTED: hemocytin [Apis mellifera] 0.978 0.060 0.556 8e-74
332029742 3753 Hemocytin [Acromyrmex echinatior] 0.987 0.061 0.538 7e-73
242019815 4029 hemolectin, putative [Pediculus humanus 0.995 0.057 0.529 4e-72
380020728 3859 PREDICTED: hemocytin-like [Apis florea] 0.978 0.059 0.539 6e-72
328713073 3821 PREDICTED: hemocytin-like [Acyrthosiphon 0.978 0.059 0.517 3e-70
>gi|270013885|gb|EFA10333.1| hypothetical protein TcasGA2_TC012551 [Tribolium castaneum] Back     alignment and taxonomy information
 Score =  289 bits (740), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 128/232 (55%), Positives = 172/232 (74%), Gaps = 2/232 (0%)

Query: 2    CSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIE 61
            C  G WKC+   C G C+AWGDSHF+TFD K++DFQG C+YVL KG    +  ++ + I+
Sbjct: 1240 CQNGKWKCTDRPCSGECSAWGDSHFKTFDGKHYDFQGQCDYVLSKGVIGSD--SYEISIQ 1297

Query: 62   VVPCGSTGVSCSKSVKISTGSKKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELHD 121
             VPCGS G SCSKSV I   S ++ + +TLTR+KPLP+      ++V + GLFV  E  D
Sbjct: 1298 NVPCGSLGTSCSKSVTIRVKSGEDQDVLTLTRDKPLPDLAPLKHITVHQKGLFVITEAPD 1357

Query: 122  IGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWR 181
            +G+T++WDKGTRV V + P+WK +VKGLCGNYNDN+ DDFQ+PSGG++E S  +FGDSW+
Sbjct: 1358 LGLTIKWDKGTRVYVKIDPRWKERVKGLCGNYNDNDADDFQTPSGGLAEASAKIFGDSWK 1417

Query: 182  VHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPVDSFFER 233
            + +YCP +++I +TC   P+RK+WA+KQC +LKS LF PCHSEVPVD F E+
Sbjct: 1418 LQSYCPEALEITNTCEDRPDRKVWALKQCGVLKSPLFAPCHSEVPVDIFLEK 1469




Source: Tribolium castaneum

Species: Tribolium castaneum

Genus: Tribolium

Family: Tenebrionidae

Order: Coleoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|91090548|ref|XP_971239.1| PREDICTED: similar to Hemolectin CG7002-PA [Tribolium castaneum] Back     alignment and taxonomy information
>gi|340727078|ref|XP_003401878.1| PREDICTED: hemocytin-like [Bombus terrestris] Back     alignment and taxonomy information
>gi|383851337|ref|XP_003701190.1| PREDICTED: hemocytin-like [Megachile rotundata] Back     alignment and taxonomy information
>gi|350402135|ref|XP_003486378.1| PREDICTED: hemocytin-like [Bombus impatiens] Back     alignment and taxonomy information
>gi|328784801|ref|XP_395067.4| PREDICTED: hemocytin [Apis mellifera] Back     alignment and taxonomy information
>gi|332029742|gb|EGI69611.1| Hemocytin [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|242019815|ref|XP_002430354.1| hemolectin, putative [Pediculus humanus corporis] gi|212515478|gb|EEB17616.1| hemolectin, putative [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|380020728|ref|XP_003694231.1| PREDICTED: hemocytin-like [Apis florea] Back     alignment and taxonomy information
>gi|328713073|ref|XP_001952865.2| PREDICTED: hemocytin-like [Acyrthosiphon pisum] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query233
FB|FBgn0029167 3843 Hml "Hemolectin" [Drosophila m 0.987 0.059 0.447 2.7e-53
UNIPROTKB|Q2PC93 5255 SSPO "SCO-spondin" [Gallus gal 0.957 0.042 0.383 1.7e-41
UNIPROTKB|P98088 5030 MUC5AC "Mucin-5AC" [Homo sapie 0.965 0.044 0.338 1.8e-40
RGD|62001 1851 Muc5ac "mucin 5AC, oligomeric 0.969 0.122 0.365 2.9e-39
RGD|3123 1513 Muc2 "mucin 2, oligomeric mucu 0.982 0.151 0.351 3.4e-39
ZFIN|ZDB-GENE-051114-1 4990 scospondin "subcommissural org 0.965 0.045 0.365 8.4e-39
MGI|MGI:1339364 2680 Muc2 "mucin 2" [Mus musculus ( 0.974 0.084 0.357 1.3e-38
UNIPROTKB|F1PHH5 4996 MUC5B "Uncharacterized protein 0.974 0.045 0.358 9e-38
UNIPROTKB|J9JHJ8 5112 MUC5B "Uncharacterized protein 0.974 0.044 0.358 9.2e-38
UNIPROTKB|G5E5A7 5944 G5E5A7 "Uncharacterized protei 0.527 0.020 0.384 1.9e-37
FB|FBgn0029167 Hml "Hemolectin" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 559 (201.8 bits), Expect = 2.7e-53, Sum P(2) = 2.7e-53
 Identities = 110/246 (44%), Positives = 153/246 (62%)

Query:     2 CSQGHWKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANXXXXXXXXXXXX 61
             C  G+WKCSK  C   C+ WGDSHF TFD  +FDFQGAC+YVL KG              
Sbjct:  1322 CRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYVLAKGVFDNGDGFSITIQN 1381

Query:    62 XXPCGSTGVSCSKSVKIS-TGSKK-----NAESITLTR-NK-PLP------NSGEYHKLS 107
                CG+ GV+CSKS++I+ TG  +     +A+S   T  NK P+       NS  ++   
Sbjct:  1382 VL-CGTMGVTCSKSLEIALTGHAEESLLLSADSAYSTDPNKTPIKKLRDSVNSKGHNAFH 1440

Query:   108 VREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSGG 167
             + +AG+FV VE+  + + V+WD+GTRV V L  +W+ KV GLCGNYN N LDD Q+PS G
Sbjct:  1441 IYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSMG 1500

Query:   168 ISEVSPVVFGDSWRVHTYCPPSIQIQDTCSKHPNRKLWAVKQCNLLKSELFQPCHSEVPV 227
             + E SP++FG +W++  +C   +   D C KHP R+ WA  +C  LKS+LF+ CH+EVP+
Sbjct:  1501 L-ETSPMLFGHAWKLQPHCSAPVAPIDACKKHPERETWAQLKCGALKSDLFKECHAEVPL 1559

Query:   228 DSFFER 233
             + F++R
Sbjct:  1560 ERFWKR 1565


GO:0030246 "carbohydrate binding" evidence=ISS
GO:0042803 "protein homodimerization activity" evidence=IDA
GO:0007599 "hemostasis" evidence=IEP;IMP
GO:0005576 "extracellular region" evidence=IDA
GO:0035006 "melanization defense response" evidence=IMP
GO:0042060 "wound healing" evidence=IMP
GO:0006030 "chitin metabolic process" evidence=IEA
GO:0008061 "chitin binding" evidence=IEA
GO:0007155 "cell adhesion" evidence=IEA
GO:0042381 "hemolymph coagulation" evidence=IMP
UNIPROTKB|Q2PC93 SSPO "SCO-spondin" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|P98088 MUC5AC "Mucin-5AC" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|62001 Muc5ac "mucin 5AC, oligomeric mucus/gel-forming" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
RGD|3123 Muc2 "mucin 2, oligomeric mucus/gel-forming" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-051114-1 scospondin "subcommissural organ spondin" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
MGI|MGI:1339364 Muc2 "mucin 2" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1PHH5 MUC5B "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|J9JHJ8 MUC5B "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|G5E5A7 G5E5A7 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query233
smart00216163 smart00216, VWD, von Willebrand factor (vWF) type 4e-40
pfam00094159 pfam00094, VWD, von Willebrand factor type D domai 1e-34
pfam0874274 pfam08742, C8, C8 domain 6e-05
>gnl|CDD|214566 smart00216, VWD, von Willebrand factor (vWF) type D domain Back     alignment and domain information
 Score =  134 bits (340), Expect = 4e-40
 Identities = 55/166 (33%), Positives = 82/166 (49%), Gaps = 9/166 (5%)

Query: 7   WKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCG 66
           W C++EEC   C+  GD H+ TFD   + F G C YVL +      E  F V+++ VPCG
Sbjct: 1   WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDC--SSEPTFSVLLKNVPCG 58

Query: 67  STGVSCSKSVKISTGS------KKNAESITLTRNKPLPNSGEYHKLSVREAGLFVFVELH 120
             G +C KSVK+            N +     +   LP       + +R +G ++ V   
Sbjct: 59  G-GATCLKSVKVELNGDEIELKDDNGKVTVNGQQVSLPYKTSDGSIQIRSSGGYLVVITS 117

Query: 121 DIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSPSG 166
              + V +D  T +SV L  K++ K  GLCGN++    DDF++P G
Sbjct: 118 LGLIQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163


Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. Length = 163

>gnl|CDD|215713 pfam00094, VWD, von Willebrand factor type D domain Back     alignment and domain information
>gnl|CDD|219998 pfam08742, C8, C8 domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 233
smart00216162 VWD von Willebrand factor (vWF) type D domain. Von 100.0
PF00094159 VWD: von Willebrand factor type D domain; InterPro 99.96
smart0083276 C8 C8 domain. This domain contains 8 conserved cys 97.99
KOG4338|consensus 1680 97.95
PF0874274 C8: C8 domain; InterPro: IPR014853 The proteins in 97.48
PF06535161 RGM_N: Repulsive guidance molecule (RGM) N-terminu 97.3
PF07481171 DUF1521: Domain of Unknown Function (DUF1521); Int 92.45
>smart00216 VWD von Willebrand factor (vWF) type D domain Back     alignment and domain information
Probab=100.00  E-value=5.5e-37  Score=237.76  Aligned_cols=152  Identities=36%  Similarity=0.776  Sum_probs=126.9

Q ss_pred             eeeecCCCCeEEEEECCCeeeccCCCeeeeecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCCCccc
Q psy10314          7 WKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNA   86 (233)
Q Consensus         7 w~C~~~~c~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~   86 (233)
                      |.|+..+|+++|.++|++||+||||+.|+|+|.|.|+|+++|..  .++|+|++++.+++.. .++.++|+|.+++    
T Consensus         1 ~~c~~~~~~~~C~v~g~~~~~TFDg~~y~~~g~C~yvL~~~~~~--~~~f~V~~~~~~~~~~-~~~~~~v~v~~~~----   73 (162)
T smart00216        1 WCCTQGECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSS--EPTFSVLLKNVPCGGG-ATCLKSVKVELNG----   73 (162)
T ss_pred             CccccCCCCCEEEEcCCCCeECcCCCEEeecCceEEEEEEECCC--CCCEEEEEEecCCCCC-ceEEEEEEEEECC----
Confidence            88999999999999999999999999999999999999999864  4699999999887653 6789999999984    


Q ss_pred             eEEEEeC--------Cc--ccCCCCCcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCCCCcccceeecCCC
Q psy10314         87 ESITLTR--------NK--PLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDN  156 (233)
Q Consensus        87 ~~i~~~~--------~~--~lp~~~~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~g~~~GLCGn~ng~  156 (233)
                      ..|.+..        |.  .+|.......+.+++.+.++++. +..||+|.|||...++|.+|+.|+|+||||||||||+
T Consensus        74 ~~i~~~~~~~~v~vng~~v~~p~~~~~~~v~~~~~~~~~v~~-~~~gl~v~~dg~~~~~V~~~~~~~g~~~GLCGn~ng~  152 (162)
T smart00216       74 DEIELKDDNGTVTVNGQQVSLPYKTSDGSIQIRSSGGYVVVI-TSLGLQVTFDGLTLLSVQLPSRYRGKTCGLCGNFDGE  152 (162)
T ss_pred             EEEEEEeCCCEEEECCEEeeCCcCcCCceEEEEECceEEEEE-CCCCEEEEECCCcEEEEEECHHHCCCeeEccCCCCCC
Confidence            3444432        22  34554333346666666555554 6899999999887799999999999999999999999


Q ss_pred             CCCCccCCCC
Q psy10314        157 ELDDFQSPSG  166 (233)
Q Consensus       157 ~~dD~~~p~g  166 (233)
                      +.|||++|+|
T Consensus       153 ~~dDf~~p~g  162 (162)
T smart00216      153 PEDDFRTPDG  162 (162)
T ss_pred             ccccccCCCC
Confidence            9999999987



Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.

>PF00094 VWD: von Willebrand factor type D domain; InterPro: IPR001846 A family of growth regulators (originally called cef10, connective tissue growth factor, fisp-12, cyr61, or, alternatively, beta IG-M1 and beta IG-M2), all belong to immediate-early genes expressed after induction by growth factors or certain oncogenes Back     alignment and domain information
>smart00832 C8 C8 domain Back     alignment and domain information
>KOG4338|consensus Back     alignment and domain information
>PF08742 C8: C8 domain; InterPro: IPR014853 The proteins in this entry contained a domain rich in positionally conserved cysteine residues Back     alignment and domain information
>PF06535 RGM_N: Repulsive guidance molecule (RGM) N-terminus; InterPro: IPR010536 This entry represents the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken) Back     alignment and domain information
>PF07481 DUF1521: Domain of Unknown Function (DUF1521); InterPro: IPR011086 This domain of unknown function is found in a limited set of Bradyrhizobium proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query233
1lsh_B319 Lipovitellin (LV-2); vitellogenin, lipoprotein, pl 97.83
1wo9_A35 HI-1, trypsin inhibitor; hydrolase inhibitor; NMR 94.49
2xtt_A36 Protease inhibitor SGPI-1; hydrolase, catalytic me 93.4
>1lsh_B Lipovitellin (LV-2); vitellogenin, lipoprotein, plasma apolipoprote apolipoprotein B, APOB; HET: PLD UPL; 1.90A {Ichthyomyzon unicuspis} SCOP: f.7.1.1 Back     alignment and structure
Probab=97.83  E-value=2.4e-06  Score=71.54  Aligned_cols=60  Identities=18%  Similarity=0.380  Sum_probs=0.0

Q ss_pred             CCCeEEEEECCCeeeccCCCeeee--ecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeC
Q psy10314         13 ECPGICTAWGDSHFQTFDAKNFDF--QGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTG   81 (233)
Q Consensus        13 ~c~~~C~~~G~~h~~TFDg~~y~f--~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~   81 (233)
                      .-.++|.+..+ .++|||++.|..  ++.|.+||++||..  ...|.|++++....      .+.|.|.++
T Consensus       256 ~~~a~C~v~~~-~vtTFdNk~Y~~~mp~sCyhVLaqDc~~--e~kF~VL~Kk~~~n------qkei~Vk~~  317 (319)
T 1lsh_B          256 PVASACELNEQ-SLTTFNGAFFNYDMPESCYHVLAQECSS--RPPFIVLIKLDSER------RISLELQLD  317 (319)
T ss_dssp             -----------------------------------------------------------------------
T ss_pred             ccCceEEEECC-EEEccCCceecCCCCCceeEEEEeeCCC--CceEEEEEeeCCCC------ceEEEEEec
Confidence            34689999987 799999999974  59999999999964  77999999885422      256666664



>1wo9_A HI-1, trypsin inhibitor; hydrolase inhibitor; NMR {Synthetic} SCOP: g.4.1.1 Back     alignment and structure
>2xtt_A Protease inhibitor SGPI-1; hydrolase, catalytic mechanism, inhibition, in vitro evoluti; 0.93A {Schistocerca gregaria} PDB: 1kj0_A 2f91_B 2vu8_I 1kio_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00