Citrus Sinensis ID: 016344
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 391 | ||||||
| 255575726 | 373 | conserved hypothetical protein [Ricinus | 0.933 | 0.978 | 0.622 | 1e-136 | |
| 224138028 | 382 | predicted protein [Populus trichocarpa] | 0.959 | 0.981 | 0.632 | 1e-135 | |
| 356540585 | 381 | PREDICTED: uncharacterized protein LOC10 | 0.938 | 0.963 | 0.598 | 1e-124 | |
| 225427312 | 383 | PREDICTED: uncharacterized protein LOC10 | 0.961 | 0.981 | 0.591 | 1e-123 | |
| 356495558 | 377 | PREDICTED: uncharacterized protein LOC10 | 0.925 | 0.960 | 0.592 | 1e-122 | |
| 357482303 | 374 | hypothetical protein MTR_5g014010 [Medic | 0.884 | 0.925 | 0.592 | 1e-119 | |
| 449461577 | 404 | PREDICTED: uncharacterized protein LOC10 | 0.905 | 0.876 | 0.574 | 1e-111 | |
| 240256408 | 387 | uncharacterized protein [Arabidopsis tha | 0.913 | 0.922 | 0.532 | 1e-102 | |
| 297794583 | 389 | hypothetical protein ARALYDRAFT_494322 [ | 0.882 | 0.886 | 0.497 | 2e-92 | |
| 8885597 | 375 | unnamed protein product [Arabidopsis tha | 0.890 | 0.928 | 0.489 | 1e-84 |
| >gi|255575726|ref|XP_002528762.1| conserved hypothetical protein [Ricinus communis] gi|223531765|gb|EEF33584.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 242/389 (62%), Positives = 297/389 (76%), Gaps = 24/389 (6%)
Query: 1 MSSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVAL 60
MSSS+ KR KP + S SSS + +T PP +LFPSK++ +RLI V+AIASSVA
Sbjct: 1 MSSSSTNKRRKP--------NLSPSSSPTLLTGPPNNLFPSKEEFVRLIAVLAIASSVAF 52
Query: 61 TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLC 120
TCN +A ++N ++KPFCDSN S ++ C PCP NGEC QGKLEC GYRKH +C
Sbjct: 53 TCNLIATYINPSTKPFCDSNT-----DSFSEFCVPCPENGECTQGKLECAEGYRKHRNIC 107
Query: 121 VEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDN 180
+EDGDINE A +LS WVEN LC AYAQ+LCDG G+IW ++NDIW DL+GH+LM+ F+ DN
Sbjct: 108 IEDGDINERAKKLSEWVENHLCEAYAQYLCDGIGTIWFQDNDIWYDLDGHQLMENFQPDN 167
Query: 181 PVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVP 240
Y+Y K++ ME + R LE RTNS+G KELKCP+L+AEHYKP +CR QW+S HA +I
Sbjct: 168 ATYIYAKRKAMEMIVRLLEIRTNSHGNKELKCPDLVAEHYKPFTCRFRQWISNHAFVIAS 227
Query: 241 VCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLR 300
+CSL+VG +LLL K+ RR Y + R EELYHQVCE+LEENALMSK NGEC+ WVVAS+LR
Sbjct: 228 LCSLVVGAVLLLRKLQRRWYLSARGEELYHQVCEVLEENALMSKQSNGECDSWVVASQLR 287
Query: 301 DHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQVEGSLSSSKMRKKG 360
DHLLLPKERKDPV+WK+VE+LVQEDSRVD+YPKL+KGESKVVWEWQVEGS SS ++RKK
Sbjct: 288 DHLLLPKERKDPVLWKRVEQLVQEDSRVDRYPKLVKGESKVVWEWQVEGSWSSGRIRKK- 346
Query: 361 EASKWRSAEGRDMKFGQQQSPLKAEPKAL 389
EASK +S+E SP KA+ AL
Sbjct: 347 EASKLKSSE----------SPPKAKFDAL 365
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224138028|ref|XP_002326500.1| predicted protein [Populus trichocarpa] gi|222833822|gb|EEE72299.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356540585|ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784375 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|225427312|ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243743 [Vitis vinifera] gi|297742158|emb|CBI33945.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356495558|ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779650 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357482303|ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago truncatula] gi|355512772|gb|AES94395.1| hypothetical protein MTR_5g014010 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449461577|ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208017 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|240256408|ref|NP_199468.5| uncharacterized protein [Arabidopsis thaliana] gi|332008015|gb|AED95398.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297794583|ref|XP_002865176.1| hypothetical protein ARALYDRAFT_494322 [Arabidopsis lyrata subsp. lyrata] gi|297311011|gb|EFH41435.1| hypothetical protein ARALYDRAFT_494322 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|8885597|dbj|BAA97527.1| unnamed protein product [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 391 | ||||||
| TAIR|locus:2142310 | 387 | AT5G46560 "AT5G46560" [Arabido | 0.841 | 0.850 | 0.531 | 4.2e-95 | |
| UNIPROTKB|H0Y9B7 | 85 | LEMD2 "LEM domain-containing p | 0.163 | 0.752 | 0.323 | 0.0002 | |
| DICTYBASE|DDB_G0293138 | 942 | DDB_G0293138 "Protein SRC1" [D | 0.140 | 0.058 | 0.321 | 0.00062 |
| TAIR|locus:2142310 AT5G46560 "AT5G46560" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 946 (338.1 bits), Expect = 4.2e-95, P = 4.2e-95
Identities = 179/337 (53%), Positives = 232/337 (68%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNST-SKPFCDSNLLLDSP-QS 88
M EPPQSLFPSK + L+ V+ +A +VA TCN+L+ L+S SK FCDSN +P S
Sbjct: 31 MLEPPQSLFPSKGEFFTLLKVLLVACAVAFTCNFLSKSLSSNPSKSFCDSNF---NPIDS 87
Query: 89 PTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQF 148
D CEPCP NGEC+QGKL+C GY+ LCVEDG+INE+ +L + E ++C +YA
Sbjct: 88 DLDICEPCPINGECYQGKLQCNLGYKNQRNLCVEDGEINESTKKLVGYFERKVCESYAHN 147
Query: 149 LCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMK 208
C GTG+IWV END+W +L + + LD Y + K + +E V LE RTNS G+
Sbjct: 148 ECYGTGTIWVPENDVWTELRSNSFLS--NLDESAYNFLKGKAVEGVTELLEKRTNSNGID 205
Query: 209 ELKCPELLAEHYKPLSCRIHQWVSTHALIIVPXXXXXXXXXXXXWKVHRRRYFAIRVEEL 268
ELKCPE +A+ YKPL+CR+HQW+ H LII ++ R++ F+ RVEEL
Sbjct: 206 ELKCPESVAKSYKPLTCRLHQWILRHILIISSSCAMLVGSAMLRRRIQRKQCFSRRVEEL 265
Query: 269 YHQVCEILEENALMSKSVN-GECEPWVVASRLRDHLLLPKERKDPVIWKKVEELVQEDSR 327
Y QVC+ LEENA+ S S CEPWV+AS LRD+LLLP+ER+DP++W KVEEL++EDSR
Sbjct: 266 YDQVCDFLEENAVASNSAETSNCEPWVIASWLRDYLLLPRERRDPLLWTKVEELIKEDSR 325
Query: 328 VDQYPKLLKGESKVVWEWQVEGSLSSSKMRKKGEASK 364
+D+Y KLLKGE KVVWEWQVEGSLS SK++K+ E K
Sbjct: 326 IDRYEKLLKGEKKVVWEWQVEGSLSLSKLKKQRETQK 362
|
|
| UNIPROTKB|H0Y9B7 LEMD2 "LEM domain-containing protein 2" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0293138 DDB_G0293138 "Protein SRC1" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.00290149 | hypothetical protein (382 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 391 | |||
| pfam09402 | 326 | pfam09402, MSC, Man1-Src1p-C-terminal domain | 3e-28 |
| >gnl|CDD|220221 pfam09402, MSC, Man1-Src1p-C-terminal domain | Back alignment and domain information |
|---|
Score = 112 bits (283), Expect = 3e-28
Identities = 64/329 (19%), Positives = 113/329 (34%), Gaps = 70/329 (21%)
Query: 67 NFLNSTSKPFCDSNLLLDS-------PQSPTDSCEPCPSNGECHQG-KLECFHGYRK--- 115
+ FC S S + C PCP N C+ G KL+C G++
Sbjct: 14 YREQKIAVGFCGSGAPTYSLALTPSFLERLKPQCIPCPPNAICYPGLKLKCEPGFKLKPH 73
Query: 116 -------HGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC-DGTGSIWVEENDIWNDL 167
C+ D + E L L + A+ C + S + EN+++
Sbjct: 74 PLSLLGLIPPKCIPDTEKEELVSELVDKALEELRKRNAKHECGEDDLSPGISENELYE-- 131
Query: 168 EGHELMKIFELDNPVYLYTKKRTMETVGRY-----------LESRTNSYGMKELKCPELL 216
L KK + E+ + +
Sbjct: 132 ---------------LLSEKKSPWINEEEFEDLWKAAIKELKENPEVVIRPEPILSKNGD 176
Query: 217 AEH------------YKPLSCRI----HQWVSTHALIIVPVCSLLVGCLLL--LWKVHRR 258
Y PLSCR ++ + L+++ + LL+ + K R
Sbjct: 177 GSTETQKTLRSSSLAYLPLSCRFRRFVRLFLKRYRLLLLGLIILLLVVFYIRYRIKKKRE 236
Query: 259 RYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKE--RKDPVIWK 316
+VEEL ++ + L++ S + EP++ +LRD LL + ++ +W+
Sbjct: 237 E--KAQVEELVKKIIDKLKQQKEASDE-DTSEEPYLPIPQLRDDLLRDEHRLKRRNRLWE 293
Query: 317 KVEELVQEDSRVDQYPKLLKGESKVVWEW 345
KV ++++S V + + GE VWEW
Sbjct: 294 KVVSKLEQNSNVRTRLREIHGEIMRVWEW 322
|
MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C terminal nucleoplasmic region forms a DNA binding winged helix and binds to Smad. This C-terminal tail is also found in S. cerevisiae and is thought to consist of three conserved helices followed by two downstream strands. Length = 326 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 391 | |||
| PF09402 | 334 | MSC: Man1-Src1p-C-terminal domain; InterPro: IPR01 | 100.0 | |
| PF12946 | 37 | EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 | 95.51 |
| >PF09402 MSC: Man1-Src1p-C-terminal domain; InterPro: IPR018996 This entry represents the Inner nuclear membrane proteins MAN1 (also known as LEM domain-containing protein 3) and LEM domain-containing protein 2 (or LEM protein 2) | Back alignment and domain information |
|---|
Probab=100.00 E-value=5e-62 Score=484.04 Aligned_cols=275 Identities=28% Similarity=0.421 Sum_probs=71.2
Q ss_pred CCCCCCCCCCCCCCC------------CCCCCCCCccCCCCceecCC-eeeeCCCceec-----------CCCcccChhh
Q 016344 71 STSKPFCDSNLLLDS------------PQSPTDSCEPCPSNGECHQG-KLECFHGYRKH-----------GKLCVEDGDI 126 (391)
Q Consensus 71 ~~~~~fCdS~~~~~~------------~~~~~p~C~PCP~hAiC~~g-~l~C~~gYvl~-----------~~~CV~D~~k 126 (391)
+...+|||++.+..+ ...++|+|+|||+||+|++| ++.|++||++. +++|++|+++
T Consensus 18 ~~~vgyC~~~~~~~~~~~~~~~~~~~~~~~~~P~C~pCP~~a~C~~~~~~~C~~~y~~~~~~l~~~g~~p~~~Ci~D~~k 97 (334)
T PF09402_consen 18 KIAVGYCGTESPSPSFADDDISVPDWLLENFKPSCEPCPEHAICYPGLKLECEPGYVLKPSPLSLFGLIPPPKCIPDTEK 97 (334)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHH
Confidence 468999999972111 14678999999999999999 99999999998 9999999999
Q ss_pred hHHHHHHHHHHHHHHHHHhhcccccC---CCCcccchhhHHHHhhhhhhhhccCCChHHHHHHHHHHHHHHHhhcccccc
Q 016344 127 NETAGRLSRWVENRLCRAYAQFLCDG---TGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTN 203 (391)
Q Consensus 127 ~~~i~~l~~~i~~~Lr~r~A~~~CG~---~~s~~i~e~dL~~~~~~~~~~k~~~ls~~~fe~l~~~al~~l~~~l~~~~~ 203 (391)
++.+.+|++++.++||++||+++||. ..+.+|+++||++++.+ ++++++++++|+++|..|+..+.+.-+..+.
T Consensus 98 ~~~i~~l~~~~~~~Lr~~~a~~~Cg~~~~~~~~~ls~~el~~~~~~---~~~~~~~~~efe~l~~~a~~~L~~~~ei~~~ 174 (334)
T PF09402_consen 98 EEKIEELAKKILDELRERNAQYECGDSEDDESPGLSEEELKDILSS---KKSPWISDEEFEELWSAALQELKKNPEIIIR 174 (334)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCcHHHHHHHHHh---ccCccccHHHHHHHHHHHHHHHHhCCcEEEe
Confidence 99999999999999999999999993 34678999999999998 7789999999999999999888744332222
Q ss_pred ---------cCCceeeccccccccCccCccchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 016344 204 ---------SYGMKELKCPELLAEHYKPLSCRIHQW----VSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270 (391)
Q Consensus 204 ---------sn~~~~~k~~~s~S~~~i~l~Cr~r~~----i~~~~~~i~~~l~~~v~i~~l~~~~~r~~~e~~~v~~Lv~ 270 (391)
.+.........+++++++||+|++++. +.+++..++++++++++++++++++++++.++++|++||+
T Consensus 175 ~~~~~~~~~~~~~~~~~~~~s~s~~~lpl~C~~~~~i~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~v~~lv~ 254 (334)
T PF09402_consen 175 DDIINSHSSDDSNEKDKYFRSSSLPYLPLKCRLRRQIRQFISRYRLIILGVLILLLLIKYIRYRYRKRREEKARVEELVK 254 (334)
T ss_dssp -----------------------------------------------------------------STHHHHHTTTTTTHH
T ss_pred cccccccccccccCCcEEEEeeCCCccccEEEEehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 001111122234689999999976555 4556666777766777777777778888899999999999
Q ss_pred HHHHHHHHHHhhhhccCCCCCCcccccccccccCCCCC--cCchhhHHHHHHHHhcCCCcceeeeEEcCceeeeeEEeec
Q 016344 271 QVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKE--RKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQVE 348 (391)
Q Consensus 271 ~vl~~L~~q~~~~~~~~~~~~pyl~~~qLRD~LL~~~~--r~r~~LW~kV~k~Ve~nSnVrt~~~ev~GE~~~vWEWig~ 348 (391)
+|+++|++|+..+ ..+..++|||+++||||+||.+.+ +++++||++|+++||+|||||++++|+|||+|+||||||+
T Consensus 255 ~ii~~L~~~~~~~-~~~~~~~p~v~~~qLRD~ll~~~~~~~~~~~lW~~v~~~ve~ns~Vr~~~~e~~Ge~~~vWeWig~ 333 (334)
T PF09402_consen 255 KIIDRLQDQARAS-DPNSSPEPYVSISQLRDDLLPPEHRLKRRNRLWKKVVKKVEENSNVRTEVREVHGEIMRVWEWIGP 333 (334)
T ss_dssp HHHHHHHHHHHHH-TTSS-S-S-B-HHHHHHTT--STTGGG-GHHHHHHHHHHHTT---SEEEEEEETTEEEEEEE----
T ss_pred HHHHHHHHHhhhh-ccCCCCCCCccHHHHHHHhCCcccCHHHHHHHHHHHHHHHHcCCCeeEEEEEECCeEEEEEEecCC
Confidence 9999999998843 344678999999999999998765 3379999999999999999999999999999999999997
Q ss_pred C
Q 016344 349 G 349 (391)
Q Consensus 349 ~ 349 (391)
+
T Consensus 334 ~ 334 (334)
T PF09402_consen 334 N 334 (334)
T ss_dssp -
T ss_pred C
Confidence 5
|
Emerin and MAN1 are LEM domain-containing integral membrane proteins of the vertebrate nuclear envelope []. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C-terminal nulceoplasmic region forms a DNA binding winged helix and binds to Smad []. LEM protein 2 is an essential protein involved in chromosome segregation and cell division, probably via its interaction with lmn-1, the main component of nuclear lamina. Has some overlapping function with emr-1.; GO: 0005639 integral to nuclear inner membrane; PDB: 2CH0_A. |
| >PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 391 | |||
| 2ch0_A | 133 | Inner nuclear membrane protein MAN1; winged helix | 4e-16 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 2e-04 |
| >2ch0_A Inner nuclear membrane protein MAN1; winged helix motif, DNA, nuclear protein; NMR {Homo sapiens} Length = 133 | Back alignment and structure |
|---|
Score = 73.7 bits (180), Expect = 4e-16
Identities = 19/116 (16%), Positives = 49/116 (42%), Gaps = 5/116 (4%)
Query: 254 KVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPV 313
+ + ++ ++ ++ ++L + + N + +P++ +RD L+ P +RK
Sbjct: 6 RWTKEEEETRQMYDMVVKIIDVLRSHNEACQE-NKDLQPYMPIPHVRDSLIQPHDRKKMK 64
Query: 314 -IWKKVEELVQE-DSRVDQYPKLLKGESKVVWEWQVEGSLSSSKMRKKGEASKWRS 367
+W + + + +SRV + + G +VW W +S + W+
Sbjct: 65 KVWDRAVDFLAANESRVRTETRRIGGADFLVWRWI--QPSASCDKILVIPSKVWQG 118
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 391 | |||
| 2ch0_A | 133 | Inner nuclear membrane protein MAN1; winged helix | 99.88 | |
| 1b9w_A | 95 | Protein (merozoite surface protein 1); MSP-1, cand | 93.84 | |
| 1ob1_C | 99 | Major merozoite surface protein; immune system, im | 91.55 | |
| 1n1i_A | 105 | Merozoite surface protein-1; MSP1, malaria, surfac | 90.64 |
| >2ch0_A Inner nuclear membrane protein MAN1; winged helix motif, DNA, nuclear protein; NMR {Homo sapiens} | Back alignment and structure |
|---|
Probab=99.88 E-value=2e-23 Score=179.76 Aligned_cols=95 Identities=16% Similarity=0.413 Sum_probs=82.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCcccccccccccCCCCCc-CchhhHHHHHHHHhcCCC-cceeee
Q 016344 256 HRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKER-KDPVIWKKVEELVQEDSR-VDQYPK 333 (391)
Q Consensus 256 ~r~~~e~~~v~~Lv~~vl~~L~~q~~~~~~~~~~~~pyl~~~qLRD~LL~~~~r-~r~~LW~kV~k~Ve~nSn-Vrt~~~ 333 (391)
++.+.++++|.+||++|+++|++|+..| .++++.+|||++.||||.||.++.+ ++.++|++|+++||+|+| ||++++
T Consensus 8 ~k~~ee~~~v~~LV~~Iid~L~~q~~~~-~e~~~~~p~l~i~hLRD~ll~~~~r~k~~~lW~kAvk~lE~NesRVRte~~ 86 (133)
T 2ch0_A 8 TKEEEETRQMYDMVVKIIDVLRSHNEAC-QENKDLQPYMPIPHVRDSLIQPHDRKKMKKVWDRAVDFLAANESRVRTETR 86 (133)
T ss_dssp STHHHHHTTTTTTHHHHHHHHHHHHHHH-TTSSCSCSCBCHHHHHHTTCCSTTGGGGHHHHHHHHHHHTTTCCCSEEEEE
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCCceehhhhHhhhcCccchHHHHHHHHHHHHHHHcccchhhhcce
Confidence 3456777889999999999999998876 3445678999999999999988765 559999999999998755 999999
Q ss_pred EEcCceeeeeEEeecCCC
Q 016344 334 LLKGESKVVWEWQVEGSL 351 (391)
Q Consensus 334 ev~GE~~~vWEWig~~~~ 351 (391)
+++||+|+||+|||+.+.
T Consensus 87 ~i~GE~~rvW~Wig~~~~ 104 (133)
T 2ch0_A 87 RIGGADFLVWRWIQPSAS 104 (133)
T ss_dssp EETTEEEEEEECCCCSSS
T ss_pred eecCeEEEEEEecCCCCc
Confidence 999999999999998764
|
| >1b9w_A Protein (merozoite surface protein 1); MSP-1, candidate malaria vaccine, surface antigen; 1.80A {Plasmodium cynomolgi} SCOP: g.3.11.4 g.3.11.4 PDB: 2npr_A | Back alignment and structure |
|---|
| >1ob1_C Major merozoite surface protein; immune system, immunoglobulin/complex, immunoglobulin, antib fragment, MSP1-19, EGF-like domain; 2.90A {Plasmodium falciparum} SCOP: g.3.11.4 g.3.11.4 PDB: 1cej_A 2flg_A | Back alignment and structure |
|---|
| >1n1i_A Merozoite surface protein-1; MSP1, malaria, surface antigen, glycoprotein, EGF domain, cell adhesion; HET: HIS; 2.40A {Plasmodium knowlesi strain H} SCOP: g.3.11.4 g.3.11.4 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 391 | |||
| d1ob1c1 | 45 | Merozoite surface protein 1 (MSP-1) {Malaria paras | 93.91 | |
| d1b9wa1 | 45 | Merozoite surface protein 1 (MSP-1) {Malaria paras | 92.87 | |
| d1gl4a2 | 40 | EGF-like domain of nidogen-1 {Mouse (Mus musculus) | 91.92 | |
| d2vj3a1 | 42 | Neurogenic locus notch homolog protein 1, Notch1 { | 80.05 |
| >d1ob1c1 g.3.11.4 (C:1-45) Merozoite surface protein 1 (MSP-1) {Malaria parasite (Plasmodium falciparum) [TaxId: 5833]} | Back information, alignment and structure |
|---|
class: Small proteins fold: Knottins (small inhibitors, toxins, lectins) superfamily: EGF/Laminin family: Merozoite surface protein 1 (MSP-1) domain: Merozoite surface protein 1 (MSP-1) species: Malaria parasite (Plasmodium falciparum) [TaxId: 5833]
Probab=93.91 E-value=0.015 Score=38.31 Aligned_cols=30 Identities=33% Similarity=0.926 Sum_probs=26.5
Q ss_pred cCCCCceecCC-----eeeeCCCceecCCCcccCh
Q 016344 95 PCPSNGECHQG-----KLECFHGYRKHGKLCVEDG 124 (391)
Q Consensus 95 PCP~hAiC~~g-----~l~C~~gYvl~~~~CV~D~ 124 (391)
-||+||-||.+ +-+|--||...+++||+.+
T Consensus 11 ~~P~NagCfRyldg~EEwrCLL~fKk~~~kCv~~p 45 (45)
T d1ob1c1 11 QCPQNSGCFRHLDEREECKCLLNYKQEGDKCVENP 45 (45)
T ss_dssp CCCTTEEEECCTTSCCEEEECTTEEEETTEEEECS
T ss_pred CCCcCcceeEecCCCeeeeeeeeecccCCCcccCC
Confidence 69999999976 3599999999999999864
|
| >d1b9wa1 g.3.11.4 (A:1-45) Merozoite surface protein 1 (MSP-1) {Malaria parasite (Plasmodium cynomolgi) [TaxId: 5827]} | Back information, alignment and structure |
|---|
| >d1gl4a2 g.3.11.5 (A:359-398) EGF-like domain of nidogen-1 {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
| >d2vj3a1 g.3.11.1 (A:411-452) Neurogenic locus notch homolog protein 1, Notch1 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|