Psyllid ID: psy12980


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------61
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPIRPAPTYRLPPPYSPP
cccHHHHHHHHHHHHHcccccccEEEEEEEEcccccEEEEEEccccccccccHHHHHHHHHHHccccccccHHHHHHHHcccEEEEEEEEEEcccccccccccccccccccccccccccccccEEEccccEEEEccccccccccccccEEEEEEEEccccEEEccccccccccccccccccEEEEEEEEEEcccccccccccccccccccccccEEccccccEEEEcccEEEEEEccccccccccccEEEEEEEEEccccccccccccEEEEccccEEEEEccccccEEEEEEEEEEEcccccEEEcEEEEEEccccccccEEEEEEEcccccccccHHHHHHHHHHHHHHHcccccccEEEEEEEcccccccccEEEEEEcccccccccccHHHHHHHHHHHHHcccccHHHHHHHcccccEEEEEEEEcccccccccccccccccccccccccccccccccccEEEEEHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccEEccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
cccHHHHHHHHHHHHHcccccccEEEEEEEEccccccEEEEEccccccccccHHHHHHHHHHHHccccccccHHHHHHccccEEEEEEEEcccccccccccccEEEEccccccccccccccccEEEcccEEEEEEccccEEcccccccEEEEEEEccccccccccccccccccccccccccEEEEEEEEcccccccccccccEEEEEccccccccEcccccccEEEcccEEEEEEccccccccccccccEEEEEEEcccccccccccEEEEcccccEEEEccccccccEEEEEEEEEcccccEEccEEEEEEEcccccccEEEEEEEccccHHccccHHHHHHHHHHHHHHHccccccEEEEEEEEccccccccEEEEEEEcccccccccccHHHHHHHHHHHHcccccccHHHHHHHcccccEEEEEEEEcccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEccccHHHHHHHHHccccEEEccHccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
MVDWQIFVYNILLDLygnsgnssyITVRSVvtnkepyvftytneslprnecptaQINNLLEVIsntttgqakPVLTRALGRlmtvknitwrgighcedsppppvvkvklenqvplirnpiDHLEAISGELLvyhvpedtffdhedggtrKLKLHLMTMDRYvisntttgqakPVLTRALGRlmtvknitwrgighcedsppppvvkvklenqvplirnpiDHLEAISGELLvyhvpedtffdhedggtrKLKLHLMtmdrttippthwlqfdaknqefygipqltdlgrREYQLVCkdsnglmandglevivkspgpayysasfsltlarpsstfaqPAAQRAFLEKLAGlfgdrnaskivniefednpqepgksTVVRWANRTlasnvecpettIGLLREVLLTEDDLLQESLVTSLEkdygvkhasviptgiceglktplhtpgverkpaysgkntpanaSVEYLITFVAPLVVIVIMLLCAALIACLLYRrrhtgkmsvgngvdsgilirskgipvifQDELEErlevssgtkspiilkeekpplpppeynrdkndfsrdkndysrsqpnattallsdiddtspyhppppirpaptyrlpppyspp
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYtneslprneCPTAQINNLLEVisntttgqakpVLTRALGRLMTVKNITWRGIghcedsppppvVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVisntttgqakpvlTRALGRLMTVKNITWRGIghcedsppppvVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNiefednpqepgksTVVRWANRTlasnvecpettIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRrhtgkmsvgngvdsgilirSKGIPVIFQDELEERLevssgtkspiilkeekpplpppeynrDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPpirpaptyrlpppyspp
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPlvvivimllcaaliacllYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSpiilkeekpplpppEYNRDKNDFSRDKNDYSRSQPNATTALLSDIDDTSpyhppppirpaptyrlpppyspp
**DWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARP*STFAQPAAQRAFLEKLAGLFGDRNASKIVNIEF**********TVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHT****************NASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDE*************************************************************************************
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDS****VVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLL***DLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVER*********PANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRR******************SKGIPVIFQDE******************************************************************************PPP****
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPIRPAPTYRLPPPYSPP
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEG**********************ANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEV**GTKSPIILKEEKPPLPPPEYNR*********NDYSRSQPNATTALLS**DDTSPYHPPPPIRPAPTYRLPPPYSPP
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPIRPAPTYRLPPPYSPP
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query609 2.2.26 [Sep-21-2011]
Q14118895 Dystroglycan OS=Homo sapi yes N/A 0.535 0.364 0.335 1e-40
Q28685895 Dystroglycan OS=Oryctolag yes N/A 0.536 0.365 0.327 4e-40
Q62165893 Dystroglycan OS=Mus muscu yes N/A 0.540 0.368 0.349 5e-40
O18738895 Dystroglycan OS=Bos tauru yes N/A 0.538 0.366 0.314 6e-40
Q9TSZ6892 Dystroglycan OS=Canis fam yes N/A 0.527 0.359 0.316 2e-39
Q29243128 Dystroglycan (Fragment) O no N/A 0.149 0.710 0.340 5e-10
>sp|Q14118|DAG1_HUMAN Dystroglycan OS=Homo sapiens GN=DAG1 PE=1 SV=2 Back     alignment and function desciption
 Score =  168 bits (426), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 119/355 (33%), Positives = 175/355 (49%), Gaps = 29/355 (8%)

Query: 211 NQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQ 270
           NQ P ++N ID ++A  G      +P DTF+DHED  T KLKL L   ++  +    W+Q
Sbjct: 494 NQRPELKNHIDRVDAWVGTYFEVKIPSDTFYDHEDTTTDKLKLTLKLREQQLVGEKSWVQ 553

Query: 271 FDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIV-KSP----GPAYYSASFS 325
           F++ +Q  YG+P  + +G+ EY +   D  GL A D  E+ V + P     PA + A F 
Sbjct: 554 FNSNSQLMYGLPDSSHVGKHEYFMHATDKGGLSAVDAFEIHVHRRPQGDRAPARFKAKF- 612

Query: 326 LTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTL 385
             +  P+        + A ++KLA  FGDRN S I              S VV W N TL
Sbjct: 613 --VGDPALVLNDIHKKIALVKKLAFAFGDRNCSTITLQNITRG------SIVVEWTNNTL 664

Query: 386 ASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLK-----T 440
                CP+  I  L   +  +D   + +   +LE D+     +V  +G C  L+      
Sbjct: 665 PLE-PCPKEQIAGLSRRIAEDDGKPRPAFSNALEPDFKATSITVTGSGSCRHLQFIPVVP 723

Query: 441 PLHTPGVERKPAYSGKNTPANASVE--YLITFVAPLVVIVIMLLCAALIACLLYRRRHTG 498
           P   P  E  P       P  +S +  YL T + P VV+  +LL A +IA + YR++  G
Sbjct: 724 PRRVPS-EAPPTEVPDRDPEKSSEDDVYLHTVI-PAVVVAAILLIAGIIAMICYRKKRKG 781

Query: 499 KMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEY 553
           K+++    D    I+ KG+P+IF DEL++     S +   +IL+EEK PLPPPEY
Sbjct: 782 KLTLE---DQATFIK-KGVPIIFADELDDSKPPPSSSMP-LILQEEKAPLPPPEY 831




Beta-dystroglycan is a transmembrane protein that plays important roles in connecting the extracellular matrix to the cytoskeleton. Acts as a cell adhesion receptor in both muscle and non-muscle tissues. Receptor for both DMD and UTRN and, through these interactions, scaffolds axin to the cytoskeleton. Also functions in cell adhesion-mediated signaling and implicated in cell polarity.
Homo sapiens (taxid: 9606)
>sp|Q28685|DAG1_RABIT Dystroglycan OS=Oryctolagus cuniculus GN=DAG1 PE=1 SV=1 Back     alignment and function description
>sp|Q62165|DAG1_MOUSE Dystroglycan OS=Mus musculus GN=Dag1 PE=1 SV=4 Back     alignment and function description
>sp|O18738|DAG1_BOVIN Dystroglycan OS=Bos taurus GN=DAG1 PE=1 SV=1 Back     alignment and function description
>sp|Q9TSZ6|DAG1_CANFA Dystroglycan OS=Canis familiaris GN=DAG1 PE=3 SV=1 Back     alignment and function description
>sp|Q29243|DAG1_PIG Dystroglycan (Fragment) OS=Sus scrofa GN=DAG1 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query609
91086363 1114 PREDICTED: similar to AGAP005162-PA [Tri 0.658 0.359 0.425 8e-84
307209853 1249 Dystroglycan [Harpegnathos saltator] 0.706 0.344 0.405 1e-83
332016339 1297 Dystroglycan [Acromyrmex echinatior] 0.702 0.329 0.410 1e-83
307191154 1311 Dystroglycan [Camponotus floridanus] 0.709 0.329 0.399 1e-82
322784874 1144 hypothetical protein SINV_05379 [Solenop 0.671 0.357 0.413 3e-82
242015530 1069 conserved hypothetical protein [Pediculu 0.673 0.383 0.413 6e-80
357626981 1181 hypothetical protein KGM_17314 [Danaus p 0.660 0.340 0.400 4e-78
383849507 1327 PREDICTED: uncharacterized protein LOC10 0.679 0.311 0.4 2e-77
350418919 1305 PREDICTED: hypothetical protein LOC10074 0.684 0.319 0.405 6e-77
158292686 1185 AGAP005162-PB [Anopheles gambiae str. PE 0.658 0.338 0.399 3e-76
>gi|91086363|ref|XP_974516.1| PREDICTED: similar to AGAP005162-PA [Tribolium castaneum] gi|270009843|gb|EFA06291.1| hypothetical protein TcasGA2_TC009157 [Tribolium castaneum] Back     alignment and taxonomy information
 Score =  318 bits (814), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 195/458 (42%), Positives = 258/458 (56%), Gaps = 57/458 (12%)

Query: 169  GQAKPVLTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISG 228
            G  K  L+  L   + VK +T  G+G CE +P  PV      N  P +RNP+DH+ A  G
Sbjct: 697  GDPKQELSLLLTPDLRVKKVTKHGLGVCE-TPQVPVTPPT--NFSPFLRNPVDHINATVG 753

Query: 229  ELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLG 288
            ELL++ VP+DTF+D ED  +R L + L+T DR  I  T+WLQFD+KN+EFYGIP+    G
Sbjct: 754  ELLIFKVPDDTFYDPEDTDSRTLNISLLTGDRQAIKSTNWLQFDSKNREFYGIPRKP--G 811

Query: 289  RREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAA-QRAFLEK 347
            R EY LVC DS G  A D LE++V      +Y+  FS+T+  P  TF   A+ QR F+EK
Sbjct: 812  RSEYHLVCVDSGGATATDSLEIVVYPAHKMHYNVEFSMTVETPYDTFVNSASMQRKFVEK 871

Query: 348  LAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTED 407
            L  +FGDRN+S +    F    ++   STV+ W NRTL+++ +CP   I  L  VL    
Sbjct: 872  LQEIFGDRNSSSLYFNPF----KQKDHSTVITWFNRTLSTS-KCPHDEINHLESVLKNHQ 926

Query: 408  DL-LQESLVTSLEKDYGVKHASVIPTGIC--EGLKTPLH---TPGVERKPAYSGKNTPAN 461
            D  +   +   +E ++ V   SV   G C  + L  P+H    P  E+ P  S       
Sbjct: 927  DRGISHQVSRIMEPEFLVSKISVFTMGNCRTKSLPPPVHEDVIPVDEKSPDASA------ 980

Query: 462  ASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIF 521
            A  +YL+TF+ P V+I IML  AAL AC+LYRRR TGK++V    D      +KGIPVIF
Sbjct: 981  AQYQYLMTFIIPAVIISIMLFFAALCACVLYRRRRTGKLNVEE--DGRQSYGNKGIPVIF 1038

Query: 522  QDELEERLEVSSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDKNDYSRSQPNATTALLSD 581
            Q+ELEE+ E  SGTK+P+ILK+EKPPL PPEY++                    +  L  
Sbjct: 1039 QEELEEKPE--SGTKTPVILKDEKPPLAPPEYSK--------------------SGSLKL 1076

Query: 582  IDDTSPYHPPPPI----------RPAPTYRLPPPYSPP 609
             DD+ PY PPPP           +P PTYR PPPY PP
Sbjct: 1077 TDDSEPYQPPPPFTRTDGRQPRPKPTPTYRKPPPYVPP 1114




Source: Tribolium castaneum

Species: Tribolium castaneum

Genus: Tribolium

Family: Tenebrionidae

Order: Coleoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|307209853|gb|EFN86632.1| Dystroglycan [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|332016339|gb|EGI57252.1| Dystroglycan [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|307191154|gb|EFN74852.1| Dystroglycan [Camponotus floridanus] Back     alignment and taxonomy information
>gi|322784874|gb|EFZ11654.1| hypothetical protein SINV_05379 [Solenopsis invicta] Back     alignment and taxonomy information
>gi|242015530|ref|XP_002428406.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513018|gb|EEB15668.1| conserved hypothetical protein [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|357626981|gb|EHJ76853.1| hypothetical protein KGM_17314 [Danaus plexippus] Back     alignment and taxonomy information
>gi|383849507|ref|XP_003700386.1| PREDICTED: uncharacterized protein LOC100881230 [Megachile rotundata] Back     alignment and taxonomy information
>gi|350418919|ref|XP_003492011.1| PREDICTED: hypothetical protein LOC100745910 [Bombus impatiens] Back     alignment and taxonomy information
>gi|158292686|ref|XP_314057.4| AGAP005162-PB [Anopheles gambiae str. PEST] gi|157017109|gb|EAA09416.5| AGAP005162-PB [Anopheles gambiae str. PEST] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query609
FB|FBgn00340721179 Dg "Dystroglycan" [Drosophila 0.610 0.315 0.340 7.4e-50
UNIPROTKB|F1M8K0893 Dag1 "Protein Dag1" [Rattus no 0.568 0.387 0.305 4.1e-28
MGI|MGI:101864893 Dag1 "dystroglycan 1" [Mus mus 0.568 0.387 0.308 7.2e-28
UNIPROTKB|Q14118895 DAG1 "Dystroglycan" [Homo sapi 0.559 0.381 0.305 9.6e-28
UNIPROTKB|Q28685895 DAG1 "Dystroglycan" [Oryctolag 0.561 0.382 0.299 3.8e-27
UNIPROTKB|F1N7D7895 DAG1 "Dystroglycan" [Bos tauru 0.568 0.386 0.292 1.1e-26
UNIPROTKB|I3LD20877 DAG1 "Dystroglycan" [Sus scrof 0.548 0.380 0.303 1.4e-26
UNIPROTKB|O18738895 DAG1 "Dystroglycan" [Bos tauru 0.499 0.339 0.300 1.4e-26
UNIPROTKB|Q9TSZ6892 DAG1 "Dystroglycan" [Canis lup 0.564 0.385 0.290 2.4e-26
ZFIN|ZDB-GENE-021223-1866 dag1 "dystroglycan 1" [Danio r 0.500 0.352 0.277 5.2e-23
FB|FBgn0034072 Dg "Dystroglycan" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 471 (170.9 bits), Expect = 7.4e-50, Sum P(2) = 7.4e-50
 Identities = 132/388 (34%), Positives = 191/388 (49%)

Query:   185 VKNITWRGIGHCE-DSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDH 243
             +K+IT + IG C+ D       +   +N  P+ RN +D + A  G+LLVY VP DTF+D 
Sbjct:   771 IKSITGQLIGSCQKDLTQVKPTQHMTKNVPPMPRNQVDRVNASLGQLLVYKVPADTFYDA 830

Query:   244 EDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLM 303
              D    +L L L T D   + P HWLQFD+KN+EFYGIP+  D+G  EY LV +DS GL 
Sbjct:   831 NDN---QLTLTLKTRDHLELSPRHWLQFDSKNEEFYGIPKSGDIGSEEYLLVAEDSGGLS 887

Query:   304 ANDGLEVIVKSPGPAY-YSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVN 362
             A+D L V+V SP P   +   F   L+     F     QR F+E++A L GD    +I  
Sbjct:   888 AHDALVVVV-SPAPKRDFGFFFKAYLSIKHERF-NADLQRKFVERVAKLNGDPTTGQIQI 945

Query:   363 IEFEDNPQEPGKSTVVRWANRTL-ASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKD 421
                  +    G  T+V + N TL   +  C E  + + R V L  D  L+E+   +L  +
Sbjct:   946 RSITTHHDSDG--TIVNFYNTTLYRKHNSCREKEVAMTRSVYLNSDLSLREAAKRALGPE 1003

Query:   422 YGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVE-YLITFVAPXXXXXXX 480
               + + SV+P  IC   +  + T  ++  P+   + T  ++  E Y+ITFV P       
Sbjct:  1004 LNLTNFSVVPFSICHHTEN-IDTNQLDYIPSRPEEPTHKSSFGEDYMITFVWPIVIIVAM 1062

Query:   481 XXXXXXXXX-XXYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSXX 539
                         + R+ +GKM +G+  +     R+KGIPVIFQDE EE+ E+  G KS  
Sbjct:  1063 LVAASIIACCLHWCRQRSGKMELGDEEERKSF-RAKGIPVIFQDEYEEKPEI--GNKSPV 1119

Query:   540 XXXXXXXXXXXXEYNRDKNDFSRDKNDY 567
                          YN    +   D +DY
Sbjct:  1120 ILKDEKPPLLPPSYNTSNMNGDNDVDDY 1147


GO:0008307 "structural constituent of muscle" evidence=ISS
GO:0016011 "dystroglycan complex" evidence=ISS;IDA;NAS
GO:0007016 "cytoskeletal anchoring at plasma membrane" evidence=IEA;ISS
GO:0016334 "establishment or maintenance of polarity of follicular epithelium" evidence=IMP
GO:0045197 "establishment or maintenance of epithelial cell apical/basal polarity" evidence=IMP
GO:0007314 "oocyte anterior/posterior axis specification" evidence=IMP
GO:0005509 "calcium ion binding" evidence=IEA
GO:0009925 "basal plasma membrane" evidence=IDA
GO:0016010 "dystrophin-associated glycoprotein complex" evidence=IPI
GO:0007163 "establishment or maintenance of cell polarity" evidence=IMP
GO:0007295 "growth of a germarium-derived egg chamber" evidence=IMP
GO:0030010 "establishment of cell polarity" evidence=IMP
GO:0007411 "axon guidance" evidence=IGI
GO:0046716 "muscle cell homeostasis" evidence=IMP
GO:0005515 "protein binding" evidence=IPI
GO:0007309 "oocyte axis specification" evidence=IMP
GO:0035088 "establishment or maintenance of apical/basal cell polarity" evidence=IMP
GO:0007474 "imaginal disc-derived wing vein specification" evidence=IMP
GO:0030315 "T-tubule" evidence=IDA
GO:0045214 "sarcomere organization" evidence=IMP
GO:0016203 "muscle attachment" evidence=IMP
GO:0060025 "regulation of synaptic activity" evidence=IMP
GO:0007293 "germarium-derived egg chamber formation" evidence=IMP
GO:0031594 "neuromuscular junction" evidence=IDA
GO:0042383 "sarcolemma" evidence=IDA
GO:0030018 "Z disc" evidence=IDA
GO:0006110 "regulation of glycolysis" evidence=IGI
GO:0031532 "actin cytoskeleton reorganization" evidence=IMP
UNIPROTKB|F1M8K0 Dag1 "Protein Dag1" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
MGI|MGI:101864 Dag1 "dystroglycan 1" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q14118 DAG1 "Dystroglycan" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q28685 DAG1 "Dystroglycan" [Oryctolagus cuniculus (taxid:9986)] Back     alignment and assigned GO terms
UNIPROTKB|F1N7D7 DAG1 "Dystroglycan" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|I3LD20 DAG1 "Dystroglycan" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|O18738 DAG1 "Dystroglycan" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q9TSZ6 DAG1 "Dystroglycan" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-021223-1 dag1 "dystroglycan 1" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query609
pfam05454290 pfam05454, DAG1, Dystroglycan (Dystrophin-associat 3e-41
cd1130399 cd11303, Dystroglycan_repeat, Cadherin-like repeat 5e-10
smart0073697 smart00736, CADG, Dystroglycan-type cadherin-like 6e-09
pfam05454290 pfam05454, DAG1, Dystroglycan (Dystrophin-associat 0.001
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein 1) Back     alignment and domain information
 Score =  150 bits (381), Expect = 3e-41
 Identities = 92/286 (32%), Positives = 136/286 (47%), Gaps = 37/286 (12%)

Query: 343 AFLEKLAGLFGDRNASKIV--NIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLR 400
             ++KLA  FGDRN+S I   +I           S +V W N TL     CP+  + +L 
Sbjct: 23  LLVKKLAFAFGDRNSSTITLRSIT--------KGSIIVEWTNNTLPHE-PCPKEQVAMLS 73

Query: 401 EVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLK-TPLHTPGV--ERKPAYSGKN 457
           + +L  D   +E+   +LE ++ + + SV+ TG C   +  P +       R P     +
Sbjct: 74  KKILDSDGSPREAFKNALEPEFKLTNISVVGTGSCRHTEFIPTNPLDYIPSRTPPTGTPD 133

Query: 458 TPANASVE---YLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRS 514
            P   S E   YL T +  +VV  I+L+ A +IA + YR++  GK+++    +       
Sbjct: 134 RPPEKSSEDDVYLHTVIPAVVVAAILLI-AGIIAMICYRKKRKGKLTL----EDQATFIK 188

Query: 515 KGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDKNDYSRSQPNA 574
           KG+P+IF DEL++  +    +  P+ILKEEKPPLPPPEY       +   N     +   
Sbjct: 189 KGVPIIFADELDDS-KPPPSSSMPLILKEEKPPLPPPEYPNQNVPETTPLNQDLLGE--- 244

Query: 575 TTALLSDIDDTSPYHPPPPI---------RP--APTYRLPPPYSPP 609
            T L  +  +  PY PPPP          RP     YR PPPY PP
Sbjct: 245 YTPLRDEDPNAPPYQPPPPFTTPMEGKGSRPKNMTPYRSPPPYVPP 290


Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in human. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton. [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in mouse brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear. Length = 290

>gnl|CDD|206636 cd11303, Dystroglycan_repeat, Cadherin-like repeat domain of alpha dystroglycan Back     alignment and domain information
>gnl|CDD|214795 smart00736, CADG, Dystroglycan-type cadherin-like domains Back     alignment and domain information
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein 1) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 609
PF05454290 DAG1: Dystroglycan (Dystrophin-associated glycopro 100.0
KOG3781|consensus559 100.0
smart0073697 CADG Dystroglycan-type cadherin-like domains. Cadh 99.8
PF05454290 DAG1: Dystroglycan (Dystrophin-associated glycopro 99.65
smart0073697 CADG Dystroglycan-type cadherin-like domains. Cadh 99.0
PF05510386 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro 98.76
KOG3781|consensus559 98.76
PF0534549 He_PIG: Putative Ig domain; InterPro: IPR008009 Th 98.31
KOG4482|consensus449 97.9
cd00031199 CA Cadherin repeat domain; Cadherins are glycoprot 96.93
PF12877 684 DUF3827: Domain of unknown function (DUF3827); Int 95.83
PF15102146 TMEM154: TMEM154 protein family 95.5
PF10873155 DUF2668: Protein of unknown function (DUF2668); In 95.44
PF13908179 Shisa: Wnt and FGF inhibitory regulator 95.3
PF12273130 RCR: Chitin synthesis regulation, resistance to Co 93.81
smart0011279 CA Cadherin repeats. Cadherins are glycoproteins i 92.95
PF0002893 Cadherin: Cadherin domain; InterPro: IPR002126 Cad 92.55
PF0243938 Adeno_E3_CR2: Adenovirus E3 region protein CR2; In 91.64
PF04478154 Mid2: Mid2 like cell wall stress sensor; InterPro: 90.22
cd00031199 CA Cadherin repeat domain; Cadherins are glycoprot 89.92
PF0749566 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This regi 89.23
cd0014681 PKD polycystic kidney disease I (PKD) domain; simi 88.6
PF15050133 SCIMP: SCIMP protein 88.53
PF15418132 DUF4625: Domain of unknown function (DUF4625) 87.19
PF01102122 Glycophorin_A: Glycophorin A; InterPro: IPR001195 86.36
PF14991118 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 84.23
PF1224560 Big_3_2: Bacterial Ig-like domain (group 3); Inter 81.87
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 Back     alignment and domain information
Probab=100.00  E-value=4.4e-82  Score=649.26  Aligned_cols=272  Identities=42%  Similarity=0.692  Sum_probs=1.0

Q ss_pred             eEEEEEecCCCCCcCchHHH-HHHHHHHHHHhCCCCCCcEEEEeeecCCCCCCCcEEEEEEecCCCCCCCCCHHHHHHHH
Q psy12980        322 ASFSLTLARPSSTFAQPAAQ-RAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLR  400 (609)
Q Consensus       322 vef~L~l~~~~~~ft~~~~q-~~LVekLA~llGD~d~s~ItVrsI~~~~~~~~gSTvV~w~N~TLp~~~~CP~~eI~~L~  400 (609)
                      |+|+++|++++++|+++.++ ++||++||.+|||+|+++|+||+|+      +|||+|+|||+||+. +.||+++|++|+
T Consensus         1 ~~F~~~l~~d~~~f~~dv~~ki~lVekLA~~~GD~nts~ItV~sIt------~gstiVtwtNnTLp~-~~CP~eeI~~L~   73 (290)
T PF05454_consen    1 VEFSATLDIDYESFNNDVQRKILLVEKLARLFGDRNTSSITVRSIT------SGSTIVTWTNNTLPT-SPCPKEEIEKLR   73 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             CeeEEEEcCCHHHhhhhHHHHHHHHHHHHHHhCCCCCCeEEEEEec------CCCEEEEEEcCCCCC-CCCCHHHHHHHH
Confidence            68999999999999986544 7899999999999999999999999      899999999999998 899999999999


Q ss_pred             HHHhhccchhhHHHHHhhcccceeeEEeeeeccccCCCCCCCCCCCCCCCCC------CCCCCCCCCCCccEEEEeehHH
Q psy12980        401 EVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPA------YSGKNTPANASVEYLITFVAPL  474 (609)
Q Consensus       401 ~kL~~~dg~~s~~f~~am~peF~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~~------~~~~~~~~na~~~~Llv~ViPa  474 (609)
                      ++|.+++|+++++|+++|+|+|+|++|++.++|+|++.......+.+..|..      +......+.++++||+ ++||+
T Consensus        74 ~~L~~~~g~~~~~f~~am~pef~V~svsv~~~G~C~~~~~ip~~~~~~iP~~~~~t~~p~~~~~~~s~~d~yL~-T~Ipa  152 (290)
T PF05454_consen   74 KRLVDDDGKPSQEFVRAMGPEFKVKSVSVIPIGSCQDTSFIPTPPDSEIPTPAPPTNTPDEPSPKSSFSDDYLH-TFIPA  152 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHhcCCCCcCHHHHHHhCCCCceeEEEEEEeeccCCCccCCCCCcCcCCCCCCCCCCCCCCCCCcccccchHH-HHHHH
Confidence            9999999999999999999999999999999999999766543222223321      1111123444565655 55799


Q ss_pred             HHHHHHHHHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCCCCCCCCCeeccCCCCCCCCCCCC
Q psy12980        475 VVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYN  554 (609)
Q Consensus       475 VIIa~mllia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P~IlkeEkpplpppey~  554 (609)
                      |||++|||||++||||||||||||||++    |||++||+|||||||||||||. ||++++|+||||||||||||||||+
T Consensus       153 VVI~~iLLIA~iIa~icyrrkR~GK~~~----ee~~~f~~KGiPvIF~dElee~-kp~~~~~~P~IlkeEkPPl~pp~y~  227 (290)
T PF05454_consen  153 VVIAAILLIAGIIACICYRRKRKGKMSL----EEQKTFISKGIPVIFQDELEES-KPEPGSKSPVILKEEKPPLPPPEYP  227 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHhhhhhhcccccc----chhHHHHhcCCceecccccccc-CCCCCCCCCeeecccCCCCCCCCCC
Confidence            9999999999999999999999999998    5699999999999999999995 4999999999999999999999999


Q ss_pred             CCCCCCCCCcCccCCCCCCccccccCCCCCCCCCCCCCCC---------C--CCCCCCCCCCCCCC
Q psy12980        555 RDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPI---------R--PAPTYRLPPPYSPP  609 (609)
Q Consensus       555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~pppp~---------r--~~~~yr~pppyvpp  609 (609)
                      +++++++++|+.++.+   +..++.++|+|++||||||||         |  +||+||+|||||||
T Consensus       228 ~~~~~~~~p~~~~~~~---~~~~~~~~~~~~~~y~ppp~~~~~~~~~~~~p~~~~~yr~PPpyvpp  290 (290)
T PF05454_consen  228 NSNMPESTPLNQDTLG---EYTPLRDEDPDAPPYQPPPPFTSPGEGRRSRPKNTPPYRSPPPYVPP  290 (290)
T ss_dssp             -------------------------------------------------------S----------
T ss_pred             CCCCCCCCCccccccc---cccccccCCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCCCCCCC
Confidence            9999999988433311   112767777899999999999         3  69999999999998



5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.

>KOG3781|consensus Back     alignment and domain information
>smart00736 CADG Dystroglycan-type cadherin-like domains Back     alignment and domain information
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 Back     alignment and domain information
>smart00736 CADG Dystroglycan-type cadherin-like domains Back     alignment and domain information
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex Back     alignment and domain information
>KOG3781|consensus Back     alignment and domain information
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins Back     alignment and domain information
>KOG4482|consensus Back     alignment and domain information
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here Back     alignment and domain information
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ] Back     alignment and domain information
>PF15102 TMEM154: TMEM154 protein family Back     alignment and domain information
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [] Back     alignment and domain information
>PF13908 Shisa: Wnt and FGF inhibitory regulator Back     alignment and domain information
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls Back     alignment and domain information
>smart00112 CA Cadherin repeats Back     alignment and domain information
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,] Back     alignment and domain information
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] Back     alignment and domain information
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region Back     alignment and domain information
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here Back     alignment and domain information
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators Back     alignment and domain information
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here Back     alignment and domain information
>PF15050 SCIMP: SCIMP protein Back     alignment and domain information
>PF15418 DUF4625: Domain of unknown function (DUF4625) Back     alignment and domain information
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane Back     alignment and domain information
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P Back     alignment and domain information
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query609
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-05
1u2c_A246 Dystroglycan; IG-like domain, S6 like fold, protei 8e-06
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 54.1 bits (129), Expect = 1e-07
 Identities = 81/546 (14%), Positives = 146/546 (26%), Gaps = 159/546 (29%)

Query: 5   QIFV-------YNILLDLYGNSGNS------SYITVRSVVTNKEPYVFTYTNESLPRNEC 51
           Q FV       Y  L+                YI  R  + N +  VF   N  + R + 
Sbjct: 80  QKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYN-DNQVFAKYN--VSRLQP 136

Query: 52  PTAQINNLLEVISNTT------TGQAKPVLTRALGRLMTVK-----NITWRGIGHCEDSP 100
                  LLE+            G  K  +   +     V+      I W  + +C +SP
Sbjct: 137 YLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKNC-NSP 195

Query: 101 PPPVVKVKLENQVPLIR--------------NPIDHLEAISGE-------------LLVY 133
                +  LE    L+               N    + +I  E             LLV 
Sbjct: 196 -----ETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVL 250

Query: 134 -HVPEDTFFDHEDGGTRKLKLHLMTMDRYVISNTTTGQAKPVLTRALGRLMTVKN----- 187
            +V     ++  +   + L   L T  + V    +      +        +T        
Sbjct: 251 LNVQNAKAWNAFNLSCKIL---LTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL 307

Query: 188 ITWRGIGHCEDSPP------PPVV------------------KVKLENQVPLIRNPIDHL 223
           + +      +D P       P  +                   V  +    +I + ++ L
Sbjct: 308 LKYLDCR-PQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIESSLNVL 366

Query: 224 EAISGELLVY-----------HVPEDTF----FDHEDGGTRKL--KLHLMTM-DRTTIPP 265
           E    E               H+P        FD        +  KLH  ++ ++     
Sbjct: 367 E--PAEYRKMFDRLSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKES 424

Query: 266 T---HWLQFDAKNQEFYGIPQLTDLGRR---EYQLVCK-DSNGLMAN--DGLEVIVKSPG 316
           T     +  + K +    +     L R     Y +    DS+ L+    D          
Sbjct: 425 TISIPSIYLELKVK----LENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQ--------- 471

Query: 317 PAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRN--ASKIVNIEFEDNPQEPGK 374
              Y  S    +        +   +         +F D      KI +     N      
Sbjct: 472 ---YFYSH---IGHHLKN-IEHPERMTLFRM---VFLDFRFLEQKIRHDSTAWNASGSIL 521

Query: 375 STV--VRWANRTLASNVECPETTIGLLREVLLT---------EDDLLQESLVTSLEKDYG 423
           +T+  +++    +  N    E  +  + + L             DLL+ +L+   E  + 
Sbjct: 522 NTLQQLKFYKPYICDNDPKYERLVNAILDFLPKIEENLICSKYTDLLRIALMAEDEAIFE 581

Query: 424 VKHASV 429
             H  V
Sbjct: 582 EAHKQV 587


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1u2c_A Dystroglycan; IG-like domain, S6 like fold, protein binding; 2.30A {Mus musculus} SCOP: b.1.6.2 d.272.1.1 Length = 246 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query609
1u2c_A246 Dystroglycan; IG-like domain, S6 like fold, protei 99.71
1u2c_A246 Dystroglycan; IG-like domain, S6 like fold, protei 98.83
3q2v_A550 Cadherin-1; cell adhesion, calcium binding; HET: M 98.09
3q2w_A559 Cadherin-2; cell adhesion, calcium binding; HET: M 97.95
3mvs_A210 Cadherin-23, otocadherin; adhesion, extracellular 97.8
2a62_A322 Cadherin-8; extracellular domain, homodimer, calci 97.67
1l3w_A546 EP-cadherin, C-cadherin; cell adhesion, calcium bi 97.67
3q2v_A550 Cadherin-1; cell adhesion, calcium binding; HET: M 97.66
1l3w_A546 EP-cadherin, C-cadherin; cell adhesion, calcium bi 97.64
3q2w_A559 Cadherin-2; cell adhesion, calcium binding; HET: M 97.43
1q55_A880 EP-cadherin, C-cadherin; trans interaction, desmos 97.43
3ubh_A419 Neural-cadherin; cell adhesion; 2.70A {Drosophila 97.37
2yrl_A102 KIAA1837 protein; PKD domain, NPPSFA, national pro 97.34
3ubf_A316 Neural-cadherin; cell adhesion; 2.50A {Drosophila 97.26
1q55_A880 EP-cadherin, C-cadherin; trans interaction, desmos 97.18
3lnd_A207 CDH6 protein, cadherin 6; cell adhesion, cell memb 96.89
3ubh_A419 Neural-cadherin; cell adhesion; 2.70A {Drosophila 96.88
3ppe_A203 Vascular endothelial cadherin; extracellular cadhe 96.85
2a4e_A215 Cadherin-11; dimer, calcium binding, cell adhesion 96.76
3qrb_A213 Cadherin-1; cell adhesion; HET: PGE; 1.80A {Mus mu 96.56
3k5r_A218 Cadherin 13; mouse, structural protein; 2.00A {Mus 96.49
2a62_A322 Cadherin-8; extracellular domain, homodimer, calci 96.46
2ee0_A114 Protocadherin-9; Ca domain, structural genomics, N 95.91
2qvi_A215 Cadherin-2; beta barrel, strand SWAP, domain SWAP, 95.87
3ubf_A316 Neural-cadherin; cell adhesion; 2.50A {Drosophila 95.67
2x2u_A246 Proto-oncogene tyrosine-protein kinase receptor RE 95.56
2e7m_A113 Protein KIAA0319; PKD domain, structural genomics, 95.55
2yhg_A437 SDE_182CT, cellulose-binding protein; hydrolase, g 95.4
3pdd_A190 Glycoside hydrolase, family 9; CBHA, beta-sandwich 95.31
4apx_B242 Protocadherin-15, otocadherin; cell adhesion, hear 95.01
2yst_A119 Protocadherin-7; BH-PCDH, beta-sandwich, structura 94.48
2y72_A85 Collagenase, collagenase G; polycystic kidney dise 94.11
3qc5_X360 Platelet binding protein GSPB; carbohydrate/sugar 93.78
2kpn_A103 Bacillolysin; solution structure, BIG_3 domain, PF 91.28
1zxk_A98 Cadherin-8; strand dimer, cell adhesion; 2.00A {Mu 90.74
2c4x_A260 Endoglucanase, ctcel9D-CEL44A; cellulase ctcel9D-C 90.2
1zvn_A99 Mn-cadherin; strand dimer, ----; 2.16A {Gallus gal 87.86
3mvs_A210 Cadherin-23, otocadherin; adhesion, extracellular 87.13
2omz_B105 Epithelial-cadherin; leucine-rich-repeat, nvasion 86.05
3qrb_A213 Cadherin-1; cell adhesion; HET: PGE; 1.80A {Mus mu 85.57
1ncg_A110 N-cadherin; cell adhesion protein; 2.10A {Mus musc 84.09
2c4x_A260 Endoglucanase, ctcel9D-CEL44A; cellulase ctcel9D-C 83.81
3k6d_A99 T-cadherin; cell adhesion; 1.80A {Xenopus laevis} 83.54
3k6i_A99 T-cadherin; cell adhesion, alternative splicing, c 81.3
3pe9_A98 Fibronectin(III)-like module; CBHA, beta-sandwich, 81.27
3ppe_A203 Vascular endothelial cadherin; extracellular cadhe 81.05
3k6f_A100 T-cadherin; cell adhesion, calcium, cell membrane, 80.88
>1u2c_A Dystroglycan; IG-like domain, S6 like fold, protein binding; 2.30A {Mus musculus} SCOP: b.1.6.2 d.272.1.1 Back     alignment and structure
Probab=99.71  E-value=3.9e-17  Score=163.30  Aligned_cols=138  Identities=13%  Similarity=0.197  Sum_probs=112.2

Q ss_pred             ccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEc
Q psy12980        219 PIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKD  298 (609)
Q Consensus       219 ~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATD  298 (609)
                      .|+++.|.+|+.|+|+||.++|.    |+  .++|+++.++|..||  +||+||+++++|+|+|+..|+|.|.+.|+|+|
T Consensus         7 GipD~~a~vg~~F~~tIP~~~Fs----gd--~~~y~vt~a~g~~LP--sWL~fd~~t~TfsG~P~~~D~G~~~i~VtA~~   78 (246)
T 1u2c_A            7 GIPDGTAVVGRSFRVSIPTDLIA----SS--GEIIKVSAAGKEALP--SWLHWDPHSHILEGLPLDTDKGVHYISVSAAR   78 (246)
T ss_dssp             CCCCEEEETTSCEEEECCHHHHC----CT--TEEEEEEETTSSSCC--TTEEEETTTTEEEECCCGGGCEEEEEEEEEEE
T ss_pred             CCCChhhcCCcceEEECCccccC----CC--ccEEEEEecCCCCCC--cceEEcCCCCeEEeccCCCCcceEEEEEEEee
Confidence            48999999999999999999999    44  456666668899999  99999999999999999999999999999983


Q ss_pred             -----CCCCeeEeeEEEEEECC-C----------------------CCCcceEEEEEecCCCCCcCchHHHHHHHHHHHH
Q psy12980        299 -----SNGLMANDGLEVIVKSP-G----------------------PAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAG  350 (609)
Q Consensus       299 -----s~G~sasd~F~V~V~~~-p----------------------~~~~~vef~L~l~~~~~~ft~~~~q~~LVekLA~  350 (609)
                           +.|..++|.|+|+|.+. .                      ...+...+++.++.+.+++.- ..+..|++++|.
T Consensus        79 ~~~~g~~~~~asdvF~I~V~ped~~ep~~v~~~~~~~~e~~~~~C~~~epvtvltVilDa~l~kl~p-~~Rv~Ll~~~~~  157 (246)
T 1u2c_A           79 LGANGSHVPQTSSVFSIEVYPEDHNEPQSVRAASSDPGEVVPSACAADEPVTVLTVILDADLTKMTP-KQRIDLLNRMQS  157 (246)
T ss_dssp             ECTTSCEEEEEEEEEEEEEESSCCC------------------CCCTTSCEEEEEEEESCCGGGSCH-HHHHHHHHHHHH
T ss_pred             ccCCCCccccccceEEEEEecCCCcchhhhhhhhcccccCccccCCCCCceEEEEEEeeCCchhCCH-HHHHHHHHHHHH
Confidence                 34778999999999931 1                      112223367788888788874 234899999999


Q ss_pred             HhCCCCCCcEEEEeee
Q psy12980        351 LFGDRNASKIVNIEFE  366 (609)
Q Consensus       351 llGD~d~s~ItVrsI~  366 (609)
                      .++.. .+++++..+-
T Consensus       158 f~~v~-~~~~~l~p~~  172 (246)
T 1u2c_A          158 FSEVE-LHNMKLVPVV  172 (246)
T ss_dssp             HHCSC-GGGCEEEECC
T ss_pred             Hhccc-hhheEeeecc
Confidence            99985 4777777665



>1u2c_A Dystroglycan; IG-like domain, S6 like fold, protein binding; 2.30A {Mus musculus} SCOP: b.1.6.2 d.272.1.1 Back     alignment and structure
>3q2v_A Cadherin-1; cell adhesion, calcium binding; HET: MAN; 3.40A {Mus musculus} Back     alignment and structure
>3q2w_A Cadherin-2; cell adhesion, calcium binding; HET: MAN NAG; 3.20A {Mus musculus} Back     alignment and structure
>3mvs_A Cadherin-23, otocadherin; adhesion, extracellular domain, cell adhesion; 1.10A {Mus musculus} PDB: 4apx_A* 2whv_A 2wcp_A* 4aq8_A 4axw_A* 4aqe_A* 4aqa_A* 2wd0_A* 2wbx_A Back     alignment and structure
>2a62_A Cadherin-8; extracellular domain, homodimer, calcium binding, adhesion; 4.50A {Mus musculus} Back     alignment and structure
>1l3w_A EP-cadherin, C-cadherin; cell adhesion, calcium binding, extracellular, ECT metal binding protein; HET: NAG NDG; 3.08A {Xenopus laevis} SCOP: b.1.6.1 b.1.6.1 b.1.6.1 b.1.6.1 b.1.6.1 Back     alignment and structure
>3q2v_A Cadherin-1; cell adhesion, calcium binding; HET: MAN; 3.40A {Mus musculus} Back     alignment and structure
>1l3w_A EP-cadherin, C-cadherin; cell adhesion, calcium binding, extracellular, ECT metal binding protein; HET: NAG NDG; 3.08A {Xenopus laevis} SCOP: b.1.6.1 b.1.6.1 b.1.6.1 b.1.6.1 b.1.6.1 Back     alignment and structure
>3q2w_A Cadherin-2; cell adhesion, calcium binding; HET: MAN NAG; 3.20A {Mus musculus} Back     alignment and structure
>1q55_A EP-cadherin, C-cadherin; trans interaction, desmosome, junction, adhesion, structural protein; HET: NAG NDG; 30.00A {Mus musculus} SCOP: i.20.1.1 PDB: 1q5a_A* 1q5b_A* 1q5c_A* Back     alignment and structure
>3ubh_A Neural-cadherin; cell adhesion; 2.70A {Drosophila melanogaster} Back     alignment and structure
>2yrl_A KIAA1837 protein; PKD domain, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>3ubf_A Neural-cadherin; cell adhesion; 2.50A {Drosophila melanogaster} PDB: 3ubg_A Back     alignment and structure
>1q55_A EP-cadherin, C-cadherin; trans interaction, desmosome, junction, adhesion, structural protein; HET: NAG NDG; 30.00A {Mus musculus} SCOP: i.20.1.1 PDB: 1q5a_A* 1q5b_A* 1q5c_A* Back     alignment and structure
>3lnd_A CDH6 protein, cadherin 6; cell adhesion, cell membrane, membrane, transmembr; 2.82A {Mus musculus} Back     alignment and structure
>3ubh_A Neural-cadherin; cell adhesion; 2.70A {Drosophila melanogaster} Back     alignment and structure
>3ppe_A Vascular endothelial cadherin; extracellular cadherin (EC) domain, beta barrel, IG-domain L domain swapped dimer interface; 2.10A {Gallus gallus} Back     alignment and structure
>2a4e_A Cadherin-11; dimer, calcium binding, cell adhesion; 3.20A {Mus musculus} Back     alignment and structure
>3qrb_A Cadherin-1; cell adhesion; HET: PGE; 1.80A {Mus musculus} PDB: 1ff5_A 3lng_A 2qvf_B 1edh_A 3lne_A 1q1p_A 3lni_A 3q2l_A* 3q2n_A* 3lnh_A 3lnf_A 2o72_A 1suh_A Back     alignment and structure
>3k5r_A Cadherin 13; mouse, structural protein; 2.00A {Mus musculus} PDB: 3k5s_A Back     alignment and structure
>2a62_A Cadherin-8; extracellular domain, homodimer, calcium binding, adhesion; 4.50A {Mus musculus} Back     alignment and structure
>2ee0_A Protocadherin-9; Ca domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>2qvi_A Cadherin-2; beta barrel, strand SWAP, domain SWAP, calcium, cell adhesion, cleavage on PAIR of basic residues, glycoprotein, membrane; 3.01A {Mus musculus} SCOP: b.1.6.1 b.1.6.1 PDB: 1ncj_A Back     alignment and structure
>3ubf_A Neural-cadherin; cell adhesion; 2.50A {Drosophila melanogaster} PDB: 3ubg_A Back     alignment and structure
>2x2u_A Proto-oncogene tyrosine-protein kinase receptor RET; hirschsprung disease, extracellular domain, disease mutation, transferase, glycoprotein; HET: NAG SO4; 2.00A {Homo sapiens} Back     alignment and structure
>2e7m_A Protein KIAA0319; PKD domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>2yhg_A SDE_182CT, cellulose-binding protein; hydrolase, glycoside hydrolase; HET: BTB; 1.08A {Saccharophagus degradans} Back     alignment and structure
>3pdd_A Glycoside hydrolase, family 9; CBHA, beta-sandwich, cellulosome, unknown function; 1.72A {Clostridium thermocellum} PDB: 3pdg_A Back     alignment and structure
>4apx_B Protocadherin-15, otocadherin; cell adhesion, hearing, deafness, CDH23, PCDH15; HET: MES; 1.65A {Mus musculus} PDB: 4aq8_C 4aqa_B* 4aqe_B* 4axw_B* Back     alignment and structure
>2yst_A Protocadherin-7; BH-PCDH, beta-sandwich, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>2y72_A Collagenase, collagenase G; polycystic kidney disease domain, beta barrel, collagen RECO domain, hydrolase; 1.18A {Clostridium histolyticum} PDB: 3jqu_A 3js7_A Back     alignment and structure
>3qc5_X Platelet binding protein GSPB; carbohydrate/sugar binding, sugar binding protein; HET: GOL; 1.40A {Streptococcus gordonii} PDB: 3qc6_X 4i8e_X* 3qd1_X* Back     alignment and structure
>2kpn_A Bacillolysin; solution structure, BIG_3 domain, PF07523, PSI BIG-open NESG, cell WALL, hydrolase, peptidoglycan-anchor; NMR {Bacillus cereus atcc 14579} Back     alignment and structure
>1zxk_A Cadherin-8; strand dimer, cell adhesion; 2.00A {Mus musculus} PDB: 2a4c_A Back     alignment and structure
>2c4x_A Endoglucanase, ctcel9D-CEL44A; cellulase ctcel9D-CEL44A, PKD domain, CBM44, carbohydrate BI module, beta-sandwich proteins, cellulosome; 2.0A {Clostridium thermocellum} PDB: 2c26_A Back     alignment and structure
>1zvn_A Mn-cadherin; strand dimer, ----; 2.16A {Gallus gallus} Back     alignment and structure
>3mvs_A Cadherin-23, otocadherin; adhesion, extracellular domain, cell adhesion; 1.10A {Mus musculus} PDB: 4apx_A* 2whv_A 2wcp_A* 4aq8_A 4axw_A* 4aqe_A* 4aqa_A* 2wd0_A* 2wbx_A Back     alignment and structure
>2omz_B Epithelial-cadherin; leucine-rich-repeat, nvasion protein, IG-like domain, adhesi protein, cell invasion-cell adhesion complex; 1.60A {Homo sapiens} SCOP: b.1.6.1 PDB: 2omt_B 2omu_B 2omv_B 1o6s_B 2omy_B 2omx_B 3ff8_A 3ff7_A 2omw_B Back     alignment and structure
>3qrb_A Cadherin-1; cell adhesion; HET: PGE; 1.80A {Mus musculus} PDB: 1ff5_A 3lng_A 2qvf_B 1edh_A 3lne_A 1q1p_A 3lni_A 3q2l_A* 3q2n_A* 3lnh_A 3lnf_A 2o72_A 1suh_A Back     alignment and structure
>1ncg_A N-cadherin; cell adhesion protein; 2.10A {Mus musculus} SCOP: b.1.6.1 PDB: 1nch_A 1nci_A Back     alignment and structure
>2c4x_A Endoglucanase, ctcel9D-CEL44A; cellulase ctcel9D-CEL44A, PKD domain, CBM44, carbohydrate BI module, beta-sandwich proteins, cellulosome; 2.0A {Clostridium thermocellum} PDB: 2c26_A Back     alignment and structure
>3k6d_A T-cadherin; cell adhesion; 1.80A {Xenopus laevis} PDB: 2v37_A Back     alignment and structure
>3k6i_A T-cadherin; cell adhesion, alternative splicing, calcium, cell membrane, cleavage on PAIR of basic residues, glycoprotein, GPI-anchor, lipoprotein; 1.13A {Gallus gallus} Back     alignment and structure
>3pe9_A Fibronectin(III)-like module; CBHA, beta-sandwich, cellulosome, unknown function; 1.69A {Clostridium thermocellum} Back     alignment and structure
>3ppe_A Vascular endothelial cadherin; extracellular cadherin (EC) domain, beta barrel, IG-domain L domain swapped dimer interface; 2.10A {Gallus gallus} Back     alignment and structure
>3k6f_A T-cadherin; cell adhesion, calcium, cell membrane, cleavage on PAIR of basic residues, glycoprotein, GPI-anchor, lipoprotein, membrane; 1.81A {Mus musculus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 609
d1u2ca1103 b.1.6.2 (A:58-160) Dystroglycan, N-terminal domain 7e-04
>d1u2ca1 b.1.6.2 (A:58-160) Dystroglycan, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 103 Back     information, alignment and structure

class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: Cadherin-like
family: Dystroglycan, N-terminal domain
domain: Dystroglycan, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 37.3 bits (86), Expect = 7e-04
 Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 14/105 (13%)

Query: 213 VPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFD 272
           VP +   I    A+ G      +P D              + +    +  +P   WL +D
Sbjct: 2   VPTV-VGIPDGTAVVGRSFRVSIPTDLIASSG------EIIKVSAAGKEALP--SWLHWD 52

Query: 273 AKNQEFYGIPQLTDLGR-----REYQLVCKDSNGLMANDGLEVIV 312
             +    G+P  TD G         +L    S+    +    + V
Sbjct: 53  PHSHILEGLPLDTDKGVHYISVSAARLGANGSHVPQTSSVFSIEV 97


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query609
d1u2ca1103 Dystroglycan, N-terminal domain {Mouse (Mus muscul 99.69
d1u2ca1103 Dystroglycan, N-terminal domain {Mouse (Mus muscul 98.66
d1edha2112 E-cadherin (epithelial) {Mouse (Mus musculus) [Tax 91.68
d1l3wa4107 C-cadherin ectodomain {African clawed frog (Xenopu 91.18
d1ncia_102 N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 90.47
d1l3wa2113 C-cadherin ectodomain {African clawed frog (Xenopu 90.3
d2omzb1104 E-cadherin (epithelial) {Human (Homo sapiens) [Tax 87.38
d1op4a_136 N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 86.0
d1ncja2114 N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 85.68
d1l3wa1100 C-cadherin ectodomain {African clawed frog (Xenopu 84.49
d1l3wa5107 C-cadherin ectodomain {African clawed frog (Xenopu 83.0
>d1u2ca1 b.1.6.2 (A:58-160) Dystroglycan, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: Cadherin-like
family: Dystroglycan, N-terminal domain
domain: Dystroglycan, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.69  E-value=2.3e-17  Score=144.00  Aligned_cols=94  Identities=18%  Similarity=0.291  Sum_probs=82.1

Q ss_pred             CceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEE
Q psy12980        213 VPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREY  292 (609)
Q Consensus       213 aPvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f  292 (609)
                      +|++ ..|+++.+.+|+.|+|+||+++|.    |+  .++|++...+|.+||  +||+||+.+++|+|+|...++|.|.|
T Consensus         2 ~P~i-~~ipd~~a~~G~~f~~~ip~~~f~----~d--~~t~~~~~~~g~~LP--sWl~fd~~tg~~~GtP~~~~~G~~~i   72 (103)
T d1u2ca1           2 VPTV-VGIPDGTAVVGRSFRVSIPTDLIA----SS--GEIIKVSAAGKEALP--SWLHWDPHSHILEGLPLDTDKGVHYI   72 (103)
T ss_dssp             CCCC-CCCCCEEEETTSCEEEECCHHHHC----CT--TEEEEEEETTSSSCC--TTEEEETTTTEEEECCCGGGCEEEEE
T ss_pred             CCEe-ccCCCceEeCCceEEEeCCCCCcC----CC--ccEEEEEecCCCCCC--ccEEEeccCCEEEEccCCCCceEEEE
Confidence            4666 359999999999999999999995    44  467777778999998  99999999999999999999999999


Q ss_pred             EEEEEcCC-----CCeeEeeEEEEEECC
Q psy12980        293 QLVCKDSN-----GLMANDGLEVIVKSP  315 (609)
Q Consensus       293 ~LtATDs~-----G~sasd~F~V~V~~~  315 (609)
                      .|+|+|.+     |..+.+.|.|.|.+.
T Consensus        73 ~vtAtD~~~~~s~~~~~~dvf~i~V~P~  100 (103)
T d1u2ca1          73 SVSAARLGANGSHVPQTSSVFSIEVYPE  100 (103)
T ss_dssp             EEEEEEECTTSCEEEEEEEEEEEEEESS
T ss_pred             EEEEEeCCCCcccccEEEEEEEEEeCCC
Confidence            99999965     566778999999874



>d1u2ca1 b.1.6.2 (A:58-160) Dystroglycan, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1edha2 b.1.6.1 (A:102-213) E-cadherin (epithelial) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1l3wa4 b.1.6.1 (A:327-433) C-cadherin ectodomain {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1ncia_ b.1.6.1 (A:) N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1l3wa2 b.1.6.1 (A:101-213) C-cadherin ectodomain {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1op4a_ b.1.6.1 (A:) N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1ncja2 b.1.6.1 (A:102-215) N-cadherin (neural) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1l3wa1 b.1.6.1 (A:1-100) C-cadherin ectodomain {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1l3wa5 b.1.6.1 (A:434-540) C-cadherin ectodomain {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure