Domain diagrams of select SEA-containing proteins. Domains or regions with sequence motifs are shown as rectan-gular boxes. N-terminal signal peptides, transmembrane segments, and EGF domains are shown as yellow, blue and greenunlabeled boxes, respectively. Other domains or regions are labeled with their names or name abbreviations in the boxes. Theabbreviations are: 7TM - GPCR seven-pass transmembrane domain; ANK - ankyrin repeats; CA - cadherin domain; Cad_C - cadherin cytoplasmic domain; CADG - cadherin-like domain in dystroglycan; CU - Cupredoxin domain; D - DUF3454 domain;FG - FG repeat region in nucleoporins; KAZ - Kazal domain; L - LDLa domain; LamG - Laminin G domain; ND - N-terminaldomain of EpCAM; PTPc - protein phosphatase catalytic domain; S/T - serine and threonine rich region; UPAR - UPAR_LY6_2domain; V - VWC domain. The CADG1SEA module are highlighted with thick red outlines. Names of SEA domains with theautoproteolysis motif are shown in yellow font. Domain diagrams above and below the dashed lines are shown in differentlength scales (suggested by the arrows) to accommodate several large proteins with more than 3,000 amino acid residues. Themajority of these proteins are from human with the exceptions of the sea urchin sperm protein, aMonosigaprotein with canoni-cal SEA domains and theDE-cadherin (shotgun) ofD. melanogaster. GenBank accession numbers and protein lengths areshown for each protein.