RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12966
(741 letters)
>gnl|CDD|215878 pfam00362, Integrin_beta, Integrin, beta chain. Integrins have
been found in animals and their homologues have also
been found in cyanobacteria, probably due to horizontal
gene transfer. The sequences repeats have been trimmed
due to an overlap with EGF.
Length = 424
Score = 581 bits (1500), Expect = 0.0
Identities = 208/435 (47%), Positives = 262/435 (60%), Gaps = 42/435 (9%)
Query: 17 DKKRCFQPSLNADDNLEQ--CPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSS 74
+ RC + ++L + C EE + NP + V +N+ L
Sbjct: 25 NSPRC-----DTLESLLEKGCSEEDIENPRSELEVTENKPL------------------- 60
Query: 75 SSGSFSSGGSASGHSEAVQISPQHVSLKLRINEAYRMVVDYAQAEDYPVDLYYLMDLSNS 134
+AVQISPQ V+LKLR E + QAEDYPVDLYYLMDLS S
Sbjct: 61 --------SDKGSGDDAVQISPQKVTLKLRPGEPQTFNLKVRQAEDYPVDLYYLMDLSYS 112
Query: 135 MRDDKDRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPKNLLEPCA----GC 190
M+DD + L LG L+ M +TSNFRLGFGSFVDK V PYVS VP+ L PC+ GC
Sbjct: 113 MKDDLENLKTLGTDLAKEMANITSNFRLGFGSFVDKTVSPYVSTVPEKLKNPCSSKNPGC 172
Query: 191 AAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEEIGWRDRARR 250
P+G+ +V+SL+ DT F+ +VK +SGNLD PEGGFDAIMQA VC EEIGWR+ ARR
Sbjct: 173 QPPFGFRHVLSLTDDTDLFNEEVKKQKISGNLDAPEGGFDAIMQAAVCGEEIGWRNEARR 232
Query: 251 LLVFSTDAEFHHAGDGKLGGIVKPNDGLCHMDRNGMYTHSTVQDYPSISQINMKVKQNSI 310
LLVF+TDA FH AGDGKLGGIV+PNDG CH+D NG YT ST DYPS+ Q+ K+ +N+I
Sbjct: 233 LLVFTTDAGFHFAGDGKLGGIVEPNDGQCHLDDNGEYTASTTLDYPSVGQLAEKLSENNI 292
Query: 311 NLIFAVTAEQIGVYERLKTHIEGSSSGTLTNDSSNVVDLVKDQYNKISSSVEMK-DTSSS 369
IFAVT + +YE L I GS+ G L++DSSNVV L+KD YNKISS VE++ D
Sbjct: 293 QPIFAVTENVVDLYEELSELIPGSTVGELSSDSSNVVQLIKDAYNKISSKVELEHDNLPD 352
Query: 370 AVKVTYHSKCLNKDGPSKPTAKCDGLKVGTVVHFEIDIEVTACPANRSEWMQTFYIYPVG 429
V V+Y S C + KC +K+G V F +++ CP + TF I P+G
Sbjct: 353 GVSVSYTSDCPGGEEL-PGKGKCSNVKIGDEVSFNVEVTAKECPKEGKKT--TFTIKPLG 409
Query: 430 IDETLRVDLEMQCEC 444
+ L V+LE CEC
Sbjct: 410 FSDELTVELEFICEC 424
>gnl|CDD|197563 smart00187, INB, Integrin beta subunits (N-terminal portion of
extracellular region). Portion of beta integrins that
lies N-terminal to their EGF-like repeats. Integrins are
cell adhesion molecules that mediate cell-extracellular
matrix and cell-cell interactions. They contain both
alpha and beta subunits. Beta integrins are proposed to
have a von Willebrand factor type-A "insert" or "I"
-like domain (although this remains to be confirmed).
Length = 423
Score = 537 bits (1386), Expect = 0.0
Identities = 218/430 (50%), Positives = 268/430 (62%), Gaps = 38/430 (8%)
Query: 20 RCFQPSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSF 79
RC A+ + C E + NPA+ VL+++ L
Sbjct: 27 RC---DTRANLLAKGCSPESIENPASSAEVLEDKPL------------------------ 59
Query: 80 SSGGSASGHSEAVQISPQHVSLKLRINEAYRMVVDYAQAEDYPVDLYYLMDLSNSMRDDK 139
S GS +AVQ+SPQ V LKLR E + QAEDYPVDLYYLMDLS SM+DD
Sbjct: 60 SDKGSGG---QAVQVSPQRVRLKLRPGEPQNFTLTVRQAEDYPVDLYYLMDLSYSMKDDL 116
Query: 140 DRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPKNLLEPCAG----CAAPYG 195
D L +LGD L+ M+ +TSNFRLGFGSFVDK V P+VS P+ L PC C PYG
Sbjct: 117 DNLKSLGDDLAREMKGLTSNFRLGFGSFVDKTVSPFVSTHPEKLENPCPNYNKTCEPPYG 176
Query: 196 YHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEEIGWRDRARRLLVFS 255
+ +V+SL+ DT F+ +V +SGNLD PEGGFDAIMQA VC E+IGWR+ ARRLLVFS
Sbjct: 177 FKHVLSLTDDTDEFNEEVGKQRISGNLDAPEGGFDAIMQAAVCTEQIGWREDARRLLVFS 236
Query: 256 TDAEFHHAGDGKLGGIVKPNDGLCHMDRNGMYTHSTVQDYPSISQINMKVKQNSINLIFA 315
TDA FH AGDGKLGGIV PNDG CH+D NG YT ST QDYPSI Q+ K+ +N+IN IFA
Sbjct: 237 TDAGFHFAGDGKLGGIVTPNDGQCHLDNNGEYTMSTTQDYPSIGQLAQKLAENNINPIFA 296
Query: 316 VTAEQIGVYERLKTHIEGSSSGTLTNDSSNVVDLVKDQYNKISSSVEMKDTSS-SAVKVT 374
VT +Q+ +Y+ L I GSS G L+ DSSNVV+L+KD YNKISS VE++D + V VT
Sbjct: 297 VTKKQVSLYKELSALIPGSSVGELSEDSSNVVELIKDAYNKISSRVELEDNALPEGVSVT 356
Query: 375 YHSKCLNKDGPSKPTAKCDGLKVGTVVHFEIDIEVTACPANRSEWMQTFYIYPVGIDETL 434
Y S C T KC+G+K+G V FE+ + T CP E Q+ I PVG ETL
Sbjct: 357 YTSSCPG-GVTGPGTRKCEGVKIGDTVSFEVTVTATKCPPEDQE--QSIRIRPVGFSETL 413
Query: 435 RVDLEMQCEC 444
V++ C+C
Sbjct: 414 EVEVTFLCDC 423
>gnl|CDD|149701 pfam08725, Integrin_b_cyt, Integrin beta cytoplasmic domain.
Integrins are a group of transmembrane proteins which
function as extracellular matrix receptors and in cell
adhesion. Integrins are ubiquitously expressed and are
heterodimeric, each composed of an alpha and beta
subunit. Several variations of the the alpha and beta
subunits exist, and association of different alpha and
beta subunits can have different a different binding
specificity. This domain corresponds to the cytoplasmic
domain of the beta subunit.
Length = 46
Score = 62.2 bits (152), Expect = 1e-12
Identities = 21/31 (67%), Positives = 24/31 (77%)
Query: 709 KILTSIHDRREFAKFEKERMLAKWDTKLNSL 739
K+L +IHDRRE+AKFEKER AKWD N L
Sbjct: 1 KLLVTIHDRREYAKFEKERKKAKWDKGENPL 31
>gnl|CDD|219669 pfam07965, Integrin_B_tail, Integrin beta tail domain. This is the
beta tail domain of the Integrin protein. Integrins are
receptors which are involved in cell-cell and
cell-extracellular matrix interactions.
Length = 84
Score = 48.5 bits (116), Expect = 3e-07
Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 16/80 (20%)
Query: 619 QEFKNCIQCQVLQD--------------RTIVRGGNEAKDE--NLCVYFDEDECKFEYVY 662
E K+C++C+ I+ +E + + C DED+C F + Y
Sbjct: 2 TEKKDCVECRAFGTGPLKDEKNCSRECKDEILTVVDELEKDDSKNCKERDEDDCWFYFTY 61
Query: 663 WYDAQGKIHLRAQQERECPP 682
D GK + Q+ERECP
Sbjct: 62 EEDGSGKYIVYVQKERECPE 81
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 36.6 bits (85), Expect = 0.009
Identities = 15/29 (51%), Positives = 16/29 (55%)
Query: 53 RHLAKASKYHHSQHSSSSSSSSSSGSFSS 81
AK + SSSSSSSSSS S SS
Sbjct: 90 SRSAKKISAKKRRRSSSSSSSSSSSSSSS 118
Score = 32.8 bits (75), Expect = 0.16
Identities = 14/34 (41%), Positives = 16/34 (47%)
Query: 48 SVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSS 81
V ++K SSSSSSSSSS S S
Sbjct: 87 HVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120
Score = 32.8 bits (75), Expect = 0.20
Identities = 15/31 (48%), Positives = 16/31 (51%)
Query: 57 KASKYHHSQHSSSSSSSSSSGSFSSGGSASG 87
A K + SSSSSSSS S SS S S
Sbjct: 92 SAKKISAKKRRRSSSSSSSSSSSSSSSSESS 122
Score = 31.6 bits (72), Expect = 0.43
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 53 RHLAKASKYHHSQHSSSSSSSSSSGSFSSGGSAS 86
++++K ++ SSSSSSS S SS S+
Sbjct: 87 HVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120
Score = 30.9 bits (70), Expect = 0.83
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 53 RHLAKASKYHHSQHSSSSSSSSSSGSFSSGGSAS 86
RH++ S S SSSSSS S SS S+S
Sbjct: 86 RHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSS 119
>gnl|CDD|221039 pfam11234, DUF3036, Protein of unknown function (DUF3036). Some
members in this family of proteins are annotated as
yoaS. Currently no function is known.
Length = 155
Score = 33.0 bits (76), Expect = 0.23
Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 1/42 (2%)
Query: 681 PPHVYILGLVL-GVIGAIVLIGLAFLCLWKILTSIHDRREFA 721
P Y+L VL GV + +A +K+L I F+
Sbjct: 39 PEIAYLLYPVLLGVYLGAIPFYVALYQAFKLLRYIDRNTAFS 80
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 34.1 bits (78), Expect = 0.35
Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)
Query: 23 QPSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSG 82
+ S + D + EQ E+ P+ + + SSSS S SSSGS S
Sbjct: 361 KLSSSEDSDEEQATEK---PPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSES 417
Query: 83 GSASGHSEAVQISPQHVS 100
S+S SE + P +
Sbjct: 418 ESSSSDSEENE--PPRTA 433
>gnl|CDD|238119 cd00198, vWFA, Von Willebrand factor type A (vWA) domain was
originally found in the blood coagulation protein von
Willebrand factor (vWF). Typically, the vWA domain is
made up of approximately 200 amino acid residues folded
into a classic a/b para-rossmann type of fold. The vWA
domain, since its discovery, has drawn great interest
because of its widespread occurrence and its involvement
in a wide variety of important cellular functions. These
include basal membrane formation, cell migration, cell
differentiation, adhesion, haemostasis, signaling,
chromosomal stability, malignant transformation and in
immune defenses In integrins these domains form
heterodimers while in vWF it forms multimers. There are
different interaction surfaces of this domain as seen by
the various molecules it complexes with. Ligand binding
in most cases is mediated by the presence of a metal ion
dependent adhesion site termed as the MIDAS motif that
is a characteristic feature of most, if not all A
domains.
Length = 161
Score = 32.5 bits (74), Expect = 0.37
Identities = 28/141 (19%), Positives = 49/141 (34%), Gaps = 27/141 (19%)
Query: 123 VDLYYLMDLSNSMRDDK-DRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPK 181
D+ +L+D+S SM +K D+ L +S+ R+G +F
Sbjct: 1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNA---------- 50
Query: 182 NLLEPCAGCAAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEE 241
V+ L+ DT + + L G A+ A+ +
Sbjct: 51 ---------------RVVLPLTTDTDKADLLEAIDALKKGLGGGTNIGAALRLALELLKS 95
Query: 242 IGWRDRARRLLVFSTDAEFHH 262
R ARR+++ TD E +
Sbjct: 96 AK-RPNARRVIILLTDGEPND 115
>gnl|CDD|221321 pfam11928, DUF3446, Domain of unknown function (DUF3446). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 80 to 99 amino acids in length. This domain is
found associated with pfam00096. This domain has a
single completely conserved residue P that may be
functionally important.
Length = 84
Score = 30.6 bits (69), Expect = 0.47
Identities = 17/34 (50%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 64 SQHSSSSSSSSSSGSFSSGGSAS-GHSEAVQISP 96
S SSSSSSSSS S S S S SE I
Sbjct: 36 SSSPSSSSSSSSSSSQSPPLSCSVHQSEPSPIYS 69
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.
Members of this protein share two defining regions. One
is a histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is
predicted to perform peptide modifications, likely in
the His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 32.5 bits (74), Expect = 0.47
Identities = 21/64 (32%), Positives = 25/64 (39%), Gaps = 4/64 (6%)
Query: 24 PSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGG 83
+ DDNLE P + P NV + N S H HSS SS S +G S
Sbjct: 24 EAGLTDDNLE--PTDVELAPLNV--DIPNLLAGHRSHSSHRSHSSHSSHYSGAGGSYSVP 79
Query: 84 SASG 87
S
Sbjct: 80 SGDT 83
Score = 31.0 bits (70), Expect = 1.3
Identities = 14/67 (20%), Positives = 21/67 (31%), Gaps = 1/67 (1%)
Query: 24 PSLN-ADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSG 82
+L A PE + + + ++ L + H S SS S S S
Sbjct: 10 LALGFAPTKGAAAPEAGLTDDNLEPTDVELAPLNVDIPNLLAGHRSHSSHRSHSSHSSHY 69
Query: 83 GSASGHS 89
A G
Sbjct: 70 SGAGGSY 76
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 32.9 bits (75), Expect = 0.85
Identities = 13/46 (28%), Positives = 20/46 (43%)
Query: 56 AKASKYHHSQHSSSSSSSSSSGSFSSGGSASGHSEAVQISPQHVSL 101
A K H +S SS+ + S S+ + S S+ I P +S
Sbjct: 235 AVLLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSR 280
>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
This is a family of conserved proteins of from 500 - 600
residues found from worms to humans. Its function is not
known.
Length = 515
Score = 32.4 bits (74), Expect = 1.0
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 42 NPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGGS--ASGHSEAVQISP 96
P + S ++ +K S HS S S +SSSSGS ++ GS A+G E + +
Sbjct: 74 KPNDTLSGASSKLSSKHSGSSHSLASVSDRNSSSSGSCANSGSWEAAGMEEPINTTD 130
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 28.2 bits (63), Expect = 1.2
Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 2/30 (6%)
Query: 456 RNSPNCSGFGTFKC--GLCECDQSHFGRRC 483
S C+G GT G C CD + G C
Sbjct: 2 SASGICNGRGTCVRPCGKCVCDSGYQGATC 31
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 31.8 bits (72), Expect = 1.5
Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 6/50 (12%)
Query: 687 LGLVLGVIGAIVLIGLAFLCLWKILTSIHDRREFAKFEKERMLAKWDTKL 736
L L+L ++GAI +I L LW R+E + + ++R L + +K
Sbjct: 4 LRLILIIVGAIAIIALLVHGLWT------SRKERSSYFRDRPLKRMKSKR 47
>gnl|CDD|237086 PRK12386, PRK12386, fumarate reductase iron-sulfur subunit;
Provisional.
Length = 251
Score = 31.2 bits (71), Expect = 1.5
Identities = 26/78 (33%), Positives = 36/78 (46%), Gaps = 20/78 (25%)
Query: 564 VTVAPLMT-PVYLLREERCARARESVSAVCANVLKIARE--------DIPAGSARNARPV 614
VTV P+ T PV R+ V+ V N K ARE D+ G R +
Sbjct: 86 VTVTPMRTFPV----------IRDLVTDVSFNYEK-AREIPSFTPPKDLQPGEYRMQQVD 134
Query: 615 LEDVQEFKNCIQCQVLQD 632
+E QEF+ CI+C + Q+
Sbjct: 135 VERSQEFRKCIECFLCQN 152
>gnl|CDD|234208 TIGR03436, acidobact_VWFA, VWFA-related Acidobacterial domain.
Members of this family are bacterial domains that
include a region related to the von Willebrand factor
type A (VWFA) domain (pfam00092). These domains are
restricted to, and have undergone a large paralogous
family expansion in, the Acidobacteria, including
Solibacter usitatus and Acidobacterium capsulatum ATCC
51196.
Length = 296
Score = 31.1 bits (71), Expect = 2.1
Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 112 VVDYAQAEDYPVDLYYLMDLSNSMRDDKDRL-SALGDQLSASMQ 154
+ + + D P+ + ++D S SMR+D DR +A L ++
Sbjct: 43 IASFRRETDLPLTVGLVIDTSGSMRNDLDRARAAAIRFLKTVLR 86
>gnl|CDD|222192 pfam13519, VWA_2, von Willebrand factor type A domain.
Length = 172
Score = 30.0 bits (68), Expect = 2.6
Identities = 27/142 (19%), Positives = 49/142 (34%), Gaps = 38/142 (26%)
Query: 124 DLYYLMDLSNSMRDD---KDRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVP 180
DL ++D+S SM RL+ ++ + + + R+G +F
Sbjct: 1 DLVIVLDVSGSMNATDLKPSRLTRAKAAIADLLARLPGD-RVGLIAF------------- 46
Query: 181 KNLLEPCAGCAAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGG---FDAIMQAIV 237
AG A + V+ L+ D + +A + + P GG A+ A+
Sbjct: 47 -------AGSA-----YLVLPLTDDRAALAAALPALSPRI---MPGGGTNLAAALALAL- 90
Query: 238 CKEEIGWRDRARRLLVFSTDAE 259
+ +V TD E
Sbjct: 91 --RLLAGAGGGSGAIVLITDGE 110
>gnl|CDD|163683 cd07945, DRE_TIM_CMS, Leptospira interrogans citramalate synthase
(CMS) and related proteins, N-terminal catalytic TIM
barrel domain. Citramalate synthase (CMS) catalyzes the
conversion of pyruvate and acetyl-CoA to (R)-citramalate
in the first dedicated step of the citramalate pathway.
Citramalate is only found in Leptospira interrogans and
a few other microorganisms. This family belongs to the
DRE-TIM metallolyase superfamily. DRE-TIM metallolyases
include 2-isopropylmalate synthase (IPMS),
alpha-isopropylmalate synthase (LeuA),
3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate
synthase, citramalate synthase, 4-hydroxy-2-oxovalerate
aldolase, re-citrate synthase, transcarboxylase 5S,
pyruvate carboxylase, AksA, and FrbC. These members all
share a conserved triose-phosphate isomerase (TIM)
barrel domain consisting of a core beta(8)-alpha(8)
motif with the eight parallel beta strands forming an
enclosed barrel surrounded by eight alpha helices. The
domain has a catalytic center containing a divalent
cation-binding site formed by a cluster of invariant
residues that cap the core of the barrel. In addition,
the catalytic site includes three invariant residues -
an aspartate (D), an arginine (R), and a glutamate (E) -
which is the basis for the domain name "DRE-TIM".
Length = 280
Score = 30.8 bits (70), Expect = 2.6
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 110 RMVVDYAQAEDYPVDLYYLMDLSNSMRDDKDRLSALGDQLS 150
R V++YA V++Y L D SN MRD D + L D LS
Sbjct: 118 REVIEYAIKNGIEVNIY-LEDWSNGMRDSPDYVFQLVDFLS 157
>gnl|CDD|227559 COG5234, CIN1, Beta-tubulin folding cofactor D [Posttranslational
modification, protein turnover, chaperones /
Cytoskeleton].
Length = 993
Score = 30.7 bits (69), Expect = 3.4
Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 1/98 (1%)
Query: 278 LCHMDRNGMYTHSTVQDYPSISQINMKVK-QNSINLIFAVTAEQIGVYERLKTHIEGSSS 336
+ + YP + I+ N+++ I T E+ E + I S
Sbjct: 638 IISFRYQEKICKLLLDIYPQLHSIDYSAPIANALHNIVPFTYEKSESIEEFRKEILNVLS 697
Query: 337 GTLTNDSSNVVDLVKDQYNKISSSVEMKDTSSSAVKVT 374
LT+ +V ++ K+ SS +KD+S + +
Sbjct: 698 NYLTDTRGDVGSWIRKPAMKLMSSFLVKDSSGKKLYII 735
>gnl|CDD|237114 PRK12488, PRK12488, acetate permease; Provisional.
Length = 549
Score = 30.2 bits (68), Expect = 4.2
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 696 AIVLIGLAFLCLWKILTSIHDRREFAKFEKERMLAKW 732
A+ + LAFL W + S+ DR E A E+ R LA++
Sbjct: 500 ALFSMSLAFLSAW--VFSVTDRSERAVAERGRYLAQF 534
>gnl|CDD|220811 pfam10565, NMDAR2_C, N-methyl D-aspartate receptor 2B3 C-terminus.
This domain is found at the C-terminus of many
NMDA-receptor proteins, many of which also carry the
Ligated ion-channel family pfam00060 further upstream as
well as the ANF_receptor family pfam01094. This region
is predicted to be a large extra-cellular domain of the
NMDA receptor proteins, being highly hydrophilic, and is
thought to be integrally involved in the function of the
receptor. The region also carries a number of potential
N-glycosylation sites.
Length = 660
Score = 30.2 bits (68), Expect = 4.7
Identities = 20/69 (28%), Positives = 28/69 (40%), Gaps = 9/69 (13%)
Query: 24 PSLNADDNLEQCPEEYVF--NPANVFSVLDNRHLAKASK-----YHHSQHSSSSSSSSSS 76
SL DN PE+ +F + V N HL K S H H+ + ++ +S
Sbjct: 99 RSLAYSDNPPYLPEDNMFSDYISEVERTFGNLHL-KDSNNYQDHGLH-HHTLNEANPNSL 156
Query: 77 GSFSSGGSA 85
GS SS
Sbjct: 157 GSASSLDGG 165
>gnl|CDD|222404 pfam13829, DUF4191, Domain of unknown function (DUF4191).
Length = 223
Score = 29.4 bits (67), Expect = 5.2
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 685 YILGLVLGVIGAIVLIGLAFLCLW 708
+LG LGV+ V++GL F W
Sbjct: 30 LMLGAFLGVLAVFVVLGLLFGSWW 53
>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
Length = 349
Score = 29.8 bits (67), Expect = 5.9
Identities = 16/35 (45%), Positives = 17/35 (48%)
Query: 49 VLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGG 83
L H +K S SSS SSS SS S SS G
Sbjct: 253 PLTTFHADSVAKEGRSSSSSSRSSSDSSTSSSSDG 287
>gnl|CDD|221661 pfam12609, DUF3774, Wound-induced protein. This family of proteins
is found in eukaryotes. Proteins in this family are
typically between 81 and 97 amino acids in length. The
proteins in the family are often annotated as
wound-induced proteins however there is little
accompanying literature to confirm this.
Length = 78
Score = 27.4 bits (61), Expect = 6.0
Identities = 13/47 (27%), Positives = 23/47 (48%)
Query: 56 AKASKYHHSQHSSSSSSSSSSGSFSSGGSASGHSEAVQISPQHVSLK 102
A S + ++ ++ S+S + S SS SAS + V+ SL+
Sbjct: 22 ALRSLHQAAKANARRSASQAGKSSSSSSSASAAAAEVKAKQAEESLR 68
>gnl|CDD|178666 PLN03119, PLN03119, putative ADP-ribosylation factor
GTPase-activating protein AGD14; Provisional.
Length = 648
Score = 29.8 bits (66), Expect = 6.7
Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 3/30 (10%)
Query: 22 FQPSLNADD---NLEQCPEEYVFNPANVFS 48
FQPS+ NL+Q P+ Y F PAN F+
Sbjct: 343 FQPSIAPRSPPVNLQQAPQTYSFTPANSFA 372
>gnl|CDD|222050 pfam13332, Fil_haemagg_2, Haemagluttinin repeat.
Length = 170
Score = 28.7 bits (65), Expect = 6.7
Identities = 11/40 (27%), Positives = 17/40 (42%), Gaps = 3/40 (7%)
Query: 51 DNRHLAKASKYHHSQHSSSSSSSSSSG---SFSSGGSASG 87
N +A++ S S++ SS S G + GG G
Sbjct: 55 ANDINLEAAQNTQSTSSTNKSSGGSVGVSVGYGGGGGGLG 94
>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645). These
sequences are derived from a number of hypothetical
plant proteins. The region in question is approximately
270 amino acids long. Some members of this family are
annotated as yeast pheromone receptor proteins AR781 but
no literature was found to support this.
Length = 191
Score = 29.0 bits (65), Expect = 7.4
Identities = 10/19 (52%), Positives = 12/19 (63%)
Query: 67 SSSSSSSSSSGSFSSGGSA 85
SSS +SSS S S+G S
Sbjct: 84 VSSSPASSSRKSSSTGSSK 102
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 27.7 bits (62), Expect = 7.4
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 64 SQHSSSSSSSSSSGSFSSGGSASGHS 89
S +SSSSSS+SS SS S+SG S
Sbjct: 29 SGSNSSSSSSNSSNGGSSSSSSSGDS 54
>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA
binding) region. This family represents the N-terminal
head region of intermediate filaments. Intermediate
filament heads bind DNA. Vimentin heads are able to
alter nuclear architecture and chromatin distribution,
and the liberation of heads by HIV-1 protease liberates
may play an important role in HIV-1 associated
cytopathogenesis and carcinogenesis. Phosphorylation of
the head region can affect filament stability. The head
has been shown to interaction with the rod domain of
the same protein.
Length = 89
Score = 27.5 bits (61), Expect = 8.1
Identities = 12/32 (37%), Positives = 14/32 (43%)
Query: 58 ASKYHHSQHSSSSSSSSSSGSFSSGGSASGHS 89
+S + SSSS S S S SS S S
Sbjct: 21 SSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSL 52
>gnl|CDD|181437 PRK08470, PRK08470, adenylosuccinate lyase; Provisional.
Length = 442
Score = 29.3 bits (66), Expect = 9.0
Identities = 24/82 (29%), Positives = 34/82 (41%), Gaps = 14/82 (17%)
Query: 598 IAREDIPAGSARNARPVLEDVQEFKNCIQCQ----VLQDRTIVRGGNEAKDENLCVYFDE 653
++RED RNA V ED+Q+ K I + LQ DE+L E
Sbjct: 366 VSREDAYKIVQRNAMKVWEDLQQGKAAINEKGESLFLQ--------ALLNDEDLRKSLSE 417
Query: 654 DECK--FEYVYWYDAQGKIHLR 673
+E + F+Y Y+ I R
Sbjct: 418 EEIRACFDYSYYTKNVDAIFKR 439
>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
domain. Spiders use fibroins to make silk strands.
This family includes tubuliform silk fibroins which are
used to protect egg cases. This domain is a structural
domain which is found in repeats of up to 20 in many
individuals (although this is not necessarily the
case). RP1 makes up structural domains in the N
terminal while RP2 makes up structural domains in the C
terminal.
Length = 167
Score = 28.3 bits (63), Expect = 9.8
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 64 SQHSSSSSSSSSSGSFSSGGSAS 86
SQ +SS+SSSSS+ +F+ S++
Sbjct: 4 SQAASSASSSSSASAFAQSLSSA 26
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.133 0.402
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,672,352
Number of extensions: 3374532
Number of successful extensions: 3984
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3912
Number of HSP's successfully gapped: 45
Length of query: 741
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 637
Effective length of database: 6,324,786
Effective search space: 4028888682
Effective search space used: 4028888682
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)