RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12966
         (741 letters)



>gnl|CDD|215878 pfam00362, Integrin_beta, Integrin, beta chain.  Integrins have
           been found in animals and their homologues have also
           been found in cyanobacteria, probably due to horizontal
           gene transfer. The sequences repeats have been trimmed
           due to an overlap with EGF.
          Length = 424

 Score =  581 bits (1500), Expect = 0.0
 Identities = 208/435 (47%), Positives = 262/435 (60%), Gaps = 42/435 (9%)

Query: 17  DKKRCFQPSLNADDNLEQ--CPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSS 74
           +  RC     +  ++L +  C EE + NP +   V +N+ L                   
Sbjct: 25  NSPRC-----DTLESLLEKGCSEEDIENPRSELEVTENKPL------------------- 60

Query: 75  SSGSFSSGGSASGHSEAVQISPQHVSLKLRINEAYRMVVDYAQAEDYPVDLYYLMDLSNS 134
                          +AVQISPQ V+LKLR  E     +   QAEDYPVDLYYLMDLS S
Sbjct: 61  --------SDKGSGDDAVQISPQKVTLKLRPGEPQTFNLKVRQAEDYPVDLYYLMDLSYS 112

Query: 135 MRDDKDRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPKNLLEPCA----GC 190
           M+DD + L  LG  L+  M  +TSNFRLGFGSFVDK V PYVS VP+ L  PC+    GC
Sbjct: 113 MKDDLENLKTLGTDLAKEMANITSNFRLGFGSFVDKTVSPYVSTVPEKLKNPCSSKNPGC 172

Query: 191 AAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEEIGWRDRARR 250
             P+G+ +V+SL+ DT  F+ +VK   +SGNLD PEGGFDAIMQA VC EEIGWR+ ARR
Sbjct: 173 QPPFGFRHVLSLTDDTDLFNEEVKKQKISGNLDAPEGGFDAIMQAAVCGEEIGWRNEARR 232

Query: 251 LLVFSTDAEFHHAGDGKLGGIVKPNDGLCHMDRNGMYTHSTVQDYPSISQINMKVKQNSI 310
           LLVF+TDA FH AGDGKLGGIV+PNDG CH+D NG YT ST  DYPS+ Q+  K+ +N+I
Sbjct: 233 LLVFTTDAGFHFAGDGKLGGIVEPNDGQCHLDDNGEYTASTTLDYPSVGQLAEKLSENNI 292

Query: 311 NLIFAVTAEQIGVYERLKTHIEGSSSGTLTNDSSNVVDLVKDQYNKISSSVEMK-DTSSS 369
             IFAVT   + +YE L   I GS+ G L++DSSNVV L+KD YNKISS VE++ D    
Sbjct: 293 QPIFAVTENVVDLYEELSELIPGSTVGELSSDSSNVVQLIKDAYNKISSKVELEHDNLPD 352

Query: 370 AVKVTYHSKCLNKDGPSKPTAKCDGLKVGTVVHFEIDIEVTACPANRSEWMQTFYIYPVG 429
            V V+Y S C   +       KC  +K+G  V F +++    CP    +   TF I P+G
Sbjct: 353 GVSVSYTSDCPGGEEL-PGKGKCSNVKIGDEVSFNVEVTAKECPKEGKKT--TFTIKPLG 409

Query: 430 IDETLRVDLEMQCEC 444
             + L V+LE  CEC
Sbjct: 410 FSDELTVELEFICEC 424


>gnl|CDD|197563 smart00187, INB, Integrin beta subunits (N-terminal portion of
           extracellular region).  Portion of beta integrins that
           lies N-terminal to their EGF-like repeats. Integrins are
           cell adhesion molecules that mediate cell-extracellular
           matrix and cell-cell interactions. They contain both
           alpha and beta subunits. Beta integrins are proposed to
           have a von Willebrand factor type-A "insert" or "I"
           -like domain (although this remains to be confirmed).
          Length = 423

 Score =  537 bits (1386), Expect = 0.0
 Identities = 218/430 (50%), Positives = 268/430 (62%), Gaps = 38/430 (8%)

Query: 20  RCFQPSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSF 79
           RC      A+   + C  E + NPA+   VL+++ L                        
Sbjct: 27  RC---DTRANLLAKGCSPESIENPASSAEVLEDKPL------------------------ 59

Query: 80  SSGGSASGHSEAVQISPQHVSLKLRINEAYRMVVDYAQAEDYPVDLYYLMDLSNSMRDDK 139
           S  GS     +AVQ+SPQ V LKLR  E     +   QAEDYPVDLYYLMDLS SM+DD 
Sbjct: 60  SDKGSGG---QAVQVSPQRVRLKLRPGEPQNFTLTVRQAEDYPVDLYYLMDLSYSMKDDL 116

Query: 140 DRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPKNLLEPCAG----CAAPYG 195
           D L +LGD L+  M+ +TSNFRLGFGSFVDK V P+VS  P+ L  PC      C  PYG
Sbjct: 117 DNLKSLGDDLAREMKGLTSNFRLGFGSFVDKTVSPFVSTHPEKLENPCPNYNKTCEPPYG 176

Query: 196 YHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEEIGWRDRARRLLVFS 255
           + +V+SL+ DT  F+ +V    +SGNLD PEGGFDAIMQA VC E+IGWR+ ARRLLVFS
Sbjct: 177 FKHVLSLTDDTDEFNEEVGKQRISGNLDAPEGGFDAIMQAAVCTEQIGWREDARRLLVFS 236

Query: 256 TDAEFHHAGDGKLGGIVKPNDGLCHMDRNGMYTHSTVQDYPSISQINMKVKQNSINLIFA 315
           TDA FH AGDGKLGGIV PNDG CH+D NG YT ST QDYPSI Q+  K+ +N+IN IFA
Sbjct: 237 TDAGFHFAGDGKLGGIVTPNDGQCHLDNNGEYTMSTTQDYPSIGQLAQKLAENNINPIFA 296

Query: 316 VTAEQIGVYERLKTHIEGSSSGTLTNDSSNVVDLVKDQYNKISSSVEMKDTSS-SAVKVT 374
           VT +Q+ +Y+ L   I GSS G L+ DSSNVV+L+KD YNKISS VE++D +    V VT
Sbjct: 297 VTKKQVSLYKELSALIPGSSVGELSEDSSNVVELIKDAYNKISSRVELEDNALPEGVSVT 356

Query: 375 YHSKCLNKDGPSKPTAKCDGLKVGTVVHFEIDIEVTACPANRSEWMQTFYIYPVGIDETL 434
           Y S C         T KC+G+K+G  V FE+ +  T CP    E  Q+  I PVG  ETL
Sbjct: 357 YTSSCPG-GVTGPGTRKCEGVKIGDTVSFEVTVTATKCPPEDQE--QSIRIRPVGFSETL 413

Query: 435 RVDLEMQCEC 444
            V++   C+C
Sbjct: 414 EVEVTFLCDC 423


>gnl|CDD|149701 pfam08725, Integrin_b_cyt, Integrin beta cytoplasmic domain.
           Integrins are a group of transmembrane proteins which
           function as extracellular matrix receptors and in cell
           adhesion. Integrins are ubiquitously expressed and are
           heterodimeric, each composed of an alpha and beta
           subunit. Several variations of the the alpha and beta
           subunits exist, and association of different alpha and
           beta subunits can have different a different binding
           specificity. This domain corresponds to the cytoplasmic
           domain of the beta subunit.
          Length = 46

 Score = 62.2 bits (152), Expect = 1e-12
 Identities = 21/31 (67%), Positives = 24/31 (77%)

Query: 709 KILTSIHDRREFAKFEKERMLAKWDTKLNSL 739
           K+L +IHDRRE+AKFEKER  AKWD   N L
Sbjct: 1   KLLVTIHDRREYAKFEKERKKAKWDKGENPL 31


>gnl|CDD|219669 pfam07965, Integrin_B_tail, Integrin beta tail domain.  This is the
           beta tail domain of the Integrin protein. Integrins are
           receptors which are involved in cell-cell and
           cell-extracellular matrix interactions.
          Length = 84

 Score = 48.5 bits (116), Expect = 3e-07
 Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 16/80 (20%)

Query: 619 QEFKNCIQCQVLQD--------------RTIVRGGNEAKDE--NLCVYFDEDECKFEYVY 662
            E K+C++C+                    I+   +E + +    C   DED+C F + Y
Sbjct: 2   TEKKDCVECRAFGTGPLKDEKNCSRECKDEILTVVDELEKDDSKNCKERDEDDCWFYFTY 61

Query: 663 WYDAQGKIHLRAQQERECPP 682
             D  GK  +  Q+ERECP 
Sbjct: 62  EEDGSGKYIVYVQKERECPE 81


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 36.6 bits (85), Expect = 0.009
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 53  RHLAKASKYHHSQHSSSSSSSSSSGSFSS 81
              AK       + SSSSSSSSSS S SS
Sbjct: 90  SRSAKKISAKKRRRSSSSSSSSSSSSSSS 118



 Score = 32.8 bits (75), Expect = 0.16
 Identities = 14/34 (41%), Positives = 16/34 (47%)

Query: 48  SVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSS 81
            V        ++K      SSSSSSSSSS S S 
Sbjct: 87  HVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120



 Score = 32.8 bits (75), Expect = 0.20
 Identities = 15/31 (48%), Positives = 16/31 (51%)

Query: 57  KASKYHHSQHSSSSSSSSSSGSFSSGGSASG 87
            A K    +   SSSSSSSS S SS  S S 
Sbjct: 92  SAKKISAKKRRRSSSSSSSSSSSSSSSSESS 122



 Score = 31.6 bits (72), Expect = 0.43
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 53  RHLAKASKYHHSQHSSSSSSSSSSGSFSSGGSAS 86
              ++++K   ++    SSSSSSS S SS  S+ 
Sbjct: 87  HVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120



 Score = 30.9 bits (70), Expect = 0.83
 Identities = 15/34 (44%), Positives = 18/34 (52%)

Query: 53  RHLAKASKYHHSQHSSSSSSSSSSGSFSSGGSAS 86
           RH++  S    S      SSSSSS S SS  S+S
Sbjct: 86  RHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSS 119


>gnl|CDD|221039 pfam11234, DUF3036, Protein of unknown function (DUF3036).  Some
           members in this family of proteins are annotated as
           yoaS. Currently no function is known.
          Length = 155

 Score = 33.0 bits (76), Expect = 0.23
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 1/42 (2%)

Query: 681 PPHVYILGLVL-GVIGAIVLIGLAFLCLWKILTSIHDRREFA 721
           P   Y+L  VL GV    +   +A    +K+L  I     F+
Sbjct: 39  PEIAYLLYPVLLGVYLGAIPFYVALYQAFKLLRYIDRNTAFS 80


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 34.1 bits (78), Expect = 0.35
 Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)

Query: 23  QPSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSG 82
           + S + D + EQ  E+    P+            + +       SSSS S SSSGS S  
Sbjct: 361 KLSSSEDSDEEQATEK---PPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSES 417

Query: 83  GSASGHSEAVQISPQHVS 100
            S+S  SE  +  P   +
Sbjct: 418 ESSSSDSEENE--PPRTA 433


>gnl|CDD|238119 cd00198, vWFA, Von Willebrand factor type A (vWA) domain was
           originally found in the blood coagulation protein von
           Willebrand factor (vWF). Typically, the vWA domain is
           made up of approximately 200 amino acid residues folded
           into a classic a/b para-rossmann type of fold. The vWA
           domain, since its discovery, has drawn great interest
           because of its widespread occurrence and its involvement
           in a wide variety of important cellular functions. These
           include basal membrane formation, cell migration, cell
           differentiation, adhesion, haemostasis, signaling,
           chromosomal stability, malignant transformation and in
           immune defenses  In integrins these domains form
           heterodimers while in vWF it forms multimers. There are
           different interaction surfaces of this domain as seen by
           the various molecules it complexes with. Ligand binding
           in most cases is mediated by the presence of a metal ion
           dependent adhesion site termed as the MIDAS motif that
           is a characteristic feature of most, if not all A
           domains.
          Length = 161

 Score = 32.5 bits (74), Expect = 0.37
 Identities = 28/141 (19%), Positives = 49/141 (34%), Gaps = 27/141 (19%)

Query: 123 VDLYYLMDLSNSMRDDK-DRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVPK 181
            D+ +L+D+S SM  +K D+       L +S+       R+G  +F              
Sbjct: 1   ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNA---------- 50

Query: 182 NLLEPCAGCAAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGGFDAIMQAIVCKEE 241
                            V+ L+ DT +         +   L G      A+  A+   + 
Sbjct: 51  ---------------RVVLPLTTDTDKADLLEAIDALKKGLGGGTNIGAALRLALELLKS 95

Query: 242 IGWRDRARRLLVFSTDAEFHH 262
              R  ARR+++  TD E + 
Sbjct: 96  AK-RPNARRVIILLTDGEPND 115


>gnl|CDD|221321 pfam11928, DUF3446, Domain of unknown function (DUF3446).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is typically
          between 80 to 99 amino acids in length. This domain is
          found associated with pfam00096. This domain has a
          single completely conserved residue P that may be
          functionally important.
          Length = 84

 Score = 30.6 bits (69), Expect = 0.47
 Identities = 17/34 (50%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 64 SQHSSSSSSSSSSGSFSSGGSAS-GHSEAVQISP 96
          S   SSSSSSSSS S S   S S   SE   I  
Sbjct: 36 SSSPSSSSSSSSSSSQSPPLSCSVHQSEPSPIYS 69


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.
          Members of this protein share two defining regions. One
          is a histidine/serine-rich cluster, typically
          H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
          the context of a pair of radical SAM proteins, HxsB and
          HxsC, and a fourth protein HxsD. The system is
          predicted to perform peptide modifications, likely in
          the His-Xaa-Ser region, to produce some uncharacterized
          natural product.
          Length = 186

 Score = 32.5 bits (74), Expect = 0.47
 Identities = 21/64 (32%), Positives = 25/64 (39%), Gaps = 4/64 (6%)

Query: 24 PSLNADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGG 83
           +   DDNLE  P +    P NV   + N      S   H  HSS SS  S +G   S  
Sbjct: 24 EAGLTDDNLE--PTDVELAPLNV--DIPNLLAGHRSHSSHRSHSSHSSHYSGAGGSYSVP 79

Query: 84 SASG 87
          S   
Sbjct: 80 SGDT 83



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 14/67 (20%), Positives = 21/67 (31%), Gaps = 1/67 (1%)

Query: 24 PSLN-ADDNLEQCPEEYVFNPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSG 82
           +L  A       PE  + +     + ++   L        + H S SS  S S   S  
Sbjct: 10 LALGFAPTKGAAAPEAGLTDDNLEPTDVELAPLNVDIPNLLAGHRSHSSHRSHSSHSSHY 69

Query: 83 GSASGHS 89
            A G  
Sbjct: 70 SGAGGSY 76


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 32.9 bits (75), Expect = 0.85
 Identities = 13/46 (28%), Positives = 20/46 (43%)

Query: 56  AKASKYHHSQHSSSSSSSSSSGSFSSGGSASGHSEAVQISPQHVSL 101
           A   K H     +S  SS+ + S S+  + S  S+   I P  +S 
Sbjct: 235 AVLLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSR 280


>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
           This is a family of conserved proteins of from 500 - 600
           residues found from worms to humans. Its function is not
           known.
          Length = 515

 Score = 32.4 bits (74), Expect = 1.0
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 42  NPANVFSVLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGGS--ASGHSEAVQISP 96
            P +  S   ++  +K S   HS  S S  +SSSSGS ++ GS  A+G  E +  + 
Sbjct: 74  KPNDTLSGASSKLSSKHSGSSHSLASVSDRNSSSSGSCANSGSWEAAGMEEPINTTD 130


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
           domains found in a variety of extracellular proteins.
          Length = 31

 Score = 28.2 bits (63), Expect = 1.2
 Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 2/30 (6%)

Query: 456 RNSPNCSGFGTFKC--GLCECDQSHFGRRC 483
             S  C+G GT     G C CD  + G  C
Sbjct: 2   SASGICNGRGTCVRPCGKCVCDSGYQGATC 31


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 6/50 (12%)

Query: 687 LGLVLGVIGAIVLIGLAFLCLWKILTSIHDRREFAKFEKERMLAKWDTKL 736
           L L+L ++GAI +I L    LW        R+E + + ++R L +  +K 
Sbjct: 4   LRLILIIVGAIAIIALLVHGLWT------SRKERSSYFRDRPLKRMKSKR 47


>gnl|CDD|237086 PRK12386, PRK12386, fumarate reductase iron-sulfur subunit;
           Provisional.
          Length = 251

 Score = 31.2 bits (71), Expect = 1.5
 Identities = 26/78 (33%), Positives = 36/78 (46%), Gaps = 20/78 (25%)

Query: 564 VTVAPLMT-PVYLLREERCARARESVSAVCANVLKIARE--------DIPAGSARNARPV 614
           VTV P+ T PV           R+ V+ V  N  K ARE        D+  G  R  +  
Sbjct: 86  VTVTPMRTFPV----------IRDLVTDVSFNYEK-AREIPSFTPPKDLQPGEYRMQQVD 134

Query: 615 LEDVQEFKNCIQCQVLQD 632
           +E  QEF+ CI+C + Q+
Sbjct: 135 VERSQEFRKCIECFLCQN 152


>gnl|CDD|234208 TIGR03436, acidobact_VWFA, VWFA-related Acidobacterial domain.
           Members of this family are bacterial domains that
           include a region related to the von Willebrand factor
           type A (VWFA) domain (pfam00092). These domains are
           restricted to, and have undergone a large paralogous
           family expansion in, the Acidobacteria, including
           Solibacter usitatus and Acidobacterium capsulatum ATCC
           51196.
          Length = 296

 Score = 31.1 bits (71), Expect = 2.1
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 112 VVDYAQAEDYPVDLYYLMDLSNSMRDDKDRL-SALGDQLSASMQ 154
           +  + +  D P+ +  ++D S SMR+D DR  +A    L   ++
Sbjct: 43  IASFRRETDLPLTVGLVIDTSGSMRNDLDRARAAAIRFLKTVLR 86


>gnl|CDD|222192 pfam13519, VWA_2, von Willebrand factor type A domain. 
          Length = 172

 Score = 30.0 bits (68), Expect = 2.6
 Identities = 27/142 (19%), Positives = 49/142 (34%), Gaps = 38/142 (26%)

Query: 124 DLYYLMDLSNSMRDD---KDRLSALGDQLSASMQEVTSNFRLGFGSFVDKVVMPYVSMVP 180
           DL  ++D+S SM        RL+     ++  +  +  + R+G  +F             
Sbjct: 1   DLVIVLDVSGSMNATDLKPSRLTRAKAAIADLLARLPGD-RVGLIAF------------- 46

Query: 181 KNLLEPCAGCAAPYGYHNVMSLSQDTSRFSAQVKGANVSGNLDGPEGG---FDAIMQAIV 237
                  AG A     + V+ L+ D +  +A +   +       P GG     A+  A+ 
Sbjct: 47  -------AGSA-----YLVLPLTDDRAALAAALPALSPRI---MPGGGTNLAAALALAL- 90

Query: 238 CKEEIGWRDRARRLLVFSTDAE 259
               +         +V  TD E
Sbjct: 91  --RLLAGAGGGSGAIVLITDGE 110


>gnl|CDD|163683 cd07945, DRE_TIM_CMS, Leptospira interrogans citramalate synthase
           (CMS) and related proteins, N-terminal catalytic TIM
           barrel domain.  Citramalate synthase (CMS) catalyzes the
           conversion of pyruvate and acetyl-CoA to (R)-citramalate
           in the first dedicated step of the citramalate pathway. 
           Citramalate is only found in Leptospira interrogans and
           a few other microorganisms.  This family belongs to the
           DRE-TIM metallolyase superfamily.  DRE-TIM metallolyases
           include 2-isopropylmalate synthase (IPMS),
           alpha-isopropylmalate synthase (LeuA),
           3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate
           synthase, citramalate synthase, 4-hydroxy-2-oxovalerate
           aldolase, re-citrate synthase, transcarboxylase 5S,
           pyruvate carboxylase, AksA, and FrbC.  These members all
           share a conserved  triose-phosphate isomerase (TIM)
           barrel domain consisting of a core beta(8)-alpha(8)
           motif with the eight parallel beta strands forming an
           enclosed barrel surrounded by eight alpha helices.  The
           domain has a catalytic center containing a divalent
           cation-binding site formed by a cluster of invariant
           residues that cap the core of the barrel.  In addition,
           the catalytic site includes three invariant residues -
           an aspartate (D), an arginine (R), and a glutamate (E) -
           which is the basis for the domain name "DRE-TIM".
          Length = 280

 Score = 30.8 bits (70), Expect = 2.6
 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 110 RMVVDYAQAEDYPVDLYYLMDLSNSMRDDKDRLSALGDQLS 150
           R V++YA      V++Y L D SN MRD  D +  L D LS
Sbjct: 118 REVIEYAIKNGIEVNIY-LEDWSNGMRDSPDYVFQLVDFLS 157


>gnl|CDD|227559 COG5234, CIN1, Beta-tubulin folding cofactor D [Posttranslational
           modification, protein turnover, chaperones /
           Cytoskeleton].
          Length = 993

 Score = 30.7 bits (69), Expect = 3.4
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 1/98 (1%)

Query: 278 LCHMDRNGMYTHSTVQDYPSISQINMKVK-QNSINLIFAVTAEQIGVYERLKTHIEGSSS 336
           +             +  YP +  I+      N+++ I   T E+    E  +  I    S
Sbjct: 638 IISFRYQEKICKLLLDIYPQLHSIDYSAPIANALHNIVPFTYEKSESIEEFRKEILNVLS 697

Query: 337 GTLTNDSSNVVDLVKDQYNKISSSVEMKDTSSSAVKVT 374
             LT+   +V   ++    K+ SS  +KD+S   + + 
Sbjct: 698 NYLTDTRGDVGSWIRKPAMKLMSSFLVKDSSGKKLYII 735


>gnl|CDD|237114 PRK12488, PRK12488, acetate permease; Provisional.
          Length = 549

 Score = 30.2 bits (68), Expect = 4.2
 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 696 AIVLIGLAFLCLWKILTSIHDRREFAKFEKERMLAKW 732
           A+  + LAFL  W  + S+ DR E A  E+ R LA++
Sbjct: 500 ALFSMSLAFLSAW--VFSVTDRSERAVAERGRYLAQF 534


>gnl|CDD|220811 pfam10565, NMDAR2_C, N-methyl D-aspartate receptor 2B3 C-terminus. 
           This domain is found at the C-terminus of many
           NMDA-receptor proteins, many of which also carry the
           Ligated ion-channel family pfam00060 further upstream as
           well as the ANF_receptor family pfam01094. This region
           is predicted to be a large extra-cellular domain of the
           NMDA receptor proteins, being highly hydrophilic, and is
           thought to be integrally involved in the function of the
           receptor. The region also carries a number of potential
           N-glycosylation sites.
          Length = 660

 Score = 30.2 bits (68), Expect = 4.7
 Identities = 20/69 (28%), Positives = 28/69 (40%), Gaps = 9/69 (13%)

Query: 24  PSLNADDNLEQCPEEYVF--NPANVFSVLDNRHLAKASK-----YHHSQHSSSSSSSSSS 76
            SL   DN    PE+ +F    + V     N HL K S        H  H+ + ++ +S 
Sbjct: 99  RSLAYSDNPPYLPEDNMFSDYISEVERTFGNLHL-KDSNNYQDHGLH-HHTLNEANPNSL 156

Query: 77  GSFSSGGSA 85
           GS SS    
Sbjct: 157 GSASSLDGG 165


>gnl|CDD|222404 pfam13829, DUF4191, Domain of unknown function (DUF4191). 
          Length = 223

 Score = 29.4 bits (67), Expect = 5.2
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 685 YILGLVLGVIGAIVLIGLAFLCLW 708
            +LG  LGV+   V++GL F   W
Sbjct: 30  LMLGAFLGVLAVFVVLGLLFGSWW 53


>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
          Length = 349

 Score = 29.8 bits (67), Expect = 5.9
 Identities = 16/35 (45%), Positives = 17/35 (48%)

Query: 49  VLDNRHLAKASKYHHSQHSSSSSSSSSSGSFSSGG 83
            L   H    +K   S  SSS SSS SS S SS G
Sbjct: 253 PLTTFHADSVAKEGRSSSSSSRSSSDSSTSSSSDG 287


>gnl|CDD|221661 pfam12609, DUF3774, Wound-induced protein.  This family of proteins
           is found in eukaryotes. Proteins in this family are
           typically between 81 and 97 amino acids in length. The
           proteins in the family are often annotated as
           wound-induced proteins however there is little
           accompanying literature to confirm this.
          Length = 78

 Score = 27.4 bits (61), Expect = 6.0
 Identities = 13/47 (27%), Positives = 23/47 (48%)

Query: 56  AKASKYHHSQHSSSSSSSSSSGSFSSGGSASGHSEAVQISPQHVSLK 102
           A  S +  ++ ++  S+S +  S SS  SAS  +  V+      SL+
Sbjct: 22  ALRSLHQAAKANARRSASQAGKSSSSSSSASAAAAEVKAKQAEESLR 68


>gnl|CDD|178666 PLN03119, PLN03119, putative ADP-ribosylation factor
           GTPase-activating protein AGD14; Provisional.
          Length = 648

 Score = 29.8 bits (66), Expect = 6.7
 Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 3/30 (10%)

Query: 22  FQPSLNADD---NLEQCPEEYVFNPANVFS 48
           FQPS+       NL+Q P+ Y F PAN F+
Sbjct: 343 FQPSIAPRSPPVNLQQAPQTYSFTPANSFA 372


>gnl|CDD|222050 pfam13332, Fil_haemagg_2, Haemagluttinin repeat. 
          Length = 170

 Score = 28.7 bits (65), Expect = 6.7
 Identities = 11/40 (27%), Positives = 17/40 (42%), Gaps = 3/40 (7%)

Query: 51 DNRHLAKASKYHHSQHSSSSSSSSSSG---SFSSGGSASG 87
           N    +A++   S  S++ SS  S G    +  GG   G
Sbjct: 55 ANDINLEAAQNTQSTSSTNKSSGGSVGVSVGYGGGGGGLG 94


>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645).  These
           sequences are derived from a number of hypothetical
           plant proteins. The region in question is approximately
           270 amino acids long. Some members of this family are
           annotated as yeast pheromone receptor proteins AR781 but
           no literature was found to support this.
          Length = 191

 Score = 29.0 bits (65), Expect = 7.4
 Identities = 10/19 (52%), Positives = 12/19 (63%)

Query: 67  SSSSSSSSSSGSFSSGGSA 85
            SSS +SSS  S S+G S 
Sbjct: 84  VSSSPASSSRKSSSTGSSK 102


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
          short domain found in bacterial type II/III secretory
          system proteins. The architecture of these proteins
          suggest that this family may be functionally analogous
          to pfam03958.
          Length = 95

 Score = 27.7 bits (62), Expect = 7.4
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 64 SQHSSSSSSSSSSGSFSSGGSASGHS 89
          S  +SSSSSS+SS   SS  S+SG S
Sbjct: 29 SGSNSSSSSSNSSNGGSSSSSSSGDS 54


>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA
          binding) region.  This family represents the N-terminal
          head region of intermediate filaments. Intermediate
          filament heads bind DNA. Vimentin heads are able to
          alter nuclear architecture and chromatin distribution,
          and the liberation of heads by HIV-1 protease liberates
          may play an important role in HIV-1 associated
          cytopathogenesis and carcinogenesis. Phosphorylation of
          the head region can affect filament stability. The head
          has been shown to interaction with the rod domain of
          the same protein.
          Length = 89

 Score = 27.5 bits (61), Expect = 8.1
 Identities = 12/32 (37%), Positives = 14/32 (43%)

Query: 58 ASKYHHSQHSSSSSSSSSSGSFSSGGSASGHS 89
          +S     + SSSS   S S S SS  S S   
Sbjct: 21 SSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSL 52


>gnl|CDD|181437 PRK08470, PRK08470, adenylosuccinate lyase; Provisional.
          Length = 442

 Score = 29.3 bits (66), Expect = 9.0
 Identities = 24/82 (29%), Positives = 34/82 (41%), Gaps = 14/82 (17%)

Query: 598 IAREDIPAGSARNARPVLEDVQEFKNCIQCQ----VLQDRTIVRGGNEAKDENLCVYFDE 653
           ++RED      RNA  V ED+Q+ K  I  +     LQ            DE+L     E
Sbjct: 366 VSREDAYKIVQRNAMKVWEDLQQGKAAINEKGESLFLQ--------ALLNDEDLRKSLSE 417

Query: 654 DECK--FEYVYWYDAQGKIHLR 673
           +E +  F+Y Y+      I  R
Sbjct: 418 EEIRACFDYSYYTKNVDAIFKR 439


>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
          domain.  Spiders use fibroins to make silk strands.
          This family includes tubuliform silk fibroins which are
          used to protect egg cases. This domain is a structural
          domain which is found in repeats of up to 20 in many
          individuals (although this is not necessarily the
          case). RP1 makes up structural domains in the N
          terminal while RP2 makes up structural domains in the C
          terminal.
          Length = 167

 Score = 28.3 bits (63), Expect = 9.8
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 64 SQHSSSSSSSSSSGSFSSGGSAS 86
          SQ +SS+SSSSS+ +F+   S++
Sbjct: 4  SQAASSASSSSSASAFAQSLSSA 26


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.133    0.402 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,672,352
Number of extensions: 3374532
Number of successful extensions: 3984
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3912
Number of HSP's successfully gapped: 45
Length of query: 741
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 637
Effective length of database: 6,324,786
Effective search space: 4028888682
Effective search space used: 4028888682
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)