RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6668
         (306 letters)



>gnl|CDD|227455 COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal
           transduction mechanisms / Cytoskeleton / Cell division
           and chromosome partitioning / General function
           prediction only].
          Length = 160

 Score = 58.9 bits (143), Expect = 1e-10
 Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 2/65 (3%)

Query: 90  YAHYVFNTLDHNKTGFLNFEDFVKGLSI-LCRGSEEEKLRWIFCLYDINGDGVISKDDLY 148
             + +F  +D      ++F +F+  +S+ L RG +EE+LR  F L+D + DG IS  +L 
Sbjct: 57  EINKLFEEIDAG-NETVDFPEFLTVMSVKLKRGDKEEELREAFKLFDKDHDGYISIGELR 115

Query: 149 NIVSS 153
            ++ S
Sbjct: 116 RVLKS 120



 Score = 32.3 bits (74), Expect = 0.13
 Identities = 16/82 (19%), Positives = 31/82 (37%), Gaps = 16/82 (19%)

Query: 122 SEEEKLRWIFCLYDINGDGVISKDDLYNIVSSVYELMGTYAYRAFDSGMVAQRVEYLFQV 181
            + ++L+  F L+D + DG+I +++L  I             R+         +  LF+ 
Sbjct: 17  EQIQELKEAFQLFDRDSDGLIDRNELGKI------------LRSLGFNPSEAEINKLFEE 64

Query: 182 STKGFE----MSILCSISIPRS 199
              G E       L  +S+   
Sbjct: 65  IDAGNETVDFPEFLTVMSVKLK 86



 Score = 29.6 bits (67), Expect = 1.2
 Identities = 17/61 (27%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 88  IRYAHYVFNTLDHNKTGFLNFEDFVKGLSILC-RGSEEEKLRWIFCLYDINGDGVISKDD 146
           +R A   F   D +  G+++  +  + L  L  R S+EE +  +   YD +GDG I  ++
Sbjct: 94  LREA---FKLFDKDHDGYISIGELRRVLKSLGERLSDEE-VEKLLKEYDEDGDGEIDYEE 149

Query: 147 L 147
            
Sbjct: 150 F 150


>gnl|CDD|238008 cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily
           of calcium sensors and calcium signal modulators; most
           examples in this alignment model have 2 active canonical
           EF hands. Ca2+ binding induces a conformational change
           in the EF-hand motif, leading to the activation or
           inactivation of target proteins. EF-hands tend to occur
           in pairs or higher copy numbers.
          Length = 63

 Score = 53.3 bits (129), Expect = 1e-09
 Identities = 13/63 (20%), Positives = 27/63 (42%)

Query: 90  YAHYVFNTLDHNKTGFLNFEDFVKGLSILCRGSEEEKLRWIFCLYDINGDGVISKDDLYN 149
                F   D +  G ++ ++    L  L  G  EE++  +    D +GDG I  ++   
Sbjct: 1   ELREAFRLFDKDGDGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGDGKIDFEEFLE 60

Query: 150 IVS 152
           +++
Sbjct: 61  LMA 63



 Score = 37.5 bits (88), Expect = 4e-04
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 126 KLRWIFCLYDINGDGVISKDDLYNIVSSVYELM 158
           +LR  F L+D +GDG IS D+L   + S+ E +
Sbjct: 1   ELREAFRLFDKDGDGTISADELKAALKSLGEGL 33


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
          the RNA polymerase II associated Paf1 complex. The Paf1
          complex functions during the elongation phase of
          transcription in conjunction with Spt4-Spt5 and
          Spt16-Pob3i.
          Length = 431

 Score = 50.9 bits (122), Expect = 3e-07
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 23 SELVCRVKYNNTLPDIPFDCKFIAYPF 49
           + + +VKY N LPD PF  K I YPF
Sbjct: 1  QDYIAKVKYTNNLPDPPFPPKLITYPF 27


>gnl|CDD|222177 pfam13499, EF_hand_5, EF-hand domain pair. 
          Length = 60

 Score = 43.5 bits (103), Expect = 3e-06
 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 4/59 (6%)

Query: 97  TLDHNKTGFLNFEDFVKGLSILCRGSEEEK----LRWIFCLYDINGDGVISKDDLYNIV 151
            LD +  G+++ E+  K L  L     +E+    +   F   D +GDG IS ++    +
Sbjct: 2   LLDKDGDGYIDVEELRKLLKALGLKLTDEEVEELIEADFNEIDKDGDGRISFEEFLEAM 60



 Score = 32.0 bits (73), Expect = 0.047
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query: 91  AHYVFNTLDHNKTGFLNFEDFVKGL 115
               FN +D +  G ++FE+F++ +
Sbjct: 36  IEADFNEIDKDGDGRISFEEFLEAM 60



 Score = 27.3 bits (61), Expect = 1.9
 Identities = 10/48 (20%), Positives = 20/48 (41%), Gaps = 8/48 (16%)

Query: 133 LYDINGDGVISKDDLYNIVSSVYELMGTYAYRAFDSGMVAQRVEYLFQ 180
           L D +GDG I  ++L  ++ ++   +            V + +E  F 
Sbjct: 2   LLDKDGDGYIDVEELRKLLKALGLKL--------TDEEVEELIEADFN 41


>gnl|CDD|222407 pfam13833, EF_hand_6, EF-hand domain pair. 
          Length = 53

 Score = 42.9 bits (102), Expect = 4e-06
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 102 KTGFLNFEDFVKGLSILCRGSEEEKLRWIFCLYDINGDGVISKDDLYN 149
           + G +  E+  + L++L     EE++  +F  +D +GDG IS ++   
Sbjct: 1   EKGLITREELKRALALLGISLSEEEVDILFREFDTDGDGKISFEEFCV 48


>gnl|CDD|197492 smart00054, EFh, EF-hand, calcium binding motif.  EF-hands are
           calcium-binding motifs that occur at least in pairs.
           Links between disease states and genes encoding
           EF-hands, particularly the S100 subclass, are emerging.
           Each motif consists of a 12 residue loop flanked on
           either side by a 12 residue alpha-helix. EF-hands
           undergo a conformational change unpon binding calcium
           ions.
          Length = 29

 Score = 35.0 bits (82), Expect = 0.002
 Identities = 8/29 (27%), Positives = 19/29 (65%)

Query: 126 KLRWIFCLYDINGDGVISKDDLYNIVSSV 154
           +L+  F L+D +GDG I  ++  +++ ++
Sbjct: 1   ELKEAFRLFDKDGDGKIDFEEFKDLLKAL 29



 Score = 27.7 bits (63), Expect = 0.59
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 94  VFNTLDHNKTGFLNFEDFVKGLSIL 118
            F   D +  G ++FE+F   L  L
Sbjct: 5   AFRLFDKDGDGKIDFEEFKDLLKAL 29


>gnl|CDD|200946 pfam00036, efhand, EF hand.  The EF-hands can be divided into two
           classes: signaling proteins and buffering/transport
           proteins. The first group is the largest and includes
           the most well-known members of the family such as
           calmodulin, troponin C and S100B. These proteins
           typically undergo a calcium-dependent conformational
           change which opens a target binding site. The latter
           group is represented by calbindin D9k and do not undergo
           calcium dependent conformational changes.
          Length = 29

 Score = 34.7 bits (81), Expect = 0.002
 Identities = 8/29 (27%), Positives = 17/29 (58%)

Query: 126 KLRWIFCLYDINGDGVISKDDLYNIVSSV 154
           +L+  F  +D +GDG IS ++   ++  +
Sbjct: 1   ELKEAFKEFDKDGDGKISFEEFKELLKKL 29



 Score = 29.3 bits (67), Expect = 0.20
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 94  VFNTLDHNKTGFLNFEDFVKGLS 116
            F   D +  G ++FE+F + L 
Sbjct: 5   AFKEFDKDGDGKISFEEFKELLK 27


>gnl|CDD|205583 pfam13405, EF_hand_4, EF-hand domain. 
          Length = 30

 Score = 32.6 bits (75), Expect = 0.011
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 130 IFCLYDINGDGVISKDDLYNIVSSV 154
            F L+D +GDG IS ++L   + S+
Sbjct: 5   AFKLFDKDGDGYISAEELRKALRSL 29



 Score = 31.0 bits (71), Expect = 0.040
 Identities = 7/27 (25%), Positives = 12/27 (44%)

Query: 92  HYVFNTLDHNKTGFLNFEDFVKGLSIL 118
              F   D +  G+++ E+  K L  L
Sbjct: 3   REAFKLFDKDGDGYISAEELRKALRSL 29


>gnl|CDD|185504 PTZ00184, PTZ00184, calmodulin; Provisional.
          Length = 149

 Score = 35.1 bits (81), Expect = 0.013
 Identities = 17/65 (26%), Positives = 40/65 (61%), Gaps = 7/65 (10%)

Query: 96  NTLDHNKTGFLNFEDFVKGLSILCRGSE----EEKLRWIFCLYDINGDGVISKDDLYNIV 151
           N +D +  G ++F +F   L+++ R  +    EE+++  F ++D +G+G IS  +L +++
Sbjct: 54  NEVDADGNGTIDFPEF---LTLMARKMKDTDSEEEIKEAFKVFDRDGNGFISAAELRHVM 110

Query: 152 SSVYE 156
           +++ E
Sbjct: 111 TNLGE 115


>gnl|CDD|185503 PTZ00183, PTZ00183, centrin; Provisional.
          Length = 158

 Score = 33.9 bits (78), Expect = 0.037
 Identities = 17/57 (29%), Positives = 30/57 (52%)

Query: 94  VFNTLDHNKTGFLNFEDFVKGLSILCRGSEEEKLRWIFCLYDINGDGVISKDDLYNI 150
            F   D +KTG ++ ++  +    L     +E+L+ +    D NGDG IS+++ Y I
Sbjct: 95  AFRLFDDDKTGKISLKNLKRVAKELGETITDEELQEMIDEADRNGDGEISEEEFYRI 151



 Score = 30.4 bits (69), Expect = 0.71
 Identities = 14/69 (20%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 85  IKQIRYAHYVFNTLDHNKTGFLNFEDFVKGLSILCRGSEEEKLRWIFCLYDINGDGVISK 144
            K+IR A   F+  D + +G ++ ++    +  L    ++E+++ +    D +G G I  
Sbjct: 16  KKEIREA---FDLFDTDGSGTIDPKELKVAMRSLGFEPKKEEIKQMIADVDKDGSGKIDF 72

Query: 145 DDLYNIVSS 153
           ++  +I++ 
Sbjct: 73  EEFLDIMTK 81


>gnl|CDD|215520 PLN02964, PLN02964, phosphatidylserine decarboxylase.
          Length = 644

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 16/59 (27%), Positives = 30/59 (50%)

Query: 89  RYAHYVFNTLDHNKTGFLNFEDFVKGLSILCRGSEEEKLRWIFCLYDINGDGVISKDDL 147
            +A  +   +D+++ G L+F +F   +          K   +F   D+NGDGV++ D+L
Sbjct: 179 SFARRILAIVDYDEDGQLSFSEFSDLIKAFGNLVAANKKEELFKAADLNGDGVVTIDEL 237


>gnl|CDD|205383 pfam13202, EF_hand_3, EF hand. 
          Length = 25

 Score = 27.2 bits (62), Expect = 0.87
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 133 LYDINGDGVISKDDL 147
            +D NGDG ISK++L
Sbjct: 7   QFDTNGDGKISKEEL 21


>gnl|CDD|220646 pfam10240, DUF2464, Protein of unknown function (DUF2464).  This is
           a family of proteins conserved from worms to humans.
           Members have been annotated as FAM125A proteins, but
           their function is unknown.
          Length = 251

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 13/61 (21%), Positives = 23/61 (37%), Gaps = 5/61 (8%)

Query: 220 SLPFSSSVPEGQSSLQAIRPIQSSVPGLKAIRPPAKSMPDLKPLPLRVSEPESNSYHGNG 279
           S+P    VP+ +S  Q  + +            PA  +P L  L    S   +++ +   
Sbjct: 144 SVPAPPPVPKPRSISQGSQSLSLDSAS-----RPAPGLPLLSSLSRGKSATRADATYDTS 198

Query: 280 T 280
           T
Sbjct: 199 T 199


>gnl|CDD|218376 pfam05005, Ocnus, Janus/Ocnus family (Ocnus).  This family is
           comprised of the Ocnus, Janus-A and Janus-B proteins.
           These proteins have been found to be testes specific in
           Drosophila melanogaster.
          Length = 109

 Score = 28.4 bits (64), Expect = 1.8
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 70  DNNCLAGGRCDHKLDIKQIR-YAH-YVFNTLDHNKT 103
              CL GGR +H  D K+I+ Y +   F   DH  T
Sbjct: 61  CTECLGGGRIEHDQDKKKIKVYGYSQGFGRADHAIT 96


>gnl|CDD|240157 cd05031, S-100A10_like, S-100A10_like: S-100A10 domain found in
           proteins similar to S100A10. S100A10 is a member of the
           S100 family of EF-hand superfamily of calcium-binding
           proteins. Note that the S-100 hierarchy, to which this
           S-100A1_like group belongs, contains only S-100 EF-hand
           domains, other EF-hands have been modeled separately.
           S100 proteins are expressed exclusively in vertebrates,
           and are implicated in intracellular and extracellular
           regulatory activities. A unique feature of S100A10 is
           that it contains mutation in both of the calcium binding
           sites, making it calcium insensitive. S100A10 has been
           detected in brain, heart, gastrointestinal tract,
           kidney, liver, lung, spleen, testes, epidermis, aorta,
           and thymus. Structural data supports the homo- and
           hetero-dimeric as well as hetero-tetrameric nature of
           the protein. S100A10 has multiple binding partners in
           its calcium free state and is therefore involved in many
           diverse biological functions.
          Length = 94

 Score = 28.2 bits (63), Expect = 1.8
 Identities = 14/25 (56%), Positives = 17/25 (68%), Gaps = 3/25 (12%)

Query: 98  LDHNKTGFLNFEDF---VKGLSILC 119
           LD N+ G +NFE+F   V GLSI C
Sbjct: 60  LDQNRDGKVNFEEFVSLVAGLSIAC 84


>gnl|CDD|104071 PRK09629, PRK09629, bifunctional thiosulfate
           sulfurtransferase/phosphatidylserine decarboxylase;
           Provisional.
          Length = 610

 Score = 29.3 bits (65), Expect = 3.2
 Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 19/104 (18%)

Query: 185 GFEMSILCSISIPRSPYSLVPEGGEGAPHLNTPLFSLPFSSSVP-EGQSSLQAIRPIQSS 243
           GF   +  ++  PR   +     GE   H +TP+  +P  ++ P E     Q   P Q +
Sbjct: 237 GFTYLVAKALGYPRVK-AYAGSWGEWGNHPDTPV-EVPTVAAAPIEAVEVAQPAAPAQPA 294

Query: 244 VPGLKAIRPPAKSMPDLKPLPLRVSEPE--SNSYHGNGTSGPAV 285
                    PA+++      P+R +EP   S  Y G+ +SGP++
Sbjct: 295 ---------PAETVE-----PVRATEPSKGSQKYSGHSSSGPSM 324


>gnl|CDD|183426 PRK12309, PRK12309, transaldolase/EF-hand domain-containing
           protein; Provisional.
          Length = 391

 Score = 28.9 bits (65), Expect = 3.4
 Identities = 11/35 (31%), Positives = 21/35 (60%), Gaps = 5/35 (14%)

Query: 112 VKGLSILCRGSEEEKLRWIFCLYDINGDGVISKDD 146
           ++G       ++E     IF LYD++GDG I++++
Sbjct: 326 LEGGEAFTHAAQE-----IFRLYDLDGDGFITREE 355


>gnl|CDD|226837 COG4403, LcnDR2, Lantibiotic modifying enzyme [Defense mechanisms].
          Length = 963

 Score = 29.0 bits (65), Expect = 4.4
 Identities = 15/80 (18%), Positives = 21/80 (26%), Gaps = 10/80 (12%)

Query: 69  KDNNCLAGGRCDHKLDIKQIRYAHYVFNTLDHNKTGFLNFEDFVKGLSILCRGSEEEKLR 128
           K+       +     + K      Y+         GF N          LC G  E K +
Sbjct: 394 KEPYMFESEKNLPTFNGKDKELLDYIK----EITEGFRNMYLL------LCSGISEIKEK 443

Query: 129 WIFCLYDINGDGVISKDDLY 148
            I    D     V     +Y
Sbjct: 444 NILAFEDCKIRVVFRNTSVY 463


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 28.8 bits (64), Expect = 4.8
 Identities = 19/75 (25%), Positives = 27/75 (36%), Gaps = 5/75 (6%)

Query: 231 QSSLQAIRPIQSSVPGLKAIRPPAKSMPDLKPLPLRVSEPESNSYHGNGTSGPAVVTAAK 290
           Q +L+  R      P   A   PA   P     P  V  P   S + N +    V +AA+
Sbjct: 52  QRALETARS---DTPATAAAPAPAPQAPTKPAAP--VHAPLKLSANANMSQRQRVASAAE 106

Query: 291 VTLLADSQTQSSFSP 305
             + A +  Q    P
Sbjct: 107 DMIAAMALRQPVSVP 121


>gnl|CDD|238131 cd00213, S-100, S-100: S-100 domain, which represents the largest
           family within the superfamily of proteins carrying the
           Ca-binding EF-hand motif. Note that this S-100 hierarchy
           contains only S-100 EF-hand domains, other EF-hands have
           been modeled separately. S100 proteins are expressed
           exclusively in vertebrates, and are implicated in
           intracellular and extracellular regulatory activities.
           Intracellularly, S100 proteins act as Ca-signaling or
           Ca-buffering proteins. The most unusual characteristic
           of certain S100 proteins is their occurrence in
           extracellular space, where they act in a cytokine-like
           manner through RAGE, the receptor for advanced glycation
           products. Structural data suggest that many S100 members
           exist within cells as homo- or heterodimers and even
           oligomers; oligomerization contributes to their
           functional diversification. Upon binding calcium, most
           S100 proteins change conformation to a more open
           structure exposing a hydrophobic cleft. This hydrophobic
           surface represents the interaction site of S100 proteins
           with their target proteins. There is experimental
           evidence showing that many S100 proteins have multiple
           binding partners with diverse mode of interaction with
           different targets. In addition to S100 proteins (such as
           S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes
           the ''fused'' gene family, a group of calcium binding
           S100-related proteins. The ''fused'' gene family
           includes multifunctional epidermal differentiation
           proteins - profilaggrin, trichohyalin, repetin,
           hornerin, and cornulin; functionally these proteins are
           associated with keratin intermediate filaments and
           partially crosslinked to the cell envelope. These
           ''fused'' gene proteins contain N-terminal sequence with
           two Ca-binding EF-hands motif, which may be associated
           with calcium signaling in epidermal cells and
           autoprocessing in a calcium-dependent manner. In
           contrast to S100 proteins, "fused" gene family proteins
           contain an extraordinary high number of almost perfect
           peptide repeats with regular array of polar and charged
           residues similar to many known cell envelope proteins.
          Length = 88

 Score = 26.7 bits (60), Expect = 5.5
 Identities = 9/29 (31%), Positives = 17/29 (58%), Gaps = 3/29 (10%)

Query: 94  VFNTLDHNKTGFLNFEDFVK---GLSILC 119
           +   LD NK G ++F++F+     L++ C
Sbjct: 56  IMKDLDVNKDGKVDFQEFLVLIGKLAVAC 84


>gnl|CDD|234450 TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein,
           SusC/RagA family.  This model describes a distinctive
           clade among the TonB-linked outer membrane proteins
           (OMP). Members of this family are restricted to the
           Bacteriodetes lineage (except for Gemmatimonas
           aurantiaca T-27 from the novel phylum Gemmatimonadetes)
           and occur in high copy numbers, with over 100 members
           from Bacteroides thetaiotaomicron VPI-5482 alone.
           Published descriptions of members of this family are
           available for RagA from Porphyromonas gingivalis, SusC
           from Bacteroides thetaiotaomicron, and OmpW from
           Bacteroides caccae. Members form pairs with members of
           the SusD/RagB family (pfam07980). Transporter complexes
           including these outer membrane proteins are likely to
           import large degradation products of proteins (e.g.
           RagA) or carbohydrates (e.g. SusC) as nutrients, rather
           than siderophores [Transport and binding proteins,
           Unknown substrate].
          Length = 982

 Score = 28.3 bits (64), Expect = 5.8
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 5/38 (13%)

Query: 135 DINGDGVISKDDLYNI-VSSVYELMG----TYAYRAFD 167
           D+NGDG I  DD   I  S+  +L      +++Y+ FD
Sbjct: 801 DVNGDGKIDDDDRVPIGYSNTPDLTYGFGNSFSYKGFD 838


>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
           Myosin III has been shown to play a role in  the vision
           process in insects and in hearing in mammals. Myosin
           III, an unconventional myosin, does not form dimers.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 653

 Score = 28.3 bits (63), Expect = 6.4
 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 4/65 (6%)

Query: 64  FIQQQKDNNCLAGGRCDHKLDIKQIRYAHYVFNTLDHNKTGF---LNFEDFVKGLSILCR 120
           F++  K N      + D +  +KQ+RY   +  T    + GF   + F +F++    L  
Sbjct: 549 FVRCIKPNEDRQAKKFDAEKVLKQLRYTG-ILETARIRRQGFSHRILFANFIRRYCFLAY 607

Query: 121 GSEEE 125
             EEE
Sbjct: 608 RFEEE 612


>gnl|CDD|177973 PLN02339, PLN02339, NAD+ synthase (glutamine-hydrolysing).
          Length = 700

 Score = 28.1 bits (63), Expect = 6.9
 Identities = 20/55 (36%), Positives = 22/55 (40%), Gaps = 18/55 (32%)

Query: 100 HNKTGFLNFEDFVKGLSILCRGSE--EEKLRWIFCLY-----DINGDGVISKDDL 147
             K+GFL           L  GS   +E LR     Y     DIN  G ISK DL
Sbjct: 504 RGKSGFL-----------LVLGSANVDEGLRGYLTKYDCSSADINPIGGISKQDL 547


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 28.3 bits (63), Expect = 7.3
 Identities = 25/92 (27%), Positives = 36/92 (39%), Gaps = 26/92 (28%)

Query: 143 SKDDLYNIVSSVYELMGTYAYR--------AFDSGMVAQRVEYLFQVSTKGFEMSILCSI 194
           S  D YN +S  Y  + T  Y+         + + MV +   YLFQ+  K          
Sbjct: 693 SDTDSYNDISEFYREVETQGYKIEFINISEEYINKMVDEGKLYLFQIYNK---------- 742

Query: 195 SIPRSPYSLVPEGGEGAPHLNTPLFSLPFSSS 226
               SP+S      +G P+L+T  F   FS  
Sbjct: 743 --DFSPFS------KGKPNLHTLYFKALFSEE 766


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 27.7 bits (61), Expect = 8.6
 Identities = 24/71 (33%), Positives = 30/71 (42%), Gaps = 10/71 (14%)

Query: 194 ISIPRSPYSLVPEGGEGAPHLNTPLFSLPFSSSVPEGQSSLQAIRPIQSSVPGLKAIRPP 253
            S P  P+ L        P L  P  + P  S  P  QS+LQ  +P +          PP
Sbjct: 290 SSNPPQPFGLAQSQV---PPLPLPSQAQP-HSHTPPSQSALQPQQPPREQP------LPP 339

Query: 254 AKSMPDLKPLP 264
           A SMP +KP P
Sbjct: 340 APSMPHIKPPP 350


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 28.0 bits (62), Expect = 8.8
 Identities = 19/67 (28%), Positives = 26/67 (38%), Gaps = 6/67 (8%)

Query: 170 MVAQRVEYLFQVSTKGFEMSILCSISIPRSPYSLVP------EGGEGAPHLNTPLFSLPF 223
           +V +R+E +F     G  +    S   P    +L P      EG + AP      F    
Sbjct: 298 LVVRRLEQIFTGKISGRNVRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPPSNFITNP 357

Query: 224 SSSVPEG 230
            SSVPE 
Sbjct: 358 FSSVPEP 364


>gnl|CDD|215563 PLN03081, PLN03081, pentatricopeptide (PPR) repeat-containing
           protein; Provisional.
          Length = 697

 Score = 27.5 bits (61), Expect = 9.5
 Identities = 23/96 (23%), Positives = 28/96 (29%), Gaps = 20/96 (20%)

Query: 196 IPRSPYSLVPEGGEGAPHLNTPLFSLP--FSS-----SVPEGQSSLQAIRPIQSSVPGLK 248
           IP + Y  +          N  L   P  FS        P    S        S   GLK
Sbjct: 3   IPLARYQSIRLDEIRDSLSNPRLLHSPRKFSLRGRRTKTPFSSIS------CSSVEQGLK 56

Query: 249 AIRPPAKSMPDLKPLPLRVSEPESNSYHGNGTSGPA 284
                    P LKP P+R+   ES     + T    
Sbjct: 57  P-------RPRLKPEPIRIEVSESKDARLDDTQIRK 85


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.407 

Gapped
Lambda     K      H
   0.267   0.0908    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,235,050
Number of extensions: 1425810
Number of successful extensions: 1007
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1001
Number of HSP's successfully gapped: 41
Length of query: 306
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 209
Effective length of database: 6,635,264
Effective search space: 1386770176
Effective search space used: 1386770176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.2 bits)