RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3590
         (258 letters)



>gnl|CDD|215648 pfam00003, 7tm_3, 7 transmembrane sweet-taste receptor of 3 GCPR.
           This is a domain of seven transmembrane regions that
           forms the C-terminus of some subclass 3
           G-coupled-protein receptors. It is often associated with
           a downstream cysteine-rich linker domain, NCD3G
           pfam07562, which is the human sweet-taste receptor, and
           the N-terminal domain, ANF_receptor pfam01094. The seven
           TM regions assemble in such a way as to produce a
           docking pocket into which such molecules as cyclamate
           and lactisole have been found to bind and consequently
           confer the taste of sweetness.
          Length = 238

 Score = 58.8 bits (143), Expect = 2e-10
 Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 3/76 (3%)

Query: 1   MFGAFLAWETRHVSIPALNDSKYVGMSVYNVVLMCIMGAAISFVISDKQDASFVIISLFI 60
           +   FLA++ R +     N++K++  S+Y   ++ I    +    + K      +++  I
Sbjct: 163 LLCFFLAFKARKLP-ENFNEAKFITFSMYLFCIIWIAFIPLYLSTNGK--YRDAVLAFAI 219

Query: 61  LFCTTATLCLVFVPKI 76
           L   T  L  +F+PK 
Sbjct: 220 LASATGLLLCIFIPKC 235


>gnl|CDD|222300 pfam13664, DUF4149, Domain of unknown function (DUF4149). 
          Length = 98

 Score = 31.4 bits (72), Expect = 0.13
 Identities = 10/56 (17%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 27 SVYN-VVLMCIMGAAISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKR 81
           VY  + L   +   ++ ++   +   + ++ L +L   T        P+I  LK 
Sbjct: 39 PVYFLLGLALAVLLLLTELLLGGKAERWQLLLLAVLLLLTLLNLFYLTPRITALKD 94


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 32.7 bits (74), Expect = 0.18
 Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 3/46 (6%)

Query: 146 ALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNNH 191
             P   PH   P G PPGK   S   P  P   T +  +LL  ++H
Sbjct: 660 GYPFQSPHLGAPSGSPPGKDRDS---PDLPRPTTSLHPKLLSAHHH 702


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 32.6 bits (74), Expect = 0.21
 Identities = 27/108 (25%), Positives = 39/108 (36%), Gaps = 14/108 (12%)

Query: 146 ALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNNHGIL---SEDPFYYW 202
           +LP   P    PP  PPG   PS   PP P  + V KR     +  G +   S +P Y  
Sbjct: 3   SLPPGNP---PPPPPPPGFEPPSQPPPPPPPGVNVKKRSRKQLSIVGDILGHSGNPIYSL 59

Query: 203 KALCHALKHLKRLLQSQHY--------EMMLKQTNPVMVRPATHYQVD 242
           +     +K   +               E + K  +P    P+   +VD
Sbjct: 60  RVSDKPVKLGNKAKTLHVLTLKAPIPPEHLRKIQSPCSDMPSVLTKVD 107


>gnl|CDD|222477 pfam13965, SID-1_RNA_chan, dsRNA-gated channel SID-1.  This is a
           family of proteins that are transmembrane dsRNA-gated
           channels. They passively transport dsRNA into cells and
           do not act as ATP-dependent pumps. They are required for
           systemic RNA interference.
          Length = 567

 Score = 32.4 bits (74), Expect = 0.26
 Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 17/110 (15%)

Query: 41  ISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKRNPQGSIDKRIRATLRPMSK 100
           I+F ++  +  S+ + +L ++F    + CL+F+  ++ + RN +  +  R   +  P   
Sbjct: 103 ITFNVTILESISYFVPTLAMVF-FLLSFCLLFIVFVIHIFRNRR-DLRSRDIISFEPSPS 160

Query: 101 TRR-----------DSSELEEKLKDAKMSNMRHRKILAERENELQALLSR 139
            +R           DSSE E  +  A    M    ++ E+ +   + LSR
Sbjct: 161 EQRAMDLQPDQSEEDSSERENDILMADQQIM----VIREKASLYVSDLSR 206


>gnl|CDD|187828 cd09697, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as Csx8 family.
          Length = 441

 Score = 30.3 bits (68), Expect = 0.94
 Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 4/41 (9%)

Query: 102 RRDSSELEEKLKDAKMSNMRHRKILAERENEL----QALLS 138
           R    E++E LK AK       K   +  N+L    Q LLS
Sbjct: 352 RNGGKEMDENLKKAKACAKAVVKKFIKELNKLASYKQKLLS 392


>gnl|CDD|233970 TIGR02670, cas_csx8, CRISPR-associated protein Cas8a1/Csx8, subtype
           I.  In three genomes so far, a member of this protein
           appears in the midst of a CRISPR-associated (cas) gene
           operon, immediately upstream of a member of family
           TIGR01875 (CRISPR-associated autoregulator, DevR
           family). The genomes so far are Nocardia farcinica
           IFM10152, Clostridium perfringens SM101, and Clostridium
           tetani E88 [Mobile and extrachromosomal element
           functions, Other].
          Length = 441

 Score = 30.3 bits (68), Expect = 0.94
 Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 4/41 (9%)

Query: 102 RRDSSELEEKLKDAKMSNMRHRKILAERENEL----QALLS 138
           R    E++E LK AK       K   +  N+L    Q LLS
Sbjct: 352 RNGGKEMDENLKKAKACAKAVVKKFIKENNKLASYKQKLLS 392


>gnl|CDD|220985 pfam11109, RFamide_26RFa, Orexigenic neuropeptide Qrfp/P518.
           Qrfp/P518 has a direct role in maintaining bone mineral
           density. Qrfp has also found to be important in energy
           homeostasis by regulating appetite and energy
           expenditure in mice. The c-terminal 28 residues are the
           functional 26RFa.
          Length = 132

 Score = 29.1 bits (65), Expect = 1.0
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 159 GQPPGKSLPSALKPPQPEALTVVKREL 185
           G+P     P   + PQP+AL V  REL
Sbjct: 45  GRPFPWGSPGWPRAPQPQALLVTAREL 71


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 30.2 bits (68), Expect = 1.1
 Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 8/58 (13%)

Query: 96  RPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENEL--------QALLSRLGSESL 145
           +   K+ R+  ELEE L+  K    + R +L + E E         QALL  L + +L
Sbjct: 135 KNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTL 192


>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family.  This
           model represent the sugar porter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 481

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 6/65 (9%)

Query: 3   GAFLAWETRHVSIPALNDSKYV----GMSVYNVVLMCIMGAAISFVISDK--QDASFVII 56
           G  LA         + N   Y     G+ V   ++   +GA  +  +SD+  +  S +II
Sbjct: 48  GGALALPAFEFKFTSANSDSYSSSLWGLVVSIFLVGGFIGALFAGWLSDRFGRKKSLLII 107

Query: 57  SLFIL 61
           +L  +
Sbjct: 108 ALLFV 112


>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
          Length = 574

 Score = 29.2 bits (65), Expect = 2.5
 Identities = 28/104 (26%), Positives = 43/104 (41%), Gaps = 23/104 (22%)

Query: 81  RNPQGSIDKRIRATLRPMSKTRR---------DSSELEEKLKDAKMSNMRHRKILAEREN 131
           R  + S ++RIR  L PM+   R         D S +EE     ++ N    K+ +E+ +
Sbjct: 249 RRQESSYNQRIRELLEPMTGPGRVNPEVSVDMDFSVVEEA---RELYNGEPAKLRSEQVS 305

Query: 132 ELQALLSRLGSESLALPSSGPHGKT--PPGQPPGKSLPSALKPP 173
           +         + + A    GP G T   PGQPP  +   A   P
Sbjct: 306 D---------TSTSATGPQGPPGATSNSPGQPPAPAAAGAPGTP 340


>gnl|CDD|218632 pfam05549, Allexi_40kDa, Allexivirus 40kDa protein. 
          Length = 271

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 84  QGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKI--LAERENELQALLSRLG 141
           QG+   R+R  LR +++T      +E+K++ +  +    + +  L   E  L+AL ++L 
Sbjct: 76  QGTPLSRLRDALRTLTRTVDSIYSVEQKIELSTQTPSSSKLLKKLTSIETSLEALHAKLD 135

Query: 142 SESLAL------PSSGPHGKTPPGQPPGKSLP 167
             + +L      PS  P   +       +SLP
Sbjct: 136 ELTSSLTSTLAGPSVSPTPASSHPTASDRSLP 167


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 1/70 (1%)

Query: 75  KIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
            I E +R  + + ++R+      + K   +  ELE ++++ +    +  +  AE + EL+
Sbjct: 309 SIAEKERELEDA-EERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367

Query: 135 ALLSRLGSES 144
            L + L    
Sbjct: 368 DLRAELEEVD 377



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 18/74 (24%), Positives = 23/74 (31%), Gaps = 18/74 (24%)

Query: 78  ELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALL 137
           E  R     +DK    T       R +  +  EKL+  K              NEL+  L
Sbjct: 367 EDLRAELEEVDKEFAET-------RDELKDYREKLEKLK-----------REINELKREL 408

Query: 138 SRLGSESLALPSSG 151
            RL  E   L    
Sbjct: 409 DRLQEELQRLSEEL 422


>gnl|CDD|240543 cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins similar to
           Bacillus subtilis yoeA.  The integral membrane proteins
           from the MATE family are involved in exporting
           metabolites across the cell membrane and are responsible
           for multidrug resistance (MDR) in many bacteria and
           animals. A number of family members are involved in the
           synthesis of peptidoglycan components in bacteria.
          Length = 431

 Score = 29.0 bits (66), Expect = 3.1
 Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 12/77 (15%)

Query: 24  VGMSVYNVVLMCIMG--AAISFVIS------DKQDASFVIISLFILFCTTA----TLCLV 71
            G  +  ++    MG     + +IS      D++ A   + +L +LF         L L+
Sbjct: 44  NGSPIIFLLTAIGMGLSMGGTILISQYIGAKDEEKAKKAVGTLLVLFLILGIVLTILGLL 103

Query: 72  FVPKIVELKRNPQGSID 88
           F P I+ L   P   ++
Sbjct: 104 FAPPILRLMNTPAEILE 120


>gnl|CDD|112645 pfam03841, SelA, L-seryl-tRNA selenium transferase. 
          Length = 367

 Score = 28.4 bits (64), Expect = 3.6
 Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 4/104 (3%)

Query: 76  IVELKRNPQGS---IDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENE 132
           I  LK+NP      +DK   A L    +   +  +L EK+   ++   +  K L  +   
Sbjct: 230 IERLKKNPLKRALRVDKLTLAALEATLRLYLNPEKLYEKIPTLRLLT-QPLKELRAKAER 288

Query: 133 LQALLSRLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPE 176
           LQ LL         +       +   G  P K LPSA    +PE
Sbjct: 289 LQKLLKARLGSGAEVSVEPSFSQIGGGSLPVKELPSAAVTIRPE 332


>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus.  This family represents the
           N-terminus (approximately 200 residues) of the
           proline-rich protein BAT2. BAT2 is similar to other
           proteins with large proline-rich domains, such as some
           nuclear proteins, collagens, elastin, and synapsin.
          Length = 189

 Score = 28.0 bits (62), Expect = 4.1
 Identities = 13/50 (26%), Positives = 16/50 (32%), Gaps = 1/50 (2%)

Query: 144 SLALPSSGPHGKTPPG-QPPGKSLPSALKPPQPEALTVVKRELLVPNNHG 192
           S ALP   P  + P   Q    S P   +P  P      K   +     G
Sbjct: 99  SDALPPQTPESQPPAVSQTSASSPPPPPQPATPLVPGGAKSWAVASAKPG 148


>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
          Length = 857

 Score = 28.7 bits (64), Expect = 4.2
 Identities = 24/79 (30%), Positives = 32/79 (40%)

Query: 79  LKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLS 138
           L +    +  KR+      +S   R  SELEE+ K  K S    + I AE E    A+  
Sbjct: 429 LMKESDEASKKRLDMLNEELSDKERQYSELEEEWKAEKASLSGTQTIKAELEQAKIAIEQ 488

Query: 139 RLGSESLALPSSGPHGKTP 157
                 LA  S   +GK P
Sbjct: 489 ARRVGDLARMSELQYGKIP 507


>gnl|CDD|236242 PRK08326, PRK08326, ribonucleotide-diphosphate reductase subunit
           beta; Validated.
          Length = 311

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 18/59 (30%), Positives = 25/59 (42%), Gaps = 11/59 (18%)

Query: 166 LPSAL----KPPQPEALTVVKRELLVPNNH---GILSEDPFYYWKALCHALKHLKRLLQ 217
           LP+AL      P PE          V  NH   G+L+E  +Y W+ +C     L  L +
Sbjct: 146 LPAALNRLSTDPSPENQV----RASVTYNHVVEGVLAETGYYAWRKICVTRGILPGLQE 200


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 28.5 bits (63), Expect = 4.3
 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 4/65 (6%)

Query: 75  KIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
           KI EL R  +  I++  R      +  +R   E E+  + A+ +  +  K    RE +LQ
Sbjct: 282 KIEELARGLRADIERVAREN----ADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQ 337

Query: 135 ALLSR 139
           A  +R
Sbjct: 338 AECAR 342


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 28.4 bits (63), Expect = 4.4
 Identities = 20/115 (17%), Positives = 43/115 (37%), Gaps = 8/115 (6%)

Query: 74  PKIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENEL 133
            +I   +      I+  ++     + + R +  +L  +L+ ++ S M  ++ L   E E 
Sbjct: 648 TQIGHAQAAAHNHIEHHVQKLESDLKQLRAEREQLVAQLEKSQQSLMEFQQELNAAEAER 707

Query: 134 QALLS---RLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKREL 185
           +  L     L  E+L    S               L ++ K    E + +++RE 
Sbjct: 708 RKHLEENRELKQEALLAAIS-----EKDANIALLELSASKKKKTQEEVALLRREK 757


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 28.3 bits (63), Expect = 4.4
 Identities = 14/86 (16%), Positives = 32/86 (37%), Gaps = 18/86 (20%)

Query: 105 SSELEEKLKD-----AKMSNMRHRKILAERENELQALLSRLGSESLALPSSG-------- 151
           +++++++ ++       ++  + R     R  +L    + L  +  AL ++         
Sbjct: 78  AAQMQKQYEEIRRELDVLN--KQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPV 135

Query: 152 PHGKTPPGQPPGKSLP---SALKPPQ 174
           P     P  P G+  P       PPQ
Sbjct: 136 PQMPASPPGPEGEPQPGNTPVSFPPQ 161


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 28.0 bits (62), Expect = 4.6
 Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 6/58 (10%)

Query: 91  IRATL------RPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLSRLGS 142
           +RA L            + D  EL+EKL++ +       K L E E E + +  RL  
Sbjct: 117 VRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKR 174


>gnl|CDD|178323 PLN02721, PLN02721, threonine aldolase.
          Length = 353

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 9/48 (18%), Positives = 21/48 (43%), Gaps = 9/48 (18%)

Query: 194 LSEDPFYYWKALCHALKHLKRLLQSQHYEMMLKQTNPVMVRPATHYQV 241
           +++      + LC +L         + + ++L   N   +R  TH+Q+
Sbjct: 296 ITDGSRITAEKLCKSL---------EEHGVLLMPGNSSRIRVVTHHQI 334


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 27.6 bits (62), Expect = 4.8
 Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 9/71 (12%)

Query: 75  KIVELKRNPQGS------IDKRIRATLRPMSKTRRDSSELEEKLKDA---KMSNMRHRKI 125
           +I E+K+N + +      I +  +  + P+ K   +  EL +KLKD    K S    +  
Sbjct: 35  EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKAR 94

Query: 126 LAERENELQAL 136
           L E E EL+ L
Sbjct: 95  LKELEKELKNL 105


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.1 bits (63), Expect = 4.8
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 3/44 (6%)

Query: 104 DSSELE---EKLKDAKMSNMRHRKILAERENELQALLSRLGSES 144
           D +EL+   EKLK      M   K L E E ELQ L S++   +
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLT 245


>gnl|CDD|178056 PLN02437, PLN02437, ribonucleoside--diphosphate reductase large
           subunit.
          Length = 813

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 24/82 (29%)

Query: 63  CTTATLCLVFVPKIVELKRNPQGSIDKRIRATLRPMSKTRR-DSSELEE----------K 111
           C  A++ L   P+ V  K  P  S   ++  +L   SK R  D  +L E          K
Sbjct: 444 CNLASIAL---PRFVREKGVPLESHPSKLVGSLG--SKNRYFDFEKLAEVTAVVTRNLNK 498

Query: 112 LKD--------AKMSNMRHRKI 125
           + D        A+ SNMRHR I
Sbjct: 499 IIDVNYYPVETARRSNMRHRPI 520


>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor.  Mcm10 is a eukaryotic
           DNA replication factor that regulates the stability and
           chromatin association of DNA polymerase alpha.
          Length = 346

 Score = 27.9 bits (62), Expect = 5.5
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 4/56 (7%)

Query: 80  KRNPQGSIDKRIRATLRP-MSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
           ++N   +  ++      P M K R   SE  +K+  AK S   H  +L E E E Q
Sbjct: 175 EKNASSAKRQKSPEDEEPAMKKRRYLESEEFQKILAAKSS---HTGLLEEAEREAQ 227


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 26.1 bits (58), Expect = 5.5
 Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 12/66 (18%)

Query: 78  ELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMS------NMRHRKILAEREN 131
           ELK+N      K  R   R     +++   L+ +L   K            ++ L + E 
Sbjct: 18  ELKKN------KAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQKLKKLEK 71

Query: 132 ELQALL 137
           +L A+ 
Sbjct: 72  DLDAVK 77


>gnl|CDD|235893 PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase;
           Provisional.
          Length = 397

 Score = 27.9 bits (63), Expect = 5.8
 Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 6/33 (18%)

Query: 99  SKTRR------DSSELEEKLKDAKMSNMRHRKI 125
            K +R      D ++LE +LK+AK +  RH+ I
Sbjct: 146 CKAKRYRYANNDMADLEAQLKEAKEAGARHKLI 178


>gnl|CDD|227331 COG4998, COG4998, Predicted endonuclease (RecB family) [DNA
           replication, recombination, and repair].
          Length = 209

 Score = 27.7 bits (61), Expect = 6.1
 Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 5/81 (6%)

Query: 71  VFVPKIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERE 130
           V V    +L    + ++D+ + + +  +S       E  E LK      M   K  AE+ 
Sbjct: 105 VVVLSTDDLYEAVEEALDRAVTSLVEVLSHCEELEIEALEVLKA-----MAKAKSFAEKA 159

Query: 131 NELQALLSRLGSESLALPSSG 151
           ++L   +  L +    L  +G
Sbjct: 160 SKLGTAVEELAARVDRLKRAG 180


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 27.8 bits (62), Expect = 6.7
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 11/60 (18%)

Query: 88  DKRIRATLRPMSKTRRDSSELE----------EKLKDAKMSNMRHRKILAERENELQALL 137
            +R     R + +T  +   LE          EKL+       R++++ AE   EL+  L
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR-ELELAL 229


>gnl|CDD|225833 COG3296, COG3296, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 143

 Score = 26.8 bits (59), Expect = 6.8
 Identities = 17/104 (16%), Positives = 40/104 (38%), Gaps = 9/104 (8%)

Query: 1   MFGAFLAWETRHVSIPALNDSK------YVGMSVYNVVLMCIMGAAISFVISDKQDASFV 54
           + G  + W  +  S P ++          +  ++Y++V   ++ A +    +D      +
Sbjct: 38  LLGPLIVWLLKKDSSPFVDAQGKEALNFQISYTIYSIVSFVLILAGVFLAATDISFIIII 97

Query: 55  --IISLFILFCTTATLCLVFVPKIVELKRNPQGSIDKRIRATLR 96
              ++L  +    A   L  V  I+   +  +G  + R   T+R
Sbjct: 98  GFFLTLIGILSIFALSILSLVLTIIAAIKAYEG-QEYRYPLTIR 140


>gnl|CDD|233146 TIGR00837, araaP, aromatic amino acid transport protein.  The
           Hydroxy/Aromatic Amino Acid Permease (HAAAP) Family-
           tyrosine/tryptophan subfamily (TC 2.A.42.1) The HAAAP
           family includes well characterized aromatic amino
           acid:H+ symport permeases and hydroxy amino acid
           permeases. This subfamily is specific for aromatic amino
           acid transporters and includes the tyrosine permease,
           TyrP, of E. coli, and the tryptophan transporters TnaB
           and Mtr of E. coli [Transport and binding proteins,
           Amino acids, peptides and amines].
          Length = 381

 Score = 27.7 bits (62), Expect = 6.9
 Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 12  HVSIPALNDSKYVGMSVYNVVLMCIMGAAISFVI 45
           H ++P+L   KY   +V  V    ++G+AI+ V+
Sbjct: 194 HGNVPSLY--KYYDGNVKKVKKSILIGSAIALVL 225


>gnl|CDD|212662 cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins.  This
           subfamily consists of uncharacterized hypothetical
           bacterial proteins with similarity to Escherichia coli
           sugar kinase ygcE , whose functional roles are not yet
           clear. Escherichia coli ygcE is recognized by this
           model, but is not present in the alignment as it
           contains a deletion relative to other members of the
           group. These proteins belong to the FGGY family of
           carbohydrate kinases, the monomers of which contain two
           large domains, which are separated by a deep cleft that
           forms the active site. This model includes both the
           N-terminal domain, which adopts a ribonuclease H-like
           fold, and the structurally related C-terminal domain.
          Length = 488

 Score = 27.6 bits (62), Expect = 7.1
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query: 197 DPFYYWKALCHALKHLKR 214
           DP YYW+ALC A + L  
Sbjct: 45  DPDYYWQALCEACRELWE 62


>gnl|CDD|153304 cd07620, BAR_SH3BP1, The Bin/Amphiphysin/Rvs (BAR) domain of
           SH3-domain Binding Protein 1.  BAR domains are
           dimerization, lipid binding and curvature sensing
           modules found in many different proteins with diverse
           functions. SH3-domain binding protein 1 (SH3BP1 or
           3BP-1) is a Rac GTPase activating protein that inhibits
           Rac-mediated platelet-derived growth factor
           (PDGF)-induced membrane ruffling. SH3BP1 contains an
           N-terminal BAR domain followed by a GAP domain for Rho
           and Rac GTPases and a C-terminal proline-rich domain.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions.
          Length = 257

 Score = 27.2 bits (60), Expect = 7.4
 Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 12/99 (12%)

Query: 92  RATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLSRLGSESLALPSSG 151
           +  L+P++K   +  +L E LK        ++K  A+   +  +  SR    +   P SG
Sbjct: 106 KDVLQPLNKLSEE--DLPEILK--------NKKQFAKLTTDWNSAKSRSPQAAGRSPRSG 155

Query: 152 PHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNN 190
              +   G+  G    +  +P + E      R+L    +
Sbjct: 156 GRSEE-VGEHQGIRRANKGEPLKEE-EEECWRKLEQCKD 192


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 27.2 bits (61), Expect = 7.6
 Identities = 10/68 (14%), Positives = 26/68 (38%), Gaps = 9/68 (13%)

Query: 29  YNVVLMCIMGAAISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKRNPQGSID 88
              +++C    A    ++     +F I +L  +      L +V++  I+  K+  Q +  
Sbjct: 129 DETIIVC----APPLALNGT---AFRIFTLSSVIINVIVL-IVYIILIIIFKKKGQTTNS 180

Query: 89  KRIRATLR 96
              +   +
Sbjct: 181 -DSKKVFK 187


>gnl|CDD|215560 PLN03076, PLN03076, ARF guanine nucleotide exchange factor
           (ARF-GEF); Provisional.
          Length = 1780

 Score = 27.9 bits (62), Expect = 8.0
 Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 9/80 (11%)

Query: 108 LEEKLKDAKMSNMRHRKILAERENELQALLSRLGSESLALPSSGPHGKTPPGQPPGKSLP 167
           LE+ +K+A  S  +H K+     +E +A++ RL S     PS+              SLP
Sbjct: 18  LEKIIKNA--SWRKHSKLA----HECKAVIERLNSPEKNPPSTSSAAADSA---SASSLP 68

Query: 168 SALKPPQPEALTVVKRELLV 187
             L        ++ + EL++
Sbjct: 69  GPLHDGGSIEYSLAESELIL 88


>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
          Length = 214

 Score = 26.9 bits (59), Expect = 8.4
 Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 8/40 (20%)

Query: 137 LSRLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPE 176
           LS  GS     P+SGP      G  PG   P+   PP+P 
Sbjct: 165 LSGPGSR----PTSGP----VAGGLPGGGAPAPPTPPRPG 196


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 27.2 bits (59), Expect = 9.4
 Identities = 10/37 (27%), Positives = 17/37 (45%)

Query: 148 PSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRE 184
           P+     K  P   PGK+ P+A +  + EA +  +  
Sbjct: 61  PAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEEES 97


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 27.4 bits (61), Expect = 9.9
 Identities = 22/78 (28%), Positives = 33/78 (42%), Gaps = 7/78 (8%)

Query: 75  KIVELKRNPQGSIDKRIR----ATLRPMSKTRRDS-SELEEKLKDAKMSNMRHRKILAER 129
           ++ ELK   +     R R      LR   K  +    ELEE+L   +   ++  ++  E 
Sbjct: 567 RLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE-ELLQSLELS-EA 624

Query: 130 ENELQALLSRLGSESLAL 147
           ENEL+     L SE   L
Sbjct: 625 ENELEEAEEELESELEKL 642


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.134    0.406 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,309,305
Number of extensions: 1269325
Number of successful extensions: 1767
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1762
Number of HSP's successfully gapped: 76
Length of query: 258
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 163
Effective length of database: 6,723,972
Effective search space: 1096007436
Effective search space used: 1096007436
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.1 bits)