RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6681
         (537 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score =  123 bits (312), Expect = 3e-32
 Identities = 55/259 (21%), Positives = 109/259 (42%), Gaps = 25/259 (9%)

Query: 9   FVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMS-AGCQ 67
            V++  +         + NLA AD    L L   A+               W    A C+
Sbjct: 1   LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGG--------DWPFGDALCK 52

Query: 68  MAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVL 127
           + GF+ V++   S+  L  I+++R  AI H +   +  + + A  +++  W  A+L+++ 
Sbjct: 53  LVGFLFVVNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLP 112

Query: 128 PLFGI----SDYRKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCYLKMYCAIRG 183
           PL        +     TCL             ++  +    +  L+++ CY  +   +R 
Sbjct: 113 PLLFSWLRTVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRK 172

Query: 184 S----------QAWNSNDSRIAKRMALLVFTDLLCWFPIALVA-LTAICGVHLV-SLEQA 231
                      +  +S + + AK + ++V   +LCW P  +V  L ++C + +   L  A
Sbjct: 173 RARSGASQARAKRSSSKERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCPLSIWRLLPTA 232

Query: 232 KIFTVFVLPLNSCCNPFLY 250
            + T+++  +NSC NP +Y
Sbjct: 233 LLITLWLAYVNSCLNPIIY 251


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
           protein; Provisional.
          Length = 335

 Score = 67.5 bits (165), Expect = 3e-12
 Identities = 66/278 (23%), Positives = 121/278 (43%), Gaps = 36/278 (12%)

Query: 1   MLGNGTVVFVLIFARSKIDVPRFLVCNLAAAD--FFMGLYLGLLAVVDASTLGEFRMYAI 58
           ++GN  V++VL   + K  +  +L+ NLA +D  F M L   +              Y I
Sbjct: 55  LVGNIIVIYVLTKTKIKTPMDIYLL-NLAVSDLLFVMTLPFQI-------------YYYI 100

Query: 59  PWQMSAG---CQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMV 115
            +Q S G   C++   +  +    S+  + V++++R  AI H +  NK  ++K+   + +
Sbjct: 101 LFQWSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSL 160

Query: 116 CGWSFAILMAVLPLFGI----SDYRKYATCLPFETTTGVWSLTYIVFLMFING--VAFLI 169
             W  +I+    P+  +     D+     C+ +   T  W L +I F + I G  +   I
Sbjct: 161 VIWIISIIETT-PILFVYTTKKDHETLICCMFYNNKTMNWKL-FINFEINIIGMLIPLTI 218

Query: 170 LMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALV-------ALTAICG 222
           L+ CY K+   ++G      N   I K + ++V   ++ W P  +         L    G
Sbjct: 219 LLYCYSKILITLKGINKSKKNKKAI-KLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSG 277

Query: 223 VHLVSLEQAKIFTVFVLPLNSCC-NPFLYAILTKQFKK 259
              V   Q  +    ++ L+ CC NP +YA +++ F K
Sbjct: 278 CKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNK 315


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 53.4 bits (128), Expect = 1e-07
 Identities = 67/284 (23%), Positives = 128/284 (45%), Gaps = 33/284 (11%)

Query: 1   MLGNGTVVFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPW 60
           ++GN  V+ VLI  R    V  +L  N+A +D  +      +   D   L E+       
Sbjct: 43  LIGNVLVIAVLIVKRFMFVVDVYLF-NIAMSDLMLVFSFPFIIHND---LNEWIFGEFMC 98

Query: 61  QMSAGCQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHL-NKRLSLKHAGYIMVCGWS 119
           ++  G    GF S      ++F + +I+++R   + +A  + NK +SL  +  + V  W 
Sbjct: 99  KLVLGVYFVGFFS------NMFFVTLISIDRYILVVNATKIKNKSISL--SVLLSVAAWV 150

Query: 120 FAIL--MAVLPLFGISDYRKYATCLPFETTTGVWSL-TYIVFLMFING--VAFLILMGCY 174
            +++  M  + L+ + +      C+ F      +S   +  F + I G  +  +IL+ CY
Sbjct: 151 CSVILSMPAMVLYYVDNTDNLKQCI-FNDYHENFSWSAFFNFEINIFGIVIPLIILIYCY 209

Query: 175 LKMYCAIRGSQAWNSNDSR-IAKRMALLVFTDLLCWFPIALVAL-TAICGVHLVSLE--- 229
            K+   ++  +  N N +R I   + ++ FT +  W P  +V    ++  V L+ +    
Sbjct: 210 SKILYTLKNCK--NKNKTRSIKIILTVVTFTVVF-WVPFNIVLFINSLQSVGLIDIGCYH 266

Query: 230 -QAKIFTVFVLPLNS----CCNPFLYAILTKQFKKDC-VMICKT 267
            +  ++++ +  L S    C NP +YA + K FKK    M C+T
Sbjct: 267 FKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKKVFKNMFCRT 310


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 47.3 bits (112), Expect = 1e-05
 Identities = 58/304 (19%), Positives = 121/304 (39%), Gaps = 54/304 (17%)

Query: 1   MLGNGTVVFVLIFARSKIDVPRFLVCNLAAAD--FFMGLYLGLLAVVDASTLGEFRMYAI 58
           + GN  ++ +L   + K  +    + NLA +D  F +     +    D    G+F    I
Sbjct: 113 LFGNAAIIMILFCKKIKT-ITDIYIFNLAISDLIFVIDFPFIIYNEFDQWIFGDFMCKVI 171

Query: 59  PWQMSAGCQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGW 118
                +     GF S      ++F + +++++R +AI + +   K  +      + +  W
Sbjct: 172 -----SASYYIGFFS------NMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISW 220

Query: 119 SFAILMAVLPLF----------------GISDYRKYATCLPFETTTGVWSLTYIV-FLMF 161
             ++++     F                 IS+Y+   T +       +  L  I+ F + 
Sbjct: 221 ILSLIITSPAYFIFEASNIIFSAQDSNETISNYQ--CTLIEDNEKNNISFLGRILQFEIN 278

Query: 162 ING--VAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALVALTA 219
           I G  +  +I   CY+K+   ++  Q   S  ++    +++++   L+CW P+ +V L A
Sbjct: 279 ILGMFIPIIIFAFCYIKIILKLK--QLKKSKKTKSIIIVSIIIICSLICWIPLNIVILFA 336

Query: 220 ---------------ICGVHLVSLEQAKIFTVFVLPLNSCCNPFLYAILTKQFKKDCVMI 264
                          ICG   + L  A +    +   + C NP +Y ++ + F+   +MI
Sbjct: 337 TMYSFKGFNSIISEHICGF--IKLGYAMMLAEAISLTHCCINPLIYTLIGENFRMHLLMI 394

Query: 265 CKTI 268
            + I
Sbjct: 395 FRNI 398


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 37.1 bits (86), Expect = 0.025
 Identities = 41/244 (16%), Positives = 61/244 (25%), Gaps = 30/244 (12%)

Query: 300 DRSSREHFPHPQPCSCNLKLLGERMSKPRPHRPRWSATKKVRRFWDCLHLQRKNHRHIRT 359
            R  R   P     S      G R +          ++                      
Sbjct: 204 PRPPRRSSPISASASSPAPAPG-RSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP 262

Query: 360 DQYAYQIAEIQQKQHKRASSVCSSENFS-SSRSDSWKQNHHHCGIPMRLLDPKRRASSWL 418
                +I E        +    +S + S   RS S   +    G          RASS  
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSG----PAPSSPRASS-- 316

Query: 419 VTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRS-SNSSDRS----KPRLTRQSAIVDD 473
            +  +S++S+ SS+ + S  S   A +     SRS S S             R       
Sbjct: 317 -SSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAP 375

Query: 474 PDLPNSPARLTVR----------------FLTTIPSAAETSVVNEEESFAILHPTPDASP 517
                S  R T R                          + +     S A     P  +P
Sbjct: 376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435

Query: 518 AGSP 521
           +G P
Sbjct: 436 SGEP 439



 Score = 29.4 bits (66), Expect = 6.4
 Identities = 30/153 (19%), Positives = 44/153 (28%), Gaps = 32/153 (20%)

Query: 375 KRASSVCSSENFSSSRSDSWKQNHHHCGIPM----RLLDPKRRASSWLVTRKTSQDSNLS 430
              +S   S +  SS      +N   C +P      L      AS W         ++ S
Sbjct: 231 DAGASSSDSSSSESSGCGWGPENE--CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288

Query: 431 SSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVDDPDLPNSPARLTVRFLTT 490
           SS  + S S    S S    S  + SS R+    +            +S +         
Sbjct: 289 SSPRERSPSP---SPSSPG-SGPAPSSPRASSSSSSSRESSSSSTSSSSESS-------- 336

Query: 491 IPSAAETSVVNEEESFAILHPTPDASPAGSPQR 523
                           A + P P  S + SP R
Sbjct: 337 --------------RGAAVSPGPSPSRSPSPSR 355


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 36.1 bits (83), Expect = 0.045
 Identities = 26/143 (18%), Positives = 46/143 (32%), Gaps = 4/143 (2%)

Query: 347 LHLQRKNHRHIRTDQYAYQIAEIQQKQH--KRASSVCSSENFSSSRSDSWKQNHHHCGIP 404
             L+           + +Q   I    H  K A S   +     + S+  + +       
Sbjct: 251 PELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGS 310

Query: 405 MRLLDPKRRASSWLVTRKTS--QDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKP 462
                    A++  +  K+S     +  +  N S  S ++A+ SV   S S N S + + 
Sbjct: 311 KETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRA 370

Query: 463 RLTRQSAIVDDPDLPNSPARLTV 485
             +   A     D     A  TV
Sbjct: 371 SKSSSGARDSKKDASGMSANGTV 393


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
           subunit [Transcription].
          Length = 548

 Score = 35.0 bits (80), Expect = 0.11
 Identities = 21/111 (18%), Positives = 37/111 (33%), Gaps = 10/111 (9%)

Query: 430 SSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQS-AIVDDPDLPNSPARLTVRFL 488
            SS N+ +       TS+S I  S       K +  ++  +I D    P +P        
Sbjct: 202 PSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQRDVSISDRATTPIAPG-----VE 256

Query: 489 TTIPSAAETSVVNEEESFAILHP-TP---DASPAGSPQRKKKISLEEKQNS 535
           +   S + T      ++            D S  G+P     +  +E +N 
Sbjct: 257 SASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESEND 307


>gnl|CDD|238504 cd01031, EriC, ClC chloride channel EriC.  This domain is found in
           the EriC chloride transporters that mediate the extreme
           acid resistance response in eubacteria and archaea. This
           response allows bacteria to survive in the acidic
           environments by decarboxylation-linked proton
           utilization. As shown for Escherichia coli EriC, these
           channels can counterbalance the electric current
           produced by the outwardly directed virtual proton pump
           linked to amino acid decarboxylation.  The EriC proteins
           belong to the ClC superfamily of chloride ion channels,
           which share a unique double-barreled architecture and
           voltage-dependent gating mechanism.  The
           voltage-dependent gating is conferred by the permeating
           anion itself, acting as the gating charge. In
           Escherichia coli EriC, a glutamate residue that
           protrudes into the pore is thought to participate in
           gating by binding to a Cl- ion site within the
           selectivity filter.
          Length = 402

 Score = 34.4 bits (80), Expect = 0.14
 Identities = 30/157 (19%), Positives = 56/157 (35%), Gaps = 26/157 (16%)

Query: 21  PRFLVCNLAA---ADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAGCQMAGFMSVLSA 77
           P  L+  L A   ADF   L+ GL  V+    L    + +    +  G  +AG +  L  
Sbjct: 167 PLALLTALVASIAADFVSRLFFGLGPVLSIPPLPALPLKSYWLLLLLGI-IAGLLGYL-- 223

Query: 78  ELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRK 137
               F  +++  +  Y       L + L       +++ G     L  +LP      +  
Sbjct: 224 ----FNRSLLKSQDLYRKLK--KLPREL------RVLLPGLLIGPLGLLLPEALGGGHG- 270

Query: 138 YATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCY 174
               L      G +S++ ++ +  +    F+  M  Y
Sbjct: 271 ----LILSLAGGNFSISLLLLIFVLR---FIFTMLSY 300


>gnl|CDD|184922 PRK14958, PRK14958, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 509

 Score = 33.6 bits (76), Expect = 0.32
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 7/78 (8%)

Query: 458 DRSKPRLTRQSAIVDDPDLPNSPARLT------VRFLTTIPSAAETSVVNEEESFAILHP 511
           DR   +L  Q  ++   DL  SP   T      +R L   PSAA  ++   E   A++ P
Sbjct: 318 DREDVQLFYQIGLIGQRDLAYSPTPQTGFEMTLLRMLAFYPSAA-NAIPETESKEAVVTP 376

Query: 512 TPDASPAGSPQRKKKISL 529
           TP +S    PQ   ++ L
Sbjct: 377 TPKSSETSWPQLLSQLKL 394


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 30.8 bits (70), Expect = 0.37
 Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 403 IPMRLLDPKRRASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKP 462
            P+  L+ KR  SS      TS  S   SS   +S S+++ S++    S SS+    S  
Sbjct: 4   FPVNYLNMKRSGSS-----NTSVTSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGT 58

Query: 463 RLTRQS 468
           R+T +S
Sbjct: 59  RITTES 64



 Score = 30.4 bits (69), Expect = 0.59
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 424 SQDSNLSSSRNDSSGSANTASTSVSRISRSSNSS 457
           S  S+ SS+   SS S++  S+S +RI+  S S 
Sbjct: 34  SSSSSNSSNGGSSSSSSSGDSSSGTRITTESESD 67


>gnl|CDD|119052 pfam10532, Plant_all_beta, Plant specific N-all beta domain.  This
           domain was identified by Babu and colleagues. It is
           found associated with the WRKY domain pfam03106.
          Length = 114

 Score = 29.8 bits (67), Expect = 1.1
 Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 20/76 (26%)

Query: 460 SKPRLTRQSAIVDDPDLPNSPARLTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAG 519
           SKP+  R++ IVDD D+        + +LT+         ++EE    +LH     +   
Sbjct: 21  SKPK--RETYIVDDEDV--------LVYLTS---------LDEEGRRPVLH-VEVVNDLE 60

Query: 520 SPQRKKKISLEEKQNS 535
             +R +++S  E+++S
Sbjct: 61  ENERLEQVSRVERRSS 76


>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
           Srx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srx is part of the Srg
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 275

 Score = 31.0 bits (71), Expect = 1.3
 Identities = 34/160 (21%), Positives = 62/160 (38%), Gaps = 18/160 (11%)

Query: 68  MAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVL 127
           + G + +   E+S  T  +I+L R  A+       K  S+K+   I+   W  AI+++ L
Sbjct: 69  IGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWIIAIIISTL 128

Query: 128 PLFGISDY----RKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGC--YLKMYCAI 181
             F +  +     +  T    E         Y  FL  +  V    ++    ++K+    
Sbjct: 129 FYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVIITNIINVITFIKLRKFS 188

Query: 182 RGSQAWNSNDS--RIAKRM---------ALLVFTDLLCWF 210
           + S   +S +S  R  K +          LL   DL+ +F
Sbjct: 189 KKSSL-SSEESKKRRKKEINFFKQTVIQDLLYLIDLINYF 227


>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
          Length = 456

 Score = 31.3 bits (71), Expect = 1.4
 Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 1/67 (1%)

Query: 268 IEESRVTRG-IGRCRHSSNFSNRQTPGNTNSIVDRSSREHFPHPQPCSCNLKLLGERMSK 326
           I+  R  RG  GR +       +  P         +S +    P     + K  GE+  +
Sbjct: 387 IQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRR 446

Query: 327 PRPHRPR 333
            RP +P 
Sbjct: 447 RRPRKPA 453


>gnl|CDD|215858 pfam00324, AA_permease, Amino acid permease. 
          Length = 473

 Score = 31.2 bits (71), Expect = 1.5
 Identities = 12/67 (17%), Positives = 23/67 (34%), Gaps = 3/67 (4%)

Query: 156 IVFLMFINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAK---RMALLVFTDLLCWFPI 212
           IVF   +     + L+   L     +R  +A+      I +   +  L  F  +L    +
Sbjct: 357 IVFNFLLAISGLIGLIVWGLISLSHLRFRKAFKYQGRSIDELPFKACLGPFGVILGLAAL 416

Query: 213 ALVALTA 219
             + L  
Sbjct: 417 ITILLIQ 423


>gnl|CDD|219123 pfam06653, Claudin_3, Tight junction protein, Claudin-like.  This
           is a family of probable membrane tight junction,
           Claudin-like, proteins.
          Length = 165

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 23/121 (19%), Positives = 48/121 (39%), Gaps = 14/121 (11%)

Query: 104 RLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRKYATC---LPFETTTGVWSLTYIVFLM 160
           R+ LK  G ++V  +   I+    P +              +PF +T G W L    ++M
Sbjct: 3   RVKLKALGVLVVISFILNIVGVFTPGWITDTGGFLEPSIGIVPFYSTEGGW-LVAASWMM 61

Query: 161 FINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALVALTAI 220
           +I+   FL+L+  Y+     ++      +  S   +++       L+    + +  LT +
Sbjct: 62  YISFALFLVLILIYIVAVRKVK-----KNGYSHYQRKLFF-----LIALLSLLIAILTVV 111

Query: 221 C 221
            
Sbjct: 112 A 112


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 29.9 bits (68), Expect = 2.7
 Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 15/115 (13%)

Query: 140 TCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCYLKMYCAIRG-SQAWNSNDSRIAKRM 198
            C P     G     + +  + IN V  LI+   Y+ +    +   Q  NS+  ++ K +
Sbjct: 134 VCAPPLALNGTAFRIFTLSSVIIN-VIVLIV---YIILIIIFKKKGQTTNSDSKKVFKSL 189

Query: 199 ALLVFTDLLCWFPIALVALTAICGVHLV-SLEQAKIFTV----FVLPLNSCCNPF 248
            + V   +  WF      +     + L    E   I  +    FV    S    F
Sbjct: 190 QVTVIIFIFGWF---TSTIANTVFLLLTEDGEVENIIQMYAGIFVNL--SFSQNF 239


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 30.6 bits (69), Expect = 2.8
 Identities = 30/152 (19%), Positives = 47/152 (30%), Gaps = 7/152 (4%)

Query: 387 SSSRSDSWKQNHHHCGIPMRLLDPKRRASSW----LVTRKTSQDSNLSSSRNDSS--GSA 440
           S+ R  SW  +     I         R  S+    L  R  ++  +  S     S  G  
Sbjct: 139 SAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLTDRTPAKVRSSRSELGAPSPSGGT 198

Query: 441 NTASTSVSRISRSSNSSDRSKPRLTRQSAIVDDPDLPNSPARLTVRFLTTIPSAAETSVV 500
           +  S+S  R S   +   R    L ++ A++  P  P S +           S++  S  
Sbjct: 199 SCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRA-RSSSAKSPF 257

Query: 501 NEEESFAILHPTPDASPAGSPQRKKKISLEEK 532
                          S   SP+   K S  E 
Sbjct: 258 KSSIQRKATKALSKLSLRASPKDTSKSSKSEV 289


>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
           transduction mechanisms].
          Length = 612

 Score = 30.3 bits (68), Expect = 2.9
 Identities = 27/105 (25%), Positives = 42/105 (40%), Gaps = 6/105 (5%)

Query: 426 DSNLSSSR--NDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVD-DPDLPNSPAR 482
             N + S   N S+ +  T S  V   SRS+N  +  +   T   + +    +L  +P+ 
Sbjct: 113 PPNHTRSSGNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSG 172

Query: 483 LTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAGSPQRKKKI 527
            T   L++ PS   TS   + ES     PT   SP       K+ 
Sbjct: 173 KT---LSSQPSRKPTSENPKSESDNSKLPTSVNSPLPDKSLLKRT 214


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 19/73 (26%), Positives = 29/73 (39%), Gaps = 20/73 (27%)

Query: 416 SWLVTRKTS-------------------QDSNLSSSRNDSSGSANTASTSVSRISRSSNS 456
           SWLV R++S                   + S+  +SR   SG + T S ++SR    S S
Sbjct: 146 SWLVKRESSTSLVSEDEEDFEREAAREREHSSRRASRRGRSGYS-TPSAALSRRGSRSAS 204

Query: 457 SDRSKPRLTRQSA 469
              S+  L+    
Sbjct: 205 RRGSRADLSMTPL 217


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 30.2 bits (68), Expect = 3.4
 Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 12/89 (13%)

Query: 372 KQHKRASSVCSSENFSSSRSD-SWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQDSNLS 430
             H R     SS + S+  S  S ++N  H    M        +++ L+ R +       
Sbjct: 194 SPHFRQKF--SSSDTSNGFSYPSIRKNSRHSSNSMPSFP--HSSTAVLLKRHSGSSGA-- 247

Query: 431 SSRNDSSGSANTASTSVSRISRSSNSSDR 459
                S  S+N   +S +  + S++S   
Sbjct: 248 -----SLISSNITPSSSNSEAMSTSSKRP 271


>gnl|CDD|219068 pfam06511, IpaD, Invasion plasmid antigen IpaD.  This family
           consists of several invasion plasmid antigen IpaD
           proteins. Entry of Shigella flexneri into epithelial
           cells and lysis of the phagosome involve the IpaB, IpaC,
           and IpaD proteins, which are secreted by type III
           secretion machinery.
          Length = 337

 Score = 30.0 bits (67), Expect = 3.6
 Identities = 16/119 (13%), Positives = 43/119 (36%), Gaps = 6/119 (5%)

Query: 409 DPKRRASSWLVTRKTSQDSNLSSSRN-DSSGSANTASTSVSRISRSSNSSDRSKPRLTRQ 467
           +     +      ++  ++N SS+   ++     T++ ++  I  S+ +  +   + T  
Sbjct: 2   NITTSTNPISTASRSPNNTNGSSAETVNADVKTTTSNDTLHNIRTSNAALKKELSQQTLT 61

Query: 468 SAIVDDPDLPNSPARLTVRFLTTIPSA-AETSVVNEEESFAILHPTPDASP----AGSP 521
              +++    +S   + V     + +  A++      E   +LH    A       G P
Sbjct: 62  KTPLEEIAEHSSQITMDVNKSAQLLNILAKSGYPISAEQRELLHSGLSAPKEAELDGYP 120


>gnl|CDD|129861 TIGR00779, cad, cadmium resistance transporter (or sequestration)
          family protein.  These proteins are members of the
          Cadmium Resistance (CadD) Family (TC 2.A.77). To date,
          this family of proteins has only been found in
          Gram-positive bacteria. The CadD family includes
          several closely related Staphylococcal proteins
          reported to function in cadmium resistance. Members are
          predicted to span the membrane five times; the
          mechanism of resistance is believed to be export but
          has also been suggested to be binding and sequestration
          in the membrane. Closely related but outside the scope
          of this model is another staphylococcal protein that
          has been reported to possibly function in quaternary
          ammonium ion export. Still more distant are other
          members of the broader LysE family (see Vrljic. et al,
          PMID:10943564) [Transport and binding proteins, Amino
          acids, peptides and amines].
          Length = 193

 Score = 29.4 bits (66), Expect = 3.9
 Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 9/59 (15%)

Query: 7  VVFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAG 65
          V+ ++ FAR+K    +         D ++G YLG + ++  S L  F +  IP +   G
Sbjct: 10 VILLIFFARAKRKEYK---------DIYIGQYLGSIILILVSLLLAFGVNLIPEKWVLG 59


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 29.7 bits (67), Expect = 4.5
 Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 12/114 (10%)

Query: 414 ASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSR-ISRSSNSSDRSKPRLTRQSAIVD 472
           +SS   +R  S  S+ SSS + SS  ++T+     R  S SS     S   L + S+   
Sbjct: 2   SSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSP 61

Query: 473 DPDLPNSPARLTVRFLTTIPSAA-------ETSVVNEEESFAILHPTPD-ASPA 518
                   A      L  +  AA       E+++V+  E     HP P  A PA
Sbjct: 62  PSVTLPPAATTQTPQLNPLQRAAAAALDAVESALVSHLER---QHPLPKTADPA 112


>gnl|CDD|218145 pfam04553, Tis11B_N, Tis11B like protein, N terminus.  Members of
           this family always contain a tandem repeat of CCCH zinc
           fingers pfam00642. Tis11B, Tis11D and their homologues
           are thought to be regulatory proteins involved in the
           response to growth factors. The function of the N
           terminus is unknown.
          Length = 109

 Score = 27.8 bits (61), Expect = 5.3
 Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 5/49 (10%)

Query: 412 RRASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRS 460
           RR S   ++ K +Q+  L+S + D SG ++TA +     ++ +   DRS
Sbjct: 49  RRHSVTNLSSKFNQNQFLNSLKMDPSGVSSTALS-----NKENRFRDRS 92


>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
          Length = 609

 Score = 29.1 bits (65), Expect = 6.3
 Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 8/52 (15%)

Query: 419 VTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPR-LTRQSA 469
           V  + + +   SSSR   S S +T   S  +       S RS P  + RQ  
Sbjct: 455 VIERLNVNEGRSSSRASPSHSTSTIPYSPPQ-------SGRSTPTSILRQRT 499


>gnl|CDD|218666 pfam05633, DUF793, Protein of unknown function (DUF793).  This
           family consists of several plant proteins of unknown
           function.
          Length = 389

 Score = 29.0 bits (65), Expect = 6.6
 Identities = 20/76 (26%), Positives = 29/76 (38%), Gaps = 10/76 (13%)

Query: 378 SSVCSSENFSSSRSDSWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQD-------SNLS 430
            S     N S  RS  W  +H   G          R+ SW V+R  S         +NL+
Sbjct: 175 GSGGGHRNRSFGRS--WSFHHRSIGSSGGGTGGHLRSLSWSVSRNWSAAKQLQAMGANLA 232

Query: 431 SSR-NDSSGSANTAST 445
           + R ND++ +   A  
Sbjct: 233 TPRGNDAAATNGLAVP 248


>gnl|CDD|223469 COG0392, COG0392, Predicted integral membrane protein [Function
           unknown].
          Length = 322

 Score = 28.9 bits (65), Expect = 6.6
 Identities = 16/111 (14%), Positives = 29/111 (26%), Gaps = 3/111 (2%)

Query: 100 HLNKRLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRKYATCLPFETTTGVWSLTYIVFL 159
            L  +LSL     + +  +    L  + P    S              +       + FL
Sbjct: 79  SLGVKLSLWK---VALASFIAYFLNNITPAALGSGPAVRYRLYSRLGLSVALVSRILAFL 135

Query: 160 MFINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWF 210
           +   G+  L+L    L                      + LL+   +L   
Sbjct: 136 LLTFGLGLLLLFLLILLFVPLKLALFLLLLTLLLRILLILLLLAAAILLIL 186


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 29.1 bits (65), Expect = 6.7
 Identities = 28/138 (20%), Positives = 43/138 (31%), Gaps = 20/138 (14%)

Query: 371 QKQHKRASSVCSSENFSSSRSDSWKQNHHHCGI------------------PMRLLDPKR 412
           ++  K  S     E  SSS S S   +HHH                     P   L    
Sbjct: 756 EEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSS 815

Query: 413 RASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVD 472
                   ++  +  + SSS    S S+  +S+  S  S+   +    K   T +     
Sbjct: 816 PKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEG--KGSSTSKEHKGS 873

Query: 473 DPDLPNSPARLTVRFLTT 490
             D PN  +   V  L+ 
Sbjct: 874 SGDTPNKASSFPVPPLSN 891


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
           family of proteins are involved in maintaining Golgi
           structure. They stimulate the formation of Golgi stacks
           and ribbons, and are involved in intra-Golgi retrograde
           transport. Two main interactions have been
           characterized: one with RAB1A that has been activated by
           GTP-binding and another with isoform CASP of CUTL1.
          Length = 509

 Score = 29.0 bits (65), Expect = 7.0
 Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 10/80 (12%)

Query: 408 LDPKRRASSWLVT-RKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTR 466
              +  ASS     R    + +   SR  S  S           SRS +S       L  
Sbjct: 45  WSVRETASSNKARSRSEKWNPDQPGSRVSSPSS------KKDGTSRSLSSQVDD---LAS 95

Query: 467 QSAIVDDPDLPNSPARLTVR 486
             +     DL +  A L +R
Sbjct: 96  AVSSQSSSDLEDELAALKIR 115


>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar
          epimerases [Cell envelope biogenesis, outer membrane /
          Carbohydrate transport and metabolism].
          Length = 588

 Score = 29.2 bits (66), Expect = 7.4
 Identities = 13/84 (15%), Positives = 26/84 (30%), Gaps = 10/84 (11%)

Query: 8  VFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAGCQ 67
          + +++ A          +  L   +    L+ G   +        F +Y   W+ S    
Sbjct: 7  LLLVLVA--------LALALLLRDELSASLWSGAQLLT-ICVFILFGLYRAVWRASVSDL 57

Query: 68 MAGFMSVLSAELSVFTLAVITLER 91
               +VL++   V  L   T   
Sbjct: 58 FIILKAVLTS-ALVSLLFFSTRTD 80


>gnl|CDD|224198 COG1279, COG1279, Lysine efflux permease [General function
           prediction only].
          Length = 202

 Score = 28.0 bits (63), Expect = 9.3
 Identities = 17/98 (17%), Positives = 36/98 (36%), Gaps = 16/98 (16%)

Query: 113 IMVCGWSFAILMAVLPLFGISDYRKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMG 172
            ++C  S  +L++   +FG+                 +    +++ ++   G AFL+  G
Sbjct: 40  ALLCAISDIVLIS-AGVFGVGAL--------------IAKSPWLLLIVRWGGAAFLLYYG 84

Query: 173 CYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWF 210
             L +  A RG       +    K   +L+F   +   
Sbjct: 85  -LLALKSAPRGPSQLQVAEFTKLKLKKVLLFALAVTLL 121


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 28.7 bits (64), Expect = 9.6
 Identities = 26/116 (22%), Positives = 41/116 (35%), Gaps = 17/116 (14%)

Query: 422 KTSQDSNLSSSRNDSSGS-----ANTASTSVSRISRSSNSSDRSKPRLTRQ--------S 468
           + +Q+S    + +  S        N   +SVS+++   + S   + R + Q        S
Sbjct: 592 RNTQNSLQIDTLDHESSEQPLEEKNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVS 651

Query: 469 AIVDDPDLPNSPARLTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAGSPQRK 524
             V   D  N      V       S +  S+V  E S  +  PT   SP     RK
Sbjct: 652 TGVSVADESNDSENELVNCA----SPSSLSIVPVEVSPVLKSPTLSVSPRIRNSRK 703


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 28.8 bits (64), Expect = 9.7
 Identities = 21/106 (19%), Positives = 44/106 (41%), Gaps = 3/106 (2%)

Query: 369 IQQKQHKRASSVCSSENFSSSRSDSWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQDSN 428
            ++++ K+  ++       ++  D +   +     P++  D            K + DS+
Sbjct: 66  RKEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSS 125

Query: 429 LSSSRNDSSGSANTASTSVSRISRSSNSSD-RSKPRLTR--QSAIV 471
            SSS + S  + +  S+  S  S  SNS   +  P++ +  QS  V
Sbjct: 126 SSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHV 171


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 28.6 bits (64), Expect = 9.9
 Identities = 13/68 (19%), Positives = 26/68 (38%), Gaps = 1/68 (1%)

Query: 401 CGIPMRLLDPKRRASSWLVTRKTSQDSNLSSSRNDSSGS-ANTASTSVSRISRSSNSSDR 459
               M++   K++ +  L + K    + L    + +S S  +   +    ++    SS R
Sbjct: 231 LKPSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSR 290

Query: 460 SKPRLTRQ 467
             P L R 
Sbjct: 291 RTPSLLRS 298


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.133    0.409 

Gapped
Lambda     K      H
   0.267   0.0690    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,227,526
Number of extensions: 2458239
Number of successful extensions: 3017
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2975
Number of HSP's successfully gapped: 67
Length of query: 537
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 435
Effective length of database: 6,413,494
Effective search space: 2789869890
Effective search space used: 2789869890
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.4 bits)