RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17682
         (490 letters)



>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score =  249 bits (636), Expect = 6e-77
 Identities = 121/316 (38%), Positives = 186/316 (58%), Gaps = 6/316 (1%)

Query: 152 GRINADHALVGGHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDL 211
           G++ ++  ++ G +GP++D+A++P D   + + SED  +  W IP+ G+++ +++P+V L
Sbjct: 62  GKLASNPPILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHL 121

Query: 212 LYHQRRVGLVLWHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGS 271
             H ++VG+V +HP+A NVL +AG+D  V +W+V  G+ +  I+CH D + S  WN DGS
Sbjct: 122 QGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS 181

Query: 272 KLLTTCKDKKMRIINPRTGAIEEEAVCHEGSKATRAIYLK-SGLIFTTGFSKMSERQYSL 330
            L TT KDKK+ II+PR G I      H  +K+ R ++ K   LI T G SK  +RQ  L
Sbjct: 182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIML 241

Query: 331 RAPDHLDEPIVMVELDTSNGVMFPLYDPDTNLVYLCGKGDSVIRYFEITAEPPFVHYINT 390
                +  P   V+LD S+ +  P +D DTNL+Y+  KG+  IR FE+  E        +
Sbjct: 242 WDTRKMASPYSTVDLDQSSALFIPFFDEDTNLLYIGSKGEGNIRCFELMNERLTFCS--S 299

Query: 391 FQTPDPQRGIGMMPKRGCDVNSCEIGRFFRLNNSGLCQVVSMTVPRKS--ELFQDDLYPD 448
           + + +P +G+ MMPK   D   CEI RF+ L    L   + M +PRK      Q D+YP 
Sbjct: 300 YSSVEPHKGLCMMPKWSLDTRKCEIARFYALTYHSLYT-IQMLLPRKQADSELQVDVYPP 358

Query: 449 TVGDTAALTADEWFAG 464
           T  D  A+TADE+F+G
Sbjct: 359 TFADHPAITADEYFSG 374



 Score = 36.8 bits (85), Expect = 0.024
 Identities = 15/31 (48%), Positives = 24/31 (77%)

Query: 34  TEPVVDLLYHQRRVGLVLWHPTAQNVLLTAG 64
           ++P+V L  H ++VG+V +HP+A NVL +AG
Sbjct: 115 SDPIVHLQGHTKKVGIVSFHPSAMNVLASAG 145



 Score = 29.1 bits (65), Expect = 6.7
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 1   DLYPDTVGDTAALTADEWF 19
           D+YP T  D  A+TADE+F
Sbjct: 354 DVYPPTFADHPAITADEYF 372


>gnl|CDD|220080 pfam08954, DUF1900, Domain of unknown function (DUF1900).  This
           domain is predominantly found in the structural protein
           coronin, and is duplicated in some sequences. It has no
           known function.
          Length = 136

 Score =  218 bits (559), Expect = 4e-70
 Identities = 68/129 (52%), Positives = 90/129 (69%), Gaps = 2/129 (1%)

Query: 339 PIVMVELDTSNGVMFPLYDPDTNLVYLCGKGDSVIRYFEITAEPPFVHYINTFQTPDPQR 398
           P+   ELDTS+GV+ P YD DT L+YL GKGD  IRY+E+T E P++HY++ + + +PQ+
Sbjct: 1   PLATEELDTSSGVLIPFYDEDTGLLYLAGKGDGNIRYYEVTDEEPYLHYLSEYTSTEPQK 60

Query: 399 GIGMMPKRGCDVNSCEIGRFFRLNNSGLCQVVSMTVPRK-SELFQDDLYPDTVGDTAALT 457
           G   +PKR  DV  CEI RF++L N    + +S  VPRK SELFQ+DLYPDT G   ALT
Sbjct: 61  GFCFLPKRALDVKKCEIARFYKLTNDS-IEPISFIVPRKNSELFQEDLYPDTAGTEPALT 119

Query: 458 ADEWFAGNE 466
           A+EW +G  
Sbjct: 120 AEEWLSGKN 128



 Score = 41.0 bits (97), Expect = 2e-04
 Identities = 13/20 (65%), Positives = 15/20 (75%)

Query: 1   DLYPDTVGDTAALTADEWFA 20
           DLYPDT G   ALTA+EW +
Sbjct: 106 DLYPDTAGTEPALTAEEWLS 125


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score =  185 bits (472), Expect = 2e-52
 Identities = 105/328 (32%), Positives = 171/328 (52%), Gaps = 22/328 (6%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGIS-RTLTEPVVDLLYHQRRVGLV 221
           GH   +LD+ ++P    ++ASGSED  ++VW+IP    S + + +P   L  H++++ ++
Sbjct: 72  GHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISII 131

Query: 222 LWHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKK 281
            W+P    ++ ++G D+ V IW++   +   QI   P  + S  WN  G+ L  TC  K 
Sbjct: 132 DWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINM-PKKLSSLKWNIKGNLLSGTCVGKH 190

Query: 282 MRIINPRTGAIEEEAVCHEGSKATRAIYLKSGL------IFTTGFSKMSERQYSLRAPDH 335
           M II+PR   I      H+G K T+ I++  GL      I +TGFSK + R+  L    +
Sbjct: 191 MHIIDPRKQEIASSFHIHDGGKNTKNIWI-DGLGGDDNYILSTGFSKNNMREMKLWDLKN 249

Query: 336 LDEPIVMVELDTSNGVMFPLYDPDTNLVYLCGKGDSVIRYFEITAEPPFVHYINTFQTPD 395
               +V + +D ++  + P YD  T L+YL GKGD   RY++ +     +  +N +++  
Sbjct: 250 TTSALVTMSIDNASAPLIPHYDESTGLIYLIGKGDGNCRYYQHSLGS--IRKVNEYKSCS 307

Query: 396 PQRGIGMMPKRGCDVNSCEIGRFFRLNNSGLCQVVSMTVPRK-SELFQDDLYPDTVGDTA 454
           P R  G +PK+ CDV  CEIGR ++  N+   + +S  VPRK    FQ+DLYP  +    
Sbjct: 308 PFRSFGFLPKQICDVYKCEIGRVYKNENNSSIRPISFYVPRKNPTKFQEDLYPPILMHDP 367

Query: 455 ALTADEWFAG----------NELFQDDL 472
             ++  W  G           +L +DDL
Sbjct: 368 ERSSRNWIDGKDNKMKRINIKDLTEDDL 395


>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 97.8 bits (244), Expect = 8e-23
 Identities = 56/219 (25%), Positives = 95/219 (43%), Gaps = 16/219 (7%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVL 222
           GH GPV D+A S  D   +ASGS D  +++W +  G   RTLT        H   V  V 
Sbjct: 49  GHTGPVRDVAASA-DGTYLASGSSDKTIRLWDLETGECVRTLTG-------HTSYVSSVA 100

Query: 223 WHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKKM 282
           + P  + +L ++  D  + +W+V TG+ +  +  H D V S +++ DG+ + ++ +D  +
Sbjct: 101 FSPDGR-ILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159

Query: 283 RIINPRTGAIEEEAVCHEGSKATRAIYLKSGLIFTTGFSKMSERQYSLRAPDHLDEPIVM 342
           ++ + RTG        H G       +   G    +  S  + + + L     L      
Sbjct: 160 KLWDLRTGKCVATLTGHTGE-VNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH 218

Query: 343 VELDTSNGVMFPLYDPDTNLVYLCGKGDSVIRYFEITAE 381
                 NGV    + PD  L+   G  D  IR +++   
Sbjct: 219 -----ENGVNSVAFSPDGYLL-ASGSEDGTIRVWDLRTG 251



 Score = 86.2 bits (214), Expect = 1e-18
 Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 9/140 (6%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVL 222
           GH G V  +A+SP D  ++A+GS D  +KVW +  G + RTL         H   V  V 
Sbjct: 7   GHTGGVTCVAFSP-DGKLLATGSGDGTIKVWDLETGELLRTLKG-------HTGPVRDVA 58

Query: 223 WHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKKM 282
                   L +  SD  + +W++ TGE +  +  H   V S +++ DG  L ++ +DK +
Sbjct: 59  ASA-DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTI 117

Query: 283 RIINPRTGAIEEEAVCHEGS 302
           ++ +  TG        H   
Sbjct: 118 KVWDVETGKCLTTLRGHTDW 137



 Score = 84.3 bits (209), Expect = 4e-18
 Identities = 42/141 (29%), Positives = 70/141 (49%), Gaps = 11/141 (7%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVL 222
           GH   V  +A+SP D   +AS S+D  +K+W +  G    TLT        H   V  V 
Sbjct: 133 GHTDWVNSVAFSP-DGTFVASSSQDGTIKLWDLRTGKCVATLTG-------HTGEVNSVA 184

Query: 223 WHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLTTC-KDKK 281
           + P  +  LL++ SD  + +W++ TG+ +  +  H + V S +++ DG  LL +  +D  
Sbjct: 185 FSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGT 242

Query: 282 MRIINPRTGAIEEEAVCHEGS 302
           +R+ + RTG   +    H  S
Sbjct: 243 IRVWDLRTGECVQTLSGHTNS 263



 Score = 80.1 bits (198), Expect = 1e-16
 Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 11/123 (8%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVL 222
           GH G V  +A+SP D   + S S D  +K+W +  G    TL         H+  V  V 
Sbjct: 175 GHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLR-------GHENGVNSVA 226

Query: 223 WHPTAQNVLLTAGS-DNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKK 281
           + P     LL +GS D  + +W++ TGE +  +  H + V S +W+ DG +L +   D  
Sbjct: 227 FSPD--GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGT 284

Query: 282 MRI 284
           +RI
Sbjct: 285 IRI 287



 Score = 63.5 bits (155), Expect = 4e-11
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 9/82 (10%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVL 222
           GH+  V  +A+SP D  ++ASGSED  ++VW +  G   +TL+        H   V  + 
Sbjct: 217 GHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQTLSG-------HTNSVTSLA 268

Query: 223 WHPTAQNVLLTAGSDNNVAIWN 244
           W P  +  L +  +D  + IW+
Sbjct: 269 WSPDGK-RLASGSADGTIRIWD 289



 Score = 52.0 bits (125), Expect = 2e-07
 Identities = 30/162 (18%), Positives = 61/162 (37%), Gaps = 10/162 (6%)

Query: 214 HQRRVGLVLWHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKL 273
           H   V  V + P    +L T   D  + +W++ TGE++  ++ H   V   + + DG+ L
Sbjct: 8   HTGGVTCVAFSPD-GKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYL 66

Query: 274 LTTCKDKKMRIINPRTGAIEEEAVCHEGSKATRAI-YLKSGLIFTTGFSKMSERQYSLRA 332
            +   DK +R+ +  TG        H       ++ +   G I ++     + + + +  
Sbjct: 67  ASGSSDKTIRLWDLETGECVRTLTGHT--SYVSSVAFSPDGRILSSSSRDKTIKVWDVE- 123

Query: 333 PDHLDEPIVMVELDTSNGVMFPLYDPDTNLVYLCGKGDSVIR 374
                   +      ++ V    + PD   V      D  I+
Sbjct: 124 ----TGKCLTTLRGHTDWVNSVAFSPDGTFVA-SSSQDGTIK 160



 Score = 36.2 bits (84), Expect = 0.030
 Identities = 13/70 (18%), Positives = 25/70 (35%)

Query: 253 QIECHPDIVYSASWNWDGSKLLTTCKDKKMRIINPRTGAIEEEAVCHEGSKATRAIYLKS 312
            ++ H   V   +++ DG  L T   D  +++ +  TG +      H G     A     
Sbjct: 4   TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADG 63

Query: 313 GLIFTTGFSK 322
             + +    K
Sbjct: 64  TYLASGSSDK 73


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score = 74.4 bits (181), Expect = 3e-14
 Identities = 58/344 (16%), Positives = 121/344 (35%), Gaps = 27/344 (7%)

Query: 25  LTYFSPRTLTEPVVDLLYHQRRVGLVLWHPTAQNVLLTAGKEAEPILISLKDG----YIP 80
           +  +   T  + +  L  H   V  + + P  + +   +  +    L  L+ G     + 
Sbjct: 136 VKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLA 195

Query: 81  PSEKSKSALKVTKRSNVLDKVGGKGGANRN-STANEITISEDYVKEFKEEIRKLKSQQAL 139
                 S+L  +    +L   G   G  R    +    +           +       +L
Sbjct: 196 GHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSL 255

Query: 140 SILPPPSTAPNTGRI------NADHALVGGHKGPVLDIAWSPHDDNVIASGSEDCVVKVW 193
                  ++  T R+      ++    + GH   VL +A+SP D  ++ASGS D  V++W
Sbjct: 256 LAS---GSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSP-DGKLLASGSSDGTVRLW 311

Query: 194 QIPDGGISRTLTEPVVDLLYHQRRVGLVLWHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQ 253
            +  G +  +LT     L  H+  V  + + P    ++     D  + +W++ TG+ +  
Sbjct: 312 DLETGKLLSSLT-----LKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKT 366

Query: 254 IECHPDIVYSASWNWDGSKLLTTCKDKKMRIINPRTGAIEEEAVCHEGSKATRAIYLKSG 313
           +E H + V S S++ DG  + +   D  +R+ +  TG++      H     +        
Sbjct: 367 LEGHSN-VLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGK 425

Query: 314 LIFTTGFSKMSERQYSLRAPDHLDEPIVMVELDTSNGVMFPLYD 357
            + +           ++R  D       +        +     D
Sbjct: 426 SLASGSSDN------TIRLWDLKTSLKSVSFSPDGKVLASKSSD 463



 Score = 69.0 bits (167), Expect = 2e-12
 Identities = 52/224 (23%), Positives = 99/224 (44%), Gaps = 18/224 (8%)

Query: 160 LVGGHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDG-GISRTLTEPVVDLLYHQRRV 218
           L+ GH+  +  IA+SP  +  + SGS D  +K+W + +G  + ++L         H   V
Sbjct: 60  LLRGHEDSITSIAFSPDGEL-LLSGSSDGTIKLWDLDNGEKLIKSLEG------LHDSSV 112

Query: 219 GLVLWHPTAQNVLLTAGS--DNNVAIWNVGT-GEIMIQIECHPDIVYSASWNWDGSKLLT 275
             +       N +L A S  D  V +W++ T G+++  +E H + V S +++ DG  L +
Sbjct: 113 SKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLAS 172

Query: 276 T-CKDKKMRIINPRTGAIEEEAVCHEGSKATRAIYLKSGLIFTTGFSKMSERQYSLRAPD 334
               D  +++ + RTG        H    ++ A     GL+  +G S  + R + L    
Sbjct: 173 GSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLS--- 229

Query: 335 HLDEPIVMVELDTSNGVMFPLYDPDTNLVYLCGKGDSVIRYFEI 378
                ++   L   +  +   + PD +L+      D  IR +++
Sbjct: 230 --TGKLLRSTLSGHSDSVVSSFSPDGSLLASGSS-DGTIRLWDL 270



 Score = 67.8 bits (164), Expect = 4e-12
 Identities = 34/138 (24%), Positives = 61/138 (44%), Gaps = 9/138 (6%)

Query: 158 HALVGGHKGPVLDIAWSPHDDNVIAS--GSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQ 215
            +L G H   V  +A S  D N I     S D  VK+W +       T  + +  L  H 
Sbjct: 102 KSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDL------STPGKLIRTLEGHS 155

Query: 216 RRVGLVLWHPTAQNVLLTAGSDNNVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLT 275
             V  + + P  + +   +  D  + +W++ TG+ +  +  H D V S +++ DG  L+ 
Sbjct: 156 ESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIA 215

Query: 276 TC-KDKKMRIINPRTGAI 292
           +   D  +R+ +  TG +
Sbjct: 216 SGSSDGTIRLWDLSTGKL 233


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 47.3 bits (113), Expect = 2e-07
 Identities = 18/31 (58%), Positives = 24/31 (77%), Gaps = 1/31 (3%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVW 193
           GH GPV  +A+SP D N++ASGS+D  V+VW
Sbjct: 9   GHTGPVTSVAFSP-DGNLLASGSDDGTVRVW 38



 Score = 33.5 bits (77), Expect = 0.011
 Identities = 9/39 (23%), Positives = 22/39 (56%)

Query: 248 GEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKKMRIIN 286
           G+++  ++ H   V S +++ DG+ L +   D  +R+ +
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 30.4 bits (69), Expect = 0.13
 Identities = 9/31 (29%), Positives = 14/31 (45%), Gaps = 1/31 (3%)

Query: 214 HQRRVGLVLWHPTAQNVLLTAGSDNNVAIWN 244
           H   V  V + P   N+L +   D  V +W+
Sbjct: 10  HTGPVTSVAFSPD-GNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 46.5 bits (111), Expect = 3e-07
 Identities = 16/31 (51%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 163 GHKGPVLDIAWSPHDDNVIASGSEDCVVKVW 193
           GH GPV  +A+SP D   +ASGS+D  +K+W
Sbjct: 10  GHTGPVTSVAFSP-DGKYLASGSDDGTIKLW 39



 Score = 35.0 bits (81), Expect = 0.003
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 247 TGEIMIQIECHPDIVYSASWNWDGSKLLTTCKDKKMRIIN 286
           +GE++  ++ H   V S +++ DG  L +   D  +++ +
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 30.0 bits (68), Expect = 0.23
 Identities = 9/39 (23%), Positives = 15/39 (38%), Gaps = 1/39 (2%)

Query: 206 EPVVDLLYHQRRVGLVLWHPTAQNVLLTAGSDNNVAIWN 244
           E +  L  H   V  V + P     L +   D  + +W+
Sbjct: 3   ELLKTLKGHTGPVTSVAFSPD-GKYLASGSDDGTIKLWD 40


>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
           This is a family of eukaryotic translation initiation
           factors.
          Length = 194

 Score = 39.9 bits (94), Expect = 0.001
 Identities = 30/131 (22%), Positives = 49/131 (37%), Gaps = 27/131 (20%)

Query: 165 KGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLT------EPVVDLLYHQRRV 218
           +GP+ D AWSP       +G E  V+       G +   +T        +  L    R  
Sbjct: 59  EGPIHDFAWSP-------NGKEFAVIY------GYMPAKITFFDLKGNVIHSLGEQPRNT 105

Query: 219 GLVLWHPTAQNVLLTAGSDN---NVAIWNVGTGEIMIQIECHPDIVYSASWNWDGSKLLT 275
             + W P    ++L AG  N    +  W+V   + +   E   +      W+ DG   LT
Sbjct: 106 --IFWSPFG-RLVLLAGFGNLAGQIEFWDVKNKKKIATAE-ASNATDCE-WSPDGRYFLT 160

Query: 276 TCKDKKMRIIN 286
                ++R+ N
Sbjct: 161 ATTSPRLRVDN 171


>gnl|CDD|143380 cd07875, STKc_JNK1, Catalytic domain of the Serine/Threonine
           Kinase, c-Jun N-terminal Kinase 1.  Serine/Threonine
           Kinases (STKs), c-Jun N-terminal kinase 1 (JNK1)
           subfamily, catalytic (c) domain. STKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The
           JNK1 subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. JNKs are mitogen-activated protein kinases
           (MAPKs) that are involved in many stress-activated
           responses including those during inflammation,
           neurodegeneration, apoptosis, and persistent pain
           sensitization, among others. Vetebrates harbor three
           different JNK genes (Jnk1, Jnk2, and Jnk3). JNK1, like
           JNK2, is expressed in every cell and tissue type.
           Initially it was thought that JNK1 and JNK2 were
           functionally redundant as mice deficient in either genes
           (Jnk1 or Jnk2) could survive but disruption of both
           genes resulted in lethality. However, recent studies
           have shown that JNK1 and JNK2 perform distinct functions
           through specific binding partners and substrates. JNK1
           specifically binds with JAMP (JNK1-associated membrane
           protein), which regulates the duration of JNK1 activity
           in response to stimuli. Specific JNK1 substrates include
           Itch and SG10, which are implicated in Th2 responses and
           airway inflammation, and microtubule dynamics and
           axodendritic length, respectively. Mice deficient in
           Jnk1 are protected against arthritis, obesity, type 2
           diabetes, cardiac cell death, and non-alcoholic liver
           disease, suggesting that JNK1 may play roles in the
           pathogenesis of these diseases.
          Length = 364

 Score = 32.3 bits (73), Expect = 0.59
 Identities = 31/108 (28%), Positives = 46/108 (42%), Gaps = 27/108 (25%)

Query: 187 DCVVKVWQIPDGGISRT-----LTEPVVDLLYHQRRVGLVLWHPTAQNVLLTAGSDNNVA 241
           DC +K   I D G++RT     +  P V   Y++           A  V+L  G   NV 
Sbjct: 162 DCTLK---ILDFGLARTAGTSFMMTPYVVTRYYR-----------APEVILGMGYKENVD 207

Query: 242 IWNVGTGEIMIQIECH----PDIVYSASWNWDGSKLLTTCKD--KKMR 283
           IW+VG   IM ++       P   +   WN    +L T C +  KK++
Sbjct: 208 IWSVGC--IMGEMIKGGVLFPGTDHIDQWNKVIEQLGTPCPEFMKKLQ 253


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 32.4 bits (73), Expect = 0.63
 Identities = 22/82 (26%), Positives = 44/82 (53%), Gaps = 15/82 (18%)

Query: 171 IAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEPVVDLLYHQRRVGLVLW---HPTA 227
           I W+ +  + +AS + + VV+VW +         ++ V ++  H++RV    W   + +A
Sbjct: 538 ICWNSYIKSQVASSNFEGVVQVWDV-------ARSQLVTEMKEHEKRV----WSIDYSSA 586

Query: 228 QNVLLTAGSDN-NVAIWNVGTG 248
              LL +GSD+ +V +W++  G
Sbjct: 587 DPTLLASGSDDGSVKLWSINQG 608



 Score = 31.2 bits (70), Expect = 1.3
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 164 HKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDG 198
           H+  V  I +S  D  ++ASGS+D  VK+W I  G
Sbjct: 574 HEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQG 608


>gnl|CDD|241456 cd13302, PH2_Pleckstrin_2, Pleckstrin 2 Pleckstrin homology (PH)
           domain, repeat 2.  Pleckstrin is a protein found in
           platelets. This name is derived from platelet and
           leukocyte C kinase substrate and the KSTR string of
           amino acids. Pleckstrin 2 contains two PH domains and a
           DEP (dishvelled, egl-10, and pleckstrin) domain. Unlike
           pleckstrin 1, pleckstrin 2 does not contain obvious
           sites of PKC phosphorylation. Pleckstrin 2 plays a role
           in actin rearrangement, large lamellipodia and
           peripheral ruffle formation, and may help orchestrate
           cytoskeletal arrangement. The PH domains of pleckstrin 2
           are thought to contribute to lamellipodia formation.
           This cd contains the second PH domain repeat. PH domains
           have diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 109

 Score = 29.8 bits (67), Expect = 1.0
 Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 372 VIRYFEITAEPPFVHYINTFQTPDPQRGIGMMPKRGCDVNSCE 414
            +R F +  +P ++HY +  +  DP   I +   RGC V + E
Sbjct: 24  KVRKFVLRDDPAYLHYYDPAKGEDPLGAIHL---RGCVVTAVE 63


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 31.4 bits (71), Expect = 1.4
 Identities = 23/104 (22%), Positives = 36/104 (34%), Gaps = 11/104 (10%)

Query: 155 NADHALVGGHKGPVLDIAWSPHDDNVIASGSEDCVVKVWQIPDGGISRTLTEP-VVDLLY 213
              H        PV  ++ SP    + +  +    V+ W  P          P V  L +
Sbjct: 22  EVIHTRFESENWPVAYVSESPLGTYLFSEHAAG--VECWGGPSKAKLVRFRHPDVKYLDF 79

Query: 214 HQRRVGLVLW------HPTAQNVLLTAGSDNNVAIWNVGTGEIM 251
                 LV W       P  +    T  S NNV +W++ +G I+
Sbjct: 80  SPNEKYLVTWSREPIIEPEIEISPFT--SKNNVFVWDIASGMIV 121


>gnl|CDD|143379 cd07874, STKc_JNK3, Catalytic domain of the Serine/Threonine
           Kinase, c-Jun N-terminal Kinase 3.  Serine/Threonine
           Kinases (STKs), c-Jun N-terminal kinase 3 (JNK3)
           subfamily, catalytic (c) domain. STKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The
           JNK3 subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. JNKs are mitogen-activated protein kinases
           (MAPKs) that are involved in many stress-activated
           responses including those during inflammation,
           neurodegeneration, apoptosis, and persistent pain
           sensitization, among others. Vetebrates harbor three
           different JNK genes (Jnk1, Jnk2, and Jnk3). JNK3 is
           expressed primarily in the brain, and to a lesser extent
           in the heart and testis. Mice deficient in Jnk3 are
           protected against kainic acid-induced seizures, stroke,
           sciatic axotomy neural death, and neuronal death due to
           NGF deprivation, oxidative stress, or exposure to
           beta-amyloid peptide. This suggests that JNK3 may play
           roles in the pathogenesis of these diseases.
          Length = 355

 Score = 30.1 bits (67), Expect = 2.8
 Identities = 30/100 (30%), Positives = 42/100 (42%), Gaps = 21/100 (21%)

Query: 187 DCVVKVWQIPDGGISRT-----LTEPVVDLLYHQRRVGLVLWHPTAQNVLLTAGSDNNVA 241
           DC +K   I D G++RT     +  P V   Y++           A  V+L  G   NV 
Sbjct: 155 DCTLK---ILDFGLARTAGTSFMMTPYVVTRYYR-----------APEVILGMGYKENVD 200

Query: 242 IWNVG--TGEIMIQIECHPDIVYSASWNWDGSKLLTTCKD 279
           IW+VG   GE++      P   Y   WN    +L T C +
Sbjct: 201 IWSVGCIMGEMVRHKILFPGRDYIDQWNKVIEQLGTPCPE 240


>gnl|CDD|181784 PRK09334, PRK09334, 30S ribosomal protein S25e; Provisional.
          Length = 86

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 8/35 (22%), Positives = 16/35 (45%)

Query: 100 KVGGKGGANRNSTANEITISEDYVKEFKEEIRKLK 134
           K        +   +  IT+ E+ +K   +E++K K
Sbjct: 6   KKKEGSKTEKEIKSTIITLDEELLKRVAKEVKKEK 40


>gnl|CDD|225928 COG3393, COG3393, Predicted acetyltransferase [General function
           prediction only].
          Length = 268

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 13/43 (30%), Positives = 17/43 (39%), Gaps = 1/43 (2%)

Query: 359 DTNLVYLCGKGDSVIRYFEITAEPPFVHYINTFQTPDPQRGIG 401
            +   +L G G  ++   E  AE P    IN   T    RG G
Sbjct: 176 RSRTYFLEGDGK-IVAKAETAAENPAYAQINGVYTHPEYRGKG 217


>gnl|CDD|213558 TIGR00762, DegV, EDD domain protein, DegV family.  This family of
           proteins is related to DegV of Bacillus subtilis and
           includes paralogous sets in several species (B.
           subtilis, Deinococcus radiodurans, Mycoplasma
           pneumoniae) that are closer in percent identity to each
           than to most homologs from other species. This suggests
           both recent paralogy and diversity of function. DegV
           itself is encoded immediately downstream of DegU, a
           transcriptional regulator of degradation, but is itself
           uncharacterized. Crystallography suggested a
           lipid-binding site, while comparison of the crystal
           structure to dihydroxyacetone kinase and to a mannose
           transporter EIIA domain suggests a conserved domain,
           EDD, with phosphotransferase activity [Unknown function,
           General].
          Length = 275

 Score = 29.4 bits (67), Expect = 4.0
 Identities = 17/81 (20%), Positives = 33/81 (40%), Gaps = 14/81 (17%)

Query: 69  PILISLKDGYIPPSEKSKSALKVTKR--SNVLDKVGGKGGANRNSTANEITIS----EDY 122
           PIL ++ DG + P EK +   K  K+    V + +              + I     E+ 
Sbjct: 185 PIL-TVDDGKLVPIEKVRGRKKAIKKLVELVKEDIKDGKPK-------RVAIIHADAEEE 236

Query: 123 VKEFKEEIRKLKSQQALSILP 143
            +E KE++++    + + I  
Sbjct: 237 AEELKEKLKEKFPVKEILISE 257


>gnl|CDD|217340 pfam03055, RPE65, Retinal pigment epithelial membrane protein.
           This family represents a retinal pigment epithelial
           membrane receptor which is abundantly expressed in
           retinal pigment epithelium, and binds plasma retinal
           binding protein. The family also includes the sequence
           related neoxanthin cleavage enzyme in plants and
           lignostilbene-alpha,beta-dioxygenase in bacteria.
          Length = 469

 Score = 29.1 bits (66), Expect = 6.5
 Identities = 14/54 (25%), Positives = 20/54 (37%), Gaps = 8/54 (14%)

Query: 354 PLYDPDT----NLVYLCGKGDSVIRYFEITAEPPFVHYINTFQTPDPQRGIGMM 403
           P  DP T    N     G     + Y+E+ A+   V  +  F  P P     M+
Sbjct: 164 PKVDPVTGELVNFGLSLGPKPPYLTYYEVDADGKLVREVPIFSLPGP----SMI 213


>gnl|CDD|227691 COG5404, SulA, SOS-response cell division inhibitor, blocks FtsZ
           ring formation [Cell division and chromosome
           partitioning].
          Length = 169

 Score = 28.4 bits (63), Expect = 6.6
 Identities = 11/40 (27%), Positives = 15/40 (37%), Gaps = 6/40 (15%)

Query: 262 YSASWNWDGSKLLTTCKDKKMRIINPRTGAIEEEAVCHEG 301
            SAS++   S +  T         NP T  +  E V  E 
Sbjct: 9   RSASFSSATSNIART------STDNPTTAGLISEVVYSED 42


>gnl|CDD|185735 cd08994, GH43_like_2, Glycosyl hydrolase 43-like family consists of
           hypothetical proteins.  This subfamily mostly contains
           uncharacterized proteins similar to glycosyl hydrolase
           family 43 (GH43) which includes enzymes with
           beta-xylosidase (EC 3.2.1.37) and
           alpha-L-arabinofuranosidase (EC 3.2.1.55) and possibly
           bifunctional xylosidase/arabinofuranosidase activities.
           GH43 are inverting enzymes (i.e. they invert the
           stereochemistry of the anomeric carbon atom of the
           substrate) that have an aspartate as the catalytic
           general base, a glutamate as the catalytic general acid
           and another aspartate that is responsible for pKa
           modulation and orienting the catalytic acid. Many of the
           enzymes in this family display both
           alpha-L-arabinofuranosidase and beta-D-xylosidase
           activity using aryl-glycosides as substrates. A common
           structural feature of GH43 enzymes is a 5-bladed
           beta-propeller domain that contains the catalytic acid
           and catalytic base. A long V-shaped groove, partially
           enclosed at one end, forms a single extended
           substrate-binding surface across the face of the
           propeller.
          Length = 291

 Score = 28.9 bits (65), Expect = 6.8
 Identities = 10/40 (25%), Positives = 14/40 (35%)

Query: 357 DPDTNLVYLCGKGDSVIRYFEITAEPPFVHYINTFQTPDP 396
               N  Y     DS    +   + PPF+   +   T DP
Sbjct: 170 PTKGNRKYGVAIADSPTGPYTKVSGPPFIEPGDNGNTEDP 209


>gnl|CDD|219928 pfam08602, Mgr1, Mgr1-like, i-AAA protease complex subunit.  The S.
           cerevisiae Mgr1 protein has been shown to be required
           for mitochondrial viability in yeast lacking
           mitochondrial DNA. It is a mitochondrial inner membrane
           protein, which interacts with Yme1 and is a new subunit
           of the i-AAA protease complex.
          Length = 362

 Score = 28.7 bits (64), Expect = 7.2
 Identities = 12/57 (21%), Positives = 19/57 (33%), Gaps = 6/57 (10%)

Query: 196 PDGGISRTLTEPVVDLLYHQRRVGLVLWHPTAQNVLLTAGSDNNVAIWNVGTGEIMI 252
             G    +   P       +  +GL LW P      L   SDN   ++ +   +  I
Sbjct: 17  SGGSGDNSKDIPNPAKFIPRPSLGLKLWGP------LVPASDNLPGLYTLVGLQTGI 67


>gnl|CDD|223820 COG0749, PolA, DNA polymerase I - 3'-5' exonuclease and polymerase
           domains [DNA replication, recombination, and repair].
          Length = 593

 Score = 28.8 bits (65), Expect = 7.2
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 271 SKLLTTCKDKKMRIINPRTGAI 292
           +KL +T  D   ++INP TG I
Sbjct: 299 AKLKSTYTDGLPKLINPDTGRI 320


>gnl|CDD|188483 TIGR03968, mycofact_TetR, mycofactocin system transcriptional
           regulator.  Members of this family are TetR family
           putative transcriptional regulators that occur in genome
           contexts near proteins of the mycofactocin system. These
           include the precursor peptide (TIGR03969), a radical SAM
           peptide maturase (TIGR03962), and a putative carrier
           protein (TIGR03967).
          Length = 190

 Score = 28.0 bits (63), Expect = 8.9
 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 4/40 (10%)

Query: 428 QVVSMTVPRKSELFQDDLYPDTVGDT---AALTA-DEWFA 463
           +V++  V R+  L  DDL P TVG      AL+A ++W A
Sbjct: 129 RVIAEFVARRLGLPPDDLLPRTVGWLTLGVALSAYEQWLA 168


>gnl|CDD|238746 cd01469, vWA_integrins_alpha_subunit, Integrins are a class of
           adhesion receptors that link the extracellular matrix to
           the cytoskeleton and cooperate with growth factor
           receptors to promote celll survival, cell cycle
           progression and cell migration. Integrins consist of an
           alpha and a beta sub-unit. Each sub-unit has a large
           extracellular portion, a single transmembrane segment
           and a short cytoplasmic domain. The N-terminal domains
           of the alpha and beta subunits associate to form the
           integrin headpiece, which contains the ligand binding
           site, whereas the C-terminal segments traverse the
           plasma membrane and mediate interaction with the
           cytoskeleton and with signalling proteins.The VWA
           domains present in the alpha subunits of integrins seem
           to be a chordate specific radiation of the gene family
           being found only in vertebrates. They mediate
           protein-protein interactions.
          Length = 177

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 26/109 (23%), Positives = 46/109 (42%), Gaps = 13/109 (11%)

Query: 23  FRLTYFSPRTLTEPVVDLLYHQRRVGLVLWHPTAQNVLLT--------AGKEAEPILISL 74
           F L  +  RT  EP+  L+ H  ++  +    TA   ++T        A K+A  +L+ +
Sbjct: 54  FTLNEY--RTKEEPLS-LVKHISQLLGLTNTATAIQYVVTELFSESNGARKDATKVLVVI 110

Query: 75  KDGYIPPSEKSKSALKVTKRSNVLDKVGGKGGA-NRNSTANEI-TISED 121
            DG        K  +   +R  ++    G GG   R ++  E+ TI+  
Sbjct: 111 TDGESHDDPLLKDVIPQAEREGIIRYAIGVGGHFQRENSREELKTIASK 159


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.416 

Gapped
Lambda     K      H
   0.267   0.0728    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,244,582
Number of extensions: 2453411
Number of successful extensions: 2282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2236
Number of HSP's successfully gapped: 48
Length of query: 490
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 389
Effective length of database: 6,457,848
Effective search space: 2512102872
Effective search space used: 2512102872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.3 bits)