RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8880
(141 letters)
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 48.7 bits (117), Expect = 6e-09
Identities = 18/60 (30%), Positives = 31/60 (51%)
Query: 12 SLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRI 71
+LK L +++N+ + + GA GL L L N + + +F L +L LDL+ N +
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 48.7 bits (117), Expect = 7e-09
Identities = 21/59 (35%), Positives = 34/59 (57%)
Query: 61 LSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDI 119
L LDL++NR+ + + L NL LDLS N L +++ + + L SL+ L L N++
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 48.3 bits (116), Expect = 9e-09
Identities = 21/58 (36%), Positives = 35/58 (60%)
Query: 36 KLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHN 93
L SL +N++ V+ D +F+ L L LDL+ N + ++S + + L +L +LDLS N
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGN 58
Score = 35.6 bits (83), Expect = 7e-04
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 8 RRLSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQI 47
+ L +LKVL ++ N +S+ A +GL L SL N +
Sbjct: 21 KGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 34.8 bits (81), Expect = 0.001
Identities = 18/57 (31%), Positives = 32/57 (56%)
Query: 84 NLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGDNVTITTLTLSG 140
NL +LDLS+N L + L +L+ L L N+++ ++P+ ++ +L LSG
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSG 57
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 51.8 bits (124), Expect = 1e-08
Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 7/133 (5%)
Query: 10 LSSLKVLSIAHNQFSSVRRGAL---AGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDL 66
+ SL++LS+A N+F G L G +L +L NQ +LS L L L
Sbjct: 451 MPSLQMLSLARNKFF----GGLPDSFGSKRLENLDLSRNQFSGAVPRKLGSLSELMQLKL 506
Query: 67 AHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDV 126
+ N++ L+ L +LDLSHN L + + L +L L N +S P
Sbjct: 507 SENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKN 566
Query: 127 IGDNVTITTLTLS 139
+G+ ++ + +S
Sbjct: 567 LGNVESLVQVNIS 579
Score = 45.2 bits (107), Expect = 2e-06
Identities = 35/118 (29%), Positives = 56/118 (47%)
Query: 10 LSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHN 69
L+SL+ L++A NQ L + L + +N + + L++L+HLDL +N
Sbjct: 187 LTSLEFLTLASNQLVGQIPRELGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYN 246
Query: 70 RIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVI 127
+ +SL +L NL L L N L I L+ L L L DN +S P+++
Sbjct: 247 NLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLSDNSLSGEIPELV 304
Score = 43.3 bits (102), Expect = 1e-05
Identities = 28/90 (31%), Positives = 45/90 (50%)
Query: 6 TVRRLSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLD 65
+ LS L L ++ N+ S L+ KL SL HNQ+ SF + LS LD
Sbjct: 494 KLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLD 553
Query: 66 LAHNRIVAVSGASLAHLSNLTTLDLSHNFL 95
L+ N++ +L ++ +L +++SHN L
Sbjct: 554 LSQNQLSGEIPKNLGNVESLVQVNISHNHL 583
Score = 39.1 bits (91), Expect = 4e-04
Identities = 26/87 (29%), Positives = 39/87 (44%)
Query: 9 RLSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAH 68
L+ L ++ NQFS L L +L L N++ + L LDL+H
Sbjct: 473 GSKRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSH 532
Query: 69 NRIVAVSGASLAHLSNLTTLDLSHNFL 95
N++ AS + + L+ LDLS N L
Sbjct: 533 NQLSGQIPASFSEMPVLSQLDLSQNQL 559
Score = 31.7 bits (72), Expect = 0.12
Identities = 27/91 (29%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 50 LEDHSFRALSTLSHLDLAHNRIV-AVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKS 108
+ D F S+L +L+L++N ++ S+ NL TLDLS+N L + I S
Sbjct: 109 IPDDIFTTSSSLRYLNLSNNNFTGSIPRGSI---PNLETLDLSNNMLSGEIPNDIGSFSS 165
Query: 109 LQELRLDDNDISMVTPDVIGDNVTITTLTLS 139
L+ L L N + P+ + + ++ LTL+
Sbjct: 166 LKVLDLGGNVLVGKIPNSLTNLTSLEFLTLA 196
Score = 30.6 bits (69), Expect = 0.23
Identities = 31/114 (27%), Positives = 48/114 (42%)
Query: 7 VRRLSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDL 66
V +L +L++L + N F+ AL L +L L N+ + + L+ LDL
Sbjct: 304 VIQLQNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDL 363
Query: 67 AHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDIS 120
+ N + L NL L L N L + +SL+ +RL DN S
Sbjct: 364 STNNLTGEIPEGLCSSGNLFKLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFS 417
Score = 30.6 bits (69), Expect = 0.27
Identities = 30/119 (25%), Positives = 50/119 (42%), Gaps = 28/119 (23%)
Query: 11 SSLKVLSIAHNQFS-SVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHN 69
SSL+ L++++N F+ S+ RG++ L+ L DL++N
Sbjct: 118 SSLRYLNLSNNNFTGSIPRGSIPNLETL---------------------------DLSNN 150
Query: 70 RIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIG 128
+ + S+L LDL N L + +T L SL+ L L N + P +G
Sbjct: 151 MLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLVGQIPRELG 209
Score = 27.5 bits (61), Expect = 2.8
Identities = 23/82 (28%), Positives = 40/82 (48%), Gaps = 3/82 (3%)
Query: 59 STLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRA-LTQDLITPLKSLQELRLDDN 117
S + +DL+ I +++ L + T++LS+N L + D+ T SL+ L L +N
Sbjct: 69 SRVVSIDLSGKNISGKISSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSLRYLNLSNN 128
Query: 118 DISMVTPDVIGDNVTITTLTLS 139
+ + P G + TL LS
Sbjct: 129 NFTGSIPR--GSIPNLETLDLS 148
>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
Length = 623
Score = 46.7 bits (111), Expect = 6e-07
Identities = 22/72 (30%), Positives = 35/72 (48%)
Query: 57 ALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDD 116
L L ++L+ N I SL +++L LDLS+N + + L SL+ L L+
Sbjct: 440 KLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNG 499
Query: 117 NDISMVTPDVIG 128
N +S P +G
Sbjct: 500 NSLSGRVPAALG 511
Score = 34.0 bits (78), Expect = 0.018
Identities = 19/62 (30%), Positives = 31/62 (50%)
Query: 34 LDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHN 93
L L S++ N I S ++++L LDL++N SL L++L L+L+ N
Sbjct: 441 LRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGN 500
Query: 94 FL 95
L
Sbjct: 501 SL 502
Score = 29.8 bits (67), Expect = 0.46
Identities = 16/52 (30%), Positives = 24/52 (46%)
Query: 88 LDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGDNVTITTLTLS 139
L L + LR + I+ L+ LQ + L N I P +G ++ L LS
Sbjct: 423 LGLDNQGLRGFIPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLS 474
Score = 28.2 bits (63), Expect = 1.7
Identities = 16/59 (27%), Positives = 30/59 (50%)
Query: 82 LSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGDNVTITTLTLSG 140
L +L +++LS N +R + + SL+ L L N + P+ +G ++ L L+G
Sbjct: 441 LRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNG 499
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 45.7 bits (108), Expect = 2e-06
Identities = 39/111 (35%), Positives = 51/111 (45%), Gaps = 3/111 (2%)
Query: 30 ALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLD 89
L L LTSL +N I + S L LDL+ N+I ++ L +L NL LD
Sbjct: 111 ELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPS-PLRNLPNLKNLD 169
Query: 90 LSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGDNVTITTLTLSG 140
LS N L L L++ L +L L L N IS + P I + L LS
Sbjct: 170 LSFNDLSDLP-KLLSNLSNLNNLDLSGNKISDL-PPEIELLSALEELDLSN 218
Score = 42.3 bits (99), Expect = 2e-05
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 6/131 (4%)
Query: 10 LSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHN 69
L +LK L ++ N S + + L+ L L +L N+I L LS L LDL++N
Sbjct: 162 LPNLKNLDLSFNDLSDLPKL-LSNLSNLNNLDLSGNKISDLPPEIEL-LSALEELDLSNN 219
Query: 70 RIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGD 129
I+ + SL++L NL+ L+LS+N L L + I L +L+ L L +N IS ++ +G
Sbjct: 220 SIIELLS-SLSNLKNLSGLELSNNKLEDLP-ESIGNLSNLETLDLSNNQISSISS--LGS 275
Query: 130 NVTITTLTLSG 140
+ L LSG
Sbjct: 276 LTNLRELDLSG 286
Score = 29.6 bits (66), Expect = 0.53
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 9 RLSSLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAH 68
LS+L+ L +++N + +L+ L L+ L +N++ S LS L LDL++
Sbjct: 207 LLSALEELDLSNNSIIELL-SSLSNLKNLSGLELSNNKL-EDLPESIGNLSNLETLDLSN 264
Query: 69 NRIVAVSGASLAHLSNLTTLDLSHN-FLRALTQDLITPLKSLQELRL 114
N+I ++S SL L+NL LDLS N AL + L L L
Sbjct: 265 NQISSIS--SLGSLTNLRELDLSGNSLSNALPLIALLLLLLELLLNL 309
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 36.9 bits (86), Expect = 0.002
Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 15/102 (14%)
Query: 53 HSFRALSTLSHLDLAHNRI-----VAVSGASLAHLSNLTTLDLSHNFLR-----ALTQDL 102
+A L LDL +N + A++ LA L +L L+L N L AL L
Sbjct: 187 EGLKANCNLEVLDLNNNGLTDEGASALAET-LASLKSLEVLNLGDNNLTDAGAAALASAL 245
Query: 103 ITPLKSLQELRLDDNDI----SMVTPDVIGDNVTITTLTLSG 140
++P SL L L NDI + +V+ + ++ L L G
Sbjct: 246 LSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRG 287
Score = 29.2 bits (66), Expect = 0.64
Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 11/77 (14%)
Query: 54 SFRALSTLSHLDLAHN----RIVAVSGASLAHLS-NLTTLDLSHNFL-----RALTQDLI 103
S S+L L L +N R + + L L L L L N L AL + L
Sbjct: 103 SLLRSSSLQELKLNNNGLGDRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALR 162
Query: 104 TPLKSLQELRLDDNDIS 120
+ L+EL L +N I
Sbjct: 163 A-NRDLKELNLANNGIG 178
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number
of proteins with diverse functions and cellular
locations. These repeats are usually involved in
protein-protein interactions. Each Leucine Rich Repeat
is composed of a beta-alpha unit. These units form
elongated non-globular structures. Leucine Rich Repeats
are often flanked by cysteine rich domains.
Length = 43
Score = 31.7 bits (73), Expect = 0.012
Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 61 LSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALT 99
L LDL++N+I + L++L NL TLDLS N + L+
Sbjct: 3 LETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLS 39
Score = 30.9 bits (71), Expect = 0.026
Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 84 NLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTP 124
NL TLDLS+N + L ++ L +L+ L L N I+ ++P
Sbjct: 2 NLETLDLSNNQITDL--PPLSNLPNLETLDLSGNKITDLSP 40
Score = 29.8 bits (68), Expect = 0.069
Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 12 SLKVLSIAHNQFSSVRRGALAGLDKLTSLSCHHNQI 47
+L+ L +++NQ + + L+ L L +L N+I
Sbjct: 2 NLETLDLSNNQITDL--PPLSNLPNLETLDLSGNKI 35
Score = 27.8 bits (63), Expect = 0.36
Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 2/36 (5%)
Query: 36 KLTSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRI 71
L +L +NQI L L L LDL+ N+I
Sbjct: 2 NLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKI 35
>gnl|CDD|172730 PRK14242, PRK14242, phosphate transporter ATP-binding protein;
Provisional.
Length = 253
Score = 30.1 bits (68), Expect = 0.27
Identities = 23/75 (30%), Positives = 34/75 (45%), Gaps = 12/75 (16%)
Query: 55 FRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQ--DLITPLKSLQEL 112
F+AL +S L+ N++ A+ G S S FLR L + DLI + E+
Sbjct: 19 FQALHDIS-LEFEQNQVTALIGPSGCGKST---------FLRCLNRMNDLIPGARVEGEI 68
Query: 113 RLDDNDISMVTPDVI 127
LD +I DV+
Sbjct: 69 LLDGENIYDPHVDVV 83
>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers.
Length = 24
Score = 26.5 bits (60), Expect = 0.64
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 82 LSNLTTLDLSHNFLRALTQDL 102
L NL LDLS+N L +L
Sbjct: 1 LPNLRELDLSNNQLSSLPPGA 21
Score = 24.2 bits (54), Expect = 5.3
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 10 LSSLKVLSIAHNQFSSVRRGALAG 33
L +L+ L +++NQ SS+ GA G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
subfamily.
Length = 24
Score = 26.5 bits (60), Expect = 0.64
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 82 LSNLTTLDLSHNFLRALTQDL 102
L NL LDLS+N L +L
Sbjct: 1 LPNLRELDLSNNQLSSLPPGA 21
Score = 24.2 bits (54), Expect = 5.3
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 10 LSSLKVLSIAHNQFSSVRRGALAG 33
L +L+ L +++NQ SS+ GA G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
>gnl|CDD|218275 pfam04812, HNF-1B_C, Hepatocyte nuclear factor 1 (HNF-1), beta
isoform C terminus. This family consists of a region
found within the alpha isoform and at the C terminus of
the beta isoform of the homeobox-containing
transcription factor of HNF-1. Different isoforms of
HNF-1 are generated by the differential use of
polyadenylation sites and by alternative splicing. The
C-terminal region of HNF-1 is responsible for the
activation of transcription. Mutations and polymorphisms
in HNF-1 cause the type 3 form of maturity-onset
diabetes of the young (MODY3).
Length = 237
Score = 29.1 bits (65), Expect = 0.74
Identities = 22/91 (24%), Positives = 38/91 (41%), Gaps = 4/91 (4%)
Query: 38 TSLSCHHNQIDVLEDHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLT---TLDLSHNF 94
+S H ++ VL+ S L L ++++VSG L +S LT +L S +
Sbjct: 49 SSTISHGDRASVLQQVSPGGLEPSHSLLSTDAKMISVSGGPLPPVSTLTNLHSLSQSSHH 108
Query: 95 LRALTQDLI-TPLKSLQELRLDDNDISMVTP 124
L TQ+LI L + + + +
Sbjct: 109 LHQQTQNLIMASLPGVMAISSLLIAQGLTSS 139
>gnl|CDD|100598 PRK00561, ppnK, inorganic polyphosphate/ATP-NAD kinase;
Provisional.
Length = 259
Score = 28.7 bits (64), Expect = 0.92
Identities = 16/59 (27%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 58 LSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDD 116
+ST ++ + A ++V ++ L ++ DL NF L Q T + L E+++DD
Sbjct: 47 VSTAANYNCAGCKVVGINTGHLGFYTSFNETDLDQNFANKLDQLKFTQI-DLLEVQIDD 104
>gnl|CDD|185268 PRK15370, PRK15370, E3 ubiquitin-protein ligase SlrP; Provisional.
Length = 754
Score = 28.9 bits (64), Expect = 0.93
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 8/62 (12%)
Query: 60 TLSHLDLAHNRIVAVSGASLAHL-SNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDND 118
T+ ++L+ NRI + L S L +LDL HN + L ++L + L+ L + DN
Sbjct: 242 TIQEMELSINRITELP----ERLPSALQSLDLFHNKISCLPENLP---EELRYLSVYDNS 294
Query: 119 IS 120
I
Sbjct: 295 IR 296
Score = 27.4 bits (60), Expect = 3.7
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 61 LSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELR 113
L LD++ N+I V +L +TTLD+S N L L ++L L+ +Q R
Sbjct: 348 LQVLDVSKNQI-TVLPETLP--PTITTLDVSRNALTNLPENLPAALQIMQASR 397
>gnl|CDD|213859 TIGR03767, P_acnes_RR, metallophosphoesterase, PPA1498 family.
This model describes a small collection of probable
metallophosphoresterases, related to pfam00149 but with
long inserts separating some of the shared motifs such
that the homology is apparent only through multiple
sequence alignment. Members of this protein family, in
general, have a Sec-independent TAT (twin-arginine
translocation) signal sequence, N-terminal to the region
modeled by This model. Members include YP_056203.1 from
Propionibacterium acnes KPA171202.
Length = 496
Score = 28.7 bits (64), Expect = 0.99
Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 43 HHNQIDVLEDHSFRALSTLSHLDL-AHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQD 101
H V +D F ++T SH+D RI+ ++ +S TTL + A
Sbjct: 397 HRRVEGVGKDKGFWEINTASHIDFPQQGRIIELADNQDGTVSIFTTL-IESAAPYAWDAA 455
Query: 102 LITPL 106
+ PL
Sbjct: 456 DLEPL 460
>gnl|CDD|198362 cd07789, FGGY_CsGK_like, Cellulomonas sp. glycerol kinase-like
proteins; belongs to the FGGY family of carbohydrate
kinases. This subgroup corresponds to a small group of
bacterial glycerol kinases (GK) with similarity to
Cellulomonas sp. glycerol kinase (CsGK). CsGK might
exist as a dimer. Its monomer is composed of two large
domains separated by a deep cleft that forms the active
site. This model includes both the N-terminal domain,
which adopts a ribonuclease H-like fold, and the
structurally related C-terminal domain. The regulation
of the catalytic activity of this group has not yet been
examined. Members in this subgroup belong to the FGGY
family of carbohydrate kinases.
Length = 495
Score = 28.5 bits (64), Expect = 1.2
Identities = 18/61 (29%), Positives = 21/61 (34%), Gaps = 4/61 (6%)
Query: 74 VSGASLAHLSNLTTLDLSHNFLRALTQDLITPLKSLQELRLDDNDISMVTPDVIGDNVTI 133
V+ AS L +L TLD L A P L E+R V I V I
Sbjct: 182 VTNASRTMLMDLETLDWDPELLAAFG----IPRSMLPEIRPSSEVYGTVRGRGILAGVPI 237
Query: 134 T 134
Sbjct: 238 A 238
>gnl|CDD|233613 TIGR01877, cas_cas6, CRISPR-associated endoribonuclease Cas6.
CRISPR is a term for Clustered, Regularly Interspaced
Short Palidromic Repeats. A number of protein families
appear only in association with these repeats and are
designated Cas (CRISPR-Associated) proteins. This
broadly distributed, highly divergent Cas family is now
characterized as an endoribonuclease that generates
guide RNAs for host defense against phage and other
invaders. The family contains a C-terminal motif
GXGXXXXXGXG, where the each X between two Gly is
hydrophobic and the spacer XXXXX contains (usually) one
Arg or Lys. The seed alignment for the current version
of this model has gappy columns removed. Members of this
protein family are found associated with several
different CRISPR/cas system subtypes, and consequently
we designate this family Cas6 [Mobile and
extrachromosomal element functions, Other].
Length = 199
Score = 27.7 bits (62), Expect = 1.7
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 5/45 (11%)
Query: 98 LTQDLITPLKSLQELRLDDN-----DISMVTPDVIGDNVTITTLT 137
L + L L L E +L +I ++ P I D VT TL+
Sbjct: 45 LIESLYNGLLKLPEFKLGGLSFEVKEIKVLEPPEITDEVTFKTLS 89
>gnl|CDD|236150 PRK08097, ligB, NAD-dependent DNA ligase LigB; Reviewed.
Length = 562
Score = 27.6 bits (62), Expect = 2.3
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 10/42 (23%)
Query: 103 ITPLKSLQELRLDDNDISMV---------TPDVI-GDNVTIT 134
IT + L+ + LDD +S V D+ GD V ++
Sbjct: 328 ITVVLELEPVMLDDKRVSRVNIGSVRRWQQWDIAPGDQVLVS 369
>gnl|CDD|224267 COG1348, NifH, Nitrogenase subunit NifH (ATPase) [Inorganic ion
transport and metabolism].
Length = 278
Score = 26.9 bits (60), Expect = 3.5
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 103 ITPLKSLQELRLDDNDISMVTPDVIGDNV 131
IT + L+EL + D+ +V DV+GD V
Sbjct: 102 ITAINLLEELGAFEEDLDVVIYDVLGDVV 130
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 26.2 bits (58), Expect = 6.3
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 89 DLSHNFLRALTQDLITPLKSLQE 111
D+ N ALT++ I PL+ + +
Sbjct: 76 DIHRNLGSALTEEAIKPLRQVLD 98
>gnl|CDD|223001 PHA03148, PHA03148, hypothetical protein; Provisional.
Length = 289
Score = 26.1 bits (58), Expect = 6.5
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 80 AHLSNLTTLDLSHNFLRALTQDLITPLKSL 109
A + +T ++ SHN LR + I PL+ L
Sbjct: 116 APYNTVTIMNRSHNSLRLVPSLDINPLQHL 145
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The
Polycystin Cation Channel (PCC) Family (TC 1.A.5)
Polycystin is a huge protein of 4303aas. Its repeated
leucine-rich (LRR) segment is found in many proteins.
It contains 16 polycystic kidney disease (PKD) domains,
one LDL-receptor class A domain, one C-type lectin
family domain, and 16-18 putative TMSs in positions
between residues 2200 and 4100. Polycystin-L has been
shown to be a cation (Na+, K+ and Ca2+) channel that is
activated by Ca2+. Two members of the PCC family
(polycystin 1 and 2) are mutated in autosomal dominant
polycystic kidney disease, and polycystin-L is deleted
in mice with renal and retinal defects. Note: this
model is restricted to the amino half for technical
reasons.
Length = 2740
Score = 26.6 bits (58), Expect = 7.0
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 65 DLAHNRIVAVSGASLAHLSNLTTLDLSHN 93
D+++N+I + A+L NL+ +DLS N
Sbjct: 1 DISNNKISTIEEGICANLCNLSEIDLSGN 29
>gnl|CDD|236009 PRK07392, PRK07392, threonine-phosphate decarboxylase; Validated.
Length = 360
Score = 26.1 bits (58), Expect = 7.4
Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 7/70 (10%)
Query: 47 IDVLEDHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLITPL 106
I L D F+ T + L A + LA L LT L + NFL +Q L
Sbjct: 256 IAALADRDFQQQ-TWAWLPPAREALFQ----GLASLPGLTPLPSAANFLLVQSQGSALQL 310
Query: 107 --KSLQELRL 114
K LQ+ R+
Sbjct: 311 QEKLLQQHRI 320
>gnl|CDD|235112 PRK03244, argD, acetylornithine aminotransferase; Provisional.
Length = 398
Score = 26.0 bits (58), Expect = 7.4
Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 6/32 (18%)
Query: 57 ALSTLSHLDLAHNRIV-AVSG--ASLAHLSNL 85
A++ L H AH +V AV+ A+L H+SNL
Sbjct: 53 AVNALGH---AHPAVVEAVTRQLATLGHVSNL 81
>gnl|CDD|223318 COG0240, GpsA, Glycerol-3-phosphate dehydrogenase [Energy
production and conversion].
Length = 329
Score = 26.0 bits (58), Expect = 7.8
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 50 LEDHSFRALSTLSHLDLAHNRIVAVSGASLA 80
LE + R LS + +L N I +SG S A
Sbjct: 108 LEPETGRLLSEIIEEELPDNPIAVLSGPSFA 138
>gnl|CDD|176465 cd01362, Fumarase_classII, Class II fumarases. This subgroup
contains Escherichia coli fumarase C, human
mitochondrial fumarase, and related proteins. It is a
member of the Lyase class I family. Members of this
family for the most part catalyze similar
beta-elimination reactions in which a C-N or C-O bond is
cleaved with the release of fumarate as one of the
products. These proteins are active as tetramers. The
four active sites of the homotetrameric enzyme are each
formed by residues from three different subunits.
Fumarase catalyzes the reversible hydration/dehydration
of fumarate to L-malate during the Krebs cycle.
Length = 455
Score = 25.9 bits (58), Expect = 8.0
Identities = 24/111 (21%), Positives = 37/111 (33%), Gaps = 52/111 (46%)
Query: 52 DHSFRALSTLSHLDLAHNRIVAVSGASLAHLSNLTTLDLSHNFLRALTQDLIT------- 104
+ F AL+ AH+ +V SGA L+ L L+
Sbjct: 258 PNKFEALA-------AHDALVEASGA-----------------LKTLAVSLMKIANDIRW 293
Query: 105 ----PLKSLQELRLDDND-----------------ISMVTPDVIGDNVTIT 134
P L EL L +N+ ++MV V+G++ IT
Sbjct: 294 LGSGPRCGLGELSLPENEPGSSIMPGKVNPTQCEALTMVAAQVMGNDAAIT 344
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 26.0 bits (57), Expect = 9.3
Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 14/87 (16%)
Query: 57 ALSTLSHLDLAHNRIVAVSGASL----AHLSNLTTLDLSHNFLRALTQDLITPLK----- 107
+ L +DL+ N I + L A++ NL ++ S F +L + L
Sbjct: 28 MMDELVEVDLSGNTIGTEAMEELCNVIANVRNLRVVNFSDAFTGRDKDELYSNLVMLLKA 87
Query: 108 -----SLQELRLDDNDISMVTPDVIGD 129
LQ++ L DN P+ +GD
Sbjct: 88 LLKCPRLQKVDLSDNAFGSEFPEELGD 114
>gnl|CDD|181611 PRK09014, rfaH, transcriptional activator RfaH; Provisional.
Length = 162
Score = 25.6 bits (57), Expect = 9.5
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 100 QDLITPLKSLQELRLDDNDISMVTPDVIGDNVTIT 134
D+I L + ++ D + TP GD V IT
Sbjct: 90 SDVIYQLSVYKPEKIVDPE----TPKP-GDKVIIT 119
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.132 0.369
Gapped
Lambda K H
0.267 0.0777 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,860,536
Number of extensions: 603253
Number of successful extensions: 680
Number of sequences better than 10.0: 1
Number of HSP's gapped: 648
Number of HSP's successfully gapped: 78
Length of query: 141
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 54
Effective length of database: 7,078,804
Effective search space: 382255416
Effective search space used: 382255416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (24.5 bits)