RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6681
(537 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 123 bits (312), Expect = 3e-32
Identities = 55/259 (21%), Positives = 109/259 (42%), Gaps = 25/259 (9%)
Query: 9 FVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMS-AGCQ 67
V++ + + NLA AD L L A+ W A C+
Sbjct: 1 LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGG--------DWPFGDALCK 52
Query: 68 MAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVL 127
+ GF+ V++ S+ L I+++R AI H + + + + A +++ W A+L+++
Sbjct: 53 LVGFLFVVNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLP 112
Query: 128 PLFGI----SDYRKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCYLKMYCAIRG 183
PL + TCL ++ + + L+++ CY + +R
Sbjct: 113 PLLFSWLRTVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRK 172
Query: 184 S----------QAWNSNDSRIAKRMALLVFTDLLCWFPIALVA-LTAICGVHLV-SLEQA 231
+ +S + + AK + ++V +LCW P +V L ++C + + L A
Sbjct: 173 RARSGASQARAKRSSSKERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCPLSIWRLLPTA 232
Query: 232 KIFTVFVLPLNSCCNPFLY 250
+ T+++ +NSC NP +Y
Sbjct: 233 LLITLWLAYVNSCLNPIIY 251
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 67.5 bits (165), Expect = 3e-12
Identities = 66/278 (23%), Positives = 121/278 (43%), Gaps = 36/278 (12%)
Query: 1 MLGNGTVVFVLIFARSKIDVPRFLVCNLAAAD--FFMGLYLGLLAVVDASTLGEFRMYAI 58
++GN V++VL + K + +L+ NLA +D F M L + Y I
Sbjct: 55 LVGNIIVIYVLTKTKIKTPMDIYLL-NLAVSDLLFVMTLPFQI-------------YYYI 100
Query: 59 PWQMSAG---CQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMV 115
+Q S G C++ + + S+ + V++++R AI H + NK ++K+ + +
Sbjct: 101 LFQWSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSL 160
Query: 116 CGWSFAILMAVLPLFGI----SDYRKYATCLPFETTTGVWSLTYIVFLMFING--VAFLI 169
W +I+ P+ + D+ C+ + T W L +I F + I G + I
Sbjct: 161 VIWIISIIETT-PILFVYTTKKDHETLICCMFYNNKTMNWKL-FINFEINIIGMLIPLTI 218
Query: 170 LMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALV-------ALTAICG 222
L+ CY K+ ++G N I K + ++V ++ W P + L G
Sbjct: 219 LLYCYSKILITLKGINKSKKNKKAI-KLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSG 277
Query: 223 VHLVSLEQAKIFTVFVLPLNSCC-NPFLYAILTKQFKK 259
V Q + ++ L+ CC NP +YA +++ F K
Sbjct: 278 CKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNK 315
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 53.4 bits (128), Expect = 1e-07
Identities = 67/284 (23%), Positives = 128/284 (45%), Gaps = 33/284 (11%)
Query: 1 MLGNGTVVFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPW 60
++GN V+ VLI R V +L N+A +D + + D L E+
Sbjct: 43 LIGNVLVIAVLIVKRFMFVVDVYLF-NIAMSDLMLVFSFPFIIHND---LNEWIFGEFMC 98
Query: 61 QMSAGCQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHL-NKRLSLKHAGYIMVCGWS 119
++ G GF S ++F + +I+++R + +A + NK +SL + + V W
Sbjct: 99 KLVLGVYFVGFFS------NMFFVTLISIDRYILVVNATKIKNKSISL--SVLLSVAAWV 150
Query: 120 FAIL--MAVLPLFGISDYRKYATCLPFETTTGVWSL-TYIVFLMFING--VAFLILMGCY 174
+++ M + L+ + + C+ F +S + F + I G + +IL+ CY
Sbjct: 151 CSVILSMPAMVLYYVDNTDNLKQCI-FNDYHENFSWSAFFNFEINIFGIVIPLIILIYCY 209
Query: 175 LKMYCAIRGSQAWNSNDSR-IAKRMALLVFTDLLCWFPIALVAL-TAICGVHLVSLE--- 229
K+ ++ + N N +R I + ++ FT + W P +V ++ V L+ +
Sbjct: 210 SKILYTLKNCK--NKNKTRSIKIILTVVTFTVVF-WVPFNIVLFINSLQSVGLIDIGCYH 266
Query: 230 -QAKIFTVFVLPLNS----CCNPFLYAILTKQFKKDC-VMICKT 267
+ ++++ + L S C NP +YA + K FKK M C+T
Sbjct: 267 FKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKKVFKNMFCRT 310
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 47.3 bits (112), Expect = 1e-05
Identities = 58/304 (19%), Positives = 121/304 (39%), Gaps = 54/304 (17%)
Query: 1 MLGNGTVVFVLIFARSKIDVPRFLVCNLAAAD--FFMGLYLGLLAVVDASTLGEFRMYAI 58
+ GN ++ +L + K + + NLA +D F + + D G+F I
Sbjct: 113 LFGNAAIIMILFCKKIKT-ITDIYIFNLAISDLIFVIDFPFIIYNEFDQWIFGDFMCKVI 171
Query: 59 PWQMSAGCQMAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGW 118
+ GF S ++F + +++++R +AI + + K + + + W
Sbjct: 172 -----SASYYIGFFS------NMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISW 220
Query: 119 SFAILMAVLPLF----------------GISDYRKYATCLPFETTTGVWSLTYIV-FLMF 161
++++ F IS+Y+ T + + L I+ F +
Sbjct: 221 ILSLIITSPAYFIFEASNIIFSAQDSNETISNYQ--CTLIEDNEKNNISFLGRILQFEIN 278
Query: 162 ING--VAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALVALTA 219
I G + +I CY+K+ ++ Q S ++ +++++ L+CW P+ +V L A
Sbjct: 279 ILGMFIPIIIFAFCYIKIILKLK--QLKKSKKTKSIIIVSIIIICSLICWIPLNIVILFA 336
Query: 220 ---------------ICGVHLVSLEQAKIFTVFVLPLNSCCNPFLYAILTKQFKKDCVMI 264
ICG + L A + + + C NP +Y ++ + F+ +MI
Sbjct: 337 TMYSFKGFNSIISEHICGF--IKLGYAMMLAEAISLTHCCINPLIYTLIGENFRMHLLMI 394
Query: 265 CKTI 268
+ I
Sbjct: 395 FRNI 398
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 37.1 bits (86), Expect = 0.025
Identities = 41/244 (16%), Positives = 61/244 (25%), Gaps = 30/244 (12%)
Query: 300 DRSSREHFPHPQPCSCNLKLLGERMSKPRPHRPRWSATKKVRRFWDCLHLQRKNHRHIRT 359
R R P S G R + ++
Sbjct: 204 PRPPRRSSPISASASSPAPAPG-RSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP 262
Query: 360 DQYAYQIAEIQQKQHKRASSVCSSENFS-SSRSDSWKQNHHHCGIPMRLLDPKRRASSWL 418
+I E + +S + S RS S + G RASS
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSG----PAPSSPRASS-- 316
Query: 419 VTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRS-SNSSDRS----KPRLTRQSAIVDD 473
+ +S++S+ SS+ + S S A + SRS S S R
Sbjct: 317 -SSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAP 375
Query: 474 PDLPNSPARLTVR----------------FLTTIPSAAETSVVNEEESFAILHPTPDASP 517
S R T R + + S A P +P
Sbjct: 376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
Query: 518 AGSP 521
+G P
Sbjct: 436 SGEP 439
Score = 29.4 bits (66), Expect = 6.4
Identities = 30/153 (19%), Positives = 44/153 (28%), Gaps = 32/153 (20%)
Query: 375 KRASSVCSSENFSSSRSDSWKQNHHHCGIPM----RLLDPKRRASSWLVTRKTSQDSNLS 430
+S S + SS +N C +P L AS W ++ S
Sbjct: 231 DAGASSSDSSSSESSGCGWGPENE--CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288
Query: 431 SSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVDDPDLPNSPARLTVRFLTT 490
SS + S S S S S + SS R+ + +S +
Sbjct: 289 SSPRERSPSP---SPSSPG-SGPAPSSPRASSSSSSSRESSSSSTSSSSESS-------- 336
Query: 491 IPSAAETSVVNEEESFAILHPTPDASPAGSPQR 523
A + P P S + SP R
Sbjct: 337 --------------RGAAVSPGPSPSRSPSPSR 355
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 36.1 bits (83), Expect = 0.045
Identities = 26/143 (18%), Positives = 46/143 (32%), Gaps = 4/143 (2%)
Query: 347 LHLQRKNHRHIRTDQYAYQIAEIQQKQH--KRASSVCSSENFSSSRSDSWKQNHHHCGIP 404
L+ + +Q I H K A S + + S+ + +
Sbjct: 251 PELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGS 310
Query: 405 MRLLDPKRRASSWLVTRKTS--QDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKP 462
A++ + K+S + + N S S ++A+ SV S S N S + +
Sbjct: 311 KETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRA 370
Query: 463 RLTRQSAIVDDPDLPNSPARLTV 485
+ A D A TV
Sbjct: 371 SKSSSGARDSKKDASGMSANGTV 393
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 35.0 bits (80), Expect = 0.11
Identities = 21/111 (18%), Positives = 37/111 (33%), Gaps = 10/111 (9%)
Query: 430 SSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQS-AIVDDPDLPNSPARLTVRFL 488
SS N+ + TS+S I S K + ++ +I D P +P
Sbjct: 202 PSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQRDVSISDRATTPIAPG-----VE 256
Query: 489 TTIPSAAETSVVNEEESFAILHP-TP---DASPAGSPQRKKKISLEEKQNS 535
+ S + T ++ D S G+P + +E +N
Sbjct: 257 SASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESEND 307
>gnl|CDD|238504 cd01031, EriC, ClC chloride channel EriC. This domain is found in
the EriC chloride transporters that mediate the extreme
acid resistance response in eubacteria and archaea. This
response allows bacteria to survive in the acidic
environments by decarboxylation-linked proton
utilization. As shown for Escherichia coli EriC, these
channels can counterbalance the electric current
produced by the outwardly directed virtual proton pump
linked to amino acid decarboxylation. The EriC proteins
belong to the ClC superfamily of chloride ion channels,
which share a unique double-barreled architecture and
voltage-dependent gating mechanism. The
voltage-dependent gating is conferred by the permeating
anion itself, acting as the gating charge. In
Escherichia coli EriC, a glutamate residue that
protrudes into the pore is thought to participate in
gating by binding to a Cl- ion site within the
selectivity filter.
Length = 402
Score = 34.4 bits (80), Expect = 0.14
Identities = 30/157 (19%), Positives = 56/157 (35%), Gaps = 26/157 (16%)
Query: 21 PRFLVCNLAA---ADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAGCQMAGFMSVLSA 77
P L+ L A ADF L+ GL V+ L + + + G +AG + L
Sbjct: 167 PLALLTALVASIAADFVSRLFFGLGPVLSIPPLPALPLKSYWLLLLLGI-IAGLLGYL-- 223
Query: 78 ELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRK 137
F +++ + Y L + L +++ G L +LP +
Sbjct: 224 ----FNRSLLKSQDLYRKLK--KLPREL------RVLLPGLLIGPLGLLLPEALGGGHG- 270
Query: 138 YATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCY 174
L G +S++ ++ + + F+ M Y
Sbjct: 271 ----LILSLAGGNFSISLLLLIFVLR---FIFTMLSY 300
>gnl|CDD|184922 PRK14958, PRK14958, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 509
Score = 33.6 bits (76), Expect = 0.32
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 7/78 (8%)
Query: 458 DRSKPRLTRQSAIVDDPDLPNSPARLT------VRFLTTIPSAAETSVVNEEESFAILHP 511
DR +L Q ++ DL SP T +R L PSAA ++ E A++ P
Sbjct: 318 DREDVQLFYQIGLIGQRDLAYSPTPQTGFEMTLLRMLAFYPSAA-NAIPETESKEAVVTP 376
Query: 512 TPDASPAGSPQRKKKISL 529
TP +S PQ ++ L
Sbjct: 377 TPKSSETSWPQLLSQLKL 394
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 30.8 bits (70), Expect = 0.37
Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 403 IPMRLLDPKRRASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKP 462
P+ L+ KR SS TS S SS +S S+++ S++ S SS+ S
Sbjct: 4 FPVNYLNMKRSGSS-----NTSVTSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGT 58
Query: 463 RLTRQS 468
R+T +S
Sbjct: 59 RITTES 64
Score = 30.4 bits (69), Expect = 0.59
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 424 SQDSNLSSSRNDSSGSANTASTSVSRISRSSNSS 457
S S+ SS+ SS S++ S+S +RI+ S S
Sbjct: 34 SSSSSNSSNGGSSSSSSSGDSSSGTRITTESESD 67
>gnl|CDD|119052 pfam10532, Plant_all_beta, Plant specific N-all beta domain. This
domain was identified by Babu and colleagues. It is
found associated with the WRKY domain pfam03106.
Length = 114
Score = 29.8 bits (67), Expect = 1.1
Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 20/76 (26%)
Query: 460 SKPRLTRQSAIVDDPDLPNSPARLTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAG 519
SKP+ R++ IVDD D+ + +LT+ ++EE +LH +
Sbjct: 21 SKPK--RETYIVDDEDV--------LVYLTS---------LDEEGRRPVLH-VEVVNDLE 60
Query: 520 SPQRKKKISLEEKQNS 535
+R +++S E+++S
Sbjct: 61 ENERLEQVSRVERRSS 76
>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 31.0 bits (71), Expect = 1.3
Identities = 34/160 (21%), Positives = 62/160 (38%), Gaps = 18/160 (11%)
Query: 68 MAGFMSVLSAELSVFTLAVITLERNYAITHAMHLNKRLSLKHAGYIMVCGWSFAILMAVL 127
+ G + + E+S T +I+L R A+ K S+K+ I+ W AI+++ L
Sbjct: 69 IGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWIIAIIISTL 128
Query: 128 PLFGISDY----RKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMGC--YLKMYCAI 181
F + + + T E Y FL + V ++ ++K+
Sbjct: 129 FYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVIITNIINVITFIKLRKFS 188
Query: 182 RGSQAWNSNDS--RIAKRM---------ALLVFTDLLCWF 210
+ S +S +S R K + LL DL+ +F
Sbjct: 189 KKSSL-SSEESKKRRKKEINFFKQTVIQDLLYLIDLINYF 227
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 31.3 bits (71), Expect = 1.4
Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 1/67 (1%)
Query: 268 IEESRVTRG-IGRCRHSSNFSNRQTPGNTNSIVDRSSREHFPHPQPCSCNLKLLGERMSK 326
I+ R RG GR + + P +S + P + K GE+ +
Sbjct: 387 IQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRR 446
Query: 327 PRPHRPR 333
RP +P
Sbjct: 447 RRPRKPA 453
>gnl|CDD|215858 pfam00324, AA_permease, Amino acid permease.
Length = 473
Score = 31.2 bits (71), Expect = 1.5
Identities = 12/67 (17%), Positives = 23/67 (34%), Gaps = 3/67 (4%)
Query: 156 IVFLMFINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAK---RMALLVFTDLLCWFPI 212
IVF + + L+ L +R +A+ I + + L F +L +
Sbjct: 357 IVFNFLLAISGLIGLIVWGLISLSHLRFRKAFKYQGRSIDELPFKACLGPFGVILGLAAL 416
Query: 213 ALVALTA 219
+ L
Sbjct: 417 ITILLIQ 423
>gnl|CDD|219123 pfam06653, Claudin_3, Tight junction protein, Claudin-like. This
is a family of probable membrane tight junction,
Claudin-like, proteins.
Length = 165
Score = 30.1 bits (68), Expect = 1.8
Identities = 23/121 (19%), Positives = 48/121 (39%), Gaps = 14/121 (11%)
Query: 104 RLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRKYATC---LPFETTTGVWSLTYIVFLM 160
R+ LK G ++V + I+ P + +PF +T G W L ++M
Sbjct: 3 RVKLKALGVLVVISFILNIVGVFTPGWITDTGGFLEPSIGIVPFYSTEGGW-LVAASWMM 61
Query: 161 FINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWFPIALVALTAI 220
+I+ FL+L+ Y+ ++ + S +++ L+ + + LT +
Sbjct: 62 YISFALFLVLILIYIVAVRKVK-----KNGYSHYQRKLFF-----LIALLSLLIAILTVV 111
Query: 221 C 221
Sbjct: 112 A 112
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 29.9 bits (68), Expect = 2.7
Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 15/115 (13%)
Query: 140 TCLPFETTTGVWSLTYIVFLMFINGVAFLILMGCYLKMYCAIRG-SQAWNSNDSRIAKRM 198
C P G + + + IN V LI+ Y+ + + Q NS+ ++ K +
Sbjct: 134 VCAPPLALNGTAFRIFTLSSVIIN-VIVLIV---YIILIIIFKKKGQTTNSDSKKVFKSL 189
Query: 199 ALLVFTDLLCWFPIALVALTAICGVHLV-SLEQAKIFTV----FVLPLNSCCNPF 248
+ V + WF + + L E I + FV S F
Sbjct: 190 QVTVIIFIFGWF---TSTIANTVFLLLTEDGEVENIIQMYAGIFVNL--SFSQNF 239
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 30.6 bits (69), Expect = 2.8
Identities = 30/152 (19%), Positives = 47/152 (30%), Gaps = 7/152 (4%)
Query: 387 SSSRSDSWKQNHHHCGIPMRLLDPKRRASSW----LVTRKTSQDSNLSSSRNDSS--GSA 440
S+ R SW + I R S+ L R ++ + S S G
Sbjct: 139 SAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLTDRTPAKVRSSRSELGAPSPSGGT 198
Query: 441 NTASTSVSRISRSSNSSDRSKPRLTRQSAIVDDPDLPNSPARLTVRFLTTIPSAAETSVV 500
+ S+S R S + R L ++ A++ P P S + S++ S
Sbjct: 199 SCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRA-RSSSAKSPF 257
Query: 501 NEEESFAILHPTPDASPAGSPQRKKKISLEEK 532
S SP+ K S E
Sbjct: 258 KSSIQRKATKALSKLSLRASPKDTSKSSKSEV 289
>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
transduction mechanisms].
Length = 612
Score = 30.3 bits (68), Expect = 2.9
Identities = 27/105 (25%), Positives = 42/105 (40%), Gaps = 6/105 (5%)
Query: 426 DSNLSSSR--NDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVD-DPDLPNSPAR 482
N + S N S+ + T S V SRS+N + + T + + +L +P+
Sbjct: 113 PPNHTRSSGNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSG 172
Query: 483 LTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAGSPQRKKKI 527
T L++ PS TS + ES PT SP K+
Sbjct: 173 KT---LSSQPSRKPTSENPKSESDNSKLPTSVNSPLPDKSLLKRT 214
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 30.1 bits (68), Expect = 3.1
Identities = 19/73 (26%), Positives = 29/73 (39%), Gaps = 20/73 (27%)
Query: 416 SWLVTRKTS-------------------QDSNLSSSRNDSSGSANTASTSVSRISRSSNS 456
SWLV R++S + S+ +SR SG + T S ++SR S S
Sbjct: 146 SWLVKRESSTSLVSEDEEDFEREAAREREHSSRRASRRGRSGYS-TPSAALSRRGSRSAS 204
Query: 457 SDRSKPRLTRQSA 469
S+ L+
Sbjct: 205 RRGSRADLSMTPL 217
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 30.2 bits (68), Expect = 3.4
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 12/89 (13%)
Query: 372 KQHKRASSVCSSENFSSSRSD-SWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQDSNLS 430
H R SS + S+ S S ++N H M +++ L+ R +
Sbjct: 194 SPHFRQKF--SSSDTSNGFSYPSIRKNSRHSSNSMPSFP--HSSTAVLLKRHSGSSGA-- 247
Query: 431 SSRNDSSGSANTASTSVSRISRSSNSSDR 459
S S+N +S + + S++S
Sbjct: 248 -----SLISSNITPSSSNSEAMSTSSKRP 271
>gnl|CDD|219068 pfam06511, IpaD, Invasion plasmid antigen IpaD. This family
consists of several invasion plasmid antigen IpaD
proteins. Entry of Shigella flexneri into epithelial
cells and lysis of the phagosome involve the IpaB, IpaC,
and IpaD proteins, which are secreted by type III
secretion machinery.
Length = 337
Score = 30.0 bits (67), Expect = 3.6
Identities = 16/119 (13%), Positives = 43/119 (36%), Gaps = 6/119 (5%)
Query: 409 DPKRRASSWLVTRKTSQDSNLSSSRN-DSSGSANTASTSVSRISRSSNSSDRSKPRLTRQ 467
+ + ++ ++N SS+ ++ T++ ++ I S+ + + + T
Sbjct: 2 NITTSTNPISTASRSPNNTNGSSAETVNADVKTTTSNDTLHNIRTSNAALKKELSQQTLT 61
Query: 468 SAIVDDPDLPNSPARLTVRFLTTIPSA-AETSVVNEEESFAILHPTPDASP----AGSP 521
+++ +S + V + + A++ E +LH A G P
Sbjct: 62 KTPLEEIAEHSSQITMDVNKSAQLLNILAKSGYPISAEQRELLHSGLSAPKEAELDGYP 120
>gnl|CDD|129861 TIGR00779, cad, cadmium resistance transporter (or sequestration)
family protein. These proteins are members of the
Cadmium Resistance (CadD) Family (TC 2.A.77). To date,
this family of proteins has only been found in
Gram-positive bacteria. The CadD family includes
several closely related Staphylococcal proteins
reported to function in cadmium resistance. Members are
predicted to span the membrane five times; the
mechanism of resistance is believed to be export but
has also been suggested to be binding and sequestration
in the membrane. Closely related but outside the scope
of this model is another staphylococcal protein that
has been reported to possibly function in quaternary
ammonium ion export. Still more distant are other
members of the broader LysE family (see Vrljic. et al,
PMID:10943564) [Transport and binding proteins, Amino
acids, peptides and amines].
Length = 193
Score = 29.4 bits (66), Expect = 3.9
Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 9/59 (15%)
Query: 7 VVFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAG 65
V+ ++ FAR+K + D ++G YLG + ++ S L F + IP + G
Sbjct: 10 VILLIFFARAKRKEYK---------DIYIGQYLGSIILILVSLLLAFGVNLIPEKWVLG 59
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 29.7 bits (67), Expect = 4.5
Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 12/114 (10%)
Query: 414 ASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSR-ISRSSNSSDRSKPRLTRQSAIVD 472
+SS +R S S+ SSS + SS ++T+ R S SS S L + S+
Sbjct: 2 SSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSP 61
Query: 473 DPDLPNSPARLTVRFLTTIPSAA-------ETSVVNEEESFAILHPTPD-ASPA 518
A L + AA E+++V+ E HP P A PA
Sbjct: 62 PSVTLPPAATTQTPQLNPLQRAAAAALDAVESALVSHLER---QHPLPKTADPA 112
>gnl|CDD|218145 pfam04553, Tis11B_N, Tis11B like protein, N terminus. Members of
this family always contain a tandem repeat of CCCH zinc
fingers pfam00642. Tis11B, Tis11D and their homologues
are thought to be regulatory proteins involved in the
response to growth factors. The function of the N
terminus is unknown.
Length = 109
Score = 27.8 bits (61), Expect = 5.3
Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Query: 412 RRASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRS 460
RR S ++ K +Q+ L+S + D SG ++TA + ++ + DRS
Sbjct: 49 RRHSVTNLSSKFNQNQFLNSLKMDPSGVSSTALS-----NKENRFRDRS 92
>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
Length = 609
Score = 29.1 bits (65), Expect = 6.3
Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 8/52 (15%)
Query: 419 VTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPR-LTRQSA 469
V + + + SSSR S S +T S + S RS P + RQ
Sbjct: 455 VIERLNVNEGRSSSRASPSHSTSTIPYSPPQ-------SGRSTPTSILRQRT 499
>gnl|CDD|218666 pfam05633, DUF793, Protein of unknown function (DUF793). This
family consists of several plant proteins of unknown
function.
Length = 389
Score = 29.0 bits (65), Expect = 6.6
Identities = 20/76 (26%), Positives = 29/76 (38%), Gaps = 10/76 (13%)
Query: 378 SSVCSSENFSSSRSDSWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQD-------SNLS 430
S N S RS W +H G R+ SW V+R S +NL+
Sbjct: 175 GSGGGHRNRSFGRS--WSFHHRSIGSSGGGTGGHLRSLSWSVSRNWSAAKQLQAMGANLA 232
Query: 431 SSR-NDSSGSANTAST 445
+ R ND++ + A
Sbjct: 233 TPRGNDAAATNGLAVP 248
>gnl|CDD|223469 COG0392, COG0392, Predicted integral membrane protein [Function
unknown].
Length = 322
Score = 28.9 bits (65), Expect = 6.6
Identities = 16/111 (14%), Positives = 29/111 (26%), Gaps = 3/111 (2%)
Query: 100 HLNKRLSLKHAGYIMVCGWSFAILMAVLPLFGISDYRKYATCLPFETTTGVWSLTYIVFL 159
L +LSL + + + L + P S + + FL
Sbjct: 79 SLGVKLSLWK---VALASFIAYFLNNITPAALGSGPAVRYRLYSRLGLSVALVSRILAFL 135
Query: 160 MFINGVAFLILMGCYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWF 210
+ G+ L+L L + LL+ +L
Sbjct: 136 LLTFGLGLLLLFLLILLFVPLKLALFLLLLTLLLRILLILLLLAAAILLIL 186
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 29.1 bits (65), Expect = 6.7
Identities = 28/138 (20%), Positives = 43/138 (31%), Gaps = 20/138 (14%)
Query: 371 QKQHKRASSVCSSENFSSSRSDSWKQNHHHCGI------------------PMRLLDPKR 412
++ K S E SSS S S +HHH P L
Sbjct: 756 EEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSS 815
Query: 413 RASSWLVTRKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTRQSAIVD 472
++ + + SSS S S+ +S+ S S+ + K T +
Sbjct: 816 PKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEG--KGSSTSKEHKGS 873
Query: 473 DPDLPNSPARLTVRFLTT 490
D PN + V L+
Sbjct: 874 SGDTPNKASSFPVPPLSN 891
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 29.0 bits (65), Expect = 7.0
Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 10/80 (12%)
Query: 408 LDPKRRASSWLVT-RKTSQDSNLSSSRNDSSGSANTASTSVSRISRSSNSSDRSKPRLTR 466
+ ASS R + + SR S S SRS +S L
Sbjct: 45 WSVRETASSNKARSRSEKWNPDQPGSRVSSPSS------KKDGTSRSLSSQVDD---LAS 95
Query: 467 QSAIVDDPDLPNSPARLTVR 486
+ DL + A L +R
Sbjct: 96 AVSSQSSSDLEDELAALKIR 115
>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar
epimerases [Cell envelope biogenesis, outer membrane /
Carbohydrate transport and metabolism].
Length = 588
Score = 29.2 bits (66), Expect = 7.4
Identities = 13/84 (15%), Positives = 26/84 (30%), Gaps = 10/84 (11%)
Query: 8 VFVLIFARSKIDVPRFLVCNLAAADFFMGLYLGLLAVVDASTLGEFRMYAIPWQMSAGCQ 67
+ +++ A + L + L+ G + F +Y W+ S
Sbjct: 7 LLLVLVA--------LALALLLRDELSASLWSGAQLLT-ICVFILFGLYRAVWRASVSDL 57
Query: 68 MAGFMSVLSAELSVFTLAVITLER 91
+VL++ V L T
Sbjct: 58 FIILKAVLTS-ALVSLLFFSTRTD 80
>gnl|CDD|224198 COG1279, COG1279, Lysine efflux permease [General function
prediction only].
Length = 202
Score = 28.0 bits (63), Expect = 9.3
Identities = 17/98 (17%), Positives = 36/98 (36%), Gaps = 16/98 (16%)
Query: 113 IMVCGWSFAILMAVLPLFGISDYRKYATCLPFETTTGVWSLTYIVFLMFINGVAFLILMG 172
++C S +L++ +FG+ + +++ ++ G AFL+ G
Sbjct: 40 ALLCAISDIVLIS-AGVFGVGAL--------------IAKSPWLLLIVRWGGAAFLLYYG 84
Query: 173 CYLKMYCAIRGSQAWNSNDSRIAKRMALLVFTDLLCWF 210
L + A RG + K +L+F +
Sbjct: 85 -LLALKSAPRGPSQLQVAEFTKLKLKKVLLFALAVTLL 121
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 28.7 bits (64), Expect = 9.6
Identities = 26/116 (22%), Positives = 41/116 (35%), Gaps = 17/116 (14%)
Query: 422 KTSQDSNLSSSRNDSSGS-----ANTASTSVSRISRSSNSSDRSKPRLTRQ--------S 468
+ +Q+S + + S N +SVS+++ + S + R + Q S
Sbjct: 592 RNTQNSLQIDTLDHESSEQPLEEKNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVS 651
Query: 469 AIVDDPDLPNSPARLTVRFLTTIPSAAETSVVNEEESFAILHPTPDASPAGSPQRK 524
V D N V S + S+V E S + PT SP RK
Sbjct: 652 TGVSVADESNDSENELVNCA----SPSSLSIVPVEVSPVLKSPTLSVSPRIRNSRK 703
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 28.8 bits (64), Expect = 9.7
Identities = 21/106 (19%), Positives = 44/106 (41%), Gaps = 3/106 (2%)
Query: 369 IQQKQHKRASSVCSSENFSSSRSDSWKQNHHHCGIPMRLLDPKRRASSWLVTRKTSQDSN 428
++++ K+ ++ ++ D + + P++ D K + DS+
Sbjct: 66 RKEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSS 125
Query: 429 LSSSRNDSSGSANTASTSVSRISRSSNSSD-RSKPRLTR--QSAIV 471
SSS + S + + S+ S S SNS + P++ + QS V
Sbjct: 126 SSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHV 171
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 28.6 bits (64), Expect = 9.9
Identities = 13/68 (19%), Positives = 26/68 (38%), Gaps = 1/68 (1%)
Query: 401 CGIPMRLLDPKRRASSWLVTRKTSQDSNLSSSRNDSSGS-ANTASTSVSRISRSSNSSDR 459
M++ K++ + L + K + L + +S S + + ++ SS R
Sbjct: 231 LKPSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSR 290
Query: 460 SKPRLTRQ 467
P L R
Sbjct: 291 RTPSLLRS 298
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.133 0.409
Gapped
Lambda K H
0.267 0.0690 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,227,526
Number of extensions: 2458239
Number of successful extensions: 3017
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2975
Number of HSP's successfully gapped: 67
Length of query: 537
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 435
Effective length of database: 6,413,494
Effective search space: 2789869890
Effective search space used: 2789869890
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.4 bits)