RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780722|ref|YP_003065135.1| pilus assembly protein
[Candidatus Liberibacter asiaticus str. psy62]
(187 letters)
>gnl|CDD|34572 COG4965, TadB, Flp pilus assembly protein TadB [Intracellular
trafficking and secretion].
Length = 309
Score = 190 bits (484), Expect = 2e-49
Identities = 87/187 (46%), Positives = 127/187 (67%)
Query: 1 MKHLIKKRTAKFLDDFPNALDIIVRSVRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQH 60
++ +R +F P ALD+IVR++RAG P+ DA+ + ++ +P+ +EF + + Q
Sbjct: 123 LRSRRARRLKRFGQQLPEALDLIVRALRAGAPLPDALRLAAKETPEPLGTEFTLITDRQQ 182
Query: 61 LGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQ 120
LG+ L ++ RM PL+E+ F + VIS+QS+ GGNLSE L NLSR++R+RK MKAKV+
Sbjct: 183 LGIDLPAALLRMYERYPLEELYFLAIVISIQSRHGGNLSELLDNLSRVIRERKKMKAKVR 242
Query: 121 ALSMEAKASAWIIGSLPFCVSTLVYFTSPGYMNVLINDPRGHMLLGVAAAFMLIGIMVMR 180
ALS EA+ SAWI+G+LP V L+Y SP Y++ L DP G LL +A + LIGI++MR
Sbjct: 243 ALSAEARMSAWILGALPLLVGLLIYLLSPDYLSFLWTDPTGRFLLVASAGWELIGILIMR 302
Query: 181 LMINFDV 187
M+NFD
Sbjct: 303 KMLNFDF 309
>gnl|CDD|144175 pfam00482, GSPII_F, Bacterial type II secretion system protein F
domain. The original family covered both the regions
found by the current model. The splitting of the family
has allowed the related FlaJ_arch (archaeal FlaJ family)
to be merged with it. Proteins with this domain in form
a platform for the type II secretion machinery, as well
as the type IV pili and the archaeal flagellae. This
domain seems to show some similarity to pfam00664, but
this may just be due to similarities in the TM helices
(personal obs: C Yeats).
Length = 125
Score = 60.0 bits (146), Expect = 4e-10
Identities = 31/128 (24%), Positives = 62/128 (48%), Gaps = 5/128 (3%)
Query: 19 ALDIIVRSVRAGLPVSDAVAVIVGQSSDP-VRSEFRRVIETQHLGLSLSESISRMVRYMP 77
L + +RAGLP+ +A+ ++ ++ + +R E +R+ E G SLSE+++R P
Sbjct: 1 FLRQLATLLRAGLPLLEALEILAEEAENGPLREELKRIRERLREGGSLSEALART----P 56
Query: 78 LQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQALSMEAKASAWIIGSLP 137
L E + + GGNL+E L L+ L +R+ ++ K+ A + + +
Sbjct: 57 LSEFPPLLVALIAAGESGGNLAEVLERLADYLEERRELRRKILAALLYPLILLVVALLVL 116
Query: 138 FCVSTLVY 145
+ ++
Sbjct: 117 LILLAILL 124
>gnl|CDD|32247 COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 320
Score = 40.6 bits (94), Expect = 2e-04
Identities = 34/157 (21%), Positives = 65/157 (41%), Gaps = 5/157 (3%)
Query: 2 KHLIKKRTAKFLDDFPNALDIIVRSVRAGLPVSDAVAVI--VGQSSDPVRSEFRRVIETQ 59
+KKR + + P+ L ++ + AGL ++DA+ + + E R
Sbjct: 161 SLALKKRLKEIARELPDFLRLMAVCLEAGLSLADALKRVADELYGQRILAEELARTTAEL 220
Query: 60 HLGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKV 119
LGLS+ E++ R+ + EV ++++ + GG+L++AL LS L +K
Sbjct: 221 SLGLSIEEALIRLAVRLGSDEVKRVVSLLTQALESGGSLADALRVLSMELSRETRLKL-- 278
Query: 120 QALSMEAKASAWIIGSLPFCVSTLVYFTSPGYMNVLI 156
K I+ L + ++ F + I
Sbjct: 279 -GEIRAGKIGVGILLPLILVIIPVLLFVVILPAILGI 314
>gnl|CDD|31648 COG1459, PulF, Type II secretory pathway, component PulF [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 397
Score = 36.7 bits (85), Expect = 0.004
Identities = 22/95 (23%), Positives = 50/95 (52%), Gaps = 6/95 (6%)
Query: 27 VRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQHL-GLSLSESISRMVRYMPLQEVSFFS 85
++AGLP+ +A+ ++ Q+ +P + I + G SLSE+++++ P +
Sbjct: 71 LKAGLPLYEALEILADQAPNPKLKQVLTSILEELESGKSLSEALAQLPGVFP----DLYV 126
Query: 86 TVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQ 120
+++ + GNL E L L++ L + ++ K++
Sbjct: 127 AMVAA-GERSGNLDEVLQRLAKYLEKQAALRKKIK 160
>gnl|CDD|31879 COG1693, COG1693, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 325
Score = 28.7 bits (64), Expect = 1.1
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 14 DDFPNALDIIVRSVRAGLPVSDAVAVIVGQS 44
DF + LDI+ + +GL +S V +I G S
Sbjct: 118 SDFEDVLDIVKEVISSGLSISPRVKIIEGDS 148
>gnl|CDD|145269 pfam01995, DUF128, Domain of unknown function DUF128. This
archaebacterial protein family has no known function.
The domain is found duplicated in a protein from
Methanothermobacter thermautotrophicus str. Delta H.
Many of these are attached to an N-terminal winged
helix domain suggesting these are transcriptional
regulators and that this domain has a ligand binding
function.
Length = 236
Score = 28.3 bits (64), Expect = 1.4
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 14 DDFPNALDIIVRSVRAGLPVSDAVAVIVGQSSDP 47
DD AL+I+ AGL +SD V + P
Sbjct: 36 DDLEEALEILKEVYEAGLSISDRVKIEDEGEEVP 69
>gnl|CDD|38778 KOG3570, KOG3570, KOG3570, MAPK-activating protein DENN [Signal
transduction mechanisms].
Length = 1588
Score = 27.0 bits (59), Expect = 2.9
Identities = 8/42 (19%), Positives = 24/42 (57%)
Query: 36 AVAVIVGQSSDPVRSEFRRVIETQHLGLSLSESISRMVRYMP 77
+ +++ + + +R + RR++ H+GL S+ I+ ++ +
Sbjct: 1321 SYMLLMKVNKNDIRKKVRRLMGKSHIGLVYSQQINEVLDQLA 1362
>gnl|CDD|31400 COG1207, GlmU, N-acetylglucosamine-1-phosphate uridyltransferase
(contains nucleotidyltransferase and I-patch
acetyltransferase domains) [Cell envelope biogenesis,
outer membrane].
Length = 460
Score = 27.1 bits (60), Expect = 3.2
Identities = 12/52 (23%), Positives = 23/52 (44%)
Query: 94 LGGNLSEALANLSRILRDRKNMKAKVQALSMEAKASAWIIGSLPFCVSTLVY 145
LG N LA RI++ R K + +++ A+ +I G + ++
Sbjct: 227 LGVNDRVQLAEAERIMQRRIAEKLMLAGVTLIDPATTYIRGDVEIGRDVVIE 278
>gnl|CDD|143812 pfam00022, Actin, Actin.
Length = 369
Score = 26.4 bits (59), Expect = 4.4
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 113 KNMKAKVQALSMEAKASAWIIGS 135
+K K+ A E K SAWI GS
Sbjct: 316 SGVKVKIIAPPNERKYSAWIGGS 338
>gnl|CDD|39873 KOG4674, KOG4674, KOG4674, Uncharacterized conserved coiled-coil
protein [Function unknown].
Length = 1822
Score = 26.5 bits (58), Expect = 4.8
Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 5/93 (5%)
Query: 23 IVRSVRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQHLGLSLSESISRMVRYMPLQEVS 82
I S + SD+ +I S V EF + E Q + L ++ + + E +
Sbjct: 507 ITVSSDSTENESDSEEII---SERLV--EFSNINELQEKNVELLNAVRELAEKLEAAEKT 561
Query: 83 FFSTVISVQSQLGGNLSEALANLSRILRDRKNM 115
T+ ++ + SE +A L + L +++
Sbjct: 562 QDKTLQNILKETINEASEKIAELEKELEEQEQR 594
>gnl|CDD|34025 COG4303, EutB, Ethanolamine ammonia-lyase, large subunit [Amino
acid transport and metabolism].
Length = 453
Score = 25.7 bits (56), Expect = 8.7
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 119 VQALSMEAKASAWIIGSLPFCVSTLVYFTSPGYM 152
++MEA+ PF V+T+V F P Y+
Sbjct: 302 ADQVTMEARCYGLARHFDPFLVNTVVGFIGPEYL 335
>gnl|CDD|35685 KOG0464, KOG0464, KOG0464, Elongation factor G [Translation,
ribosomal structure and biogenesis].
Length = 753
Score = 25.5 bits (55), Expect = 9.4
Identities = 9/69 (13%), Positives = 24/69 (34%), Gaps = 1/69 (1%)
Query: 47 PVRSEF-RRVIETQHLGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANL 105
P+ ++ +R + + I R+ ++PL E+ S + + + +
Sbjct: 675 PILADLAQRRAHFEEIDAREDNEIRRICAFLPLAEIEGLSKTLRTLTSGFADFALEFRGY 734
Query: 106 SRILRDRKN 114
+ K
Sbjct: 735 QAMNEHDKM 743
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.325 0.136 0.380
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 2,140,707
Number of extensions: 105403
Number of successful extensions: 412
Number of sequences better than 10.0: 1
Number of HSP's gapped: 408
Number of HSP's successfully gapped: 31
Length of query: 187
Length of database: 6,263,737
Length adjustment: 88
Effective length of query: 99
Effective length of database: 4,362,145
Effective search space: 431852355
Effective search space used: 431852355
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 54 (24.5 bits)