RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780722|ref|YP_003065135.1| pilus assembly protein [Candidatus Liberibacter asiaticus str. psy62] (187 letters) >gnl|CDD|34572 COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]. Length = 309 Score = 190 bits (484), Expect = 2e-49 Identities = 87/187 (46%), Positives = 127/187 (67%) Query: 1 MKHLIKKRTAKFLDDFPNALDIIVRSVRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQH 60 ++ +R +F P ALD+IVR++RAG P+ DA+ + ++ +P+ +EF + + Q Sbjct: 123 LRSRRARRLKRFGQQLPEALDLIVRALRAGAPLPDALRLAAKETPEPLGTEFTLITDRQQ 182 Query: 61 LGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQ 120 LG+ L ++ RM PL+E+ F + VIS+QS+ GGNLSE L NLSR++R+RK MKAKV+ Sbjct: 183 LGIDLPAALLRMYERYPLEELYFLAIVISIQSRHGGNLSELLDNLSRVIRERKKMKAKVR 242 Query: 121 ALSMEAKASAWIIGSLPFCVSTLVYFTSPGYMNVLINDPRGHMLLGVAAAFMLIGIMVMR 180 ALS EA+ SAWI+G+LP V L+Y SP Y++ L DP G LL +A + LIGI++MR Sbjct: 243 ALSAEARMSAWILGALPLLVGLLIYLLSPDYLSFLWTDPTGRFLLVASAGWELIGILIMR 302 Query: 181 LMINFDV 187 M+NFD Sbjct: 303 KMLNFDF 309 >gnl|CDD|144175 pfam00482, GSPII_F, Bacterial type II secretion system protein F domain. The original family covered both the regions found by the current model. The splitting of the family has allowed the related FlaJ_arch (archaeal FlaJ family) to be merged with it. Proteins with this domain in form a platform for the type II secretion machinery, as well as the type IV pili and the archaeal flagellae. This domain seems to show some similarity to pfam00664, but this may just be due to similarities in the TM helices (personal obs: C Yeats). Length = 125 Score = 60.0 bits (146), Expect = 4e-10 Identities = 31/128 (24%), Positives = 62/128 (48%), Gaps = 5/128 (3%) Query: 19 ALDIIVRSVRAGLPVSDAVAVIVGQSSDP-VRSEFRRVIETQHLGLSLSESISRMVRYMP 77 L + +RAGLP+ +A+ ++ ++ + +R E +R+ E G SLSE+++R P Sbjct: 1 FLRQLATLLRAGLPLLEALEILAEEAENGPLREELKRIRERLREGGSLSEALART----P 56 Query: 78 LQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQALSMEAKASAWIIGSLP 137 L E + + GGNL+E L L+ L +R+ ++ K+ A + + + Sbjct: 57 LSEFPPLLVALIAAGESGGNLAEVLERLADYLEERRELRRKILAALLYPLILLVVALLVL 116 Query: 138 FCVSTLVY 145 + ++ Sbjct: 117 LILLAILL 124 >gnl|CDD|32247 COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and secretion / Intracellular trafficking and secretion]. Length = 320 Score = 40.6 bits (94), Expect = 2e-04 Identities = 34/157 (21%), Positives = 65/157 (41%), Gaps = 5/157 (3%) Query: 2 KHLIKKRTAKFLDDFPNALDIIVRSVRAGLPVSDAVAVI--VGQSSDPVRSEFRRVIETQ 59 +KKR + + P+ L ++ + AGL ++DA+ + + E R Sbjct: 161 SLALKKRLKEIARELPDFLRLMAVCLEAGLSLADALKRVADELYGQRILAEELARTTAEL 220 Query: 60 HLGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANLSRILRDRKNMKAKV 119 LGLS+ E++ R+ + EV ++++ + GG+L++AL LS L +K Sbjct: 221 SLGLSIEEALIRLAVRLGSDEVKRVVSLLTQALESGGSLADALRVLSMELSRETRLKL-- 278 Query: 120 QALSMEAKASAWIIGSLPFCVSTLVYFTSPGYMNVLI 156 K I+ L + ++ F + I Sbjct: 279 -GEIRAGKIGVGILLPLILVIIPVLLFVVILPAILGI 314 >gnl|CDD|31648 COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]. Length = 397 Score = 36.7 bits (85), Expect = 0.004 Identities = 22/95 (23%), Positives = 50/95 (52%), Gaps = 6/95 (6%) Query: 27 VRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQHL-GLSLSESISRMVRYMPLQEVSFFS 85 ++AGLP+ +A+ ++ Q+ +P + I + G SLSE+++++ P + Sbjct: 71 LKAGLPLYEALEILADQAPNPKLKQVLTSILEELESGKSLSEALAQLPGVFP----DLYV 126 Query: 86 TVISVQSQLGGNLSEALANLSRILRDRKNMKAKVQ 120 +++ + GNL E L L++ L + ++ K++ Sbjct: 127 AMVAA-GERSGNLDEVLQRLAKYLEKQAALRKKIK 160 >gnl|CDD|31879 COG1693, COG1693, Uncharacterized protein conserved in archaea [Function unknown]. Length = 325 Score = 28.7 bits (64), Expect = 1.1 Identities = 12/31 (38%), Positives = 18/31 (58%) Query: 14 DDFPNALDIIVRSVRAGLPVSDAVAVIVGQS 44 DF + LDI+ + +GL +S V +I G S Sbjct: 118 SDFEDVLDIVKEVISSGLSISPRVKIIEGDS 148 >gnl|CDD|145269 pfam01995, DUF128, Domain of unknown function DUF128. This archaebacterial protein family has no known function. The domain is found duplicated in a protein from Methanothermobacter thermautotrophicus str. Delta H. Many of these are attached to an N-terminal winged helix domain suggesting these are transcriptional regulators and that this domain has a ligand binding function. Length = 236 Score = 28.3 bits (64), Expect = 1.4 Identities = 12/34 (35%), Positives = 16/34 (47%) Query: 14 DDFPNALDIIVRSVRAGLPVSDAVAVIVGQSSDP 47 DD AL+I+ AGL +SD V + P Sbjct: 36 DDLEEALEILKEVYEAGLSISDRVKIEDEGEEVP 69 >gnl|CDD|38778 KOG3570, KOG3570, KOG3570, MAPK-activating protein DENN [Signal transduction mechanisms]. Length = 1588 Score = 27.0 bits (59), Expect = 2.9 Identities = 8/42 (19%), Positives = 24/42 (57%) Query: 36 AVAVIVGQSSDPVRSEFRRVIETQHLGLSLSESISRMVRYMP 77 + +++ + + +R + RR++ H+GL S+ I+ ++ + Sbjct: 1321 SYMLLMKVNKNDIRKKVRRLMGKSHIGLVYSQQINEVLDQLA 1362 >gnl|CDD|31400 COG1207, GlmU, N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane]. Length = 460 Score = 27.1 bits (60), Expect = 3.2 Identities = 12/52 (23%), Positives = 23/52 (44%) Query: 94 LGGNLSEALANLSRILRDRKNMKAKVQALSMEAKASAWIIGSLPFCVSTLVY 145 LG N LA RI++ R K + +++ A+ +I G + ++ Sbjct: 227 LGVNDRVQLAEAERIMQRRIAEKLMLAGVTLIDPATTYIRGDVEIGRDVVIE 278 >gnl|CDD|143812 pfam00022, Actin, Actin. Length = 369 Score = 26.4 bits (59), Expect = 4.4 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 113 KNMKAKVQALSMEAKASAWIIGS 135 +K K+ A E K SAWI GS Sbjct: 316 SGVKVKIIAPPNERKYSAWIGGS 338 >gnl|CDD|39873 KOG4674, KOG4674, KOG4674, Uncharacterized conserved coiled-coil protein [Function unknown]. Length = 1822 Score = 26.5 bits (58), Expect = 4.8 Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 5/93 (5%) Query: 23 IVRSVRAGLPVSDAVAVIVGQSSDPVRSEFRRVIETQHLGLSLSESISRMVRYMPLQEVS 82 I S + SD+ +I S V EF + E Q + L ++ + + E + Sbjct: 507 ITVSSDSTENESDSEEII---SERLV--EFSNINELQEKNVELLNAVRELAEKLEAAEKT 561 Query: 83 FFSTVISVQSQLGGNLSEALANLSRILRDRKNM 115 T+ ++ + SE +A L + L +++ Sbjct: 562 QDKTLQNILKETINEASEKIAELEKELEEQEQR 594 >gnl|CDD|34025 COG4303, EutB, Ethanolamine ammonia-lyase, large subunit [Amino acid transport and metabolism]. Length = 453 Score = 25.7 bits (56), Expect = 8.7 Identities = 11/34 (32%), Positives = 17/34 (50%) Query: 119 VQALSMEAKASAWIIGSLPFCVSTLVYFTSPGYM 152 ++MEA+ PF V+T+V F P Y+ Sbjct: 302 ADQVTMEARCYGLARHFDPFLVNTVVGFIGPEYL 335 >gnl|CDD|35685 KOG0464, KOG0464, KOG0464, Elongation factor G [Translation, ribosomal structure and biogenesis]. Length = 753 Score = 25.5 bits (55), Expect = 9.4 Identities = 9/69 (13%), Positives = 24/69 (34%), Gaps = 1/69 (1%) Query: 47 PVRSEF-RRVIETQHLGLSLSESISRMVRYMPLQEVSFFSTVISVQSQLGGNLSEALANL 105 P+ ++ +R + + I R+ ++PL E+ S + + + + Sbjct: 675 PILADLAQRRAHFEEIDAREDNEIRRICAFLPLAEIEGLSKTLRTLTSGFADFALEFRGY 734 Query: 106 SRILRDRKN 114 + K Sbjct: 735 QAMNEHDKM 743 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.325 0.136 0.380 Gapped Lambda K H 0.267 0.0788 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 2,140,707 Number of extensions: 105403 Number of successful extensions: 412 Number of sequences better than 10.0: 1 Number of HSP's gapped: 408 Number of HSP's successfully gapped: 31 Length of query: 187 Length of database: 6,263,737 Length adjustment: 88 Effective length of query: 99 Effective length of database: 4,362,145 Effective search space: 431852355 Effective search space used: 431852355 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 54 (24.5 bits)