RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780721|ref|YP_003065134.1| pilus component protein
[Candidatus Liberibacter asiaticus str. psy62]
(329 letters)
>gnl|CDD|32247 COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 320
Score = 115 bits (288), Expect = 2e-26
Identities = 98/322 (30%), Positives = 148/322 (45%), Gaps = 5/322 (1%)
Query: 8 LFDATELAVAITTAVSIFSLIYAVVIPSLGSGELEKRMKSVAVEREILRKQQITSLQKDS 67
L EL + A +F+L VV+ L EL +K V E + + L S
Sbjct: 1 LLMRAELYLTFLVARPLFALAILVVMAVLKRKELLLLLKGVDREIPRIILLSLIRLLIGS 60
Query: 68 ASSRLRTRDSKSLRNFVKKLNLKAILVDENIVNKLRAAGFRSEYSLNILLVVRLVVPII- 126
S R LR + L L D KL A +R E +L + L L+ +
Sbjct: 61 VSIRELLAFL--LRKIISGLLLFFGGEDSKPRFKLVEALYRLELALPLFLFALLLALFLG 118
Query: 127 FLILGIIWIFGYDKLLEYPFQLRVGAVILIGYVGFCAPSVWISNLVQKRQSSIKRAWPDA 186
L+ I + + P + +LI +G P + S ++KR I R PD
Sbjct: 119 VLLSLPILLLVLGSFIILPLSDFLAIALLISLLGLPLPFILFSLALKKRLKEIARELPDF 178
Query: 187 LDLLLICVESGISIDQALRRVAEDIGGQSVPLSEEMLLTIAELSFLPNRQVAFENFYNRT 246
L L+ +C+E+G+S+ AL+RVA+++ GQ L+EE+ T AELS + + A R
Sbjct: 179 LRLMAVCLEAGLSLADALKRVADELYGQ-RILAEELARTTAELSLGLSIEEALIRLAVRL 237
Query: 247 QMDCVRNATQALIQSDRYGTSIGDSLRVLVSETRSE-RLMEAEKKAASLGPKLTVPMIIF 305
D V+ L Q+ G S+ D+LRVL E E RL E +A +G + +P+I+
Sbjct: 238 GSDEVKRVVSLLTQALESGGSLADALRVLSMELSRETRLKLGEIRAGKIGVGILLPLILV 297
Query: 306 FLPVLILVIIGPAILSIIDTMK 327
+PVL+ V+I PAIL I +
Sbjct: 298 IIPVLLFVVILPAILGIASSGG 319
>gnl|CDD|144175 pfam00482, GSPII_F, Bacterial type II secretion system protein F
domain. The original family covered both the regions
found by the current model. The splitting of the family
has allowed the related FlaJ_arch (archaeal FlaJ family)
to be merged with it. Proteins with this domain in form
a platform for the type II secretion machinery, as well
as the type IV pili and the archaeal flagellae. This
domain seems to show some similarity to pfam00664, but
this may just be due to similarities in the TM helices
(personal obs: C Yeats).
Length = 125
Score = 62.7 bits (153), Expect = 1e-10
Identities = 28/131 (21%), Positives = 54/131 (41%), Gaps = 6/131 (4%)
Query: 186 ALDLLLICVESGISIDQALRRVAEDIGGQSVPLSEEMLLTIAELSFLPNRQVAFENFYNR 245
L L + +G+ + +AL +AE+ + PL EE+ L + A R
Sbjct: 1 FLRQLATLLRAGLPLLEALEILAEEAE--NGPLREELKRIRERLREGGSLSEALA----R 54
Query: 246 TQMDCVRNATQALIQSDRYGTSIGDSLRVLVSETRSERLMEAEKKAASLGPKLTVPMIIF 305
T + ALI + G ++ + L L R + + AA L P + + + +
Sbjct: 55 TPLSEFPPLLVALIAAGESGGNLAEVLERLADYLEERRELRRKILAALLYPLILLVVALL 114
Query: 306 FLPVLILVIIG 316
L +L+ +++
Sbjct: 115 VLLILLAILLP 125
>gnl|CDD|34572 COG4965, TadB, Flp pilus assembly protein TadB [Intracellular
trafficking and secretion].
Length = 309
Score = 54.5 bits (131), Expect = 4e-08
Identities = 41/196 (20%), Positives = 77/196 (39%), Gaps = 8/196 (4%)
Query: 120 RLVVPIIFLILGIIWIFGYDKLLEYPFQLRVGAVILIGYVGFCAPSVWISNLVQKRQSSI 179
+ +I L G + L V V LIG + + +R
Sbjct: 79 VSLARLILLSAGAGLAVAL--VAWLGLSLLVALVALIGAALLPRLVLR--SRRARRLKRF 134
Query: 180 KRAWPDALDLLLICVESGISIDQALRRVAEDIGGQSVPLSEEMLLTIAELSFLPNRQVAF 239
+ P+ALDL++ + +G + ALR A++ PL E L + A
Sbjct: 135 GQQLPEALDLIVRALRAGAPLPDALRLAAKETPE---PLGTEFTLITDRQQLGIDLPAAL 191
Query: 240 ENFYNRTQMDCVRNATQALIQSDRYGTSIGDSLRVLVSETRSERLMEAEKKAASLGPKLT 299
Y R ++ + + R+G ++ + L L R + M+A+ +A S +++
Sbjct: 192 LRMYERYPLEELYFLAIVISIQSRHGGNLSELLDNLSRVIRERKKMKAKVRALSAEARMS 251
Query: 300 VPMIIFFLPVLILVII 315
I+ LP+L+ ++I
Sbjct: 252 A-WILGALPLLVGLLI 266
>gnl|CDD|31669 COG1480, COG1480, Predicted membrane-associated HD superfamily
hydrolase [General function prediction only].
Length = 700
Score = 31.1 bits (70), Expect = 0.47
Identities = 23/115 (20%), Positives = 45/115 (39%), Gaps = 9/115 (7%)
Query: 50 VEREILRKQQITSLQKDSASSRLRTRDSKSLRNFVKKLNLKAIL-----VDENIVNKLRA 104
EIL + D + +++ S VK + I+ + + L
Sbjct: 205 AATEILNSALKPNYVYDEEQTENLRQEALSKVEPVKISKGQIIVKEGEIITDEDYVILDL 264
Query: 105 AGFRSEYSLNILLVVRLVVPIIFLILGIIWIFGYDKLLEYPFQLRVGAVILIGYV 159
G S+NIL ++ L +I +I I+ Y++ + P +LR ++L +
Sbjct: 265 LGL-LSLSVNILPLLGL---LILVIFLILLFALYERRTKSPLKLRNSLLLLYLSL 315
>gnl|CDD|39872 KOG4673, KOG4673, KOG4673, Transcription factor TMF, TATA element
modulatory factor [Transcription].
Length = 961
Score = 29.8 bits (66), Expect = 0.98
Identities = 18/71 (25%), Positives = 32/71 (45%), Gaps = 8/71 (11%)
Query: 40 ELEKRMKSVAVEREILRKQQITSLQKDSASSRLRTRDSKSLRNFVKKLNL------KAIL 93
LEK+++++ ER+ LR++Q K ++ L + + +L K L
Sbjct: 413 TLEKKVQALTKERDALRREQ--KSLKKELAAALLKDELAEKDEIINQLMAEGEKLSKKQL 470
Query: 94 VDENIVNKLRA 104
I+ KLRA
Sbjct: 471 AQSAIIKKLRA 481
>gnl|CDD|143474 cd07322, PriL_PriS_Eukaryotic, Eukaryotic core primase: Large
subunit, PriL. Primases synthesize the RNA primers
required for DNA replication. Primases are grouped into
two classes, bacteria/bacteriophage and
archaeal/eukaryotic. The proteins in the two classes
differ in structure and the replication apparatus
components. Archaeal/eukaryotic core primase is a
heterodimeric enzyme consisting of a small catalytic
subunit (PriS) and a large subunit (PriL). In eukaryotic
organisms, a heterotetrameric enzyme formed by DNA
polymerase alpha, the B subunit and two primase subunits
has primase activity. Although the catalytic activity
resides within PriS, the PriL subunit is essential for
primase function as disruption of the PriL gene in yeast
is lethal. PriL is composed of two structural domains.
Several functions have been proposed for PriL such as
stabilization of the PriS, involvement in synthesis
initiation, improvement of primase processivity,
determination of product size and transfer of the
products to DNA polymerase alpha.
Length = 390
Score = 27.6 bits (62), Expect = 5.5
Identities = 18/73 (24%), Positives = 25/73 (34%), Gaps = 13/73 (17%)
Query: 42 EKRMKSVAVEREILRKQQITSLQKDSASSRLRTRDSKSLRNFVKKLNLKAILVDENIVNK 101
E R V E E+ R RL + L+ F+K L V + +
Sbjct: 55 ELRRWFVRQETELFRY-------------RLELLSLEGLKQFLKSNGLDYQPVSDEEKEE 101
Query: 102 LRAAGFRSEYSLN 114
LR +S SL
Sbjct: 102 LREELLKSASSLK 114
>gnl|CDD|38163 KOG2952, KOG2952, KOG2952, Cell cycle control protein [Cell cycle
control, cell division, chromosome partitioning,
Transcription, Signal transduction mechanisms].
Length = 351
Score = 27.2 bits (60), Expect = 5.7
Identities = 15/89 (16%), Positives = 36/89 (40%), Gaps = 11/89 (12%)
Query: 103 RAAGFRSEYSLNILLVVRLVVPIIFLILGIIWIFGYDKLLEYPFQLRVGAVILIGYVGFC 162
R ++ + +L + ++ ++FL +G+ +F K++E I I Y C
Sbjct: 32 RLPAWQPILTPRTVLPLFFIIGVVFLPIGVGLLFASSKVIE----------ITIRYTD-C 80
Query: 163 APSVWISNLVQKRQSSIKRAWPDALDLLL 191
P+ + +N + Q + + +
Sbjct: 81 IPTGFRTNPSEYIQGHFDQTKSCTITFTV 109
>gnl|CDD|37930 KOG2719, KOG2719, KOG2719, Metalloprotease [General function
prediction only].
Length = 428
Score = 27.1 bits (60), Expect = 6.0
Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 99 VNKLRAAGFRSEYSLNILLVVRLVVPIIFLILGIIWIFGYDKLLE-YPFQLRVGAVILIG 157
NK + F + ++L+ V L +PI+ I+ I+ FG + + F + ++L
Sbjct: 140 FNKQTLSLFIIDGLKSLLVGVVLTIPIVAAIVMILKKFGPYFFIYLWLFYFVLSLLLLTI 199
Query: 158 YVGFCAPSV 166
Y GF AP
Sbjct: 200 YPGFIAPLF 208
>gnl|CDD|31175 COG0833, LysP, Amino acid transporters [Amino acid transport and
metabolism].
Length = 541
Score = 26.7 bits (59), Expect = 9.8
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 98 IVNKLRAAGF-RSEYSLNILLVVRLVVPIIFLILGIIWIFG 137
++N GF +E+ + + V+ + I F+ILGII I G
Sbjct: 173 LLNLFGVKGFGETEFWFSSIKVLTI---IGFIILGIIIICG 210
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.325 0.139 0.390
Gapped
Lambda K H
0.267 0.0753 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 3,791,209
Number of extensions: 200999
Number of successful extensions: 1044
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1042
Number of HSP's successfully gapped: 32
Length of query: 329
Length of database: 6,263,737
Length adjustment: 94
Effective length of query: 235
Effective length of database: 4,232,491
Effective search space: 994635385
Effective search space used: 994635385
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 58 (26.1 bits)