From PneumoWiki
Jump to navigation Jump to search
PangenomeTIGR4
serotype 4
D39
serotype 2
D39V
serotype 2
Hungary19A-6
serotype 19A
EF3030
serotype 19F
670-6B
serotype 6B
6A-10
serotype 6A
70585
serotype 5
A026
serotype 19F
A66
serotype 3
AP200
serotype 11A
ASP0581
serotype 12F
ATCC 49619
serotype 19F
ATCC 700669
serotype 23F
BM6001
serotype 19F
BVJ1JL
serotype 1
CGSP14
serotype 14
G54
serotype 19F
HU-OH
serotype 3
Hu15
serotype 19A
Hu17
serotype 19A
INV104
serotype 1
INV200
serotype 14
JJA
serotype 14
MDRSPN001
serotype 19F
NCTC7465
serotype 1
NCTC7466
serotype 2
NU83127
serotype 4
OXC141
serotype 3
P1031
serotype 1
R6
serotype 2
SP49
serotype 19A
SPN032672
serotype 1
SPN034156
serotype 3
SPN034183
serotype 3
SPN994038
serotype 3
SPN994039
serotype 3
SPNA45
serotype 3
ST556
serotype 19F
TCH8431/19A
serotype 19A
Taiwan19F-14
serotype 19F
Xen35
serotype 4
gamPNI0373
serotype 1

NCBI: 08-MAR-2025

Summary[edit | edit source]

  • organism: Streptococcus pneumoniae SP49
  • locus tag: BMJ42_RS07930 [old locus tag: BMJ42_01499 ]
  • pan locus tag?: PNEUPAN002831000
  • symbol: BMJ42_RS07930
  • pan gene symbol?: pclA
  • synonym:
  • product: pneumococcal collagen-like adhesin PclA

Genome View[edit | edit source]

Gene[edit | edit source]

General[edit | edit source]

  • type: CDS
  • locus tag: BMJ42_RS07930 [old locus tag: BMJ42_01499 ]
  • symbol: BMJ42_RS07930
  • product: pneumococcal collagen-like adhesin PclA
  • replicon: chromosome
  • strand: -
  • coordinates: 1426423..1432023
  • length: 5601
  • essential: unknown

Accession numbers[edit | edit source]

  • Location: NZ_CP018136 (1426423..1432023) NCBI
  • BioCyc: BMJ42_RS07930 BioCyc
  • MicrobesOnline:
  • PneumoBrowse for strain D39V: SPV_1376 PneumoBrowse

Phenotype[edit | edit source]

Share your knowledge and add information here. [edit]

DNA sequence[edit | edit source]

  • 1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    4501
    4561
    4621
    4681
    4741
    4801
    4861
    4921
    4981
    5041
    5101
    5161
    5221
    5281
    5341
    5401
    5461
    5521
    5581
    ATGAAACATTCACATAAAAAATCATTTGACTGGTATAGCATGCAACAACGTTATTCTATT
    CGTAAGTATTACTTTGGTGCAGTTAGTGTCTTGCTCGGTACCGCTTTGGTATTAGGTGCA
    GCAGCTAGTGTCCAAACAGTACAAGCGGAAGAAAACAAACAAGAAACTACCAATAGTATT
    TCTGTTGGTAGGGGAGAAGCAGCTACTAAACTAGCAGAGGTTTCTGCGTCTAATAAAGAG
    AAAACCTATGCAGCTCCAACTGTAGCTAATCCAGTAGAAACGACTCCAGTTAAAACTGGA
    GAAGTTACTAAACCAGCAGAAAAAGTTGAAGAAGCAAAAGACAAAAAAGAGGAAGTAACG
    CATCAAGATGCTATTGACAAGTCAAAATTGTTAACGGCTCTTTCGCGTGCTAAAAAGTTA
    GAAAGCAAGTTATATACAGAAGCAAGTGCTGCTAACTTGCAAACAAGTATCCAAGCTGGT
    CAAAGCTTGCTTGGGAAAGCAGATGCATCTGAAGCTGAATTATCAGCAGCAGAGTCATCT
    ATTCAATCATCTATTATTGGTCTAGAACTTCGTTCTAACTCTAACAAAGGAACTGTATCA
    GAAACTCCTGTAGCGAAGAAAGCTAATATAGTTGAAGCGAAGGAAGAGACTAATCCAGCT
    GCAACAACTGATCGTTCAGCTGTTGATAGTGCTGTTTTGCCAACTAGCACAGCTGCCAAA
    GTAGAAACAACTTCAGCTCCAGCATCTATTAATGAAATCTTGAAACCAGGTTTGAGCCTT
    TCTGATGCTCGCCAAAATCCAGCTATCCGTAAAGAAGACTTGGATAAGGGGTATAGTGGT
    TTTAGAGCAGCACCTGCTCCAACGAACAAACTAGTAACTAATCTAGGAAATAATACTGTT
    TTCACTGATATTAGCAGAGGGTCTCATACTTTCCGTGGTAATGGGAATTCACGTGGTGGA
    AACCCGATTCATTTTGATGTAACAACAACTAGAGTTGGAAATAGAGTAGAATTCTCAATT
    GCATATTCTGGTCCAGGTGAGTTCGTTAATAATAACTTTCTCTTAGACAAAGGGAATGGA
    TTTGGAGAACCATCTCGTGCAACTATTTCAACAAACAGATTAAGAGATCAAACTAAAGAT
    ATAAGAAAGGGTGCAAATTTTGTATCTCATTCAAGCTACAGTTTAACATCTGCTGTTGAA
    ACAAACAGTAATCAAACTATTAGATTTAGTTTACCTATAGCCAACCCTAATGGTGATCTG
    TCTGTTCGATTAAGACCTGTTACTTTTAATGTAGGTCAAGGGGAATCAGGATCTGCAACT
    AGTAATGATCCATATAGTAACTCTAACTATTACTTCAGAGAAAAACCACTCCTCTTGGAT
    GCAAATCCAAATGGTGGTACTAATAATAAGACTGTTTCAGAAGACATTGACTTCCAAACT
    GTCTATCTTCCAACTAGTAAGTTACCAGAAGGTCAAACTAGATTAGTTCGAGAAGGTGAA
    AAAGGACAACGTCAAATCATCTATAAAGTTCATCAATTGGGTAACGAAACAATCCTAGGA
    TTGCCAATTAGTAATTCAGTTATTAAAGAAGTTAAGCCACGTATTATGCAAATTGGTGTG
    GCTAAAGATCTAATCGATACAGTAAAACCACGTGTTGATCAAAATAAAGTCGGTGATACA
    AATAACCTCACTTTCTATCTTGATAACGATGGAAACGGTGTTTATACTGAGGGCGTAGAC
    GAACTTGTTCAAAAAATTGCTATTAAAGATGGTGCTAAAGGTGAAAAAGGAGTCCAAGGC
    GAACGCGGCCTGACTGGAGCGCAAGGTCAGGCAGGTCGTGACGGTGTAACTCCAACCGTA
    ACCGTTAAAGATAATAAAAATGACGGCACTCATACTATCACTATTAACGACGGTAGAGGT
    AATGTTGCAAGTACTGTTGTAAGAGATGGTTTCGATGGCGCAAGTCCATTAGTTGCGACT
    CAACGAAATGAAGCAGATAAGACAACAACTGTTATCTTCTATTATGATCAAAATGGAAAT
    AATGAATTAGATGCTTCTGATAAGAAATTAAAAGAAGTTATTATTGCAGATGGTGCCAAA
    GGTGAACAAGGTCTTCAAGGTCGTGATGGAGCTCAAGGACCAAAAGGCGCAGATGGACAA
    AGAGGACCAGCGGGACCACAAGGACCAAAAGGAGAACAAGGTAATCCAGGAACTCCAGGT
    AAAGATGGAAAATCTCTAATTGCTGTTAAAAATGGAACTGAAACAAAAGTCTATGTAGAG
    GATCCGGCTAGACCAGGACAACCACTTAATCCAAACCAACCATTAGCAACTATCACTGAT
    GGAAGAAATGGTACTAATGGACAATCACCAACAATTACAGCGACACGAAGTGTTCAAAAT
    GGTAAAAATGGTGTATTAGTAACAATTACACCAGTAGGAGGTCGTCCACAAACAACATTT
    GTAGAGGATGGACAAAAAGGTGCTGATGGGAAAACCCCAACAGTAACAATAACTGAGGGG
    CAAAACGGCACACATACATTAACGGTTCATAATCCAGGAAGTCCAGATGTGACAACTACG
    ATCCGTGATGGAGCTACAGGACAAGCAGGTCGTGATGGTAAAGATGTATTAAACGGAAAA
    GTAAATCCACAACCAAACCAAGGTAAAAATGGAGATAAATATATTAATACCGAAACCGGT
    GATGTCTATGTTAAAAACAATGGAAACTGGGATAAAGAAGGCAACATCAAAGGCCCTAAA
    GGGGACAAAGGTGCAGATGGTGCTAAAGGCGAAAAAGGAGACCAAGGCGAACGCGGCCTA
    ACTGGAGCGCAAGGAGCTAAAGGTGCGGATGGCGCAGTAGGTCGTGATGGACGTGACGGT
    AAAGACGTGTTGAACGGCAAAGCTAACCCAGAAGCACATCAAGGTAAAGACGGCGATAAA
    TACGTTAATACAGAAACAGGCGACGTCTTCGTTAAGAATAACGGCAACTGGGATAAAGAG
    GGCAACATCAAAGGCCCTAAGGGTGACAAAGGTGAACGCGGAGAAGATGGTAAGACTCCA
    GAAGTAACTGTAACTCCAGGTAAAGATGGCCATAGTACTGACATTACATTCACTGTTCCA
    GGTAAAGATCCAGTTACAGTAAATGTTAAGGACGGAGAAAATGGTCTGAACGGTAAAACT
    CCAAAAGTTGATTTACTTCGTGTCCAAGGAAAAAACGGAAATCCATCTCATACAATTGTG
    ACATTCTATACAGATGAAAACAATGACGGCAAATATACACCAGGAACTGATGAACTTCTA
    GGTTCAGAAATGATTAAAGATGGTGCTAAAGGCGCGGACGGACGAGATGGTAAATCATTG
    CTTACTGTCAAGGATGGTAAAGAAACTAAAGTTTACCAAGAAGATCCAGCTAACCCAGGA
    CAACCATTAAATCCAGAAAAACCACTTGCGGTAATTAGAGATGGAGTAGATGGAAAATCA
    CCTACAGTTACAGCTGTTCGTAAAGATGAAGCAGGGCATAAAGGTGTAGAAATCACTGTT
    GATAACCATGATGGTTCACAACCAACTACAGTCTTTGTTCAAGATGGTGCTAAGGGAGAA
    ACTGGTGCAACCGGTCAGGATGGACAAACTCCTACAATCACTACTCAACGTGGACAAGAT
    GGCCAAAGCACTGTTGTAACTATCACAACATCAGGTAAAGATCCAGTAACCTTCACTGTA
    AAAGATGGTAAGAATGGTAAAGATGGCCGTGCACCGAAAATCAAAGTAGAAGATATTACT
    TCACCTTCAAGAATTAGACGCGATACAGATGCTGCTGCAACTCCAACGCGTAACGGTATC
    CGTGTTACAGTTTATGATGATGTTAATGACAATGGGGTATACGACGGAGGTGTCGATAAA
    GTATTAAATAGTAAAGATATTTATAACGGTATAGATGGACGTGATGGTTCAGCTCCAACT
    ATTACTACAAAAGATAATGGAGATGGAACACACACTATCACAGTTCAAAATCCTGATGGT
    TCTGAATCAACAACTGTTGTTAAAGATGGTAAAGACGGTAAAACTGCGAATATCACTACA
    ACAGAAAACCCAGATGGAAGCCACACAATTACGGTGACAAATCCAGATGGTTCAACTAAA
    GAAACTGTTGTTAAAAACGGTAAAGACGGTAAGACTCCTAAAGTTGAAGTAACGGATAAC
    AACGATGGAACACACACAGTTAAAGTGACGGATGGAGACGGCAATGTTACCAACGCTATC
    ATCAAAGATGGTAAAGATGGTAAAGCTGCAACAGCAACAACTACTGAAAATCCAGATGGA
    AGCCACACAGTAACAATCACTAACCCAGACGGAACTAAGAATGAGTTTGTTGTTAAGAAT
    GGACGTGACGGTGTTGACGGACGTACTCCAACCGCATCTGTTCGTGATAATGGAGACGGA
    AGTCATACAATCGTTATTACAAATCCAGAAGGTGTGACAACTGAAACCACAGTTCGTGAT
    GGTAAATCACCAAAAGTGACTATAACTGATGAACAAAATGGAACTCATAAGATCTCTGTT
    CTAAATGGTGACGGAACAACTACTGAAACAATCATTACAGATGGTAAATCACCAGTAGCA
    ACAGTTAGAGATAACCAAGATGGTACTTACACTATTCGTGTGGAAAACGGTAATGGTACT
    GTTTCTGAAACCACAGTTCGTGACGGTAAATCACCAACTGCTAAGGTTGTGGATAATGGA
    GATGGAACTCACACTATCACAGTTGTGAACTCAGACGGAACAACTACAACAACTACAGTT
    CGTGATGGTAGAGAACCAAAACTTGAAGTTATTGATAACAACGATGGTTCACACACTATT
    AAAGTGACAGGTGCTGATGGTAAAGGAACGACAACTACAATCTTTGATGGTAAATCACCA
    AAAGCGAACATCGTTGATAACGGAGATGGAACTCATACATTAACAATCGTAGATTCTGAT
    GGTCGTGAATACAAATCTATTATCAAAGATGGTAAAGACGGCAAAGATAGCGTTTCACCA
    ACTGTAACTGTTAAAAATAATAACGATGGAACTCACGTTGTTACAATCACTAATCCAGAT
    GGAAGTAAGACAGAAATGGTGATTAAAGACGGTAAAGATGGTAAATCACCAAAAGTTTCT
    GTTGAAGATAATGGTGATGGTAGTCATACAATCACAATCATCAATTCTGATGGAACTGTG
    ACAAAAACAGTTATTAAAGATGGCAAAGATGGTAGAGATGGACGTGATGGTCGAGACGGC
    AAAGACGGTAAAGATGGAAAATGTGGATGCCAAGACAAACCAGTAACACCATCAAATGAC
    AAACCAGTTCCTCCAACACCAAATGTGCCGACACCAGAAGTACCGGTTAAACCAGTACCA
    GCGGTTCCAGAACAACCAGTAGTACCAACACCGGCTCAACCAGCAACTCCAGTAAATGCT
    AACCCAGTAGCACCAACTACAGGTAAAGAAAACCGTGGGGACAAATTACCTGAAACTGGA
    AGCCAATCTGATTATATCTCTGTTCTTTTAGGTAGCGGTATTCTATTGAGCCTATATGTA
    GGACGAAGAAAAGAAGATTAA
    60
    120
    180
    240
    300
    360
    420
    480
    540
    600
    660
    720
    780
    840
    900
    960
    1020
    1080
    1140
    1200
    1260
    1320
    1380
    1440
    1500
    1560
    1620
    1680
    1740
    1800
    1860
    1920
    1980
    2040
    2100
    2160
    2220
    2280
    2340
    2400
    2460
    2520
    2580
    2640
    2700
    2760
    2820
    2880
    2940
    3000
    3060
    3120
    3180
    3240
    3300
    3360
    3420
    3480
    3540
    3600
    3660
    3720
    3780
    3840
    3900
    3960
    4020
    4080
    4140
    4200
    4260
    4320
    4380
    4440
    4500
    4560
    4620
    4680
    4740
    4800
    4860
    4920
    4980
    5040
    5100
    5160
    5220
    5280
    5340
    5400
    5460
    5520
    5580
    5601

Protein[edit | edit source]

General[edit | edit source]

  • locus tag: BMJ42_RS07930 [old locus tag: BMJ42_01499 ]
  • symbol: BMJ42_RS07930
  • description: pneumococcal collagen-like adhesin PclA
  • length: 1866
  • theoretical pI: 5.77235
  • theoretical MW: 196108
  • GRAVY: -0.812433

Function[edit | edit source]

  • TIGRFAM:
    Streptococcal surface-anchored protein repeat, S. criceti family (TIGR04203; HMM-score: 536.1)
    and 3 more
    gram-positive signal peptide, YSIRK family (TIGR01168; HMM-score: 35.6)
    Cell structure Cell envelope Other LPXTG cell wall anchor domain (TIGR01167; HMM-score: 17)
    Por secretion system C-terminal sorting domain (TIGR04183; HMM-score: 7.9)
  • TheSEED: data available for D39
  • PFAM:
    no clan defined CFSR; Collagen-flanked surface repeat (PF19079; HMM-score: 559.8)
    and 8 more
    Collagen; Collagen triple helix repeat (20 copies) (PF01391; HMM-score: 65.7)
    G5 (CL0593) G5; G5 domain (PF07501; HMM-score: 39.9)
    no clan defined YSIRK_signal; YSIRK type signal peptide (PF04650; HMM-score: 33.3)
    E-set (CL0159) BiPBP_C; Penicillin-Binding Protein C-terminus Family (PF06832; HMM-score: 31.3)
    no clan defined Gram_pos_anchor; LPXTG cell wall anchor motif (PF00746; HMM-score: 14.4)
    DUF4860; Domain of unknown function (DUF4860) (PF16152; HMM-score: 11.3)
    PGDYG; PGDYG protein (PF14083; HMM-score: 10.5)
    E-set (CL0159) Por_Secre_tail; Secretion system C-terminal sorting domain (PF18962; HMM-score: 9.3)

Structure, modifications & cofactors[edit | edit source]

  • domains:
  • modifications:
  • cofactors:
  • effectors:

Localization[edit | edit source]

  • PSORTb: Cellwall
    • Cytoplasmic Score: 0
    • Cytoplasmic Membrane Score: 0
    • Cellwall Score: 9.99
    • Extracellular Score: 0.01
    • Internal Helix: 1
  • DeepLocPro: Cell wall & surface
    • Cytoplasmic Score: 0.0005
    • Cytoplasmic Membrane Score: 0.0013
    • Cell wall & surface Score: 0.9801
    • Extracellular Score: 0.0182
  • SignalP: Signal peptide SP(Sec/SPI) length 49 aa
    • SP(Sec/SPI): 0.947738
    • TAT(Tat/SPI): 0.02451
    • LIPO(Sec/SPII): 0.005982
    • Cleavage Site: CS pos: 49-50. VQA-EE. Pr: 0.9032
  • predicted transmembrane helices (TMHMM): 1

Accession numbers[edit | edit source]

  • GI:
  • RefSeq: WP_050198442 NCBI
  • UniProt:

Protein sequence[edit | edit source]

  • MKHSHKKSFDWYSMQQRYSIRKYYFGAVSVLLGTALVLGAAASVQTVQAEENKQETTNSISVGRGEAATKLAEVSASNKEKTYAAPTVANPVETTPVKTGEVTKPAEKVEEAKDKKEEVTHQDAIDKSKLLTALSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSNSNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEILKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRGNGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNRLRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVTFNVGQGESGSATSNDPYSNSNYYFREKPLLLDANPNGGTNNKTVSEDIDFQTVYLPTSKLPEGQTRLVREGEKGQRQIIYKVHQLGNETILGLPISNSVIKEVKPRIMQIGVAKDLIDTVKPRVDQNKVGDTNNLTFYLDNDGNGVYTEGVDELVQKIAIKDGAKGEKGVQGERGLTGAQGQAGRDGVTPTVTVKDNKNDGTHTITINDGRGNVASTVVRDGFDGASPLVATQRNEADKTTTVIFYYDQNGNNELDASDKKLKEVIIADGAKGEQGLQGRDGAQGPKGADGQRGPAGPQGPKGEQGNPGTPGKDGKSLIAVKNGTETKVYVEDPARPGQPLNPNQPLATITDGRNGTNGQSPTITATRSVQNGKNGVLVTITPVGGRPQTTFVEDGQKGADGKTPTVTITEGQNGTHTLTVHNPGSPDVTTTIRDGATGQAGRDGKDVLNGKVNPQPNQGKNGDKYINTETGDVYVKNNGNWDKEGNIKGPKGDKGADGAKGEKGDQGERGLTGAQGAKGADGAVGRDGRDGKDVLNGKANPEAHQGKDGDKYVNTETGDVFVKNNGNWDKEGNIKGPKGDKGERGEDGKTPEVTVTPGKDGHSTDITFTVPGKDPVTVNVKDGENGLNGKTPKVDLLRVQGKNGNPSHTIVTFYTDENNDGKYTPGTDELLGSEMIKDGAKGADGRDGKSLLTVKDGKETKVYQEDPANPGQPLNPEKPLAVIRDGVDGKSPTVTAVRKDEAGHKGVEITVDNHDGSQPTTVFVQDGAKGETGATGQDGQTPTITTQRGQDGQSTVVTITTSGKDPVTFTVKDGKNGKDGRAPKIKVEDITSPSRIRRDTDAAATPTRNGIRVTVYDDVNDNGVYDGGVDKVLNSKDIYNGIDGRDGSAPTITTKDNGDGTHTITVQNPDGSESTTVVKDGKDGKTANITTTENPDGSHTITVTNPDGSTKETVVKNGKDGKTPKVEVTDNNDGTHTVKVTDGDGNVTNAIIKDGKDGKAATATTTENPDGSHTVTITNPDGTKNEFVVKNGRDGVDGRTPTASVRDNGDGSHTIVITNPEGVTTETTVRDGKSPKVTITDEQNGTHKISVLNGDGTTTETIITDGKSPVATVRDNQDGTYTIRVENGNGTVSETTVRDGKSPTAKVVDNGDGTHTITVVNSDGTTTTTTVRDGREPKLEVIDNNDGSHTIKVTGADGKGTTTTIFDGKSPKANIVDNGDGTHTLTIVDSDGREYKSIIKDGKDGKDSVSPTVTVKNNNDGTHVVTITNPDGSKTEMVIKDGKDGKSPKVSVEDNGDGSHTITIINSDGTVTKTVIKDGKDGRDGRDGRDGKDGKDGKCGCQDKPVTPSNDKPVPPTPNVPTPEVPVKPVPAVPEQPVVPTPAQPATPVNANPVAPTTGKENRGDKLPETGSQSDYISVLLGSGILLSLYVGRRKED

Experimental data[edit | edit source]

  • protein localization:
  • interaction partners:

Expression & Regulation[edit | edit source]

Operon[edit | edit source]

Regulation[edit | edit source]

  • regulator:

Transcription[edit | edit source]

  • transcription start site:

Expression data[edit | edit source]

Biological Material[edit | edit source]

Mutants[edit | edit source]

Expression vector[edit | edit source]

lacZ fusion[edit | edit source]

GFP fusion[edit | edit source]

two-hybrid system[edit | edit source]

FLAG-tag construct[edit | edit source]

Antibody[edit | edit source]

Other Information[edit | edit source]

You can add further information about the gene and protein here. [edit]

Literature[edit | edit source]

References[edit | edit source]

Relevant publications[edit | edit source]