Jump to navigation
		Jump to search
		
PangenomeTIGR4
serotype 4
D39serotype 2
D39Vserotype 2
Hungary19A-6serotype 19A
EF3030serotype 19F
670-6Bserotype 6B
6A-10serotype 6A
70585serotype 5
A026serotype 19F
A66serotype 3
AP200serotype 11A
ASP0581serotype 12F
ATCC 49619serotype 19F
ATCC 700669serotype 23F
BM6001serotype 19F
BVJ1JLserotype 1
CGSP14serotype 14
G54serotype 19F
HU-OHserotype 3
Hu15serotype 19A
Hu17serotype 19A
INV104serotype 1
INV200serotype 14
JJAserotype 14
MDRSPN001serotype 19F
NCTC7465serotype 1
NCTC7466serotype 2
NU83127serotype 4
OXC141serotype 3
P1031serotype 1
R6serotype 2
SP49serotype 19A
SPN032672serotype 1
SPN034156serotype 3
SPN034183serotype 3
SPN994038serotype 3
SPN994039serotype 3
SPNA45serotype 3
ST556serotype 19F
TCH8431/19Aserotype 19A
Taiwan19F-14serotype 19F
Xen35serotype 4
gamPNI0373serotype 1
NCBI: 31-JAN-2014
⊟Summary[edit | edit source]
- organism: Streptococcus pneumoniae Hungary19A-6
- locus tag: SPH_2354 [new locus tag: SPH_RS11525 ]
- pan locus tag?: PNEUPAN003850000
- symbol: SPH_2354
- pan gene symbol?: fucY
- synonym:
- product: large secreted protein
⊟Genome View[edit | edit source]
⊟Gene[edit | edit source]
⊟General[edit | edit source]
- type: CDS
- locus tag: SPH_2354 [new locus tag: SPH_RS11525 ]
- symbol: SPH_2354
- product: large secreted protein
- replicon: chromosome
- strand: -
- coordinates: 2162319..2164613
- length: 2295
- essential: unknown
⊟Accession numbers[edit | edit source]
- Location: CP000936 (2162319..2164613) NCBI
- BioCyc: see SPH_RS11525
- MicrobesOnline: 5697173 MicrobesOnline
- PneumoBrowse for strain D39V: SPV_1988 PneumoBrowse
⊟Phenotype[edit | edit source]
Share your knowledge and add information here. [edit]
⊟DNA sequence[edit | edit source]
- 1
 61
 121
 181
 241
 301
 361
 421
 481
 541
 601
 661
 721
 781
 841
 901
 961
 1021
 1081
 1141
 1201
 1261
 1321
 1381
 1441
 1501
 1561
 1621
 1681
 1741
 1801
 1861
 1921
 1981
 2041
 2101
 2161
 2221
 2281GTGAAATTATGGTATAAGAAAGCTGCCGCAAATTGGAATGAAGCCTTGCCGATTGGGAAC
 GGTCATTTAGGTGGTATGATTTATGGTTCAGCTACAAAAGAATGTATTCAACTAAACGAT
 GAGACTATTTGGTATAGAGGAAAGTCAGATAGAAATAATCCAGACTCACTATTGCATCTT
 AAAAAAATTCGGGAATATCTTTTAGATGGAGAAATTCAGAAAGCCGAAGAATTGATAAAG
 TTAACAATGTTTGCTACCCCAAGAGATCAAAGCCACTATGAATTACTTGGGGAACTTTAC
 ATTGAGCATATAGATATTCAGTCTTGTGCTCTTTCATTGTATGAAAGAGAGCTAGATTTA
 GATACAGCTATTTCTAATGTTGTGTTTGAGCCTAATAGTTGTAATTTACAAATAAAAAGA
 GAATATTTTACGAGTTTTAATAAGAATATTTTATGTTGCCGTATAGTGTCATCAGTTCAA
 AACACATTAAATTTAAACATTAATTTGGGTAGAAATAAACGGTTTAATGACGAAGTATCT
 AAACTGGATTCAAGTACAATTTTAATGTCGGCCTCTGCTGGAGGTAGAAAAGGTGTTCAG
 TTTAAAGTAGTATGTCATTCTAAGGTTACGGATGGTGAAGTAAGTGTATTGGGAGAGACA
 ATAGTTATTCGGAATGCTACAGAGGTATTTCTTTATCTCAAATCAATGACGGATTATTGG
 GGAAATATAGATATTTCTTCTCTTCAGGGAGAATTTAGTAGTATTGATTACTTTACAGAA
 AAAGATGAACATGTAAAAAAATATCAGGAGCAATTTAATAGAGTTGATTTTAAACTAGAC
 TATAGTAAAGGTTGTCTTAGCATTCCAACGAATCTACTTCTTGAAAACACTAAAAAGTAT
 AGTAACTACTTGACTAACTTGTTATTTCATTATGGAAGATATCTGTTAATATCGTCTAGT
 CAACCGAATGGTTTACCTGCCAATCTTCAAGGAATATGGTGTGATGAATTAAATCCAATT
 TGGGGTTCTAAATATACGATTAATATTAATACTCAAATGAATTATTGGATTGTAGGTCCA
 TGTGATTTACCAGAAGTAGAATATCCATTATTTGATATGCTCGAAAGAATGAGAGAACCG
 GGAAGACTAACCGCTAAGAAAATGTATGGAGCTAGAGGTTTTACAGCACATCATAATACG
 GATGGTTTTGGCGATACGGCTCCCCAATCTCATGCCATGGGGGCTGCAATTTGGGTATTA
 ACTATTCCATGGTTATGTACTCATATTTGGGAACACTATTTATATTTCCAAGATGAGCGT
 ATTCTTACGGAACATTTTGAAATGATAAAAGAAGCATTTCTTTTCTTTGAAGATTATTTA
 TTTGAGGTGGATGGCTACTTGATGACAGGTCCAAGTGTCTCACCGGAAAATAAATATCGC
 TTAAAAAATGGTATTGAAGGAAATGCTTGTCTATCATCTACAATTGATAATCAAATTTTA
 AGATATTTTTGTGATTCATGCATTGGAATTGCAAAACAATTAGGAGACAATTCGGATTTT
 ATTAGTCGTGTGAAGGAGTTAAAAAAGAAACTACCTAAAACAAAAATAGGTAGTAATGGG
 CAAATCCAAGAATGGTTAGAAGATTATGAAGAAGTAGAGCCTGGGCATAGACACATTTCA
 CCTCTATTTGGGCTTTATCCTTATAATGAGATTGATATTCATAAAACTCCGGAATTAGCA
 GAAGCAGCTAAAATCACTATCAATAGGAGATTATCAAACGCTAACTTTTTATCTTCACAG
 GAGAGGGAGCAAGCGATTAATAATTGGTTAGTAAGTGGTTTGCATGCTAGTACACAAACA
 GGTTGGAGTGCTGCATGGCTGATTCATTTTTTTGCGAGACTATATCAAGGTGAACCTGCT
 TATAACCAGATTAATGGTTTGTTAAATAATGCGACTCTTGGCAATTTATTTCTTGACCAT
 CCACCATTTCAAATTGATGGTAATTTAGGTTTGGTGAGTGGAATTTGTGAATTATTAGTA
 CAGAGCCATCATAATTGGTTATCACTAATTCCAGCTTTACCTTCTGCTTGGTCAGAAGGA
 GAAGTGAAAGGTTTCAGAGTAAGAGGAGGATATAAGGTATCGTTTGCTTGGAAAAATGGG
 GATATAACATTCCTAAAATTGGAAGGAGGAAACAAAGATCAAAAAGTAAGAGTAAGAATA
 TATGGCAAAAATACTGATGTACAAAATATTGAATTGGTATTTAATTCAGAAAAAATTATT
 GAGTTAAATTTTTAG60
 120
 180
 240
 300
 360
 420
 480
 540
 600
 660
 720
 780
 840
 900
 960
 1020
 1080
 1140
 1200
 1260
 1320
 1380
 1440
 1500
 1560
 1620
 1680
 1740
 1800
 1860
 1920
 1980
 2040
 2100
 2160
 2220
 2280
 2295
⊟Protein[edit | edit source]
⊟General[edit | edit source]
- locus tag: SPH_2354 [new locus tag: SPH_RS11525 ]
- symbol: SPH_2354
- description: large secreted protein
- length: 764
- theoretical pI: 6.16188
- theoretical MW: 87324.6
- GRAVY: -0.337173
⊟Function[edit | edit source]
- TIGRFAM: putative PEP-CTERM system TPR-repeat lipoprotein (TIGR02917; HMM-score: 11.6)
- TheSEED  : - putative large secreted protein
 
- PFAM: Gal_mutarotase (CL0103) Glyco_hyd_65N_2; Glycosyl hydrolase family 65, N-terminal domain (PF14498; HMM-score: 189.8)and 2 moreno clan defined DUF6679; Family of unknown function (DUF6679) (PF20384; HMM-score: 12.4)DUF5931; Family of unknown function (DUF5931) (PF19354; HMM-score: 12.3)
⊟Structure, modifications & cofactors[edit | edit source]
- domains:
- modifications:
- cofactors:
- effectors:
⊟Localization[edit | edit source]
- PSORTb: unknown (no significant prediction)- Cytoplasmic Score: 2.5
- Cytoplasmic Membrane Score: 2.5
- Cellwall Score: 2.5
- Extracellular Score: 2.5
- Internal Helices: 0
 
- DeepLocPro: Extracellular- Cytoplasmic Score: 0.1009
- Cytoplasmic Membrane Score: 0.0074
- Cell wall & surface Score: 0.0033
- Extracellular Score: 0.8885
 
- SignalP: no predicted signal peptide- SP(Sec/SPI): 0.029423
- TAT(Tat/SPI): 0.000547
- LIPO(Sec/SPII): 0.003184
 
- predicted transmembrane helices (TMHMM): 0
⊟Accession numbers[edit | edit source]
⊟Protein sequence[edit | edit source]
- MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQSCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGRKGVQFKVVCHSKVTDGEVSVLGETIVIRNATEVFLYLKSMTDYWGNIDISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCLSIPTNLLLENTKKYSNYLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWIVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEHFEMIKEAFLFFEDYLFEVDGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPKTKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLNNATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALPSAWSEGEVKGFRVRGGYKVSFAWKNGDITFLKLEGGNKDQKVRVRIYGKNTDVQNIELVFNSEKIIELNF
⊟Experimental data[edit | edit source]
- protein localization:
- interaction partners:
⊟Expression & Regulation[edit | edit source]
⊟Operon[edit | edit source]
⊟Regulation[edit | edit source]
- regulator: CcpA regulonCcpA (TF) important in Global catabolite repression; RegPrecise transcription unit transferred from TIGR4 data RegPrecise 
⊟Expression data[edit | edit source]
- PneumoExpress for strain D39V: SPV_1988 PneumoExpress
⊟Biological Material[edit | edit source]
⊟Mutants[edit | edit source]
⊟Expression vector[edit | edit source]
⊟lacZ fusion[edit | edit source]
⊟GFP fusion[edit | edit source]
⊟two-hybrid system[edit | edit source]
⊟FLAG-tag construct[edit | edit source]
⊟Antibody[edit | edit source]
⊟Other Information[edit | edit source]
You can add further information about the gene and protein here. [edit]