Semantic Brand Score

Last updated October 15, 2025

The Semantic Brand Score (SBS) is a measure of brand importance that is calculated on textual data.^[1]^[2]^[3] The measure is rooted in graph theory and partly connected to Keller's^[4] conceptualization of brand equity.^[5] It is calculated by converting texts into word or semantic networks and analyzing three key aspects: the frequency with which a brand name is mentioned (prevalence), the extent to which it is linked to distinctive and uncommon terms in the discourse (diversity), and its potential role as a bridge that connects otherwise unconnected or weakly connected terms or concepts (connectivity).

Definition and calculation

Pre-processing

To compute the Semantic Brand Score, it is necessary to convert the analyzed texts into word networks, i.e., graphs where each node signifies a word. Connections between words are formed based on their co-occurrence within a specified distance threshold (a number of words). Natural language pre-processing is usually conducted to refine texts, which involves tasks such as removing stopwords and applying stemming.^[9] Here is a sample network derived from pre-processing the sentence "The dawn is the appearance of light - usually golden, pink or purple - before sunrise".

The SBS is a composite indicator with three dimensions: prevalence, diversity and connectitivy.^[10]^[11]^[12] SBS measures brand importance, a construct that cannot be understood by examining a single dimension alone.^[5]

Prevalence

Prevalence measures the frequency of brand name usage, indicating how often a brand is explicitly referenced in a corpus. The prevalence factor is associated with brand awareness, suggesting that a brand mentioned frequently in a text is more familiar to its authors.^[10]^[11]^[8] Likewise, frequent mentions of a brand name enhance its recognition and recall among readers.

Diversity

Diversity assesses the variety of words linked with a brand, focusing on textual associations. These textual associations refer to the words used alongside a particular brand or term. Measurement involves employing the degree centrality indicator, reflecting the number of connections a brand node has in the semantic network.^[1] Alternatively, an approach using distinctiveness centrality ^[13] has been proposed, assigning greater significance to unique brand associations and reducing redundancy. The rationale is that distinctive textual associations enrich discussions about a brand, thereby enhancing its memorability.

Diversity can be calculated for the brand node in a word network, i.e., a weighted undirected graph G, made of n nodes and m arcs. If two nodes (terms or concepts), i and j, are not connected, then $w_{ij}=0$ , otherwise the weight of the arc connecting them is $w_{ij}\geq 1$ . In the following, $g_{j}$ is the degree of node j and $I_{(f)}$ is the indicator function which equals 1 if $f=TRUE$ , i.e. if there is an arc connecting nodes i and j.

$DI(i)=\sum _{j=1,j\neq i}^{n}\log _{10}{\frac {n-1}{g_{j}}}I_{(w_{ij}>0)}$ .

Connectivity

Connectivity evaluates a brand's connective power within broader discourse, indicating its capacity to serve as a bridge between various words/concepts (nodes) in the network.^[1]^[2]^[3]^[12] It captures a brand's brokerage power, its ability to connect different words, groups of words, or topics together. The calculation hinges on the weighted betweenness centrality metric.^[3]^[14]

The Semantic Brand Score indicator is given by the sum of the standardized values of prevalence, diversity, and connectivity.^[1]^[10]^[11] SBS standardization is typically performed by subtracting the mean from the raw scores of each dimension and then dividing by the standard deviation.^[3] This process takes into account the scores of all relevant words in the corpus.

References

1 2 3 4 Schlaile, Michael P.; Bogner, Kristina; Muelder, Laura (2021). "It's more than complicated! Using organizational memetics to capture the complexity of organizational culture" . Journal of Business Research. 129: 801–812. doi:10.1016/j.jbusres.2019.09.035.
1 2 Santomauro, Giuseppe; Alderuccio, Daniela; Ambrosino, Fiorenzo; Migliori, Silvio (2021). "Ranking Cryptocurrencies by Brand Importance: A Social Media Analysis in ENEAGRID". In Bitetta, Valerio; Bordino, Ilaria; Ferretti, Andrea; Gullo, Francesco; Ponti, Giovanni; Severini, Lorenzo (eds.). Mining Data for Financial Applications. Lecture Notes in Computer Science. Vol. 12591. Cham: Springer International Publishing. pp. 92–100. doi:10.1007/978-3-030-66981-2_8. ISBN 978-3-030-66981-2.
1 2 3 4 Bashar, Md Abul; Nayak, Richi; Balasubramaniam, Thirunavukarasu (2022-07-25). "Deep learning based topic and sentiment analysis: COVID19 information seeking on social media". Social Network Analysis and Mining. 12 (1): 90. doi:10.1007/s13278-022-00917-5. ISSN 1869-5469. PMC 9312316 . PMID 35911483.
↑ Keller, Kevin Lane (1993). "Conceptualizing, Measuring, and Managing Customer-Based Brand Equity" . Journal of Marketing. 57 (1): 1–22. doi:10.1177/002224299305700101. ISSN 0022-2429.
1 2 Fronzetti Colladon, Andrea (2018). "The Semantic Brand Score". Journal of Business Research. 88: 150–160. arXiv: 2105.05781 . doi:10.1016/j.jbusres.2018.03.026.
↑ Indraccolo, Ugo; Losavio, Ernesto; Carone, Mauro (2023). "Applying graph theory to improve the quality of scientific evidence from textual information: Neural injuries after gynaecologic pelvic surgery for genital prolapse and urinary incontinence" . Neurourology and Urodynamics. 42 (3): 669–679. doi:10.1002/nau.25133. ISSN 0733-2467. PMID 36648454.
↑ Kasia, Parys. "Polish Twitter on immigrants during the 2021 Belarus–European Union border crisis". www.linkedin.com. Retrieved 2024-04-03.
1 2 Das, Sibanjan Debeeprasad; Bala, Pradip Kumar; Das, Sukanta (2024). "Exploiting User-Generated Content in Product Launch Videos to Compute a Launch Score". IEEE Access. 12: 49624–49639. Bibcode:2024IEEEA..1249624D. doi: 10.1109/ACCESS.2024.3381541 . ISSN 2169-3536.
↑ Perkins, Jacob; Fattohi, Faiz (2014). Python 3 text processing with NLTK 3 cookbook. Quick answers to common problems (2nd ed.). Birmingham: Packt Publishing Ltd. ISBN 978-1-78216-785-3.
1 2 3 Bianchino, Antonella; Fusco, Daniela; Pisciottano, Daniele (2021-05-27). "How to Measure the Touristic Competitiveness: A Mixed Mode Model Proposal" (PDF). Athens Journal of Tourism. 8 (2): 131–146. doi:10.30958/ajt.8-2-4.
1 2 3 Beccari, Nicholas; Nicola, Valerio (2019). Brand-generated and Usergenerated content videos on YouTube: characteristics, behavior and user perception (PDF). Milan, Italy: Politecnico di Milano.
1 2 Mercurio, Simona (2024). "What About Corruption? A Text Analytics Method for a Scoping Literature Review". In Giordano, Giuseppe; Misuraca, Michelangelo (eds.). New Frontiers in Textual Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer. pp. 349–359. doi:10.1007/978-3-031-55917-4_28. ISBN 978-3-031-55916-7.
↑ Fronzetti Colladon, Andrea; Naldi, Maurizio (2020-05-22). "Distinctiveness centrality in social networks". PLOS ONE. 15 (5) e0233276. arXiv: 1912.03391 . Bibcode:2020PLoSO..1533276F. doi: 10.1371/journal.pone.0233276 . ISSN 1932-6203. PMC 7244137 . PMID 32442196.
↑ Bashar, Md Abul; Nayak, Richi; Knapman, Gareth; Turnbull, Paul; Fforde, Cressida (December 2023). "An Informed Neural Network for Discovering Historical Documentation Assisting the Repatriation of Indigenous Ancestral Human Remains". Social Science Computer Review. 41 (6): 2293–2317. arXiv: 2303.14475 . doi:10.1177/08944393231158788. ISSN 0894-4393.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 3 4 Schlaile, Michael P.; Bogner, Kristina; Muelder, Laura (2021). "It's more than complicated! Using organizational memetics to capture the complexity of organizational culture" . Journal of Business Research. 129: 801–812. doi:10.1016/j.jbusres.2019.09.035.

[:4-2] 1 2 Santomauro, Giuseppe; Alderuccio, Daniela; Ambrosino, Fiorenzo; Migliori, Silvio (2021). "Ranking Cryptocurrencies by Brand Importance: A Social Media Analysis in ENEAGRID". In Bitetta, Valerio; Bordino, Ilaria; Ferretti, Andrea; Gullo, Francesco; Ponti, Giovanni; Severini, Lorenzo (eds.). Mining Data for Financial Applications. Lecture Notes in Computer Science. Vol. 12591. Cham: Springer International Publishing. pp. 92–100. doi:10.1007/978-3-030-66981-2_8. ISBN 978-3-030-66981-2.

[:1-3] 1 2 3 4 Bashar, Md Abul; Nayak, Richi; Balasubramaniam, Thirunavukarasu (2022-07-25). "Deep learning based topic and sentiment analysis: COVID19 information seeking on social media". Social Network Analysis and Mining. 12 (1): 90. doi:10.1007/s13278-022-00917-5. ISSN 1869-5469. PMC 9312316 . PMID 35911483.

[4] Keller, Kevin Lane (1993). "Conceptualizing, Measuring, and Managing Customer-Based Brand Equity" . Journal of Marketing. 57 (1): 1–22. doi:10.1177/002224299305700101. ISSN 0022-2429.

[:6-5] 1 2 Fronzetti Colladon, Andrea (2018). "The Semantic Brand Score". Journal of Business Research. 88: 150–160. arXiv: 2105.05781 . doi:10.1016/j.jbusres.2018.03.026.

[6] Indraccolo, Ugo; Losavio, Ernesto; Carone, Mauro (2023). "Applying graph theory to improve the quality of scientific evidence from textual information: Neural injuries after gynaecologic pelvic surgery for genital prolapse and urinary incontinence" . Neurourology and Urodynamics. 42 (3): 669–679. doi:10.1002/nau.25133. ISSN 0733-2467. PMID 36648454.

[7] Kasia, Parys. "Polish Twitter on immigrants during the 2021 Belarus–European Union border crisis". www.linkedin.com. Retrieved 2024-04-03.

[:5-8] 1 2 Das, Sibanjan Debeeprasad; Bala, Pradip Kumar; Das, Sukanta (2024). "Exploiting User-Generated Content in Product Launch Videos to Compute a Launch Score". IEEE Access. 12: 49624–49639. Bibcode:2024IEEEA..1249624D. doi: 10.1109/ACCESS.2024.3381541 . ISSN 2169-3536.

[9] Perkins, Jacob; Fattohi, Faiz (2014). Python 3 text processing with NLTK 3 cookbook. Quick answers to common problems (2nd ed.). Birmingham: Packt Publishing Ltd. ISBN 978-1-78216-785-3.

[:2-10] 1 2 3 Bianchino, Antonella; Fusco, Daniela; Pisciottano, Daniele (2021-05-27). "How to Measure the Touristic Competitiveness: A Mixed Mode Model Proposal" (PDF). Athens Journal of Tourism. 8 (2): 131–146. doi:10.30958/ajt.8-2-4.

[:3-11] 1 2 3 Beccari, Nicholas; Nicola, Valerio (2019). Brand-generated and Usergenerated content videos on YouTube: characteristics, behavior and user perception (PDF). Milan, Italy: Politecnico di Milano.

[:7-12] 1 2 Mercurio, Simona (2024). "What About Corruption? A Text Analytics Method for a Scoping Literature Review". In Giordano, Giuseppe; Misuraca, Michelangelo (eds.). New Frontiers in Textual Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer. pp. 349–359. doi:10.1007/978-3-031-55917-4_28. ISBN 978-3-031-55916-7.

[13] Fronzetti Colladon, Andrea; Naldi, Maurizio (2020-05-22). "Distinctiveness centrality in social networks". PLOS ONE. 15 (5) e0233276. arXiv: 1912.03391 . Bibcode:2020PLoSO..1533276F. doi: 10.1371/journal.pone.0233276 . ISSN 1932-6203. PMC 7244137 . PMID 32442196.

[14] Bashar, Md Abul; Nayak, Richi; Knapman, Gareth; Turnbull, Paul; Fforde, Cressida (December 2023). "An Informed Neural Network for Discovering Historical Documentation Assisting the Repatriation of Indigenous Ancestral Human Remains". Social Science Computer Review. 41 (6): 2293–2317. arXiv: 2303.14475 . doi:10.1177/08944393231158788. ISSN 0894-4393.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]