Inverse network sampling to explore online brand allegiance

Peter Grindrod, Desmond J. Higham, Peter Laflin, Amanda Otley, Jonathan A. Ward

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Within the online media universe there are many underlying communities. These may be defined, for example, through politics, location, health, occupation, extracurricular interests or retail habits. Government departments, charities and commercial organisations can benefit greatly from insights about the structure of these communities; the move to customer-centered practices requires knowledge of the customer base. Motivated by this issue, we address the fundamental question of whether a subnetwork looks like a collection of individuals who have effectively been picked at random from the whole, or instead forms a distinctive community with a new, discernible structure. In the former case, to spread a message to the intended user base it may be best to use traditional broadcast media (TV, billboard), whereas in the latter case a more targeted approach could be more effective. In this work, we therefore formalize a concept of testing for substructure and apply it to social interaction data. First, we develop a statistical test to determine whether a given subnetwork (induced subgraph) is likely to have been generated by sampling nodes from the full network uniformly at random. This tackles an interesting inverse alternative to the more widely studied “forward” problem. We then apply the test to a Twitter reciprocated mentions network where a range of brand name based subnetworks are created via tweet content. We correlate the computed results against the independent views of sixteen digital marketing professionals. We conclude that there is great potential for social media based analytics to quantify, compare and interpret on-line brand allegiances systematically, in real time and at large scale.
LanguageEnglish
Number of pages13
JournalEuropean Journal of Applied Mathematics
Early online date23 Feb 2016
DOIs
Publication statusE-pub ahead of print - 23 Feb 2016

Fingerprint

Statistical tests
Marketing
Health
Sampling
Testing
Customers
Forward Problem
Social Media
Social Interaction
Induced Subgraph
Substructure
Statistical test
Correlate
Broadcast
Quantify
Likely
Alternatives
Vertex of a graph
Range of data
Community

Keywords

  • generating function
  • twitter
  • statistics
  • sampling
  • random graph
  • p-value
  • networks
  • mentions

Cite this

Grindrod, Peter ; Higham, Desmond J. ; Laflin, Peter ; Otley, Amanda ; Ward, Jonathan A. / Inverse network sampling to explore online brand allegiance. In: European Journal of Applied Mathematics. 2016.
@article{06c4148613e242458ad33ae55ca6f645,
title = "Inverse network sampling to explore online brand allegiance",
abstract = "Within the online media universe there are many underlying communities. These may be defined, for example, through politics, location, health, occupation, extracurricular interests or retail habits. Government departments, charities and commercial organisations can benefit greatly from insights about the structure of these communities; the move to customer-centered practices requires knowledge of the customer base. Motivated by this issue, we address the fundamental question of whether a subnetwork looks like a collection of individuals who have effectively been picked at random from the whole, or instead forms a distinctive community with a new, discernible structure. In the former case, to spread a message to the intended user base it may be best to use traditional broadcast media (TV, billboard), whereas in the latter case a more targeted approach could be more effective. In this work, we therefore formalize a concept of testing for substructure and apply it to social interaction data. First, we develop a statistical test to determine whether a given subnetwork (induced subgraph) is likely to have been generated by sampling nodes from the full network uniformly at random. This tackles an interesting inverse alternative to the more widely studied “forward” problem. We then apply the test to a Twitter reciprocated mentions network where a range of brand name based subnetworks are created via tweet content. We correlate the computed results against the independent views of sixteen digital marketing professionals. We conclude that there is great potential for social media based analytics to quantify, compare and interpret on-line brand allegiances systematically, in real time and at large scale.",
keywords = "generating function, twitter, statistics, sampling, random graph, p-value, networks, mentions",
author = "Peter Grindrod and Higham, {Desmond J.} and Peter Laflin and Amanda Otley and Ward, {Jonathan A.}",
year = "2016",
month = "2",
day = "23",
doi = "10.1017/S0956792516000085",
language = "English",
journal = "European Journal of Applied Mathematics",
issn = "0956-7925",

}

Inverse network sampling to explore online brand allegiance. / Grindrod, Peter; Higham, Desmond J.; Laflin, Peter; Otley, Amanda; Ward, Jonathan A.

In: European Journal of Applied Mathematics, 23.02.2016.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Inverse network sampling to explore online brand allegiance

AU - Grindrod, Peter

AU - Higham, Desmond J.

AU - Laflin, Peter

AU - Otley, Amanda

AU - Ward, Jonathan A.

PY - 2016/2/23

Y1 - 2016/2/23

N2 - Within the online media universe there are many underlying communities. These may be defined, for example, through politics, location, health, occupation, extracurricular interests or retail habits. Government departments, charities and commercial organisations can benefit greatly from insights about the structure of these communities; the move to customer-centered practices requires knowledge of the customer base. Motivated by this issue, we address the fundamental question of whether a subnetwork looks like a collection of individuals who have effectively been picked at random from the whole, or instead forms a distinctive community with a new, discernible structure. In the former case, to spread a message to the intended user base it may be best to use traditional broadcast media (TV, billboard), whereas in the latter case a more targeted approach could be more effective. In this work, we therefore formalize a concept of testing for substructure and apply it to social interaction data. First, we develop a statistical test to determine whether a given subnetwork (induced subgraph) is likely to have been generated by sampling nodes from the full network uniformly at random. This tackles an interesting inverse alternative to the more widely studied “forward” problem. We then apply the test to a Twitter reciprocated mentions network where a range of brand name based subnetworks are created via tweet content. We correlate the computed results against the independent views of sixteen digital marketing professionals. We conclude that there is great potential for social media based analytics to quantify, compare and interpret on-line brand allegiances systematically, in real time and at large scale.

AB - Within the online media universe there are many underlying communities. These may be defined, for example, through politics, location, health, occupation, extracurricular interests or retail habits. Government departments, charities and commercial organisations can benefit greatly from insights about the structure of these communities; the move to customer-centered practices requires knowledge of the customer base. Motivated by this issue, we address the fundamental question of whether a subnetwork looks like a collection of individuals who have effectively been picked at random from the whole, or instead forms a distinctive community with a new, discernible structure. In the former case, to spread a message to the intended user base it may be best to use traditional broadcast media (TV, billboard), whereas in the latter case a more targeted approach could be more effective. In this work, we therefore formalize a concept of testing for substructure and apply it to social interaction data. First, we develop a statistical test to determine whether a given subnetwork (induced subgraph) is likely to have been generated by sampling nodes from the full network uniformly at random. This tackles an interesting inverse alternative to the more widely studied “forward” problem. We then apply the test to a Twitter reciprocated mentions network where a range of brand name based subnetworks are created via tweet content. We correlate the computed results against the independent views of sixteen digital marketing professionals. We conclude that there is great potential for social media based analytics to quantify, compare and interpret on-line brand allegiances systematically, in real time and at large scale.

KW - generating function

KW - twitter

KW - statistics

KW - sampling

KW - random graph

KW - p-value

KW - networks

KW - mentions

UR - http://journals.cambridge.org/action/displayAbstract?fromPage=online&aid=10197682&fulltextType=RA&fileId=S0956792516000085

U2 - 10.1017/S0956792516000085

DO - 10.1017/S0956792516000085

M3 - Article

JO - European Journal of Applied Mathematics

T2 - European Journal of Applied Mathematics

JF - European Journal of Applied Mathematics

SN - 0956-7925

ER -