Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks

Nguyen, Hung T.; Thai, My T.; Dinh, Thang N.

Computer Science > Social and Information Networks

arXiv:1605.07990 (cs)

[Submitted on 25 May 2016 (v1), last revised 22 Feb 2017 (this version, v3)]

Title:Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks

Authors:Hung T. Nguyen, My T. Thai, Thang N. Dinh

View PDF

Abstract:Influence Maximization (IM), that seeks a small set of key users who spread the influence widely into the network, is a core problem in multiple domains. It finds applications in viral marketing, epidemic control, and assessing cascading failures within complex systems. Despite the huge amount of effort, IM in billion-scale networks such as Facebook, Twitter, and World Wide Web has not been satisfactorily solved. Even the state-of-the-art methods such as TIM+ and IMM may take days on those networks.
In this paper, we propose SSA and D-SSA, two novel sampling frameworks for IM-based viral marketing problems. SSA and D-SSA are up to 1200 times faster than the SIGMOD'15 best method, IMM, while providing the same $(1-1/e-\epsilon)$ approximation guarantee. Underlying our frameworks is an innovative Stop-and-Stare strategy in which they stop at exponential check points to verify (stare) if there is adequate statistical evidence on the solution quality. Theoretically, we prove that SSA and D-SSA are the first approximation algorithms that use (asymptotically) minimum numbers of samples, meeting strict theoretical thresholds characterized for IM. The absolute superiority of SSA and D-SSA are confirmed through extensive experiments on real network data for IM and another topic-aware viral marketing problem, named TVM. The source code is available at this https URL

Comments:	Correct the errors in the proofs for SSA/D-SSA. Update D-SSA to estimate ε(s) instead of δ(s)
Subjects:	Social and Information Networks (cs.SI); Data Structures and Algorithms (cs.DS); Physics and Society (physics.soc-ph)
Cite as:	arXiv:1605.07990 [cs.SI]
	(or arXiv:1605.07990v3 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.1605.07990

Submission history

From: Thang N. Dinh [view email]
[v1] Wed, 25 May 2016 18:15:01 UTC (2,648 KB)
[v2] Wed, 7 Sep 2016 14:40:39 UTC (1,209 KB)
[v3] Wed, 22 Feb 2017 05:15:27 UTC (1,367 KB)

Computer Science > Social and Information Networks

Title:Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators