Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures

Dey, Kuntal; Agrawal, Sahil; Malviya, Rahul; Kaushik, Saroj

Computer Science > Social and Information Networks

arXiv:1605.09338 (cs)

[Submitted on 30 May 2016 (v1), last revised 31 May 2016 (this version, v2)]

Title:Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures

Authors:Kuntal Dey, Sahil Agrawal, Rahul Malviya, Saroj Kaushik

View PDF

Abstract:This paper explores the social quality (goodness) of community structures formed across Twitter users, where social links within the structures are estimated based upon semantic properties of user-generated content (corpus). We examined the overlap of the community structures of the constructed graphs, and followership-based social communities, to find the social goodness of the links constructed. Unigram, bigram and LDA content models were empirically investigated for evaluation of effectiveness, as approximators of underlying social graphs, such that they maintain the {\it community} social property. Impact of content at varying granularities, for the purpose of predicting links while retaining the social community structures, was investigated. 100 discussion topics, spanning over 10 Twitter events, were used for experiments. The unigram language model performed the best, indicating strong similarity of word usage within deeply connected social communities. This observation agrees with the phenomenon of evolution of word usage behavior, that transform individuals belonging to the same community tending to choose the same words, made by Danescu et al. (2013), and raises a question on the literature that use, without validation, LDA for content-based social link prediction over other content models. Also, semantically finer-grained content was observed to be more effective compared to coarser-grained content.

Subjects:	Social and Information Networks (cs.SI)
Cite as:	arXiv:1605.09338 [cs.SI]
	(or arXiv:1605.09338v2 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.1605.09338

Submission history

From: Kuntal Dey [view email]
[v1] Mon, 30 May 2016 17:36:55 UTC (166 KB)
[v2] Tue, 31 May 2016 01:24:20 UTC (166 KB)

Computer Science > Social and Information Networks

Title:Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators