Graduation Year
2018
Document Type
Thesis
Degree
M.S.C.S.
Degree Name
MS in Computer Science (M.S.C.S.)
Degree Granting Department
Computer Science and Engineering
Major Professor
Adriana Iaminitchi, Ph.D.
Committee Member
Yicheng Tu, Ph.D.
Committee Member
John Skvoretz, Ph.D.
Keywords
binary node attribute networks, dk graph models, graph generators, social networks
Abstract
Real graph datasets are important for many science domains, from understanding epidemics to modeling traffic congestion. To facilitate access to realistic graph datasets, researchers proposed various graph generators typically aimed at representing particular graph properties. While many such graph generators exist, there are few techniques for generating graphs where the nodes have binary attributes. Moreover, generating such graphs in which the distribution of the node attributes preserves real-world characteristics is still an open challenge. This thesis introduces Borromean, a graph generating algorithm that creates synthetic graphs with binary node attributes in which the attributes obey an attribute-specific joint degree distribution. We show experimentally the accuracy of the generated graphs in terms of graph size, distribution of attributes, and distance from the original joint degree distribution. We also designed a parallel version of Borromean in order to generate larger graphs and show its performance. Our experiments show that Borromean can generate graphs of hundreds of thousands of nodes in under 30 minutes, and these graphs preserve the distribution of binary node attributes within 40% on average.
Scholar Commons Citation
Gandy, Clayton A., "Borromean: Preserving Binary Node Attribute Distributions in Large Graph Generations" (2018). USF Tampa Graduate Theses and Dissertations.
https://digitalcommons.usf.edu/etd/7293