The Length of Bridge Ties: Structural and Geographic Properties of Online So...Yuto Yamaguchi
?
The document explores the relationship between structural properties of online social ties and spatial distance, indicating that individuals closer in proximity are more likely to establish social connections. Results reveal that social links within the core span shorter distances compared to those in the periphery, and interaction levels are higher among closely connected users. The study highlights how spatial constraints impact network structure and social bonds, particularly emphasizing the roles of tie strength and network positioning.
SocNL: Bayesian Label Propagation with ConfidenceYuto Yamaguchi
?
This document proposes a new algorithm called SocNL for node classification in graphs. SocNL incorporates classification confidence by assigning confidence scores when labeling nodes based on their connections. It assumes that connected nodes will share labels, and nodes with many labeled neighbors can be labeled confidently. SocNL is proven to converge, and is theoretically equivalent to linear programming or Bayesian inference approaches. Empirical tests on three real-world networks show SocNL achieves higher overall accuracy than competing algorithms.
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationYuto Yamaguchi
?
This document presents OMNI-Prop, a new algorithm for node classification on graphs that can handle arbitrary label correlation types. OMNI-Prop calculates variables representing the likelihood of node labels and propagates these values to achieve classification. It runs in linear time per iteration and converges on any graph. Experimental results show OMNI-Prop outperforms other methods on various datasets.
Towards Social User Profiling: Unified and Discriminative Influence Model for...Yuto Yamaguchi
?
The paper presents a unified discriminative influence model (UDI) to infer users' home locations on social networks, specifically addressing challenges of scarce and noisy signals in location data. Using Twitter data, the authors propose two location profiling methods—local and global prediction strategies—and demonstrate that their approach can correctly place 66% of users within a 100-mile error distance. Extensive experiments indicate that the UDI significantly outperforms existing methods for location inference.
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Yuto Yamaguchi
?
The document presents a new online and incremental method for inferring user home locations from social media posts. It exploits spatiotemporal correlations in social streams by extracting local words that are correlated to locations over specific time periods. The proposed Online Location Inference Method (OLIM) divides the map into regions, calculates population distributions, and updates local word distributions and user location distributions incrementally as new posts arrive. An evaluation on a Twitter dataset shows it achieves better accuracy than existing batch methods and has lower computational cost per update.
This document proposes a new tensor decomposition model to handle cases where tensor indices are partially missing. The model treats indices as latent variables and uses a variational MAP-EM algorithm to infer missing indices and learn tensor decomposition parameters. Experiments on synthetic and real-world Twitter data show the model performs well when the number of samples is large or the missing ratio is not very high, outperforming alternative approaches that cannot handle missing indices.
When Does Label Propagation Fail? A View from a Network Generative ModelYuto Yamaguchi
?
The document proposes a Partially Labeled Stochastic Block Model (PLSBM) as a probabilistic generative model for networks with some labeled nodes. It proves the relationship between label propagation (LP) and the Stochastic Block Model (SBM) through PLSBM. Specifically, it shows that the solution to LP is identical to maximum likelihood estimation under PLSBM when the label ratio is uniform, edge probabilities are uniform within and between labels, and the network is assortative. When these conditions do not hold, LP is shown to fail both theoretically and experimentally.
1. The document analyzes patterns in interactive tagging networks on Twitter, contrasting them with traditional resource tagging networks and examining reciprocity.
2. It finds that interactive tagging networks on Twitter consist of broad folksonomies with many users categorizing others, unlike narrow folksonomies where users primarily tag their own resources.
3. Larger multiplicity of tags between users and friendship-related tags are more likely to be reciprocated in interactive tagging networks.