Title: | Simple Models and Classification in Networked Data |
Authors: | Macskassy, Sofus Provost, Foster |
Issue Date: | 2004 |
Publisher: | Stern School of Business, New York University |
Series/Report no.: | CeDER-04-03 |
Abstract: | When entities are linked by explicit relations, classification methods that take advantage of the network can perform substantially better than methods that ignore the network. This paper argues that studies of relational classification in networked data should include simple network-only methods as baselines for comparison, in addition to the non-relational baselines that generally are used. In particular, comparing more complex algorithms with algorithms that only consider the network (and not the features of the entities) allows one to factor out the contribution of the network structure itself to the predictive power of the model. We examine several simple methods for network-only classification on previously used relational data sets, and show that they can perform remarkably well. The results demonstrate that the inclusion of network-only classifiers can shed new light on studies of relational learners. |
URI: | http://hdl.handle.net/2451/14117 |
Appears in Collections: | CeDER Working Papers IOMS: Information Systems Working Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
CeDER-04-03.pdf | 98.37 kB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.