Inequality in the Utility of Data: Modeling, Assessment, and Implications Academic Article uri icon


  • ABSTRACT In this study, we discuss the inequality in the utility of data resources–the extent to which records in large datasets differ in their business-value contribution. We introduce analytical tools for modeling and quantifying inequality, and demonstrate their application for assessing inequality in a large data repository. We suggest that the distribution of utility and the magnitude of inequality have important implications for data management such as impacting the design and administration of the data resource, informing data acquisition and retention policies, and prioritizing quality improvement efforts.

publication date

  • January 1, 2008