My projects and collaborations have produced several kinds of original data sets and archives. I strongly believe in sharing data, references, and ideas as soon as they might be of value, even though some would say there’s a risk of being plagiarized.

All original data here have a Creative Commons Attribution-Non Commercial-Share Alike 4.0 Unported license, and are designed to be shared and remixed.  Please let me know if you publish your analysis of the data so that I can learn from your observations.  If you expand or extend your data in some way, please let me know and I’ll help disseminate your work.  Please remember to acknowledge some of the core work of the original contributors when you present your new work.   

First, there are original archives and collections of literature and policy data:

Second, there are replication data sets for articles I have published:

Third, there is the official ICPSR filing of “ICT Diffusion and Distribution Data Set, 1990-2007“.  This is the collection of our work computing gini coefficients for technology access around the world.

Fourth, there are the event history and comparative communication data sets that I have been working on:

Finally, there is the data behind some specific figures and charts.  These are often not raw data sets but the aggregated data that has to be massaged into place to actually make an interpretable figure.  These are not all the figures, just the ones people have requested for reproduction purposes: