randomosity

strikingly random thoughts and 'maximum data existentialisation'

  • Research
    • Conference Papers
    • Datasets
      • 1871 Populations of Ontario
      • Breweries and Distilleries in Ontario, 1914–15
      • Canadian Federal Railway Charters
      • 1871 Tavernkeepers in Huron County
    • Maps
      • 1891 Ontario Census Divisions
      • Admissions from Gaols to Hamilton Asylum
      • Asylums in New Zealand, 1900
      • Asylums in Scotland, 1797–1897
      • Asylums in the Australian Colonies, 1860
      • Asylums in Western Canada, 1911
      • Asylums of England and Wales, 1765–1845
      • Asylums of England and Wales, 1845–1860
      • Asylums of Ireland, 1814–1869
      • Discharge Rate from Hamilton Asylum
      • Duration of Stay for First Admissions to Hamilton Asylum
      • First Admissions to Hamilton Asylum by County
      • Rate of Readmission to Hamilton Asylum
      • Study Context
      • 1841 Settlers Map of Ontario
      • 1851 Essex County by Religion Stated in Census
      • 1848 Circulation Map of Paris
      • Modern Circulation Map of Paris
      • Irish and Indian-Trained Psychiatrists in Canada
      • Asylums in the United States, 1850
    • Other Research Stuff
      • Sir Frank Smith
    • Visual Support Materials
      • 1851 — 1911 Essex County Census District Evolution
      • Guelph Historical GIS
      • Occupational Comparison 1867–2007
      • Pajek Apple Taxonomy
      • Napoleonic Timeline
      • 1878 Guelph Mass Model
  • Gallery
  • Archives
  • About
    • Contact Me
    • Contact Me
    • Curriculum Vitae
    • Ligit Results
    • Movies
    • Stuff
    • Stats
    • Collophon
    • Delicious Tags

Data Source Handbook: A Guide to Public Data

Posted by shawnday on 1 March 2011
Posted in: Social Network Analysis, Text Analysis, Visualization. Tagged: Review. Leave a Comment

data source handbookThe Data Source Hand­book by Pete Warden provides a con­cise and handy guide to some of the main sources of pub­lic data access­ible on the web today. It’s a very short book of 40 pages. This in itself does not stand against the book. These sources are rap­idly chan­ging and com­pil­ing and com­mit­ting an exhaust­ive sur­vey to a prin­ted volume would damn it to almost instant obsol­es­cence. It would also pre­vent any treat­ment of indi­vidual data­sources in any use­ful detail.

 

As it is, Warden is able to pick a select few and identify strengths and avail­able APIs in a use­ful fashion. He organ­ises the type of sources into logical cat­egor­ies and iden­ti­fies some key sources for each:

  • Web­sites
  • People
  • Search terms
  • Loc­a­tions
  • Com­pan­ies
  • IP Addresses
  • Books, films, movies, music and products

He selects the key open pro­viders of data in these areas and sys­tem­at­ic­ally shows how to access the inform­a­tion along with simple pro­gram­matic instruc­tions. In a volume of such lim­ited length you would not expect to find extens­ive instruc­tions or dis­cus­sion — and you won’t. What you have is a very con­cise sur­vey identi­fy­ing the key play­ers and giv­ing a nut­shell indic­a­tion of what you can use the data­sources for.
This is a use­ful and quick ref­er­ence for any­one routinely access­ing, com­pil­ing, aggreg­at­ing or aug­ment­ing their own data­sets. Although very few of the sources iden­ti­fied would be new to most people in the data ana­lysis space, this does provide a use­ful com­pil­a­tion and also handy con­cise reminder of how one might aug­ment a lim­ited data­set quickly in an auto­mated fash­ion.
This is an eas­ily access­ible volume, well organ­ized and with the only major fail­ing that it will be become dated in a pub­lished form. How­ever, as an eBook it is ideal and I would recom­mend it to any­one new to the area of adata visu­al­isa­tion look­ing for some great sample data to access, or to the more seasoned data trav­el­ler look­ing to keep their famili­ar­ity with the wide vari­ety of avail­able data current.

I review for the O'Reilly Blogger Review Program

Share this:

  • Print
  • LinkedIn
  • Twit­ter
  • Google +1
  • Tumblr

Posts navigation

← Mining the Social Web by Matthew A Russell
A Case for the iPad →
Logging In...
Cancel Reply
  • about.me

    Shawn Day

    Shawn Day

    Shawn Day is an entrepreneur, digital historian, economist and blender of the aesthetic and the informative. Raised in Canada, Shawn now works with the Digital Humanities Observatory, a project of the Royal Irish Academy, to leverage Ireland's participation in the emerging practise of digital humanities scholarship. He lectures in Social Computing and the Philosophy of Technology.

    His own research explores the social and economic circumstances of the nineteenth century retail liquor trade and it's impact on family. He applies digital, spatial and social network analysis to the study of the relationships between credit, respectability, and order in the Victorian community. Recent articles have examined the social dimensions of the Victorian public mental hospital using GIS and statistical modeling tools. Shawn has been involved in a number of successful and innovative digital humanities projects throughout Canada. Most recently he has worked with large manuscript census databases in the 1871/1891 census project (University of Guelph). He is a team member of the national TAPoR text analysis portal project, the Canadian Network for Economic History and the Network for Canadian History and the Environment (NiCHE - UWO).

    Shawn has blended his background in management economics with an entrepreneurial ethos to found a number of successful software development ventures in Canada and find a means to leverage this in the academic arena.

  • Twitter Updates

    • RT @DiggingIntoData: And, we're back! Round 3 of the int'l Digging into Data Challenge launches today w/ TEN research sponsors http://t. ... 8 hours ago
    • stallman reminds - Amazon recalls (and embodies) Orwell's '1984' news.cnet.com/8301-13860_3-1… via @CNET 1 day ago
    • Well spotted - thoughtful: “@kcor1964: Why innovation is so hard to achieve management.fortune.cnn.com/2013/01/16/why…” 1 day ago
    • RT @adriansalmon: "There once was a curate from Kew Who kept a small cat in a pew. He taught it to speak Alphabetical Greek, But it neve ... 1 day ago
    • RT @rcahms: Telling Scotland's Story: download the new @ScARFHub booklet & uncover stories from the past bit.ly/VqaCWh 1 day ago
  • Flickr

    			shawnday posted a photo:	OUA Semi-Final FencingOUA Semi-Final Fencing			shawnday posted a photo:	OUA Semi-Final FencingOUA Semi-Final Fencing			shawnday posted a photo:	OUA Semi-Final FencingOUA Semi-Final Fencing			shawnday posted a photo:	OUA Semi-Final FencingOUA Semi-Final Fencing			shawnday posted a photo:	OUA Semi-Final FencingOUA Semi-Final Fencing
    Used tag: mcmaster
  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

  • Pages

    • About
      • Collophon
      • Contact Me
      • Contact Me
      • Curriculum Vitae
      • Delicious Tags
      • Ligit Results
      • Movies
      • Stats
      • Stuff
    • Archives
    • Gallery
    • Research
      • Conference Papers
      • Datasets
        • 1871 Populations of Ontario
        • 1871 Tavernkeepers in Huron County
        • Breweries and Distilleries in Ontario, 1914–15
        • Canadian Federal Railway Charters
      • Maps
        • 1841 Settlers Map of Ontario
        • 1848 Circulation Map of Paris
        • 1851 Essex County by Religion Stated in Census
        • 1891 Ontario Census Divisions
        • Admissions from Gaols to Hamilton Asylum
        • Asylums in New Zealand, 1900
        • Asylums in Scotland, 1797–1897
        • Asylums in the Australian Colonies, 1860
        • Asylums in the United States, 1850
        • Asylums in Western Canada, 1911
        • Asylums of England and Wales, 1765–1845
        • Asylums of England and Wales, 1845–1860
        • Asylums of Ireland, 1814–1869
        • Discharge Rate from Hamilton Asylum
        • Duration of Stay for First Admissions to Hamilton Asylum
        • First Admissions to Hamilton Asylum by County
        • Irish and Indian-Trained Psychiatrists in Canada
        • Modern Circulation Map of Paris
        • Rate of Readmission to Hamilton Asylum
        • Study Context
      • Other Research Stuff
        • Sir Frank Smith
      • Visual Support Materials
        • 1851 — 1911 Essex County Census District Evolution
        • 1878 Guelph Mass Model
        • Guelph Historical GIS
        • Napoleonic Timeline
        • Occupational Comparison 1867–2007
        • Pajek Apple Taxonomy
Proudly powered by WordPress Theme: Parament by Automattic.