De-anonymising public data
Jinfo Blog
9th February 2010
By Anne Jordan
Item
âDe-anonymiseâ does not yet appear in the Oxford English dictionary, but it may be a word the editors of that venerable authority on the English language may want to review for addition. I recently came across the word in an article about how public data may be misused by criminals to discover personal level data, potentially limiting the roll-out of recent government data initiatives. Criminal activity on the web is nothing new. Last weekend the website of Tata Consultancy Services, one of Indiaâs largest software and services companies was hacked and a âFor Saleâ message posted. Google and Twitter have also suffered breaches in the past. Whether a joke, for political ends, or criminal purposes, these events can be embarrassing and potentially commercially damaging. A recent article has reported another risk for website owners to guard against, and particularly sites hosting public data sets, such as the UK local and national data initiatives, the Greater London Authorityâs Datastore and the UK governmentâs data.gov.uk. These have been launched since the New Year and welcomed in LiveWire postings by myself and Michele Bate at http://digbig.com/5bbbmn and http://digbig.com/5bbbmq. The article in The Guardian (http://digbig.com/5bbbnr) looks at how statistical "de-anonymisation" techniques might limit the roll-out of such public data initiatives. Computer scientists in the US have discovered ways to "re-identify" the names of people included in supposedly anonymous datasets. The example cited is a movie rental company but there are more serious implications. The discovery that lists can be "de-anonymised" needs to be included in the debate about how information is released and where to draw the line. Dr Ian Brown, of the Oxford Internet Institute believes the discovery raises concerns about initiatives such as Data.gov.uk. He says: "they are looking at releasing crime reports down to street level. You have to think about how people might be able to link that back to individuals."About this article
- Blog post title: De-anonymising public data
- Link to this page
- View printable version
What's new at Jinfo?
Register for our next Community session:
![]()
Transforming knowledge management at BASF – GenAI and the evolution of QKnows
10th December 2025
Latest on our YouTube channel:![]()
Read on the Blog:
December 2025 update
3rd December 2025
- Jinfo wins CILIP’s inaugural “McFarlane & Ward Information Management” award
4th December 2025 - December 2025 update
3rd December 2025 - Review of Matchplat – combining AI with traditional industry code searching
27th November 2025
- Team roles and AI (Community) 26th February 2026
- Team demand and AI (Community) 22nd January 2026
- Transforming knowledge management at BASF – GenAI and the evolution of QKnows (Community) 10th December 2025
Learn more about the Jinfo Subscription