Organisers: Chris Skinner (London School of Economics) and David Hand (Imperial College)
Recent decades have seen data developments of huge potential analytic value to social science research, especially as a result of the movement away from paper to electronic records and the rapid spread of the internet and related communications technologies. Examples arise in many fields, including education, health and tax records; customer transactions and commercial databases; and digital data generated by mobile telephones, remote sensing devices and social networks. For the social science research opportunities afforded by such data sources to be realized, however, at least two challenges need to be addressed.
First, there is a need to be able to link different data sources so that the same entities - individuals, organisations etc. - can be identified for analytic purposes and so that any damage to analysis as a result of duplicate records or other issues of data quality can be avoided. Second, there is a need to be able to protect privacy and confidentiality, by devising ways of making data available to researchers whereby identification of individuals or other kinds of disclosure is prevented. These two topics are connected since methods of record linkage represent a threat as well as an opportunity, in the sense that the potential linkage of research data records to known individuals by mischievous data users represents a threat to confidentiality.
A mathematical framework for methods of record linkage was established by Fellegi and Sunter in 1969, formalizing earlier work by Newcombe. This framework has been further developed in the statistical literature and, in the last decade or more, in the computer science literature, where the method has also been called data matching, data linkage, entity resolution or object identification. Much recent work in the area has focussed on the practical implementation of these methods in different domains. Such applied work has generated a variety of challenges at a more mathematical level, for example how to link across multiple (rather than pairs of) data sources, possibly at different levels, such as persons and families, or across time via multigenerational genealogies/pedigrees. Other existing challenges include the estimation of patterns of linkage error and the development of methods for the analysis of linked data which take account of linkage error.
Methodology for data anonymisation has developed over a similar time scale. Methods vary according to the environments in which the data is accessed by the researcher, for example through visiting a data enclave, or connecting remotely, or by more open access to data files. Challenges may be formulated as constrained optimisation problems, where measures of the analytic utility of the data are maximized across alternative forms of data protection, subject to measures of disclosure risk meeting specified conditions. More recent developments include the use of simulation and imputation methods to create synthetic datasets and the use differential privacy ideas to measure disclosure risk and to evaluate alternative disclosure control methods.
The first and principal objective of this meeting is to identify key areas of mathematical sciences research on data linkage and anonymisation that can be of value to social and economic research using large and complex datasets. Presentations will be given by those engaged in social and economic research regarding what they see as challenges for research on data linkage and anonymisation and by those engaged more in mathematical sciences research in these areas, regarding what they see as promising lines for development. The general aim will be to find common ground between the mathematical and the social sciences.
Participation is by invitation only and will include: those engaged in social and economic research with awareness and experience of the challenges represented by data linkage and/or anonymisation; those with expertise at an international level in the development of methodology in these areas in the mathematical sciences, including statistics and computer science; individuals with a good knowledge of the relevant parts of the mathematical sciences community in the UK. Social and economic research will include both academic social sciences research and research in the private sector making use of large data sources.
Participation is by invitation only and will include:
Social and economic research will include both academic social sciences research and research in the private sector making use of large data sources.
|Wednesday 16th January 2013|
|09:00 to 09:45||No Room Required|
|09:45 to 10:00||
David Hand Imperial College London
|10:00 to 10:30||
Chris Skinner London School of Economics
|10:30 to 10:45||Room 2|
|10:45 to 11:00||Room 2|
|11:00 to 11:30||No Room Required|
|11:30 to 12:15||
Clive Humby dunnhumby UK
|12:15 to 13:00||
Peter Elias University of Warwick
|13:00 to 14:00||No Room Required|
|14:00 to 15:30||
Christopher Dibben University of St Andrews
|16:00 to 17:30||Room 2|
|Thursday 17th January 2013|
|09:45 to 10:30||Room 2|
|10:30 to 11:00||No Room Required|
|11:00 to 11:30||Room 2|
|11:30 to 12:30||Room 2|
|12:30 to 13:30||No Room Required|
|13:30 to 14:15||
Bill Winkler U.S. Census Bureau
|14:15 to 15:00||
Peter Christen Australian National University
|15:00 to 15:30||No Room Required|
|15:30 to 17:00||
Natalie Shlomo University of Manchester
|19:30 to 22:00||No Room Required|
Subscribe for the latest updates on events and news
INI is a creative collaborative space which is occupied by up to fifty-five mathematical scientists at any one time (and many more when there is a workshop). Some of them may not have met before and others may not realise the relevance of other research to their own work.
INI is especially important as a forum where early-career researchers meet senior colleagues and form networks that last a lifetime.
Here you can learn about all activities past, present and future, watch live seminars and submit your own proposals for research programmes.
Within this section of the website you should find all the information required to arrange and plan your visit to the Institute. If you have any further questions, or are unable to find the information you require, please get in touch with the relevant staff member or our Reception team via our contact pages.
INI and its programme participants produce a range of publications to communicate information about activities and events, publish research outcomes, and document case studies which are written for a non-technical audience. You will find access to them all in this section.
The Isaac Newton Institute aims to maximise the benefit of its scientific programmes to the UK mathematical science community in a variety of ways.
Whether spreading research opportunities through its network of correspondents, offering summer schools to early career researchers, or hosting public-facing lectures through events such as the Cambridge Festival, there is always a great deal of activity to catch up on.
Find out about all of these endeavours in this section of the site.
There are various ways to keep up-to-date with current events and happenings at the Isaac Newton Institute. As detailed via the menu links within this section, our output covers social media streams, news articles, a regular podcast series, an online newsletter, and more detailed documents produced throughout the year.
“A world famous place for research in the mathematical sciences with a reputation for efficient management and a warm welcome for visitors”
The Isaac Newton Institute is a national and international visitor research institute. It runs research programmes on selected themes in mathematics and the mathematical sciences with applications over a wide range of science and technology. It attracts leading mathematical scientists from the UK and overseas to interact in research over an extended period.
INI has a vital national role, building on many strengths that already exist in UK universities, aiming to generate a new vitality through stimulating and nurturing research throughout the country.During each scientific programme new collaborations are made and ideas and expertise are exchanged and catalysed through lectures, seminars and informal interaction, which the INI building has been designed specifically to encourage.
For INI’s knowledge exchange arm, please see the Newton Gateway to Mathematics.
The Institute depends upon donations, as well as research grants, to support the world class research undertaken by participants in its programmes.
Fundraising activities are supported by a Development Board comprising leading figures in academia, industry and commerce.
Visit this section to learn more about how you could play a part in supporting INI’s groundbreaking research.
In this section you can find contact information, staff lists, maps and details of how to find INI’s main building in Cambridge.
Our administrative staff can help you with any queries regarding a prospective or planned visit. If you would like to discuss a proposed a research programme or other event, our senior management team will be happy to help.