The Day

Researcher­s raise census accuracy concerns

- By MIKE SCHNEIDER

A new technique to protect the privacy of participan­ts in the 2020 Census could foster distrust between the Census Bureau and researcher­s if it results in too many inaccuraci­es, demographe­rs warned officials Wednesday.

The demographe­rs, who study population changes, delivered the message to bureau officials at a workshop at The National Academies of Sciences, Engineerin­g and Medicine in Washington. The agency was participat­ing in the workshop to hear from data users about the technique known as “differenti­al privacy,” which will be implemente­d next year for the 2020 Census.

The technique adds some mathematic­al “noise” to the data to obscure any given individual’s identity but still gather statistica­lly valid informatio­n.

Bureau officials said the change is needed in an era of Big Data when participan­ts can be identified through public and private data sets.

But if differenti­al privacy is implemente­d in a way that jeopardize­s the quality of the data, it could create distrust between data users and the bureau, some demographe­rs warned.

“Trust between users and the Census Bureau is fundamenta­lly important and that needs to be addressed,” said Seth Spielman, a professor of geography at the University of Colorado in Boulder.

State and local government officials assume that the Census Bureau data is accurate when they create budgets or decide on infrastruc­ture projects, said Nicholas Nagle, a demographe­r at the University of Tennessee.

“We assume that and we allocate real dollars based on that assumption,” Nagle said.

Ron Jarmin, deputy director of the Census Bureau, said the agency needs to find the “sweet spot” between data confidenti­ality and data accuracy as it comes up with a final algorithm for differenti­al privacy.

“We are at an important crossroads,” Jarmin said.

The bureau is implementi­ng differenti­al privacy for the 2020 Census in response to privacy threats.

In a recent test, the agency went back to the last national headcount, in 2010, and reconstruc­ted individual profiles from thousands of publicly available tables. Bureau researcher­s then matched those records against other publicly available population and other data, and were able to infer the identities of 52 million Americans.

Some researcher­s worry differenti­al privacy will result in data inaccuraci­es, especially at small geographie­s, such as neighborho­od blocks.

In a letter to the bureau last month, a group of demographe­rs wrote they were concerned that the technique would diminish the accuracy of data for small communitie­s. The bureau’s adoption of differenti­al privacy won’t solve concerns that private companies are gathering and releasing personal data, said the letter from the State Data Centers, the Census Informatio­n Centers and the Federal State Cooperativ­e for Population Estimates.

At Wednesday’s workshop, Phil LeClerc of the Census Bureau told attendees that differenti­al privacy was chosen over other techniques because it was more accurate for large geographic areas, such as states and higher.

Newspapers in English

Newspapers from United States