Tech

How one data-driven agency — the Census Bureau — found extra value in machine learning

The bureau is piloting ML that flags valuable information employees may not have even been searching for originally.

February 26, 2020

(Getty Images)

Like many agencies, the Census Bureau looks for reductions in expenses and workloads when it makes decisions about machine learning. But the agency has discovered another advantage in the technology: It can find data that employees never knew they needed.

More than 100 different surveys are handled by siloed programs within the Census Bureau, and the capture, instrumentation, processing and summation of the resulting data is “really hard to manage,” said Zachary Whitman, chief data officer, at an AFCEA Bethesda event Wednesday,

The bureau’s dissemination branch exports data in a consolidated system where discovery and preparation is “difficult” for employees, Whitman said. So the agency is piloting ML that flags valuable information employees may not have even been searching for originally.

“How do you get people to translate into information they might not know about but would be very valuable to them?” Whitman said. “That’s where a lot of our AI is coming into play, not only with our search services, but also with our user engagement.”

When users write to the bureau about one of its products — maybe they found the title of a table confusing — a feedback algorithm analyzes their comment. The algorithm classifies positive and negative feedback, who the author is, why they used the tool, whether the comment concerns a feature or a bug, and how their experience might be improved.

That information is then relayed upstream to inform the development of new, customer-driven applications.

When it comes to ML, the bureau continues to try and make the value of a dollar go farther, Whitman said. The process of onboarding data from systems that look and feel differently — and have been operating on their own for years — can only scale up if operations and maintenance costs continue to decline.

“Trying to converge [systems] into a consolidated data model is a lot of work — manual work — because they’ll deliver to a spec that will fail 10, 15, 20 times, and each time it will require someone to go in and help them debug to understand what it is about the XML [formatting] that is failing the data,” Whitman said. “That inefficiency is gross and something that we are desperate to move off of, because we can ultimately never scale to where we need to go.”

The alternative, he added, is killing and refreshing the systems — a process that is much more costly and time-consuming.

How one data-driven agency — the Census Bureau — found extra value in machine learning

More Like This

ICE pursuing privacy approvals related to controversial phone location data

House Modernization panel advances bill to improve CRS’s data access in first-ever markup

GSA administrator: Generative AI tools will be ‘a giant help’ for government services

Top Stories

State Department encouraging workers to use ChatGPT

Federal CIO calls on Congress to fund Technology Modernization Fund

Congressional panel outlines five guardrails for AI use in House

With 2023 tax season in the rearview, IRS commissioner eyes expansion of AI capabilities

CMS’s financial office is using LLM pilot to combat loss of institutional knowledge

Top public sector takeaways from Google Cloud Next 2024

Commerce requests information about AI, open data assets, data dissemination

More Scoops

How the U.S. Census Bureau leveraged cloud services to modernize security

Watchdog calls on Census Bureau to improve cyber incident detection and alerting

Census Bureau shares analytics insights, not data, in pilot with IRS

Census Bureau moving beyond surveys and censuses with cloud-based data ecosystem

Census Survey Explorer helping users find the data they need

Report: Census Bureau should set timeframes for protecting respondents’ data privacy

How agencies like Census Bureau are improving network visibility and operations

Latest Podcasts

How one data-driven agency — the Census Bureau — found extra value in machine learning

AI Week 2024 has Come and Gone

Darryl Peek on Elastic’s role in enhancing public sector search and data analytics with AI and Google collaboration

Danny Werfel on How Automation has Enhanced his Agency’s Operations.

Tech

Defense

Cyber

Acquisition