Tech

The Air Force isn’t shying away from bad data

Taking the bad data with the good makes it easier to decide what to run machine-learning analysis on, according to CTO Frank Konieczny.

By Dave Nyczepir

May 22, 2019

Frank Konieczny speaks April 25, 2019, at the Security Through Innovation Summit presented by McAfee and produced by FedScoop and CyberScoop. (FedScoop)

Air Force analysts are taking a fresh approach to the service’s data by refusing to fix the “bad” material until after it has been presented to leadership, Chief Technology Officer Frank Konieczny says.

An example could be two related but differing datasets about the on-board hardware of aircraft, Konieczny says. The data could be bad for various reasons: It could be stored in an old or obsolete format or system. It could be missing fields or include incorrect information. It could be mismatched, repeat or include typos. Whatever the case, the Air Force is trying to ignore the urge to hide those flaws, he says.

“We want the seniors to actually see the bad data so that they yell at people to get it fixed,” Konieczny said at an Armed Forces Communications and Electronics Association event Tuesday. “Most people like to fix it and then show the seniors the results.”

The Air Force’s Office of Information Dominance is working to clean things up, but for now it has more intelligence, surveillance and reconnaissance data than it can process, Konieczny said. The service needs “a bunch of” fit-for-purpose clouds — ones outside the Joint Enterprise Data Infrastructure general-purpose cloud that will eventually support the entire Department of Defense — with graphic processing units to perform machine-learning faster, he said.

The goal is to avoid having a single data lake that could go stale quickly, Konieczny said, so his team is meta-tagging data where it resides.

“Because we realize we can’t move all the data in the Air Force into a central cloud,” he said. “It’s just impossible. We tried it initially.”

With meta-tagging, users can “dynamically” cherry-pick the “right” data they want while still presenting “wrong” data, Konieczny said.

“We have to do things better with the data faster and look at the bad data,” he said.

The Air Force isn’t shying away from bad data

More Like This

AWS secures $2.6B DHS-wide cloud project

VA clinical staff rushed to use generative AI without oversight, watchdog finds

Let’s build the national data ecosystem America needs

Top Stories

IRS IT department has shrunk 42% under Trump

OPM wants to leverage talent exchanges that would place feds in industry

Fraud-focused bills passed by House follow ‘DOGE playbook,’ privacy experts warn

FBI, DHS emphasize no-drone zones as World Cup kicks off

NIH contracting arm announces sunset of all governmentwide vehicles

More Scoops

Air Force develops new model for battle management to underpin requirements for ABMS

Air Force CIO: ‘We’re not waiting’ for JWCC

Google to create standalone public sector cloud services division

Air Force to bring top-secret data into the mix in next weapons system hackathon

Despite delay, experts not concerned by DOD’s JWCC cloud contract timeline

Air Force positions autonomous drones, networked weapon systems as top priorities

Transportation Command migrating applications to Air Force’s Cloud One

Latest Podcasts

Under Tech Force, OPM wants to send some feds for tours of duty in industry

Oracle wins OPM’s massive governmentwide HR modernization contract

CBP is installing new AI-powered surveillance towers at the southern border

OPM announces new Tech Force partners

Tech

Defense

Cyber

FedScoop TV