What’s happening with the EEO-1 data?

This was originally written on 13 May 2020.

I get asked this question enough that it seemed worth putting the answer in one place.

I work with the Equal Employment Opportunity Commission's EEO-1 establishment-survey data, which surveys large private employers about the racial and sexual composition of their workforces. Historically, researchers gained access to these data via the Intergovernmental Personnel Agreement. This is the law that governs secondment between federal agencies and other employers. Say that you work for the California Parks Service and are going to spend a year in Washington at the Department of the Interior. California is still paying your salary but you get treated as a federal employee for other purposes; this is the type of secondment the IPA handles. In our case, researchers were treated as Commission employees at a salary of $0 per year. We could access the data, and we could be punished to the full extent of the law if we were responsible for a breach.

As the federal government has tightened up its data-security policies, this access method moved out of compliance with the standards. In February 2018 the EEOC's Chief Data Officer announced a moratorium on new IPA agreements while the Commission put a compliant data-access regime in place. Those of us with existing IPAs have been allowed to continue working on the data through extensions.

In December 2018 the EEOC announced a partnership with the National Opinion Research Center at the University of Chicago to build a secure data enclave, through which researchers will be able to work on the Commission's data. The original plan was to bring on some existing researchers as beta testers of the new system in August 2019 and start accepting new applications for access in January 2020.

The beta version of the enclave went into testing in the fall of 2019. I was one of the beta testers. The new system is and will be a considerable improvement over how we accessed the data in the past. There were then delays, though, over beginning the access-request process. The Commission's CDO gave an update to the research community in early March:

In the course of putting the virtual data center together we learned that EEOC does not have any legal authority to share data with the “public” and according to agency lawyers, for Title VII purposes, researchers are considered the public. The IPA process had apparently been used as a workaround.

In a stroke of luck the recently enacted Evidence Act provides agencies with a way to share data with researchers. Under this law, approved researcher are deemed “agents” enabling them to use the data while imposing felony sanctions (jail time and large fines) for deliberate misuse of the data. These are stronger sanctions than the Title VII protections.

However the agency must be a statistical agency or be a designated statistical unit within an agency in order to avail themselves of this provision of the law.

We have decided - with the full support of the Chair - to seek official OMB recognition as a statistical unit for OEDA’s Information and Data Access Division. Just last week we received the authorization from the Office of the Chair to move forward with the application process.

Currently we have begun to develop the application and expect to submit it to OMB in May. Our hope is to receive approval by late summer. We’re hoping to have everything up and ready for researchers in late Fall.

Once and for all we will have in place the actual legal authority to share data, the data sharing process will be governed by the same CIPSEA provisions governing other federal statistical agencies, the application for access to a secure virtual research data center will be open to any qualified researcher, and appropriate strong disclosure controls will be in place.

Please thank all of the researchers for their patience. A superior long term solution for all will be in place once we’re finished.

Of course, within hours of this email, the shit hit the fan with respect to the pandemic.

So, that's what's happening with the EEO-1 data. At present, the EEOC is not accepting applications for access. They probably will be before the end of the calendar year. How much of a wrench COVID-19 has thrown into the works I cannot say, but everyone involved has been acting in good faith.

I write all of this here because, every few weeks, I get an email from an academic colleague asking what the deal is with accessing these data. This is reasonable--I'm happy to be one of the "authorities" on these data! But rather than typing the same thing over and over, I thought I'd just write it once and direct people to it.

Obviously, the second I know more, I'll write a new post!

Previous
Previous

The pernicious problem of the perfect paper

Next
Next

Refactoring a project