South African Apartheid Economy Data Archive¶
This archive contains digitized historical datasets covering the South African economy during apartheid (1948-1994). It is designed to support quantitative research into the economic mechanisms of apartheid's rise, transformation, and collapse.
What Research Can This Data Support?¶
The datasets here enable analysis of several interconnected questions about apartheid South Africa:
- Labor market shocks and mechanization: How did the 1974 mine labor crisis propagate through the economy, tripling Black mine wages and accelerating agricultural mechanization? (Agricultural census + TEBA recruiting panel)
- Electoral politics of labor market change: How did exposure to Black labor market competition drive white electoral realignment? Districts east of the Eiselen Line — where farmers competed directly with mines for Black labor — shifted +6.3pp toward right-wing parties by 1987 (Elections + district-level economic data)
- Industrial action and regime change: Did strikes in the 1980s shift investor expectations about apartheid's durability? Political strikes generated 2.4x larger stock market reactions than economic strikes (Strike panel + share prices + sanctions data)
- Structural transformation: How did the economy shift from primary production to manufacturing, and how did racial employment barriers evolve? (Manpower survey + manufacturing census)
Datasets at a Glance¶
District-Level Panels (joinable on magisterial district)¶
These datasets share a common geographic unit — the 279 magisterial districts — and can be merged into a single panel.
| Dataset | Years | Level | Records | Key Variables |
|---|---|---|---|---|
| Agricultural Census | 1949-1990 | District | ~3,000 district-years | Employment by race, wages, mechanization |
| TEBA Mine Recruiting | 1953-1975 | District | ~800 district-years | Mine recruits by origin district |
| Population | 1904-1996 | District | 279 per year | Population by race, urban/rural |
| Forced Removals | 1960s-1980s | District | 631 events | Removal counts by category |
| Elections | 1938-1992 | Constituency | 2,000+ results | Votes by party, turnout |
National and Sectoral Panels¶
| Dataset | Years | Level | Records | Key Variables |
|---|---|---|---|---|
| Strike Panel | 1978-1990 | Incident | 4,100+ strikes | Company, union, workers, outcome |
| Manpower Survey | 1965-1994 | Sector-occupation | 165,583 | Employment by race, gender, sector |
| Mining Census | 1964-1987 | Mine | 67+ files | Gold production, costs, employment |
| Manufacturing Census | 1948-1985 | Region | 23+ files | Output, employment by industry |
| Industrial Wages | 1973-1988 | Industry-occupation | 825 files | Gazetted wage rates |
Financial and Macro¶
| Dataset | Years | Records | Key Variables |
|---|---|---|---|
| Sanctions & Macro | 1968-2015 | ~40,000 daily obs | Gold price, exchange rates, bond yields, credit spreads |
| Share Prices | 1973-2015 | 1,000+ equities | JSE market values, price-to-book |
See the full Data Catalog for all 1,900+ files across 24 categories.
Origin and Quality¶
These datasets were digitized from archival PDFs — primarily South African government census publications, TEBA/NRC mine recruiting records, and academic working papers. The original digitization was performed by oDesk contractors in 2013-14, with extensive validation and correction in 2025-26.
All data traces back to source PDFs through a documented provenance chain. See Provenance & Validation for error rates, correction logs, and validation scripts.
Data quality varies by year
The oDesk contractors sometimes introduced transcription errors (1-23% error rates depending on year). The agricultural census years 1949-1976 have been validated; other datasets are at various stages. Always check the provenance documentation before using a dataset.
Associated Research¶
- Boone & Wilse-Samson — "The Political Economy of Labor Market Shocks: Evidence from South Africa's 1974 Mining Crisis." Key finding: the 1974 foreign labor shock tripled Black mine wages and drove national agricultural mechanization, but the district-level DiD is specification-sensitive (robust in pooled OLS, null in TWFE). The Eiselen Line electoral divergence (+6.3pp right-wing by 1987) is the lead empirical result.
- Naidu, Turban & Wilse-Samson — "Striking at Apartheid's Economic Heart: Labor Unrest as Information Revelation in Authoritarian Capitalism." Key finding: 2,680 strikes matched to JSE returns show -0.35% mean abnormal returns; political strikes 2.4x larger; sanctions complementarity 3.4x. Null on political targeting of connected firms.
- Wilse-Samson — The Economic Contradictions of Apartheid (book manuscript, ~67,000 words, 23 figures)
Access¶
CSV data files are not in the git repository (too large). To request access, contact Laurence Wilse-Samson at NYU Wagner.
Code, documentation, and panel-building scripts are all in the GitHub repository.
License¶
CC-BY-4.0. See Citation for how to cite.
Source Copyright
The underlying source documents (government census publications, TEBA/NRC records) may be subject to separate copyright. This license applies to the digitized datasets, code, and documentation created by the project team.