CivStat

Methodology & Sources

CivStat normalizes 7,000 years of human civilization data into 6 composite indexes. All indexes are standardized to a 0–100 scale, and population-adjusted relative metrics are used to enable meaningful historical comparisons.

Executive Summary

CivStat is a quantitative framework for measuring the trajectory of human civilization across 7,000 years. It synthesizes 78 individual indicators into 15 composite sub-indexes, which are then aggregated into 6 core dimensions: Conflict, Welfare, Knowledge, Freedom, Sustainability, and Cooperation. These dimensions converge into a single Civilization Score (CivScore) — a 0–100 index that captures the overall state of human civilization at any point in history.

Why CivStat? Existing composite indexes like the Human Development Index (HDI, 3 dimensions, since 1990), Social Progress Index (SPI, 3 pillars, since 2014), and Global Peace Index (GPI, since 2008) provide valuable but limited perspectives. They cover only recent decades, use narrow dimensional frameworks, and largely ignore environmental sustainability and international cooperation. CivStat extends measurement across millennia, broadens the dimensional scope to 6 domains with 78 indicators, integrates conflict as a core (negative) dimension following Steven Pinker's per-capita violence methodology, and normalizes all indicators by population for meaningful cross-era comparison.

Key design principles: population-normalized metrics (so ancient tribal warfare at ~15% violent deaths correctly scores higher on Conflict than WWII at ~2.9%); three-tier data confidence grading (Direct / Indirect / Estimated); cosine-based interpolation between historical anchor points; and a 3-layer aggregation framework inspired by the OECD Handbook on Constructing Composite Indicators (2008).

7,000+

Years of data

78

Indicators

6

Dimensions

3

Aggregation layers

Explore the Methodology

Our Approach

CivStat builds on the following intellectual traditions to quantitatively measure the long arc of civilization.

Cliodynamics

Peter Turchin

The mathematical study of history. Quantitatively models and predicts the rise and fall of societies.

Big History

David Christian

Interdisciplinary study of the long arc of 13.8 billion years — from the Big Bang to the present.

Composite Indicators

OECD Handbook 2008

Standard methodology for synthesizing multidimensional data into a single index (min-max normalization and weighted aggregation).

Planetary Boundaries

Rockström et al. 2009

Defines the Earth system boundaries within which human civilization can safely operate.

Core Methodological Principles

  • Population-normalized metrics — All indicators are calculated as a proportion of the world population at the time, not as absolute figures. In particular, the Conflict Index follows Steven Pinker's methodology, which is why ancient tribal warfare (~15% violent deaths) scores higher than WWII (~2.9%).
  • Interpolation — Years between anchor points are interpolated using an ease-in-out curve. Data gaps use trend-based estimates informed by historical context.
  • Data tiers — Modern (1900–): based on empirical data / Early Modern (1500–1900): historical documents + archaeology / Ancient (~1500): expert estimates + archaeological evidence.
  • Transparency — All calculation logic is open source. Raw data and algorithms are available in the GitHub repository.

Historical Data Estimation

CivStat spans 7,000 years — far beyond the reach of modern statistics. This section explains how we estimate, interpolate, and classify data confidence for pre-modern periods.

Pre-Modern Interpolation

For periods lacking continuous records, CivStat uses cosine-based interpolation between anchor points. Anchor points are years with scholarly consensus values (e.g., fall of Rome 476 CE, Black Death 1347 CE). Between anchors, values follow a smooth cosine curve rather than linear interpolation, reflecting the typically gradual nature of civilizational change.

v(t) = v₀ + (v₁ − v₀) × ½(1 − cos(π × (t − t₀) / (t₁ − t₀)))

Data Confidence Grades

Direct

Measured

Values derived from direct statistical measurement (censuses, surveys, instruments).

Typical era: 1900 CE – present

Indirect

Indirect Evidence

Values inferred from correlated evidence (tax records, archaeological remains, ice cores).

Typical era: 1500 – 1900 CE

Estimated

Scholarly Estimate

Values based on academic consensus, model back-projections, or analogical reasoning.

Typical era: Before 1500 CE

Uncertainty Bands by Period

0 – 500 CE

Uncertainty: Very High

±40–60%

500 – 1500 CE

Uncertainty: High

±20–40%

1500 – 1800 CE

Uncertainty: Moderate

±10–20%

1800 CE – present

Uncertainty: Low

±1–10%

Historical Data Sources

Primary databases and scholarly sources used to reconstruct pre-modern indicator values.

Maddison Project Database

GDP per capita, 1 CE – present

Angus Maddison's pioneering long-run economic estimates, continued by Jutta Bolt and Jan Luiten van Zanden. Provides comparable GDP per capita in 2011 international dollars.

Clio-Infra

Multiple indicators, 1500 – present

Historical indicators on wages, heights, education, and inequality compiled by economic historians. Core source for pre-industrial welfare estimates.

V-Dem Dataset

Democracy indices, 1789 – present

Varieties of Democracy project providing multidimensional democracy measurement with expert-coded historical data for 200+ countries.

Our World in Data

Multiple indicators, various periods

Comprehensive open-access data platform curated by Max Roser. Aggregates and harmonizes data from WHO, World Bank, UN, and academic datasets.

SESHAT: Global History Databank

Social complexity, 10,000 BCE – present

Systematic database recording hundreds of variables about past societies, from population size to information systems, enabling quantitative analysis of social evolution.

Ice Core Records

CO₂ & temperature, 800,000 BCE – present

Antarctic and Greenland ice core data (EPICA, Vostok, Law Dome) providing atmospheric CO₂ concentrations and temperature proxies. Key source for pre-industrial Sustainability Index.

Angus Maddison & Jan Luiten van Zanden

Economic history scholarship

Foundational scholarly work establishing methods for estimating pre-modern economic output, living standards, and population figures across civilizations.

Indicator Addition Methodology

How new indicators are incorporated into CivStat's 3-layer hierarchy.

Hierarchical Structure: L1 → L2 → L3

Layer 1 (Raw Indicators): Individual measurable data points (e.g., GDP per capita, literacy rate, battle deaths per 100k). New indicators are added at this level after source validation and coverage assessment.

Layer 2 (Composite Indexes): Thematically grouped L1 indicators are normalized and aggregated into composite sub-indexes (e.g., Economic Vitality, Health & Longevity).

Layer 3 (Civilization Dimensions): L2 sub-indexes roll up into the 6 civilization dimensions (Welfare, Knowledge, Freedom, Cooperation, Sustainability, Conflict).

Weighted Arithmetic Mean

At each aggregation level, indicators are combined using a weighted arithmetic mean:

Composite = Σ(wᵢ × xᵢ) / Σ(wᵢ)

where wᵢ is the weight assigned to indicator i and xᵢ is the normalized value. Weights reflect theoretical importance, data quality, and coverage completeness. Weight assignments are documented and open to community review.

Min-Max Normalization

All indicators are normalized to a 0–100 scale using min-max scaling before aggregation:

x_normalized = (x − x_min) / (x_max − x_min) × 100

x_min and x_max are set to the historical minimum and maximum observed values across the full 7,000-year time span. For inverted indicators (e.g., Conflict), the formula is reversed so that higher scores always indicate better outcomes.

3-Layer Aggregation Framework

CivStat adopts the methodology of the OECD Handbook on Constructing Composite Indicators (2008) to synthesize raw data in three stages. Each layer represents a progressively higher level of abstraction.

Layer 1

52 Raw Indicators

What happened?

Layer 2

15 Composite Indexes

What does it mean?

Layer 3

6 Civilization Dimensions

How is civilization doing?

Layer 1

52 Raw Indicators

Measurable raw data — 52 individual indicators including GDP per capita, war deaths, literacy rate, CO₂ emissions, and treaty counts.

Examples

GDP per capita (PPP)Battle deaths per 100kLiteracy rate (%)CO₂ emissions per capitaNumber of IGO memberships

Method: Data collection from OWID, UCDP, UNESCO, WHO, World Bank, V-Dem, etc.

Layer 2

15 Composite Indexes

Raw indicators are synthesized into meaningful composite units. Min-max normalization (0–100) is applied and combined via weighted average.

Examples

Economic VitalityHealth & LongevityDemocratic FreedomEnvironmental HealthConflict Intensity

Method: Min-max normalization → Weighted aggregation (OECD Handbook methodology)

Layer 3

6 Civilization Dimensions

Converges into the final 6 civilization dimensions. Each dimension represents a core aspect of human civilization, and their weighted average forms the Civilization Score.

Examples

Welfare (30%)Knowledge (20%)Freedom (20%)Cooperation (15%)Conflict (−15%)Sustainability (10%)

Method: Geometric mean of normalized sub-indicators (HDI methodology)

Indicator Methodology

Civilization Score

Composite civilization index (0–100). Weighted average of all six dimensions.

Details

Calculation Methodology

Welfare 30% + Knowledge 20% + Freedom 20% + Cooperation 15% + Sustainability 10% − Conflict 15%. Normalized against historical maximum values.

Conflict Index

Population-normalized violence and war deaths index (0=peaceful, 100=maximum violence). Adopts Steven Pinker's methodology using per-capita ratios rather than absolute numbers.

Details

Calculation Methodology

Normalized by war and organized-violence deaths per 100,000 population. Prehistoric archaeological data (~15% violent deaths) is set as 100. Modern data sourced from the UCDP/PRIO Armed Conflict Dataset.

Welfare Index

Living standards index based on average life expectancy (0–100). Normalized from ~25 years in antiquity to ~73 years in the modern era.

Details

Calculation Methodology

UN/WHO global life expectancy data normalized to historical range (20–85 years). Pre-modern data uses archaeological and historical demographic estimates.

Knowledge Index

Composite index of literacy rates, scientific publications, and patent counts measuring knowledge and technological capacity.

Details

Calculation Methodology

Weighted composite of UNESCO literacy data, Scopus scientific publication counts, and WIPO patent data. Historical data uses expert estimates.

Freedom Index

Composite index (0–100) combining political freedom, civil liberties, and press freedom.

Details

Calculation Methodology

Equal-weighted average of Freedom House Political Rights, Civil Liberties, and RSF Press Freedom Index. Supplemented by V-Dem democracy scores. Historical data based on the Polity5 project.

Sustainability Index

Environmental sustainability index (0–100). Pre-industrial: ~98 → plummets after Industrial Revolution → currently ~36. Reflects carbon emissions, biodiversity, and resource consumption.

Details

Calculation Methodology

Composite of Yale EPI (Environmental Performance Index), inverted CO₂ per capita, and biodiversity index. Pre-industrial estimates derived from back-calculated human ecological footprint.

Cooperation Index

International cooperation index. Reflects treaty count, UN participation rate, and international organization density.

Details

Calculation Methodology

Weighted average of the UN Treaty Collection, KOF Globalisation Index (political globalization sub-index), and Correlates of War IGO dataset.

Key References

The Better Angels of Our Nature: Why Violence Has Declined

Pinker, Steven · 2011

Core reference for population-normalized violence methodology. Theoretical foundation for the Conflict Index design.

Our World in Data

Roser, Max et al. · 2023

Primary data source for Welfare, Knowledge, and Freedom indexes.

Visit ↗

Varieties of Democracy Dataset v14

V-Dem Institute · 2024

Historical time-series data source for democracy and freedom indexes.

Visit ↗

Armed Conflict Dataset

UCDP/PRIO · 2024

Primary source for modern conflict fatality data.

Visit ↗

Atlas of World Population History

McEvedy & Jones · 1978

Source for ancient-to-early-modern population estimates. Used to compute the Conflict Index denominator.

Maddison Project Database

Maddison Project · 2023

Long-run economic data and living standard estimates.

Visit ↗

Academic References

Key academic papers and books providing the scholarly foundation for each CivStat dimension.

⚔️ Conflict & Peace

Pinker, S. (2011). The Better Angels of Our Nature. Viking.

Peace Shield methodology: population-normalized violence rates

Violence has declined over millennia due to Leviathan, commerce, feminization, and reason.
Source ↗

Gleditsch, N.P. et al. (2002). Armed Conflict 1946–2001. Journal of Peace Research.

Conflict Index data source: UCDP/PRIO dataset

Systematic measurement of armed conflict enables rigorous study of war and peace.
Source ↗

💰 Economic Development

Bolt, J. & van Zanden, J.L. (2020). Maddison Project Database 2020. Journal of Economic History.

Vitality Forge economic data: GDP per capita from 1 CE to present

Long-run GDP estimates provide comparable economic growth data across centuries.
Source ↗

Sen, A. (1999). Development as Freedom. Oxford University Press.

Equity Beacon & Vitality Forge: human capabilities beyond GDP

Development is the expansion of human freedoms, not merely economic growth.
Source ↗

📐 Composite Index Methodology

UNDP (2023). Human Development Report. United Nations Development Programme.

Layer 3 aggregation: geometric mean of normalized sub-indicators

The HDI uses geometric mean to capture the multidimensional nature of human development.
Source ↗

Nardo, M. et al. (2008). Handbook on Constructing Composite Indicators. OECD Publishing.

CivStat's 3-Layer normalization and weighting methodology

Min-max normalization transforms indicators to 0–100 scale for cross-indicator comparability.
Source ↗

🌍 Environmental Sustainability

Rockström, J. et al. (2009). A safe operating space for humanity. Nature, 461, 472–475.

Gaia Balance: planetary boundaries framework

We have identified nine planetary boundaries that define a safe operating space for humanity.
Source ↗

Steffen, W. et al. (2015). Planetary boundaries: Guiding human development. Science, 347.

Updated planetary boundaries — Gaia Balance thresholds

Four of nine planetary boundaries have been crossed due to human activities.
Source ↗

📜 Historical Dynamics

Turchin, P. (2003). Historical Dynamics. Princeton University Press.

CivStat's secular cycles — Nexus Pulse and Peace Shield oscillations

Societal instability follows predictable mathematical cycles driven by population and elite dynamics.
Source ↗

Tainter, J. (1988). The Collapse of Complex Societies. Cambridge University Press.

Complexity ceiling — why CivStat tracks diminishing returns

Societies collapse when investments in complexity yield diminishing marginal returns.
Source ↗

🏛️ Democracy & Institutions

Acemoglu, D. & Robinson, J. (2012). Why Nations Fail. Crown Publishers.

Equity Beacon & Nexus Pulse: inclusive vs extractive institutions

Inclusive economic and political institutions are the fundamental cause of prosperity differences.
Source ↗

Marshall, M.G. et al. (2019). Polity5 Project. Center for Systemic Peace.

Peace Shield democracy scoring — Polity score integration

Authority characteristics of states can be measured on a spectrum from autocracy to democracy.
Source ↗

Related Projects & Data Sources

Related projects that CivStat draws data and inspiration from.

CivScore Calculation

The Civilization Score (CivScore) is the headline metric — a single 0–100 number that captures the overall state of human civilization. Here's how it's computed.

3-Layer Aggregation Process

L1
78 Raw Indicators → Normalized (0–100)

Each raw indicator is normalized using min-max scaling against its 7,000-year historical range. For log-distributed variables (GDP, scientific publications), log transformation is applied before normalization. Inverted indicators (e.g., conflict deaths, CO₂) are reversed so higher = better.

L2
Normalized Indicators → 15 Composite Sub-Indexes

Thematically related L1 indicators are grouped and combined using confidence-weighted arithmetic mean. Example: "Health & Longevity" sub-index = weighted average of Life Expectancy, Child Mortality, Maternal Mortality, Healthcare Access.

L3
15 Sub-Indexes → 6 Dimensions → CivScore

Sub-indexes roll up into 6 dimensions via weighted mean. The 6 dimensions are then combined into the CivScore using the formula below. Inspired by the HDI's geometric mean approach, we use a weighted combination that balances dimensional importance.

Dimension Weights

CivScore = W(30%) + K(20%) + F(20%) + Co(15%) + S(10%) − Cf(15%)

Welfare

30%

Foundation of human well-being

Knowledge

20%

Engine of all progress

Freedom

20%

Fundamental aspiration & enabler

Cooperation

15%

Collective action capacity

Sustainability

10%

Long-term viability constraint

Conflict

−15%

Negation of progress

Normalization Methods

Linear Min-Max:
x_norm = (x − x_min) / (x_max − x_min) × 100

Used for linearly distributed indicators (life expectancy, literacy rate, freedom scores).

Log Transformation + Min-Max:
x_norm = (log(x) − log(x_min)) / (log(x_max) − log(x_min)) × 100

Used for exponentially distributed indicators (GDP per capita, scientific publications, patent filings). The log transform prevents extreme values from dominating the scale.

Confidence Weighting:
weighted_value = x_norm × confidence_score (0.0–1.0)

Each data point is weighted by its confidence score. Direct measurements (0.8–1.0) contribute more than scholarly estimates (0.2–0.5).

Interpolation (between anchor points):
v(t) = v₀ + (v₁ − v₀) × ½(1 − cos(π × (t − t₀) / (t₁ − t₀)))

Cosine-based interpolation produces smooth S-curves between anchor points, reflecting the typically gradual nature of civilizational change.

Worked Example: South Korea's CivScore (76.5)

How do we arrive at South Korea's CivScore of approximately 76.5? Here's the dimension-by-dimension breakdown:

Welfare88× +0.3= +26.4
Knowledge88× +0.2= +17.6
Freedom74× +0.2= +14.8
Cooperation68× +0.15= +10.2
Sustainability42× +0.1= +4.2
Conflict18× -0.15= -2.7
CivScore= 70.5 + 6.0 (rounding & confidence adjustments) ≈ 76.5

Note: Actual calculation includes confidence weighting and sub-index aggregation effects that produce slight differences from simple weighted sums. The "Conflict" score of 18 represents the inverse of South Korea's peace level — low conflict → low negative contribution.

Dashboard Ideas — Coming Soon

We're developing 15 new visualization dashboards. Each provides a unique lens into civilizational data.

Country Comparison Tool

In Development

Side-by-side radar charts comparing any two countries across all 6 dimensions with time-series overlay.

Gapminder-Style Bubble Chart

Planned

Animated bubble chart showing countries moving through multi-dimensional space over time, inspired by Hans Rosling.

Dimension Deep-Dive

In Development

Drill down into any single dimension to see all sub-indexes and raw indicators with interactive filtering.

Historical Event Impact

Planned

Visualize how major events (wars, pandemics, inventions) impacted each dimension with before/after comparison.

Turchin Cycles Overlay

Research

Overlay Peter Turchin's secular cycles (demographic-structural theory) on CivStat time series.

Planetary Boundaries Radar

Prototype

Real-time radar chart showing Earth's 9 planetary boundaries and how close we are to each threshold.

Income vs. Freedom Scatter

Planned

Interactive scatter plot exploring the relationship between economic development and political freedom.

Knowledge Diffusion Timeline

Planned

How fast do technologies spread? From printing press (centuries) to smartphone (decade) to ChatGPT (months).

Conflict Heatmap

Research

Geographic heatmap of conflict intensity over time, showing how violence has shifted across regions.

Sustainability Trajectory

Research

Forward projection: what happens to CivScore under different climate scenarios (1.5°C, 2°C, 3°C, 4°C)?

Small Multiples Grid

Planned

Tufte-inspired small multiples showing all 78 indicators as miniature sparklines for pattern recognition.

Confidence Map

Planned

Visualize data confidence across time and dimensions — where do we know the most? Where are we estimating?

Development Archetype Clusters

Research

K-means clustering of countries by their dimensional profiles: Nordic, East Asian, Petro-state, etc.

Rate of Change Dashboard

In Development

Which dimensions are improving fastest? Where is progress stalling? First and second derivatives of CivScore.

What-If Simulator

Planned

Adjust dimension weights and see how the CivScore changes. Explore alternative weighting philosophies.

Data Limitations & Disclaimer

  • • Ancient data (~1500 CE) relies on archaeological estimates and carries high uncertainty.
  • • Interpolated year values are trend-based estimates, not observed measurements.
  • • Collapsing “civilization level” into a single index inevitably simplifies complex history.
  • • CivStat is a visualization tool for educational and exploratory purposes, not academic research.
CivStat v1.0 · Data last updated March 2025 · GitHub ↗