“The consequences for human welfare involved in questions like these are simply staggering: Once one starts to think about them, it is hard to think about anything else.” — Robert E. Lucas Jr., On the Mechanics of Economic Development, 1988
The previous chapters addressed how we measure economic activity in the short and medium run. This chapter steps back and asks a more fundamental question: what determines the level of economic activity in the long run — the standard of living that a country can sustain not just this year but over decades and generations?
This is not merely an academic exercise. The gap in living standards between the world’s richest and poorest countries is staggering: real income per person in the United States is roughly fifty times that of Malawi, sixty times that of Niger. A person born in Norway today can expect to live more than thirty years longer than one born in the Central African Republic. These differences are not explained by short-run fluctuations in aggregate demand; they reflect profound, persistent differences in productive capacity — in the amount of physical capital workers have at their disposal, the level of education and health they bring to their work, and the state of the technology they use. Growth theory attempts to explain why these productive capacities differ so dramatically and what, if anything, can change them.
5.1 What Growth Theory Is Explaining¶
Before presenting models, it is worth being precise about what we are trying to explain. The relevant outcome variable is real output per worker, or equivalently real output per capita in an economy where the employment rate is approximately constant. We care about output per worker rather than total output because a country is not richer simply by having more people — what matters for living standards is how much each person can produce.
Over the very long run, output per worker in currently advanced economies has grown at roughly 2% per year. This sounds modest, but compound growth at 2% per year means a doubling of living standards every 35 years. The standard of living of a typical American in 2000 was roughly eight times higher than in 1900, and roughly sixty times higher than in 1800. This sustained improvement is the central fact that any growth theory must explain.
Alongside this long-run trend, several robust regularities appear when comparing growth experiences across countries. These are the Kaldor stylized facts (Kaldor, 1961):
Output per worker grows at a roughly constant rate over long periods.
The capital-to-labor ratio grows at a roughly constant rate.
The real interest rate (return on capital) is roughly constant over long periods.
The capital-to-output ratio is roughly constant.
Labor and capital shares of national income are roughly constant.
Growth rates of output per worker differ substantially across countries.
Facts 1–5 together describe what growth theorists call a balanced growth path — a trajectory on which all quantities grow at constant rates and all ratios remain approximately stable. Fact 6 is the puzzle: if balanced growth is the long-run attractor, why do some countries grow along a high-income balanced path while others are stuck at a low-income one?
Definition (Balanced Growth Path). A balanced growth path is a trajectory of the economy on which output per effective worker , capital per effective worker , and consumption per effective worker are all constant. Here is the level of labor-augmenting technology and the size of the labor force. Output per actual worker grows along the balanced path at the rate of technological progress, consistent with Kaldor facts 1 and 2.
5.2 The Solow–Swan Growth Model¶
The first rigorous model of long-run growth is due independently to Robert Solow (1956) and Trevor Swan (1956). Its central insight is deceptively simple: capital accumulation alone cannot sustain indefinite growth, because of diminishing returns; sustained growth in output per worker requires sustained improvements in technology.
To see why, consider the production function. The economy produces a single good using capital and labor augmented by the level of technology :
The function is assumed to exhibit constant returns to scale: doubling both inputs doubles output, for any . It also exhibits diminishing marginal returns to each input: , , meaning each additional unit of capital adds positive but declining amounts to output. The Inada conditions — and — ensure that production cannot take place with zero capital and that capital is never so productive that the economy would want to accumulate it without limit.
The reason technology enters as — multiplying labor rather than capital — is that this specification of labor-augmenting (or Harrod-neutral) technological progress is the only one consistent with a balanced growth path when preferences are of the constant-relative-risk-aversion form. Technology augmenting capital would violate the constancy of factor shares (Kaldor fact 5); technology entering in a Hicks-neutral form also creates difficulties. This is not a minor technical point: the form of technological progress determines whether a balanced growth path exists, and it is a constraint on model-building with empirical content.
Definition (Effective Labor and Intensive Form). Define effective labor as — the product of the technology level and the raw number of workers. Define capital per effective worker and output per effective worker . Dividing both sides of the production function by , the intensive form of production is:
Working in intensive form collapses a model with two growing variables ( and ) into a model with one stationary variable , which is much easier to analyze.
The economy’s saving rate is, in the basic Solow model, taken as exogenous — households save a fixed fraction of their income regardless of the interest rate or their age. This is the model’s key simplifying assumption; it will be relaxed in the Ramsey–Cass–Koopmans model of Section 5.3. With exogenous saving, the capital accumulation equation in intensive form is:
where is the population growth rate, is the rate of technological progress, and is the depreciation rate of capital. The term is the effective depreciation: even with zero physical depreciation, would shrink if population grows (each worker has less capital) or if technology improves (each unit of effective labor is endowed with less capital). The effective depreciation rate plays an important role throughout.
The intuition for this equation is worth spelling out. The change in capital per effective worker equals investment per effective worker (, since all saving is invested in a closed economy) minus the amount of investment needed just to keep constant as the labor force grows and technology improves. If investment exceeds break-even, rises; if it falls short, falls.
The Steady State¶
The steady state of the Solow model is the value at which capital per effective worker is constant: . Setting the right-hand side to zero:
This says that investment per effective worker exactly covers effective depreciation. The steady state exists and is unique under the Inada conditions, because — the average product of capital, scaled by — is declining from infinity (as ) to zero (as ), and it crosses the horizontal line exactly once.
With the Cobb–Douglas production function , , the steady state has a closed form:
These formulas have important implications. A higher saving rate raises and — countries that invest more are richer in the long run. A higher population growth rate or depreciation rate lowers and — these forces continuously dilute the capital stock. The parameter (the capital share in income, approximately one-third in most countries) governs the sensitivity of income to saving: because , doubling the saving rate raises steady-state income by only about 41%, not 100%.
Transitional Dynamics and Conditional Convergence¶
Away from the steady state, converges to monotonically. Linearizing around the steady state:
The solution is : the gap between current capital and the steady state shrinks exponentially at rate . With and , , giving a half-life of years. This means the Solow model predicts that economies converge to their steady states fairly slowly — a prediction consistent with observed growth patterns.
This convergence is conditional: countries converge to their own steady states, not to a common one. A poor country that also has a low saving rate and high population growth will have a low and will not catch up to a rich country with high and low . The conditional convergence hypothesis — that countries converge in growth rates after conditioning on the determinants of the steady state — is strongly supported empirically (Mankiw, Romer, and Weil, 1992). Unconditional convergence — the idea that all poor countries simply grow faster than rich ones — is not supported in the broad cross-section of countries, though it holds within groups of similarly structured economies such as the OECD.
The Golden Rule of Capital Accumulation¶
A natural question is: which saving rate maximizes household welfare? Consumption per effective worker on the balanced growth path is . Since is an increasing function of , raising simultaneously increases output ( rises) and decreases the fraction of output available for consumption ( falls). The saving rate that maximizes steady-state consumption — the Golden Rule saving rate — balances these two forces:
Verbally: at the Golden Rule, the marginal product of capital equals the effective depreciation rate. Since the net marginal product of capital , the Golden Rule condition is equivalent to .
An economy is dynamically efficient if , meaning the return on capital exceeds what is needed to maintain at the Golden Rule level. Dynamically efficient economies have not overaccumulated capital — households are not saving too much relative to the welfare-maximizing amount. An economy with is dynamically inefficient: it has accumulated so much capital that reducing investment would allow more consumption at every future date, benefiting every generation. Most empirical evidence suggests that advanced economies are dynamically efficient (Abel et al., 1989).
5.3 The Ramsey–Cass–Koopmans Model¶
The Solow model’s exogenous saving rate is its principal limitation. Households in the Solow model behave as if they follow a mechanical saving rule rather than choosing optimally. This is unsatisfying both intellectually — we should derive saving behavior from preferences — and practically, because it prevents us from analyzing how saving responds to changes in policy, taxation, or the return on capital.
The Ramsey–Cass–Koopmans (RCK) model endogenizes saving by replacing the mechanical saving rule with explicit intertemporal optimization by an infinitely-lived representative household. The “infinitely lived” assumption is a mathematical convenience: it can be justified either literally (people care about their descendants as much as themselves) or as an approximation to an overlapping-generations model where bequests are operative.
The representative household has members and maximizes the present discounted value of utility:
where is the pure rate of time preference — the rate at which the household discounts future utility relative to present utility, holding consumption constant. The requirement ensures that the utility integral converges. The per-period utility function satisfies the standard conditions: (more consumption is better), (diminishing marginal utility), and the Inada conditions , . The standard parametric form is the constant relative risk aversion (CRRA) utility function:
Definition (CRRA Utility). The CRRA utility function is:
with the limiting case when . The parameter is simultaneously the coefficient of relative risk aversion (how much a household dislikes gambles over consumption) and the inverse of the elasticity of intertemporal substitution (EIS) . The EIS measures how responsive the household’s consumption growth is to the real interest rate: a high (low EIS) means households are reluctant to substitute consumption across time, so even a high interest rate induces little additional saving. The CRRA family is the only class of utility functions consistent with balanced growth (King, Plosser, and Rebelo, 1988).
Writing the household’s problem in intensive form with , the economy’s resource constraint is:
the same as in the Solow model but with replacing . The household now chooses the entire path to maximize utility subject to this constraint. This is an optimal control problem: the state variable is (capital accumulated from past decisions) and the control variable is (the decision made at each instant).
The current-value Hamiltonian is:
where is the costate variable — the shadow price of capital in utility units, measuring how much lifetime utility increases if the capital stock is marginally increased.
The Pontryagin maximum principle provides three conditions for optimality. The first-order condition for :
The costate equation:
The transversality condition:
The transversality condition rules out “Ponzi schemes” in which the household accumulates capital without limit, intending to consume it all at infinite time. It says that the present value of the capital stock must approach zero as the horizon extends to infinity — the household cannot plan to die with positive assets (that would mean it consumed too little) or negative assets (that would mean it consumed on borrowed time forever).
Differentiating the first-order condition with respect to time and substituting the costate equation yields the Euler equation for consumption:
In per-capita terms, this simplifies to:
where is the net marginal product of capital — the real return on saving. This is the fundamental equation of consumption theory. It says: consumption grows when the real return on saving exceeds the rate of impatience. If , it is worthwhile to sacrifice present consumption for future consumption; if , the household prefers to consume now rather than wait.
The Steady State and Phase-Plane Analysis¶
The steady state requires both and . From the Euler equation, requires:
This determines the steady-state capital stock independently of the saving rate — a sharp contrast with the Solow model, where the steady state depends on the exogenous . In the RCK model, the steady-state capital stock is pinned down by preference parameters (, ) and technology (, ). The corresponding steady-state consumption is determined by substituting into the resource constraint with : .
Comparing the RCK steady state with the Golden Rule: since (assuming , which holds for standard parameter values), we have . The optimizing economy holds less capital than the Golden Rule would prescribe. Why? Because households are impatient () and therefore discount future consumption. Accumulating more capital to reach the Golden Rule would require sacrificing present consumption, which households do not find worthwhile given their impatience. The RCK model thus endorses dynamic efficiency: rational households will never allow the economy to overaccumulate capital.
The dynamics of the two-dimensional system are analyzed using a phase diagram. The locus is a hump-shaped curve in space (the resource constraint evaluated at ). The locus is a vertical line at . The equilibrium is a saddle point: there are four regions of the phase plane corresponding to different combinations of rising/falling and , and the stable manifold — called the saddle path — is the unique trajectory that converges to the steady state. Since is a free variable (the household can choose its initial consumption), the economy jumps immediately to the saddle path at and then travels along it to the steady state. This jump is not a discontinuity in behavior; it is the household choosing its initial consumption optimally, given that it knows the optimal path forward.
5.4 Exogenous Versus Endogenous Growth¶
In both the Solow and RCK models, the long-run growth rate of per-capita income equals — the rate of technological progress. This is exogenous growth: the engine of long-run improvement in living standards (technology) is taken as given by the model, falling like manna from heaven. The model explains the level of income but not the source of growth. This is a significant limitation. If we want to know whether subsidizing research, opening trade, or improving education will raise a country’s long-run growth rate, a model with exogenous technology can only say “not if technology is exogenous” — which is circular.
Definition (Exogenous Growth). A growth model exhibits exogenous growth if the long-run growth rate of per-capita income is determined by variables — typically the rate of technological progress — that are taken as parameters outside the model. Policy can change the level of the balanced growth path but not the long-run growth rate.
Definition (Endogenous Growth). A growth model exhibits endogenous growth if the long-run growth rate of per-capita income is determined within the model — by the choices of households, firms, and the government regarding saving, education, research, and other investments. In endogenous growth models, policy can permanently raise the growth rate, not merely the level of income.
The distinction has profound policy implications. Under exogenous growth, a policy that permanently raises the saving rate shifts the balanced growth path upward but does not change its slope: the economy reaches a permanently higher level of income but eventually resumes growing at the same rate . Under endogenous growth, a policy that permanently raises investment in knowledge or technology can permanently raise the growth rate — the slope of the income path changes. Whether this distinction corresponds to something empirically important is one of the central debates in growth economics.
The AK Model¶
The simplest model of endogenous growth eliminates diminishing returns by assuming a linear production function:
where is interpreted broadly to include both physical capital and human capital — knowledge, skills, and organizational capabilities that can be accumulated without limit. With saving rate , capital accumulation is . If , output grows permanently at rate . Crucially, depends on the saving rate : policies that raise saving raise the permanent growth rate. This is the defining feature of endogenous growth.
The AK model’s linearity is, of course, a strong assumption. It requires that the external returns to capital (broadly defined) exactly offset the private diminishing returns, so that the social production function is linear. Whether this holds empirically — whether knowledge and human capital accumulation generate sufficient externalities to prevent long-run diminishing returns — is an empirical question on which the evidence is mixed.
Romer’s Knowledge Accumulation Model¶
A more satisfying approach to endogenous growth endogenizes the process of technological innovation itself. Romer (1990) models technology as the accumulated stock of designs — blueprints for new producer goods — produced by profit-seeking researchers. This makes technological progress the result of intentional investment decisions rather than an unexplained gift.
In the Romer model, the economy has three sectors. A research sector employs skilled labor to produce new designs; the flow of new designs is governed by the knowledge production function:
where is a productivity parameter. Crucially, the stock of existing knowledge enters multiplicatively: standing on the shoulders of giants, today’s researchers are more productive when more knowledge already exists. This generates the standing-on-shoulders effect that makes knowledge accumulation self-reinforcing.
A goods sector uses the existing stock of designs, along with human labor and unskilled labor , to produce final output:
where is the quantity of intermediate input (embodied in design ). Each intermediate good is produced by a monopolist who holds the patent on the corresponding design — this monopoly profit is the reward that motivates researchers to invest in generating new designs.
In symmetric equilibrium, the long-run growth rate is , which is endogenous: it depends on the share of human capital devoted to research, , which is in turn affected by policies like R&D subsidies, intellectual property protection, and education spending. The model provides a rigorous foundation for policies aimed at stimulating innovation.
Jones (1995) identified a problem with the Romer model as stated: the scale effect. Because , doubling the population of researchers doubles the long-run growth rate. But over the past century, the number of scientists and engineers engaged in R&D in the United States has grown by more than tenfold, while the long-run growth rate of per-capita income has remained roughly constant at 2%. The scale effect prediction is decisively rejected. Semi-endogenous growth models (Jones, 1995b) resolve this by introducing diminishing returns in knowledge production: with . In the resulting model, the long-run growth rate is independent of the level of R&D investment (it depends only on population growth), but the transition path — and hence long-run income levels — depends on policy.
5.5 The Empirics of Convergence¶
The Solow model generates a specific, testable prediction about the growth rate of countries. Countries that are currently below their steady state should grow faster than countries that are at or above it. Converting the linearized dynamics to per-capita output growth rates gives the convergence regression:
where the coefficient on initial income is negative: countries that start poor (low ) grow faster, converging toward richer countries. The constant absorbs the determinants of the steady state.
A crucial distinction governs the interpretation of this regression. Unconditional convergence — the hypothesis that poor countries simply grow faster regardless of their structural characteristics — would require the coefficient on to be negative in an unrestricted regression with no additional controls. This hypothesis is decisively rejected in cross-country data: poor countries are not, on average, growing faster than rich ones. Countries like Korea and Taiwan were poor in 1960 and have grown rapidly; countries like Zambia and Haiti were also poor in 1960 and have not grown at all. Unconditional convergence confuses the dynamics within an economy (toward its own steady state) with the dynamics across economies (toward a common steady state).
Conditional convergence — the hypothesis that countries converge to their own steady states, which differ across countries — is strongly supported. When the regression controls for the determinants of the steady state (saving rates, population growth, human capital), the coefficient on initial income becomes significantly negative. Mankiw, Romer, and Weil (1992) show that an augmented Solow model with human capital explains roughly 80% of the cross-country variation in per-capita income when these controls are included, and yields estimates of the physical and human capital shares (, ) consistent with factor income shares observed in national accounts.
The implication is that long-run differences in income are primarily explained by differences in steady-state characteristics — saving rates, human capital investment, population growth — rather than by idiosyncratic country-specific factors. Whether the determinants of these steady-state characteristics are themselves explainable (by institutions, geography, culture, historical accident) is the subject of Chapter 33.
Next: Chapter 6 — Macroeconomic Data and Sources