Formal MDP: Consumption Stage (cons_stage)¶
no-port-cons¶
Source:
packages/dolo/examples/models/doloplus/port-with-shocks/library/cons_stage.yaml
Rosetta Stone¶
| DDSL abstract | This stage |
|---|---|
| Stage name | cons_stage |
| Arrival state \(x_{\prec}\) | \(m\) |
| Decision state \(x\) | \(m\) |
| Continuation state \(x_{\succ}\) | \(a\) |
| Control | \(c\) |
| Exogenous shock | (none) |
Model¶
A household enters the stage with cash-on-hand \(m \in \mathbb{R}_{++}\) and chooses consumption \(c\). There are no shocks within this stage; any stochastic elements (income, returns) are handled by preceding stages in the period.
State and action spaces. The state space is \(X_m \coloneqq \mathbb{R}_{++}\) and the action space is \(\mathbb{R}_+\). The feasibility correspondence is
Transitions. The arrival-to-decision transition is the identity \(m_d = m\). After the consumption choice, end-of-period assets are
Preferences. The per-period reward is CRRA utility \(u(c) \coloneqq \frac{c^{1-\rho}}{1-\rho}\), with risk-aversion parameter \(\rho > 0\), and the discount factor is \(\beta \in (0,1)\).
Bellman Equation¶
Since the arrival-to-decision transition is identity and there are no shocks, the arrival and decision value functions coincide. The Bellman equation for this stage is:
where \(v_{\succ} \colon \mathbb{R}_+ \to \mathbb{R}\) is the continuation value function (supplied by the next stage or the next-period arrival value through inter-stage wiring). The optimal policy is
The arrival value function is the identity:
Marginal value. By the envelope theorem, the marginal value at the decision perch is
Since the arrival transition is identity, \(v'_{\prec}(m) = v'(m)\).
First-Order Conditions and EGM Representation¶
At an interior optimum the Euler equation holds:
The endogenous grid method (EGM) exploits this by working on the continuation grid. Given \(v'_{\succ}(a)\) for each grid point \(a\):
- Inverse Euler. Recover consumption on the continuation grid:
- Reverse transition. Recover the endogenous decision-grid point:
This produces pairs \((\hat{m}(a), \hat{c}(a))\) on an endogenous grid, which are interpolated onto a regular \(m\)-grid to obtain \(c^*(m)\).
Forward Operator (Population Dynamics)¶
Given a distribution \(\mu\) over arrival states \(m\), the forward operator pushes \(\mu\) through the optimal policy. Since the stage is deterministic:
and the continuation-state distribution is \(\mu_{\succ} = (m \mapsto m - c^*(m))_\# \, \mu\).
Calibration¶
| Symbol | Value | Description |
|---|---|---|
| \(\beta\) | 0.96 | discount factor |
| \(\rho\) | 2.0 | CRRA risk aversion |