Do-calculus

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Do-calculus is a set of mathematical rules devised by Judea Pearl in 1995 to determine whether causal effects can be identified from observational data under specific assumptions encoded in a causal graph. It provides a systematic method for transforming expressions involving the do-operator (representing interventions) into expressions involving only observable probabilities, enabling the identification of causal relationships.

Definition and purpose

[edit | edit source]

Causal queries involving interventions (e.g., P(ydo(x))) are considered identifiable if they can be expressed using observational data alone, independent of unmeasured parameters. The do-calculus achieves this by leveraging graphical criteria from directed acyclic graphs (DAGs) to remove do-operators through algebraic manipulations.[1]

The three rules of Do-calculus

[edit | edit source]

The rules[2] apply to a causal graph 𝒢 and assume the Markov condition holds:

Rule 1: Insertion/deletion of observations

[edit | edit source]
P(ydo(x),z,w)=P(ydo(x),w)if YZX,W in 𝒢X

This rule allows the removal of irrelevant observations (Z) if they are d-separated from Y given X and W in the graph where incoming edges to X are removed.

Rule 2: Action/observation exchange

[edit | edit source]
P(ydo(x),do(z),w)=P(ydo(x),z,w)if YZX,W in 𝒢XZ_

This rule permits replacing an intervention (do(z)) with an observation (z) if Y and Z are *d*-separated in the graph where outgoing edges from Z are removed.

Rule 3: Insertion/deletion of interventions

[edit | edit source]
P(ydo(x),do(z),w)=P(ydo(x),w)if YZX,W in 𝒢XZ(W)

This rule removes irrelevant interventions (do(z)) if Y and Z are d-separated in a graph modified to block paths through Z.

Applications

[edit | edit source]

Do-calculus can be applied to various domains within causal inference such as mediation analysis in decomposing direct and indirect effects.[3][4] It can be used for meta-synthesis to combine the results from heterogeneous studies.[3][5]

Completeness

[edit | edit source]

The do-calculus is considered complete: if repeated application of the rules cannot eliminate the do-operator, the causal effect is not identifiable. This result was formalized in 2006 by Huang, Valtorta, Shpitser, and Pearl.[3]

Criticism

[edit | edit source]

Critics have pointed out that other frameworks, such as structural equation modeling (SEM) or Bayesian networks, may offer more intuitive approaches to causal inference for certain applications. These methods often emphasize parameter estimation rather than identifiability, which can be more relevant for applied research.[6]

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ a b c Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  6. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

Lua error in Module:Authority_control at line 153: attempt to index field 'wikibase' (a nil value).