CLADDER: A Benchmark to Assess Causal Reasoning Capabilities of Language Models
Proximal Causal Inference With Text Data
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Teaching Transformers Causal Reasoning through Axiomatic Training
C2P: Featuring Large Language Models with Causal Rea- soning