Generative causal testing to bridge data-driven models and scientific theories in language neuroscience

1 October 2024

Alexander Huth

Abstract

Representations from large language models are highly effective at predicting BOLD fMRI responses to language stimuli. However, these representations are largely opaque: it is unclear what features of the language stimulus drive the response in each brain area. We present generative causal testing (GCT), a framework for generating concise explanations of language selectivity in the brain from predictive models and then testing those explanations in follow-up experiments using LLM-generatedthis http URLapproach is successful at explaining selectivity both in individual voxels and cortical regions of interest (ROIs), including newly identified microROIs in prefrontal cortex. We show that explanatory accuracy is closely related to the predictive power and stability of the underlying predictive models. Finally, we show that GCT can dissect fine-grained differences between brain areas with similar functional selectivity. These results demonstrate that LLMs can be used to bridge the widening gap between data-driven models and formal scientific theories.

View on arXiv

@article{antonello2025_2410.00812,
  title={ Generative causal testing to bridge data-driven models and scientific theories in language neuroscience },
  author={ Richard Antonello and Chandan Singh and Shailee Jain and Aliyah Hsu and Sihang Guo and Jianfeng Gao and Bin Yu and Alexander Huth },
  journal={arXiv preprint arXiv:2410.00812},
  year={ 2025 }
}

Comments on this paper