Resources for mini-projects#

We’ll update this page with more resources as we go!

General:#

Language / brain datasets:#

  • Li et al. (2022) - Le Petit Prince multilingual naturalistic fMRI corpus (French, Chinese, English) paper, data

  • Nastase et al. (2021) - The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension paper, data

  • Gwilliams et al. (2022) - MEG-MASC: a high-quality magneto-encephalography dataset for evaluating natural speech processing paper, data

  • Wehbe et al. (2014) - Simultaneously Uncovering the Patterns of Brain Regions Involved in Different Story Reading Subprocesses paper, data (we will work with this data in lab 2)

  • Pereira et al. (2018) - Toward a universal decoder of linguistic meaning from brain activation (fMRI, words/sentences & pictures) paper, data

  • Broderick et al. (2018) - Electrophysiological Correlates of Semantic Dissimilarity Reflect the Comprehension of Natural, Narrative Speech paper, data

  • Huth et al. (2016). Natural speech reveals the semantic maps that tile human cerebral cortex paper, data

  • More datasets on Nora Hollenstein’s Cognitive NLP wiki

Music / brain datasets:#

  • Stober et al. (2015) - Towards Music Imagery Information Retrieval: Introducing the OpenMIIR Dataset of EEG Recordings from Music Perception and Imagination paper, data

  • Di Liberto et al. (2020) - Cortical encoding of melodic expectations in human temporal cortex paper, data

Music information retrieval datasets:#

Memory, vision & more#