Note

This page is a reference documentation. It only explains the class signature, and not how to use it. Please refer to the user guide for the big picture.

nidl.datasets.BaseNumpyDataset¶

class nidl.datasets.BaseNumpyDataset(root, patterns, channels, split='train', targets=None, target_mapping=None, transforms=None, mask=None, withdraw_subjects=None)[source]¶

Bases: BaseDataset

Neuroimaging dataset that uses numpy arrays and memory mapping.

Parameters:

root: str: the location where are stored the data.
patterns: str or list of str: the relative locations (no path names matching allowed in specified pattern) of the numpy array to be loaded.
channels: str or list of str, default=None: the name of the channels.
split: str, default ‘train’: define the split to be considered.
targets: str or list of str, default=None: the dataset will also return these tabular data.
target_mapping: dict, default None: optionaly, define a dictionary specifying different replacement values for different existing values. See pandas DataFrame.replace documentation for more information.
transforms: callable, default None: a function that can be called to augment the input images.
mask: str, default None: optionnaly, mask the input data using this numpy array.
withdraw_subjects: list of str, default None: optionaly, provide a list of subjects to remove from the dataset.

Raises:

FileNotFoundError: If the mandatorry input files are not found.
KeyError: If the mandatory key are not found.
UserWarning: If missing data are found.

Notes

A ‘participants.tsv’ file containing subject information (including the requested targets) is expected at the root. A ‘<split>.tsv’ file containg the subject to include is expected at the root.

__init__(root, patterns, channels, split='train', targets=None, target_mapping=None, transforms=None, mask=None, withdraw_subjects=None)[source]¶

get_data(idx)[source]¶: Proper data indexing.