Data Structure
MassVision allows users to import modality independent MSI data in a structured CSV or hierarchical HDF5 formats for analysis. These format ensures compatibility with any rectilinear MSI data where all pixels share a common list of ions. The following instructions assume the MSI data has MxN pixels and L ions per pixel.
HDF5 file structure
The HDF5 should have two datasets, one for peak intensities named peaks, and one for ions named mz. The content of the datasets are the following Numpy arrays:
peaks: A three-dimensional MSI data as Numpy array with shape (MxNxL) (float or integer)
mz: A one-dimensional vector data as Numpy array with shape (L,) (float)
CSV file structure
The CSV should be created based on a spectrum-per-row architecture. The CSV file will have M×N+1 rows (one header row and M×N rows for pixel spectra) and L+2 columns (two location indices and L abundance values):
Header Row The first row (header) contains:
M: Number of pixels in height (integer)
N: Number of pixels in width (integer)
L m/z values: The m/z values corresponding to L ions (float)
Pixel Data Rows Each subsequent row corresponds to a pixel on the M×N grid. Each row contains:
i: The pixel’s y-coordinate (integer, range 0 to M-1, 0 being the top)
j: The pixel’s x-coordinate (integer, range 0 to N-1, 0 being the left)
Ion intensities: The abundance values for the L ions at this pixel (no specific type or range constraints)
Example For a 3x2 pixel MSI data the spatial indexing of the pixels will look like
(0,0) |
(0,1) |
(1,0) |
(1,1) |
(2,0) |
(2,1) |
Assume the data contains 4 ions with m/z of 281.0375, 494.2507, 600.324, and 831.5288, the CSV structure will look like
3 |
2 |
281.0375 |
494.2507 |
600.324 |
831.5288 |
|---|---|---|---|---|---|
0 |
0 |
26 |
59 |
9 |
133 |
0 |
1 |
45 |
32 |
0 |
24 |
1 |
0 |
0 |
0 |
77 |
0 |
1 |
1 |
112 |
60 |
18 |
72 |
2 |
0 |
0 |
28 |
38 |
22 |
2 |
1 |
0 |
0 |
0 |
18 |
By following this format, users can import MSI data from diverse modalities into MassVision for analysis and visualization.