Opening MHD Model Files with PsiData

Opening MHD Model Files with PsiData#

Explore a MAS radial-magnetic-field file through the PsiData() interface: inspect metadata attributes, trace the connections to psi_io._models, psi_io._mesh, and psi_io._units, and observe the lazy-loading and caching behavior.

This example demonstrates:

  1. Opening a MAS HDF5 file and exploring the reader’s metadata attributes.

  2. The role of psi_io._models in defining physical quantity properties.

  3. How psi_io._mesh encodes Yee-grid stagger positions for each quantity.

  4. How psi_io._units supplies the MAS code-unit normalization factors.

  5. Lazy loading — no data leaves the disk until explicitly requested.

  6. Automatic caching of full-array reads for quick re-access.

Note

PsiData() is the only public symbol exported by psi_io.mhd_io. HDF4 (.hdf) and HDF5 (.h5) files are supported transparently; the file extension selects the I/O backend.

from pathlib import Path
from psi_io import data
from psi_io.mhd_io import PsiData

Opening a file

PsiData() takes a path to any PSI MAS or POT3D HDF file. No data is read at this point — only the filename is parsed and minimal HDF metadata is inspected to identify the quantity, units, and mesh code.

To define the metadata data of the given input file, the reader follows a hierarchy of inference steps to determine the values of the following core attributes:

'quantity'

Canonical lower-case quantity identifier.

'sequence'

Integer time-step sequence number.

'unit'

Code-to-physical unit for this quantity, as an Unit or a string parseable by it.

'scalar'

True if the quantity is a scalar field; False for vector components.

'mesh'

Mesh code (MeshCodeType) describing data staggering.

If these values are not explicitly included in the PsiData() constructor the reader falls back to reading the HDF metadata attributes (if present) and then parsing the filename according to the PSI filename schema. The reader then cross-references the quantity against the canonical properties defined in psi_io._models to infer the remaining metadata attributes.

br_filepath = data.get_3d_data()
print(f"Filename : {Path(br_filepath).name}")
reader = PsiData(br_filepath)
Filename : br.h5

Core metadata attributes

The quantity and sequence attributes are extracted from the filename stem using the PSI filename schema (e.g. br001001.h5 gives quantity='br', sequence=1001). Since the provided filename does not contain an explicit sequence number, the reader defaults to sequence=0.

print(f"quantity  : {reader.quantity!r}")
print(f"sequence  : {reader.sequence}")
print(f"ndim      : {reader.ndim}")
print(f"shape     : {reader.shape}  (Nφ × Nθ × Nr in HDF storage order)")
quantity  : 'br'
sequence  : 0
ndim      : 3
shape     : (181, 100, 151)  (Nφ × Nθ × Nr in HDF storage order)

Connection to psi_io._models

The props attribute is a Props dataclass stored in psi_io._models, which bundles the canonical name, description, native unit, and mesh code for every recognised PSI quantity.

print(f"description : {reader.description}")
print(f"props.name  : {reader.props.name}")
description : MAS Magnetic Field (Radial Component)
props.name  : br

Connection to psi_io._mesh

The mesh attribute is a tuple of Mesh enum members (one per spatial axis in physical (r, θ, φ) order) that encode the Yee-grid stagger position of the field quantity.

For the radial magnetic field br, the field is face-centred in the radial direction (half-mesh) and cell-centred in both angular directions (main-mesh):

from psi_io._mesh import Mesh
print(f"mesh : {reader.mesh}")
print(f"  r  → {reader.mesh[0].name}")
print(f"  θ  → {reader.mesh[1].name}")
print(f"  φ  → {reader.mesh[2].name}")
mesh : (<Mesh.HALF: 1>, <Mesh.MAIN: 0>, <Mesh.MAIN: 0>)
  r  → HALF
  θ  → MAIN
  φ  → MAIN

Connection to psi_io._units

The unit attribute is one of the custom MAS normalization units defined in psi_io._units. Multiplying a code-unit value by this factor converts it to physical CGS units. Here, MAS_b represents approximately 2.2 Gauss per code unit.

from psi_io._units import MAS_b
print(f"unit     : {reader.unit}")
print(f"MAS_b    : {MAS_b}")
print(f"in Gauss : {reader.unit.to('G'):.4f}")
unit     : MAS_b
MAS_b    : MAS_b
in Gauss : 2.2069

Coordinate scale readers

The scales attribute is a Scales(r, t, p) named tuple; each element is itself a lightweight reader. Calling read() on a scale returns the 1-D coordinate array as a Quantity.

r_scale = reader.scales.r.read()
t_scale = reader.scales.t.read()
p_scale = reader.scales.p.read()
print(f"r scale  : shape={r_scale.shape}  range=[{r_scale[0]:.5f}, {r_scale[-1]:.5f}]")
print(f"θ scale  : shape={t_scale.shape}  range=[{t_scale[0]:.5f}, {t_scale[-1]:.5f}]")
print(f"φ scale  : shape={p_scale.shape}  range=[{p_scale[0]:.5f}, {p_scale[-1]:.5f}]")
r scale  : shape=(151,)  range=[0.99956 PSI_rsun, 30.51165 PSI_rsun]
θ scale  : shape=(100,)  range=[0.00000 PSI_angle, 3.14159 PSI_angle]
φ scale  : shape=(181,)  range=[0.00000 PSI_angle, 6.28319 PSI_angle]

Lazy loading

Reading the coordinate scales above did not load the main data array. The is_cached property confirms the primary dataset has not yet been transferred from disk:

print(f"is_cached before read : {reader.is_cached}")
is_cached before read : False

Calling read() with no arguments loads the full dataset. Because no spatial restrictions are applied, the result is stored in the reader’s internal cache:

data_arr, r, t, p = reader.read()
print(f"data shape : {data_arr.shape}  (Nφ × Nθ × Nr)")
print(f"data unit  : {data_arr.unit}")
print(f"is_cached  : {reader.is_cached}")
data shape : (181, 100, 151)  (Nφ × Nθ × Nr)
data unit  : MAS_b
is_cached  : True

Subsequent full-array calls return the cached copy without a second disk read. The cache is populated only for unrestricted reads; any partial read (i.e. any call that restricts at least one axis) bypasses and never updates the cache.

Total running time of the script: (0 minutes 0.013 seconds)

Gallery generated by Sphinx-Gallery