Resampling to lon-lat grid#

This notebook shows a quick way to resample HEALPix data to a lon/lat grid using nearest neighbors.

Anti-Aliasing#

Nearest-Neighbor remapping can lead to aliasing: along steep gradients (e.g. the temperature difference between land and water along the Amazon river), data is picked seemingly randomly from either side of the gradient (it depends on how the source and target grids fit onto each other on the very fine scale). While this effect likely averages out when analyzing larger areas, it can disturb the local scale. Several methods exist to overcome this problem. A simple approximative way is supersampling, where we first interpolate to a finer grid and then average the interpolated data back to our target grid. Uniform supersampling can be implemented as follows:

supersampling = {"lon": 4, "lat": 4}
idx = get_nn_lon_lat_index(
    2**zoom,
    np.linspace(-70, -55, supersampling["lon"] * 300),
    np.linspace(5, 20, supersampling["lat"] * 300),
)
tas_lon_lat_aa = ds.tas.isel(time=0, cell=idx).coarsen(supersampling).mean()
tas_lon_lat_aa.plot()
None

../../_images/a420df36b1b279672fb1eb5b1ff5f3b46a75994dd0c6e90b889297e8ce3efa0d.png

While the output barely changes for uniform areas, regions with gradients apprear much smoother now.

Saving to disk#

Of course, any data remapped this way can be saved to disk by the usual means of xarray I/O methods, including netCDF and zarr formats. If data is opened using dask (remember to use some chunks definition while opening), it is possible to perform the regridding lazily, chunk by chunk while writing the output.

Selecting zoom level automatically#

If we want a simple automatic way of selecting an appropriate zoom level, we can also compute HEALPix indices for multiple zoom levels and observe how many unique index values we obtain:

If we have only a few unique values, a single model output pixel will end up in many pixels in the lon/lat projection: the HEALPix resolution is too coarse for the desired lon/lat grid.
If we have as many unique values as lon/lat pixels, every lon/lat pixel will get data from a different model output pixel, but we might skip a bunch of model pixels, thus subsample the output and might see aliasing effects.

So to choose an appropriate zoom level, we might want to search for the zoom where the unique_fraction goes towards 1 but not necessarily be exactly 1.

Selecting a good zoom level comes with multiple advantages:

We don’t load excessive amounts of data -> our code becomes faster
Using pre-aggregated data can reduce aliasing effects (if the output hierarchy uses area averaging)

@lambda f: np.vectorize(f, excluded={1, 2})
def unique_fraction(nside, lons, lats):
    idx = get_nn_lon_lat_index(nside, lons, lats)
    return np.unique(idx).size / idx.size

Let’s try this function for several zoom levels and different grids:

import matplotlib.pylab as plt

zoom = np.arange(15)

grids = [
    (np.linspace(-70, -55, 300), np.linspace(5, 20, 300), "Carribean, fine"),
    (np.linspace(-70, -55, 100), np.linspace(5, 20, 100), "Carribean, coarse"),
    (np.linspace(-180, 180, 360), np.linspace(-90, 90, 180), "Globe, 1° by 1°"),
]

for lons, lats, label in grids:
    plt.plot(zoom, unique_fraction(2**zoom, lons, lats), label=label)

plt.xlabel("zoom")
plt.ylabel("unique fraction")
plt.legend()
None

../../_images/065807cdb6a5e0b72a86174eb74e2930a1e3b263172f8b5bba3c678291acd011.png

As the change between 0 and 1 typically is rather steep, we could get a simple criterion for “approaching 1” by just picking the first zoom level where unique_fraction > 0.5 and use that as a suggested zoom level:

for lons, lats, label in grids:
    print(f"{label:30s} {np.argmax(unique_fraction(2**zoom, lons, lats) > 0.5)}")

Carribean, fine                10
Carribean, coarse              9
Globe, 1° by 1°                6

So based on this criterion, the best zoom level for our initial example would indeed be 10. We can also see that level 6 could be appropriate for a global 1° by 1° grid as suggested in the HEALPix intro.

Resampling to lon-lat grid#

Remapping via source indices#

Remapping model output#

Anti-Aliasing#

Saving to disk#

Selecting zoom level automatically#

This Page