34 Matching Annotations
  1. Jul 2024
  2. notebooksharing.space notebooksharing.space
    1. are excited to announce the extension of the Xarray data model to support vector geometries

      The announcement has already happened with Xvec :) . Instead I would phrase as "describing how the Xarray ecosystem supports vector geometries".

      Also I'd mention xvec in this first paragraph.

    2. Spatial indexing

      This is quite powerful, I'd add a bit about .xvec.query in the introduction.

      AND here, you are showing how the geometry dimension is special, and enables nice interfaces

    3. Computation and grouping along a time dimension

      The time dimension is a speicfic example here. More importantly, you can do the "usual" multi-dimensional array things along the non-geometry dimensions. I would make that point.

    4. The above operation interpolated the ERA5 data onto the coordinates from europe_ds but in the process we lost the data variables describing the name and country of each city. Add those onto the interpolated vector data cube and drop the level coordinate variable, which we don't need.

      If you set them as coords, I bet you wouldn't need this.

    5. lat(geometry)float6441.33 41.32 41.11 ... 50.18 50.57array([41.3275 , 41.3230556, 41.1125 , ..., 50.75 , 50.1833333, 50.5666667])lon

      lat, lon are redundant with "geometry", I'd drop them to illustrate your points better

    6. points

      To me, Polygons of interest are where things get interesting.

      Points are reasonably handled without these geometries. But a polygon is a meaningful increase in metadata associated with an observation.

    7. Vector datasets are frequently treated as 'flat' or where the spatial dimension is the only required functional dimension; but what happens when vector datasets contain additional dimensions like time?

      nice

    8. raster data is viewed as a cube, while vector data is discussed as a data frame.

      When you use this kind of construction, always use the same verb i.e. "viewed", and the same ordering. For example - BAD: "A is better than B, while D is worse than C" - GOOD: "A is better than B, and C is better than D".

      When you use the same ordering and minimize differences, it becomes a lot easier for the brain to parse.

  3. Jun 2024
  4. notebooksharing.space notebooksharing.space
    1. idx_variables.update(index.create_variables(variables))

      index is a tuple, but should be PandasIndex. You've copied too much code over :) Write it from scratch.

  5. Jul 2022
  6. notebooksharing.space notebooksharing.space
    1. these are the resources I've produced or contributed to this summer,

      I agree with moving this up. This is your main "product" IMO. And to write these you developed the skills in (1)

      It'd also be nice to mention the NASA project, so it shows that your work is feeding in to a bigger effort in the area.

    2. working with xarray,

      and "designing analysis pipelines for nD array datasets".

      Usually the word thrown around for "using less for loops" is "vectorization" but that's a bit too much jargon.

    3. specialized knowledge is passed down from advisor to advisee, or in small university classes

      Nice place to bring in a personal story if you have one.