Raster basics

Raster datasets represent geographic features by dividing the world into discrete square or rectangular cells laid out in a grid. Each cell has a value that is used to represent some characteristic of that location, such as temperature, elevation, or a spectral value.

raster dataset diagram

Raster datasets are commonly used for representing and managing imagery, digital elevation models, and numerous other phenomena. Often rasters are used as a way to represent point, line, and polygon features. In the example below, you can see how a series of polygons would be represented as a raster dataset.

A vector diagram represented as a raster

Rasters can be used to represent all geographic information (features, images, and surfaces), and they have a rich set of analytic geoprocessing operators. In addition to being a universal data type for holding imagery in GIS, rasters are also heavily used to represent features, enabling all geographic objects to be used in raster-based modeling and analysis.

Rasters in the geodatabase

A raster is a set of cells arranged in rows and columns and is a commonly used dataset in GIS. Users typically employ many raster files, yet many users see an increasing need to manage raster data, along with their other geographic information, in a DBMS. The geodatabase provides a very effective means for raster data management in both file and ArcSDE geodatabases.

Raster management strategies

Two data management strategies for rasters are important:

  • Raster provisioning—Getting raster datasets quickly "into play" in your GIS means you will most likely use them as is, typically as a series of raster files. This can be a series of independent files, or you can use a technology like the Image extension to ArcGIS Server to manage and serve these existing datasets as a collection.
  • Rasters in the geodatabase—This strategy is useful when you want to manage rasters, add behavior, and control the schema; want to manage a well-defined set of raster datasets as part of your DBMS; need to get high performance without loss of content and information (no compression); and want one data architecture for managing all your content.

Geographic properties of raster data

Four geographic properties are typically recorded for all raster datasets. These become useful for georeferencing and help explain how raster data files are structured. This concept is important to understand: it helps explain how rasters are stored and managed in the geodatabase.

Raster datasets have a special way of defining geographic location. Once the cells or pixels can be accurately georeferenced, it's easy to have an ordered list of all the cell values in a raster. This means that each raster dataset typically has a header record holding its geographic properties, and the body of the content is simply an ordered list of cell values.

Geographic properties for a raster typically include

  • Its coordinate system
  • A reference coordinate or x,y location (typically the upper left or the lower left corner of the raster)
  • A cell size
  • The count of rows and columns

This information can be used to find the location of any specific cell. By having this information available, the raster data structure lists all the cell values in order from the upper left cell along each row to the lower right cell, as illustrated below.

Diagram of cell values

The raster block table in the geodatabase

Raster data is typically much larger in size than features and requires a side table for storage. For example, a typical orthoimage can have as many as 6,700 rows by 7,600 columns (more than 50 million cell values).

To get high performance with these larger raster datasets, a geodatabase raster is cut up into smaller tiles (referred to as blocks) with a typical size of around 128 rows by 128 columns or 256 x 256. These smaller blocks are then held in a side table for each raster. Each separate tile is held in a separate row in a block table as shown below.

Block table view diagram

This simple structure means that only the blocks for an extent need to be fetched when they are needed instead of the entire image. In addition, resampled blocks used to build raster pyramids can be stored and managed in the same block table as additional rows.

This enables rasters of enormous sizes to be managed in a DBMS; produce very high performance; and provide multiuser, secure access.

Extending rasters

Rasters are heavily and increasingly used in GIS applications. The geodatabase can manage rasters for many purposes: as individual datasets, logical collections of datasets, and picture attributes in tables.

A number of geodatabase capabilities enable users to extend how they manage their raster information as follows:

Use

If you need to

Raster datasets

Manage very large, continuous image datasets and mosaicked rasters.

diagram of raster

Mosaic Datasets

A mosaic dataset is a data model that is a hybrid of a raster catalog and raster dataset, representing an on-the-fly view of a raster catalog. They allow you to store, manage, view, and query collections of raster image data.

Mosaic dataset

Raster catalogs

Accomplish a number of purposes, including

  • Manage a tiled raster layer, where each tile is a separate raster.
    Tile diagram
  • Manage any series of rasters in a DBMS.
    Series of rasters diagram
  • Manage a raster time series.

    Time series diagram

Raster attribute columns in tables

Store pictures or scanned documents as attributes in tables.

Rasters as attributes diagram

9/18/2012