Data Quality: Assessment
This week in GIS 5935, I concluded the last of a three-part module on data quality in GIS. This module focused on data assessment where we were tasked with determining completeness of two different road networks by summarizing our analysis in textual, visual, and numerical terms. Completeness is one of the five components of data quality where it essentially refers to "how well the data set captures all the features" (Boldstad and Mason 2022:611). Further, assessment of completeness can help inform the analyst on important information missing in a data set that might be critical for a spatial analysis project.
For this assignment, I assessed completeness by examining differences in road length for a section of Washington County, Oregon in two different data sets: TIGER Roads (U.S. Census Bureau) and Street Centerlines (Washington County). In addition, I used a grid consisting of 297 1-km by 1-km polygons to serve as base units of analysis. For the analysis I used various tools and functions in ArcGIS including Clip, Summarize Within, Spatial Join, Calculate Field, and Calculate Geometry.
The total length of roads (km) was higher for the TIGER data. In the grid polygon, this consisted of 11,253 km of roads in the TIGER layer with less for the County layer (10,671.2). Of the 297 grid polygons, TIGER data was more complete in 162 with the County data more complete in 132. In three grid sections, the lengths were identical.
In addition, I calculated a percentage that reflected the differences in length between the two data sets across the grid. This involved subtracting the total length of TIGER roads from the total length of the County centerlines, dividing by the length of the County centerlines (as the base), the multiplying by 100%. The map choropleth map below shows how the differences are symbolized where I establish three distinct categories:
-197.5-10, -9.9-10, and -10.1 to 80.9. Here I wanted to show at opposite ends, greater than 10% differences as well as those with less than 10%:
I found this assignment interesting and helpful in allowing me to better understand aspects of data quality, especially the component of completeness, along with further enhancing my cartographic design skills.
GIS Fundamentals: A First Text on Geographic Information Systems, 7th edition. Minnesota: Eider
Press.
Comments
Post a Comment