Geocoding is the process of assigning a geographic coordinates to a description of location-related information. The most common use is the attribution of geographic coordinates (latitude and longitude) to a street or postal address or address geocoding. In contrast, reverse geocoding is the reverse process that transforms a pair of geographic coordinates into an address.
Geocoding begins when data in text or tabular form is compared to a reference data table that includes defined map coordinates. When the input data is matched to the reference data, the corresponding map coordinates are assigned to the input data. Reference data is typically based on a segmented street centerline layer that contain information on house number ranges. Geographic coordinates are then interpolated from the estimated location where the address number falls on the segment. For example, if a road segment contain the address range 100 – 119 and runs west to east and the address attribute is 109, then the geographic location would be roughly 50% of the way along the segment on the odd side of the street.
The quality and accuracy of the geocoded data depends on understanding the reference table and data, the methods in which the matches are being produced and the given accuracy once a match is found.
Knowing the format required for the geocoder that you intend to use is critical to getting identical matches. Accuracy depends on the nature of input data including the format and “cleanliness”. For example, input data that includes misspellings, special characters (such as “ \ % # ?) and abbreviations often result in inaccuracy and mismatches. Be prepared to refine your data and re-geocode your data as errors or typos may be found during the process. Finally, it is important to perform a quality control check on your geocoding results by comparing the address locations against other data sources, such as street basemaps.
Geocoding can be done on a case by case basis or in batches using an open-source or commercial geocoding service. There are many batch geocoding services online that are free up to a pre-set level while others that charge a fee. As geocoding becomes more valuable, batch geocoding large numbers of addresses has become costly and geocoding batch services limited.
ESRI’s World Geocoding Service is available to ASU faculty, staff and students. However, the service uses 40 credits per 1,000 address, which is pulled from the ASU University credit pool. Currently, each user in the ASU community has the ability to geocode up to 25,000 addresses. An alternative to utilizing a pre-configured geocoding service is to create your own address locator. Address locators are based on an address locator style that defines reference data used and rules for address format and parsing depending on locator style. Recently, Erica Quintana, a Policy Analyst in the Morrison Institute for Public Policy, created composite address locators for all of Arizona using the Census TIGER files. She has agreed to share them with the ASU community. The address locators and supporting files can be found here.
Aside from using the address locators from ASU, there are many geocoding service options. Here are a few to consider:
These are just a few of the geocoding methods out there to you started. Once you started, the sky is the limit. Happy geocoding!
by Jill Sherwood