Fast, Inclusive Searches for Geographic Names Using Digraphs

Techniques and Methods 7-A1



An algorithm specifies how to quickly identify names that approximately match any specified name when searching a list or database of geographic names. Based on comparisons of the digraphs (ordered letter pairs) contained in geographic names, this algorithmic technique identifies approximately matching names by applying an artificial but useful measure of name similarity. A digraph index enables computer name searches that are carried out using this technique to be fast enough for deployment in a Web application. This technique, which is a member of the class of n-gram algorithms, is related to, but distinct from, the soundex, PHONIX, and metaphone phonetic algorithms. Despite this technique's tendency to return some counterintuitive approximate matches, it is an effective aid for fast, inclusive searches for geographic names when the exact name sought, or its correct spelling, is unknown.

Additional publication details

Publication type Report
Publication Subtype USGS Numbered Series
Title Fast, Inclusive Searches for Geographic Names Using Digraphs
Series title Techniques and Methods
Series number 7-A1
DOI 10.3133/tm7A1
Edition -
Year Published 2008
Language ENGLISH
Publisher Geological Survey (U.S.)
Contributing office(s) Eastern Geographic Science Center
Description iv, 6 p.
Larger Work Type Report
Larger Work Subtype USGS Numbered Series
Larger Work Title Chapter 1 of Book 7, Automated Data Processing and Computations of Section A, Algorithms
Online Only (Y/N) Y