Glossary term
Glossary term
Evaluation and Benchmarks
A measurement of how similar two text strings are to each other. In machine learning, edit distance is useful for the following reasons:
Edit distance is easy to compute.
Edit distance can compare two strings known to be similar to each other.
Edit distance can determine the degree to which different strings are similar to a given string.
Several definitions of edit distance exist, each using different string operations. See Levenshtein distance for an example.
Created for this library
A search team uses edit distance to suggest spelling corrections for misspelled product names in the search bar.
A localization team uses edit distance to compare translations against approved glossaries when reviewing automated translations.
A data quality team uses edit distance to match noisy company names across vendor lists for deduplication.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License