Chapter 16. String Algorithms

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.1 Exact Matching

Given a text string and a pattern string, determine whether the pattern occurs in the text and, if so, find all positions where the match begins.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.2 Knuth-Morris-Pratt (KMP)

The naive exact matching algorithm repeatedly compares the same characters after every mismatch.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.3 Z Algorithm

Many string algorithms need to answer the question: > How many characters match between a string prefix and a substring starting at a particular position?

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.4 Rabin-Karp

Suppose you need to search for a pattern inside a large text.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.5 Boyer-Moore Overview

Most exact matching algorithms process the text from left to right.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.6 Trie Matching

Suppose you need to search for many patterns simultaneously.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.7 Aho-Corasick

Suppose you need to search a text for thousands of patterns simultaneously.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.8 Suffix Arrays

Suppose you need to answer many substring queries against the same text.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.9 Longest Common Prefix (LCP) Arrays

In the previous recipe, you built a suffix array and used it to perform efficient substring searches.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.10 Suffix Automata

Suppose you need to answer questions such as: - Does a substring occur in the text?

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.11 Palindromic Trees (Eertrees)

Many string algorithms focus on prefixes, suffixes, or arbitrary substrings.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.12 Manacher Algorithm

You need to find palindromic substrings efficiently.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.13 Edit Distance

Exact matching assumes that strings must be identical.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.14 Longest Common Substring

Given two strings, find the longest contiguous block of characters that appears in both.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.15 Lexicographic Order

Many string algorithms depend on a precise ordering of strings.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.16 String Hashing

Many string algorithms repeatedly compare substrings.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.17 Compressed Strings

Many strings contain repeated structure.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.18 Unicode and Normalization

Many string algorithms assume that a string is simply a sequence of characters.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.19 Token Streams

Most string algorithms operate on characters.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.20 Choosing the Right String Algorithm

This chapter introduced a large collection of string-processing techniques: - Naive matching - KMP - Z Algorithm

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.21 Text Indexing

A single pattern search scans one text once.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.22 Building a Search Engine

Throughout this chapter, we studied individual string-processing techniques: - Tokenization - Tries - Hashing

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.23 Complexity Analysis

String algorithms often look deceptively simple.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.24 Testing String Algorithms

String algorithms are easy to implement incorrectly.

Programming › Algorithm Cookbook › Chapter 16. String Algorithms ›

16.25 Real-World String Processing Patterns

After learning dozens of string algorithms, a natural question remains: > What do real systems actually do?