Start of Tutorial > Start of Trail > Start of Lesson |
Search
Feedback Form |
Until now, we've only been interested in whether or not a match is found at some location within a particular input string. We never cared about where in the string the match was taking place.You can make your pattern matches more precise by specifying such information with boundary matchers. For example, maybe you're interested in finding a particular word, but only if it appears at the beginning or end of a line. Or maybe you want to know if the match is taking place on a word boundary, or at the end of the previous match.
The following table lists and explains all the boundary matchers.
The following examples demonstrate the use of boundary matchers
Boundary Matchers ^
The beginning of a line $
The end of a line \b
A word boundary \B
A non-word boundary \A
The beginning of the input \G
The end of the previous match \Z
The end of the input but for the final terminator, if any \z
The end of the input ^
and$
. As noted above,^
matches the beginning of a line, and$
matches the end.The first example is successful because the pattern occupies the entire input string. The second example fails because the input string contains extra whitespace at the beginning. The third example specifies an expression that allows for unlimited white space, followed by "dog" on the end of the line. The fourth example requires "dog" to be present at the beginning of a line followed by an unlimited number of word characters.Current REGEX is: ^dog$ Current INPUT is: dog I found the text "dog" starting at index 0 and ending at index 3. Current REGEX is: ^dog$ Current INPUT is: dog No match found. Current REGEX is: \s*dog$ Current INPUT is: dog I found the text " dog" starting at index 0 and ending at index 15. Current REGEX is: ^dog\w* Current INPUT is: dogblahblah I found the text "dogblahblah" starting at index 0 and ending at index 11.To check if a pattern begins and ends on a word boundary (as opposed to a substring within a longer string), just use
\b
on either side; for example,\bdog\b
To match the expression on a non-word boundary, useCurrent REGEX is: \bdog\b Current INPUT is: The dog plays in the yard. I found the text "dog" starting at index 4 and ending at index 7. Current REGEX is: \bdog\b Current INPUT is: The doggie plays in the yard. No match found.\B
instead:To require the match to occur only at the end of the previous match, useCurrent REGEX is: \bdog\B Current INPUT is: The dog plays in the yard. No match found. Current REGEX is: \bdog\B Current INPUT is: The doggie plays in the yard. I found the text "dog" starting at index 4 and ending at index 7.\G
:Here the second example finds only one match, because the second occurrence of "dog" does not start at the end of the previous match.Current REGEX is: dog // Without \G Current INPUT is: dog dog I found the text "dog" starting at index 0 and ending at index 3. I found the text "dog" starting at index 4 and ending at index 7. Current REGEX is: \Gdog // With \G Current INPUT is: dog dog I found the text "dog" starting at index 0 and ending at index 3.
Start of Tutorial > Start of Trail > Start of Lesson |
Search
Feedback Form |
Copyright 1995-2004 Sun Microsystems, Inc. All rights reserved.