regex

Struct Match

Source
pub struct Match<'h> { /* private fields */ }
Expand description

Represents a single match of a regex in a haystack.

A Match contains both the start and end byte offsets of the match and the actual substring corresponding to the range of those byte offsets. It is guaranteed that start <= end. When start == end, the match is empty.

Since this Match can only be produced by the top-level Regex APIs that only support searching UTF-8 encoded strings, the byte offsets for a Match are guaranteed to fall on valid UTF-8 codepoint boundaries. That is, slicing a &str with Match::range is guaranteed to never panic.

Values with this type are created by Regex::find or Regex::find_iter. Other APIs can create Match values too. For example, Captures::get.

The lifetime parameter 'h refers to the lifetime of the matched of the haystack that this match was produced from.

§Numbering

The byte offsets in a Match form a half-open interval. That is, the start of the range is inclusive and the end of the range is exclusive. For example, given a haystack abcFOOxyz and a match of FOO, its byte offset range starts at 3 and ends at 6. 3 corresponds to F and 6 corresponds to x, which is one past the end of the match. This corresponds to the same kind of slicing that Rust uses.

For more on why this was chosen over other schemes (aside from being consistent with how Rust the language works), see this discussion and Dijkstra’s note on a related topic.

§Example

This example shows the value of each of the methods on Match for a particular search.

use regex::Regex;

let re = Regex::new(r"\p{Greek}+").unwrap();
let hay = "Greek: αβγδ";
let m = re.find(hay).unwrap();
assert_eq!(7, m.start());
assert_eq!(15, m.end());
assert!(!m.is_empty());
assert_eq!(8, m.len());
assert_eq!(7..15, m.range());
assert_eq!("αβγδ", m.as_str());

Implementations§

Source§

impl<'h> Match<'h>

Source

pub fn start(&self) -> usize

Returns the byte offset of the start of the match in the haystack. The start of the match corresponds to the position where the match begins and includes the first byte in the match.

It is guaranteed that Match::start() <= Match::end().

This is guaranteed to fall on a valid UTF-8 codepoint boundary. That is, it will never be an offset that appears between the UTF-8 code units of a UTF-8 encoded Unicode scalar value. Consequently, it is always safe to slice the corresponding haystack using this offset.

Source

pub fn end(&self) -> usize

Returns the byte offset of the end of the match in the haystack. The end of the match corresponds to the byte immediately following the last byte in the match. This means that &slice[start..end] works as one would expect.

It is guaranteed that Match::start() <= Match::end().

This is guaranteed to fall on a valid UTF-8 codepoint boundary. That is, it will never be an offset that appears between the UTF-8 code units of a UTF-8 encoded Unicode scalar value. Consequently, it is always safe to slice the corresponding haystack using this offset.

Source

pub fn is_empty(&self) -> bool

Returns true if and only if this match has a length of zero.

Note that an empty match can only occur when the regex itself can match the empty string. Here are some examples of regexes that can all match the empty string: ^, ^$, \b, a?, a*, a{0}, (foo|\d+|quux)?.

Source

pub fn len(&self) -> usize

Returns the length, in bytes, of this match.

Source

pub fn range(&self) -> Range<usize>

Returns the range over the starting and ending byte offsets of the match in the haystack.

It is always correct to slice the original haystack searched with this range. That is, because the offsets are guaranteed to fall on valid UTF-8 boundaries, the range returned is always valid.

Source

pub fn as_str(&self) -> &'h str

Returns the substring of the haystack that matched.

Trait Implementations§

Source§

impl<'h> Clone for Match<'h>

Source§

fn clone(&self) -> Match<'h>

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<'h> Debug for Match<'h>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'h> From<Match<'h>> for &'h str

Source§

fn from(m: Match<'h>) -> &'h str

Converts to this type from the input type.
Source§

impl<'h> From<Match<'h>> for Range<usize>

Source§

fn from(m: Match<'h>) -> Range<usize>

Converts to this type from the input type.
Source§

impl<'h> PartialEq for Match<'h>

Source§

fn eq(&self, other: &Match<'h>) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl<'h> Copy for Match<'h>

Source§

impl<'h> Eq for Match<'h>

Source§

impl<'h> StructuralPartialEq for Match<'h>

Auto Trait Implementations§

§

impl<'h> Freeze for Match<'h>

§

impl<'h> RefUnwindSafe for Match<'h>

§

impl<'h> Send for Match<'h>

§

impl<'h> Sync for Match<'h>

§

impl<'h> Unpin for Match<'h>

§

impl<'h> UnwindSafe for Match<'h>

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

Layout§

Note: Most layout information is completely unstable and may even differ between compilations. The only exception is types with certain repr(...) attributes. Please see the Rust Reference's “Type Layout” chapter for details on type layout guarantees.

Size: 32 bytes