#[non_exhaustive]pub struct WordBreakOptions<'a> {
pub content_locale: Option<&'a LanguageIdentifier>,
pub invariant_options: WordBreakInvariantOptions,
}Expand description
Options to tailor word breaking behavior.
Fields (Non-exhaustive)§
This struct is marked as non-exhaustive
Non-exhaustive structs could have additional fields added in future. Therefore, non-exhaustive structs cannot be constructed in external crates using the traditional
Struct { .. } syntax; cannot be matched against without a wildcard ..; and struct update syntax will not work.content_locale: Option<&'a LanguageIdentifier>Content locale for word segmenter
If you know the language of the text being segmented, provide it here in order to produce higher quality breakpoints.
§Examples
Normally, a colon character ‘:’ is a word separator:
use icu::segmenter::WordSegmenter;
let segmenter = WordSegmenter::new_auto(Default::default());
let breakpoints: Vec<usize> = segmenter.segment_str("EU:ssa").collect();
assert_eq!(&breakpoints, &[0, 2, 3, 6]);But not in Finnish, where it is used for loanwords:
use icu::locale::langid;
use icu::segmenter::options::WordBreakOptions;
use icu::segmenter::WordSegmenter;
let mut options = WordBreakOptions::default();
let langid = &langid!("fi");
options.content_locale = Some(langid);
let segmenter = WordSegmenter::try_new_auto(options).unwrap();
let breakpoints: Vec<usize> =
segmenter.as_borrowed().segment_str("EU:ssa").collect();
assert_eq!(&breakpoints, &[0, 6]);invariant_options: WordBreakInvariantOptionsOptions independent of the locale
Trait Implementations§
Source§impl<'a> Clone for WordBreakOptions<'a>
impl<'a> Clone for WordBreakOptions<'a>
Source§fn clone(&self) -> WordBreakOptions<'a>
fn clone(&self) -> WordBreakOptions<'a>
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl<'a> Debug for WordBreakOptions<'a>
impl<'a> Debug for WordBreakOptions<'a>
Source§impl<'a> Default for WordBreakOptions<'a>
impl<'a> Default for WordBreakOptions<'a>
Source§fn default() -> WordBreakOptions<'a>
fn default() -> WordBreakOptions<'a>
Returns the “default value” for a type. Read more
Source§impl<'a> PartialEq for WordBreakOptions<'a>
impl<'a> PartialEq for WordBreakOptions<'a>
impl<'a> Copy for WordBreakOptions<'a>
impl<'a> Eq for WordBreakOptions<'a>
impl<'a> StructuralPartialEq for WordBreakOptions<'a>
Auto Trait Implementations§
impl<'a> Freeze for WordBreakOptions<'a>
impl<'a> RefUnwindSafe for WordBreakOptions<'a>
impl<'a> Send for WordBreakOptions<'a>
impl<'a> Sync for WordBreakOptions<'a>
impl<'a> Unpin for WordBreakOptions<'a>
impl<'a> UnwindSafe for WordBreakOptions<'a>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more