Package 'morphemepiece.data' reference manual

Title:	Data for Morpheme Tokenization
Description:	Provides data about morphemes, the smallest units of meaning in a language.
Authors:	Jonathan Bratt [aut] (ORCID: <https://orcid.org/0000-0003-2859-0076>), Jon Harmon [aut, cre] (ORCID: <https://orcid.org/0000-0003-4781-4346>), Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning [cph]
Maintainer:	Jon Harmon <[email protected]>
License:	Apache License (>= 2)
Version:	1.2.0
Built:	2026-05-21 08:04:14 UTC
Source:	https://github.com/macmillancontentscience/morphemepiece.data

Load a Morphemepiece Lookup

Description

A morphemepiece lookup is a named character vector. The names of the vector are the words, and the values are the space-separated morpheme breakdowns of those words.

Usage

morphemepiece_lookup()
morphemepiece_lookup()

Value

A named character vector.

Examples

head(morphemepiece_lookup())
head(morphemepiece_lookup())

Load a Morphemepiece Vocabulary

Description

A morphemepiece vocabulary is a named integer vector with class "morphemepiece_vocabulary". The names of the vector are the morphemes, and the values are the integer identifiers of those tokens. The vocabulary is 0-indexed for compatibility with Python implementations.

Usage

morphemepiece_vocab()
morphemepiece_vocab()

Value

A morphemepiece_vocabulary.

Examples

head(morphemepiece_vocab())
head(morphemepiece_vocab())

Package 'morphemepiece.data'

Help Index

Load a Morphemepiece Lookup

Description

Usage

Value

Examples

Load a Morphemepiece Vocabulary

Description

Usage

Value

Examples