Package: morphemepiece 1.2.3

Jonathan Bratt

morphemepiece: Morpheme Tokenization

Tokenize text into morphemes. The morphemepiece algorithm uses a lookup table to determine the morpheme breakdown of words, and falls back on a modified wordpiece tokenization algorithm for words not found in the lookup table.

Authors:Jonathan Bratt [aut, cre], Jon Harmon [aut], Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning [cph]

morphemepiece_1.2.3.tar.gz
morphemepiece_1.2.3.zip(r-4.5)morphemepiece_1.2.3.zip(r-4.4)morphemepiece_1.2.3.zip(r-4.3)
morphemepiece_1.2.3.tgz(r-4.5-any)morphemepiece_1.2.3.tgz(r-4.4-any)morphemepiece_1.2.3.tgz(r-4.3-any)
morphemepiece_1.2.3.tar.gz(r-4.5-noble)morphemepiece_1.2.3.tar.gz(r-4.4-noble)
morphemepiece_1.2.3.tgz(r-4.4-emscripten)morphemepiece_1.2.3.tgz(r-4.3-emscripten)
morphemepiece.pdf |morphemepiece.html
morphemepiece/json (API)
NEWS

# Install 'morphemepiece' in R:
install.packages('morphemepiece', repos = c('https://macmillancontentscience.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/macmillancontentscience/morphemepiece/issues

On CRAN:

Conda:

5.04 score 11 stars 8 scripts 311 downloads 10 exports 38 dependencies

Last updated 3 years agofrom:bc071b1a03. Checks:3 OK, 5 NOTE. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKFeb 26 2025
R-4.5-winNOTEFeb 26 2025
R-4.5-macNOTEFeb 26 2025
R-4.5-linuxNOTEFeb 26 2025
R-4.4-winNOTEFeb 26 2025
R-4.4-macNOTEFeb 26 2025
R-4.3-winOKFeb 26 2025
R-4.3-macOKFeb 26 2025

Exports:load_lookupload_or_retrieve_lookupload_or_retrieve_vocabload_vocabmorphemepiece_cache_dirmorphemepiece_lookupmorphemepiece_tokenizemorphemepiece_vocabprepare_vocabset_morphemepiece_cache_dir

Dependencies:bitbit64cachemclicliprcpp11crayondigestdlrfansifastmapfastmatchfsgluehmslifecyclemagrittrmemoisemorphemepiece.datapiecemakerpillarpkgconfigprettyunitsprogresspurrrR6rappdirsreadrrlangstringistringrtibbletidyselecttzdbutf8vctrsvroomwithr

Generating a Vocabulary and Lookup

Rendered fromgenerating_vocab.Rmdusingknitr::rmarkdownon Feb 26 2025.

Last update: 2021-10-26
Started: 2021-07-29

Testing the fall-through algorithm

Rendered fromalgorithm_test.Rmdusingknitr::rmarkdownon Feb 26 2025.

Last update: 2021-09-06
Started: 2021-07-29