Package: stringdist 0.9.12

Mark van der Loo

stringdist: Approximate String Matching, Fuzzy Text Search, and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.

Authors:Mark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb], Chris Muir [ctb], Johannes Gruber [ctb], Brian Ripley [ctb]

stringdist.pdf |stringdist.html
stringdist/json (API)

# Install stringdist in R:
install.packages('stringdist', repos = c('', ''))

Peer review:

Bug tracker:

Uses libs:
  • openmp– GCC OpenMP (GOMP) support library


19 exports 314 stars 8.95 score 0 dependencies 218 dependents

Last updated 2 months agofrom:6da3a6c58795808e1e95237a11beac28b380bfe9



RJournal 6 111-122 (2014)

Rendered fromRJournal_6_111-122-2014.Rnwusingutils::Sweaveon May 23 2024.

Last update: 2018-08-29
Started: 2018-08-29

stringdist C/C++ API

Rendered fromstringdist_C-Cpp_api.Rnwusingutils::Sweaveon May 23 2024.

Last update: 2018-08-29
Started: 2018-08-29