Package: stringi 1.8.4.9001

Marek Gagolewski

stringi: Fast and Portable Character String Processing Facilities

A collection of character string/text/natural language processing tools for pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, sorting, padding, wrapping, Unicode normalisation, date-time formatting and parsing, and many more. They are fast, consistent, convenient, and - thanks to 'ICU' (International Components for Unicode) - portable across all locales and platforms. Documentation about 'stringi' is provided via its website at <https://stringi.gagolewski.com/> and the paper by Gagolewski (2022, <doi:10.18637/jss.v103.i02>).

Authors:Marek Gagolewski [aut, cre, cph], Bartek Tartanus [ctb], and others; Unicode, Inc. and others

stringi_1.8.4.9001.tar.gz
stringi_1.8.4.9001.zip(r-4.5)stringi_1.8.4.9001.zip(r-4.4)stringi_1.8.4.9001.zip(r-4.3)
stringi_1.8.4.9001.tgz(r-4.4-x86_64)stringi_1.8.4.9001.tgz(r-4.4-arm64)stringi_1.8.4.9001.tgz(r-4.3-x86_64)stringi_1.8.4.9001.tgz(r-4.3-arm64)
stringi_1.8.4.9001.tar.gz(r-4.5-noble)stringi_1.8.4.9001.tar.gz(r-4.4-noble)
stringi_1.8.4.9001.tgz(r-4.4-emscripten)stringi_1.8.4.9001.tgz(r-4.3-emscripten)
stringi.pdf |stringi.html
stringi/json (API)
NEWS

# Install 'stringi' in R:
install.packages('stringi', repos = c('https://fastverse.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/gagolews/stringi/issues

Uses libs:
  • icu– International Components for Unicode
  • c++– GNU Standard C++ Library v3

On CRAN:

icuicu4cnatural-language-processingnlpregexregexpstring-manipulationstringistringrtexttext-processingtidy-dataunicode

18.53 score 304 stars 8.3k packages 10k scripts 994k downloads 34 mentions 256 exports 0 dependencies

Last updated 4 months agofrom:1304a6c35b. Checks:OK: 4 NOTE: 5. Indexed: no.

TargetResultDate
Doc / VignettesOKNov 08 2024
R-4.5-win-x86_64NOTENov 08 2024
R-4.5-linux-x86_64NOTENov 08 2024
R-4.4-win-x86_64NOTENov 08 2024
R-4.4-mac-x86_64NOTENov 08 2024
R-4.4-mac-aarch64NOTENov 08 2024
R-4.3-win-x86_64OKNov 08 2024
R-4.3-mac-x86_64OKNov 08 2024
R-4.3-mac-aarch64OKNov 08 2024

Exports:%s!=%%s!==%%s*%%s+%%s<%%s<=%%s==%%s===%%s>%%s>=%%s$%%stri!=%%stri!==%%stri*%%stri+%%stri<%%stri<=%%stri==%%stri===%%stri>%%stri>=%%stri$%stri_cstri_c_liststri_cmpstri_cmp_eqstri_cmp_equivstri_cmp_gestri_cmp_gtstri_cmp_lestri_cmp_ltstri_cmp_neqstri_cmp_nequivstri_collstri_comparestri_convstri_countstri_count_boundariesstri_count_charclassstri_count_collstri_count_fixedstri_count_regexstri_count_wordsstri_datetime_addstri_datetime_add<-stri_datetime_createstri_datetime_fieldsstri_datetime_formatstri_datetime_fstrstri_datetime_nowstri_datetime_parsestri_datetime_symbolsstri_detectstri_detect_charclassstri_detect_collstri_detect_fixedstri_detect_regexstri_dupstri_duplicatedstri_duplicated_anystri_enc_detectstri_enc_detect2stri_enc_fromutf32stri_enc_getstri_enc_infostri_enc_isasciistri_enc_isutf16bestri_enc_isutf16lestri_enc_isutf32bestri_enc_isutf32lestri_enc_isutf8stri_enc_liststri_enc_markstri_enc_setstri_enc_toasciistri_enc_tonativestri_enc_toutf32stri_enc_toutf8stri_encodestri_endswithstri_endswith_charclassstri_endswith_collstri_endswith_fixedstri_escape_unicodestri_extractstri_extract_allstri_extract_all_boundariesstri_extract_all_charclassstri_extract_all_collstri_extract_all_fixedstri_extract_all_regexstri_extract_all_wordsstri_extract_firststri_extract_first_boundariesstri_extract_first_charclassstri_extract_first_collstri_extract_first_fixedstri_extract_first_regexstri_extract_first_wordsstri_extract_laststri_extract_last_boundariesstri_extract_last_charclassstri_extract_last_collstri_extract_last_fixedstri_extract_last_regexstri_extract_last_wordsstri_flattenstri_infostri_isemptystri_joinstri_join_liststri_lengthstri_list2matrixstri_locale_getstri_locale_infostri_locale_liststri_locale_setstri_locatestri_locate_allstri_locate_all_boundariesstri_locate_all_charclassstri_locate_all_collstri_locate_all_fixedstri_locate_all_regexstri_locate_all_wordsstri_locate_firststri_locate_first_boundariesstri_locate_first_charclassstri_locate_first_collstri_locate_first_fixedstri_locate_first_regexstri_locate_first_wordsstri_locate_laststri_locate_last_boundariesstri_locate_last_charclassstri_locate_last_collstri_locate_last_fixedstri_locate_last_regexstri_locate_last_wordsstri_matchstri_match_allstri_match_all_regexstri_match_firststri_match_first_regexstri_match_laststri_match_last_regexstri_na2emptystri_numbytesstri_omit_emptystri_omit_empty_nastri_omit_nastri_opts_brkiterstri_opts_collatorstri_opts_fixedstri_opts_regexstri_orderstri_padstri_pad_bothstri_pad_leftstri_pad_rightstri_pastestri_paste_liststri_printfstri_rand_lipsumstri_rand_shufflestri_rand_stringsstri_rankstri_read_linesstri_read_rawstri_remove_emptystri_remove_empty_nastri_remove_nastri_replacestri_replace_allstri_replace_all_charclassstri_replace_all_collstri_replace_all_fixedstri_replace_all_regexstri_replace_firststri_replace_first_charclassstri_replace_first_collstri_replace_first_fixedstri_replace_first_regexstri_replace_laststri_replace_last_charclassstri_replace_last_collstri_replace_last_fixedstri_replace_last_regexstri_replace_nastri_replace_rstrstri_reversestri_sortstri_sort_keystri_splitstri_split_boundariesstri_split_charclassstri_split_collstri_split_fixedstri_split_linesstri_split_lines1stri_split_regexstri_sprintfstri_startswithstri_startswith_charclassstri_startswith_collstri_startswith_fixedstri_stats_generalstri_stats_latexstri_string_formatstri_substri_sub_allstri_sub_all_replacestri_sub_all<-stri_sub_replacestri_sub_replace_allstri_sub<-stri_subsetstri_subset_charclassstri_subset_charclass<-stri_subset_collstri_subset_coll<-stri_subset_fixedstri_subset_fixed<-stri_subset_regexstri_subset_regex<-stri_subset<-stri_timezone_getstri_timezone_infostri_timezone_liststri_timezone_setstri_trans_casefoldstri_trans_charstri_trans_generalstri_trans_isnfcstri_trans_isnfdstri_trans_isnfkcstri_trans_isnfkc_casefoldstri_trans_isnfkdstri_trans_liststri_trans_nfcstri_trans_nfdstri_trans_nfkcstri_trans_nfkc_casefoldstri_trans_nfkdstri_trans_tolowerstri_trans_totitlestri_trans_toupperstri_trimstri_trim_bothstri_trim_leftstri_trim_rightstri_unescape_unicodestri_uniquestri_widthstri_wrapstri_write_lines

Dependencies:

Readme and manuals

Help Manual

Help pageTopics
Concatenate Two Character Vectors%s+% %stri+% operator_add operator_plus oper_plus
Compare Strings with or without Collation%s!=% %s!==% %s<% %s<=% %s==% %s===% %s>% %s>=% %stri!=% %stri!==% %stri<% %stri<=% %stri==% %stri===% %stri>% %stri>=% operator_compare oper_compare oper_comparison
C-Style Formatting with 'stri_sprintf' as a Binary Operator%s$% %stri$% operator_dollar oper_dollar
Passing Arguments to Functions in 'stringi'about_arguments arguments stringi-arguments
Character Encodings and 'stringi'about_encoding encoding stringi-encoding
Locales and 'stringi'about_locale locale stringi-locale
String Searchingabout_search search stringi-search
Text Boundary Analysis in 'stringi'about_search_boundaries search_boundaries stringi-search-boundaries
Character Classes in 'stringi'about_search_charclass search_charclass stringi-search-charclass
Locale-Sensitive Text Searching in 'stringi'about_search_coll search_coll stringi-search-coll
Locale-Insensitive Fixed Pattern Matching in 'stringi'about_search_fixed search_fixed stringi-search-fixed
Regular Expressions in 'stringi'about_search_regex search_regex stringi-search-regex
Fast and Portable Character String Processing in R-package stringi-package about_stringi stringi
Compare Strings with or without Collationstri_cmp stri_cmp_eq stri_cmp_equiv stri_cmp_ge stri_cmp_gt stri_cmp_le stri_cmp_lt stri_cmp_neq stri_cmp_nequiv stri_compare
Count the Number of Pattern Occurrencesstri_count stri_count_charclass stri_count_coll stri_count_fixed stri_count_regex
Count the Number of Text Boundariesstri_count_boundaries stri_count_words
Date and Time Arithmeticstri_datetime_add stri_datetime_add<-
Create a Date-Time Objectstri_datetime_create
Get Values for Date and Time Fieldsstri_datetime_fields
Date and Time Formatting and Parsingstri_datetime_format stri_datetime_parse
Convert 'strptime'-Style Format Stringsstri_datetime_fstr
Get Current Date and Timestri_datetime_now
List Localizable Date-Time Formatting Datastri_datetime_symbols
Detect Pattern Occurrencesstri_detect stri_detect_charclass stri_detect_coll stri_detect_fixed stri_detect_regex
Duplicate Strings%s*% %stri*% operator_multiply oper_multiply stri_dup
Determine Duplicated Elementsstri_duplicated stri_duplicated_any
Detect Character Set and Languagestri_enc_detect
[DEPRECATED] Detect Locale-Sensitive Character Encodingstri_enc_detect2
Convert From UTF-32stri_enc_fromutf32
Query a Character Encodingstri_enc_info
Check If a Data Stream Is Possibly in ASCIIstri_enc_isascii
Check If a Data Stream Is Possibly in UTF-16 or UTF-32stri_enc_isutf16be stri_enc_isutf16le stri_enc_isutf32be stri_enc_isutf32le
Check If a Data Stream Is Possibly in UTF-8stri_enc_isutf8
List Known Character Encodingsstri_enc_list
Get Declared Encodings of Each Stringstri_enc_mark
Set or Get Default Character Encoding in 'stringi'stri_enc_get stri_enc_set
Convert To ASCIIstri_enc_toascii
Convert Strings To Native Encodingstri_enc_tonative
Convert Strings To UTF-32stri_enc_toutf32
Convert Strings To UTF-8stri_enc_toutf8
Convert Strings Between Given Encodingsstri_conv stri_encode
Escape Unicode Code Pointsstri_escape_unicode
Extract Pattern Occurrencesstri_extract stri_extract_all stri_extract_all_charclass stri_extract_all_coll stri_extract_all_fixed stri_extract_all_regex stri_extract_first stri_extract_first_charclass stri_extract_first_coll stri_extract_first_fixed stri_extract_first_regex stri_extract_last stri_extract_last_charclass stri_extract_last_coll stri_extract_last_fixed stri_extract_last_regex
Extract Data Between Text Boundariesstri_extract_all_boundaries stri_extract_all_words stri_extract_first_boundaries stri_extract_first_words stri_extract_last_boundaries stri_extract_last_words
Flatten a Stringstri_flatten
Query Default Settings for 'stringi'stri_info
Determine if a String is of Length Zerostri_isempty
Concatenate Character Vectorsstri_c stri_join stri_paste
Concatenate Strings in a Liststri_c_list stri_join_list stri_paste_list
Count the Number of Code Pointsstri_length
Convert a List to a Character Matrixstri_list2matrix
Query Given Localestri_locale_info
List Available Localesstri_locale_list
Set or Get Default Locale in 'stringi'stri_locale_get stri_locale_set
Locate Pattern Occurrencesstri_locate stri_locate_all stri_locate_all_charclass stri_locate_all_coll stri_locate_all_fixed stri_locate_all_regex stri_locate_first stri_locate_first_charclass stri_locate_first_coll stri_locate_first_fixed stri_locate_first_regex stri_locate_last stri_locate_last_charclass stri_locate_last_coll stri_locate_last_fixed stri_locate_last_regex
Locate Text Boundariesstri_locate_all_boundaries stri_locate_all_words stri_locate_first_boundaries stri_locate_first_words stri_locate_last_boundaries stri_locate_last_words
Extract Regex Pattern Matches, Together with Capture Groupsstri_match stri_match_all stri_match_all_regex stri_match_first stri_match_first_regex stri_match_last stri_match_last_regex
Replace NAs with Empty Stringsstri_na2empty
Count the Number of Bytesstri_numbytes
Generate a List with BreakIterator Settingsstri_opts_brkiter
Generate a List with Collator Settingsstri_coll stri_opts_collator
Generate a List with Fixed Pattern Search Engine's Settingsstri_opts_fixed
Generate a List with Regex Matcher Settingsstri_opts_regex
Ordering Permutationstri_order
Pad (Center/Left/Right Align) a Stringstri_pad stri_pad_both stri_pad_left stri_pad_right
A Lorem Ipsum Generatorstri_rand_lipsum
Randomly Shuffle Code Points in Each Stringstri_rand_shuffle
Generate Random Stringsstri_rand_strings
Rankingstri_rank
Read Text Lines from a Text Filestri_read_lines
Read Text File as Rawstri_read_raw
Remove All Empty Strings from a Character Vectorstri_omit_empty stri_omit_empty_na stri_omit_na stri_remove_empty stri_remove_empty_na stri_remove_na
Replace Pattern Occurrencesstri_replace stri_replace_all stri_replace_all_charclass stri_replace_all_coll stri_replace_all_fixed stri_replace_all_regex stri_replace_first stri_replace_first_charclass stri_replace_first_coll stri_replace_first_fixed stri_replace_first_regex stri_replace_last stri_replace_last_charclass stri_replace_last_coll stri_replace_last_fixed stri_replace_last_regex
Replace Missing Values in a Character Vectorstri_replace_na
Convert gsub-Style Replacement Stringsstri_replace_rstr
Reverse Each Stringstri_reverse
String Sortingstri_sort
Sort Keysstri_sort_key
Split a String By Pattern Matchesstri_split stri_split_charclass stri_split_coll stri_split_fixed stri_split_regex
Split a String at Text Boundariesstri_split_boundaries
Split a String Into Text Linesstri_split_lines stri_split_lines1
Format Stringsstri_printf stri_sprintf stri_string_format
Determine if the Start or End of a String Matches a Patternstri_endswith stri_endswith_charclass stri_endswith_coll stri_endswith_fixed stri_startswith stri_startswith_charclass stri_startswith_coll stri_startswith_fixed
General Statistics for a Character Vectorstri_stats_general
Statistics for a Character Vector Containing LaTeX Commandsstri_stats_latex
Extract a Substring From or Replace a Substring In a Character Vectorstri_sub stri_sub<- stri_sub_replace
Extract or Replace Multiple Substringsstri_sub_all stri_sub_all<- stri_sub_all_replace stri_sub_replace_all
Select Elements that Match a Given Patternstri_subset stri_subset<- stri_subset_charclass stri_subset_charclass<- stri_subset_coll stri_subset_coll<- stri_subset_fixed stri_subset_fixed<- stri_subset_regex stri_subset_regex<-
Set or Get Default Time Zone in 'stringi'stri_timezone_get stri_timezone_set
Query a Given Time Zonestri_timezone_info
List Available Time Zone Identifiersstri_timezone_list
Translate Charactersstri_trans_char
General Text Transforms, Including Transliterationstri_trans_general
List Available Text Transforms and Transliteratorsstri_trans_list
Perform or Check For Unicode Normalizationstri_trans_isnfc stri_trans_isnfd stri_trans_isnfkc stri_trans_isnfkc_casefold stri_trans_isnfkd stri_trans_nfc stri_trans_nfd stri_trans_nfkc stri_trans_nfkc_casefold stri_trans_nfkd
Transform Strings with Case Mapping or Foldingstri_trans_casefold stri_trans_tolower stri_trans_totitle stri_trans_toupper
Trim Characters from the Left and/or Right Side of a Stringstri_trim stri_trim_both stri_trim_left stri_trim_right
Un-escape All Escape Sequencesstri_unescape_unicode
Extract Unique Elementsstri_unique
Determine the Width of Code Pointsstri_width
Word Wrap Text to Format Paragraphsstri_wrap
Write Text Lines to a Text Filestri_write_lines