Technically this returns the number of "code points", in a string. One code point usually corresponds to one character, but not always. For example, an u with a umlaut might be represented as a single character or as the combination a u and an umlaut.

str_length(string)

Arguments

string

Input vector. Either a character vector, or something coercible to one.

Value

A numeric vector giving number of characters (code points) in each element of the character vector. Missing string have missing length.

See also

stringi::stri_length() which this function wraps.

Examples

str_length(letters)
#> [1] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
str_length(NA)
#> [1] NA
str_length(factor("abc"))
#> [1] 3
str_length(c("i", "like", "programming", NA))
#> [1] 1 4 11 NA
# Two ways of representing a u with an umlaut u1 <- "\u00fc" u2 <- stringi::stri_trans_nfd(u1) # The print the same: u1
#> [1] "ü"
u2
#> [1] "ü"
# But have a different length str_length(u1)
#> [1] 1
str_length(u2)
#> [1] 2
# Even though they have the same number of characters str_count(u1)
#> [1] 1
#> [1] 1