| Encoding {base} | R Documentation |
Read or Set the Declared Encodings for a Character Vector
Description
Read or set the declared encodings for a character vector.
Usage
Encoding(x)
Encoding(x) <- value
Arguments
x |
A character vector. |
value |
A character vector of positive length. |
Details
As from R 2.5.0, character strings in R can be declared to be in
"latin1" or "UTF-8". These declarations can be read by
Encoding, which will return a character vector of values
"latin1", "UTF-8" or "unknown", or set, when
value is recycled as needed and other values are silently
treated as "unknown".
There are other ways for character strings to acquire a declared
encoding apart from explicitly setting it. Functions
scan, read.table, readLines,
parse and source have an encoding
argument that is used to declare encodings, iconv
declares encodings from its from argument, and console input in
suitable locales is also declared.
Value
A character vector.
Examples
## x is intended to be in latin1
x <- "fa\xE7ile"
Encoding(x)
Encoding(x) <- "latin1"
x
xx <- iconv(x, "latin1", "UTF-8")
Encoding(c(x, xx))
c(x, xx)