csv - fread (data.table in R) with specification of encoding -
could not find proper answer in previous questions , answers problem: 1. have 2.3 gb csv file contains 2.4 million rows of hebrew text, coded in ascii. since talking big file, fread preferable encoding? idea how read csv file coded in ascii avoid famous "embedded nul in string" error?
thank you
as of august 25th case linked david arenburg closed, , functionality included in available version of data.table. encoding parameter can used when calling fread:
text <- fread(file, encoding = 'utf-8')
ascii not explicit encoding option, ascii valid utf-8, can specify utf-8 when want read hebrew text.
Comments
Post a Comment