Clarified in man xorriso the roles of character sets
This commit is contained in:
@ -9,7 +9,7 @@
|
||||
.\" First parameter, NAME, should be all caps
|
||||
.\" Second parameter, SECTION, should be 1-8, maybe w/ subsection
|
||||
.\" other parameters are allowed: see man(7), man(1)
|
||||
.TH XORRISO 1 "Version 1.3.5, Dec 28, 2013"
|
||||
.TH XORRISO 1 "Version 1.3.5, Jan 02, 2014"
|
||||
.\" Please adjust this date whenever revising the manpage.
|
||||
.\"
|
||||
.\" Some roff macros, for reference:
|
||||
@ -3308,30 +3308,42 @@ on differently nationalized terminals.
|
||||
The meanings of byte codes are defined in \fBcharacter sets\fR which have
|
||||
names. Shell command iconv \-l lists them.
|
||||
.br
|
||||
Character sets should not matter as long as only english alphanumeric
|
||||
The file names on hard disk are assumed to be encoded by the
|
||||
\fBlocal character set\fR which is also used for the communication
|
||||
with the user.
|
||||
Byte codes 32 to 126 of the local character set must match the US\-ASCII
|
||||
characters of the same code. ISO\-8859 and UTF\-8 fulfill this demand.
|
||||
.br
|
||||
By default, \fBxorriso\fR uses the character set as told by
|
||||
shell command "locale" with argument "charmap". This may be influenced
|
||||
by environment variables LC_ALL, LC_CTYPE, or LANG and should match the
|
||||
expectations of the terminal.
|
||||
In some situations it may be necessary to set it by command \-local_charset.
|
||||
.br
|
||||
Local character sets should not matter as long as only english alphanumeric
|
||||
characters are used for file names or as long as all writers and readers
|
||||
of the media use the same character set.
|
||||
of the media use the same local character set.
|
||||
Outside these constraints it may be necessary to let \fBxorriso\fR
|
||||
convert byte codes.
|
||||
convert byte codes from and to other character sets.
|
||||
.br
|
||||
There is an input conversion from input character set to the local character
|
||||
set which applies when an ISO image gets loaded. A conversion from local
|
||||
character set to the output character set is performed when an
|
||||
image tree gets written. The sets can be defined independently by commands
|
||||
The Rock Ridge file names in ISO filesystems are assumed to be
|
||||
encoded by the \fBinput character set\fR.
|
||||
The Rock Ridge file names which get written with ISO filesystems will be
|
||||
encoded by the \fBoutput character set\fR.
|
||||
.br
|
||||
The sets can be defined independently by commands
|
||||
\-in_charset and \-out_charset. Normally one will have both identical, if ever.
|
||||
Other than the local character set, these two character sets may deviate
|
||||
from US\-ASCII.
|
||||
.br
|
||||
If conversions are desired then \fBxorriso\fR needs to know the name of the
|
||||
local character set. \fBxorriso\fR can inquire the same info as
|
||||
shell command
|
||||
"locale" with argument "charmap". This may be influenced by environment
|
||||
variables LC_ALL, LC_CTYPE, or LANG and should match the expectations of
|
||||
the terminal.
|
||||
The output character sets for Joliet and HFS+ are not influenced by these
|
||||
commands. Joliet uses output character set UCS\-2 or UTF\-16. HFS+ uses UTF\-16.
|
||||
.br
|
||||
The default output charset is the local character set of the terminal where
|
||||
\fBxorriso\fR runs. So by default no conversion happens between local
|
||||
filesystem
|
||||
names and emerging names in the image. The situation stays ambigous and the
|
||||
reader has to riddle what character set was used.
|
||||
names and emerging Rock Ridge names in the image. The situation stays
|
||||
ambigous and the reader has to riddle what character set was used.
|
||||
.br
|
||||
By command \-auto_charset it is possible to attribute the output charset name
|
||||
to the image. This makes the situation unambigous. But if your terminal
|
||||
|
Reference in New Issue
Block a user