src/cmd/tcs/ex5.utf - plan9 - Git at Google

 .tr -\(hy
 .TL
 Hello World
 .br
 or
 .br
 Καλημέρα κόσμε
 .br
 or
 .br
 こんにちは 世界
 .AU
 Rob Pike
 Ken Thompson
 .AI
 .MH
 .AB
 Plan 9 from Bell Labs has recently been converted from ASCII
 to an ASCII-compatible variant of Unicode, a 16-bit character set.
 In this paper we explain the reasons for the change,
 describe the character set and representation we chose,
 and present the programming models and software changes
 that support the new text format.
 Although we stopped short of full internationalization\(emfor
 example, system error messages are in Unixese, not Japanese\(emwe
 believe Plan 9 is the first system to treat the representation
 of all major languages on a uniform, equal footing throughout all its
 software.
 .AE
 .SH
 Introduction
 .PP
 The world is multilingual but most computer systems
 are based on English and ASCII or worse.
 The pending release of Plan 9 [Pike90], a new distributed operating
 system from Bell Laboratories, seemed a good occasion
 to correct this chauvinism.
 It is easier to make such deep changes when building new systems than
 by retrofitting old ones.
 .PP
 The ANSI C standard [ANSIC] contains some guidance on the matter of
 `wide' and `multi-byte' characters but falls far short of
 solving the myriad associated problems.
 We could find no literature on how to convert a
 .I system
 to larger character sets, although some individual
 .I programs
 have been converted.
 This paper reports what we discovered as we
 explored the problem of representing multilingual
	.tr -\(hy
	.TL
	Hello World
	.br
	or
	.br
	Καλημέρα κόσμε
	.br
	or
	.br
	こんにちは世界
	.AU
	Rob Pike
	Ken Thompson
	.AI
	.MH
	.AB
	Plan 9 from Bell Labs has recently been converted from ASCII
	to an ASCII-compatible variant of Unicode, a 16-bit character set.
	In this paper we explain the reasons for the change,
	describe the character set and representation we chose,
	and present the programming models and software changes
	that support the new text format.
	Although we stopped short of full internationalization\(emfor
	example, system error messages are in Unixese, not Japanese\(emwe
	believe Plan 9 is the first system to treat the representation
	of all major languages on a uniform, equal footing throughout all its
	software.
	.AE
	.SH
	Introduction
	.PP
	The world is multilingual but most computer systems
	are based on English and ASCII or worse.
	The pending release of Plan 9 [Pike90], a new distributed operating
	system from Bell Laboratories, seemed a good occasion
	to correct this chauvinism.
	It is easier to make such deep changes when building new systems than
	by retrofitting old ones.
	.PP
	The ANSI C standard [ANSIC] contains some guidance on the matter of
	`wide' and `multi-byte' characters but falls far short of
	solving the myriad associated problems.
	We could find no literature on how to convert a
	.I system
	to larger character sets, although some individual
	.I programs
	have been converted.
	This paper reports what we discovered as we
	explored the problem of representing multilingual