Category Unicode

Arabic, C programming (miscellaneous), Unicode, Unicode

Unicode for the impatient (Part 3: UTF-8 bits, bytes and C code)

I promised to finish the series on Unicode and UTF-8 so here is the final instalment, better late than never. Before reading this article I suggest that you read Part 1 and Part 2 which cover some important background. As…

Graham Douglas
October 17, 2011
2 Comments

Unicode

Unicode for the impatient (Part 2: UTF-X, what is it?)

Introduction As mentioned in the previous post, a vast amount of information on Unicode is widely available so I won’t waste your reading time by repeating it here. What I will try to add is some additional explanations to fill…

Graham Douglas
February 18, 2011
1 Comment

Unicode, Unicode

From Unicode code points to Arabic text

In Unicode, the range (in hex) 0600 to 06FF is used for Arabic characters. Each value in the range 0600 to 06FF is referred to as a code point. In simple terms, just think of it as a number allocated…

Graham Douglas
February 16, 2011

Unicode

Unicode for the impatient (Part 1: updated)

Over the last few weeks I have been reading extensively about Unicode, OpenType, UTF-8 plus a whole set of technologies related to typesetting complex scripts (Arabic) with LuaTeX. It is, for sure, a pretty complex picture so I thought I…

Graham Douglas
February 15, 2011
2 Comments