<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>unicode &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/unicode/</link>
	<description>Feed of posts on WordPress.com tagged "unicode"</description>
	<pubDate>Thu, 26 Nov 2009 05:32:43 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[උන්ගේ ආච්චිලාගේ සම්මාන | Their Granmas’ awards ]]></title>
<link>http://taboosubjects.wordpress.com/2009/11/17/1986/</link>
<pubDate>Tue, 17 Nov 2009 14:38:43 +0000</pubDate>
<dc:creator>Taboo</dc:creator>
<guid>http://taboosubjects.wordpress.com/2009/11/17/1986/</guid>
<description><![CDATA[මේ රාජ්‍ය ආයතන මාර්ගයෙන් &#8220;ඊ-නිර්මාණකරුවන්ට&#8221; ප්‍රදානය කරනු ලබන්නේය කියන &#8220;සම්මාන]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://taboosubjects.wordpress.com/files/2009/11/chamara1.jpg"><img class="alignnone size-full wp-image-2419" title="chamara" src="http://taboosubjects.wordpress.com/files/2009/11/chamara1.jpg" alt="" width="500" height="327" /></a><a href="http://taboosubjects.wordpress.com/files/2009/11/dscn0602.jpg"></a></p>
<p><span style="font-size:14pt;color:#002060;line-height:115%;font-family:&#34;" lang="EN-US"></p>
<p>මේ රාජ්‍ය ආයතන මාර්ගයෙන් &#8220;ඊ-නිර්මාණකරුවන්ට&#8221; ප්‍රදානය කරනු ලබන්නේය කියන &#8220;සම්මාන&#8221; ගැන ලියැවෙන තෙවන සටහනය. මේ සටහන ලිවීමට ආසන්න හේතුව <a href="http://sinhala.kalingasblog.com/2009/11/16/sbu-official-notice-about-bestweb_lk" target="_blank">සිංහල බ්ලොග්කරුවන්ගේ සංසදයේ මිතුරන් එකිනෙකා විසින් තම තමන්ගේ බ්ලොග්වල පුන පුනා වහෙන් ඔරෝ බාසාවෙන් පළ කරනු ලබන නිවේදනයකි</a>. මේ වන විට දස වතාවකට වඩා පුනරුච්චාරණය කොට ඇති මේ නිවේදනය උගුරට හොරා බෙහෙත් කෑමකි. ඒ ගැන ලේඛකයා පුදුම නොවේ. තමන් කොයිතරම් නිග්‍රහයට පත් කළද, තොරතුරු තාක්ෂණ නියෝජිතායතනය (Information and Communication Technology Agency හෙවත් ICTA) හා මේ සිද්ධියට ඍජුව වගකිවයුතු <a href="http://www.nic.lk" target="_blank">.lk වසම් අධිකාරිය </a>බාර ආචාර්ය ගිහාන් ඩයස් තරහ නොකැරගෙන සිටීම යෙහෙකිය ඔවුන් කල්පනා කරනවා වන්නට පිළිවන. එවැනි බැඳීම් නැති හෙයින් කිව යුත්ත කෙළින්ම කීමටය මේ කෙටි සටහන.</p>
<p>රාජ්‍ය ආයතන මාර්ගයෙන් &#8220;ඊ-නිර්මාණකරුවන්ට&#8221; හෝ බ්ලොග්කරුවන්ට දෙනු ලබන්නේය කියන සම්මාන තනිකරම මගෝඩි ක්‍රියාවලියකි. මෙවන් &#8220;සම්මාන&#8221; දීම සඳහා මේ කිසිම ආයතනයකට සදාචාරාත්මක අයිතියක් නැත. අද සිංහල භාෂාවෙන් නිර්මාණය වෙමින් පවතින කිසිම ඩිජිටල් අන්තර්ගතයක් මොවුන්ගේ මැදිහත්වීමකින්, උත්තේජනයකින් ඇති වූවක් නොවේ.</p>
<p>Kottu.org හා &#8220;සිංහල බ්ලොග් කියවනය&#8221; වැනි සින්ඩිකේටරයන් බිහිව ඇත්තේ මේ කිසිම ආයතනයක සම්මාදමක් නැතිව තනිකරම පුද්ගලික උවමනාව මත තනි පුද්ගලයන්ගේ වියදමිනි. <a href="http://blog.malinthe.com" target="_blank">මලින්ත සමරකෝන්</a>, බුද්ධික ජයසිංහ, <a href="http://sinhala.kalingasblog.com" target="_blank">කාලිංග ඇතුලත්මුදලි</a>, <a href="http://blog.sudaraka.com" target="_blank">සුපුන් සුදාරක </a>ඇතුළු පිරිසක් සිංහල බ්ලොග් කියවනය නිර්මාණයෙහිද, <a href="http://indi.ca" target="_blank">ඉන්ඩි සමරජීව </a>Kottu.org නිර්මාණය කිරීමෙහිද ප්‍රධානත්වයක් ගෙන කටයුතු කළයි අසා ඇත්තෙමි. (මේ නම් සඳහන් කෙරෙන්නේ ඔවුන් කළ සද්කාර්යය අගය කළයුතු නිසාය. මේ කිසිවකු මම නමින් මිස නොදනිමි. වරදක් ඇතොත් නිවැරදි කෙරෙත්වා!) සම්මාන දෙනවා නම් දිය යුත්තේ ඔවුන්ටය. මේ කර ඇති කාර්යයම ඊට ප්‍රමාණවත්ය. ඊට තව තරග අවශ්‍ය නොවේ.</p>
<p>විදේශාධාර මිලියන ගණනින් නාස්තිකරමින් තොරතුරු තාක්ෂණ නියෝජිතායතනය (ICTA) විසින් කරන ලද එකම කටයුත්ත යුනිකෝඩ් සම්මතය හඳුන්නා දීමක් පමණක් විය හැකිය. එහෙත් යුනිකෝඩයට බොහෝ ඉහත සිංහල අන්තර්ගතයන් (තිබස් බඳු වෙනත් ෆොන්ට භාවිතාවෙන්) බිහිවූ අතර කන්දක් විලි ලා වැදූ මී පැටවකු බඳු, තවමත් යංසයන් රේපයන් එල්ලා හැලෙන යුනිකෝඩය නැතිවද අද සිදුවෙමින් පවතින පරිවර්තනය වීමට ඉඩ තිබුණේය. (උදාහරණයක් හැටියට ඉතාමත් ඉහළ පාඨක ප්‍රතිචාරයක් ලබන ලංකාදීප, ලක්බිම හා ලංකා ඊ-නිව්ස් මංගත පුවත්පත් තවමත් යුනිකෝඩය භාවිතා නොකරයි.)</p>
<p>තොරතුරු තාක්ෂණ නියෝජිතායතනය හා .lk වසම් අධිකාරිය &#8220;ඊ-නිර්මාණකරුවන්ට&#8221;, බ්ලොග්කරුවන්ට &#8220;සම්මාන&#8221; දීමට මැදිහත් වන්නේ එකම හේතුවක් නිසාය. ඒ දැනට වර්ධනය වෙමින් පවතින සිංහල ඩිජිටල් අන්තර්ගතයන් පිළබඳ ගෞරවයෙන් කොටසක් තමන් ඩැහැ ගැනීම සඳහාය. මේ නිර්ලජ්ජිත ක්‍රියාවලියකි. තමන්ගේ කිසිම සම්මාදමකින් තොරව අනුන් කළ දෙයක ගෞරවය ගැනීමට කැස කවන්නෝ අපේ පිළිකුලට මිස ගෞරවයට පාත්‍ර වෙත්ද?</p>
<p><strong>උන්ගේ ආච්චිලාගේ සම්මාන උන්ටම තියාගන්නට කියමු. අප මෙතෙක් කල් ඩිජිටල් අන්තර්ගතයන් නිර්මාණය කළේ උන්ගේ සම්මාන වෙනුවෙන්ද? නැත. අපි අපේ ගමන යමු. ඊට කාගෙන් හෝ කිහිලිකරු අවශ්‍ය නොවේ.</strong><br />
</span><span style="font-size:14pt;color:#002060;line-height:115%;" lang="EN-US"></span></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[bestweb.lk හා myblog.lk සම්බන්ධව සිංහල බ්ලොග්කරුවන්ගේ සංසදයේ නිල ප්‍රකාශය]]></title>
<link>http://harshadewa.wordpress.com/2009/11/16/sbu-official-notice-about-bestweb_lk/</link>
<pubDate>Mon, 16 Nov 2009 06:55:25 +0000</pubDate>
<dc:creator>Harshadewa Ariyasinghe</dc:creator>
<guid>http://harshadewa.wordpress.com/2009/11/16/sbu-official-notice-about-bestweb_lk/</guid>
<description><![CDATA[ශ්‍රී ලංකා .LK වසම් ලේඛකාධිකාරිය (LK Domain Registry) මගින් මීට ටික කලකට පෙර Most Favourite Sri Lank]]></description>
<content:encoded><![CDATA[ශ්‍රී ලංකා .LK වසම් ලේඛකාධිකාරිය (LK Domain Registry) මගින් මීට ටික කලකට පෙර Most Favourite Sri Lank]]></content:encoded>
</item>
<item>
<title><![CDATA[How to uppecase [lowercase] strings in Delphi 2009+ ?]]></title>
<link>http://sergworks.wordpress.com/2009/11/15/how-to-uppecase-lowercase-strings-in-delphi-2009/</link>
<pubDate>Sun, 15 Nov 2009 10:42:01 +0000</pubDate>
<dc:creator>Serg</dc:creator>
<guid>http://sergworks.wordpress.com/2009/11/15/how-to-uppecase-lowercase-strings-in-delphi-2009/</guid>
<description><![CDATA[The answer is not so obvious as it was before. In the time immemorial strings were ASCII strings, an]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The answer is not so obvious as it was before.</p>
<p>In the time immemorial strings were ASCII strings, and since then we have UpperCase function. The function is  just replaces the lowercase &#8216;a&#8217;..&#8217;z&#8217; characters by their uppercase &#8216;A&#8217;..&#8217;Z&#8217; versions. Though nowadays the UpperCase is a Unicode function it still &#8216;uppercases&#8217; only ASCII characters. That is good but that is not all uppercase functionality we need.</p>
<p>The early Delphi versions introduced ANSI strings. The uppercase conversion on ANSI strings includes local characters besides ASCII &#8216;a&#8217;..&#8217;z&#8217; and consequntly is locale-dependent. The only locale supported by pre-unicode Delphi versions is system locale. The ANSIUpperCase function was introduced to uppercase ANSI strings using system locale. That is not so good as it may be, since sometimes we need a locale different from system locale, but at least is clear and unambiguous.</p>
<p>Now in Delphi2009 we can explicitly define the ANSI locale in Ansistring type, and that is great. The ANSI uppercase conversion is locale-sensitive, and now we can perform ANSI string uppercase conversions on any locale by just calling AnsiUpperCase function, right? The answer is &#8211; NO.</p>
<p>First, in Delphi 2009 the AnsiUpperCase is unicode function. I understand the [compatibility] reasons why the unicode function has Ansi prefix, but I prefer it would never happen.</p>
<p>Second, the real ANSI version of AnsiUpperCase function defined in Ansistrings unit is a wrapper for WinAPI CharUpperBuffA function, and still uses the current system locale and ignores the locale associated with the string type.</p>
<p>As a result it is better to avoid using AnsiUpperCase at all. The AnsiUpperCase function is just a backward compatibility issue.</p>
<p>Nowadays the defaul string type is Unicode string. The unicode string can include characters of all different locales at once, so the unicode strings are again locale-indendent as ASCII strings were, right? Well, it is almost right. But to be absolutely correct, the answer is still NO.</p>
<p>The only problem I know that makes Unicode uppercase conversion locale-dependent is the case of dotless i (ı, $0131) and dotted I (İ, $0130). In most languages the upper of i ($69) is I ($49), but in turkish locale i ($69) maps to İ ($0130). Similarly in turkish the lower of I ($49) is ı ($0131).</p>
<p>Now let us go back to Delphi. The new string format containes no locale information for unicode strings. We have 2 functions to make the unicode uppercase conversion &#8211; the already mentioned SysUtils.AnsiUpperCase introduced for backward compatibility and new ToUpper function defined in Character unit. The SysUtils.AnsiUpperCase is a wrapper for WinAPI CharUpperBuffW function and ignores locale-specific issues. The CharUpperBuffW works much better than its ANSI analog CharUpperBuffA, but still can&#8217;t help with &#8220;turkish case&#8221;. The Character.ToUpper function is a wrapper for LCMapString function and is locale-dependent, but the locale parameter is set to system locale. So if you need &#8220;turkish uppercase&#8221; on the system with different locale (or want to make the result of uppercase independent from system locale) you must write your own LCMapString wrapper.</p>
<p>BTW: the early Delphi 2009 releases containes side-effect bug in ToUpper and ToLower implementations. The bug was fixed in Update 3 (build 12.0.3420.21218).</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode, UTF-8 and Coroutines.]]></title>
<link>http://udoprog.wordpress.com/2009/11/14/unicode-utf-8-and-coroutines/</link>
<pubDate>Sat, 14 Nov 2009 01:41:00 +0000</pubDate>
<dc:creator>udoprog</dc:creator>
<guid>http://udoprog.wordpress.com/2009/11/14/unicode-utf-8-and-coroutines/</guid>
<description><![CDATA[I felt like getting my hands dirty in the unicode swamp, and actually writing myself a proper UTF-8 ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I felt like getting my hands dirty in the unicode swamp, and actually writing myself a proper UTF-8 decoder.</p>
<p>I am putting this code public to argue a couple of points on good coding practice.</p>
<p>But as a backstory, i recommend <a href="http://www.chiark.greenend.org.uk/~sgtatham/coroutines.html">the following about couroutines</a> and why they are one of the dirtiest hacks that exists to date in ANSI C.</p>
<p>The actual code is small, and consistent. The &#8216;nasty&#8217; goto statement for handling decoding errors I believe is a nice touch which in this case only contributes to clarity and consistency.</p>
<p>What do you think?</p>
<pre>
<div id="foo">
</div>
</pre>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Khmer Unicode Mac]]></title>
<link>http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/</link>
<pubDate>Fri, 13 Nov 2009 03:19:40 +0000</pubDate>
<dc:creator>nokorsoft</dc:creator>
<guid>http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/</guid>
<description><![CDATA[Khmer Unicode for Apple Mac computer Nokorsoft will release Khmer Unicode MacOSX Lite V2.0. Khmer Un]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><h1>Khmer Unicode for Apple Mac computer</h1>
<p>Nokorsoft will release Khmer Unicode MacOSX Lite V2.0. Khmer Unicode Mac OSX Lite enables Mac OS X users to read and write Khmer Unicode with most software on Mac OS machine.</p>
<p><img class="aligncenter size-medium wp-image-366" title="khmerunicode4mac" src="http://nokorsoft.wordpress.com/files/2009/11/khmerunicode4mac.png?w=300" alt="khmerunicode4mac" width="300" height="212" /></p>
<p><img class="aligncenter size-full wp-image-353" title="khmeros" src="http://nokorsoft.wordpress.com/files/2009/11/khmeros.jpg" alt="khmeros" width="600" height="412" /></p>
<p>What are in the package?</p>
<p>- In version 2.o, we removed the old Khmer Unicode font and replaced it by a Khmer Unicode font created  by Mr. Danh Hong, I do have his authorization to release this package.</p>
<p>- NiDA Khmer Keyboard Layout Version 1.0 &#38; Version 2.0.</p>
<p>- Documents on how to use Khmer Unicode MacOSX can be found in Applications folder.</p>
<p>To get better Khmer Unicode fonts that will work with Microsoft Word and Adobe products, please contact Nokorsoft or Mr. Danh Hong via email: <a href="mailto:dnhhng@gmail.com">dnhhng@gmail.com</a></p>
<p>- <strong>Download Link:</strong></p>
<p><strong>Khmer Unicode MacOSX Lite Version 2.o will be available in near future.<br />
</strong></p>
<p>Password to unlock zip file is: <strong>nokorsoft.com</strong></p>
<p><strong>- File Size: </strong>889 KB</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode]]></title>
<link>http://max1957.wordpress.com/2009/11/09/unicode/</link>
<pubDate>Mon, 09 Nov 2009 14:11:00 +0000</pubDate>
<dc:creator>max1957</dc:creator>
<guid>http://max1957.wordpress.com/2009/11/09/unicode/</guid>
<description><![CDATA[Per i non informatici: Unicode è un sistema di codifica che assegna un numero univoco ad ogni caratt]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Per i non informatici: <strong>Unicode</strong> è un sistema di codifica che assegna un numero univoco ad ogni carattere usato per la scrittura di testi, in maniera indipendente dalla lingua, dalla piattaforma informatica e dal programma utilizzati (via <a href="http://it.wikipedia.org/wiki/Unicode" target="_blank">Wikipedia</a>).  In<a href="http://t-a-w.blogspot.com/2008/12/funny-characters-in-unicode.html" target="_blank"> questo articolo</a> potrete scoprire le meraviglie che contiene come ad esempio il carattere 0E5B (Khomut in Thai)</p>
<p><a href="http://t-a-w.blogspot.com/2008/12/funny-characters-in-unicode.html"><img class="alignnone" title="Unicode" src="http://4.bp.blogspot.com/_IYGc_MWwkfw/ST8M2rug1qI/AAAAAAAAAcU/vExFkJaetGU/s400/khomut.png" alt="" width="145" height="98" /></a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Module 1 - Declaring, initializing and using variables in Java]]></title>
<link>http://tecnoesis.wordpress.com/2009/11/07/module-1-declaring-initializing-and-using-variables/</link>
<pubDate>Sat, 07 Nov 2009 17:53:12 +0000</pubDate>
<dc:creator>Rajani Ramsagar</dc:creator>
<guid>http://tecnoesis.wordpress.com/2009/11/07/module-1-declaring-initializing-and-using-variables/</guid>
<description><![CDATA[  Module 1: Declaring, initializing and using variables.    Objectives Identify the uses for variabl]]></description>
<content:encoded><![CDATA[  Module 1: Declaring, initializing and using variables.    Objectives Identify the uses for variabl]]></content:encoded>
</item>
<item>
<title><![CDATA[ASCII, EXTENDED CHARACTER SET, UNICODE]]></title>
<link>http://foreverma.wordpress.com/2009/11/06/ascii-extended-character-set-unicode/</link>
<pubDate>Fri, 06 Nov 2009 09:56:29 +0000</pubDate>
<dc:creator>foreverma</dc:creator>
<guid>http://foreverma.wordpress.com/2009/11/06/ascii-extended-character-set-unicode/</guid>
<description><![CDATA[karakter-katakter dari ASCII, EXTENDED CHARACTER SET, UNICODE telah saya kumpulkan, silahkan lihat ]]></description>
<content:encoded><![CDATA[karakter-katakter dari ASCII, EXTENDED CHARACTER SET, UNICODE telah saya kumpulkan, silahkan lihat ]]></content:encoded>
</item>
<item>
<title><![CDATA[Javascript and special characters]]></title>
<link>http://zniper.wordpress.com/2009/11/06/javascript-and-special-characters/</link>
<pubDate>Fri, 06 Nov 2009 09:40:49 +0000</pubDate>
<dc:creator>Zniper</dc:creator>
<guid>http://zniper.wordpress.com/2009/11/06/javascript-and-special-characters/</guid>
<description><![CDATA[I met a problem of displaying special characters of german when working with Javascript. Here is an ]]></description>
<content:encoded><![CDATA[I met a problem of displaying special characters of german when working with Javascript. Here is an ]]></content:encoded>
</item>
<item>
<title><![CDATA[Scrivere elfico con linux]]></title>
<link>http://bethelwhite.wordpress.com/2009/11/02/scrivere-elfico-con-linux/</link>
<pubDate>Mon, 02 Nov 2009 19:47:15 +0000</pubDate>
<dc:creator>caladyon</dc:creator>
<guid>http://bethelwhite.wordpress.com/2009/11/02/scrivere-elfico-con-linux/</guid>
<description><![CDATA[Un carattere elfico Sin da quando lessi &#8220;Il signore degli anelli&#8221; ho la fissa della scri]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><div id="attachment_39" class="wp-caption alignleft" style="width: 295px"><a href="http://bethelwhite.wordpress.com/files/2009/11/dettagli-carattere.png"><img class="size-full wp-image-39" title="Un carattere elfico" src="http://bethelwhite.wordpress.com/files/2009/11/dettagli-carattere.png" alt="Un carattere elfico" width="285" height="310" /></a><p class="wp-caption-text">Un carattere elfico</p></div>
<p>Sin da quando lessi &#8220;Il signore degli anelli&#8221; ho la fissa della scrittura con le <strong>tengwar</strong>, cioè i caratteri elfici descritti in una delle appendici di quel romanzo. Esistono da anni vari strumenti (per vari sistemi operativi) per raggiungere questo scopo, ma tutti usano particolari font che mappano le lettere dell&#8217;alfabeto latino con le suddette tengwar. Invece il mio obiettivo era utilizzare i caratteri destinati a questo uso definiti nelle tabelle di <strong>Unicode</strong>, ammesso che ci fossero.<!--more--></p>
<p><em>Perché</em>? I motivi sono presto detti: le soluzioni esistenti presuppongono che chi scrive il documento elfico abbia installato almeno un font adatto (e che faccia lo stesso chi riceve il documento); inoltre la mappatura di cui sopra non è standard; e poi la codifica caratteri universale, dopo che  mi ha causato tanto stress sul lavoro, avrà qualche risvolto positivo negli hobby di una persona qualsiasi, o no? <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>Passo a descrivere la <strong>soluzione</strong>, vediamo dopo se ho centrato questi obiettivi&#8230;</p>
<p>Io ho lavorato con Ubuntu Jaunty, ma tutto quello che ho fatto dovrebbe essere fattibile su qualsiasi linux con poche modifiche e anche su windows o mac con qualche modifica in più (<em>in teoria</em>!).</p>
<p>Per prima cosa, conviene provare se i caratteri elfici sono <strong>già</strong> funzionanti sul proprio sistema (<em>tentar non nuoce, ma non ci sperate</em>): apriamo un qualsiasi browser (che è già aperto se leggete questa pagina) e dirigiamoci sulla pagina di <a title="Tengwar Unicode Test" href="http://www.code2000.net/tengwartest.htm" target="_blank">Prova delle Tengwar Unicode</a>. Se le prime due righe sono in elfico, siete a cavallo, altrimenti vedrete dei quadrati o altri simboli e questo vuol dire che vi manca il font giusto.</p>
<p>Io ho scaricato il <strong>font</strong> gratuito Code2001 da <a title="Unicode Fonts" href="http://www.wazu.jp/gallery/Fonts_Tengwar.html" target="_blank">Tengwar Unicode Fonts</a> e l&#8217;ho subito installato nel mio sistema:</p>
<ul>
<li>copiato il file CODE2001.TTF in /usr/local/share/fonts/ (ma anche ~/.fonts può andare)</li>
<li>lanciato il comando sudo fc-cache -fv per aggiornare la cache dei font</li>
</ul>
<p>A questo punto è opportuno ricaricare la pagina di test (eventualmente dopo aver chiuso e riaperto il browser) per vedere che i caratteri <strong>appaiono</strong> correttamente!</p>
<p>Bene, il sistema a questo punto permette di visualizzare e utilizzare le tegwar&#8230; ma come si può fare?</p>
<div id="attachment_37" class="wp-caption alignright" style="width: 160px"><a href="http://bethelwhite.wordpress.com/files/2009/11/tengwar-tabella.png"><img class="size-thumbnail wp-image-37" title="Tabella caratteri aperta sulle tengwar" src="http://bethelwhite.wordpress.com/files/2009/11/tengwar-tabella.png?w=150" alt="Tabella caratteri aperta sulle tengwar" width="150" height="104" /></a><p class="wp-caption-text">Tabella caratteri aperta sulle tengwar</p></div>
<p>Lanciando la <strong>Tabella caratteri</strong> (Menu Applicazioni &#8211; Accessori), i nuovi caratteri possono essere trovati in:</p>
<ul>
<li>Area Uso Privato, a partire dal carattere U+E000, oppure</li>
<li>Area-A a uso privato supplementare, a partire dal carattere U+F2000.</li>
</ul>
<p>Sono gli stessi caratteri ripetuti: sul mio portatile preferisco la seconda area, perché nella prima alcuni codici sono utilizzati anche da altri simboli e al momento di utilizzarli devo comunque selezionare il font Code2001. Dalla figura si vede che questo font contiene anche le rune de Il signore degli anelli.</p>
<p>Se, come me, state usando <strong>Gnome</strong>, allora è possibile usare la seguente combinazione di caratteri:</p>
<p>CTRL+MAIUSC+U, codice esadecimale, INVIO</p>
<p>È sufficiente ricordarsi a memoria il codice corrispondente al simbolo desiderato&#8230; io che sono troppo vecchio per queste cose (<em>è una citazione?</em>) mi sono fatto un foglio di calcolo con i <strong>codici</strong>. <a href="http://bethelwhite.wordpress.com/files/2009/10/tengwar-unicode.pdf">PDF dei codici unicode delle tengwar</a></p>
<div id="attachment_38" class="wp-caption alignnone" style="width: 520px"><a href="http://bethelwhite.wordpress.com/files/2009/11/tengwar-gedit-passwd.png"><img class="size-full wp-image-38" title="GEdit per scrivere elfico in passwd" src="http://bethelwhite.wordpress.com/files/2009/11/tengwar-gedit-passwd.png" alt="GEdit per scrivere elfico in passwd" width="510" height="72" /></a><p class="wp-caption-text">GEdit per scrivere elfico in passwd</p></div>
<p>Una volta imparato, si possono scrivere questi caratteri dovunque, anche in un file di sistema come /etc/passwd.</p>
<div id="attachment_34" class="wp-caption alignright" style="width: 357px"><a href="http://bethelwhite.wordpress.com/files/2009/10/tengwar-1.png"><img class="size-full wp-image-34" title="Nome in elfico nel pannello" src="http://bethelwhite.wordpress.com/files/2009/10/tengwar-1.png" alt="Nome in elfico nel pannello" width="347" height="95" /></a><p class="wp-caption-text">Nome in elfico nel pannello</p></div>
<p>Avendo fatto come nella figura, il mio nome in elfico appare sia nella finestra di login di GDM sia nel pannello in alto a destra.</p>
<div id="attachment_35" class="wp-caption alignleft" style="width: 102px"><a href="http://bethelwhite.wordpress.com/files/2009/10/tengwar-2.png"><img class="size-full wp-image-35" title="Applet Tavolozza caratteri elfici" src="http://bethelwhite.wordpress.com/files/2009/10/tengwar-2.png" alt="Applet Tavolozza caratteri elfici" width="92" height="230" /></a><p class="wp-caption-text">Applet Tavolozza caratteri elfici</p></div>
<p>Per aiutarmi nella scrittura, mi sono configurato nel pannello anche l&#8217;<strong>applet</strong> Tavolozza caratteri, come in figura, anche se è sicuramente migliorabile (per ora ho messo alcune vocali elfiche e la e con la dieresi che serve per scrivere Lastë).</p>
<p>A questo punto il limite è l&#8217;immaginazione.:)</p>
<p>All&#8217;inizio dell&#8217;articolo esponevo quali potessero essere i motivi per tutto questo lavoro che ho fatto&#8230; beh, posso dire che sono abbastanza <strong>soddisfatto</strong>.</p>
<p>E&#8217; necessario installare almeno un font che implementi i caratteri elfici: credo che non si possa fare a meno di questo&#8230; così come si devono installare i font adatti per le scritture orientali.</p>
<p>Sfruttando la codifica Unicode, si possono scrivere tutti i caratteri (latini, elfici, giapponesi, russi, ecc&#8230;) in <strong>qualsiasi posto</strong>, anche un semplice file di testo, dato non c&#8217;è bisogno di formattere il testo, cosa che invece era necessaria prima di Unicode.</p>
<p>Un problema può essere il cambio di codici quando le tengwar verranno accettate definitivamente in Unicode (ma non mi sembra ci sia fretta, <a title="Proposta di Tengwar in Unicode" href="http://anubis.dkuug.dk/JTC1/SC2/WG2/docs/n1641/n1641.htm" target="_blank">la proposta</a> risale al 1997). Fino ad allora c&#8217;è il rischio che quelli che io vedo come caratteri elfici, qualcun&#8217;altro veda associati a simboli differenti: in effetti le aree in cui sono attualmente inseriti sono denominate per &#8220;uso privato&#8221;!</p>
<p>A questo punto cosa manca? Sicuramente sento la mancanza delle &#8220;<strong>modalità automatiche</strong>&#8221; di scrittura elfica (quenya, sindarin, inglese, italiano&#8230;) che sono una caratteristica dei programmi esistenti di scrittura con le tengwar, ma come ho detto precedentemente questi usano i normali caratteri latini che diventano caratteri elfici grazie a vari font. Forse bastano pochi interventi di programmazione per adattare un programma esistente.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Setting Up Your Own Jython 2.5.1 Environment]]></title>
<link>http://dbaktiar.wordpress.com/2009/11/02/setting-up-your-own-jython-2-5-1-environment/</link>
<pubDate>Mon, 02 Nov 2009 07:38:23 +0000</pubDate>
<dc:creator>dbaktiar</dc:creator>
<guid>http://dbaktiar.wordpress.com/2009/11/02/setting-up-your-own-jython-2-5-1-environment/</guid>
<description><![CDATA[In this post I&#8217;ll show you how to setup your own Jython environment. Prerequisite to this, you]]></description>
<content:encoded><![CDATA[In this post I&#8217;ll show you how to setup your own Jython environment. Prerequisite to this, you]]></content:encoded>
</item>
<item>
<title><![CDATA[Font Finder]]></title>
<link>http://yaytay.wordpress.com/2009/10/30/font-finder/</link>
<pubDate>Fri, 30 Oct 2009 07:11:57 +0000</pubDate>
<dc:creator>yaytay</dc:creator>
<guid>http://yaytay.wordpress.com/2009/10/30/font-finder/</guid>
<description><![CDATA[I wrote this a while ago purely to find out whether any of the fonts on my PC had a glyph for the Un]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I wrote this a while ago purely to find out whether any of the fonts on my PC had a glyph for the Unicode &#8220;Sound Recording Copyright&#8221; symbol (<span>&#8471;</span>).</p>
<p><img src="http://yaytay.wordpress.com/files/2009/10/fontfinder1.png" alt="Font Finder UI" /></p>
<p>You can search by typing in either of the top boxes, searching by the name of the character in the top left box or going directly by unicode character point in the top right box.</p>
<p>You can download Font Finder <a href="http://www.mediafire.com/?sharekey=8a7b0784f61236847432d3c9683f450ae04e75f6e8ebb871">here</a>.</p>
<p>Have a play, and let me know if you have any problems with it.<br />
It&#8217;s a .net 3.5 app, so you&#8217;ll need to get that if you haven&#8217;t already got it.</p>
<p>With thanks to Michael Kaplan for providing useful input over on his blog: <a href="http://blogs.msdn.com/michkap/archive/2008/07/01/8674304.aspx">http://blogs.msdn.com/michkap/archive/2008/07/01/8674304.aspx</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Neue Probleme mit griechischem Text in LaTeX]]></title>
<link>http://kapaneus.wordpress.com/2009/10/29/neue-probleme-mit-griechischem-text-in-latex/</link>
<pubDate>Thu, 29 Oct 2009 13:50:40 +0000</pubDate>
<dc:creator>kapaneus</dc:creator>
<guid>http://kapaneus.wordpress.com/2009/10/29/neue-probleme-mit-griechischem-text-in-latex/</guid>
<description><![CDATA[Griechischen Text in ein handelsübliches LaTeX-Dokument einzubinden, war schon seit langem ein PITA.]]></description>
<content:encoded><![CDATA[Griechischen Text in ein handelsübliches LaTeX-Dokument einzubinden, war schon seit langem ein PITA.]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode Brothers Eps.5]]></title>
<link>http://zumix.wordpress.com/2009/10/28/unicode-brothers-eps-5/</link>
<pubDate>Wed, 28 Oct 2009 16:01:47 +0000</pubDate>
<dc:creator>Zumix</dc:creator>
<guid>http://zumix.wordpress.com/2009/10/28/unicode-brothers-eps-5/</guid>
<description><![CDATA[]]></description>
<content:encoded><![CDATA[]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode Brothers Eps.4]]></title>
<link>http://zumix.wordpress.com/2009/10/28/unicode-brothers-eps-4/</link>
<pubDate>Tue, 27 Oct 2009 23:56:13 +0000</pubDate>
<dc:creator>Zumix</dc:creator>
<guid>http://zumix.wordpress.com/2009/10/28/unicode-brothers-eps-4/</guid>
<description><![CDATA[]]></description>
<content:encoded><![CDATA[]]></content:encoded>
</item>
<item>
<title><![CDATA[Yes we can use Khmer Unicode on Apple Mac &amp; iPhone]]></title>
<link>http://nokorsoft.wordpress.com/2009/10/27/we-can-use-khmer-unicode-on-apple-mac-iphone/</link>
<pubDate>Tue, 27 Oct 2009 15:06:29 +0000</pubDate>
<dc:creator>nokorsoft</dc:creator>
<guid>http://nokorsoft.wordpress.com/2009/10/27/we-can-use-khmer-unicode-on-apple-mac-iphone/</guid>
<description><![CDATA[អ្នកណាហ៊ានថាមិនអាចប្រើភាសាខ្មែរក្នុង Apple iMac, iPod, iPhone ទៀតគឺខុស ខុស ខុស Xenotypeលែងជាដំណោះស្រ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><h1>អ្នកណាហ៊ានថាមិនអាចប្រើភាសាខ្មែរក្នុង</h1>
<h1>Apple iMac, iPod, iPhone</h1>
<h1>ទៀតគឺខុស ខុស ខុស</h1>
<p>Xenotypeលែងជាដំណោះស្រាយតែមួយគត់ទៀតហើយ  ដើម្បីមើលភាសាខ្មែរលើប្រព័ន្ធ ដំណើរការApple Mac OS។</p>
<p>លោកហុងមកពីKhmer OS បានស្រាវជ្រាវឃើញវិធីអាចអោយយើង ប្រើអក្សរខ្មែរយូនីកូដ ក្នុងកម្មវិធី Microsoft Office 2008, FireFox, Safari, Adobe&#8217;s Products (Indesign) ជាដើម ក្នុងប្រព័ន្ធដំណើរការ MAC OSបានហើយ។</p>
<p><img class="aligncenter size-medium wp-image-334" title="Khmer Unicode Mac Word 08" src="http://nokorsoft.wordpress.com/files/2009/10/picture-1.png?w=300" alt="Khmer Unicode Mac Word 08" width="300" height="131" /></p>
<p>ប៉ុន្តែយើងមិនទាន់ដឹងថាពុម្ពអក្សរខ្មែរទាំងនោះនឹងអាចដាក់ អោយបងប្អូនប្រើនៅពេលណាទេ ព្រោះពុម្ពអក្សរទាំងនៅស្ថិតក្នុង ដំណាក់កាលសាកល្បងនៅឡើយ។</p>
<p>ចំណែកឯលោកUponastar វិញបានចេញកម្មវិធីកញ្ជប់ភាសាខ្មែរសំរាប់ iPod Touch / iPhone រួមមានទាំងពុម្ពអក្សរខ្មែរយូនីកូដ ក្តារចុច បញ្ជីប្រើប្រាស់ ជាភាសាជាដើម។</p>
<p>សូមទៅកាន់គេហទំព័រ <a href="http://www.iphonekhmer.com" target="_blank">http://www.iphonekhmer.com</a></p>
<p><img class="aligncenter size-medium wp-image-335" title="IMG_00061" src="http://nokorsoft.wordpress.com/files/2009/10/img_00061.png?w=200" alt="IMG_00061" width="200" height="300" /></p>
<p><img class="aligncenter size-medium wp-image-337" title="1.7" src="http://nokorsoft.wordpress.com/files/2009/10/1-7.png?w=300" alt="1.7" width="300" height="201" /></p>
<p>Pics&#8217; Source: iphonekhmer.com</p>
<p><span style="color:#ff0000;"><br />
</span></p>
<p>ទាយយកកម្មវិធី​Khmer Unicode MacOSX Lite V2.o ពី <a href="http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/" target="_blank">http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/</a></p>
<p>&#160;</p>
<p>ដូចនេះទោះបីជាមិនបាន១០០%  យើងអាចនៅតែអាចនិយាយបានថា</p>
<p><span style="color:#ff0000;"><strong>យើងអាចប្រើខ្មែរយូនីកូដលើប្រព័ន្ធ Apple Mac OS X និង iPhone OS បានហើយ។</strong> </span></p>
<p>Mr. Hong from Khmer OS has found a way to make Khmer OS Unicode fonts display correctly with software runing under Mac OS X such as MS Office 2008, Firefox, Safari, Adobe InDesing; however, the font development process is still in testing stage. We do not know when  and how the fonts will be released.</p>
<p>On the other hand, Mr. Uponastar released Khmer Localization Package including Fonts, Keyboard, Khmer Menu, Khmer Contact, Weather and Theme for iPod Touch &#38; iPod Phone. Please visit <a href="http://www.iphonekhmer.com/" target="_blank">http://www.iphonekhmer.com</a> for more details.</p>
<p>Please download Khmer Unicode MacOSX Lite Version 2.o  from <a href="http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/" target="_blank">http://nokorsoft.wordpress.com/2009/11/13/khmer-unicode-mac/</a></p>
<p><strong>So all together, Yes we can use Khmer Unicode fonts on Apple Mac OS X &#38; iPhone OS . It is not a dream anymore. </strong></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode Brothers Eps.3]]></title>
<link>http://zumix.wordpress.com/2009/10/26/unicode-brothers-eps-3/</link>
<pubDate>Mon, 26 Oct 2009 17:22:21 +0000</pubDate>
<dc:creator>Zumix</dc:creator>
<guid>http://zumix.wordpress.com/2009/10/26/unicode-brothers-eps-3/</guid>
<description><![CDATA[]]></description>
<content:encoded><![CDATA[]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode Brothers Eps.2]]></title>
<link>http://zumix.wordpress.com/2009/10/25/unicode-brothers-eps-2/</link>
<pubDate>Sun, 25 Oct 2009 16:51:08 +0000</pubDate>
<dc:creator>Zumix</dc:creator>
<guid>http://zumix.wordpress.com/2009/10/25/unicode-brothers-eps-2/</guid>
<description><![CDATA[]]></description>
<content:encoded><![CDATA[]]></content:encoded>
</item>
<item>
<title><![CDATA[Unicode Brothers Eps.1]]></title>
<link>http://zumix.wordpress.com/2009/10/24/unicode-brothers-eps-1/</link>
<pubDate>Sat, 24 Oct 2009 16:21:57 +0000</pubDate>
<dc:creator>Zumix</dc:creator>
<guid>http://zumix.wordpress.com/2009/10/24/unicode-brothers-eps-1/</guid>
<description><![CDATA[]]></description>
<content:encoded><![CDATA[]]></content:encoded>
</item>
<item>
<title><![CDATA[Categorically Yours &amp; Typographical Musings†]]></title>
<link>http://range.wordpress.com/2009/10/24/categorically-yours-typographical-musings%e2%80%a0/</link>
<pubDate>Sat, 24 Oct 2009 09:56:39 +0000</pubDate>
<dc:creator>range</dc:creator>
<guid>http://range.wordpress.com/2009/10/24/categorically-yours-typographical-musings%e2%80%a0/</guid>
<description><![CDATA[Okami&#39;s Lamy fountain pens I One of the marks of being a good prof is when they see that student]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><div class="wp-caption alignleft" style="width: 330px"><a href="http://okami-whatever.blogspot.com/2009/10/featured-pen-lamy-safari.html"><img src="http://3.bp.blogspot.com/_dcGUM5wwl2A/St2imJgnCyI/AAAAAAAAAsA/a3ZvW6XvP00/s400/IMG_1625.jpg" alt="Okamis Lamy fountain pens" width="320" height="240" /></a><p class="wp-caption-text">Okami&#39;s Lamy fountain pens</p></div>
<p style="text-align:center;"><strong>I</strong></p>
<p>One of the marks of being a good prof is when they see that students didn&#8217;t understand something and go over it again. My algebra prof has made a habit of this, especially when he goes over stuff in class too quickly. <a href="http://range.wordpress.com/2009/10/16/mathematical-teaching-methods/">This happened last week and left me quite furious</a>. Someone must have mentioned something to the prof², and he went over what we saw in the last hour again. This took an hour out of the three-hour class. I appreciated. I realized that there were some undergrads in our class, who weren&#8217;t familiar with some of the more abstract concepts of <a href="http://en.wikipedia.org/wiki/Category_theory" target="_blank">category theory</a>³. It was an issue. Class was great today. I was drinking my strong milk tea and noting stuff down. We saw direct sums and <a href="http://en.wikipedia.org/wiki/Universal_property" target="_blank">(co)universal objects</a>. Having proofs done with commutative diagrams is so elegant and simple.</p>
<p><!--more--></p>
<div class="wp-caption alignleft" style="width: 210px"><a href="http://en.wikipedia.org/wiki/Portal:Category_theory/Selected_picture"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/8/81/Functor_cone_%28extended%29.svg/200px-Functor_cone_%28extended%29.svg.png" alt="Limit of a diagram is a cone, via Wikipedia" width="200" height="168" /></a><p class="wp-caption-text">Limit of a diagram is a cone, via Wikipedia</p></div>
<p style="text-align:center;"><strong>II</strong></p>
<p>I&#8217;ve got a small term paper due in December. I&#8217;ve decided to do it on the <a href="http://en.wikipedia.org/wiki/Clifford_algebra" target="_blank">Clifford algebras</a> of cubic forms. It&#8217;s supposed to be quite short, but I&#8217;ll pump out quite a bit of stuff so that I can nail an A.</p>
<p>The algebra prof told us that we&#8217;d have a take-home midterm exam. I haven&#8217;t mentioned to too many people that I have all of the solutions to all of the Hungerford problems¹.Well, most of them. I noticed today while I was printing them out that there were a few sections missing. I was definitely feeling good after he explained everything again. Not that I didn&#8217;t understand it beforehand, but it made a difference that he cared enough to go through it once more, at normal speed.</p>
<p style="text-align:center;"><strong>III</strong></p>
<div class="wp-caption alignleft" style="width: 330px"><a href="http://en.wikipedia.org/wiki/Keyboard_layout#Canadian_French"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/41/KB_Canadian_Multilingual_Standard.svg/400px-KB_Canadian_Multilingual_Standard.svg.png" alt="Canadiatn Multilingual Standard keyboard layout, via Wikipedia" width="320" height="106" /></a><p class="wp-caption-text">Canadian Multilingual Standard keyboard layout, via Wikipedia</p></div>
<p>I always use a <a href="http://en.wikipedia.org/wiki/File:KB_Canadian_Multilingual_Standard.svg" target="_blank">Canadian Multilingual keyboard</a>. For some reason, I&#8217;ve been blindly hitting a combination of keys that would give me the multiplication sign &#8216;×&#8217;. I was trying to figure out how this happened when I found this page on Wikipedia. I&#8217;ve kind of become interested in the Unicode characters which are available on standard keyboards. My favorites include the superscript numbers I use for footnotes, † → the dagger symbol, ‡ → the double dagger symbol, and this star symbol → ★. Other favorites include the curly brackets → { } and the abreviation for number → №. There&#8217;s the degree symbol → ° as well as the tilde → ~ and the <a href="http://range.wordpress.com/2009/10/03/%e2%80%95/">horizontal bar → ―</a>.</p>
<p style="text-align:center;"><strong>IV</strong></p>
<p>I think that we&#8217;ve completed our intro into category theory. I&#8217;ll have to learn the rest by myself, as we&#8217;re delving into the structure of groups now. I&#8217;m looking forward to that, as well as <a href="http://en.wikipedia.org/wiki/Galois_theory" target="_blank">Galois theory</a> and <a href="http://en.wikipedia.org/wiki/Ring_theory" target="_blank">commutative ring theory</a>.</p>
<p>I really like how some proofs are just completed by commutative diagrams. It&#8217;s just so elegant. I don&#8217;t know why I find these algebraic topics so interesting. They are elegant and abstract. In analysis, we are seeing the Lebesgue measure and integral. The prof keeps saying &#8216;mearable&#8217; instead of saying <em>measurable</em>. He also says &#8216;lebesgue&#8217; by pronouncing the &#8217;s&#8217;, which is silent, as any French-speaker will tell you.</p>
<p>Another weird thing was the word integer. I was convinced that it was pronounced integer with a hard /g/, not a soft /j/. This happens because I speak three languages fluently. My wife&#8217;s favorites include catholic and patio. When I&#8217;m tired, I put the emphasis on the wrong part of the word. She finds it cute.</p>
<p>This is how you pronounce: ĭn&#8217;tĭ-jər.</p>
<p><a href="http://twitter.com/djrange" target="_blank">Follow me on Twitter @djrange</a></p>
<p style="text-align:center;">* * * * *</p>
<p>[¹]: <a href="http://range.wordpress.com/2009/10/01/gtm-hungerfords-algebra-vol-125/"><em>Algebra</em></a>, by Thomas Hungerford, Springer Verlag, GTM Vol 73.<br />
[²]: Not about me, but about the fact that he went too fast and that they didn&#8217;t understand. Towards the end of the class, I mentioned out loud that I didn&#8217;t get most of what he had been explaining because he was going too fast. I was basically just mumbling in my beard.<br />
[³]: The <a href="http://range.wordpress.com/2009/10/16/mathematical-teaching-methods/#yesgirls"><em>yes girls</em>‡</a> are undergrads. I wasn&#8217;t surprised to find this out. For some reason, the prof didn&#8217;t want to get into a too high level of abstraction.</p>
<p>[†]: I thought this was a good play on words.<br />
[‡]: see <a href="http://range.wordpress.com/2009/10/16/mathematical-teaching-methods/#yesgirls">this footnote  in this post</a>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Asyik, PHP 6 Terbaru Support Karakter Unicode]]></title>
<link>http://bowoblog.wordpress.com/2009/10/22/asyik-php-6-terbaru-support-karakter-unicode/</link>
<pubDate>Thu, 22 Oct 2009 11:38:50 +0000</pubDate>
<dc:creator>bowo</dc:creator>
<guid>http://bowoblog.wordpress.com/2009/10/22/asyik-php-6-terbaru-support-karakter-unicode/</guid>
<description><![CDATA[PHP 6, sebuah upgrade bahasa dinamis open source populer untuk aplikasi web kini telah support denga]]></description>
<content:encoded><![CDATA[PHP 6, sebuah upgrade bahasa dinamis open source populer untuk aplikasi web kini telah support denga]]></content:encoded>
</item>
<item>
<title><![CDATA[What is Unicode ??]]></title>
<link>http://abhilekh.wordpress.com/2009/10/21/what-is-unicode/</link>
<pubDate>Wed, 21 Oct 2009 12:26:26 +0000</pubDate>
<dc:creator>Abhishek Singh</dc:creator>
<guid>http://abhilekh.wordpress.com/2009/10/21/what-is-unicode/</guid>
<description><![CDATA[Computers internally work on numbers. This means that characters need to be coded as numbers. A typi]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Computers internally work on numbers. This means that characters need to be coded as numbers. A typical arrangement is to use numbers from 0 to 255, because this range fits into a basic unit of data storage and transfer, called a (8-bit) byte or octet.</p>
<p>A character code is something that defines how that numbers corresponds to characters. Most of the chracters have the same assignment for numbers 0 to 127, used for character that appear in English as well as in many other languages: the letters a-z plus their uppercase equivalents, the digits 0-9 and a few punctuation marks. Many of the code numbers in in this so-called ASCII set of characters are used for various technical purpose. For French texts, for example, you need additional characters such as accented letters. These can be provided by using code numbers in the range 128-256 in addition to the ASCII range., and this give room for letters used in most other Western European languages as well.</p>
<p>Thus you can use a single character code, called Latin 1, even for a text containing a mixture of English, French, Spanish and German, because these language all use the Latin characters with relatively few additions. However you quickely run out of numbers if you try to cover too many languages within 256 characters. For this reason, different character codes were developed. For example, Latin 1 is for Western Europian Languages, Latin 2 for several languages spoken in Central and Eastern Europe, and additional character codes exists for Greek, Cyrillic, Arabic etc. Character codes that use only the code numbers from 0-255 are called 8-bit codes, since such code numbers can be represented using 8 bits. Things change when you need to combine languages in one document and the languages are fundamentally different in their use of characters.</p>
<p>As you know the numbers of characters needed for Chinese and Japanese is very large. They just would not fit into a set with only 256 characters. Therefore, different strategies are used. For example,2 byte (octects) instead of one might be used for some character. On the other hand the character codes developed for the needs of East Asian Languages do not contain all the characters used in the world.</p>
<p>The solution to such problems and many other problems in the world of growing information exchange, is the introduction of a character codes that gives every character  of every anguage a unique number. This number does not depend upon the language used in the text, the font used to display, the character, the software, the operating system ar the device. It is universal and kept unchanged. The range of possible number is set sufficiently high to cover the current and future needs of all languages.</p>
<p>The solution is called <em>Unicode</em>, and it gives anyone the opportunity to say, &#8220;<em>I want this character displayed and the number is&#8230;.</em>&#8220;, and have herself understood by all systems that support Unicode. This does not always guarantee a success in displaying the character, due to lack of suitable font, but such technical problems are manageable. Much widely used softwares including Windows, Mac OS X, and Linux, has supported Unicode for years. However, to use Unicode , all the relevant components must be &#8220;Unicode Enabled&#8221;.</p>
<p>Resources for Further Study :</p>
<p><span style="color:#333399;">http://www.unicode.org/ </span></p>
<p><span style="color:#333399;">http://en.wikipedia.org/wiki/Unicode </span></p>
<p>-<strong><span style="color:#ff6600;">Abhishek Singh</span></strong></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Question of the Day: SQL Server Central]]></title>
<link>http://enggtech.wordpress.com/2009/10/20/question-of-the-day-sql-server-central/</link>
<pubDate>Tue, 20 Oct 2009 13:11:00 +0000</pubDate>
<dc:creator>Visitor Blogs</dc:creator>
<guid>http://enggtech.wordpress.com/2009/10/20/question-of-the-day-sql-server-central/</guid>
<description><![CDATA[We need to define a column that will contain externally provided identifiers. Their length will vary]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>We need to define a column that will contain externally provided identifiers. Their length will vary from seven to ten characters, all English letters, both upper and lower case mixed with numeric digits, with an even distribution of lengths.</p>
<p>Which of these data types will be more efficient?</p>
<p class="byline">By John Arnott</p>
<p><strong>Answer: </strong>char(10)</p>
<p><strong>Explanation: </strong>A varchar (or nvarchar) column carries two bytes of overhead to hold the actual length of the data. Since the average size of the data is 8.5 characters &#8212; (7+8+9+10) / 4 &#8211;, a varchar data type would take an average of 10.5 bytes for each entry. A column defined as char(10) needs just 10 bytes. As the column will hold only English language letters and digits, we don&#8217;t need the additional bytes required by the national character data types. They use two bytes for each character, so would take 20 bytes for nchar or 19 bytes for nvarchar (average lenghth 8.5 doubled for national characters, then add two bytes to hold the length).</p>
<p>» <a rel="nofollow" href="http://www.sqlservercentral.com/links/1066535/126018" target="_blank">Discuss this question and answer on the forums</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[How to convert unicode code point to the character (binary) with PHP]]></title>
<link>http://adrenalinexp.wordpress.com/2009/10/18/how-to-convert-unicode-code-point-to-the-character-binary-with-php/</link>
<pubDate>Sun, 18 Oct 2009 12:02:27 +0000</pubDate>
<dc:creator>Adrenalin</dc:creator>
<guid>http://adrenalinexp.wordpress.com/2009/10/18/how-to-convert-unicode-code-point-to-the-character-binary-with-php/</guid>
<description><![CDATA[You want to display a unicode code point as the char it actually represent ? For example, display fo]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>You want to display a unicode code point as the char it actually represent ?</p>
<p>For example, display for <a href="http://www.eki.ee/letter/chardata.cgi?ucode=00CE">U+00CE</a> the Î character it represent. (here is the list of all <a href="http://www.eki.ee/letter/chardata.cgi?lang=ro+Romanian&#38;script=latin">romanian special characters</a>)</p>
<p>Quite strange, I didn&#8217;t find a ready to work code for that instantly (as I usually do) <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>For our task, we will need the berlioz&#8217;s <a href="http://php.net/manual/en/function.unicode-encode.php#79829">unicode2utf8</a> (that support 4 bytes utf, initialy I got a function that supported only 3 bytes and got errors, if you need 6 bytes support, see the <a href="http://php.net/manual/en/function.utf8-encode.php">Unicode_to_UTF function</a>) function.</p>
<p>Here is the trick, unicode2utf8 requires as argument an integers, 00CE in our example (and the unicode notation) is hex, everything we need to do is to apply the hexdec function.</p>
<p>Php code:</p>
<pre class="brush: php;">
echo unicode2utf8(hexdec(&#34;00CE&#34;)); // Result: Î

// Or the function that will recognize U+ in front of string, and will skip it to show the character
function unicodeCodePointToChar($str) {
	if (substr($str,0,2) != &#34;U+&#34;) return $str;
	$str = substr($str,2); // Skip U+
	return unicode_to_utf8(array(hexdec($str)));
}
echo unicodeCodePointToChar(&#34;U+00CE&#34;); // Result: Î
</pre>
<p>Why I would ever need that, you will ask, well I need to implement sphinx&#8217;s charset_table convert logic on the user&#8217;s string. Here&#8217;s the map I used, kindly provided by <a href="http://beginrescue.blogspot.com/2008/05/sphinx-charsettable-with-unicode.html">someone</a> on a pastebin.</p>
<p>So if a user search for &#8220;bălan&#8221;, he will actually find both &#8220;bălan&#8221; and &#8220;balan&#8221;.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Compilation of resources for migrate to Delphi 2009/2010 Unicode]]></title>
<link>http://theroadtodelphi.wordpress.com/2009/10/22/compilatation-of-resources-for-delphi-20092010-unicode/</link>
<pubDate>Fri, 23 Oct 2009 03:36:39 +0000</pubDate>
<dc:creator>Rodrigo</dc:creator>
<guid>http://theroadtodelphi.wordpress.com/2009/10/22/compilatation-of-resources-for-delphi-20092010-unicode/</guid>
<description><![CDATA[There are many resources available that you can read and that you will assist in the migration from ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>There are many resources available that you can read and that you will assist in the migration from a old Delphi version to Delphi 2009/2010 (Unicode).</p>
<ul>
<li><a rel="nofollow" href="http://cc.embarcadero.com/item/27398">Unicode Migration Statistics Tool</a> (This utility will hopefully assist you in collecting useful statistics on how hard (or not) it would be to migrate your older applications to Unicode.)</li>
<li><a rel="nofollow" href="http://www.jacobthurman.com/?p=30">Delphi 2009 and Unicode</a></li>
<li><a rel="nofollow" href="http://compaspascal.blogspot.com/2008/10/delphi-2009-strings-explained-by.html">Delphi 2009 strings explained by example</a></li>
<li><a rel="nofollow" href="http://compaspascal.blogspot.com/2009/05/upgrading-major-project-to-delphi-2009.html">Upgrading a major project to Delphi 2009</a></li>
<li><a rel="nofollow" href="http://edn.embarcadero.com/article/images/38980/Delphi%5Fand%5FUnicode.pdf">Delphi and Unicode</a></li>
<li><a rel="nofollow" href="http://www.drbob42.com/examines/examinA7.htm">Dr. Bob Delphi 2009 Unicode</a></li>
<li><a rel="nofollow" href="http://chrisbensen.blogspot.com/2008/09/delphi-2009-unicode-in-type-libraries.html">Delphi 2009 &#8211; Unicode in Type Libraries</a></li>
<li><a rel="nofollow" href="http://blog.marcocantu.com/blog/strings%5Funicode%5Fdelphi%5F2009.html">On Strings and Unicode in Delphi 2009</a></li>
<li><a rel="nofollow" href="http://edn.embarcadero.com/article/38437">Delphi in a Unicode World Part I: What is Unicode, Why do you need it, and How do you work with it in Delphi?</a></li>
<li><a rel="nofollow" href="http://edn.embarcadero.com/article/38498">Delphi in a Unicode World Part II: New RTL Features and Classes to Support Unicode</a></li>
<li><a rel="nofollow" href="http://edn.embarcadero.com/article/38693">Delphi in a Unicode World Part III: Unicodifying Your Code</a></li>
<li><a rel="nofollow" href="http://go2.wordpress.com/?id=725X1342&#38;site=wiert.wordpress.com&#38;url=http%3A%2F%2Fwww.xs4all.nl%2F%7Ejp%2Fblog-dl%2F2009%2FCodeRage4.2009%2F3.Using-Unicode-and-Other-Encodings-in-your-Programs-attendee.zip">CodeRage 4 : Using Unicode and Other Encodings in your Programs</a></li>
</ul>
</div>]]></content:encoded>
</item>

</channel>
</rss>
