Sunday, January 11, 2009

Those iPhone Developers Might be the Funkiest, Schwaginist Geeks in Cupertino, but they Sure Ain’t No Linguists!

Dear gods of the cyberverse, please direct this blog-entry through the right routers,feed aggregators, RSS readers … anything it takes so that those great apple developers in the sky can start to get a taste of the multilingual HUMAN world outside of that apparently monoglot paradise of Cupertino.

I've had my iPhone for almost a year now. Someone actually gave me a 3G version, but I was so attached to my original phone that I sold the 3G version and kept the old one. You see apart from liking the metallic feel of it – something that you just can't find with handheld appliances these days, mine has a huge crack across the screen that I've grown rather fond of. (The crack happened when running to escape the Bangkok rain in mid 2008 – when the device slipped out of the slip-proof iPhone belt-holster that I'd purchased a few days earlier).

Why a Polyglot / linguaphile SHOULD own an iPhone

As we learned in Dale Carnegie Instructor training, when delivering bad news, START WITH THE POSITIVE AS A BUFFER! So here goes.

The first week I had my iPhone I wanted to throw the thing out the window. I'm a tinkerer – and not being an Apple person, I found the iPhone not really tinkerer friendly. Luckily within a few days I was lucky to find a team of enthusiastic iPhone hackers that whetted my palate with the possibilities that the iPhone holds for tinkerers. From that moment on, I started to take to the thing.

Podcasts, Language MP3's, Youtube and more

I quickly found many uses for it – it's actually changed my life. From ChinesePod.com to Japanesepod101.com to news podcasts in different languages to morse simulators to Youtube clips, the iPhone is a language learners dream! If you want to know someone, take a look at their iTunes library. I have MP3's and streams in over 40 languages in my iTunes that I sync with my phone which facilitates my learning every waking moment (that I'm not with clients of course … well, even then some)

Now with AV cables, I don't even have to take my computer to many events that I have to speak at / teach at. I can play the Audio / Video files straight out of my iPhone through the sound system / LCD of the seminar venue – and even record Presentations as video files and then use my iPhone as the presentation device rather than PowerPoint.

There are also some great apps out there that developers are creating for learning languages - one that I particularly like is JapaneseFlip (日本語フリップ) by Andre Khromov. Very simple but good looking App that helps you get through your JLPT levels.

3 Steps to Drive a Polyglot to go Insane and Throw their Freaking iPhone off a Cliff!

I can imagine what the project management meetings in the development process were like:

Project Manager: "Now guys, there are gonna be some folks out there that don't speak 'Merican, so we might have to have some way they can talk to each other in their own gobbledygook"

Development Team Underling: "Ahh… yes boss, I'll get on it. Those Chinese folks down on the corner at the restaurant seem nice, maybe they'll be able to help me get some of their lingo into this thing."

Project Manager: "Great idea! Just make sure you don't make it too easy for 'em. We don't wanna encourage too much of 'their language'… in the long run, better to have 'em speaking 'Merican like the rest of the world! We have those great 'Merican language modules that we use in our Mac OS's … just throw one of those under the hood and it should do the trick"

Development Team Underling: "Umm .. boss… what about people who want to communicate in more than one language at once?"

Project Manager: "Yeah right!" (sarcastically)

Development Team Underling: "Okie Dokie boss – consider it done!"

And so the iPhone language development team was born.

3 Areas of iPhone Linguistic Insanity

  1. Language Input
  2. Predictive Text
  3. Fonts

Language Input

Well, I have to hand it to the team. For the first models, there was only a handful of roman scripted languages that were catered for. Later firmware versions introduced more languages. I wish these US developers would realise that even in the US, you will have people using their devices who speak a whole array of languages. You don't need to release a phone in Thailand, Japan or China to warrant having Thai and Japanese input options. Give the options on installation / updating of the firmware as to what languages should be included – that way you don't eat up memory with unnecessary stuff and everyone is happy.

The way you switch languages is really cumbersome too! In a typical message sent out of my phone, I could have any combination of Thai, Chinese, Japanese, English, Indonesian, Spanish and Nordic languages being used. I just had a look through some of my SMS's and some of them use up to 4 languages in a given message.

Could I suggest a language selection button that will iterate if tapped normally through the input language list similar to the buttons that already open up to display additional characters when held. Rather than a straight tab of letters though, if held it would open up something like a daisy-wheel so you can slide your finger directly to the target language selection – when you release your finger, it's in that language input mode.

That way, if there are only 2 input keyboards running, you can do 1-tap iteration, otherwise people with more than 2 keyboards can also quickly switch between them without iterating through ALL OF THEM … often resulting in the thing hanging and then crashing. At the moment I have 8 keyboards active. So if I change to a language and back to English, a minimum of 9 clicks is needed. Not only that, because you're typing as fast as you can, you often get 'click happy' and iterate past the English keyboard and have to go through them all again. It's a bit 'Ground hog day'ish' guys!

Input Keyboard Layout

I understand that screen real-estate is scarce, but some of the excuses I've seen for Thai keyboards are appalling. Given, I don't think that most of the ones that I've been using have been developed by Apple themselves – most of them 3rd party apps from Eastern Europe, the developers were probably just concerned about getting so many letters into such a small space and knew nothing about typing habits of native speakers of those languages, common letters vs. uncommon letters etc.

Predictive Text

With the original firmware, there was a function in there that allowed you to turn the predictive text off. Since version 2.0, I'll be darned if I can find it anymore!??

Even for monoglots of English, people don't communicate in standard English. There have been papers after papers on the language of messaging, sms language, email English etc.

Here's an example in English:

Desired text:

Ud bttr c if ur gf's up to it, coz ur gona get it otherwise.

If I type that into my iPhone without going back each time and correcting it here's the result:

I'd byte CID ye FFA up to it, cox HR Bina get it otherwise

The funny thing… each time I tried to backspace and fix things up, things progressively changed and got worse (even more changes to what you see above!)

I don't know what these guys could have been thinking. We don't send text messages to be grammatically correct! We send them for efficiency. Not only that, the shorthand used communicates a multitude of emotions, relationships and other 'meta meanings'.

Indonesian Texting

I think Indonesians would have to be right up there with the Filipinos as the number one texters on the planet! Someone could be driving down the freeway at 130kmh, swerving between the cars and still get out an epic text message in a few seconds flat.

As with many other languages, SMS language in Indonesia is a language of its own.

As a linguist, it's fascinating to analyse what's going on. Normal language takes on almost semitic language principles – where words retract into fundamental vowel deprived roots, then inflated with different amended affixes as needed.

Here's another call out to the developers…

INDONESIANS USE THE ROMAN ALPHABET TO TYPE BAHASA INDONESIA…. So there's no need to change a keyboard. The English keyboard would do just fine if there wasn't any freakin' predictive text there to botch it all up!

Not only do Indonesian's use the Roman alphabet, many of them can speak languages other than Indonesian!... so in a given message, you might have Bahasa Indonesia, Sundanese, Javanese, Mandarin, Hakka and English!

But let's get back to Indonesian SMS'ing. Here's a sample sentence:

SMS Bahasa Indonesia

Luh !... lg ngapain skrg? Skr aq lg ngopi2 ama tmn. Mo bareng2 ama kt gak?

SMS on iPhone after Predictive Text interferes

Lun ! of ngapain sir ? sir aa of NGO 2 AMA Tbk . Mo naren 2 AMA it Tak?

Colloquial Bahasa Indonesia

(of original message)

Elu! Lagi ngapain sekarang? Sekarang aku lagi ngopi-ngopi sama teman. Mau bareng-bareng sama kita enggak?

Standard Bahasa Indonesia

(of original message)

Hai! Sedang apa sekarang? Sekarang saya sedang duduk-duduk minum kopi dengan teman saya. Apakah anda ingin berjalan-jalan dengan kami?

English (with poetic license)

Hey dog! What'cha up to? I'm sitting here chillin' (over coffee) with a couple of friends. You wanna come out with us?

Thai and other languages

This applies to ALL languages. Sometimes depending on someone's phone and language upbringing, Thais might send a message in Thai – or write their version of Thai using the Roman alphabet. There is no standard to the spelling used, so developing a dictionary would be useless. Each person has their own 'flavour'.

It gets worse, the predictive text started to think it would be smart by 'learning' Thai letter order patterns. Well it thought it did… it applied 'English' logic to Thai and the results were terrible. Vowels and consonants were all swapped around to make the language look totally alien.

Other notes on Input Languages

The technology is out there guys… so how bout it…add a few languages into the iPhone official 'flock' – Arabic, Thai, Hindi / Devanagari, Tamil and other scripts that although they might be a little foreign to Cupertino, there are still billions of people outside of that 'hood' that use the languages.

Fonts

My last note is on Fonts.

On my Window's system, I love the MS Arial Unicode font – although it lacks some scripts like Burmese, Khmer, Tibeten and several others, it is fantastic in that the point size for Thai, Chinese, Arabic and Latin are all the same. I can even call a whole range of IPA characters up and they'll display on any page.

The iPhone's font mapping is bizarre with languages like Arabic. Arabic is a complex class script where the letters' forms will change depending on where they are in the word, with diacritics above and below the letters. For some reason, when reading pages on the web in Arabic / Urdu / Farsi etc, only the initial position letter forms are used. This again, makes the language look extremely bizarre. While it's possible to make out what the words are (kinda) it's very cumbersome and not in line with that slick 'Apple' image that I could imagine that you guys would be trying to shoot for.

APPLE EXECUTIVES TAKE NOTE

You are about to launch the iPhone here in Thailand legally. I'm not sure what you're doing in the way of your 'official' Thai input option, but if I were you, I'd seriously consider giving the Thai language issues an overhaul.

I'm happy to hire my consulting services out to you to make sure you don't piss the SE Asian texting community off any more than they already are.

Actually – I've changed my mind … hire me and we can sit down over a coffee in Cupertino and give the WHOLE FREAKIN' LANGUAGE thing an overhaul.

I've walked into Apple stores all over the world over the past couple of years and challenged the salespeople to convert me to a Mac user. I need a system that will allow me to use all the languages I use with ease – Microsoft actually got the whole predictive text thing right with Mandarin and Windows leaves Mac's OS in the dust when it comes to speed of typing up a Chinese document.

Both Apple and Windows do a dismal job in allowing us to type in Cantonese and other Chinese languages / dialects. I'm honestly surprised that one of the over a BILLION speakers of non-Mandarin Chinese languages hasn't mentioned it to you before.

CONVERT ME! MAKE ME A DISCIPLE! No one has been able to do it yet - and I still sit here with my PC - Linux and Windows.

Share on Facebook

Friday, January 02, 2009

Jazz Lessons on Language - Improvisation 101 - Stuart Jay Raj’s Indic Script Compass

Find Your Way Around Indic Based Writing Systems (Devanagari, Tamil, Panjabi (Gurmukhi), Burmese, Bahasa Bali (Balinese), Bahasa Jawa (Javanese), Thai and Khmer (Cambodian) Without a GPS

Download Full A3 Size Chart Here

Theme and Variation

The reason I love playing and listening to jazz so much is because you're able to get into the soul of the players by hearing how they improvise on a theme. Language is not that much different.

I've noticed what could be described as almost an unquestioning deific reverence and fear paid to the writing systems of Indic language systems by their users (and learners of them). Whether it's the Devanagari script used to write amongst other languages, Sanskrit, Hindi, Marathi and Nepali, or the scripts of other languages in India and surrounding countries like Panjabi, Tamil, Telugu or as far down as Southeast Asia including Thai, Lao, Burmese, Khmer (Cambodian), Balinese orJavanese, when you start to scratch the surface of the origins, grammar or even just the reasoning of certain things that would seem strange to outsiders, the eyes of your average Joe who's spoken one of those languages since birth will tend to glaze over. (Now THAT was a long sentence!)

That's the Problem with Turtles… It really shouldn't be that way!

Those Indians were pretty cluey back in the day. They developed an alphabet thousands of years ago that was a virtual GPS for the human mouth!

Just like the theme and variation in Jazz, all the languages that I've mentioned above (and many more) have used this basic map as their base - a reference guide and then coloured it with the sound filters of their own respective languages.

To draw an analogy from English, take a common sentence that we use every day like:

'That's the problem with turtles'

This is the 'base' or 'theme'. A variation might have those same letters read as:

'Dat's da pwoblem wif toitles'

For many, the whole concept that 'Dat's' had come from 'That's' was never explained, leaving not only learners, but also native speakers of the language in a haze as to 'why' certain things are about the language and just accepting that it's something beyond the comprehension of mere mortals.

Part of my Mnidcraft course is to get a good grounding in the principles of Indic based scripts and develop some level of proficiency in identifying several the scripts including Devanagari, Thai, Khmer and Burmese.

I developed my 'Indic Consonant Glyphs' and the 'Stuart Jay Raj's Indic Script Consonant Compass' as a tool to accelerate the acquisition / learning of scripts based on the Indic sound system themes and variations.

Don't Count The Letters!

I'll often hear people measuring the complexity of a language by the number of letters in its alphabet. This to me is absurd as the practice of judging a presentation's quality by its PowerPoint slide count! (Another pet hate of mine… which led me to designing my 'Present!' presentations workshop who's subtitle is 'Bringing Presentations Back to The Presenter').

It's this body count (or in this case consonants + vowel count) that has turned many a learner off from learning languages like Thai, Sanskrit, Khmer and Burmese.

What Language Would You Prefer to Learn?

Here's the blurb….

Wadafrackizet

Soowizy

Welcome to your first day of learning Wadafrackizet!

Our language only has 18 letters in its alphabet, but they're used to write a total of 90 different sounds give or take a few.

The order of the alphabet has no specific logic to it, nor do the shapes of the letters really mean anything.

Not only that!... Wadafrackizet has an extremely comprehensive set of rules that you must follow.

Well, the general rule of thumb is that you follow those rules with the exception of the times that you don't follow them – which happens quite often.

The joy of learning Wadafrackizet is finding out just when those times are that the rules aren't applied by yourself.

Welcome to your first day of learning Soowizy!

Soowizy's writing system follows an ingeniously simple line of logic that plots each letter in a systematic fashion to key points of the mouth, one syllable at a time.

This way, you don't even really need to remember an 'alphabetical order' so to speak. Where the letter comes from in your mouth is where it lies in the alphabet.

Not only that, many of the letters actually 'look' the way they 'sound'!

The good news is that once you've learned this very simple system based on only 5 points of the mouth , you can tweak it here and there enabling you to learn over 50 other languages that work on the same operating system!

I don't know about you, but I don't think I'd be lining up to learn Wadafrackizet! (Ok, I admit it – I probably would)… but honestly, Soowizy sounds just so … EASY!

The Cardinal Points of the Mouth

As you read through the brief explanation to each of the following glyphs, place your tongue / mouth in the position and hold it there. Once you can do it for each glyph, move your gaze from one glyph to another at random and practice locking your mouth into the correct position according to the glyph that you're looking at.

Drilling yourself for about 2 minutes like this should be enough to start to develop some long lasting muscle memory.

Gutteral

Palatal

Cerebral

Dental

Labial

Back of the throat

Flattened Tongue on the Roof of the Mouth (Palate)

Rolled Back (Retroflex) Tongue on the Roof of the Mouth

Tongue on Teeth

Lips

대한민국! (Dae Han Min Kuk!)

Now, I know the Korean speakers out there must be thinking "Hmmm… some these symbols seem eerily familiar!".

Koreans are very proud of their writing system 'Han Geul' – 한 글 .. . and so they should be. Developed by King Sejong the Great (1418-1450) in 1444, it is one of the easiest scripts to learn on the planet! (I learned it on a bus ride across Seoul one sunny afternoon in 1996J ).

To add value to learning the Korean alphabet, I suggest spending an extra 10 minutes to learn how to touch type in Korean also. The keys are very logically set out between:

  • Left hand: Consonants
  • Right hand: Vowels

Each syllable in Han Geul is broken into a beginning (initial) sound, vowel and final sound component (if necessary). The symbols used to represent each of the sounds were based on the various parts of the mouth from which they were produced.

Korea Meets India

I have combined this principle with the basic principle of the Indic Scripts 'Map of the Mouth' logic to develop my own symbols.

It's interesting to note that the sound for 'r' / 'l' in Korean is uncannily similar to many of the symbols used to represent these same sounds in Indic (Brahmi) based scripts that display a 'rolling tongue' in one rendering or another. When I'm learning a new Indic script, this is actually one of the first memory points I will look for – I will peg the similarity or difference of its 'r' sound to the ones that I already know. I can't include all the samples in this text based medium as the fonts probably won't render properly on your computer. You can see them on the main 'Indic Consonant Compass' chart though.

Another shape that is very similar include the 'base voice' symbol which in many scripts is a circular shape of some sort.

The 'y' symbol which in Korean is normally 'two prongs' heading in the vowel direction. In the Indic scripts, the 'y' sound is usual a 2 or 3 pronged shape which I imagine is a representation of the way the tongue interacts with the palate.

Here is a sample:

Comparison of Similar Sound Shapes in Korean, Devanagari, Thai, Tamil and Gurmukhi (Panjabi)

Sound

Korean

Devanagari

Thai

Tamil

Gurmukhi

r

ya

The Key Actions of the Mouth

Stopped Throat

Aspirated

Voiced

Voiced Aspirated

Nasal

Consonant Starts with the Throat Closed

Throat opens and puffs air or a 'h' sound over the consonant. In some languages, the 'h' isn't as accented as others

Voice Resonates Over the Consonant

Voice Resonates Over the Consonant AND puffs air over the consonant (which opens the throat)

Sound is Directed Through the Nose

Semi Vowel

Sibilant

'H' Aspirate

Voice Base

Not Quite a Consonant Not Quite a Vowel –

Letters in this category are 'fluid' versions where 'full contanct' isn't really made with the 'cardinal point'. Think of it in English – is 'y' REALLY a consonant? (despite what your teachers told you)... or is it a vowel?

'S' Hissing Sound

'H' Sound

Open Throat

Root Symbol that Signifies the Voicebox

Colour Coding

I have also colour coded each category within the sound system to give a colourful representation of the textual glyph. There are 5 base colours that correspond to each of the 5 cardinal points of the mouth (see above).

When comparing the sound shifts from the original base letter to the target language letter, you can either follow the 'glyph' transliterations in the octagon adjacent the target letter, or just look at the colour changes. Getting a visual and emotional representation of these sound shifts through colour is another device that I find really useful in 'embedding' the language within me.

Ready to Go!

"Stuart Jay Raj's Indic Script Consonant Compass v1.0 – Southeast Asian Version"

There were so many languages to choose from. I broke this chart down to 8 scripts that I think would cover a good portion of the globe. The languages / scripts with my reasoning for choosing them are:

Language

Reason

Devanagari

  • Devanagari is used to write a slew of Indic languages including Hindi, Marathi, Nepali and has also become the standard that Sanskrit is rendered in – although Sanskrit writings can commonly be found in local scripts like Tamil, Sinhalese, Burmese, Thai, Khmer, Balinese etc.

  • Given Devanagari's popularity and the fact that it has letters to represent each of the base sounds, I chose it as the base script.

Tamil

  • Tamil is not only popular in India, Sri Lanka and other surrounding countries, but it is also very widely spoken in Singapore and Malaysia (even reaching official language status in Singapore)

  • Given that I am in Southeast Asia, I wanted to develop this chart in particular to help as bigger cross section of the SE Asian community as possible. I think it would be great if this chart could spur people living in countries where Tamil is prevalent to have a go at learning the script. It will open up a whole new world around them.

  • Another reason that I chose Tamil is that it is from the Dravdian stream of languages – and is indeed a very ancient script. You will notice that it's different from the other scripts in the chart in that there are very few base letters. The natural rules that govern how sounds change when preceding / followed by another letter will 'colour' the letter to give it different sounds – Sandhi. – E.g. in English – 'What did you do' is pronounced 'wo dije du' (very rough transliteration without IPA fonts!)

  • Tamil is the 'What did you do' version where the others are more the 'wo dije du' versions.

Panjabi

  • This is the Gurmukhi script used to write Panjabi. There are many Panjabi speakers all over Southeast Asia, so I thought it would be nice to include this script. In Thailand, there are many Thai born Indians that speak Panjabi at home, but cannot read the Gurmukhi script. Now with this comparison chart, I hope that the links to Thai can be made and again, spur some people that otherwise wouldn't onto learning the script … which will in turn open a new world up in the area of literature, religion, culture and getting in the good-books with Grandma and Grandpa!

Burmese

  • Another SE Asian Language. There are fascinating things that happen phonetically with Burmese. I have really been getting into learning Burmese lately and am loving every minute of it. The script is very easy to learn once you have one of the other scripts down – and understanding what's going on phonetically vs. script will go a long way to help you start to get the language 'into your body'.

  • There are also many Burmese in Thailand that cannot read or write Thai. Given the right guidance, it shouldn't take more than a few hours for Burmese who have lived in Thailand for any amount of time to start to link the Burmese and Thai Scripts together and open their eyes up to even more of the world around them.

Balinese

  • Balinese is a beautiful script… not that the others aren't! Sadly, it's a dying art in Indonesia. While 'Bahasa Daerah' – regional languages are taught in Primary school, I have found very small percentage of people that are really proficient at reading and writing Balinese. Hopefully this will help a resurgence J

  • Another reason that I've included both Balinese and Javanese is so that Balinese and Javanese speakers can see the similarities in the scripts! In my humble opinion, the two scripts are basically different fonts of the same script. While Balinese is very fluid and has more of an 'ancient free' feel, Javanese is more 'boxy' and looks more suitable for machine printed media.

Javanese

  • I love Javanese – and part of the language of course is the script.

  • Apart from that, I have pretty much the same reasoning for including Javanese as I did Balinese.

Thai

  • After having taught Thai for many years, the Script still takes the number one position for learners' 'obstacles in learning Thai'. I have put this together as one more aid for demystifying the script and in doing so, helping learners of Thai get over a big hurdle.

  • I wrote a post a couple of years ago that was a Fast-track guide for Indians to Learn Thai. This chart could be thought of as an upgrade to that to Indians living, working or looking at coming to Thailand.
  • Once you get this structure down in you subconscious, tone rules will become second nature!... it's built into the system!

Khmer

  • Being a neighbour to Thailand, much of Thailand's language, culture and customs have flowed from similar streams as Cambodia.

  • Just take a look at any temple in Thailand, or on the roof of most Taxis driving around Bangkok, you'll often see Sanskrit and Pali writings in the Mon / Khmer script. Despite this, very few Thais can read the script.
  • Just like the tone rules for Thai are part and parcel of this 5x5 structure, so too are the 'consonant classes' of Cambodian. Where the Mid and High classes for Thai are the first and second rows + the 'h', sibilants and base throat, the Cambodian 'oo' and 'oa' classes are broken down almost identically.

  • This is my little effort in 'bridging the gap' :)

Download "Stuart Jay Raj's Indic Script Consonant Compass v1.0 – Southeast Asian Version"

As my New Years gift to you for 2009, rather than giving you a cruddy jpg or bmp version (like you see above) and waiting for you to come to my Mnidcraft workshop to receive a full high resolution version, I have included a PDF version of my "Indic Script Consonant Compass – v1.0 – Southeast Asian Version". I have created it using vector graphics, embedding the fonts where possible and expanding any other fonts that could not be embedded. What this means is that you are free to print out the chart to enormous sizes and stick it all over your home, office, toilet and any other place you see fit. Getting one or more of these scripts under your belt is an investment that you won't regret.

Just the Consonants Ma'am

Remember, these are only the consonsants... and admittedly, there are a couple of rare consonants in each script that I haven't included on the chart as it would in my opinion add unnecessary complexity to learning the scripts. There is also some funky stuff happening with the Tamil that I'm working at ironing out in future versions. If there are any Tamil experts out there, drop me a line!

You will also notice that I haven't touched on vowels. … stay tuned… that will come in a later installment.

I also have some surprises planned for this whole 'Indic' kick that I'm on. If time permits, I'm sure that you will see the bizarre fruits that my brain has borne in the near future.

Good Luck!

Share on Facebook