Chinese Input method editors
bababardwan
June 13, 2010 at 12:49 AM posted in General DiscussionI'm guessing like most poddies I've been using pinyin input editors to write hanzi. This is fine most of the time except when one is looking for a rare character under an entry like yi that can have several dozen entries to look through. One trick I've learnt is if you know a larger word that such a character may be part of ...just type that to find it which usually gives a much shorter list and then just delete the bits you don't want. But anyhow,I digress. I've decided it might be an idea to see what I can learn from other input method editors. Pinyin only deals with the phonetics but I'm interested in methods that focus more on the structure of the character. To this end I've just been reading more about cangjie which actually looks pretty simple and I've tried it briefly in an online version. Where I'm a little stuck is getting a canjie ime installed on my windows 7. Articles I've read suggest that canjie should be already available on most computers without having to download anything. I've gone into the language section of the control panel to tick the box for some other keyboards but couldn't find cangjie as an option [perhaps it was only on older versions of windows?] . I did find options like Hong Kong for example and as I've read that cangjie is a favoured method there I ticked the HK box and it is now showing up in my language bar but doesn't seem to work...it still just writes in English. Does anyone have any tips on how to get cangjie running on windows 7? I still have a vista drive I could use if it's a version issue. This post would be good to discuss who is using ime's other than pinyin input and opinions on those ime's.
bababardwan
June 14, 2010 at 09:39 AM
hehe ,very good RJ, I like it. Dangran he's the guy who's been responsible also for me mucking up my tones and unwittingly saying embarrassing things.
RJ
June 14, 2010 at 09:35 AM
All of the above hk, but lately I just learn them as they come up in lessons. It doesn't matter because no matter how many characters you know, when the time comes to read something important, it will no doubt be made up of all characters you don't know. This is one of Murphy's laws. "The Hanzi corollary." Beware.
hkboy
June 14, 2010 at 06:40 AM
You guys have learned so many characters and post in Chinese. Are you learning the characters by going through the cpod lessons or do you have a system - book, skritter, etc.?
bababardwan
June 13, 2010 at 09:17 PM
thanks RJ. Rather than learning the code for characters in that way though I'm approaching it from the opposite angle. That is see if I can work it out myself and then use mdbg if I choose one of the mnemonics wrong. I don't want to memorise a whole set of codes but rather learn how to use the system [but frequent use would no doubt mean a lot of the higher frequency codes would just be memorised anyhow]. Really not sure if I'll stick with this for long or not though. I guess this programme could be an easier fallback than mdbg though so thanks again.
bodawei
June 13, 2010 at 01:08 PM
Barbs
Your thirst for knowledge is pretty damn impressive. I have almost no idea of what you are talking about. :) I am still trying to perfect the input system on my mobile phone, the standard pinyin based one. I am curious about the 5 stroke input system called (I just looked it up) 五笔画 , but there aren't enough hours in my day to make it a serious study. I haven't ever seen a native speaker use it, but they are incredibly fast with the pinyin input system.
I see your point about unusual characters, or perhaps the ability to use both pinyin and cangjie would cover all bases.
bababardwan
June 14, 2010 at 06:38 AM
hang on. Confusing nomenclature. What are you guys using ?:
wubizixing input method [五笔字型输入法] aka wubi xing aka wubi aka Wang ma
OR
wubuhua [五笔画] ?
....sounds like it might be the latter,对不对?
bababardwan
June 14, 2010 at 06:27 AM
here's an online page to practice wubi:
http://www.yuzi.net/tool/Import.htm
bababardwan
June 14, 2010 at 06:13 AM
No problems at all. I'm finding it really interesting that a few of you are using it on your cell phones. I draw the characters on my iphone for sms'ing but have found very little use for it really at this stage living here. I have iced dictionary on the phone and it uses pinyin imput. Are you using wubi for sms'ing or what?
xiaophil
June 14, 2010 at 05:47 AM
Baba
I really wish that I could tell you, but it came preinstalled on my cell. Perhaps you can contact your cell phone manufacturer?
By the way, this dictionary is prett plain Jane. It only gives definitions, no examples, and it is often stumped by idioms. But it is fast when you get used to it.
bababardwan
June 13, 2010 at 09:03 PM
I'm really keen to give this one a shot now. I'll look up how I get it installed unless you or anyone else has a quick tip on the best way to go about installing it.
bababardwan
June 13, 2010 at 09:01 PM
thanks Jason. It's great to get everyone's opinion on this. :)
bababardwan
June 13, 2010 at 08:59 PM
sorry mate,I was just trying out a couple of the ime's and making sure it worked by posting it....it's mostly gibberish. Yeah, I was intending to look into wubi as well. Good suggestion.
bodawei
June 13, 2010 at 04:07 PM
I use my phone for calls and texting, not as a dictionary (mainly because it only works as English-Chinese, optimised for the native Chinese speaker). I use my electronic dictionary mostly these days and will write the character if I can rather than use pinyin (to improve my writing), although as xiao_phil says the pinyin is sometimes faster (depends on the character.) I'm guessing that 5 stroke input is always or at least mostly slower than using an electronic dictionary, and slower than entering pinyin for texting. If I forget the pinyin I find another word - bit of a cheat. Also the pinyin entry is pretty intelligent, throwing up the next likely character.
But I see that it could be satisfying to have 5 stroke input as an option - this has encouraged me to practice. Well, learn it first, then practice.
xiaophil
June 13, 2010 at 02:59 PM
I never tried the canjie, so I wouldn't be able to comment like baba, but wubi is not nearly as hard as I thought it would be. I still use pinyin for inputing if I know the pinyin, though.
RJ
June 13, 2010 at 02:55 PM
yes this canjie is not so intuitive in some cases. I was just about to ask if wubi wouldnt be more useful. Im thinking that one makes more sense for us.
Isnt it 五笔划?
xiaophil
June 13, 2010 at 02:50 PM
Yes, you beat me to it! I was just about to write something very similar. I find that learning 五笔画 has been one of the most useful things I have done while studying Chinese. After learning it, I generally use my phone dictionary much more than my little electronic dictionary with stylus because I find it to be much quicker and less cumbersome. The other benefit is that it is like a poor man's Skritter. My stroke order has vastly improved since using this input method. In the beginning it was somewhat hard because I didn't have an intuitive sense, and there were even times when I stubbornly sat in front of my cell for up to fifteen minutes trying to figure out the stroke order. I can't say I don't have problems anymore, but I usually can figure the stroke order on the first try.
JasonSch
June 13, 2010 at 02:39 PM
I've always found 五笔画 very useful for looking up characters of which I don't know the pinyin in my phone dictionary. It's actually not too difficult to learn, but a basic understanding of stroke types/order is essential. Once you've got that though, it's can be quite useful. (Of course, these days, with apps like Pleco and Nciku, you can just write in the character, but it's somehow less satisfying that way...)
bababardwan
June 13, 2010 at 09:10 PM
btw is it true they're still planning to phase out bopomofo in Taiwan?
bababardwan
June 13, 2010 at 09:09 PM
this bopomofo looks like it also has the potential to be fast as for a character like zheng [5 letters in pinyin] ..you would only need 2 keystrokes plus one for the tone [and as tone is included it must therefore on average cut down about 4/5 of your options ]. I guess how good the ime is at predicting is an important factor too. Google pinyin I found much better in this regard than microsoft pinyin.
bababardwan
June 13, 2010 at 06:13 AM
Ok,just solved one mystery. Changjie is under the Taiwan keyboard options and seems to work fine. I'd still be interested in others experiences with other input methods...both regular users and those who have just tried the other methods.
bababardwan
June 14, 2010 at 09:10 AM
thanks RJ, that's the same one that orangina linked to ,but it's great 'cos now I can read it twice ;)
RJ
June 14, 2010 at 08:44 AM
this guy posts a list of 40 - pretty similar
http://yolearnchinese.com/archives/40-most-common-radicals
bababardwan
June 14, 2010 at 06:07 AM
thanks heaps orangina. That's awesome. A really useful place to start I think. Looks like it'll fill in a couple of holes and is otherwise reassuring. I best just knuckle down and get on with the rest once I've polished up here.
orangina
June 14, 2010 at 05:48 AM
Exactly! This is what I was saying. And even though is is designed for Chinese users, I suppose this way is good for my learning in the long run. At least it has the pinyin when you look up the character.
xiaophil
June 14, 2010 at 05:42 AM
It is disappointing that phone dictionaries don't give pinyin when looking up English words (at least ours), but I guess that's because it really isn't intended for us, but rather Chinese users. I'm sure you have figured this out, but just in case, you can always look up the word using English, then input the result of that search, and finally, voila--there is your pinyin.
orangina
June 14, 2010 at 05:32 AM
I found this site.
Interesting discussion guys... I use 五笔画 occasionally on my phone also. I find my phone dictionary has many words pleco does not, but I wish it would include the pinyin when I look up a word in english. But at least with 五笔画 I have a fighting chance of figuring out unfamiliar characters. Maybe I should use it more often so I can get faster at it. But I wish it were 八笔画... it isn't like my phone doesn't have enough keys for that!
bababardwan
June 14, 2010 at 03:56 AM
thanks RJ. Do you have a link to the most common 43 radicals? Maybe I should start with them. I had been thinking of just starting with one stroke radicals, then 2 and so on.
I'd like to look at those common word lists too.
RJ
June 14, 2010 at 03:18 AM
I did see that and here you can download - its called simplified canjie
http://nats-www.informatik.uni-hamburg.de/~joseph/newton/input_methods/index.html
btw - the most common 43 radicals appear in 74% of the 5000 most common characters. I did make a point of learning these. A lot easier than learning all 214 and it goes a long way. The 5000 most common are those chosen by Ping-gam go in his book in this case. The actual numbers may vary based on which 5000 you choose but the principle is sound.
bababardwan
June 13, 2010 at 09:06 PM
btw I'd really prefer to go for option 3 [which didn't seem to be there and thus may have to be downloaded I guess]...namely speed cangjie. As you may have seen, with speed cangjie you only need to extract the first and last mnemonic and thus it's meant to be very fast.
bababardwan
June 13, 2010 at 08:51 PM
exactly RJ. Yeah, I saw both choices and went for new canjie also. You've also hit on one of the reasons I decided to look into it. Namely ,to date I have just been recognising characters by sight rather than analysing them,mainly because I haven't gotten round to learning the radicals. However as vocabulary builds and the differences between some characters is subtle I am mindful of some advice Pete gave a very long ago that recognising them by sight alone becomes difficult at some point. I think it's time to follow that advice and learn the radicals at least. I was also looking at the 4 corner method. Neither canjie or the 4 corner method specifically uses radicals necessarily but at least it does get one focussing on the structure of the character.
RJ
June 13, 2010 at 02:30 PM
ha - i just figured that one out and was about to let you know. Missed your "I found it" comment. There are two choices actually - canjie and new canjie. I chose new and will play some. I still have some questions re trad vs simplified. Im sure we can flesh all this out but I am thinking this system would help one learn characters. Before now I have always dismissed this as unnecessary. Lets play.
RJ
June 13, 2010 at 03:05 PMBaba
here is a program that will give you the canjie code if you highlight a character. May be useful in learning.
http://www.sfu.ca/~vwchu/cjexplorer.html
Chinese is such a bottomless learning pit. You have two choices. Learn less and less about more and more until you know nothing about everything, or you can learn more and more about less and less until you know everything about nothing.