Recent Topics

1 Jun 06, 2004 08:59    

In Chinese, a character is similar to a word in English, which sometimes has a meaning and sometimes it doesn't. A sentence in Chinese is made up of a series of characters.

Each post has the number of words in it shown next to the post title. If the entire post is in traditional Chinese (UTF-8), the number of word count is 0. That doesn't look quite right to me.

Character (or word) count in Chinese usually means counting the number of characters.

2 Jun 06, 2004 14:28

Interesting.

Is that for traditional or simplified chinese or both? (yeah I know I don't have a clue, you gotta tell me all about it!)

Also are there spaces and/or punctuation? (Those won't count as words I guess).

3 Jun 07, 2004 01:55

Yes, the same rule applies to both traditional and simplified Chinese.

Spaces and punctuations should not be counted as words.

You can see what I meant here: http://www.bauhiniatech.com/blogs/ Posts in English show the correct word count, but if the posts are in Chinese the word/character count is incorrect.

4 Jun 14, 2004 22:00

I'm noticing the same issue with my japanese posts.

With english posts, I'm seeing an accurate word count, but in Japanese, they are VERY inaccurate. I'll have posts that say there is only 1 word or maybe 5 words when the post is actually many lines long.

5 Jun 30, 2004 09:00

LoKa320 wrote:

I'm noticing the same issue with my japanese posts.

With english posts, I'm seeing an accurate word count, but in Japanese, they are VERY inaccurate. I'll have posts that say there is only 1 word or maybe 5 words when the post is actually many lines long.

I have a chinese blog and encount the same problem. LoKa320 found irregular word count. But actually it counted right because it counted english word only. if you have one english word and lots of japanese in a post, the word count will return 1. two english words, return 2; etc.

don't know how about other locales. So it, the word count, should really be named english word count, or something else.

6 Jul 10, 2004 04:25

if not counting words,how about a byte-count?
:oops:


Form is loading...