顺便问一下,我看我的编辑器EmEditor的另存为中,可以选择UTF-8,UTF-7,但也可以选择Unicode和Unicode big endian 。我记得好像UTF-8就是Unicode嘛,它为什么还要分开几种让我选呢? 具体这4种有什么区别,哪位高手指教下?另外,选了除“UTF-7”的其它3种,都有另外一个选项“添加Unicode标记(BOM)”,这个干什么用的?
怪事了,明明我已经打开了php_mbstring.dll,可还是报错: Fatal error: Call to undefined function: mbstring() 我用phpinfo()看到已经支持了: mbstring Multibyte Support enabled Japanese support enabled Simplified chinese support enabled Traditional chinese support enabled Korean support enabled Russian support enabled Multibyte (japanese) regex support enabled mbstring extension makes use of "streamable kanji code filter and converter", which is distributed under the GNU Lesser General Public License version 2.1. Directive Local Value Master Value mbstring.detect_order no value no value mbstring.encoding_translation Off Off mbstring.func_overload 0 0 mbstring.http_input pass pass mbstring.http_output pass pass mbstring.internal_encoding ISO-8859-1 no value mbstring.language neutral neutral mbstring.substitute_character no value no value
iconv("gbk","big5","深圳市南山区")转换后结果中少了一个“区”:(看起来好像都会丢掉最后一个中文字,WHY?
我尝试不转换直接输出,发现上面两个中文字符串最后一个中文字是空白(像是全角空格似的),但切换到简体中文编码是能够显示最后一个汉字的。
两个编码参数和iconv正好相反,手册上有详细的说明
這樣就對了
因爲你的 鱼 在big5裏面沒有
如果用程序自动转的话,还是得用iconv啊,难道这时就不会转丢字符了?郁闷了,不知mbstring能不能搞定这个问题。
具体这4种有什么区别,哪位高手指教下?另外,选了除“UTF-7”的其它3种,都有另外一个选项“添加Unicode标记(BOM)”,这个干什么用的?
Fatal error: Call to undefined function: mbstring() 我用phpinfo()看到已经支持了:
mbstring
Multibyte Support enabled
Japanese support enabled
Simplified chinese support enabled
Traditional chinese support enabled
Korean support enabled
Russian support enabled
Multibyte (japanese) regex support enabled mbstring extension makes use of "streamable kanji code filter and converter", which is distributed under the GNU Lesser General Public License version 2.1. Directive Local Value Master Value
mbstring.detect_order no value no value
mbstring.encoding_translation Off Off
mbstring.func_overload 0 0
mbstring.http_input pass pass
mbstring.http_output pass pass
mbstring.internal_encoding ISO-8859-1 no value
mbstring.language neutral neutral
mbstring.substitute_character no value no value
在哪可以找到mbstring的中文手册?
具体的用mb_convert_encoding或者mb_convert_variables
http://ca3.php.net/manual/en/ref.mbstring.php
可惜,像iconv一样不支持将"鱼"正确地转换为繁体"魚"!而是变成了"?":(
或者有高手已经写好的源码类或函数?