Inputting Rare Characters / 罕用字輸入
While inputting data, we sometimes encounter characters that cannot be typed using the most common inputting systems. These characters often belong to UTF Extension A & B characters, and means of solving this problem are listed below.
You can write the characters using the mouse/touchpad on this website.
This website contains 康熙字典, and supports various means to search the characters.
Installation of Unifonts is required for viewing UTF Extension B characters.
Installation of Xiao Yao Bi is required for inputting UTF Extension B characters. Check the official website for details and releases of updates on this software: http://shurufa.ihanzi.cn/ (Note that the official website for this software is only available in Chinese.)
Installation of Cangjie (倉頡平台) can input Unicode CJK/Ext-A/B/C/D characters.
If you can break the character into different parts, you can use this website to search for it. For example, if you want to input the character "㙲," you can search for “土雍".
You can use multiple ways to search for a character on this website. (eg. Pinyin or number of strokes)
This tool, 實用漢字轉拼音v4.8, runs only on Windows machine. It supports the conversion from both traditional and simplified Chinese characters to pinyin. No installation is needed. Just unzip the RAR compressed file, and double-click the KTestpinyin.exe to run. Please see this page for explanation and screenshots for this software.
The interface for this software is in Simplified Chinese. For non-Simplified-Chinese machines, you can download the Microsoft AppLocale Utility to run this software in Simplified Chinese.
A known problem of the conversion is for 曾. This software converts 曾 to "ceng", whereas 曾 should be "zeng" when it is a surname. There might be problems in any Chinese characters which has multiple pronounciations. E.g., 都 can be read as dou or du; 給 could be gei or ji; 車 could be che or ju.
GUESS is software for the visual display of networks. Download and follow the instructions in README.TXT.
PAJEK 2.0 released 20101025 is unicode compatible and can be used for both visualization and social network analysis, as well as PAJEK 3.14 and its later versions. For the latest version of Pajek, please go to the Pajek website.