- Apycom Encoding Table
- Character encoding tools
Encoding validator: used for cleansing corpora that include unexpected characters. Character encoding converter: can optionally emulate non-ascii characters with ascii strings.
- Character sets
- Character sets & encodings Tutorial
Tutorial on how to markup up XHTML, HTML and CSS pages with information about character encodings, and how to use character escapes.
character encodings escapes i18n internationalisation internationalization localisation localization sets translation tutorial
- Code Set Overview
- Complex Scripts
FAQs on complex scripts, contextual shaping, character reordering, diacritics and special justification and wordbreak rules for complex scripts.
character complex contextual diacritics faqs reordering scripts shaping
- Computing with Foreign Symbols
- CSets: Supplemental Unicode Mapping Tables
The CSets collection is a set of mapping tables between various character sets and Unicode, and is intended to provide mappings not included in most character set conversion tools available today. The origin of this distribution was several projects that involved text encoded in many obscure character encodings. Many of these encodings are not supported in the most frequently used character set conversion tools (i.e. iconv), so this package was put together to provide the encoding information in a simple, consistent format. No program is provided to actually do the conversion between characters sets because of the wide variety of text file formats they appear in. It is up to the developer/user to write their own conversion programs using this data.
- Free foreign fonts
Links to websites where you can download fonts for many different alphabet and writing systems
Asian Scripts (most of the website is in Japanese).
GICAS: Grammatological Informatics based on Corpora of Asian Scripts
- HTML Character Entities
- IANA charset registry
These are the official names for character sets that may be used in the Internet and may be referred to in Internet documentation.
ICU user guide to character set (and language) detection.
The International Components for Unicode (ICU) library user guide.
code components for g11n globalization i18n icu icu4c icu4j international internationalization l10n localization nls unicode
- Indic scripts
- International Register of Coded Character Sets
ISO/IEC International Register of Coded Character Sets To Be Used With Escape Sequences
- ISO 8859-1 vs CP-1252
1984 2008 8859-1 articles character code dos ebcdic encoding encodings expanded from iec iso mac may number pages sets windows
- ISO country codes
Complete list of international country codes, following ISO3166 standard. Most of these are top-level domains. The list shows the world's status of internet e-mail accessibility and connectivity
accessibility africa america antarctica asia code codes country din e-mail europe internet iso iso3166 isoc mail map world
- ISO-8859 tables
- Malayalam encoding converter
Since 1998, developing the free library support for Malayalam English transliteration. Thus facilitating communication in Malayalam through internet.
- mozdev encoding converters
1 - 20