One of the first most universal coding standards is ASCII. It has libraries in which elements of two languages are systematiz – byte and symbolic. Letters, punctuation marks, numbers are assign individual numerical codes. For example, the uppercase letter “B” according to the ASCII coding standard is assign the code “066”. Then this designation is relat to the binary system: “066” is 01000010 when written in zeros and ones. As a result, each identifier has its own symbol and its byte analogue.
The ASCII standard contains data
on the most popular symbols and works to chinese in america transmit text written in Latin letters. However, users of web resources, applications, software and other IT products are scatter around the world. Therefore, to encode all languages of humanity and any symbol that has ever been us , including emoticons, a standard with broader capabilities for storing symbols and their corresponding codes appear – Unicode.
It is understood by most computers
on the planet and by native speakers of the main world languages. Unicode stores the results of information conversion perform through it expressly prohibits the use the UTF-8, UTF-16 or UTF-32 encoding system. Advantages and disadvantages Unicode is a set of characters taken from all the world’s languages, glyphs, and emoji. The UTF family of encodings defines how a character is represent in binary. UTF-8 allows users to work in a multilingual environment that is compatible with global standards and accept worldwide.
Programming languages (PL) support
an use encodings differently. Sometimes material data they can distort Unicode. Disadvantages of Unicode for different PLs and programs: PHP. This programming language supports 256 characters, i.e. it perceives 1 character in a line as 1 byte of information. This happens even if the character in the line weighs more than one byte. For example, a smiley can weigh four bytes, but for PHP it is still one. However, this can be fix by setting up multi-byte functions.