Recurrence plot: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Anrnusna
m References: journal name, replaced: The European Physical Journal - Special Topics → European Physical Journal ST using AWB
Removed superfluous second link to the same article.
 
Line 1: Line 1:
{{Citation style|date=September 2009}}<!-- please use <ref> -->
The writer is recognized by the name of Figures Lint. To gather cash is 1 of the things I love most. She is a librarian but she's usually needed her own business. Minnesota is where he's been living for years.<br><br>Stop by my web site [http://www.myprgenie.com/view-publication/weight-loss-getting-you-down-these-tips-can-help weight loss food delivery]
{{infobox character encoding
| name = Shift JIS
| mime = Shift_JIS
| alias =
| standard = JIS X 0208 Appendix 1
| lang = [[Japanese language|Japanese]]
| status =
| prev =
| next =
}}
'''Shift JIS''' ('''Shift Japanese Industrial Standards''', also '''SJIS''', [[MIME]] name '''Shift_JIS''') is a [[character encoding]] for the [[Japanese language]], originally developed by a [[Japan]]ese company called [[ASCII Corporation]] in conjunction with [[Microsoft]] and standardized as '''JIS X 0208 Appendix 1'''.
 
== Description ==
 
Shift JIS is based on character sets defined within [[Japanese Industrial Standards|JIS]] standards [[JIS X 0201]]:1997 (for the single-byte characters) and [[JIS X 0208]]:1997 (for the double byte characters). The lead bytes for the double byte characters are "shifted" around the 64 halfwidth [[katakana]] characters in the single-byte range [[JIS X 0201#Encoded Katakana|0xA1 to 0xDF]]. The single-byte characters [[Hexadecimal|0x]]00 to 0x7F match the [[ASCII]] encoding, except for a [[Japanese yen|yen]] sign (U+00A5) at 0x5C and an [[overline]] (U+203E) at 0x7E in place of the ASCII character set's backslash and tilde respectively.  The single-byte characters from 0xA1 to 0xDF map to the half-width katakana characters found in JIS X 0201. HTML written in Shift JIS can still be interpreted to some extent when incorrectly tagged as ASCII, since the important start and end of HTML tags and fields, <, >, /, " are coded by the same single bytes as in ASCII, not as two-byte sequences.
 
Shift JIS requires an [[8-bit clean]] medium for transmission. It is fully [[backward compatibility|backwards compatible]] with the legacy [[JIS X 0201]] [[single-byte encoding]], meaning it supports [[half-width katakana]] and that any valid JIS X 0201 string is also a valid Shift JIS string. For two-byte characters, however, Shift JIS only guarantees that the first byte will be high bit set (0x80–0xFF); the value of the second byte can be either high or low. Appearance of byte values 0x40–0x7E as second bytes of [[code word]]s makes reliable Shift JIS detection difficult, because same codes are used for ASCII characters. On the other hand, the competing 8-bit format [[Extended Unix Code#EUC-JP|EUC-JP]], which does not support single-byte halfwidth katakana, allows for a much cleaner and direct conversion to and from JIS X 0208 [[code point]]s, as all high bit set bytes are parts of a double-byte character and all codes from ASCII range represent single-byte characters.
 
For a double-byte JIS sequence <math>j_1 j_2</math>,<ref>''j''<sub>1</sub> and ''j''<sub>2</sub> are each in the range 33 to 126 inclusive (i.e., 7-bit character values excluding control characters (0&ndash;31 and 127) and space)</ref> the transformation to the corresponding Shift JIS bytes <math>s_1 s_2</math> is:
 
:<math>s_1 = \begin{cases} \left \lfloor \frac{j_1 + 1}{2} \right \rfloor + 112  & \mbox{if } 33 \le j_1 \le 94 \\
                          \left \lfloor \frac{j_1 + 1}{2} \right \rfloor + 176  & \mbox{if } 95 \le j_1 \le 126            \end{cases}</math>
:<math>s_2 = \begin{cases} j_2 + 31 + \left \lfloor \frac{j_2}{96} \right \rfloor & \mbox{if } j_1 \mbox{ is odd }\\
                          j_2 + 126 & \mbox{if } j_1 \mbox{ is even }
            \end{cases}</math>
 
== Multiple versions ==
Many different versions of Shift JIS exist.
 
There are two areas for expansion: Firstly, JIS X 0208 does not fill the whole 94×94 space encoded for it in Shift JIS, therefore there is room for more characters here — these are really extensions to JIS X 0208 rather than to Shift JIS itself. The most popular extension here is to the Windows-31J, otherwise known as [[Code page 932]], popularized by Microsoft, although Microsoft itself does not recognize the Windows-31J name and instead calls that variation "shift_jis". Secondly, Shift JIS has more encoding space than is needed, for JIS X 0201 and JIS X 0208 and this space can and is used for yet more characters. The space, with lead bytes 0xF5 to 0xF9, is used by Japanese [[mobile phone]] operators for [[Emoji|pictographs]] for use in [[E-mail]], for example. ([[KDDI]] goes further and defines hundreds more in the space with lead bytes 0xF3 and 0xF4).
 
Beyond even this, there have been numerous minor variations made on Shift JIS, with individual characters here and there altered. Most of these extensions and variants have no [[Internet Assigned Numbers Authority|IANA]] registration, so there is much scope for confusion, if the extensions are used. Microsoft Code Page 932 is registered separately from Shift JIS.
 
IBM [[CCSID]] 943 has the same extensions as Code Page 932. As with most code pages and encodings, it is recommended by Microsoft, Apple, the Unicode Consortium and most major operating system makers that [[Unicode]] be used instead.
 
== Shift JIS byte map ==
The chart below gives the detailed meaning of each byte in a Shift JIS encoded stream.
{{Shift-JIS byte map}}
 
== See also ==
* [[Japanese language and computers]]
* [[Mojibake]]
* [[Shift JIS art]]
* [[Code page 932|Microsoft code page 932]]
 
==References==
{{reflist}}
 
==External links==
* [http://www.rikai.com/library/kanjitables/kanji_codes.sjis.shtml Shift-JIS] A table of the non-ASCII part of the codeset.
* [http://www.microsoft.com/globaldev/reference/dbcs/932.htm Microsoft's definition of Code Page 932]
* Forms of Shift-JIS in ICU ([[International Components for Unicode]])
** [http://demo.icu-project.org/icu-bin/convexp?conv=ibm-942 ibm-942 (sjis78)]
** [http://demo.icu-project.org/icu-bin/convexp?conv=ibm-943 ibm-943 (Contains the \u00A5 ↔ \x5C mapping)]
** [http://demo.icu-project.org/icu-bin/convexp?conv=Shift_JIS Shift JIS (Contains the \u005C ↔ \x5C mapping)]
 
{{character encoding}}
 
{{DEFAULTSORT:Shift Jis}}
[[Category:Encodings of Japanese]]

Latest revision as of 03:54, 27 October 2014

The writer is recognized by the name of Figures Lint. To gather cash is 1 of the things I love most. She is a librarian but she's usually needed her own business. Minnesota is where he's been living for years.

Stop by my web site weight loss food delivery