Java: Summary - Character
Character class static methods
Character Class Methods | ||
Character class is used mostly for static methods to test char values. | ||
b = | Character.isDigit(c) | true if c is digit character. |
b = | Character.isLetter(c) | true if c is letter character. |
b = | Character.isLetterOrDigit(c) | true if c is letter or digit. |
b = | Character.isLowerCase(c) | true if c is lowercase char. |
b = | Character.isUpperCase(c) | true if c is uppercase char. |
b = | Character.isWhitespace(c) | true if c is space, tab, .... |
c = | Character.toLowerCase(c) | Lowercase version of c. |
c = | Character.toUpperCase(c) | Uppercase version of c. |
ANSI/ASCII and Extended Latin Sections of Unicode
Unicode attempts to represent the characters in all current human languages, as well as numerous special symbols. The most common implmentation of it uses 16 bits, which is 65,536 characters (many are not yet assigned a graphic). The first 128 codes are identical to ANSI/ASCII (American National Standards Institute / American Standard Code for Information Interchange). Of the ASCII codes, the first 32 are control codes. The first 256 codes are the same as ISO-8859-1 (Latin-1), which includes ASCII of course. Below is a table which shows this common first part of the Unicode character set. The table below is in HTML, so the actual characters that are displayed are determined by your browser, not Java.
+0 | +1 | +2 | +3 | +4 | +5 | +6 | +7 | +8 | +9 | +10 | +11 | +12 | +13 | +14 | +15 | |
32 | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / | |
48 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
64 | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
80 | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
96 | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
112 | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
128 | | | | | | | | | | | | | | | | |
144 | | | | | | | | | | | | | | | | |
160 | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | | ® | ¯ | |
176 | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
192 | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
208 | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
224 | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
240 | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
An applet (written in old Java 1 in a rather unique style) that displays the Unicode characters can be found at mindprod.com/jgloss/reuters.html. For more information on Unicode, see www.alanwood.net/unicode/.