Java: Summary - Character

Character class static methods

Character Class Methods
Character class is used mostly for static methods to test char values.
b = Character.isDigit(c) true if c is digit character.
b = Character.isLetter(c) true if c is letter character.
b = Character.isLetterOrDigit(c) true if c is letter or digit.
b = Character.isLowerCase(c) true if c is lowercase char.
b = Character.isUpperCase(c) true if c is uppercase char.
b = Character.isWhitespace(c) true if c is space, tab, ....
c = Character.toLowerCase(c) Lowercase version of c.
c = Character.toUpperCase(c) Uppercase version of c.

ANSI/ASCII and Extended Latin Sections of Unicode

Unicode attempts to represent the characters in all current human languages, as well as numerous special symbols. The most common implmentation of it uses 16 bits, which is 65,536 characters (many are not yet assigned a graphic). The first 128 codes are identical to ANSI/ASCII (American National Standards Institute / American Standard Code for Information Interchange). Of the ASCII codes, the first 32 are control codes. The first 256 codes are the same as ISO-8859-1 (Latin-1), which includes ASCII of course. Below is a table which shows this common first part of the Unicode character set. The table below is in HTML, so the actual characters that are displayed are determined by your browser, not Java.

+0+1+2+3+4+5+6+7+8+9+10+11+12+13+14+15
32 !"#$%&'()*+,-./
480123456789:;<=>?
64@ABCDEFGHIJKLMNO
80PQRSTUVWXYZ[\]^_
96`abcdefghijklmno
112pqrstuvwxyz{|}~
128ƒˆŠŒŽ
144˜šœžŸ
160 ¡¢£¤¥¦§¨©ª«¬­®¯
176°±²³´µ·¸¹º»¼½¾¿
192ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏ
208ÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß
224àáâãäåæçèéêëìíîï
240ðñòóôõö÷øùúûüýþÿ

An applet (written in old Java 1 in a rather unique style) that displays the Unicode characters can be found at mindprod.com/jgloss/reuters.html. For more information on Unicode, see www.alanwood.net/unicode/.


Copyleft 2005 Fred Swartz