Contents | Prev | Next | Index | Java Language Specification Third Edition |
CHAPTER 4
The Java programming language is a strongly typed language, which means that every variable and every expression has a type that is known at compile time. Types limit the values that a variable (§4.12) can hold or that an expression can produce, limit the operations supported on those values, and determine the meaning of the operations. Strong typing helps detect errors at compile time.
The types of the Java programming language are divided into two categories: primitive types and reference types. The primitive types (§4.2) are the boolean
type and the numeric types. The numeric types are the integral types byte
, short
, int
, long
, and char
, and the floating-point types float
and double
. The reference types (§4.3) are class types, interface types, and array types. There is also a special null type. An object (§4.3.1) is a dynamically created instance of a class type or a dynamically created array. The values of a reference type are references to objects. All objects, including arrays, support the methods of class Object
(§4.3.2). String literals are represented by String
objects (§4.3.3).
Types exist at compile-time. Some types correspond to classes and interfaces, which exist at run-time. The correspondence between types and classes or interfaces is incomplete for two reasons:
A consequence of (2) is the possibility of heap pollution (§4.12.2.1). Under certain conditions, it is possible that a variable of a parameterized type refers to an object that is not of that parameterized type. The variable will always refer to an object that is an instance of a class that implements the parameterized type. See (§4.12.2) for further discussion.
There is also a special null type, the type of the expressionType: PrimitiveType ReferenceType
null
, which has no name. Because the null type has no name, it is impossible to declare a variable of the null type or to cast to the null type. The null reference is the only possible value of an expression of null type. The null reference can always be cast to any reference type. In practice, the programmer can ignore the null type and just pretend that null
is merely a special literal that can be of any reference type.
Primitive values do not share state with other primitive values. A variable whose type is a primitive type always holds a primitive value of that same type. The value of a variable of primitive type can be changed only by assignment operations on that variable (including increment (§15.14.2, §15.15.1) and decrement (§15.14.3, §15.15.2) operators).PrimitiveType: NumericType boolean NumericType: IntegralType FloatingPointType IntegralType: one of byte short int long char FloatingPointType: one of float double
The numeric types are the integral types and the floating-point types.
The integral types are byte
, short
, int
, and long
, whose values are 8-bit, 16-bit, 32-bit and 64-bit signed two's-complement integers, respectively, and char
, whose values are 16-bit unsigned integers representing UTF-16 code units (§3.1).
The floating-point types are float
, whose values include the 32-bit IEEE 754 floating-point numbers, and double
, whose values include the 64-bit IEEE 754 floating-point numbers.
The boolean
type has exactly two values: true
and false.
byte
, from -128 to 127, inclusive
short
, from -32768 to 32767, inclusive
int
, from -2147483648 to 2147483647, inclusive
long
, from -9223372036854775808 to 9223372036854775807, inclusive
char
, from '\u0000'
to '\uffff'
inclusive, that is, from 0 to 65535
boolean
:
<
, <=
, >
, and >=
(§15.20.1)
==
and !=
(§15.21.1)
int
or long
:
+
and -
(§15.15.3, §15.15.4)
*
, /
, and %
(§15.17)
+
and -
(§15.18)
++
, both prefix (§15.15.1) and postfix (§15.14.2)
--
, both prefix (§15.15.2) and postfix (§15.14.3)
<<
, >>
, and >>>
(§15.19)
~
(§15.15.5)
&
, |
, and ^
(§15.22.1)
? :
(§15.25)
+
(§15.18.1), which, when given a String
operand and an integral operand, will convert the integral operand to a String
representing its value in decimal form, and then produce a newly created String
that is the concatenation of the two strings
Byte
, Short
, Integer
, Long
, and Character
.
If an integer operator other than a shift operator has at least one operand of type long
, then the operation is carried out using 64-bit precision, and the result of the numerical operator is of type long
. If the other operand is not long
, it is first widened (§5.1.5) to type long
by numeric promotion (§5.6). Otherwise, the operation is carried out using 32-bit precision, and the result of the numerical operator is of type int
. If either operand is not an int
, it is first widened to type int
by numeric promotion.
The built-in integer operators do not indicate overflow or underflow in any way. Integer operators can throw a NullPointerException
if unboxing conversion (§5.1.8) of a null reference is required. Other than that, the only integer operators that can throw an exception (§11) are the integer divide operator /
(§15.17.2) and the integer remainder operator %
(§15.17.3), which throw an ArithmeticException
if the right-hand operand is zero, and the increment and decrement operators ++
(§15.15.1, §15.15.2) and --
(§15.14.3, §15.14.2), which can throw an OutOfMemoryError
if boxing conversion (§5.1.7) is required and there is not sufficient memory available to perform the conversion.
The example:
produces the output:class Test { public static void main(String[] args) { int i = 1000000; System.out.println(i * i); long l = i; System.out.println(l * l); System.out.println(20296 / (l - i)); } }
and then encounters an-727379968 1000000000000
ArithmeticException
in the division by l
-
i
, because l
-
i
is zero. The first multiplication is performed in 32-bit precision, whereas the second multiplication is a long
multiplication. The value -727379968
is the decimal value of the low 32 bits of the mathematical result, 1000000000000
, which is a value too large for type int
.
Any value of any integral type may be cast to or from any numeric type. There are no casts between integral types and the type boolean
.
float
and double
, which are conceptually associated with the single-precision 32-bit and double-precision 64-bit format IEEE 754 values and operations as specified in IEEE Standard for Binary Floating-Point Arithmetic, ANSI/IEEE Standard 754-1985 (IEEE, New York).
The IEEE 754 standard includes not only positive and negative numbers that consist of a sign and magnitude, but also positive and negative zeros, positive and negative infinities, and special Not-a-Number values (hereafter abbreviated NaN). A NaN value is used to represent the result of certain invalid operations such as dividing zero by zero. NaN constants of both float
and double
type are predefined as Float.NaN
and Double.NaN
.
Every implementation of the Java programming language is required to support two standard sets of floating-point values, called the float value set and the double value set. In addition, an implementation of the Java programming language may support either or both of two extended-exponent floating-point value sets, called the float-extended-exponent value set and the double-extended-exponent value set. These extended-exponent value sets may, under certain circumstances, be used instead of the standard value sets to represent the values of expressions of type float
or double
(§5.1.13, §15.4).
The finite nonzero values of any floating-point value set can all be expressed in the form s · m · 2(e-N+1), where s is +1 or -1, m is a positive integer less than 2N, and e is an integer between Emin = -(2K-1-2) and Emax = 2K-1-1, inclusive, and where N and K are parameters that depend on the value set. Some values can be represented in this form in more than one way; for example, supposing that a value v in a value set might be represented in this form using certain values for s, m, and e, then if it happened that m were even and e were less than 2K-1, one could halve m and increase e by 1 to produce a second representation for the same value v. A representation in this form is called normalized if m 2(N-1); otherwise the representation is said to be denormalized. If a value in a value set cannot be represented in such a way that m 2(N-1), then the value is said to be a denormalized value, because it has no normalized representation.
The constraints on the parameters N and K (and on the derived parameters Emin and Emax) for the two required and two optional floating-point value sets are summarized in Table 4.1.
Parameter | float | float-extended-exponent | double |
double-extended-exponent
N |
24 |
24 |
53 |
53 |
K |
8 |
11 |
11 |
15 |
Emax |
+127 |
+1023 |
+1023 |
+16383 |
Emin |
-126 |
-1022 |
-1022 |
-16382 |
|
---|
Where one or both extended-exponent value sets are supported by an implementation, then for each supported extended-exponent value set there is a specific implementation-dependent constant K, whose value is constrained by Table 4.1; this value K in turn dictates the values for Emin and Emax.
Each of the four value sets includes not only the finite nonzero values that are ascribed to it above, but also NaN values and the four values positive zero, negative zero, positive infinity, and negative infinity.
Note that the constraints in Table 4.1 are designed so that every element of the float value set is necessarily also an element of the float-extended-exponent value set, the double value set, and the double-extended-exponent value set. Likewise, each element of the double value set is necessarily also an element of the double-extended-exponent value set. Each extended-exponent value set has a larger range of exponent values than the corresponding standard value set, but does not have more precision.
The elements of the float value set are exactly the values that can be represented using the single floating-point format defined in the IEEE 754 standard. The elements of the double value set are exactly the values that can be represented using the double floating-point format defined in the IEEE 754 standard. Note, however, that the elements of the float-extended-exponent and double-extended-exponent value sets defined here do not correspond to the values that can be represented using IEEE 754 single extended and double extended formats, respectively.
The float, float-extended-exponent, double, and double-extended-exponent value sets are not types. It is always correct for an implementation of the Java programming language to use an element of the float value set to represent a value of type float
; however, it may be permissible in certain regions of code for an implementation to use an element of the float-extended-exponent value set instead. Similarly, it is always correct for an implementation to use an element of the double value set to represent a value of type double
; however, it may be permissible in certain regions of code for an implementation to use an element of the double-extended-exponent value set instead.
Except for NaN, floating-point values are ordered; arranged from smallest to largest, they are negative infinity, negative finite nonzero values, positive and negative zero, positive finite nonzero values, and positive infinity.
IEEE 754 allows multiple distinct NaN values for each of its single and double floating-point formats. While each hardware architecture returns a particular bit pattern for NaN when a new NaN is generated, a programmer can also create NaNs with different bit patterns to encode, for example, retrospective diagnostic information.
For the most part, the Java platform treats NaN values of a given type as though collapsed into a single canonical value (and hence this specification normally refers to an arbitrary NaN as though to a canonical value). However, version 1.3 the Java platform introduced methods enabling the programmer to distinguish between NaN values: the Float.floatToRawIntBits
and Double.doubleToRawLongBits
methods. The interested reader is referred to the specifications for the Float
and Double
classes for more information.
Positive zero and negative zero compare equal; thus the result of the expression 0.0==-0.0
is true
and the result of 0.0>-0.0
is false
. But other operations can distinguish positive and negative zero; for example, 1.0/0.0
has the value positive infinity, while the value of 1.0/-0.0
is negative infinity.
NaN is unordered, so the numerical comparison operators <
, <=
, >
, and >=
return false
if either or both operands are NaN (§15.20.1). The equality operator ==
returns false
if either operand is NaN, and the inequality operator !=
returns true
if either operand is NaN (§15.21.1). In particular, x!=x
is true
if and only if x
is NaN, and (x<y)
==
!(x>=y)
will be false
if x
or y
is NaN.
Any value of a floating-point type may be cast to or from any numeric type. There are no casts between floating-point types and the type boolean
.
boolean
:
<
, <=
, >
, and >=
(§15.20.1)
==
and !=
(§15.21.1)
float
or double
:
+
and -
(§15.15.3, §15.15.4)
*
, /
, and %
(§15.17)
+
and -
(§15.18.2)
++
, both prefix (§15.15.1) and postfix (§15.14.2)
--
, both prefix (§15.15.2) and postfix (§15.14.3)
? :
(§15.25)
+
(§15.18.1), which, when given a String
operand and a floating-point operand, will convert the floating-point operand to a String
representing its value in decimal form (without information loss), and then produce a newly created String
by concatenating the two strings
Float
, Double
, and Math
.If at least one of the operands to a binary operator is of floating-point type, then the operation is a floating-point operation, even if the other is integral.
If at least one of the operands to a numerical operator is of type double
, then the operation is carried out using 64-bit floating-point arithmetic, and the result of the numerical operator is a value of type double
. (If the other operand is not a double
, it is first widened to type double
by numeric promotion (§5.6).) Otherwise, the operation is carried out using 32-bit floating-point arithmetic, and the result of the numerical operator is a value of type float.
If the other operand is not a float
, it is first widened to type float
by numeric promotion.
Operators on floating-point numbers behave as specified by IEEE 754 (with the exception of the remainder operator (§15.17.3)). In particular, the Java programming language requires support of IEEE 754 denormalized floating-point numbers and gradual underflow, which make it easier to prove desirable properties of particular numerical algorithms. Floating-point operations do not "flush to zero" if the calculated result is a denormalized number.
The Java programming language requires that floating-point arithmetic behave as if every floating-point operator rounded its floating-point result to the result precision. Inexact results must be rounded to the representable value nearest to the infinitely precise result; if the two nearest representable values are equally near, the one with its least significant bit zero is chosen. This is the IEEE 754 standard's default rounding mode known as round to nearest.
The language uses round toward zero when converting a floating value to an integer (§5.1.3), which acts, in this case, as though the number were truncated, discarding the mantissa bits. Rounding toward zero chooses at its result the format's value closest to and no greater in magnitude than the infinitely precise result.
Floating-point operators can throw a NullPointerException
if unboxing conversion (§5.1.8) of a null reference is required. Other than that, the only floating-point operators that can throw an exception (§11) are the increment and decrement operators ++
(§15.15.1, §15.15.2) and --
(§15.14.3, §15.14.2), which can throw an OutOfMemoryError
if boxing conversion (§5.1.7) is required and there is not sufficient memory available to perform the conversion.
An operation that overflows produces a signed infinity, an operation that underflows produces a denormalized value or a signed zero, and an operation that has no mathematically definite result produces NaN. All numeric operations with NaN as an operand produce NaN as a result. As has already been described, NaN is unordered, so a numeric comparison operation involving one or two NaNs returns false
and any !=
comparison involving NaN returns true
, including x!=x
when x
is NaN.
The example program:
produces the output:class Test { public static void main(String[] args) { // An example of overflow: double d = 1e308; System.out.print("overflow produces infinity: "); System.out.println(d + "*10==" + d*10); // An example of gradual underflow: d = 1e-305 * Math.PI; System.out.print("gradual underflow: " + d + "\n "); for (int i = 0; i < 4; i++) System.out.print(" " + (d /= 100000)); System.out.println(); // An example of NaN: System.out.print("0.0/0.0 is Not-a-Number: "); d = 0.0/0.0; System.out.println(d); // An example of inexact results and rounding: System.out.print("inexact results with float:"); for (int i = 0; i < 100; i++) { float z = 1.0f / i; if (z * i != 1.0f) System.out.print(" " + i); } System.out.println(); // Another example of inexact results and rounding: System.out.print("inexact results with double:"); for (int i = 0; i < 100; i++) { double z = 1.0 / i; if (z * i != 1.0) System.out.print(" " + i); } System.out.println(); // An example of cast to integer rounding: System.out.print("cast to int rounds toward 0: "); d = 12345.6; System.out.println((int)d + " " + (int)(-d)); } }
overflow produces infinity: 1.0e+308*10==Infinity gradual underflow: 3.141592653589793E-305 3.1415926535898E-310 3.141592653E-315 3.142E-320 0.0 0.0/0.0 is Not-a-Number: NaN inexact results with float: 0 41 47 55 61 82 83 94 97 inexact results with double: 0 49 98 cast to int rounds toward 0: 12345 -12345
This example demonstrates, among other things, that gradual underflow can result in a gradual loss of precision.
The results when i
is 0
involve division by zero, so that z
becomes positive infinity, and z
*
0
is NaN, which is not equal to 1.0
.
boolean
type represents a logical quantity with two possible values, indicated by the literals true
and false
(§3.10.3). The boolean operators are:
==
and !=
(§15.21.2)
!
(§15.15.6)
&
, ^
, and |
(§15.22.2)
&&
(§15.23) and ||
(§15.24)
? :
(§15.25)
+
(§15.18.1), which, when given a String
operand and a boolean operand, will convert the boolean operand to a String
(either "true"
or "false"
), and then produce a newly created String
that is the concatenation of the two strings
if
statement (§14.9)
while
statement (§14.12)
do
statement (§14.13)
for
statement (§14.14)
boolean
expression also determines which subexpression is evaluated in the conditional ? :
operator (§15.25).
Only boolean
or Boolean
expressions can be used in control flow statements and as the first operand of the conditional operator ? :
. An integer x
can be converted to a boolean
, following the C language convention that any nonzero value is true
, by the expression x!=0
. An object reference obj
can be converted to a boolean
, following the C language convention that any reference other than null
is true
, by the expression obj!=null
.
A cast of a boolean
value to type boolean
or Boolean
is allowed (§5.1.1); no other casts on type boolean
are allowed. A boolean
can be converted to a string by string conversion (§5.4).
ReferenceType: ClassOrInterfaceType TypeVariable ArrayType ClassOrInterfaceType: ClassType InterfaceType ClassType: TypeDeclSpecifier TypeArgumentsopt InterfaceType: TypeDeclSpecifier TypeArgumentsopt TypeDeclSpecifier: TypeName ClassOrInterfaceType . Identifier
A class or interface type consists of a type declaration specifier, optionally followed by type arguments (in which case it is a parameterized type). Type arguments are described in (§4.5.1).TypeName: Identifier TypeName . Identifier TypeVariable: Identifier ArrayType: Type [ ]
A type declaration specifier may be either a type name (§6.5.5), or a class or interface type followed by "." and an identifier. In the latter case, the specifier has the form T.id, where id must be the simple name of an accessible (§6.6) member type ( §8.5, §9.5) of T, or a compile-time error occurs. The specifier denotes that member type.
The sample code:
declares a class typeclass Point { int[] metrics; } interface Move { void move(int deltax, int deltay); }
Point
, an interface type Move
, and uses an array type int[]
(an array of int
) to declare the field metrics
of the class Point
.The reference values (often just references) are pointers to these objects, and a special null reference, which refers to no object.
A class instance is explicitly created by a class instance creation expression (§15.9). An array is explicitly created by an array creation expression (§15.10).
A new class instance is implicitly created when the string concatenation operator + (§15.18.1) is used in a non-constant (§15.28) expression, resulting in a new object of type String
(§4.3.3). A new array object is implicitly created when an array initializer expression (§10.6) is evaluated; this can occur when a class or interface is initialized (§12.4), when a new instance of a class is created (§15.9), or when a local variable declaration statement is executed (§14.4). New objects of the types Boolean, Byte, Short, Character, Integer, Long, Float and Double may be implicitly created by boxing conversion (§5.1.7).
Many of these cases are illustrated in the following example:
which produces the output:class Point { int x, y; Point() { System.out.println("default"); } Point(int x, int y) { this.x = x; this.y = y; } // A Point instance is explicitly created at class initialization time: static Point origin = new Point(0,0); // A String can be implicitly created by a + operator: public String toString() { return "(" + x + "," + y + ")"; } } class Test { public static void main(String[] args) { // A Point is explicitly created using newInstance: Point p = null; try { p = (Point)Class.forName("Point").newInstance(); } catch (Exception e) { System.out.println(e); } // An array is implicitly created by an array constructor: Point a[] = { new Point(0,0), new Point(1,1) }; // Strings are implicitly created by + operators: System.out.println("p: " + p); System.out.println("a: { " + a[0] + ", " + a[1] + " }"); // An array is explicitly created by an array creation expression: String sa[] = new String[2]; sa[0] = "he"; sa[1] = "llo"; System.out.println(sa[0] + sa[1]); } }
The operators on references to objects are:default p: (0,0) a: { (0,0), (1,1) } hello
+
(§15.18.1), which, when given a String
operand and a reference, will convert the reference to a String
by invoking the toString
method of the referenced object (using "null"
if either the reference or the result of toString
is a null reference), and then will produce a newly created String
that is the concatenation of the two strings
instanceof
operator (§15.20.2)
==
and !=
(§15.21.3)
? :
(§15.25).
The example program:
produces the output:class Value { int val; } class Test { public static void main(String[] args) { int i1 = 3; int i2 = i1; i2 = 4; System.out.print("i1==" + i1); System.out.println(" but i2==" + i2); Value v1 = new Value(); v1.val = 5; Value v2 = v1; v2.val = 6; System.out.print("v1.val==" + v1.val); System.out.println(" and v2.val==" + v2.val); } }
becausei1==3 but i2==4 v1.val==6 and v2.val==6
v1.val
and v2.val
reference the same instance variable (§4.12.3) in the one Value
object created by the only new
expression, while i1
and i2
are different variables.See §10 and §15.10 for examples of the creation and use of arrays.
Each object has an associated lock (§17.1), which is used bysynchronized
methods (§8.4.3) and the synchronized
statement (§14.19) to provide control over concurrent access to state by multiple threads (§17).Object
is a superclass (§8.1) of all other classes. A variable of type Object
can hold a reference to the null reference or to any object, whether it is an instance of a class or an array (§10). All class and array types inherit the methods of class Object
, which are summarized here:
The members ofpackage java.lang; public class Object { public final Class<?> getClass() { . . . } public String toString() { . . . } public boolean equals(Object obj) { . . . } public int hashCode() { . . . } protected Object clone() throws CloneNotSupportedException { . . . } public final void wait() throws IllegalMonitorStateException, InterruptedException { . . . } public final void wait(long millis) throws IllegalMonitorStateException, InterruptedException { . . . } public final void wait(long millis, int nanos) { . . . } throws IllegalMonitorStateException, InterruptedException { . . . } public final void notify() { . . . } throws IllegalMonitorStateException public final void notifyAll() { . . . } throws IllegalMonitorStateException protected void finalize() throws Throwable { . . . } }
Object
are as follows:
getClass
returns the Class
object that represents the class of the object. A Class
object exists for each reference type. It can be used, for example, to discover the fully qualified name of a class, its members, its immediate superclass, and any interfaces that it implements. A class method that is declared synchronized
(§8.4.3.6) synchronizes on the lock associated with the Class
object of the class. The method Object.getClass() must be treated specially by a Java compiler. The type of a method invocation e.getClass(), where the expression e has the static type T, is Class<? extends |T|>.
toString
returns a String
representation of the object.
equals
and hashCode
are very useful in hashtables such as java.util.Hashtable
. The method equals
defines a notion of object equality, which is based on value, not reference, comparison.
clone
is used to make a duplicate of an object.
wait
, notify
, and notifyAll
are used in concurrent programming using threads, as described in §17.
finalize
is run just before an object is destroyed and is described in §12.6.
String
represent sequences of Unicode characters. A String
object has a constant (unchanging) value. String literals (§3.10.5) are references to instances of class String
.
The string concatenation operator +
(§15.18.1) implicitly creates a new String
object when the result is not a compile-time constant (§15.28).
At run time, several reference types with the same binary name may be loaded simultaneously by different class loaders. These types may or may not represent the same type declaration. Even if two such types do represent the same type declaration, they are considered distinct.
Two reference types are the same run-time type if:
Type variables have an optional bound, T & I1 ... In. The bound consists of either a type variable, or a class or interface type T possibly followed by further interface types I1 , ..., In. If no bound is given for a type variable,TypeParameter: TypeVariable TypeBoundopt TypeBound: extends ClassOrInterfaceType AdditionalBoundListopt AdditionalBoundList: AdditionalBound AdditionalBoundList AdditionalBound AdditionalBound: & InterfaceType
Object
is assumed. It is a compile-time error if any of the types I1 ... In is a class type or type variable. The erasures (§4.6) of all constituent types of a bound must be pairwise different, or a compile-time error occurs. The order of types in a bound is only significant in that the erasure of a type variable is determined by the first type in its bound, and that a class type or type variable may only appear in the first position. A type variable may not at the same time be a subtype of two interface types which are different parameterizations of the same generic interface.
See section §6.3 for the rules defining the scope of type variables.The members of a type variable X with bound T & I1 ... In are the members of the intersection type (§4.9) T & I1 ... In appearing at the point where the type variable is declared.
Discussion
The following example illustrates what members a type variable has.
The type variable T has the same members as the intersection type C & I, which in turn has the same members as the empty class CT, defined in the same scope with equivalent supertypes. The members of an interface are always public, and therefore always inherited (unless overridden). Hence mI is a member of CT and of T. Among the members of C, all but mCPrivate are inherited by CT, and are therefore members of both CT and T.package TypeVarMembers; class C { void mCDefault() {} public void mCPublic() {} private void mCPrivate() {} protected void mCProtected() {} } class CT extends C implements I {} interface I { void mI(); } <T extends C & I> void test(T t) { t.mI(); // OK t.mCDefault(); // OK t.mCPublic(); // OK t.mCPrivate(); // compile-time error t.mCProtected(); // OK } }
If C had been declared in a different package than T, then the call to mCDefault would give rise to a compile-time error, as that member would not be accessible at the point where T is declared.
Let P = G<T1, ..., Tn> be a parameterized type. It must be the case that, after P is subjected to capture conversion (§5.1.10) resulting in the type G<X1, ..., Xn>, for each actual type argument Xi, 1in , Xi <: Bi[A1 := X1, ..., An := Xn] (§4.10), or a compile time error occurs.
Discussion
Example: Parameterized types.
Vector<String> Seq<Seq<A>> Seq<String>.Zipper<Integer> Collection<Integer> Pair<String,String> // Vector<int> -- illegal, primitive types cannot be arguments // Pair<String> -- illegal, not enough arguments // Pair<String,String,String> -- illegal, too many arguments
Two parameterized types are provably distinct if either of the following conditions hold:
TypeArguments: < ActualTypeArgumentList > ActualTypeArgumentList: ActualTypeArgument ActualTypeArgumentList , ActualTypeArgument
ActualTypeArgument: ReferenceType Wildcard
Wildcard: ? WildcardBoundsOpt WildcardBounds: extends ReferenceType super ReferenceType
Discussion
Examples
void printCollection(Collection<?> c) { // a wildcard collection for (Object o : c) { System.out.println(o); } }
Note that using Collection<Object> as the type of the incoming parameter, c, would not be nearly as useful; the method could only be used with an actual parameter that had type Collection<Object>, which would be quite rare. In contrast, the use of an unbounded wildcard allows any kind of collection to be used as a parameter.
Discussion
Example - Wildcard parameterized types as component types of array types.public Method getMethod(Class<?>[] parameterTypes) { ... }
, where B is the bound.? extends B
Discussion
Example: Bounded wildcards.boolean addAll(Collection<? extends E> c)
Here, the method is declared within the interface Collection<E>, and is designed to add all the elements of its incoming argument to the collection upon which it is invoked. A natural tendency would be to use Collection<E> as the type of c, but this is unnecessarily restrictive. An alternative would be to declare the method itself to be generic:
<T> boolean addAll(Collection<T> c)
This version is sufficiently flexible, but note that the type parameter is used only once in the signature. This reflects the fact that the type parameter is not being used to express any kind of interdependency between the type(s) of the argument(s), the return type and/or throws type. In the absence of such interdependency, generic methods are considered bad style, and wildcards are preferred.
Unlike ordinary type variables declared in a method signature, no type inference is required when using a wildcard. Consequently, it is permissible to declare lower bounds on a wildcard, using the syntax:
, where B is a lower bound.? super B
Discussion
Example: Lower bounds on wildcards.
Here, the referent can be inserted into any queue whose element type is a super type of the type T of the referent.Reference(T referent, ReferenceQueue<? super T> queue);
Two type arguments are provably distinct if neither of the arguments is a type variable or wildcard, and the two arguments are not the same type.
Discussion
The relationship of wildcards to established type theory is an interesting one, which we briefly allude to here.Wildcards are a restricted form of existential types. Given a generic type declaration G<T extends B>, G<?> is roughly analogous to Some X <: B. G<X>.
Readers interested in a more comprehensive discussion should refer to On Variance-Based Subtyping for Parametric Types by Atsushi Igarashi and Mirko Viroli, in the proceedings of the 16th European Conference on Object Oriented Programming (ECOOP 2002).
Wildcards differ in certain details from the constructs described in the aforementioned paper, in particular in the use of capture conversion (§5.1.10) ratther than the close operation described by Igarashi and Viroli. For a formal account of wildcards, see Wild FJ by Mads Torgersen, Erik Ernst and Christian Plesner Hansen, in the 12th workshop on Foundations of Object Oriented Programming (FOOL 2005).
Historically, wildcards are a direct descendant of the work by Atsushi Igarashi and Mirko Viroli. This work itself builds upon earlier work by Kresten Thorup and Mads Torgersen ("Unifying Genericity", ECOOP 99), as well as a long tradition of work on declaration based variance that goes back to Pierre America's work on POOL (OOPSLA 89)
Discussion
This is of no consequence, as it is impossible to access a member of a parameterized type without performing capture conversion (§5.1.10), and it is impossible to use a wildcard type after the keyword new in a class instance creation expression
Discussion
The decision not to make all generic types reifiable is one of the most crucial, and controversial design decisions involving the language's type system.
Ultimately, the most important motivation for this decision is compatibility with existing code.
Naively, the addition of new constructs such as genericity has no implications for pre-existing code. The programming language per se, is compatible with earlier versions as long as every program written in the previous versions retains its meaning in the new version. However, this notion, which may be termed language compatibility, is of purely theoretical interest. Real programs (even trivial ones, such as "Hello World") are composed of several compilation units, some of which are provided by the Java platform (such as elements of java.lang
or java.util
).
In practice then, the minimum requirement is platform compatibillity - that any program written for the prior version of the platform continues to function unchanged in the new platform.
One way to provide platform compatibillity is to leave existing platform functionality unchanged, only adding new functionality. For example, rather than modify the existing Collections hierarchy in java.util
, one might introduce a new library utilizing genericity.
The disadvantages of such a scheme is that it is extremely difficult for pre-existing clients of the Collection library to migrate to the new library. Collections are used to exchange data between independently developed modules; if a vendor decides to switch to the new, generic, library, that vendor must also distribute two versions of their code, to be compatible with their clients. Libraries that are dependent on other vendors code cannot be modified to use genericity until the supplier's library is updated. If two modules are mutually dependent, the changes must be made simultaneously.
Clearly, platform compatibility, as outlined above, does not provide a realistic path for adoption of a pervasive new feature such as genericity. Therefore, the design of the generic type system seeks to support migration compatibility. Migration compatibiliy allows the evolution of existing code to take advantage of generics without imposing dependencies between independently developed software modules.
The price of migration compatibility is that a full and sound reification of the generic type system is not possible, at least while the migration is taking place.
More precisely, a raw type is define to be either:
Discussion
The latter point may not be immediately self evident. Presenting for your consideration, then, the following example:
The type of the member(s) ofclass Outer<T>{ T t; class Inner { T setOuterT(T t1) {t = t1;return t;} } }
Inner
depends on the type parameter of Outer
. If Outer
is raw, Inner
must be treated as raw as well, as their is no valid binding for T
.
This rule applies only to type members that are not inherited. Inherited type members that depend on type variables will be inherited as raw types as a consequence of the rule that the supertypes of a raw type are erased, described later in this section.
Discussion
Another implication of the rules above is that a generic inner class of a raw type can itself only be used as a raw type:
it is not possible to accessclass Outer<T>{ class Inner<S> { S s; } }
Inner
as partially raw type (a "rare" type)
becauseOuter.Inner<Double> x = null; // illegal Double d = x.s;
Outer
itself is raw, so are all its inner classes, including Inner
, and so it is not possible to pass any type parameters to it.
The use of raw types is allowed only as a concession to compatibility of legacy code. The use of raw types in code written after the introduction of genericity into the Java programming language is strongly discouraged. It is possible that future versions of the Java programming language will disallow the use of raw types.
It is a compile-time error to attempt to use a type member of a parameterized type as a raw type.
Discussion
This means that the ban on "rare" types extends to the case where the qualifying type is parameterized, but we attempt to use the inner class as a raw type:
This is the opposite of the case we discussed above. There is no practical justification for this half baked type. In legacy code, no type parameters are used. In non-legacy code, we should use the generic types correctly and pass all the required actual type parameters.Outer<Integer>.Inner x = null; // illegal
Discussion
Variables of a raw type can be assigned from values of any of the type's parametric instances.
For instance, it is possible to assign a Vector<String>
to a Vector
, based on the subtyping rules (§4.10.2).
The reverse assignment from Vector
to Vector<String>
is unsafe (since the raw vector might have had a different element type), but is still permitted using unchecked conversion (§5.1.9) in order to enable interfacing with legacy code. In this case, a compiler will issue an unchecked warning.
The type of a constructor (§8.8), instance method (§8.8, §9.4), or non-static field (§8.3) M of a raw type C that is not inherited from its superclasses or superinterfaces is the erasure of its type in the generic declaration corresponding to C. The type of a static member of a raw type C is the same as its type in the generic declaration corresponding to C.
It is a compile-time error to pass actual type parameters to a non-static type member of a raw type that is not inherited from its superclasses or superinterfaces.
To make sure that potential violations of the typing rules are always flagged, some accesses to members of a raw type will result in warning messages. The rules for generating warnings when accessing members or constructors of raw types are as follows:
The supertype of a class may be a raw type. Member accesses for the class are treated as normal, and member accesses for the supertype are treated as for raw types. In the constructor of the class, calls to super are treated as method calls on a raw type.
Discussion
Example: Raw types.
class Cell<E> E value; Cell (E v) { value=v; } A get() { return value; } void set(E v) { value=v; } } Cell x = new Cell<String>("abc"); x.value; // OK, has type Object x.get(); // OK, has type Object x.set("def"); // unchecked warning
Discussion
For example,
import java.util.*; class NonGeneric { Collection<Number> myNumbers(){return null;} } abstract class RawMembers<T> extends NonGeneric implements Collection<String> { static Collection<NonGeneric> cng = new ArrayList<NonGeneric>(); public static void main(String[] args) { RawMembers rw = null; Collection<Number> cn = rw.myNumbers(); // ok Iterator<String> is = rw.iterator(); // unchecked warning Collection<NonGeneric> cnn = rw.cng; // ok - static member } }
RawMembers<T>
inherits the method
from theIterator<String> iterator()
Collection<String>
superinterface. However, the type RawMembers
inherits iterator()
from the erasure of its superinterface, which means that the return type of the member iterator()
is the erasure of Iterator<<String>, Iterator
. As a result, the attempt to assign to rw.iterator()
requires an unchecked conversion (§5.1.9) from Iterator
to Iterator<String>
, causing an unchecked warning to be issued.
In contrast, the static member cng
retains its full parameterized type even when accessed through a object of raw type (note that access to a static member through an instance is considered bad style and is to be discouraged). The member myNumbers
is inherited from the NonGeneric
(whose erasure is also NonGeneric
) and so retains its full parameterized type.
Discussion
Raw types are closly related to wildcards. Both are based on existential types. Raw types can be thought of as wildcards whose type rules are deliberately unsound, to accommodate interaction with legacy code.
Historically, raw types preceded wildcards; they were first introduced in GJ, and described in the paper Making the future safe for the past: Adding Genericity to the Java Programming Language by Gilad Bracha, Martin Odersky, David Stoutamire, and Philip Wadler, in Proc. of the ACM Conf. on Object-Oriented Programming, Systems, Languages and Applications, (OOPSLA 98) October 1998.
The members of an intersection type T1 & ... & Tn are determined as follows:
Discussion
It is worth dwelling upon the distinction between intersection types and the bounds of type variables. Every type variable bound induces an intersection type. This intersection type is often trivial (i.e., consists of a single type).The form of a bound is restricted (only the first element may be a class or type variable, and only one type variable may appear in the bound) to preclude certain awkward situations coming into existence. However, capture conversion can lead to the creation of type variables whose bounds are more general (e.g., array types).
The subtypes of a type T are all types U such that T is a supertype of U, and the null type. We write T <: S to indicate that that the subtype relation holds between types T and S. T is a proper subtype of S, written T < S, if T <:S and S T. T is a direct subtype of S, written T <1 S, if S >1 T.
Subtyping does not extend through generic types: T <: U does not imply that C<T> <: C<U>.
double >1 float
float >1 long
long >1 int
int >1 char
int >1 short
short >1 byte
Given a type declaration for C<F1,...,Fn>, the direct supertypes of the parameterized type (§4.5) C<F1,...,Fn> are all of the following:
Object
, if C is an interface type with no direct superinterfaces.
C<X1,...,Xn> is the result of applying capture conversion (§5.1.10) to C<R1,...,Rn>.
The direct supertypes of an intersection type (§4.9) T1 & ... & Tn, are Ti, 1in.
The direct supertypes of a type variable (§4.4) are the types listed in its bound.
The direct supertypes of the null type are all reference types other than the null type itself.
In addition to the above rules, a type variable is a direct supertype of its lower bound.
Object >
1 Object[]
Cloneable >
1 Object[]
java.io.Serializable >
1 Object[]
The following code fragment contains one or more instances of most kinds of usage of a type:
import java.util.Random; class MiscMath<T extends Number>{ int divisor; MiscMath(int divisor) { this.divisor = divisor; } float ratio(long l) { try { l /= divisor; } catch (Exception e) { if (e instanceof ArithmeticException) l = Long.MAX_VALUE; else l = 0; } return (float)l; } double gausser() { Random r = new Random(); double[] val = new double[2]; val[0] = r.nextGaussian(); val[1] = r.nextGaussian(); return (val[0] + val[1]) / 2;
} Collection<Number> fromArray(Number[] na) { Collection<Number> cn = new ArrayList<Number>(); for (Number n : na) { cn.add(n) } return cn; } void <S> loop(S s){ this.<S>loop(s);}
In this example, types are used in declarations of the following:}
Random
, imported from the type java.util.Random
of the package java.util
, is declared
divisor
in the class MiscMath
is declared to be of type int
l
of the method ratio
is declared to be of type long
ratio
is declared to be of type float
, and the result of the method gausser
is declared to be of type double
MiscMath
is declared to be of type int
r
and val
of the method gausser
are declared to be of types Random
and double[]
(array of double
)
e
of the catch
clause is declared to be of type Exception
T
has Number
as its declared bound.
r
of method gausser
is initialized by a class instance creation expression that uses the type Random
Number
is used as a type argument in the expression new ArrayList<Number>()
val
of method gausser
is initialized by an array creation expression that creates an array of double
with size 2
loop
calls itself with an explicit type argument S
return
statement of the method ratio
uses the float
type in a cast
instanceof
operator (§15.20.2); here the instanceof
operator tests whether e
is assignment compatible with the type ArithmeticException
.
Types are also used as arguments to parameterized types; here the type Number
is used as an argument in the parameterized type Collection<Number>
.++
(increment) or --
(decrement) operator (§15.14.2, §15.14.3, §15.15.1, §15.15.2).Compatibility of the value of a variable with its type is guaranteed by the design of the Java programming language, as long as a program does not give rise to unchecked warnings (§4.12.2.1). Default values are compatible (§4.12.5) and all assignments to a variable are checked for assignment compatibility (§5.2), usually at compile time, but, in a single case involving arrays, a run-time check is made (§10.10).
Discussion
Note that a variable is not guaranteed to always refer to a subtype of its declared type, but only to subclasses or subinterfaces of the declared type. This is due to the possibility of heap pollution discussed below.
If T is a primitive type, then a variable of type "array of T" can hold a null reference or a reference to any array of type "array of T"; if T is a reference type, then a variable of type "array of T" can hold a null reference or a reference to any array of type "array of S" such that type S is a subclass or subinterface of type T. In addition, a variable of type Object[]
can hold an array of any reference type. A variable of type Object
can hold a null reference or a reference to any object, whether class instance or array.
Discussion
For example, the code:
gives rise to an unchecked warning, because it is not possible to ascertain, either at compile-time (within the limits of the compile-time type checking rules) or at run-time, whether the variable l does indeed refer to aList l = new ArrayList<Number>(); List<String> ls = l; // unchecked warning
List<String>
.
If the code above is executed, heap pollution arises, as the variable ls, declared to be a List<String>
, refers to a value that is not in fact a List<String>
.
The problem cannot be identified at run-time because type variables are not reified, and thus instances do not carry any information at run-time regarding the actual type parameters used to create them.
In a simple example as given above, it may appear that it should be straightforward to identify the situation at compile-time and give a compilation error. However, in the general (and typical) case, the value of the variable l may be the result of an invocation of a separately compiled method, or its value may depend upon arbitrary control flow.
The code above is therefore very atypical, and indeed very bad style.
Assignment from a value of a raw type to a variable of a parameterized type should only be used when combining legacy code which does not make use of parameterized types with more modern code that does.
If no operation that requires an unchecked warning to be issued takes place, heap pollution cannot occur. Note that this does not imply that heap pollution only occurs if an unchecked warning actually occurred. It is possible to run a program where some of the binaries were compiled by a compiler for an older version of the Java programming language, or by a compiler that allows the unchecked warnings to suppressed. This practice is unhealthy at best.
Conversely, it is possible that despite executing code that could (and perhaps did) give rise to an unchecked warning, no heap pollution takes place. Indeed, good programming practice requires that the programmer satisfy herself that despite any unchecked warning, the code is correct and heap pollution will not occur.
The variable will always refer to an object that is an instance of a class that implements the parameterized type.
Discussion
For instance, the value ofl
in the example above is always a List
.
static
within a class declaration (§8.3.1.1), or with or without the keyword static
within an interface declaration (§9.3). A class variable is created when its class or interface is prepared (§12.3.2) and is initialized to a default value (§4.12.5). The class variable effectively ceases to exist when its class or interface is unloaded (§12.7).
static
(§8.3.1.1). If a class T has a field a that is an instance variable, then a new instance variable a is created and initialized to a default value (§4.12.5) as part of each newly created object of class T or of any class that is a subclass of T (§8.1.4). The instance variable effectively ceases to exist when the object of which it is a field is no longer referenced, after any necessary finalization of the object (§12.6) has been completed.
catch
clause of a try
statement (§14.20). The new variable is initialized with the actual object associated with the exception (§11.3, §14.18). The exception-handler parameter effectively ceases to exist when execution of the block associated with the catch
clause is complete.
for
statement (§14.14), a new variable is created for each local variable declared in a local variable declaration statement immediately contained within that block or for
statement. A local variable declaration statement may contain an expression which initializes the variable. The local variable with an initializing expression is not initialized, however, until the local variable declaration statement that declares it is executed. (The rules of definite assignment (§16) prevent the value of a local variable from being used before it has been initialized or otherwise assigned a value.) The local variable effectively ceases to exist when the execution of the block or for
statement is complete.
Were it not for one exceptional situation, a local variable could always be regarded as being created when its local variable declaration statement is executed. The exceptional situation involves the switch
statement (§14.11), where it is possible for control to enter a block but bypass execution of a local variable declaration statement. Because of the restrictions imposed by the rules of definite assignment (§16), however, the local variable declared by such a bypassed local variable declaration statement cannot be used before it has been definitely assigned a value by an assignment expression (§15.26).
The following example contains several different kinds of variables:
class Point { static int numPoints; // numPoints is a class variable int x, y; // x and y are instance variables int[] w = new int[10]; // w[0] is an array component int setX(int x) { // x is a method parameter int oldx = this.x; // oldx is a local variable this.x = x; return oldx; } }
final
. A final variable may only be assigned to once. It is a compile time error if a final variable is assigned to unless it is definitely unassigned (§16) immediately prior to the assignment.A blank final is a final variable whose declaration lacks an initializer.
Once a final
variable has been assigned, it always contains the same value. If a final
variable holds a reference to an object, then the state of the object may be changed by operations on the object, but the variable will always refer to the same object. This applies also to arrays, because arrays are objects; if a final
variable holds a reference to an array, then the components of the array may be changed by operations on the array, but the variable will always refer to the same array.
Declaring a variable final
can serve as useful documentation that its value will not change and can help avoid programming errors.
In the example:
the classclass Point { int x, y; int useCount; Point(int x, int y) { this.x = x; this.y = y; } final static Point origin = new Point(0, 0); }
Point
declares a final class variable origin
. The origin
variable holds a reference to an object that is an instance of class Point
whose coordinates are (0, 0). The value of the variable Point.origin
can never change, so it always refers to the same Point
object, the one created by its initializer. However, an operation on this Point
object might change its state-for example, modifying its useCount
or even, misleadingly, its x
or y
coordinate.
We call a variable, of primitive type or type String
, that is final and initialized with a compile-time constant expression (§15.28) a constant variable. Whether a variable is a constant variable or not may have implications with respect to class initialization (§12.4.1), binary compatibility (§13.1, §13.4.9) and definite assignment (§16).
byte
, the default value is zero, that is, the value of (byte)0
.
short
, the default value is zero, that is, the value of (short)0
.
int
, the default value is zero, that is, 0
.
long
, the default value is zero, that is, 0L
.
float
, the default value is positive zero, that is, 0.0f
.
double
, the default value is positive zero, that is, 0.0d
.
char
, the default value is the null character, that is, '\u0000'
.
boolean
, the default value is false
.
null
.
prints:class Point { static int npoints; int x, y; Point root; } class Test { public static void main(String[] args) { System.out.println("npoints=" + Point.npoints); Point p = new Point(); System.out.println("p.x=" + p.x + ", p.y=" + p.y); System.out.println("p.root=" + p.root); } }
illustrating the default initialization ofnpoints=0 p.x=0, p.y=0 p.root=null
npoints
, which occurs when the class Point
is prepared (§12.3.2), and the default initialization of x
, y
, and root
, which occurs when a new Point
is instantiated. See §12 for a full description of all aspects of loading, linking, and initialization of classes and interfaces, plus a description of the instantiation of classes to make new class instances.
Every object belongs to some particular class: the class that was mentioned in the creation expression that produced the object, the class whose Class
object was used to invoke a reflective method to produce the object, or the String
class for objects implicitly created by the string concatenation operator +
(§15.18.1). This class is called the class of the object. (Arrays also have a class, as described at the end of this section.) An object is said to be an instance of its class and of all superclasses of its class.
Sometimes a variable or expression is said to have a "run-time type". This refers to the class of the object referred to by the value of the variable or expression at run time, assuming that the value is not null
.
The compile time type of a variable is always declared, and the compile time type of an expression can be deduced at compile time. The compile time type limits the possible values that the variable can hold or the expression can produce at run time. If a run-time value is a reference that is not null
, it refers to an object or array that has a class, and that class will necessarily be compatible with the compile-time type.
Even though a variable or expression may have a compile-time type that is an interface type, there are no instances of interfaces. A variable or expression whose type is an interface type can reference any object whose class implements (§8.1.5) that interface.
Here is an example of creating new objects and of the distinction between the type of a variable and the class of an object:
In this example:public interface Colorable { void setColor(byte r, byte g, byte b); } class Point { int x, y; } class ColoredPoint extends Point implements Colorable { byte r, g, b; public void setColor(byte rv, byte gv, byte bv) { r = rv; g = gv; b = bv; } } class Test { public static void main(String[] args) { Point p = new Point(); ColoredPoint cp = new ColoredPoint(); p = cp; Colorable c = cp; } }
p
of the method main
of class Test
has type Point
and is initially assigned a reference to a new instance of class Point
.
cp
similarly has as its type ColoredPoint
, and is initially assigned a reference to a new instance of class ColoredPoint
.
cp
to the variable p
causes p
to hold a reference to a ColoredPoint
object. This is permitted because ColoredPoint
is a subclass of Point
, so the class ColoredPoint
is assignment compatible (§5.2) with the type Point
. A ColoredPoint
object includes support for all the methods of a Point
. In addition to its particular fields r
, g
, and b
, it has the fields of class Point
, namely x
and y
.
c
has as its type the interface type Colorable
, so it can hold a reference to any object whose class implements Colorable
; specifically, it can hold a reference to a ColoredPoint
.
Discussion
Note that an expression such asnew Colorable()
is not valid because it is not possible to create an instance of an interface, only of a class.
getClass
, when invoked for an array object, will return a class object (of class Class
) that represents the class of the array. The classes for arrays have strange names that are not valid identifiers; for example, the class for an array of int
components has the name "[I
" and so the value of the expression:
is the stringnew int[10].getClass().getName()
"[I"
; see the specification of Class.getName
for details.
Contents | Prev | Next | Index | Java Language Specification Third Edition |
Copyright © 1996-2005 Sun Microsystems, Inc.
All rights reserved
Please send any comments or corrections via our feedback form