Generics in Java

Last updated

Generics are a facility of generic programming that were added to the Java programming language in 2004 within version J2SE 5.0. They were designed to extend Java's type system to allow "a type or method to operate on objects of various types while providing compile-time type safety". [1] The aspect compile-time type safety required that parametrically polymorphic functions are not implemented in the Java virtual machine, since type safety is impossible in this case. [2] [3]

Contents

The Java collections framework supports generics to specify the type of objects stored in a collection instance.

In 1998, Gilad Bracha, Martin Odersky, David Stoutamire and Philip Wadler created Generic Java, an extension to the Java language to support generic types. [4] Generic Java was incorporated in Java with the addition of wildcards.

Hierarchy and classification

According to Java Language Specification: [5]

Motivation

The following block of Java code illustrates a problem that exists when not using generics. First, it declares an ArrayList of type Object . Then, it adds a String to the ArrayList. Finally, it attempts to retrieve the added String and cast it to an Integer—an error in logic, as it is not generally possible to cast an arbitrary string to an integer.

finalListv=newArrayList();v.add("test");// A String that cannot be cast to an IntegerfinalIntegeri=(Integer)v.get(0);// Run time error

Although the code is compiled without error, it throws a runtime exception (java.lang.ClassCastException) when executing the third line of code. This type of logic error can be detected during compile time by using generics [7] and is the primary motivation for using them. [6] It defines one or more type variables that act as parameters.

The above code fragment can be rewritten using generics as follows:

finalList<String>v=newArrayList<String>();v.add("test");finalIntegeri=(Integer)v.get(0);// (type error)  compilation-time error

The type parameter String within the angle brackets declares the ArrayList to be constituted of String (a descendant of the ArrayList's generic Object constituents). With generics, it is no longer necessary to cast the third line to any particular type, because the result of v.get(0) is defined as String by the code generated by the compiler.

The logical flaw in the third line of this fragment will be detected as a compile-time error (with J2SE 5.0 or later) because the compiler will detect that v.get(0) returns String instead of Integer. [7] For a more elaborate example, see reference. [9]

Here is a small excerpt from the definition of the interfaces java.util.List and java.util.Iterator in package java.util :

interfaceList<E>{voidadd(Ex);Iterator<E>iterator();}interfaceIterator<E>{Enext();booleanhasNext();}

Generic class definitions

Here is an example of a generic Java class, which can be used to represent individual entries (key to value mappings) in a map:

publicclassEntry<KeyType,ValueType>{privatefinalKeyTypekey;privatefinalValueTypevalue;publicEntry(KeyTypekey,ValueTypevalue){this.key=key;this.value=value;}publicKeyTypegetKey(){returnkey;}publicValueTypegetValue(){returnvalue;}publicStringtoString(){return"("+key+", "+value+")";}}

This generic class could be used in the following ways, for example:

finalEntry<String,String>grade=newEntry<String,String>("Mike","A");finalEntry<String,Integer>mark=newEntry<String,Integer>("Mike",100);System.out.println("grade: "+grade);System.out.println("mark: "+mark);finalEntry<Integer,Boolean>prime=newEntry<Integer,Boolean>(13,true);if(prime.getValue()){System.out.println(prime.getKey()+" is prime.");}else{System.out.println(prime.getKey()+" is not prime.");}

It outputs:

grade: (Mike, A) mark: (Mike, 100) 13 is prime. 

Generic method definitions

Here is an example of a generic method using the generic class above:

publicstatic<Type>Entry<Type,Type>twice(Typevalue){returnnewEntry<Type,Type>(value,value);}

Note: If we remove the first <Type> in the above method, we will get compilation error (cannot find symbol "Type"), since it represents the declaration of the symbol.

In many cases, the user of the method need not indicate the type parameters, as they can be inferred:

finalEntry<String,String>pair=Entry.twice("Hello");

The parameters can be explicitly added if needed:

finalEntry<String,String>pair=Entry.<String>twice("Hello");

The use of primitive types is not allowed, and boxed versions must be used instead:

finalEntry<int,int>pair;// Fails compilation. Use Integer instead.

There is also the possibility to create generic methods based on given parameters.

public<Type>Type[]toArray(Type...elements){returnelements;}

In such cases you can't use primitive types either, e.g.:

Integer[]array=toArray(1,2,3,4,5,6);

Diamond operator

Thanks to type inference, Java SE 7 and above allow the programmer to substitute an empty pair of angle brackets (<>, called the diamond operator) for a pair of angle brackets containing the one or more type parameters that a sufficiently close context implies. [10] Thus, the above code example using Entry can be rewritten as:

finalEntry<String,String>grade=newEntry<>("Mike","A");finalEntry<String,Integer>mark=newEntry<>("Mike",100);System.out.println("grade: "+grade);System.out.println("mark: "+mark);finalEntry<Integer,Boolean>prime=newEntry<>(13,true);if(prime.getValue())System.out.println(prime.getKey()+" is prime.");elseSystem.out.println(prime.getKey()+" is not prime.");

Type wildcards

A type argument for a parameterized type is not limited to a concrete class or interface. Java allows the use of "type wildcards" to serve as type arguments for parameterized types. Wildcards are type arguments in the form "<?>"; optionally with an upper or lower bound. Given that the exact type represented by a wildcard is unknown, restrictions are placed on the type of methods that may be called on an object that uses parameterized types.

Here is an example where the element type of a Collection<E> is parameterized by a wildcard:

finalCollection<?>c=newArrayList<String>();c.add(newObject());// compile-time errorc.add(null);// allowed

Since we don't know what the element type of c stands for, we cannot add objects to it. The add() method takes arguments of type E, the element type of the Collection<E> generic interface. When the actual type argument is ?, it stands for some unknown type. Any method argument value we pass to the add() method would have to be a subtype of this unknown type. Since we don't know what type that is, we cannot pass anything in. The sole exception is null; which is a member of every type. [11]

To specify the upper bound of a type wildcard, the extends keyword is used to indicate that the type argument is a subtype of the bounding class. [12] So List<?extendsNumber> means that the given list contains objects of some unknown type which extends the Number class. For example, the list could be List<Float> or List<Number>. Reading an element from the list will return a Number. Adding null elements is, again, also allowed. [13]

The use of wildcards above adds flexibility [12] since there is not any inheritance relationship between any two parameterized types with concrete type as type argument. Neither List<Number> nor List<Integer> is a subtype of the other; even though Integer is a subtype of Number. [12] So, any method that takes List<Number> as a parameter does not accept an argument of List<Integer>. If it did, it would be possible to insert a Number that is not an Integer into it; which violates type safety. Here is an example that demonstrates how type safety would be violated if List<Integer> were a subtype of List<Number>:

finalList<Integer>ints=newArrayList<>();ints.add(2);finalList<Number>nums=ints;// valid if List<Integer> were a subtype of List<Number> according to substitution rule. nums.add(3.14);finalIntegerx=ints.get(1);// now 3.14 is assigned to an Integer variable!

The solution with wildcards works because it disallows operations that would violate type safety:

finalList<?extendsNumber>nums=ints;// OKnums.add(3.14);// compile-time errornums.add(null);// allowed

To specify the lower bounding class of a type wildcard, the super keyword is used. This keyword indicates that the type argument is a supertype of the bounding class. So, List<?superNumber> could represent List<Number> or List<Object>. Reading from a list defined as List<?superNumber> returns elements of type Object. Adding to such a list requires either elements of type Number, any subtype of Number or null (which is a member of every type).

The mnemonic PECS (Producer Extends, Consumer Super) from the book Effective Java by Joshua Bloch gives an easy way to remember when to use wildcards (corresponding to covariance and contravariance) in Java. [12]

Generics in throws clause

Although exceptions themselves cannot be generic, generic parameters can appear in a throws clause:

public<TextendsThrowable>voidthrowMeConditional(booleanconditional,Texception)throwsT{if(conditional){throwexception;}}

Problems with type erasure

Generics are checked at compile-time for type-correctness. [7] The generic type information is then removed in a process called type erasure. [6] For example, List<Integer> will be converted to the non-generic type List, which ordinarily contains arbitrary objects. The compile-time check guarantees that the resulting code uses the correct type. [7]

Because of type erasure, type parameters cannot be determined at run-time. [6] For example, when an ArrayList is examined at runtime, there is no general way to determine whether, before type erasure, it was an ArrayList<Integer> or an ArrayList<Float>. Many people are dissatisfied with this restriction. [14] There are partial approaches. For example, individual elements may be examined to determine the type they belong to; for example, if an ArrayList contains an Integer, that ArrayList may have been parameterized with Integer (however, it may have been parameterized with any parent of Integer, such as Number or Object).

Demonstrating this point, the following code outputs "Equal":

finalList<Integer>li=newArrayList<>();finalList<Float>lf=newArrayList<>();if(li.getClass()==lf.getClass()){// evaluates to trueSystem.out.println("Equal");}

Another effect of type erasure is that a generic class cannot extend the Throwable class in any way, directly or indirectly: [15]

publicclassGenericException<T>extendsException

The reason why this is not supported is due to type erasure:

try{thrownewGenericException<Integer>();}catch(GenericException<Integer>e){System.err.println("Integer");}catch(GenericException<String>e){System.err.println("String");}

Due to type erasure, the runtime will not know which catch block to execute, so this is prohibited by the compiler.

Java generics differ from C++ templates. Java generics generate only one compiled version of a generic class or function regardless of the number of parameterizing types used. Furthermore, the Java run-time environment does not need to know which parameterized type is used because the type information is validated at compile-time and is not included in the compiled code. Consequently, instantiating a Java class of a parameterized type is impossible because instantiation requires a call to a constructor, which is unavailable if the type is unknown.

For example, the following code cannot be compiled:

<T>TinstantiateElementType(List<T>arg){returnnewT();//causes a compile error}

Because there is only one copy per generic class at runtime, static variables are shared among all the instances of the class, regardless of their type parameter. Consequently, the type parameter cannot be used in the declaration of static variables or in static methods.

Type erasure was implemented in Java to maintain backward compatibility with programs written prior to Java SE5. [7]

Differences from Arrays

There are several important differences between arrays (both primitive arrays and Object arrays), and generics in Java. Two of the major differences, namely, differences in terms of variance and reification.

Covariance, contravariance and invariance

Generics are invariant, whereas arrays are covariant. [6] This is a benefit of using generic when compared to non-generic objects such as arrays. [6] Specifically, generics can help prevent run time exceptions by throwing a compile-time exception to force the developer to fix the code.

For example, if a developer declares an Object[] object and instantiates the object as a new Long[] object, no compile-time exception is thrown (since arrays are covariant). [6] This may give the false impression that the code is correctly written. However, if the developer attempts to add a String to this Long[] object, the program will throw an ArrayStoreException. [6] This run-time exception can be completely avoided if the developer uses generics.

If the developer declares a Collection<Object> object an creates a new instance of this object with return type ArrayList<Long>, the Java compiler will (correctly) throw a compile-time exception to indicate the presence of incompatible types (since generics are invariant). [6] Hence, this avoids potential run-time exceptions. This problem can be fixed by creating an instance of Collection<Object> using ArrayList<Object> object instead. For code using Java SE7 or later versions, the Collection<Object> can be instantiated with an ArrayList<> object using the diamond operator

Reification

Arrays are reified, meaning that an array object enforces its type information at run-time, whereas generics in Java are not reified. [6]

More formally speaking, objects with generic type in Java are non-reifiable types. [6] A non-reifiable type is type whose representation at run-time has less information than its representation at compile-time. [6]

Objects with generic type in Java are non-reifiable due to type erasure. [6] Java only enforces type information at compile-time. After the type information is verified at compile-time, the type information is discarded, and at run-time, the type information will not be available. [6]

Examples of non-reifiable types include List<T> and List<String>, where T is a generic formal parameter. [6]

Project on generics

Project Valhalla is an experimental project to incubate improved Java generics and language features, for future versions potentially from Java 10 onwards. Potential enhancements include: [16]

See also

Citations

  1. Java Programming Language
  2. A ClassCastException can be thrown even in the absence of casts or nulls. "Java and Scala's Type Systems are Unsound" (PDF).
  3. Bloch 2018, pp. 123–125, Chapter §5 Item 27: Eliminate unchecked warnings.
  4. GJ: Generic Java
  5. Java Language Specification, Third Edition by James Gosling, Bill Joy, Guy Steele, Gilad Bracha – Prentice Hall PTR 2005
  6. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Bloch 2018, pp. 126–129, Chapter §5 Item 28: Prefer lists to arrays.
  7. 1 2 3 4 5 6 7 8 Bloch 2018, pp. 117–122, Chapter §5 Item 26: Don't use raw types.
  8. Bloch 2018, pp. 135–138, Chapter §5 Item 30: Favor generic methods.
  9. Gilad Bracha (July 5, 2004). "Generics in the Java Programming Language" (PDF). www.oracle.com.
  10. "Type Inference for Generic Instance Creation".
  11. Gilad Bracha (July 5, 2004). "Generics in the Java Programming Language" (PDF). www.oracle.com. p. 5.
  12. 1 2 3 4 Bloch 2018, pp. 139–145, Chapter §5 Item 31: Use bounded wildcards to increase API flexibility.
  13. Bracha, Gilad. "Wildcards > Bonus > Generics". The Java™ Tutorials. Oracle. ...The sole exception is null, which is a member of every type...
  14. Gafter, Neal (2006-11-05). "Reified Generics for Java" . Retrieved 2010-04-20.
  15. "Java Language Specification, Section 8.1.2". Oracle. Retrieved 24 October 2015.
  16. Goetz, Brian. "Welcome to Valhalla!". OpenJDK mail archive. OpenJDK. Retrieved 12 August 2014.

Related Research Articles

Generic programming is a style of computer programming in which algorithms are written in terms of data types to-be-specified-later that are then instantiated when needed for specific types provided as parameters. This approach, pioneered by the ML programming language in 1973, permits writing common functions or types that differ only in the set of types on which they operate when used, thus reducing duplicate code.

In programming language theory and type theory, polymorphism is the use of a single symbol to represent multiple different types.

In computer programming, a parameter or a formal argument is a special kind of variable used in a subroutine to refer to one of the pieces of data provided as input to the subroutine. These pieces of data are the values of the arguments with which the subroutine is going to be called/invoked. An ordered list of parameters is usually included in the definition of a subroutine, so that, each time the subroutine is called, its arguments for that call are evaluated, and the resulting values can be assigned to the corresponding parameters.

In computer science, boxing is the transformation of placing a primitive type within an object so that the value can be used as a reference. Unboxing is the reverse transformation of extracting the primitive value from its wrapper object. Autoboxing is the term for automatically applying boxing and/or unboxing transformations as needed.

This article compares two programming languages: C# with Java. While the focus of this article is mainly the languages and their features, such a comparison will necessarily also consider some features of platforms and libraries. For a more detailed comparison of the platforms, see Comparison of the Java and .NET platforms.

<span class="mw-page-title-main">Java syntax</span> Set of rules defining correctly structured program

The syntax of Java is the set of rules defining how a Java program is written and interpreted.

Many programming language type systems support subtyping. For instance, if the type Cat is a subtype of Animal, then an expression of type Cat should be substitutable wherever an expression of type Animal is used.

TestNG is a testing framework for the Java programming language created by Cedric_Beust and inspired by JUnit and NUnit. The design goal of TestNG is to cover a wider range of test categories: unit, functional, end-to-end, integration, etc., with more powerful and easy-to-use functionalities.

<span class="mw-page-title-main">Scala (programming language)</span> General-purpose programming language

Scala is a strong statically typed high-level general-purpose programming language that supports both object-oriented programming and functional programming. Designed to be concise, many of Scala's design decisions are intended to address criticisms of Java.

In the Java computer programming language, an annotation is a form of syntactic metadata that can be added to Java source code. Classes, methods, variables, parameters and Java packages may be annotated. Like Javadoc tags, Java annotations can be read from source files. Unlike Javadoc tags, Java annotations can also be embedded in and read from Java class files generated by the Java compiler. This allows annotations to be retained by the Java virtual machine at run-time and read via reflection. It is possible to create meta-annotations out of the existing ones in Java.

<span class="mw-page-title-main">Oxygene (programming language)</span> Object Pascal-based programming language

Oxygene is a programming language developed by RemObjects Software for Microsoft's Common Language Infrastructure, the Java Platform and Cocoa. Oxygene is based on Delphi's Object Pascal, but also has influences from C#, Eiffel, Java, F# and other languages.

<span class="mw-page-title-main">Java collections framework</span> Collections in Java

The Java collections framework is a set of classes and interfaces that implement commonly reusable collection data structures.

The Java programming language and Java software platform have been criticized for design choices including the implementation of generics, forced object-oriented programming, the handling of unsigned numbers, the implementation of floating-point arithmetic, and a history of security vulnerabilities in the primary Java VM implementation, HotSpot. Software written in Java, especially its early versions, has been criticized for its performance compared to software written in other programming languages. Developers have also remarked that differences in various Java implementations must be taken into account when writing complex Java programs that must work with all of them.

In computer programming, an anonymous function is a function definition that is not bound to an identifier. Anonymous functions are often arguments being passed to higher-order functions or used for constructing the result of a higher-order function that needs to return a function. If the function is only used once, or a limited number of times, an anonymous function may be syntactically lighter than using a named function. Anonymous functions are ubiquitous in functional programming languages and other languages with first-class functions, where they fulfil the same role for the function type as literals do for other data types.

This article describes the syntax of the C# programming language. The features described are compatible with .NET Framework and Mono.

This comparison of programming languages (associative arrays) compares the features of associative array data structures or array-lookup processing for over 40 computer programming languages.

In the Java programming language, heap pollution is a situation that arises when a variable of a parameterized type refers to an object that is not of that parameterized type. This situation is normally detected during compilation and indicated with an unchecked warning. Later, during runtime heap pollution will often cause a ClassCastException.

The computer programming language, C#, introduces several new features in version 2.0. These include:

In the Java programming language, the wildcard? is a special kind of type argument that controls the type safety of the use of generic (parameterized) types. It can be used in variable declarations and instantiations as well as in method definitions, but not in the definition of a generic type. This is a form of use-site variance annotation, in contrast with the definition-site variance annotations found in C# and Scala.

Kotlin is a cross-platform, statically typed, general-purpose high-level programming language with type inference. Kotlin is designed to interoperate fully with Java, and the JVM version of Kotlin's standard library depends on the Java Class Library, but type inference allows its syntax to be more concise. Kotlin mainly targets the JVM, but also compiles to JavaScript or native code via LLVM. Language development costs are borne by JetBrains, while the Kotlin Foundation protects the Kotlin trademark.

References