IC211 (Spring 2020)

Data Hiding & Constructors

Separating interface from implementation: supporting vs. enforcing

What we saw last class is that OOP views programs not as a collection of functions, but as a collection of objects, each of which combines data and functions (fields and methods, in Java parlance) in a single bundle. So what is the "interface" and "implementation" we want to separate? The interface is the collection of prototypes for the member functions (methods) we provide for outside use, while the implementation consists of the definitions of those functions, data members (fields), and any helper functions we use to make the object work, but which we don't really intend users of the class to call.

Given what we've seen so far, Java allows us to separate the interface from the implementation, but it doesn't enforce that separation. Programmers who make use of our classes can access data members (fields) and helper functions, even if we don't intend them to. This is an important point. It means that we can hope that folks stay away from our implementation, but we can't rely on it. Why does this matter? Well, for starters, it means reckless programmers can mess up our code — say by setting a data member (field) to an inappropriate value, for example setting a "distance" data member to a distance in miles when you were assuming the value was in kilometers. It also means that you cannot change your implementation without having to worry about breaking the code of folks who rely on your classes. After all, they may have been using elements of the implementation unbeknownst to you.

To truly realize the benefits of OOP (and, indeed, of separation of interface from implementation) we need to enforce this separation. The mechanism for this is access modifiers.

Access Modifiers: public vs. private

Access modifiers allow the programmer to indicate what can be used (called, assigned to, read from) by what parts of a program in a way that is enforced not just by the compiler but, more importantly, by the JVM!

There are three basic modifiers: public, private, and protected. Within the scope of this class, only public and private are meaningful. The access modifier (if present) is the very first thing in a declaration.

data members / fields: If a field of class Foo is marked private, it is inaccessible (for both reading and writing) outside of the class declaration itself. If a field of class Foo is marked public, it is accessible outside of the class too.
member functions / methods: If a member-function/method of class Foo is marked private, it cannot be called outside of the declaration of class Foo. If a member-function/method of class Foo is marked public, it is can be called from any code.
classes: Classes behave differently according to whether they are "nested" (i.e. a class that is defined within another class) or "unnested" (most of the classes we've seen so far). If a nested class is declared private, it can only be instantiated from within the enclosing class. If it is declared public, it can be instantiated from any place the enclosing class can be instantiated from. An unnested class may NOT be declared private. However, if an unnested class is not marked public, it is by default "package private". Package private classes can only be instantiated from within the package the class belongs to ... and since we haven't talked about packages yet, we'll have to leave it at that. If an unnested class is marked public, it can be instantiated from any code.

The golden rule!!!!! We will grade you on this! In most situations, the rule you want to follow is simple: make the class itself and all member-functions (methods) you intend outsiders to use public, make all other member-functions (methods) and all data-members (fields) private. If you do this, you have a well-defined interface (the public methods), and a well-defined implementation (everything else), and their separation is enforced by the compiler and the JVM.

You may review the official Java Tutorial on Access Control here: https://docs.oracle.com/javase/tutorial/java/javaOO/accesscontrol.html

Initializing Objects with Constructors

In order to free those using your class from remembering to call initialization routines, and in order to allow you as a class implementor to be sure that your objects never get corrupted or in a bad state (meaning that the values in data fields are somehow wrong), you need to be able to control the initialization of objects. The main method of doing this is with a constructor.

This is how initialization is almost always done! A constructor is a special kind of instance method: it has no return type, and it is called when an object is created. In fact, "new" basically calls the constructor. Constructors can be overloaded, i.e. multiple constructors can exists, as long as they differ in the type and/or number of parameters they take. Parameters are used to allow the caller input into how the object will be initialized. For example, consider the following program that includes a class Coundown to ... well, to count down!

Countdown.java

public class Countdown {
  private int ctr;

  // This is the constructor!
  public Countdown(int start) {
    ctr = start;
  }

  public String next() {
    if (ctr < 0) {
      return null;
    } else if (ctr == 0) {
      ctr--;
      return "Blastoff!";
    }
    return "" + ctr--;
  }
}

Ex1.java

public class Ex1 {
  public static void main(String[] args) {
    Countdown C = new Countdown(3);  // Calls the constructor!!!
    String resp = C.next();

    while( resp != null ) {
      System.out.println(resp);
      resp = C.next();
    }
  }
}

Notice that the user of the class decides to countdown from 3 instead of 10, or 100, or whatever. Notice that the 3 in the "new" expression becomes the "start" parameter in the constructor. Note: Once one or more constructors are present in a class, only those constructors can be used. For example, with constructor Countdown(int start) present, it's not possible to instantiate a Countdown like this (below) anymore,

Countdown C = new Countdown();

since that would require a Countdown() constructor.

Initializing Objects without a specified constructor

What happens if you don't include a constructor in your code? The correct answer is that a default constructor is provided to you, but you'll have to wait another week to learn how that happens (through inheritance). For now, we can just talk about how your class data fields are initialized:

default initialization: Unless you specify otherwise, when an object is instantiated, all data-members of primitive type are initialized to 0/0.0/false, and all reference data-members are set to null.
initialization in the field declaration: This sounds attractive, but is not generally what you want. To give an example, in the following code, the field ctr is initialized to 10, so that every instance that gets created has its ctr field set to 10.
```
public class Countdown {
  private int ctr = 10;
  public String next() {
    if( ctr < 0 ) {
      return null;
    } else if( ctr == 0 ) {
      ctr--;
      return "Blastoff!";
    }
    return "" + ctr--;
  }
}
```

A properly designed "Batter" class

Since good "data hiding" design says keep data "private", here is a good implementation of the Batter class from last lecture:

Batter.java

public class Batter {
  private int hits;
  private int atBats;

  public void record(String outcomes) {
    for (int i = 0; i < outcomes.length(); i++) {
      if (outcomes.charAt(i) == 'h') {
        hits++;
      }

      if (outcomes.charAt(i) != 'w') {
        atBats++;
      }
    }
  }

  public double average() {
    return (double)hits / atBats;
  }
}

We can see one benefit of following the data-hiding stricture right away. No matter how anyone else uses this class, no matter what they do with it, no matter how they rely on it, if I change the class to this to the following, it all still works!

Batter2.java

public class Batter2 {
  private int hits;
  private int outs;
  private int walks;

  public void record(String outcomes) {
    for (int i = 0; i < outcomes.length(); i++) {
      if (outcomes.charAt(i) == 'h') {
        hits++;
      }

      if (outcomes.charAt(i) == 'o') {
        outs++;
      }

      if (outcomes.charAt(i) == 'w') {
        walks++;
      }
    }
  }

  public double average() {
    return (double)hits / (hits + outs);
  }
}

How do I know that? Because the only things that changed (the fields and the definitions of methods) were things that no code outsde the class could ever make use of, access, or touch in anyway. So these changes could not possibly affect any outside code!

"`static`" methods and fields

Access levels (public/private) control who can access something. We now discuss a different modifier, static, that determines if a field/method is specific to an instantiated object, or generalized and not specific to any one object.

Up to this point, we've made a big deal that all member fields in a class have separate copies in each instance of the class, so if we have two variables: Point one,two; then one.x is a different variable from two.x. There is an exception to this. We can declare a member field as static. In that case there is a single variable that is shared between all instances.

class Point {
  int x,y;
  static int num;
}

In this case one.num is the same variable as two.num. If I change one.num, I have changed two.num as well. This is handy for information that should be shared across all objects of the class. Imagine you wanted to establish a unique ID number to each object. Keeping a static field for the next free ID would be handy. But, what is the value of num initially? We can't initialize it because we don't have an object yet in order to name it. The answer is to use the name of the class not the name of a variable instance of the class. We could make num 0 by saying Point.num=0; Note that this is only valid for static fields.

Stylistically, since we can always access num via the class name instead of a variable name, it has become preferred that we always do. This way it signals to the reader of our code that this is not a regular member field.

We've seen the keyword static also applied to methods. A static method still is a member method of that class, but like static fields, it is not associated with any particular object. What this means is that inside this method, you cannot access any member fields that are not static. Consider adding the following to Point:

public static int foo() {
  return x; // ERROR ERROR ERROR!
}

If foo were not static, we could happily do one.foo(), and it would access the x field of the object one. But, since this is static, we would access this function as Point.foo(). So which object's x do we access? It's not clear! The compiler agrees with your confusion. For this reason, we would get a compile time error: "non-static variable x cannot be referenced from a static context." However, there is no impediment to accessing other static fields or functions:

public static int foo() {
  return num; // the field 'num' is declared static above
}

Static methods are used in 2 cases:

When we only want to access static fields in the class, like the example above.
When we don't need to access any of the state of an object because all the information we need is in the arguments. This is usually for utility functions that are best called in a structured/procedural way. The built in math functions are all examples of that: Math.pow(3,2);