Abstract Data Types and Objects
Two fundamental approaches to data abstraction
14 Mar 2020
There are three main methods of representing data which developers are likely to encounter: abstract data types, algebraic data types, and objects. Abstract data types (frequently abbreviated “ADTs”) are likely familiar to developers with a computer science background, and algebraic data types (unfortunately also abbreviated “ADTs”) are likely familiar to developers with a functional programming background. Objects are a concept that most developers are extremely familiar with on a practical level, but many struggle to precisely define. These three approaches each have distinct tradeoffs, and familiarity with their characteristics can enable developers to choose the most advantageous approach for a given problem. This post will introduce and compare two of these, abstract data types and objects. I do the same for algebraic data types in the sequel to this post, here. Example code will be written in TypeScript.
This post is heavily based on one of my favorite papers, “On Understanding Data Abstraction, Revisited” by William Cook. I have tried to pull out and present the concepts which are most likely to be relevant to the average JavaScript developer while omitting more abstract mathematical concerns, but for those who are motivated, I highly recommend reading the original paper and its companion essay.
Abstract data types: representing data opaquely #
An abstract data type is a model for data consisting of values and operations, whose concrete structure is hidden. For example, a Set
abstract data type is defined as having operations like add
, remove
, and has
. Each of these operations mutates or returns a Set
value, but the means by which they do so, or by which a Set
is represented, is hidden from consumers of the Set
type. To be strictly correct, the word “model” is an operative part of this definition: an abstract data type is an idea, describing a data type that could be represented in different languages. Colloquially, it’s common to use the term “abstract data type” to refer both to the formal definition of the data type as well as to concrete implementations of it. I will use the colloquial definition, and refer to software components which hide their internal structure as abstract data types.
There are several abstract data types built into the JavaScript language: Set
, Map
, and Array
are good examples. Developers have the ability to create and manipulate these, but are not able to access, modify, or extend their underlying representations. (Assuming we are limited by the constraints of good sense and decency and don’t try to modify the prototypes of things that we shouldn’t.) This leads to a fundamental tradeoff which characterizes abstract data types: by making the implementation of the data opaque, we have gained modularity at the expense of extensibility.
Objects: representing data through composable interfaces #
Cook’s definition of objects and object-oriented software discards many things which are normally considered essential to the paradigm, such as classes, mutable state, and inheritance. His definition instead centers on a total dedication to encapsulation. This is the definition that we will use here: a system is properly built out of objects if every function or method in the system only has access to the internals of a single abstraction. A method on an instance of a Foo
class can only access to its own instance state. Any interaction with another object, even another instance of class Foo
, must be done through an interface. In fact, it should be impossible to tell whether any object is of class Foo
or not! This is an implementation detail, which should be hidden by abstractions. Only the interfaces implemented by Foo
may be observed. Language features which allow you to break an object’s encapsulation, such as using instanceof
to figure out what concrete type of object is providing the interface being used, are completely disallowed.
This definition is clearly based on established object-oriented principles: “maintain encapsulation” and “code to an interface, not an implementation” are well-known sayings among object-oriented programmers. This definition is much stricter, though, and a great deal of code which is often referred to as “object oriented” would be more accurately described as “imperative, using classes” under this definition. Cook calls the guiding principle of his object-oriented programming “autognosis”, meaning “only having knowledge of one’s self”. As we’ll see, implementations which follow autognosis have a very different set of costs and benefits than ones which do not.
An abstract data type representation of a set #
To make the nature of and distinction between abstract data types and objects more concrete, we’ll implement the same data structure in each. The structure in question will be a set holding numbers.
The easiest implementation of this set as an abstract data type (though not the most efficient) is as a sorted array. This is what we’ll use here. We’ll provide five functions which operate on these sets: empty
will create an empty set, add
will add an item to a set, isEmpty
will tell us whether a set is empty, has
will tell us whether a set contains a given number, and union
will merge two sets together. Here is our implementation:
A few things to note about this implementation:
- Our
NumberSet
type is immutable; ourinsert
andunion
functions return a newNumberSet
without modifying their inputs. This shows that abstract data types are compatible with purity and functional programming, despite the fact that the ones built into JavaScript are mutable. - Our
empty
andisEmpty
functions run in constant time; our other functions run in linear time on the size of the set. A less naive implementation could reduce these to logarithmic time. - Our
union
function absolutely does not respect autognosis: it takes in two different sets, and operates on the internals of both, iterating through them as arrays. This solution is not object-oriented, by the definition we’re using.
This implementation does have one caveat– it’s not truly abstract. This is because TypeScript types are structural, so to the rest of a program which uses this file, it’s clear that NumberSet
is actually Array<number>
. This could be a problem. Say we wanted to add a size
function to our sets, which returned the number of distinct elements they contained. Here’s an obvious and efficient implementation:
This function assumes that there’s no duplicates in the array, because we enforce that invariant with our add
and union
functions. If a user were to access the array directly, though, they could insert duplicate elements, which would make our size
function return the wrong result. Maintaining abstraction is thus essential for our abstract data type to stay extensible.
There are a couple of ways we can enforce this abstraction. One would be to lean on the type system and use brands, a common trick for sneaking additional information into a type. We could also enforce encapsulation using classes:
The logic is the same as before; the difference now is that state is private, and that only instances of NumberSet
can view each other’s contents
.1 None of the aforementioned properties have changed.
It might seem strange to say that this implementation isn’t “object oriented”, because we’re using classes, and our implementation looks an awful lot like examples that you may have been told were OO. Using our current definition, though, OO is about much larger architectural properties than whether the class
keyword is used as an abstraction mechanism. As we’ll see next, an autognostic OO solution looks and behaves radically differently.
An object implementation of a set #
To implement NumberSet
with objects, we first need an interface for the objects that we’ll be working with. Here is this interface:
This interface has four of the five functions which we implemented for our abstract data type. Rather than providing a way to get an empty set as a method on our interface, we’ll create a class for empty sets.
The implementation of isEmpty
and has
are obvious. An empty set always returns true
when asked if it is empty, and false
when asked if it contains a given item. The union of an empty set and another set is just the other set. Our add
method may be more puzzling, however. By definition, an empty set doesn’t hold anything, so Empty
shouldn’t have any logic around holding items. We thus make a new class which is responsible for adding items to a set, called Insert
, and make Empty
delegate responsibility for adding items to the Insert
class. Here’s the definition of Insert
:
The Insert
set is constructed by providing a number and another set, and is responsible for representing that set with the given number added to it. Insert
only takes responsibility for this one number; if asked whether it contains a different number with has
, Insert
will pass responsibility for determining this off to its contained set.
In a similar vein, Insert
doesn’t deal with taking the union of sets. Instead, we pass that off to a class which has that as its single responsibility, called Union
. Here’s its implementation:
Compare our Union
object to the union
implementation of our abstract data type. That implementation combined two sets by being aware of each of their internal structures. This one implements its has
and isEmpty
functions by delegating responsibility to each of the sets which it wraps. It interacts with these other sets through their public NumberSetI
interface, without knowing what concrete class is involved. This means that our Union
class respects autognosis.
Just as with our abstract data type implementation, all of our data here is immutable. The add
and union
methods don’t modify the sets they’re called on, but rather return a new set. While people usually think of object-oriented programming as being based on mutability, objects can be very useful without any mutable state.
Writing autognostic code in TypeScript #
Cook provides two rules for writing this style of OO code in Java, and these may serve as guidelines for TypeScript as well. Here they are verbatim:
Classes only as constructors A class name may only be used after the keyword new.
No primitive equality The program must not use primitive equality (==). Primitive equality exposes representation and prevents simulation of one object by another.
In particular, classes may not be used as types to declare members, method arguments or return values. Only interfaces may be used as types. Also, classes may not be used in casts or to test with
instanceof
.
I would not take these as instructions for writing all code in TypeScript. Rather, they are guard rails to follow in sections of your codebase where you want the full extensibility benefits that objects bring. For a more thorough exploration of the importance of keeping types separate from classes, I recommend the talk “It is Possible to do OOP in Java” by Kevlin Henney.2
Tradeoffs between abstract data types and objects #
Abstract data types and objects have radically different characteristics, both in their implementations and in their usage in a larger system.
Looking first at these two implementations in isolation, we can see that their performance characteristics are radically different. For our abstract data type, isEmpty
was a constant time operation, and the runtime of union
, add
and has
were based on the number of items in the set. For our object implementation, union
and add
are constant time operations, while the runtime of has
and isEmpty
are based on the total number of methods that have been called on our set. In general, many performance optimizations are based on reducing some complex or redundant structure to an equivalent but smaller case. When we follow autognosis, this approach is not available, because we don’t get to inspect the concrete structure that we’ve created. For example, when we take the union
of two sets in the abstract data type implementation we eliminate any duplicate entries. If we take the union of the same sets over and over, our set will not grow, and later calls to has
will not be any slower. In our object implementation, Union
has no way to know whether the sets its combining overlap, and so can’t perform any kind of simplification. Calling union
repeatedly on the same sets will result in more and more Union
nodes being added, and later calls to has
on this set being slower.
There is a justification for this cost: object implementations can be easily extended in ways that are impossible for abstract data types. The internal structure of an abstract data type determines and limits what data it can represent, but objects can represent anything for which we can define interface methods. For instance, our abstract data type implementation has no way to represent infinite sets, but our object implementation can represent them easily:
Everything
is the set which contains all numbers, while Even
contains every even integer. Their implementations are trivial, and they can be used with all of our existing sets. If the sets we’ve written so far were published as a library, callers of the library would be free to implement their own sets, without needing anything from our library but the interface definition. The set of all prime numbers, the set of all numbers in a range, or any other set you’d like could be written and would interoperate with each other.
Objects are clearly superior to abstract data types when it comes to adding new representations, but this doesn’t mean that they are more extensible in every way. Consider adding a smallestIntegerAbove
function, which takes a number, and returns the smallest integer which is contained in the set that is larger than the provided number, or null
if no such integer exists. Adding this to the abstract data type requires adding one additional function:
To add this method to our object implementation, though, we’d have to modify every NumberSetI
instance that we’ve implemented. Empty
, Insert
, Union
, Everything
and Odd
will all have different implementations of smallestIntegerAbove
. Worse, imagine that we had published both of these set implementations on npm
. Adding a new function to the abstract data type implementation would be a minor change; our users could upgrade and just ignore the new function if they don’t want to use it. For the object implementation, extending the interface expected from objects would be a breaking change. Upgrading would require users to write additional code in order to stay compatible with other set implementations.
Abstract data types are thus easily extensible in terms of operations, but not in terms of representations, while objects are easily extensible in terms of representations, but not in terms of operations. This tradeoff of operational extensibility versus representational extensibility is known as “the expression problem”. There are solutions to this problem which enable both kinds of extensibility for objects in a modular way, but they are much more involved than the easy representational extensibility which we have right now.
Abstract data types and objects both provide data abstraction. This means that, from the perspective of a user consuming them, data is defined in terms of “what can be done with this data” rather than “what structure this data has.” For abstract data types, this abstraction is achieved by implementing a structure for data internally, and hiding this structure from users. For objects, no consistent structure exists, and data is represented as the composition of the behaviors which appear on the object’s interface. The individual objects which are combined to form a single data structure are as isolated from each other as the implementations of abstract data types are from the users who consume these types. Techniques which break the abstraction provided by object an object’s interface, such as the use of instanceof
, are thus strictly forbidden.
This fundamental distinction leads to a number of tradeoffs. Abstract data types are much easier to optimize than objects, but are impossible to extend without access to their internal implementation. Objects may be freely extended by creating additional representations which conform to the object’s published interface, even by users who are consuming the objects from a library. The use of concrete structures inside of abstract data types makes extending these structures with additional representations difficult, but a library author has a great deal of freedom to add new operations. In contrast, despite the ease with which objects may be extended representationally, adding new operations to them tends to require sweeping changes.
The two concepts are complementary, and one’s strength tends to be the other’s weakness. When you need data abstraction, knowing their tradeoffs is immensely important for choosing the appropriate tool for a given situation.
The capabilities of these two are vast, but they aren’t the only approaches to representing data. Algebraic data types are a third option, which represent data as a composable, concrete structure. I introduce them and contrast them with abstract data types and objects here.
This post is a reworking of one which I originally published on Medium. You can view my old Medium posts here.
-
Technically it’s still possible to break encapsulation here, by using casts or untyped JavaScript. A lot of things that should be forbidden by good sense and manners are technically possible, though. Here, we’ll make the assumption that our developers aren’t out to break our architecture by hopping over guard rails. ↩
-
The title of this talk is taken from a line in the Cook paper. ↩