Natural Software

Klingon Scala

2010-12-10T14:42:00.000-05:00

English is a "subject-verb-object" language. This denotes the ordinary order of words in an English sentence. For example, "He loves her," reveals his feelings, but not hers. When we write code in an object oriented language, we tend to choose words that reflect this practice.

In Scala, given a Set ns of Integers, we can use the Set's contains method to ask whether a given number is in the Set.

  val ns = Set(8, 15, 17)
  println(ns contains 42) //false
  println(ns contains 17) //true

In a sense, contains is a binary operator that carries ordered pairs to Booleans. We emphasize that, unlike other binary operators such as `+`, this one is not commutative. A snippet like "42 contains ns" would mean something else entirely, and doesn't even compile.

A DSL with ∈

Klingon (Tlingan) is an "object-verb-subject" language. Translating word-by-word from such a language, "Her loves He", or more properly, "She is-loved-by him" again tells us about his feelings, but not hers. Sometimes when writing code, it would be easier on the reader to shuffle the order of our operands.

  println(42 `∈` ns)

This of course does not compile because there is no such `∈` method of integers. However, when the gain in readability is worth the effort, Scala offers a way to write such expressive code.

  class MyElement[X](x :X) {
   def `∈`(xs :Set[X]) = xs contains x
  }
  implicit toMyElement[X](x :X) = new MyElement(x)

This approach contrasts a bit with monkey patching found in other languages. On one hand, the Scala approach tends to be a bit more verbose, since a new class is defined. On the other hand, Scala allows careful control over the modification's scope. Instead of globally altering the integer type as monkey patching would do, Scala affects the code only where the implicit function is imported.

Scala Duck Typing, Almost

2010-12-03T10:25:00.019-05:00

There are a couple of different approaches to type systems, and I'm not talking about the whole static vs. dynamic thing. The nominative approach requires subtypes to extend base types explicitly. The structural approach allows types to be equivalent if they merely have the same methods. Scala supports both.

Nominative Example

trait Printable { def print :Unit }

class Nominative extends Printable {

def print { println("Nominative") }

}

If we define a function that accepts Printable instances, then it will happily accept Nominative instances, too.

def nominative(p :Printable) = {

p.print

}

nominative(new Nominative)

Structural Example

Because our Structural class below does not explicitly extend Printable, the compiler does not let us pass its instances into the nominative function, even though it has a suitable print method. And sometimes, that's exactly the kind of type safety we want.

class Structural {

def print { println("Structural") }

}

nominative(new Structural) // does NOT compile

But, other times it isn't. Scala is powerful enough to support structural types, whose definitions look like traits but without names. We can use the type keyword to give our structural type an alias.

type CanPrint = { def print :Unit }

def structural(p :CanPrint) = {

p.print

}

structural(new Nominative)

structural(new Structural) // compiles!

A nice feature here is that our Structural class was defined before CanPrint, so structural typing is useful when we must adapt old code to a new purpose.

An Interesting Idiom

Finally, let's consider an interesting non-legacy case. Suppose we want structural typing, and we also have full control over our class definitions. It sure would be nice to be able to have the compiler check that our signatures match up.

Unfortunately (or perhaps fortunately, since it's not completely clear what it should mean), the following does not compile.

class DoesNotCompile extends CanPrint {

So instead, let's use the Predef.identity generic function to ensure that our class does indeed have the correct structure.

class AnotherStructural {

identity[CanPrint](this)

def print :Unit = {

println("AnotherStructural")

}

If we had misspelled or forgotten the print method, our class would not have compiled.

AspectJ and Scala

2009-11-23T08:36:00.006-05:00

What is the "atom" of software? If you consider an atom to be the smallest thing with which you can work, while continuing to do chemistry, then what's the software analogue?

My first thought was that an atom is a file. I can jar them up to make molecules, and string peptides of them together to make OSGi bundles. At some point the inorganic chemistry of programming becomes the protein-rich biochemistry of software engineering.

Or maybe an atom is the largest thing that, in isolation, can't possibly have a bug in it. Something like an instruction. Or maybe even a fully unit-tested class or method.

But, the history of the atom allows our analogy to grow richer, and weirder. Originally, atoms were a computational aid. They were discovered as a way to predict the outcomes of macroscopic chemical reactions. Even up until around 1905, there were still a handful of practicing chemists who didn't believe in atoms, except as a calculational tool.

But real they are, regardless of the intended meanings of the symbols chemists use to denote them. So, atoms feel more like aspects to me. Always there lurking in a program’s behavior, even if not represented using aspect syntax in the source code.

If I have a class implementing the public API of some library, then I log all the incoming calls. That logging is an aspect, even though I might have duplicated those slf4j calls in a dozen places. If I have code that takes care to release resources after I've acquired and used them, then that's another aspect. And if I've forgotten the finally clause somewhere, then that bug is a contaminant in the reactants, which makes my program behave differently than my chemistry equations would predict.

The trouble with hand-implemented aspects like repetitive logging calls or finally clauses -- even when you remember all of them -- goes beyond the biz logic pollution that they impose. All that duplicated code permits inconsistencies. For example, the log message in this method here looks a little different than the one over there.

And that's a bug escape. Because the log scraper that customer support is using, which you didn't even know about, is going to malfunction on that logging call that's only half a bubble off plumb.

It's a bit like isotopes of atoms. Not all carbon atoms are alike. You used a carbon-12 here, but whoops you used carbon-14 over there. And we know that one can decay on you. You used mostly protium, but here's a deuterium, so the heavy water that you made from it has measurably different physical properties (like boiling point), even though the chemical properties are the same.

Keeping with the analogy, hand-implemented aspects take you out of ordinary chemistry and force you to worry about nuclear and physical effects. It would be better to elevate aspects in the code to natively supported compiler constructs, like classes, so everybody is using the same isotopes.

That's why using AspectJ and Scala together tops my list of exciting things to do. I think of AspectJ as an external DSL that allows me to define pointcuts into my Scala code. Pretty much all my code, including the advice, continues to be written in Scala itself. The real virtue of AspectJ lies in the weaving.

And for some elegant work on internal DSL alternatives, refer to the paper by Daniel Spiewak and Tian Zhao about an AOP implementation in Scala.

So, rather than worrying about polluting my biz logic with code that better belongs in aspects, I'm now on guard against letting my biz logic leak into my aspects. And this is a much happier place to be.

Come to think of it, the false promise of object oriented programming was to offer reuse. This never really happened because classes are the wrong size to be reusable. Too small to be independently deployable, and too large to exclude application-specific implementation details. Instead, OO 's importance comes from the organizing principles it champions. But I wonder if aspects, devoid of custom biz logic, might take us closer to reusable software. Components and libraries are reusable in the large. But might some group (or period) of little aspect atoms be reusable in the small?

Principled Concordion

2009-10-29T22:44:00.002-04:00

There are a couple of ways to look at a hiatus from blogging. Either you are so successful that you rationalize you're too busy or important to reflect on all the wonderful things happening, or you've grown too slothful to find something exciting enough to share. I recently went to the No Fluff Just Stuff conference, and it has recharged my batteries, much as it did the last time I attended. Life offers only the palest excuses to avoid thoughtful introspection, or to fail to discover shareworthy things.

Analogies, Analogies

Lewis Carroll famously asked, why is a raven like writing desk? The question remains the archetypal example of a riddle deliberately concocted to have no solution. Nevertheless, I love contrived analogies for a couple of reasons. First, they are useful because they can communicate profound ideas economically. Second, they are whimsical and can warp the mind into discovering new ideas.

Which brings me to why developing software is like modeling a pendulum. To predict the future behavior of a simple harmonic oscillator, I need to know both the current position and the current velocity. Without two pieces of information, I can't solve the differential equation, and chart the pendulum bob's trajectory. Sorry, but there's just no getting around needing two values. Blame mathematics itself.

We often attack software projects like this. We figure out our current state and use it to predict where we need to go. This is a bit like taking a snapshot of the customer's expectations or requirements, and marching in the correct direction. The problem is, taking these measurements is really hard. And just a small error in requirements can lead to unsatisfied customers.

But, there's another way to solve differential equations. I still need two pieces of information, but they don't both have to be "initial" conditions. In a Dirichlet problem, I'm given an initial and a final position. From these, I can figure out the intermediate positions and the velocities.

We should (and the better among us do) develop software like this. By capturing requirements as stories, and expressing them as executable tests, we reduce our measurement errors. Moreover, our trajectory is anchored by the end condition, not mearly by initial guesses, so we're less likely to march off into the weeds.

Concordion

Consequently, I'm becoming increasingly enamored with Concordion. There are many descriptions of the tool available on the web, so I won't parrot them. Instead, I'd like to offer a different perspective, which I hope will not offend the Concordion community.

Concordion is an organizing principle, which helps one design acceptance tests and other tests of software. To be pedantic, it's actually an instantiation of such a principle, much as Smalltalk is an example of object oriented programming. My take on it is this: a family of automated tests deserves human-readable views into them, with appropriate encapsulation or elision of distracting details, such as execution order.

Concordion is often compared with FitNesse, but infrequently contrasted with it. FitNesse drives tests. I can go to a web page, push a button, and see my test run. Concordion, however, is a view into tests. I go to a web page to see results, which probably came from a continuous integration server. This difference is profound.

You can find a dozen books on object oriented programming, particularly the older ones, that sing the praises of OO because it permits code reuse. In the real world, reuse turns out to be the least compelling reason to embrace object oriented programming. The real value of OO principles lie in the improved organization of the resulting code. We mean "improved" here for human readability, not necessarily performance or computer efficiency.

Analogously, you can find many books about automated testing and the virtues it brings to software development. But a neglected advantage of good automation is that it offers ways to organize tests. Well presented tests are superior expressions of requirements.

With Concordion, I can design web page views into my tests. I leave many details, such as the order in which tests run, to my continuous integration server. For example, tests with similar setup requirements can be grouped together. But I can organize the presentation of the results any way that I want. For example, tests can be organized by sprint, or by module, or by cross-cutting feature.

Concordion makes software development look more like a Dirichlet problem, where I can keep the end in mind from the very beginning. Thinking of Concordion not as a tool, but as a principle, will shape how I program. And I still have much to learn about how one does that well.

More Scala Using RAISIN

2009-06-24T12:01:00.001-04:00

Last time, we offered a minimally functional emulation of C#'s using syntax, to manage resources elegantly in Scala. We defined a curried function, whose second argument was a simple block of code. We'll refine that approach and try to bring about the remaining goals we set for ourselves for this feature.


  def using[T <% Disposable]
  (resource: T)(block: => Unit) = {
    try {
      block
    }
    finally {
      resource.dispose
    }
  }

One problem with our first cut was that the object encapsulating the managed resource had a larger scope than we wanted. Since we constructed our FileHandle instance outside of the block that used it, one could accidentally access it after it had been disposed.


  val handle = new FileHandle("trouble")
  using(handle) {
    handle.read
    handle.write(42)
  }
  // big trouble below!
  handle.read

What we really need is not to pass a Unit into the using function, but a function that accepts the resource as its argument. In other words, we'd like to be able to make a useful function and pass that as an argument into the using method


  def useful_function(handle: FileHandle): Unit = {
    handle.read
    handle.write(42)
  }

  // pseudo-code to capture the idea
  //
  using(new FileHAndle("good"), useful_function)

That's the gist of what we want to do, but we don't want all the cruft of declaring the useful function separately. Happily, Scala allows us to use function literals to write the above very economically.


  using(new FileHandle("good")) { handle =>
    handle.read
    handle.write(42)
  }
  //
  // handle is not visible down here and
  // can't be abused, Yay

For this to work, we have to refine our using method. All we have to do is change the second argument from type Unit to the function T => Unit, and make sure to call the block with the expected T resource.


  def using[T <% Disposable]
  (resource: T)(block: T => Unit) {
    try {
      block(resource)
    }
    finally {
      resource.dispose
    }
  }

Our using function is pretty powerful now. Without any modifications, it works with closures as well as function literals. Let's alter the client code a bit to demonstrate. The following is a closure and not a function literal because i is not defined inside the curly braces demarking the code passed into using.


  def demonstrate_closure(i: Int) = {
    using (new FileHandle("simple")) { handle =>
      handle.read
      handle.write(i)
    }
  }

Still, there are additional things we can do in the body of our using method. For example, we could take special action if the resource passed in were null. Alternatively, we could wrap the dispose calls inside a try-catch block to prevent them from emitting exceptions.

C++ uses compile-time overloading to choose different behaviors for some functions. For example, the new operator comes in different overloaded flavors. One takes a throwaway argument of type nothrow_t to indicate that the desired version of new will return NULL when it fails, instead of throwing an exception.

In Scala, a tried and true way to choose different behaviors at compile time is by the import statements. For example, if you want a mutable Set in Scala, you


  import scala.collection.mutable.Set

This inherits from the same Set trait as the immutable version, so the logic where the class is used is clean. Although the C++ nothrow_t concept is interesting, Scala's approach appears to have a better separation of concerns, and results in uncluttered code.

If we are so inclined, we can do something analogous with our using method. We could choose to import from one package where the implementation swallows Throwables emitted by dispose. Or, we could import from another where they are allowed to propagate. In other words, we can handle exceptions quite intelligently, and customize our behavior depending on context.

Finally, let's consider whether we can avoid needing to nest using clauses, and manage the disposal of multiple resources more elegantly. This is possible, but there's one important subtlety that we have to worry about.


def using[T <% Disposable, U <% Disposable]
(resource: T, _resource2: => U)(block: (T,U) => Unit) {
  try {
    val resource2 = _resource2
    try {
      block(resource, resource2)
    }
    finally {
      resource2.dispose
    }
  }
  finally {
    resource.dispose
  }
}

Note that the _resource2 argument is passed by name, and not by value. We don't actually access it until declaring the val resource2 inside the outer try block. This means that if the construction of resource2 fails, we will still call dispose on the other resource.

Let's demonstrate this. Suppose our first resource object constructs okay, but the second one throws an exception in its constructor. This is standard behavior for a RAISIN class, which disallows partially constructed instances.


  def two_resources() = {
    using (new FileHandle("okay"), new FileHandle("bad")) {
      (first, second) =>
      second.write(first.read)
    }
  }

If that second FileHandle constructor fires before entering the using method, then we have a resource leak! The first FileHandle is never disposed. But, because we pass the second argument by name, the second constructor does not fire before entering the using method. We're essentially passing a pointer to the constructor into the using function, who calls it.

Why pass just the second one by name and not the first one? Did we just get lucky? No. Scala evaluates its arguments from left to right.

A consequence of this choice is that we cannot access the _resource2 argument more than once inside the using method. Note that it's accessed exactly once when defining the val resource2. Otherwise, the constructor would be called again and again inside the using method. That would be an even worse resource leak, and would probably malfunction.

We've now shown that our C# emulation meets all but one of our goals. This is impressive because the Scala behavior is superior even to C# itself, for example with regard to limiting the scope of variables. The remaining goal is to demonstrate how our using construct can work with legacy classes such as java.io.File that do not extend Disposable. We'll take up this cause in the near future, after a detour into some decidedly non-standard C++. But the punchline is, we had the foresight to use view bounds and not upper bounds, so we're well prepared.

In summary, we've shown how to emulate the C# using syntax in Scala, to enable RAISIN style programming. We were remarkably successful at bullet-proofing our resource management with surprisingly few lines of code. We handling many edge cases, offered flexibility, and achieved ambitious goals. Along the way, we encountered function literals, closures, pass by name, generics, view bounds, import statements, and (presently) implicits.

This was a lovely exercise because so many different aspects of Scala had to come together in harmony. It's clear that API designers must master these features to produce high quality code, but even casual programmers would do well to learn them.

Scala Using RAISIN

2009-06-17T12:01:00.007-04:00

last time, we touched on RAISIN, and considered Java's inability to support this programming style to be an important deficiency of the language. We also promised to explore whether Scala could emulate the C# approach to deterministic destructors. We take up that challenge presently, and we're going to find that a wide variety of Scala features all come together to make this happen.

Implementing RAISIN is a little tougher than our Ruby "unless" modifier, where the task was pretty narrow and well understood. So before we begin, let's capture the goals we should set for emulating -- and surpassing -- the C# "using" syntax inside Scala.

Beautiful, readable code
Obliging the user to do very little
Handling multiple resources at once
Preventing stale objects from being accessed
Prefer immutable & avoid nulls
Intelligent exception handling
Flexible enough for arbitrary resources

Beautiful, readable code

This is always the prime directive. Suppose we had our FileHandle class, and we have to ge rid of its associated reource after we use it. We should tolerate nothing uglier than what we'd see in C#.


  // Scala wishful thinking
  //
  val handle = new FileHandle("myfile")
  using(handle) {
    // Either of the following methods might
    // throw, but that's okay.
    //
    handle.read
    handle.write(42)
  }

Obliging the user to do very little

We really want to avoid having to repeast all the try-finally scaffolding in the user's code, which Java would require. We also don't wan tht use to have to understand the details of how to free up the resources. Maybe something as simple as...

import csharp._

...should be sufficient to make the using syntax available to the programmer's code.

Handling multiple resources at once

Rather than nesting one using clause inside another, it would be nice to follow C#'s practice of allowing multiple resources inside one using statement. This also aligns with th functionality afforded by C++, in which we can put multiple objects on the stack inside the same block, illustrated below.


// C++
{
  FileHandle const h1 = // details omitted
  FileHandle const h2 = // details omitted

  // Use h1 and h2 freely here.  Even if the
  // construction of h2 failed, h1 still
  // gets released.  That's important
  //
}

Preventing stale objecgts from being accessed

This is an opportunity for our Scala solution to shine. Reconsidering our first example above, We'd like the handle to have the smallest possible scope.


  val handle = new FileHandle("myfile")
  using(handle) {
    // Either of the following methods might
    // throw, but that's okay.
    //
    handle.read
    handle.write(42)
  }

// It would be nice if we could somehow make the
// compiler prevent spurious accesses of the handle
// down here.  We want to deny access to disposed
// objects.

Prefer immutable & avoid nulls

We'd like to use val rather than var wherever we can. This is analogous to using Java final when declaring variables. We'd also like to be assured that the resource is constructed correctly, and not null.

These desires may may compel us to put the initialization, meaning the resource acquisition, somehow inside the using clause where it can be managed well.

Intelligent exception handling

It's a well known coding practice in C++ to code destructors so that they do not emit exceptions. However, no such convention exists for common Java classes. For example, the java.io.File.close method throws java.io.IOException. We need a way to handle such exceptions intelligently.

Flexible enough for arbitrary resources

In C++, any class can have a meaningful destructor, so previously designed classes can be used in the RAISIN style. In C#, we're constrained to use only classes that inherit from the IDisposable interface, and the cleanup has to be done in the dispose method.

This means that ordinary classes like java.io.File, which has a close method instead of a dispose method, will pose some difficulties when trying to wrap it in a C#-like "using" clause. Yet, Scala is powerful, and it's a reasonable goal to overcome these limitations.

Will all these goals in mind, let's not try to bite off too much at once. Last time, our zeroth cut defined a Disposable trait and a FileHandle that extends it. This time, we'll also want a using function that accepts a Disposable object and a block of code to be executed.


// First cut...
package csharp

object Using {
  def using[T <% Disposable](resource: T)(block: => Unit) {
    try {
      block
    }
    finally {
      resource.dispose
    }
  }
}

There's a lot going on in that method, so let's tease it apart carefully. First, it's a parameterized function, where the resource argument must be of type T. The <% notation is a view bound. It means that type T must inherit from Disposable or be transformable into Disposable by an implicit.

(It's not obvious yet why we need view bounds, or even an upper bound. This is just a little adumbration for how we're going to achieve some of our trickier goals, such as "preventing stale objects from being accessed," and "flexible enough for arbitrary resources." We won't get there in this post, but have patience.)

Second, the using method has two argument lists, rather than a single list of comma delimited arguments. Put another way, using is a curried function, as evidenced by two sets of parentheses instead of just one. This syntax allows the second argument to be a block of code in curly braces, rather than something inside using's parentheses.

Third, note that the arrow notation implies that the block is passed by name, not by value. This means that the code won't actually execute until block is called inside the try clause of the using method. It does not execute before using is entered.

Since our toy FileHandle class (defined in a previous post) inherits from Disposable, then we can write the following.


import csharp.Using._

object Main {

  def simple_usage = {
    val handle = new FileHandle("simple")
    using(handle) {
      handle.read
      handle.write(42)
    }
  }

  // details omitted

That's not bad for a first cut. We've achieved our first two goals, but we still have a long way to go in future posts to make progress on the others.

In summary, we've taken some steps towards implementing RAISIN in Scala, taking the C# using syntax as a model. Along the way, we've seen view bounds, curried functions, and pass-by-name. The latter two language features allow the user's code to be beautiful.

Software Development Process

2009-06-10T12:01:00.004-04:00

A process is the collection of practices followed in an organization. it identifies the hats worn by people, and the artifacts they produce and consume. It names the responsibilities that the workers fulfill, and the workflows through which their artifacts pass. Also, a process likely includes at least some of the tools used, because automation is a big part of getting things done.

Examples of software development processes include RUP (Rational Unified Process), Scrum, and Waterfall. To make a coding analogy, one might argue that a certain project instantiates a development process just as an object instantiates a class.

A process not only reflects the activities of the participants, it also guides their efforts. however, keeping with the coding analogy, the humans are the virtual machine in which the process instance runs. Therefore, people are the heart of any process, and processes are always malleable. Even if a process purports to be rigid, it will not likely be followed for very long.

Processes can be documented, but a process description is no more a real process than a virus is a living cell.

The metaphor is apt. Practices are captured in memes. For example, champions of test driven development self identify as "test infected." Very few developers who have not actually tried TDD and seen that it changes the way code gets designed could have gleaned this effect just from reading a book.

A good process will reproduce, evolve, and spread its success far and wide. But just as some organisms can't live in some environments, the ecosystem has to be receptive to the practices embraced in a process for them to take root. There are no "best practices." Context is everything.

Successful processes arm decision makers with timely information, and offer guidance for resolving problems. As a corollary, the more empowered the workers are, the more freely available such information must be, because there are more decision makers shaping progress. The contrapositive also follows. Without transparency, success rests on the talents of just a few privileged individuals.

Useful processes permit the measurement of and influence over:

Quality
Costs
Progress
Growth

By Quality, of course we mean customer satisfaction. What's not quite so obvious is that many people in the organization wear the customer hat for various artifacts and services during development.

By Costs, we mean the fiduciary expenditures for salary, tools, training, hardware, and so on. (This is sometimes more difficult than it would appear, beause a single software effort could have multiple funders, each interested in different features being developed.)

By Progress, we mean the maturing of the artifacts, such as code, documentation, and models, into a consumable or sellable state. Often, the careful monitoring of progress is especially important to certain stakeholders.

By Growth, we mean the professional growth of the human beings who are developing goods. This includes skills improvement, job satisfaction, value to the organization, and contributions to the profession and the art.

Hey Scala, Finalize This

2009-06-03T12:01:00.005-04:00

Years ago, when I moved from C++ to Java, I expected to miss a few things. My daily workhorses like templates & the STL were not available in the new environment. When it came to API design, which is really mini-language design, I could no longer rely on operator overloading. Even little efficiencies like inline functions were denied me.

But, it turns out, I didn't really long for any of those things as much as I anticipated. What I really missed, I mean what I felt no programmer could live without, was the deterministic destructor.

Bjarne Stroustrup champions RAISIN, Resource Acquisition IS INitialization. The idea is to represent acquired resources as class instances on the stack. So when those objects fall out of scope, a destructor fires and frees the associated resource.

A nice advantage of RAISIN is that it doesn't matter how you leave the scope. The code could simply return, or it could emit and exception. The burden falls on the class designer to remember to free up the resources. This is vastly superior to obliging every user of the class to remember to free the resources, in every place where it's used.

Note that we're not talking about manual memory management here. Resources include file handles and mutexes and database connections and whatnot. Acquisition occurs when the instance is initialized. Let's see an example.


//C++
void raisin_example(std::string name)
{
    FileHandle const handle = FileHandle(name);

    // use handle here, maybe some code throws
    // an exception, but that's okay
    //
}

All the cleanup work is done once and for all in FileHandle::~FileHandle(), and that always fires when the handle object falls out of scope. The simplicity and safety of he above contrasts sharply with Java.


// Java
void hardly_raisin(String name)
{
    FileHandle handle = null;
    try
    {
        handle = new FileHandle(name);

        // use handle here, maybe some code throws
        // an exception, but that's okay
        //
    }
    finally
    {
        if (null != handle) handle.close();
    }

Wow. That's a lot of scaffolding for something that every user has to remember to get right every single time. There's a lot of opportunity for things to go wrong here. We can't even make the handle const (or final), because it would then be out of scope of the finally clause. Java even gets worse if here are several resources, and we have to release them in the reverse order of which they were acquired.

Coming to Java required a profound shift in programming style. The lack of support for a good coding practice like RAISIN is one of the serious deficiencies of the language. I used to marvel at how much Java code I had written since leaving C+, as if hat was some kind of proof that RAISIN really wasn't so important. But now that I program in Scala, I shudder to think about how many lines of that Java code were just scaffolding.

Java's success despite this weakness says something about how important Java's strengths are. In other words, in the marketplace, garbage collection apparently trumps RAISIN. Thinking like a scientist, it's fun to speculate about what language features are more valuable than others, using trial by market as a grand laboratory.

The C# designers addressed this Java deficiency, after a fashion, by creating Disposable objects, and a convenient language syntax for cleaning them up.


// C#
void raisin_example(String name)
{
    FileHandle handle = new FileHandle(name);
    using(handle)
    {

        // use handle here, maybe some code throws
        // an exception, but that's okay
        //
    }
}

That's not too shabby. All we have to do is oblige the FileHandle class to inherit from the IDisposable interface and implement a dispose method, which executes at the end of the using clause. The clean up code that the C++ programmer would have to put into a destructor goes into the dispose method instead.

Like Java, Scala also lacks deterministic destructors. However, Scala is powerful enough to emulate the C# "using" syntax. Let's make a zeroth cut at this in Scala. We'll define a Disposable trait, which we'll use in subsequent posts.


// Disposable.scala
package csharp

trait Disposable {
  def dispose(): Unit
}

Finally, let's contrive a FileHandle class that extends csharp.Disposable. We'll use this in subsequent posts, too.


// FileHandle.scala
package raisin

class FileHandle(name: String) extends csharp.Disposable {

  // constructor acquires resource here
  //

  override def dispose: Unit = {
    // release resource here
    //
  }

  // Nice things you can do with a FileHandle
  //
  def read(): Int = { /* details omitted */ }
  def write(i: Int): Unit = { /* details omitted */ }
}

Last time, we showed how the Ruby unless modifier could be implemented in Scala. Next time, we'll take some steps towards bringing RAISIN to Scala.

Ruby Unless Scala

2009-05-27T12:01:00.008-04:00

Now that I'm programming primarily in Scala, I find myself missing a couple of cool tricks that Ruby offers. For example, it's neat to be able to put "unless" modifiers at the end of a line. Even if you've never seen Ruby (or Perl) before, it's easy to guess what the following code does.

# Ruby
print total unless total.zero?

It should be no surprise that if the total is zero, then nothing happens. But if the total is not zero, then it's printed.

To my knowledge, Scala has no such concept. However, with all the power Scala offers for creating internal DSLs, it might be fun to try to emulate this syntax. This post is more about demonstrating what can be done with Scala than it is about championing the use of unless modifiers in one's code.

My first attempt failed. I thought I would create a RichBoolean, analogous to a RichInt, so that I could effectively add some methods onto the Boolean class. My new class needed an "unless" method, but it had to be right associative. So, borrowing the trick use in the cons operator, it would have to end in a colon.


class RichBoolean(b: Boolean) {
  def unless_:(block: => Unit) {
    if (!b) block
  }
}

implicit def booleanToRichBoolean(b: Boolean) = {
  new RichBoolean(b)
}

There's a lot going on up there, so let's try to tease it apart and explain it. The RichBoolean class is basically a wrapper around the Boolean class. We've effectively added an "unless_:" method to that class. The implicit function tells the compiler to convert a Boolean into a RichBoolean whenever it appears that someone is trying to call an "unless_:" method on it.

This is the standard Scala way to add methods to a class. Some languages are more open, and allow the addition of methods directly after the class is defined. Scala offers the same freedom, but with better control. Unless you're importing the booleanToRichBoolean function, you don't get the automatic conversion. I know that some folks are nervous about implicits. But because of this control, I find them safer than open classes.

Another noteworthy feature of the above is the arrow symbol. This implies that the block is being passed into the "unless_:" method by name, and not by value. In other words, we hope that the block doesn't get evaluated before "unless_:" executes, but only inside of that method when b is false.

Passing by name is a remarkably powerful language feature. To the imperative programmer, it might seem like passing by reference in C++ or Fortran, but it's actually more subtle. We're not passing the address of the result of some calculation. We're actually passing a pointer to the code that computes the result. Methods that accept pass-by-name have the option to skip the calculation entirely, when that makes sense. Consider how efficient that can make a logging API!

Finally, Scala forces us to include the underscore in the "unless_:" method name, so that there's no ambiguity about whether the colon symbol is part of the lexeme. There's an important lesson here that's more widely applicable than this example. Never end lexemes with underscores. If they happen to wind up next to a colon, they may run into trouble.

I tried to test out this code with a little function. I could pass either true or false into it and see what happened. It compiled fine. It just didn't do what I expected.


def demonstrate_ruby_syntax(flag: Boolean) = {
  println("flag is " + flag) unless_: flag
}

Sure enough, flag gets promoted to a RichBoolean, and the infix "unless_:" fires. But no matter whether I pass in true or false, the println always executes. This is a little surprising because we passed block by name and not by value.

I'm at a bit of a loss to explain this. A little instrumenting showed that the block containing the println statement is executing outside the "unless_:" method, and not inside it.

By way of comparison, suppose we used the cons operator (::) to construct a List[Int] as follows...


  val list = 1 / 0 :: Nil

This blows up because of the divide by zero, but the stack trace reveals that the exception occurs before getting into the cons method. However, the cons method scaladocs say that the argument is passed by value, not by name, so we'd expect exactly that here.

So, unable to make my RichBoolean idea work, I next tried to put a wrapper class around the block itself. This had the advantage of letting me get rid of the colon cruft on the unless method name. I also don't think it's any more dangerous, despite the implicit, because the unless method signature admits only a Boolean.


package ruby

object Unless {

  class UnlessClass(block: => Unit) {
    def unless(b: Boolean) = {
      if (!b) block
    }
  }

  implicit def
  unitToUnlessClass(block: => Unit): UnlessClass = {
    new UnlessClass(block)
  }
}

Happily, this approach worked. It also demonstrates a neat fact. The implicit function accepts block by name, and the UnlessClass constructor does too. Yet, block doesn't execute until the unless method is called with a false argument. This means that the Scala compiler is smart enough to let the by-name cascade through (at least) two calls.

All I have to do now is...


import ruby.Unless._

// details omitted...

  def demonstrate_ruby_syntax(flag: Boolean) = {
    println("flag is " + flag) unless flag
  }

... and my Ruby-esque unless modifier syntax works as expected. The printing only occurs when the flag is false.

There's one more enhancement we can consider. Our code only compiled because println returns Unit. But what if we had some other routine that returned some other type? In such a case, we're relying on the side effects, and not the computational result of the function. This is an imperative rather than functional style, but since Scala lives in both worlds, it would still be nice to be able to use the unless modifier syntax. Consider the following contrived example.


def myfunc(flag: Boolean): Int = {
  println("myfunc flag is " + flag)
  42
}

Happily, generics can come to our rescue. By parameterizing our UnlessClass, we can implicitly convert to it from arbitrary types.


class UnlessClass[T](block: => T) {
  def unless(b: Boolean): Unit = {
    if (!b) block
  }
}

implicit def
toUnlessClass[T](block: => T): UnlessClass[T] = {
  new UnlessClass[T](block)
}

Note that our new unless method still returns Unit because we only use this construct where the return value of methods like myfunc are deliberately discarded.


  def demonstrate_ruby_syntax(flag: Boolean) = {
    myfunc(flag) unless flag
  }

In summary, by emulating the Ruby unless modifier, we've demonstrated a few of the Scala language features that allow rich DSLs to be created. Along the way we learned about right associativity, implicits, passing by name, and generics.

Overriding Scala def With val

2009-05-20T12:01:00.003-04:00

Last time, we created a little toy class hierarchy to demonstrate Scala injection. We also illustrated how Scala's powerful type system can keep us out of trouble. This time, we're going to explore some design tradeoffs that emerge from choosing Scala def or Scala val.

To review, we created an abstract base class that stores masses, and always reports their values in kilograms. We extended that with immutable classes that are initialized with values in various units.


abstract class Mass {
  def kilograms: double
}

class Kilograms(kg: Double) extends Mass {
  def kilograms = kg
}

class Grams(grams: Double) extends Mass {
  def kilograms = grams / 1000.0
}

Suppose that the calculation to convert grams into kilograms was difficult and lengthy. Then the Grams implementation of the kilograms method might get us into trouble, because we'd be repeating that work needlessly every time it was called.


class Grams(grams: Double) extends Mass {
  def kilograms: Double = {
    Thread.sleep(5000) // pretend to think hard
    grams / 1000.0
  }
}

The above class constructs instantly, but every time somebody calls the kilograms method on an instance, it takes a long time. This is sad because Grams is immutable. We'd like some way to save the output of the calculation instead of the input.

Let's use the javap tool to peer into what Scala is doing under the hood. The constructor argument grams is called a class parameter in Scala-ese. Class parameters used outside of constructors, as grams is used in the kilograms method, become full fledged private fields of the class. Consider the following (edited) snippet.


$ javap -private Grams
Compiled from "Grams.scala"
public class Grams extends Mass
    private final double grams;
    public Grams(double);
    public double kilograms();

Amazingly, Scala allows us to override the abstract "def kilograms" in mass with a "val kilograms" in Grams. This is a lovely language feature, but it's worth spending a little energy to understand what's going on under the hood.

Let's change our kilograms def into a val in our derived classes. The following class is slow to construct, but each call to kilograms completes instantly.


class Grams(grams: Double) extends Mass {
  val kilograms: double = {
    Thread.sleep(5000) // pretend to think hard
    grams / 1000.0
  }
}

Take a moment to digest the tradeoff. The first version is small in memory, containing only one double field, the grams class parameter. It constructs quickly, but each call to kilograms takes a long time. The second version constructs slowly, but all calls to kilograms are quick. We would prefer the first design if we expect the users of the class to call kilograms no more than once, and the second design if we expect the users to call kilograms multiple times on each Grams instance.

In the second design, the grams class parameter appears to be used nowhere but in the constructor itself when the "val kilograms" is defined. So, one might expect that it will not become a real field in the Grams class. Trusty javap confirms this suspicion. Consider the following (again edited) snippet.


$ javap -private Grams
Compiled from "Grams.scala"
public class Grams extends Mass
    private final double kilograms;
    public Grams(double);
    public double kilograms();

Note that under the hood, despite being declared a val in the Scala source code, kilograms is also a method. A moment's reflection(no pun intended) will tell us that it has to be a method. Grams is a concrete class that extends an abstract class with a pure virtual kilograms method. So even thought the Scala source hides it, kilograms is still a method of Grams.

What is that public kilograms method up to? Again we appeal to javap, and learn that it's doing nothing except returning the double stored in the private kilograms field. Just as we might have expected.


public double kilograms();
  Code:
   Stack=2, Locals=1, Args_size=1
   0:   aload_0
   1:   getfield #30;     // Field kilograms:D
   4:   dreturn

The above is much shorter than the previous version, which performed the expensive calculation. Again, we conclude that the criteria to prefer one design over the other rests on the expected usage patterns of our class, as explored above.

We should also ask ourselves whether it's possible to delay the expensive calculation, possibly indefinitely, in case it's never needed. This third design would represent the classic programming tradeoff between space and time, and we'll take it up in a later post.

In summary, we've seen that it's possible to override a Scala def with a Scala val. Under the hood, the override is still implemented by a method. The javap tool is very useful to help us figure out what's going on, and one would do well to understand the design tradeoffs of each approach. Scala's marriage of object oriented programming with functional programming is made in heaven. We can use inheritance and exploit immutability, enjoying the flexibility to make considered design choices.

Sweet Scala Injection

2009-05-13T12:01:00.003-04:00

Before getting back to non-final finals, let's consider a fun diversion. The second most famous equation in Physics is Newton's second law of motion, F = ma. When you apply a force F to an object of mass m, it accelerates at rate a.

Of course, your arithmetic only gives you the right answer if you're consistent in the measurement system you pick. There are two major systems of units in use. One is the metric system or SI (System International), formerly called the mks system. The letters stand for meter, kilogram, and second, which are the principal units used to measure length, mass, and time.

The other main system in use is also the metric system. (Gotcha.) It's called the cgs system, whose letters stand for centimeter, gram, and second. You have to take care to keep your units straight to use formulas like F = ma. The units for force are named newtons in the mks system and dynes in the cgs system. But if you multiply a gram times an acceleration recorded in meters per second per second, you'll get neither a newton nor a dyne.

The recent loss of the Mars Polar Lander is a painful demonstration that units really matter.

Since I like to blog about how thinking like a scientist makes me a better coder, I'll mention that it's unnatural to think, "oh, this book weighs two." Such a sentence might be grammatically correct, but without specifying the units, it's meaningless.

Scala has an especially thoughtful type system, and we can press it into service to keep our units straight when we do calculations. In this (and the next) post, we'll create a toy program, in illustrate one or two Scala goodies.

Kilograms and grams both measure mass. It's not too much of a stretch to use the "is-a" relationship in an object oriented language to capture this notion. In what follows, Kilograms and Grams inherit from Mass.


absract class Mass {
  def kilograms: Double
}

class Kilograms(kg: Double) extends Mass {
  def kilograms = kg
}

class Grams(grams: Double) extends Mass {
  def kilograms = grams / 1000.0
}

Our base class has a kilograms method that returns the amount of mass in the mks units. All our calculations will be done in mks units, but the programmer is free to initialize a mass variable with either kilograms or grams.

Now let's construct a Force class. In a full-fledged example, we'd probably make it an abstract class extended by Newtons and Dynes. But we don't need such a complete solution here to demonstrate the ideas. Give the class an accelerates method, which tells how much the given force in newtons will accelerate a specified mass.


class Force(newtons: double) {
  def accelerates(mass: Mass) =
    (newtons / mass.kilograms) + " meters per sec^2"
}

Note that the accelerates method doesn't care whether it's passed a value in kilograms or in grams. All it's demanding is a mass, and since that offers a method to take us into mks-land, we can assuredly report our acceleration in meters per second per second.

Now, let's define a force of half a newton, and run a little program to see how much this force will accelerate a couple of masses. In each case below, there's no ambiguity about whether each mass is expressed in kilograms or grams, because the units are explicitly specified.


object MyApp extends Application {
  val force = new Force(0.5)
  println(force accelerates (new Kilograms(4.0)))
  println(force accelerates (new Grams(100)))
  //
  // "0.125 meters per sec^2"
  // "5.0 meters per sec^2"
}

The parentheses around the "new Kilograms(4.0)" are actually redundant, but that might surprise a Java programmer. Scala also lets us omit the dot between force and accelerates, which arguably improves readability.

So, the above works, but specifying "new Kilograms" everywhere I need to define a mass is a hassle. More importantly, it hurts readability, because there is no "new" anywhere in my mental model of the F = ma equation.

Fortunately, Scala offers injections, which can pretty up the source code. In C++, I can construct an instance on the stack without calling new. Although all instances in Scala live on the heap, I find the syntax reminiscent of C++ constructors.

We want to be able to write "Kilograms(4.0)" instead of "new Kilograms(4.0)" when we use our concrete Mass classes. To do this, create a Scala companion object of the same name as the class, and give it an apply method.


object Kilograms {
  def apply(kg: Double) = new Kilograms(kg)
}

object Grams {
  def apply(grams: Double) = new Grams(grams)
}

These functions are called injections. Basically, they are factory methods on the companion objects, but we don't need to call apply explicitly. This is the same syntactic sugar that allows us to write "List(1, 2, 3)" instead of "new List(1, 2, 3)". It pretties up our code nicely.


  println(force accelerates Kilograms(4.0))
  println(force accelerates Grams(100))

Note that we have made a tradeoff for this sweetness. We had to write more code (the injections) when defining our classes, so we could make life easier on the users of the classes. However, this is almost always the way to go. Readability is important.

Readability is also the reason that the accelerates method takes a Mass instance and not a plain Double. The extra word "Kilograms" or "Grams" doesn't help the computer, but it does help the human.

(However, the astute reader will have noticed that the kilograms method of the Grams class is inefficient. It performs a double precision floating point calculation every time it is called, even though the instance itself is immutable. If only there were a way to save the result of the calculation instead of the inputs, then we could run faster without worsening our memory footprint. Contemplating this is a topic for another day.)

In conclusion, tastefully applied Scala injections enhance readability. And they're more digestible than Martian soil coming towards you at a rate of, uhm, really fast.

And That's Final, Not!

2009-05-06T21:30:00.004-04:00

This post is about C++ and Java, but it really offers some necessary background material to explore an interesting issue facing Scala. What follows is hopefully widely known by C++ and Java veterans, but it's still worth reviewing here so that we're all on the same page when we talk about Scala in the near future.

C++ fans are often encouraged not to use #defines for their constants, in part because the preprocessor has no notion of types. For example, Scott Meyers champions this idea. Instead of writing...

#define PI 3.14159

...which performs a simple textual substitution everywhere in the source file, the following is usually preferred:

const double PI = 3.14159;

The latter alternative helps the compiler supply more meaningful error messages. If the preprocessor were used instead, then the compiler has never heard of the lexeme "PI", and can't include it in any error messages.

We don't have a preprocessor in Java. We also lack a usable const keyword. Instead we use final to describe variables whose values will not change. Unfortunately, just as static has multiple meanings in C++, final has multiple meanings in Java.

In C++, methods are non-virtual by default, and must be given a special keyword, virtual, to denote that they are polymorphic. The Java philosophy is different. In Java, methods are virtual by default, and must be given a special keyword, final, to denote that they can not be overridden. So this keyword pulls double duty in Java: for methods final means non-virtual, and for fields it implies constant.

Another difference with C++ is that we don't have standalone variables in Java. We put them inside a class as below. In order to explore the issue at the heart of this blog entry, we deliberately do not make the field below static.


public class MyClass extends YourClass
{
    public final double PI = 3.14159;
    //
    //... details omitted
}

A nearly equivalent way to define PI would be in a constructor. It's noteworthy that the final fields of a class can only be defined where they are declared, or in a constructor. A "set" method to change a final field would not compile.


public class MyClass extends YourClass{
    public final double PI;public MyClass()
    {
        PI = 3.14159;
    }
    //
    //... details omitted
}

At first glance, the two ways of defining the final PI in Java appear equivalent. But they are not. In fact, they are different in a crucial way that we'll explore in a subsequent post. Programmers that don't understand when final doesn't really mean final risk writing programs with undesired behavior.

In case you're on an interview...

A standard interview question is to ask a candidate to contrast inheritance in C++ and Java. The expected answer includes something like, "Well, C++ has multiple inheritance and Java doesn't."

But there's another difference, and folks who make the following observation display a valuable insight into the differences between the languages. "Well, I can truly call a virtual function from a Java constructor, but I can only appear to call a virtual function from a C++ constructor."

Let's digest this statement. If I try to call a virtual function in a C++ base class, I'm only going to get the base class's version, not the derived class's version. In other words, it's forbidden for a base class constructor to peer down into the code of a class that inherits from it.


// C++
class Base
{
public:
    virtual void f();
    Base();
};
void Base::f() { cout << "Base" << endl; }

The rules of C++ deny the implementation Derived::f from being executed within Base::Base. In other words, f does not behave as a virtual function when called within a constructor.

But in Java...

Such behavior contrasts sharply with Java. In Java, the derived class's implementation does get executed. Consider the ostensibly equivalent program below.


// Java
public class Base
{
    public void f() { System.out.println("Base"); }
    public Base() { f(); }
}

public class Derived extends Base
{
    @Override public void f()
    {
        System.out.println("Derived");
    }
}

public static void main(String[] args)
{
    new Derived();
    //
    // "Derived" gets printed, not "Base"
}

Java's approach might seem to be an advantage, but it comes with a hefty price. If the Derived class has a constructor, it fires after the base class constructor. That means that the f method of the derived class executes before the derived class constructor.

Reread that and let it sink in. It implies that if the derived class constructor has any initializations to perform or invariants to enforce before Derived::f fires, then we're in trouble. Let's demonstrate this with an example.


public abstract class Abstract
{
    public Abstract() { showPi(); }
    public abstract void showPi();
}

public class Concrete extends Abstract
{
    final double PI;
    public Concrete() { this.PI = 3.14159; }
    @Override public void showPi()
    {
        System.out.println(PI);
    }
}

public class Main
{
    public static main(String[] args)
    {
        new Concrete();
        //
        // "0.0" gets printed, not "3.14159"
    }
}

How can this be? It's as if the final PI value has changed. In fact, that's exactly what has happened. When a new Java object is allocated from the heap, all its fields are zeroed out. So when the constructor of Abstract fires, the memory location where PI lives contains zero. Later on, when the constructor for Concrete fires, that memory location is overwritten by 3.14159. Any subsequent attempts to call showPi will print "3.14159".

How serious is this problem in Java? I argue that it's not too serious, as long as developers are trained in this behavior, and they know what to expect. The greater dangers come from language quirks that surprise the coder, or from the clever coder who tries to exploit the poorly lit street corners of the language.

There are a few reasons why this problem is not too awful. First, the behavior is still deterministic. The fields of the object are all zeroed out when it's allocated from the heap, so there is no surprising cruft left in those memory addresses. No matter how many times I run my program above, I'm always going to print "0.0" and not some random bits.

Second, it's prudent for constructors only to call methods that are themselves final (meaning non-virtual). This is a common coding convention, and embracing it leads to code that's easier to understand and maintain. Tools like the fb-contrib plugin for Findbugs can enforce this convention.

Finally, classes that extend base classes know what their superclass is. It's a bit difficult to sneak dodgy behavior into a base class without being seen by the designer of the derived class, particularly when your tools will detect it. Consider that the source code of the child class itself will specify the particular base class it extends.

How serious these non-final finals are in Scala, however, may be another matter. We'll investigate this in the near future.

Scala Dependency Injection

2009-04-29T14:31:00.011-04:00

First in a series... This post flows from my attempts to understand and apply the lessons from the "Modular Programming Using Objects" chapter of the Programming in Scala book by Odersky, Spoon, and Venners. As much as I love that book, I felt that the chapter was a bit too terse for my poor brain, and there's a lot of exploration possible for the ideas found there. In other words, maybe all this is obvious to everybody but me, but here goes...

I'm a constructor injection chauvinist. I don't hate setter injection, but I avoid it if I'm able. I do appreciate that how one does inversion of control is somewhat a matter of taste. But a couple of defenses of my preference come to mind.

First, I like my finals. In Java, member fields assigned in a constructor can be final, and that prevents me from accidentally changing things I shouldn't change. Poka-yoke has saved many a developer many a time. I think it was noted software developer Harry Callahan, who advised us to know our limitations. On second thought, I think he might have been speaking in a different context, but I'm well aware of the kinds of programming mistakes I'm prone to make.

Second, a once-used set method looks like dangling cruft. It's usually public so as to be callable by frameworks, so it lessens the signal to noise ratio of the class's source code. How? Well, the users of the class must be educated not to call the special set method. I'm troubled by a method with a standard name that suggests a particular usage, but then behaves unpredictably if that usage is attempted.

Additionally, the instantiators of the class must be educated to call the special set method, and not to use the class before doing so. It's never a good idea to surprise the coder, and constructors that don't finish constructing will enable partially built objects to exist. Maybe this is just a violation of Poka-yoke again.

In Scala, instead of finals, we have vals. And in addition to dependency injection frameworks like Guice or Spring, we have a lovely way within the language to assemble object graphs. It could well be argued that such frameworks are merely clunky compensators for weaknesses in the Java language itself, such as the lack of mixins.

Imagine an AutoPilot object that needs to ask questions of a FuelSensor object. The fuel sensor has a remaining_liters method that the auto pilot might need to call from time to time. So our object graph comprises an auto pilot object with a pointer to a fuel sensor. This graph has to be instantiated when the program starts.


class FuelSensor {
  def remaining_liters: Int = { //blah blah

class AutoPilot(
  private[this] val fuel_sensor: FuelSensor) {
  // blah blah

A typical Scala approach to dependency injection will encapsulate the initialization of the object graph inside a trait that can be "with"ed into the application.


trait ProductionEnvironment {
  val the_fuel_sensor = new FuelSensor()
  val the_auto_pilot = new AutoPilot(the_fuel_sensor)
}

object MyApp extends Application
  with ProductionEnvironment { // blah blah

Of course, one can initialize Scala object graphs using Spring XML files or Guice annotations, but the trait approach has a nice advantage: if you make a spelling mistake, it's a compilation error, not a runtime problem. Eventually, we're going to see that it enjoys other niceties, too.

In real life, I'll have many environments. For example, when I want to unit test my auto pilot class, I might do something like the following.


trait AutoPilotTestEnvironment {
  val the_fuel_sensor = new FuelSensor {
    override def remaining_liters: Int = {
      // mock implementation here
    }
  }
  val the_auto_pilot = new AutoPilot(the_fuel_Sensor)
}

In the above example, I'm free to use TestNG or ScalaTest if I prefer. Moreover, I can opt for a separate MockFuelSensor class instead of an anonymous one inside the trait. Don't let such details be distracting. The real point is that instead of being in XML-heck with Spring, I can create specific environment traits to assemble meaningful object graphs. And the compiler helps me.

There's a second concrete advantage of the Scala "in-language" approach to dependency injection (DI). I can use Object Oriented (OO) principles -- that is, the separation of the general from the specific -- to organize different configurations thoughtfully.

Suppose for example, that there were two flavors of fuel sensors. Let's emend our code example a bit. A couple of concrete fuel sensor implementations would inherit from the abstract fuel sensor type.


abstract class FuelSensor {
  def remaining_liters: Int
  // blah blah

class JetFuelSensor extends FuelSensor {
  def remaining_liters: Int = { // blah blah

class PropellorFuelSensor extends FuelSensor {
  def remaining_liters: Int = { // blah blah

The beauty here is that I can create mixins to mirror the inheritance heirachy of the objects being initialized. Our production environment trait becomes abstract, leaving configuration-specific mixins to handle the varying construction details.


trait ProductionEnvironment {
  val the_fuel_sensor: FuelSensor
  val the_auto_pilot = new AutoPilot(the_fuel_sensor)
}

trait JetFuelEnvironment {
  val the_fuel_sensor = new JetFuelSensor
}

trait PropellorFuelEnvironment {
  val the_fuel_sensor = new PropellorFuelSensor
}

object JetApplication extends Application
  with JetFuelEnvironment
  with ProductionEnvironment { // blah blah

This feels right. Knowledge about how to construct concrete objects can be collocated with their class definitions, if so desired. I can (no pun intended) mix and match my mixin environments to assemble the exact configuration I want for a given application. Spelling errors are detected early (at compile time).

Now for sure, I could do much of this in Spring by including XML fragments inside master configuration files, but I think it's much nicer on the human to use genuine, language-supported OO features. IDE (Interactive Development Environment) support is natural, and that's a big win here.

Stay tuned for more thoughts about Scala dependency injection, and for more refinements of our example. We still have to deal with a handful of "real world" considerations as we transform our toy system into an industrial strength solution. The goal of this post was just to throw up a straw man, whom we can clothe in armor as we go along.

In summary, Scala's support for mixins offers a nice, in-language way to initialize object graphs. Though perfectly compatible with dependency injection frameworks, Scala offers an approach that enjoys a couple of advantages. First, because configurations are code, certain errors are detected early. Second, configuration details can be partitioned meaningfully in traits, and then assembled in a more user-friendly fashion than XML files or annotations.

Software Velocity as a Physical Observable

2008-12-26T21:35:00.005-05:00

The Math Professor

I had a Mathematics professor who once said, “You’ve been using numbers for quite a while, so it’s about time you learned what they really are.” This was in an analysis class where we were learning about real numbers. There are a couple of ways to define real numbers: an axiomatic approach and a construction-based approach. In the former, you write down the fourteen or fifteen properties of real numbers and presume that a set containing such elements exists. In the latter, you build real numbers out of rational numbers.

It’s almost true that real numbers are merely sequences of rational numbers. It’s well known that pi or the square root of two are not rational, but there are sequences of rational numbers that converge to these real numbers. However, the sequences are not unique. There are lots and lots of sequences whose limit is pi.

For example, just take any sequence that converges to pi and prepend any integer to the beginning of it. Since you have an infinite number of integers to choose from, you have at least an infinite number of sequences that all converge to the same real number. If that strikes you as too much of a cheat, consider that different algorithms exist for approximating pi. Some converge faster than others, but if you could run them forever, they’d all produce pi.

So, an individual sequence doesn’t correspond to a real number, but the set of all sequences with the same limit does. In other words, real numbers are equivalence classes of sequences of rationals. On the one hand, we could have a totally abstract definition of real numbers, based on axioms. (Multiplicative commutativity, the distributive law, and so on.) But on the other hand we could have a more pragmatic definition rooted in equivalence classes of simpler, more mundane objects.

There’s something magical about equivalence classes. Consider again that two different algorithms can generate sequences of digits that both converge to pi. If you’ll pardon the pun, pi must be, well, real, if you can arrive at it in different ways. Equivalence classes of concrete things agree with the abstract real number concept, which is otherwise described by axioms alone.

The Physics Professor

I had a Physics professor who advanced a compelling definition of mass. “Mass,” he said, “is that property of a substance that makes the law of conservation of momentum true.” We all found this delightful. It was a rigorous step up from the circular pseudo-definitions we had seen before. (Mass is quantity of matter. What is matter? Matter is anything that occupies space and has mass. What then is mass, again? Ugh.)

As curious Physics majors, the professor’s axiomatic approach to mass appealed to our inner Euclid. First, demand that the something like the law of conservation of momentum works, and then see what follows from that. But, in time, I came to feel that it was dangerously abstract. Sure there’s this elegant concept, but why should the real universe obey it? Yes, it was the apparent utility of mass and momentum that made them worth studying, but were we doing Physics (where we’re never sure that we’re right) or doing Mathematics (where we’re right but we don’t know what we’re talking about).

Anyway, there are a number of operational definitions of mass. By “operational,” we mean measurements that rely directly on experiments, even if those experiments are only gedankens. For example, if you pull on an isolated body with a rubber band stretched to a fixed length, then the object’s mass can be operationally defined as the reciprocal of the resulting acceleration.

Other operational definitions are possible. Imagine tying a body to a test mass and spinning them about each other in zero-g. The location of the axis on the rope can be used to measure the ratio of the two masses. You could also imagine a mass-measurer made out of an asymmetrical tilt-a-whirl, where the units surprisingly turn out to be seconds-squared.

You could repeat many experiments, and the results would -- if you’ll pardon the term -- converge on the true mass as operationally defined. Moreover, different methods of measurement will agree. The different operational definitions produce consistent measures. You know where I'm going with this. I'm about to suggest that they form an informal equivalence class.

The abstract definition of mass is real, and it’s complemented by the equivalence class of all operational definitions of mass. This same line of reasoning sheds insight into other physical observables, such as charge, temperature, and so on. I’m struck by the analogy to the two complementary ways to define real numbers. Equivalence classes of operational definitions marry real world phenomena to abstract models, where we can flex our mathematical muscles and do interesting work.

Software Velocity

When tracking progress on a burn-up chart, agilists concede that the units of velocity are not important. They could be person-days, story-points, or whatever. Nevertheless, the velocity concept has meaning. I imagine that each team is like its own custom experimental apparatus, providing its own operational definition of velocity.

But is software velocity real? Yes. It’s an abstract concept whose complement is the equivalence class of all operational definitions of software velocity. The difference between physical mass and software velocity is that the experimental equipment needed to operationally define physical mass is very simple, at least in principle. But the analogous equipment for a software development project lives in the deep structures of the developers’ brains, drawing upon whatever arcane magic causes them to choose which Fibonacci number corresponds to the heft of any given feature.

Normative Theorems, Narrative Principles

2008-09-20T12:00:00.000-04:00

We need a better word than "spec" to name the documents we write. A specification well-describes things that are specifiable, such as a language grammar or a communication protocol or a reusable library. So, if you need to write an Ada compiler, or implement an HTTP client, or use the STL, then the specs for these sorts of things are probably sufficient for you to get your work done.

But, that's because you already knew a thing or two about compilers, or IPC, or generic programming. If that's the kind of thing that you do in your day job, then I'm happy for you, and a little jealous. A lot of software has to do with modeling poorly understood domains, where there is little agreed-upon common vocabulary or precedent. Some days I get confused about exactly what my job is, because I've strayed into something quite new.

If that describes you too, then a normative-only spec doesn't reveal the gist of the problem to be solved. Many of the so-called specs that we write concern "wicked" problems, that is, problems that are so intractable that they can only be stated by solving them. I've often seen requirements documents go on for pages and pages with the guise of rigor, but which fail to convey enough context for why their enumerated and cross-referenced shalls are good ideas.

During the actual process of composing such a document, a good writer will actually discover and answer these "why" questions. Options are weighed and decisions are made. But if the reasoning isn't recorded, then the spec degenerates into solipsist snippets of conversations between the author and himself. That's hardly communication.

At work, I try to champion putting more non-normative context into our documents. Unfortunately, I'm really good at losing arguments. So, I've started to wonder if any lessons from Mathematics or Physics can shore up my reasoning. I can think of three reasons why normative-only specifications are like Mathematical proofs.

They hide the many wrong turns and false starts that their authors suffered while working.
They are themselves the end product of the effort, not necessarily a means to an end. (Implementing the spec, or applying the theorem, is often left to others.)
Specs and proofs are both terse, but they convey enough information for their consumers, as long as their consumers have the expertise and context to read them.

Somehow, I'm reminded here of Richard Feynman's "Lost Lecture" on the Motion of Planets Around the Sun. He wished to prove that orbits are ellipses without using calculus, and said,

I am going to give what I will call an elementary demonstration. [But] elementary does not mean easy to understand. Elementary means that nothing, very little, is required to know ahead of time in order to understand it, except to have an infinite amount of intelligence. It is not necessary to have knowledge, but to have intelligence in order to understand an elementary demonstration.

In this sense, our normative software specs need to be elementary. Unfortunately, I have never met a software spec writer as gifted as Feynman. Neither have you.

A while back, a pretentious document crossed my desk. It dotted every "i" and crossed every "t." Yet, after several readings, I still had no idea what the point was. There was a whole bunch of language that felt like it was adding fault tolerance features to an existing distributed product, but at the same time there was no redundant hardware to make changes worthwhile. So, naturally, I started to inject myself into hallway conversations to get the gist of the project.

It turns out that the existing product relied on some third-party software that came with some pretty onerous licensing fees. The whole point of the project was to rework how the system was deployed to save money. If you could revisit a few architectural decisions, then the expensive software could be localized in fewer places, thus saving a few bucks. Knowing that one little gem would have made the spec comprehensible, but it was nowhere to be found in the text because it was "non-normative."

Of course, you already know the punchline of the story. After spending a bunch of cash on the new project, somebody finally had the idea to call up the third-party software vendor and try to negotiate a more reasonable price. They agreed, and the re-architecture effort turned out to be a very expensive way to save money. By every criterion, whether dollars spent or risk incurred or opportunities lost, the phone call was the better option. But since not enough people were trusted with the real objective of the project, the optimal solution was discovered too late.

In hindsight, calling up the vendor seems obvious, but it wasn't. For those developers, who seldom interacted non-technically, it actually tipified out-of-the-box thinking. Creativity can't be scheduled into the spec-writer's block of time, because nobody can think of everything. The deliberate decision to exclude non-normative information from the original spec enabled this mistake

The best solution to a wicked problem is not commanded; it often emerges as it is solved. Had enough people understood what they were being asked to do, somebody would have made that phone call sooner. Consider Eric Raymond's assertion that "given enough eyeballs, all bugs are shallow." It's interesting to speculate whether there exists some parallel to Linus's Law. Perhaps enough eyeballs tame all wicked problems.

I suppose rationale doesn't matter when the spec itself is the work product. A compiler writer doesn't really have to know why Ada has two different functions for mod and remainder. One would just have to know the rules for each, and implement them correctly. But many of our software documents are not work products themselves; they are just the means why which communities build work products.

Much of what we do is wicked.

The analogy to nature came to mind as I thought about this software development question. I can hone my mathematical skills by reading and doing mathematics. Terse proofs are suitable in Mathematics, because anything you need to know is already out there for the taking. (Can you tell whether I'm a Platonist?) But Physics doesn't work that way. Or, at least, it doesn't work that way in my brain. One can't do new physics without doing experiments, and the most exciting experiments are done in unfamiliar domains.

So, mathematical proofs are like normative specifications, but physical principles are like domain expertise. That surely can't be guessed a priori. Now, suspend your objections for a minute. Yes, Physics has proofs. Yes, Mathematics is at least a quasi-empirical science. Yes, the analogy is bad. But all analogies are bad. I'm not really trying to make a point about Mathematics and Physics here; I'm trying to explore how to write better software documents.

For example, I think one can reason their way towards the notion that there are more real numbers than there are rationals. (I'm not saying everyone is Cantor, but I am saying that bright people can follow his conclusions, and that he didn't need to build a machine to discover them.) This contrast with Physics. I don't think anybody thought up the quantum Hall effect before it was discovered experimentally, even though both quantum mechanics and the Hall effect were already laying around. Physics is wicked.

The second most famous equation in Physics is due to Newton, F = ma. With calculus, we economically write the more general form F = dp/dt, where p represents the momentum of the object. In all our ordinary experience, momentum is the product of the object's mass and velocity, p = mv. Newton's second law tells us that forces impart changes to an object's momentum.

Embracing the analogy, we can say that F = dp/dt is a normative specification for how the universe works. Strictly speaking, you don't need to convey much else. You don't need an appendix containing the history of Leibniz and Newton. As long as you already know calculus and the definition of p, you're off to the races.

Knowing calculus is a technical skill. If you grok fluxions, then you understand what F = dp/dt really means. Given some measurements, you can make some meaningful, quantitative remarks about an object's behavior.

Maybe it's technical skill like being able to read UML. If you look at the arrows on a class diagram (whether inheritance or composition), then you can tell what depends on what in the picture. If there are cycles, then you might be able to say something intelligent about the quality of the design.

Einstein realized that some pretty weird things happen when objects move at speeds that are non-trivial fractions of the speed of light. Our humdrum definition of momentum grows some wrinkles. Our familiar p = mv turns out to be the low velocity approximation of the more general p = mv/sqrt(1-v^2/c^2).

The normative formula, F = dp/dt is still quite correct! Even with Einstein's discovery, I wouldn't need to rewrite my normative spec. But it's going to be misunderstood by anybody who only knows the low velocity definition of p. When you start to pull people out of the domain in which they are most comfortable, normative isn't enough.

So, if "spec" is too pretentious for what we do, what's the alternative? Our documents need to be more humble. It's a happy coincidence of English that humble and human are such similar words, because when trying to evolve solutions to wicked problems, it's all about people communicating.

Specs are perfectly good work products that describe what something has to do, and they have a crucial role in software development. But not every document is an end unto itself. Many are living documents. What could we call those documents, which people use as tools to build a work product?

Narratives.

Specs are ends of well-defined efforts. Narratives are means of attacking wicked problems. In literature, the best narratives uncover profound truths about the human condition. In software development, the best narratives collect the relevant principles needed to attack the problem at hand.

Einstein elucidated the principle that the speed of light is constant, regardless of the speed of the observer. This principle drove a rethinking of the definition of momentum, but it only matters for objects outside of our familiar experiences. However, when attacking problems where the principle is relevant, we'd better make it clear. I'm going to start calling what I do in my day job narrative-writing instead of spec-writing, and see where that leads.

Methods Of Proof

2008-01-06T13:00:00.000-05:00

Over the past couple of weeks or so, I've been playing with some recreational math. It was pointed out to me that the sum of the first n cubes always seemed to be a square. For example, 1^3 + 2^3 + 3^3 = 36 = 6^2. This is probably obvious to the highly numerate, but it blew my mind to learn it. After struggling a bit, I could show that the sum of the first n cubes is always the square of the sum of the first n integers. Wow!

I came up with a couple of different proofs that were full of tedious algebra, and didn't seem to offer any insights into why such a relation should hold. I became convinced that something so easy to state should have a brief demonstration, possibly geometric, and definitely more intuitive. For quite a while, I couldn't find such a thing.

Then it hit me. Induction! There are so many methods of proof: reductio ad absurdam, proof by construction, even proof by computer. Once I remembered the technique, I quickly banged out a proof by induction that the sum of the first n cubes is the square of the sum of the first n integers in just a handful of lines.

Mathematical proofs to me are a lot like tests in software development. Just as there are varied methods of proof, there are varied mechanisms for testing the deliverables in the software development process. For example, I might test that the requirements are met with formal acceptance testing. In other words, the acceptance test proves the functional spec (or some other document).

As another example, someone might say that the unit tests prove the code. As a fan of test driven development, I would say exactly the reverse. But the point is that different testing mechanisms are suitable for different parts of the development process. Just as a mathematician must be well versed in different methods of proof, we software folks must be well versed in different testing mechanisms.

But what tests the design?

Sure, a formal inspection can uncover lots of problems in a document, but is that really sufficient to demonstrate that a design is any good? Multithreading issues, resource leaks, inadequate error handling, and bottlenecks are difficult to uncover in a review, except in the most obvious cases. Formal review chauvinists are sure to challenge me on that opinion. But I believe that if humans really could anticipate how complex systems behaved, then there wouldn't be any such phrase as emergent behavior.

So, I feel that we need another "method of proof" suitable for evaluating design fitness. Ideally, it would be inexpensive and repeatable, giving it another leg up on formal inspections. It should also be available as early in the development process as possible. I suggest that designs are testable by simulation.

The project I've been working on most recently has lots of algorithms with tunable parameters, and one of the open questions is how much hardware firepower do we really need to handle realistic customer loads. Although I can imagine that somebody much smarter than I could figure this out from first principles, I wouldn't have much confidence in any design that wasn't vetted by simulations.

Happily, with simulation, we don't have to be done before we find out that our design is broken. With languages like Ruby, we can develop margin eaters and simulators rapidly. Design trouble spots can be identified early, maybe even before any production code is written. Of course, this is only possible if the architecture is amenable to simulation.

One figure of merit for a good architecture is how early it can support simulations. For example, a recent effort I've shepherded used text over sockets for interprocess communication. There were a lot of fans of Java Object Serialization, but that would have required Java on both sides of the connection. Sticking with text over sockets allowed rapid development of Ruby simulators and placeholders.

Testing designs by simulation is not an earth-shattering suggestion. Mature software companies have been using simulation forever, replacing margin eaters with production code as development proceeds. But as an architect, I have profited from the notion that simulation is a kind of test, a method of proof, for designs themselves.

NMI Mine

2007-11-23T15:02:00.001-05:00

There's an old story about a computer science professor that tried to give his son a middle name of the empty string. Whether true or not, such efforts would surely fail. How could one fill out the birth certificate? The social security forms? There really is no way to indicate that a middle name exists but has no letters in it.

I used to work at a big company, complete with photo id badges. Full names were printed on them. If someone lacked a middle name, their badge displayed "NMI" for "No Middle Initial." Naturally, I wondered what would happen if we ever hired someone with a middle name of "NMI." Of course, this never happened. But if I were responsible for the id badge software, you can bet that a person with such a name would be a test case.

Programming 101 teaches us to keep data and control separate. But in reality, this lesson is violated all the time. Consider Unix system programming. If you want to open a file descriptor, the open() function returns it, or it returns an error code on failure. Since all the error codes are negative and all file descriptors are positive, this doesn't seem to get us into too much trouble.

But, before you fancy yourself as wise as the designers of Unix, it's worth keeping data and control separate whenever possible. I'm familiar with a software system that schedules tasks on distributed embedded hardware. For reporting, the system writes a CSV file that can be imported into Excel for friendly display.

To schedule a task, a SOAP message is sent into the system. This is interpreted, log messages are written, and an appropriate embedded device is chosen. Then the system sends a message of its own to the device, passing along the scheduling information.

Scheduled tasks can be edited. But if the start time has already arrived, then the start time can't be changed. Only other details can be changed. In such cases, the update message contains a "zero" as the start time, since null start times were not allowed by the XML schema.

This use of a magic value, where data masquerades as control, seemed harmless enough at the time. I let it slip by. There were bigger battles to wage, and I felt that I had been saying "no" too frequently anyway. Certainly the designers on the team felt that I had. Unfortunately, this was a mistake.

You see, the embedded systems ran Linux. So their "zero" time was the 1970 epoch. However, the outside world was using 1900 as zero time, because that's what Excel uses. I'm embarrassed to say that it took us many days to figure this out.

I wish I could blame the delay in fixing this bug on the fact that we have a geographically distributed team. I wish that human language barriers were a suitable excuse. I wish that I could offer something to deflect the cause away from my own bad judgment.

But I cannot. The real root cause of this bug was that I allowed an architectural flaw to creep into the system. Thou shalt not conflate data and control.

Aspect Variations

2007-11-14T10:00:00.000-05:00

In a prior post, I suggested an analogy between different ways of formulating Physics and the contrast between thinking about objects and aspects. If OO is like vectors and matrices, then aspects are like variational principles. There's a cute little follow-up that I'd like to explore presently.

I'm very fond of Log4J and diagnostic contexts. In a word, diagnostic contexts conveniently allow the code to store arbitrary strings in thread local storage. So suppose I have a jar that offers functionality to persist to a database or a file. The application level code can set a diagnostic context, and then the log messages written by the jar's code will contain that useful information.

In pseudo-code, we might do something like this...


import org.apache.log4j.NDC;
import mypersistpackage.Persister;
//...
  NDC.push("current cseq=" + cseq);
  Persister.save(foo);

Then any Log4J messages written by the Persister.save method will contain the specified cseq number. This is incredibly useful, and we didn't have to modify the Persister source code at all.

However, a subtle problem arises when we're using thread pools. Whenever the application code makes an RPC, there's a chance that the thread will get returned to the pool during the call. This means that when the remote procedure returns, we might pick up where we left off in a different thread. This means that any context held in thread local storage will be stale.

In our example, we'd risk printing out the wrong cseq number after making a remote procedure call. The risks of this would increase when the system is under load, which is exactly when log messages are most important.

In a prior post, it was suggested that an aspect could advise all RPCs, so that timing information could be gathered. This is an example of a global principle that can be applied across the entire application. We can press that aspect into service to make sure our diagnostic contexts are not mangled when coming back from a remote call.

Even though we could do this with our OO helper class described in the prior post, this solution just plain didn't occur to us until we started thinking in aspects. (In fact, we abandoned using diagnostic contexts altogether.) And sure, I could solve the brachistochrone problem numerically using vectors, but I'm sure that a computed solution wouldn't give me the same insight I'd get by using the calculus of variations.

OO is to Vector as Aspect is to ...

2007-11-13T09:00:00.000-05:00

When beginning to study of Physics, one typically learns about forces, just as Newton framed the subject. Forces are vectors, so you have to learn some trigonometry and linear algebra. Trajectories have a locally computable flavor about them, in that they are determined by summing up the forces imparting accelerations to particles. For example, the deflection of a ray of light through a prism is given by Snell's Law, which concerns itself only with the place where the light ray hits the glass, not original source or final destination of the beam.

And just when you are comfortable with forces being central to everything, the rug gets pulled out from underneath you.

Physicists after Newton reformulated Physics with energy and action as central players. This requires a bit more mathematical sophistication. So as you get farther in the subject, you have to learn variational calculus. Trajectories are now the solutions to a boundary value problems, and have a more global nature to them. For example, the deflection of a ray of light through a prism is governed by Fermat's Principle of Least Time. Out of a family of admissible trajectories, the one nature chooses is determined by a variational principle.

That's a very different way of thinking about natural phenomena. I wonder whether there's an analogy here to object oriented and aspect oriented programming.

Becoming comfortable with object oriented programming is like learning about vectors and matrices. Objects interact with each other by sending messages changing each other's state. This reminds me of forces imparting accelerations to particles. Direct method calls are like fricative forces, and JMS calls to mind magnetism or electrostatics.

The shift in thinking that's required to embrace aspect oriented programming is like learning about the principle of least action. For want of a better phrase, cross cutting concerns have a more, well, globally principled feel to them.

Consider a problem drawn from my real experiences with a system that makes a number of remote procedure calls. Sometimes, things get bogged down, and it's important to know where the time is getting consumed. The original OO approach was to design a little helper class that callers could use to keep track of how long each RPC took.


MyHelper helper = new MyHelper("rpcMethodA");
rpcMethodA();
helper.done();

The constructor took note of the current time and method name, and then the done method wrote an informative log message recording how long the method took.

But, rather than polluting the biz logic with all this bookkeeping, this problem calls out for an aspect oriented solution. A better approach would be to make an aspect that did the timekeeping and logging, and then advise whatever methods you wanted. In my imagination, this feels like imposing a variational principle on the software.

Four A's and a Zed

2007-11-12T22:26:00.000-05:00

A couple of years ago, in an attempt to combat myopia, I tried to collect some thoughts on a software system I had a hand in developing. I came up with a few principles that applied to that code, but there might be something more general in them that’s worth capturing. So with apologies to Hugh Grant, here are Four A’s and a Zed.

Availability is Scalability

An important lesson learned is never try to bolt on fault tolerance or fault resiliency at the end. One has to design it in from the beginning. The twin of this idea is that scalability isn’t accidental either. By scalability, I mean the capability of the architecture to improve some figure of merit (throughput, for example) by throwing more hardware at it.

All right, those points are obvious. But what wasn’t obvious (and what might not even be true generally) is the idea that availability and scalability are the same thing.

Architecture By Contract

ABC is a term I made up to denote an amalgam of Design By Contract (DBC) and Interface Oriented Programming (IOP).

Adaptors for Protocols, Plug-ins for Logic

Our product had to integrate into a number of environments. One mistake we made was to mix business logic into the same code that was handling the communication. A better approach would have been to create adaptors that only handled the protocols, and contained no biz logic. Instead, separate plug-ins would contain customizable logic that could be varied independently.

Asynchronicity Considered Harmful

Event-driven systems can allow design decisions to be deferred. Usually, it's better not to defer such decisions, but make the hard choices up front.

Zero Bugs in Zero Slocs

Code that you don't have to write can't have bugs in it. Our prowess at the keyboard should not be measured by how many lines of code we write, but how few.

Translating Orcs

2007-11-09T20:02:00.000-05:00

I've been reading Seamus Heaney's translation of Beowulf. It's fantastic. In both senses of the word. I've always liked this kind of stuff, and it's only a coincidence that the movie has just come out. I mean, I would have been reading it anyway.

Something early on in the verses has caught my eye. Mr. Heaney translated "orcs" (actually orcneas) as "evil phantoms." I wonder if he was tempted to translate it as simply "orcs." I bet that a lot of Beowulf readers wouldn't be thrown by the word orcs. And those that were would probably be curious enough to look it up.

The Pragmantic Programmers give a presentation about the Dreyfus Model of communication. Workers (whether in nursing, cooking, or programming) can be divided into five categories ranging from beginner to expert. The less skilled require detailed rules. Bake in the oven at 450 for 30 minutes, then remove the pan using insulated potholders. The more skilled don't require such rules, and embrace intuition. Whip up some fritters.

It's suggested that communication across too many levels is difficult. A novice cook would have trouble following the latter directive. An expert would chafe at having to issue the former one. One of the catch phrases when considering the Dreyfus Model is "legalize intuition." In other words, good organizations tend to defer to experts' intuition.

I have to quibble with that, though.

In our profession, one of the phrases we hear too much is: "I have n years experience, so you have to just trust me." Well, first of all, there's a difference between having twenty years experience and having one year of experience twenty times in a row. Most of us overestimate our expertise.

But even when the person making that argument really is an expert, I feel that it's still a cop out not to articulate the logic behind one's point of view. If you really are such an expert, you should be able to convey why the solution you advocate is best. I just don't buy that the expert can't make himself understood to the novice.

Of course, that doesn't mean that the novice will believe him. Or embrace the direction given. But that's different from being unable to communicate.

I've been blessed with a number of very talented professors over the years. A great many were brilliant. None of them met the stereotype of solipsist genius that couldn't teach worth a darn. In fact, the most gifted were exactly the ones who communicated best.

Nobel laureate Richard Feynman remarked that if a Physics topic could not be explained to freshmen, then physicists really didn't understand the topic.

So, to my fellow architects out there, the next time you are charged with putting a little extra effort into defending your point of view, resist the temptation to take it as a challenge to your role in the group. Instead, welcome the opportunity to reify your intuition into a coherent explanation. And trust your audience to be bright enough, or at least curious enough, to know what orcs are.

Unscientific Methods

2007-11-08T19:26:00.000-05:00

Every once in a while, you need to <rant>

Some years back, I was a novice programmer on an important software effort. IIRC, it was something like a couple or three dozen coders for six calendar months. There were six milestones, one scheduled at the end of each month. For some ironclad non-technical reasons whose details don't matter here, the project's final deadline absolutely positively could not budge.

Well, you know how software development goes, and we completed our first milestone after two months elapsed. The project manager called a meeting. "Don't worry," he reassured us. "We're only a month behind schedule." I sure didn't see it that way, so I had a discreet conversation with my technical lead.

"Our estimate for how long it would take to complete the first milestone was off by 100%," I said." If our other estimates are similarly off, we're not a month behind schedule. We're six months behind!" My tech lead endured my naivete. He reminded me that we had a talented and hardworking group. The first milestone was just a fluke.

"Well, sure," I pressed on. "But, think like a scientist. We've performed one experiment. It tells us that our estimating process is off by a factor of two. I really think we need more people." It was a big company. So throwing more bodies at the problem didn't seem unreasonable to me. But it was not to be.

We worked hard, put in lots of overtime, and completed our second milestone after two more months elapsed. The project manager was replaced. The new manager gave us the Gipper speech, which pretty much alienated everybody who had been slaving away for the past four months. I now had two experiments supporting my view of the schedule.

As I saw it, the team was now effectively tasked with completing eight months work in two months. I voiced this opinion, a little less discreetly this time. But on paper management's unrevised schedule looked like four months work in two months. Somehow that was more palatable. By being the only one who understood the scientific method, I just couldn't make myself believed.

Now, I know what you're thinking. You're thinking that they really believed me, but they were wearing their special emerald colored glasses that get bolted on when they enter the Oz of management. The deadline couldn't move, so they had to keep up the front that we can get the job done in time. After all, everyone died at Khitomer, because the alternative would be unthinkable.

I might buy that, if it weren't for what happened next.

To make up for lost ground, all the programmers were divided into three shifts, and we were each scheduled to come in for our assigned hours. Being young and single, it was little hardship for me to take 3rd shift. We were all working so many hours, there was quite a bit of overlap anyway.

That approach might have made sense if we all didn't have workstations of our own on our desks. It might have made sense if programming wasn't such a human activity, which thrived on good interpersonal communication among team members. It might have made sense if we were geographically distributed across time zones, and worked non-standard hours to maximize overlap. But none of those things were true.

Dividing us into shifts was like a shiny pocket watch that a hypnotist takes out of his pocket to distract someone, in this case, upper management. I'm pretty sure I expressed this simile to my tech lead, but we were both so bleary-eyed at the time I can't really be sure. If the decision makers could be duped by the illogic of such a bizarre work tactic, they probably didn't really understand why someone who thought like an experimental scientist would call for revising the milestone effort estimates.

At least, that's my hypothesis.

</rant>

Passivating Thoughts

2007-11-07T15:55:00.000-05:00

Before a friend of mine became a brilliant linguist, he was a brilliant Linguistics student. I recall the two of us mulling over some thoughts about active and passive voice then. In English, the subject of an active voice sentence is the actor doing the action. Geoff studies languages. In passive voice, the subject and direct object get swapped. Languages are studied by Geoff. It turns out that most active voice sentences can be rewritten in passive voice and vice versa.

However, there is an interesting class of active voice sentences that cannot be “passivated,” if I may coin that term. He sank the boat to become a hero. If we try to flip that around, it no longer makes any real sense. The boat was sunk to become a hero. It’s no longer correct. Surely the sinker and not the boat itself is the hero.

There are similarly structured sentences that do admit passivation. He sank the boat to collect the insurance. This is nearly identical to the previous example. The boat was sunk to collect the insurance. That works! It’s clear that the boat is not the collector of the insurance. A million other examples flow off the tongue. He drank the Jolt to postpone sleep. We were in college after all. The Jolt was drunk to postpone sleep.

So, what’s so special about the hero example? Why does the passivation transformation sometimes fail?

After kicking this around some, one of us noticed a difference in the sentences. The verb “collect” can be used in passive voice. Insurance was collected. But “become” is special. Without taking poetic license with the language, one cannot passivate become. Hero was become, is not correct. We then conjectured that sentences of the form above could be passivated if and only if their infinitival clause had a valid passive voice form.

Armed with a conjecture, we thought we’d run a few more experiments and see how it bears up. There’s no way to say “was remain”, so we predicted that the following sentence could not be passivated. He shredded the contract to remain a free agent. And sure enough: The contract was shredded to remain a free agent. This sentence would imply that the contract itself was a free agent, but the active voice form does not. The attempt to passivate the verb “to shred” in this example fails because “to remain” admits no passive voice form.

Other experiments also shore up the conjecture. To dance all night, she chose comfortable shoes. That has a clear meaning. She, and not the shoes, is doing the dancing. But attempting to passivate the sentence fails. Comfortable shoes were chosen to dance all night. Even if that might be a grammatically correct sentence, the meaning is warped. The shoes were not selected to dance (among other dancing candidates). It’s the dancer that dances. Our conjecture correctly predicts this because the construction “were danced” doesn’t make sense. (Although I could dance a jig, which is transitive, and a jig could “be danced,” the flavor of dance used above is intransitive, admitting no direct object, and no passive voice form.)

We both celebrated with a Jolt, but here is where the differences in the way Physics students and Linguistics students look at the world came into play. “We’re done,” I exclaimed. “We looked at the data, formed a conjecture, and tested it with more data. Write it up!” That our conjecture was interesting and useful was enough for me.

My friend said something like, “No, we’re not done at all. Now we have to figure out why English obeys the conjecture. What forces could have driven the evolution of the language (or our minds, really) to behave this way? We can‘t just offer the conjecture without justifying it.”

This notion floored me. It would never occur to me to ask why F = GMm/r^2. That’s just a useful law that Newton discovered. That it works is enough. Hypotheses non fingo. So, it seemed incredibly ambitious and speculative to try to explain why the linguistics conjecture worked. I was of no further use.

In summary, most English active voice sentences can be passivated without changing their meaning or rendering the new sentence ungrammatical. Putting on our Physics hat, we might say that they are invariant under the passivation transformation. However, there are a few sentences that are not, namely the ones with infinitival clauses that admit no passive form. This represents an interesting broken symmetry.

When refactoring software, one improves the internal structure without breaking the desired external behavior. In the literature, I perceive a couple of approaches to this. Both are compatible, but have different emphases. Martin Fowler and others emphasize the importance of comprehensive unit tests that pass before and after modifications are made. Bill Opdyke and others champion the idea that source code can be transformed in specific, discrete ways that leave behavior unchanged.

I feel that both approaches are important. The first approach admits the possibility that the unit test suite is not complete enough. A client somewhere might rely on some behavior that’s not checked by a unit test. So the refactoring attempt could fail. The second approach should always work, but it limits the refactoring repertoire to those actions that your tool can do. (Unless you are incredibly meticulous and can confidently edit the code by hand yourself.).

However, I wonder what subtle broken symmetries, analogous to the linguistics example above, might still exist when transforming source code according to our profession's ever growing catalog of refactorings.

On Being Imitated

2007-11-06T08:59:00.000-05:00

A while back, I noticed that my son (who's nearly two) would hold out his arm and stare intently at the back of his wrist whenever someone asked "what time is it?" No one taught him this, he just picked it up because he observed adults doing it. This struck me with the one-two punch of (1) "Wow he's so observant! That's wonderful!" followed by (2) "Oh-oh, what else is he learning by osmosis? I need to be more careful."

I'm familiar with a piece of Java code that fires callbacks in response to receiving certain raw events. The idea is that the raw events themselves don't have enough information to be useful to the ultimate consumers. So this piece of code transforms the data, and then fires a more meaningful callback.

Here's how it goes. The Transformer class implements a pair of interfaces. One allows clients to register themselves as callbacks, so they can listen to meaningful events. The class maintains a collection of callbacks into which it fires the transformed messages. The other interface allows Transformer to listen to the raw events. Here's some pseudo-code.


public interface Callback
{
    void onEvent(String s);
}

public interface Publisher
{
    void registerCallback(Callback c);
}

public interface RawListener
{
    void onRawEvent(String s);
}

class Transformer implements Publisher, RawListener
{
    private List<callback> allCallbacks;

    public void registerCallback(Callback c)
    {
        this.allCallbacks.add(c);
    }

    public void onRawEvent(String s)
    {
        String transformedString = this.transform(s);
        foreach (Callback c : this.allCallbacks)
        {
            c.onEvent(transformedString);
        }
    }

    private String transform(String s)
    // details omitted

So, clients of the Publisher interface that are interested in receiving events implement the Callback interface, then add themselves into the Transformer's collection. Some other class fires events into the RawListener interface. The idea is that the implementors of Callback don't have to know anything about the RawListener or the format of the raw events.

However, there's a difference between "don't have to know" and "shouldn't know". If the raw event format is subject to change, then we really don't want the implementors of Callback to depend on that in any way. The problem is, that once a client has their mitts on a Publisher, then they can cast it into a RawListener. Like a population of frogs expanding to fill a new niche in the ecosystem, living code will exploit that.


  void cleverAndRisky(Publisher p)
  {
      RawListener listener = (RawListener) p;
      String rawEvent = // details omitted
      listener.onRawEvent(rawEvent);
  }

In Don Box's excellent book, Effective COM, he offers some compelling arguments about the dangers of the QueryInterface method, which is basically a cast. (Incidentally, I think the first chapter of his book is among the finest technical writing I've ever read.) Whenever a class implements interfaces used for different purposes, one runs the risk of a client writing brittle code.

I've grown fond of using Inner Classes to attack this problem. Consider an improved Transformer implementation below. With this new approach, the cleverAndRisky method above won't ever work. This doesn't complicate the Transformer code too much, and it sets a good example of paying careful attention to what gets exposed.


class Transformer implements Publisher
{
    private List<callback> allCallbacks;
   
    public void registerCallback(Callback c)
     {
         allCallbacks.add(c);
     }

    private class RawListenerImpl implements RawListener
    {
        public void onRawEvent(String s)
            {
             String transformedString = transform(s);
             foreach (Callback c : allCallbacks)
                    {
                 c.onEvent(transformedString);
             }
         }
     }

    private String transform(String s)
    // details omitted

Now, I'm not saying that you have to treat users of your code like two-year-olds, who will poke their fingers into every dangerous socket you leave open. Programmers are a pretty smart bunch. But I am suggesting that the public part of an API should be given careful thought. It's a subtle point, but the public API includes what I can cast interfaces into.

Keeping track of this sort of thing sets a good example. Be mindful of what you do when you are in a leadership position. Take pride when the troops imitate you.

Noether's Theorem

2007-11-05T05:18:00.000-05:00

In my last semester as an undergraduate Physics concentrator, I took an initially promising class that turned out to be very depressing. In it, we learned that energy is not conserved. Now, everyone knows about the law of conservation of energy. The patent office has even been known to deny patents on the grounds that purported inventions resemble perpetual motion machines, which would violate the principle. And yet, much in the way that Newton’s laws fail in an Einsteinian cosmos, energy is not really conserved.

Here’s how it goes. It’s well accepted that the universe is expanding. This is a colloquial way to express the more precise notion that the distances between everything are getting larger. Space is not expanding into anything larger, in the way that a cake fills up the volume of an oven. We don’t notice this effect because our lives are incredibly short and because forces like gravity and electrostatics hold familiar objects together despite the expansion.

It’s also well known that light propagates in waves, and that the energy in a beam of light depends on the wavelength. More energetic waves have shorter wavelengths. Well, consider a ray of light traveling along in vacuum for a very long time. If left alone long enough, the distances between the crests of its wave will increase because of the expansion of the universe. This increases the wavelength, and robs the beam of energy. Put whimsically, even light gets tired as it ages.

Our little group was shaken by this line of reasoning. The course was an elective, and attended by maybe a dozen curious students. I don’t think we could recall a single problem set endured over the years that didn’t rely on energy conservation somewhere. So we just sat there for a few moments digesting this idea. Finally one of my study partners spoke up. “Professor, you’ve just undone the last four years of our lives,” he managed to get out.

Some years later, as a graduate student (in Mathematics, no less), I came to understand a far deeper principle. Noether’s theorem marries conservation laws to symmetries. Symmetry here has a specialized meaning that’s richer than the layman’s definition. If a deep symmetry can be found in nature, then some observable quantity must be conserved.

Specifically, if an experiment performed today would demonstrate the same behavior if performed tomorrow, then we say that the laws of Physics are invariant under translation in time. Invariance under time translation is an entirely reasonable and rather timid assumption. It’s an example of a symmetry (in the mathematical sense) . Noether’s theorem tells us that this symmetry implies the law of conservation of energy. Other symmetries imply other conservation laws.

Armed with this understanding, conservation laws were displaced from my perspective as fundamental notions, and became natural consequences of mild assumptions about the world. This happy discovery more than made up for the earlier depressing one.

With this more profound perspective, it’s not so upsetting to contemplate that energy might not be conserved over time scales that are non-trivial fractions of the age of the universe. In fact, we might even expect it! We’d have to start thinking up gedankens that violate conservation, like the aging light beam above.

Why has all this come to mind when thinking and blogging about software architecture?

It comes to mind because over time I’ve found that some “best practices” that I’d embraced previously only make sense in limited contexts. This is a depressing discovery, akin to the feelings I had as an undergraduate described above. It’s suggests that we can only hope to architect systems on a sandy intellectual foundation.

However, whether or not a practice is best, or even good, is not happenstance. Rather, it’s a consequence of some deeper principle when certain assumptions are applied. The analogy to Noether’s theorem is too close not to be struck by it. This is a happy discovery, and I plan to expand on this idea and offer a concrete example or two in subsequent posts.