Non tail-recursive functions in core.async go blocks

2013-07-17 clojure core.async go

I’ve been using various Go examples and tutorials to take a deeper look into core.async. The CSP pattern is very interesting and powerful; it’s a good move for Clojure to “throw in” with Go and push this style of programming.

core.async works at the s-expression level, where some other JVM solutions (Kilim, Pulsar) do the same at the byte-code level. The main benefit of doing these transforms at the s-expression level is that they are applicable to ClojureScript, where CSP can be a very neat way out of callback hell. David has written about this.

The go macro

Now, one limitation of the go macro is that it can’t “look into” other functions or closures. This difference stands out quite clearly when reading Go code where you can put “go” in front of function calls.

(defn put-all! [vs ch]
  (doseq [v vs] (>! ch v)))

(let [ch (chan)]
  (go (put-all! [42 84] ch))
  [(<!! ch) (<!! ch)])

This causes the Assert failed: >! used not in (go ...) block error. In this case it’s easily fixed, simply move the (go ...) embrace into the put-all! function or inline the body of the function.

(let [ch (chan)]
  (go (doseq [v [42 84]] (>! ch v)))
  [(<!! ch) (<!! ch)])

Please note that inlining an anonymous function does not work (fails with the error as above)

(let [ch (chan)]
  (go ((fn [vs] (doseq [v vs] (>! ch v)))
       [42 84]))
  [(<!! ch) (<!! ch)])

However, if the function you’re inlining is tail recursive, you can use the approach above by using the loop/recur form (which the go macro can translate).

(let [ch (chan)]
  (go (loop [[f & r] [42 84]]   ;; equivalent to (doseq)
        (when f (>! ch f) (recur r))))
  [(<!! ch) (<!! ch)])

I.e. it’s possible to simulate putting go around a function call by inlining it’s body – if it’s tail recursive (or not recursive at all).

Non-tail recursive functions

Now for a more involved example, we want to walk a binary search tree and put the values on a channel. When all results have been put on the channel, we want to close it.

Here’s a simple version of the walker (using the thread blocking >!! call).

(defn walk [tree ch]
  (letfn [(walker [t]
            (when t
              (walker (:left t))
              (>!! ch (:value t))
              (walker (:right t))))]
    (walker tree)
    (close! ch)))

Then we can use the same function to see if 2 trees are identical;

(defn same [t1 t2]
  (let [ch1 (chan)
        ch2 (chan)
        drain #(loop [v (<!! %) res []]
                 (if v (recur (<!! %) (conj res v)) res))]

    (thread (walk t1 ch1))
    (thread (walk t2 ch2))

    (= (drain ch1) (drain ch2))))

How do we convert this to use go blocks? We ideally want to have the entire walker function done by one go process, so that we can close the channel when the that function returns. Embracing the body of that function with (go ..) gives us the problem that the channel immediately closes before we get a chance to put anything on it.

(defn walk [tree ch]
  (letfn [(walker [t]
            (go
             (when t
               (walker (:left t))
               (>! ch (:value t))
               (walker (:right t)))))]
    (walker tree)
    (close! ch)))

Also, we’ll get a new go process for every node in the tree (every recursive call), so not all processing is done in “one place”.

Ok, let’s try to inline the body of the walker function so we can call close! when it’s done.

(defn walk [tree ch]
  (go
   ((fn walker [t]
       (when t
         (walker (:left t))
         (>! ch (:value t))
         (walker (:right t))))
    tree)
   (close! ch)))

Hm, this won’t work because of the restriction of the go macro described above. If the walker function would have been tail recursive, we could have used the loop/recur form where the ((fn ..)) is and this approach would work, but not in this case.

Let’s give up the idea of using one go process per tree and see if we can’t use some kind of synchronisation to not call close! too early.

(defn walk [tree ch]
  (letfn [(walker [t]
            (go
             (when t
               (walker (:left t))
               (>! ch (:value t))
               (walker (:right t)))))]
    (go
     (<! (walker tree))
     (close! ch))))

Here we put the walker and close! calls in another go block and then we wait for the call to walker to “finish” before moving on to close the channel. What it means for the “walker to finish” is that the go block for the top node of the tree terminates (since every node in the tree will have it’s own go block).

This looks promising, but the results in the channel can be in any order (since there are no order guarantees in the scheduling of go processes) – this also means that some of the values might be missing since the “top” go process can be scheduled before a child one.

We need a little bit more synchronisation to arrive at a working solution.

(defn walk [tree ch]
  (letfn [(walker [t]
            (go
             (when t
               (<! (walker (:left t)))
               (>! ch (:value t))
               (<! (walker (:right t))))))]
    (go
     (<! (walker tree))
     (close! ch))))

Now all the nodes will be processed (by different go processes) but in the correct order.

The complete solution can be found here, note that the (thread ...) embrace in the same function has been removed.

Postscript

Some more Go tutorials converted to core.async

A final word of advice: when converting Go examples to core.async, remember not to do Thread/sleep in your go blocks! In Go, the sleep function is integrated into the go routine scheduling; this is not the case in core.async. See here for a deeper explanation of why.