Those are better numbers in that the degradation between the opennsd and tclsh tests is about the same for proc and inline. In other words, each opennsd test is about 1/2 the speed of the standalone test. And the proc case is roughly 1/8th faster than inline in each case (rather than 4x faster as in Alex's results).
While I don't know why you guys get such different numbers, this last set is much more in line with my expectations.
You're right about the threaded vs. non-threaded issue. I'm surprised it would cause that level of degradation, but not surprised that the interpreter would run slower to some degree.