Is this another “Perl indistinguishable from line noise” argument? Don’t know …

… but I do know that the analysis has some … er … flaws. Yeah. Flaws.

I’ll ignore their sample size issue for the moment (though it does go to the size of their error bars … I hope they appreciate the inverse functional relationship between these two).

Take two sets of data with error bars. Put them down on the same graph. The data from each set overlaps within the error bars of the other set.

Are these distinguishable?

No. They aren’t.

Can you draw any meaningful distinction between these data points?

No, you cannot.

Yet, somehow, this is what they did.

Yeah, I know. Lies, damn lies, and statistics. I’ve seen (and corrected/critiqued) my fair share of the same.

They invent a programming language called Randomo. In it, they literally use random characters. They then have novices program in 3 languages, their Quorum, Perl and this Randomo. At the end of the process, the results are scored, analyzed, etc. Charts generated. Error bars drawn.

And thats when they make some pretty classic errors. They take a set of data which is effectively within the error bars of another set of data (its symmetric in this manner actually), and they claim that they can distinguish. Really? Their data, and the analysis say otherwise.

And then, this is where I note the tedious aspect of this … they attack Perl as being, literally, indistinguishable from a line noise language. One they call Randomo. And they do so with a poor interpretation of their data that doesn’t actually support their conclusions.

Yeah.

I took the time to point this out. I was nice about it, I didn’t fisk it, or rip them as others did on this and the other site.

It is kind of sad to see this, as this study, and the concept behind it have potential merit. But they got lost somewhere along the way. Putting out a white paper on this as the beginning of an ongoing study, indicating that preliminary data was available for the first group … yeah, that would have been worth while. This paper … maybe not so much. It looks like a conference submission of some sort.

There are more than a few folks out there who still like to bash on Perl, still think thats cool. You know, its indistinguishable from a line noise language and this paper “proves” it. Thats why this is more dangerous than the usual fanboi tedious arguments. It has something akin to academic heft behind it.

The way I look at it, I am unaware of another language with anything close to Perl’s CPAN library scope/size/variety. R has a far smaller variant (CRAN), there have been passing attempts at this for Python and others. Octaveforge is cool. But the mere fact that there is such a huge resource

The Comprehensive Perl Archive Network (CPAN) currently has 100,828 Perl modules in 23,628 distributions, written by 9,299 authors

… yeah, this rather belies the fundamental point of anyone bashing the language as being indistinguishable from randomly chosen characters. If it were so bad, then why would so many people spend so much time and effort building such useful things from it? Wouldn’t they obviously switch to an easier system? The authors suggest Java (no really, stop laughing) as being easier.

Yeah, I wasn’t impressed with the analysis. They could have done better. And their potshots at Perl, apart from being unwarranted, weren’t supported by their own data. Won’t push them to retract. Did push them to get more data so they can reduce the size of their error bars. And include many more languages in their study.

Viewed 44126 times by 7171 viewers

2 thoughts on “Is this another “Perl indistinguishable from line noise” argument? Don’t know …

  1. Re: comparisons to CPAN — PECL and PEAR are pretty large, and a lot of the items have been so useful (APC, et. al.) that they’ve gotten rolled into PHP’s core, which tends to bring the numbers back down a bit.

    The suggestion to use Java? That’s proof of delusion right there. There is nothing that your average sysadmin hates more than being handed something written in java to run. Java, and java-based apps, are the number one pain in a linux sysadmin’s posterior, no matter how well the understand the language and stack.

  2. @Karl

    My bad in not thinking of PHP. Honestly I do very little there, but this is by history more than anything else. PEAR has been quite useful to me in the past, and PECL I played with a little. None of these are of the size/scope of CPAN, but for such similar languages, with similar issues, yeah, these sorts of code bases would suggest that mebbe novices can learn the language.

    I guess I am really surprised that they didn’t try it in R or Octave. R has a somewhat more … obscure … syntax, though it is very powerful. Octave is a Matlab-like environment, and this is all the rage, not just in scientific circles … we have customers looking to run code with this on large machines.

    The study can have value, its just it needs to avoid taking potshots, and leaping to conclusions that aren’t well supported by their own data.

    I’d even offer to critique their next set of papers if they would like, before publishing. Good science means understanding and appreciating the critique, deciding if/how to respond and then iterating the paper if needed. Ignoring fundamental criticism (sample size, etc.) would be profoundly wrong. Disagreeing with criticism is possible. I’ve had my own … animated discussions … with reviewers before.

Comments are closed.