Smart match versus hash deathmatch

By brian d foy on October 20, 2010 11:53 PM

A couple of days ago, I posted about my answer to a Stackoverflow question asking about the speed of the smart match operator. Smart matches looking for a scalar in an array, can short circuit, so they are pretty speedy.

The trick, however, is if this sort of smart match is faster than a hash lookup. Yes they are and no they aren't. I've updated my Stackoverflow answer with additional benchmarks and a new plot.

Smart matches are faster if you have to create the hash, but slower if you already have the hash.

There's a middle ground in between that I don't care to find. For some number of searches of the hash, the cost of creating the hash amortizes enough that it's faster than a smart match.

It depends on what you are doing, but that's the rub with every benchmark. The numbers aren't the answers to your real question, in this case "Which technique should I use?". They only support a decision once you add context.

6 comments

6 Comments

Phred | October 21, 2010 6:33 PM | Reply

Out of curiosity, why aren't you posting the results here and directing those on stackoverflow to this blog?

brian d foy | October 21, 2010 9:38 PM | Reply

I answer your question on this other website where you didn't ask the question.

Phred | October 21, 2010 10:42 PM | Reply

Thanks - part of me has been wondering where people are talking about Perl now with the lack of write access to use.perl.org.

Phred | October 21, 2010 11:00 PM | Reply

Perhaps I should clarify a bit more. I've started to notice a trend of syndication, the first instance being with posterous.com. I would see tweets which link to posterous.com, which would have a (almost hidden) link to follow to the actual content.

So I've been trying to get a better understanding of why people are syndicating, and if they are posting abridged versions on the intermediate sites (in this case blogs.perl.org, posterous in other cases), and why they are doing that instead of just linking directly to the end site (in this case stackoverflow) with the content.

Normally I would have expected to see an answer like this posted on Perlmonks.org, but that site is so slow that I don't visit it anymore. I was wondering why you chose stackoverflow - and I guess the answer is that is where the question was posted in the first place. Part of me is curious about the psychology of why more Perl Q&A isn't here on blogs.perl.org, but it seems like stackoverflow is growing in popularity, and more suited to this type of conversation.

Hope that makes sense :)

brian d foy | October 21, 2010 11:06 PM | Reply

I don't understand your question. I didn't choose StackOverflow. I didn't make a "choice". There's no exclusion by me reading or posting to StackOverflow. I reject the assumption that I have to limit what I read or where I post.

The questioner chose StackOverflow to ask his question. I don't know why he chose that. If he asked it on Perlmonks, the answer would be on Perlmonks.

Phred | October 22, 2010 1:58 AM | Reply

Sorry for getting off topic with my comments - scratched an itch that didn't belong here.

On topic though, thanks for posting this - good read. Interesting that the search is context dependent, but makes sense when I think about it. If you have to repeatedly search for an element, it might be worth ordering the hash specifically for the search you are running so that you're always looking in the beginning of the dataset, assuming the cost of ordering is relatively small.

Name

Email Address

URL

Remember personal info?

Comments (You may use HTML tags for style)

About brian d foy

I'm the author of Mastering Perl, and the co-author of Learning Perl (6th Edition), Intermediate Perl, Programming Perl (4th Edition) and Effective Perl Programming (2nd Edition).

More info »

brian d foy