Discoverable tests and creating testing standards

By Ovid on March 15, 2013 10:57 AM

I like writing code. I like writing tests. I don't like:

Trying to figure out where the tests are
Writing boilerplate
Finding yet another package without tests

That last one is particularly vexing when you discover that your code is failing because another package doesn't load, but it doesn't have tests. So I fixed that.

Some of what follows is a repeat of things written in previous posts, but it's important enough that it bears repeating.

A couple of months ago I suggested we create "discoverable" tests at work. In short, many of us face the problem of trying to figure out where the tests are in a new project. Worst case is this:

$ ls t/*t | wc -l
312

And you're sitting there weeping because the nice, orderly code in your lib/ directory is tested by a pile 'o junk. Other projects have subdirectories, but those subdirectories often group things by behavior, not package; your tests for lib/Foo/Bar/Baz.pm wind up in t/common/exceptions/, t/common/standards/, t/database/, t/miscellaneous/crap/tests/. Even if you memorize all of those subdirectories, you find yourself adding code that can, arguably, be in multiple subdirectories.

My recommendation was that our modules have one corresponding .t file. The agreement reached was that a module like lib/Foo/Bar.pm would go into t/Foo/Bar.t. I also have the following in my .vimrc:

noremap ,gg :call GotoCorresponding(expand("%"))<cr>                                                                                     

function! GotoCorresponding(module)                                                                                                       
    let file = system("get_corresponding ".a:module)
    if !empty(file)
        let ignore = system("perl /home/cpoe/bin/make_test_stub ".a:module." ".file)
        execute "edit " . file
    else
        echoerr("Cannot find corresponding file for: ".a:module)
    endif
endfunction

The get_corresponding code merely returns lib/Foo/Bar.pm for t/Foo/Bar.t (and vice versa). It's trivial to write. However, the make_test_stub is what I added today. It's a hack (as many good experiments are!), but it does the following:

Exit if we're not about to enter a .t test.
Exit if the .t test exists
Write a subtest for every public function

It looks like this:

use strict;
use warnings;
use autodie ':all';
use Class::Inspector;
use Sub::Information;

my ( $package_file, $test_file ) = @ARGV;

my $package = $package_file;
$package =~ s{\.pm$}{} or exit;
$package =~ s{^lib/}{};
$package =~ s{/}{::}g;

exit if -e $test_file;

eval "use $package";
die $@ if $@;

open my $fh, '>', $test_file;
print $fh <<"END";
use Test::Most;
use $package;

END

my $functions = Class::Inspector->function_refs($package);

my @functions;
foreach my $function (@$functions) {
    my $info = inspect($function);
    my $name = $info->name;
    next if $name =~ /^_/ or $name =~ /^[[:upper:]_]+$/;
    next if $info->package ne $package;
    push @functions => $name;
}

foreach my $function (@functions) {
    print $fh <<"END";
subtest "Verify $function" => sub {
    can_ok '$package', '$function';
};

END
}

print $fh "done_testing;\n";

So now, if I edit a file named lib/Weborama/Collect/Component/Publisher.pm and it has no tests, I hit ,gg in vim and get this:

use Test::Most;     
use Weborama::Collect::Component::Publisher;

subtest "Verify code" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'code';
};

subtest "Verify halt" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'halt';
};

subtest "Verify has_landing_url" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'has_landing_url';
};

subtest "Verify headers" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'headers';
};

subtest "Verify landing_url" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'landing_url';
};

subtest "Verify new" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'new';
};

subtest "Verify response" => sub {
    can_ok 'Weborama::Collect::Component::Publisher', 'response';
};

done_testing;

This has a few benefits:

You start out with working code rather than an empty test file.
Even for modules that have no tests, you'll at least know that you can "use" the module.
Even if you don't update the stub test, you'll get a test failure if someone removes part of the public API.
It's harder to forget to test a particular public function.
Continuing to evolve standards for testing can make our test suites easier to manage.

So I used this to quickly generate some stub tests for one of our projects and a few minutes later I had these extra tests in the test suite (I thought it prudent to obscure these test names):

$ git st --porcelain|grep ^A|cut -d' ' -f3|xargs prove -l
t/xxxxxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxxx.t.................... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxxx/xxxx/xxxxxxxx.t...... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxxx/xxxx/xxxxxxxxxxxx.t.. ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxx/xxxx.t......................... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxxxxxxx.t......................... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxxxxxxx/xxxx.t.................... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxxxxxxx/xxxx/xxxxxxxxxx.t......... ok
t/xxxxxxxx/xxxxxxx/xxxxxxxxxxxxxx/xxxx/xxxxxxxxxxxxxx.t..... ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxxxxxxx.t................ ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxxxxxxx/xxxxxx.t......... ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxxxx.t..... ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxxxxx.t.... ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxxxxxxx/xxxxxxxx.t....... ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxx/xxxx/xxxxxxxx.t............ ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxxxxxxx.t........................ ok
t/xxxxxxxx/xxxxxxx/xxxxx/xxxx.t............................. ok
All tests successful.
Files=16, Tests=53,  7 wallclock secs ( 0.06 usr  0.01 sys +  6.48 cusr  0.28 csys =  6.83 CPU)
Result: PASS

Naturally, not all tests can be shoe-horned into this one-size fits all methodology, but shortly after I recommended this at the BBC, devs noticed it was much easier to work with the test suite. At my current position, I'm finding that other projects have adopted this pattern and it makes switching between repositories and getting up to speed much easier.

This idea is just an experiment, but I'd love to hear feedback and suggestions.

If you liked this post, don't forget to check out my Beginning Perl book.

12 comments

Tagged as:

perl, testing, tests

12 Comments

Brian Wisti | March 15, 2013 1:11 PM

I like it. Reminds me of ZenTest for Ruby - specifically, the material in this article. It's been a while since I used ZenTest, though. The resemblance may be coincidental.

Ovid replied to comment from Brian Wisti | March 15, 2013 3:10 PM

Brian: completely coincidental. The directory layout I got after noticing how easy it is to find tests when using Test::Class. The stub tests are something I wanted to do for a while and I hacked them together in a few minutes.

Interestingly, for anyone interested in this approach, I should mention that I've twice migrated large, legacy test suites to this new test layout. It takes almost no time at all (well, if you have my vim setup it doesn't :) and the payoff is immediate.

Steven Haryanto | March 15, 2013 4:38 PM

Pretty straightforward recommendation, though not applicable in all cases. Sometimes I organize tests not by module but by case groups. There are also abstract classes and roles which might not be tested at all (heretic!).

Steven Haryanto | March 15, 2013 4:41 PM

*... which might not need to be tested ...

confuseAcat | March 15, 2013 5:12 PM

Very nice.

Here's a little more vimscript, also including the GetCorresponding() function: https://gist.github.com/mannih/16c75db8a94f73c2f75d

Johan Lindstrom | March 15, 2013 8:42 PM

See also Devel::CoverX::Covered for another take on the problem of finding the way in a sprawling test suite.

Devel::CoverX::Covered extracts and stores the relationship between covering test files and covered source files from a Devel::Cover cover_db.

So you can jump from source to test files vice versa. In Emacs, you can also have it highlight sub coverage.

Curtis, I wrote that after investigating Devel::Cover during a Gold Card day when we both worked in Pips (a project at the Beeb). I'm surprised you didn't give that a try; it sounds like it would have been helpful in your situation.

Ovid | March 16, 2013 2:00 PM

Johan: Yes, I love Devel::CoverX::Covered. It's complementary to this technique. I've written about it before, but I should do that again.

mlawren.myopenid.com | April 3, 2013 10:27 PM

It seems that the glob used by MakeMaker when running "make test" for finding tests is "t/*.t", which doesn't discover the tests in subdirectories. Does anyone know of a way of changing that?

Michael G Schwern | April 5, 2013 6:14 PM

I like the naming convention, I try to do that on my projects. Usually I go with one test file per method, especially in untested code methods to too much, so testing Foo::Bar->thing would be t/Foo/Bar/thing.t.

I've tried the can_ok stubs before, it's why can_ok takes a list of methods, and I found it has major drawbacks. It gave the illusion of more test coverage than really existed (caveat, this was before Devel::Cover).

For a dev team that isn't sold on the value of testing, it trained the developers to not trust the tests when they passed. There were a lot of .t files, and they passed, but that didn't tell them much. They had to manually look at each .t file to see if it actually tested anything. It was better to not have a .t file at all and know they were playing without a net.

As for the benefits, "even for modules that have no tests, you'll at least know that you can 'use' the module" is better accomplished with a single t/00compile.t. It runs first and fails fast.

In my particular case, because modules were heavily interdependent and slow to load, it made the test suite very, very slow. Another strike against the value of testing to a team already unconvinced.

The only concrete one is "even if you don't update the stub test, you'll get a test failure if someone removes part of the public API" but IMO it isn't worth creating all those stubs and all those false positive passes.

What is nice about those stubs is it encourages good scoping. However, I'd see more value in an editor macro to expand "method_name" into a subtest stub.

Michael G Schwern replied to comment from mlawren.myopenid.com | April 5, 2013 6:15 PM

I add this to my WriteMakefile.

test        => {
    TESTS => 't/*.t t/*/*.t t/*/*/*.t',
},

mlawren.myopenid.com replied to comment from Michael G Schwern | April 6, 2013 3:01 AM

After a big search through the (unfortunately not well documented) Module::Install code I found the "tests_recursive" function that does the same thing to the required depth automatically.

Ovid replied to comment from Michael G Schwern | April 8, 2013 8:45 AM

Michael,

Thanks for the thoughtful comments. I'll have to take that into consideration.

About Ovid

Freelance Perl/Testing/Agile consultant and trainer. See http://www.allaroundtheworld.fr/ for our services. If you have a problem with Perl, we will solve it for you. And don't forget to buy my book! http://www.amazon.com/Beginning-Perl-Curtis-Poe/dp/1118013840/

More info »

Ovid