blogs.perl.org

[FAILED] 10x faster than LibXML

By yko on May 1, 2013 4:45 PM

Unfortunately, the idea contained fatal flaw. See the following post for explaantions.

Once upon a time I faced a huge pile of HTML files which I had to analyze. Say, there were about 1 000 000 of them. Say, 100 Gb of data.
Most of you would say “It’s not that much!”. And you are right. It’s not.

But then I’ve decided to estimate time required to process that pile of files. I quickly put XPaths of what I was needed together and got a prototype in Web::Scraper. And here I go: ~0.94s per file, i/o overhead not included. That occurred more than 11 days on my laptop. Phew!

5 comments

Why You Should Help Crowd-fund Pinto

By Sawyer X on May 1, 2013 11:13 AM

There are only a few more days left to chip in to sponsoring work on Pinto. If you're still unconvinced or haven't thought about it yet, let me give my point of view on why you should spare a few minutes and a few bucks to sponsor Pinto.

2 comments

Chicago.PM - Beyond grep - Expanding the Programmer Toolset

By preaction on May 1, 2013 12:45 AM

Last week, Andy Lester (author of Land the Tech Job You Love) came to talk about tools to help programmers work more efficiently and the 2.0 release of his Ack search tool.

1 comment

Contribute to Pinto through Paypal or Flattr

By brian d foy on April 30, 2013 5:53 PM

My experiment to crowd fund Jeff Thalhammer's Pinto development is going well. It's 87% of the way there. We need $503 to reach the campaign minimum. We have a week left to get that remaining 13% to get the campaign to "tilt", and I think we can get even more than that. Our secondary money goal is $5,000, all of which goes to Jeff to work on open source features of Pinto. I like $6425 (two perfect squares next to each other). That's 0b0001100100011001 (repeats the bit pattern) or 0x1919 (repeated, and the same prime next to itself).

On Monday Sean Quinlan became the 100th contributor. We have a week left to get 128 contributors. Part of the experiment is to get as many people involved as we can, at any level. I don't care how much you donate: a $1 donation is just as good as $100 when we are counting contributors.

0 comments

Padre 0.98 has FINALLY been released.

By Peter Lavender on April 30, 2013 1:25 PM

Padre, the Perl IDE, is the work of a number of people with the goal of creating an IDE written in Perl itself.

Padre 0.98, according to the Release History page has finally been released 1 year and 1 week after 0.96.

This is a long time between releases. In part this can be put down to me as the Release Manager. As things go, we all have interests and busy times in our lives that can take us away from projects that we give up our free time to contribute to. For me, it's been a case of discovering photography. So instead of looking at code I'm looking at images I have taken.

This has meant, that try as I might, I never focused back on Padre and releasing Padre enough to get the new version out the door.

Well, tonight Padre 0.98 finally hits PAUSE.

1 comment

Small but hideously formed

By David Cantrell on April 29, 2013 3:07 PM

For several years I've been using a shell script that runs once a day from a cron job, and automagically subscribes me to the RSS feeds that rt.cpan creates for my modules. That means that whenever I release a new module, within a day or two I'll be subscribed to its bug reports, and within another day or so I'll start getting bug reports automatically emailed to me.

At some point RT got upgraded and my script broke. When I became aware of it, I fixed it, and I also put it on github. I hope you find it useful.

To run it you will need to set the RTUSER and RTPASS environment variables and possibly edit the variables at the top of the script. I assume that you use rss2email for reading RSS feeds. Use of any other tool for RSS is a bug, but if you wish to be buggy I'm sure you can work around that.

1 comment

Permalink

Is Earley parsing fast enough?

By Jeffrey Kegler on April 29, 2013 12:28 PM

"First we ask, what impact will our algorithm have on the parsing done in production compilers for existing programming languages? The answer is, practically none." -- Jay Earley's Ph.D thesis, p. 122.

[ This is cross posted from its home at the Ocean of Awareness blog.

In the above quote, the inventor of the Earley parsing algorithm poses a question. Is his algorithm fast enough for a production compiler? His answer is a stark "no".

This is the verdict on Earley's that you often hear repeated today, 45 years later. Earley's, it is said, has a too high a "constant factor". Verdicts tends to be repeated more often than examined. This particular verdict originates with the inventor himself. So perhaps it is not astonishing that many treat the dismissal of Earley's on grounds of speed to be as valid today as it was in 1968.

But in the past 45 years, computer technology has changed beyond recognition and researchers have made several significant improvements to Earley's. It is time to reopen this case.

0 comments

Pinto Jam Sessions On IRC This Thursday

By Jeffrey Ryan Thalhammer on April 29, 2013 10:41 AM

I'm on IRC just about all the time (my handle is "thaljef"). But I thought it might be interesting to actually schedule a session and invite people to come in and ask questions about Pinto, suggest a feature, report a bug, or just say "Hi".

So there will be two one-hour jam sessions in the #pinto channel on irc.perl.org this Thursday, May 2. The first will at 14:00 and the second will be at 18:00 (all times GMT). If you haven't used IRC before, this is an excellent guide.

Hope to see you all then!

1 comment

Permalink

Perl 5 Porters Monthly: April 2013

By Perl 5 Porters Summaries on April 28, 2013 10:57 PM

Welcome to Perl 5 Porters Monthly, a summary of the email traffic of the perl5-porters email list. This is the last monthly catch-up. I am planning to do weekly summaries for the week starting April 29, 2013. (But the road to hell is paved with yada yada yada...)

Topics from this month include:

DAVEM TPF Grant Month 2013 report
On eliminating external tools from the release process
Blead on s390x
archlib - (ed. RJBS asks, "Should we keep it or not?")
Status of z/OS/EBCDIC
NWCLARK TPF grant March report
status on 5.18.0 (as of April 24, 2013)

0 comments

Permalink

Using kcachegrind on potion

By Reini Urban on April 28, 2013 5:55 PM

cachegrind gives you information on the callstack and callcount, dependencies and efficiency. You can easily see hotspots in your code.

I use it to check the JIT and objmodel efficiency of potion, which is the vm for p2.

See my first post today Install kcachegrind on MACOSX with ports if you are on a Mac.

cachegrind

The first run with:

$ make bin/potion-s
$ valgrind --tool=callgrind -v --dump-every-bb=10000000 bin/potion-s example/binarytrees.pn
Ctrl-C

generates this sample

 $ open qcachegrind

Open one of the generated callgrind.out.pid.num files.

2 comments

Interview with Michael Schwern

By Gábor Szabó - גאבור סבו on April 28, 2013 11:32 AM

Especially for the weekend readers, here is my
interview with Michael Schwern. Enjoy and share!

1 comment

Permalink

POE::Component::IRC::Plugin::WWW::Reddit::TIL first release

By curtis on April 27, 2013 7:21 AM

Hi all,

Yesterday I released another IRC plugin for POE::Component::IRC: POE::Component::IRC::Plugin::WWW::Reddit::TIL

The plugin simply fetches a random title and link from the front page of Reddit's TodayILearned subreddit.

I used WWW::Shorten::Simple to return bitly links, and Mojo::JSON to decode reddit's API.

example:

curtis: !TIL
ircbot: curtis: TIL that the fighter squadron with the highest number of kills in the Battle of Britain during WWII were actually from Poland, and showed up two months after the battle had begun. http://bit.ly/ZN0qdB

links:

CPAN
Github

0 comments

Permalink

German Perl Workshop 2014 is on its way to Hanover

By burnersk on April 25, 2013 4:15 PM

Hannover.pm is organising the 16th German Perl Workshop 2014 ( GPW 2014 ) in Hanover.

An official act.yapc.eu website is currently in the making and will be published in early June. Give us some time to understand and fully configure its back end.

The gpw2014 will take place from March 26th to 28th 2014 (Wednesday to Friday). The CeBIT will take place from March 11th to 15th, the Hannover Messe will take place from April 7th to 11th. We're smack-dab in the middle of those two big fairs, but hotel rooms will be affordable during that week.

I'll blog about the gpw2014 at least every month to keep you informed. But please also have a look at the official act website for major news.

If you like to chat you can join the IRC channel #gpw (#gpw2014 is for the organisers) on irc.perl.org.

German version: http://www.perl-community.de/bat/poard/thread/18295

0 comments

Permalink

Install kcachegrind on MacOSX with ports

By Reini Urban on April 28, 2013 5:17 PM

Well, you don't want to install kcachegrind with port.

$ sudo port search cachegrind
kcachegrind @0.4.6 (devel)
    KCachegrind - Profiling Visualization

Because building KDE takes hours, and you wont need it other than for cachegrind. But there's a QT variant coming with kcachegrind, called qcachegrind. Maybe ports wants to use this variant. Or not, because kdelibs3 is listed as dependency.

$ sudo port info kcachegrind

kcachegrind @0.4.6, Revision 1 (devel) Variants: universal

Description: KCachegrind visualizes traces generated by profiling, including a tree map and a call graph visualization of the calls happening. It's designed to be fast for very large programs like KDE applications. Homepage: http://kcachegrind.sourceforge.net/

Library Dependencies: kdelibs3
Platforms: darwin
License: unknown
Maintainers: nomaintainer@macports.org

1 comment

Perl 5 Porters Monthly: March 2013

By Perl 5 Porters Summaries on April 28, 2013 6:09 AM

cross posted from my blog

Welcome to Perl 5 Porters Monthly, a summary of the email traffic of the perl5-porters email list.

Topics from this month include:

0 comments

Permalink

Fractal Diamond-Square Terrain Generation in Perl

By Ovid on April 25, 2013 2:49 PM

The title is mostly for search engines for anyone who encounters this in the future.

I have, for no particular reason, decided to implement the fractal diamond-square terrain generation algorithm in Perl. Sometimes it's nice to just play.

1 comment

Learning from other industries, part 1 of n

By David Cantrell on April 25, 2013 12:20 PM

My first job was as a bus conductor, and my second one was as a student trainee in an engineering company - proper engineering, with production lines, big machines, hot things, and "danger of death" notices on equipment. In both of these, safety was an important concern, and especially in the second one it was drilled in to me that safety and quality are closely related and arise from systems, not merely from individual endeavour. While I never completed my degree in manufacturing/systems engineering (I dropped out because I was fed up after too many years in the classroom) I still retain an interest in the subject.

I recently came across the excellent Disastercast podcast by Drew Rae. Of particular interest to programmers is the sixth episode, which looks at the report into a fatal rail crash caused by a poor safety and testing culture.

3 comments

Why I joined Propaganda.pm

By lichtkind on April 24, 2013 6:31 PM

4 reasons:

1: Right man.I was there when it started. At German Perl Workshop this march in Berlin Richard ignited with his inofficial keynote a lot of controversy. All what said wasn't new or IMO just opinion or chatter/not relevant. Later I spoke with him @ the social meeting in the computer game museum. (seriously, is there a better place for such an event?)

During our conversation I found out: he listens to people, he really loves Perl and he's the right kind of Person to do that, with the right experience set. Even if I don't share some of his fews/considerations what is important.

3 comments

Perl 5 Porters Monthly: February 2013

By Perl 5 Porters Summaries on April 28, 2013 4:13 AM

cross posted from my own blog

Welcome to Perl 5 Porters Monthly, a summary of the email traffic of the perl5-porters email list.

Topics from this month include:

0 comments

Permalink

You Are Invited To The Stratopan Beta

By Jeffrey Ryan Thalhammer on April 24, 2013 9:50 AM

Stratopan is a new service for hosting custom repositories of Perl modules in the cloud. Private beta trials will begin early this summer. If you'd like to participate in the trials, please stop by https://stratopan.com and leave us your email address. We'll contact you with all the details when the trials begin.

Stratopan will host both public and private repositories with any combination of proprietary and open source Perl modules. And Stratopan is built on Pinto, the open source tool for creating custom CPAN-like repositories, so it has the same helpful tools for managing your application dependencies.

5 comments

Permalink

[FAILED] 10x faster than LibXML

Why You Should Help Crowd-fund Pinto

Chicago.PM - Beyond grep - Expanding the Programmer Toolset

Contribute to Pinto through Paypal or Flattr

Padre 0.98 has FINALLY been released.

Small but hideously formed

Is Earley parsing fast enough?

Pinto Jam Sessions On IRC This Thursday

Perl 5 Porters Monthly: April 2013

Using kcachegrind on potion

cachegrind

Interview with Michael Schwern

POE::Component::IRC::Plugin::WWW::Reddit::TIL first release

German Perl Workshop 2014 is on its way to Hanover

Install kcachegrind on MacOSX with ports

Perl 5 Porters Monthly: March 2013

Fractal Diamond-Square Terrain Generation in Perl

Learning from other industries, part 1 of n

Why I joined Propaganda.pm

Perl 5 Porters Monthly: February 2013

You Are Invited To The Stratopan Beta

About blogs.perl.org

Search blogs.perl.org

cachegrind

About blogs.perl.org

Search blogs.perl.org

Perl & The Community