Parsing of anime reviews on rec.arts.anime.misc

I have been collecting posts from rec.arts.anime.misc newsgroup, mostly related to anime reviews to prioritize which one to see. The format is more or less the same of (collection of) reviews ...

  • title
  • a paragraph of comments
  • ending with some kind of quality statetment (a second paragraph is rare)

Take ca6u74F3g1tU1@mid.individual.net message for example by GeoffC (Wed, 15 Oct 2014 10:48:16 +0100) ...

Gundam G no Reconguista:
Another Gundam, still with young characters piloting giant robots in space. Not badly done, but I was confused about who was who and what was going on in the first episode, and did not feel sufficiently involved to continue with further episodes.

...

Donten ni Warau:
Meiji Restoration period drama. I saw the first episode in raw. So far, it has not interested me enough to encourage further investigation.

... or, from Dave B in message 8F96CD25-3F91-45E1-9CA9-9AE037696ADF%anthony.baranyi@bell.net in single anime review (9 Oct 2014 00:55:43 GMT; "?" are present possibly due to missing character in the font used in xterm) ...

?Hitsugi no Chaika ? Avenging Battle? is the sequel to the fairly successful action/fantasy series from earlier this year. The new season starts out pretty much where the first season left off
...

All-in-all, the opening of the sequel went very well. It's almost as if there were no break in the story at all. It's good to see this show back on air and I am looking forward to seeing how things develop. My initial rating is B+.

Does somebody have suggestions to (semi-)programmatically parse into title, review, see/drop/possibly-see?

Leave a comment

About parv

user-pic Ranting, blurbing about Perl, among other things.