root/Web-Scraper/trunk/Changes

Revision 2324 (checked in by miyagawa, 13 years ago)

Checking in changes prior to tagging of version 0.12. Changelog diff is:

=== Changes
==================================================================
--- Changes (revision 6880)
+++ Changes (local)
@@ -1,5 +1,11 @@

Revision history for Perl extension Web
Scraper

+0.12 Thu Aug 30 02:39:44 PDT 2007
+ - Added 's' command to scraper to get the HTML source
+ - You can use $tree variable to deal with the HTML::Element object in scraper shell
+ - Give a graceful error message if the given Selector/XPath doesn't compile
+ - Give a better error when number of args in process() seems wrong
+

0.11 Tue Aug 28 02:50:01 PDT 2007

- Supported hash-reference in process values, like

process "a", "people[]", { link => '@href', name => 'TEXT' };

Line 
1 Revision history for Perl extension Web::Scraper
2
3 0.12  Thu Aug 30 02:39:44 PDT 2007
4         - Added 's' command to scraper to get the HTML source
5         - You can use $tree variable to deal with the HTML::Element object in scraper shell
6         - Give a graceful error message if the given Selector/XPath doesn't compile
7         - Give a better error when number of args in process() seems wrong
8
9 0.11  Tue Aug 28 02:50:01 PDT 2007
10         - Supported hash-reference in process values, like
11           process "a", "people[]", { link => '@href', name => 'TEXT' };
12           See t/09-process_hash.t for its usage.
13
14 0.10  Mon Aug 27 00:53:51 PDT 2007
15         - result now returns the entire stash if called without keys
16         - added bin/scraper CLI
17
18 0.09  Wed Aug 15 10:51:14 PDT 2007
19         - remove Devel::Leak use from tests
20
21 0.08  Tue Aug 14 13:25:16 PDT 2007
22         - Call $tree->delete after the callback to avoid memory leaks by TreeBuilder.
23           (Thanks to k.daiba for the report)
24
25 0.07  Sat May 12 16:23:51 PDT 2007
26         - Updated dependencies for HTML::TreeBuilder::XPath
27
28 0.06  Sat May 12 15:47:27 PDT 2007
29         - Now don't use decoded_content to work with new H::R::Encoding
30
31 0.05  Wed May  9 18:21:22 PDT 2007
32         - Added (less DSL-ish) Web::Scraper->define(sub { ... }) syntax
33         - Fixed bug where the module dies if there's no encoding found in HTTP response headers
34         - Added more examples in eg/
35         - When we get value using callback, pass HTML::Element object as $_, in addition to $_[0]
36           (Suggested by Matt S. Trout)
37         - If the expression (1st argument to process()) starts with "/", it's
38           treated as a direct XPath and no Selector-to-XPath conversion is done.
39
40 0.04  Wed May  9 00:55:32 PDT 2007
41         - *API CHANGE* Now scraper {} returns Web::Scraper object and not closure.
42           You should call ->scrape() to get the response back.
43           (Suggested by Marcus Ramberg)
44
45           I loved the code returning closure, but this is more compatible to
46           scrapi.rb API and hopefully less confusing to people.
47
48 0.03  Tue May  8 23:04:13 PDT 2007
49         - use 'TEXT' rather than 'content' to grab text from element
50           to be more compatible with scrapi
51         - Added unit tests using Test::Base
52         - Refactored internal code for easier reading
53         - chained callbacks are now passed HTML::Element, not HTML, to avoid double HTML parsing
54         - Implemented callbacks (iterator) API
55         - Added 'process_first' to be compatible with scrapi
56
57 0.02  Tue May  8 20:03:37 PDT 2007
58         - Added dependencies to Makefile.PL
59
60 0.01  Tue May  8 04:05:59 2007
61         - original version
Note: See TracBrowser for help on using the browser.