root/Web-Scraper/trunk


Mode:

Legend:

Added
Modified
Copied or renamed
Date Rev Chgset Author Log Message
(edit) 11/11/07 14:13:04 @2393 [2393] miyagawa bump up HTML::Selector::XPath req
(edit) 10/18/07 09:52:41 @2379 [2379] miyagawa Checking in changes prior to tagging of version 0.22. Changelog diff is: …
(edit) 10/18/07 09:41:34 @2378 [2378] miyagawa print to PAGER if it's set. changed variable names for the generated code.
(edit) 10/04/07 17:42:22 @2371 [2371] miyagawa look to see if textarea preserves newlines
(edit) 10/04/07 17:06:29 @2370 [2370] miyagawa Checking in changes prior to tagging of version 0.21_01. Changelog diff …
(edit) 10/04/07 16:46:13 @2369 [2369] miyagawa fixed example
(edit) 10/04/07 16:43:30 @2368 [2368] miyagawa fixed issues with non-match regexp sub. also handle undef return values
(edit) 10/04/07 16:26:12 @2367 [2367] miyagawa fixed a bug in loop filters
(edit) 10/04/07 16:13:40 @2366 [2366] miyagawa add an experimental filter support!
(edit) 10/04/07 02:38:52 @2364 [2364] miyagawa Checking in changes prior to tagging of version 0.21. Changelog diff is: …
(edit) 10/03/07 16:30:10 @2362 [2362] miyagawa Checking in changes prior to tagging of version 0.20. Changelog diff is: …
(edit) 10/03/07 16:23:45 @2361 [2361] miyagawa add PS store news
(edit) 09/21/07 14:43:33 @2359 [2359] miyagawa Checking in changes prior to tagging of version 0.19. Changelog diff is: …
(edit) 09/21/07 14:40:26 @2358 [2358] miyagawa try to get encoding from META tags as well
(edit) 09/21/07 12:45:28 @2357 [2357] miyagawa revert the fix for now!
(edit) 09/21/07 12:44:30 @2356 [2356] miyagawa make absolute URI return value as a string, not URI object to be …
(edit) 09/21/07 11:49:40 @2354 [2354] miyagawa Checking in changes prior to tagging of version 0.18. Changelog diff is: …
(edit) 09/21/07 11:35:54 @2353 [2353] miyagawa use as_XML instead of as_HTML in RAW
(edit) 09/21/07 11:27:36 @2352 [2352] miyagawa fix absolute URI bug with nested scrapers
(edit) 09/20/07 11:13:46 @2350 [2350] miyagawa Checking in changes prior to tagging of version 0.17. Changelog diff is: …
(edit) 09/19/07 14:47:18 @2348 [2348] miyagawa Checking in changes prior to tagging of version 0.16. Changelog diff is:
(edit) 09/19/07 14:43:39 @2347 [2347] miyagawa support TextNode?. call Term::Encoding
(edit) 09/16/07 13:28:28 @2346 [2346] miyagawa changes date
(edit) 09/16/07 13:25:03 @2344 [2344] miyagawa Checking in changes prior to tagging of version 0.15. Changelog diff is: …
(edit) 09/15/07 10:06:41 @2343 [2343] miyagawa simplify POD example
(edit) 09/15/07 09:49:09 @2342 [2342] miyagawa make user_agent an accessor as well
(edit) 09/15/07 09:46:34 @2341 [2341] miyagawa make UserAgent? variable accessible
(edit) 09/15/07 09:44:12 @2340 [2340] miyagawa don't escape utf-8 characters in WARN and 's' on scraper shell
(edit) 09/15/07 08:07:14 @2338 [2338] miyagawa Checking in changes prior to tagging of version 0.14. Changelog diff is: …
(edit) 09/15/07 08:05:49 @2337 [2337] miyagawa now url is absoltue. yay
(edit) 09/15/07 08:04:10 @2336 [2336] miyagawa added URI absolutification and RAW/HTML getter
(edit) 09/15/07 07:50:03 @2335 [2335] miyagawa no optional end tag
(edit) 09/15/07 07:16:59 @2334 [2334] miyagawa added =~ to the selector
(edit) 09/03/07 23:53:30 @2333 [2333] miyagawa fix Term::Readline usage
(edit) 09/03/07 13:05:36 @2331 [2331] miyagawa Checking in changes prior to tagging of version 0.13. Changelog diff is: …
(edit) 09/03/07 09:00:34 @2330 [2330] miyagawa added rel-tag extractor
(edit) 09/03/07 08:42:59 @2329 [2329] miyagawa add search-cpan.pl example
(edit) 09/03/07 08:35:33 @2328 [2328] miyagawa added WARN handy sub to scraper
(edit) 09/03/07 08:34:05 @2327 [2327] miyagawa added URI to the deps
(edit) 09/03/07 08:32:53 @2326 [2326] miyagawa added 'c' and 'c all' to scraper
(edit) 08/30/07 18:42:00 @2324 [2324] miyagawa Checking in changes prior to tagging of version 0.12. Changelog diff is: …
(edit) 08/28/07 18:52:17 @2318 [2318] miyagawa Checking in changes prior to tagging of version 0.11. Changelog diff is: …
(edit) 08/27/07 17:10:19 @2317 [2317] miyagawa requires YAML for scraper script and tests
(edit) 08/27/07 17:07:17 @2315 [2315] miyagawa Checking in changes prior to tagging of version 0.10. Changelog diff is: …
(edit) 08/16/07 02:53:51 @2311 [2311] miyagawa Checking in changes prior to tagging of version 0.09. Changelog diff is: …
(edit) 08/15/07 05:29:52 @2309 [2309] miyagawa Checking in changes prior to tagging of version 0.08. Changelog diff is: …
(edit) 08/15/07 05:25:48 @2308 [2308] miyagawa add tree->delete to avoid memeory leaks
(edit) 06/25/07 16:08:31 @2293 [2293] miyagawa fixed live test
(edit) 05/13/07 08:25:52 @2262 [2262] miyagawa Checking in changes prior to tagging of version 0.07. Changelog diff is: …
(edit) 05/13/07 08:03:57 @2261 [2261] miyagawa better dependencies for XPath libraries
(edit) 05/13/07 07:49:18 @2259 [2259] miyagawa Checking in changes prior to tagging of version 0.06. Changelog diff is: …
(edit) 05/13/07 07:37:50 @2258 [2258] miyagawa don't use decoded_content to work with new HTTP::Response::Encoding
(edit) 05/13/07 07:34:12 @2257 [2257] miyagawa add live.t for Unicode testing
(edit) 05/10/07 10:25:20 @2254 [2254] miyagawa Checking in changes prior to tagging of version 0.05. Changelog diff is: …
(edit) 05/10/07 10:22:33 @2253 [2253] miyagawa assume default as latin-1 per RFC
(edit) 05/10/07 10:20:13 @2252 [2252] miyagawa if expression starts with /, it's treated as direct XPath expression, not …
(edit) 05/10/07 07:47:05 @2246 [2246] miyagawa store $node to $_ in the callback
(edit) 05/09/07 19:02:17 @2245 [2245] miyagawa add HD trailer extraction code as an example of callback
(edit) 05/09/07 18:49:35 @2244 [2244] miyagawa treat as UTF-8 if there's no encoding found
(edit) 05/09/07 18:22:29 @2243 [2243] miyagawa rename .t
(edit) 05/09/07 17:12:52 @2242 [2242] miyagawa added less-DSLish constructor Web::Scraper->define(sub { ... });
(edit) 05/09/07 16:58:56 @2240 [2240] miyagawa Checking in changes prior to tagging of version 0.04. Changelog diff is: …
(edit) 05/09/07 16:54:30 @2239 [2239] miyagawa API CHANGE: Now scraper {} returns Web::Scraper object, not the closure. …
(edit) 05/09/07 15:12:56 @2235 [2235] miyagawa Checking in changes prior to tagging of version 0.03. Changelog diff is: …
(edit) 05/09/07 15:06:58 @2234 [2234] miyagawa implemented process 'selector', sub { ... } and process_first for that.
(edit) 05/09/07 14:42:43 @2233 [2233] miyagawa refactored get_value to the function. Callback takes HTML::Element, not …
(edit) 05/09/07 14:20:13 @2232 [2232] miyagawa added unit tests
(edit) 05/09/07 13:14:54 @2231 [2231] miyagawa make use of 'TEXT' instead of content, to be more compatible with Scrapi
(edit) 05/09/07 12:04:18 @2229 [2229] miyagawa Checking in changes prior to tagging of version 0.02. Changelog diff is: …
(edit) 05/09/07 11:57:01 @2227 [2227] miyagawa Checking in changes prior to tagging of version 0.01. Changelog diff is:
(edit) 05/09/07 11:55:18 @2225 [2225] miyagawa import Web::Scraper
(add) 05/09/07 11:54:45 @2224 [2224] miyagawa Directory for svk import.