root/Web-Scraper


Mode:

Legend:

Added
Modified
Copied or renamed
Date Rev Chgset Author Log Message
(edit) 12/09/08 11:16:49 @2901 [2901] miyagawa Added Web::Scraper::Node wrapper
(edit) 02/02/08 14:23:25 @2423 [2423] miyagawa commit woremacx's patch at …
(edit) 02/02/08 14:15:40 @2422 [2422] miyagawa branching libxml
(edit) 02/02/08 14:14:49 @2421 [2421] miyagawa branches
(edit) 12/07/07 17:11:46 @2401 [2401] miyagawa accept 0 as a return value
(edit) 11/26/07 09:01:42 @2398 [2398] miyagawa Tagging version '0.24' using shipit.
(edit) 11/26/07 09:01:19 @2397 [2397] miyagawa Checking in changes prior to tagging of version 0.24. Changelog diff is: …
(edit) 11/25/07 10:24:03 @2396 [2396] miyagawa Tagging version '0.23' using shipit.
(edit) 11/25/07 10:23:40 @2395 [2395] miyagawa Checking in changes prior to tagging of version 0.23. Changelog diff is: …
(edit) 11/21/07 10:21:01 @2394 [2394] miyagawa accept id() function as XPath not CSS selector
(edit) 11/11/07 14:13:04 @2393 [2393] miyagawa bump up HTML::Selector::XPath req
(edit) 10/18/07 09:53:04 @2380 [2380] miyagawa Tagging version '0.22' using shipit.
(edit) 10/18/07 09:52:41 @2379 [2379] miyagawa Checking in changes prior to tagging of version 0.22. Changelog diff is: …
(edit) 10/18/07 09:41:34 @2378 [2378] miyagawa print to PAGER if it's set. changed variable names for the generated code.
(edit) 10/04/07 18:38:14 @2374 [2374] miyagawa Tagging version '0.21_01' using shipit.
(edit) 10/04/07 18:37:33 @2373 [2373] miyagawa remove
(edit) 10/04/07 18:23:53 @2372 [2372] miyagawa Tagging version '0.21_01' using shipit.
(edit) 10/04/07 17:42:22 @2371 [2371] miyagawa look to see if textarea preserves newlines
(edit) 10/04/07 17:06:29 @2370 [2370] miyagawa Checking in changes prior to tagging of version 0.21_01. Changelog diff …
(edit) 10/04/07 16:46:13 @2369 [2369] miyagawa fixed example
(edit) 10/04/07 16:43:30 @2368 [2368] miyagawa fixed issues with non-match regexp sub. also handle undef return values
(edit) 10/04/07 16:26:12 @2367 [2367] miyagawa fixed a bug in loop filters
(edit) 10/04/07 16:13:40 @2366 [2366] miyagawa add an experimental filter support!
(edit) 10/04/07 02:39:09 @2365 [2365] miyagawa Tagging version '0.21' using shipit.
(edit) 10/04/07 02:38:52 @2364 [2364] miyagawa Checking in changes prior to tagging of version 0.21. Changelog diff is: …
(edit) 10/03/07 16:35:05 @2363 [2363] miyagawa Tagging version '0.20' using shipit.
(edit) 10/03/07 16:30:10 @2362 [2362] miyagawa Checking in changes prior to tagging of version 0.20. Changelog diff is: …
(edit) 10/03/07 16:23:45 @2361 [2361] miyagawa add PS store news
(edit) 09/21/07 14:43:42 @2360 [2360] miyagawa Tagging version '0.19' using shipit.
(edit) 09/21/07 14:43:33 @2359 [2359] miyagawa Checking in changes prior to tagging of version 0.19. Changelog diff is: …
(edit) 09/21/07 14:40:26 @2358 [2358] miyagawa try to get encoding from META tags as well
(edit) 09/21/07 12:45:28 @2357 [2357] miyagawa revert the fix for now!
(edit) 09/21/07 12:44:30 @2356 [2356] miyagawa make absolute URI return value as a string, not URI object to be …
(edit) 09/21/07 11:49:45 @2355 [2355] miyagawa Tagging version '0.18' using shipit.
(edit) 09/21/07 11:49:40 @2354 [2354] miyagawa Checking in changes prior to tagging of version 0.18. Changelog diff is: …
(edit) 09/21/07 11:35:54 @2353 [2353] miyagawa use as_XML instead of as_HTML in RAW
(edit) 09/21/07 11:27:36 @2352 [2352] miyagawa fix absolute URI bug with nested scrapers
(edit) 09/20/07 11:13:53 @2351 [2351] miyagawa Tagging version '0.17' using shipit.
(edit) 09/20/07 11:13:46 @2350 [2350] miyagawa Checking in changes prior to tagging of version 0.17. Changelog diff is: …
(edit) 09/19/07 14:47:28 @2349 [2349] miyagawa Tagging version '0.16' using shipit.
(edit) 09/19/07 14:47:18 @2348 [2348] miyagawa Checking in changes prior to tagging of version 0.16. Changelog diff is:
(edit) 09/19/07 14:43:39 @2347 [2347] miyagawa support TextNode?. call Term::Encoding
(edit) 09/16/07 13:28:28 @2346 [2346] miyagawa changes date
(edit) 09/16/07 13:25:21 @2345 [2345] miyagawa Tagging version '0.15' using shipit.
(edit) 09/16/07 13:25:03 @2344 [2344] miyagawa Checking in changes prior to tagging of version 0.15. Changelog diff is: …
(edit) 09/15/07 10:06:41 @2343 [2343] miyagawa simplify POD example
(edit) 09/15/07 09:49:09 @2342 [2342] miyagawa make user_agent an accessor as well
(edit) 09/15/07 09:46:34 @2341 [2341] miyagawa make UserAgent? variable accessible
(edit) 09/15/07 09:44:12 @2340 [2340] miyagawa don't escape utf-8 characters in WARN and 's' on scraper shell
(edit) 09/15/07 08:07:45 @2339 [2339] miyagawa Tagging version '0.14' using shipit.
(edit) 09/15/07 08:07:14 @2338 [2338] miyagawa Checking in changes prior to tagging of version 0.14. Changelog diff is: …
(edit) 09/15/07 08:05:49 @2337 [2337] miyagawa now url is absoltue. yay
(edit) 09/15/07 08:04:10 @2336 [2336] miyagawa added URI absolutification and RAW/HTML getter
(edit) 09/15/07 07:50:03 @2335 [2335] miyagawa no optional end tag
(edit) 09/15/07 07:16:59 @2334 [2334] miyagawa added =~ to the selector
(edit) 09/03/07 23:53:30 @2333 [2333] miyagawa fix Term::Readline usage
(edit) 09/03/07 13:06:27 @2332 [2332] miyagawa Tagging version '0.13' using shipit.
(edit) 09/03/07 13:05:36 @2331 [2331] miyagawa Checking in changes prior to tagging of version 0.13. Changelog diff is: …
(edit) 09/03/07 09:00:34 @2330 [2330] miyagawa added rel-tag extractor
(edit) 09/03/07 08:42:59 @2329 [2329] miyagawa add search-cpan.pl example
(edit) 09/03/07 08:35:33 @2328 [2328] miyagawa added WARN handy sub to scraper
(edit) 09/03/07 08:34:05 @2327 [2327] miyagawa added URI to the deps
(edit) 09/03/07 08:32:53 @2326 [2326] miyagawa added 'c' and 'c all' to scraper
(edit) 08/30/07 18:42:43 @2325 [2325] miyagawa Tagging version '0.12' using shipit.
(edit) 08/30/07 18:42:00 @2324 [2324] miyagawa Checking in changes prior to tagging of version 0.12. Changelog diff is: …
(edit) 08/28/07 18:52:56 @2319 [2319] miyagawa Tagging version '0.11' using shipit.
(edit) 08/28/07 18:52:17 @2318 [2318] miyagawa Checking in changes prior to tagging of version 0.11. Changelog diff is: …
(edit) 08/27/07 17:10:19 @2317 [2317] miyagawa requires YAML for scraper script and tests
(edit) 08/27/07 17:09:28 @2316 [2316] miyagawa Tagging version '0.10' using shipit.
(edit) 08/27/07 17:07:17 @2315 [2315] miyagawa Checking in changes prior to tagging of version 0.10. Changelog diff is: …
(edit) 08/16/07 02:54:22 @2312 [2312] miyagawa Tagging version '0.09' using shipit.
(edit) 08/16/07 02:53:51 @2311 [2311] miyagawa Checking in changes prior to tagging of version 0.09. Changelog diff is: …
(edit) 08/15/07 05:30:45 @2310 [2310] miyagawa Tagging version '0.08' using shipit.
(edit) 08/15/07 05:29:52 @2309 [2309] miyagawa Checking in changes prior to tagging of version 0.08. Changelog diff is: …
(edit) 08/15/07 05:25:48 @2308 [2308] miyagawa add tree->delete to avoid memeory leaks
(edit) 06/25/07 16:08:31 @2293 [2293] miyagawa fixed live test
(edit) 05/13/07 08:26:38 @2263 [2263] miyagawa Tagging version '0.07' using shipit.
(edit) 05/13/07 08:25:52 @2262 [2262] miyagawa Checking in changes prior to tagging of version 0.07. Changelog diff is: …
(edit) 05/13/07 08:03:57 @2261 [2261] miyagawa better dependencies for XPath libraries
(edit) 05/13/07 07:49:43 @2260 [2260] miyagawa Tagging version '0.06' using shipit.
(edit) 05/13/07 07:49:18 @2259 [2259] miyagawa Checking in changes prior to tagging of version 0.06. Changelog diff is: …
(edit) 05/13/07 07:37:50 @2258 [2258] miyagawa don't use decoded_content to work with new HTTP::Response::Encoding
(edit) 05/13/07 07:34:12 @2257 [2257] miyagawa add live.t for Unicode testing
(edit) 05/10/07 10:25:44 @2255 [2255] miyagawa Tagging version '0.05' using shipit.
(edit) 05/10/07 10:25:20 @2254 [2254] miyagawa Checking in changes prior to tagging of version 0.05. Changelog diff is: …
(edit) 05/10/07 10:22:33 @2253 [2253] miyagawa assume default as latin-1 per RFC
(edit) 05/10/07 10:20:13 @2252 [2252] miyagawa if expression starts with /, it's treated as direct XPath expression, not …
(edit) 05/10/07 07:47:05 @2246 [2246] miyagawa store $node to $_ in the callback
(edit) 05/09/07 19:02:17 @2245 [2245] miyagawa add HD trailer extraction code as an example of callback
(edit) 05/09/07 18:49:35 @2244 [2244] miyagawa treat as UTF-8 if there's no encoding found
(edit) 05/09/07 18:22:29 @2243 [2243] miyagawa rename .t
(edit) 05/09/07 17:12:52 @2242 [2242] miyagawa added less-DSLish constructor Web::Scraper->define(sub { ... });
(edit) 05/09/07 16:59:17 @2241 [2241] miyagawa Tagging version '0.04' using shipit.
(edit) 05/09/07 16:58:56 @2240 [2240] miyagawa Checking in changes prior to tagging of version 0.04. Changelog diff is: …
(edit) 05/09/07 16:54:30 @2239 [2239] miyagawa API CHANGE: Now scraper {} returns Web::Scraper object, not the closure. …
(edit) 05/09/07 15:37:20 @2238 [2238] miyagawa Tagging version '0.03' using shipit.
(edit) 05/09/07 15:36:29 @2237 [2237] miyagawa remove
(edit) 05/09/07 15:27:05 @2236 [2236] miyagawa Tagging version '0.03' using shipit.
(edit) 05/09/07 15:12:56 @2235 [2235] miyagawa Checking in changes prior to tagging of version 0.03. Changelog diff is: …
(edit) 05/09/07 15:06:58 @2234 [2234] miyagawa implemented process 'selector', sub { ... } and process_first for that.