1
0
mirror of https://github.com/webrecorder/pywb.git synced 2025-03-15 00:03:28 +01:00

587 Commits

Author SHA1 Message Date
Ilya Kreymer
bee70260ac Merge branch 'develop' 2014-12-13 23:19:52 -08:00
Ilya Kreymer
09861ada2c wb.js: preserve anchor in location redirect
update CHANGELIST
2014-12-13 23:19:04 -08:00
Ilya Kreymer
80c5b45178 wb.js: don't include anchor when comparing current url with expected top
url
2014-12-13 23:12:42 -08:00
Ilya Kreymer
6d5592f890 wb js: add guard around wb js to prevent double-inits (such as from
fragments!)
2014-12-07 23:33:21 -08:00
Ilya Kreymer
5129c83095 update changelist 2014-12-07 21:15:35 -08:00
Ilya Kreymer
0a6838ac2b rewrite: refactor JS rewriters into seperate mixins, allowing for
link only, location only, and link + location JS rewriters.
location-only rewriter is new
js_rewrite_location options: all, location, urls (for now)
2014-12-07 21:13:19 -08:00
Ilya Kreymer
b951b304b6 Fix quotes in CHANGES.rst 2014-12-06 17:28:43 -08:00
Ilya Kreymer
0495423e86 rewrite: add per-collection rewrite options, settable in 'rewrite_opts'
block in each collection. Added rewrite_base to disable rewriting <base>
tag and rewrite_rel_canon to disable rewriting link rel=canon.

Disabling <base> tag rewrite fixex #51 and new system addresses #50 as
well.
2014-12-06 17:16:35 -08:00
Ilya Kreymer
6440e2503f bump version to 0.6.6 2014-12-06 15:22:57 -08:00
Ilya Kreymer
238a45bcbe Merge branch 'develop' for 0.6.5 0.6.5 2014-12-04 23:12:03 -08:00
Ilya Kreymer
d31a4df3a6 add changelist for 0.6.5 2014-12-04 23:10:51 -08:00
Ilya Kreymer
ea89702701 static handler: add default 'application/octet-stream' and only set
guessed mime if not none
2014-12-04 23:02:30 -08:00
Ilya Kreymer
c996e70a6e wburl: detect and decode partially encoded schemes in url, such as http%3A//,
https%A2F2F// before handling further
add additional tests for wburl
2014-11-29 11:13:57 -08:00
Ilya Kreymer
d7eb40af20 rewrite: properly rewrite scheme relative JS-escaped urls:
'\/\/example.com', '\\/\\/example.com/', treat same as '//example.com'
adding http: prefix
2014-11-23 18:56:49 -08:00
Ilya Kreymer
b8b8c30573 cookie_rewriter: add tests for exact cookie rewriter 2014-11-13 09:43:50 -08:00
Ilya Kreymer
20070e95b6 cookie_rewriter: add 'exact' cookie rewriter which never changes the
path/domain
2014-11-13 09:24:34 -08:00
Ilya Kreymer
388f31e08f rewrite: don't rewrite rel=canonical links, need to make rewriting more
configurable (#50)
2014-11-11 15:34:14 -08:00
Ilya Kreymer
49e98e0cdc archiveiterator/cdxindexer: cleaner load path for compressed and
uncompressed, ability to distinguish between chunked and non-chunked
warcs/arcs
Raise error for non-chunked gzip warcs as they can not be indexed for
replay, addressing #48
add 'bad' non-chunked gzip file for testing, using custom ext
2014-11-06 01:32:42 -08:00
Ilya Kreymer
044792f99f bump version to 0.6.5! 2014-11-06 01:28:56 -08:00
Ilya Kreymer
cc776b65bd update README.rst 0.6.4 2014-11-06 00:35:36 -08:00
Ilya Kreymer
71a8abe9c3 Merge branch 'develop' for 0.6.4 2014-11-06 00:34:32 -08:00
Ilya Kreymer
f6053a977b Update changes for 0.6.4 2014-11-05 21:59:54 -08:00
Ilya Kreymer
00121aa165 statusandheaders parsing: properly skip multiline bad headers (missing
header name and ':'), fixes #49
2014-11-05 20:26:23 -08:00
Ilya Kreymer
e4bcef1c8b rewrite: default HTMLParser entityref and charref are treated as plain
data for HTMLRewriter, since they are never rewritten, and to avoid
semicolon ambiguity, since no way to determine if there is a ; or not
at end. Addresses #43
2014-11-04 12:14:00 -08:00
Ilya Kreymer
5e4b830fa7 cdx: ensure cdx file is closed when iterator is done, since cdx files
are opened per-lookup, related to #45
2014-11-04 09:42:53 -08:00
Ilya Kreymer
a3b931b45e regex rewrite: fix js regex (dashes), add additional test case 2014-11-01 15:39:51 -07:00
Ilya Kreymer
841fd3f7b4 warc: add ability to set read block size (def 16384) in archiveiterator 2014-11-01 13:29:37 -07:00
Ilya Kreymer
5be65f2945 rules: better rule def, cleanup spacing 2014-10-30 00:10:39 -07:00
Ilya Kreymer
f14f37d5b1 tests: use httpbin for redirect tests 2014-10-29 09:47:32 -07:00
Ilya Kreymer
61ce53a0e0 warc/cdx: include metadata and resource records in default cdx index
emit 200 and 204 responses for metadata and resource, though write '-'
to cdx (for compatibility for now)
include content-length in resource/metadata records
2014-10-28 10:29:50 -07:00
Ilya Kreymer
c9273ee5ed rewrite: add 'deprefix' support to remove wburl prefix from any query
params
2014-10-26 12:12:37 -07:00
Ilya Kreymer
037cf35eb8 wsgi_wrapper: check for str before decoding err msg 2014-10-25 11:42:44 -07:00
Ilya Kreymer
8441b54192 head_insert: add mod to wombat 2014-10-24 14:13:59 -07:00
Ilya Kreymer
67e94d13f4 handlers/wombat: pass in mod to wombat, ability to customize modifier
for embeds
2014-10-24 12:45:41 -07:00
Ilya Kreymer
9b64194342 bump version to 0.6.4 2014-10-24 12:44:52 -07:00
Ilya Kreymer
a4f9138cb4 update README link 2014-10-21 19:29:27 -07:00
Ilya Kreymer
f394e26cf1 update CHANGES.rst 2014-10-21 19:21:15 -07:00
Ilya Kreymer
05995ad9cf Merge branch 'master' into develop, just README changes 2014-10-21 19:09:31 -07:00
Ilya Kreymer
e8d3965269 pep8 style fixes, remove unused methods 2014-10-21 19:06:16 -07:00
Ilya Kreymer
0a1c053507 Add badge 2014-10-19 08:33:26 -07:00
Ilya Kreymer
dfae25da01 Update README with News! 2014-10-19 08:32:11 -07:00
Ilya Kreymer
1a78fffa22 refactor handlers: simplify handling methods: handle_request() called
for all requests, handle_query() only for url query/calendar, and
handle_replay() only for replay. Improves extensibility of the handling
path
2014-10-19 00:33:32 -07:00
Ilya Kreymer
d99f7f996c urlrewriter refactor: replace get_abs_url and get_timestamp_url with
get_new_ur() which just calls wburl.to_str and applies rewriter prefix
allows creating a new wburl with any component(s) changed
2014-10-19 00:24:00 -07:00
Ilya Kreymer
d01275335b bump version to 0.6.3 2014-10-19 00:19:07 -07:00
Ilya Kreymer
c9c9e9d7ed Add Gratipay link 2014-10-18 17:00:33 -07:00
Ilya Kreymer
e4befd0d85 update README.rst 0.6.2 2014-10-18 15:27:58 -07:00
Ilya Kreymer
729320393a update license statement in js files with github link 2014-10-18 15:18:40 -07:00
Ilya Kreymer
268861b2ea Update README with UI Customization info 2014-10-18 15:14:43 -07:00
Ilya Kreymer
b7d23e4736 Update CHANGES.rst with latest 2014-10-18 14:51:21 -07:00
Ilya Kreymer
7f378c9aab move wb.css include into banner.html for easier overridability 2014-10-18 12:40:02 -07:00