1
0
mirror of https://github.com/webrecorder/pywb.git synced 2025-03-15 16:14:48 +01:00

1840 Commits

Author SHA1 Message Date
Ilya Kreymer
42a31bbebf wombat improvements:
- history change check: don't reject urls without a slash, check if new url == origin
- new api: override window.fetch() if it exists
- srcset elem rewriting, <source> element srcset override
- ajax: don't add X-Pywb-Requested-With header if url is a data: url
2016-10-19 21:11:16 -07:00
Ilya Kreymer
8b77f66a10 wb_frame.js: make more safe, check that frame actually exists before accessing 2016-10-19 20:57:56 -07:00
Ilya Kreymer
003d84c371 responseloader: self-redirect: if no status code (eg. for revisits), always parse and look at the actual status code 2016-10-19 11:03:48 -07:00
Ilya Kreymer
7b45df7338 wburl: support for new modifier form: $mod as well as 'mod_' 2016-10-10 17:00:36 -07:00
Ilya Kreymer
06b9e957e6 vidrw: when in proxy mode, use current protocol for vi_ query 2016-10-03 08:17:13 -07:00
Ilya Kreymer
ccc13b427f dockerfile: update to latest pywb
urlrewrite: upstream url avoid adding empty '&'
2016-10-02 11:29:51 -07:00
Ilya Kreymer
28dd799516 wombat: auto-disable notifications and geolocation queries 2016-10-01 21:08:53 -07:00
Ilya Kreymer
b8769c7de0 proxy mode: use js_proxy rewriter for js embedded in html when in proxy mode #198 2016-10-01 21:08:08 -07:00
Ilya Kreymer
e97d2fb517 wombat unrewrite: if given a host-relative url (starting with '/') to extract_orig(), extract as host-relative as well if the host matches the current origin -- maintain host-relative urls when possible 2016-10-01 13:53:59 -07:00
Ilya Kreymer
950c31737c wombat typo: check that __WB_top_frame is not null before using! 2016-09-30 13:49:57 -07:00
Ilya Kreymer
a4efa58d1e proxy mode: add special 'proxy_js' rewriter which defaults to none rewriter, but supports custom rules
from rules.yaml, to avoid inserting WB_wombat_ overrides in proxy mode #198
2016-09-30 11:33:30 -07:00
Ilya Kreymer
2079ce191c header rewriter improvements: better define headers rewritten/prefixed due to content rewrite vs url rewriting
when in proxy mode, don't rewrite headers unless related to content, transfer-encoding or cacheing (separate settings) #197
2016-09-30 09:02:50 -07:00
Ilya Kreymer
718cd43ae2 client rewrite: improvements for proxy mode
- disable most overrides when in proxy mode
- if using rewrite_url(), keep current scheme, instead of defaulting to http
- use 'window._wb_js' to check init
2016-09-29 15:26:12 -07:00
Ilya Kreymer
bdf4f9bc71 static handler: if 'wsgi.file_wrapper' throws exception, default to streaming directly 2016-09-29 15:23:40 -07:00
Ilya Kreymer
e61078ab96 memento: use replace_header() to avoid double adding Link, Memento-Datetime, Vary when using range request cache 2016-09-29 15:22:44 -07:00
Ilya Kreymer
4cdb99f415 rewrite: strip www redir check: use re.MULTILINE to include urls that may have a \r 2016-09-29 15:20:25 -07:00
Ilya Kreymer
98e8a75920 vidrw: more permissive flash video rewriting: consider any <object> with flashvars, attempt any youtube-dl playlist
bump version to 0.32.2
2016-09-21 11:37:31 -07:00
Ilya Kreymer
64cd408de4 update CHANGES for 0.32.1 2016-09-20 15:44:50 -07:00
Ilya Kreymer
a6a186891e wbrequestresponse: text response: calculate Content-Length from encoded utf-8 bytes, not the original text 2016-09-20 15:44:50 -07:00
Ilya Kreymer
1bb7aa01ce wburl improved scheme detection: use regex to match acceptable scheme before :/, don't treat something like 'a.com/?x=http://' as having a scheme, update tests to check for this 2016-09-20 15:44:50 -07:00
Ilya Kreymer
9a3017bfcd bump version to 0.32.1 2016-09-20 15:44:49 -07:00
Ilya Kreymer
5c499753f8 webrecore Docker: update Docker file to latest pywb, python, starting to use versioning! 2016-09-16 18:43:26 -07:00
Ilya Kreymer
874bef0ab1 Update CHANGES 0.32.0 2016-09-15 14:18:44 -07:00
Ilya Kreymer
dc05d14934 Merge pull request #194 from nlevitt/cli-desc
fix/tweak for cli --help
2016-09-15 14:16:42 -07:00
Ilya Kreymer
c3f98c3d38 Merge branch 'develop' 2016-09-15 14:15:34 -07:00
Ilya Kreymer
d95116885a Update CHANGES for 0.32.0 2016-09-15 14:14:05 -07:00
Ilya Kreymer
86cbb366f3 rules: undo yt rules change (will revisit later) 2016-09-15 10:01:36 -07:00
Ilya Kreymer
0a76a56b91 wombat: edge case: correctly handle <iframe src="javascript:WB_wombat_location=...> assignment created via JS.. custom rewrite_frame_src() added for use with rewrite_elem(), ensures wombat init is inserted first thing after 'javascript:' 2016-09-14 15:44:20 -07:00
Ilya Kreymer
cc65ce914d wombat improvements (2.16):
- rewrite_elem() also rewrite 'poster'
- extract_orig() don't add http:// if nothing extracted
- new override: navigator.sendBeacon() if available
2016-09-14 14:13:59 -07:00
Ilya Kreymer
5fede0fea3 wombat: turn off debugging (accidentally committed) 2016-09-14 13:39:10 -07:00
Ilya Kreymer
1fb6e9b5fa rewrite: url rewriter: don't rewrite relative urls, only those that start with scheme, / or contain ../ #195
update tests to reflect this new behavior
2016-09-14 13:04:46 -07:00
Noah Levitt
1620668363 fix/tweak for cli --help 2016-09-14 09:58:44 -07:00
Ilya Kreymer
70fdaae2b3 rules: rewrite location string for periscope js 2016-09-12 20:07:14 -07:00
Ilya Kreymer
1a37d789ed cdx-api: when using cdx server api, return no captures 404 error in json format if output=json, plain text otherwise instead of as html #193 2016-09-08 18:59:52 -07:00
Ilya Kreymer
f47ae0bb7e rewrite: for rewriting on* attr, add 'window.' before WB_wombat_ as window may not be in scope (if no '.' before WB_wombat) 2016-09-08 18:38:35 -07:00
Ilya Kreymer
6452c72b4f bump versions 2016-09-08 10:31:07 -07:00
Ilya Kreymer
1fe201c528 rewrite: html: rewrite svg <image> tag
client: update textContent after rewrite_style() in rewrite_elem()
2016-09-08 10:06:47 -07:00
Ilya Kreymer
895a01933c wb: allow multiple readystateevent changes, in case data changes (eg. title is available later) 2016-09-02 12:04:30 -07:00
Ilya Kreymer
70a25b6d0f client rewrite: ensure window.open() windows have wombat inited. if they are set to about:blank, use parser from opener to ensure proper relative url resolving 2016-08-20 13:03:17 -04:00
Ilya Kreymer
099a81b786 wb_frame: add support for optional 'wbinfo.outer_prefix' which if set, is used for making the top frame url (#191) 2016-08-20 00:03:21 -04:00
Ilya Kreymer
892ebacead cross-frame improvements: #191
- make hashchange functions use postMessage(), support setting top->replay and replay->top
- special postMessage() option for sending message from top frame -> replay frame
- fix history navigation, mimic top frame history same as replay frame as much as possible
- remove iframe_loaded() callback, using postMessage() notifications only
- include document title in 'load' message
2016-08-19 23:44:15 -04:00
Ilya Kreymer
6af1a7856e top-frame handling: don't access contents of top frame directly to support cross-domain frames
set __WB_top_Frame in wombat if is_framed property is true, don't check wbinfo (#191)
2016-08-19 13:59:42 -04:00
Ilya Kreymer
2fb1df34c9 recorder: add upload/streaming support with put_record=stream where the content being uploaded is already in WARC record form 2016-08-12 21:23:25 -04:00
Ilya Kreymer
c8b6a48005 webagg: use prepare_auth() to ensure Authorization header is set for http://user:pass@host urls 2016-08-12 21:22:17 -04:00
Ilya Kreymer
82d3b61523 recorder: catch exception in close_idle_files() if file no longer exists and ensure it's removed 2016-08-12 01:19:30 -04:00
Ilya Kreymer
594aff86d3 webagg: response self-redir: don't check if live, throw correct exception 2016-08-10 00:50:43 -04:00
Ilya Kreymer
92dfcbfcbe rewrite: don't rewrite 'www-authenticate' and 'proxy-authenicate' headers 2016-08-10 00:02:53 -04:00
Ilya Kreymer
cca0c01547 urlrewrite misc fixes:
- ensure content-length is converted to str
- templateview: support optional extensions
- fix test
2016-08-09 19:53:22 -04:00
Ilya Kreymer
b22a29df5f vidrw: also check for 'src' param as well as movie 2016-08-08 19:50:16 -04:00
Ilya Kreymer
c93d7ecafc webagg: Fix loading of url-lookup (url agnostic) revisits, ensure all params passed to cdx lookup, add tests for url-agnostic revisit lookup 2016-08-04 16:53:24 -04:00