1
0
mirror of https://github.com/webrecorder/pywb.git synced 2025-04-05 05:15:42 +02:00

1373 Commits

Author SHA1 Message Date
Ilya Kreymer
af37b99e80 wombat: additional fixes/testing: for about:blank/empty iframes, initialize WB_wombat_location, document.WB_wombat_location and WB_wombat_top immediately.
disable redundant rewrites
vidrw: check for null parent node, fix bug with double-add! don't rewrite added elements
2015-05-24 20:17:04 -07:00
Ilya Kreymer
adb9448f27 rules: improved rules for googleplus! 2015-05-22 18:45:50 -07:00
Ilya Kreymer
d5b92dbb3c rules: update rules for yt comments 2015-05-21 17:20:40 -07:00
Ilya Kreymer
179f11198b fuzzy match: look at first occurence, not last of match seperator
rules: add new rule for yt comments
2015-05-21 23:52:09 +00:00
Ilya Kreymer
35e2e535bb def banner: ensure banner element isn't rewritten! 2015-05-21 12:24:16 -07:00
Ilya Kreymer
a929e96433 wombat: add rewrite_elem() back to main init_dom_override(), check if already overwritten
createElement(): add optional skip arg
2015-05-21 12:11:08 -07:00
Ilya Kreymer
b7e27ba1a8 tests: update tests for keeping scheme-relative, scheme-relative after rewrite #101
remove tests for document.cookie, document.referrer and document.domain rewrite, as this is now handled client-side
2015-05-21 11:38:06 -07:00
Ilya Kreymer
690106bcb4 wombat: more refactoring! enable http/src observer by default, add skip_createElement override
implement document.cookie, document.referrer and document.domain as property overrides instead of WB_wombat rewrites
when a new iframe is loaded, ensure the *document* is reinited with wombat, even if window already has wombat settings
2015-05-21 11:26:54 -07:00
Ilya Kreymer
4983bf4425 rewrite: keep relative scheme after all, to work where with: scheme + "//..." constructions, #101 2015-05-21 11:26:54 -07:00
Ilya Kreymer
9912a31523 wombat: add prototype-level override for innerHTML and outerHTML 2015-05-21 11:26:54 -07:00
Ilya Kreymer
4e1be5c275 wombat work: add createElement() override, use current protocol instead of original url protocol
refactor init_dom_override() to only check children for fragments, add innerHTML override
2015-05-21 11:26:54 -07:00
Ilya Kreymer
058b25ec5a wombat: test with href overrides 2015-05-21 11:26:54 -07:00
Ilya Kreymer
c5a5d45a58 wombat: experimenting with initializing wombat on iframe init directly, rather than waiting for injected init... 2015-05-21 11:26:54 -07:00
Ilya Kreymer
4603b423f4 bump to 0.9.9 dev 2015-05-21 11:25:31 -07:00
Ilya Kreymer
0223ac0489 rewrite: top rewrite: avoid rewriting 'top(' 2015-05-14 22:32:10 -07:00
Ilya Kreymer
d55bac70c1 update version for 0.9.8 release 2015-05-14 22:32:10 -07:00
Ilya Kreymer
5cf7368f90 default config: set default 'archive_paths' to current directory, to avoid exception no startup 2015-05-14 22:32:10 -07:00
Ilya Kreymer
557f26b852 config: allow custom config.yaml settings for automatic collections.
settings in config.yaml are merged with collection-specific settings, which take precedence
(before, the config.yaml settings were being overwritten) #103
2015-05-14 22:32:09 -07:00
Ilya Kreymer
d2763004dd wombat: for now, disable node observers by default 2015-05-14 22:32:09 -07:00
Ilya Kreymer
d8b11db1e7 wombat:
ajax: always explicitly add X-Requested-With: XMLHttpRequest to ajax requests
mutation obs: don't rewrite <link rel=canonical> with node added observer
2015-05-14 22:32:09 -07:00
Ilya Kreymer
e94b239d84 rewrite: when rewriting scheme-relative urls, if adding an absolute prefix, use the scheme of the prefix
otherwise, keep relative scheme #101
2015-05-14 22:32:09 -07:00
Ilya Kreymer
fd4a0cc9b1 wombat: add extra mutation observer for any nodes added 2015-05-14 22:32:09 -07:00
Ilya Kreymer
1b9ef4e325 html_rewriter: handle parse_comments by rewriting as html, not as js, should address ikreymer/pywb-webrecorder#7 2015-05-14 22:32:09 -07:00
Ilya Kreymer
40f15cf6ea rules: add location rewrite only rule for disqus
wombat: ensure _orig_setAttribute is still set even if setAttribute rewriting disabled!
2015-05-14 22:32:09 -07:00
Ilya Kreymer
86be72b30a query_handler: specify matchType exact for all queries, in case url ends in * 2015-05-14 22:32:09 -07:00
Ilya Kreymer
b2e26eeb27 wombat: remove timezone offset, as Date.now() already UTC 2015-05-14 22:32:09 -07:00
Ilya Kreymer
7cbf43872f wombat: obey _no_rewrite for rewrite_elem() 2015-05-14 22:32:08 -07:00
Ilya Kreymer
15ac7ea1f8 vidrw: just check 'ustream' in url 2015-05-14 22:32:08 -07:00
Ilya Kreymer
7a0ab76a07 vidrw work: limit flashvar parsing to ustream (for now) 2015-05-14 22:32:08 -07:00
Ilya Kreymer
9a90af595c views: don't add head_insert for ajax requests! 2015-05-14 22:32:08 -07:00
Ilya Kreymer
d657a65469 wombat: fix rewrite_attr not actually being rewritten due to incomplete refactor
when is_live, don't add timestamp to rewritten urls (hypothesis/via#11)
2015-05-14 22:32:08 -07:00
Ilya Kreymer
99ff29e283 js regex rewrite: scheme-rel rewrite must be preceded by a quote no semicolon, to avoid rewriting ;//comment; as url
add rewrite tests
2015-05-14 22:32:08 -07:00
Ilya Kreymer
838853e52d vidrw work: parse first url from flashvars (experimental), add special case for ustream! 2015-05-14 22:32:08 -07:00
Ilya Kreymer
33f247582f rewrite: HTMLRewriter should insert head_insert at end of stream, if it hasn't
been inserted by the end (and if there was some content written -- don't insert for 0-length responses)
Addresses missing head insert if only head tags are present and no head, as per hypothesis/via#9
2015-05-14 22:32:08 -07:00
Ilya Kreymer
48aa73df38 live_rewrite_handler: add _ignore_proxies() callback to allow ignoring proxy settings by derived class 2015-05-14 22:32:08 -07:00
Ilya Kreymer
5072ed568c fix typos: wombat: fix rewrite not being called on setter
frame_insert: ensure <iframe> has separate close tag
recordloader: ensure length used as string
2015-05-14 22:32:07 -07:00
Ilya Kreymer
3a0a18b4e4 rules: update rules for yt 2015-05-14 22:32:07 -07:00
Ilya Kreymer
72982829bd postMessage: check for rewitten message 2015-05-14 22:32:07 -07:00
Ilya Kreymer
1c61bdcdd7 bump version to 0.9.8-dev
fix vidrw to work with mp_ / inverse framed replay!
2015-05-14 22:32:07 -07:00
Ilya Kreymer
733de9bffe update version to 0.9.7 for release! 2015-04-27 01:01:25 -07:00
Ilya Kreymer
026873e308 rules: add extra fb rule 2015-04-27 00:44:24 -07:00
Ilya Kreymer
160794e4ec wombat: add options to disable worker override, use mutation observers instead of setAttribute() override 2015-04-27 00:43:10 -07:00
Ilya Kreymer
904b4265e9 wombat typo fix 2015-04-26 18:28:05 -07:00
Ilya Kreymer
a88acb7476 wombat work: add option to use MutationObserver instead of setAttribute override
use original setAttribute when calling internally
use querySelectorAll for child element rewrite selection
add check for ':port' being dropped from host url and attempt to add back in
2015-04-24 22:09:08 -07:00
Ilya Kreymer
a91640e464 auto init fix: autoadd collection paths only if archive path is relative and not a url 2015-04-20 18:12:35 -07:00
Ilya Kreymer
d9bd47ea77 wombat: disable always remove slash in extract_orig, as this may break urls that depend
on it (and they are different urls anyway)
2015-04-20 17:46:03 -07:00
Ilya Kreymer
5028901a17 tests: add tests for indexing http custom status/verbs with and without verify #99 2015-04-20 08:58:51 -07:00
Ilya Kreymer
08064f3806 warc load: make http response/request protocol/verb validation optional
enabled for replay, disabled by default for cdx-indexing, though can
be enabled with -v option #99
2015-04-20 08:29:18 -07:00
Ilya Kreymer
28e3cd791b bump version to 0.9.7-dev 2015-04-19 22:47:20 -07:00
Ilya Kreymer
a992909cb5 update README and version for 0.9.6 2015-04-17 22:02:28 -07:00