1
0
mirror of https://github.com/webrecorder/pywb.git synced 2025-03-15 08:04:49 +01:00
Ilya Kreymer 921b2eb2e1 improve testing and a few fixes:
archivalrouter: support empty collection, with and without SCRIPT_NAME
cdx: remove cdx source test, including access denied
replay: when content-type present, limit the decompressed stream to content-length
(this ensures last 4 bytes in warc/arc record are not read)
integration tests for identity replay
2014-02-27 18:43:55 -08:00
..
2014-02-27 18:43:55 -08:00
2014-02-17 10:01:09 -08:00
2014-02-22 10:50:03 -08:00
2014-02-27 12:33:11 -08:00
2014-02-27 18:43:55 -08:00
2014-02-27 18:43:55 -08:00

pywb.cdx package

This package contains the CDX processing suite of the pywb wayback tool suite.

The CDX Server loads, filters and transforms cdx from multiple sources in response to a given query.

Sample App

A very simple reference WSGI app is included.

Run: python -m pywb.cdx.wsgi_cdxserver to start the app, keyboard interrupt to stop.

The default config.yaml points to the sample data directory and uses port 8080.

The domain specific rules.yaml are also loaded.

CDX Server API Reference

Goal is to provide compatiblity with this feature set and more: https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server

TODO