Ilya Kreymer
0784e4e5aa
spin-off warcio!
...
update imports to point to warcio
warcio rename fixes:
- ArcWarcRecord.stream -> raw_stream
- ArcWarcRecord.status_headers -> http_headers
- ArchiveLoadFailed single param init
2017-03-07 10:58:00 -08:00
Ilya Kreymer
1213466afb
warc & recorder refactor: split BaseWARCWriter from MultiWARCWriter, move to warc/warcwriter.py, recorder/multifilewarcwriter.py
...
split indexing functionality from base warc iterator, move to archiveindexer.py
2017-03-01 14:18:44 -08:00
Ilya Kreymer
3faa55906a
warcwriter: attempt to separate warc writing semantics from the recorder
...
use StatusAndHeaders instead of requests CaseInsensitiveDict for consistency
refactor writer api: create_warc_record() for creating new record
copy_warc_record() for copying a full record from a stream
add writer tests, separate from recorder
2017-03-01 12:50:32 -08:00
Ilya Kreymer
36862fd9e9
recorder test: fix warc/revisit cdx test (don't assume exact order with 14-digit timestamp)
2016-11-13 11:46:10 -08:00
Ilya Kreymer
fa247b8fe5
refactor: fix recorder and urlrewrite packages #200
2016-11-08 15:04:22 -08:00
Ilya Kreymer
99e5008ac0
refactor: move newly merged packages to be pywb subpackages
2016-11-08 07:01:33 -08:00