qpdf - qpdf + zopfli patch

Age	Commit message (Collapse)	Author
2022-06-19	Add Pl_Function -- a generic function pipeline	Jay Berkenbilt

2022-06-19	Add qpdfjob_register_progress_reporter	Jay Berkenbilt

2022-06-19	Add QPDFJob::registerProgressReporter	Jay Berkenbilt

2022-06-19	Move C-based ProgressReporter helper into QPDFWriter	Jay Berkenbilt

2022-06-19	Add more flexible funtions to qpdfjob C API	Jay Berkenbilt

2022-06-18	Use the default logger for other writes to stdout/stderr	Jay Berkenbilt
	When there is no context for writing output or error messages, use the default logger.
2022-06-18	Use "save" logger when saving data to standard output	Jay Berkenbilt
	This includes the output PDF, streams from --show-object and attachments from --save-attachment. This also enables --verbose and --progress to work with saving to stdout.
2022-06-18	QPDF, QPDFJob: use QPDFLogger instead of custom output streams	Jay Berkenbilt

2022-06-18	Add and test QPDFLogger class	Jay Berkenbilt

2022-05-31	In json mode, reveal recovered user password when otherwise unavailable	Jay Berkenbilt

2022-05-31	Add additional information when listing attachments	Jay Berkenbilt

2022-05-21	Change default decode level to "none" with --json-output	Jay Berkenbilt

2022-05-21	Add another binary utf8 to JSON test	Jay Berkenbilt

2022-05-21	Allow empty b: binary JSON strings	Jay Berkenbilt

2022-05-21	Code clean up: use range-style for loops wherever possible	m-holger
	Remove variables obsoleted by commit 4f24617.
2022-05-21	Add json to large file test	Jay Berkenbilt

2022-05-20	Exercise object description in tests	Jay Berkenbilt

2022-05-20	Add test for bad data and bad datafile	Jay Berkenbilt

2022-05-20	Test --update-from-json	Jay Berkenbilt

2022-05-20	Test (and fix) handling of dangling references	Jay Berkenbilt

2022-05-20	Explicitly test ignoring unknown keys in JSON input	Jay Berkenbilt

2022-05-20	Make version default to latest for --json-output (like --json)	Jay Berkenbilt

2022-05-20	Round-trip tests with --json-stream-data=file	Jay Berkenbilt

2022-05-20	Tests with manually constructed qpdf json	Jay Berkenbilt

2022-05-20	Add tests for --json-input	Jay Berkenbilt

2022-05-20	Add more names and strings in good13	Jay Berkenbilt
	* native UTF-8 strings * names whose PDF and canonical syntax differ in both dictionary key positions and other positions For json, names are converted both as names and directly when used as dictionary keys.
2022-05-20	Rename all test files: _ to -	Jay Berkenbilt

2022-05-20	Major rework -- see long comments	Jay Berkenbilt
	* Replace --create-from-json=file with --json-input, which causes the regular input to be treated as json. * Eliminate --to-json * In --json=2, bring back "objects" and eliminate "objectinfo". Stream data is never present. * In --json-output=2, write "qpdf-v2" with "objects" and include stream data.
2022-05-20	Back out fluent QPDFObjectHandle methods. Keep the andGet methods.	Jay Berkenbilt
	I decided these were confusing and inconsistent with how JSON works. They muddle the API rather than improving it.
2022-05-20	Parse objects; stream data is not yet handled	Jay Berkenbilt

2022-05-16	Implement top-level qpdf json parsing	Jay Berkenbilt

2022-05-16	Remove offset from missing /Root error	Jay Berkenbilt
	The last offset is irrelevant to not being able to find /Root.
2022-05-14	Split qpdf.test into multiple test suites	Jay Berkenbilt
	This makes it a lot easier to run parts of the test suite.
2022-05-08	Add maxobjectid to JSON	Jay Berkenbilt

2022-05-08	Add --to-json option	Jay Berkenbilt

2022-05-08	Test inline stream data with different decode levels	Jay Berkenbilt

2022-05-08	Test json v2 with invalid stream data	Jay Berkenbilt

2022-05-08	Implement JSON v2 output	Jay Berkenbilt

2022-05-08	Apply script across future v2 test files	Jay Berkenbilt
	There is one unexpected pass in this commit. This script was applied to the files changed in this commit: ---------- #!/usr/bin/env python3 import json import sys def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) data['version'] = 2 objectinfo = {} if 'objectinfo' in data: objectinfo = data['objectinfo'] del data['objectinfo'] if 'objects' not in data: continue qpdf = {'jsonversion': 2, 'pdfversion': '1.3', 'objects': {}} for k, v in data['objects'].items(): is_stream = objectinfo.get(k, {}).get('stream', {}).get('is', False) if k.endswith(' R'): k = 'obj:' + k if is_stream: v = {'stream': {'dict': v}} else: v = {'value': v} qpdf['objects'][k] = v data['qpdf'] = qpdf del data['objects'] print(json_dumps(data)) ----------
2022-05-08	Prepare test suite for json v2	Jay Berkenbilt

2022-05-08	Fix typo in json output key name	Jay Berkenbilt
	moddify -> modify. Also carefully spell checked all remaining keys by splitting them into words and running a spell checker, not just relying on visual proofreading. That was the only one.
2022-05-08	Implement JSON v2 for Stream	Jay Berkenbilt
	Not fully exercised in this commit
2022-05-08	Implement JSON v2 for String	Jay Berkenbilt
	Also refine the herustic for deciding whether to use hexadecimal notation for a string.
2022-05-07	Prepare code for JSON v2	Jay Berkenbilt
	Update getJSON() methods and calls to them
2022-05-07	Objectinfo json: write incrementally and in numeric order	Jay Berkenbilt
	This script was used on test data: ---------- #!/usr/bin/env python3 import json import sys import re def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) if 'objectinfo' not in data: continue trailer = None to_sort = [] for k, v in data['objectinfo'].items(): if k == 'trailer': trailer = v else: m = re.match(r'^(\d+) \d+ R', k) if m: to_sort.append([int(m.group(1)), k, v]) newobjectinfo = {x[1]: x[2] for x in sorted(to_sort)} if trailer is not None: newobjectinfo['trailer'] = trailer data['objectinfo'] = newobjectinfo print(json_dumps(data)) ----------
2022-05-07	Objects json: write incrementally and in numeric order	Jay Berkenbilt
	The following script was used to adjust test data: ---------- #!/usr/bin/env python3 import json import sys import re def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) if 'objects' not in data: continue trailer = None to_sort = [] for k, v in data['objects'].items(): if k == 'trailer': trailer = v else: m = re.match(r'^(\d+) \d+ R', k) if m: to_sort.append([int(m.group(1)), k, v]) newobjects = {x[1]: x[2] for x in sorted(to_sort)} if trailer is not None: newobjects['trailer'] = trailer data['objects'] = newobjects print(json_dumps(data)) ----------
2022-05-07	Top-level json: write incrementally	Jay Berkenbilt
	This commit just changes the order in which fields are written to the json without changing their content. All the json files in the test suite were modified with this script to ensure that we didn't get any changes other than ordering. ---------- #!/usr/bin/env python3 import json import sys def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) newdata = {} for i in ('version', 'parameters', 'pages', 'pagelabels', 'acroform', 'attachments', 'encrypt', 'outlines', 'objects', 'objectinfo'): if i in data: newdata[i] = data[i] print(json_dumps(newdata)) ----------
2022-05-07	Test json against schema only on demand	Jay Berkenbilt
	Testing json against schema requires an in-memory copy, so do it only when requested by the test suite.
2022-05-07	Add next to Pl_String and fix comments	Jay Berkenbilt

2022-05-04	Add new FileInputSource constructors	Jay Berkenbilt