Age | Commit message (Collapse) | Author | |
---|---|---|---|
2018-02-19 | Simplify TokenFilter interface | Jay Berkenbilt | |
Expose Pl_QPDFTokenizer, and have it do more of the work of managing the token filter's pipeline. | |||
2018-02-19 | Detect and report bad tokens in content normalization | Jay Berkenbilt | |
2018-02-19 | Implement TokenFilter and refactor Pl_QPDFTokenizer | Jay Berkenbilt | |
Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a general filter that passes data through a TokenFilter. |