Commit Graph

96 Commits

Author SHA1 Message Date
jonaswinkler fca98b411e reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
Jonas Winkler a03315102a added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler d3c13f6c93 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler 2d559d330d reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler f4013b1343 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler fec9e54049 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00
Jonas Winkler d1f9f456bb moved the documentation of the config
to the actual docs.
2020-11-19 13:18:57 +01:00
Jonas Winkler d8e6804650 removed all quotes from the config: they are not needed and are confusing for the docker-compose env files, where they are actually not allowed. 2020-11-18 22:42:56 +01:00
Jonas Winkler c487e5f017 a new setting that allows you to skip thumbnail optimization. 2020-11-18 22:42:05 +01:00
Jonas Winkler fef6dd38f9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler 31c4167535 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler f6a926c9b1 added config options to conf example 2020-11-16 18:37:12 +01:00
Jonas Winkler 0b1637da62 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler 83f82f3caf added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler 9d22d9c9b1 added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler 44cd8565b1 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler 9067a4f288 added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler dc59e0f257 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler 9ce926df7f updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler 9f55fb668d silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler 5c4849796b fixed folders, compatible with previous paperless version 2020-10-29 14:30:15 +01:00
Jonas Winkler 322832c9b3 config cleanup 2020-10-29 00:34:37 +01:00
Jonas Winkler c26962f17f changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler 052c1680f3 added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler b0143a280e removed reminders, changed a bunch of stuff 2020-10-20 00:35:39 +02:00
Jonas Winkler 421dab786d Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
Wolf-Bastian Poettner 07bb9aa723 Allow filename to use tags[KEY] and tags[INDEX] 2020-03-28 21:06:17 +00:00
Wolf-Bastian Poettner d8fda64cd0 Tags in filenames can now either be found by key (in case - or _ are
used as delimeter) or by index. Added some more tests.
2020-02-01 20:58:44 +00:00
Wolf-Bastian Poettner 6a671ebd22 Unified PAPERLESS_DIRECTORY_FORMAT and PAPERLESS_FILENAME_FORMAT into
one configuration setting
2020-02-01 20:09:36 +00:00
Wolf-Bastian Pöttner e70d0c53c8 Update paperless.conf.example
Co-Authored-By: Pit <pitkley@googlemail.com>
2020-02-01 14:22:02 +01:00
Wolf-Bastian Poettner 6813805712 Allows to configure directory and filename formats for documents stored in paperless
Default configuration is as before (incrementing numbers), but additional fields can be added at will
2019-12-27 14:25:38 +00:00
Michael Gmelin f0320fb72d Allow configuring transformations to be applied to the filename before
parsing. The motivation was that files produced by a Brother scanner
wouldn't match paperless' expectations. At most one transformation
is applied (first matching). It won't affect the filename on disk.

This is generic enough so that it is useful for various purposes.
In my case it allows me to use the different hardware buttons on
the scanner to use different profiles, feeding one instance of
paperless with documents of multiple entities and tagging them
accordingly.

Example:

PAPERLESS_FILENAME_PARSE_TRANSFORMS=[{"pattern":"^([a-z]+)_(\\d{8})_(\\d{6})_([0-9]+)\\.", "repl":"\\2\\3Z - \\4 - \\1."},{"pattern":"^([a-z]+)_([0-9]+)\\.", "repl":" - \\2 - \\1."}]
2019-08-31 15:01:25 +02:00
Jonas Winkler 32f3876590 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
JensPfeifle ea282c22ba Add GS_BINARY to settings to avoid harcoded call of "gs" 2019-03-03 20:31:52 +01:00
Daniel Quinn 3c78105fd7 Align example conf with real-world defaults 2019-01-27 13:05:56 +00:00
Jonas Winkler 4bd018df07 Merge branch 'master' into dev 2019-01-21 15:07:49 +01:00
Daniel Quinn cea8332038 Merge pull request #468 from ddddavidmartin/document_ocr_always_setting
Reference PAPERLESS_OCR_ALWAYS in example config file.
2018-12-31 14:30:58 +00:00
David Martin 4d13521f36 Reference PAPERLESS_OCR_ALWAYS in example config file.
This setting was introduced when support for retrieving the text layer
from documents was added. Having it in the example config makes it more
clear that it exists.
2018-12-16 18:11:39 +11:00
Jonas Winkler 766109ae4e Merge remote-tracking branch 'upstream/master' 2018-12-11 12:06:15 +01:00
Daniel Quinn d544f269e0 Conform everything to the coding standards
https://paperless.readthedocs.io/en/latest/contributing.html#additional-style-guides
2018-12-01 17:09:12 +00:00
Joshua Taillon 730daa3d6d Merge branch 'master' of github.com:danielquinn/paperless into ENH_filename_date_parsing 2018-11-15 23:17:59 -05:00
Joshua Taillon e1d8744c66 Add option for parsing of date from filename (and associated tests) 2018-11-15 20:32:15 -05:00
David Martin 818780a191 Add PAPERLESS_FORGIVING_OCR option to example config.
It helps having it in the example config as that makes it more clear
that it exists.
2018-10-08 19:38:38 +11:00
Daniel Quinn 750ab5bf85 Use optipng to optimise document thumbnails 2018-10-07 14:56:38 +01:00
Jonas Winkler fb6f2e07c9 Added a bunch of new features:
- Debug mode is now configurable in the configuration file. This way, we don't have to edit versioned files to disable it on production systems.
- Recent correspondents filter (enable in configuration file)
- Document actions: Edit tags and correspondents on multiple documents at once
- Replaced month list filter with date drilldown
- Sortable document count columns on Tag and Correspondent admin
- Last correspondence column on Correspondent admin
- Save and edit next functionality for document editing
2018-09-13 15:19:25 +02:00
Jonas Winkler 7c589f71a4 Fixed a few minor issues. 2018-09-12 16:25:23 +02:00
Jonas Winkler ef0d37985b Merge branch 'master' into dev 2018-09-12 11:47:35 +02:00
Jonas Winkler 17803e7936 fixed settings 2018-09-11 17:30:46 +02:00
Daniel Quinn a86a20ef0f Make the example file contain the default value 2018-09-09 21:16:53 +01:00
Joshua Taillon 22378789e2 add option for inline vs. attachment for document rendering 2018-09-05 22:58:38 -04:00