Commit Graph

36 Commits

Author SHA1 Message Date
Daniel Quinn 857c7ac654 #44: Harmonise environment variables with constant names 2016-03-01 22:39:40 +00:00
Daniel Quinn 26c3781350 #44: Harmonise environment variables with constant names 2016-03-01 22:37:42 +00:00
Daniel Quinn 7d1aa1175f pep8 2016-03-01 19:03:28 +00:00
Daniel Quinn 26fc27da9b Setting appropriate permissions 2016-03-01 18:57:12 +00:00
Daniel Quinn 2fe9b0cbc1 New logging appears to work 2016-02-27 20:18:50 +00:00
Daniel Quinn 1aecb1e63a Compensate for case and format of jpg vs. jpeg 2016-02-23 20:15:13 +00:00
Daniel Quinn 55a7dc2444 pep8 2016-02-21 01:43:48 +00:00
Daniel Quinn 0d46643026 Version bump 2016-02-21 01:24:30 +00:00
Daniel Quinn 17d3a44952 A crude API is in place 2016-02-21 00:55:38 +00:00
Daniel Quinn 809fb8fa1f Moved the default GNUPG home to /tmp for tox-friendliness 2016-02-21 00:29:59 +00:00
Daniel Quinn 422ae9303a pep8 2016-02-21 00:14:50 +00:00
Daniel Quinn cebc44f2c9 API is halfway there 2016-02-16 09:28:34 +00:00
Daniel Quinn aea4af5d3b Version bump and feature update 2016-02-14 17:18:28 +00:00
Daniel Quinn 7843ea5037 Added and implemented a rudimentary logger 2016-02-14 16:09:52 +00:00
Pit Kleyersburg f5beda9c56 Enable parallel OCR processing
At the moment, every page in a PDF will be processed one by one using
tesseract. Since the processing of a single page is independent from every
other page, one can make use of multi-core machines.

This PR introduces a multiprocessing pool to process multiple pages
simultaneously. The amount of threads to use can be specified in the
environment variable `PAPERLESS_OCR_THREADS`. This will default to the
number of cores/hyperthreads Python detects for your system.
2016-02-14 15:57:42 +01:00
Daniel Quinn cef4abc01d version bump 2016-02-11 12:25:12 +00:00
Daniel Quinn 0eaed36420 The 'API' is written but untested 2016-02-08 23:46:16 +00:00
Daniel Quinn d8ad6b589b Added pytest and broke up the consumer into file and mail 2016-02-05 00:23:36 +00:00
Daniel Quinn a70b40f618 Broke the consumer script into separate files and started on a mail consumer 2016-01-30 01:18:52 +00:00
Daniel Quinn 3026593d6c Version bump for automated tagging 2016-01-28 07:29:25 +00:00
Daniel Quinn 286292dbf9 Added some documentation 2016-01-24 20:15:50 -05:00
Daniel Quinn 4f1bf81d5b Better variable names 2016-01-23 03:05:40 +00:00
Daniel Quinn fbbaf9cce0 Organised and documented project settings 2016-01-23 02:28:39 +00:00
the01 4c1ff658d2 add language setting for tesseract 2016-01-21 09:24:13 +01:00
Daniel Quinn 08270b1aaa Having debug off is too much of a pain in the ass 2016-01-17 02:10:07 +00:00
Daniel Quinn d89349f272 No need for the debug flag by default 2016-01-15 18:19:53 +00:00
Daniel Quinn 17615d43cb Fixed a few consumer bugs and added an exporter
Rename exporter to export and fixt some debugging

Account for files not matching the sender/title pattern

Added a safety note

Wrong regex on the name parser

Renamed the command to something slightly less ambiguous
2016-01-15 18:14:42 +00:00
Daniel Quinn 3f92c1df3c People were complaining that they couldn't find requirements.txt 2016-01-11 09:41:43 +00:00
Daniel Quinn 94ce12f9ee Merge branch 'master' of https://github.com/danielquinn/paperless 2016-01-10 23:18:49 +00:00
Daniel Quinn 97e9cc37a9 Set CONSUMPTION_DIR to something user-editable 2016-01-10 23:17:12 +00:00
Daniel Quinn f620762456 Safer place for the database 2016-01-10 22:45:15 +00:00
Daniel Quinn 3c2c7abdf9 Added django extensions 2016-01-10 13:53:18 +00:00
Daniel Quinn fe47f9f07e Better handling of the passphrase 2016-01-10 13:40:26 +00:00
Daniel Quinn f72c515742 Added GPG encryption for the PDFs 2016-01-01 16:13:59 +00:00
Daniel Quinn cb0f7393d6 Prettied up the admin 2015-12-26 13:21:19 +00:00
Daniel Quinn 855ee64097 It works! 2015-12-20 19:23:33 +00:00