paperless-ngx

mirror of https://github.com/paperless-ngx/paperless-ngx.git synced 2026-03-18 06:55:56 +00:00

Author	SHA1	Message	Date
Trenton H	aea2927a02	Feature: Convert Tika parser to the plugin system (#12333 ) * Chore: move Tika parser and tests to paperless/ Move TikaDocumentParser and its tests to the canonical parser package location, matching the pattern established for TextDocumentParser: - src/paperless_tika/parsers.py → src/paperless/parsers/tika.py - src/paperless_tika/tests/test_tika_parser.py → src/paperless/tests/parsers/test_tika_parser.py - src/paperless_tika/tests/samples/ → src/paperless/tests/samples/tika/ Merge tika fixtures (tika_parser, sample_odt_file, sample_docx_file, sample_doc_file, sample_broken_odt) into the shared parsers conftest. Remove the now-empty src/paperless_tika/tests/conftest.py. Content is unchanged — this commit is rename-only so git history is preserved on the moved files. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Feature: Phase 3 — migrate TikaDocumentParser to ParserProtocol Refactor TikaDocumentParser to satisfy ParserProtocol without subclassing the legacy DocumentParser ABC: - Add ClassVars: name, version, author, url - Add supported_mime_types() classmethod (12 Office/ODF/RTF MIME types) - Add score() classmethod — returns None when TIKA_ENABLED is False, 10 otherwise - can_produce_archive = False (PDF is for display, not an OCR archive) - requires_pdf_rendition = True (Office formats need PDF for browser display) - __enter__/__exit__ via ExitStack: TikaClient opened once per parser lifetime and shared across parse() and extract_metadata() calls - extract_metadata() falls back to a short-lived TikaClient when called outside a context manager (legacy view-layer metadata path) - _convert_to_pdf() uses OutputTypeConfig() to honour the database-stored ApplicationConfiguration before falling back to the env-var setting - Rename convert_to_pdf → _convert_to_pdf (private helper) Update paperless_tika/signals.py shim to import from the new module path and drop the legacy logging_group/progress_callback kwargs. Update documents/consumer.py to extend the existing TextDocumentParser special cases to also cover TikaDocumentParser (parse/get_thumbnail signatures, __exit__ cleanup). Add TestTikaParserRegistryInterface (7 tests) covering score(), properties, and ParserProtocol isinstance check. Update existing tests to use the new accessor API (get_text, get_date, get_archive_path, _convert_to_pdf). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Fix: update remaining imports and move live Tika tests after parser migration - src/documents/tests/test_parsers.py: import TikaDocumentParser from paperless.parsers.tika (old paperless_tika.parsers no longer exists) - git mv paperless_tika/tests/test_live_tika.py → paperless/tests/parsers/test_live_tika.py to co-locate all Tika tests with the parser; update import and replace old attribute API (tika_parser.text/.archive_path) with accessor methods (get_text/get_archive_path) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Fix: satisfy mypy and pyrefly for TikaDocumentParser Use a TYPE_CHECKING-guarded assert to narrow self._tika_client from TikaClient \| None to TikaClient at the point of use in parse(). The assert is visible to type checkers (TYPE_CHECKING=True) so both mypy and pyrefly accept the subsequent attribute accesses without error; at runtime TYPE_CHECKING is False so the assert never executes and no ruff S101 suppression is required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Fix: require context manager for TikaDocumentParser; clean up client lifecycle - consumer.py: call __enter__ for new-style parsers so _tika_client and _gotenberg_client are set before parse() is invoked - views.py: use `with parser` (via nullcontext for old-style parsers) in get_metadata so extract_metadata always runs inside a context manager - tika.py: GotenbergClient added to ExitStack alongside TikaClient; inline client creation removed from extract_metadata and _convert_to_pdf; __exit__ uses ExitStack.close() instead of __exit__ pass-through Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 15:43:28 -07:00
Trenton H	85fecac401	Fix: don't try to usermod/groupmod when non-root + update docs (#12365 )	2026-03-17 05:15:03 +00:00
Trenton H	f15394fa5c	Fix: Removes the double exec that prevented migrations from running (#12317 )	2026-03-12 12:46:12 -07:00
Trenton H	86fa74c115	Fix: Postgres selection, DBENGINE and migrations (#12299 )	2026-03-11 11:54:24 -07:00
dependabot[bot]	484bef00c1	docker-compose(deps): Bump gotenberg/gotenberg in /docker/compose (#12190 ) Bumps gotenberg/gotenberg from 8.26 to 8.27. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.27' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 10:14:48 -08:00
dependabot[bot]	b9b90ec9f7	docker-compose(deps): Bump nginx in /docker/compose (#12018 ) Bumps nginx from 1.29-alpine to 1.29.5-alpine. --- updated-dependencies: - dependency-name: nginx dependency-version: 1.29.5-alpine dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-06 12:26:29 -08:00
dependabot[bot]	4a5116adf8	docker-compose(deps): Bump gotenberg/gotenberg in /docker/compose (#11979 ) Bumps gotenberg/gotenberg from 8.25 to 8.26. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.26' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-04 13:24:19 -08:00
shamoon	72b861b5eb	Fix: fix broken docker create_classifier command in 2.20.6 (#11965 )	2026-02-01 12:09:30 -08:00
shamoon	c3b036e0d3	Merge branch 'main' into dev	2026-01-31 09:10:33 -08:00
shamoon	6913f9d79c	Fix: fix user checks in management scripts (#11928 )	2026-01-28 13:45:12 -08:00
Trenton H	01b21377af	Chore: Use a local http server instead of external to reduce flakiness (#11916 )	2026-01-28 03:57:12 +00:00
Trenton H	c84f2f04b3	Chore: Switch to a local IMAP server instead of a real email service (#11913 )	2026-01-27 11:35:12 -08:00
shamoon	444ff6951e	Merge branch 'release/v2.20.x' into dev	2026-01-25 16:58:04 -08:00
Trenton H	d0032c18be	Breaking: Remove support for document and thumbnail encryption (#11850 )	2026-01-24 19:29:54 -08:00
Trenton H	94f6b8d36d	Fixes the management scripts under a non-root install where the user ID is something besides 1000 (#11870 )	2026-01-23 16:08:28 -08:00
shamoon	e940764fe0	Feature: Paperless AI (#10319 )	2026-01-13 16:24:42 +00:00
Daniel Rheinbay	67d079fe14	fix: Skip SSL for MariaDB ping in init script (#11491 ) Restore compatibility with MariaDB server versions < 11.4, which do not use SSL by default.	2025-11-28 14:25:57 -08:00
Daniel Rheinbay	ffc56bddda	fix: Add user parameter to MariaDB connection check (#11441 )	2025-11-23 15:03:35 -08:00
Trenton H	a96db50b0a	Feature: Replace duplicated static files with symlinks (#11418 )	2025-11-21 20:07:57 +00:00
Trenton H	bc622d67fc	Chore: Configure pre-commit to format our s6-overlay files (#11414 )	2025-11-19 21:34:29 +00:00
Trenton H	25b5e8fede	Improves the MariaDB wait command to use mariadb-admin ping for a better check if the server is up (#11396 )	2025-11-18 23:45:49 +00:00
dependabot[bot]	4bf681387a	docker-compose(deps): bump gotenberg/gotenberg in /docker/compose (#11393 ) Bumps gotenberg/gotenberg from 8.24 to 8.25. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.25' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-11-18 09:29:39 -08:00
Michael Martin	c3ac102eba	Enhancement: speed-up docker container startup (#11134 ) This alters the retry/backoff logic in the init-wait-for-db script to be more optimistic about database availability. During regular deployment and operations of paperless-ngx, it's common to restart the application server with the database instance already running, so we should optimize for this case. Instead of unconditionally delaying 5 seconds between each connection attempt, start with a minimum delay of 1 second and increase the delay linearly with each attempt, maxing out at 10 seconds. This makes the retry count-based failure mode less practical, so instead we just use a timeout-based approach.* *NOTE: the original implementation would have an effective timeout of 25s. This alters the behavior to 60s. Additionally, this removes an unnecessary 5s delay that was injected in the postgres case. The script uses a more comprehensive connection check for postgres than it does mariadb, so if anything this 5s delay after getting an "ok" response from the DB was extra unnecessary in the postgres case.	2025-11-17 13:11:49 -08:00
shamoon	a206ac78dd	Chore: update Postgres compose volume mount path (#11084 )	2025-10-20 16:18:36 +00:00
dependabot[bot]	7326224888	docker-compose(deps): Bump gotenberg/gotenberg in /docker/compose (#11050 ) Bumps gotenberg/gotenberg from 8.23 to 8.24. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.24' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Trenton H <797416+stumpylog@users.noreply.github.com>	2025-10-15 13:52:58 -07:00
dependabot[bot]	92ee906701	docker-compose(deps): Bump library/postgres in /docker/compose (#10965 ) Bumps library/postgres from 17 to 18. --- updated-dependencies: - dependency-name: library/postgres dependency-version: '18' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-09-30 20:25:02 +00:00
dependabot[bot]	84d85d7a23	docker-compose(deps): Bump gotenberg/gotenberg from 8.22 to 8.23 in /docker/compose (#10812 ) Bumps gotenberg/gotenberg from 8.22 to 8.23. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.23' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-09-09 12:47:25 -07:00
dependabot[bot]	10ccccc987	docker-compose(deps): Bump library/mariadb from 11 to 12 in /docker/compose (#10621 ) Bumps library/mariadb from 11 to 12. --- updated-dependencies: - dependency-name: library/mariadb dependency-version: '12' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-08-27 15:07:50 -07:00
dependabot[bot]	27d72ebb18	docker-compose(deps): Bump gotenberg/gotenberg from 8.20 to 8.22 in /docker/compose (#10687 ) Bumps gotenberg/gotenberg from 8.20 to 8.22. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.22' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-08-27 14:53:35 -07:00
Katrin Leinweber	5410074062	Documentation: copy-edits (#10417 )	2025-07-20 17:27:04 +00:00
Boyuan Yang	f8689c4819	Documentation: Fix URL for PAPERLESS_OCR_LANGUAGE example in docker-compose.env (#10408 )	2025-07-19 02:25:31 +00:00
Trenton H	3d2a3ede71	Chore: Updates dependency groups (#10339 )	2025-07-07 17:37:58 -07:00
shamoon	cc5ba71f06	Chore: remove spaces from run log	2025-06-17 16:02:45 -07:00
dependabot[bot]	bcb0ae1ee5	docker-compose(deps): Bump library/redis from 7 to 8 in /docker/compose (#9879 ) Bumps library/redis from 7 to 8. --- updated-dependencies: - dependency-name: library/redis dependency-version: '8' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-07 16:49:18 +00:00
Trenton H	c83b0bfca6	Fix: Trim off the path portion so the comparision can properly skip (#9839 )	2025-05-01 13:50:25 -07:00
dependabot[bot]	915584551c	docker-compose(deps): bump gotenberg/gotenberg from 8.19 to 8.20 in /docker/compose (#9661 ) Bumps gotenberg/gotenberg from 8.19 to 8.20. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.20' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-04-24 10:06:07 -07:00
shamoon	312bb743b9	Chore: add ymlfmt (#9745 )	2025-04-22 22:20:54 +00:00
Trenton H	ab8c75958d	Fix: Adds better handling during folder checking/creation/permissions for non-root (#9616 ) * Adds better handling during folder checking/creation/permissions for when the image is running as non-root * Prefers the long options to commands	2025-04-14 15:51:57 +00:00
Trenton H	e2860ed36d	Fix: Explicitly set the HOME environment variable for running as root at startup (#9643 ) * Explicitly set the HOME environment for the migrations to fix issue with certificates * Defines the HOME globally when we're running as root for startup	2025-04-14 15:21:45 +00:00
Thom Wiggers	82a5680217	Delete unused docker/docker-entrypoint.sh (#9615 )	2025-04-14 07:39:56 -07:00
Trenton Holmes	f036292b72	Merge remote-tracking branch 'origin/dev'	2025-04-09 14:53:31 -07:00
Trenton H	0fb55f3ae8	Fix: Run migration lock as the correct user (#9604 )	2025-04-09 21:15:38 +00:00
Trenton H	78822f6121	Fix: Adds a warning to the user if their secret file includes a trailing newline (#9601 )	2025-04-09 21:05:26 +00:00
dependabot[bot]	c9bc9acd1a	docker-compose(deps): Bump gotenberg/gotenberg in /docker/compose (#9532 ) Bumps gotenberg/gotenberg from 8.17 to 8.19. --- updated-dependencies: - dependency-name: gotenberg/gotenberg dependency-version: '8.19' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-04-02 18:53:44 +00:00
shamoon	32a7f9cd5a	Enhancement: allow webUI first account signup (#9500 )	2025-03-29 17:12:34 +00:00
Trenton H	9c68100dc0	Fix: Make management commands aware of the container environment (#9499 )	2025-03-26 14:17:10 -07:00
Trenton H	6e694ad9ff	Switch to using uvloop and upgrade granian for some ASGI fixes (#9494 )	2025-03-25 21:41:29 -07:00
Trenton H	9944f81512	Fix: Allow setting of other Granian options (#9360 )	2025-03-11 17:33:56 +00:00
dependabot[bot]	032bada221	docker-compose(deps): Bump library/postgres in /docker/compose (#9353 ) Bumps library/postgres from 16 to 17. --- updated-dependencies: - dependency-name: library/postgres dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-03-10 09:37:53 -07:00
Trenton H	654c9ca273	Feature: Switch webserver to granian (#9218 ) Co-authored-by: shamoon <4887959+shamoon@users.noreply.github.com>	2025-02-28 19:37:45 +00:00

1 2 3 4 5 ...

271 Commits