mirror of https://github.com/paperless-ngx/paperless-ngx.git synced 2026-05-07 07:05:24 +00:00

T

Trenton H 701735f6e5 Chore: Drop old signal and unneeded apps, transition to parser registry instead (#12405 )

* refactor: switch consumer and callers to ParserRegistry (Phase 4)

Replace all Django signal-based parser discovery with direct registry
calls. Removes `_parser_cleanup`, `parser_is_new_style` shims, and all
old-style isinstance checks. All parser instantiation now uses the
`with parser_class() as parser:` context manager pattern.

- documents/parsers.py: delegate to get_parser_registry(); drop lru_cache
- documents/consumer.py: use registry + context manager; remove shims
- documents/tasks.py: same pattern
- documents/management/commands/document_thumbnails.py: same pattern
- documents/views.py: get_metadata uses context manager
- documents/checks.py: use get_parser_registry().all_parsers()
- paperless/parsers/registry.py: add all_parsers() public method
- tests: update mocks to target documents.consumer.get_parser_class_for_mime_type

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor: drop get_parser_class_for_mime_type; callers use registry directly

All callers now call get_parser_registry().get_parser_for_file() with
the actual filename and path, enabling score() to use file extension
hints. The MIME-only helper is removed.

- consumer.py: passes self.filename + self.working_copy
- tasks.py: passes document.original_filename + document.source_path
- document_thumbnails.py: same pattern
- views.py: passes Path(file).name + Path(file)
- parsers.py: internal helpers inline the registry call with filename=""
- test_parsers.py: drop TestParserDiscovery (was testing mock behavior);
  TestParserAvailability uses registry directly
- test_consumer.py: mocks switch to documents.consumer.get_parser_registry

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor: remove document_consumer_declaration signal infrastructure

Remove the document_consumer_declaration signal that was previously used
for parser registration. Each parser app no longer connects to this signal,
and the signal declaration itself has been removed from documents/signals.

Changes:
- Remove document_consumer_declaration from documents/signals/__init__.py
- Remove ready() methods and signal imports from all parser app configs
- Delete signal shim files (signals.py) from all parser apps:
  - paperless_tesseract/signals.py
  - paperless_text/signals.py
  - paperless_tika/signals.py
  - paperless_mail/signals.py
  - paperless_remote/signals.py

Parser discovery now happens exclusively through the ParserRegistry
system introduced in the previous refactor phases.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor: remove empty paperless_text and paperless_tika Django apps

After parser classes were moved to paperless/parsers/ in the plugin
refactor, these Django apps contained only empty AppConfig classes
with no models, views, tasks, migrations, or other functionality.

- Remove paperless_text and paperless_tika from INSTALLED_APPS
- Delete empty app directories entirely
- Update pyproject.toml test exclusions
- Clean stale mypy baseline entries for moved parser files

paperless_remote app is retained as it contains meaningful system
checks for Azure AI configuration.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Moves the checks and tests to the main application and removes the old applications

* Adds a comment to satisy Sonar

* refactor: remove automatic log_summary() call from get_parser_registry()

The summary was logged once per process, causing it to appear repeatedly
during Docker startup (management commands, web server, each Celery
worker subprocess). External parsers are already announced individually
at INFO when discovered; the full summary is redundant noise.
log_summary() is retained on ParserRegistry for manual/debug use.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Cleans up the duplicate test file/fixture

* Fixes a race condition where webserver threads could race to populate the registry

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-22 06:53:32 -07:00

.devcontainer

Breaking: Remove pybzar as a barcode reader (#12065 )

2026-02-13 08:14:00 -08:00

.github

Chore(deps): Bump the actions group with 2 updates (#12377 )

2026-03-18 06:18:11 +00:00

docker

Fix: don't try to usermod/groupmod when non-root + update docs (#12365 ) (#12391 )

2026-03-18 10:38:45 -07:00

docs

Merge branch 'main' into dev

2026-03-21 02:12:19 -07:00

resources

New -ngx logo 2022

2022-02-26 20:14:24 -08:00

scripts

Chore: Remove some further old items (encryption passphrase and PNG handling) (#12290 )

2026-03-09 22:04:51 +00:00

src

Chore: Drop old signal and unneeded apps, transition to parser registry instead (#12405 )

2026-03-22 06:53:32 -07:00

src-ui

Auto translate strings

2026-03-21 09:26:23 +00:00

.codecov.yml

Breaking: Drop support for Python 3.10 (#12234 )

2026-03-04 15:03:33 -08:00

.dockerignore

Chore: Enable mypy checking in CI (#11991 )

2026-02-03 16:02:33 -08:00

.editorconfig

Breaking: Refactor advanced database settings to allow more user configuration (#12165 )

2026-02-27 14:37:26 -08:00

.env

Chore: Remove unneeded .env entry, revert crowdin action rm, reduce frequency

2023-12-02 08:24:17 -08:00

.gitignore

Chore: move to Zensical for docs (#12011 )

2026-02-07 10:58:55 -08:00

.hadolint.yml

Configure Hadolint in a single location for both hooks and CI

2022-07-19 13:54:33 -07:00

.mypy-baseline.txt

Chore: Drop old signal and unneeded apps, transition to parser registry instead (#12405 )

2026-03-22 06:53:32 -07:00

.pre-commit-config.yaml

Chore(deps): Bump https://github.com/astral-sh/ruff-pre-commit (#12371 )

2026-03-18 06:25:40 +00:00

.prettierrc.js

Chore(deps): Bump the pre-commit-dependencies group with 4 updates (#12323 )

2026-03-12 16:29:57 +00:00

.pyrefly-baseline.json

Chore: Configure pyrefly as an alternative typing tool (#12003 )

2026-02-07 10:33:00 -08:00

.yamlfmt

Chore(deps): Bump bootstrap from 5.3.7 to 5.3.8 in /src-ui (#10740 )

2025-09-03 21:58:53 +00:00

CODE_OF_CONDUCT.md

Chore(deps-dev): Bump the development group across 1 directory with 2 updates (#6851 )

2024-05-29 07:04:01 +00:00

CODEOWNERS

Chore: Switch from pipenv to uv (#9251 )

2025-03-04 16:15:51 +00:00

CONTRIBUTING.md

Breaking: Drop support for Python 3.10 (#12234 )

2026-03-04 15:03:33 -08:00

crowdin.yml

Chore: Implement crowdin GHA (#4706 )

2023-12-01 17:44:33 -08:00

Dockerfile

docker(deps): bump astral-sh/uv (#12265 )

2026-03-10 17:27:06 +00:00

install-paperless-ngx.sh

Chore: fix Postgres compose volume mount path in install script (#11184 )

2025-10-26 14:40:37 +00:00

LICENSE

Initial commit

2015-12-20 12:54:28 +00:00

paperless-ngx.code-workspace

Chore: Enables pylance pytest integration, swaps around some test markers (#11930 )

2026-01-28 23:06:11 +00:00

paperless.conf.example

Feature: support split documents based on tag barcodes (#11645 )

2026-01-29 08:05:33 -08:00

pyproject.toml

Chore: Drop old signal and unneeded apps, transition to parser registry instead (#12405 )

2026-03-22 06:53:32 -07:00

README.md

Documentation: update crowdin links (#9595 )

2025-04-09 08:01:21 -07:00

SECURITY.md

Create SECURITY.md

2024-02-15 23:38:33 -08:00

uv.lock

Merge branch 'main' into dev

2026-03-21 02:12:19 -07:00

zensical.toml

Breaking: Refactor advanced database settings to allow more user configuration (#12165 )

2026-02-27 14:37:26 -08:00

README.md

Paperless-ngx

Paperless-ngx is a document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.

Paperless-ngx is the official successor to the original Paperless & Paperless-ng projects and is designed to distribute the responsibility of advancing and supporting the project among a team of people. Consider joining us!

Thanks to the generous folks at DigitalOcean, a demo is available at demo.paperless-ngx.com using login demo / demo. Note: demo content is reset frequently and confidential information should not be uploaded.

Features
Getting started
Contributing
Related Projects
Important Note

This project is supported by:

Features

A full list of features and screenshots are available in the documentation.

Getting started

The easiest way to deploy paperless is docker compose. The files in the /docker/compose directory are configured to pull the image from the GitHub container registry.

If you'd like to jump right in, you can configure a docker compose environment with our install script:

bash -c "$(curl -L https://raw.githubusercontent.com/paperless-ngx/paperless-ngx/main/install-paperless-ngx.sh)"

More details and step-by-step guides for alternative installation methods can be found in the documentation.

Migrating from Paperless-ng is easy, just drop in the new docker image! See the documentation on migrating for more details.

Documentation

The documentation for Paperless-ngx is available at https://docs.paperless-ngx.com.

Contributing

If you feel like contributing to the project, please do! Bug fixes, enhancements, visual fixes etc. are always welcome. If you want to implement something big: Please start a discussion about that! The documentation has some basic information on how to get started.

Community Support

People interested in continuing the work on paperless-ngx are encouraged to reach out here on github and in the Matrix Room. If you would like to contribute to the project on an ongoing basis there are multiple teams (frontend, ci/cd, etc) that could use your help so please reach out!

Translation

Paperless-ngx is available in many languages that are coordinated on Crowdin. If you want to help out by translating paperless-ngx into your language, please head over to https://crowdin.com/project/paperless-ngx, and thank you! More details can be found in CONTRIBUTING.md.

Feature Requests

Feature requests can be submitted via GitHub Discussions, you can search for existing ideas, add your own and vote for the ones you care about.

Bugs

For bugs please open an issue or start a discussion if you have questions.

Please see the wiki for a user-maintained list of related projects and software that is compatible with Paperless-ngx.

Important Note

Document scanners are typically used to scan sensitive documents like your social insurance number, tax records, invoices, etc. Paperless-ngx should never be run on an untrusted host because information is stored in clear text without encryption. No guarantees are made regarding security (but we do try!) and you use the app at your own risk. The safest way to run Paperless-ngx is on a local server in your own home with backups in place.

Languages

PostScript 71.7%

Python 15.7%

TypeScript 9.7%

HTML 2.4%

SCSS 0.3%

README.md

Paperless-ngx

Features

Getting started

Documentation

Contributing

Community Support

Translation

Feature Requests

Bugs

Related Projects

Important Note