Self-Hosted⭐⭐PythonGPL-3.0

Paperless-ngx

Scan, OCR, and organize documents into a searchable digital archive

Editor's Take

Paperless-ngx is the project that finally makes going paperless feel achievable rather than aspirational. The workflow is simple: scan a document, and Paperless automatically OCRs it, extracts text, suggests tags and categories using machine learning, and files it in a searchable archive. The email consumption feature is particularly clever — pipe your invoices and receipts through a dedicated email address and they're processed automatically. The search is fast and accurate, making it genuinely useful as a document management system rather than just a digital filing cabinet. What impressed us is how well the ML categorization works after just a few weeks of training. The Docker setup is straightforward, and the web interface is clean and functional. The only real barrier is the initial setup: you need a scanner and some patience to get your existing paper organized. But once it's running, it runs forever.

Best for users who are comfortable following setup instructions or running a self-hosted tool.

Start Here

Why It Stands Out

  • 1Automatic OCR on upload with text search across all documents
  • 2Machine learning auto-categorization with tags and correspondents
  • 3Email consumption — automatically process incoming PDF attachments

Best Use Cases

Go paperless at home

Scan all physical documents and search them digitally from any device

Invoice management

Automatically process and categorize invoices from email

Who Should Try It

individualshome office

Similar Projects

#self-hosted#document-management#ocr#docker