"What’s Your Preferred Self-Hosted Solution for Deep Monitoring (Beyond Simple Page Changes)?"
"What’s Your Preferred Self-Hosted Solution for Deep Monitoring (Beyond Simple Page Changes)?"
Hello! I'm evaluating tools to track changes in:
- Government/legal PDFs (new regulations, court rulings)
- News sites without reliable RSS
- Tender portals
- Property management messages (e.g. service notices)
- Bank terms and policy updates
Current options I've tried:• Huginn — Powerful but requires significant setup, no unified feed • Changedetection-io — good for HTML, limited for documents
Key needs:✓ Local processing (no cloud dependencies)✓ Multi-page PDF support✓ Customizable alert rules✓ Trying to reduce manual monitoring overhead — looking for robust, offline-first approaches
What's working well for others? Especially interested in:
- Solutions combining OCR + text analysis
- Experience with local LLMs for this (NLP, not just diff)
- Creative workarounds you've built
(P.S. Testing a deep scraping + LLM pipeline — if results look promising, will share.)