01Automated multi-source search across DocumentCloud, CourtListener, Scribd, and DOJ archives.
02Browser automation via Playwright to handle complex UI interactions and rendered DOM extraction.
03Built-in support for the RECAP extension to access crowdsourced PACER documents for free.
04Automatic document organization with a metadata manifest including source URLs and timestamps.
053 GitHub stars
06Two-phase search strategy that prioritizes direct downloads before engaging browser automation.