Release Feb 27, 2026
Cleaner, more accurate content extraction from HTML inputs with fewer irrelevant elements and preserved structured data.
💎 Improved
- Strip non-content elements before parsing to reduce noise and improve extracted content quality.
- Preserve JSON-LD and math scripts so embedded structured data and formulas remain available.