Release Feb 27, 2026

Cleaner, more accurate content extraction from HTML inputs with fewer irrelevant elements and preserved structured data.

💎 Improved

  • Strip non-content elements before parsing to reduce noise and improve extracted content quality.
  • Preserve JSON-LD and math scripts so embedded structured data and formulas remain available.