PyPI page
Home page
Author:
None
License:
MIT
Summary:
🚀 Lean Python tool for extracting clean, LLM-optimized markdown from web pages. Handles dynamic content with Playwright + Trafilatura for maximum information extraction efficiency.
Latest version:
0.1.2
Required dependencies:
aiohttp
|
beautifulsoup4
|
click
|
html2text
|
loguru
|
lxml
|
openai
|
pandas
|
playwright
|
python-dotenv
|
tabulate
|
trafilatura
Optional dependencies:
black
|
ipykernel
|
jupyter
|
mypy
|
pre-commit
|
pytest
|
pytest-asyncio
|
pytest-cov
|
pytest-mock
|
ruff
Downloads last day:
18
Downloads last week:
29
Downloads last month:
32