PyPI page
Home page
Author:
None
License:
Apache License 2.0
Summary:
MinerU-HTML is a main content extraction tool based on Small Language Models.
Latest version:
1.1.2
Required dependencies:
accelerate
|
beautifulsoup4
|
lxml
|
mineru-webkit
|
pytest
|
pytest-asyncio
|
selectolax
|
trafilatura
|
transformers
Optional dependencies:
nest-asyncio
|
openai
|
vllm
Downloads last day:
19
Downloads last week:
130
Downloads last month:
383