PyPI Stats

Search

All packages
Top packages

Track packages

deduplication


PyPI page
Home page
Author: Marcnuth
License: Apache License 2.0
Summary: Remove duplicate documents via popular algorithms such as SimHash, SpotSig, Shingling, etc.
Latest version: 0.0.3
Required dependencies: spacy

Downloads last day: 10
Downloads last week: 28
Downloads last month: 104