PyPI Stats

Search

All packages
Top packages

Track packages

nlp-dedup


PyPI page
Home page
Author: Dan Saattrup Nielsen
License: MIT
Summary: Remove duplicates and near-duplicates from text corpora, no matter the scale.
Latest version: 0.1.2
Required dependencies: datasketch | joblib | more-itertools | tqdm

Downloads last day: 5
Downloads last week: 110
Downloads last month: 142