PyPI page
Home page
Author:
Dan Saattrup Nielsen
License:
MIT
Summary:
Remove duplicates and near-duplicates from text corpora, no matter the scale.
Latest version:
0.1.2
Required dependencies:
datasketch
|
joblib
|
more-itertools
|
tqdm
Downloads last day:
5
Downloads last week:
110
Downloads last month:
142