PyPI page
Home page
Author:
VictorAut
Summary:
A Python library for near deduplication and record linkage.
Latest version:
0.8.0
Required dependencies:
catalogue
|
cleanco
|
datasketch
|
faker
|
nameparser
|
networkx
|
nltk
|
pandas
|
polars
|
pyarrow
|
rapidfuzz
|
scikit-learn
|
sparse-dot-topn
|
typing-extensions
Optional dependencies:
dask
|
modin
|
pyspark
|
ray
Downloads last day:
45
Downloads last week:
252
Downloads last month:
988