PyPI page
Home page
Author:
Jean-Baptiste Laval
License:
Apache License 2.0
Summary:
PySin is a toolbox for text retrieval in unstructured documents datasets. It contains both a multi-type text extractor and a search engine. To test them, you can use the medical prescriptions generator that is also provided.
Latest version:
1.6.1
Required dependencies:
beautifulsoup4
|
docx2txt
|
faker
|
fpdf
|
google-cloud-translate
|
pandas
|
path.py
|
pdftotext
|
psycopg2-binary
|
requests
|
sqlalchemy
|
striprtf
|
tqdm
|
unidecode
Downloads last day:
2
Downloads last week:
10
Downloads last month:
59