An open-source Python library for simplifying local testing of Databricks workflows using PySpark and Delta tables. This library enables seamless testing of PySpark processing logic outside Databricks ...
RapidFuzz is a fast string matching library for Python and C++, which is using the string similarity calculations from FuzzyWuzzy. However, there are two aspects that set RapidFuzz apart from ...