analytic Coffee!
  • Topical Tessitura Community Groups
  • More
analytic Coffee!
Wiki Record DeDuplication (record linkage)
  • Discussions
  • Files
  • Wiki
  • Members
  • Mentions
  • Tags
  • Events
  • More
  • Cancel
  • New
analytic Coffee! requires membership for participation - click to join
  • analytic Coffee! Wiki
  • +Tessitura Analytics Shared Dashboards
  • +Data Analysis Training Opportunities
  • +Analytics Tips & Trouble shooting
  • Jupyter Notebooks
  • Learning about AI
  • -Learning about Python
    • Record DeDuplication (record linkage)
  • +Power BI & Tessitura

Record DeDuplication (record linkage)

Here are some Python based tools to help find records that refer to the same customers to support your record DeDuplicating initiatives.  

libraries

Dedupe (pip install dedupe) https://pypi.org/project/dedupe/

pandas-dedupe (pip install pandas-dedupe) https://pypi.org/project/pandas-dedupe/ 

articles

Basics of Entity Resolution with Python and Dedupe https://medium.com/district-data-labs/basics-of-entity-resolution-with-python-and-dedupe-bc87440b64d4

  • Dedupe
  • record linkage
  • merge
  • Share
  • History
  • More
  • Cancel
Related
Recommended