Posted on February 25th, 2023
We have published a step-by-step tutorial demonstrating the automated identification of inconsistent company name data in a dataset. It shows our Cloud Data Connect product that connects to various data sources and runs match reports, appends similarity keys, generates SQL, and performs several other functions utilizing Interzoid APIs behind the scenes to identify matches.
The tutorial found here provides a walkthrough of the automated generation of a match report of possible duplicate data for a CSV file that is stored in the Cloud in AWS S3. The product works the same for Cloud databases such as Snowflake, AWS RDS, Google Cloud SQL, Microsoft Azure SQL, Postgres, MySQL, and several others. It also works with other data content types such as individual person data and street address data, data that is typically inconsistently represented within databases.