際際滷

際際滷Share a Scribd company logo
Deduplication & Fusioninfo@sparsity-technologies.com
IndexIntroduction
Process
Successful stories
DemoIndexIntroduction
Process
Successful stories
DemoIntroductionBenefitsIdentification of suspected duplicated records inside a databaseMerging of data belonging to several databases with different formats detecting duplicated recordsValidation tools for the detected similarities
IntroductionDeduplication
IntroductionDeduplicationConfigurationAutomatic executionValidation of resultsPersonalized export
IntroductionDeduplicationConfigurationAutomatic executionValidation of resultsPersonalized export
IntroductionFusion
IntroductionFusionConfigurationAutomatic executionValidation of resultsPersonalized export
IntroductionFusionConfigurationAutomatic executionValidation of resultsPersonalized export
IntroductionFeatures
IndexIntroduction
Process
Successful stories
DemoProcessConfigurationsInput data file format: CSV
Select relevant columns to link registers
Relation between columns from different data sources (only when merging)
Assign types to columns to help using the most adequate automatic filtersCSVConfigurationsExecutionValidationExportationExcelPDFXMLCSV
ProcessConfigurationsComparative type: exact value, estimation by text, numerical estimation
Percentage of the importance of each column for the similarity computationCSVConfigurationsExecutionValidationExportation30%35%35% 100% =ExcelPDFXMLCSV
ProcessConfigurationsSpecific percentage for registers with null valued columns
Use filters to make values standard
Available automatic and specific filters for values such as name, dates, address, etcCSVConfigurationsExecutionValidationExportationExcelPDFXMLCSV
ProcessConfigurationsEdit filters (create new filters, delete or update existing ones)
Use of dictionaries: name-converter dictionary (I.e.: Pepe  Jose)

More Related Content

Daurum: Introduction