Skip to main content
NewsRelease Notes

January ’26 Release: Improved matching accuracy, clearer visibility, and more control

By February 1, 2026No Comments

The January ’26 release focuses on improving the reliability and clarity of DataGroomr’s core workflows. This update delivers more accurate duplicate detection, better visibility into matching behavior, and additional options for teams that need deeper control.

Below is a summary of what’s included and why it matters.

Enhanced matching accuracy

Duplicate detection depends on both precision and predictability. In this release, we enhanced Similar/Fuzzy matching to better detect “near matches” that may have been missed previously.

The updated approach:

  • Uses broader blocking logic to identify more candidate records
  • Refines similarity scoring within each cluster for more accurate confidence score values

To avoid unexpected changes:

  • Existing models continue to use Similar v1, preserving current match results
  • New models automatically use the updated Similar logic

This ensures improved accuracy going forward without impacting live or historical behavior.

AI improvements for review and rule guidance

Duplicate Review and Personalized Rule now run on the latest Claude Sonnet 4.5.

This improves consistency and clarity in AI-generated recommendations, particularly for edge cases and nuanced matching scenarios.

Duplicate Review and Personalized Rule

New integration and customization options

Real-time duplicate check API

A new API endpoint allows you to check incoming records for duplicates before they are inserted or updated in Salesforce.

  • Submit records via a POST request
  • Match against your Live Dedupe dataset index
  • Receive confidence scores and likely master records in response

Learn more here: DataGroomr API

This is useful for custom ingestion pipelines, middleware validations, or pre-save checks in external systems.

duplicte check api

Override standard Salesforce merge

Enterprise customers can now replace the standard Salesforce merge with a custom Salesforce Flow.

This allows teams to:

  • Trigger notifications
  • Create follow-up tasks
  • Apply business-specific logic after merges

The option is available directly in the dataset Merge settings.

Override standard Salesforce merge

Clearer information, fewer configuration steps

Several updates improve visibility and reduce the need to open configuration dialogs:

  • Minimum Match Confidence is now displayed on the dataset page
  • Dataset grids support up to 100 display fields (previously 50)
  • Mass merge results clearly distinguish estimated duplicates from records actually retrieved
log browser
  • Dataset filters now show field type icons to reduce selection errors
dataset configuration

These changes make it easier to understand dataset behavior at a glance.

Improved auditability and traceability

This release also improves transparency across import, merge, and audit workflows.

  • Import error logs now include identifying CSV columns
  • Dashboard Merge/Convert history charts support direct drill-down into the Log Browser
  • Exported datasets from the Transfer module include both mapped fields and original CSV columns for reuse

These updates simplify troubleshooting and downstream data operations.

Reliability and usability fixes

The January ’26 release includes a broad set of fixes across Dedupe, Cleanse, Transfer, Audit, and Setup, including:

  • Improved stability for incremental analysis and Live Dedupe grouping
  • Reliable undo and rollback behavior
  • Reduced UI refreshes, and inconsistent counters
  • Multiple layout, sorting, and interaction fixes across the application

Many of these changes are based on production telemetry.

Summary

The DataGroomr January ’26 release strengthens matching accuracy, improves transparency, and expands customization options, while maintaining backward compatibility and predictable behavior. If you have questions about any of these changes or want help applying them to your workflows, our team is happy to help.

Ben Novoselsky

Ben Novoselsky is Chief Technology Officer at DataGroomr, where he leads the design and development of scalable data quality solutions that power reliable analytics, operations, and AI at scale. A hands-on architect with over 25 years of experience, Ben specializes in distributed systems, data integrity, and applied methods for ensuring data reliability at scale. He holds a PhD in Computer Science and a Master’s in Computational and Applied Mathematics from St. Petersburg State University.