Data Cleansing with SSIS Brian Knight Pragmatic Works
[email protected] About the Ugly Guy Speaking • SQL Server MVP • Founder of Pragmatic Works • Co-Founder of BIDN.com, SQLServerCentral.com and SQLShare.com • Written more than a dozen books on SQL Server
Takeaways • Profiling the data • Cleansing and validating with scripts • Fuzzy techniques
Data Profiling
• Retrieves metadata about your data • Help identify data issues • Uses an SSIS Data Profiling Task
Data Profiling Demo
Scripting in SSIS • Tasks • Data Flow – Source – Transform – Destination
Script Transform Demo
Fuzzy Techniques • De-duplicating data • Fuzzy catches misspelt words or variants • Fuzzy Grouping and Fuzzy Lookup
Fuzzy Grouping Demo
The End Already? • Questions http://www.bidn.com/people/brianknight
@BrianKnight
[email protected] http://www.youtube.com/pragmaticworks