Data profiling is all about CCC- AID? (Part 2)
In this part, we intend to apply the data principles discussed in the previous part onto a sample set of data using Talend™ open profiler tool.
The tool is very simple to install and use, installation is pretty much simple and once installed the eclipse work bench look and feel with cheat sheets allows any person to quickly learn the trick and trade of profiling. A graphic representation of the Talend ™ open profiler.

As shown above, there are pre-defined profiling rules that can be executed against potential data sources, test files (comma separated delimited files) and or existing MDM applications. The data profiling rule can be run and the graphic look and feel of the result is of tremendous value. The CCC-AID criteria are visually depicted to play back the inference with the business community. As shown below, the empty field values are highlighted, indicating a deficiency in the data acquisition process.
I am eager for the blogging community to provide any insight. This blog is by no means an endorsement of any tool set, but aims to address pain points of an MDM consultant.
I hope you liked this blog and please feel free to drop me a note in case any of the above material is useful or in case you need any support from us. I am currently working on my next blog which is around Data Modeling in MDM, till then adios.




